Projects
A collection of data science, ML, and engineering work
Collabrium — AI-Powered Collaboration Platform
Multi-modal RAG platform that lets small teams ingest PDFs, images, and math-heavy documents into a secure vector store, then query them through an intelligent AI teammate. Built as a Harvard AC215 capstone project.
Quantitative Finance Strategy with Machine Learning
Robust financial time-series workflow implementing pairs-trading mean-reversion with random forest classification, plus Augmented Dickey-Fuller and Breusch-Pagan testing for statistical validity.
PPE Object Detection — R-CNN vs. Transformer
Comparative investigation of R-CNN vs. Transformer object detection models on personal protective equipment data. Models can monitor PPE non-compliance in industry, preventing fines and liability.
Data Engineering Lakehouse
Complete end-to-end ML data pipeline with a bronze/silver/gold lakehouse schema built on PySpark. The resulting architecture feeds both BI and ML business needs efficiently.
Bird Strikes and You
A logistic regression analysis in R to determine what factors influence bird-strike damage costs. Animal mass and season were most predictive, and actionable recommendations were provided to airlines.