My Projects
Baseball Hall of Fame Analysis
An analysis of BBWAA Hall of Fame voting. This project aims to determine what factors make an MLB player likely to be inducted, analyze Hall of Fame trends over time, and predict likely future inductees.
- Technologies used: Python, Pandas, Scikit-learn, Plotly
- Key Features: Data Cleaning, Data Visualization, Predictive Modeling
Hand-Made Regression Algorithms
An overview of common regression techniques, and handmade classes for each algorithm. Also includes 3-D plotting of training sets and resulting regression surfaces to understand how each regressor learns data and makes predictions.
Baseball Simulation
Daily Simulation of MLB games. This projet develops a model to predict the outcome of MLB plate appearances based pitch-by-pitch data from Statcast to precict plate appearence results. This model is then leveraged alongside expected lineups, weather, and other outside factors to simulate MLB games and predict outcomes for player statlines.
- Technologies used: Python, Pandas, Scikit-learn, TensorFlow
- Key Features: Data Cleaning, Feature Engineering, Predictive Modeling, Simulations
News Recommendation System
A document retrieval system for use in recommending news articles across the political spectrum. The system leverages a COLBERT model checkpoint with further training on a corpus of news articles, and is designed to recommend articles from diverse viewpoints to users based on their preferences.
- Technologies used: Python, NLP, Web Scraping
- Key Features: Text Similarity, Machine Learning, Recommendation Systems