My Projects

Baseball Hall of Fame Analysis

An analysis of BBWAA Hall of Fame voting. This project aims to determine what factors make an MLB player likely to be inducted, analyze Hall of Fame trends over time, and predict likely future inductees.

  • Technologies used: Python, Pandas, Scikit-learn, Plotly
  • Key Features: Data Cleaning, Data Visualization, Predictive Modeling

View Project

Project GitHub


Hand-Made Regression Algorithms

An overview of common regression techniques, and handmade classes for each algorithm. Also includes 3-D plotting of training sets and resulting regression surfaces to understand how each regressor learns data and makes predictions.

View Project

Project GitHub


Baseball Simulation

Daily Simulation of MLB games. This projet develops a model to predict the outcome of MLB plate appearances based pitch-by-pitch data from Statcast to precict plate appearence results. This model is then leveraged alongside expected lineups, weather, and other outside factors to simulate MLB games and predict outcomes for player statlines.

  • Technologies used: Python, Pandas, Scikit-learn, TensorFlow
  • Key Features: Data Cleaning, Feature Engineering, Predictive Modeling, Simulations

Project GitHub


News Recommendation System

A document retrieval system for use in recommending news articles across the political spectrum. The system leverages a COLBERT model checkpoint with further training on a corpus of news articles, and is designed to recommend articles from diverse viewpoints to users based on their preferences.

  • Technologies used: Python, NLP, Web Scraping
  • Key Features: Text Similarity, Machine Learning, Recommendation Systems

Project GitHub