Academic Projects
HireReady

Mar 2020 – May 2020

This project was a web application that provides interview preparation for data science-related job seekers by using machine learning algorithms to generate mock interview questions that are tailored to a specific job posting. It was selected as one of the best projects and received positive feedback during the presentation to a panel of venture capitalists. It utilized Flask for the back-end, Jinja for the front-end, as well as Docker and AWS Elastic Beanstalk for deployment.

See Project →

hireready

Individual Healthcare Cost Prediction

Jan 2020

This project predicted individual healthcare cost using data from U.S. Health Insurance Marketplace, U.S. Census, and Zillow Rental Values. It utilized Python and PySpark for scripting.

See Project Presentation →

health

Soccer Player Transfer Value Prediction

Nov 2019 – Dec 2019

This project predicted the value of a soccer player in the transfer market from their ratings and characteristics with a Random Forest model, using data from the FIFA 19 game.

See Project Presentation →

soccer

U.S. Health Insurance Marketplace Visualization using Census and Rent Cost

Nov 2019 – Dec 2019

This project processed and visualized various relationships between the U.S. Health Insurance Marketplace, U.S. Census, and Zillow Rental Values. It utilized Python and PySpark for scripting and Plotly for visualization.

See Project Presentation →

health

NBA Player of the Week & Salary Prediction

Oct 2019

This project used various data sources related to NBA players to predict how likely a player is to receive the Player of the Week award using logistic regression, as well as to predict his salary using linear regression, for a given NBA season.

See Project →

nba

NBA Player of the Week & Salary Visualization

Sep 2019 – Oct 2019

This project visualized various player statistics related to NBA players, including Player of the Week award, Most Valuable Player award, and salary, using multiple data sources. It utilized Python for scripting and ggplot for visualization.

See Project →

nba

SFO Air Traffic Passengers Visualization

Mar 2018 – May 2018

This project visualized various passenger statistics of San Francisco International Airport (SFO), using the dataset published by the SF Airport Commission through DataSF. It utilized R for scripting and ggplot for visualization.

See Project →

sfo

USF MEDA Practicum

Jan 2018 – May 2018

This project, in collaboration with the Mission Economic Development Agency, analyzed and visualized various distance and school placement metrics of elementary school students who are clients of MEDA, using publicly available school quality and demographic information combined with proprietary information.

meda

Flight Buddy

Sep 2017 – Dec 2017

This project was a concept for a mobile application that provides a platform for air travelers to share their flight experiences.

See Project Demo →

flight-buddy

SFO Air Traffic Landings Statistics

Apr 2017 – May 2017

This project visualized various landing statistics of San Francisco International Airport (SFO), using the dataset published by the SF Airport Commission through DataSF. It utilized JavaScript for scripting and D3.js for Visualization.

See Project →

sfo