Recent Projects & Publications


Tools Used:


Thesis pic

'How COVID-19 Impacted Tertiary Education:
Predicting the Educational Infrastructure of the Future'

An analysis of over 4.5 million observations regarding the impact of COVID-19 on tertiary education, with suggestions for the educational infrastructure of the future. Analysis included multiple linear regression, logistic regression, polynominal pipelines, and predictions trough classification with a support vector machine. Analysis was performed in Python (Pandas, NumPy, and SciKit-Learn) and R.

Submitted as my Senior Thesis in Data Science at Claremont McKenna College.

Search engine pic

Search Engine Project

A highly scalable search engine using the multi-petabyte common crawl dataset. Achieved a webpage rendering time of 0.006 seconds. Constructed using Docker, PostgreSQL, and Python.

Twitter pic

Twitter Coronavirus Project

Approximately 500 million tweets are sent everyday. Of those tweets, about 1% are geotagged. That is, the user's device includes location information about where the tweets were sent from. In this project, a dataset of all geotagged tweets that were sent in 2020 was analyzed, roughly 1.1 billion tweets.

The project used the MapReduce divide-and-conquer paradigm to create parallel code. It also involved working with multilingual text. The project stack included Docker, PostgreSQL, and Python.

RxRate pic

RxRate, an RxAll Company

According to the WHO, more than 1 million people die each year because of fake medications. The majority of these deaths occur in low- and middle-income countries, where an estimated 1 in 10 medical products is either substandard or falsified.

RxRate was created to directly combat this global crisis. The web-app rates the safety of pharmacies (on a novel pharmacy rating system I developed) and helps people steer clear of counterfeit drugs in the developing world. The RxRate web-app was created using JavaScript, Node.js, the Firebase SDK, and the Google Maps REST API.

Launched in September, 2019, RxRate achieved a bounce rate of 18%, a pages/session ratio of 2.73, and an average session duration of 00:03:15, in its first three months online.

CIE Paper pic

'Correlates of Innovation in Eastern European Economies'

A research paper outlining how lagging Eastern European countries can find economic success by implementing innovative institutional frameworks and industrial policies. The paper was developed through correlational research, with specific attention paid to the WIPO's Global Innovation Index.

The research paper was published by Claremont McKenna's Center for Innovation and Entrepreneurship (where I worked as a research analyst) in May 2020.

Trajectory pic

Trajectory – Basketball Physics

An iOS application I made in High School. 'Trajectory' aimed to help amateur basketball players improve their free-throw shot by measuring their angles of release and illustrating them on screen. These were shown next to an illustration of an angle of release that would guarantee a higher-percentage shot. The app was created with Swift in Xcode.

The application was published on the Apple App Store in 2017, but has since been removed. It generated over 50 downloads.

Professional Work Experience



Key Skills:

Data Analysis | Machine Learning | Data Warehousing
Data Visualization | Relational Database Management | ETL Pipelines
API Tools & Integration | CI/CD | Agile

German Football Association (DFB), Data Science Intern

DFB Pic
Simulated the Euro 2024 Qualifiers to predict the 24 teams to qualify for the tournament, improving resource allocation for the tournament by up to 20%; subsequently simulated the actual tournament to predict which teams Germany would most likely face, providing strategic insights for friendly match scheduling decisions ahead of the tournament

Reduced data ingestion expenses by 50% by championing, and then implementing, a transition from a PostgreSQL-based data infrastructure to one that leverages the capabilities of a serverless Databricks SQL Warehouse

Worked closely with Match Analysts to create, improve, and automate scouting tools, reducing manual effort by up to 40%

Expanded data tracking capabilities by 2% through the implementation of pipelines to integrate various new data sources

Completed various ad-hoc requests from the first team coaching staff, often delivering results under strict time-constraints


Los Angeles Dodgers, Data Consultant

LA Dodgers Pic
Analyzed a dataset of 400,000+ observations and developed logistic, polynomial, and random forest models trained with and without spin profile characteristics to determine whether batted-ball spin contributes to quality of contact in baseball

Presented a comprehensive, self-created R package and a Power BI Dashboard with key findings to the Dodgers R&D team


Crestron Electronics, Data Engineering Intern


Crestron Pic Constructed a platform to automate the Project Intake process at Crestron, providing users with a centralized database to manage projects and automatically assign relevant BPs to New Project Approval flows using Microsoft Power Platform

Reduced related business-processing times by 20–30% and followed an Agile development scheme throughout the internship, consisting of daily scrums and the use of Azure DevOps

Integrated the Power BI REST API with Power Automate and PowerPoint to allow users to instantly generate a complete, multi-slide .PPTX Presentation with specific and relevant information about their Project



Etesian Ecommerce, Software Engineering Intern


Etesian Pic
Designed UI, UX and member login portal of subsidiary company (SellGlobal) website using Wix, Docker and Ghost CMS

Constructed a member-specific client dashboard that highlights KPIs and automated the build for new sign-ups with Zapier



RxAll, Software Developer Intern

RxAll Pic
Developed a map-based web-app (RxRate) with the Google Maps API, Node.js, MySQL and Firebase that helps travelers find safe pharmacies and avoid counterfeit drugs in developing countries – iterated with feedback from UI/UX testing

Negotiated sign-ups from over 20 pharmacies in Yangon, Myanmar, through sales pitches and product demonstrations


About Me

Pic of me
I'm a recent graduate of Claremont McKenna College, graduating in December 2022 with a BA in Data Science. I hold three citizenships (USA, Germany, Canada) and I'm an avid world traveler and explorer, having visited 65 countries so far and counting. Please feel free to reach out!

Send me an email