This repository is an aggregate of smaller course-related capstone projects, associated with courses from Codecademy and Coursera. The goal of this repository is to reduce repository clutter and showcase achievements in my journey of data science and analytics. This repo is a compliment of my coursework from Columbia Data Analytics Bootcamp.
Content for the About section goes here.
Code/ Folder: 01-life-expectancy-gdp
Description: An exploration into GDP and Life expectancy for six countries between 2000 and 2015. This project was completed for the Codecademy Data Analytics Capstone.
Tech-stack: pandas
, numpy
, matplotlib
, seaborn
, scipy
Results/ Presentation: Summary Document
Code/ Folder: biodiversity.ipynb
Description: An exploration into a biodiversity dataset.
Tech-stack: pandas
, numpy
, matplotlib
, seaborn
, scipy
Results/ Presentation: Aquatic and mammalian species are at greatest risk and are consistent across all National Parks.
Code/ Folder: insurance_summary.ipynb
Description: A dive into US medical conditions and its impact on US medical insurance rates.
Tech-stack: pandas
, numpy
, matplotlib
, seaborn
Code/ Folder: summary_analysis.ipynb
Description: Using StackOverflow survey data aggregated between 2018 and 2020, I wanted to explore what are the most in-demand skills required to enter the data analytics industry.
Tech-stack: pandas
, numpy
, matplotlib
, seaborn
Results/ Presentation: Most in demand DMSs are MySQL, PostgreSQL, Microsoft SQL, MongoDB and SQLite
Code/ Folder: crash-EDA.ipynb
Description: This analysis uses publicly available data from NHTSA and the Pew Research Center to perform a causal analysis of smartphone usage and fatal auto accidents.
Tech-stack: pandas
, numpy
, matplotlib
, seaborn
, datetime
, scipy
Results/ Presentation: Although this data suggests that there is no relationship between smartphone usage and auto accidents, it does shed insight to the number of drivers who text and drive without smartphones. As demonstrated in the analysis, there is a notable decrease in auto accidents between the years 2009 and 2012. This may be attributed to the DOT's texting and driving ads shown on television during this time. Smartphone usage took off after 2011, this may suggest that there is a percentage of people in the population do text and drive without the need of a smartphone like iPhone or android.
Code/ Folder: exploratory.ipynb
Description: We seek to strengthen MSU Library's CTA language and encourage to click on the "Interact" link. This interact CTA allows students to access library resources such as hours, floorplans, faculty, and more.
Tech-stack: Google Analytics
, CrazyEgg
, pandas
, numpy
, matplotlib
, seaborn
, datetime
, scipy
, regex
Results/ Presentation: ab_test_ppt_pdf.pdf