Skip to content

Latest commit

 

History

History
91 lines (90 loc) · 14.8 KB

README.md

File metadata and controls

91 lines (90 loc) · 14.8 KB

Data Practice Projects from Codecademy

Description


This repo holds all the challenge and mini projects from Codecademy, relating to the following courses:

  • Python
    • Data Science: Natural Language Processing
    • Deep Learning with TensorFlow
    • Machine Learning/ AI Engineering
    • Data Scientist: Machine Learning
    • Analyze Data with R
    • Data Science: Inference Specialist
    • Data Science: Analytics
    • Data Science Foundations
    • ML and AI Fundamentals
    • Master Statistics with Python
    • Fundamental Math for Data Science
    • Analyze Data with Python
    • Cleaning Data with Python
    • Visualize Data with Python
    • Software Engineerin for Data Scientists
    • Software Engineering for Python
    • Learn Statistics with Numpy
    • Learn Python 3
  • SQL
    • Analyze Data with SQL
  • Bash/ Terminal
    • Learn Command Line
  • Javascript
    • Learn Javascript

Projects and status

Category Project Title Description
Python - NLP Presidential Speeches Analyzing inaugural speeches to learn how they use language to convey their adgenda with Word Embedding
Python - NLP News Analysis Using tf-idf to analyze article contents and uncover the terms that best describe each article
Python - NLP Predicting Mystery Postcard Using NBC to predict which friend sent a mystery letter
Python - NLP Discover Insights from Classic Texts Noun Phrase Parsing and Verb Phrase Parsing to analyze classic texts
Python - Big Data with PySpark Analyze Common Crawl Data with PySpark Introduction to PySpark Queries and RDD
Python - Deep Learning Classifying Galaxies with Conv Neural Networks Using a dataset of 10000 (128x128, 3) images of galaxies, build a classification model to predict what type of galaxy the corrosponding image is.
Python - Deep Learniing Deep Learning Classification with Heart Disease Data Using a dataset from kaggle to predict the survival of patients with heart disease based on patient characteristics and serum panel
Python - Deep Learning Deep Learning Regression with Admissions Data Mini-project focused on building a deep learning regression model to predict college admission chance based on different features
Python - Deep Learning Implementing Neural Networks Utilizing a neural network to predict life expectancy using WHO Global Health data
Python - ML Book Recommender Using suprise library, building a recommendation algorithm for book readers
Python - ML Perceptron Using a perceptron to model after and, or and xor logic gates.
Python - OOP School Catalog Refresher on OOP
Python - ML - Pipelines Building ML Pipelines Using UCI's bone marrow data to build a pipeline including preprocessing, cleaning and selecting a classifier
Python - ML - Boosting Predict whether an individual makes over $50K Using Census Data, predicting if a person makes more than 50K annually
Python - ML - Random Forests Predict whether an individual makes over $50K Using Census Data, predicting if a person makes more than 50K annually
Python - ML - Hyperparameter Tuning Classifying Raisins with Hyperparameter Tuning Classifying different grain raisin types using Murat Koklu's Kaggle dataset
Python - ML - Logisitic Regression with CV Predicting Wine Quality Using data from UCI's Machine Learning Repository, predicting wine quality with regularization techniques with logisitc regression
Python - ML - Support Vector Machines Predict Baseball Strikezones Using a baseball dataset to find the decision boundary of a strikezone
Python - ML/NLP - Naive Bayes Classifier Email Similarity Using fetch_20_newsgroups dataset from sklearn.datasets, train a model to classifcy email types
Python - ML - KMC Handwriting Recognition with KMeans Clustering Using the digits library from sklearn.datasets, creating a kmeans clustering algo to predict handwriting
Python - ML - Wrapper Methods Wrapper Methods Determining which feature selection method improves the obesity prediction model
Python - ML - Decision Trees Find the flag! Generating decision trees using flag data
Python - ML - K Nearest Neighbors Cancer Classifier Using SkLearn's neighbors library, predict if a patient has breast cancer.
Python - ML - Linear Regression Honey Production Using SkLearn's Linear model library, predict the production output of honey by 2050.
Python - ML - Data Transformation Transformating Data into Features Transforming categorical data, scaling data, working with datetime features
R - ML Predicting Income with Social Data Using R to uncover how age, education and gender plays a role in income
Python - ML - Logistic Regression Predicting Credit Card Fraud Using Scikit-Learn LogisticRegression model to predict credit card fraud
Python - ML Class Objects to Store Patient Information Refresher on class objects and OOP
Python - Probability True probability of coin flip Visualizing the true probability of a coin flip
Data Analysis - BI Intro to Tableau for Data Analysis Learning to use Tableau to show the relationship between NYC trees and Income
Data Analysis, Visualization, Python Roller Coasters Visualizing different roller coasters using functions
Data Analysis and Visualization Board Slides for FoodWheel Tracking business insights and consumer insights
Visualization Constellations Mapping Orion star system
SQL - Engineering First-Last-Touch-Attribution Finding first/ last touch attribution through different media channels
SQL - Engineering Calculating Churn Rates Calculating Churn rate for a fictional company Codeflix
Data Analytics - AB Testing Ad effectiveness with AB Testing AB Testing for conversion rate
Data Visualization Airline Trends Visualizing airline trends
Data Analytics NBA Trends Analyze trends to explore associations between score, win rates,
Data Analytics - Exploratory Data Analysis EDA: Diagnosing Diabetes Analyze diagnostic factors that affect diabetic outcomes in patients
Data Analytics Exploring Mushrooms Summarizing nominal characteristics of mushrooms
Data Analytics Analyzing Internet Use with Python Analyzing internet usage rates across the world between 1990 and 2020 using aggregation techniques
Data Analytics - Usage Funnel Page Visits Funnel Merging complimentary datasets to determine the proportion of users that complete their purchses in a fictional t-shirt ecommerce website.
Data Analytics Product Defects Using distributions to predict the number of product defects in a given day.
Data Analytics Analyzing High-Speed Railway Delays Using EDA and aggregation methods, determining how weather impacts railway times.
Data Analytics Summarizing Automobile Evaluation Data Summarizing categorical data of cat manufacturers
Python - Concurrent Programming Concurrent Programming Learning concurrent programming
Data Analytics - Summaries Analyze Electric Vehicle Stations in Python Summarizing private and public charging stations in the US in Jupyter
Python - Context Manageres Aisha’s Greetings Using context managers and constructors to generate card messages
Python - Error Logging ATM Logging Error Logging practice
Python - Generators Boredless Tourist OOP Practice
Python - Classes and Constructors Basta Faloozin Constructor practice
Python - Basic Statistics Crunchie Munchies Basic statistics with numpy
Python - Basic Visualizations Election Results Basic data visualizations with pyplot
Python - Classes School Catalogue Practice with constructors and dunder methods
Python - Functions Functional Programming Learning how to write basic functions
Python - Functions Scrabble Learning basic function syntax
Python - Loops and conditional Thread Shed Practice with loops
Shell scripts Building build script