Springboard Data Science Career Track

This is the course work done to complete the Data Science Career Track at Springboard.

https://www.springboard.com

Introduction

Springboard is an online intensive full time data science program:

600+ hours of curriculum, including video, articles and hands-on projects.
Developed and continuously updated with and by industry experts, to teach in-demand skills
Capstone Projects
Curriculum covers data wrangling, data storytelling, inferential statistics, data visualization, machine learning and big data
Weekly 1-on-1 mentorship by industry experts
Mentor matched to student profile and goals delivering relevant insight from the industry
All mentors are active data scientists at top technology companies
20% acceptance rate; we accept only the best applicants

The goal is to cover core data science fundamentals to get students job ready

Libraries/Modules

Need to install the following Modules for each Unit

JSON Based Data Exercise	pandas, json, numpy
SQL Practice	Springboard SQL Website
API Mini-Project	requests, json, pandas, statistics
Frequentist Statistics	scipy, numpy, pandas, nupmy.random, matplotlib.pyplot
Bootstrap Statistics	pandas, numpy, numpy.random, matplotlib.pyplot
Bayesian Inference	pymc3, pandas, numpy, numpy.random, matplotlib.pyplot, scipy
Linear Regression Boston Housing Data Set	numpy, pandas, scipy, matplotlib, sklearn, seaborn
Heights and Weights Logistic Regression	numpy, scipy, matplotlib, pandas, seaborn, sklearn, warnings
Predicting Movie Ratings from Reviews Using Naive Bayes	glob, numpy, scipy, matplotlib, pandas, seaborn, six.moves
Customer Segmentation Using Clustering	pandas, sklearn, matplotlib, seaborn
Find 2-3 Job Titles WordCloud	wordcloud, re, string, collections, nltk, bokeh
Spark Mini-Project DataBricks	Need to register on databricks, pyspark and spark sql
Take-Home Challenge Ultimate	pandas, json, plotly, bokeh, seaborn, matplotlib, numpy, sklearn, datetime, sklearn, xgboost, keras
Take Home Challenge Relax	glob, numpy, datetime, tqdm, collections, seaborn, bokeh, sklearn, xgboost

Data Wrangling JSON

This World Bank dataset to practice data wrangling for school quality improvement project in Ethiopia.

Data Wrangling JSON

SQL mini project

Use Springboards SQL to wrangle data.

SQL Practice

API MINI Project

Using Quandl to analyze stock prices of Frankfurt Stock Exchange.

API MINI Project

Frequentist Inferential Part A and B

In part A teaches about z-statistic, t-statistic and the central limit theorem

Part B goes into Hospital Medical charges.

Inferential Statistics - Bootstrapping

Uses the same Medical charge dataset but uses bootstrap method instead.

Inferenetial Stats Bootstrap

Bayesian Inference

Same Medical charge dataset but using bayesian statistics.

Bayesian Stats

Linear Regression Boston Housing Data Set

This is a very quick run-through of some basic statistical concepts, adapted from Lab 4 in Harvard's CS109 course.

* Linear Regression Models
* Prediction using linear regression

Linear Regression

Heights and Weights Logistic Regression

Logistic Regression Exercise mini project from lab5 CS109.

Logistic Regression

Predicting Movie Ratings from Reviews Using Naive Bayes

Movie reviews using subset of rotten tomatoes data to analyze basic text.

Naive Bayes Movie Reviews

Customer Segmentation Using Clustering

Marketing and newsletter/email campaign to try to cluster the customers into different groups. Unsupervised learning.

Customer Segmentations Clustering

Find 2-3 Job Titles WordCloud

Exercise to use word cloud visualization on job postings to see what they want in candidates.

Word Cloud

Spark Mini-Project DataBricks

Learning to use Spark to handle huge dataset. Dealing with job types and payroll.

Spark

Take-Home Challenge Ultimate

Company is basically like UBER/ Lyft. Tried to figure out how to keep customers and turn more basic memberships to premium memberships.

Ultimate Take Home Challenge

Take Home Challenge Relax

Try to predict which users would continue on to be active users. The dataset is similar to the slack or online workspace companies.

Relax Challenge

Author

Justin Huang

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
Bayesian_stats_Q6.28		Bayesian_stats_Q6.28
Bootstrap_Inference_Mini-Project8.19		Bootstrap_Inference_Mini-Project8.19
clustering		clustering
data_wrangling_json		data_wrangling_json
inferential_statistics_frequentist_mini-projects6.28.19		inferential_statistics_frequentist_mini-projects6.28.19
linear_regression		linear_regression
logistic_regression		logistic_regression
naive_bayes		naive_bayes
relax_challenge		relax_challenge
spark		spark
ultimate_challenge		ultimate_challenge
1520094343_sql_projectv2.sql		1520094343_sql_projectv2.sql
README.md		README.md
Spark DF, SQL, ML Exercise.ipynb		Spark DF, SQL, ML Exercise.ipynb
api_data_wrangling_mini_project.ipynb		api_data_wrangling_mini_project.ipynb
json_mini_projectv2.ipynb		json_mini_projectv2.ipynb
wordcloud_dreamjob.ipynb		wordcloud_dreamjob.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Springboard Data Science Career Track

Introduction

Libraries/Modules

Data Wrangling JSON

SQL mini project

API MINI Project

Frequentist Inferential Part A and B

Inferential Statistics - Bootstrapping

Bayesian Inference

Linear Regression Boston Housing Data Set

Heights and Weights Logistic Regression

Predicting Movie Ratings from Reviews Using Naive Bayes

Customer Segmentation Using Clustering

Find 2-3 Job Titles WordCloud

Spark Mini-Project DataBricks

Take-Home Challenge Ultimate

Take Home Challenge Relax

Author

About

Releases

Packages

Languages

jvhuang1786/DSCcareertrack

Folders and files

Latest commit

History

Repository files navigation

Springboard Data Science Career Track

Introduction

Libraries/Modules

Data Wrangling JSON

SQL mini project

API MINI Project

Frequentist Inferential Part A and B

Inferential Statistics - Bootstrapping

Bayesian Inference

Linear Regression Boston Housing Data Set

Heights and Weights Logistic Regression

Predicting Movie Ratings from Reviews Using Naive Bayes

Customer Segmentation Using Clustering

Find 2-3 Job Titles WordCloud

Spark Mini-Project DataBricks

Take-Home Challenge Ultimate

Take Home Challenge Relax

Author

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages