Skip to content

Latest commit

 

History

History
855 lines (818 loc) · 43.5 KB

File metadata and controls

855 lines (818 loc) · 43.5 KB

Data Science Roadmap 2025

1. Introduction

It's a multidisciplinary field that looks at raw and structured data sets and provides potentially actionable insights. The field of data science looks at ensuring that we are asking the right questions as opposed to finding exact answers. Data Scientists require skillsets centered on Computer Science, Mathematics, and Statistics. Data Scientists use several unique techniques to analyze data such as machine learning, trends, linear regressions, and predictive modeling. The tools that data scientists use to apply these techniques include Python and R.

  • These are small differences between each job title:

Data-Science-ezgif com-webp-to-jpg-converter

  • For more details about each job title, you can see this Arabic video or this other video.

2. Levels

The roadmap is divided into 4 main levels, each level will be divided into weeks and each week will have a set of tasks to be completed. We will try to provide task links one by one when it's finished. Each level is designed to be completed within 1-3 months on average, however, the time taken to complete the roadmap may vary depending on the individual.

  1. Entry: Good introduction to the field.
  2. Beginner: Data scientist toolkit and foundations.
  3. Intermediate: Dive deeper and solidly understand and work with data.
  4. Advanced A: Mathematics and Machine Learning.
  5. Advanced B: Deep Learning and specializing in a specific field.

2.1. Entry Level

It includes the following topics:

  • Data Literacy
  • Understanding Data Science
  • Introduction to Statistics
  • Python Basics
  • OOP in Python
📅 Phase 📚 Topics 📖 Resources ✅ Tasks
Week 1
  1. 📊 Data Literacy
  2. 🔍 Understanding Data Science
  1. 🏆 Complete the Data Literacy course.
  2. 🏆 Complete the Understanding Data Science course.
  3. 📝 MCQ Quiz.
Week 2
    📊 Introduction to Statistics
  1. 🏆 Complete the Introduction to Statistics course.
  2. 📝 MCQ Quiz.
Week 3 🐍 Python Basics
  1. 🏆 Complete the Introduction to Python Udacity course.
  2. 💡 Problem-solving exercises.
  3. 📝 Quiz.
Week 4 🐍 OOP in Python
  1. 🏆 Complete the OOP in Python course.
  2. 🎓 Capstone Project and presentation.

2.2. Beginner Level

It includes the following topics:

  • NumPy
  • Pandas
  • Matplotlib
  • Seaborn
  • Power BI
  • Git & GitHub
📅 Phase 📚 Topics 📖 Resources ✅ Tasks
Week 1 🔢 NumPy
  1. 🏆 Complete Keith Galli's NumPy tutorial.
  2. 📝 Quiz.
Week 2 🐼 Pandas
  1. 🏆 Complete Corey Schafer's course.
  2. 📝 Quiz.
Week 3 〽️ Matplotlib
  1. 🏆 Complete Understanding Data Visualization course.
  2. 🏆 Complete Corey Schafer's course.
  3. 💡 Practice and document all that you learned in a notebook.
Week 4 🌊 Seaborn
  1. 🏆 Complete Kimberly Fessel's course.
  2. 💡 Practice and document all that you learned in a notebook.
Week 5 📊 Power BI
  1. 🏆 Complete Alex The Analyst course.
  2. 💡 Make a Power BI dashboard.
Week 6 🔗 Git & GitHub
  1. 🏆 Complete any course.
  2. 📝 Quiz.

2.3. Intermediate Level

It includes the following topics:

  • Regular Expressions (RegEx)
  • Data Cleaning
  • Feature Engineering
  • Exploratory Data Analysis
  • Web Scraping
  • Structured Query Language (SQL)
📅 Phase 📚 Topics 📖 Resources ✅ Tasks
Week 1 🔣 Regular Expressions (Regex)
  1. 🏆 Complete the Regular Expressions in Python course.
  2. 📝 Quiz.
Week 2 🧹 Data Cleaning
  1. 🏆 Complete the Cleaning Data in Python course.
  2. 📝 Read the blog and extract your insights from it.
  3. 💡 Practicing in a notebook with any dataset.
Week 3 🛠 Feature Engineering
  1. 🏆 Complete the Feature Engineering for Machine Learning in Python course.
  2. 📝 Read the blog and extract your insights from it.
  3. 💡 Practicing in a notebook with any dataset.
Week 4 🔍 Exploratory Data Analysis (EDA)
  1. 🏆 Complete the Exploratory Data Analysis in Python course.
  2. 📝 Read the blog and extract your insights from it.
  3. 💡 Practicing in a notebook with any dataset.
Week 5 🕸 Web Scraping
  1. 🏆 Complete Codezilla's Web Scraping with Python course.
  2. 💡 Practice and scrape any website.
Week 6 🗃 Structured Query Language (SQL)
  1. 🏆 Complete the skill track.
  2. 💡 Practice problems on HackerRank.
  3. `

2.4. Advanced A Level

It includes the following topics:

  • Math required for Machine Learning:
    • Linear Algebra
    • Multi-variate Calculus
  • Machine Learning Algorithms:
    • Supervised Learning
    • Unsupervised Learning
    • Ensemble Learning
  • Model Evaluation and Selection
  • APIs
📅 Phase 📚 Topics 📖 Resources ✅ Tasks
Week 1 🔢 Linear Algebra
  1. 🏆 Complete the Imperial College London course.
  2. 📝 Quiz.
Week 2 📈 Multi-variate Calculus
  1. 🏆 Complete the Imperial College London course.
  2. 📝 Quiz.

In this stage you are ready to dive deep in the world of Machine Learning. The following resources are general and not divided into categories or weeks, you can follow them in parallel with the base resources in the weeks as a supplementary resource if you want.

Now we let's continue the roadmap into weeks.

📅 Phase 📚 Topics 📖 Resources ✅ Tasks
Weeks 3 - 4 🕵️ Supervised Learning
  1. 🏆 Complete Andrew Ng's course.
  2. 🏆 Complete the DataCamp course.
  3. 📝 Quiz.
Weeks 5 - 14 📖 Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow, 3rd Edition
    Every week, we will study a chapter from the book until we reach chapter 9, the end of the machine learning section in the book.
  1. 🏆 Study the chapter and summarize it.
  2. 📝 Quiz on each chapter.
Weeks 15 - 16
  • 🤖 Unsupervised Learning
  • ֎ Reinforcement Learning
  1. 🏆 Complete the DataCamp course.
  2. 🏆 Complete Andrew Ng's course.
  3. 📝 Quiz.
Weeks 17 - 18
  • 🌲 Ensemble Learning
  • ⚛ Neural Networks
  1. 🏆 Complete the DataCamp course.
  2. 🏆 Complete Andrew Ng's course.
  3. 📝 Quiz.
Week 20 🌐 APIs
  1. 🏆 Complete the DataCamp courses.
  2. 💡 Practice.

In this stage, you now have strong basics about machine learning algorithms and how it works. Also, you learned about APIs and how to use them. Now you are ready to train models, practice on datasets, and make some projects involving the algorithms you learned. You may also make a machine learning algorithm from scratch; it would be great practice to understand the algorithms more.

2.5. Advanced B Level

In this stage, you will enter the Deep Learning and NLP World. It's divided into three phases:

  • Phase 1: Basic concepts of Deep Learning (NN, CNN, RNN, Backpropagation, Optimizers, etc.)
  • Phase 2: Transformers and LLMs.
  • Phase 3: NLP fields.

2.5.1. Phase 1: Basic Concepts of Deep Learning

📅 Phase 📚 Topics 📖 Resources ✅ Tasks
Weeks 1 - 3 🧠 Basic concepts of Deep Learning
  1. 🏆 Complete Andrew Ng's course.
  2. 🏆 Complete Andrej Karpathy's course.
  3. 🏆 Complete the DataCamp course.
  4. 📝 Practice on a dataset.
Weeks 3 - 4
  • 🚀 Optimizers
  • 🔦 PyTorch
  1. 🏆 Complete Andrew Ng's course.
  2. 🏆 Complete the PyTorch Tutorials.
  3. 📝 Practice in a notebook.
Weeks 5
  • 🛠️ Structuring Machine Learning projects
  • 🔁 Transfer Learning
  1. 🏆 Complete Andrew Ng's course.
  2. 🏆 Complete the PyTorch Tutorials.
  3. 📝 Practice in a notebook.
Weeks 6 - 8 🌀 Convolutional Neural Networks (CNN)
  1. 🏆 Complete Andrew Ng's course.
  2. 🏆 Complete Khaled El-Hady videos.
  3. 📝 Choose one task, for example, segmentation, search for an online tutorial on how to implement it, and try writing code by hand.
Weeks 9 - 11 ⟳ Recurrent Neural Networks (RNN)
  1. 🏆 Complete Andrew Ng's course.
  2. 🏆 Complete DataCamp courses.
  3. 📝 Make any text classification project and try to implement as many things as you can.

2.5.2. Phase 2: Transformers and LLMs

📅 Phase 📚 Topics 📖 Resources ✅ Tasks
Weeks 12 - 13 🤖 Transformers
  1. 🏆 Read the paper carefully and try to write down what you got from it.
  2. 📝 Make the transformer model from scratch and follow along with the video, you don't have to make it alone for the first time.
Weeks 14 - 15 ֎ Large Language Models (LLMs)
  1. 🏆 Complete StatQuest's video.
  2. 🏆 Complete the Coursera course.
  3. 🏆 Finish Hugging Face tutorial.
  4. 📝 Make the GPT model from scratch and follow along with the video.
  5. 📝 Use the libraries you learned and try to make an NLP project.

2.5.3. Phase 3: NLP fields

  • There are many sub-fields in this amazing field (NLP), one of them is RAG.
  • At first you need to learn LangChain and LangGraph.
  • Also we recommend to you Abu Bakr Soliman's course. In this course, you will learn a lot of concepts and tools to build a really strong project like (fastapه, docker, MongoDB, and MVC Design pattern).

> More to be added and we will try to make this roadmap updated with the latest resources.