• Stars
    star
    437
  • Rank 99,659 (Top 2 %)
  • Language
  • License
    Apache License 2.0
  • Created about 2 years ago
  • Updated over 1 year ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Roadmap for Data Science

Logo

One-stop solution for all your Data Science learning needs.

All in one place, the best resources to learn Data Science with comprehensive and detailed roadmaps.
Go to website

Join our discord Antern Community and ask your questions there.

Data Science Roadmap 🤖

All in one place, the best resources to learn Data Science with comprehensive and detailed roadmaps. Data Science is a vast field and it is very difficult to find the best resources to learn it. This repository is an attempt to solve this problem. It contains the best resources to learn Data Science with comprehensive and detailed roadmaps.

It also contains the best resources to learn Machine Learning, Deep Learning, Data Analysis, Data Visualization, and much more. This repository is a one-stop solution for all your Data Science learning needs.

It is a continuously evolving repository and I will keep adding more resources to it. If you have any suggestions or want to contribute to this repository, feel free to open an issue or a pull request.

Table of Contents

I will divide the resources into different levels of learning and will also provide the best resources to learn each topic. The levels of learning are:

  • Getting ready to learn data science
  • Core Data Science Fundamentals
  • Intermediate Data Science
  • Advanced Data Science
  • Data Science Projects
  • Guide to Data Science Interviews

Let's get ready to learn data science 🚀

Topic Resources & Links
Linear Algebra Introduction to Linear Algebra by Gilbert Strang, Book, Linear Algebra by Antern, Course, Linear Algebra for Dummies Book by Mary Jane Sterling
Calculus Calculus for Dummies, Book , Single Variable Calculus Course by Antern
Statistics & Probability Statistics for Dummies, Book, Probability for Dummies, Book, Statistics and Probability Course by Antern
Basics of Information Theory Information Theory by d2l.ai

Evaluate your skills and prepare for Interviews 📝

Topic Resources & Links
Linear Algebra Questions Linear Algebra Interview questions
Statistics & Probability Interview Questions Link 1, Link 2, Link 3, Link 4

Learning Tip 1 💁: If you're are a beginnner and not able to answer interview questions, it's totally ok, you can look upto the solutions and solve similar types of problems on your own to practice those types of questions. You don't need to learn every concept, if you're not able to understand it, just skip it and move on to the next topic and review next day or try to get help from communities such as discord communities.

Interview Tip 1 💁: While answering questions, try to explain in such a way that you're building your solution from base, if you know the answer, start with explaining how you reached to that answer, don't tell your answer, explain your thought process. Interviews wants to check your problem solving skills. Even if you give wrong answer but your thought process is correct, interviewer might be impressed.

Programming Languages & Data Structures and Algorithms 🖥️

Lecture Topics Resources & Links
Core Python Durga Sir Python, or Corey Schafer
Intermediate Python Corey Schafer
Advance Python Durga Sir Advance Python
Core Software Engineering Principle Robust Python & Design Patterns
Data Structures and Algorithms Data Structures and Algorithms in Python, Introduction to Algorithms, MIT 6.006

Learning Tip 2 💁: If you're a beginner and learning python, it will require time to reiterate several times to understand a concept, & trust me it's totally worth it. As said learning Data science requires time and learning the hard way rather than shortcuts which will make you nowhere. So, don't get demotivated if you're not able to understand a concept, just keep trying and you'll get it.

Learning Tip 3 💁: Data structures and algorithms is becoming one of the important topics in data science interview as well in giant companies, so it's important to learn it. Not only from the perspective of interviews, learning it and solving problems using dsa makes your problem solving skill and criticial thinking much more better than before and you will be having several tools in your toolbox to solve any problem. So I suggest to learn a particular topic and solve several questions on it, we will soon be adding several problems on this page to practice for data science.

Core Data Science Fundamentals 🚀

According to Harvard business School, Data science is the process of deriving meaningful insights from raw data. Data science aims to make sense of the copious amounts of data, also referred to as big data, that today’s organizations maintain.

Frameworks 📚

Topics Resources & Links
Pandas Pandas user Guide, Getting started with Pandas,Python for Data Analysis: Data Wrangling with pandas, NumPy, and Jupyte, Book, Data School
Numpy Numpy Learn docs
Matplotlib Matplotlib Tutorial, Corey Schafer Matplotlib Tutorials

Data Visualization & Analysis

Topics Resources & Links
Data Analysis Python for Data Analysis, Head First Data Analysis: A Learner's Guide to Big Numbers, Statistics, and Good Decisions
Data Visualization Fundamentals of Data Visualization: A Primer on Making Informative and Compelling Figures

Learning Tip 4 💁: Learning frameworks is not a big deal, but the way you use frameworks to analyze data, visualize data and solve problems is what matters. So, I suggest to understand the CRUX of data analysis and data visualization and use the frameworks to build your solution. If you don't know the actual CRUX of data visualization, analysis, then there is no point in learning frameworks. and If you don't know how to work with data, then there is no point in learning ML.

SQL 📊

Topics Resources & Links
SQL SQL for Data Analysis Cathy Tanimura or Learning SQL
Practicing SQL SQL Cookbook by By Anthony Molinaro, DataLemur

Note: People usually have question around learning Big data tools in initial phases of data science, I personally think, it's not necessary to learn big data tools in initial phases of data science, but if you're interested in learning it, you can learn it later on. There are different perspectives on this, i would like you to check out the answers from this quora answer.

Intermediate Data Science 🚀

Machine Learning 🤖

Machine Learning Roadmap

I have made a separate page for machine learning, you can check it out here. I also given my personal opinion on machine learning and how to learn it in the most efficient way possible in the form of a video, you can check it out here, which got more than 150k views.

Advance Data Science 🚀

Topics Resources & Links
Deep Learning courses Yann LeCun’s Deep Learning Course at CDS, CS230 Deep Learning, Antern's ML002, Deep Learning: CS 182
Deep Learning books Deep Learning Book, Deep Learning with Python, Deep Learning for Coders with fastai and PyTorch
Natural Language Processing CS224n: Natural Language Processing with Deep Learning
Computer Vision Stanford Computer Vision
Machine Learning Operations MadewithML

Data Science Projects 📈

Before starting with any project, I would suggest you to go through this video, which will help you to understand the process of building a data science project which can help you to land a job.

We will be publishing a detailed blog and a video which walks you through a procedure to finding and building impactful data science project. It will be out soon, till then we suggest you to go through the following resources for inspiration:-

Taking part in competitions is also a great way to learn and build your portfolio, you can check out the following platforms for competitions:-

  • DrivenData
  • Kaggle
  • Analytics Vidhya
  • Zindi

Guide to Data Science Interviews

We will be publishing Interviews guide for every topic, but till then you can go through the following resources:-

Upcoming Topics 📚

This repository is a work in progress, we will be adding more topics in the future, you can check out the following topics which we will be adding in the future:-

  • Detailed Interview Guides
  • Detailed Project Guides
  • Detailed Guide to Data Science Portfolio
  • Detailed Guide to Data Science Resume
  • Detailed Guide to Data Science Cover Letter
  • Other ways to get spotted by recruiters

Contributions 🤝

We are open to contributions, if you want to contribute to this repository, you can check out the contributing guidelines. You can also contribute by sharing this repository with your friends and colleagues.

More Repositories

1

ML001-Project-Sources-Code-and-Learning-Materials

ML001 Sources Code and Learning Materials
Jupyter Notebook
308
star
2

mlops-roadmap

Roadmap for MLOps
56
star
3

Data-Science-Project-Loan-Prediction-System-With-Deployment-

Hi Everyone Glad to see your interest in this repo and welcome, we will be working on end to end data science project which is "Loan Prediction System" we will also make a website and integrate ml model in backend. It will be lot of fun over there.
Jupyter Notebook
16
star
4

ayush714

Config files for my GitHub profile.
Jupyter Notebook
14
star
5

Fashion-Clothes-Generation-Using-GANS-

Fashion Clothes Generation Using GANS
Jupyter Notebook
10
star
6

Covid-19-Fake-News-Detection-System-

Covid-19 Fake News Detection System
Jupyter Notebook
9
star
7

jekyll-serif-theme

SCSS
5
star
8

SMS_Spam_Detection

Python
5
star
9

retail-price-optimization-mlops

retail-price-optimization-mlops
3
star
10

Analytics-vidhya-Blog-Projects-and-Codes-

Jupyter Notebook
3
star
11

CS004-Core-Python-Programming-

Python
3
star
12

Iris-Flowers-Detection-System-

We will build a system that will detect flowers and we will not only code it and we will follow the whole ml pipeline as well as we will code an beautiful website, and we will use flask in backend.
Jupyter Notebook
3
star
13

Diabetes-Prediction-System

Jupyter Notebook
2
star
14

customer-satisfaction

Python
2
star
15

Breat-Cancer-Detection-Using-Logistic-Regression-From-Scratch-

This Repository is by me, Where I build a Breast Cancer Detection Using Logistic Regression From Scratch without using any libraries for training. And Also I explained each and every term, with the mathematical intution.
Jupyter Notebook
2
star
16

HR-Analytics-Prediction-

This was a Problem I have seen in Analytics Vidhya, So, Here is my Approach following full data science lifecycle.
Jupyter Notebook
2
star
17

Ship30For30

Ship30For30
Python
1
star
18

ComlpexNumber

Complex Number Class
Java
1
star
19

Sentiment-Analysis-Classifier-From-Scratch

Jupyter Notebook
1
star
20

Problems_Solved

Java
1
star
21

ayush-ml-blog

Jupyter Notebook
1
star
22

core-deep-learning-course

Core Deep Learning Course
1
star