Roadmap

Global Grading System

Uploaded by

m.akbari

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

27 views6 pages

Roadmap

Global Grading System

Uploaded by

m.akbari

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Roadmap

Note: This is how I got into the field of data science and machine learning. So this document
should not be considered as a standard way to approach the subject. Here is the overall view
of the steps towards being a data scientist/machine learning engineer. The ones written in blue
are more advanced and not critical.

I have also added a few points about each part in the subsequent sections. Also you could find
a motivation behind some parts in the remarks section.

1 Roadmap
The basic steps to become a data scientist:
• Start with basic understanding of Python and R programming languages
• Data retrieval techniques; Become familiar with databases (relational and non relational
and graph databases) and their query languages (SQL, and the query language of noSQL
databases like mongoDB, Elasticsearch, Cypher and so on)
• Data models and data structures
• Learn as much maths as you can
• Basic machine learning algorithms (the supervised and unsupervised algorithms, ....)
• dive into a real world problem and learn how to translate a business problem into the
corresponding data scientific problem and translate the findings back into the business
insight
• Hands on advanced programming with Python (implementation of different ML algo-
rithms, especially implementing different neural networks using pytorch and/or tensor-
flow)
• Learn advanced mathematical techniques related to each specific problem
• Reinforcement Learning

1.1 Programming with Python and R

Although Python is the main programming language for data analysis and machine learning,
for statistical learning tasks R usually outperforms Python. So you should learn how to use R
as well.
Both these languages are pretty easy to learn. That should take you less than a month to get
familiar with the elements of both programming languages.
To learn Python and R, I suggest you look at one of the courses in Udacity; They always use
good teachers and the courses are very efficient in covering most important materials you need.
But if you use a course like this or read a book to learn Python/R, make sure that you have
learned the following subjects:

Data Science 1
1.2 Databases

• basic programs with Python/R and different operators

• The notion of classes (which is not as comprehensive as languages like Java, or Scala;
mainly because they have been attached to Python afterwards)
• Data manipulation libraries like dplyr (R) and Pandas (Python)
• Data visualization techniques and libraries that help you visualize data; like matplotlib,
ploty, seaborn, etc. This part is essential, especially if you want to be a good data analyst.
• libraries that help you connect to different databases, like psycopg2, pymongo and elas-
ticsearch
• Working with matrices and arrays: Numpy library
For the deep learning part, I have attached one of introductory courses of Udacity on deep
learning using pytorch. The github projects of this course could be found on this repository:
deep learning with pytorch. You need the latter part when you go over the advanced program-
ming for machine learning and AI. So do not go over these ones before learning the basics of
implementing different ML algorithms.
But keep this in mind that you do not actually need to learn these in a complete manner, if
you miss some points, do not worry. You only have to know the pillars of these programming
languages. The actual learning comes after you evolve in real projects. Then you have to use
these thousands of times, and you will eventually get onboard with all of these materials.

1.2 Databases
I am not an expert in this area, so this section is very elementary and covers what you need to
know about databases and how you could query on them.
Learning SQL of some kind is pretty forward. These are some flavors you could try: MySQL,
MSSQL, postGreSQL. But the skeleton of all is the same with a few differences.
For document based databases like MongoDB, I strongly suggest that you go through the stuff
provided by the official website: MongoDB
The same story holds for Elasticsearch. The best reference is the one presented by Elasticsearch

∗∗∗
The graph databases are a little different, They store not only data points as entities, but also
are capable of storing the relations between them. Neo4j is a good example of the databases of
this kind (Neo4j query language)

Data Science 2
1.3 Mathematics

fig 1: How a graph database stores and retrieve data

Neo4j is not the only one you could work on, Oracle is also capable of storing data with the
desired structure, but as far as I know, many of the advanced Python libraries for graph learning
problems (like NetworkX) are compatible with Neo4j.
In the process of learning elements of data retrieval and getting familiar with data structures,
make sure that you understand the difference between structured and unstructured data; and
also you know how each database store and index data.
Keep in mind that as a data scientist, you need to deep dive into these subjects. But as a part
of data manipulation you will have to know how to get your data in an efficient way.
The list I have presented above is not comprehensive in any manner. But These are the main
references of different databases which store data in different ways. But if you know how they
work, it is pretty much guaranteed that you could work with any other database.

1.3 Mathematics
Almost all problems in the field of artificial intelligence and machine learning are mathematical
problems, so basically to enter the field you must have a good grasp on different mathematical
subjects. As a matter of fact, learn as much maths as you can. Especially when you encounter
various AI/ML algorithms, it is usually crucial to know how they work and what mathematical
concepts are used to derive them. This helps you not only choose best algorithms for attacking
a problem, but also equipped you with ways of how the algorithms could be optimized and even
gives you the ability to create new algorithms from the existing ones.

The main mathematical concepts you must be familiar with are listed below. The ones written
in red are more advanced and require more time and effort. But I have attached some references
to help you get onboard with them and understand the motivation behind them.
Also I have attached the slides I have taught before to see an overview of the concepts.
• Probability theory (Frequentist view and Bayesian view) and Bayesian inference. Also
knowing a few concepts from statistical analysis like testing the hypothesis is necessary
(slides)
• Linear Algebra (slides)
• Optimization methods (slides)

Data Science 3
1.4 Machine Learning

• Stochastic processes and Time series analysis

• Topology (very basic)
• Markov Chain Monte Carlo sampling methods (MCMC)
• Probabilistic graphical models (PGM’s)
• Control Theory, dynamic programming and Bellman equations (for reinforcement learn-
ing)
A good reference to get familiar with the basic concepts that will be critical in machine learning
is: mathematics for machine learning. Another excellent example is Bishop’s book: Pattern
Recognition and Machine Learning. I have attached both to the document I am sending.

1.4 Machine Learning

Machine learning is a pathway to artificial intelligence (AI). This subcategory of AI uses algo-
rithms to automatically learn insights and recognize patterns from data, applying that learning
to make increasingly better decisions. By studying and experimenting with machine learning,
programmers test the limits of how much they can improve the perception, cognition, and ac-
tion of a computer system. Deep learning, an advanced method of machine learning, goes a step
further. Deep learning models use large neural networks - networks that function like a human
brain to logically analyze data - to learn complex patterns and make predictions independent
of human input.
The basic machine learning algorithms are generally divided into two major categories, super-
vised learning and unsupervised learning algorithms. In the former ones, we use labeled
data and machines learn how to label unseen data. The latter one is in general harder, the
machine has to detect the general patterns in data without referencing to any label or tag.
The main supervised and unsupervised learning algorithms you should learn are listed below:
♠ Supervised Learning
• Feature engineering
• Basic regression (Linear/Generalized Linear)
• Tree based classification/regression methods
• Support vector machines (SVM’s)
• Kernel methods - These are not machine learning algorithms, but they help you embed
your data into high dimensional Hilber spaces in other to get rid of limits imposed on
them being in the low dimension.
• Gradient based methods (which in combination with tree based methods give rise to
algorithms like XGBoost)
• Dimensionality reduction techniques, like PCA and MCA
• Nonlinear classifier/regressors like multilayer neural networks (In this step, you do not
need to implement more advanced neural networks, they will come into play, when you
are doing AI)

Data Science 4
1.5 Artificial Intelligence

• parameter learning methods - deeper understanding of loss functions and avoiding over-
fitting and underfitting
♠ Unsupervised Learning
• Basic clustering algorithms like K-means, self organizing maps (SOM’s) and other ones
• Embedding methods (like word2vec, doc2vec, etc)
• advanced clustering/data visualisation methods like t-sne and umap
Again the list is not comprehensive, but in my opinion, every data scientist must know all of
them.

1.5 Artificial Intelligence

Artificial Intelligence is the field of developing computers and robots that are capable of be-
having in ways that both mimic and go beyond human capabilities. AI-enabled programs can
analyze and contextualise data to provide information or automatically trigger actions without
human interference. Today, artificial intelligence is at the heart of many technologies we use,
including smart devices and voice assistants such as Siri on Apple devices. Companies are in-
corporating techniques such as natural language processing and computer vision - the ability for
computers to use human language and interpret images - to automate tasks, accelerate decision
making, and enable customer conversations with chatbots.

The main algorithms you must encounter with are:

• ELements of Natural Language Processing (NLP) and text processing
• Different recurrent neural networks (naive RNN, LSTM, GRU, etc). These are the corner-
stones of NLP. They are capable of performing lots of amazing tasks like translation, auto
completing the search text, and basically almost all rudimentary tasks that a machine is
expected to do in the area of text mining
• Image processing (machine vision): The well known convolutional neural networks (CNN)
and its variations
• Attention networks and transformers (In both cases of text and image processing)
• implementation of PGM’s and its applications. A good example is Restricted Bultzmann
Machine (RBM) which is one the most popular algorithms in developing recommender
systems
• If you are interested (or is necessary for your job) you could learn about voice recognition
techniques.

Data Science 5
1.6 Reinforcement Learning

1.6 Reinforcement Learning

fig 2: How an agent interacts with the environment and takes actions.

Reinforcement learning1 (RL) in the branch of AI where the machine learns to solve compli-
cated problems (like winning a game or put the best advertisement on a webpage or trade in
stock market) solely on its own. In fact, there is no explicit learning instruction here and the
machine learns by interacting with the environment and observes the rewards it gets through
this interaction. In recent years, reinforcement learning gained lots of attention, as it could
solve a variety of different problems. So I encourage you to have a look at the following course
to see more information on the subject: Google’s deepmind course instructed by David Silver:
Introduction to Reinforcement Learning.

1.7 Soft skills

To become a good data scientist, you should be able to communicate with almost all sections
in the workplace (Either a company or a research facility). depending on the company you are
planning to work in, you have to know the functionality of most parts and how each one of
them facing different problems.

1
This is more advanced, but I strongly recommend you to go over the material. The theory behind Rein-
forcement learning is very beautiful and the applications are fun!!

Data Science 6

Inverse Kinematic Analysis of Robot Manipulators PDF
0% (1)
Inverse Kinematic Analysis of Robot Manipulators PDF
336 pages
Machine Learning - The Mastery Bible - The Definitive Guide To Machine Learning Data Science PDF
100% (5)
Machine Learning - The Mastery Bible - The Definitive Guide To Machine Learning Data Science PDF
331 pages
Chan, Jamie - Machine Learning With Python For Beginners - A Step-By-Step Guide With Hands-On Projects (Learn Coding Fast With Hands-On Project (2021) - Libgen - Li
100% (1)
Chan, Jamie - Machine Learning With Python For Beginners - A Step-By-Step Guide With Hands-On Projects (Learn Coding Fast With Hands-On Project (2021) - Libgen - Li
200 pages
Introducing Data Science
57% (7)
Introducing Data Science
2 pages
Book
100% (1)
Book
269 pages
New Ebook Guide To AI & Data Science
No ratings yet
New Ebook Guide To AI & Data Science
175 pages
Week 1 Introduction To ML
100% (1)
Week 1 Introduction To ML
42 pages
W Aifi156
No ratings yet
W Aifi156
1 page
Deep Learning Handson
No ratings yet
Deep Learning Handson
65 pages
Unit 2 - AIML
No ratings yet
Unit 2 - AIML
32 pages
Data Science ML Full Stack 2022 GitHub
No ratings yet
Data Science ML Full Stack 2022 GitHub
9 pages
RoadMap Data Science
No ratings yet
RoadMap Data Science
6 pages
TIMUR BIKMUKHAMETOV - DS - Roadmap
No ratings yet
TIMUR BIKMUKHAMETOV - DS - Roadmap
27 pages
65 Free Data Science Resources For Beginners PDF
No ratings yet
65 Free Data Science Resources For Beginners PDF
19 pages
Detailed Roadmap & Free Resources To Become AI Engineer-1
100% (1)
Detailed Roadmap & Free Resources To Become AI Engineer-1
8 pages
Machine Learning - 2 Books in 1 - The Complete Guide For Beginners To Master Neural Networks, Artificial Intelligence, and Data Science With Python (BooksRack - Net)
No ratings yet
Machine Learning - 2 Books in 1 - The Complete Guide For Beginners To Master Neural Networks, Artificial Intelligence, and Data Science With Python (BooksRack - Net)
201 pages
Anomaly Detection in Images CIFAR-10
No ratings yet
Anomaly Detection in Images CIFAR-10
9 pages
Data Science Syllabus From Beginner To Advanced
No ratings yet
Data Science Syllabus From Beginner To Advanced
7 pages
Stuck Pipe Prediction
No ratings yet
Stuck Pipe Prediction
6 pages
Roadmap For AI ML and Data Engineer
No ratings yet
Roadmap For AI ML and Data Engineer
24 pages
Data Science Roadmap - Notes
No ratings yet
Data Science Roadmap - Notes
1 page
ML Interactively
No ratings yet
ML Interactively
273 pages
Chapter 2 Preparing To Model
No ratings yet
Chapter 2 Preparing To Model
49 pages
01 Artificial Intelligence Learning Roadmap (AI Roadmap) 2025
No ratings yet
01 Artificial Intelligence Learning Roadmap (AI Roadmap) 2025
28 pages
Complet ML
No ratings yet
Complet ML
44 pages
ML QB 1,2,3
No ratings yet
ML QB 1,2,3
60 pages
Machine Learning Unit-1.1
No ratings yet
Machine Learning Unit-1.1
29 pages
Inf. About Data Scientist
No ratings yet
Inf. About Data Scientist
22 pages
Data Science RoadMap
No ratings yet
Data Science RoadMap
31 pages
7th Semester - Project Report
No ratings yet
7th Semester - Project Report
28 pages
c15732d c4d6 Af31 d18 d56f0f8f5675 Machine Learning Roadmap
No ratings yet
c15732d c4d6 Af31 d18 d56f0f8f5675 Machine Learning Roadmap
25 pages
Roadmap Geeksforgeeks
No ratings yet
Roadmap Geeksforgeeks
24 pages
(English (Auto-Generated) ) Learn Machine Learning Like A GENIUS and Not Waste Time (DownSub - Com)
No ratings yet
(English (Auto-Generated) ) Learn Machine Learning Like A GENIUS and Not Waste Time (DownSub - Com)
16 pages
Data Science
No ratings yet
Data Science
26 pages
A Mathematical Guide To Operator Learning
No ratings yet
A Mathematical Guide To Operator Learning
45 pages
Data Sceince and AI Training Curriculum - V4.0
No ratings yet
Data Sceince and AI Training Curriculum - V4.0
19 pages
Master Data Science, Data Analytics and Machine Learning Using Python
No ratings yet
Master Data Science, Data Analytics and Machine Learning Using Python
16 pages
Machine Learning Unit-1.1
No ratings yet
Machine Learning Unit-1.1
43 pages
Artificial Intelligence Techniques For Security Vulnerability Prevention
No ratings yet
Artificial Intelligence Techniques For Security Vulnerability Prevention
8 pages
Deep Learning Engineer
No ratings yet
Deep Learning Engineer
16 pages
(English (Auto-Generated) ) How I'd Learn ML in 2025 (If I Could Start Over) (DownSub - Com)
No ratings yet
(English (Auto-Generated) ) How I'd Learn ML in 2025 (If I Could Start Over) (DownSub - Com)
14 pages
Data Science Vs Machine Learning Vs Deep Learning: The Difference
No ratings yet
Data Science Vs Machine Learning Vs Deep Learning: The Difference
19 pages
Roadmap AI
No ratings yet
Roadmap AI
19 pages
Chapter 3
No ratings yet
Chapter 3
25 pages
Oishi - Finite Elements Using Neural Networks and A Posteriori Error
No ratings yet
Oishi - Finite Elements Using Neural Networks and A Posteriori Error
24 pages
A Brief Review: Acoustic Emission Method For Tool Wear Monitoring During Turning
No ratings yet
A Brief Review: Acoustic Emission Method For Tool Wear Monitoring During Turning
9 pages
Full Stack Data Science Roadmap
No ratings yet
Full Stack Data Science Roadmap
17 pages
I'm New To Machine Learning and Data Analytics. How Should I Learn About This Field As I Have Recently Completed My Engineering - Quora
No ratings yet
I'm New To Machine Learning and Data Analytics. How Should I Learn About This Field As I Have Recently Completed My Engineering - Quora
4 pages
Data Science Diary
No ratings yet
Data Science Diary
10 pages
Getting Started
No ratings yet
Getting Started
10 pages
Zhenli Zhang ExFuse Enhancing Feature ECCV 2018 Paper
No ratings yet
Zhenli Zhang ExFuse Enhancing Feature ECCV 2018 Paper
16 pages
MCQ Soft Computing
No ratings yet
MCQ Soft Computing
23 pages
How To Become A Machine Learning Engineer
No ratings yet
How To Become A Machine Learning Engineer
10 pages
Deep Neural Networks For Spectrum Sensing A Review
No ratings yet
Deep Neural Networks For Spectrum Sensing A Review
25 pages
Full Detailed I Need
No ratings yet
Full Detailed I Need
7 pages
Data Science Student Schedule
No ratings yet
Data Science Student Schedule
7 pages
ML Roadmap
No ratings yet
ML Roadmap
11 pages
ML Engineer Learning Resources
No ratings yet
ML Engineer Learning Resources
9 pages
Python
No ratings yet
Python
9 pages
How To Become A Machine Learning Engineer
No ratings yet
How To Become A Machine Learning Engineer
10 pages
AI Pathway
No ratings yet
AI Pathway
6 pages
Data Science Study Plan
No ratings yet
Data Science Study Plan
3 pages
Data Roadmap
No ratings yet
Data Roadmap
9 pages
ROad MAp
No ratings yet
ROad MAp
5 pages
Ai Blueprint
No ratings yet
Ai Blueprint
6 pages
Ultaprime
No ratings yet
Ultaprime
1 page
Chartered Data Scientists Curriculum 2023 - 2
No ratings yet
Chartered Data Scientists Curriculum 2023 - 2
4 pages
Learning Path Machine Learning
No ratings yet
Learning Path Machine Learning
7 pages
Data Science and Machine Learning A Self-Study
No ratings yet
Data Science and Machine Learning A Self-Study
1 page
How To Start Learning Machine Learning?
No ratings yet
How To Start Learning Machine Learning?
3 pages
Neural Network & Fuzzy Logic SRM
No ratings yet
Neural Network & Fuzzy Logic SRM
42 pages
Guide To Data Science
No ratings yet
Guide To Data Science
2 pages
Lec13 Neural Networks and Deep Learning PDF
No ratings yet
Lec13 Neural Networks and Deep Learning PDF
33 pages
Why and How Do I Get Into Machine Learning Development?
No ratings yet
Why and How Do I Get Into Machine Learning Development?
3 pages
Deep Learning Ssuet
No ratings yet
Deep Learning Ssuet
8 pages
A Comparative Study of Various Machine Learning Algorithms in Fog Computing
No ratings yet
A Comparative Study of Various Machine Learning Algorithms in Fog Computing
12 pages
PDF Deep Learning For Remote Sensing Images With Open Source Software 1st Edition Rémi Cresson Download
100% (2)
PDF Deep Learning For Remote Sensing Images With Open Source Software 1st Edition Rémi Cresson Download
53 pages
Research Article - Format
No ratings yet
Research Article - Format
7 pages
AI Phase2
No ratings yet
AI Phase2
13 pages
Robuts Recognition For Traffic Signals
No ratings yet
Robuts Recognition For Traffic Signals
5 pages
Graph Foundation Model To Uncover Online Information Operations
No ratings yet
Graph Foundation Model To Uncover Online Information Operations
9 pages
1 s2.0 S004896972200403X Main
No ratings yet
1 s2.0 S004896972200403X Main
7 pages
Stock Market Forecasting Using Intrinsic Time-Scale Decomposition in Fusion With Cluster Based Modified CSA Optimized ELM (2022)
No ratings yet
Stock Market Forecasting Using Intrinsic Time-Scale Decomposition in Fusion With Cluster Based Modified CSA Optimized ELM (2022)
17 pages
3C
No ratings yet
3C
4 pages
Imp.-Image Category Classification Using Deep Learning-MATLAB
No ratings yet
Imp.-Image Category Classification Using Deep Learning-MATLAB
9 pages
Pump It Up: Data Mining The Water Table
No ratings yet
Pump It Up: Data Mining The Water Table
5 pages
Predictive Data Mining and Discovering Hidden Values of Data Warehouse
No ratings yet
Predictive Data Mining and Discovering Hidden Values of Data Warehouse
5 pages
Semester 5 Mca
No ratings yet
Semester 5 Mca
8 pages
Introduction to Algorithms & Data Structures: A solid foundation for the real world of machine learning and data analytics
From Everand
Introduction to Algorithms & Data Structures: A solid foundation for the real world of machine learning and data analytics
Bolakale Aremu
No ratings yet
Exploring the World of Data Science and Machine Learning
From Everand
Exploring the World of Data Science and Machine Learning
NIBEDITA Sahu
No ratings yet