0% found this document useful (0 votes)
12 views2 pages

List of Topics For Data Scientists

The document provides a comprehensive list of topics and subtopics relevant for Data Scientists and Data Analysts, including Python programming, database interaction, data analysis and visualization, statistical modeling, and optimization techniques. It includes references to online resources such as YouTube videos and books for further learning. Additionally, it covers specific algorithms and mathematical formulations related to SandMan, SandMix, and variable compactability.

Uploaded by

hr
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
12 views2 pages

List of Topics For Data Scientists

The document provides a comprehensive list of topics and subtopics relevant for Data Scientists and Data Analysts, including Python programming, database interaction, data analysis and visualization, statistical modeling, and optimization techniques. It includes references to online resources such as YouTube videos and books for further learning. Additionally, it covers specific algorithms and mathematical formulations related to SandMan, SandMix, and variable compactability.

Uploaded by

hr
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

List of topics for Data Scientists/ Data analyst

SL Topics Subtopics Reference


N
o
1 Python Basics python (data https://www.youtube.com/watch?v=YYXdXT2l-
Programmi structure, function ,error Gg&list=PL-osiE80TeTskrapNbzXhwoFUiLCjGgY7
ng handling), https://www.programiz.com/python-
programming/getting-started

Data Manipulation (Pandas, https://www.youtube.com/watch?v=ZyhVh-


Numpy), qRZPA&list=PL-
osiE80TeTsWmV9i9c58mdDCSskIFdDS
Book : Jake vanderplas python data science
handbook

Data Visualization https://www.youtube.com/watch?v=UO98lJQ3


(Matplotlib, seaborn) QGI&list=PL-
osiE80TeTvipOqomVEeZ1HRrcEvtZB_

Git version control https://www.youtube.com/watch?v=HVsySz-


h9r4&list=PL-osiE80TeTuRUfjRe54Eea17-
YfnOOAx

https://www.youtube.com/watch?v=ZDa-
Object oriented Z5JzLYM&list=PL-
programming osiE80TeTsqhIuOqKhwlXsIBIdSeYtc

Book : Jake vanderplas python data science


Sklearn library handbook
2 Database SQLALchemy, Pandas with https://www.youtube.com/watch?v=vKuKp10L
Interaction SQL (optional) QEM&t=69s
3 Data Descriptive statistics,
Analysis Probability distribution
and
Visualizatio
n
4 Statistical • Regression :Normal • https://www.statlearning.com/
modelling regression, lasso, For PCA and KPCA follow Ali Ghodsi lecture
ridge, Random
forest, SVR, KNN, • https://www.youtube.com/watch?v=L-
Random forest pQtGm3VS8
• Dimension
reduction: PCA, • https://www.youtube.com/watch?v=je
KPCA, OEXCFK30M
• Classification:
logistic, KNN, SVM, • https://www.statlearning.com/
Random forest
• Clustering: K mean, For SVM follow Ali Ghodsi lecture
Hierarchical,DBSCA
N,Fuzzy K mean • https://www.youtube.com/watch?v=rL
T4OFy-atc

5 Optimizatio • How to formulate


n and solve LP, QP
and MILP
optimization using
python
6 SandMan Mathematical formulation Lecture by Infosoft team
and high of SandMan and high
influence - influence model
Algorithm
7 SandMix- Mathematical formulation Lecture by Infosoft team
Algorithm of SandMix
8 Variable Mathematical formulation Lecture by Infosoft team
compactalit of VCOMP
y (VCOMP-
SP)-
Algorithm
9 Sand Lecture by Infosoft team
Analytics
product
demonstrat
ion

You might also like