DS Curriculum
DS Curriculum
5. Statistics & Learn to apply the concepts • Statistical Analysis using • Measures of Central Tendency
Data of Descriptive and Python • Measures of Dispersion
Visualization Inferential Statistics to • Prediction using • Fundamentals of probability
using Python analyze data, using Python Probability (simple, joint and conditional)
as a tool. • Data Sampling & • Probability distributions
Distribution (Binomial, Poisson’s)
• Prediction using • Hypothesis testing (Z and T Test
Probability Distribution of means)
• Predicting events using • ANOVA
Binomial and Poison’s • Chi Square Tests
distribution
• Parametric Testing
• Non Parametric Testing
• Chi Square Testing
6. Exploratory Learn and apply the best • EDA using Descriptive • EDA best practice demonstration
Data Analysis practices of EDA in a Statistics • Univariate, Bivariate and Multi-
systematic way (Univariate, • EDA hackathon 1 variate analysis
Bivariate and Multi-variate • EDA using inferential • Data visualization using graphs
Analysis) along with the Statistics and charts
statistical concepts. Present • EDA hackathon 2
the result of this analysis • EDA hackathon 3
visually.
7. Web Scraping Use Beautiful Soup to scrape • Working with API’s • Working with Web API’s
& Introduction the Web by learning how to • Breaking down HTML and • HTML and JSON fundamentals
to Text work with Web API’s. Learn JSON • Scraping using Beautiful Soup
Analytics the fundamentals of Text • Scraping the Web for data • Regular Expressions
Analytics using spaCy library • Data cleaning using Regular • Fundamentals of Text Analysis
in Python. Expressions using the spaCy library
• Analyze Text Data to detect (stemming, lemmatization, stop
sentiments words, phrase matching and
tokenization using spaCy)
8. Unstructured Use MongoDB to store • Justifying the need for No- • Document database
Data Analysis unstructured data. Querying SQL Database • MongoDB,
using a No-SQL complex data structures to • Installing and Exploring • CRUD operations
Database extract insights from real MongoDB • Embedded documents
world datasets. • Creating database objects • Data analysis using sorting,
and inserting data filtering, identifying correlations
• Analyze unstructured data and projection
through MongoDB
9. Exploratory This course will help you deep • Build a view to explore data • Dimensions & Measures
Data Analysis dive into Visual Analytics. It • Structure data for analysis in • Discrete (blue), or continuous
using Tableau covers various advanced Tableau (green) Row level or Aggregates
techniques for analyzing data • Aggregate and disaggregate or multi-level aggregation
to enhance your EDA skills data in Tableau • Concepts related to the use of
• Creating Charts and Graphs graphs - Line, Bar, Stacked Bar,
• Creating data hierarchies Pie, Spider, Word Cloud, …)
10. Story Telling Learn to build Dashboards • Creating Story Lines • Drill Downs
using Tableau and tell stories using Data. • Data Blending in Tableau • Building Story lines
• Adding Filters • Interactive Dashboards
• Adding actions to Dashboards
11. Data Modelling Demonstrate the ability to • Building a mathematic model • Linear Regression
using deploy ML algorithms for for prediction • Logistic Regression
Supervised predicting continuous • Predict continuous variable • Lasso and Ridge Regression
Machine variables and classification of using Linear Regression • Naïve Bayes Classifier
Learning categorical data. • Binomial classification of • Classification and Regression
Apply feature engineering events/outcomes using Trees
techniques to optimize the Logistic Regression • Feature Engineering
model accuracy. • Multinomial classification • KNN
using Naïve Bayes and KNN
algorithms
• Classification and Regression
using Decision Trees
12. Data Modelling Demonstrate the ability to • Clustering data using K- • K-Means Clustering
using Un- deploy ML algorithms for Means • DB Scan
supervised clustering similar data. Ability • Clustering data using DB Scan • Hierarchical Clustering
Machine to choose an ensemble of ML & Hierarchical clustering • Random Forest
Learning algorithms for a given • Optimizing predictions using • Bagging
problem and fine tune the Random Forest • Boosting
accuracy. • Optimizing prediction using
Bagging & Boosting
techniques
13. Deep Learning Understand the concept of • Building a single layer Neural • ANN
for Analytics neurons to build neural Network for simple linear • CNN
networks for analytics using regression • RNN
Keras and TensorFlow. Apply • Building multi-layer neural • XAI
these concepts to build neural networks for card churn
net models for solving analysis
regression and classification • Building recurrent neural
problems. networks to predict stock
prices
• The future of AI : XAI
14. NLP using Learn advanced Natural • Word embedding • GRU
Deep learning Language Processing (NLP) • Sentiment analysis • LSTM
Techniques using Convolutional Neural • Topic modeling
Networks (CNN) and • Text Summarization
Recurrent Neural Networks • Building chat bot
(RNN) to build chat bots
15. Image Learn to label, segment and • Pre-processing and labeling • CNN
Classification classify images data using of image data • GAN
using Deep CNNs and Generative • Image segmentation
Learning Adversarial Networks (GAN) Object Detection
• Image classification
• Image creation
• Outlook from Image to Text
• Video Analysis
16. Time Series Understand the significance • ETS • Error-Trend-Seasonality
Data Analytics of Time Series Data and apply • Evaluating Forecast Models • ARIMA
statistical tools to predict an • Auto Regression using Stats • SARIMA
event in time. Optimize the Model • ARIMAX
predictions in Time Series • Forecasting using ARMA and
Analysis using Deep Learning ARIMA
concepts. • Forecasting using SARIMA
and SARIMAX
• Time Series Analysis using
Deep Learning
Capstone Project Complete end-to-end ML • Complete a data analysis • Combine all knowledge & skills
project project including ML learnt to solve a problem
modelling • Walk-thru the solution
• 2-3 member small agile team
• Final project presentation