Syllabus
Syllabus
Program
Curriculum
DATA SCIENCE TOOLKIT
1 INTRODUCTION TO PYTHON
Build a foundation for the most in-demand programming language of the 21st century.
3 INFERENTIAL STATISTICS
Build a strong statistical foundation and learn how to ‘infer’ insights from a huge population using a
small sample.
4 HYPOTHESIS TESTING
Understand how to formulate and validate hypothesis for a population to solve real-life
business problems.
MACHINE LEARNING I
1 LINEAR REGRESSION
Venture into the machine learning community by learning how one variable can be predicted
using several other variables through a housing dataset where you will predict the prices of
houses based on various factors.
3
LOGISTIC REGRESSION
Learn your first binary classification technique by determining which customers of a telecom
operator are likely to churn to help versus who are not to help business retain customers.
4 NAIVE BAYES
Understand the basic building blocks of Naive Bayes and learn how to build an SMS Spam
Ham Classifier using Naive Bayes technique.
5 MODEL SELECTION
Learn the pros and cons of simple and complex models and the different methods for quantifying
model complexity, along with regularisation and cross validation.
MACHINE LEARNING II
1 ADVANCED REGRESSION
Understand generalised regression and different feature selection techniques, along with the
perils of overfitting and how it can be countered using regularisation.
4
TREE MODELS
Learn how the human decision making process can be replicated using a decision tree and
other powerful ensemble algorithms.
6
BOOSTING
Learn how weak learners can be ‘boosted’ with the help of each other and become strong
learners using different boosting algorithms such as Adaboost, GBM, and XGBoost.
8
UNSUPERVISED LEARNING: PRINCIPAL COMPONENT ANALYSIS
Understand important concepts related to dimensionality reduction, the basic idea and the
learning algorithm of PCA, and its practical applications on supervised and unsupervised
problems.
DEEP LEARNING