BIRLA INSTITUTE OF TECHNOLOGY & SCIENCE, PILANI
WORK INTEGRATED LEARNING PROGRAMMES
COURSE HANDOUT
Part A: Content Design
Course Title Advanced Statistical Techniques for Analytics
Course No(s) SS ZG536
Credit Units 4
Course Author
Version No
Date January ,2019
Course Description
This course will cover the statistical techniques which are very important in Data analytics. It
covers the models related to descriptive statistics, inferential statistics, predictive analytics and
applied multivariate analytics. And also this course introduces the important tools used in Data
Analytics.
Course Objectives
CO1 Understanding the data visualisation and analysis which is very important in Data
Analytics
CO2 Understanding the predictive & inferential statistical models used in Data Analytics
CO3 Understanding the tools used in data analytics
Text Books
T1 Probability & Statistics for Engineering and Sciences,8th Edition, Jay L Devore
T2 Applied Logistic Regression, Hosmer and Lemeshow,3rd Edition, Wiley
T3 Financial Time Series,3rd Edition, RUEY S. TSAY, third edition, WILEY
Reference Books
R1 Introduction to Time series and Forecasting, Second Edition,Peter J Brockwell, Richard
A. Davis, Springer
R2 Miller and Freund's - Probability and Statistics for Engineers,8th Edition, PHI
R3 R programming for Data Science, Roger D Peng,
Modular Content Structure
1. Descriptive Statistics & Probability
1.1 Data Visualisation
1.2 Measures of Central Tendency
1.3 Measures of Variability
1.4 Probability – axioms & Propeties
1.5 Conditional probability & Bayes theorem
1.6 Random Variables – Discrete & Continuous with Properties
1.7 Probability Distributions
1.7.1. Binomial distribution
1.7.2. Poisson distribution
1.7.3. Normal distribution
1.7.4. Exponential distribution
2. Inferential Statistics
2.1 Random Sampling and sample Bias
2.2 Central Limit Theorem
2.3 Estimation _Confidence Intervals
2.4 Testing of Hypothesis
2.4.1. Type I, Type II errors
2.4.2. Testing of Hypothesis – single sample(mean & proportion)
2.4.3. Inferences based on two samples(mean & Proportion)
2.4.4. Chi Square Test
2.5 The Analysis of Variance - Single Factor and Two factor ANOVA
3. Predictive Analytics – 1(Linear Regression)
3.1 Correlation
3.2 Sum of Least Squares Approximation
3.3 Simple linear regression
3.4 Gradient Descent Method
3.5 Ridge Models &Lasso Model
3.6 Non Linear regression
3.7 Multiple regression analysis
4. Predictive Analytics – 2 (Logistic Regression)
4.1 Introduction
4.2 Logistic regression function
4.3 Polynomial Logistic Regression
4.4 Dependent variable prediction
5. Predictive Analytics – 3 (Forecasting Models)
5.1 Principles of Forecasting - Introduction
5.2 Time series Analysis
5.2.1. Smoothing & decomposition methods
5.2.2. ARIMA Model
5.2.3 GARCH models
5.2.4 Holt – Winter method
5.3 Moving Averages
5.4 Exponential smoothing
6. Applied Multivariate Analytics
6.1 Introduction
6.2 Multivariate Normal distribution
6.3 Principal Component Analysis
6.4 Factor Analysis
6.5 Discriminant Analysis
6.6 MANOVA
Learning Outcomes:
No Learning Outcomes
LO1 Clear understanding of the difference between Business Intelligence and data
analytics
LO2 Various statistical models & tools used in Data Analytics
Part B: Contact Session Plan
Academic Term II Semester 2018-2019
Course Title Advanced Statistical Techniques for Analytics
Course No SS ZG536
Lead Instructor Dr. Y.V.K. RAVI KUMAR
Course Contents
Contact Session 1-: Descriptive Statistics & Probability (Module 1)
Contact List of Topic Title References
Session
CS- 1 Introduction, Data Visualization T1:Chapter 1
Measures of central tendency, Measures of variability
HW Problems on descriptive statistics T1:Chapter 1
Lab Implementation of Data visualization
Contact Session 2-: Descriptive Statistics & Probability (Module 1)
Contact List of Topic Title References
Session
CS- 2 Introduction to probability, properties T1:Chapter 2
Conditional probability , Baye’s theorem
HW Problems on conditional probability & Baye’s theorem T1:Chapter 2
Lab Implementation of Baye’s theorem
Contact Session 3-: Descriptive Statistics & Probability (Module 1)
Contact List of Topic Title References
Session
CS- 3 Random variables – Discrete & Continuous T1:Chapter 3
Probability distributions &4
HW Problems on random variables and probability T1:Chapter 3
distributions &4
Lab Implementation of probability distributions (discrete)
Contact Session 4-: Inferential Statistics (Module 2)
Contact List of Topic Title References
Session
CS- 4 Random Sampling , Central Limit theorem T1:Chapter 5
Confidence Intervals, Type I, Type II Errors &7
HW Problems on confidence intervals T1:Chapter 5
&7
Lab Implementation of probability distributions(continuous)
Contact Session 5-: Inferential Statistics (Module 2)
Contact List of Topic Title References
Session
CS- 5 Tests of Hypotheses – single sample(mean, proportion) T1:Chapter 8
HW Problems on tests of Hypotheses T1:Chapter 8
Lab Implementation of tests of hypothesis
Contact Session 6-: Inferential Statistics (Module 2)
Contact List of Topic Title References
Session
CS- 6 Tests of Hypotheses –Two Samples(mean, proportion) T1:Chapter 9
Several proportions – Chi Square test
HW Problems on tests of Hypotheses T1:Chapter 9
Lab Implementation of Chi Square test
Contact Session 7-: Inferential Statistics (Module 2)
Contact List of Topic Title References
Session
CS- 7 The Analysis of Variance(ANOVA) – Single Factor & T1:Chapter
Two – Factor ANOVA 10&11
HW Problems on ANOVA T1:Chapter
10&11
Lab Implementation of ANOVA
Contact Session 8-: Predictive Analytics -1 (Module 3) & Revision
Contact List of Topic Title References
Session
CS- 8 Correlation & Review of the topics covered from T1:Chapter 5
session 1 to 7
HW Problems on Correlation T1:Chapter 5
Lab
Contact Session 9-: Predictive Analytics -1 (Module 3)
Contact List of Topic Title References
Session
CS- 9 Method of least squares approximation T1:Chapter 12
Simple linear regression, Gradient Descent method
HW Problems on Regression T1:Chapter 12
Lab Implementation of Regression
Contact Session 10-: Predictive Analytics -1 (Module 3)
Contact List of Topic Title References
Session
CS- 10 Nonlinear and multiple regression, Ridge & Lasso model T1:Chapter 13
HW Problems on Regression T1:Chapter 13
Lab Implementation of Regression
Contact Session 11-: Predictive Analytics -2(Logistic Regression) (Module 4)
Contact List of Topic Title References
Session
CS- 11 Introduction, Logistic regression function, T2
Polynomial Logistic Regression, Dependent
variable prediction
HW Problems on Logistic regression T2
Lab Implementation of Logistic regression
Contact Session 12-: Predictive Analytics -3(Forecasting Model) (Module 5)
Contact List of Topic Title References
Session
CS- 12 Principles of Forecasting ,Time Series Analysis, T3:Chapter 1
Smoothing and decomposition methods &2
HW Problems on Time series analysis T3:Chapter 1
&2
Lab Implementation of Time series analysis
Contact Session 13-: Predictive Analytics -3(Forecasting Model) (Module 5)
Contact List of Topic Title References
Session
CS- 13 Moving Averages , exponential smoothing T3:Chapter 3
ARIMA Model, GARCH Model, Holt – Winter
method
HW Problems on Time series analysis T3:Chapter 3
Lab Implementation of Time series analysis
Contact Session 14-: Applied Multivariate Analytics (Module 6)
Contact List of Topic Title References
Session
CS- 14 Applied Multivariate Analytics - Introduction T3:Chapter 9
Multivariate Normal distribution , Principal Component
Analysis
HW Problems on principal component analysis T3:Chapter 9
Lab Implementation of principal component analysis
Contact Session 15-: Applied Multivariate Analytics (Module 6)
Contact List of Topic Title References
Session
CS- 15 Factor Analysis, Discriminant Analysis, MANOVA Ref Material
HW
Lab
Contact Session 16-: Revision
Contact List of Topic Title References
Session
CS- 16 Revision
HW
Lab
Evaluation Scheme
Legend: EC = Evaluation Component; AN = After Noon Session; FN = Fore Noon Session
No Name Type Duration Weight Day, Date, Session, Time
EC-1 Quiz-1 Online 5% 14th Feb to 28th Feb , 2019
Quiz-2 14th March to 28th March
Online 5%
,2019
Assignment-I Online 10% 1st April to 15th April,2019
EC-2 Mid-Semester Test Closed Book 2 hours 30%
EC-3 Comprehensive Exam Open Book 3 hours 50%
Important Information
Syllabus for Mid-Semester Test (Closed Book): Topics in Weeks 1-7
Syllabus for Comprehensive Exam (Open Book): All topics given in plan of study
Evaluation Guidelines:
1. For Closed Book tests: No books or reference material of any kind will be permitted.
Laptops/Mobiles of any kind are not allowed. Exchange of any material is not allowed.
2. For Open Book exams: Use of prescribed and reference text books, in original (not
photocopies) is permitted. Class notes/slides as reference material in filed or bound form is
permitted. However, loose sheets of paper will not be allowed. Use of calculators is permitted
in all exams. Laptops/Mobiles of any kind are not allowed. Exchange of any material is not
allowed.
3. If a student is unable to appear for the Regular Test/Exam due to genuine exigencies, the
student should follow the procedure to apply for the Make-Up Test/Exam. The genuineness of
the reason for absence in the Regular Exam shall be assessed prior to giving permission to
appear for the Make-up Exam. Make-Up Test/Exam will be conducted only at selected exam
centres on the dates to be announced later.
It shall be the responsibility of the individual student to be regular in maintaining the self-study
schedule as given in the course handout, attend the lectures, and take all the prescribed evaluation
components such as Assignment/Quiz, Mid-Semester Test and Comprehensive Exam according to the
evaluation scheme provided in the handout.