Data Analytics-1
Data Analytics-1
Text Book:
Reference Book:
Note:
• Each session mentioned is for theory and of 2 hours duration. Lab assignments are
indicatives; faculty needs to assign adequate assignments for better practice.
Trainer has to teach the statistical and probability concepts involved here in detail
• Trainer must teach ‘SCIPY’ package in detail.
Session 1 & 2:
Introduction to Business Analytics using some case studies
Data analytics Life Cycle:
Discovery
Data preparation
Model planning
Model building implementation
Quality assurance
Documentation
Management approval
Installation
PG-DAI Page 1 of 4
ACTS, Pune
Assignment –Lab: Setup workspace in either in Google Colab or Jupyter and go through the
documentation of Pandas, Numpy, Sklearn libraries.
Session3:
Intelligent data analysis o Nature of Data
Analytic Processes and Tools o Analysis vs. Reporting
Modern Data Analytic Tools
Session 4:
Visualization and Exploring Data
Case studies: Making Right Business Decisions based on
data
Assignment –Lab: Load a sample dataset and explore the data and draw some insights. Also use
Matplotlib and seaborn libraries.
Session 5:
Descriptive Statistical Measures
Summary Statistics - Central Tendency & Dispersion (Mean, Median, Mode, Quartiles,
Percentiles, Range, Interquartile Range, Standard Deviation,
Variance, and Coefficient of Variation)
Assignment –Lab: Load any dataset and find out the mean, median mode and other central
tendencies of the dataset.
Session 6:
Sample& population, Uni-variate and bi-variate sampling, re-sampling
Sample Spaces and Events
Joint, Conditional and Marginal Probability
Bayes’ Theorem
Assignment –Lab: Load any dataset and apply Naive Bayes function and predict the output.
Session 7:
Random Variable
Probability Distribution and Data
Continuous and discrete distribution – (Normal, Binomial, and Poisson distribution)
Central Limit Theorem
PG-DAI Page 2 of 4
ACTS, Pune
Assignment –Lab: Generate random numbers and check if they are in normal distribution using
scipy libraries.
Session 8:
Sampling and Estimation
Statistical Interfaces
Session 9:
Concepts of Correlation
Covariance
Pearson Correlation
Outliers
Assignment –Lab: Load any dataset and find out the covariance between two fields and also find
the correlation and determine how two fields are correlated. Also handle the outliers in the data.
Session 12:
Predictive modelling and analysis
Application
Types
Benefits and challenges
The Future of predictive modelling
The Limitations of Predictive modelling
Predictive modelling Tools
PG-DAI Page 3 of 4
ACTS, Pune
Session 16:
Regression Analysis
Forecasting Techniques
Session 17:
Simulation and Risk Analysis
Optimization- Linear, Nonlinear
Session 18:
Overfitting and Its Avoidance:
Generalization
Holdout Evaluation Vs. Cross Validation
Session 19:
Decision Analytics
Evaluating Classifiers
Analytical Framework
Evaluation
Baseline,
Performance and Implications for Investments in Data
Session 20:
Evidence and Probabilities
Explicit Evidence Combination with Bayes Rule
Probabilistic Reasoning
Session 21:
Factor Analysis
Directional Data Analytics
Functional Data Analysis
Session 22:
Introduction to KNIME
Assignment –Lab: Load any dataset and explore KNIME tool.
PG-DAI Page 4 of 4