Certificate Course of Data Analytics
Certificate Course of Data Analytics
Data analysis is the need of the hour. Today, different organizations are generating huge
amounts of data without knowing how to make use of it for their benefit. To change this,
machine learning and statistical techniques are now being to develop predictive models
from existing data to forecast future outcomes.
Objective
Expecting to build a solid foundation of business analytics, this course has been designed
to impart knowledge of machine learning and statistical methods for data analysis. The
course shall also provide sufficient knowledge of python programming language to use for
machine learning algorithm and python/R programming for statistical methods. A brief
introduction of neural networks and deep learning will also be covered.
Target Audience
1. Students who’ve passed 10+2 examinations with Mathematics.
2. Professionals having knowledge of Mathematics.
Data pre-processing and cleaning: data manipulation steps (sorting, filtering, duplicates,
merging, appending, subsetting, derived variables, data type conversions, renaming,
formatting, etc.), normalizing data, sampling, missing value treatment, outliers.
Exploratory data analysis: Data visualization using matplotlib, seaborn libraries, creating
graphs (bar/line/pie/boxplot/histogram, etc.), summarizing data, descriptive statistics,
univariate analysis (distribution of data), bivariate analysis (cross tabs, distributions and
relationships, graphical analysis).
Model Evaluation: Cross validation types (train & test, bootstrapping, k-fold validation),
parameter tuning, confusion matrices, basic evaluation metrics, precision-recall, ROC
curves.
Case study
Association Rule Mining: Mining frequent itemsets, Apriori algorithm, market basket
analysis.
Case study
Case Study
References:
1. Kumar, U.D. :Business Analytics – The Science of Data – Driven Decision Making,
Wiley.
2. Gert, H.N., Thorlund, L. and Thorlund, J. :Business Analytics for Managers –
Taking Business Intelligence Beyond Reporting, Wiley.
3. Johnson, R.A., Miller, I. and Freund, J. :Probability and Statistics for Engineers,
Pearson.
4. Jose, J. and Lal, S.P. :Introduction to Computing & problem solving with Python,
Khanna Publishers.
5. Bowles, M. :Machine Learning in Python – Essential Techniques for Predictive
Analysis, Wiley.
6. Larose, D.T. and Larose, C.T.: Data Mining and Predictive Analytics, Wiley.
7. Bishop, C.M. :Pattern recognition & Machine Learning, Springer New York.
8. Falch, P. :Machine Learning, Wiley.
9. Deepa, S.N. and Sivanandam, S.N. :Principles of Soft Computing, Wiley.
10. Taha, A.H. :Operations Research – An Introduction, Prentice Hall.
11. Raschka, S. :Python Machine Learning
Course Co-coordinators:
1. Dr. Sameer Anand
2. Dr. Ajay Jaiswal