Data Science
Data Science
EXPLORATION OF INSIG
Courses
100%
100%Live
LiveInteractive
InteractiveSessions
Sessionsfrom
fromExpert
Expert
Hybrid
HybridModel
Model-Live
-Live++Classroom
ClassroomProject
Project
Sessions
Sessions
Guaranteed
GuaranteedInterview
Interviewcalls
calls
Doubt
DoubtClearing
ClearingSession
Sessionwith
withExpert
Expert
Data Analytics Certification Program
Introduction[ Orientation Courses ] Strings, Decisions & Loop Control
Basics of computer architecture, Computer Working With Numbers, Booleans
programming in c++, Java, Data types and Data and Strings, String types and formatting, String
structure, Programming Tools Branching, loop, operations, Simple if Statement, if-else Statement if-
Function, Other Data Types, Array, Pointer, elif Statement. Introduction to while Loops, for
Components of object oriented system, Basics of Loops, Using continue and break.
mathematics, Linear algebra, Statistics [Mean, Python Data Types
Median, Mode Standard Deviation, Average. List, Tuples, Dictionaries
Probability, permutations, and combinations ] Python Lists, Tuples, Dictionaries Accessing Values,
Python Basics Basic Operations Indexing, Slicing, and Matrixes
Types of code editors in python Built-in Functions & Methods Exercises on List,
Introduction to Anaconda & Jupyter notebook Tuples And Dictionary
Flavors of python, Introduction to Git, GitHub Functions And Modules
Python Fundamentals, Source code vs Byte code vs Introduction To Functions
Machine code, Compiler & Interpreter Defining & Calling Functions, Functions With
Memory Management in Python, Multiple Arguments. Anonymous Functions –
Python Programming Overview Lambda, Using Built-In Modules, User-Defined
Python Overview, Python 2.7 vs Python 3 Modules, Module Namespaces, Iterators And
Writing your First Python Program Lines and Generators
Indentation, Python Identifiers, Various Operators and File I/O And Exceptional Handling
Operators Precedence, Getting input from User, Opening and Closing Files, open Function,file Object
Comments, Multi line Comments. Attributes close() Method ,Read, write, seek.
Exception Handling, try-finally Clause Raising an
Exceptions, User-Defined Exceptions, Regular
Expression- Search and Replace
Data Analytics Certification Program
Data Analysis Using Numpy Statistics
Introduction to Numpy. Array Creation, Fundamentals of Math and Probability
Printing Arrays, Basic Operation - Indexing, Slicing Probability distributed function &
and Iterating, Shape Manipulation - Changing shape, cumulative distribution function. Conditional
stacking and splitting of array Probability, Baye’s Theorem, Problem solving for
Vector stacking, Broadcasting with Numpy, Numpy probability assignments, Random Experiments,
for Statistical Operation. Regular Expression Mutually Exclusive Events, Joint Events, Dependent
Modifiers Regular Expression Patterns & Independent Events
Data Analysis Using Pandas Introduction to Statistics, Statistical Thinking
Pandas : Introduction to Pandas Importing Variable and its types, Quantitative,
data into Python, Pandas Data Frames, Indexing Data Categorical, Discrete, Continuous, *all with examples
Frames ,Basic Operations With Data frame, Renaming Five Point Summary and Box Plot Outliers, Causes of
Columns, Subsetting and filtering a data frame. Outliers, How to treat Outliers, I-QR Method and Z-
Data Visualization using Matplotlib Score Method
Matplotlib: Introduction, plot(), Controlling Line Inferential Statistics
Properties, Subplot with Functional Method, Multiple Central Limit Theorem
Plot, Working with Multiple Figures, Histograms Point estimate and Interval estimate Creating
Data Visualization using Seaborn confidence interval for population parameter
Seaborn : Characteristics of Z-distribution and T- Distribution.
Introduction to Seaborn And Visualizing statistical Type of test and rejection region. Type of errors in
relationships , Import and Prepare data. Plotting with Hypothesis Testing
categorical data and Visualizing linear relationships.
Seaborn Exercise
Data Analytics Certification Program
Population & Sample Binomial Distribution
Population vs Sample, Sample Size Simple Assumptions of Binomial Distribution, Normal
Random Sampling, Systematic Sampling, Cluster Distribution, Properties of Normal Distribution, Z
Sampling, Stratified Sampling, Convenience table, Empirical Rule of Normal Distribution &
Sampling, Quota Sampling, Snowball Sampling and Central Limit Theorem and its Applications, T-Test,
Judgement Sampling Analysis of variance (ANOVA), and Analysis of
Descriptive Statistics Covariance (ANCOVA) Regression analysis in
Measures of Central Tendency – Mean, ANOVA
Median and Mode Measures of Dispersion – Standard Class Hands-on:
Deviation, Variance, Range, IQR (Inter-Quartile Problem solving for C.L.T
Range) Measure of Symmetricity/ Shape – Skewness Problem solving Hypothesis Testing Problem solving
and Kurtosis. for T-test, Z-score test Case study and model run for
Hypothesis Testing ANOVA, ANCOVA
Type of test and Rejection Region Type o errors-Type Linear Algebra
1 Errors, Type 2 Errors. P value method, Z score Dot Product, Projecting Point on Axis.
Method. The Chi-Square Test of Independence. Matrices in Python, Element Indexing, Square Matrix,
Regression. Factorial Analysis of Variance. Pearson Triangular Matrix, Diagonal Matrix, Identity Matrix,
Correlation Coefficients in Depth. Statistical Addition of Matrices, Scalar Multiplication, Matrix
Significance, Null and Alternative Hypothesis One- Multiplication, Matrix Transpose, Determinant, Trace
tailed and Two-tailed Tests, Critical Value, Rejection Data Processing & Exploratory Data Analysis
region, Inference based on Critical Value What is Data Wrangling
Data Pre-processing and cleaning? How to
Restructure the data? What is Data Integration and
Transformation?
Data Analytics Certification Program
EDA
Finding and Dealing with Missing Values. VIF. Using Python library Sklearn to create the Linear
What are Outliers? Using Z-scores to Find Outliers. Regression Model and evaluate the model created.
Bivariate Analysis, Scatter Plots and Heatmaps. Data Preprocessing
Introduction to Multivariate Analysis Types of Missing values (MCAR, MAR,
Machine Learning MNAR) , Methods to handle missing values
Definition, Examples, Importance of Outliers, Methods to handle outliers: IQR Method, Z
Machine Learning Definition of ML Elements: Method, Feature Scaling: Definition , Methods:
Algorithm, Model, Predictor Variable, Response Absolute Maximum Scaling, Min-Max Scaler ,
Variable, Training - Test Split, Steps in Machine Normalization, Standardization, Robust Scaling,
Learning, ML Models Type: Supervised Learning, Encoding the data: Definition, Methods: OneHot
Unsupervised Learning and Reinforcement Learning. Encoding, Mean Encoding, Label Encoding, Target
Regression and Classification Models Guided Ordinal Encoding
Definition of regression, OLS Algorithm, Sum of Logistic Regression Mode
Squares of residuals, Gradient Descent Algorithm, Definition. Why is it called the
Cost Function Evaluation Metrics for Regression “Regression model”? Sigmoid Function,
Model: MAE, MSE, RMSE, R Square, Adjusted R Transformation & Graph of Sigmoid Function
Square Regression and Classification Model
Linear Regression Model Definition of regression, OLS Algorithm, Sum of
Comparing MAE, MSE, and RMSE. Significance of Squares of residuals, Gradient Descent Algorithm,
Adjusted R square. Overfitting and Underfitting. Bias Cost Function Evaluation Metrics for Regression
and Variance. Regularization methods: Ridge and Model: MAE, MSE, RMSE, R Square, Adjusted R
Lasso, Multicollinearity, Square
Data Analytics Certification Program
Evaluation Metrics for Classification model Hyperparameter Tuning
Confusion Matrix, Accuracy, Misclassification, TPR, GridSearchCV, Variable Importance. Using Python
FPR, TNR, Precision, Recall, F1 Score, ROC Curve, library Sklearn to create the Random Forest Model
and AUC. Using Python library Sklearn to create the and evaluate the model created.
Logistic Regression Model and evaluate the model Use cases
created Naive Baye’s Model
K Nearest Neighbours Model Definition, Advantages, Baye’s Theorem
Definition, Steps in KNN Model, Types of Distance: Applicability, Disadvantages of Naive Baye’s Model,
Manhattan Distance, Euclidean Distance, ‘Lazy Laplace’s Correction, Types of Classifiers: Gaussian,
Learner Model’. Confusion Matrix of Multi Class Multinomial and Bernoulli Using Python library
Classification, Using Python library Sklearn to create Sklearn to create the Naive Baye’s Model and
the K Nearest Neighbours Model and evaluate the evaluate the model created,
model
Decision Tree Model
Definition, Basic Terminologies, Tree Splitting
Constraints, Splitting Algorithms: CART, C4.5, ID3,
CHAID, Splitting Methods:
GINI, Entropy, Chi-Square, and Reduction in
Variance, Using Python library Sklearn to create the
Decision Tree Model and evaluate the model created
Random Forest Model
Ensemble Techniques: Bagging/bootstrapping &
Boosting. Definition of Random Forest, OOB Score
K-Fold Cross-Validation
Data Analytics Certification Program
SQL Programming with SQL
RDBMS And SQL Operations. Single Table Queries - Mathematical Functions Variables, Conditional Logic
SELECT, WHERE, ORDER BY, Distinct, And, OR Loops, Custom Functions Grouping and Ordering
Multiple Table Queries: INNER, SELF, CROSS, and Partitioning, Filtering Data Subqueries
OUTER, Join, Left Join, Right Join, Full Join, Union PowerBI
Advance SQL Operations Installing Power BI Desktop and
Data Aggregations and summarizing the data Connecting to Data, Overview of the Workflow in
Ranking Functions: Top-N Analysis Advanced SQL Power BI Desktop , Introducing the Different Views
Queries for Analytics of the Data Mode, Query Editor Interface
NoSQL, HBase & MongoDB Working on Data Model, Connecting Dataset with
NoSQL Databases, Introduction to HBase Power BI Creating a Customer Segmentation
HBase Architecture, HBase Components, Storage Dashboard Analyzing the Customer Segmentation
Model of HBase. HBase vs RDBMS, Introduction to Dashboard, Working with Time Series Understanding
Mongo DB, CRUD Advantages of MongoDB over aggregation and granularity
RDBMS Filters and Slicers in Power BI Maps Scatterplots and
JSON Data & CRUD BI Reports
Basics and CRUD Operation Databases, Collection &
Documents Shell & MongoDB drivers, What is JSON
Data, Create, Read, Update, Delete, Finding, Deleting,
Updating, Inserting Elements, Working with Arrays
Understanding Schemas and Relations
Data Analytics Certification Program
Big Data & Spark Analytics Programming with SQL
Distributed Architecture - A Brief Overview. Mathematical Functions Variables, Conditional Logic
Understanding Big Data Introduction To Hadoop, Loops, Custom Functions Grouping and Ordering
Hadoop Architecture, HDFS, Overview of Partitioning, Filtering Data Subqueries
MapReduce Framework, Hadoop Master: Slave PowerBI
Architecture MapReduce Architecture, Use cases of Installing Power BI Desktop and
MapReduce Connecting to Data, Overview of the Workflow in
Getting to know PySpark Power BI Desktop , Introducing the Different Views
ntroduction to Spark RDD Introduction to Spark SQL of the Data Mode, Query Editor Interface
and Data frames, Using R-Spark for machine learning Working on Data Model, Connecting Dataset with
Hands-on: Installation and configuration of Spark Power BI Creating a Customer Segmentation
Using R-Spark for machine learning programming Dashboard Analyzing the Customer Segmentation
Pyspark Introduction, Pyspark Environment Setup Dashboard, Working with Time Series Understanding
PySpark - Spark context, RDD , Broadcast and aggregation and granularity
Accumulator Sparkconf and Sparkfiles, Spark MLlib Filters and Slicers in Power BI Maps Scatterplots and
Overview Algorithms and utilities in Spark Mlib BI Reports
JSON Data & CRUD Map reduce Use Case 1: Youtube data analysis
Basics and CRUD Operation Databases, Collection & Map reduce Use Case 2: Uber data analytics
Documents Shell & MongoDB drivers, What is JSON Spark RDD programming
Data, Create, Read, Update, Delete, Finding, Deleting, Spark SQL and Data frame programming
Updating, Inserting Elements, Working with Arrays
Understanding Schemas and Relations
Data Analytics Certification Program
Time Series Formatting Fonts & Values Adjusting Row Height and
Introduction to Time Series Forecasting Column Width, Changing Cell Alignment, Adding
Basics of Time Series Analysis and Forecasting Borders, Applying Colors and Patterns Using the
Method Selection in Forecasting Moving Average Format Painter, Merging Cells, Rotating Text,
(MA) Forecast Example Different Components of Formatting, Editing, finding, filtering, Referencing,
Time Series Data Log Based Differencing, Linear Chart, Creating a PivotTable, Specifying the Data a
Regression for Detrending PivotTable Analyzes, Changing a PivotTable's
Introduction to ARIMA Models Calculation, Selecting What Appears in a PivotTable
ARIMA Model Calculations, Manual ARIMA Grouping Dates in a PivotTable Updating a
Parameter Selection ,ARIMA with Explanatory PivotTable
Variables Understanding Multivariate Time Series and Formatting and Charting a PivotTable Automating
their Structure Checking for Stationarity and Tasks with Macros Recording a Macro
Differencing the MTS, Time series classification of Playing a Macro and Assigning a Macro
smartphone data to predict user behavior, Performing Shortcut Key
Time Series Analysis on Stock Prices, Time series
forecasting of sales data
Excel
Getting started with Excel
Creating a New Workbook Navigating in Excel
Moving the Cell Pointer, Using Excel Menus
Creating Headers, Footers, and Page Numbers
Adjusting Page Margins and Orientation
Adding Print Titles and Gridlines, rows to repeat at
top of each page
Data Science & AI Certification Program
20 10 Minor 2 Major
Assignments Projects Projects
Minimum Qualification
Working professional having more than 8 years of experience in any domain (Technical/Non-Technical)
Qualification: BE/B.Tech (from any branch), BBA/MBA, MCA/M.Tech, B.Com, B.Sc (in any branch)
Model Training & Deployment using (AWS GCP) Deploying Machine Learning Model
Deploying Models, Understanding training graphs
AWS (Amazon Web Services) and serving graphs Check and adjust model size
Deployment Strategies Automations Build an optimal prediction graph Creating input
Monitoring and Logging Communication and function
Collaboration Creating a model version
Getting Online Prediction
GCP (Google Cloud Platform)
GCP Development Tools - Cloud SDK, Training Machine Learning Model
Repositories, Plugins Developing a trained model application
Deployment Manager and Cloud Endpoint Running and monitoring a machine learning
model
Introduction to AWS and GCP Cloud ML Engine Using hyperparameter tuning Using GPUs for
CloudML Engine & AWS in Machine Learning training models in the cloud
WorkFlow Components of AWS & Cloud ML
Engine
GCP and AWS Console. gcloud command-line tool
and Rest API
Image Processing
Histogram equalization Thresholding and
Convolution Sharpening and edge detection
Morphological transformations Image pyramid
Data Science & AI Certification Program
for
Professionals
Data Analytics Certification Program