0% found this document useful (0 votes)
165 views13 pages

Integrated Data Science Certification - DexLab Analytics - Big Data Hadoop SAS R Analytics Predictive Modeling & Excel VBA

The document describes an Integrated Data Science Certification program that includes training in R Programming, Python, machine learning, forecasting, deep learning, and Tableau. The 160-hour program is offered both in a classroom setting and online, and costs INR 44,000 or USD 725 for installments and INR 41000 or USD 676 in a onetime payment. The program modules cover topics such as data management, data manipulation, graphics and plotting, web scraping, SQL, machine learning concepts, probability, sampling theory, and predictive modeling.

Uploaded by

Rohit Roy
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
165 views13 pages

Integrated Data Science Certification - DexLab Analytics - Big Data Hadoop SAS R Analytics Predictive Modeling & Excel VBA

The document describes an Integrated Data Science Certification program that includes training in R Programming, Python, machine learning, forecasting, deep learning, and Tableau. The 160-hour program is offered both in a classroom setting and online, and costs INR 44,000 or USD 725 for installments and INR 41000 or USD 676 in a onetime payment. The program modules cover topics such as data management, data manipulation, graphics and plotting, web scraping, SQL, machine learning concepts, probability, sampling theory, and predictive modeling.

Uploaded by

Rohit Roy
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 13

5/14/2020 Integrated Data Science Certification - DexLab Analytics | Big Data Hadoop SAS R Analytics Predictive Modeling &

Modeling & Excel VBA

Integrated Data Science Certification

R Programming, Python, Machine Learning, Forecasting, Deep Learning, Tableau

160 Hours
Weekday & Weekend

Classroom & Live Online

INR 44000 plus GST (installments)


INR 41000 plus GST (onetime payment)

USD 725(installments)
USD 676(onetime payment)

- INTEGRATED DATA SCIENCE CERTIFICATION TRAINING MODULE

R Programming

Introduction to R
What is R,What is S History of R, Features of R
Comparing Tools Installing R
R interfaces, R Library Data Types, Attributes
Entering Input, Explicit Coercion Data Frame, List, Factors, Vectors
Objects, Matrix Dimensions, Matrix Design CBind and RBind, Missing Value

Data management

Reading Data Writing data


Reading data files with tables Files connection
Reading lines of Text files

Dplyr (Data Manipulation)


https://www.dexlabanalytics.com/courses/data-science-machine-learning-certification?pd=Integrated Data Science Certification 1/13
5/14/2020 Integrated Data Science Certification - DexLab Analytics | Big Data Hadoop SAS R Analytics Predictive Modeling & Excel VBA

Sorting Data Subsetting Data


Merging Data Aggregating Data
Reshaping Data Control Structures
If, For loop Repeat, While loop
Next, Return Apply, Lapply, Sapply,
Tapply, Mapply

Graphics and Plotting using R


Basic plotting and Advance plotting (ggplot2) Graphical Parameters
Combining Plot Density Plot, Histogram
Dot Pot,Bar Plots,Line charts,Pie charts Box plot, Scatter Plot
Saving plots

User Defined Functions


Built-in Function,User-defined Function Calling a Function
Calling a Function with or without an Argument Lazy Evaluation of Function

Web Scrapping
Download files from internet Extract data from Wikipedia
Extract data from website Extract data from PDF
Extract data from Facebook

Sql with R
Connecting with Mysql server Connecting with Microsoft sql server
Table creating, editing, updating and merging

TidyR (Data Cleaning)

Clean datasets according standard Date and Times

https://www.dexlabanalytics.com/courses/data-science-machine-learning-certification?pd=Integrated Data Science Certification 2/13


5/14/2020 Integrated Data Science Certification - DexLab Analytics | Big Data Hadoop SAS R Analytics Predictive Modeling & Excel VBA

Pattern Matching Regular Expression

Project
Twitter Sentiment analysis

Python

Introduction
Course Curriculum Overview & Python 3 Installation Notes
Quick Note on Jupyter Notebook Python Installation — Windows, Ubuntu
IDE Selection Jupyter (iPython) Notebooks

Basic Command
Numbers Strings
Print Formatting Lists
Dictionaries Tuples
Files Sets and Booleans
Resources for More Basic Practice

Condition, Expression and Loop


Introduction to Python Statements if, elif, and elif Statements
for Loops while Loops
range() Exception Handling: try, except, finally
Errors and Exceptions Homework Errors and Exceptions – Solutions

Function and Importing & Exporting


Introduction to functional programming Function recipe
Function parameters and resuse Recursive functions
https://www.dexlabanalytics.com/courses/data-science-machine-learning-certification?pd=Integrated Data Science Certification 3/13
5/14/2020 Integrated Data Science Certification - DexLab Analytics | Big Data Hadoop SAS R Analytics Predictive Modeling & Excel VBA

Creating modules Lamda


Import & Export Text file
CSV Excel
JSON HTML

PLOT
Matplotlib Seaborn
Plotly

Library Numpy (array matrix)


Numpy

Library Panda (Data Frame)


Panda

Regular Expression

Pattern Matching and replacement Project

Date and Time and Missing Treatment


Date and Time Data wrangling and cleaning
Missing treatment

Python & MySQL Connection

Web Scraping using Python


Prepare Text Data with Scikit learn

https://www.dexlabanalytics.com/courses/data-science-machine-learning-certification?pd=Integrated Data Science Certification 4/13


5/14/2020 Integrated Data Science Certification - DexLab Analytics | Big Data Hadoop SAS R Analytics Predictive Modeling & Excel VBA

Core Analytics, Predictive Modeling And Machine


Learning

Introduction to Analytics
Evolution of Analytics Definition of Analytics
Scope of analytics in different industries

Types of Analytics
Descriptive Analysis Predictive Analysis
Prescriptive Analysis

Concepts of Analytics
Confirmatory & Exploratory Analysis Different Scale of Measurement-Nominal, Ordinal,
Interval
Ratio Attribute and Variable concept Graphical Representation of Data
Measures of Central Tendency-Mean, Median, Mode Measures of Dispersion-Range, Variance, Standard
Deviation
Measures of Location-Quartiles, Interquartile Range Outliers & Box Plot Graphs

Probability
Concept of Probability Probability mass function
Random Variables-Discrete and Continuous Binomial Distribution
Poisson Distribution Normal Distribution

Sampling Theory
Concept of sampling: Population and Sample Types of Sampling
Probability sampling-Simple, Stratified, Systematic Non probability Sampling-Convenience, Judgmental
Testing Of Hypothesis-Null and Alternative Type I error and Type II error

https://www.dexlabanalytics.com/courses/data-science-machine-learning-certification?pd=Integrated Data Science Certification 5/13


5/14/2020 Integrated Data Science Certification - DexLab Analytics | Big Data Hadoop SAS R Analytics Predictive Modeling & Excel VBA
Significance level Confidence Interval

Parametric Test
Concept of Parametric test Z test
T test Two independent sample T test
Paired sample T test

Association between Variables


Chi square Test for Independence Scatter Plot
Correlation Partial Correlation

Analysis Of Variance (ANOVA)


One-Way & Two-Way ANOVA Concept of Eigen Value and Eigen Vector

Machine Learning

Introduction
Introduction to Data Science and Artificial Introduction to Machine Learning
Intelligence
How Artificial Intelligence relates to Machine History of Machine Learning
Learning

Introduction of Basic Mathematical concepts used in Machine


Learning

Vectors
Vectors Operations (Addition, subtract,
multiplication)
Sparsh Vector
Dense Vector

https://www.dexlabanalytics.com/courses/data-science-machine-learning-certification?pd=Integrated Data Science Certification 6/13


5/14/2020 Integrated Data Science Certification - DexLab Analytics | Big Data Hadoop SAS R Analytics Predictive Modeling & Excel VBA

Eigen Values
Eigen Vectors

Type of Machine Learning

Introduction of MLLIB (Machine Learning libraries) Machine Learning algorithms in theory


Supervised Learning Unsupervised Learning
Recommendation Systems

Supervised Learning

Classification

Decision Tree Naïve Bayes


Gaussian Naïve Bayes Logistic Regression
Linear Discriminant Analysis K-Nearest Neighbor (KNN)
Support Vector Machine (SVM)

Regression
Linear Regression Ridge Regression
LASSO Regression ElasticNet Regression
Decision Tree Regressor KNN Regressor
Support Vector Regressor

Ensemble Algorithm

Boosting Random Forest


Extra Trees AdaBoost
Gradient Boosting Machine

Unsupervised Learning
https://www.dexlabanalytics.com/courses/data-science-machine-learning-certification?pd=Integrated Data Science Certification 7/13
5/14/2020 Integrated Data Science Certification - DexLab Analytics | Big Data Hadoop SAS R Analytics Predictive Modeling & Excel VBA

Dimension Reduction using PCA K Mean Clustering

Recommendation Systems

Collaborative Filtering Content Based Filtering


Singular Value Decomposition

Text Mining and Introduction to NLP

Regular Expressions Bag of Word Model

Projects and Case Studies (based on real time scenarios)


Recommender System (recommend movies to watch) Titanic Data (EDA and ML)
SMS spam classification Yelp reviews classification
Fake News Classifier Topic Modelling
Boston House price prediction Loan Default Prediction
Income Classification Prediction Irish data classification
Sonar VS Rock data classification Election data analysis
Cancer Classification Stock Market Prediction
Credit Risk Modelling

XGBoost Basics

Introduction to Gradient Boosting AdaBoost the First Boosting Algorithm


Introduction to XGBoost Algorithm XGBoost
XGBoost Model in Python with scikit-learn Problem Description: Predict Onset of Diabetes
Predictions with XGBoost Model Data Preparation for Gradient Boosting
Evaluate Models With k-Fold Cross-Validation Plot XGBoost Decision Tree

XGBoost Advanced
Save and Load Trained XGBoost Models XGBoost and Feature Selection
https://www.dexlabanalytics.com/courses/data-science-machine-learning-certification?pd=Integrated Data Science Certification 8/13
5/14/2020 Integrated Data Science Certification - DexLab Analytics | Big Data Hadoop SAS R Analytics Predictive Modeling & Excel VBA

Monitor Training Performance and Early Stopping Tune Multithreading Support for XGBoost
Train XGBoost Models in the Cloud with Amazon
Web Services

XGBoost Tuning
Tune the Number and Size of Decision Trees with Tune Learning Rate and Number of Trees with
XGBoost
Tuning Stochastic Gradient Boosting with XGBoost

Natural Language Processing (Deep Learning)

Introduction

Natural Language Processing foundations Deep Learning for Natural Language Processing
Keras Keras Model Life-Cycle
Keras Functional Models Project: Develop Deep Learning Models with
Keras

Data Preparation

How to Clean Text Manually and with NLTK Metamorphosis by Franz Kafka
Tokenization and Cleaning with NLTK Additional Text Cleaning

The Bag-of-Words Model Word Counts with CountVectorizer


Word Frequencies with TfidfVectorizer Hashing with HashingVectorizer

Prepare Text Data with Keras


Split Words with text to word sequence Encoding with one hot
Hash Encoding with hashing trick Tokenizer API

Bag-of-Words

https://www.dexlabanalytics.com/courses/data-science-machine-learning-certification?pd=Integrated Data Science Certification 9/13


5/14/2020 Integrated Data Science Certification - DexLab Analytics | Big Data Hadoop SAS R Analytics Predictive Modeling & Excel VBA

The Bag-of-Words Model Example of the Bag-of-Words Model


Managing Vocabulary Scoring Words

Project:

Prepare Movie Review Data for Sentiment Analysis Develop a Neural Bag-of-Words Model for Sentiment
Analysis

Word Embeddings

How to Develop Word Embedding with Genism Genism Python Library


Develop Word2Vec Embedding Visualize Word Embedding
Load Google’s Word2Vec Embedding Load Stanford’s GloVe Embedding
Learn and Load Word Embeddings in Keras

Text Classification

Neural Models for Document Classification Word Embeddings + CNN = Text Classification

Project:

Develop an Embedding + CNN Model for Sentiment Develop an n-gram CNN Model for Sentiment
Analysis Analysis

Language Modeling
Neural Language Modeling Statistical Language Modeling
Neural Language Models

Develop a Character-Based Neural Language Model


Sing a Song of Sixpence Generate Text
Develop a Word-Based Neural Language Model Jack and Jill Nursery Rhyme
Model 1: One-Word-In, One-Word-Out Sequences Model 2: Line-by-Line Sequence

https://www.dexlabanalytics.com/courses/data-science-machine-learning-certification?pd=Integrated Data Science Certification 10/13


5/14/2020 Integrated Data Science Certification - DexLab Analytics | Big Data Hadoop SAS R Analytics Predictive Modeling & Excel VBA

Model 3: Two-Words-In, One-Word-Out Sequence

Project:
Develop a Neural Language Model for Text The Republic by Plato
Generation

Image Captioning

Neural Image Caption Generation Describing an Image with Text


Encoder-Decoder Architecture

Neural Network Models for Caption Generation

Image Caption Generation Load and Use a Pre-Trained Object Recognition


Model
ImageNet The Oxford VGG Models
Load the VGG Model in Keras Develop a Simple Photo Classifier

BLEU Score

Bilingual Evaluation Understudy Score Calculate BLEU Scores


Cumulative and Individual BLEU Scores Prepare a Photo Caption Dataset for Modeling

Project:

Develop a Neural Image Caption Generation Model

Machine Translation

Statistical Machine Translation Neural Machine Translation


Encoder-Decoder Models for Neural Machine Encoder-Decoder Architecture for NMT
Translation
Sutskever NMT Model Cho NMT Model

https://www.dexlabanalytics.com/courses/data-science-machine-learning-certification?pd=Integrated Data Science Certification 11/13


5/14/2020 Integrated Data Science Certification - DexLab Analytics | Big Data Hadoop SAS R Analytics Predictive Modeling & Excel VBA
Configure Encoder-Decoder Models for Machine Encoder-Decoder Model for Neural Machine
Translation Translation

Project:
Develop a Neural Machine Translation Model German to English Translation Dataset

Major Projects
Chatbot using Deep Learning Recommender System

Tableau BI & Visualization


(Videos only)

Introduction
Overview of Tableau workbook, worksheets Dimension & Measures
Discrete and Continuous

Install Tableau Desktop and connect to database

Download Tableau Desktop (14 day Trial) How to connect Tableau Desktop to various sources
Create Data Extracts Live Connection
Custom SQL & Challenges Use of Mark/Size/Colour/Transparency/Label
Tool Tip/ Detail / View Highlight

Different Chart Types


Area Maps Scatter Plot
Heat Maps Bar Charts
Line Charts Histograms
Box and Whisker Pie Chart
Maps Dual Axis Charts

https://www.dexlabanalytics.com/courses/data-science-machine-learning-certification?pd=Integrated Data Science Certification 12/13


5/14/2020 Integrated Data Science Certification - DexLab Analytics | Big Data Hadoop SAS R Analytics Predictive Modeling & Excel VBA
Various other chart types

Data Organizing
Create Sets Create Bins
Create Groups Using Sets and Group to Increase the Efficiency of
Work.
Filters & Types of Filter Drilling, Alias
Sorting, Computed Sort, Axis Sort Hierarchies
Formatting Calculated Fields
Format Visualization Introduction
Size, Shapes, Details, Tooltips String / Date / Logical Functions
Labels, Annotations Calculated Column using single Formula.
Titles, Captions Aggregation / Dis-Aggregation
Legends, Highlight
Handle Null Values

Table Calculations
Using Table Calculations Difference between Calculated Field and Table
Calculations
Computation Direction for Table Calculation QA

Actions
Actions filters Action highlights

https://www.dexlabanalytics.com/courses/data-science-machine-learning-certification?pd=Integrated Data Science Certification 13/13

You might also like