0% found this document useful (0 votes)
19 views4 pages

DS Curriculum

Xyz
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
19 views4 pages

DS Curriculum

Xyz
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 4

Topics, Tools, and Techniques

Course Name Brief Description Learning Sprints


Covered
1. Data Analytics Learn to analyze numerical • Perform Basic Data Analysis • Excel functions (range, sort,
Using Excel and categorized data using Using Excel filter, grouping, Lookups)
MS Excel as a tool. Use • Measure Central • Statistical concepts (Measures of
Descriptive Statistical Tendencies using Statistical central Tendency and Dispersion)
techniques to analyse data. Techniques • Visualization using Graphs and
Visually present the analysis • Data Visualization using Charts (bar, pie, histogram,
using graphs, charts and Excel scatter plots, and trend lines)
dashboards using Excel • Measuring the Spread • EDA best practices
and Correlation of Data
• Story telling using data
• Exploratory Data Analysis
(EDA)
2. Analytics using Learn to represent data in • Working with RDBMS • ER modelling
SQL the form of tables in a • Creating Database Objects • Normalization fundamentals
Relational Database. Create, • Setting Constraints to • DDL (create tables, set
update and retrieve data Populate & Query Data constrains, insert data, modify
stored in MySQL (RDBMS) • Query & Summarize Data tables and drop tables)
database. Use Structured • Retrieve Data from Multiple • DML(query using grouping,
Query Language (SQL) to Tables filtering, conditional query, query
perform various operations logical operators)
on the database. • Joins, Subqueries & views
3. Introduction to Learn to apply problem • Working with the Python • Programming constructs (syntax
Programming solving techniques to Interpreter and semantics)
using Python decompose a problem into • Writing simple programs in • Writing simple programs using
computational steps. Then Python arithmetic and logical operators
to translate the steps into a • Writing modular programs (and, or, not)
program using the Python in Python • Decision making using
language. • Working with In-built data conditional operators
structures in Python • (if, else and elif)
• Working with files in Python • Looping constructs (for and while
loop)
• User defined functions
• Lambda functions
4. Python for Use NumPy and Pandas • Creating & manipulating • Series
Data Science libraries in Python to arrays (1D & 2D) using • 1 D NumPy Arrays
perform data analysis. NumPy • 2 D NumPy Arrays
Launch Python scripts/shell • Creating & Manipulating • Reading CSV files into Data
scripts to automate tasks. Series and Data Frames Frames
Connect Python program to • Manipulating Data Frames • Data Frame manipulation
databases to retrieve and • Advanced Data Frame functions
manipulate data. Manipulation operations • Data Visualization using
• Analyzing & visualizing data Matplotlib.
using Python libraries • SQL and Python Interface
• Connecting SQL and Python

5. Statistics & Learn to apply the concepts • Statistical Analysis using • Measures of Central Tendency
Data of Descriptive and Python • Measures of Dispersion
Visualization Inferential Statistics to • Prediction using • Fundamentals of probability
using Python analyze data, using Python Probability (simple, joint and conditional)
as a tool. • Data Sampling & • Probability distributions
Distribution (Binomial, Poisson’s)
• Prediction using • Hypothesis testing (Z and T Test
Probability Distribution of means)
• Predicting events using • ANOVA
Binomial and Poison’s • Chi Square Tests
distribution
• Parametric Testing
• Non Parametric Testing
• Chi Square Testing
6. Exploratory Learn and apply the best • EDA using Descriptive • EDA best practice demonstration
Data Analysis practices of EDA in a Statistics • Univariate, Bivariate and Multi-
systematic way (Univariate, • EDA hackathon 1 variate analysis
Bivariate and Multi-variate • EDA using inferential • Data visualization using graphs
Analysis) along with the Statistics and charts
statistical concepts. Present • EDA hackathon 2
the result of this analysis • EDA hackathon 3
visually.
7. Web Scraping Use Beautiful Soup to scrape • Working with API’s • Working with Web API’s
& Introduction the Web by learning how to • Breaking down HTML and • HTML and JSON fundamentals
to Text work with Web API’s. Learn JSON • Scraping using Beautiful Soup
Analytics the fundamentals of Text • Scraping the Web for data • Regular Expressions
Analytics using spaCy library • Data cleaning using Regular • Fundamentals of Text Analysis
in Python. Expressions using the spaCy library
• Analyze Text Data to detect (stemming, lemmatization, stop
sentiments words, phrase matching and
tokenization using spaCy)
8. Unstructured Use MongoDB to store • Justifying the need for No- • Document database
Data Analysis unstructured data. Querying SQL Database • MongoDB,
using a No-SQL complex data structures to • Installing and Exploring • CRUD operations
Database extract insights from real MongoDB • Embedded documents
world datasets. • Creating database objects • Data analysis using sorting,
and inserting data filtering, identifying correlations
• Analyze unstructured data and projection
through MongoDB
9. Exploratory This course will help you deep • Build a view to explore data • Dimensions & Measures
Data Analysis dive into Visual Analytics. It • Structure data for analysis in • Discrete (blue), or continuous
using Tableau covers various advanced Tableau (green) Row level or Aggregates
techniques for analyzing data • Aggregate and disaggregate or multi-level aggregation
to enhance your EDA skills data in Tableau • Concepts related to the use of
• Creating Charts and Graphs graphs - Line, Bar, Stacked Bar,
• Creating data hierarchies Pie, Spider, Word Cloud, …)
10. Story Telling Learn to build Dashboards • Creating Story Lines • Drill Downs
using Tableau and tell stories using Data. • Data Blending in Tableau • Building Story lines
• Adding Filters • Interactive Dashboards
• Adding actions to Dashboards
11. Data Modelling Demonstrate the ability to • Building a mathematic model • Linear Regression
using deploy ML algorithms for for prediction • Logistic Regression
Supervised predicting continuous • Predict continuous variable • Lasso and Ridge Regression
Machine variables and classification of using Linear Regression • Naïve Bayes Classifier
Learning categorical data. • Binomial classification of • Classification and Regression
Apply feature engineering events/outcomes using Trees
techniques to optimize the Logistic Regression • Feature Engineering
model accuracy. • Multinomial classification • KNN
using Naïve Bayes and KNN
algorithms
• Classification and Regression
using Decision Trees

12. Data Modelling Demonstrate the ability to • Clustering data using K- • K-Means Clustering
using Un- deploy ML algorithms for Means • DB Scan
supervised clustering similar data. Ability • Clustering data using DB Scan • Hierarchical Clustering
Machine to choose an ensemble of ML & Hierarchical clustering • Random Forest
Learning algorithms for a given • Optimizing predictions using • Bagging
problem and fine tune the Random Forest • Boosting
accuracy. • Optimizing prediction using
Bagging & Boosting
techniques
13. Deep Learning Understand the concept of • Building a single layer Neural • ANN
for Analytics neurons to build neural Network for simple linear • CNN
networks for analytics using regression • RNN
Keras and TensorFlow. Apply • Building multi-layer neural • XAI
these concepts to build neural networks for card churn
net models for solving analysis
regression and classification • Building recurrent neural
problems. networks to predict stock
prices
• The future of AI : XAI
14. NLP using Learn advanced Natural • Word embedding • GRU
Deep learning Language Processing (NLP) • Sentiment analysis • LSTM
Techniques using Convolutional Neural • Topic modeling
Networks (CNN) and • Text Summarization
Recurrent Neural Networks • Building chat bot
(RNN) to build chat bots
15. Image Learn to label, segment and • Pre-processing and labeling • CNN
Classification classify images data using of image data • GAN
using Deep CNNs and Generative • Image segmentation
Learning Adversarial Networks (GAN) Object Detection
• Image classification
• Image creation
• Outlook from Image to Text
• Video Analysis
16. Time Series Understand the significance • ETS • Error-Trend-Seasonality
Data Analytics of Time Series Data and apply • Evaluating Forecast Models • ARIMA
statistical tools to predict an • Auto Regression using Stats • SARIMA
event in time. Optimize the Model • ARIMAX
predictions in Time Series • Forecasting using ARMA and
Analysis using Deep Learning ARIMA
concepts. • Forecasting using SARIMA
and SARIMAX
• Time Series Analysis using
Deep Learning
Capstone Project Complete end-to-end ML • Complete a data analysis • Combine all knowledge & skills
project project including ML learnt to solve a problem
modelling • Walk-thru the solution
• 2-3 member small agile team
• Final project presentation

You might also like