0% found this document useful (0 votes)
148 views4 pages

20191120122749-Data Science Certification Training

This data science certification training course covers a range of topics including data analytics, visualization, predictive analytics, machine learning, natural language processing, web scraping, and integrating Python with Hadoop and Spark. Students will learn essential skills through hands-on practice with sample projects and will receive study materials upon completion. The course provides an overview of data science concepts and covers tools like Python, NumPy, Pandas, Scikit-Learn, and Matplotlib.

Uploaded by

avsrao123
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
148 views4 pages

20191120122749-Data Science Certification Training

This data science certification training course covers a range of topics including data analytics, visualization, predictive analytics, machine learning, natural language processing, web scraping, and integrating Python with Hadoop and Spark. Students will learn essential skills through hands-on practice with sample projects and will receive study materials upon completion. The course provides an overview of data science concepts and covers tools like Python, NumPy, Pandas, Scikit-Learn, and Matplotlib.

Uploaded by

avsrao123
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 4

Data Science certification Training

Course description
In this course you will learn data analytics, data exploration, data visualization, predictive analytics
and descriptive analytics techniques. Hands-on practice by implementing various real-life industry
experiences in various domains. Learn more about why data science and machine learning are
revolutionizing the world. In this course, you will get complete overview what the data science is
today

Student Take away


 Study Material
 Learning stuff
 Sample project for practice

Data Science certification Training online training curriculum


Introduction to Data Science
 What is Data Science?
 Data Scientists
 Examples of Data Science
 Python for Data Science

Data Analytics Overview


 Data Visualization
 Processes in Data Science
 Data Wrangling, Data Exploration, and Model Selection
 Exploratory Data Analysis or EDA
 Data Visualization
 Plotting
 Hypothesis Building and Testing

Statistical Analysis and Business Applications


 Introduction to Statistics
 Statistical and Non-Statistical Analysis
 Some Common Terms Used in Statistics
 Data Distribution: Central Tendency, Percentiles, Dispersion
 Histogram
 Bell Curve
 Hypothesis Testing
 Chi-Square Test
 Correlation Matrix
 Inferential Statistics
Python: Environment Setup and Essentials
 Introduction to Anaconda
 Installation of Anaconda Python Distribution - For Windows, Mac OS, and Linux
 Jupyter Notebook Installation
 Jupyter Notebook Introduction
 Variable Assignment
 Basic Data Types: Integer, Float, String, None, and Boolean; Typecasting
 Creating, accessing, and slicing tuples
 Creating, accessing, and slicing lists
 Creating, viewing, accessing, and modifying dicts
 Creating and using operations on sets
 Basic Operators: 'in', '+', '*'
 Functions
 Control Flow

Mathematical Computing with Python (NumPy)


 NumPy Overview
 Properties, Purpose, and Types of ndarray
 Class and Attributes of ndarray Object
 Basic Operations: Concept and Examples
 Accessing Array Elements: Indexing, Slicing, Iteration, Indexing with Boolean Arrays
 Copy and Views
 Universal Functions (ufunc)
 Shape Manipulation
 Broadcasting
 Linear Algebra

Scientific computing with Python (Scipy)


 SciPy and its Characteristics
 SciPy sub-packages
 SciPy sub-packages –Integration
 SciPy sub-packages – Optimize
 Linear Algebra
 SciPy sub-packages – Statistics
 SciPy sub-packages – Weave
 SciPy sub-packages - 10

Data Manipulation with Python (Pandas)


 Introduction to Pandas
 Data Structures
 Series
 Data Frame
 Missing Values
 Data Operations
 Data Standardization
 Pandas File Read and Write Support
 SQL Operation
Machine Learning with Python (Scikit–Learn)
 Introduction to Machine Learning
 Machine Learning Approach
 How Supervised and Unsupervised Learning Models Work
 Scikit-Learn
 Supervised Learning Models - Linear Regression
 Supervised Learning Models: Logistic Regression
 K Nearest Neighbours (K-NN) Model
 Unsupervised Learning Models: Clustering
 Unsupervised Learning Models: Dimensionality Reduction
 Pipeline
 Model Persistence
 Model Evaluation - Metric Functions

Natural Language Processing with Scikit-Learn


 NLP Overview
 NLP Approach for Text Data
 NLP Environment Setup
 NLP Sentence analysis
 NLP Applications
 Major NLP Libraries
 Scikit-Learn Approach
 Scikit - Learn Approach Built - in Modules
 Scikit - Learn Approach Feature Extraction
 Bag of Words
 Extraction Considerations
 Scikit - Learn Approach Model Training
 Scikit - Learn Grid Search and Multiple Parameters
 Pipeline

Data Visualization in Python using Matplotlib


 Introduction to Data Visualization
 Python Libraries
 Plots
 Matplotlib Features:
 Line Properties Plot with (x, y)
 Controlling Line Patterns and Colours
 Set Axis, Labels, and Legend Properties
 Alpha and Annotation
 Multiple Plots
 Subplots
 Types of Plots and Seaborne
Data Science with Python Web Scraping
 Web Scraping
 Common Data/Page Formats on The Web
 The Parser
 Importance of Objects
 Understanding the Tree
 Searching the Tree
 Navigating options
 Modifying the Tree
 Parsing Only Part of the Document
 Printing and Formatting
 Encoding

Python integration with Hadoop, Map Reduce and Spark


 Need for Integrating Python with Hadoop
 Big Data Hadoop Architecture
 Map Reduce
 Cloud era Quick Start VM Set Up
 Apache Spark
 Resilient Distributed Systems (RDD)
 PySpark
 Spark Tools
 PySpark Integration with Jupyter Notebook

You might also like