[FREE PDF sample] Supervised Learning with Python Concepts and Practical Implementation Using Python 1st Edition Vaibhav Verdhan ebooks
[FREE PDF sample] Supervised Learning with Python Concepts and Practical Implementation Using Python 1st Edition Vaibhav Verdhan ebooks
com
https://textbookfull.com/product/supervised-
learning-with-python-concepts-and-practical-
implementation-using-python-1st-edition-vaibhav-
verdhan/
https://textbookfull.com/product/supervised-learning-with-python-
concepts-and-practical-implementation-using-python-1st-edition-
vaibhav-verdhan/
textbookfull.com
https://textbookfull.com/product/innovation-in-public-libraries-
learning-from-international-library-practice-kirstie-nicholson/
textbookfull.com
The Art of Judicial Reasoning Festschrift in Honour of
Carl Baudenbacher Gunnar Selvik
https://textbookfull.com/product/the-art-of-judicial-reasoning-
festschrift-in-honour-of-carl-baudenbacher-gunnar-selvik/
textbookfull.com
https://textbookfull.com/product/sets-logic-and-maths-for-
computing-3rd-edition-makinson-d/
textbookfull.com
https://textbookfull.com/product/brazilian-beach-systems-1st-edition-
andrew-d-short/
textbookfull.com
https://textbookfull.com/product/changing-english-global-and-local-
perspectives-1st-edition-markku-filppula/
textbookfull.com
https://textbookfull.com/product/how-it-works-book-of-the-human-body-
imagine-publishing/
textbookfull.com
Automating with SIMATIC Controllers Software Programming
Data Communication 6th Edition Hans Berger
https://textbookfull.com/product/automating-with-simatic-controllers-
software-programming-data-communication-6th-edition-hans-berger/
textbookfull.com
Vaibhav Verdhan
Apress Standard
The publisher, the authors and the editors are safe to assume that the
advice and information in this book are believed to be true and
accurate at the date of publication. Neither the publisher nor the
authors or the editors give a warranty, expressed or implied, with
respect to the material contained herein or for any errors or omissions
that may have been made. The publisher remains neutral with regard
to jurisdictional claims in published maps and institutional affiliations.
—Vaibhav Verdhan
Limerick,
Ireland. June 2020
Acknowledgments
I would like to thank Apress publications, Celestin John, Shrikant
Vishwarkarma, and Irfan Elahi for the confidence shown and the
support extended. Many thanks to Dr. Eli Kling for the fantastic forward
to the book. Special words for my family—Yashi, Pakhi, and Rudra—
without their support it would have been impossible to complete this
work.
Table of Contents
Chapter 1:Introduction to Supervised Learning
What Is ML?
Relationship Between Data Analysis, Data Mining, ML, and
AI
Data, Data Types, and Data Sources
How ML Differs from Software Engineering
ML Projects
Statistical and Mathematical Concepts for ML
Supervised Learning Algorithms
Regression vs.Classification Problems
Steps in a Supervised Learning Algorithm
Unsupervised Learning Algorithms
Cluster Analysis
PCA
Semi-supervised Learning Algorithms
Technical Stack
ML’s Popularity
Use Cases of ML
Summary
Chapter 2:Supervised Learning for Regression Analysis
Technical Toolkit Required
Regression analysis and Use Cases
What Is Linear Regression
Assumptions of Linear Regression
Measuring the Efficacy of Regression Problem
Visit https://textbookfull.com
now to explore a rich
collection of eBooks, textbook
and enjoy exciting offers!
Example 1:Creating a Simple Linear Regression
Example 2:Simple Linear Regression for Housing Dataset
Example 3:Multiple Linear Regression for Housing Dataset
Nonlinear Regression Analysis
Identifying a Nonlinear Relationship
Assumptions for a Nonlinear Regression
Challenges with a Regression Model
Tree-Based Methods for Regression
Case study:Petrol consumption using Decision tree
Ensemble Methods for Regression
Case study:Petrol consumption using Random Forest
Feature Selection Using Tree-Based Methods
Summary
Chapter 3:Supervised Learning for Classification Problems
Technical Toolkit Required
Hypothesis Testing and p-Value
Classification Algorithms
Logistic Regression for Classification
Assessing the Accuracy of the Solution
Case Study:Credit Risk
Additional Notes
Naïve Bayes for Classification
Case Study:Income Prediction on Census Data
k-Nearest Neighbors for Classification
Case Study:k-Nearest Neighbor
The Dataset
Business Objective
Tree-Based Algorithms for Classification
Types of Decision Tree Algorithms
Summary
Chapter 4:Advanced Algorithms for Supervised Learning
Technical Toolkit Required
Boosting Algorithms
Using Gradient Boosting Algorithm
SVM
SVM in 2-D Space
KSVM
Case Study Using SVM
Supervised Algorithms for Unstructured Data
Text Data
Use Cases of Text Data
Challenges with Text Data
Text Analytics Modeling Process
Text Data Extraction and Management
Preprocessing of Text Data
Extracting Features from Text Data
Case study:Customer complaints analysis using NLP
Word Embeddings
Case study:Customer complaints analysis using word
embeddings
Image Data
Use Cases of Image Data
Challenges with Image Data
Image Data Management Process
Image Data Modeling Process
Fundamentals of Deep Learning
Artificial Neural Networks
Activation Functions
Loss Function in a Neural Network
Optimization in a Neural Network
Neural Network Training Process
Case Study 1:Create a Classification Model on Structured Data
Case Study 2:Image Classification Model
Summary
Chapter 5:End-to-End Model Development
Technical Toolkit Required
ML Model Development
Step 1:Define the Business Problem
Step 2:Data Discovery Phase
Step 3:Data Cleaning and Preparation
Duplicates in the Dataset
Categorical Variable Treatment in Dataset
Missing Values Present in the Dataset
Imbalance in the Dataset
Outliers in the Dataset
Other Common Problems in the Dataset
Step 4:EDA
Step 5:ML Model Building
Train/Test Split of Data
Finding the Best Threshold for Classification Algorithms
Overfitting vs.Underfitting Problem
Key Stakeholder Discussion and Iterations
Presenting the Final Model
Step 6:Deployment of the Model
Step 7:Documentation
Step 8:Model Refresh and Maintenance
Summary
Index
About the Author
Vaibhav Verdhan
has 12+ years of experience in data
science, machine learning, and artificial
intelligence. An MBA with engineering
background, he is a hands-on technical
expert with acumen to assimilate and
analyze data. He has led multiple
engagements in ML and AI across
geographies and across retail, telecom,
manufacturing, energy, and utilities
domains. Currently he resides in Ireland
with his family and is working as a
Principal Data Scientist.
About the Technical Reviewer
Irfan Elahi
is a full stack customer-focused cloud
analytics specialist bearing the unique
and proven combination of diverse
consulting and technical competencies
(cloud, big data, and machine learning)
with a growing portfolio of successful
projects delivering substantial impact
and value in multiple capacities across
telecom, retail, energy, and health-care
sectors. Additionally, he is an analytics
evangelist as is evident from the
published book, Udemy courses,
blogposts, trainings, lectures, and
presentations with global reach.
© Vaibhav Verdhan 2020
V. Verdhan, Supervised Learning with Python
https://doi.org/10.1007/978-1-4842-6156-9_1
What Is ML?
When we post a picture on Facebook or shop at Amazon, tweet or
watch videos on YouTube, each of these platforms is collecting data for
us. At each of these interactions, we are leaving behind our digital
footprints. These data points generated are collected and analyzed, and
ML allows these giants to make logical recommendations to us. Based
on the genre of videos we like, Netflix/YouTube can update our playlist,
what links we can click, and status we can react to; Facebook can
recommend posts to us, observing what type of product we frequently
purchase; and Amazon can suggest our next purchase as per our pocket
size! Amazing, right?
The short definition for ML is as follows: “In Machine Learning, we
study statistical/mathematical algorithms to learn the patterns from
the data which are then used to make predictions for the future.”
And ML is not limited to the online mediums alone. Its power has
been extended to multiple domains, geographies, and use cases. We
will be describing those use cases in detail in the last section of this
chapter.
So, in ML, we analyze vast amounts of data and uncover the
patterns in it. These patterns are then applied on real-world data to
make predictions for the future. This real-world data is unseen, and the
predictions will help businesses shape their respective strategies. We
do not need to explicitly program computers to do these tasks; rather,
the algorithms take the decisions based on historical data and
statistical models.
But how does ML fit into the larger data analysis landscape? Often,
we encounter terms like data analysis, data mining, ML, and artificial
intelligence (AI). Data science is also a loosely used phrase with no
exact definition available. It will be a good idea if these terms are
explored now.
Figure 1-3 Data quality plays a vital role in development of an ML solution; a lot of
time and effort are invested in improving data quality