0% found this document useful (0 votes)
4 views31 pages

Machine Learning

Machine Learning And Business
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
4 views31 pages

Machine Learning

Machine Learning And Business
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 31

Supervised machine

learning
D ATA S C I E N C E F O R B U S I N E S S L E A D E R S

Ramnath Vaidyanathan
VP of Product Research, DataCamp
What is supervised machine learning?
Machine learning: Predictions from data

Supervised machine learning: Predictions from data with labels and features
Recommendation systems

Email subject optimization

Churn prediction

DATA SCIENCE FOR BUSINESS LEADERS


Case study: churn prediction

DATA SCIENCE FOR BUSINESS LEADERS


Case study: churn prediction

DATA SCIENCE FOR BUSINESS LEADERS


Case study: churn prediction

DATA SCIENCE FOR BUSINESS LEADERS


Recap
Make a prediction based on data

Data has features and labels


Label: what we want to predict

Features: data that might predict the label

Trained model can make predictions

DATA SCIENCE FOR BUSINESS LEADERS


Model evaluation
Split historical data into training and testing sets

DATA SCIENCE FOR BUSINESS LEADERS


Model evaluation
Prediction Reality

Churn 0% 3%

Remain 100% 97%

DATA SCIENCE FOR BUSINESS LEADERS


Clustering
D ATA S C I E N C E F O R B U S I N E S S L E A D E R S

Ramnath Vaidyanathan
VP of Product Research, DataCamp
What is clustering?

Divide data into categories

Use cases
Customer segmentation

Image segmentation

Anomaly detection

DATA SCIENCE FOR BUSINESS LEADERS


Supervised Machine Learning Unsupervised Machine Learning

DATA SCIENCE FOR BUSINESS LEADERS


Case study: customer segmentation

DATA SCIENCE FOR BUSINESS LEADERS


Case study: customer segmentation
De ne features
Number of ights in the past year

Percent international

Advanced planning

Percent business class

DATA SCIENCE FOR BUSINESS LEADERS


Case study: customer segmentation
De ne number of clusters

DATA SCIENCE FOR BUSINESS LEADERS


Case study: customer segmentation

DATA SCIENCE FOR BUSINESS LEADERS


Clustering review
De nition

Divide unlabeled dataset into different categories

Steps

Select features

Select number of clusters

Use clusters to solve business problems

DATA SCIENCE FOR BUSINESS LEADERS


Special topics in
Machine Learning
D ATA S C I E N C E F O R B U S I N E S S L E A D E R S

Ramnath Vaidyanathan
VP of Product Research, DataCamp
Time series forecasting
Time is a feature

Accounts for weekly, monthly, or yearly trends

DATA SCIENCE FOR BUSINESS LEADERS


Seasonality
Weekly: Lower television viewership on Fridays

Monthly: Higher spending at end of pay periods

Yearly: Less ice cream in the winter

DATA SCIENCE FOR BUSINESS LEADERS


Natural Language Processing
Dataset is text
Customer reviews

Tweets

Medical records

Email subjects

Possible uses
Classifying sentiment

Clustering medical records

DATA SCIENCE FOR BUSINESS LEADERS


Word counts
Sentence Texans Giants football great

The Texans are a great football team. 1 0 1 1

The Giants are a great football team. 0 1 1 1

DATA SCIENCE FOR BUSINESS LEADERS


Problems with word counts: negation
Sentence Texans Giants football great not

The Giants are a great football team. 0 1 1 1 0

The Giants are not a great football team. 0 1 1 1 1

DATA SCIENCE FOR BUSINESS LEADERS


Word counts and synonyms
Word counts don't help us consider synonyms

Example: "blue"
"sky-blue"

"aqua"

"cerulean"

Want to group as a single feature

DATA SCIENCE FOR BUSINESS LEADERS


Word embeddings
Create features that group similar words

Features have a mathematical meaning:

king - man + woman = queen

DATA SCIENCE FOR BUSINESS LEADERS


Review
Time series forecasting
Time is a feature

Seasonality

Natural Language Processing (NLP)


Text as input data

Word counts

Word embeddings

DATA SCIENCE FOR BUSINESS LEADERS


Deep Learning and
Explainable AI
D ATA S C I E N C E F O R B U S I N E S S L E A D E R S

Ramnath Vaidyanathan
VP of Product Research, DataCamp
What is Deep Learning?
AKA: Neural Networks or Neural Nets

Special area of Machine Learning

Requires more data

Best when inputs that are images or text

DATA SCIENCE FOR BUSINESS LEADERS


Explainable AI
Deep Learning Explainable AI

Highly accurate predictions Understandable by humans

Better for "What?" Better for "Why?"

DATA SCIENCE FOR BUSINESS LEADERS


Case Study: Explainable AI
1. Prediction: What a customer is likely to do

2. Explanation: Why a customer is likely to do it

DATA SCIENCE FOR BUSINESS LEADERS


Case Study: Inexplicable AI
Prediction only: Which letter is this likely to be?

DATA SCIENCE FOR BUSINESS LEADERS


When to use Deep Learning
1. Is the training data complex?

2. Do we have a very large amount of training data?

3. Does the model need to be predictive or explanatory?

DATA SCIENCE FOR BUSINESS LEADERS

You might also like