INtro To ML
INtro To ML
Machine Learning
• Types of Machine Learning Problems
• Steps to solve a MachineLearning Problem
• Summary
1
6
What is a cat?
1
7
What is a cat?
1
8
Introduction to machine learning
Introduction to Machine Learning
20
Introduction to Machine Learning
21
Introduction to Machine Learning
Supervised
Unsupervised
Reinforcement
22
Introduction to Machine Learning
Types of machine learning problems
Learn through examples of which we knowthe
Supervised desired output (what we want topredict).
Reinforcement
23
Introduction to Machine Learning
Supervised Classification
Unsupervised
Regression
Output is continuous
Reinforcement (e.g., price, temperature)
24
Introduction to Machine Learning
Types of machine learning problems
Supervised
Reinforcement
25
Introduction to Machine Learning
Types of machine learning problems
Unsupervised
Reinforcement
26
Introduction to Machine Learning
Types of machine learning problems
Reinforcement
27
Introduction to Machine Learning
28
Introduction to Machine Learning
Data gathering
Might depend on humanwork
• Manual labeling for supervised learning.
• Domain knowledge. Maybe evenexperts.
May come for free, or “sortof”
• E.g., Machine Translation.
The more the better: Some algorithms need large amounts of data to be useful
(e.g., neural networks).
The quantity and quality of data dictate the model accuracy
29
Introduction to Machine Learning
Data preprocessing
Is there anything wrong with the data?
• Missing values
• Outliers
• Bad encoding (fortext)
• Wrongly-labeled examples
• Biased data
• Do we have many more samples of one class
than the rest?
It can be an art
• Requires thought and knowledge of thedata
Two steps:
• Variable transformation (e.g., dates into weekdays, normalizing)
• Feature creation (e.g., n-grams for texts, if word is capitalizedto detect
names, etc.) 32
Introduction to Machine Learning
Predict Adjust
34
Introduction to Machine Learning
Making predictions
Training Phase
Labels
Machine
Learning
Feature model
Samples extraction Features
Prediction Phase
Input Feature Features Trained Label
extraction classifier
35
Introduction to Machine Learning
Summary
• Machine Learning is intelligent use of data to answer questions
• Enabled by an exponential increase in computing power and data
availability
• Three big types of problems: supervised, unsupervised,reinforcement
• 5 stepsto every machine learning solution:
1. Data Gathering
2. Data Preprocessing
3. Feature Engineering
4. Algorithm Selection &Training
5. Making Predictions 36