ML Introduction
ML Introduction
Agenda
Arthur Samuel
Machine Learning
ARTIFICIAL INTELLIGENCE MACHINE LEARNING
AI stands for Artificial intelligence, where
ML stands for Machine Learning which is defined
ARTIFICIAL
intelligence INTELLIGENCE
is defined as ability to acquire VSasMACHINE LEARNING
the acquisition of knowledge or skill
and apply knowledge.
The aim is to increase chance of success and The aim is to increase accuracy, but it does not
not accuracy. care about success
It work as a computer program that does It is a simple concept machine takes data and learn
smart work from data.
The goal is to learn from data on certain task to
The goal is to simulate natural intelligence to
maximize the performance of machine on this
solve complex problem
task.
1
1
Machine Learning
Machine Learning Algorithms
Supervised
Learning
1
7
An example application
■ An emergency room in a hospital measures 17 variables (e.g.,
blood pressure, age, etc) of newly admitted patients.
■ A decision is needed: whether to put a new patient in an
intensive-care unit.
■ Due to the high cost of ICU, those patients who may survive
less than a month are given higher priority.
1
9
Another application
■ A credit card company receives thousands of applications for new
cards. Each application contains information about an applicant,
❑ age
❑ Marital status
❑ annual salary
❑ outstanding debts
❑ credit rating
❑ etc.
■ Problem: to decide whether an application should approved, or to
classify applications into two categories, approved and not
approved.
2
0
Machine learning and our focus
■ Like human learning from past experiences.
■ A computer does not have “experiences”.
■ A computer system learns from data, which represent some
“past experiences” of an application domain.
■ Our focus: learn a target function that can be used to predict
the values of a discrete class attribute, e.g., approve or not-
approved, and high-risk or low risk or a value.
■ The task is commonly called: Supervised learning,
classification, or inductive learning.
2
1
The data and the goal
■ Data: A set of data records (also called examples,
instances or cases) described by
❑ k attributes: A1, A2, … Ak.
❑ a class: Each example is labelled with a pre-defined
class.
■ Goal: To learn a classification or regression model
from the data that can be used to predict the
unseen data.
2
2
Two Major Processes of SL
■ Classification is about
predicting a label
and Regression is about
predicting a quantity.
■ Classification is the problem
of predicting a discrete class
label output.
■ Regression is the problem of
predicting a continuous
quantity output.
2
3
An example: data (loan application)
Approved or not
2
5
An example: the learning task
■ Learn a classification model from the data
❑ No (not approved)
2
6
Supervised learning process: two
steps
■ Learning (training): Learn a model using the training data
■ Testing: Test the model using unseen test data to assess the model
accuracy
2
7
What do we mean by learning?
■ Given
❑ a data set D,
❑ a task T, and
❑ a performance measure M,
a computer system is said to learn from D to perform the task T if
after learning the system’s performance on T improves as
measured by M. (Learn without being explicitly programmed)
2
8
An example
■ Data: Loan application data
■ Task: Predict whether a loan should be approved or not.
■ Performance measure: accuracy.
2
9
Lifecycle
3
0
Lifecycle
3
1
Fundamental assumption of
learning
Assumption: The distribution of training examples is identical to
the distribution of test examples (including future unseen
examples).
3
2
Unsupervised
Learning
3
4
3
5
Unsupervised learning
■ Unsupervised learning is a type of machine learning algorithm
used to draw inferences from datasets consisting of input data
without labeled responses.
■ The most common unsupervised learning method is cluster
analysis, which is used for exploratory data analysis to find
hidden patterns or grouping in data.
3
6
Clustering
■ Clustering is a technique for finding similarity groups in data,
called clusters.
3
7
Clustering
An Example
■ The data set has three natural groups of data points, i.e., 3 natural
clusters.
3
9
What is clustering for?
■ Let us see some real-life examples
■ Example 1: groups people of similar sizes together to make
“small”, “medium” and “large” T-Shirts.
❑ Tailor-made for each person: too expensive
❑ One-size-fits-all: does not fit all.
4
0
Association rule mining
4
1
Principle Component Analysis
Director AI Research , Facebook
4
4
Supervised vs. Unsupervised
Learning
■ Supervised learning: classification is seen as supervised
learning from examples.
❑ Supervision: The data (observations, measurements, etc.) are
labeled with pre-defined classes. It is like that a “teacher” gives
the classes (supervision).
❑
Test data are classified into these classes too.
■ Unsupervised learning (clustering)
❑ Class labels of the data are unknown
❑ Given a set of data, the task is to establish the existence of
classes or clusters in the data
4
5
Reinforcement
learning
4
6
Reinforcement
learning
4
7
Reinforcement learning
Reinforcement learning
“Throw a robot into a maze and let it find an exit“
4
9
Supervised Learning Vs
Reinforcement
Supervised Learning
Learning
Step: 1
Teacher: Does picture 1 show a car or a flower?
Learner: A flower.
Teacher: No, it’s a car.
Step: 2
Teacher: Does picture 2 show a car or a flower?
Learner: A car.
Teacher: Yes, it’s a car.
Step: 3 ....
5
0
Supervised Learning Vs
Reinforcement Learning
Reinforcement Learning
Step: 1
World: You are in state 9. Choose action A or C.
Learner: Action A.
World: Your reward is 100.
Step: 2
World: You are in state 32. Choose action B or E.
Learner: Action B.
World: Your reward is 50.
Step: 3 ....
5
1
5
2
Applications of
ML Algorithms
Thanks