0% found this document useful (0 votes)

60 views

Chapter 4: Machine Learning

The document discusses different machine learning methods including decision tree learning, instance based learning using K-nearest neighbors and case based reasoning. It provides examples of how decision trees and K-nearest neighbors work for classification problems. The document also outlines the basic steps in designing a machine learning system including determining the training experience, target function, representation of the learned function, and learning algorithm.

Uploaded by

gary

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

60 views

Chapter 4: Machine Learning

Uploaded by

gary

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 30

Chapter 4:

Machine Learning
Can Machine Learns?
 Learning? ~ to improve automatically with experience.
 We do not yet know how to make computers learn nearly
as well as people learn ~ machine and human is two
different things .
 “concept” ~ human learns from their experience (trial by
error, or being guided – like an infant/student).
 Example: baby attempts to walk after fall down several
times. Pain or how to balance are the best guidance.
 Problem ~ how machine can learn these? Do we need
to put sensory devices to detect pain? ~ how to
represent pain? ~ as “electronic pulse” of pain?
Machine Learning
 Machine learning ~ draws on concepts from statistics,
artificial intelligence, philosophy, information theory,
biology, cognitive science, computational complexity and
control theory (many more!).
 How machine learn? : a computer is said to learn from
experience E with respect of tasks T and performance
measures P. (if its P at tasks in T (measured by P)
improves with experience E)
 Need to have well-defined learning problem (based on
three features: (class of tasks, measure of performance
and source of experience)
Well-Posed Learning Problems
 A checkers learning problems:
 Task T: playing checkers
 Performance measure P: percent of games won
against opponents.
 Training experience E: playing practice games against
itself.
 A handwriting recognition problems:
 Task T: recognizing & classifying handwritten words
images.
 Performance measure P: percent of words correctly
classified.
 Training experience E: database of handwritten words.
General Machine Learning
Model
Training
experience
Database
(experience) Machine learning Adjust learning
algorithm parameter (based
on performance
results)

Performance?
Designing a Learning System
• Basic four steps to design learning system:

Determine type of “what is the best training

training experience approach”

Determine target “how the performance can

function be evaluated @ what to
be solved?”

Determine representation “how to represent the

of learned function learning process?”

Determine learning “how learning process will

algorithm take place?”
Types of Machine Learning
•Basic machine learning methods:
•Concept learning
•Decision tree learning
•Supervised and unsupervised learning
•Statistical learning & computational learning theory
•Instance based learning
•Explanation based learning
•Evolutionary learning
•Reinforcement learning
1) Decision Tree Learning
 The most widely used and practical methods of
classification.
 Approximating discrete valued functions ~ using “Divide
and Conquer”.
 Concept = node ”test attributes” , branch  “possible
value of the attributes”.
 Classification method  sorting the attribute down
from the root to particular leaf node.
 Decision tree algorithm : ID3, ASSTANT. C4.5, C5.0,
CART, SPRINT.
Decision Tree Learning
ROOT
Gender NODE
BRANCH
Male
Female

Height
Height > 2.0m
<1.3m >1.8m <1.5m

Tall
Short Tall Short
Medium
Medium
LEAF NODE
Example: Decision Tree Learning

ID Refund Marital Tax Cheat

1 Yes Single 125 No REFUND
2 No Married 100 No
YES
3 No Single 70 No NO
4 Yes Married 120 No
5 No Divorced 95 Yes MARITAL NO

SINGLE, MARRIED
DIVORCED
NO
TAX
<80 > 80

NO YES
Why Decision Tree ?
•Advantages:
•Easy to use and efficient.
•Tree structures are easy to interpret and understand.
•Direct representation.
•Disadvantages
•Do not easily handle continuous data.
•Difficult to handle missing data.
•Correlation between attributes are ignored by decision
tree process.
•Tree might replicate
2) Instance Based Learning
•Instance based learning ~ straight forward approaches to
approximating target value
•Basic: when a new query instance is encountered, a set of
similar related instances is retrieved from memory and used
to classify new instances.
•Sometimes referred as “lazy learner” ~ learning process
takes place when new instance must be classified.
•Main concept ~ the nearest existing example that might
similar to the new one!
•Common method ~ K-Nearest Neighbor and Case Based
Reasoning.
K-Nearest Neighbor
•Named as “lazy learner” method requires comparison
with training set . Primarily based on “nearest” distance.

•Calculate the similarities of two

points (new data and training
data. GROUP A
Group A or B?
•The lowest distance is voted as
neighbor to respective class.
•Assumption : if the nearest
neighbor class is A, then the GROUP B
class of that new data is also A.
K-Nearest Neighbor (cont)

•Assumption: for k-dimensional Euclidean space, the distance

between 2 points, x = {x1, x2,…,xn) and y = {y1,y2,…,yn) are
defined through:
n
Ed    xi  yi 
2
Euclidean Distance
i 1

n
Manhattan Distance M d   xi  yi
i 1

n
Minkowski’s Distance Minkd   x  y 
2
q
i i
i 1
K-Nearest Neighbor (cont)
Attributes X1 X2 X3 CLASS
A 5 1 3 GOOD
B 3 1 3 GOOD
C 4 1 5 BAD

•Given new set of data, D = { 2,1,3). Find the possible

class for set D using KNN. (K = 1)
d(D,A) = 3, d(D,B)=1, d(D,C)=2.83.
d ( D, A)   2  5  1  1  3  3
2 2 2
Based from the calculation, distance
d(D,B) is the minimum.
d ( D, B )   2  3  1  1  3  3
2 2 2

Since K=1, only one neighbor

d ( D, C )   2  4   1  1  3  5
2 2 2
involved, therefore dataset D can be
classified as GOOD (based on
dataset B)
K-Nearest Neighbor (cont)
•Advantages:
•Easy to program
•No optimization / training is required
•Incremental learning (information is retained).
•Robust to noisy data (only nearest data involved)
•Disadvantages:
•Exhaustive learning (more dataset, more memory)
•“Curse of dimensionality”  how about if the
dimension is too big or infinite?
Case Based Reasoning
Uses various techniques to match a situation or a problem
description with the most similar cases  similarity
assessment.

•Definition (Schank and Abelson, 1977): “technique to

solve new problems by adapting solutions that were used to
solve old problems”.

•Refers to both a cognitive and computational model of

reasoning by analogy.

•Basic  many problems are not unique, but rather a

variations of a problem type.
Case Based Reasoning (cont)
•All cases are independent from each other  each case
describes one particular situation.

•Widely implemented in legal, medical, diagnosis.

•Example of case (Case study: paddy disease)

•CASE 1: Leaf color green with yellow stripes

Stalk color green.
Spot yes.
Spot condition stripes
Panicle  yes
DISEASE = bacterial leaf blight.
Case Based Reasoning (cont)
Problem New (1)RETRIEVE
(4)RETAIN Case
Add new cases
Learned
Case
Select related
cases Retrieved
Stored
Cases Case

Solved
Tested, Repaired Case
Case (2)REUSE
Confirmed Suggested
solution (3) REVISE solution
Case Based Reasoning (cont)
CASE F12:
Leaf color green
Stalk color green.
Spot yes. NEW CASE:
Spot condition stripes Leaf color green
Panicle yes Stalk color green.
Disease : Bacterial Leaf Spot no.
Streak. Spot condition stripes
Panicle yes
Disease : ?
CASE B3:
Leaf color yellowish
Stalk color green. Compare
Possibly Bacterial
Spot no. similarities
Leaf Streak disease
Spot condition no. (local)
Panicle no
(New case is almost
Disease : Bakanae similar to case F12).
Case Based Reasoning (cont)
•Advantages:
•Easy to represent (by cases representation)
•Incremental learning (reused, retained and
adaptation process).
•Capable to handle missing value.
•Disadvantages:
•Exhaustive learning (more dataset, more memory)
•Cases should be updated regularly.
•Complexity of the cases sometimes hard to be
represented.
3) Supervised Learning
Supervised Learning
Essential ingredient: availability the external
indicator (“teacher”).  teacher provides desired or
target response for particular training vector.

Environment Teacher

Vector describing Desired

Actual response
state of environment
response
Learning
System -  +

Error signal
Supervised Learning
Example: Multilayer Perceptron Neural Networks
•Inspired by observation that biological learning systems
are built from very complex interconnected neurons.
•Learning algorithm: error-correction learning (error-
signal). dk(n) is a desired response and yk(n) is a actual
response.
ek (n)  d k (n)  yk (n)

•Aim: to minimize cost function: 1

(n)   ek2 (n)
2 k

•Gradient descent minimization method.

MLP Neural Networks
Input
layer Hidden
layer Output
layer

input node output

node

weights
hidden
node
3) Unsupervised Learning
Unsupervised Learning
Essential ingredient: no external “teacher” to
oversee the learning process. (no specific examples
of the function to be learned by the network).
•A sequence of input vector is provided, but NO target
vector.
•Basically, the similar group of data will be clustered
together (self organized learning) – “winner takes all”
strategies. ~ clustering.
•Example: Kohonen Self Organizing Map (SOM),
Adaptive Resonance Theory ~ ART)
Unsupervised Learning
Example: Kohonen Self Organizing Map
•Also known as “topology preserving map”
•The weight vector for a cluster unit serves as an
exemplar of the input patterns associated with that
cluster.
•Basically ~ the cluster unit whose weight vector matches
the input pattern most closely is chosen as a winner.
•Euclidean distance ~ minimum distance is considered
winner.
n
Ed    xi  yi 
2

i 1
Kohonen SOM

Output (cluster)

{Output layer}

weights

{input layer}
Input nodes
4) Reinforcement Learning
Reinforcement Learning
•Addresses to question of how an autonomous agent that
senses and acts in its environment can learn to choose
optimal action(s)  to achieve its goal(s).
•Concept ~ each time the agent performs an action in its
environment, reward or penalty will be given (based on
desirability of result state).
•Task ~ the agent must know which action gain most
reward (reinforcement signal) ~ strengthened signal or
reward indicates satisfactory actions.
•Learning algorithm: using Q learning, adaptive heuristic
critic and temporal-difference methods.
Reinforcement Learning
Agent : Reinforcement Learning

Agent

State (s) Action (a)

Reward (s)

Environment

a0 a1 a2
S0 S1 S2
r0 r1 r2

Aim to r0   r1   r2  ...., where 0    1

2
maximize:
Machine Learning Application
•Pattern recognition ~ to recognize hand-writing,
signature, biometrics, texture analysis, stock exchange,
signal processing or even human emotions.
•Control ~ autonomous robot, self-guided underwater
rover, manufacturing, autonomous vehicles, washing
machine, smart home.
•Medical applications ~ automated heart attack
detection, cancer cells analysis, disease diagnosis,
outbreak analysis.
•Gamming ~ chess playing program, soccer playing
game etc.

GoM Report On Government Communication
67% (9)
GoM Report On Government Communication
97 pages
NCPs For Diabetes Mellitus
100% (1)
NCPs For Diabetes Mellitus
6 pages
S/4 HANA 1610 Manufacturing Contents
No ratings yet
S/4 HANA 1610 Manufacturing Contents
75 pages
Contract Risk Assessment Delivery
100% (1)
Contract Risk Assessment Delivery
8 pages
ML (Unit-1)
No ratings yet
ML (Unit-1)
17 pages
Machine Learning
No ratings yet
Machine Learning
99 pages
Aiml M3 C1
No ratings yet
Aiml M3 C1
59 pages
3171617_introduction_1175
No ratings yet
3171617_introduction_1175
58 pages
Unit-5-1
No ratings yet
Unit-5-1
113 pages
Module 3 - AIML
No ratings yet
Module 3 - AIML
134 pages
complete ml (1)
No ratings yet
complete ml (1)
325 pages
Module 1
No ratings yet
Module 1
50 pages
UNIT 5 ML-2-70
No ratings yet
UNIT 5 ML-2-70
69 pages
CHAPTER 6 Machine Learning: Objective
No ratings yet
CHAPTER 6 Machine Learning: Objective
29 pages
Ai Lect6 Genetic
No ratings yet
Ai Lect6 Genetic
94 pages
Unit-1 ML
No ratings yet
Unit-1 ML
19 pages
Aimlf Unit 3
No ratings yet
Aimlf Unit 3
20 pages
ML Unit 1-Notes
No ratings yet
ML Unit 1-Notes
21 pages
ML RUSA Module 1 Intro
No ratings yet
ML RUSA Module 1 Intro
30 pages
Machine Learning Overview
No ratings yet
Machine Learning Overview
54 pages
Unit-1 MLT
No ratings yet
Unit-1 MLT
51 pages
Lect6 PDF
No ratings yet
Lect6 PDF
66 pages
AIML Module-03
No ratings yet
AIML Module-03
40 pages
Chapter-1 Ml Intro
No ratings yet
Chapter-1 Ml Intro
36 pages
AI Lect8 Neural
No ratings yet
AI Lect8 Neural
84 pages
01 Introduction ML
No ratings yet
01 Introduction ML
60 pages
The Machine Learning Landscape
No ratings yet
The Machine Learning Landscape
30 pages
ML Unit 1 Notes
No ratings yet
ML Unit 1 Notes
134 pages
Introduction of ML
No ratings yet
Introduction of ML
53 pages
AI UNIT-4 PPT
No ratings yet
AI UNIT-4 PPT
60 pages
Machine Learning INTRO
No ratings yet
Machine Learning INTRO
12 pages
UNIT 1
No ratings yet
UNIT 1
12 pages
unit1
No ratings yet
unit1
6 pages
Chapter1 ML
No ratings yet
Chapter1 ML
101 pages
Introduction To ML - MCA - 2023
No ratings yet
Introduction To ML - MCA - 2023
30 pages
AI Chapter 5
No ratings yet
AI Chapter 5
31 pages
1 ML M1503-Introduction - ABP
No ratings yet
1 ML M1503-Introduction - ABP
14 pages
ML Doc1
No ratings yet
ML Doc1
14 pages
Machine Learning Techniques
100% (2)
Machine Learning Techniques
45 pages
What is
No ratings yet
What is
5 pages
Introducti0n (MLT)
No ratings yet
Introducti0n (MLT)
39 pages
MLT Unit 1 - Updated
No ratings yet
MLT Unit 1 - Updated
42 pages
WEEK 01 Merged
No ratings yet
WEEK 01 Merged
606 pages
1 - Introduction
No ratings yet
1 - Introduction
82 pages
Machine Learning (R17A0534) Lecture Notes: B.Tech Iv Year - I Sem (R17) (2020-21)
No ratings yet
Machine Learning (R17A0534) Lecture Notes: B.Tech Iv Year - I Sem (R17) (2020-21)
9 pages
ML UNIT-1 NOTES
No ratings yet
ML UNIT-1 NOTES
15 pages
Unit-5_1
No ratings yet
Unit-5_1
88 pages
Artificial Intelligence: Chapter 5 - Machine Learning
No ratings yet
Artificial Intelligence: Chapter 5 - Machine Learning
30 pages
Module 3
No ratings yet
Module 3
41 pages
Machine Learning Unit1
No ratings yet
Machine Learning Unit1
151 pages
Chapter 5 Artificial Intelligence notes
No ratings yet
Chapter 5 Artificial Intelligence notes
7 pages
Unit 3
No ratings yet
Unit 3
62 pages
Machine Learning
No ratings yet
Machine Learning
135 pages
ML UNIT-1 Notes PDF
No ratings yet
ML UNIT-1 Notes PDF
22 pages
Intro - Types of Machine Learning
No ratings yet
Intro - Types of Machine Learning
24 pages
191AIC502T - Machine Learning - Unit 1
No ratings yet
191AIC502T - Machine Learning - Unit 1
41 pages
Learning and Planning
No ratings yet
Learning and Planning
107 pages
ML Unit-I Chapter-I Introduction
No ratings yet
ML Unit-I Chapter-I Introduction
36 pages
Unit 4 Part1
No ratings yet
Unit 4 Part1
33 pages
Day 2 Part 1
No ratings yet
Day 2 Part 1
52 pages
Lecture 06 Part A - Macine Learning
No ratings yet
Lecture 06 Part A - Macine Learning
77 pages
Artificial Intelligence and Machine Learning
No ratings yet
Artificial Intelligence and Machine Learning
51 pages
ACT Math Section and SAT Math Level 2 Subject Test Practice Problems 2013 Edition
From Everand
ACT Math Section and SAT Math Level 2 Subject Test Practice Problems 2013 Edition
Dr. David Kronmiller
3/5 (3)
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Urp
No ratings yet
Urp
170 pages
First Phase of Revolutionary Activities
No ratings yet
First Phase of Revolutionary Activities
7 pages
Cc4ii8 en VS
No ratings yet
Cc4ii8 en VS
1 page
7 - Classification
No ratings yet
7 - Classification
71 pages
National Comprehensive HIV Prevention, Care, and Treatment Training For Pharmacy Professionals-Participant Manual
No ratings yet
National Comprehensive HIV Prevention, Care, and Treatment Training For Pharmacy Professionals-Participant Manual
326 pages
ENRG Annual Report 2017
No ratings yet
ENRG Annual Report 2017
267 pages
Internet Service Provider
No ratings yet
Internet Service Provider
4 pages
Lms Activity 3 Two-Storey Residential House Magaddon, Engel A. Ground Floor Plan
No ratings yet
Lms Activity 3 Two-Storey Residential House Magaddon, Engel A. Ground Floor Plan
1 page
SS2 Technical Drawing Lesson Plan Week 5
100% (1)
SS2 Technical Drawing Lesson Plan Week 5
6 pages
Orthodontic Preparation For Orthodontic Surgery
No ratings yet
Orthodontic Preparation For Orthodontic Surgery
15 pages
BEL 2023 PYQ
No ratings yet
BEL 2023 PYQ
9 pages
One-Word Substitutions
No ratings yet
One-Word Substitutions
18 pages
Eco Friendly Competent Ware - VIZAG STEEL
No ratings yet
Eco Friendly Competent Ware - VIZAG STEEL
8 pages
9-The Three-Point Estimating Technique
100% (1)
9-The Three-Point Estimating Technique
4 pages
Aman Futures
No ratings yet
Aman Futures
2 pages
Programing Examples Using C
No ratings yet
Programing Examples Using C
78 pages
Lynxos
No ratings yet
Lynxos
4 pages
Warsash New Training Requirements Under Stcw10
No ratings yet
Warsash New Training Requirements Under Stcw10
6 pages
Ap22 Apc English Language q1
No ratings yet
Ap22 Apc English Language q1
16 pages
Notes On Business Studies (The Nature of Business)
No ratings yet
Notes On Business Studies (The Nature of Business)
13 pages
Economics of Food Safety: Basic
No ratings yet
Economics of Food Safety: Basic
50 pages
Loi Lukoil
No ratings yet
Loi Lukoil
4 pages
Middle Chapters 2
No ratings yet
Middle Chapters 2
5 pages
Cathodic Protection of Reinforced Concrete Distance Learning
No ratings yet
Cathodic Protection of Reinforced Concrete Distance Learning
4 pages
Swimming With Dolphins
No ratings yet
Swimming With Dolphins
4 pages
Assignment 1 and 2
No ratings yet
Assignment 1 and 2
4 pages