0% found this document useful (0 votes)

44 views16 pages

Classifiers (Support Vector Machines, Decision Trees, Nearest Neighbor Classification)

This document discusses supervised machine learning classifiers. It defines learning, supervised vs. unsupervised learning, and the classification process. The key stages are training a model on labeled data and then testing it on unlabeled data. Several common classification techniques are mentioned, including decision trees, rule-based methods, nearest neighbors, neural networks, naive Bayes, and support vector machines. Feature selection and data preprocessing are also addressed.

Uploaded by

Sakhawat Ali

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

44 views16 pages

Classifiers (Support Vector Machines, Decision Trees, Nearest Neighbor Classification)

Uploaded by

Sakhawat Ali

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 16

Pattern Recognition (60014703-3)

Lecture 3

Classifiers
(Support Vector Machines, Decision Trees, Nearest
Neighbor Classification)

Instructor: Amany Al Luhaybi

Source: Bing Liu, UIC

What is Learning?
 Herbert Simon: “Learning is any process by
which a system improves performance from
experience.”

 “A computer program is said to learn from

experience E with respect to some class of
tasks T and performance measure P, if its
performance at tasks in T, as measured by P,
improves with experience E.”
– Tom Mitchell
2
2
Learning

 Learning is essential for unknown environments,

 Learning is useful as a system construction

method,
 i.e., expose the agent to reality rather than trying to
write it down

 Learning modifies the agent's decision

mechanisms to improve performance

3
Supervised learning
 Like human learning from past experiences.
 A computer does not have “experiences”.
 A computer system learns from data, which
represent some “past experiences” of an
application domain.
 Our focus: learn a target function that can be used
to predict the values of a discrete class attribute
 The task is commonly called: Supervised learning,
classification, or inductive learning.

4
The data and the goal
 Data: A set of data records (also called
examples, instances or cases) described by
 k attributes: A1, A2, … Ak.
 a class: Each example is labelled with a pre-
defined class.
 Goal: To learn a classification model from the
data that can be used to predict the classes
of new (future, or test) cases/instances.

5
An example: data (loan application)
Approved or not

6
An example: the learning task
 Learn a classification model from the data
 Use the model to classify future loan applications
into
 Yes (approved) and
 No (not approved)
 What is the class for following case/instance?

7
Supervised vs. unsupervised
Learning
 Supervised learning: classification is seen as
supervised learning from examples.
 Supervision: The data (observations,
measurements, etc.) are labeled with pre-defined
classes. It is like that a “teacher” gives the classes
(supervision).
 Test data are classified into these classes too.
 Unsupervised learning (clustering)
 Class labels of the data are unknown
 Given a set of data, the task is to establish the
existence of classes or clusters in the data

8
Supervised learning process: two
steps
 Learning (training): Learn a model using the

training data
 Testing: Test the model using unseen test
data to assess the model accuracy

9
Fundamental assumption of learning
Assumption: The distribution of training
examples is identical to the distribution of test
examples (including future unseen examples).

 In practice, this assumption is often violated to

certain degree.
 Strong violations will clearly result in poor
classification accuracy.
 To achieve good accuracy on the test data,
training examples must be sufficiently
representative of the test data.
10
Classification: Definition
 In classification, we predict labels y (classes) for
inputs x
 Given a collection of records (training set )
 Each record contains a set of attributes, one of the attributes is the class.
 Find a model for class attribute as a function of the
values of other attributes.
 Goal: previously unseen records should be
assigned a class as accurately as possible.
 A test set is used to determine the accuracy of the model. Usually, the
given data set is divided into training and test sets, with training set used to
build the model and test set used to validate it.

11
Illustrating Classification Task
Tid Attrib1 Attrib2 Attrib3 Class Learning
No
1 Yes Large 125K
algorithm
2 No Medium 100K No
3 No Small 70K No
4 Yes Medium 120K No
Induction
5 No Large 95K Yes
6 No Medium 60K No
7 Yes Large 220K No Learn
8 No Small 85K Yes Model
9 No Medium 75K No
10 No Small 90K Yes
Model
10

Training Set
Apply
Tid Attrib1 Attrib2 Attrib3 Class Model
11 No Small 55K ?
12 Yes Medium 80K ?
13 Yes Large 110K ? Deduction
14 No Small 95K ?
15 No Large 67K ?
10

Test Set

12
Examples of Classification Task
 Predicting tumor cells as benign or malignant

 Classifying credit card transactions

as legitimate or fraudulent

 Categorizing news stories as finance,

weather, entertainment, sports, etc

13
Issues: Data Preparation
 Data cleaning
 Preprocess data in order to reduce noise and handle
missing values
 Relevance analysis (feature selection)
 Remove the irrelevant or redundant attributes

14
Resources: Datasets

 UCI Repository:
http://www.ics.uci.edu/~mlearn/MLRepository.html
 UCI KDD Archive:
http://kdd.ics.uci.edu/summary.data.application.html
 Statlib: http://lib.stat.cmu.edu/
 Delve: http://www.cs.utoronto.ca/~delve/

15
Classification Techniques

 Decision Tree based Methods

 Rule-based Methods
 Memory based reasoning
 Neural Networks
 Naïve Bayes and Bayesian Belief Networks
 Support Vector Machines

Loop Checking A Technician Guide
60% (5)
Loop Checking A Technician Guide
8 pages
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
From Everand
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
Artem Kovera
No ratings yet
Classification (Part II)
No ratings yet
Classification (Part II)
162 pages
MI - Unit 3
100% (1)
MI - Unit 3
107 pages
Unit-1 ML
No ratings yet
Unit-1 ML
19 pages
ML 3RD Unit
No ratings yet
ML 3RD Unit
67 pages
Unit Iii Classification
No ratings yet
Unit Iii Classification
57 pages
Unit 4 Classification
No ratings yet
Unit 4 Classification
87 pages
Machine Learning Introduction
No ratings yet
Machine Learning Introduction
56 pages
Lecture 9
No ratings yet
Lecture 9
27 pages
ClassificationandPrediction Module3
No ratings yet
ClassificationandPrediction Module3
88 pages
Supervised Machine Learning Algorithm
100% (1)
Supervised Machine Learning Algorithm
111 pages
Intro To ML
No ratings yet
Intro To ML
107 pages
For Unit 4 Useful
100% (1)
For Unit 4 Useful
107 pages
ML Notes - 2025
No ratings yet
ML Notes - 2025
145 pages
ML Unit-Ii
No ratings yet
ML Unit-Ii
37 pages
University Institute of Engineering Department of Computer Science and Engg
No ratings yet
University Institute of Engineering Department of Computer Science and Engg
27 pages
Chapter 2
No ratings yet
Chapter 2
35 pages
WEEK 01 Merged
No ratings yet
WEEK 01 Merged
606 pages
1 - Introduction
No ratings yet
1 - Introduction
82 pages
AI Notes Week 11
No ratings yet
AI Notes Week 11
68 pages
IntroClassificationDA 2024
No ratings yet
IntroClassificationDA 2024
129 pages
Chapter 5 Machine Learning
No ratings yet
Chapter 5 Machine Learning
96 pages
Basics of Machine Learning and Classifications: Dr. Helal Uddin Ahmed
No ratings yet
Basics of Machine Learning and Classifications: Dr. Helal Uddin Ahmed
18 pages
Classification: Unit-III
No ratings yet
Classification: Unit-III
90 pages
Big Data Analytics - Unit 3
No ratings yet
Big Data Analytics - Unit 3
55 pages
Classification Basic Concept - Data Mining
No ratings yet
Classification Basic Concept - Data Mining
20 pages
Chap 5 Learning
No ratings yet
Chap 5 Learning
56 pages
Data Mining 4th Is
No ratings yet
Data Mining 4th Is
24 pages
ABP DWDM UNIT 4 Classification 1
No ratings yet
ABP DWDM UNIT 4 Classification 1
51 pages
Outline: - Learning Agents - Inductive Learning - Decision Tree Learning
No ratings yet
Outline: - Learning Agents - Inductive Learning - Decision Tree Learning
30 pages
Classification and Prediction
No ratings yet
Classification and Prediction
14 pages
Unit-4 AML (1. Basics and K-NN)
No ratings yet
Unit-4 AML (1. Basics and K-NN)
25 pages
Lec-1 Introduction
No ratings yet
Lec-1 Introduction
65 pages
Ai CH4
No ratings yet
Ai CH4
27 pages
Chapter 01 Introduction To Machine Learning
No ratings yet
Chapter 01 Introduction To Machine Learning
59 pages
19-Introduction Classification Algorithm-18-09-2024
No ratings yet
19-Introduction Classification Algorithm-18-09-2024
102 pages
Machine Learning
No ratings yet
Machine Learning
42 pages
08 - Classification - Decision Trees
No ratings yet
08 - Classification - Decision Trees
116 pages
Classification
No ratings yet
Classification
53 pages
Machine Learning-Classification
No ratings yet
Machine Learning-Classification
52 pages
NLP Chapter 2
No ratings yet
NLP Chapter 2
79 pages
Unit 3
No ratings yet
Unit 3
27 pages
ML COMPLETE (Pure Sem Ka)
No ratings yet
ML COMPLETE (Pure Sem Ka)
347 pages
18mca52c U3
No ratings yet
18mca52c U3
8 pages
Presentation On ML
No ratings yet
Presentation On ML
469 pages
ML Module4 Classification
No ratings yet
ML Module4 Classification
79 pages
Unit6 - 1 Classification-and-Prediction-Basics
No ratings yet
Unit6 - 1 Classification-and-Prediction-Basics
12 pages
ML 2
No ratings yet
ML 2
166 pages
Unit 4 - Classification and Prediction
No ratings yet
Unit 4 - Classification and Prediction
72 pages
Dav Unit 3
No ratings yet
Dav Unit 3
50 pages
ML Introduction
No ratings yet
ML Introduction
54 pages
Unit 1
No ratings yet
Unit 1
24 pages
Chapter 01 Introduction To ML
No ratings yet
Chapter 01 Introduction To ML
178 pages
Module 1
No ratings yet
Module 1
50 pages
05 - Machine Learning
No ratings yet
05 - Machine Learning
31 pages
Machine Learning INTRO
No ratings yet
Machine Learning INTRO
12 pages
6 Classification
No ratings yet
6 Classification
53 pages
Chapter 2
No ratings yet
Chapter 2
124 pages
Data Mining-Unit-3
No ratings yet
Data Mining-Unit-3
16 pages
08 Class Basic
No ratings yet
08 Class Basic
141 pages
ML First Unit
No ratings yet
ML First Unit
70 pages
Course+Slides+ +Data+Warehouse+ +the+Ultimate+Guide
No ratings yet
Course+Slides+ +Data+Warehouse+ +the+Ultimate+Guide
393 pages
Human Activity Recognition
No ratings yet
Human Activity Recognition
10 pages
Object Recognition
No ratings yet
Object Recognition
60 pages
Interactive Planning
100% (1)
Interactive Planning
73 pages
Ch2 - Fundamental of Deep Learning
No ratings yet
Ch2 - Fundamental of Deep Learning
33 pages
BEU 5173 Artificial Intelligence: Rule Based Expert Systems
No ratings yet
BEU 5173 Artificial Intelligence: Rule Based Expert Systems
33 pages
Naskah Publikasi
No ratings yet
Naskah Publikasi
38 pages
Ec8093 - Unit 4
No ratings yet
Ec8093 - Unit 4
51 pages
Face Swap Using Autoencoders & Image-To-Image Translation Techniques
No ratings yet
Face Swap Using Autoencoders & Image-To-Image Translation Techniques
7 pages
Lab Report 08: Convolutional Networks For Images With Keras: Sukkur Institute of Business Administration University
No ratings yet
Lab Report 08: Convolutional Networks For Images With Keras: Sukkur Institute of Business Administration University
19 pages
Contact Details: Name: Dr. Srinivasa Rao P Office: 17-03-02 Phone: 05-368 7207 Email: Srinivasa - Pedapati@utp - Edu.my
No ratings yet
Contact Details: Name: Dr. Srinivasa Rao P Office: 17-03-02 Phone: 05-368 7207 Email: Srinivasa - Pedapati@utp - Edu.my
45 pages
Evaluating DNN and Classical ML Algorithms For Nids
No ratings yet
Evaluating DNN and Classical ML Algorithms For Nids
24 pages
Mba It Unit 2
No ratings yet
Mba It Unit 2
6 pages
09b Cassandra Slides
No ratings yet
09b Cassandra Slides
26 pages
Detection Tracking and Classification of Aircraft and Drones in 2019
No ratings yet
Detection Tracking and Classification of Aircraft and Drones in 2019
36 pages
Control System: by Saurabh Korgaonkar
No ratings yet
Control System: by Saurabh Korgaonkar
24 pages
Call Hold Protocol Adherence
No ratings yet
Call Hold Protocol Adherence
16 pages
2IC421 Control - System
No ratings yet
2IC421 Control - System
2 pages
Data Mining Pertemuan 6
No ratings yet
Data Mining Pertemuan 6
28 pages
VKK 211 2024 Lecture Timetable and Assessment Schedule
No ratings yet
VKK 211 2024 Lecture Timetable and Assessment Schedule
2 pages
Decision Trees & The Iterative Dichotomiser 3 (ID3) Algorithm
100% (1)
Decision Trees & The Iterative Dichotomiser 3 (ID3) Algorithm
8 pages
Chapter 10
50% (2)
Chapter 10
51 pages
Cavallo - Using Matlab Simulink and Control Toolbox - ToC (1996)
No ratings yet
Cavallo - Using Matlab Simulink and Control Toolbox - ToC (1996)
7 pages
Multi-Grade Lesson Plan in English 5 & 6
79% (19)
Multi-Grade Lesson Plan in English 5 & 6
5 pages
Adaboost Algorithm
No ratings yet
Adaboost Algorithm
17 pages
Solved Problems: C. Impulse Response
No ratings yet
Solved Problems: C. Impulse Response
14 pages
How To Index, Slice and Reshape NumPy Arrays For Machine Learning
No ratings yet
How To Index, Slice and Reshape NumPy Arrays For Machine Learning
31 pages
Gesture Recognition System: Submitted in Partial Fulfillment of Requirement For The Award of The Degree of
No ratings yet
Gesture Recognition System: Submitted in Partial Fulfillment of Requirement For The Award of The Degree of
37 pages

Classifiers (Support Vector Machines, Decision Trees, Nearest Neighbor Classification)

Uploaded by

Classifiers (Support Vector Machines, Decision Trees, Nearest Neighbor Classification)

Uploaded by

Pattern Recognition (60014703-3)

Instructor: Amany Al Luhaybi

Source: Bing Liu, UIC

 “A computer program is said to learn from

 Learning is essential for unknown environments,

 Learning is useful as a system construction

 Learning modifies the agent's decision

 In practice, this assumption is often violated to

 Classifying credit card transactions

 Categorizing news stories as finance,

 Decision Tree based Methods

You might also like