SlideShare a Scribd company logo
Very, Very Basic
Introduction to
Machine Learning
Classification
Josh Borts
Problem
Identify which of
a set of
categories a new
observation
belongs
Classification is
Supervised Learning
(we tell the system the classifications)
Clustering is
Unsupervised Learning
(the data determines the groupings (which we then
name))
Examples
Handwriting Recognition / OCR
Spam Filters
Blood Type Identification
Automatic Document Classification
Face Recognition
SHAZAM!!
Other
Examples
Credit Scoring
Text Sentiment
Extraction
Cohort
Assignment
Gesture
Recognition
Observations
an Observation can be described by
a fixed set of quantifiable properties
called Explanatory Variables or
Features
For example, a Doctor visits could result in the following Features:
• Weight
• Male/Female
• Age
• White Cell Count
• Mental State (bad, neutral, good,
great)
• Blood Pressure
• etc
Text Documents will have a set of Features that defines
the number of occurrences of each Word or n-gram in
the corpus of documents
Classifier
a Machine Learning Algorithm or
Mathematical Function that maps
input data to a category is known as
a Classifier
Examples:
• Linear Classifiers
• Quadratic Classifiers
• Support Vector Machines
• K-Nearest Neighbours
• Neural Networks
• Decision Trees
Most algorithms are best applied to Binary
Classification.
If you want to have multiple classes (tags) then use
multiple Binary Classifiers instead
Training
A Classifier has a set of variables that
need to set (trained). Different
classifiers have different algorithms to
optimize this process
Overfitting
Danger!!
The model fits only the data in was trained on.
New data is completely foreign
Introduction to Machine Learning Classifiers
Among competing
hypotheses, the one
with the fewest
assumptions should
be selected
Split the data into In-Sample (training) and
Out-Of-Sample (test)
How do we
Evaluate
Classifier
Performance?
Of course there are many ways we can
define Best Performance…
Accuracy
Sensitivity
Specifity
F1 Score
Likelihood
Cumulative Gain
Mean Reciprocal Rank
Average Precision
Algorithms
k-Nearest
Neighbor
Cousin of k-Means Clustering
Algorithm:
1) In feature space, find the k closest neighbors (often using
Euclidean distance (straight line geometry))
2) Assign the majority class from those neighbors
Decision
Tress
Can generate multiple decision
trees to improve accuracy
(Random Forest)
Can be learned by consecutively
splitting the data on an attribute pair
using Recursive Partitioning
New York & San
Fran housing by
Elevation and
Price
Introduction to Machine Learning Classifiers
Introduction to Machine Learning Classifiers
Linear
Classifier
Linear Combination of the Feature Vector and a Weight
Vector.
Can think of it as splitting a high-dimensional input space
with a hyperplane
Often the fastest classifier, especially when feature
space is sparse or large number of dimensions
Determining
the Weight
Vector
Can either use Generative or
Discriminative models to determine
the Weight Vector
Generative models attempt to model the conditional
probability function of an Observation Vector given a
Classification.
Examples include:
• LDA (Gaussian density)
• Naive Bayes Classifier (Multinomial Bernoulli events)
Examples include:
• Logistic Regression (maximum likelihood estimation assuming training set was
generated by a binomial model)
• Support Vector Machine (attempts to maximize the margin between the
decision hyperplane and the examples in the training set)
Discriminative models attempt to maximize the quality
of the output on a training set through an optimization
algorithm.
Neural
Network
Not going to get into the details, this time….
Introduction to Machine Learning Classifiers
Functional Imperative
functionalimperative.com
(647) 405-8994
@func_i

More Related Content

PPT
Classification (ML).ppt
PDF
Naive Bayes Classifier Tutorial | Naive Bayes Classifier Example | Naive Baye...
PDF
L2. Evaluating Machine Learning Algorithms I
PDF
Understanding Bagging and Boosting
PPTX
Linear regression
PDF
Feature selection
PDF
Classification Based Machine Learning Algorithms
PDF
Understanding random forests
Classification (ML).ppt
Naive Bayes Classifier Tutorial | Naive Bayes Classifier Example | Naive Baye...
L2. Evaluating Machine Learning Algorithms I
Understanding Bagging and Boosting
Linear regression
Feature selection
Classification Based Machine Learning Algorithms
Understanding random forests

What's hot (20)

PPTX
Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...
PPTX
Supervised and Unsupervised Learning In Machine Learning | Machine Learning T...
PDF
Logistic regression in Machine Learning
ODP
Machine Learning with Decision trees
ODP
NAIVE BAYES CLASSIFIER
PPT
Decision tree
PDF
Machine learning
PPT
2.2 decision tree
PPT
Basics of Machine Learning
PPT
Machine Learning
PPTX
Introduction to ML (Machine Learning)
PPTX
Machine Learning Tutorial Part - 2 | Machine Learning Tutorial For Beginners ...
PPTX
Text clustering
PPTX
Classification and Regression
PDF
Dimensionality Reduction
PPTX
Machine Learning Algorithms | Machine Learning Tutorial | Data Science Algori...
PPTX
Supervised and unsupervised learning
PDF
Machine Learning: Applications, Process and Techniques
PDF
Machine Learning Model Evaluation Methods
Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...
Supervised and Unsupervised Learning In Machine Learning | Machine Learning T...
Logistic regression in Machine Learning
Machine Learning with Decision trees
NAIVE BAYES CLASSIFIER
Decision tree
Machine learning
2.2 decision tree
Basics of Machine Learning
Machine Learning
Introduction to ML (Machine Learning)
Machine Learning Tutorial Part - 2 | Machine Learning Tutorial For Beginners ...
Text clustering
Classification and Regression
Dimensionality Reduction
Machine Learning Algorithms | Machine Learning Tutorial | Data Science Algori...
Supervised and unsupervised learning
Machine Learning: Applications, Process and Techniques
Machine Learning Model Evaluation Methods
Ad

Similar to Introduction to Machine Learning Classifiers (20)

PPT
Machine learning by Dr. Vivek Vijay and Dr. Sandeep Yadav
PPT
Free Ebooks Download ! Edhole.com
PDF
Presentation-19.08.2024hvug7gugyvuvugugugugugug
PPTX
Lecture 09(introduction to machine learning)
PDF
Introduction to Machine Learning Lecture
PPTX
Deep learning from mashine learning AI..
PDF
Computer Vision: Algorithms and Applications Richard Szeliski
PDF
An Introduction to Supervised Machine Learning and Pattern Classification: Th...
PDF
Machine Learning Algorithms Introduction.pdf
PPT
594503964-Introduction-to-Classification-PPT-Slides-1.ppt
PPT
Machine Learning Deep Learning Machine learning
PPTX
Unit 4 Classification of data and more info on it
PPTX
How Machine Learning Helps Organizations to Work More Efficiently?
PDF
MLHEP 2015: Introductory Lecture #1
PPTX
classification in Data Analysis Data Analysis.pptx
PDF
Machine Learning Classifiers
PPTX
Supervised learning
PDF
introducatio to ml introducatio to ml introducatio to ml
PPT
Machine Learning workshop by GDSC Amity University Chhattisgarh
PPT
[ppt]
Machine learning by Dr. Vivek Vijay and Dr. Sandeep Yadav
Free Ebooks Download ! Edhole.com
Presentation-19.08.2024hvug7gugyvuvugugugugugug
Lecture 09(introduction to machine learning)
Introduction to Machine Learning Lecture
Deep learning from mashine learning AI..
Computer Vision: Algorithms and Applications Richard Szeliski
An Introduction to Supervised Machine Learning and Pattern Classification: Th...
Machine Learning Algorithms Introduction.pdf
594503964-Introduction-to-Classification-PPT-Slides-1.ppt
Machine Learning Deep Learning Machine learning
Unit 4 Classification of data and more info on it
How Machine Learning Helps Organizations to Work More Efficiently?
MLHEP 2015: Introductory Lecture #1
classification in Data Analysis Data Analysis.pptx
Machine Learning Classifiers
Supervised learning
introducatio to ml introducatio to ml introducatio to ml
Machine Learning workshop by GDSC Amity University Chhattisgarh
[ppt]
Ad

Recently uploaded (20)

PDF
Modernizing your data center with Dell and AMD
PPTX
breach-and-attack-simulation-cybersecurity-india-chennai-defenderrabbit-2025....
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
PDF
NewMind AI Monthly Chronicles - July 2025
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Smarter Business Operations Powered by IoT Remote Monitoring
PPTX
MYSQL Presentation for SQL database connectivity
PPTX
Comunidade Salesforce São Paulo - Desmistificando o Omnistudio (Vlocity)
PDF
REPORT: Heating appliances market in Poland 2024
PDF
HCSP-Presales-Campus Network Planning and Design V1.0 Training Material-Witho...
PDF
madgavkar20181017ppt McKinsey Presentation.pdf
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
SAP855240_ALP - Defining the Global Template PUBLIC.pdf
PDF
Sensors and Actuators in IoT Systems using pdf
PDF
Reimagining Insurance: Connected Data for Confident Decisions.pdf
PDF
Advanced Soft Computing BINUS July 2025.pdf
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Chapter 2 Digital Image Fundamentals.pdf
Modernizing your data center with Dell and AMD
breach-and-attack-simulation-cybersecurity-india-chennai-defenderrabbit-2025....
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
NewMind AI Monthly Chronicles - July 2025
Chapter 3 Spatial Domain Image Processing.pdf
Smarter Business Operations Powered by IoT Remote Monitoring
MYSQL Presentation for SQL database connectivity
Comunidade Salesforce São Paulo - Desmistificando o Omnistudio (Vlocity)
REPORT: Heating appliances market in Poland 2024
HCSP-Presales-Campus Network Planning and Design V1.0 Training Material-Witho...
madgavkar20181017ppt McKinsey Presentation.pdf
Understanding_Digital_Forensics_Presentation.pptx
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
20250228 LYD VKU AI Blended-Learning.pptx
SAP855240_ALP - Defining the Global Template PUBLIC.pdf
Sensors and Actuators in IoT Systems using pdf
Reimagining Insurance: Connected Data for Confident Decisions.pdf
Advanced Soft Computing BINUS July 2025.pdf
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Chapter 2 Digital Image Fundamentals.pdf

Introduction to Machine Learning Classifiers