0% found this document useful (0 votes)

86 views56 pages

Machine Learning Introduction

This document provides an overview of a machine learning course. It discusses the history of artificial intelligence and various machine learning techniques, including supervised learning, feature extraction, and the challenges of machine learning. It also outlines the course topics, evaluations, textbooks, programming assignments, and examinations. The key topics covered include classification problems, training and testing phases, learning strategies such as supervised and unsupervised learning, feature extraction, linear classifiers, perceptrons, support vector machines, and artificial neural networks.

Uploaded by

Sri Mukhesh Chowdary

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

86 views56 pages

Machine Learning Introduction

Uploaded by

Sri Mukhesh Chowdary

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 56

Introduction

Machine Learning Course

P. Viswanath
IIITS
Overview
• History of AI
– past, present, future.
• Techniques
– Supervised learning techniques
– Feature extraction
–
• Some Challenges
• The term artificial intelligence was first coined
in 1956, at the Dartmouth conference.
• Artificial Intelligence is still a growing active
field of science and technology.
– Has the potential to affect our lives
AI?
• Homo sapiens (human beings) are able to
control (and exploit) other species and nature
because of their thinking capability.
Turing Test

---------------------------------------------------------------------------
Human or Machine?
The Chinese Room Argument
Information Processing?
Topics in AI
What we already achieved in AI?
• Board games – Chess, Checkers, Go, etc
• Solving Puzzles – Sudoku, etc
• Route finding in a map
• Image/speech enhancement
– Creating high resolution images, noise
suppression, …
•
Machine Learning?
• ML is a subset of AI.
• We focus on mathematical/algorithmic
aspects of learning which can be programmed
over a machine.
• We look into various learning paradigms.
• Prerequisites of the course are
– Probability theory
– Linear algebra
– Calculus
Evaluation/Examinations

• 20 Marks (16.6%) for Quizzes (best 2 out of 3 taken)

• 25 Marks (20.8%) for Assignments (3 assignments)
• 15 Marks (12.5%) for Mid1
• 25 Marks (20.8%) for Mid2
• 35 Marks (29.16%) for Endsem.
• Total – 120 Marks (100%).
Text books/ References

1. “Pattern Classification” by R. O. Duda, P. E. Hart and D. G. Stork.

2. “An Introduction to Statistical Learning” by Gareth James, Daniela Witten,
Trevor Hastie and Robert Tibshirani.
3. “Pattern Recognition and Machine Learning” by Christopher M. Bishop.
4. “Introduction to Machine Learning” by Ethem Alpaydin.
5. “Pattern Recognition: An Algorithmic Approach” by M. Narasimha Murty,
V. Susheela Devi.
6. “Machine learning” by Tom Mitchell.
Programming Assignments
• We follow C and or Python.
• No other languages are allowed.
• Plagiarism verification is done.
– Simply changing variable names may not work for
you?!
Examinations/Quizzes
• Small numeric problems are asked.
• You need to employ the techniques discussed
in the class.
• Since, these are small numeric problems, just
by trial and error you can get the answer.
– But, this is not respected.
– You must follow the method described in the
class.
LET US BEGIN …
Deductive Vs Inductive Learning
• Deductive
– Rules of the game are (hard coded) given ahead.
– Eg: An algorithm to do multiplication of numbers
is given. Given any two numbers you can apply
this and get the answer.
• Inductive
– We are given with examples (not the concept). We
need to learn the mapping from i/p to o/p.
• Supervised learning problems in AI comes under this
Learning strategies
• Supervised
– Classification, Regression, …
• Unsupervised
– Clustering, density estimation, …
• Reinforced
– A robot navigating through obstacles, …
• Learn the good features (attributes)
– Feature extraction
Unsupervised “Semi” supervised Fully supervised
What is a classification problem?
• Let there are two classes of objects.
– Class 1: Set of dog pictures
– Class 2: Set of cat pictures
• Problem is –
– Given a picture, you should say whether it is cat or
dog.
– For a human being it is easy…, but for a machine it
is a non-trivial problem.
Training (Learning phase)

We have shown a set of dog pictures and a set of cat

pictures to a child.
Testing phase

DOG

This picture as it is
may not be in the Child has done more than
training set just remembering
What is learning (pattern recognition)?
• Child has learnt what is it that is common
among dogs … and, what is it that is common
among cats… also, what are the distinguishing
features/attributes.
• Child has learnt the pattern (regularity) behind
all dogs and the pattern behind all cats.
• Child then recognized a test image as having a
particular pattern that is unique to dogs.
Basic concepts
Object
 x1  Feature vector X  
  - A vector of observations (measurements).
x
 2   X
X    - X is a point in feature space  .
 
 xd 

Class to which X belongs is y Y

-Needs to be estimated, based on training set.

Task
- To design a classifer (decision rule) f :   Y
which decides about the class label based on X.
An example
 is a set of persons
Object
Feature vector
h - A vector of observations (height, weight).
 
Person w

Class to which X belongs is y  overweight , normal 

-Needs to be estimated, based on training set.

Task
- To design a classifer (decision rule) f :   Y
- given height and weight of a person, classify him/her.
Feature extraction
Task: to extract features which are good for classification.
Good features: • Objects from the same class have similar feature values.
• Objects from different classes have different values.

“Good” features “Bad” features

Feature Space

Normal
persons

weight(w) Overweight
persons

height(h)

Training set is shown in the feature space

Learning Steps
• Feature extraction: This is an important step.
Good features are needed.
– This is a lower level step. Normally done by
techniques like image processing, speech processing,
video processing, etc.
• Training set: Set of feature vectors along with
their class labels.
– An expert can see a few examples and give labels to
them based on his experience.
• Build the classifier by using the training set.
Classification Problem
• Given a training set, build the classifier.

• One has to evaluate, how good is the built

classifier.
– Of course, it has to agree with the training set
• Is this 100% true?
– But, it should do more than this.
– The behavior of the classifier when it is asked to
classify some thing which is not in the training set
determines the quality.
An easy, but bad classifier
• Remember the training set.
• See whether the given feature vector to be
classified is available in the training set.
• If yes, then return the label of that training
example.
• Else return a random class label.

• This is called Rote learning

Classifiers
• There are many classification methods.
– Baye’s classifier, Naïve Bayes classifier
– HMM (graphical model)
– Artificial Neural Networks
– Decision Trees
– SVMs
– ….
Generative vs. Discriminative Classifiers
Generative Models Discriminative Models
• Learn the source. May be • Learn to directly predict the
distribution is learnt. labels from the data
• Often, makes use of • Often, assume a simple
conditional independence boundary (e.g., linear)
and priors • Examples
• Examples – Logistic regression
– Naïve Bayes classifier – SVM
– Bayesian network – Boosted decision trees

• Can be used to generate • Doesn’t have the capacity to

new examples ! generate new examples.

Slide credit: D. Hoiem

k-Nearest Neighbor Classifier

The pattern to be classified

If k = 1 then the class assigned is

If k = 3 then the class assigned is
Linear Classifier
Classifier:
If f(x1,x2) < 0 assign Class 1;
x2
If f(x1,x2) > 0 assign Class 2;
Class 2

f(x1,x2) = w1x1+w2x2+b = 0
Class 1

x1
Perceptron
• Perceptron is the name given to the linear
classifier.
• If there exists a Perceptron that correctly
classifies all training examples, then we say
that the training set is linearly separable.
• In 1960s Rosenblatt gave an algorithm for
Perceptron learning for linearly separable
data.
Perceptron
• For linearly separable data many classifiers are
possible.
All being doing equally good
on training set, which one is
Class 2
good on the unseen test set?

Class 1
Maximizing the Margin SVM
Var1 IDEA : Select the
separating
hyperplane that
maximizes the
margin!

Margin
Width

Margin
Width
Var2
Artificial Neural Networks
Generative Models
• Bayes
– Naïve Bayes
• Graphical models
– Belief networks
Remember…
• No classifier is inherently better than any
other: you need to make assumptions to
generalize

• Two components of the error

– Bias: due to over-simplifications
– Variance: due to inability to perfectly estimate
parameters from limited data

Slide
Slide
credit:
credit:
D. D.
Hoiem
Hoiem
Generalization

Training set (labels known) Test set (labels unknown)

• How well does a learned model generalize from

the data it was trained on to a new test set?
Generalization
• Components of generalization error
– Bias: how much the average model over all training sets differ from the
true model?
• Error due to inaccurate assumptions/simplifications made by the
model
– Variance: how much models estimated from different training sets
differ from each other
• Underfitting: model is too “simple” to represent all the
relevant class characteristics
– High bias and low variance
– High training error and high test error
• Overfitting: model is too “complex” and fits irrelevant
characteristics (noise) in the data
– Low bias and high variance
– Low training error and high test error

Slide credit: L. Lazebnik

Clustering Strategies
• K-means
– Iteratively re-assign points to the nearest cluster
center
• Agglomerative clustering
– Start with each point as its own cluster and iteratively
merge the closest clusters
• Mean-shift clustering
– Estimate modes of pdf
• Spectral clustering
– Split the nodes in a graph based on assigned links with
similarity weights
Feature Extraction
• Principal Component Analysis
• Fisher Discriminant Analysis
•
Representing Face Images: Eigenfaces
Q: How do we pick the set of basis faces?

A: We take a set of real training faces

Then we find (learn) a set of basis faces which best represent the differences
between them

That is, apply PCA and choose top Eigen vectors (Eigen faces)

We can then store each face as a set of weights for those basis faces
Eigenfaces: the idea
• Think of a face as being a weighted combination of some “component” or
“basis” faces

• These basis faces are called eigenfaces

-8029 2900 1751 1445 4238 6193

…
Eigenfaces: representing faces
• These basis faces can be differently weighted to represent any face

• So we can use different vectors of weights to represent different faces

-8029 -1183 2900 -2088 1751 -4336 1445 -669 4238 -4221 6193 10549
SOME CHALLENGES …
• King – man + woman = Queen
• Face – emotion + surprise = Surprised face

• Interpretable models.
THANK YOU

NLOS Detection Generated by Body Shadowing in A 6.5 GHZ UWB Localization System Using Machine Learning
No ratings yet
NLOS Detection Generated by Body Shadowing in A 6.5 GHZ UWB Localization System Using Machine Learning
12 pages
Introduction To Machine Learning: Jaime S. Cardoso
100% (1)
Introduction To Machine Learning: Jaime S. Cardoso
52 pages
A Course in Machine Learning
100% (1)
A Course in Machine Learning
191 pages
Machine Learning Interview Questions
From Everand
Machine Learning Interview Questions
Tech Interviews
4.5/5 (2)
Unit-1 ML
No ratings yet
Unit-1 ML
19 pages
Presentation On ML
No ratings yet
Presentation On ML
469 pages
Machine Learning INTRO
No ratings yet
Machine Learning INTRO
12 pages
Unit 1
No ratings yet
Unit 1
93 pages
Lecture 1
No ratings yet
Lecture 1
36 pages
105 Machine Learning Paper
No ratings yet
105 Machine Learning Paper
6 pages
WEEK 01 Merged
No ratings yet
WEEK 01 Merged
606 pages
1 - Introduction
No ratings yet
1 - Introduction
82 pages
ML - 1 - Sovan - Introduction To ML
No ratings yet
ML - 1 - Sovan - Introduction To ML
83 pages
ML Notes - 2025
No ratings yet
ML Notes - 2025
145 pages
ML Unit-Ii
No ratings yet
ML Unit-Ii
37 pages
Learning
No ratings yet
Learning
48 pages
Mlintro 2
No ratings yet
Mlintro 2
28 pages
Classifiers (Support Vector Machines, Decision Trees, Nearest Neighbor Classification)
No ratings yet
Classifiers (Support Vector Machines, Decision Trees, Nearest Neighbor Classification)
16 pages
Machine Learning - Unit - 1
100% (1)
Machine Learning - Unit - 1
58 pages
Machine Learning
No ratings yet
Machine Learning
28 pages
4 DL
No ratings yet
4 DL
81 pages
Machine Learning Notes
100% (4)
Machine Learning Notes
134 pages
Pattern Recognition With Semi-Supervised Learning Algorithm
No ratings yet
Pattern Recognition With Semi-Supervised Learning Algorithm
57 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
33 pages
Unit Ii
No ratings yet
Unit Ii
118 pages
Machine Learning: BE Sixth Semester 20CS610
No ratings yet
Machine Learning: BE Sixth Semester 20CS610
211 pages
Lecturenotes PDF
No ratings yet
Lecturenotes PDF
80 pages
Lecturenotes Cse176
No ratings yet
Lecturenotes Cse176
80 pages
Lecture#12 DM MS (DEIM) Spring 2025
No ratings yet
Lecture#12 DM MS (DEIM) Spring 2025
21 pages
Unit-1 MLT
No ratings yet
Unit-1 MLT
51 pages
Pattern Revision
No ratings yet
Pattern Revision
63 pages
A Course in Machine Learning
No ratings yet
A Course in Machine Learning
50 pages
ML Mid Syllabus
No ratings yet
ML Mid Syllabus
182 pages
Mlintro 3
No ratings yet
Mlintro 3
28 pages
Unit 3
No ratings yet
Unit 3
62 pages
Machine 1
No ratings yet
Machine 1
35 pages
MIT - Machine Learning Notes From Chapter 1 - 14 PDF
No ratings yet
MIT - Machine Learning Notes From Chapter 1 - 14 PDF
101 pages
This Story Paraphrased From A Post On 9/4/12
No ratings yet
This Story Paraphrased From A Post On 9/4/12
7 pages
CS464 Ch1 Intro Fall2020
No ratings yet
CS464 Ch1 Intro Fall2020
83 pages
Chapter 2
No ratings yet
Chapter 2
35 pages
A Course in Machine Learning 1648562733
No ratings yet
A Course in Machine Learning 1648562733
193 pages
Unit I MACHINE LEARNING
No ratings yet
Unit I MACHINE LEARNING
87 pages
1b Different Types
No ratings yet
1b Different Types
26 pages
Introduction To Machine Learning: Workshop On Machine Learning For Intelligent Image Processing
No ratings yet
Introduction To Machine Learning: Workshop On Machine Learning For Intelligent Image Processing
44 pages
Machine Learning Updated
No ratings yet
Machine Learning Updated
14 pages
ML Overview
No ratings yet
ML Overview
26 pages
Complete ML
No ratings yet
Complete ML
325 pages
03 Introtoml Ueh
No ratings yet
03 Introtoml Ueh
43 pages
Classification
No ratings yet
Classification
61 pages
Quiz 1 On Wednesday
No ratings yet
Quiz 1 On Wednesday
46 pages
Basic Concepts of Machine Learning for Beginners
No ratings yet
Basic Concepts of Machine Learning for Beginners
102 pages
Machine Learning HC
No ratings yet
Machine Learning HC
4 pages
Unit 1 ML
No ratings yet
Unit 1 ML
93 pages
Chapter Introduction
No ratings yet
Chapter Introduction
7 pages
Challenges in ML&DM
No ratings yet
Challenges in ML&DM
12 pages
Machine Learning
No ratings yet
Machine Learning
135 pages
Pedestrian Detection: Please, suggest a subtitle for a book with title 'Pedestrian Detection' within the realm of 'Computer Vision'. The suggested subtitle should not have ':'.
From Everand
Pedestrian Detection: Please, suggest a subtitle for a book with title 'Pedestrian Detection' within the realm of 'Computer Vision'. The suggested subtitle should not have ':'.
Fouad Sabry
No ratings yet
Python Machine Learning By Example: Unlock machine learning best practices with real-world use cases
From Everand
Python Machine Learning By Example: Unlock machine learning best practices with real-world use cases
Yuxi (Hayden) Liu
No ratings yet
Geometric Feature Learning: Unlocking Visual Insights through Geometric Feature Learning
From Everand
Geometric Feature Learning: Unlocking Visual Insights through Geometric Feature Learning
Fouad Sabry
No ratings yet
Kernel Methods: Fundamentals and Applications
From Everand
Kernel Methods: Fundamentals and Applications
Fouad Sabry
No ratings yet
Image Classification: Step-by-step Classifying Images with Python and Techniques of Computer Vision and Machine Learning
From Everand
Image Classification: Step-by-step Classifying Images with Python and Techniques of Computer Vision and Machine Learning
Mark Magic
No ratings yet
INTERNSHIPREPORT
No ratings yet
INTERNSHIPREPORT
34 pages
Sahare 2017
No ratings yet
Sahare 2017
22 pages
56.eye Ball Cursor Movement Using Opencv
75% (4)
56.eye Ball Cursor Movement Using Opencv
47 pages
Machine Learning Applications For Predicting System Production in Renewable Energy
No ratings yet
Machine Learning Applications For Predicting System Production in Renewable Energy
9 pages
Unit 3
No ratings yet
Unit 3
30 pages
Class Result Prediction Using Machine Learning
No ratings yet
Class Result Prediction Using Machine Learning
6 pages
Deepthi Reddy Resume
No ratings yet
Deepthi Reddy Resume
1 page
Finsafe
No ratings yet
Finsafe
5 pages
Sachida Paudel Milestone 1
No ratings yet
Sachida Paudel Milestone 1
79 pages
BDAunit 5
No ratings yet
BDAunit 5
26 pages
11 Machine Learning-Driven Prediction of Biochar Adsorption Capacity For Effective Removal of Congo Red Dye
No ratings yet
11 Machine Learning-Driven Prediction of Biochar Adsorption Capacity For Effective Removal of Congo Red Dye
17 pages
Fruit Veg Research 2 Updated
No ratings yet
Fruit Veg Research 2 Updated
6 pages
Benavides Hernández Dumeignil 2024 From Characterization To Discovery Artificial Intelligence Machine Learning and High
No ratings yet
Benavides Hernández Dumeignil 2024 From Characterization To Discovery Artificial Intelligence Machine Learning and High
31 pages
Moblie Price Classification 1
No ratings yet
Moblie Price Classification 1
11 pages
Protection Challenges and Mitigation Techniques of Power Grid Integrated To Rene
No ratings yet
Protection Challenges and Mitigation Techniques of Power Grid Integrated To Rene
17 pages
Detection of Chlorpyrifos and Carbendazim Residues in The Cabbage Using Visible OR Near Infrared Spectroscopy Combined With Chemometrics
No ratings yet
Detection of Chlorpyrifos and Carbendazim Residues in The Cabbage Using Visible OR Near Infrared Spectroscopy Combined With Chemometrics
9 pages
Aayush Nihar Soham Maitrik Yagnesh ML Project Report
No ratings yet
Aayush Nihar Soham Maitrik Yagnesh ML Project Report
9 pages
Full Detailed Data Mining Answer Key
No ratings yet
Full Detailed Data Mining Answer Key
4 pages
Question Bank 2023 Final All Questions
No ratings yet
Question Bank 2023 Final All Questions
78 pages
AI Enabled Threat Detection Leveraging Artificial Intelligence For Advanced Security and Cyber Threat Mitigation
No ratings yet
AI Enabled Threat Detection Leveraging Artificial Intelligence For Advanced Security and Cyber Threat Mitigation
10 pages
BTAIML10 Major Project Report
No ratings yet
BTAIML10 Major Project Report
25 pages
Cryptocurrency Prediction Using Machine Learning IJERTCONV11IS03014
No ratings yet
Cryptocurrency Prediction Using Machine Learning IJERTCONV11IS03014
5 pages
Gujarat Technological University: W.E.F. AY 2018-19
No ratings yet
Gujarat Technological University: W.E.F. AY 2018-19
3 pages
2010 1 11 - Zaremba
No ratings yet
2010 1 11 - Zaremba
26 pages
PDM For Conveyor Belts
No ratings yet
PDM For Conveyor Belts
17 pages
Cybersecurity in The AI-Based
No ratings yet
Cybersecurity in The AI-Based
24 pages
1 PB
No ratings yet
1 PB
9 pages
PA
No ratings yet
PA
2 pages
Internship
No ratings yet
Internship
22 pages