0% found this document useful (0 votes)

46 views

ML Lecture#1

This document provides an overview of machine learning. It introduces the instructor and acknowledges sources for the slides. The agenda covers an introduction to machine learning, supervised learning, unsupervised learning, reinforcement learning, dimensionality reduction techniques, and neural networks. Textbooks and research papers on machine learning are listed. Key concepts in machine learning like learning from experience, supervised vs unsupervised learning, and example applications are discussed. Types of machine learning tasks like regression, classification, and reinforcement learning are also introduced.

Uploaded by

muhammadhzrizwan2002

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

46 views

ML Lecture#1

Uploaded by

muhammadhzrizwan2002

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 52

Machine Learning

Instructor: Dr. Syed Usman

Slides Courtesy: These slides were assembled by Eric Eaton, with grateful acknowledgement of the many
others who made their course materials freely available online.
Agenda of today’s Class

• Introduction to the Course

• An Activity

• Your Introduction
Contents
• Introduction to Machine Learning

• Supervised Learning

• Unsupervised Learning

• Reinforcement Learning

• Dimensionality reduction techniques

• Neural Networks
Text/Reference Books

• Pattern Recognition & Machine Learning, 1st Edition, Chris Bishop

• Machine Learning: A Probabilistic Perspective, 1st Edition, Kevin R

Murphy

• Applied Machine Learning, online Edition, David Forsyth

• Latest Research Papers

What is Machine Learning?
“Learning is any process by which a system improves
performance from experience.”
- Herbert Simon

Definition by Tom Mitchell (1998):

Machine Learning is the study of algorithms that
• improve their performance P
• at some task T
• with experience E.
A well-defined learning task is given by <P, T, E>.
3
Traditional Programming

Data
Computer Output

Program
Machine Learning

Data
Computer Progra
m
Output 4
Slide credit: Pedro Domingos
When Do We Use Machine Learning?
ML is used when:
• Human expertise does not exist (navigating on Mars)
• Humans can’t explain their expertise (speech recognition)
• Models must be customized (personalized medicine)
• Models are based on huge amounts of data (genomics)

Learning isn’t always useful:

• There is no need to “learn” to calculate payroll
5
Based on slide by E. Alpaydin
A classic example of a task that requires machine learning:
It is very hard to say what makes a 2

6
Slide credit: Geoffrey Hinton
Some more examples of tasks that are best
solved by using a learning algorithm
• Recognizing patterns:
– Facial identities or facial expressions
– Handwritten or spoken words
– Medical images
• Generating patterns:
– Generating images or motion sequences
• Recognizing anomalies:
– Unusual credit card transactions
– Unusual patterns of sensor readings in a nuclear power plant
• Prediction:
– Future stock prices or currency exchange rates
7
Slide credit: Geoffrey Hinton
Sample Applications
• Web search
• Computational biology
• Finance
• E-commerce
• Space exploration
• Robotics
• Information extraction
• Social networks
• Debugging software
• [Your favorite area]

8
Slide credit: Pedro Domingos
Samuel’s Checkers-Player
“Machine Learning: Field of study that gives
computers the ability to learn without being
explicitly programmed.” -Arthur Samuel (1959)

9
Defining the Learning Task
Improve on task T, with respect to
performance metric P, based on experience
E
T: Playing checkers
P: Percentage of games won against an arbitrary
opponent E: Playing practice games against itself

T: Recognizing hand-written words

P: Percentage of words correctly classified
E: Database of human-labeled images of
handwritten words

T: Driving on four-lane highways using vision

sensors
P: Average distance traveled before a human-
judged error
E: A sequence of images and steering commands recorded while
observing a human driver. 10
Slide credit: Ray Mooney
State of the Art Applications of
Machine Learning

11
Autonomous Cars

• Nevada made it legal for

autonomous cars to drive on
roads in June 2011
• As of 2023, 34 states have
legalized autonomous cars
Penn’s Autonomous
Car 
(Ben Franklin Racing
Team) 12
Autonomous Car Sensors

13
Autonomous Car Technology
Path

Planning

Laser Terrain Mapping

Adaptive Vision

Images and movies taken from Sebastian Thrun’s multimedia w1e4bsite.

Deep Learning in the Headlines

15
Deep Belief Net on Face Images
object models

object parts
(combination
of edges)

edges

pixels
Based on materials 16
by Andrew Ng
Learning of Object Parts

17
Slide credit: Andrew Ng
Training on Multiple Objects

Trained on 4 classes (cars, faces,

motorbikes, airplanes).
Second layer: Shared-features
and object-specific features.
Third layer: More specific
features.

18
Slide credit: Andrew Ng
Scene Labeling via Deep Learning

[Farabet et al. ICML 2012, PAMI 2013] 19

Inference from Deep Learned Models
Generating posterior samples from faces by “filling in” experiments
(cf. Lee and Mumford, 2003). Combine bottom-up and top-down inference.

Input images

Samples from
feedforward
Inference
(control)

Samples from
Full posterior
inference

20
Slide credit: Andrew Ng
Machine Learning in
Automatic Speech Recognition
A Typical Speech Recognition System

ML used to predict of phone states from the sound spectrogram

Deep learning has state-of-the-art results

# Hidden Layers 1 2 4 8 10 12

Word Error Rate % 16.0 12.8 11.4 10.9 11.0 11.1

Baseline GMM performance = 15.4%

[Zeiler et al. “On rectified linear units for speech
recognition” ICASSP 2013]
2
1
Impact of Deep Learning in Speech Technology

22
Slide credit: Li Deng, MS Research
Types of Learning

23
Types of Learning

• Supervised (inductive) learning

– Given: training data + desired outputs (labels)
• Unsupervised learning
– Given: training data (without desired outputs)
• Semi-supervised learning
– Given: training data + a few desired outputs
• Reinforcement learning
– Rewards from sequence of actions

24
Based on slide by Pedro Domingos
Supervised Learning: Regression
• Given (x1, y1), (x2, y2), ..., (xn, yn)
• Learn a function f (x) to predict y given x
– y is real-valued == regression
9
8
September Arctic Sea Ice Extent

7
(1,000,000 sq km)

6
5
4
3
2
1
0
1970 1990 2000 2010 2020
1980 Year
26
Data from G. Witt. Journal of Statistics Education, Volume 21,
Supervised Learning: Classification
• Given (x1, y1), (x2, y2), ..., (xn, yn)
• Learn a function f (x) to predict y given x
– y is categorical == classification
Breast Cancer (Malignant / Benign)

1(Malignant)

0(Benign)
Tumor Size

27
Based on example by Andrew Ng
Supervised Learning: Classification
• Given (x1, y1), (x2, y2), ..., (xn, yn)
• Learn a function f (x) to predict y given x
– y is categorical == classification
Cancer (Malignant / Benign)

1(Malignant)

0(Benign)
Tumor Size

Based on example by Andrew Ng

Tumor Size 28
Supervised Learning: Classification
• Given (x1, y1), (x2, y2), ..., (xn, yn)
• Learn a function f (x) to predict y given x
– y is categorical == classification
Cancer (Malignant / Benign)

1(Malignant)

0(Benign)
Tumor Size
Predict Benign Predict Malignant

Based on example by Andrew Ng

Tumor Size 29
Supervised Learning
• x can be multi-dimensional
– Each dimension corresponds to an attribute

- Clump Thickness
- Uniformity of Cell Size
Age - Uniformity of Cell Shape
…

Tumor Size

30
Based on example by Andrew Ng
Unsupervised Learning
• Given x1 , x2 , ..., x n (without labels)
• Output hidden structure behind the x’s
– E.g., clustering

31
Unsupervised Learning
Genomics application: group individuals by genetic similarity
Genes

Individuals 32
[Source: Daphne Koller]
Unsupervised Learning

Organize computing clusters Social network analysis

Image credit: NASA/JPL-Caltech/E. Churchwell (Univ. of Wisconsin, Madison)

Market segmentation Astronomical data analysis 33

Slide credit: Andrew Ng
Unsupervised Learning
• Independent component analysis – separate a
combined signal into its original sources

34
Image credit: statsoft.com Audio from http://www.ism.ac.jp/~shiro/research/blindsep.html
Unsupervised Learning
• Independent component analysis – separate a
combined signal into its original sources

35
Image credit: statsoft.com Audio from http://www.ism.ac.jp/~shiro/research/blindsep.html
Reinforcement Learning
• Given a sequence of states and actions with
(delayed) rewards, output a policy
– Policy is a mapping from states  actions that
tells you what to do in a given state
• Examples:
– Credit assignment problem
– Game playing
– Robot in a maze
– Balance a pole on your hand

36
The Agent-Environment Interface

Agent and environment interact at discrete time : t  0, 1, 2,

steps Agent observes state at step t: K
t S
sproduces action at step t : at 
A(st )resulting reward : rt 1 
gets
and resulting next state :
st 1

... st rt +1 rt +2 s rt +3 ...
at st +1 t +2 st +3
at +1 at +2 at +3
37
Slide credit: Sutton & Barto
Reinforcement Learning

https://www.youtube.com/watch?v=4cgWya-wjgY 38
Inverse Reinforcement Learning
• Learn policy from user demonstrations

Stanford Autonomous Helicopter

http://heli.stanford.edu/ https://
www.youtube.com/watch?v=VCdxqn0fcnE
39
Framing a Learning Problem

40
Designing a Learning System
• Choose the training experience
• Choose exactly what is to be learned
– i.e. the target function
• Choose how to represent the target function
• Choose a learning algorithm to infer the target
function from the experience

Training data Learner

Environment/
Experience Knowledge

Testing data
Performanc
e Element 41
Based on slide by Ray Mooney
Training vs. Test Distribution
• We generally assume that the training and
test examples are independently drawn from
the same overall distribution of data
– We call this “i.i.d” which stands for “independent
and identically distributed”

• If examples are not independent, requires

collective classification
• If test distribution is different, requires
transfer learning
42
Slide credit: Ray Mooney
ML in a Nutshell
• Tens of thousands of machine learning
algorithms
– Hundreds new every year

• Every ML algorithm has three

components:
– Representation
– Optimization
– Evaluation

43
Slide credit: Pedro Domingos
Various Function Representations
• Numerical functions
– Linear regression
– Neural networks
– Support vector machines
• Symbolic functions
– Decision trees
– Rules in propositional logic
– Rules in first-order predicate logic
• Instance-based functions
– Nearest-neighbor
– Case-based
• Probabilistic Graphical Models
– Naïve Bayes
– Bayesian networks
– Hidden-Markov Models (HMMs)
– Probabilistic Context Free Grammars (PCFGs)
– Markov networks

44
Slide credit: Ray Mooney
Various Search/Optimization
Algorithms
• Gradient descent
– Perceptron
– Backpropagation
• Dynamic Programming
– HMM Learning
– PCFG Learning
• Divide and Conquer
– Decision tree induction
– Rule learning
• Evolutionary Computation
– Genetic Algorithms (GAs)
– Genetic Programming (GP)
– Neuro-evolution

45
Slide credit: Ray Mooney
Evaluation
• Accuracy
• Precision and recall
• Squared error
• Likelihood
• Posterior probability
• Cost / Utility
• Margin
• Entropy
• K-L divergence
• etc.

47
Slide credit: Pedro Domingos
ML in Practice
• Understand domain, prior knowledge, and goals
• Data integration, selection, cleaning, pre-processing, etc.
Loop • Learn models
• Interpret results
• Consolidate and deploy discovered knowledge

48
Based on a slide by Pedro Domingos
Lessons Learned about Learning
• Learning can be viewed as using direct or indirect
experience to approximate a chosen target function.

• Function approximation can be viewed as a search

through a space of hypotheses (representations of
functions) for one that best fits a set of training data.

• Different learning methods assume different

hypothesis spaces (representation languages) and/or
employ different search techniques.

49
Slide credit: Ray Mooney
A Brief History of
Machine Learning

50
What We’ll Cover in this Course
• Supervised learning • Unsupervised learning
– Decision tree induction – Clustering
– Linear regression – Dimensionality reduction
– Logistic regression • Reinforcement learning
– Support vector machines – Temporal difference
& kernel methods learning
– Model ensembles – Q learning
– Bayesian learning • Evaluation
– Neural networks & deep
learning • Applications
– Learning theory

Our focus will be on applying machine learning to real applications

Imrad Format
100% (4)
Imrad Format
2 pages
01 Introduction
No ratings yet
01 Introduction
43 pages
Introduction To Machine Learning: WWW - Seas.upenn - Edu/ Cis519
100% (1)
Introduction To Machine Learning: WWW - Seas.upenn - Edu/ Cis519
51 pages
01 Introduction 1
No ratings yet
01 Introduction 1
71 pages
Machine Learning Week2 (1)
No ratings yet
Machine Learning Week2 (1)
51 pages
01 Introduction
No ratings yet
01 Introduction
50 pages
Machine Learning Copy
No ratings yet
Machine Learning Copy
42 pages
Military AI-Week 02-Key Concept Machine Learning
No ratings yet
Military AI-Week 02-Key Concept Machine Learning
84 pages
Introduction To ML P2
No ratings yet
Introduction To ML P2
30 pages
Intro To ML
No ratings yet
Intro To ML
107 pages
Module 1-Basics of ML
No ratings yet
Module 1-Basics of ML
142 pages
AML All Merged PDF Class 1 To 8
No ratings yet
AML All Merged PDF Class 1 To 8
423 pages
Lecture 1 - Introduction (DONE!!)
No ratings yet
Lecture 1 - Introduction (DONE!!)
33 pages
Unit-1 Introduction To Machine Learning
No ratings yet
Unit-1 Introduction To Machine Learning
24 pages
Lect1 Introduction
No ratings yet
Lect1 Introduction
38 pages
Lec 1,2
No ratings yet
Lec 1,2
69 pages
Lecture 1
No ratings yet
Lecture 1
43 pages
1 - Introduction
No ratings yet
1 - Introduction
82 pages
WEEK 01 Merged
No ratings yet
WEEK 01 Merged
606 pages
Lecture 01 - Introduction To AML-Jan24
No ratings yet
Lecture 01 - Introduction To AML-Jan24
66 pages
Introduction To ML P1
No ratings yet
Introduction To ML P1
21 pages
Unit1-2
No ratings yet
Unit1-2
101 pages
Lecture Compiled
No ratings yet
Lecture Compiled
224 pages
Tirth.pdf
No ratings yet
Tirth.pdf
19 pages
Module 1
No ratings yet
Module 1
175 pages
Sesi#1 - WJ - Machine Learning in Brief (Printed Version)
No ratings yet
Sesi#1 - WJ - Machine Learning in Brief (Printed Version)
37 pages
AML - Mid Term - Merged
No ratings yet
AML - Mid Term - Merged
192 pages
Ch3-Machine Learning
No ratings yet
Ch3-Machine Learning
124 pages
Chapter 1 - Introduction
No ratings yet
Chapter 1 - Introduction
28 pages
Vikas Machine
No ratings yet
Vikas Machine
23 pages
1 Lecture 1: Introduction To Machine Learning
No ratings yet
1 Lecture 1: Introduction To Machine Learning
12 pages
Introduction To AI and ML - Day 1: Gururajan Narasimhan Erode
No ratings yet
Introduction To AI and ML - Day 1: Gururajan Narasimhan Erode
39 pages
NPTEL_Week01_02_OverviewOfMachineLearning.pptx
No ratings yet
NPTEL_Week01_02_OverviewOfMachineLearning.pptx
12 pages
Unit-1 MLT
No ratings yet
Unit-1 MLT
51 pages
Presentation on ML - Copy
No ratings yet
Presentation on ML - Copy
469 pages
UNIT I 1 ML Introduction To ML Well Posed Learning Problem
No ratings yet
UNIT I 1 ML Introduction To ML Well Posed Learning Problem
48 pages
Lecture01 Introduction To Machine Learning (Chapter1)
No ratings yet
Lecture01 Introduction To Machine Learning (Chapter1)
64 pages
Presentation of AI ML Session 1
No ratings yet
Presentation of AI ML Session 1
131 pages
Lec1 -Introduction
No ratings yet
Lec1 -Introduction
55 pages
Asset-V1 MITx+6.86x+3T2020+typeasset+blockslides Lecture1 Compressed
No ratings yet
Asset-V1 MITx+6.86x+3T2020+typeasset+blockslides Lecture1 Compressed
27 pages
1c Machinelearning
No ratings yet
1c Machinelearning
50 pages
Lec 1
No ratings yet
Lec 1
35 pages
01_ml-overview_notes
No ratings yet
01_ml-overview_notes
19 pages
ENG6500 1 IntroductionToMLDL Part1
No ratings yet
ENG6500 1 IntroductionToMLDL Part1
63 pages
Applied Machine Learning
No ratings yet
Applied Machine Learning
49 pages
Machine Learning INTRO
No ratings yet
Machine Learning INTRO
12 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
45 pages
01 LecIntro
No ratings yet
01 LecIntro
23 pages
ML -1_Sovan_Introduction to ML
No ratings yet
ML -1_Sovan_Introduction to ML
83 pages
Lecture 01 - Machine Learning Basics Revision
No ratings yet
Lecture 01 - Machine Learning Basics Revision
80 pages
Asset-V1 - MITx 6.86x 1T2021 Type@Asset Block@Slides - Lecture1 - Withcredits
No ratings yet
Asset-V1 - MITx 6.86x 1T2021 Type@Asset Block@Slides - Lecture1 - Withcredits
29 pages
IntroductionToSLvsUSL v1.0
No ratings yet
IntroductionToSLvsUSL v1.0
28 pages
Unit 3
No ratings yet
Unit 3
62 pages
Unit 1
No ratings yet
Unit 1
72 pages
New Microsoft Office Word Document
No ratings yet
New Microsoft Office Word Document
28 pages
Lec1 Intoduction
No ratings yet
Lec1 Intoduction
34 pages
Chapter One1
No ratings yet
Chapter One1
106 pages
Chapter 7- Artificial Intelligence Application
No ratings yet
Chapter 7- Artificial Intelligence Application
29 pages
L02 Fundamentals of ML
No ratings yet
L02 Fundamentals of ML
39 pages
Machine Minds AI for all: An Ethical Intelligence & Responsible Revolution
From Everand
Machine Minds AI for all: An Ethical Intelligence & Responsible Revolution
aarat
No ratings yet
Mastering OpenCV with Practical Computer Vision Projects
From Everand
Mastering OpenCV with Practical Computer Vision Projects
Shervin Emami
No ratings yet
Lecture Chi Square Non Parametric Test
No ratings yet
Lecture Chi Square Non Parametric Test
41 pages
Data Modelling of Cholera Outbreaks
No ratings yet
Data Modelling of Cholera Outbreaks
8 pages
2MLIntrodpart 2
No ratings yet
2MLIntrodpart 2
42 pages
Ebe503 Quality Control Methods - Notes
No ratings yet
Ebe503 Quality Control Methods - Notes
47 pages
Forcasting Daily Sales in Retail
No ratings yet
Forcasting Daily Sales in Retail
78 pages
Inquiries, Investigation and Immersion: Quarter 1 - Module 2
100% (1)
Inquiries, Investigation and Immersion: Quarter 1 - Module 2
38 pages
Thesis Version 2
No ratings yet
Thesis Version 2
248 pages
Factors influencing AI adoption
No ratings yet
Factors influencing AI adoption
15 pages
Pengaruh Kompetensi Disiplin Kerja Dan Pelatihan T
No ratings yet
Pengaruh Kompetensi Disiplin Kerja Dan Pelatihan T
8 pages
Chi Square Test of Proportion
No ratings yet
Chi Square Test of Proportion
3 pages
1st 2nd 3rd 4th
No ratings yet
1st 2nd 3rd 4th
6 pages
Univariate and Multivariate Analysis - Jupyter Notebook
No ratings yet
Univariate and Multivariate Analysis - Jupyter Notebook
5 pages
Total Quality Management in Service Organizations: Enrico C. Mina
No ratings yet
Total Quality Management in Service Organizations: Enrico C. Mina
55 pages
Latin Hypercube Sampling
No ratings yet
Latin Hypercube Sampling
11 pages
How To Build A Tree House PDF
No ratings yet
How To Build A Tree House PDF
62 pages
Revision Research Kay Sir
No ratings yet
Revision Research Kay Sir
55 pages
Chi-Squar Test - Shahida Jahfar Rashka
No ratings yet
Chi-Squar Test - Shahida Jahfar Rashka
20 pages
Crisp DM - Crisp MLQ
No ratings yet
Crisp DM - Crisp MLQ
12 pages
Perceptron
No ratings yet
Perceptron
3 pages
Syllabus - Asset-V1 - MITx+6.431x+1T2022+type@asset+block@resources - 1T2022 - Syllabus - 1T2022
No ratings yet
Syllabus - Asset-V1 - MITx+6.431x+1T2022+type@asset+block@resources - 1T2022 - Syllabus - 1T2022
2 pages
Capstone research paper
No ratings yet
Capstone research paper
14 pages
(Ebook) Design of Experiments for Engineers and Scientists by Jiju Antony ISBN 9780750647090, 9781417505463, 0750647094, 141750546X - Quickly download the ebook to read anytime, anywhere
100% (2)
(Ebook) Design of Experiments for Engineers and Scientists by Jiju Antony ISBN 9780750647090, 9781417505463, 0750647094, 141750546X - Quickly download the ebook to read anytime, anywhere
49 pages
Stein Et Al 2014 Environmental Heterogeneity
No ratings yet
Stein Et Al 2014 Environmental Heterogeneity
15 pages
Determination of The Impact Resistance of Thermoplastic Pipe and Fittings by Means of A Tup (Falling Weight)
No ratings yet
Determination of The Impact Resistance of Thermoplastic Pipe and Fittings by Means of A Tup (Falling Weight)
10 pages
Pahang (A)
No ratings yet
Pahang (A)
6 pages
Juniarti Dan Evelyn
No ratings yet
Juniarti Dan Evelyn
22 pages
Statistics: An Overview: Unit 1
No ratings yet
Statistics: An Overview: Unit 1
10 pages
Baseline Specifications For GSM BSS Network Performance KPIs (Call Drop Ratio On TCH)
No ratings yet
Baseline Specifications For GSM BSS Network Performance KPIs (Call Drop Ratio On TCH)
24 pages
Regression An Ova
No ratings yet
Regression An Ova
24 pages

ML Lecture#1

Uploaded by

ML Lecture#1

Uploaded by

Machine Learning

Instructor: Dr. Syed Usman

• Introduction to the Course

• Dimensionality reduction techniques

• Pattern Recognition & Machine Learning, 1st Edition, Chris Bishop

• Machine Learning: A Probabilistic Perspective, 1st Edition, Kevin R

• Applied Machine Learning, online Edition, David Forsyth

• Latest Research Papers

Definition by Tom Mitchell (1998):

Learning isn’t always useful:

T: Recognizing hand-written words

T: Driving on four-lane highways using vision

• Nevada made it legal for

Laser Terrain Mapping

Images and movies taken from Sebastian Thrun’s multimedia w1e4bsite.

Trained on 4 classes (cars, faces,

[Farabet et al. ICML 2012, PAMI 2013] 19

ML used to predict of phone states from the sound spectrogram

Deep learning has state-of-the-art results

Word Error Rate % 16.0 12.8 11.4 10.9 11.0 11.1

Baseline GMM performance = 15.4%

• Supervised (inductive) learning

Based on example by Andrew Ng

Based on example by Andrew Ng

Organize computing clusters Social network analysis

Image credit: NASA/JPL-Caltech/E. Churchwell (Univ. of Wisconsin, Madison)

Market segmentation Astronomical data analysis 33

Agent and environment interact at discrete time : t  0, 1, 2,

Stanford Autonomous Helicopter

Training data Learner

• If examples are not independent, requires

• Every ML algorithm has three

• Function approximation can be viewed as a search

• Different learning methods assume different

Our focus will be on applying machine learning to real applications

You might also like