0% found this document useful (0 votes)

94 views

01 Introduction

Uploaded by

ft ta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

94 views

01 Introduction

Uploaded by

ft ta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 49

CSE

5603
Introduction to
Machine Learning
Instructor: Belayneh M

1
Robot Image Credit: Viktoriya Sukhanova © 123RF.com
What We’ll Cover in this Course

• AI vs ML • Unsupervised learning

• ML – Clustering
• Supervised learning – Dimensionality reduction
– Decision tree induction • Reinforcement learning
– Linear regression – Temporal difference
learning
– Logistic regression
– Q learning
– Support vector machines
& kernel methods • Evaluation
– Model ensembles • Applications
– Bayesian learning
– Neural networks & deep
learning
– Learning theory
Artificial Intelligence (AI)
vs Machine learning (ML)
AI:
is the broader concept of machines being able to carry
out tasks in a way that we would consider “smart”.

ML:
an application of AI based around the idea that we
should really just be able to give machines access to
data and let them learn for themselves.
Big Data
• Widespread use of personal computers and wireless
communication leads to “big data”
• We are both producers and consumers of data
• Data is not random, it has structure, e.g., customer
behavior
• We need “big theory” to extract that structure from
data for
(a) Understanding the process
(b) Making predictions for the future
What is Machine Learning?
“Learning is any process by which a system improves
performance from experience.”
‐ Herbert Simon

Definition by Tom Mitchell (1998):
Machine Learning is the study of algorithms that
• improve their performance P
• at some task T
• with experience E.
A well‐defined learning task is given by <P, T, E>.
3
Traditional Programming

Data
Computer Output
Program

Machine Learning

Data
Computer Program
Output

4
Slide credit: Pedro Domingos
When Do We Use Machine Learning?
ML is used when:
• Human expertise does not exist (navigating on Mars)
• Humans can’t explain their expertise (speech recognition)
• Models must be customized (personalized medicine)
• Models are based on huge amounts of data (genomics)

Learning isn’t always useful:
• There is no need to “learn” to calculate payroll
5
Based on slide by E. Alpaydin
A classic example of a task that requires machine learning:
It is very hard to say what makes a 2

6
Slide credit: Geoffrey Hinton
Some more examples of tasks that are best
solved by using a learning algorithm
• Recognizing patterns:
– Facial identities or facial expressions
– Handwritten or spoken words
– Medical images
• Generating patterns:
– Generating images or motion sequences
• Recognizing anomalies:
– Unusual credit card transactions
– Unusual patterns of sensor readings in a nuclear power plant
• Prediction:
– Future stock prices or currency exchange rates
7
Slide credit: Geoffrey Hinton
Sample Applications
• Web search
• Computational biology
• Finance
• E‐commerce
• Space exploration
• Robotics
• Information extraction
• Social networks
• Debugging software
• [Your favorite area]

8
Slide credit: Pedro Domingos
Samuel’s Checkers‐Player
“Machine Learning: Field of study that gives
computers the ability to learn without being
explicitly programmed.” ‐Arthur Samuel (1959)

9
Defining the Learning Task
Improve on task T, with respect to
performance metric P, based on experience E
T: Playing checkers
P: Percentage of games won against an arbitrary opponent
E: Playing practice games against itself

T: Recognizing hand‐written words

P: Percentage of words correctly classified
E: Database of human‐labeled images of handwritten words

T: Driving on four‐lane highways using vision sensors

P: Average distance traveled before a human‐judged error
E: A sequence of images and steering commands recorded while
observing a human driver.

T: Categorize email messages as spam or legitimate.

P: Percentage of email messages correctly classified.
E: Database of emails, some with human‐given labels
10
Slide credit: Ray Mooney
State of the Art Applications of
Machine Learning

11
Autonomous Cars

• Nevada made it legal for
autonomous cars to drive on
roads in June 2011
• As of 2013, four states (Nevada,
Florida, California, and
Michigan) have legalized
autonomous cars
Penn’s Autonomous Car à
12
(Ben Franklin Racing Team)
Autonomous Car Sensors

13
Autonomous Car Technology
Path
Planning

Laser Terrain Mapping

Learning from Human Drivers
Adaptive Vision

Sebastian

Stanley

Images and movies taken from Sebastian Thrun’s multimedia w1e4bsite.
Deep Learning in the Headlines

15
Deep Belief Net on Face Images
object models

object parts
(combination
of edges)

edges

pixels
Based on materials 16
by Andrew Ng
Learning of Object Parts

17
Slide credit: Andrew Ng
Training on Multiple Objects

Trained on 4 classes (cars, faces,
motorbikes, airplanes).
Second layer: Shared‐features
and object‐specific features.
Third layer: More specific
features.

18
Slide credit: Andrew Ng
Scene Labeling via Deep Learning

[Farabet et al. ICML 2012, PAMI 2013] 19
Inference from Deep Learned Models
Generating posterior samples from faces by “filling in” experiments
(cf. Lee and Mumford, 2003). Combine bottom‐up and top‐down inference.

Input images

Samples from
feedforward
Inference
(control)

Samples from
Full posterior
inference

20
Slide credit: Andrew Ng
Machine Learning in
Automatic Speech Recognition
A Typical Speech Recognition System

ML used to predict of phone states from the sound spectrogram

Deep learning has state‐of‐the‐art results
# Hidden Layers 1 2 4 8 10 12

Word Error Rate % 16.0 12.8 11.4 10.9 11.0 11.1

Baseline GMM performance = 15.4%
[Zeiler et al. “On rectified linear units for speech
recognition” ICASSP 2013]
21
Impact of Deep Learning in Speech Technology

22
Slide credit: Li Deng, MS Research
Types of Learning

23
Types of Learning

• Supervised (inductive) learning
– Given: training data + desired outputs (labels)
• Unsupervised learning
– Given: training data (without desired outputs)
• Semi‐supervised learning
– Given: training data + a few desired outputs
• Reinforcement learning
– Rewards from sequence of actions

24
Based on slide by Pedro Domingos
Supervised Learning: Regression
• Given (x 1 , y1), (x 2 , y2), ..., (x n , yn)
• Learn a function f(x) to predict y givenx
– y is real‐valued == regression
9
September Arctic Sea Ice Extent

8
7
(1,000,000 sq km)

6
5
4
3
2
1
0
1970 1980 1990 2000 2010 2020
Year
26
Data from G. Witt. Journal of Statistics Education, Volume 21, Number 1 (2013)
Supervised Learning: Classification
• Given (x 1 , y1), (x 2 , y2), ..., (x n , yn)
• Learn a function f(x) to predict y givenx
– y is categorical == classification
Breast Cancer (Malignant / Benign)

1(Malignant)

0(Benign)
Tumor Size

27
Based on example by Andrew Ng
Supervised Learning: Classification
• Given (x 1 , y1), (x 2 , y2), ..., (x n , yn)
• Learn a function f(x) to predict y givenx
– y is categorical == classification
Breast Cancer (Malignant / Benign)

1(Malignant)

0(Benign)
Tumor Size

Based on example by Andrew Ng
Tumor Size 28
Supervised Learning: Classification
• Given (x 1 , y1), (x 2 , y2), ..., (x n , yn)
• Learn a function f(x) to predict y givenx
– y is categorical == classification
Breast Cancer (Malignant / Benign)

1(Malignant)

0(Benign)
Tumor Size
Predict Benign Predict Malignant

Based on example by Andrew Ng
Tumor Size 29
Supervised Learning
• x can be multi‐dimensional
– Each dimension corresponds to an attribute

‐ Clump Thickness
‐ Uniformity of Cell Size
Age ‐ Uniformity of Cell Shape
…

Tumor Size

30
Based on example by Andrew Ng
Unsupervised Learning
• Given x 1 , x 2 , ..., x n (without labels)
• Output hidden structure behind the x’s
– E.g., clustering

31
Unsupervised Learning
Genomics application: group individuals by genetic similarity
Genes

Individuals 32
[Source: Daphne Koller]
Unsupervised Learning

Organize computing clusters Social network analysis

Image credit: NASA/JPL‐Caltech/E. Churchwell (Univ. of Wisconsin, Madison)

Market segmentation Astronomical data analysis 33

Slide credit: Andrew Ng
Unsupervised Learning
• Independent component analysis – separate a
combined signal into its original sources

34
Image credit: statsoft.com Audio from http://www.ism.ac.jp/~shiro/research/blindsep.html
Unsupervised Learning
• Independent component analysis – separate a
combined signal into its original sources

35
Image credit: statsoft.com Audio from http://www.ism.ac.jp/~shiro/research/blindsep.html
Reinforcement Learning
• Given a sequence of states and actions with
(delayed) rewards, output a policy
– Policy is a mapping from states à actions that
tells you what to do in a given state
• Examples:
– Credit assignment problem
– Game playing
– Robot in a maze
– Balance a pole on your hand

36
The Agent‐Environment Interface

Agent and environment interact at discrete timesteps : t  0, 1, 2, K

Agent observes state at step t : st S
produces action at step t : at  A(st )
gets resulting reward : rt1 
and resulting next state : st 1

... rt +1 s rt +2 s rt +3 s ...

st a t+1
at+1 t +2
at+2 t +3 at+3
t
37
Slide credit: Sutton & Barto
Reinforcement Learning

https://www.youtube.com/watch?v=4cgWya‐wjgY 38
Inverse Reinforcement Learning
• Learn policy from user demonstrations

Stanford Autonomous Helicopter
http://heli.stanford.edu/
https://www.youtube.com/watch?v=VCdxqn0fcnE
39
Framing a Learning Problem

40
Designing a Learning System
• Choose the training experience
• Choose exactly what is to be learned
– i.e. the target function
• Choose how to represent the target function
• Choose a learning algorithm to infer the target
function from the experience

Training data Learner

Environment/
Experience Knowledge

Testing data
Performance
Element 41
Based on slide by Ray Mooney
Training vs. Test Distribution
• We generally assume that the training and
test examples are independently drawn from
the same overall distribution of data
– We call this “i.i.d” which stands for “independent
and identically distributed”

• If examples are not independent, requires
collective classification
• If test distribution is different, requires
transfer learning
42
Slide credit: Ray Mooney
ML in a Nutshell
• Tens of thousands of machine learning
algorithms
– Hundreds new every year

• Every ML algorithm has three components:
– Representation
– Optimization
– Evaluation

43
Slide credit: Pedro Domingos
Various Function Representations
• Numerical functions
– Linear regression
– Neural networks
– Support vector machines
• Symbolic functions
– Decision trees
– Rules in propositional logic
– Rules in first‐order predicate logic
• Instance‐based functions
– Nearest‐neighbor
– Case‐based
• Probabilistic Graphical Models
– Naïve Bayes
– Bayesian networks
– Hidden‐Markov Models (HMMs)
– Probabilistic Context Free Grammars (PCFGs)
– Markov networks

44
Slide credit: Ray Mooney
Various Search/Optimization
Algorithms
• Gradient descent
– Perceptron
– Backpropagation
• Dynamic Programming
– HMM Learning
– PCFG Learning
• Divide and Conquer
– Decision tree induction
– Rule learning
• Evolutionary Computation
– Genetic Algorithms (GAs)
– Genetic Programming (GP)
– Neuro‐evolution

45
Slide credit: Ray Mooney
Evaluation
• Accuracy
• Precision and recall
• Squared error
• Likelihood
• Posterior probability
• Cost / Utility
• Margin
• Entropy
• K‐L divergence
• etc.

47
Slide credit: Pedro Domingos
ML in Practice

• Understand domain, prior knowledge, and goals
• Data integration, selection, cleaning, pre‐processing, etc.
Loop • Learn models
• Interpret results
• Consolidate and deploy discovered knowledge

48
Based on a slide by Pedro Domingos
Lessons Learned about Learning
• Learning can be viewed as using direct or indirect
experience to approximate a chosen target function.

• Function approximation can be viewed as a search
through a space of hypotheses (representations of
functions) for one that best fits a set of training data.

• Different learning methods assume different
hypothesis spaces (representation languages) and/or
employ different search techniques.

49
Slide credit: Ray Mooney

Chapter - 2 Emergence and Development of Management Thought
56% (9)
Chapter - 2 Emergence and Development of Management Thought
58 pages
Job Order Costing
80% (5)
Job Order Costing
6 pages
PD Cen TR 12831-2-2017 PDF
100% (2)
PD Cen TR 12831-2-2017 PDF
34 pages
Event N Participants List v4
No ratings yet
Event N Participants List v4
30 pages
01_introduction (2)
No ratings yet
01_introduction (2)
51 pages
1.0_introduction
No ratings yet
1.0_introduction
50 pages
01 Introduction
No ratings yet
01 Introduction
50 pages
Introduction To Machine Learning: WWW - Seas.upenn - Edu/ Cis519
100% (1)
Introduction To Machine Learning: WWW - Seas.upenn - Edu/ Cis519
51 pages
Vikas Machine
No ratings yet
Vikas Machine
23 pages
Machine Learning Week2 (1)
No ratings yet
Machine Learning Week2 (1)
51 pages
AML All Merged PDF Class 1 To 8
No ratings yet
AML All Merged PDF Class 1 To 8
423 pages
01 Introduction
No ratings yet
01 Introduction
43 pages
ML Lecture#1
No ratings yet
ML Lecture#1
52 pages
Lecture 1
No ratings yet
Lecture 1
43 pages
Machine Learning Copy
No ratings yet
Machine Learning Copy
42 pages
Lecture 01 - Introduction To AML-Jan24
No ratings yet
Lecture 01 - Introduction To AML-Jan24
66 pages
Military AI-Week 02-Key Concept Machine Learning
No ratings yet
Military AI-Week 02-Key Concept Machine Learning
84 pages
Lecture 1 - Introduction (DONE!!)
No ratings yet
Lecture 1 - Introduction (DONE!!)
33 pages
01 Introduction 1
No ratings yet
01 Introduction 1
71 pages
CE802_Lec_IntroML_handouts
No ratings yet
CE802_Lec_IntroML_handouts
24 pages
Module 1-Basics of ML
No ratings yet
Module 1-Basics of ML
142 pages
Lecture Compiled
No ratings yet
Lecture Compiled
224 pages
DIR Notes 1
No ratings yet
DIR Notes 1
39 pages
ML Seminar Presentation
No ratings yet
ML Seminar Presentation
26 pages
New Microsoft Office Word Document
No ratings yet
New Microsoft Office Word Document
28 pages
Lec 1,2
No ratings yet
Lec 1,2
69 pages
CE469 - Introduction To Machine Learning: Lecturer Contact
No ratings yet
CE469 - Introduction To Machine Learning: Lecturer Contact
33 pages
AML - Mid Term - Merged
No ratings yet
AML - Mid Term - Merged
192 pages
Lecture 1
No ratings yet
Lecture 1
47 pages
1. ML Introduction
No ratings yet
1. ML Introduction
54 pages
Introduction To ML P2
No ratings yet
Introduction To ML P2
30 pages
part4
No ratings yet
part4
11 pages
Machine Learning Lecture-01
No ratings yet
Machine Learning Lecture-01
37 pages
Machine Learning: BE Sixth Semester 20CS610
No ratings yet
Machine Learning: BE Sixth Semester 20CS610
211 pages
lecture1 (1)
No ratings yet
lecture1 (1)
73 pages
Intro To ML
No ratings yet
Intro To ML
107 pages
Presentation of AI ML Session 1
No ratings yet
Presentation of AI ML Session 1
131 pages
MLT UINT1
No ratings yet
MLT UINT1
26 pages
Lecture1 PDF
No ratings yet
Lecture1 PDF
37 pages
ML Short U1-4
No ratings yet
ML Short U1-4
60 pages
A.I. Lecture 4 NEW
No ratings yet
A.I. Lecture 4 NEW
31 pages
Introduction To ML
No ratings yet
Introduction To ML
4 pages
ML Toppers Solutions 2019
No ratings yet
ML Toppers Solutions 2019
105 pages
Module 1
No ratings yet
Module 1
175 pages
Introduction To Machine Learning
100% (1)
Introduction To Machine Learning
11 pages
ML Lecture 2 Supervised Learning Setup
No ratings yet
ML Lecture 2 Supervised Learning Setup
38 pages
18.Overview
No ratings yet
18.Overview
18 pages
MLUnit_1
No ratings yet
MLUnit_1
131 pages
ML 3170724 Unit-1
No ratings yet
ML 3170724 Unit-1
27 pages
mlintro-4
No ratings yet
mlintro-4
28 pages
NPTEL_Week01_02_OverviewOfMachineLearning.pptx
No ratings yet
NPTEL_Week01_02_OverviewOfMachineLearning.pptx
12 pages
Lec1 Intoduction
No ratings yet
Lec1 Intoduction
34 pages
ML intro
No ratings yet
ML intro
28 pages
Machine Learning Techniques-bcds062!01!01[1]
No ratings yet
Machine Learning Techniques-bcds062!01!01[1]
66 pages
Machine Learning: Louis Fippo Fitime
No ratings yet
Machine Learning: Louis Fippo Fitime
37 pages
Lecture 1 Ai
No ratings yet
Lecture 1 Ai
38 pages
Lecture 1.1. Introduction
No ratings yet
Lecture 1.1. Introduction
48 pages
ML Merged
No ratings yet
ML Merged
433 pages
PDF Machine Learning
100% (1)
PDF Machine Learning
222 pages
presentation
No ratings yet
presentation
10 pages
Unit1 basic introduction
No ratings yet
Unit1 basic introduction
16 pages
Chapter 1 - Introduction
No ratings yet
Chapter 1 - Introduction
28 pages
DS Artificial Intelligence 3
No ratings yet
DS Artificial Intelligence 3
14 pages
Machine Minds AI for all: An Ethical Intelligence & Responsible Revolution
From Everand
Machine Minds AI for all: An Ethical Intelligence & Responsible Revolution
aarat
No ratings yet
23.transmit Audio and Video Content Over Network
No ratings yet
23.transmit Audio and Video Content Over Network
5 pages
Semantic Analysis: Natural Language Processing (CSE 5321)
No ratings yet
Semantic Analysis: Natural Language Processing (CSE 5321)
35 pages
Syntax and Parsing: Natural Language Processing (Cse 5321)
No ratings yet
Syntax and Parsing: Natural Language Processing (Cse 5321)
32 pages
Discourse and Pragmatic Processing: Natural Language Processing (CSE 5321)
100% (1)
Discourse and Pragmatic Processing: Natural Language Processing (CSE 5321)
18 pages
Morphological Analysis: Natural Language Processing (CSE 5321)
No ratings yet
Morphological Analysis: Natural Language Processing (CSE 5321)
23 pages
Disambiguation: Natural Language Processing (CSE 5321)
No ratings yet
Disambiguation: Natural Language Processing (CSE 5321)
14 pages
Case Analysis 2
No ratings yet
Case Analysis 2
2 pages
Chapter 6 & 7
No ratings yet
Chapter 6 & 7
45 pages
Chapter - 1 Management Overview
No ratings yet
Chapter - 1 Management Overview
52 pages
Victoriana - 2e - The Marylebone Mummy
100% (2)
Victoriana - 2e - The Marylebone Mummy
56 pages
Authentication Applications: Henric Johnson Blekinge Institute of Technology, Sweden Henric - Johnson@bth - Se
No ratings yet
Authentication Applications: Henric Johnson Blekinge Institute of Technology, Sweden Henric - Johnson@bth - Se
24 pages
NVEQ SWB IT L1 U5 Spreadsheet (Basic)
No ratings yet
NVEQ SWB IT L1 U5 Spreadsheet (Basic)
52 pages
Enhance Green Purchase Green Perceived Value, Risk, Green Trust
No ratings yet
Enhance Green Purchase Green Perceived Value, Risk, Green Trust
20 pages
Z - (Assignment1)
No ratings yet
Z - (Assignment1)
2 pages
Khands in Sri Japji Sahib
No ratings yet
Khands in Sri Japji Sahib
5 pages
Philippine History Assignment #3: Louela M. Naag IE22FB2
No ratings yet
Philippine History Assignment #3: Louela M. Naag IE22FB2
2 pages
SJK (C) Pei Hwa Year 5 English Language Assessment (1) Comprehension Paper 1 Hour 15 Minutes
No ratings yet
SJK (C) Pei Hwa Year 5 English Language Assessment (1) Comprehension Paper 1 Hour 15 Minutes
11 pages
Paper+30+ (2022 4 1) +Generation+Y+and+Z+Filipino+Consumers'+Purchasing
No ratings yet
Paper+30+ (2022 4 1) +Generation+Y+and+Z+Filipino+Consumers'+Purchasing
12 pages
NM - Vectors and Scalars - Lesson A
No ratings yet
NM - Vectors and Scalars - Lesson A
34 pages
TailRiskHedging SébastienJacques PDF
No ratings yet
TailRiskHedging SébastienJacques PDF
46 pages
Beyond Revenge The Evolution of the Forgiveness Instinct 1st Edition Michael Mccullough download pdf
100% (14)
Beyond Revenge The Evolution of the Forgiveness Instinct 1st Edition Michael Mccullough download pdf
60 pages
Live-Action and Animated Disney Films - An Analysis of Themes and
No ratings yet
Live-Action and Animated Disney Films - An Analysis of Themes and
33 pages
Heine, Heinrich - Poems (Holt, 1917)
No ratings yet
Heine, Heinrich - Poems (Holt, 1917)
328 pages
Energy and Shots For Diode Laser
No ratings yet
Energy and Shots For Diode Laser
2 pages
History
No ratings yet
History
3 pages
NIST Privacy Framework - Highlights From Version 1
No ratings yet
NIST Privacy Framework - Highlights From Version 1
17 pages
Ebooks File (Ebook PDF) Introduction To Psychology: Gateways To Mind and Behavior 15th Edition All Chapters
100% (2)
Ebooks File (Ebook PDF) Introduction To Psychology: Gateways To Mind and Behavior 15th Edition All Chapters
51 pages
N430 - EDUCATIONAL PSYCHOLOGY N6 QP 13 NOV 2019 - Edited
No ratings yet
N430 - EDUCATIONAL PSYCHOLOGY N6 QP 13 NOV 2019 - Edited
9 pages
Unit 21 PDF
No ratings yet
Unit 21 PDF
12 pages
AHRQ Safety Program For Mechanically Ventilated Patients: Daily Early Mobility Data Collection Tool
No ratings yet
AHRQ Safety Program For Mechanically Ventilated Patients: Daily Early Mobility Data Collection Tool
20 pages
Mock CAT - 04 PDF
No ratings yet
Mock CAT - 04 PDF
78 pages
Final Term Paper of Conflict Management
No ratings yet
Final Term Paper of Conflict Management
13 pages
Jesus in India PDF
100% (1)
Jesus in India PDF
56 pages
A Report On Deposit in RBB Bank Nepal
0% (1)
A Report On Deposit in RBB Bank Nepal
30 pages
SMCC 2122 Marking
No ratings yet
SMCC 2122 Marking
10 pages
Cost of Business Setup in Dubai
No ratings yet
Cost of Business Setup in Dubai
9 pages