0% found this document useful (0 votes)

141 views36 pages

CSE445 1 Intro To ML

Uploaded by

zikbal100

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

141 views36 pages

CSE445 1 Intro To ML

Uploaded by

zikbal100

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 36

1

Mirza Mohammad Lutfe Elahi;

Silvia Ahmed CSE 445 Machine Learning
Department of Electrical and
Computer Engineering Introduction to Machine Learning
CSE445 Machine Learning Introduction to Machine Learning ECE@NSU
Topics 2

• What is ML
• Types of ML
• Supervised/Unsupervised/Semi-Supervised/Reinforcement Learning
• Online/Batch Learning
• Instance/Model Based Learning
• Challenges of ML

CSE445 Machine Learning Introduction to Machine Learning ECE@NSU

Learning goals 3

• After this presentation, you should be able to

• Explain what machine learning is
• Understand the different machine-learning systems
• Know different jargon related to machine learning
• Understand the main challenges in machine learning

CSE445 Machine Learning Introduction to Machine Learning ECE@NSU

Problem statement 4

Spam email/SMS:
Emails and/SMS that contain unwanted or dangerous content.

Solution:
Use a spam filter to identify such emails/SMSs to flag them as spam

CSE445 Machine Learning Introduction to Machine Learning ECE@NSU

Traditional Approach 5

1. Study the problem: consider what spam

typically looks like. Finding out patterns in the
sender’s name and email/SMS body.
2. Write rules: Write a detection algorithm for
each of the patterns
3. Evaluate
4. Analyze errors or launch

Problem:
Spammers change the patterns. There is a need Figure 1: The traditional approach to software designing
to keep writing new rules forever.

CSE445 Machine Learning Introduction to Machine Learning ECE@NSU

Machine Learning (ML) Approach 6

Figure 2: The Machine Learning approach Figure 3: Automatically adapting to change

CSE445 Machine Learning Introduction to Machine Learning ECE@NSU

What is ML? 7

• “[Machine Learning is the] field of study that gives computers the

ability to learn without being explicitly programmed.”
- Arthur Samuel, 1959

• A computer program is said to learn from experience E with

respect to some task T and some performance measure P, if its
performance on T, as measured by P, improves with experience
E.
- Tom Mitchell, 1997

CSE445 Machine Learning Introduction to Machine Learning ECE@NSU

ML: Example 8

A computer program is said to learn from experience E with respect

to some task T and some performance measure P, if its performance
on T, as measured by P, improves with experience E.
Example:
Spam filter is a machine learning (ML) program that, given examples
of spam emails and examples of regular emails, can learn to flag
spam.
Here,
• Task, T: to flag “spam” for new emails,

• Experience, E: the training data,

• Performance, P: ratio of correctly classified emails (accuracy)

CSE445 Machine Learning Introduction to Machine Learning ECE@NSU

Application of ML 9

• Analyzing images of products on a production line to classify them automatically;

• Detecting tumors in brain scans;
• Automatically classifying news articles;
• Automatically flagging offensive comments on discussion forums;
• Summarizing long documents automatically;
• Creating a chatbot or a personal assistant;
• Forecasting your company’s revenue next year, based on many performance metrics;
• Making your app react to voice commands;
• Detecting credit card fraud;
• Segmenting clients based on their purchases so that you can design a different marketing
strategy for each segment;
• Many more….

CSE445 Machine Learning Introduction to Machine Learning ECE@NSU

Types of ML 10

Broad categories based on the following criteria:

Trained with human supervision? Incremental learning Patterns in example or

• Supervised • Online learning rule-based?
• Unsupervised • Batch learning • Instance-based
• Semi-supervised learning
• Reinforcement Learning • Model-based learning

CSE445 Machine Learning Introduction to Machine Learning ECE@NSU

Supervised Learning 11

• Probably the most common problem type in machine learning

• Data set is given
• Already know what the correct output should look like, having the
idea that there is a relationship between the input and the output.

Figure 4: A labeled training set for spam classification (example of supervised learning)

CSE445 Machine Learning Introduction to Machine Learning ECE@NSU

Supervised Learning – Categories and Example 12

• Regression – trying to predict results within a continuous output

• Classification - trying to predict results in a discrete output
Example 1:
• Given data about the size of houses on the real estate market, try to predict their
price. Price as a function of size is a continuous output, so this is a regression
problem.

Figure 5: A labeled
training set for housing
price prediction (example
of supervised learning)

• We could turn this example into a classification problem by instead making our output
about whether the house "sells for more or less than the asking price." Here we are
classifying the houses based on price into two discrete categories.
CSE445 Machine Learning Introduction to Machine Learning ECE@NSU
Supervised Learning Example (contd.) 13

What approaches can we use to solve this?

• Straight line through data
• Maybe $150 000
• Second order polynomial
• Maybe $200 000
• One thing we discuss later - how to chose straight or curved line?
• We know actual prices for houses
• The idea is we can learn what makes the price a certain value from the training data
• The algorithm should then produce more right answers based on new training data
where we don't know the price already
• i.e. predict the price

CSE445 Machine Learning Introduction to Machine Learning ECE@NSU

Supervised Learning Example (contd.) 14

Example 2:
• Can we define breast cancer as malignant or benign based on
tumor size?

Figure 6: A labeled training set for breast cancer detection

(example of supervised learning)

• Can you estimate prognosis based on tumor size?

CSE445 Machine Learning Introduction to Machine Learning ECE@NSU

Supervised Learning Example (contd.) 15

Example 2:
• This is an example of a classification problem
• Classify data into one of two discrete classes - no in between, either malignant or not
• In classification problems, can have a discrete number of possible values for the output
• e.g. maybe have four values
• 0 - benign
• 1 - type 1
• 2 - type 2
• 3 - type 4

CSE445 Machine Learning Introduction to Machine Learning ECE@NSU

Supervised Learning Example (contd.) 16

Example 2:
• In classification problems we can plot data in a different way

• Using only one attribute (size)

Figure 7: A labeled training set for breast cancer detection with

one attribute only

• In other problems may have multiple attributes

• We may also, for example, know age and tumor size
Figure 8: A labeled training set for
breast cancer detection with multiple
attributes
CSE445 Machine Learning Introduction to Machine Learning ECE@NSU
Supervised Learning Example (contd.) 17

Example 3:
• (a) Regression - Given a picture of male/female, we have to
predict his/her age on the basis of given picture.
• (b) Classification - Given a picture of male/female, we have to
predict whether he/she is of high school, college, graduate age.
• Another example for classification - Banks have to decide whether
or not to give a loan to someone on the basis of his credit history.

CSE445 Machine Learning Introduction to Machine Learning ECE@NSU

Supervised Learning Example (contd.) 18

You’re running a company, and you want to develop learning

algorithms to address each of two problems.

Problem 1: You have a large inventory of identical items. You want to

predict how many of these items will sell over the next 3 months.

Problem 2: You’d like software to examine individual customer

accounts, and for each account decide if it has been
hacked/compromised.

Should you treat these as classification or as regression problems?

CSE445 Machine Learning Introduction to Machine Learning ECE@NSU

Supervised Learning Algorithm 19

• Here are some of the most important supervised learning

algorithms:
• K-Nearest Neighbors
• Linear Regression
• Logistic Regression
• Support Vector Machines (SVMs)
• Decision Trees and Random Forests
• Neural Networks

CSE445 Machine Learning Introduction to Machine Learning ECE@NSU

Unsupervised Learning 20

• Here is a data set, can you structure it?

• Can derive structure from data where it
doesn't necessarily know the effect of
the variables.
• Can derive this structure by clustering Figure 9: An unlabeled training set for
the data based on relationships among unsupervised learning

the variables in the data.

• There is no feedback based on the
prediction results.

CSE445 Machine Learning Introduction to Machine Learning ECE@NSU

Unsupervised Learning Example 21

• Have a group of individuals

• On each measure expression of a gene
• Run algorithm to cluster individuals into types of people

CSE445 Machine Learning Introduction to Machine Learning ECE@NSU

Unsupervised Learning Algorithms 22

• Clustering
• K-Means
• DBSCAN
• Hierarchial Cluster Analysis (HCA)
• Anomaly Detection and novelty detection
• One-class SVM
• Isolation Forest
• Visualization and dimensionality reduction
• Principal Component Analysis (PCA)
• Kernel PCA
• Locally Linear Embedding (LLE)
• T-Distributed Stochastic Neighbor Embedding (t-SNE)
• Association rule learning
• Apriori
• Eclat

CSE445 Machine Learning Introduction to Machine Learning ECE@NSU

Semi-Supervised Learning 23

• Supervised learning works on labeled data

• Unsupervised learning works on unlabeled data
• In Semi-Supervised learning, the training data contains both
labeled and unlabeled data
• Mostly combinations of unsupervised
and supervised algorithms.
• Example: Google photos

Figure 10: A partially labeled training set for semisupervised learning

CSE445 Machine Learning Introduction to Machine Learning ECE@NSU

Reinforcement Learning
• The learning system, called an
agent, can observe the
environment, select and perform
actions, and get rewards in return
(or penalties).
• It must then learn by itself what is
the best strategy, called a policy, to
get the most reward over time.

Figure 11: Reinforcement learning

CSE445 Machine Learning Introduction to Machine Learning ECE@NSU

Batch and Online Learning 25

• Another criterion used to classify Machine Learning Systems is

whether or not the system can learn incrementally from a stream
of incoming data
• Batch Learning
• Incapable of learning incrementally
• It must be trained using all available data
• Generally takes a lot of time and computing resources – so it is typically
done offline
• Hence, also known as offline learning

CSE445 Machine Learning Introduction to Machine Learning ECE@NSU

Batch and Online Learning (contd.) 26

• Online Learning
• Train the system incrementally by feeding data instances sequentially,
either individually or in small groups called mini-batches.
• Each learning step is fast and cheap
• The system can learn about new data on the fly as it arrives.
• Suitable for systems that receive data as a continuous flow (e.g. stock
prices) and need to adapt autonomously.
• Also suitable to train systems on huge datasets that cannot fit in one
machine’s main memory (out-of-core learning)
• Challenge: feeding bad data gradually declines the system’s performance

CSE445 Machine Learning Introduction to Machine Learning ECE@NSU

Instance-Based vs Model-Based Learning 27

• One more way to classify machine learning algorithms is by how

they generalize.
• Instance Based Learning
• The most trivial form of learning is rote learning (to learn by heart)
• System learns the examples by heart, then generalizes to new case by
using a similarity measure to compare them to the learned examples (or a
subset of them)

Figure 12: Instance-Based learning

CSE445 Machine Learning Introduction to Machine Learning ECE@NSU

Instance-Based vs Model-Based Learning 28

• Model Based Learning

• Another way to generalize from a set of examples is to build a model of
these examples and then use the model to make predictions

Figure 13: Model-Based learning

CSE445 Machine Learning Introduction to Machine Learning ECE@NSU

Main challenges of Machine Learning 29

• Insufficient quantity of training data

• Non-representative training data
• Poor-quality data
• Irrelevant features
• Overfitting the training data
• Underfitting the training data

CSE445 Machine Learning Introduction to Machine Learning ECE@NSU

Challenge: Insufficient quantity of training data 30

Figure 14: The importance of data versus algorithms. Figure reproduced with permission from Banko and Brill
(2001), “Learning Curves for Confusion Set Disam‐ biguation.”

CSE445 Machine Learning Introduction to Machine Learning ECE@NSU

Challenge: Nonrepresentative Training Data 31

Figure 15: A more representative training sample

CSE445 Machine Learning Introduction to Machine Learning ECE@NSU

Challenge: Poor Quality Data 32

If training data is full of:

• Errors,
• Outliers,
• Missing values, and
• Noise,
It will be harder for the system to detect the underlying pattern, so
the system is less likely to perform well.

CSE445 Machine Learning Introduction to Machine Learning ECE@NSU

Challenge: Irrelevant Features 33

• Feature Engineering steps:

• Feature Selection (selecting the most useful features to train on among
existing features)
• Feature Extraction (combining existing features to produce more useful
ones)
• Creating new features by gathering new data

CSE445 Machine Learning Introduction to Machine Learning ECE@NSU

Challenge: Overfitting the training data 34

Figure 16: Overfitting the training data

Figure 17: Regularization reduces the risk of

overfitting

CSE445 Machine Learning Introduction to Machine Learning ECE@NSU

Challenge: Underfitting the training data 35

• It occurs when your model is too simple to learn the underlying

structure of the data.
• Potential Solutions:
• Select a more powerful model, with more parameters
• Feed better features to the learning algorithm
• Reduce the constraints on the model

CSE445 Machine Learning Introduction to Machine Learning ECE@NSU

Reference and further reading 36

• Chapter 1: “Hands-on Machine Learning with Scikit-Learn, Keras &

TensorFlow”, Aurelien Geron.
• https://www.coursera.org/collections/machine-learning

Python Tutorials:
• https://www.w3schools.com/python/python_syntax.asp

• Video tutorial: Jupyter Notebook Tutorial for Beginners with Python

• https://www.geeksforgeeks.org/how-to-use-jupyter-notebook-an-
ultimate-guide/
• Google Colab tutorial:
https://colab.research.google.com/drive/16pBJQePbqkz3QFV54L4NIk
On1kwpuRrj

CSE445 Machine Learning Introduction to Machine Learning ECE@NSU

Computer Graphics Lab Manual: MR - Shivakumar B, Lecturer, Dept of BCA, SSIBM, Tumakuru
No ratings yet
Computer Graphics Lab Manual: MR - Shivakumar B, Lecturer, Dept of BCA, SSIBM, Tumakuru
21 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
18 pages
CE469 - Introduction To Machine Learning: Lecturer Contact
No ratings yet
CE469 - Introduction To Machine Learning: Lecturer Contact
33 pages
MLLecture 1
No ratings yet
MLLecture 1
10 pages
Introduction
No ratings yet
Introduction
41 pages
ML 01
No ratings yet
ML 01
15 pages
Lecture1 - ML Introduction
No ratings yet
Lecture1 - ML Introduction
21 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
45 pages
AIV ML: Achine Learning Ntroduction
No ratings yet
AIV ML: Achine Learning Ntroduction
10 pages
Introduction To Machine Learning: David Kauchak CS 451 - Fall 2013
No ratings yet
Introduction To Machine Learning: David Kauchak CS 451 - Fall 2013
34 pages
Machine Learning
100% (1)
Machine Learning
46 pages
Chapter 1 - Introduction
No ratings yet
Chapter 1 - Introduction
28 pages
Lecture 1
No ratings yet
Lecture 1
47 pages
Unit 1&2
No ratings yet
Unit 1&2
270 pages
Lecture 1
No ratings yet
Lecture 1
43 pages
1 Lecture 1: Introduction To Machine Learning
No ratings yet
1 Lecture 1: Introduction To Machine Learning
12 pages
L21 Intro ML
No ratings yet
L21 Intro ML
30 pages
Machine Learning: Professional CORE (CET3006B) T. Y. B.Tech CSE
No ratings yet
Machine Learning: Professional CORE (CET3006B) T. Y. B.Tech CSE
106 pages
Unit - 1
No ratings yet
Unit - 1
63 pages
Unit 1 ML
No ratings yet
Unit 1 ML
70 pages
Chap 1
No ratings yet
Chap 1
56 pages
Introduction ML PDF
No ratings yet
Introduction ML PDF
22 pages
MLUnit 1
No ratings yet
MLUnit 1
131 pages
Unit - 5.1 - Introduction To Machine Learning
No ratings yet
Unit - 5.1 - Introduction To Machine Learning
38 pages
CE802 Lec IntroML Handouts
No ratings yet
CE802 Lec IntroML Handouts
24 pages
Lecture 1.1. Introduction
No ratings yet
Lecture 1.1. Introduction
48 pages
ML - Unit I - Final
No ratings yet
ML - Unit I - Final
132 pages
1-Iml Lecture1 Intro
No ratings yet
1-Iml Lecture1 Intro
35 pages
AML All Merged PDF Class 1 To 8
No ratings yet
AML All Merged PDF Class 1 To 8
423 pages
Module 1-Basics of ML
No ratings yet
Module 1-Basics of ML
142 pages
01 Introduction
No ratings yet
01 Introduction
50 pages
EE2211 Introduction To Machine Learning: Semester 1 2021/2022
No ratings yet
EE2211 Introduction To Machine Learning: Semester 1 2021/2022
34 pages
MLT Uint1
No ratings yet
MLT Uint1
26 pages
Lesson 4 - Introduction Machine Learning
No ratings yet
Lesson 4 - Introduction Machine Learning
44 pages
07 Overview of Machine Learning
No ratings yet
07 Overview of Machine Learning
113 pages
Chapter 5 Introduction To ML-1
100% (1)
Chapter 5 Introduction To ML-1
32 pages
MachineLearning Spring2020 1
No ratings yet
MachineLearning Spring2020 1
69 pages
01 Introduction ML
No ratings yet
01 Introduction ML
48 pages
Midterm Combined Slides
No ratings yet
Midterm Combined Slides
210 pages
Machine Learning Lecture-01
No ratings yet
Machine Learning Lecture-01
37 pages
Machine Learning Chapter 1
No ratings yet
Machine Learning Chapter 1
25 pages
ML Lecture # 01 Introduction To ML
No ratings yet
ML Lecture # 01 Introduction To ML
60 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
12 pages
01 Introduction
No ratings yet
01 Introduction
43 pages
Lecture Compiled
No ratings yet
Lecture Compiled
224 pages
W1 - Introduction To ML
No ratings yet
W1 - Introduction To ML
57 pages
EE2211 Introduction To Machine Learning: Semester 1 2020/2021
No ratings yet
EE2211 Introduction To Machine Learning: Semester 1 2020/2021
34 pages
01 02 Introduction Regression Analysis and GR
No ratings yet
01 02 Introduction Regression Analysis and GR
11 pages
Intro To ML - 1
No ratings yet
Intro To ML - 1
29 pages
ENG6500 1 IntroductionToMLDL Part1
No ratings yet
ENG6500 1 IntroductionToMLDL Part1
63 pages
MACHINE LEARNING ALGORITHM - Unit-1-1
100% (1)
MACHINE LEARNING ALGORITHM - Unit-1-1
78 pages
Machine Learning 10-401, Spring 2018: Introduction, Admin, Course Overview
No ratings yet
Machine Learning 10-401, Spring 2018: Introduction, Admin, Course Overview
35 pages
Unit-1 MLT
No ratings yet
Unit-1 MLT
51 pages
Python UNIT-5
100% (1)
Python UNIT-5
67 pages
Introduction
No ratings yet
Introduction
18 pages
01 Introduction
No ratings yet
01 Introduction
51 pages
Introduction To ML
No ratings yet
Introduction To ML
17 pages
Computer Science & Engineering: Apex Institute of Technology
No ratings yet
Computer Science & Engineering: Apex Institute of Technology
13 pages
All Lectures PDF
No ratings yet
All Lectures PDF
451 pages
2024 SCU ML 1 2 Introduction
No ratings yet
2024 SCU ML 1 2 Introduction
35 pages
MCS-024: Object Oriented Technologies and Java Programming
From Everand
MCS-024: Object Oriented Technologies and Java Programming
Dr. DK Sukhani
No ratings yet
Ma3151 (Unit 3)
No ratings yet
Ma3151 (Unit 3)
45 pages
Causality: The Impulse Response H (N) of An Ideal Low Pass Filter With Frequency Response
No ratings yet
Causality: The Impulse Response H (N) of An Ideal Low Pass Filter With Frequency Response
3 pages
Assign 7
No ratings yet
Assign 7
5 pages
De Thi Final Exam OR HK2 22 23 de I Ma de 1 Solutions
No ratings yet
De Thi Final Exam OR HK2 22 23 de I Ma de 1 Solutions
13 pages
Going Deeper With Contextual CNN For Hyperspectral Image Classification
No ratings yet
Going Deeper With Contextual CNN For Hyperspectral Image Classification
14 pages
Versatile Medical Image Denoising Algorithm
No ratings yet
Versatile Medical Image Denoising Algorithm
22 pages
Lecture 3
100% (1)
Lecture 3
11 pages
Digital Audio Processing Revisited: Juan P Bello
No ratings yet
Digital Audio Processing Revisited: Juan P Bello
29 pages
Unit II Classification
No ratings yet
Unit II Classification
31 pages
Hashing
No ratings yet
Hashing
14 pages
Lecture-80 Economic Load Dispatch Part-1 - 250519 - 001734
No ratings yet
Lecture-80 Economic Load Dispatch Part-1 - 250519 - 001734
31 pages
Linear and Binary Search
No ratings yet
Linear and Binary Search
4 pages
Syllabus - CS30001 - Design and Analysis of Algorithms
No ratings yet
Syllabus - CS30001 - Design and Analysis of Algorithms
2 pages
Phishing Detection System Through Hybrid
No ratings yet
Phishing Detection System Through Hybrid
16 pages
Finite Element Analysis
No ratings yet
Finite Element Analysis
5 pages
Computing GCD's The Euclidean Algorithm: A QB + R
No ratings yet
Computing GCD's The Euclidean Algorithm: A QB + R
4 pages
Model Order Reduction Techniques For Reducing Order of Industrial PR
No ratings yet
Model Order Reduction Techniques For Reducing Order of Industrial PR
5 pages
SCIP - Introduction
No ratings yet
SCIP - Introduction
109 pages
Be Winter 2018
No ratings yet
Be Winter 2018
2 pages
115DU122016 JNTUH Exam Papers
No ratings yet
115DU122016 JNTUH Exam Papers
2 pages
Distance Formula
No ratings yet
Distance Formula
8 pages
Learning Vector Quantization (LVQ) and K-Nearest Neighbor For Intrusion Classification
No ratings yet
Learning Vector Quantization (LVQ) and K-Nearest Neighbor For Intrusion Classification
5 pages
Sorting Project Report
No ratings yet
Sorting Project Report
52 pages
IV Sem Module 3 Network Model 20 QB 2024
No ratings yet
IV Sem Module 3 Network Model 20 QB 2024
8 pages
Fuzzy Logic and Neural Networks - 4 - Solution
100% (1)
Fuzzy Logic and Neural Networks - 4 - Solution
13 pages
Unit 1 - Discrete Time Signals and Systems
No ratings yet
Unit 1 - Discrete Time Signals and Systems
77 pages
Week 4 Notes
No ratings yet
Week 4 Notes
18 pages
AI Othello: Mick G.D. Remmerswaal April 23, 2020
No ratings yet
AI Othello: Mick G.D. Remmerswaal April 23, 2020
35 pages
Quiz Notes For Aiml
No ratings yet
Quiz Notes For Aiml
14 pages