0% found this document useful (0 votes)

5 views28 pages

ML Intro

The document outlines the fundamentals of machine learning, including definitions of learning and the data science process. It distinguishes between supervised, unsupervised, and semi-supervised learning, and discusses the stages of machine learning such as training and testing. Additionally, it provides examples related to cancer diagnosis to illustrate the application of machine learning algorithms in classification tasks.

Uploaded by

rtzvdpsw2x

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views28 pages

ML Intro

Uploaded by

rtzvdpsw2x

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 28

Machine Learning

Data Science Process

Learning?
• Herbert Simon: “Learning is any process by which a
system improves performance from experience.”

• There are two ways that a system can improve:

1. By acquiring new knowledge (e.g. acquiring new facts)
2. By adapting its behavior (e.g. solving problems more accurately )
• How to learn a machine using data?
Main types of Machine Learning
• Supervised learning(With a teacher): uses a
series of labelled examples with direct feedback

• Unsupervised/clustering learning (without a

teacher): no feedback

• Semi-supervised: in between supervised and

unsupervised learning (Some data is labeled but
most of it is unlabeled)
Supervised vs Unsupervised
• How many groups do we have in this figure?
• Can we apply supervised learning?
• What will you get if you apply unsupervised learning?
What do you think now?
Supervised vs Unsupervised
• Can you separate this data into two groups?
Supervised vs Unsupervised vs Semi-supervised
Example
• We have a dataset with two columns x1 and x2
X1 X2
1 2
5 3
… …
• We plot the data into two-dimensional space as follows

Q1) can you

divide the data
into two
groups?
• Q1) can you divide the data into two groups?
• Try to separate the points based on the distance
between the data points

X1 X2
1 2
5 3
... ...
Q2) If we give you the labels (a new column which
provides the class of each row) can you draw a line
that separte the two classes?

X1 X2 X3
(Label)
1 2 normal
(blue)
5 3 abnormal
(red)
.. .. ..
Examples of
ML Algorithms
Usual ML stages
• Hypothesis, data
• Training or learning (requires examples/data)
• Testing or generalization
Training
• Training is the acquisition of knowledge, skills, and competencies as
a result of teaching, practical skills and knowledge that relate to
specific useful competencies (wikipedia)
• Training requires scenarios or examples (data)
In machine learning we learn from the available data or examples

Training: The figure shows how the separating line is updated through the several training steps

Initial random line Updating the line after one Training is complete
training step
Testing
• How well the learned system works?
• Generalization
• Performance on unseen or unknown scenarios or data

• Which model performs the best?

Types of testing
• Evaluate performance by
testing on data NOT used for
training (both should be
randomly sampled)

• Cross validation methods

for small data sets

The more (relevant) data the

better.
Defining the Learning Task
Improve on task, T, with respect to
performance metric, P, based on experience, E.
T: Recognizing hand-written words
P: Percentage of words correctly classified
E: Database of human-labeled images of handwritten words

T: Driving on four-lane highways using vision sensors

P: Average distance traveled before a human-judged error
E: A sequence of images and steering commands recorded while
observing a human driver.

T: Categorize email messages as spam or legitimate.

P: Percentage of email messages correctly classified.
E: Database of emails, some with human-given labels
Suppose that we are done
with EDA and data is
ready for modelling, what
is next?
Cancer diagnosis
This is our data 103x5
Patient ID # of Tumors Avg Area Avg Density Diagnosis
1 5 20 118 M
2 3 15 130 B
3 7 10 52 B
4 2 30 100 M
... ... ... ... ...
100 3 19 100 M
101 4 16 95 M
102 9 22 125 B
103 1 14 80 M
Recall ML stages
Supervised Learning Classification

Training
Set

• Use this training set to learn how to classify patients

where diagnosis is not known:
Patient ID # of Tumors Avg Area Avg Density Diagnosis
101 4 16 95 ?
102 9 22 125 ? Test Set
103 1 14 80 ?

Will be predicted by
Input Data our model
Breast Cancer Diagnosis Linear Separation

Line produced
by our model
to separate
the two
classes

The plot of the training data into 2D, where:

red represents M cases and blue represents B cases
Predict the test data

The gray circles represent the test set

• The model predict the test data as following:

Patient ID # of Tumors Avg Area Avg Density Diagnosis

101 4 16 95 M Predicted by
102 9 22 125 M
the model
103 1 14 80 M

Actual
diagnosis

• How good is our model?

Examples of
ML Algorithms

ML ppt1
No ratings yet
ML ppt1
39 pages
Lecture#12 DM MS (DEIM) Spring 2025
No ratings yet
Lecture#12 DM MS (DEIM) Spring 2025
21 pages
WEEK 01 Merged
No ratings yet
WEEK 01 Merged
606 pages
AAI Lecture 9 SP 25
No ratings yet
AAI Lecture 9 SP 25
26 pages
1.0 Introduction
No ratings yet
1.0 Introduction
50 pages
MLintroduction
No ratings yet
MLintroduction
75 pages
Machine Learning Notes
100% (10)
Machine Learning Notes
19 pages
01 Introduction ML
No ratings yet
01 Introduction ML
48 pages
BE02000041 Funda of AI Unit 3 Basics of ML
No ratings yet
BE02000041 Funda of AI Unit 3 Basics of ML
86 pages
Aiml Co - 3,4 Notes
No ratings yet
Aiml Co - 3,4 Notes
98 pages
L10 Intro To Machine Learning 22112023 111237am
No ratings yet
L10 Intro To Machine Learning 22112023 111237am
38 pages
01 Introduction
No ratings yet
01 Introduction
51 pages
Chapter 5 Machine Learning
No ratings yet
Chapter 5 Machine Learning
96 pages
01ML Introduction
No ratings yet
01ML Introduction
80 pages
Machine Learning in Biostatistics - Master of Science in Biostatistics by Slidesgo
No ratings yet
Machine Learning in Biostatistics - Master of Science in Biostatistics by Slidesgo
15 pages
Machine Learning Week2
No ratings yet
Machine Learning Week2
51 pages
Unit No. 1
No ratings yet
Unit No. 1
73 pages
ML - 1 - Sovan - Introduction To ML
No ratings yet
ML - 1 - Sovan - Introduction To ML
83 pages
Pico Bricks Ebook 15
100% (1)
Pico Bricks Ebook 15
234 pages
Machine Learning - Introduction
No ratings yet
Machine Learning - Introduction
73 pages
A.I. Lecture 4 NEW
No ratings yet
A.I. Lecture 4 NEW
31 pages
Lec 7 - 8 - Machine Learning Introduction
No ratings yet
Lec 7 - 8 - Machine Learning Introduction
55 pages
Ch3-Machine Learning
No ratings yet
Ch3-Machine Learning
124 pages
Machine Learning - 1
No ratings yet
Machine Learning - 1
52 pages
Intro To ML
No ratings yet
Intro To ML
107 pages
L1 - SLM Notes (Bacground, ML)
No ratings yet
L1 - SLM Notes (Bacground, ML)
29 pages
Tesla, .. ? / Cold Fusion, Tesla, Zeropoint Energy Utilization.. Pseudoscience?// ( ) ! / Analysis of New Energy Paradigm: Including Controversial & Questionable Claims
100% (1)
Tesla, .. ? / Cold Fusion, Tesla, Zeropoint Energy Utilization.. Pseudoscience?// ( ) ! / Analysis of New Energy Paradigm: Including Controversial & Questionable Claims
498 pages
Sonu Dkash Updated PDF
No ratings yet
Sonu Dkash Updated PDF
21 pages
Lec1 - Introduction
No ratings yet
Lec1 - Introduction
55 pages
Lesson 2 - Fundamentals of Machine Learning and Deep Learning
No ratings yet
Lesson 2 - Fundamentals of Machine Learning and Deep Learning
100 pages
Machine Learning Unit1
No ratings yet
Machine Learning Unit1
151 pages
Introduction To ML Unit-1
No ratings yet
Introduction To ML Unit-1
90 pages
Chapter 7 - Artificial Intelligence Application
No ratings yet
Chapter 7 - Artificial Intelligence Application
29 pages
Unit 1
No ratings yet
Unit 1
62 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
15 pages
Chapter 01 Introduction To Machine Learning
No ratings yet
Chapter 01 Introduction To Machine Learning
59 pages
Advanced Mathematical Thinking
100% (2)
Advanced Mathematical Thinking
76 pages
1 Overview
No ratings yet
1 Overview
22 pages
ML 1
No ratings yet
ML 1
35 pages
Ch7 Introduction To Machine Learning
No ratings yet
Ch7 Introduction To Machine Learning
29 pages
Iconlibrary Production Oct2016
No ratings yet
Iconlibrary Production Oct2016
137 pages
Chapter 2
No ratings yet
Chapter 2
35 pages
Bloom's Revised Taxonomy of Educational Objectives
No ratings yet
Bloom's Revised Taxonomy of Educational Objectives
36 pages
DIR Notes 1
No ratings yet
DIR Notes 1
39 pages
2021 Machine Learning Intro
No ratings yet
2021 Machine Learning Intro
43 pages
Fundamentals of ML 1
No ratings yet
Fundamentals of ML 1
38 pages
Unit-1 ML
No ratings yet
Unit-1 ML
19 pages
10 1109@idea49133 2020 9170733
No ratings yet
10 1109@idea49133 2020 9170733
6 pages
The Secret Of Machine Learning
From Everand
The Secret Of Machine Learning
Mhd Arjunanta
No ratings yet
ML - Module 1
No ratings yet
ML - Module 1
30 pages
Annual Report-2014 PDF
No ratings yet
Annual Report-2014 PDF
108 pages
01 Introduction 1
No ratings yet
01 Introduction 1
71 pages
ETOM Processes
No ratings yet
ETOM Processes
45 pages
Eddf Fra
No ratings yet
Eddf Fra
173 pages
1 - Introduction
No ratings yet
1 - Introduction
82 pages
01 Introduction
No ratings yet
01 Introduction
43 pages
Lecture 1
No ratings yet
Lecture 1
47 pages
Lec1 Intoduction
No ratings yet
Lec1 Intoduction
34 pages
Module2 ch2
No ratings yet
Module2 ch2
36 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
24 pages
Chapter 1 Introduction To Machine Learning
No ratings yet
Chapter 1 Introduction To Machine Learning
29 pages
ML Doc1
No ratings yet
ML Doc1
14 pages
Lec 1,2
No ratings yet
Lec 1,2
69 pages
2-Capacity, Underfitting, overfitting-15-Jul-2020Material - I - 15-Jul-2020 - ML - Fundamentals
No ratings yet
2-Capacity, Underfitting, overfitting-15-Jul-2020Material - I - 15-Jul-2020 - ML - Fundamentals
35 pages
Portfolio Optimization
No ratings yet
Portfolio Optimization
53 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
10 pages
Intro Machine Learning
No ratings yet
Intro Machine Learning
4 pages
An Enlightenment To Machine Learning
100% (1)
An Enlightenment To Machine Learning
16 pages
Full The Lab Manual To Accompany The 8088 and 8086 Microprocessors Programming Interfacing Software Hardware and Applications 4th Edition Walter A. Triebel Ebook All Chapters
No ratings yet
Full The Lab Manual To Accompany The 8088 and 8086 Microprocessors Programming Interfacing Software Hardware and Applications 4th Edition Walter A. Triebel Ebook All Chapters
71 pages
Rheology and Transport Phenomena (FET)
No ratings yet
Rheology and Transport Phenomena (FET)
9 pages
Chapter 5 Gastrointestinal Agents Reviewer PDF
No ratings yet
Chapter 5 Gastrointestinal Agents Reviewer PDF
6 pages
The Band of Stability: Objectives
No ratings yet
The Band of Stability: Objectives
2 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
4 pages
Positive Results For Detection of Colchicine
No ratings yet
Positive Results For Detection of Colchicine
12 pages
Unit 12 Lexis: Commentary
No ratings yet
Unit 12 Lexis: Commentary
5 pages
Presentation 1 Adjectives-1
No ratings yet
Presentation 1 Adjectives-1
13 pages
On Dumpster Diving
No ratings yet
On Dumpster Diving
10 pages
Rubric For Oral Presentation
100% (1)
Rubric For Oral Presentation
1 page
Norton Introduction To Literature Shorter 12e The All Chapter Instant Download
100% (1)
Norton Introduction To Literature Shorter 12e The All Chapter Instant Download
24 pages
11 Watt Light
No ratings yet
11 Watt Light
14 pages
Burgess-What Is Literature
No ratings yet
Burgess-What Is Literature
4 pages
Installation Guide For Kaye Software On Win 10 and Win 8
No ratings yet
Installation Guide For Kaye Software On Win 10 and Win 8
16 pages
Purana - Padma Purana - Patalak - Estudies
No ratings yet
Purana - Padma Purana - Patalak - Estudies
5 pages
DE Experiment 7
No ratings yet
DE Experiment 7
9 pages
IIE Bachelor of Commerce in Law Factsheet 2020 (New) V1 PDF
No ratings yet
IIE Bachelor of Commerce in Law Factsheet 2020 (New) V1 PDF
2 pages
KSKD HOAS BOQ External Elevations
No ratings yet
KSKD HOAS BOQ External Elevations
7 pages
HRM Reflection 1
No ratings yet
HRM Reflection 1
2 pages
Drilling Engineering 30 Days Program
No ratings yet
Drilling Engineering 30 Days Program
2 pages
(Ebook) Idiot's Guides - Paper Airplanes by Nick Robinson PDF Download
100% (3)
(Ebook) Idiot's Guides - Paper Airplanes by Nick Robinson PDF Download
83 pages

ML Intro

Uploaded by

ML Intro

Uploaded by

Machine Learning

Data Science Process

• There are two ways that a system can improve:

• Unsupervised/clustering learning (without a

• Semi-supervised: in between supervised and

Q1) can you

• Which model performs the best?

• Cross validation methods

The more (relevant) data the

T: Driving on four-lane highways using vision sensors

T: Categorize email messages as spam or legitimate.

• Use this training set to learn how to classify patients

The plot of the training data into 2D, where:

The gray circles represent the test set

Patient ID # of Tumors Avg Area Avg Density Diagnosis

• How good is our model?

You might also like