0% found this document useful (0 votes)

43 views40 pages

Introduccion A ML

This document introduces machine learning concepts. It defines machine learning as a field that gives computers the ability to improve automatically through experience and use of data. Machine learning uses data to discover patterns and make predictions without being explicitly programmed. There are three main types of machine learning: supervised learning, unsupervised learning, and reinforcement learning, which differ based on the type of feedback available to the learning system. The document discusses various applications of machine learning such as classification, prediction, and pattern recognition.

Uploaded by

Roberto Valdez Jasso

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

43 views40 pages

Introduccion A ML

Uploaded by

Roberto Valdez Jasso

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 40

Introducción al

aprendizaje automático
TC3002B
Basic Concepts
Introduction

2023FJ - [email protected] 2
Ability to
use percepts from the outside world
not only for reacting,
What is but for improving actions in future events.

learning?
Implies that we know when and how to use this new knowledge.
When: pattern detected
How: algorithm created.

2023FJ - [email protected] 3
Example:
Imagine a supermarket chain with a hundred of stores selling
groceries to millions of customers.
Each sale has a lot of data that can be analised and converted
into information.
What is These information can be used to give people suggestions
when buying.
machine
learning? If we knew who would buy an item, we would just write
code for the computer to remind them.

Because we do not know, we collect data and hope to

extract enough information to recommend articles to
people.

2023FJ - [email protected] 4
Example:
In RoboCup agents play soccer.
There are 11 players against 11 players.
Each team has its own strategy for playing soccer.

What is If we knew which strategy a team is using, we would play a

counter-attack strategy to stop them.
machine
learning? Because we do not know their strategy, we collect data and try to
extract enough information to detect their strategies.

Once strategies are detected and classified, we could select the

best strategy to exploit this knowledge.

2023FJ - [email protected] 5
The computer algorithm should be able to:
Identify patterns in the data (When)

Construct a good and useful approximation of the solution to the

What is ML? … problem (How)

2023FJ - [email protected] 6
“Machine learning uses data and answers to discover rules behind
a problem” Chollet (2017)

“Machine learning is programming computers to optimize a

performance criterion using example data or past experience.”
Alpaydin, E. (2004)

What is ML? … Has a model defined for some parameters.

Learning is the execution of a computer program to optimize the
parameters of the model using training data or past experience.

Two types of models:

Predictive model: predictions in the future.
Descriptive model: gain knowledge from data.

2023FJ - [email protected] 7
“A computer program is said to learn from experience E with
respect to some class of tasks T and performance measure P, if its
performance at tasks in T, as measured by P, improves with
experience E.” Mitchell, T. (1997)

Example: handwriting recognition

What is ML? … Task T: recognizing and classifying handwritten words within
images.
Performance measure P: percent of words correctly classified
Training experience E: a database of handwritten words with given
classifications

2023FJ - [email protected] 8
Learning Agent

2023FJ - [email protected] 9
1. Which components of the performance element should be
learned?
Design of a
learning 2. What feedback is available to learn these components?
element
3. What representation is used for the components?

2023FJ - [email protected] 10
What can be learned?
Direct mapping from conditions on the current state to actions.

Means to infer relevant properties of the world from the percept

sequence.
Components
of the Information about the way world evolves and the results of possible
actions agent can take.
performance
Utility information indicating the desirability of world states.
element
Action-value information indicating desirability of actions.

Goals that describe classes of states whose achievement maximizes

the agent’s utility.

2023FJ - [email protected] 11
Components can be learned from appropriate feedback.
Example: training Tae Kwon Do, Driving a Taxi.

Type of feedback:
The most important factor in determining the nature of the learning
Feedback problem.

Three cases:
1. Supervised learning
2. Unsupervised learning
3. Reinforcement learning

2023FJ - [email protected] 12
Learning a function from examples of its inputs and outputs.
There is an input X, an output Y, and the task is to learn mapping
from input to output.

Outputs values can be provided

By a supervisor – someone feed the output.
Supervised By the environment – detected by sensors.

Learning
Examples:
Learn a condition-action rule for punching.
Learn to differentiate between a dog and a cat.
Regression
Classification.

2023FJ - [email protected] 13
Learning patterns in the input when no specific output values are
supplied.

Aim: to find regularities in the input.

Unsupervised
learning Example:
Learn to separate colors.
Learn when it might rain.
Learn how to detect people that will not pay their credit cards.

2023FJ - [email protected] 14
Mix between supervised and unsupervised learning

Some data is labelled – usually a very small part

Semi-
supervised Labelled data is used to create more data
learning
Learner learns to:
Generate labelled data and to
Detect regularities in the input

2023FJ - [email protected] 15
The output of the system is a sequence of actions.

Uses rewards to guide the sequence of actions

These actions are part of a policy.

A single action is not important.
Reinforcement The policy is what must be learned.

learning Agent must learn from reinforcement which actions are best, i.e. the
policy.

Examples:
Playing chess.
Driving politely.
Robot navigation.

2023FJ - [email protected] 16
Polynomials

Propositional logic

Representation Predicate calculus

of the learned
information Bayesian networks

Neural networks

Etc.

2023FJ - [email protected] 17
Learning associations
Learn how people associate
elements (ex. buying Knowledge extraction
groceries) Learning a rule from data – it
explains the data
Rules are a form of data
Classification compression
Learn to classify elements in
Applications of different categories
Outlier detection
machine Prediction
Data that does not belong to
a class
learning Learn to predict if some
action will happen
Regression problems
Learn the curve that best fits
Pattern recognition a function to a set of points
Learn to find familiar
patterns (characters, faces,
objects, etc.)

2023FJ - [email protected] 18
Artificial Intelligence

Bayesian methods

Computational complexity theory

Control theory
ML is
multidisciplinary Information theory

Philosophy

Psychology and neurobiology

Statistics

2023FJ - [email protected] 19
Designing a learning
system
Introduction

2023FJ - [email protected] 20
Hyper-
Data Model parameter
Collection Fitting tuning

ML Process
Data Model
Preparation Evaluation

2023FJ - [email protected] 21
1. Choosing the training experience
1. Feedback
2. Control of sequence of examples
3. Distribution of examples

Designing a 2.
1.
Choosing the target function
Function that is operational
learning
system 3. Choosing a representation for the target function
1. Expressive representation

4. Choosing a function approximation algorithm

1. Estimating training values
2. Adjusting the weights

2023FJ - [email protected] 22
How to build a
dataset

2023FJ - [email protected] 23
Structured Data
ML models learn from examples
Each example is called an instance or
pattern
Dataset is formed with multiple
examples
Structured Data is organized in rows
and columns
A column is called a feature

Images, videos and text are called

Unstructured Data
https://machinelearningmastery.com/wp-content/uploads/2013/12/Table-of-Data-Showing-an-Instance-Feature-and-Train-Test-Datasets.png

2023FJ - [email protected] 24
Dataset
organization
and division
Training set is usually 80% of
original set

Test set is usually 20%

Validation set is usually 20% of

training set

https://miro.medium.com/max/585/0*lbveKaL-MGRgppD8.png

2023FJ - [email protected] 25
When data is too big and we
can’t pass all data to computer
at once.

One Epoch is when an entire

dataset is passed forward and
backward through the learning
model only once.

Epoch, batch Batch size: divide dataset into

number of batches or sets or
& iteration parts.

Iterations is the number of

batches needed to complete
one epoch.

The number of batches is equal

to number of iterations for one
epoch.

https://towardsdatascience.com/epoch-vs-iterations-vs-batch-size-
4dfb9c7ce9c9
2023FJ - [email protected] 26
Data Wrangling
Data might be in different
files
Cleaning, structuring,
enriching raw data Data Preparation
Assure quality and useful Analysis and optimization of
data features
Select/remove features
Consider prediction needs
Data Cleansing and computation time
Data Missing values (delete?)
Unwanted characters
Preprocessing Unwanted elements

https://miro.medium.com/max/666/0*ScsuON73dMJDC9XO.png

2023FJ - [email protected] 27
Complete data
science
pipeline

https://developer.ibm.com/articles/ba-intro-data-science-1/

2023FJ - [email protected] 28
Model
Evaluation

2023FJ - [email protected] 29
N x N matrix Actual values
N = number of classes

Predicted values
Confusion Evaluates performance of a
Matrix classification model

Compares actual target https://www.python-course.eu/images/confusion.matrix_image.png

values with predicted values

2023FJ - [email protected] 30
True Positive (TP)
Predicted value matches
actual
Both were positive
True Negative (TN)
Predicted value matches
actual
Both are negative
Binary
confusion False Positive (FP)
Type I error
matrix Predicted value falsely
predicted
Actual value Negative
False Negative (FN)
Type II error
https://cdn.analyticsvidhya.com/wp-content/uploads/2020/04/Basic-Confusion-matrix.png Predicted value falsely
predicted
Actual value Positive

2023FJ - [email protected] 31
Accuracy
Fraction of predictions model correctly classified

!"##$%& '#$()%&)"*+
𝐴𝑐𝑐𝑢𝑟𝑎𝑐𝑦 = ,"&-. */01$# '#$()%&)"*+
Confusion
matrix metrics For binary classification
!"#!$
𝐴𝑐𝑐𝑢𝑟𝑎𝑐𝑦 = !"#!$#%"#%$

Very simple, but does not take into consideration class imbalances
and data unevenly distributed

2023FJ - [email protected] 32
Precision
Proportion predicted positives identified correctly
!"
𝑃𝑟𝑒𝑐𝑖𝑠𝑖𝑜𝑛 =
!"#$"

Recall (Sensitivity)
Proportion actual positives identified correctly
!"
𝑅𝑒𝑐𝑎𝑙𝑙 = !"#$%

Confusion Specificity
Proportion actual negatives identified correctly
matrix 𝑆𝑝𝑒𝑐𝑖𝑓𝑖𝑐𝑖𝑡𝑦 = !%#$"
!%

metrics…
Precision used when FP is a higher concern than FN
From the predicted positives, how many are really positive?

Recall used when there is a high cost associated with FN

How many positive were correctly classified?
A higher recall ensures more actual positive values are being identified

2023FJ - [email protected] 33
F1 Score
Helps understand balance between Precision and Recall

2 '#$%)+)"* × #$%-.. ,6
𝐹1 = =2𝑥 =
Confusion !
3
!
"#$%&& '"#$()(*+
'#$%)+)"*5#$%-.. !
,6 5 ,(86589)

matrix Values range from 0 to 1

metrics… A value close to 1 means it is a better model

Used when
there is a need to balance this two metrics
Not easy to decide if Type I or Type II errors is preferred

2023FJ - [email protected] 34
When a classifier is not reporting the values we desire, we can
move the threshold for classification

Moving threshold can increase/decrease recall, precision and

ROC & AUC specificity values

ROC and AUC can help us determine the best threshold

Receiver Operator Characteristic (ROC)
Area Under the Curve (AUC)

2023FJ - [email protected] 35
ROC
Summarizes all confusion matrices produced with
different thresholds

Diagonal line is where TP rate is equal to FP rate

Points above the diagonal represent a good classifier

The best classifier would be (1,0)

The value of the threshold is the value used in the

classifier that produced the ROC graph

2023FJ - [email protected] 36
AUC

Helps to compare different ROC

graphs

The greater the value of the AUC, the

better the model is for classifing that
data

2023FJ - [email protected] 37
Issues in
machine
learning

2023FJ - [email protected] 38
What algorithms exist for learning general target functions from
specific training examples?

How much training data is sufficient?

Issues in When and how can prior knowledge guide the process of generalizing
from examples?
machine
learning What is the best strategy for choosing a useful training experience?

What is the best way to reduce the learning task to one or more
function approximation problems?

Can the learner learn to represent the target function?

2023FJ - [email protected] 39
Alpaydin, Ethem (2004). Introduction to Machine Learning. The MIT Press.

Mitchell, Tom (1997). Machine Learning. WCB McGraw-Hill.

Edwards, Gavin (2018). Machine Learning, an introduction. Towards Data

References Science (https://towardsdatascience.com/machine-learning-an-
introduction-23b84d51e6d0)

Josh Starmer (2019). ROC and AUC clearly explained! StatQuest with Josh
Starmer, YouTube Channel

2023FJ - [email protected] 40

BW4HANA Golden Deck 062018 Black JU
75% (4)
BW4HANA Golden Deck 062018 Black JU
74 pages
Ecommerce App Proposal
83% (6)
Ecommerce App Proposal
7 pages
Manitou Operators Manual - M50-M70 - EN
100% (1)
Manitou Operators Manual - M50-M70 - EN
178 pages
ML Cahp 1
No ratings yet
ML Cahp 1
35 pages
Chapter 5 AI
No ratings yet
Chapter 5 AI
40 pages
Lecture - 1 Introduction To ML
No ratings yet
Lecture - 1 Introduction To ML
38 pages
Unit I
No ratings yet
Unit I
132 pages
Complete ML
No ratings yet
Complete ML
325 pages
Machine Learning
No ratings yet
Machine Learning
42 pages
Unit-1 MLT
No ratings yet
Unit-1 MLT
51 pages
Lecture 1.2 Introduction To Machine Learning
No ratings yet
Lecture 1.2 Introduction To Machine Learning
31 pages
Machine Learning Overview
No ratings yet
Machine Learning Overview
7 pages
ML - 1 - Sovan - Introduction To ML
No ratings yet
ML - 1 - Sovan - Introduction To ML
83 pages
Csit (r22) 3-2 Machine Learning Digital Notes
No ratings yet
Csit (r22) 3-2 Machine Learning Digital Notes
120 pages
1 - Machine Learning Overview
No ratings yet
1 - Machine Learning Overview
56 pages
ENG6500 1 IntroductionToMLDL Part1
No ratings yet
ENG6500 1 IntroductionToMLDL Part1
63 pages
ML - Lecture - 1 Introduction To ML
No ratings yet
ML - Lecture - 1 Introduction To ML
29 pages
Introduction To ML
No ratings yet
Introduction To ML
4 pages
1 Intro
No ratings yet
1 Intro
18 pages
LM #01-Introduction To ML
No ratings yet
LM #01-Introduction To ML
33 pages
1 - Introduction
No ratings yet
1 - Introduction
82 pages
WEEK 01 Merged
No ratings yet
WEEK 01 Merged
606 pages
Lecture 2 - What Is ML
No ratings yet
Lecture 2 - What Is ML
17 pages
Lecture 01 Introducing ML 13102022 031101pm
No ratings yet
Lecture 01 Introducing ML 13102022 031101pm
36 pages
01 Introduction
No ratings yet
01 Introduction
51 pages
Ai - Foundations of Machine Learning I
No ratings yet
Ai - Foundations of Machine Learning I
40 pages
Class 1 C
No ratings yet
Class 1 C
14 pages
Ai - Foundations of Machine Learning I
No ratings yet
Ai - Foundations of Machine Learning I
39 pages
ML Unit 1 Notes
No ratings yet
ML Unit 1 Notes
134 pages
Module 1
No ratings yet
Module 1
175 pages
01 Introduction ML
No ratings yet
01 Introduction ML
48 pages
Fundamentals of ML 1
No ratings yet
Fundamentals of ML 1
38 pages
Unit 1
No ratings yet
Unit 1
43 pages
01 Introduction 1
No ratings yet
01 Introduction 1
71 pages
Artificial Intelligence: Chapter 5 - Machine Learning
No ratings yet
Artificial Intelligence: Chapter 5 - Machine Learning
30 pages
Firoz Topic 0
No ratings yet
Firoz Topic 0
24 pages
Machine 1
No ratings yet
Machine 1
35 pages
ML Module 1 Final
No ratings yet
ML Module 1 Final
134 pages
Introduction To Machine Learning: WWW - Seas.upenn - Edu/ Cis519
100% (1)
Introduction To Machine Learning: WWW - Seas.upenn - Edu/ Cis519
51 pages
ML Unit 1
No ratings yet
ML Unit 1
15 pages
Unit-1 Introduction To Machine Learning
No ratings yet
Unit-1 Introduction To Machine Learning
24 pages
ML - Week 1
No ratings yet
ML - Week 1
37 pages
Lecture Compiled
No ratings yet
Lecture Compiled
224 pages
01 Introduction
No ratings yet
01 Introduction
50 pages
AI Module 1 Simple Notes
No ratings yet
AI Module 1 Simple Notes
14 pages
Notes Unit 1
No ratings yet
Notes Unit 1
13 pages
CHP 1
No ratings yet
CHP 1
47 pages
Unit I MACHINE LEARNING
No ratings yet
Unit I MACHINE LEARNING
87 pages
Introduction To ML
No ratings yet
Introduction To ML
48 pages
Lecture - 2 Classification (Machine Learning Basic and KNN)
No ratings yet
Lecture - 2 Classification (Machine Learning Basic and KNN)
90 pages
01 Introduction
No ratings yet
01 Introduction
43 pages
BE02000041 Funda of AI Unit 3 Basics of ML
No ratings yet
BE02000041 Funda of AI Unit 3 Basics of ML
86 pages
01 Introduction ML
No ratings yet
01 Introduction ML
60 pages
ML Lecture#1
No ratings yet
ML Lecture#1
52 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
25 pages
Lecture 1 Ai
No ratings yet
Lecture 1 Ai
38 pages
Unit 1 ML
No ratings yet
Unit 1 ML
41 pages
Machine Learning
No ratings yet
Machine Learning
135 pages
Lecture 1
No ratings yet
Lecture 1
65 pages
Lec 7 - 8 - Machine Learning Introduction
No ratings yet
Lec 7 - 8 - Machine Learning Introduction
55 pages
Introduction To ML P2
No ratings yet
Introduction To ML P2
30 pages
Ai Unit5 Learning
No ratings yet
Ai Unit5 Learning
62 pages
Artificial Intelligence Algorithms
From Everand
Artificial Intelligence Algorithms
akosnemeth
No ratings yet
OLT Config
No ratings yet
OLT Config
16 pages
BTB Brochure
No ratings yet
BTB Brochure
7 pages
BS 1881-112 1983 Concrete Methods of Accelerated Curing of Test Cubes
No ratings yet
BS 1881-112 1983 Concrete Methods of Accelerated Curing of Test Cubes
11 pages
Traffic Management
No ratings yet
Traffic Management
6 pages
Profile
No ratings yet
Profile
4 pages
Interview Questions and Answers On SDLC
No ratings yet
Interview Questions and Answers On SDLC
13 pages
Full Stack Developer - Job Description CWSSG
No ratings yet
Full Stack Developer - Job Description CWSSG
2 pages
Calimpusan Raci Matrix
No ratings yet
Calimpusan Raci Matrix
4 pages
Issue 94 Radio Parts Newsletter - October 2013
No ratings yet
Issue 94 Radio Parts Newsletter - October 2013
8 pages
Samsung Np-r410 PCB Diagram
No ratings yet
Samsung Np-r410 PCB Diagram
48 pages
Project Title: Cracking Cooler Fin Fan Project No: 401004-00011
No ratings yet
Project Title: Cracking Cooler Fin Fan Project No: 401004-00011
4 pages
Module 3. Mech Safety.
No ratings yet
Module 3. Mech Safety.
45 pages
Rs 4.18 Rs 7.17 Rs 14.15 Rs 14.34 Rs 38.24: Telenor
No ratings yet
Rs 4.18 Rs 7.17 Rs 14.15 Rs 14.34 Rs 38.24: Telenor
2 pages
HP Aruba Certified Network Security Professional - HPE7-A02 Free Exam Questions (2024) - 6
No ratings yet
HP Aruba Certified Network Security Professional - HPE7-A02 Free Exam Questions (2024) - 6
4 pages
AIRLINX INRICO Brochure - Opt
No ratings yet
AIRLINX INRICO Brochure - Opt
22 pages
ACB Schneider
No ratings yet
ACB Schneider
3 pages
Innovation Models
No ratings yet
Innovation Models
16 pages
Solar Energy Minor
No ratings yet
Solar Energy Minor
3 pages
Lab No.3 Maham
No ratings yet
Lab No.3 Maham
9 pages
17a DARPS & DPS 12 & 100 Troubleshooting
No ratings yet
17a DARPS & DPS 12 & 100 Troubleshooting
18 pages
Indian Digital Marketing Spends by Industry
No ratings yet
Indian Digital Marketing Spends by Industry
19 pages
Tso Short Reference Notes: Default Function and PF Key Settings
No ratings yet
Tso Short Reference Notes: Default Function and PF Key Settings
15 pages
Rationale PDF
No ratings yet
Rationale PDF
3 pages
D Mart Grocery App Tech
No ratings yet
D Mart Grocery App Tech
6 pages
ALE Capabilities Brochure FV
No ratings yet
ALE Capabilities Brochure FV
64 pages
Pointers and Arrays in C Jensen 1.5
No ratings yet
Pointers and Arrays in C Jensen 1.5
64 pages
Resume Format
No ratings yet
Resume Format
2 pages

Introduccion A ML

Uploaded by

Introduccion A ML

Uploaded by

Introducción al

 Because we do not know, we collect data and hope to

What is  If we knew which strategy a team is using, we would play a

 Once strategies are detected and classified, we could select the

 Construct a good and useful approximation of the solution to the

 “Machine learning is programming computers to optimize a

What is ML? …  Has a model defined for some parameters.

 Two types of models:

 Example: handwriting recognition

 Means to infer relevant properties of the world from the percept

 Goals that describe classes of states whose achievement maximizes

 Outputs values can be provided

 Aim: to find regularities in the input.

 Some data is labelled – usually a very small part

 Uses rewards to guide the sequence of actions

 These actions are part of a policy.

Representation  Predicate calculus

 Computational complexity theory

 Psychology and neurobiology

4. Choosing a function approximation algorithm

Images, videos and text are called

Test set is usually 20%

Validation set is usually 20% of

 One Epoch is when an entire

Epoch, batch  Batch size: divide dataset into

 Iterations is the number of

 The number of batches is equal

 Compares actual target https://www.python-course.eu/images/confusion.matrix_image.png

 Recall used when there is a high cost associated with FN

matrix  Values range from 0 to 1

 Moving threshold can increase/decrease recall, precision and

 ROC and AUC can help us determine the best threshold

 Diagonal line is where TP rate is equal to FP rate

 Points above the diagonal represent a good classifier

 The best classifier would be (1,0)

 The value of the threshold is the value used in the

 Helps to compare different ROC

 The greater the value of the AUC, the

 How much training data is sufficient?

 Can the learner learn to represent the target function?

 Mitchell, Tom (1997). Machine Learning. WCB McGraw-Hill.

 Edwards, Gavin (2018). Machine Learning, an introduction. Towards Data

You might also like

Because we do not know, we collect data and hope to

What is If we knew which strategy a team is using, we would play a

Once strategies are detected and classified, we could select the

Construct a good and useful approximation of the solution to the

“Machine learning is programming computers to optimize a

What is ML? … Has a model defined for some parameters.

Two types of models:

Example: handwriting recognition

Means to infer relevant properties of the world from the percept

Goals that describe classes of states whose achievement maximizes

Outputs values can be provided

Aim: to find regularities in the input.

Some data is labelled – usually a very small part

Uses rewards to guide the sequence of actions

These actions are part of a policy.

Representation Predicate calculus

Computational complexity theory

Psychology and neurobiology

One Epoch is when an entire

Epoch, batch Batch size: divide dataset into

Iterations is the number of

The number of batches is equal

Compares actual target https://www.python-course.eu/images/confusion.matrix_image.png

Recall used when there is a high cost associated with FN

matrix Values range from 0 to 1

Moving threshold can increase/decrease recall, precision and

ROC and AUC can help us determine the best threshold

Diagonal line is where TP rate is equal to FP rate

Points above the diagonal represent a good classifier

The best classifier would be (1,0)

The value of the threshold is the value used in the

Helps to compare different ROC

The greater the value of the AUC, the

How much training data is sufficient?

Can the learner learn to represent the target function?

Mitchell, Tom (1997). Machine Learning. WCB McGraw-Hill.

Edwards, Gavin (2018). Machine Learning, an introduction. Towards Data