0% found this document useful (0 votes)

11 views66 pages

01-Ml-Overview Slides

The document is an introductory lecture on Machine Learning, covering its definition, categories, and applications. It outlines the course structure and provides insights into supervised, unsupervised, and reinforcement learning. Key quotes from notable figures in the field emphasize the significance and potential of machine learning in automating processes and enhancing artificial intelligence.

Uploaded by

Lipika Sharma

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views66 pages

01-Ml-Overview Slides

Uploaded by

Lipika Sharma

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 66

Lecture 01

What is Machine Learning?

An Overview.

STAT 451: Intro to Machine Learning, Fall 2020

Sebastian Raschka
http://stat.wisc.edu/~sraschka/teaching/stat451-fs2020/

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 1

Lecture 1 Overview

1. About this course

2. What is machine learning

3. Categories of machine learning

4. Notation

5. Approaching a machine learning application

6. Diﬀerent machine learning approaches and

motivations

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 2

Course Topics

Part 1: Introduction

Part 2: Computational foundations

Part 3: Tree-based methods

Part 4: Model evaluation

Part 5: Dimensionality reduction and unsupervised learning

Part 6: Bayesian learning

Part 7: Class project presentations

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 3

About this Course

For details -> http://stat.wisc.edu/~sraschka/teaching/stat451-fs2020/

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 4

Lecture 1 Overview

1. About this course

2. What is machine learning

3. Categories of machine learning

4. Notation

5. Approaching a machine learning application

6. Diﬀerent machine learning approaches and

motivations

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 5

What is Machine Learning?

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 6

"Machine learning is the hot new thing."
-- John L. Hennessy, President of Stanford (2000-2016)

Image Source: https://www.innovateli.com/hennessy-grad-keeps-gifting/

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 7

"A breakthrough in machine learning would be
worth ten Microsofts"
-- Bill Gates, Microsoft Co-founder

Image source: https://www.gatesnotes.com/Books

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 8

[...] machine learning is a subcategory within the field of computer
science, which allows you to implement artificial intelligence. So it’s
kind of a mechanism to get you to artificial intelligence.

-- Rana el Kaliouby, CEO at Affectiva

Image Source: https://fortune.com/2019/03/08/rana-el-kaliouby-ceo-affectiva/

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 9

eld of We develop (computer)
Artificial programs
Departmenttoof(AI),
Intelligence
oped as a subfield ofUniversity
automate various
Statisticsone
ArtificialofIntelligence
ofkinds
the of processes.
goals Originally devel-
behind
(AI), one of the goals behind machine learning
Wisconsin–Madison
machine learning
the need
was tofor developing
replace computer
the need for developing computer programs ”manually.”
programs ”manually.” If programsIfareprograms
a are a
means to http://stat.wisc.edu/
automate processes,sraschka/teaching/stat479-fs2018/
⇠
we can think of machine learning as ”automating automa-
mate processes,
tion.” In other words, machine learning of
we can think machine
lets computers learning
”create” programsas ”automating
(often for making automa-
Falllearning
2018
words,predictions)
machine learning
themselves. Machinelets computers ”create”
is turning data programs (often for making
into programs.
1
emselves. Machine learning is turning data into programs.
It is said that the term machine learning was first coined by Arthur Lee Samuel in 1959 .
One quote that
1 What almost every
is Machine introductory
Learning? Anmachine learning resource is often accredited to
Overview.
Samuel, an pioneer of the field of AI: 1
the term machine learning
1.1 Machine Learning – The Big Picture
was first coined by Arthur Lee Samuel in 1959 .
almost every introductory “Machine
We develop (computer) programs learningmachine
to automate is the kinds
various field of learning
ofprocesses.
study that givesresource
Originally computers
devel- the is often
ability to accredited to
neer ofoped
the field
as a subfield of of AI:
learn Intelligence
Artificial without being explicitly
(AI), one programmed”
of the goals behind machine learning
was to replace the need for developing computer programs ”manually.”—If Arthur
programsL.areSamuel,
a AI pioneer, 1959
means to automate processes, we can think of machine learning as ”automating automa-
Image Source: https://history-computer.com/ModernComputer/thinkers/images/Arthur-Samuel1.jpg
tion.” In other words, machine learning lets computers ”create” programs (often for making
predictions) themselves. Machine learning is turning data into programs.
(This is likely not an original quote but a paraphrased version of Samuel’s sentence ”Pro-
“Machine learning
It is said that the
gramming is
term machine learning
computers to learn thewasfield
from of
first coined
experience bystudy
Arthur Leethat
should Samuel gives
eventually in 1959 . computers the
1
eliminate the need for much
One quote that almost every introductory machine learning resource is often accredited to
ability to
of
learn thiswithout
Samuel, detailed
an programming
pioneer of being
the e↵ort.”)
field of AI:explicitly programmed”
— Arthur L. Samuel, AI pioneer, 1959
“Machine learning is the field of study that gives computers the ability to
“The field
learn without being of machine
explicitly learning is concerned with the question of how to
programmed”
construct computer programs thatL. automatically
— Arthur Samuel, AI pioneer,improve
1959 with experience”
— Tom Mitchell, former chair of the Machine Learning department of
(This is likely not an original quote but a paraphrased version of Samuel’s sentence Carnegie
”Pro- Mellon University
not angramming
original computers quote
to learn frombut a should
experience paraphrased version
eventually eliminate the need for muchof Samuel’s sentence ”Pro-
of this detailed programming e↵ort.”)
puters to learn from experience should eventually eliminate the need for much
1
Arthur L Samuel. “Some studies in machine learning using the game of checkers”. In: IBM Journal of
programming e↵ort.”)
“The field 3.3
research and development of machine
(1959),learning is concerned with the question of how to
pp. 210–229.
construct computer programs that automatically improve with experience”
— Tom Mitchell, former chair of the Machine Learning department of
Carnegie Mellon University

“The 1 field
Arthur L Samuel.of“Some
machine learning
studies in machine learning using theis
gameconcerned
of checkers”. In: IBM with
Journal of the question of how to
research and development 3.3 (1959), pp. 210–229.
construct computer programs that automatically improve with experience”
Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 10
The Traditional Programming Paradigm

Inputs (observations)

Programmer Program Computer Outputs

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 11

Inputs (observations)

Programmer Program Computer Outputs

Machine learning is the field of study that gives computers the

ability to learn without being explicitly programmed

— Arthur Samuel (1959)

Inputs
Computer Program
Outputs

!3
Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 12
We will not only use the machines for their
intelligence, we will also collaborate with them in
ways that we cannot even imagine.
-- Fei Fei Li, Director of Stanford's artificial intelligence lab

Image Source: https://en.wikipedia.org/wiki/Fei-Fei_Li#/

media/File:Fei-Fei_Li_at_AI_for_Good_2017.jpg

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 13

some class of tasks T and performance measure P , if its performance at tasks
in T , as measured by P , improves with experience E.”
— Tom Mitchell, Professor at Carnegie Mellon2 University
crete, Tom Mitchell’s quote from his Machine Learning book :
As an example, consider a handwriting recognition learning problem (from Mitchell’s book):
“A computer program is said to learn from experience E with respect to
• Task T : recognizing and classifying handwritten words within images
some class of tasks T and performance measure P , if its performance at tasks
in T• ,Performance
as measuredmeasure
by PP :, percent
improvesof words
withcorrectly classifiedE.”
experience
• Training experience— E: Tom Mitchell,
a database Professor
of handwritten wordsat Carnegie
with Mellon University
given classifications

1.2 Applications of Machine Learning

consider a handwriting recognition learning problem (from Mitchell’s book):
Email spam detection
2 Tom
M Mitchell et al. “Machine learning. 1997”. In: Burr Ridge, IL: McGraw Hill 45.37 (1997),
ecognizing and classifying handwritten words within images
pp. 870–877.

nce measure P : percent of words correctly classified

experience E: a database of handwritten words with given classifications

cations of Machine Learning

detection
Sebastian
hell et al.Raschka
“Machine learning. STAT 451: Intro
1997”. In: to ML Ridge, IL: McGraw
Burr Hill 45.37 (1997), 14
Lecture 1: Introduction
A “A
bitcomputer
more concrete, Tom
program is said Mitchell’s
to learn quoteE with
from experience fromrespect
his Machin
to
some class of tasks T and performance measure P , if its performance at tasks
in T , as measured by P , improves with experience E.”
— Tom “AMitchell,
computer program
Professor is said
at Carnegie to University
Mellon learn from
some class of tasks T and performance mea
Handwriting Recognition Example:
in T , aslearning
consider a handwriting recognition measured by (from
problem P , improves
Mitchell’s with
book):exp
— Tom Mitchell, Profe
cognizing and classifying handwritten words within images
ce measure P : percent of words correctly classified
As an example, consider a handwriting recognition learning
xperience E: a database of handwritten words with given classifications

?
• Task T : recognizing
ations of Machine Learning
and classifying handwritten word
• Performance measure P : ?percent of words correctly c
etection
• Training experience E: ?a database of handwritten wo
ll et al. “Machine learning. 1997”. In: Burr Ridge, IL: McGraw Hill 45.37 (1997),

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 15

Some Applications
of Machine Learning:

•
•
•
•
•
Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 16
Lecture 1 Overview

1. About this course

2. What is machine learning

3. Categories of machine learning

4. Notation

5. Approaching a machine learning application

6. Diﬀerent machine learning approaches and

motivations

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 17

Categories of Machine Learning

Labeled data
Supervised Learning Direct feedback
Predict outcome/future

No labels/targets
Unsupervised Learning No feedback
Find hidden structure in data

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 18

Supervised Learning: Classification

x1
Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 19
Supervised Learning: Regression

x
Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 20
Categories of Machine Learning

Labeled data
Supervised Learning Direct feedback
Predict outcome/future

No labels/targets
Unsupervised Learning No feedback
Find hidden structure in data

Decision process
Reinforcement Learning
Sebastian Raschka STAT 451: Intro to ML Reward system
Lecture 1: Introduction 21
Unsupervised Learning -- Clustering

x1
Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 22
Unsupervised Learning
-- Dimensionality Reduction

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 23

Categories of Machine Learning

Labeled data
Supervised Learning Direct feedback
Predict outcome/future

No labels/targets
Unsupervised Learning No feedback
Find hidden structure in data

Decision process
Reinforcement Learning Reward system
Learn series of actions

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 24

Reinforcement Learning

Environment
Reward
State
Action

Agent

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 25

https://www.theverge.com/tldr/2017/7/10/15946542/deepmind-parkour-agent-reinforcement-learning

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 26

https://video.twimg.com/ext_tw_video/1111683489890332672/pu/vid/1200x674/WqUJEhUETw0M0gCl.mp4?tag=8

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 27

Lecture 1 Overview

1. About this course

2. What is machine learning

3. Categories of machine learning

4. Notation

5. Approaching a machine learning application

6. Diﬀerent machine learning approaches and

motivations

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 28

Supervised Learning Workflow
-- Overview
Labels
Training Data

Machine Learning
Algorithm

New Data Predictive Model Prediction

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 29

Supervised Learning Notation

Training set: 𝒟 = {⟨x[i], y [i]⟩, i = 1,… , n},

Unknown function: f(x) = y

Hypothesis: h(x) = ŷ

Classification Regression

m m
h:ℝ → ___ h:ℝ → ___

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 30

Data Representation

x1
x2
x=
⋮
xm

Feature vector

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 31

Data Representation

xT1
x1
x2 xT2
x= X=
⋮ ⋮
xm xTn

Feature vector D_n m_______

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 32

Data Representation

x1T x1[1] x2[1] ⋯ xm[1]

x1
x2 xT2 x1[2] x2[2] ⋯ xm[2]
X= X=
x= ⋮ ⋮ ⋮ ⋱ ⋮
⋮
xm xTn x1[n] x2[n] ⋯ xm[n]

Feature vector
_________________ ______________________ ______________________

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 33

Data Representation

m= _____

n= _____

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 34

Data Representation

x1 y [1]
x2 y [2]
x= y=
⋮ ⋮
xm y [n]

Input features
______________ ______________

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 35

ML Terminology (Part 1)
▪ Training example: A row in the table representing the
dataset. Synonymous to an observation, training record,
training instance, training sample (in some contexts, sample
refers to a collection of training examples)

▪ Feature: a column in the table representing the dataset.

Synonymous to predictor, variable, input, attribute,
covariate.

▪ Targets: What we want to predict. Synonymous to

outcome, output, ground truth, response variable,
dependent variable, (class) label (in classification).

▪ Output / prediction: use this to distinguish from targets;

here, means output from the model.
Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 36
Hypothesis Space
Entire hypothesis space

Hypothesis space
a particular learning
algorithm category
has access to

Hypothesis space
a particular learning
algorithm can sample
Particular hypothesis
(i.e., a model/classifier)
Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 37
Classes of Machine Learning Algorithms

• Generalized linear models (e.g.,

• Support vector machines (e.g.,

• Artificial neural networks (e.g.,

• Tree- or rule-based models (e.g.,

• Graphical models (e.g.,

• Ensembles (e.g.,

• Instance-based learners (e.g.,

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 38

Lecture Overview

1. About this course

2. What is machine learning

3. Categories of machine learning

4. Notation

5. Approaching a machine learning application

6. Diﬀerent machine learning approaches and

motivations

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 39

Supervised Learning Workflow
-- Overview
Labels
Training Data

Machine Learning
Algorithm

New Data Predictive Model Prediction

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 40

Feature Extraction and Scaling
Feature Selection
Dimensionality Reduction
Sampling

Labels

Training Dataset
Learning
Final Model New Data
Labels Algorithm

Raw Test Dataset

Data
Labels

Preprocessing Learning Evaluation Prediction

Model Selection
Cross-Validation
Performance Metrics
Hyperparameter Optimization

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 41

5 Steps for Approaching a Machine
Learning Application

1. Define the problem to be solved.

2. Collect (labeled) data.

3. Choose an algorithm class.

4. Choose an optimization metric or measure for learning the model.

5. Choose a metric or measure for evaluating the model.

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 42

Objective Functions
• Maximize the posterior probabilities (e.g., naive Bayes)

• Maximize a fitness function (genetic programming)

• Maximize the total reward/value function (reinforcement

learning)

• Maximize information gain/minimize child node impurities

(CART decision tree classification)

• Minimize a mean squared error cost (or loss) function (CART,

decision tree regression, linear regression, adaptive linear
neurons, ...)

• Maximize log-likelihood or minimize cross-entropy loss (or cost)

function

• Minimize hinge loss (support vector machine)

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 43
Optimization Methods for
Different Learning Algorithms

• Combinatorial search, greedy search (e.g., decision trees)

• Unconstrained convex optimization (e.g.,

• Constrained convex optimization (e.g.,

• Nonconvex optimization, here: using backpropagation, chain rule,

reverse autodiﬀ. (e.g.,

• Constrained nonconvex optimization (e.g.,

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 44

Evaluation -- Misclassification Error

{1 if ŷ ≠ y
0 if ŷ = y
L(y,̂ y) =

n
1
L(ŷ , y )
[i] [i]
test n ∑
ERR𝒟 =
i=1

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 45

ML Terminology (Part 2)

▪ Loss function: Often used synonymously with cost

function; sometimes also called error function. In some
contexts the loss for a single data point, whereas the cost
function refers to the overall (average or summed) loss over
the entire dataset. Sometimes also called empirical risk.

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 46

Other Metrics in Future Lectures
• Accuracy (1-Error)
• ROC AUC
• Precision
• Recall
• (Cross) Entropy
• Likelihood
• Squared Error/MSE
• L-norms
• Utility
• Fitness
• ...

But more on other metrics in future lectures.

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 47

Lecture 1 Overview

1. About this course

2. What is machine learning

3. Categories of machine learning

4. Notation

5. Approaching a machine learning application

6. Diﬀerent machine learning approaches and

motivations

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 48

Pedro Domingos's 5 Tribes of Machine Learning

Source: Domingos, Pedro.

The master algorithm: How the quest for the
ultimate learning machine will remake our world.
Basic Books, 2015.

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 49

1. INTRODUCTION
statisticians from working on a largeThe rangv
lems. Algorithmic modeling, both in the theor d
Statistics starts with data. Think of the data as
rapidly in fields outside statistics. It and/o can b
being generated by a black box in which a vector of
data sets and as a more accurate and
this:in
Breiman,
input variables x (independent
Leo. "Statistical modeling: The two cultures
variables) go in one
(with comments and a modeling
rejoinder on the
by smaller data sets. If our goa
author).
side, and on the other side the response variables y
" Statistical science 16.3solve problems,
(2001): then we need to move awa
199-231.
come out. Inside the black box, nature functions to
on data models and adopt a more diverse s
associate the predictor variables with the response
variables, so the picture is like this:
Mod
A 1. INTRODUCTION B The
tests
y nature x the
Statistics starts with data. Think of the data as Estim
being generated by a black box in which a vector of and
cians
Therevariables
input are two goals in analyzing
x (independent the data:go in one
variables) this
The A
side, and on
Prediction. Tothe othertoside
be able the what
predict response variables y
the responses
come
are out.toInside
going be to the
future black box,variables;
input nature functions to The
associate the To
Information. predictor
extractvariables with the response
some information about the bo
variables,
how naturesoisthe picture is the
associating like response
this: variables find a
Mo
to the input variables. x to p
tes
y nature x like th
There are two different approaches toward these Est
goals: cia
Sebastian Raschka There are two
STAT 451:goals
Intro to in
ML analyzing Lecture
the data:
1: Introduction 50
e almost exclusive use of datavariables
input models. This commit-
x (independent variables) go in one this
vant theory, questionable conclusions,
side, and on theand otherhasside
kept the response variables y
rking on a largeThe range of interesting
values
come of
out.the
Inside current
parameters prob-
the black arebox,estimated from
nature functions to
deling, both in the
theory
dataand
and practice,
associate the
the has then
model developed
predictor used for information
variables with the response
he data as
de statistics. It and/or
can bevariables,
usedLeo.
Breiman, bothso
prediction. onthe
Thus large
the complex
black
picture
"Statistical is box
likeisthis:
modeling: filled
The twoin like
cultures
a vector of Mo
more accurate and informative
this:
(with comments alternative to data by the author).
and a rejoinder
) go in one tes
data sets. If our" goal as a field
Statistical yis
science to 16.3
use data
(2001): to 199-231. x
nature
variables y linear regression Es
we need to move away from exclusive
y dependence
logistic regression x
unctions to cia
dopt a more diverse set of tools. Cox model
e response There are two goals in analyzing the data:
The
validation.ToYes–no
Model Prediction. using goodness-of-fit
B The and
tests values of the
areresidual
going
be able to
parameters
to examination.
be
predict what the responses
are estimated
to future input variables;from
C T
the data as the data
Estimated and the model
culture
Information. then 98%
population.
To extract used
someofforinformation
information
all statisti- about the
h a vector of and/orhow
cians. prediction.
nature Thus the black the
is associating box is filled in variables
response like find
ata:go in one
es) this: to the input variables. x to
The Algorithmic Modeling Culture like
variables y
e responses linear regression
There
y are two different
logistic approaches
regression x toward these
functions
; to The analysis in this culture considers the inside of
goals:
he response
tion about the box complex andCox model Their approach is to
unknown.
variables find a function
Dataf!x"—an
The validation.
Modeling algorithm
Culture that operates on
Model Yes–no using goodness-of-fit
x to predict the responses y. Their black box looks
tests and
The residual
analysis examination.
in this culture starts with assuming
x like this:
ward these Estimated culture
a stochastic data model for98%
population. the of all statisti-
inside of the black
cians.
box.
y For example, a common data model
unknown x is that data
data: are generated by independent draws from Mo
Sebastian Raschka The Algorithmic Modeling
STAT 451: Intro to ML Culture Lecture 1: Introduction 51
Es
mation
the response Cox model
in like
Model validation. Yes–no using goodness-of-fit
Breiman, Leo. "Statistical modeling: The two cultures
tests and residual examination.
x (with comments and a rejoinder by the author).
Estimated culture population. 98% of all statisti-
" Statistical
cians. science 16.3 (2001): 199-231.
e data:
The Algorithmic Modeling Culture
s-of-fit
the responses
ed from
bles;
C The analysis in this culture considers the inside of
ormation
tatisti- about
mation the box complex and unknown. Their approach is to
d in variables
nse like find a function f!x"—an algorithm that operates on
x to predict the responses y. Their black box looks
like this:
toward these
side of
ch is to y unknown x
ates on
ess-of-fit
x looks
ith assuming decision trees
statisti-
of the black neural nets
el is that data
rom Model validation. Measured by predictive accuracy.
Estimated culture population. 2% of statisticians,
ables, many in other fields.
inside of
parameters)
ach is to In this paper I will argue that the focus in the
Sebastian
rates on Raschka STAT 451: Intro to ML Lecture 1: Introduction 52
Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 53
Evolved antenna (Source: https://en.wikipedia.org/wiki/Evolved\_antenna) via evolutionary algorithms; used on a 2006
NASA spacecraft.

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 54

Black Boxes vs Interpretability

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 55

Black Boxes vs Interpretability

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 56

Different Motivations for Studying
Machine Learning
• Engineers:

• Mathematicians, computer scientists, and statisticians:

• Neuroscientists:

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 57

Machine Learning, AI, and Deep Learning

Machine Learning

Deep Learning
AI

Algorithms that learn

models/representations/
rules automatically
from data/examples
A non-biological system
that is intelligent
through rules Algorithms that parameterize multilayer
neural networks that then learn
representations of data with multiple layers
of abstraction
Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 58
Image by Jake VanderPlas; Source:
https://speakerdeck.com/jakevdp/the-state-of-the-stack-scipy-2015-keynote?slide=8)

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 59

Spam

https://en.wikipedia.org/wiki/Spam_(food)

"It has become the subject of a number of appearances in pop culture, notably
a Monty Python sketch which repeated the name many times, leading to its
name being borrowed for unsolicited electronic messages, especially email."

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 60

Spam

https://en.wikipedia.org/wiki/Spam_(food)

https://en.wikipedia.org/wiki/Monty_Python

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 61

Spam

https://en.wikipedia.org/wiki/Spam_(food)
https://en.wikipedia.org/wiki/Monty_Python

"Python's name is derived from the British comedy group Monty Python, whom Python creator Guido van
Rossum enjoyed while developing the language. "

https://en.wikipedia.org/wiki/Python_(programming_language)

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 62

ML Terminology (Part 3)
▪ Hypothesis: A hypothesis is a certain function that we believe (or hope) is
similar to the true function, the target function that we want to model.

▪ Model: In the machine learning field, the terms hypothesis and model are
often used interchangeably. In other sciences, they can have diﬀerent
meanings.

▪ Learning algorithm: Again, our goal is to find or approximate the target

function, and the learning algorithm is a set of instructions that tries to
model the target function using our training dataset. A learning algorithm
comes with a hypothesis space, the set of possible hypotheses it
explores to model the unknown target function by formulating the final
hypothesis.

▪ Classifier: A classifier is a special case of a hypothesis (nowadays, often

learned by a machine learning algorithm). A classifier is a hypothesis or
discrete-valued function that is used to assign (categorical) class labels to
particular data points
Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 63
Course Topics

Part 1: Introduction

Part 2: Computational foundations

Part 3: Tree-based methods

Part 4: Model evaluation

Part 5: Dimensionality reduction and unsupervised learning

Part 6: Bayesian learning

Part 7: Class project presentations

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 64

Part 1: Introduction

- Week 01: L01 - Course overview, introduction to machine learning

- Week 02: L02 - Introduction to Supervised Learning

and k-Nearest Neighbors Classifiers

Part 2: Computational foundations

- Week 03: L03 - Using Python

- Week 03: L04 - Introduction to Python's scientific computing stack

- Week 04: L05 - Data preprocessing and machine learning with scikit-learn

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 65

Reading Assignments

• Raschka and Mirjalili: Python Machine Learning, 3rd ed., Ch 1

• Elements of Statistical Learning, Ch 01

(https://web.stanford.edu/~hastie/ElemStatLearn/)

• Optional: Breiman, Leo. "Statistical modeling: The two cultures

(with comments and a rejoinder by the author)".
Statistical science 16.3 (2001): 199-231.
https://projecteuclid.org/euclid.ss/1009213726

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 66

ML-UNIT - I - Part A
No ratings yet
ML-UNIT - I - Part A
88 pages
Module 01
No ratings yet
Module 01
25 pages
Unit 1
No ratings yet
Unit 1
110 pages
ML Notes (BCS602)
No ratings yet
ML Notes (BCS602)
186 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
45 pages
ML Microst
No ratings yet
ML Microst
264 pages
Machine Learning
100% (1)
Machine Learning
46 pages
Motivation 24111
No ratings yet
Motivation 24111
23 pages
STAT 451: Introduction To Machine Learning Lecture Notes
No ratings yet
STAT 451: Introduction To Machine Learning Lecture Notes
22 pages
I MSC DS ML Notes
No ratings yet
I MSC DS ML Notes
109 pages
Unit-1 Introduction To Machine Learning
No ratings yet
Unit-1 Introduction To Machine Learning
17 pages
2023-24 ML Notes 1
No ratings yet
2023-24 ML Notes 1
25 pages
L001 Introduction
No ratings yet
L001 Introduction
15 pages
Machine Learning BCS602
No ratings yet
Machine Learning BCS602
81 pages
Introduction
No ratings yet
Introduction
199 pages
R20 ML Notes
No ratings yet
R20 ML Notes
118 pages
00intro 1
No ratings yet
00intro 1
43 pages
ML Module 1
No ratings yet
ML Module 1
52 pages
DocScanner Sep 27, 2024 9-01 AM
No ratings yet
DocScanner Sep 27, 2024 9-01 AM
24 pages
ML Overview Notes
No ratings yet
ML Overview Notes
23 pages
Machine Learning: ML by Poonam Dhamal
No ratings yet
Machine Learning: ML by Poonam Dhamal
72 pages
01 ML Overview Notes
No ratings yet
01 ML Overview Notes
22 pages
Lesson 1
No ratings yet
Lesson 1
20 pages
Machine Learning UNIT I
No ratings yet
Machine Learning UNIT I
42 pages
Report On Machine Learning
No ratings yet
Report On Machine Learning
13 pages
Machine Learning For Absolute Beginners A - Oliver Theobald
100% (2)
Machine Learning For Absolute Beginners A - Oliver Theobald
179 pages
Machine Learning Unit 1
No ratings yet
Machine Learning Unit 1
14 pages
Computer Science & Engineering: Apex Institute of Technology
No ratings yet
Computer Science & Engineering: Apex Institute of Technology
13 pages
ML PPT1
No ratings yet
ML PPT1
70 pages
Chapter 5 Introduction To ML-1
100% (1)
Chapter 5 Introduction To ML-1
32 pages
LM #01-Introduction To ML
No ratings yet
LM #01-Introduction To ML
33 pages
ML Cahp 1
No ratings yet
ML Cahp 1
35 pages
Lecture-1 Machine Learning With Python
No ratings yet
Lecture-1 Machine Learning With Python
28 pages
L01-Intro Slides
No ratings yet
L01-Intro Slides
67 pages
Introduction Machine Learning
No ratings yet
Introduction Machine Learning
53 pages
Intro To Machine Learning
100% (1)
Intro To Machine Learning
250 pages
1 Introduction
No ratings yet
1 Introduction
59 pages
1.2.1 ML Intro
No ratings yet
1.2.1 ML Intro
18 pages
CHAP Introduction 1.2 Environmental Data Science 18p
No ratings yet
CHAP Introduction 1.2 Environmental Data Science 18p
18 pages
ML Class 1 08 06 2021
No ratings yet
ML Class 1 08 06 2021
8 pages
Machine Learning
No ratings yet
Machine Learning
22 pages
Machine - Learning-MBA-unit-3 Machine - Learning-MBA-unit-3
No ratings yet
Machine - Learning-MBA-unit-3 Machine - Learning-MBA-unit-3
36 pages
Basics of Machine Learning
100% (4)
Basics of Machine Learning
22 pages
Unit 1
No ratings yet
Unit 1
88 pages
MAI Lecture 01 Introduction
No ratings yet
MAI Lecture 01 Introduction
52 pages
01 Ml-Overview Slides
No ratings yet
01 Ml-Overview Slides
58 pages
AcousticSpaceAnalysis IQP 2014-15
No ratings yet
AcousticSpaceAnalysis IQP 2014-15
104 pages
Lecture 13 Intro Machine Learning
No ratings yet
Lecture 13 Intro Machine Learning
56 pages
SEng5305-chap-1-Introduction To ML
No ratings yet
SEng5305-chap-1-Introduction To ML
85 pages
01 - Introduction To Machine Learning
No ratings yet
01 - Introduction To Machine Learning
72 pages
Faiml Unit 2
No ratings yet
Faiml Unit 2
7 pages
ML Day1
No ratings yet
ML Day1
21 pages
ML 1
No ratings yet
ML 1
13 pages
ML Microsoft Course Overview: Machine Learning in Context
100% (1)
ML Microsoft Course Overview: Machine Learning in Context
53 pages
ML Notes
No ratings yet
ML Notes
202 pages
01 Introduction C4 1
No ratings yet
01 Introduction C4 1
14 pages
Unit1 ML
No ratings yet
Unit1 ML
23 pages
Introduction To Machine Learning: David Kauchak CS 451 - Fall 2013
No ratings yet
Introduction To Machine Learning: David Kauchak CS 451 - Fall 2013
34 pages
8.2.2021 CS 601 (Introduction To ML) - Notes
No ratings yet
8.2.2021 CS 601 (Introduction To ML) - Notes
3 pages
Tourism MS
No ratings yet
Tourism MS
22 pages
Message Mapping in CPI I-Flow
No ratings yet
Message Mapping in CPI I-Flow
11 pages
Data Structure Using Python Question Bank
No ratings yet
Data Structure Using Python Question Bank
8 pages
Bus Ticket Reservation
No ratings yet
Bus Ticket Reservation
40 pages
QS Spec Sheet
No ratings yet
QS Spec Sheet
11 pages
People Central Hub Configuration Workbook
No ratings yet
People Central Hub Configuration Workbook
2,487 pages
Salesforce Developer Cheat Sheet
No ratings yet
Salesforce Developer Cheat Sheet
2 pages
Rdgupta PPT Gi Sip Part-Ii3
No ratings yet
Rdgupta PPT Gi Sip Part-Ii3
39 pages
Ar
No ratings yet
Ar
10 pages
Final ETI Micro Project Report
0% (1)
Final ETI Micro Project Report
17 pages
Activity 19 Using Report Wizard Creating A Report Based On More Than One Table
No ratings yet
Activity 19 Using Report Wizard Creating A Report Based On More Than One Table
1 page
SKEE BALL Classic: Installation and Operation Single Ball Release
No ratings yet
SKEE BALL Classic: Installation and Operation Single Ball Release
30 pages
Lutech Viewer2-9 Manual FINAL 200930A
No ratings yet
Lutech Viewer2-9 Manual FINAL 200930A
19 pages
June 2024 (v1) MS P1 IGCSE MATHEMATICS (CORE)
No ratings yet
June 2024 (v1) MS P1 IGCSE MATHEMATICS (CORE)
7 pages
Lembar Soal&jwb 3 B.inggris
No ratings yet
Lembar Soal&jwb 3 B.inggris
10 pages
Recommender Systems-Chapter 3
No ratings yet
Recommender Systems-Chapter 3
47 pages
Debloat and Restore
No ratings yet
Debloat and Restore
6 pages
Samsung Max-Vl65 Vl69 SCH
No ratings yet
Samsung Max-Vl65 Vl69 SCH
12 pages
Mastering Predictive Analytics With Python Exploit The Power of Data in Your Business by Building Advanced Predictive Modeling Applications With Python Joseph Babcock Instant Download
No ratings yet
Mastering Predictive Analytics With Python Exploit The Power of Data in Your Business by Building Advanced Predictive Modeling Applications With Python Joseph Babcock Instant Download
13 pages
ECT426 M5 Ktunotes - in
No ratings yet
ECT426 M5 Ktunotes - in
34 pages
SVMBasedRealTimeHand WrittenDigitRecognitionSystem
No ratings yet
SVMBasedRealTimeHand WrittenDigitRecognitionSystem
7 pages
Sign Language Recognition Systems: A Decade Systematic Literature Review
No ratings yet
Sign Language Recognition Systems: A Decade Systematic Literature Review
29 pages
Clang Integration
No ratings yet
Clang Integration
12 pages
Project Diary - Major
No ratings yet
Project Diary - Major
12 pages
OrionSX-Datasheet 083022
No ratings yet
OrionSX-Datasheet 083022
2 pages
Engineering Aptitude
No ratings yet
Engineering Aptitude
2 pages
Module 1 Algo Cncpts
No ratings yet
Module 1 Algo Cncpts
4 pages
Computer Science Glossary Roman English
No ratings yet
Computer Science Glossary Roman English
3 pages
Pre-Algebra - Core Concept Cheat Sheet 01 Introduction To Pre-Algebra
No ratings yet
Pre-Algebra - Core Concept Cheat Sheet 01 Introduction To Pre-Algebra
1 page

01-Ml-Overview Slides

Uploaded by

01-Ml-Overview Slides

Uploaded by

Lecture 01

What is Machine Learning?

STAT 451: Intro to Machine Learning, Fall 2020

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 1

1. About this course

2. What is machine learning

3. Categories of machine learning

5. Approaching a machine learning application

6. Diﬀerent machine learning approaches and

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 2

Part 2: Computational foundations

Part 3: Tree-based methods

Part 4: Model evaluation

Part 5: Dimensionality reduction and unsupervised learning

Part 6: Bayesian learning

Part 7: Class project presentations

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 3

For details -> http://stat.wisc.edu/~sraschka/teaching/stat451-fs2020/

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 4

1. About this course

2. What is machine learning

3. Categories of machine learning

5. Approaching a machine learning application

6. Diﬀerent machine learning approaches and

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 5

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 6

Image Source: https://www.innovateli.com/hennessy-grad-keeps-gifting/

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 7

Image source: https://www.gatesnotes.com/Books

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 8

-- Rana el Kaliouby, CEO at Affectiva

Image Source: https://fortune.com/2019/03/08/rana-el-kaliouby-ceo-affectiva/

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 9

Programmer Program Computer Outputs

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 11

Programmer Program Computer Outputs

Machine learning is the field of study that gives computers the

— Arthur Samuel (1959)

Image Source: https://en.wikipedia.org/wiki/Fei-Fei_Li#/

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 13

1.2 Applications of Machine Learning

nce measure P : percent of words correctly classified

cations of Machine Learning

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 15

1. About this course

2. What is machine learning

3. Categories of machine learning

5. Approaching a machine learning application

6. Diﬀerent machine learning approaches and

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 17

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 18

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 23

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 24

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 25

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 26

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 27

1. About this course

2. What is machine learning

3. Categories of machine learning

5. Approaching a machine learning application

6. Diﬀerent machine learning approaches and

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 28

New Data Predictive Model Prediction

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 29

Training set: 𝒟 = {⟨x[i], y [i]⟩, i = 1,… , n},

Unknown function: f(x) = y

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 30

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 31

Feature vector D___n m_________

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 32

x1T x1[1] x2[1] ⋯ xm[1]

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 33

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 34

Sebastian Raschka STAT 451: Intro to ML Lecture 1: Introduction 35

▪ Feature: a column in the table representing the dataset.

▪ Targets: What we want to predict. Synonymous to

Feature vector D_n m_______