0% found this document useful (0 votes)

40 views23 pages

Lecture 02 - Warming-Up and Data and Features - Plain

The document discusses machine learning workflows. It describes typical workflows for supervised learning, unsupervised learning, and reinforcement learning. For supervised learning, the workflow involves using labeled training data to extract features, training a machine learning algorithm to output a model, and using the model to make predictions on new test data. For unsupervised learning, the workflow is similar but uses unlabeled data for clustering. Reinforcement learning involves an agent interacting with an environment by taking actions and receiving rewards to learn an optimal policy without explicit supervision.

Uploaded by

Raja

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

40 views23 pages

Lecture 02 - Warming-Up and Data and Features - Plain

Uploaded by

Raja

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 23

Warming-up to Machine Learning,

Data and Features

CS771: Introduction to Machine Learning
Piyush Rai
2
Plan for today
 Types of ML problems

 Typical workflow of ML problems

 Various perspectives of ML problems

 Data and Features

 Some basic operations of data and features

CS771: Intro to ML
3
Keep in mind: ML is like an exam
 It’s the performance on the D-day which matters

 In an exam, our success is measured based on how well we did on the questions in
the test (not on the questions we practiced on)

 Likewise, in ML, success of the learned model is measured based on how well it
predicts/fits the future test data (not the training data) Plus, of course,
issues such as
fairness

In Machine Learning, generalization performance

on the test data matters

CS771: Intro to ML
“Labeled” means, 4
A Loose Taxonomy of ML during training, for
each input, the
corresponding
Learning using Learning using output is available
(i.e., the machine
labeled data unlabeled data learner is explicitly
told that a cat image
Some examples of
supervised learning problems is of a cat)

 Classification Supervised Unsupervised Some examples of

 Regression Learning unsupervised learning problems
 Ranking
Learning
 Clustering
 Dimensionality Reduction
Machine  Unsupervised Probability Density Estimation

Learning
Many other specialized flavors of ML also exist,
some of which include

 Semi-supervised Learning
 Active Learning
 Transfer Learning
RL doesn’t use “labeled” or  Multitask Learning
“unlabeled” data in the traditional Reinforcement  Imitation Learning (somewhat related to RL)
sense! In RL, an agent learns via Learning  Zero-Shot Learning
its interactions with an environment  Few-Shot Learning
 Continual learning

CS771: Intro to ML
5
A Typical Supervised Learning Workflow
Note: This example is for the
problem of binary classification,
a supervised learning problem
Labeled “dog” “dog”
Training “dog” “dog”
Data
“dog”
“dog” Is feature extraction done “manually” as a
“cat” pre-processing step before the ML algo
“Feature”
“cat” Extraction
“cat” ML Algorithm starts working? Can’t we “automate” this
“cat” “cat” (outputs a “model”) part? Can’t we “learn” good features
directly from raw inputs?
“cat”
Feature extraction converts raw inputs
to a numeric representation that the ML
algo can understand and work with.
More on feature extraction later. Predicted Label
Test “Feature”
Indeed. Deep Learning algos Extraction (cat/dog)
Image
do precisely that! Cat vs Dog
(feature + model learning). Prediction model
More on Deep Learning later.
https://www.pinclipart.com/, http://www.pngtree.com CS771: Intro to ML
6
A Typical Unsupervised Learning Workflow
Note: This example is for the
problem of data clustering, an
unsupervised learning problem
Unlabeled
Data

Yes. In this example, given a new

“Feature” “test” cat/dog image, we can assign
Extraction ML Algorithm
(outputs a
it to the cluster with closer centroid
clustering)

Does unsupervised learning also

have a test phase? That is, can
we also predict the cluster of a
new test input?

https://www.pinclipart.com/, http://www.pngtree.com CS771: Intro to ML

7
A Typical Reinforcement Learning Workflow
Wish to teach an agent optimal policy for some task
Agent State
Agent does the following repeatedly

 Senses/observes the environment

 Takes an action based on its current policy
 Receives a reward for that action
Observation Reward Action  Updates its policy

Agent’s goal is to maximize its overall reward

There IS supervision, not explicit

(as in Supervised Learning) but
rather implicit (feedback based)

Environment State at time t

CS771: Intro to ML
8

ML: Some Perspectives

CS771: Intro to ML
9
Geometric Perspective Recall that feature extraction converts
inputs into a numeric representation

 Basic fact: Inputs in ML problems can often be represented as points or vectors in some vector space

 Doing ML on such data can thus be seen from a geometric view

y: Grumpiness (scale of 0-100)

Regression: A supervised learning
problem. Goal is to model the
relationship between input (x) and
real-valued output (y). This is akin to
a line or curve fitting problem
x: sleep hours

Classification: A supervised learning

problem. Goal is to learn a to predict
which of the two or more classes an
input belongs to. Akin to learning
linear/nonlinear separator for the inputs
Pic from: https://learningstatisticswithr.com/book/regression.html, https://maxstat.de/
CS771: Intro to ML
10
Geometric Perspective
Clustering looks like
Clustering: An unsupervised learning classification to me. Is
there any difference?
problem. Goal is to group inputs in a
few clusters based on their similarities
Yes. In clustering, we don’t know
with each other the labels. Goal is to separate them
without any labeled “supervision”

Dimensionality Reduction: An
unsupervised learning problem. Goal is
to compress the size of each input
without losing much information
present in the data

CS771: Intro to ML
11
Perspective as function approximation
 Supervised Learning (“predict output given input”) can be usually thought of as learning a
function f that maps each input to the corresponding output

 Unsupervised Learning (“model/compress inputs”) can also be usually thought of as

learning a function f that maps each input to a compact representation

Harder since we
don’t know the
labels in this case

 Reinforcement Learning can also be seen as doing function approximation

CS771: Intro to ML
12
Perspective as probability estimation
 Supervised Learning (“predict output given input”) can be thought of as estimating
the conditional probability of each possible output given an input

p(label=“cat” | image)

 Unsupervised Learning (“model/compress inputs”) can be thought of as estimating

the probability density of the inputs
Don’t worry if this doesn’t make
much sense as of now  But the
Harder since we basic idea is to learn the underlying
don’t know the data distribution using the
labels in this case unlabeled inputs; many ways to do
this as we will see later

 Reinforcement Learning can also be seen as estimating probability densities

CS771: Intro to ML
13

Data and Features

CS771: Intro to ML
14
Data and Features Features represent semantics of the
inputs. Being able to extract good features
is key to the success of ML algos

 ML algos require a numeric feature representation of the inputs

 Features can be obtained using one of the two approaches

 Approach 1: Extracting/constructing features manually from raw inputs
 Approach 2: Learning the features from raw inputs

 Approach 1 is what we will focus on primarily for now

 Approach 2 is what is followed in Deep Learning algorithms (will see later)

 Approach 1 is not as powerful as Approach 2 but still used widely

CS771: Intro to ML
15
Example: Feature Extraction for Text Data
 Consider some text data consisting of the following sentences:
 John likes to watch movies BoW is just one of the many ways of doing
feature extraction for text data. Not the
 Mary likes movies too most optimal one, and has various flaws
(can you think of some?), but often works
 John also likes football reasonably well
 Want to construct a feature representation for these sentences
 Here is a “bag-of-words” (BoW) feature representation of these sentences

 Each sentence is now represented as a binary vector (each feature is a binary value,
denoting presence or absence of a word). BoW is also called “unigram” rep.
CS771: Intro to ML
16
Example: Feature Extraction for Image Data
 A very simple feature extraction approach for image data is flattening

Flattening and histogram based

methods destroy the spatial
information in the image but often
7x7 image Vector of pixel still work reasonably well
(49 pixels) intensities

 Histogram of visual patterns is another popular feature extr. method for images

 Many other manual feature extraction techniques developed in computer vision and
image processing communities (SIFT, HoG, and others)
Pic credit: cat.uab.cat/Research/object-recognition CS771: Intro to ML
17
Feature Selection
 Not all the extracted features may be relevant for learning the model (some may
even confuse the learner)

 Feature selection (a step after feature extraction) can be used to identify the
features that matter, and discard the others, for more effective learning
Age Calculating BMI from this
Gender data doesn’t require ML
Height Body-mass index (BMI) but this simple example is
just to illustrate the idea
Weight of feature selection 
Eye color

 Many techniques exist – some based on intuition, some based on algorithmic

principles (will visit feature selection later)

 More common in supervised learning but can also be done for unsup. learning
CS771: Intro to ML
18
Some More Postprocessing: Feature Scaling
 Even after feature selection, the features may not be on the same scale
 This can be problematic when comparing two inputs – features that have larger scales may
dominate the result of such comparisons
 Therefore helpful to standardize the features (e.g., by bringing all of them on the same
scale such as between 0 to 1)

 Also helpful for stabilizing the optimization techniques used in ML algos

Pic credit: https://becominghuman.ai/demystifying-feature-scaling-baff53e9b3fd, https://stackoverflow.com/ CS771: Intro to ML
19
Deep Learning: An End-to-End Approach to ML
Deep Learning = ML with automated feature learning from the raw inputs

Feature extraction part is automated via the feature learning module

ati o n M o del
Classific
Learning

re Le a rn in g Module
Featu
Raw Input layers_
(one or more Learned Features
(penultimate layer)
Pic an adaptation of the original from: https://deepai.org/
CS771: Intro to ML
20
Some Notation/Nomenclature/Convention
 Sup. learning requires training data as input-output pairs
RL and other flavors
of ML problems also
 Unsupervised learning requires training data as inputs use similar notation

 Each input is (usually) a vector containing the values of the features or attributes or
covariates that encode properties of the it represents, e.g.,
Size or length of the input is commonly
 For a 7 × 7 image: can be a 49 × 1 vector of pixel intensities known as data/input dimensionality or
feature dimensionality

 (In sup. Learning) Each is the output or response or label associated with input
(and its value is known for the training inputs)
 Output can be a scalar, a vector of numbers, or even an structured object (more on this later)
CS771: Intro to ML
21
Types of Features and Types of Outputs
 Features as well as outputs can be real-valued, binary, categorical, ordinal, etc.

 Real-valued: Pixel intensity, house area, house price, rainfall amount, temperature, etc

 Binary: Male/female, adult/non-adult, or any yes/no or present/absent type value

 Categorical/Discrete: Zipcode, blood-group, or any “one from a finite many choices“ value

 Ordinal: Grade (A/B/C etc.) in a course, or any other type where relative values matter

 Often, the features can be of mixed types (some real, some categorical, some ordinal, etc.)

CS771: Intro to ML
22
Some Basic Operations of Inputs
 Assume each input feature vector to of size D
What does such a
“mean” represent?
 Given inputs their average or mean can be computed as

If inputs are all cat images,

mean vector would represents
what an “average” cat looks like
 Can compute the Euclidean distance between any pair of inputs and

 .. or Euclidean distance between an input and the mean of all inputs

 .. and various other operations that we will look at later..

CS771: Intro to ML
23
Next Class
 Introduction to Supervised Learning

 A simple Supervised Learning algorithm based on computing distances

CS771: Intro to ML

Unstructtured Data Classification Fresco
100% (1)
Unstructtured Data Classification Fresco
4 pages
Working With Data and Features: CS771: Introduction To Machine Learning Nisheeth Srivastava
No ratings yet
Working With Data and Features: CS771: Introduction To Machine Learning Nisheeth Srivastava
22 pages
Lecture 2
No ratings yet
Lecture 2
26 pages
02-03-Warming-Up and Data and Features
No ratings yet
02-03-Warming-Up and Data and Features
22 pages
Course Logistics and Introduction: CS771: Introduction To Machine Learning Piyush Rai
No ratings yet
Course Logistics and Introduction: CS771: Introduction To Machine Learning Piyush Rai
24 pages
2 Getting Started
No ratings yet
2 Getting Started
20 pages
771 A18 Lec2
No ratings yet
771 A18 Lec2
119 pages
Lecture 21 and 22
No ratings yet
Lecture 21 and 22
28 pages
Lecture 21 and 22
No ratings yet
Lecture 21 and 22
28 pages
Lec 26
No ratings yet
Lec 26
16 pages
Lecture 23
No ratings yet
Lecture 23
15 pages
CS771: Introduction To Machine Learning Piyush Rai
No ratings yet
CS771: Introduction To Machine Learning Piyush Rai
25 pages
Warming-Up To ML, and Some Simple Supervised Learners (Distance-Based "Local" Methods)
No ratings yet
Warming-Up To ML, and Some Simple Supervised Learners (Distance-Based "Local" Methods)
29 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
19 pages
Lecture 03 - Supervised Learning by Computing Distances - Plain
No ratings yet
Lecture 03 - Supervised Learning by Computing Distances - Plain
17 pages
Introduction, Course Logistics: CS771: Introduction To Machine Learning Nisheeth Srivastava
No ratings yet
Introduction, Course Logistics: CS771: Introduction To Machine Learning Nisheeth Srivastava
28 pages
Lesson 4 - Introduction Machine Learning
No ratings yet
Lesson 4 - Introduction Machine Learning
44 pages
5.1 Large Scale ML
No ratings yet
5.1 Large Scale ML
10 pages
(Fall 2024) Intro To ML
No ratings yet
(Fall 2024) Intro To ML
51 pages
Machine Learning Introduction
No ratings yet
Machine Learning Introduction
56 pages
Machine Learning Updated
No ratings yet
Machine Learning Updated
14 pages
ML Revision
No ratings yet
ML Revision
207 pages
Short Course Machine Learning F de Vuyst 1715052496
No ratings yet
Short Course Machine Learning F de Vuyst 1715052496
74 pages
AI Lec-03
No ratings yet
AI Lec-03
23 pages
20ECE633T Machine Learning in VLSI
No ratings yet
20ECE633T Machine Learning in VLSI
81 pages
Lec1 Intoduction
No ratings yet
Lec1 Intoduction
34 pages
Lecture 26
No ratings yet
Lecture 26
17 pages
Course Logistics and Introduction To Machine Learning
No ratings yet
Course Logistics and Introduction To Machine Learning
34 pages
Lecture 1 Ai
No ratings yet
Lecture 1 Ai
38 pages
CE802 Lec IntroML Handouts
No ratings yet
CE802 Lec IntroML Handouts
24 pages
ML Chap 2
No ratings yet
ML Chap 2
60 pages
A.I. Lecture 4 NEW
No ratings yet
A.I. Lecture 4 NEW
31 pages
GML Slides 2024 04 29
No ratings yet
GML Slides 2024 04 29
206 pages
Machine Learning
No ratings yet
Machine Learning
257 pages
ML Notion 1
No ratings yet
ML Notion 1
18 pages
PR & ML: CS5691: Machine Learning
No ratings yet
PR & ML: CS5691: Machine Learning
42 pages
Chapter Introduction
No ratings yet
Chapter Introduction
7 pages
Linear Models: CS771: Introduction To Machine Learning Piyush Rai
No ratings yet
Linear Models: CS771: Introduction To Machine Learning Piyush Rai
8 pages
Lecture Notes 1 2 Intro Python
No ratings yet
Lecture Notes 1 2 Intro Python
13 pages
Machine Learning - Course
No ratings yet
Machine Learning - Course
6 pages
ML Basic Concepts
No ratings yet
ML Basic Concepts
25 pages
Python Machine Learning - Machine Learning and Deep Learning With Python Scikit Learn and Tensorflow 2 Third Edition
No ratings yet
Python Machine Learning - Machine Learning and Deep Learning With Python Scikit Learn and Tensorflow 2 Third Edition
4 pages
Perceptrons and SVMS: Cs771: Introduction To Machine Learning Nisheeth
No ratings yet
Perceptrons and SVMS: Cs771: Introduction To Machine Learning Nisheeth
18 pages
CS480 Lecture November 14th
No ratings yet
CS480 Lecture November 14th
72 pages
ML Notes All
No ratings yet
ML Notes All
32 pages
Machine Learning INTRO
No ratings yet
Machine Learning INTRO
12 pages
Unit-1 ML
No ratings yet
Unit-1 ML
19 pages
AAM Book
No ratings yet
AAM Book
159 pages
Unit-1 MLT
No ratings yet
Unit-1 MLT
51 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
45 pages
Introduction To ML
No ratings yet
Introduction To ML
4 pages
AI.5 Machine Learning (21 26)
No ratings yet
AI.5 Machine Learning (21 26)
176 pages
Basic Concepts of Machine Learning For Beginners 1732109263
No ratings yet
Basic Concepts of Machine Learning For Beginners 1732109263
102 pages
ECS171: Machine Learning: Lecture 1: Overview of Class, LFD 1.1, 1.2
No ratings yet
ECS171: Machine Learning: Lecture 1: Overview of Class, LFD 1.1, 1.2
29 pages
4 Ai ML - 2
No ratings yet
4 Ai ML - 2
31 pages
Lec2 Intro To ML
No ratings yet
Lec2 Intro To ML
35 pages
NEP Syllabus Questions
No ratings yet
NEP Syllabus Questions
3 pages
ML Short U1-4
No ratings yet
ML Short U1-4
60 pages
AAI Lecture 9 SP 25
No ratings yet
AAI Lecture 9 SP 25
26 pages
Machine Learning: Adaptive Behaviour Through Experience: Thinking Machines
From Everand
Machine Learning: Adaptive Behaviour Through Experience: Thinking Machines
alasdair gilchrist
4.5/5 (5)
Artificial Intelligence Algorithms
From Everand
Artificial Intelligence Algorithms
akosnemeth
No ratings yet
Lecture 09 - Calculus and Optimization Techniques (3) - Plain
No ratings yet
Lecture 09 - Calculus and Optimization Techniques (3) - Plain
15 pages
Lecture 04 - Supervised Learning by Computing Distances (2) - Plain
No ratings yet
Lecture 04 - Supervised Learning by Computing Distances (2) - Plain
16 pages
Deep Learning
No ratings yet
Deep Learning
189 pages
Bernd Klein Python and Machine Learning Letter
No ratings yet
Bernd Klein Python and Machine Learning Letter
453 pages
Bernd Klein Python Data Analysis Letter
No ratings yet
Bernd Klein Python Data Analysis Letter
514 pages
General Observation
No ratings yet
General Observation
93 pages
Dataset: (Most Famous)
No ratings yet
Dataset: (Most Famous)
8 pages
Model Training: (Anything Done While We Train The Model)
No ratings yet
Model Training: (Anything Done While We Train The Model)
194 pages
Command Line Python Scripting: Takeaways: Syntax
No ratings yet
Command Line Python Scripting: Takeaways: Syntax
2 pages
Cnns Convolution Neural Networks
No ratings yet
Cnns Convolution Neural Networks
50 pages
A B Testing
100% (1)
A B Testing
28 pages
Working With Programs: Takeaways: Syntax
No ratings yet
Working With Programs: Takeaways: Syntax
2 pages
English Spotlight 1 Daily Routines
No ratings yet
English Spotlight 1 Daily Routines
2 pages
Handout 02
No ratings yet
Handout 02
19 pages
Ragam Agt 1 2012-A-Juhana
No ratings yet
Ragam Agt 1 2012-A-Juhana
15 pages
2nd Week 2 Difference Between Technical Writing and Academic Writing
No ratings yet
2nd Week 2 Difference Between Technical Writing and Academic Writing
2 pages
What Is Cultural Translation and Why Is It Important
No ratings yet
What Is Cultural Translation and Why Is It Important
10 pages
Test Tip: IELTS Writing Task 1: Describing A Line Graph
No ratings yet
Test Tip: IELTS Writing Task 1: Describing A Line Graph
4 pages
Is GPT-3 A Psychopath Evaluating Large Language Mo
No ratings yet
Is GPT-3 A Psychopath Evaluating Large Language Mo
14 pages
Definition of Key Terms 1
No ratings yet
Definition of Key Terms 1
2 pages
SING
No ratings yet
SING
1 page
Language Assisment and Evaluation
No ratings yet
Language Assisment and Evaluation
7 pages
AI Wrappers
No ratings yet
AI Wrappers
2 pages
Ananda Wood Webpage SomeTeachings
No ratings yet
Ananda Wood Webpage SomeTeachings
63 pages
3452
No ratings yet
3452
12 pages
FIELD STUDY Towards Becoming A Teacher 1
100% (3)
FIELD STUDY Towards Becoming A Teacher 1
21 pages
Ethics in Communication-Wps Office
No ratings yet
Ethics in Communication-Wps Office
24 pages
Visual Communication
100% (1)
Visual Communication
2 pages
Stages of Adolescent Development
No ratings yet
Stages of Adolescent Development
5 pages
Predicting Human Behaviour Using AIzhang 2006
No ratings yet
Predicting Human Behaviour Using AIzhang 2006
10 pages
Relationship Between MBTI and Career Success - Yu 2011
No ratings yet
Relationship Between MBTI and Career Success - Yu 2011
6 pages
Agric Cala B
No ratings yet
Agric Cala B
6 pages
Open Book 8
No ratings yet
Open Book 8
22 pages
PT - 02 Final Oral Defense
No ratings yet
PT - 02 Final Oral Defense
4 pages
Limb Apraxias - The Influence of Higher Order Perceptual and Semantic Deficits in Motor Recovery After Stroke
No ratings yet
Limb Apraxias - The Influence of Higher Order Perceptual and Semantic Deficits in Motor Recovery After Stroke
14 pages
Issues For Today
No ratings yet
Issues For Today
87 pages
Social Identity Theory and Organization: The Academy of Management Review January 1989
No ratings yet
Social Identity Theory and Organization: The Academy of Management Review January 1989
21 pages
Transitivity
No ratings yet
Transitivity
5 pages
Personal Factors CPALE
No ratings yet
Personal Factors CPALE
113 pages
Math 42 Reading Assignment Notes
No ratings yet
Math 42 Reading Assignment Notes
3 pages
Task 3b
No ratings yet
Task 3b
7 pages