0% found this document useful (0 votes)

55 views119 pages

771 A18 Lec2

This document provides an introduction and overview of notation for a course on machine learning. It discusses that supervised learning uses training data with input-output pairs while unsupervised learning uses only input data. Each input is usually represented as a vector of feature values. Good features can be specified by experts or learned from data. The notation defines inputs xn and outputs yn, with X representing the feature matrix of all inputs and y representing the vector of all outputs.

Uploaded by

DUDEKULA VIDYASAGAR

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

55 views119 pages

771 A18 Lec2

Uploaded by

DUDEKULA VIDYASAGAR

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 119

Warming-up to ML, and Some Simple Supervised Learners

(Distance-based “Local” Methods)

Piyush Rai

Introduction to Machine Learning (CS771A)

August 2, 2018

Intro to Machine Learning (CS771A) Warming-up to ML, and Some Simple Supervised Learners 1
Announcements

Please sign-up on Piazza if you haven’t already

Intro to Machine Learning (CS771A) Warming-up to ML, and Some Simple Supervised Learners 2
Announcements

Please sign-up on Piazza if you haven’t already

I’ll be clearing all the add-drop requests by tomorrow

Intro to Machine Learning (CS771A) Warming-up to ML, and Some Simple Supervised Learners 2
Announcements

Please sign-up on Piazza if you haven’t already

I’ll be clearing all the add-drop requests by tomorrow
Maths refresher tutorial on Aug 4, 6:00-7:30pm in RM-101

Intro to Machine Learning (CS771A) Warming-up to ML, and Some Simple Supervised Learners 2
Announcements

Please sign-up on Piazza if you haven’t already

I’ll be clearing all the add-drop requests by tomorrow
Maths refresher tutorial on Aug 4, 6:00-7:30pm in RM-101
Will be mostly on the basics of multivariate calculus, linear algebra, prob/stats, optimization (basically
things you are expected to know for this course)

Intro to Machine Learning (CS771A) Warming-up to ML, and Some Simple Supervised Learners 2
Some Notation/Nomenclature/Convention

Supervised Learning requires training data given as a set of input-output pairs {(x n , yn )}N
n=1

Unsupervised Learning requires training data given as a set of inputs {x n }N

n=1

Intro to Machine Learning (CS771A) Warming-up to ML, and Some Simple Supervised Learners 3
Some Notation/Nomenclature/Convention

Supervised Learning requires training data given as a set of input-output pairs {(x n , yn )}N
n=1

Unsupervised Learning requires training data given as a set of inputs {x n }N

n=1

Each input x n is (usually) a vector containing the values of the features or attributes or covariates
that encode properties of the data it represents, e.g.,

Intro to Machine Learning (CS771A) Warming-up to ML, and Some Simple Supervised Learners 3
Some Notation/Nomenclature/Convention

Supervised Learning requires training data given as a set of input-output pairs {(x n , yn )}N
n=1

Unsupervised Learning requires training data given as a set of inputs {x n }N

n=1

Each input x n is (usually) a vector containing the values of the features or attributes or covariates
that encode properties of the data it represents, e.g.,
Representing a 7 × 7 image: x n can be a 49 × 1 vector of pixel intensities

Intro to Machine Learning (CS771A) Warming-up to ML, and Some Simple Supervised Learners 3
Some Notation/Nomenclature/Convention

Supervised Learning requires training data given as a set of input-output pairs {(x n , yn )}N
n=1

Unsupervised Learning requires training data given as a set of inputs {x n }N

n=1

Note: Good features can also be learned from data (feature learning) or extracted using hand-crafted
rules defined by a domain expert.

Intro to Machine Learning (CS771A) Warming-up to ML, and Some Simple Supervised Learners 3
Some Notation/Nomenclature/Convention

Supervised Learning requires training data given as a set of input-output pairs {(x n , yn )}N
n=1

Unsupervised Learning requires training data given as a set of inputs {x n }N

n=1

Note: Good features can also be learned from data (feature learning) or extracted using hand-crafted
rules defined by a domain expert. Having a good set of features is half the battle won!

Intro to Machine Learning (CS771A) Warming-up to ML, and Some Simple Supervised Learners 3
Some Notation/Nomenclature/Convention

Supervised Learning requires training data given as a set of input-output pairs {(x n , yn )}N
n=1

Unsupervised Learning requires training data given as a set of inputs {x n }N

n=1

Note: Good features can also be learned from data (feature learning) or extracted using hand-crafted
rules defined by a domain expert. Having a good set of features is half the battle won!
Each yn is the output or response or label associated with input x n

Intro to Machine Learning (CS771A) Warming-up to ML, and Some Simple Supervised Learners 3
Some Notation/Nomenclature/Convention

Supervised Learning requires training data given as a set of input-output pairs {(x n , yn )}N
n=1

Unsupervised Learning requires training data given as a set of inputs {x n }N

n=1

Intro to Machine Learning (CS771A) Warming-up to ML, and Some Simple Supervised Learners 3
Some Notation/Nomenclature/Convention
Will assume each input x n to be a D × 1 column vector (its transpose x >
n will be row vector)

xnd will denote the d-th feature of the n-th input

Intro to Machine Learning (CS771A) Warming-up to ML, and Some Simple Supervised Learners 4
Some Notation/Nomenclature/Convention
Will assume each input x n to be a D × 1 column vector (its transpose x >
n will be row vector)

xnd will denote the d-th feature of the n-th input

We will use X (N × D feature matrix) to collectively denote all the N inputs
We will use y (N × 1 output/response/label vector) to collectively denote all the N outputs
A feature
D

Input n Output for

xnT xn1 xn2 xnD yn
input n
N X y

Feature Matrix Outputs

xnd will denote the d-th feature of the n-th input

We will use X (N × D feature matrix) to collectively denote all the N inputs
We will use y (N × 1 output/response/label vector) to collectively denote all the N outputs
A feature
D

Input n Output for

xnT xn1 xn2 xnD yn
input n
N X y

Feature Matrix Outputs

Note: If each yn itself is a vector (we will see such cases later) then we will use a matrix Y to
collectively denote all the N outputs (with row n containing yn ) and also use boldfaced y n
Intro to Machine Learning (CS771A) Warming-up to ML, and Some Simple Supervised Learners 4
Getting Features from Raw Data: A Simple Example
Consider the feature representation for some text data consisting of the following sentences:
John likes to watch movies
Mary likes movies too
John also likes football
Our feature “vocabulary” consists of 8 unique words

Intro to Machine Learning (CS771A) Warming-up to ML, and Some Simple Supervised Learners 5
Getting Features from Raw Data: A Simple Example
Consider the feature representation for some text data consisting of the following sentences:
John likes to watch movies
Mary likes movies too
John also likes football
Our feature “vocabulary” consists of 8 unique words
Here is the bag-of-words feature vector representation of these 3 sentences

Here the features are binary (presence/absence of each word)

Again, note that this may not necessarily be the best “feature” representation for a given task (which is
why other techniques or feature learning may be needed)
Intro to Machine Learning (CS771A) Warming-up to ML, and Some Simple Supervised Learners 5
Types of Features and Types of Outputs

Features (in vector x n ) as well as outputs yn can be real-valued, binary, categorical, ordinal, etc.

Intro to Machine Learning (CS771A) Warming-up to ML, and Some Simple Supervised Learners 6
Types of Features and Types of Outputs