CSE 474/574 Introduction To Machine Learning Fall 2011 Assignment 3

This document outlines an assignment for an introduction to machine learning course. It includes 5 problems involving Gaussian density models, least squares classification, Fisher's linear discriminant, generative classification models, and logistic regression. Students are asked to write MATLAB code to generate simulated data, estimate classification models, calculate error rates, and plot decision boundaries for 2-dimensional, two-class classification tasks. The assignment is due on November 8th and students should submit code, results, and answers to other questions in hard copy.

Uploaded by

kwzeet

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

240 views

CSE 474/574 Introduction To Machine Learning Fall 2011 Assignment 3

Uploaded by

kwzeet

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

CSE 474/574 Introduction to Machine Learning

Fall 2011 Assignment 3

Due date: November 8, 2011
Total points: 100

The assignment is due at the beginning of the class on the above mentioned due
date. Submit a hard copy in class. Attach print out of code, figures/results of any coding
questions in your hard copy along with answers to other questions.

1. Gaussian Density Models

Consider the following two-dimensional data sampled from two categories, denotes Ck .
Each of them come from a Gaussian distribution p(x|Ck ) ∼ N (µk , Σk ):

C1 C2
Sample x1 x2 x1 x2
1 1.7 3.0 1.1 -1.3
2 1.2 3.5 0.2 3.5
3 1.5 5.0 1.5 4.3
4 0.6 4.5 3.2 0.8
5 0.5 4.0 4.0 2.7

(a) Write a Matlab program to find the unbiased maximum-likelihood estimation of

µk and Σk for each class of the given data. [6]
(b) Generate 10 data points for each class from your estimated density and plot your
data on a single figure with distinguished markers (e.g. ‘x’ for one class and ‘o’ for
the other). You need to save your generated data for following problem 2 and 3.
Hint: You can use mvnrnd function to get samples from multivariate Gaussian [6]
density.
(c) Generate a third class C3 of data which contains 10 data points with a same
covariance Σ1 as C1 , and a different mean. Plot both the C1 and C3 data on a
single figure with distinguished markers. Save your data for problem 4. [6]

2. Least Squares for Classification

In regression, we saw that minimization of a sum-of-squares error function led to a
simple closed-form solution for the parameters. When we tempting to apply the same
formalism to classification problem, we consider a a 1-of-K binary coding scheme for
the target vector t (in our case, K = 2). Each class Ck is described by its own linear
model yk (x). The least-squares approach gives an exact close-form solution for the
parameters of discriminant boundary when y1 (x) = y2 (x).
We can write them together using vector notation:

y(x) = X
eWf

1
Where W f is a (D + 1) × K matrix whose kth column is a (D + 1) × 1 weight vector
e is the augmented input vector of size N × (D + 1).
for the Ck class. X

(a) Write a Matlab program to find the least squares solution of W

f using the data
point generated in question 1b. [12]
(b) Using the optimal weight you have obtained from 2a, calculate yk (x) and assign
a class label to each data point according to the discriminant function. Plot your
discriminant boundary on the same figure of the marked data points. [12]
Hint: For our 2-D two class data, the optimal weight W f is a 3 × 2 matrix. The
discriminant boundary is a linear function and can be plotted using Matlab’s ez-
plot function by:
fh = @(x) -(W(2,1)-W(2,2))/(W(3,1)-W(3,2))*x-(W(1,1)-W(1,2))/(W(3,1)-W(3,2));
ezplot(fh);
(c) Calculate the error rate (rate for misclassified samples) for the Least Squares
method you have done and comment on the result. [6]

3. Fisher’s Linear Discriminant

In Fisher linear discriminant analysis we find w such that when each data point is
projected onto a line, the projected class means are ‘maximally’ separated while the
variance within each class is minimized. Then the projected points can be separate by
a simple threshold.

(a) Write a Matlab program to calculate w using the same data you have generated
in 1b. Plot the projection line which is on the direction of w (as a line passing
(0,0) on the same figure of the marked data points. [12]
(b) Find the values of projected 1-D points y(x) for both classes. To construct a
discriminant, assume the class-conditional densities p(y|Ck ) using 1-D Gaussian
distribution by maximum likelihood. Then when set ln p(y|C1 ) = ln p(y|C2 ) with
an assumption of equal prior, we can found a threshold y0 . Derive the solution to
y0 and classify each data point as belonging to C1 if y(x) ≥ y0 and classify it as
belonging to C2 otherwise. [12]
(c) Calculate the error rate (rate for misclassified samples) and comment on the result.
[6]

4. Generative Model for Classification

A generative approach for separating two classes (for example in 3c) is to model the
class conditional densities p(x|Ck ) as well as the class prior p(Ck ), then use these to
compute posterior probabilities of the class p(Ck |x) To classify each point x optimally
(in the sense of minimizing the expected classification error) we must assign it to the
class Ck that maximizes the posterior probability.

(a) Using equation (4.57) and (4.58) form the textbook, derive the result (4.65) for
the posterior class probability in the two-class generative model with Gaussian
densities, and verify the results (4.66) and (4.67) for the parameters w and w0. [6]
(b) Comment on the discriminative function (decision boundary) in case of all two
classes share the same covariance matrix. [4]

2
(c) Evaluate the discriminative function in Matlab using the 2-D data you have gen-
erated in 1c by computing parameters w as a 21 vector and w0 as a scaler in
(4.66) and (4.67). Plot and verify the decision boundary on the sample figure of
the data points. [6]

5. Logistic Regression
Show that for a linearly separable data set, the maximum likelihood solution for the
logistic regression model is obtained by finding a vector w whos decision boundary
wT φ(x) = 0 separates the classes and then taking the magnitude of w to infinity. [6]

Practice Midterm
No ratings yet
Practice Midterm
4 pages
Machine Learning Assignment
No ratings yet
Machine Learning Assignment
5 pages
Topic 2 Test
No ratings yet
Topic 2 Test
20 pages
Lesson: 3 Mathematics in The Modern World
0% (1)
Lesson: 3 Mathematics in The Modern World
25 pages
Homework 3: Matrix Solution by Computer Programming
No ratings yet
Homework 3: Matrix Solution by Computer Programming
7 pages
Supervised Machine Learning
No ratings yet
Supervised Machine Learning
74 pages
HW 1
No ratings yet
HW 1
4 pages
Materi 5 - 2
No ratings yet
Materi 5 - 2
25 pages
Discriminant Functions
No ratings yet
Discriminant Functions
33 pages
Linear - Classification
No ratings yet
Linear - Classification
72 pages
Assignment 3 (Sol.) : Introduction To Machine Learning Prof. B. Ravindran
No ratings yet
Assignment 3 (Sol.) : Introduction To Machine Learning Prof. B. Ravindran
8 pages
Department of Computer Science and Engineering (CSE)
No ratings yet
Department of Computer Science and Engineering (CSE)
11 pages
Math Behind Machine Learning
No ratings yet
Math Behind Machine Learning
9 pages
Practice Midterm 2010
No ratings yet
Practice Midterm 2010
4 pages
Matlab Homework Experts 2
No ratings yet
Matlab Homework Experts 2
10 pages
n9 PDF
No ratings yet
n9 PDF
6 pages
Topic 2 Matlab Examples
No ratings yet
Topic 2 Matlab Examples
5 pages
Weather Wax Hastie Solutions Manual
No ratings yet
Weather Wax Hastie Solutions Manual
18 pages
Homework Decision Solutions
No ratings yet
Homework Decision Solutions
3 pages
Final F04soln
No ratings yet
Final F04soln
10 pages
Week 3
No ratings yet
Week 3
3 pages
dis1
No ratings yet
dis1
5 pages
CMU 2018s NinaBALCAN HW3
No ratings yet
CMU 2018s NinaBALCAN HW3
7 pages
Weekly Homework X
No ratings yet
Weekly Homework X
15 pages
C30 C35 LinearModelForClassification
No ratings yet
C30 C35 LinearModelForClassification
50 pages
Discriminant, Generative, Discriminative Models
No ratings yet
Discriminant, Generative, Discriminative Models
98 pages
Midterm - EE511 - Part B: K K K K
No ratings yet
Midterm - EE511 - Part B: K K K K
8 pages
LINFO2275 Questions d Examen-4
No ratings yet
LINFO2275 Questions d Examen-4
34 pages
Course: DD2427 - Exercise Class 1: Exercise 1 Motivation For The Linear Neuron
No ratings yet
Course: DD2427 - Exercise Class 1: Exercise 1 Motivation For The Linear Neuron
5 pages
Homework Set 3
No ratings yet
Homework Set 3
7 pages
5 - Feature Generation
No ratings yet
5 - Feature Generation
15 pages
Pattern Classification Labreport
No ratings yet
Pattern Classification Labreport
5 pages
Linear-classifiers
No ratings yet
Linear-classifiers
48 pages
Assignment 3: Introduction To Machine Learning Prof. B. Ravindran
No ratings yet
Assignment 3: Introduction To Machine Learning Prof. B. Ravindran
4 pages
Final f04
No ratings yet
Final f04
13 pages
Inf2b Learn Note10 2up
No ratings yet
Inf2b Learn Note10 2up
7 pages
Document (6)
No ratings yet
Document (6)
6 pages
AIDI 1002 FinalExam Section 01
No ratings yet
AIDI 1002 FinalExam Section 01
2 pages
SVM
No ratings yet
SVM
57 pages
HW3-2
No ratings yet
HW3-2
4 pages
6.867 Section 3: Classification: 1 Intro 2 2 Representation 2 3 Probabilistic Models 2
No ratings yet
6.867 Section 3: Classification: 1 Intro 2 2 Representation 2 3 Probabilistic Models 2
10 pages
Datamining-lect4 - Other Classification Techniques. Nearest Neighbor Classifiers, Support Vector Machines, Logistic Regression, Naive Bayes Classification. Supervised Learning
No ratings yet
Datamining-lect4 - Other Classification Techniques. Nearest Neighbor Classifiers, Support Vector Machines, Logistic Regression, Naive Bayes Classification. Supervised Learning
79 pages
Midterm 2010 Solutions
No ratings yet
Midterm 2010 Solutions
8 pages
SVM Problems1
No ratings yet
SVM Problems1
5 pages
EE 559 Midterm From S11
No ratings yet
EE 559 Midterm From S11
12 pages
ECS7020P ClassificationExercisesSolutions II
No ratings yet
ECS7020P ClassificationExercisesSolutions II
7 pages
Assignment 1
No ratings yet
Assignment 1
16 pages
AE - Tema 5 - Two-class Fisher Discriminant Analysis
No ratings yet
AE - Tema 5 - Two-class Fisher Discriminant Analysis
6 pages
ML Question CMU
No ratings yet
ML Question CMU
12 pages
Module 3.1
No ratings yet
Module 3.1
25 pages
Homework 2: SVM, Kernel Methods, Ensemble Learning, Learning Theory
No ratings yet
Homework 2: SVM, Kernel Methods, Ensemble Learning, Learning Theory
12 pages
Linear Models For Classification: Sumeet Agarwal, EEL709 (Most Figures From Bishop, PRML)
No ratings yet
Linear Models For Classification: Sumeet Agarwal, EEL709 (Most Figures From Bishop, PRML)
21 pages
1906.02590v1
No ratings yet
1906.02590v1
16 pages
MATERIAL 5-Discriminant PDF
No ratings yet
MATERIAL 5-Discriminant PDF
26 pages
assig1 2023
No ratings yet
assig1 2023
3 pages
Genomic Signal Processing: Classification of Disease Subtype Based On Microarray Data
No ratings yet
Genomic Signal Processing: Classification of Disease Subtype Based On Microarray Data
26 pages
ml-4
No ratings yet
ml-4
101 pages
4 - SVM
No ratings yet
4 - SVM
58 pages
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
From Everand
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
Peter Kattan
2.5/5 (2)
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Calculus-II (Mathematics) Question Bank
From Everand
Calculus-II (Mathematics) Question Bank
Mohmmad Khaja Shareef
No ratings yet
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
From Everand
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
Jeffrey M. Wooldridge
No ratings yet
Numerical Analysis II Essentials
From Everand
Numerical Analysis II Essentials
The Editors of REA
No ratings yet
%initialization: All All
No ratings yet
%initialization: All All
13 pages
Assignment 1
No ratings yet
Assignment 1
3 pages
All Off: For For End End
No ratings yet
All Off: For For End End
2 pages
% To Check Validity of Solutions
No ratings yet
% To Check Validity of Solutions
1 page
Error in Sampling
No ratings yet
Error in Sampling
1 page
Journal of Statistical Software
No ratings yet
Journal of Statistical Software
14 pages
An Introduction to Combinatorics and Graph Theory
No ratings yet
An Introduction to Combinatorics and Graph Theory
123 pages
Holiday Summer Vacation 2023-24
No ratings yet
Holiday Summer Vacation 2023-24
3 pages
Calculator Techniques for the Casio FX-991ES Unravelled - Schematic Analysis　「回路図解析」
No ratings yet
Calculator Techniques for the Casio FX-991ES Unravelled - Schematic Analysis　「回路図解析」
4 pages
DSAT Diagnostics - Module 1
No ratings yet
DSAT Diagnostics - Module 1
7 pages
Calculus Early Transcendentals 3rd Edition Rogawski Solutions Manual - Available For Instant Download And Reading
100% (3)
Calculus Early Transcendentals 3rd Edition Rogawski Solutions Manual - Available For Instant Download And Reading
53 pages
Ecen 644 - Homework #5 Solution Set
No ratings yet
Ecen 644 - Homework #5 Solution Set
20 pages
Fractal Romanescu PDF
No ratings yet
Fractal Romanescu PDF
13 pages
Full Functional Analysis Spectral Theory and Applications Graduate Texts in Mathematics 276 Manfred Einsiedler PDF All Chapters
100% (6)
Full Functional Analysis Spectral Theory and Applications Graduate Texts in Mathematics 276 Manfred Einsiedler PDF All Chapters
51 pages
2nd term year 10 further maths mid term test
No ratings yet
2nd term year 10 further maths mid term test
2 pages
Capitulo 3(Student Mathematical Library 94) James Bisgard - Analysis and Linear Algebra_ The Singular Value Decomposition and Applications-American Mathematical Society (2021)
No ratings yet
Capitulo 3(Student Mathematical Library 94) James Bisgard - Analysis and Linear Algebra_ The Singular Value Decomposition and Applications-American Mathematical Society (2021)
38 pages
College Algebra 7th Edition Blitzer Solutions Manualpdf download
100% (3)
College Algebra 7th Edition Blitzer Solutions Manualpdf download
47 pages
Power Series Solutions of Differential Equations
No ratings yet
Power Series Solutions of Differential Equations
2 pages
Integers Work Sheet 2: Imo, Nstse Mathematics 2023 - Class 7 Not To Be Redistributed - For Internal Circulation Only
No ratings yet
Integers Work Sheet 2: Imo, Nstse Mathematics 2023 - Class 7 Not To Be Redistributed - For Internal Circulation Only
3 pages
WMA02 01 Que 20190116dasd
No ratings yet
WMA02 01 Que 20190116dasd
52 pages
Fundamentals of Linear Algebra and Optimization Jean Gallier And Jocelyn Quaintance pdf download
100% (1)
Fundamentals of Linear Algebra and Optimization Jean Gallier And Jocelyn Quaintance pdf download
60 pages
Graph Theory Java Point Notes
No ratings yet
Graph Theory Java Point Notes
47 pages
Class-X Mathematics Worksheet Chapter-4: Quadratic Equations
83% (6)
Class-X Mathematics Worksheet Chapter-4: Quadratic Equations
2 pages
Form 4: Chapter 1 (Functions) SPM Flashback Fully-Worked Solutions
No ratings yet
Form 4: Chapter 1 (Functions) SPM Flashback Fully-Worked Solutions
4 pages
Class Vii Worksheet 1
No ratings yet
Class Vii Worksheet 1
3 pages
MATHEMATICS(041)- SAMPLE PAPER -3
No ratings yet
MATHEMATICS(041)- SAMPLE PAPER -3
4 pages
University of Ghana College of Basic and Applied Sciences MATH 121 - Exercise #1 Algebra & Trigonometry
No ratings yet
University of Ghana College of Basic and Applied Sciences MATH 121 - Exercise #1 Algebra & Trigonometry
2 pages
Introducing Angles
No ratings yet
Introducing Angles
6 pages
AMAT 168: Transportation Model and Its Variants
No ratings yet
AMAT 168: Transportation Model and Its Variants
13 pages
Sma 2306 Linear Algebra II
No ratings yet
Sma 2306 Linear Algebra II
3 pages
56th NMTC
No ratings yet
56th NMTC
9 pages
Exam A
No ratings yet
Exam A
3 pages