0% found this document useful (0 votes)

215 views4 pages

EXAMPLE Machine Learning (C395) Exam Questions

The document contains sample questions and solutions for a machine learning exam. The questions cover topics like gradient descent, deriving the LMS training rule, calculating information gain, designing neural networks, and applying genetic algorithms to optimize a decision tree for classifying cars. The solutions provide mathematical derivations, diagrams, and pseudocode to thoroughly explain the concepts.

Uploaded by

Nithin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

215 views4 pages

EXAMPLE Machine Learning (C395) Exam Questions

Uploaded by

Nithin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

EXAMPLE Machine Learning (C395) Exam Questions

(1) Question: Explain the principle of the gradient descent algorithm. Accompany
your explanation with a diagram. Explain the use of all the terms and constants
that you introduce and comment on the range of values that they can take.

Solution: Training can be posed as an optimization problem, in which the goal

is to optimize a function (usually to minimize a cost function E) with respect to
a number of free variables, usually weights wi. The gradient decent algorithm
begins from an initialization of the weights (e.g. a random initialization) and in
an iterative procedure updates the weights wi by a quantity Δwi, where Δwi = –α
(∂E / ∂wi) and (∂E / ∂wi) is the gradient of the cost function with respect to the
weights, while α is a constant which takes small values in order to keep the
updates low and avoid oscillations.

(2) Question: Derive the gradient descent training rule assuming that the target
function representation is:

od = w0 + w1x1 + … + wnxn.

Define explicitly the cost/error function E, assuming that a set of training

examples D is provided, where each training example d ∈ D is associated with
the target output td.

Solution: The error function: E = ∑d D (td – od)2

∈

The gradient decent algorithm: Δwi = –α (∂E / ∂wi)

First represent (∂E / ∂wi) in terms of the unit inputs xid, outputs od, and target
values td:
(∂E / ∂wi) = (∂∑d D (td – od)2) / ∂wi = ∑d D 2(td – od) (∂(td – od) / ∂wi) =
∈ ∈

∑d D 2(td – od) (–∂od / ∂wi) = –∑d D 2(td – od) (∂(w0 + … + wixid + … + wnxnd) /
∈ ∈

∂wi) = –∑d D 2(td – od) (xid)

∈

=> Δwi = α ∑d D 2(td – od) xid

∈
(3) Question: Prove that the LMS training rule performs a gradient descent to
minimize the cost/error function E defined in (2).

Solution: Given the target function representation

od = w0 + w1x1 + … + wnxn,
LMS training rule is a learning algorithm for choosing the set of weights wi to
best fit the set of training examples {< d, td >}, i.e., to minimize the squared
error E ≡ ∑d D (td – od)2.
∈

LMS training rule works as follows:

(∀ < d, td >) use the current weights wi to calculate od
(∀wi) wi ← wi + η(td – od)xid (*)

From (2) à (∂E / ∂wi) = –∑d D 2(td – od)xid à –(1/2xid)(∂E / ∂wi) = (td – od)
∈

Substitute this in (*) à (∀wi) wi ← wi + (η/2)(–∂E / ∂wi)

This shows that LMS alters weights in the very same proportion as does the
gradient descent algorithm (i.e., –∂E / ∂wi), proving that LMS performs gradient
descent.

(4) Question: Consider the following set of training examples:

Instance Classification a1 a2
1 + T T
2 + T T
3 - T F
4 + F F
5 - F T
6 - F T

What is the information gain of a2 relative to these training examples? Provide

the equation for calculating the information gain as well as the intermediate
results.

Solution:
Entropy E(S) = E([3+, 3-]) = -(3/6) log2 (3/6) - (3/6) log2 (3/6) = 1.
Gain (S, a2) = E(S) – (4/6)E(T) – (2/6)E(F) = 1 – 4/6 – 2/6 ≈ 0.
E(T) = E([2+, 2-]) = 1.
E(F) = E([1+, 1-]) = 1.

(5) Question: Suppose that we want to build a neural network that classifies two
dimensional data (i.e., X = [x1, x2]) into two classes: diamonds and crosses. We
have a set of training data that is plotted as follows:
X2

Draw a network that can solve this classification problem. Justify your choice of
the number of nodes and the architecture. Draw the decision boundary that your
network can find on the diagram.

Solution:
A solution is a multilayer FFNN with 2 inputs, one hidden layer with 4 neurons
and 1 output layer with 1 neuron. The network should be fully connected, that is
there should be connections between all nodes in one layer with all the nodes in
the previous (and next) layer. We have to use two inputs because the input data
is two dimensional. We use an output layer with one neuron because we have 2
classes. One hidden layer is enough because there is a single compact region
that contains the data from the crosses-class and does not contain data from the
diamonds-class. This region can have 4 lines as borders, therefore it suffices if
there are 4 neurons at the hidden layer. The 4 neurons in the hidden layer
describe 4 separating lines and the neuron at the output layer describes the
square that is contained between these 4 lines.

(6) Question: Suppose that we want to solve the problem of finding out what a
good car is by using genetic algorithms. Suppose further that the solution to the
problem can be represented by a decision tree as follows:

size
large small
mid

brand no sport
yes
Volvo BMW SUV engine no
no yes no
F12 V10 V8
no yes no
What is the appropriate chromosome design for the given problem? Which
Genetic Algorithm parameters need to be defined? What would be the suitable
values of those parameters for the given problem? Provide a short explanation
for each.
What is the result of applying a single round of the prototypical Genetic
Algorithm? Explain your answer in a clear and compact manner by providing
the pseudo code of the algorithm.

Solution:
size = {large, mid, small} → 100, 010, 001, 011, …, 111, 000
brand = {Volvo, BMW, SUV} → 100, 010, 001, 011, …, 111, 000
sport = {yes, no} → 10, 01, 11, 00
engine = {F12, V12, V8} → 100, 010, 001, 011, …, 111, 000
GoodCar = {yes, no} → 10, 01, 11, 00
→ chromosome design:
size brand sport engine GoodCar
100 100 11 111 01

Fitness function for the given problem can be defined as a Sigmoid function f(x)
= 1 / (1+ e-x), where x is the percentage of all training examples correctly
classified by a specific solution (chromosome).
Selection method – e.g., rank selection method can be used;
Crossover technique – 2-point crossover can be used for the given problem with
a crossover mask 1111110000011; the reason is that either size + brand or
sport + engine define the solution
Crossover rate – usually k = 60%
Mutation rate – usually 1%
Termination condition – e.g., all training examples are correctly classified

GA pseudo code:
Step 1: Choose initial population.
Step 2: Evaluate the fitness of individuals in the population.
Step 3: Select k individuals to reproduce; breed new generation through
crossover and mutation; evaluate the individual fitness of offspring; replace k
worse ranked part of population with offspring.
Step 4: Repeat step 3 until the termination condition is reached.

s1/2: 1010101001010, 1011111110101, 0100011111101, 0011111001010,

1011011110101
s3: 1010101110110 (fit 1), 0011111001001 (fit 0), 1010101111110 (fit 2),
1011011001001 (fit 1), 0011111110101 (fit 2), 1011011110101 (fit 3)
result: 1011111110101, 1011011110101, 1011011110101, 0011111110101,
1010101111110

Statistics With Computer Application
100% (6)
Statistics With Computer Application
17 pages
Feature Selection For High-Dimensional Data: Verónica Bolón-Canedo Noelia Sánchez-Maroño Amparo Alonso-Betanzos
No ratings yet
Feature Selection For High-Dimensional Data: Verónica Bolón-Canedo Noelia Sánchez-Maroño Amparo Alonso-Betanzos
163 pages
Deep Reinforcement Learning
No ratings yet
Deep Reinforcement Learning
406 pages
Machine Learning Techniques - Types of Machine Learning - Applications Mathematical Foundations of Machine Learning
No ratings yet
Machine Learning Techniques - Types of Machine Learning - Applications Mathematical Foundations of Machine Learning
15 pages
Total Listing Machine Learning
100% (1)
Total Listing Machine Learning
114 pages
WM T Code List
No ratings yet
WM T Code List
9 pages
Thematic Apperception Test
100% (6)
Thematic Apperception Test
26 pages
Deep Reinforcement Learning Based Optimization Techniques For Ene
No ratings yet
Deep Reinforcement Learning Based Optimization Techniques For Ene
152 pages
4.1 Reinforcement Learning 2
No ratings yet
4.1 Reinforcement Learning 2
31 pages
Minor Project Ii Report Text Mining: Reuters-21578: Submitted by
100% (1)
Minor Project Ii Report Text Mining: Reuters-21578: Submitted by
51 pages
Back Propagation Back Propagation Network Network Network Network
No ratings yet
Back Propagation Back Propagation Network Network Network Network
29 pages
Tensorflow 2 - 0 Slides PDF
No ratings yet
Tensorflow 2 - 0 Slides PDF
100 pages
RAG With Math
No ratings yet
RAG With Math
7 pages
Back-Propagation Is Very Simple. Who Made It Complicated
No ratings yet
Back-Propagation Is Very Simple. Who Made It Complicated
26 pages
Discussion 4 Pytorch
100% (1)
Discussion 4 Pytorch
37 pages
Algorithms in ML
No ratings yet
Algorithms in ML
15 pages
Instant Ebooks Textbook Deep Generative Modeling Jakub M. Tomczak Download All Chapters
No ratings yet
Instant Ebooks Textbook Deep Generative Modeling Jakub M. Tomczak Download All Chapters
49 pages
Reinforcement Learning I:: The Setting and Classical Stochastic Dynamic Programming Algorithms
No ratings yet
Reinforcement Learning I:: The Setting and Classical Stochastic Dynamic Programming Algorithms
42 pages
LSTM
No ratings yet
LSTM
42 pages
Scaling AI and ML
No ratings yet
Scaling AI and ML
4 pages
Btech CSE
No ratings yet
Btech CSE
17 pages
Stochastic Modelling & Its Applications
No ratings yet
Stochastic Modelling & Its Applications
19 pages
MACHINELEARING UNIT 1material
100% (1)
MACHINELEARING UNIT 1material
64 pages
Back Propagation Technique
No ratings yet
Back Propagation Technique
24 pages
Artificial Neural Networks: Part 1/3
No ratings yet
Artificial Neural Networks: Part 1/3
25 pages
(Treading On Python 2) Matt Harrison - Treading On Python Volume 2 - Intermediate Python 2 (2013, Hairysun)
No ratings yet
(Treading On Python 2) Matt Harrison - Treading On Python Volume 2 - Intermediate Python 2 (2013, Hairysun)
144 pages
What Is Naive Bayes Algorithm?
No ratings yet
What Is Naive Bayes Algorithm?
18 pages
RL Unit 2
No ratings yet
RL Unit 2
11 pages
U02Lecture07 Classification
100% (1)
U02Lecture07 Classification
56 pages
Module2.3 Hyperparameter Optimization
No ratings yet
Module2.3 Hyperparameter Optimization
29 pages
Radial Basis Functions With Adaptive Input and Composite Trend Representation For Portfolio Selection
100% (1)
Radial Basis Functions With Adaptive Input and Composite Trend Representation For Portfolio Selection
13 pages
Notes On Backpropagation
No ratings yet
Notes On Backpropagation
14 pages
Machine Learning: Andrew NG's Course From Coursera: Presentation
100% (1)
Machine Learning: Andrew NG's Course From Coursera: Presentation
4 pages
How To Code A Neural Network With Backpropagation in Python
No ratings yet
How To Code A Neural Network With Backpropagation in Python
133 pages
Adaline/Madaline:Applications
100% (1)
Adaline/Madaline:Applications
25 pages
Deep Learning@Ok Interviews
No ratings yet
Deep Learning@Ok Interviews
6 pages
Lectures 2 Heuristic Optimization Methods:: Combinatorial Optimization Complexity Theory When and Why To Use Heuristics
No ratings yet
Lectures 2 Heuristic Optimization Methods:: Combinatorial Optimization Complexity Theory When and Why To Use Heuristics
37 pages
ISyE 6669 Homework 15 PDF
No ratings yet
ISyE 6669 Homework 15 PDF
3 pages
Recurrent Neural Network Wiki
100% (1)
Recurrent Neural Network Wiki
7 pages
ANN Matlab
No ratings yet
ANN Matlab
13 pages
Modeling Mindsets
No ratings yet
Modeling Mindsets
113 pages
Machine Learning Lab Assignment CSE-716: S. M. Shafkat Raihan ID: 16701041 SESSION: 2015-16
No ratings yet
Machine Learning Lab Assignment CSE-716: S. M. Shafkat Raihan ID: 16701041 SESSION: 2015-16
9 pages
Neural Network and Their Applications
No ratings yet
Neural Network and Their Applications
2 pages
RNN and LSTM: YANG Jiancheng
No ratings yet
RNN and LSTM: YANG Jiancheng
15 pages
An Introduction To Mathematics Behind Neural Networks
No ratings yet
An Introduction To Mathematics Behind Neural Networks
5 pages
Deep Reinforcement Learning
No ratings yet
Deep Reinforcement Learning
47 pages
Risk-Constrained Markov Decision Processes
No ratings yet
Risk-Constrained Markov Decision Processes
6 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
3 pages
Machine Learning LAB MANUAL
No ratings yet
Machine Learning LAB MANUAL
56 pages
IEER
100% (1)
IEER
252 pages
IndiaInvestments Wiki
No ratings yet
IndiaInvestments Wiki
432 pages
Automatic Differentiation With Pytorch: Stat 479: Deep Learning, Spring 2019 Sebastian Raschka
No ratings yet
Automatic Differentiation With Pytorch: Stat 479: Deep Learning, Spring 2019 Sebastian Raschka
43 pages
Bayesian Learning
No ratings yet
Bayesian Learning
49 pages
Ps and Solution CS229
No ratings yet
Ps and Solution CS229
55 pages
Stanford University CS224d - Deep Learning For Natural Language Processing - Syllabus
No ratings yet
Stanford University CS224d - Deep Learning For Natural Language Processing - Syllabus
3 pages
08 Model Verification & Validation
No ratings yet
08 Model Verification & Validation
30 pages
Lecture 1: Introduction To Reinforcement Learning: David Silver
No ratings yet
Lecture 1: Introduction To Reinforcement Learning: David Silver
46 pages
Health and Safety in Relation To The Use of ICT Systems
No ratings yet
Health and Safety in Relation To The Use of ICT Systems
2 pages
Class Material - 1
No ratings yet
Class Material - 1
66 pages
Distributed Data Systems: BITS Pilani
No ratings yet
Distributed Data Systems: BITS Pilani
19 pages
Behavioral Finance
No ratings yet
Behavioral Finance
2 pages
Dimensions of Knowledge
No ratings yet
Dimensions of Knowledge
10 pages
TF Idf Algorithm
No ratings yet
TF Idf Algorithm
4 pages
QUESTION AND ANSWERS For An Angel in Disguise
No ratings yet
QUESTION AND ANSWERS For An Angel in Disguise
8 pages
Birman-Schiper-Stephenson Protocol Example
100% (1)
Birman-Schiper-Stephenson Protocol Example
3 pages
Midterm Solutions For Machine Learning
No ratings yet
Midterm Solutions For Machine Learning
13 pages
BSS Example PDF
No ratings yet
BSS Example PDF
3 pages
EXAMPLE Machine Learning (C395) Exam Questions
No ratings yet
EXAMPLE Machine Learning (C395) Exam Questions
4 pages
11U.Essay Writing 101
No ratings yet
11U.Essay Writing 101
18 pages
Marketing Plan
0% (1)
Marketing Plan
48 pages
Engineering 23 06 2017
No ratings yet
Engineering 23 06 2017
137 pages
What Is A Support Vector Machine?: Primer
No ratings yet
What Is A Support Vector Machine?: Primer
3 pages
Articulation Assignment Final
No ratings yet
Articulation Assignment Final
7 pages
Static Balancing
No ratings yet
Static Balancing
4 pages
CH 9
No ratings yet
CH 9
9 pages
Background Reading - R Tree With Examples
No ratings yet
Background Reading - R Tree With Examples
24 pages
Science Social Studies 2nd Quarter Lesson Plans
No ratings yet
Science Social Studies 2nd Quarter Lesson Plans
4 pages
Invisibility of Class Privilege
No ratings yet
Invisibility of Class Privilege
2 pages
BITS Pilani: Distributed Computing Global State & Snapshot Recording Algorithms
No ratings yet
BITS Pilani: Distributed Computing Global State & Snapshot Recording Algorithms
53 pages
Rs - Resources
No ratings yet
Rs - Resources
2 pages
BITS Pilani: Distributed Computing
No ratings yet
BITS Pilani: Distributed Computing
73 pages
Research Forum Script
No ratings yet
Research Forum Script
4 pages
Eoies Task12 B2.2 Cte
No ratings yet
Eoies Task12 B2.2 Cte
7 pages
Use-Case Model: Drawing System Sequence Diagrams
No ratings yet
Use-Case Model: Drawing System Sequence Diagrams
15 pages
BAB2202LAW 2034 Company Law Subject Overview - 2013
No ratings yet
BAB2202LAW 2034 Company Law Subject Overview - 2013
8 pages
Machine Learning 10-701 Final Exam May 5, 2015: Obvious Exceptions For Pacemakers and Hearing Aids
No ratings yet
Machine Learning 10-701 Final Exam May 5, 2015: Obvious Exceptions For Pacemakers and Hearing Aids
17 pages
T
No ratings yet
T
267 pages
Section I: Personal Information
No ratings yet
Section I: Personal Information
2 pages
AstroWeb Planetary Position, Lagna Chart
No ratings yet
AstroWeb Planetary Position, Lagna Chart
1 page
BU Subject Choice 316398 26oct2022 PDF
No ratings yet
BU Subject Choice 316398 26oct2022 PDF
1 page
TAP413 3 Force Moving Charge
No ratings yet
TAP413 3 Force Moving Charge
5 pages
Traceback
No ratings yet
Traceback
3 pages
Global Vector Control Response 2017-2030: Fourth Draft (Version 4.3)
No ratings yet
Global Vector Control Response 2017-2030: Fourth Draft (Version 4.3)
50 pages
Midterm Exam
No ratings yet
Midterm Exam
8 pages
ÔN TẬP CK
No ratings yet
ÔN TẬP CK
3 pages
3rd Term Report
No ratings yet
3rd Term Report
1 page
TensorFlow 2 Pocket Primer: A Quick Reference Guide for TensorFlow 2 Developers
From Everand
TensorFlow 2 Pocket Primer: A Quick Reference Guide for TensorFlow 2 Developers
Mercury Learning and Information
No ratings yet
Hopfield Networks: Fundamentals and Applications of The Neural Network That Stores Memories
From Everand
Hopfield Networks: Fundamentals and Applications of The Neural Network That Stores Memories
Fouad Sabry
No ratings yet
Hebbian Learning: Fundamentals and Applications for Uniting Memory and Learning
From Everand
Hebbian Learning: Fundamentals and Applications for Uniting Memory and Learning
Fouad Sabry
No ratings yet
Hybrid Neural Networks: Fundamentals and Applications for Interacting Biological Neural Networks with Artificial Neuronal Models
From Everand
Hybrid Neural Networks: Fundamentals and Applications for Interacting Biological Neural Networks with Artificial Neuronal Models
Fouad Sabry
No ratings yet

EXAMPLE Machine Learning (C395) Exam Questions

Uploaded by

EXAMPLE Machine Learning (C395) Exam Questions

Uploaded by

EXAMPLE Machine Learning (C395) Exam Questions

Solution: Training can be posed as an optimization problem, in which the goal

Define explicitly the cost/error function E, assuming that a set of training

Solution: The error function: E = ∑d D (td – od)2

The gradient decent algorithm: Δwi = –α (∂E / ∂wi)

∂wi) = –∑d D 2(td – od) (xid)

=> Δwi = α ∑d D 2(td – od) xid

Solution: Given the target function representation

LMS training rule works as follows:

Substitute this in (*) à (∀wi) wi ← wi + (η/2)(–∂E / ∂wi)

(4) Question: Consider the following set of training examples:

What is the information gain of a2 relative to these training examples? Provide

s1/2: 1010101001010, 1011111110101, 0100011111101, 0011111001010,

You might also like