0% found this document useful (0 votes)

27 views7 pages

IML21 Term1

The document outlines the structure and guidelines for timed remote assessments for various computing and engineering degrees at Imperial College London for the 2021-2022 academic year. It includes specific instructions for an open book assessment on machine learning, detailing the format, rules against plagiarism, and the types of questions to be answered. Additionally, it provides a framework for evaluating student performance and maintaining academic integrity during the examination process.

Uploaded by

Alexander Arzt

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

27 views7 pages

IML21 Term1

Uploaded by

Alexander Arzt

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

I MPERIAL C OLLEGE L ONDON

T IMED R EMOTE A SSESSMENTS 2021-2022

BEng Honours Degree in Computing Part III

BEng Honours Degree in Electronic and Information Engineering Part III
MEng Honours Degree in Electronic and Information Engineering Part III
MEng Honours Degree in Electronic and Information Engineering Part IV
BEng Honours Degree in Mathematics and Computer Science Part III
MEng Honours Degree in Mathematics and Computer Science Part III
MEng Honours Degrees in Computing Part III
MSc Advanced Computing
MSc Artificial Intelligence
MSc in Computing (Specialism)
for Internal Students of the Imperial College of Science, Technology and Medicine
This paper is also taken for the relevant assessments for the
Associateship of the City and Guilds of London Institute

PAPER COMP70050=COMP97101=COMP97151

INTRODUCTION TO MACHINE LEARNING (TERM1)

Tuesday 14 December 2021, 10:00

Writing time: 90 minutes
Upload time: 25 minutes

Answer ALL THREE questions

Open book assessment

This time-limited remote assessment has been designed to be open book. You may use resources which have been identified by the examiner to
complete the assessment and are included in the instructions for the examination. You must not use any additional resources when completing
this assessment.
The use of the work of another student, past or present, constitutes plagiarism. Giving your work to another student to use constitutes an
offence. Collusion is a form of plagiarism and will be treated in a similar manner. This is an individual assessment and thus should be completed
solely by you. The College will investigate all instances where an examination or assessment offence is reported or suspected, using plagiarism
software, vivas and other tools, and apply appropriate penalties to students. In all examinations we will analyse exam performance against
previous performance and against data from previous years and use an evidence-based approach to maintain a fair and robust examination. As
with all exams, the best strategy is to read the question carefully and answer as fully as possible, taking account of the time and number of marks
available.

Paper contains 3 questions

1a You are building a binary K-nearest neighbours classifier.
You are given the following table for 6 test instances.
Label represents the correct ground truth label for the test instance.
NN1 represents the label of the training instance closest to the test instance.
NN2 represents the label of the second closest training instance. NN3 the third
closest, NN4 the fourth closest, NN5 the fifth closest.

Predictions
# NN1 NN2 NN3 NN4 NN5 Label
(1-NN) (3-NN) (5-NN)
1 + + + × + + +
2 + × + + × + +
3 × + × + + + ×
4 × + × × × × ×
5 + × × × + × +
6 × + + × + × ×
i) Compute the predicted labels (+ or ×) for a 3-nearest neighbours and
5-nearest neighbours classifier for each test instance in the table above.
The predictions for a 1-nearest neighbour are provided as an example.
There is no need to provide any calculations for this question. Just provide
the predicted labels (either by filling up the table above, or writing them
down on a blank sheet).

ii) In one sentence, explain why increasing the K in a K-nearest neighbours

classifier generally results in better accuracy.

iii) Assume the weights for the five neighbours of test instance #6 are 0.6,
0.15, 0.1, 0.1, and 0.05 respectively. What is the prediction for a
distance-weighted nearest neighbour classifier for this test instance (+ or
×)? Justify your answer in one sentence (there is no need to show your
calculations).

© Imperial College London 2021 - 2022 Paper COMP70050=COMP97101=COMP97151 Page 1 of 6

b You would like to train a decision tree that classifies whether a book belongs to
either one of three genres: fantasy, romance, or horror.
You want to find out whether the frequency of the word love in the book is a
good attribute for your decision tree. You have discretised the frequency of the
word into two groups: low and high.
Out of the 100 books you have, 30 are fantasy, 40 are romance, and 30 are
horror.
Out of the 30 fantasy books, 15 mention the word love with low frequency, and
15 mention the word with high frequency.
Out of the 40 romance books, 10 mention the word love with low frequency, and
30 mention the word with high frequency.
Out of the 30 horror books, 25 mention the word love with low frequency, and 5
mention the word with high frequency.
Compute the information gain for selecting the frequency of the word love as
an attribute for your decision tree, with respect to the initial entropy of the whole
dataset of 100 books.
Please use log2 for all calculations, and show all intermediate calculations.
Hint: Entropies can be greater than 1 when there are more than two categories.
You may find the following calculations useful:
0.1 × log2 (0.1) = −0.3322
0.2 × log2 (0.2) = −0.4644
0.3 × log2 (0.3) = −0.5211
0.4 × log2 (0.4) = −0.5288
0.5 × log2 (0.5) = −0.5000
0.6 × log2 (0.6) = −0.4422
0.7 × log2 (0.7) = −0.3602
0.8 × log2 (0.8) = −0.2575
0.9 × log2 (0.9) = −0.1368
1.0 × log2 (1.0) = 0

The two parts carry equal marks.

© Imperial College London 2021 - 2022 Paper COMP70050=COMP97101=COMP97151 Page 2 of 6

2 You are building a machine learning model to predict the result of taking a
French language test, based on three features: the number of years the person has
studied French (x1 ), the number of years they have studied any foreign language
(x2 ), and the number of hours they spent preparing for this test (x3 ). The feature
values are normalised to be in a range between 0 and 1.
You have the following neural network architecture:

The network takes the three features as input. There is one hidden neuron (h)
with tanh activation. The output (ŷ) is a single neuron with linear activation,
predicting the resulting mark in a range between 0-10. The weights of the
network have been randomly initialised and can be seen on the diagram. The
network does not have any bias parameters.
If you need to make any assumptions, state them clearly in your answer.

a You are given normalised feature values for one test taker: 0.5 for the number of
years the person has studied French, 0.6 for the number of years they have
studied any foreign language, and 0.7 for the number of hours they spent
preparing for this test.
Find what result does the model predict for this person. Demonstrate your
calculations and intermediate steps.

b After the person takes the test, they receive mark 5.0 as their official result. You
now want to use this knowledge to improve your model.
Calculate the updated values for all the trainable parameters in this network after
one step of stochastic gradient descent using this datapoint. Use mean squared
error as the loss function and 0.5 as the learning rate.
Show the path of your calculations.

c Below is a table with predicted scores and true results for 5 test takers. You want
to evaluate how well the model is able to predict who passes the test (receives a
score ≥ 6.0). Calculate precision, recall and F-score for the system.

c Imperial College London 2021 - 2022 Paper COMP70050=COMP97101=COMP97151 Page 3 of 6

Person ID Predicted result True result
1 6.44 7.5
2 6.82 5
3 5.37 6.5
4 8.59 9
5 4.21 8

The three parts carry, respectively, 30%, 50%, and 20% of the marks.

c Imperial College London 2021 - 2022 Paper COMP70050=COMP97101=COMP97151 Page 4 of 6

3 After running the MAP-Elites algorithms several times independently, the
content of all the grids is reported in the table below:
index Gen/Phen BD Fitness
1 0.9 0.8 0.4 4 7 0.4
2 0.5 0.2 0.5 7 1 1.1
3 0.2 0.2 0.9 9 2 1.2
4 0.2 0.9 0.5 9 3 1.0
5 0.1 0.2 0.9 2 1 1.2
6 0.2 0.7 0.1 8 9 0.9
7 0.2 0.3 0.3 4 8 0.6
8 0.9 1.0 0.3 4 8 0.5
9 0.2 0.4 0.4 3 4 1.1
10 0.3 0.4 0.5 6 6 0.5
11 1.0 0.2 0.5 7 2 1.1
12 0.2 0.7 0.1 8 9 0.9
13 0.4 0.0 0.6 1 1 1.5
14 0.2 0.7 0.1 8 9 0.9
In this experiment, there is no difference between the genotype and the
phenotype (i.e., the function to develop the genotype into the phenotype is the
identity function). The behavioural descriptor (BD) is a 2D vector. The index
column corresponds to the row number of the table, which is provided to more
easily refer to elements in the table, if necessary.

a Initially, the grid was composed of 100 cells (i.e., 10x10 resolution). However,
we can observe that a significantly lower number of solutions has been reported.
To avoid this, we want to group all the found solutions into three clusters.
Apply one iteration of the k-Means algorithm with k = 3 and the following
data-points as initial values for the centroids. Show the details of your
calculations.
Please use the Manhattan distance (the L1-norm) as the distance metric for
this question.
Initial Centroids
c1 1 1
c2 9 1
c3 8 9

b To make this question independent from the previous one, let’s assume that we
ran k-Means with k = 4 and obtained the following centroids:
c1 1 3
c2 7 3
c3 5 7
c4 8 9

i) We want to replace the original grid of MAP-Elites, with the clusters and
centroids obtained from k-Means. For this, we need to update the function
of MAP-Elites “add to collection(solution)” that takes as input a solution
and adds it to the collection of solutions if the solution is either novel, or
better than previously encountered one. This function returns nothing.
Write the pseudo code of the new version of this function so that it uses
the clusters and centroids defined above instead of a grid.
You can assume that a solution has these three attributes: Genotype, BD,
Fitness.

ii) Assume that we start with an empty collection. Use your

“add to collection(solution)” function to potentially add in the collection
each sample of the dataset provided at the beginning of this question. Give
the content of the collection at the end of this process.

iii) Quantify the diversity, the performance and the QD score of the
collection created above.

c We use (µ + λ) − ES to improve the performance of the solution with index 14

(also reported in the table below). The parameters are set as follow: µ = 1 and
λ = 5, and the results of the evaluation of the offsprings (Ok ) from the first
generation is listed below:
index Gen/Phen Fitness
14 0.2 0.7 0.1 0.9
O1 0.3 0.7 0.1 0.7
O2 0.2 0.8 0.1 0.6
O3 0.2 0.7 0.2 0.7
O4 0.1 0.6 0.1 0.5
O5 0.3 0.8 0.1 0.8
i) Give the fitness and genotype of the parent(s) selected for the second
generation.

ii) Instead of using an ES algorithm, we want to use the gradient ascent

algorithm. Explain in two or three sentences how this can work in the
context of a black box problem.

The three parts carry, respectively, 40%, 40%, and 20% of the marks.

ML Midsem 2022
No ratings yet
ML Midsem 2022
8 pages
Machine Learning PYQ 2022 Ans
No ratings yet
Machine Learning PYQ 2022 Ans
17 pages
ML June 2024
No ratings yet
ML June 2024
12 pages
Introduction To Machine Learning IIT KGP Week 2
100% (1)
Introduction To Machine Learning IIT KGP Week 2
14 pages
AI Lecture 12-b
No ratings yet
AI Lecture 12-b
20 pages
MachineLearning MidTerm UMT Spring 2021
100% (1)
MachineLearning MidTerm UMT Spring 2021
12 pages
Midterm Solutions
No ratings yet
Midterm Solutions
8 pages
6.034 Quiz 2 November 17, 2003: Name Email
No ratings yet
6.034 Quiz 2 November 17, 2003: Name Email
20 pages
Department of Electrical Engineering School of Science and Engineering EE514/CS535 Machine Learning Homework 2
No ratings yet
Department of Electrical Engineering School of Science and Engineering EE514/CS535 Machine Learning Homework 2
8 pages
Id5059 23 2 1
No ratings yet
Id5059 23 2 1
8 pages
Winter 21 Exam 1
No ratings yet
Winter 21 Exam 1
17 pages
Ifjo 320 Fy 98324 Fo 3 F 2 Ifr
No ratings yet
Ifjo 320 Fy 98324 Fo 3 F 2 Ifr
6 pages
Machine 2021 Jan-Apr
No ratings yet
Machine 2021 Jan-Apr
45 pages
Final Compre - Solutions - Updated FoDS
No ratings yet
Final Compre - Solutions - Updated FoDS
12 pages
Final
No ratings yet
Final
13 pages
Ain Shams University Faculty of Engineering
No ratings yet
Ain Shams University Faculty of Engineering
8 pages
Ad
No ratings yet
Ad
5 pages
Drilling For Non Technical People
100% (5)
Drilling For Non Technical People
87 pages
MLvsMAP Merged
No ratings yet
MLvsMAP Merged
208 pages
Exam 2017
No ratings yet
Exam 2017
8 pages
Machine Learning Foundations and Applications Assignment 1 Due Date: 10 October, 2021
No ratings yet
Machine Learning Foundations and Applications Assignment 1 Due Date: 10 October, 2021
3 pages
University of Edinburgh College of Science and Engineering School of Informatics
No ratings yet
University of Edinburgh College of Science and Engineering School of Informatics
5 pages
Lab 6
No ratings yet
Lab 6
6 pages
DS3001 - DAV - Final Exam - Fall23 - v3
No ratings yet
DS3001 - DAV - Final Exam - Fall23 - v3
14 pages
Midterm Examination CS540-1: Introduction To Artificial Intelligence
No ratings yet
Midterm Examination CS540-1: Introduction To Artificial Intelligence
4 pages
Questions and Solutions On Linear Regression
No ratings yet
Questions and Solutions On Linear Regression
5 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
56 pages
Kinetic Theory of Gases Notes
No ratings yet
Kinetic Theory of Gases Notes
5 pages
Iml20 Term
No ratings yet
Iml20 Term
7 pages
INF8953CE Final Exam Questions 2020
No ratings yet
INF8953CE Final Exam Questions 2020
5 pages
CS246 Final Exam Solutions, Winter 2011
No ratings yet
CS246 Final Exam Solutions, Winter 2011
18 pages
MSBD5001 WrittenAssignment2 2024F
No ratings yet
MSBD5001 WrittenAssignment2 2024F
5 pages
IBM322 Last Year ETE
No ratings yet
IBM322 Last Year ETE
5 pages
Machine Learning PYQ 2021
No ratings yet
Machine Learning PYQ 2021
4 pages
2019-20-I ES Key
No ratings yet
2019-20-I ES Key
4 pages
Mid-Term A2 ML Solution
No ratings yet
Mid-Term A2 ML Solution
7 pages
Cs3002 Question Paper 2015.16 - Externalreviewed
No ratings yet
Cs3002 Question Paper 2015.16 - Externalreviewed
5 pages
ML 20230316 1
No ratings yet
ML 20230316 1
9 pages
22EE514
No ratings yet
22EE514
6 pages
Solutions: 10-601 Machine Learning, Midterm Exam: Spring 2008 Solutions
No ratings yet
Solutions: 10-601 Machine Learning, Midterm Exam: Spring 2008 Solutions
8 pages
2011 End Spring 2011 Computer Science Machine Learning
No ratings yet
2011 End Spring 2011 Computer Science Machine Learning
10 pages
Mid-Sem 9
No ratings yet
Mid-Sem 9
2 pages
2023-24 AIML ML Mid-Semester Regular QP Anwer-Keys
No ratings yet
2023-24 AIML ML Mid-Semester Regular QP Anwer-Keys
4 pages
ML End Sem Nov2024 Paper
No ratings yet
ML End Sem Nov2024 Paper
4 pages
Midterm 2006
No ratings yet
Midterm 2006
11 pages
DT 2023 24 Sols
No ratings yet
DT 2023 24 Sols
8 pages
ML 2024a QP Solution Full
No ratings yet
ML 2024a QP Solution Full
13 pages
(Fall 2011) CS-402 Data Mining - Final Exam-SUB - v03
No ratings yet
(Fall 2011) CS-402 Data Mining - Final Exam-SUB - v03
6 pages
15-381 Spring 2007 Assignment 6: Learning
No ratings yet
15-381 Spring 2007 Assignment 6: Learning
14 pages
2024 Machine Learning
No ratings yet
2024 Machine Learning
8 pages
AI2002 - Final Exam Paper-2024 - CS
No ratings yet
AI2002 - Final Exam Paper-2024 - CS
4 pages
Action Plan in Mathematics Grade 4-Integrity
100% (2)
Action Plan in Mathematics Grade 4-Integrity
2 pages
Cos4852 2018 A1
No ratings yet
Cos4852 2018 A1
11 pages
AI Final Spring 2021
No ratings yet
AI Final Spring 2021
3 pages
2022 CS244 End Sem Soln
No ratings yet
2022 CS244 End Sem Soln
6 pages
2023-24 ML End-Semester Make-Up QP Anwer-Keys
No ratings yet
2023-24 ML End-Semester Make-Up QP Anwer-Keys
9 pages
Power of Attorney Sample SBI
67% (3)
Power of Attorney Sample SBI
3 pages
Product Competitiveness Towards Profitability of Selected Bakeries in Metro Silang, Cavite Jenny Amil Cherry Anne Baysa Jefferson Canadalla
No ratings yet
Product Competitiveness Towards Profitability of Selected Bakeries in Metro Silang, Cavite Jenny Amil Cherry Anne Baysa Jefferson Canadalla
27 pages
Guieline Full
No ratings yet
Guieline Full
460 pages
Zapper Frequency Generator Hulda Clark Royal Rife
No ratings yet
Zapper Frequency Generator Hulda Clark Royal Rife
48 pages
Substation Automation
100% (1)
Substation Automation
49 pages
Real Test Bank Legal and Ethical Aspects of Health Information Management 4th Edition by Dana C McWay Ebook and TestBank Bundle Digital Bundle
No ratings yet
Real Test Bank Legal and Ethical Aspects of Health Information Management 4th Edition by Dana C McWay Ebook and TestBank Bundle Digital Bundle
351 pages
History of English Language
No ratings yet
History of English Language
10 pages
Concrete Hollow Blocks
No ratings yet
Concrete Hollow Blocks
6 pages
Most Favoured Nation Concept in Wto
100% (1)
Most Favoured Nation Concept in Wto
23 pages
Design and Installation of Deep Benchmarks
No ratings yet
Design and Installation of Deep Benchmarks
8 pages
CS236 Hw2 Answers
No ratings yet
CS236 Hw2 Answers
14 pages
Historia Royal Hotel & Spa 1 - New Bhupalpura, 100 FT Road, Udaipur, Rajasthan, 313001 Tel No +91 7891434548 Info@Historiaroyal - Com-2
No ratings yet
Historia Royal Hotel & Spa 1 - New Bhupalpura, 100 FT Road, Udaipur, Rajasthan, 313001 Tel No +91 7891434548 Info@Historiaroyal - Com-2
12 pages
Artistic Maps in GIMP
No ratings yet
Artistic Maps in GIMP
22 pages
PGVCL Vsja Result
No ratings yet
PGVCL Vsja Result
1,339 pages
ItaleriAcrylicPaint Conversion Chart
No ratings yet
ItaleriAcrylicPaint Conversion Chart
1 page
Potentio
No ratings yet
Potentio
12 pages
Relators Application For Order Requiring Citation
No ratings yet
Relators Application For Order Requiring Citation
63 pages
Media Ethics QP
No ratings yet
Media Ethics QP
2 pages
E010OBS
No ratings yet
E010OBS
4 pages
Improvised Project Rubric
No ratings yet
Improvised Project Rubric
1 page
Process Oils and Their Uses in Rubber
No ratings yet
Process Oils and Their Uses in Rubber
7 pages
Transistor 2sb1197k ROHM
No ratings yet
Transistor 2sb1197k ROHM
3 pages
Sol. Chp. 22 - (Part 1)
No ratings yet
Sol. Chp. 22 - (Part 1)
20 pages
Searchq 8070+Mytee+Lite&Rlz 1CDGOYI EnUS1063US1063&Oq 80&Gs LCRP EgZjaHJvbWUqDggBEEUYJxg7GIAEGIoFMggIABB
No ratings yet
Searchq 8070+Mytee+Lite&Rlz 1CDGOYI EnUS1063US1063&Oq 80&Gs LCRP EgZjaHJvbWUqDggBEEUYJxg7GIAEGIoFMggIABB
1 page
lpdsc212 - Yenny Gunawan - Tektonik Arsitektur Joglo-P PDF
No ratings yet
lpdsc212 - Yenny Gunawan - Tektonik Arsitektur Joglo-P PDF
31 pages
Phrasal Verbs
No ratings yet
Phrasal Verbs
10 pages
BMFM 33141 Group Assignment (Case Study) March 2022
No ratings yet
BMFM 33141 Group Assignment (Case Study) March 2022
3 pages
Ir 4426
No ratings yet
Ir 4426
12 pages
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
From Everand
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
SUJAUL CHOWDHURY
No ratings yet
IGNOU MCA Design and Analysis of Algorithms Previous Years Unsolved Papers MCS 211
From Everand
IGNOU MCA Design and Analysis of Algorithms Previous Years Unsolved Papers MCS 211
Manish Soni
No ratings yet
Fundamental Math
From Everand
Fundamental Math
Russell Pead
No ratings yet
A Complete Guide to M.C.Q (Class-10, Mathematics): CBSE MCQ Series, #1
From Everand
A Complete Guide to M.C.Q (Class-10, Mathematics): CBSE MCQ Series, #1
Er. Sajal Kumar Ghosh
No ratings yet
IGNOU BCA Introduction to Algorithm Design Previous Year Unsolved Papers BCS 042
From Everand
IGNOU BCA Introduction to Algorithm Design Previous Year Unsolved Papers BCS 042
Manish Soni
No ratings yet
IGNOU BCA Statistical Techniques Previous Year Unsolved Papers BCS 040
From Everand
IGNOU BCA Statistical Techniques Previous Year Unsolved Papers BCS 040
Manish Soni
No ratings yet