0% found this document useful (0 votes)

37 views3 pages

Lab3Block1 2021-1

(1) This document provides instructions for a machine learning lab assignment involving kernel methods, support vector machines, and neural networks. (2) The first part of the assignment asks students to implement kernel regression to predict hourly temperatures using data on weather station locations and measurements. (3) Later parts involve using support vector machines to classify spam data and training neural networks to learn functions like the sine function.

Uploaded by

Alex Widén

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

37 views3 pages

Lab3Block1 2021-1

Uploaded by

Alex Widén

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

732A99/TDDE01/732A68 MACHINE LEARNING

LAB 3 BLOCK 1: KERNEL METHODS, SUPPORT VECTOR MACHINES AND NEURAL

NETWORKS

JOSE M. PEÑA
IDA, LINKÖPING UNIVERSITY, SWEDEN

I NSTRUCTIONS
The instructions and submission procedure from the previous labs apply to this lab as well.

R ESOURCES
The only R package that is allowed to solve the assignment 1 is the geosphere package
(specifically, the function distHaversine). The assignment 2 is designed to be solved with
the package kernlab. The assignment 3 is designed to be solved with the neuralnet
package.

1. K ERNEL M ETHODS
Implement a kernel method to predict the hourly temperatures for a date and place in Swe-
den. To do so, you are provided with the files stations.csv and temps50k.csv. These
files contain information about weather stations and temperature measurements in the stations
at different days and times. The data have been kindly provided by the Swedish Meteorological
and Hydrological Institute (SMHI).
You are asked to provide a temperature forecast for a date and place in Sweden. The
forecast should consist of the predicted temperatures from 4 am to 24 pm in an interval of 2
hours. Use a kernel that is the sum of three Gaussian kernels:
● The first to account for the physical distance from a station to the point of interest. For
this purpose, use the function distHaversine from the R package geosphere.
● The second to account for the distance between the day a temperature measurement
was made and the day of interest.
● The third to account for the distance between the hour of the day a temperature mea-
surement was made and the hour of interest.
Choose an appropriate smoothing coefficient or width for each of the three kernels above.
No cross-validation should be used. Instead, choose manually a width that gives large kernel
values to closer points and small values to distant points. Show this with a plot of the kernel
value as a function of distance. Help: Note that the file temps50k.csv may contain temper-
ature measurements that are posterior to the day and hour of your forecast. You must filter
such measurements out, i.e. they cannot be used to compute the forecast.
Finally, repeat the exercise above by combining the three kernels into one by multiplying
them, instead of summing them up. Compare the results obtained in both cases and elaborate
on why they may differ.
The only R package that is allowed to solve this assignment is the geosphere package
(specifically, the function distHaversine). Feel free to use the template below to solve the
assignment.

set.seed(1234567890)
library(geosphere)

stations <- read.csv("stations.csv")

1
2

temps <- read.csv("temps50k.csv")

st <- merge(stations,temps,by="station_number")

h_distance <- # These three values are up to the students

h_date <-
h_time <-
a <- 58.4274 # The point to predict (up to the students)
b <- 14.826
date <- "2013-11-04" # The date to predict (up to the students)
times <- c("04:00:00", "06:00:00", ..., "24:00:00")

temp <- vector(length=length(times))

# Students’ code here

plot(temp, type="o")

2. S UPPORT V ECTOR M ACHINES

The code in the file Lab3Block1 2021 SVMs St.R performs SVM model selection to clas-
sify the spam dataset. To do so, the code uses the function ksvm from the R package
kernlab, which also includes the spam dataset. All the SVM models to select from use
the radial basis function kernel (also known as Gaussian) with a width of 0.05. The C param-
eter varies between the models. Run the code in the file Lab3Block1 2021 SVMs St.R and
answer the following questions.
(1) Which filter do you return to the user ? filter0, filter1, filter2 or filter3?
Why?
(2) What is the estimate of the generalization error of the filter returned to the user? err0,
err1, err2 or err3? Why?
(3) Once a SVM has been fitted to the training data, a new point is essentially classified
according to the sign of a linear combination of the kernel function values between the
support vectors and the new point. You are asked to implement this linear combination
for filter3. You should make use of the functions alphaindex, coef and b that
return the indexes of the support vectors, the linear coefficients for the support vectors,
and the negative intercept of the linear combination. See the help file of the kernlab
package for more information. You can check if your results are correct by comparing
them with the output of the function predict where you set type = "decision".
Do so for the first 10 points in the spam dataset. Feel free to use the template provided
in the Lab3Block1 2021 SVMs St.R file.

3. N EURAL N ETWORKS
This assignment is to be solved with the neuralnet package.
(1) Train a neural network to learn the trigonometric sine function. To do so, sample 500
points uniformly at random in the interval [0, 10
10]. Apply the sine function to each point.
The resulting value pairs are the data points available to you. Use 25 of the 500 points
for training and the rest for test. Use one hidden layer with 10 hidden units. You do
not need to apply early stopping. Plot the training and test data, and the predictions of
the learned NN on the test data. You should get good results. Comment your results.
(2) In question (1), you used the default logistic (a.k.a. sigmoid) activation function, i.e.
act.fct = "logistic". Repeat question (1) with the following custom activation
functions: h1 (x) = x, h2 (x) = max{0, x} and h3 (x) = ln(1 + exp x) (a.k.a. linear, ReLU
and softplus). See the help file of the neuralnet package to learn how to use custom
activation functions. Plot and comment your results.
3

(3) Sample 500 points uniformly at random in the interval [0, 50

50], and apply the sine func-
tion to each point. Use the NN learned in question (1) to predict the sine function value
for these new 500 points. You should get mixed results. Plot and comment your results.
(4) In question (3), the predictions seem to converge to some value. Explain why this
happens. To answer this question, you may need to get access to the weights of the
NN learned. You can do it by running nn or nn$weights where nn is the NN learned.
(5) Sample 500 points uniformly at random in the interval [0, 10
10], and apply the sine func-
tion to each point. Use all these points as training points for learning a NN that tries
to predict x from sin(x)
sin(x), i.e. unlike before when the goal was to predict sin(x) from
x. Use the learned NN to predict the training data. You should get bad results. Plot
and comment your results. Help: Some people get a convergence error in this ques-
tion. It can be solved by stopping the training before reaching convergence by setting
threshold = 0.1.
Feel free to use the following template to solve the exercises above.

library(neuralnet)
set.seed(1234567890)

Var <- runif(500, 0, 10)

mydata <- data.frame(Var, Sin=sin(Var))
tr <- mydata[1:25,] # Training
te <- mydata[26:500,] # Test

# Random initialization of the weights in the interval [-1, 1]

winit <- # Your code here
nn <- neuralnet(# Your code here)

# Plot of the training data (black), test data (blue), and predictions (red)

plot(tr, cex=2)
points(te, col = "blue", cex=1)
points(te[,1],predict(nn,te), col="red", cex=1)

Practice Midterm
No ratings yet
Practice Midterm
4 pages
Machine Learning Methods in Environmental Sciences
100% (2)
Machine Learning Methods in Environmental Sciences
365 pages
Matlab Homework Experts 2
No ratings yet
Matlab Homework Experts 2
10 pages
HW 3
No ratings yet
HW 3
7 pages
DA All
No ratings yet
DA All
15 pages
Practice Midterm 2010
No ratings yet
Practice Midterm 2010
4 pages
Fun Least Squares
No ratings yet
Fun Least Squares
3 pages
Statlearn PDF
No ratings yet
Statlearn PDF
123 pages
12s 701 Final
No ratings yet
12s 701 Final
17 pages
Assignment 3
No ratings yet
Assignment 3
2 pages
Exercise - 3: DS203-2024-S1 Roll Number: 23B2215
No ratings yet
Exercise - 3: DS203-2024-S1 Roll Number: 23B2215
25 pages
HW 1
No ratings yet
HW 1
4 pages
2019-20-I ES Key
No ratings yet
2019-20-I ES Key
4 pages
Sol Eval 1
No ratings yet
Sol Eval 1
4 pages
Stanford ML
No ratings yet
Stanford ML
168 pages
Assignment 2
No ratings yet
Assignment 2
3 pages
Homework 2: SVM, Kernel Methods, Ensemble Learning, Learning Theory
No ratings yet
Homework 2: SVM, Kernel Methods, Ensemble Learning, Learning Theory
12 pages
Problem Sets
No ratings yet
Problem Sets
47 pages
ActivityGuide Rubric Phase2
No ratings yet
ActivityGuide Rubric Phase2
12 pages
Dda3020 2024F HW1
No ratings yet
Dda3020 2024F HW1
6 pages
Week 4 Linear Regression
No ratings yet
Week 4 Linear Regression
38 pages
HW 1
No ratings yet
HW 1
12 pages
Lecture Slides Week11
No ratings yet
Lecture Slides Week11
33 pages
1st Exam Question Paper
No ratings yet
1st Exam Question Paper
2 pages
Lecture Slides-Week11
No ratings yet
Lecture Slides-Week11
32 pages
Stochastic Gradient Descent 1
No ratings yet
Stochastic Gradient Descent 1
42 pages
Lecture10 Mid
No ratings yet
Lecture10 Mid
43 pages
Problemset2 PDF
No ratings yet
Problemset2 PDF
4 pages
Department of Electrical Engineering School of Science and Engineering EE514/CS535 Machine Learning Homework 1
No ratings yet
Department of Electrical Engineering School of Science and Engineering EE514/CS535 Machine Learning Homework 1
11 pages
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
No ratings yet
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
11 pages
Exercise 01
No ratings yet
Exercise 01
3 pages
DSCI 303: Machine Learning For Data Science Fall 2020
No ratings yet
DSCI 303: Machine Learning For Data Science Fall 2020
5 pages
Machine Learning Homework
No ratings yet
Machine Learning Homework
8 pages
Stat444 Notes
No ratings yet
Stat444 Notes
37 pages
Exercises Question
No ratings yet
Exercises Question
30 pages
50 Inference
No ratings yet
50 Inference
31 pages
ML 20231026 1
No ratings yet
ML 20231026 1
8 pages
1 Analytical Part (3 Percent Grade) : + + + 1 N I: y +1 I 1 N I: y 1 I
No ratings yet
1 Analytical Part (3 Percent Grade) : + + + 1 N I: y +1 I 1 N I: y 1 I
5 pages
Kernel PCA
No ratings yet
Kernel PCA
13 pages
CS 229, Public Course Problem Set #3: Learning Theory and Unsuper-Vised Learning
No ratings yet
CS 229, Public Course Problem Set #3: Learning Theory and Unsuper-Vised Learning
4 pages
KNN - Model: Train Test CL K
No ratings yet
KNN - Model: Train Test CL K
2 pages
CMU 2018s NinaBALCAN HW3
No ratings yet
CMU 2018s NinaBALCAN HW3
7 pages
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
No ratings yet
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
10 pages
ML Lab Experiments (1) - Pages-5
No ratings yet
ML Lab Experiments (1) - Pages-5
8 pages
Ps 2
No ratings yet
Ps 2
11 pages
Final F04soln
No ratings yet
Final F04soln
10 pages
hw3 Solutions PDF
No ratings yet
hw3 Solutions PDF
11 pages
Midterm 2010 Solutions
No ratings yet
Midterm 2010 Solutions
8 pages
ML 20240315
No ratings yet
ML 20240315
8 pages
HW 5
100% (1)
HW 5
11 pages
Assignment
No ratings yet
Assignment
7 pages
MidA F21
No ratings yet
MidA F21
8 pages
ML Labs
No ratings yet
ML Labs
46 pages
Week 1 HW
No ratings yet
Week 1 HW
3 pages
Lect 1
No ratings yet
Lect 1
24 pages
Stanford University CS 229, Autumn 2015 Midterm Examination
No ratings yet
Stanford University CS 229, Autumn 2015 Midterm Examination
25 pages
Module 4: Recommended Exercises: Problem 1: KNN (Exercise 2.4.7 in ISL Textbook, Slightly Modified)
No ratings yet
Module 4: Recommended Exercises: Problem 1: KNN (Exercise 2.4.7 in ISL Textbook, Slightly Modified)
6 pages
ML PG Assignment 3
No ratings yet
ML PG Assignment 3
3 pages
07au Midterm
No ratings yet
07au Midterm
17 pages
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
From Everand
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
César Pérez López
No ratings yet
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
BCG Turning Visibility Into Value in Digital Supply Chains
No ratings yet
BCG Turning Visibility Into Value in Digital Supply Chains
8 pages
XEmoAccent Embracing Diversity in Cross-Accent Emo
No ratings yet
XEmoAccent Embracing Diversity in Cross-Accent Emo
19 pages
Seminar Report
No ratings yet
Seminar Report
29 pages
Parisodhana 2025 Template
No ratings yet
Parisodhana 2025 Template
1 page
At2 Final
No ratings yet
At2 Final
23 pages
ChatGPT For Higher Education and Professional Development - A Guid
No ratings yet
ChatGPT For Higher Education and Professional Development - A Guid
135 pages
SOM Exemplo
No ratings yet
SOM Exemplo
5 pages
Computer Science 63
No ratings yet
Computer Science 63
12 pages
Attention Is All You Need - Transformer
No ratings yet
Attention Is All You Need - Transformer
12 pages
IT Practical File 9th
No ratings yet
IT Practical File 9th
23 pages
Intrusion Detection Using Pca With Random Forest
No ratings yet
Intrusion Detection Using Pca With Random Forest
6 pages
IOT and Embedded Based Smart Baby Cradle
No ratings yet
IOT and Embedded Based Smart Baby Cradle
5 pages
Cse Cic Ids Dataset
No ratings yet
Cse Cic Ids Dataset
19 pages
Python - Final 1
No ratings yet
Python - Final 1
17 pages
3CP10 MJJ Clustering Intro
No ratings yet
3CP10 MJJ Clustering Intro
18 pages
Random Forest Classifier
No ratings yet
Random Forest Classifier
9 pages
Machine Learning Laboratory Record Book: 1 Find S Algorithm
No ratings yet
Machine Learning Laboratory Record Book: 1 Find S Algorithm
22 pages
Media Piracy Detection Using Artificial Intelligence, Machine Learning and Data Mining
No ratings yet
Media Piracy Detection Using Artificial Intelligence, Machine Learning and Data Mining
3 pages
BUETK Students Employment Prediction Using Machine Learning
No ratings yet
BUETK Students Employment Prediction Using Machine Learning
5 pages
Unit 4 - Aia
No ratings yet
Unit 4 - Aia
32 pages
Congnizant Technoverse
No ratings yet
Congnizant Technoverse
8 pages
Machine Learning - Part 1
100% (1)
Machine Learning - Part 1
80 pages
Machine Learning Lab
No ratings yet
Machine Learning Lab
3 pages
A Tiered GAN Approach For Monet-Style Image Generation
No ratings yet
A Tiered GAN Approach For Monet-Style Image Generation
6 pages
AI-Driven V2V Communications: Developing A Machine Learning Framework For Vehicle-to-Vehicle (V2V) Communication and Intelligent Traffic Management
No ratings yet
AI-Driven V2V Communications: Developing A Machine Learning Framework For Vehicle-to-Vehicle (V2V) Communication and Intelligent Traffic Management
17 pages
Ds CV Sanjay Chhaba o
No ratings yet
Ds CV Sanjay Chhaba o
1 page
30 Days ML Projects Challenge
No ratings yet
30 Days ML Projects Challenge
288 pages
Survey Paper 2
No ratings yet
Survey Paper 2
31 pages
Sat - 75.Pdf - Analysis of Automatic Genger Prediction in Social Media by Using Xgboost Algorithm
No ratings yet
Sat - 75.Pdf - Analysis of Automatic Genger Prediction in Social Media by Using Xgboost Algorithm
11 pages

Lab3Block1 2021-1

Uploaded by

Lab3Block1 2021-1

Uploaded by

732A99/TDDE01/732A68 MACHINE LEARNING

LAB 3 BLOCK 1: KERNEL METHODS, SUPPORT VECTOR MACHINES AND NEURAL

stations <- read.csv("stations.csv")

temps <- read.csv("temps50k.csv")

h_distance <- # These three values are up to the students

temp <- vector(length=length(times))

# Students’ code here

2. S UPPORT V ECTOR M ACHINES

(3) Sample 500 points uniformly at random in the interval [0, 50

Var <- runif(500, 0, 10)

# Random initialization of the weights in the interval [-1, 1]

You might also like