0% found this document useful (0 votes)

7 views5 pages

Lab 03 - Linear Regression

This document outlines a laboratory exercise for a CS471 Machine Learning course focused on implementing linear regression using Python. It details objectives such as dataset preparation, feature scaling, cost function implementation, and gradient descent algorithm, along with tasks that require coding and plotting results. The lab aims to teach students about linear regression, overfitting, regularization, and hyperparameter tuning through practical coding assignments.

Uploaded by

mukhan.bese22seecs

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views5 pages

Lab 03 - Linear Regression

Uploaded by

mukhan.bese22seecs

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

Faculty of Computing

Lab 03: Linear Regression

CS471 Machine Learning

BESE – 13B
03rd February 2025

Lab Engineer: Mr. Junaid Sajid

Instructor: Dr. Daud Abdullah Asif

CS471 Machine Learning SEECS, NUST

Introduction

This laboratory exercises the python implementation of linear regression. Linear regression is
a basic supervised learning technique in which parameters are trained on a dataset to fit a
model that best approximates that dataset.

Objectives

The following are the main objectives of this lab:

• Extract and prepare the training and cross-validation datasets

• Use feature scaling to ensure uniformity among the feature columns
• Implement cost function on both training and cross-validation datasets
• Implement gradient descent algorithm
• Plot the training and cross-validation losses
• Use L2 regularization to counter overfitting

Theory
Linear Regression is a very basic supervised learning technique. To calculate the loss in each
training example, the difference between a hypothesis and the label (y) is calculated. The
hypothesis is a linear equation of the features (x) in the dataset with the coefficients acting as
the weight parameters. These weight parameters are initialized to random values at the start
but are then trained over time to learn the model. The cost function is used to calculated the
error between the predicted y^ and the actual y.

A major problem in the training is that the weights that are trained may fit the model for only
the data it is given. This means that the model will not generalize to examples outside the
dataset and is referred to as “overfitting”. Such overfitting makes the machine learning
implementation very impractical for real-life applications where data has high variation. To
prevent overfitting of the model, a modification in the cost function and gradient descent is
implemented. This modification is called regularization and is itself controlled by a
hyperparameter (lambda).

Lab Task 1 - Dataset Preparation, Feature Scaling

You have been provided with a dataset (California Housing Dataset) containing several
feature columns. You will need to select any 3 of the feature columns to make your own
dataset. The “Sale Price” is the label column that your model will predict. The dataset

CS471 Machine Learning SEECS, NUST

examples are to be divided into 2 separate portions: training and cross-validation datasets
(choose from 80-20 to 70-30 ratios). Save the prepared datasets as CSV files. Next, load the
datasets into your python program and store them as NumPy arrays (X train , ytrain, Xval, yval,).
Next, use feature scaling to rescale the feature columns of both datasets so that their values
range from 0 to 1. Finally, print both of the datasets (you need to show any 5 rows of the
datasets).

Lab Task 2 - Cost Function without and with Regularization

For linear regression, you will implement the following hypothesis:
h(x) = w0 + w1x1 + w2x2 + w3x3 + …
The wj represent the weights while the x j represents the jth feature. The linear hypothesis h(x)
is to be calculated for each training example and its difference with the label y of that training
example will represent the loss. In this task, you will write a cost function that calculates the
overall loss across a set of examples. This cost function will be useful to calculate the losses in
both the training and cross-validation phases of the program.

cost_function(X, y, lambd)

The X and y are the features and labels of either the training or the cross-validation datasets.
This is useful as it can be used for either the training examples or the cross-validation
examples of the dataset. The lambd is the regularization parameter (Note that lambda is a
keyword reserved in python). The function will calculate the losses to return the overall cost
value. The cost function is given by:

m
1
J ( w )= ∑ ¿¿
2m i=1

The m is the number of the examples in the dataset and n is the total number of features (or
non-bias weights) in the hypothesis. Write the code for the cost function and implement it for
your training and cross-validation datasets to print out the cost. Provide the code and all
relevant screenshots of the final output.

Lab Task 3 –Gradient Descent without and with Regularization

In this task, you will write a function that uses gradient descent to update the weight
parameters:

CS471 Machine Learning SEECS, NUST

gradient_descent(X, y, alpha, lambd)

The alpha is the learning rate (hyperparameter 1) and lambd is the regularization parameter
(hyperparameter 2). The gradient descent algorithm is given as follows:

m
∂J 1 λ
d w j= = ∑ (h( x (i ))– y (i )) x j(i) + w j
∂ w j m i=1 m

m
∂J 1
db= = ∑ (h(x ( i)) – y (i) )
∂ b m i=1

∂J
w j :=w j−α
∂wj

∂J
b :=b−α
∂wj

For the submission, you will need to run the gradient descent algorithm once to update the
weights. You will need to print the weights, training cost and validation cost both before and
after the weight update. Provide the code and all relevant screenshots of the final output.

Lab Task 4 – Training and Validation Program

In this task, you will use the functions from the previous two tasks to write a “main” function
that performs the actual training and validation. Use the cost function and gradient descent
function on the training examples to determine the training loss and update the weights
respectively. Then, use the cost function on the cross-validation examples to determine the
cross-validation loss. This single iteration over the entire dataset (both training and cross-
validation) marks the completion of one epoch. You will need to perform the training and
cross-validation over several epochs (the epoch number is another hyperparameter that
must be chosen). Ensure that at the end of each epoch, the training and cross-validation
losses are stored for plotting purposes. When the final epoch is performed, note down the
trained parameters (weights and bias) and make plot of the training and cross-validation
losses (y-axis) over the epochs (x-axis). Ensure that both of the losses appear on the same
graph. You only need to show a single plot for this task. Provide the code (excluding function
definitions) and all relevant screenshots of the final output.

CS471 Machine Learning SEECS, NUST

Lab Task 5 – Tuning Alpha and Lambda
In this task, you will use your linear regression code from the previous task. Tune the alpha
and lambda hyperparameters at different values to get several plots. You need to get at least
6 plots. Mention the alpha and lambda values in the plot titles. Ensure all axes are labeled
appropriately.

CS471 Machine Learning SEECS, NUST

Regression Analysis (1722021)
No ratings yet
Regression Analysis (1722021)
279 pages
NIR - Multivariate Calibration - 3rd Edition 2014 PDF
No ratings yet
NIR - Multivariate Calibration - 3rd Edition 2014 PDF
127 pages
ml unit 2
No ratings yet
ml unit 2
23 pages
ML Lab 07 Manual - Linear Regression 2 (Updated Version 4)
No ratings yet
ML Lab 07 Manual - Linear Regression 2 (Updated Version 4)
8 pages
CS229 Lecture Notes
No ratings yet
CS229 Lecture Notes
142 pages
C2_W3_Assignment
No ratings yet
C2_W3_Assignment
437 pages
CS229 Lecture 2 PDF
100% (1)
CS229 Lecture 2 PDF
48 pages
Lab 04 - Logisitic Regression
No ratings yet
Lab 04 - Logisitic Regression
5 pages
ML Lab 06 Manual - Linear Regression 1 (Version 6)
No ratings yet
ML Lab 06 Manual - Linear Regression 1 (Version 6)
8 pages
ML Lab Black & White
No ratings yet
ML Lab Black & White
83 pages
AI2025_Lecture02_recording_slides (1)
No ratings yet
AI2025_Lecture02_recording_slides (1)
52 pages
CM20315 02 Supervised
No ratings yet
CM20315 02 Supervised
53 pages
RLDL File
No ratings yet
RLDL File
31 pages
A1388404476 - 64039 - 23 - 2023 - Machine Learning II
No ratings yet
A1388404476 - 64039 - 23 - 2023 - Machine Learning II
10 pages
Linear Models and Learning Via Optimization: Piyush Rai Introduction To Machine Learning (CS771A)
No ratings yet
Linear Models and Learning Via Optimization: Piyush Rai Introduction To Machine Learning (CS771A)
26 pages
WEEK 1-5
No ratings yet
WEEK 1-5
13 pages
Lecture W2c
No ratings yet
Lecture W2c
16 pages
Lab01 Linear Regression
No ratings yet
Lab01 Linear Regression
4 pages
chapter_4_assignment (6)
No ratings yet
chapter_4_assignment (6)
5 pages
Lecture-2-1 Model Representation 20220301
No ratings yet
Lecture-2-1 Model Representation 20220301
10 pages
ML 06
No ratings yet
ML 06
6 pages
Final Lab Manual
No ratings yet
Final Lab Manual
34 pages
hw7
No ratings yet
hw7
7 pages
Manual_ Deep Learning Lab.
No ratings yet
Manual_ Deep Learning Lab.
43 pages
Proceedings of Fifth International Confe
No ratings yet
Proceedings of Fifth International Confe
1,021 pages
Linear Regression
No ratings yet
Linear Regression
3 pages
Linear Regression
No ratings yet
Linear Regression
36 pages
ML Lab 08 Manual - Logisitic Regression (Ver7)
No ratings yet
ML Lab 08 Manual - Logisitic Regression (Ver7)
9 pages
ML Exp 1
No ratings yet
ML Exp 1
12 pages
A2 Linear Models From Scratch
No ratings yet
A2 Linear Models From Scratch
2 pages
50inference
No ratings yet
50inference
31 pages
HW1
No ratings yet
HW1
4 pages
UNIT 1 Notes
No ratings yet
UNIT 1 Notes
38 pages
Lecture 5
No ratings yet
Lecture 5
18 pages
hw1
No ratings yet
hw1
12 pages
Vertopal.com C1 W2 Lab02 Multiple Variable Soln
No ratings yet
Vertopal.com C1 W2 Lab02 Multiple Variable Soln
11 pages
assgmt1
No ratings yet
assgmt1
7 pages
C2W3 Lab 01 Model Evaluation and Selection
No ratings yet
C2W3 Lab 01 Model Evaluation and Selection
21 pages
Handwriting-Based ADHD Detection For Children Having ASD Using Machine Learning Approaches
No ratings yet
Handwriting-Based ADHD Detection For Children Having ASD Using Machine Learning Approaches
12 pages
Machine Learning For Well Rate Estimation: Integrated Imputation and Stacked Ensemble Modeling
No ratings yet
Machine Learning For Well Rate Estimation: Integrated Imputation and Stacked Ensemble Modeling
118 pages
ML Labs
No ratings yet
ML Labs
46 pages
C1 W1 Lab02 Model Representation Soln
No ratings yet
C1 W1 Lab02 Model Representation Soln
7 pages
Discrimi Ner
No ratings yet
Discrimi Ner
34 pages
AIML Lab
No ratings yet
AIML Lab
48 pages
hw1_2025
No ratings yet
hw1_2025
2 pages
C1 W1 Lab03 Model Representation Soln-Copy1
No ratings yet
C1 W1 Lab03 Model Representation Soln-Copy1
7 pages
Assignment_1
No ratings yet
Assignment_1
1 page
DDA3020_2024F_HW1
No ratings yet
DDA3020_2024F_HW1
6 pages
niraj dl
No ratings yet
niraj dl
15 pages
Deep Learning Project Nice
No ratings yet
Deep Learning Project Nice
45 pages
Machine Learning Homework
No ratings yet
Machine Learning Homework
8 pages
2425s Csec520 08 Naive Bayes Knn
No ratings yet
2425s Csec520 08 Naive Bayes Knn
44 pages
C1 W1 Lab02 Model Representation Soln
No ratings yet
C1 W1 Lab02 Model Representation Soln
5 pages
Lecture 3 - Linear Regression
No ratings yet
Lecture 3 - Linear Regression
31 pages
Homework2
No ratings yet
Homework2
3 pages
Programming Exercise 5: Regularized Linear Regression and Bias V.S. Variance
No ratings yet
Programming Exercise 5: Regularized Linear Regression and Bias V.S. Variance
14 pages
Machine Learning Lab Assignment: Instructions
No ratings yet
Machine Learning Lab Assignment: Instructions
4 pages
C2W3_Lab_01_Model_Evaluation_and_Selection
No ratings yet
C2W3_Lab_01_Model_Evaluation_and_Selection
21 pages
CS435 Ch6
No ratings yet
CS435 Ch6
14 pages
ML Lab Manual
No ratings yet
ML Lab Manual
38 pages
1.1 ID5059 1.2 Tom Kelsey - Jan 2021: February 15, 2021
No ratings yet
1.1 ID5059 1.2 Tom Kelsey - Jan 2021: February 15, 2021
43 pages
C1 W1 Lab02 Model Representation Soln
No ratings yet
C1 W1 Lab02 Model Representation Soln
5 pages
DMW Lab Manual
No ratings yet
DMW Lab Manual
35 pages
C1 W1 Lab02 Model Representation Soln
No ratings yet
C1 W1 Lab02 Model Representation Soln
7 pages
Assignment 1
No ratings yet
Assignment 1
2 pages
ML Lab 11 Manual - Neural Networks (Ver4)
No ratings yet
ML Lab 11 Manual - Neural Networks (Ver4)
8 pages
Artificial Intelligence - (Unit - 1)
No ratings yet
Artificial Intelligence - (Unit - 1)
47 pages
Machine Learning Based Missing Data Imputation
No ratings yet
Machine Learning Based Missing Data Imputation
13 pages
HW 1 in 2015
No ratings yet
HW 1 in 2015
3 pages
COMPX310-19A Machine Learning: An Introduction Using Python, Scikit-Learn, Keras, and Tensorflow
No ratings yet
COMPX310-19A Machine Learning: An Introduction Using Python, Scikit-Learn, Keras, and Tensorflow
44 pages
Model Evaluation and Selection
No ratings yet
Model Evaluation and Selection
37 pages
Exam Spring 10
No ratings yet
Exam Spring 10
10 pages
Human Activity Recognition For Elderly People Using Machine and Deep Learning Approaches
No ratings yet
Human Activity Recognition For Elderly People Using Machine and Deep Learning Approaches
14 pages
Stepwise Versus Hierarchical Regression: Pros and Cons
No ratings yet
Stepwise Versus Hierarchical Regression: Pros and Cons
30 pages
Conference Paper
No ratings yet
Conference Paper
10 pages
CIS 419/519 Introduction To Machine Learning Assignment 2: Instructions
No ratings yet
CIS 419/519 Introduction To Machine Learning Assignment 2: Instructions
12 pages
Car Price Prediction Using Machine Learning Techniques
100% (1)
Car Price Prediction Using Machine Learning Techniques
6 pages
Package Rminer': R Topics Documented
No ratings yet
Package Rminer': R Topics Documented
43 pages
Discriminant Analysis
100% (1)
Discriminant Analysis
20 pages
Academic Prediction
No ratings yet
Academic Prediction
5 pages
Machine Learning Assignments
No ratings yet
Machine Learning Assignments
3 pages
Structured Data Classification
No ratings yet
Structured Data Classification
3 pages
Decision Tree
No ratings yet
Decision Tree
3 pages
Some Like It Hoax: Automated Fake News Detection in Social Networks
No ratings yet
Some Like It Hoax: Automated Fake News Detection in Social Networks
12 pages
Datascience One Word
No ratings yet
Datascience One Word
30 pages
Nelson - Mbi - Data Scientist - A
No ratings yet
Nelson - Mbi - Data Scientist - A
7 pages
Olsson 2005
No ratings yet
Olsson 2005
6 pages
All Types of Cross Validation
No ratings yet
All Types of Cross Validation
9 pages
Estimating Age Stratified WAIS R IQS From Scores On The Raven Standard Progressive Matrices
No ratings yet
Estimating Age Stratified WAIS R IQS From Scores On The Raven Standard Progressive Matrices
8 pages
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet

Lab 03 - Linear Regression

Uploaded by

Lab 03 - Linear Regression

Uploaded by

Faculty of Computing

Lab 03: Linear Regression

Lab Engineer: Mr. Junaid Sajid

CS471 Machine Learning SEECS, NUST

The following are the main objectives of this lab:

• Extract and prepare the training and cross-validation datasets

Lab Task 1 - Dataset Preparation, Feature Scaling

CS471 Machine Learning SEECS, NUST

Lab Task 2 - Cost Function without and with Regularization

Lab Task 3 –Gradient Descent without and with Regularization

CS471 Machine Learning SEECS, NUST

Lab Task 4 – Training and Validation Program

CS471 Machine Learning SEECS, NUST

CS471 Machine Learning SEECS, NUST

You might also like