0% found this document useful (0 votes)

51 views10 pages

Programming Assignment3

Uploaded by

vidhishaanand017

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

51 views10 pages

Programming Assignment3

Uploaded by

vidhishaanand017

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Programming Assignment 3

In [2]: import numpy as np

import [Link] as plt

Question 1
Consider the following points on the plane
{(−3, −3.5), (−2, −1), (−2, −0.5), (−1, 0.5), (0, 1), (0, 2.5), (1, 3), (1, 4.8), (2, 6), (3, 7), (3, 10), }

Write a function poly_fit2d(data, d) that returns the coefficients of the best fitting polynomial of degree d through the points in set of points named data together with the
residual norm.
A. Find the best linear fit to the points. What is the residual norm?
B. Find the best cubic fit to the points. What is the residual norm?
C. Use MatplotLib to plot the points and the two curves on the same axes.
In [ ]: # Your Code Starts Here
data = [Link]([
(-3, -3.5), (-2, -1), (-2, -0.5), (-1, 0.5), (0, 1), (0, 2.5),
(1, 3), (1, 4.8), (2, 6), (3, 7), (3, 10)
])
def poly_fit2d(data,d):
x = data[:, 0]
y = data[:, 1]

A = [Link]((len(x), d+1))
for i in range(len(x)):
for j in range(d + 1):
A[i, j] = x[i] ** (d - j)

AtA = [Link]((d + 1, d + 1))

Atb = [Link](d + 1)

for i in range(d + 1):

for j in range(d + 1):
for k in range(len(x)):
AtA[i, j] += A[k, i] * A[k, j]
for k in range(len(x)):
Atb[i] += A[k, i] * y[k]

coeffs = [Link](AtA, Atb)

residuals = [Link](len(x))
for i in range(len(x)):
fitted_value = sum(coeffs[j] * (x[i] ** (d - j)) for j in range(d + 1))
residuals[i] = y[i] - fitted_value
residual_norm = [Link](sum(residual ** 2 for residual in residuals))

return coeffs, residual_norm

linear_coeffs, linear_residual_norm = poly_fit2d(data, 1)

cubic_coeffs, cubic_residual_norm = poly_fit2d(data, 3)

x_values = [Link](-4, 4, 100)

linear_fit = [sum(linear_coeffs[j] * (x ** (1 - j)) for j in range(2)) for x in x_values]

cubic_fit = [sum(cubic_coeffs[j] * (x ** (3 - j)) for j in range(4)) for x in x_values]

[Link](figsize=(10, 6))
[Link](data[:, 0], data[:, 1], 'o', label='Data Points', markersize=8)
[Link](x_values, linear_fit, label=f'Linear Fit (Residual Norm: {linear_residual_norm:.2f})')
[Link](x_values, cubic_fit, label=f'Cubic Fit (Residual Norm: {cubic_residual_norm:.2f})')
[Link]('x')
[Link]('y')
[Link]('Best Linear and Cubic Fits to Data Points')
[Link]()
[Link]()
Question 2
Study the Python notebook on SVD for image compression. Pick up your own high resolution color image to experiment with. What is the number of components to get a
k

decent reconstruction after decomposition. Display the relative error in the compression for the given value. Make a plot of relative errors for different values.
k k

In [63]: # Your Code Starts Here

img = [Link]("[Link]")

print("Original shape: ",[Link])

[Link](img)
[Link]()

Original shape: (1196, 1920, 3)

In [64]: def svd(image,k):
compressed = []
for n in range(3):
U, S, V = [Link](image[..., n])
U_k = U[:, :k]
S_k = [Link](S[:k])
V_k = V[:k, :]
compress = [Link](U_k, [Link](S_k, V_k))
[Link](compress)

compressed_img = [Link](compressed, axis=-1)

return compressed_img

def relative_error(original, compressed):

return [Link](original - compressed) / [Link](original)

img_rgb = img/255.0
k_values = [10, 20, 50, 100, 150, 200]
errors = []
for k in k_values:
compressed_img = svd(img_rgb, k)
error = relative_error(img_rgb, compressed_img)
[Link](error)

[Link](img1)
[Link]("off")
[Link]()
#when k = 200
In [79]: img2 = svd(img_rgb, 25)
img2 = [Link](img2, 0, 1)

[Link](img2)
[Link]("off")
[Link]()
#when k =25

In [82]: img3 = svd(img_rgb, 100)

img2 = [Link](img3, 0, 1)

[Link](img3)
[Link]("off")
[Link]()
#when k =100
Clipping input data to the valid range for imshow with RGB data ([0..1] for floats or [0..255] for integers). Got range [-0.15547262666145525..1.12
94772082186946].

Question 3
In designing a movie recommendation system, one creates a rating matrixR ∈ R
m×n
m for users and movies. The entries gives the rating of a movie (1-5) in the -th
n Rij j

column by a user in the -th row. The rating matrix is generally quite sparse. The missing entries are replaced by as a starter. Answer the following question for the
i 0

MovieLens dataset available below. MovieLens Dataset 100k

Read Sections 2 and 3 for application of SVD in recommendation systems of this paper by Sarvar et. al..
Part A: Load the dataset ([Link]) using pandas and create the rating matrix . This will require dataframe pivoting.
R

In [170… # Your Code Starts Here

import pandas as pd
data = pd.read_table("[Link]", header=None)
data = [Link](data)
[Link] = ["user_id", "item_id", "rating", "timestamp"]
[Link]()

R = [Link](index="user_id", columns="item_id", values="rating")

[Link]()
Out[170… item_id 1 2 3 4 5 6 7 8 9 10 ... 1673 1674 1675 1676 1677 1678 1679 1680 1681 1682
user_id
1 5.0 3.0 4.0 3.0 3.0 5.0 4.0 1.0 5.0 3.0 ... NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN
2 4.0 NaN NaN NaN NaN NaN NaN NaN NaN 2.0 ... NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN
3 NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN ... NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN
4 NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN ... NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN
5 4.0 3.0 NaN NaN NaN NaN NaN NaN NaN NaN ... NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN
5 rows × 1682 columns
In [ ]:

Part B: Replace missing entries in by the average movie ratings column-wise. Create
R Rnorm by subtracting the user average for every row.
In [172… # Your Code Starts Here
R_complete = [Link](lambda avg: [Link]([Link]()), axis = 0)

user_mean = R_complete.mean(axis = 1)

R_norm = R_complete.apply(lambda row: row - user_mean[[Link]], axis=1)

R_norm

Out[172… item_id 1 2 3 4 5 6 7 8 9 10 ... 1673 1674 1675 1676

user_id
1 1.910010 -0.089990 0.910010 -0.089990 -0.089990 1.910010 0.910010 -2.089990 1.910010 -0.089990 ... -0.089990 0.910010 -0.089990 -1.089990
2 0.919898 0.126005 -0.046768 0.470138 0.222224 0.496822 0.718368 0.915332 0.816220 -1.080102 ... -0.080102 0.919898 -0.080102 -1.08010
3 0.817893 0.145681 -0.027092 0.489814 0.241900 0.516498 0.738044 0.935008 0.835896 0.771035 ... -0.060426 0.939574 -0.060426 -1.060426
4 0.789061 0.116850 -0.055924 0.460982 0.213068 0.487666 0.709212 0.906176 0.807064 0.742203 ... -0.089257 0.910743 -0.089257 -1.08925
5 0.961597 -0.038403 -0.005070 0.511836 0.263922 0.538520 0.760066 0.957030 0.857918 0.793057 ... -0.038403 0.961597 -0.038403 -1.038403
... ... ... ... ... ... ... ... ... ... ... ... ... ... ... .
939 0.771265 0.099053 -0.073721 0.443185 0.195272 0.469869 0.691416 0.888380 1.892946 0.724407 ... -0.107054 0.892946 -0.107054 -1.107054
940 0.820431 0.148219 -0.024554 -1.057888 0.244438 0.519035 0.942112 1.942112 -0.057888 0.773573 ... -0.057888 0.942112 -0.057888 -1.057888
941 1.919249 0.125356 -0.047417 0.469488 0.221575 0.496172 0.919249 0.914683 0.815570 0.750710 ... -0.080751 0.919249 -0.080751 -1.08075
942 0.778626 0.106415 -0.066359 0.450547 0.202633 0.477231 0.698777 0.895741 0.796629 0.731768 ... -0.099692 0.900308 -0.099692 -1.09969
943 0.804237 1.925918 -0.040749 0.476157 0.228244 0.502841 0.724387 0.921352 -0.074082 0.757379 ... -0.074082 0.925918 -0.074082 -1.07408
943 rows × 1682 columns
Part C: Perform the SVD of Rnorm as Rnorm = U SV
T
. Perform low-rank approximation of Rnorm by using k = 100 .
In [173… # Your Code Starts Here
import numpy as np
U, S, Vt = [Link](R_norm, full_matrices=False)
k = 100
U_k = U[:, :k]
S_k = [Link](S[:k])
Vt_k = Vt[:k, :]
R_norm_approx = U_k @ S_k @ Vt_k
R_norm_approx.shape

Out[173… (943, 1682)

Part D: By using the above low-rank representation, the predicted rating of movie by user could be given by
j i

1/2 1/2 T
Rij ≈ μi + [U(k) S ]i: [S V ]:j
(k) (k) (k)

where is the average of ratings by user .

μi i

In [174… # Your Code Starts Here

S_k_sqrt = [Link](S_k)
U_S_sqrt = U_k @ S_k_sqrt
S_sqrt_Vt = S_k_sqrt @ Vt_k

In [184… def predict_rating(user_id, item_id):

mu_i = user_mean[user_id]

user_features = U_S_sqrt[user_id, :]
movie_features = S_sqrt_Vt[:, item_id]
rating_approximation = mu_i + [Link](user_features, movie_features)

return rating_approximation

user_id = 1
item_id = 4
predicted_rating = predict_rating(user_id, item_id)
print(f"Predicted rating of movie {item_id} by user {user_id}: {predicted_rating}")

Predicted rating of movie 4 by user 1: 3.333059473007486

Part E: Make a tabular comparison of the actual ratings and predicted ratings for some randomly selected users and movies in the following format.
User Id Movie Id Actual Rating Predicted Rating
1234 987 5 4.89
2345 876 0 2.7
In [ ]: # Your Code Starts Here

random_users = [Link](R_complete.index, 10)

random_movies = [Link](R_complete.columns, 10)

user_ids = []
movie_ids = []
actual_ratings = []
predicted_ratings = []

for user_id, movie_id in zip(random_users, random_movies):

actual_rating = R_complete.loc[user_id, movie_id]
predicted_rating = predict_rating(user_id, movie_id)

user_ids.append(user_id)
movie_ids.append(movie_id)
actual_ratings.append(actual_rating)
predicted_ratings.append(predicted_rating)

results = [Link]({
"User Id": user_ids,
"Movie Id": movie_ids,
"Actual Rating": actual_ratings,
"Predicted Rating": predicted_ratings
})

[Link](10)

Out[ ]: User Id Movie Id Actual Rating Predicted Rating

0 431 834 2.200000 3.862701
1 348 1220 3.333333 3.585011
2 674 1297 2.833333 3.660598
3 10 1078 2.772727 2.117172
4 69 190 4.137097 3.965525
5 410 1429 2.750000 2.322252
6 728 528 4.132231 3.956549
7 179 872 3.095238 2.858756
8 350 1546 1.000000 2.989504
9 810 1047 2.835821 2.984848
In [ ]:

Data Science Using Python Lab Week8
No ratings yet
Data Science Using Python Lab Week8
23 pages
MFound HW3
No ratings yet
MFound HW3
4 pages
EE16A Homework 13: Polynomial Fitting
No ratings yet
EE16A Homework 13: Polynomial Fitting
23 pages
ML Labs
No ratings yet
ML Labs
14 pages
Python Programs for Data Analysis and Visualization
No ratings yet
Python Programs for Data Analysis and Visualization
8 pages
Module7 PCA Clustering November 9-13-2023
No ratings yet
Module7 PCA Clustering November 9-13-2023
41 pages
Mfds Assignment
No ratings yet
Mfds Assignment
20 pages
HW8 La
No ratings yet
HW8 La
18 pages
Exercise 01
No ratings yet
Exercise 01
3 pages
Advanced ML
No ratings yet
Advanced ML
38 pages
Weekly Homework X
No ratings yet
Weekly Homework X
15 pages
Math Equations
No ratings yet
Math Equations
9 pages
ModuleAr Merged
No ratings yet
ModuleAr Merged
42 pages
Support Vector Machine - Python Implementation Using CVXOPT - Data Blog
100% (1)
Support Vector Machine - Python Implementation Using CVXOPT - Data Blog
12 pages
Python Numerical Methods for Equations
No ratings yet
Python Numerical Methods for Equations
12 pages
Understanding DA and AI Concepts
No ratings yet
Understanding DA and AI Concepts
84 pages
Lab 2 SVM
No ratings yet
Lab 2 SVM
23 pages
EECS 275 Matrix Computation: Ming-Hsuan Yang
No ratings yet
EECS 275 Matrix Computation: Ming-Hsuan Yang
21 pages
ML Manual
No ratings yet
ML Manual
30 pages
Linear Algebra Project Guide
No ratings yet
Linear Algebra Project Guide
7 pages
SVD 4x4
No ratings yet
SVD 4x4
4 pages
Orf523 S24 HW1
No ratings yet
Orf523 S24 HW1
5 pages
HW 3
No ratings yet
HW 3
3 pages
Advanced Math & Engineering Homework
No ratings yet
Advanced Math & Engineering Homework
6 pages
Matrix Operations for Engineers
No ratings yet
Matrix Operations for Engineers
21 pages
L - AND - T - Project - Naveen 24cs002895
No ratings yet
L - AND - T - Project - Naveen 24cs002895
7 pages
Final2008f-Solution SVM PCA HMM BN
No ratings yet
Final2008f-Solution SVM PCA HMM BN
18 pages
z29YZf FQxeyQ6TifsFDaA CorrespondenceAnalysis Python Code
No ratings yet
z29YZf FQxeyQ6TifsFDaA CorrespondenceAnalysis Python Code
5 pages
Singular Value Decomposition in Data Science
No ratings yet
Singular Value Decomposition in Data Science
65 pages
Experiment 1
No ratings yet
Experiment 1
19 pages
AOE 5404 Homework 5: Due On Feb. 26, 2025
No ratings yet
AOE 5404 Homework 5: Due On Feb. 26, 2025
2 pages
Department of Electrical Engineering School of Science and Engineering
No ratings yet
Department of Electrical Engineering School of Science and Engineering
10 pages
Report System Identification and Modelling
No ratings yet
Report System Identification and Modelling
34 pages
NC Final SP 2024
No ratings yet
NC Final SP 2024
12 pages
Problem Set 2 Spring 2020 CS395T
No ratings yet
Problem Set 2 Spring 2020 CS395T
4 pages
NB 15
No ratings yet
NB 15
20 pages
Multirotor Aircraft Control Analysis
No ratings yet
Multirotor Aircraft Control Analysis
22 pages
Final2008f Solution
No ratings yet
Final2008f Solution
18 pages
Advances in Sketching for Linear Algebra
No ratings yet
Advances in Sketching for Linear Algebra
139 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
9 pages
Lab Changed
No ratings yet
Lab Changed
23 pages
Image Processing
No ratings yet
Image Processing
5 pages
Matlab
No ratings yet
Matlab
20 pages
2.3 SciPy-1
No ratings yet
2.3 SciPy-1
17 pages
Mlalllabprgs
No ratings yet
Mlalllabprgs
17 pages
CS6301 Homework2 KR
No ratings yet
CS6301 Homework2 KR
13 pages
Machine Learning Homework 1 Solutions
No ratings yet
Machine Learning Homework 1 Solutions
11 pages
MLLab Manual
No ratings yet
MLLab Manual
24 pages
Health Outcomes With Linear Regression
No ratings yet
Health Outcomes With Linear Regression
8 pages
Lab - 7 - 21130616 - TranhThanhVu - Ipynb - Colab
No ratings yet
Lab - 7 - 21130616 - TranhThanhVu - Ipynb - Colab
10 pages
Fifth
No ratings yet
Fifth
7 pages
Matrix Operations and Data Analysis in Python
No ratings yet
Matrix Operations and Data Analysis in Python
38 pages
Pinv For Modern ML
No ratings yet
Pinv For Modern ML
31 pages
Lab 6
No ratings yet
Lab 6
6 pages
SC 1
No ratings yet
SC 1
25 pages
HoangGiaUy 20020014 VNUK en - 2024
No ratings yet
HoangGiaUy 20020014 VNUK en - 2024
10 pages
Data Similarity & SVD Techniques
No ratings yet
Data Similarity & SVD Techniques
10 pages
Foundations of Data Science: Exercise 1
No ratings yet
Foundations of Data Science: Exercise 1
5 pages
Em Assignment
No ratings yet
Em Assignment
4 pages
Calculus III Quiz Analysis
No ratings yet
Calculus III Quiz Analysis
5 pages
ENGG295 Week 1 Maths Tutorial Guide
No ratings yet
ENGG295 Week 1 Maths Tutorial Guide
2 pages
What Is Transformation Matrix and How To Use It
No ratings yet
What Is Transformation Matrix and How To Use It
11 pages
Informative Target 1.5 Polynomials and Complex Zeros
No ratings yet
Informative Target 1.5 Polynomials and Complex Zeros
10 pages
Calculus Research Trends
No ratings yet
Calculus Research Trends
9 pages
Advanced Calculus for Students
100% (1)
Advanced Calculus for Students
4 pages
Imperfections
100% (1)
Imperfections
10 pages
Linear Systems and Signals - B. P. Lathi
No ratings yet
Linear Systems and Signals - B. P. Lathi
194 pages
Algebra and Trigonometry Jay Abramson All Chapter Instant Download
100% (1)
Algebra and Trigonometry Jay Abramson All Chapter Instant Download
40 pages
General Mathematics11 - q1 - Mod1 - Functions
No ratings yet
General Mathematics11 - q1 - Mod1 - Functions
43 pages
Mathematical Analysis
No ratings yet
Mathematical Analysis
4 pages
Differential Geometry 1
No ratings yet
Differential Geometry 1
3 pages
9.differential Equations 2ndPUC PYQs
No ratings yet
9.differential Equations 2ndPUC PYQs
2 pages
Numerical Integration CH 7
No ratings yet
Numerical Integration CH 7
25 pages
Non-Linear Programming With Constraints, "Lagrange's Method" - Example Problems Example
No ratings yet
Non-Linear Programming With Constraints, "Lagrange's Method" - Example Problems Example
6 pages
Solution Manual For Signals and Systems Analysis Using Transform Methods and MATLAB 3rd Edition
No ratings yet
Solution Manual For Signals and Systems Analysis Using Transform Methods and MATLAB 3rd Edition
52 pages
AP Precalculus 3.4-3.7 Review
No ratings yet
AP Precalculus 3.4-3.7 Review
3 pages
NDA 1 2024 Exam Analysis Overview
No ratings yet
NDA 1 2024 Exam Analysis Overview
12 pages
Logit Marginal Effects
No ratings yet
Logit Marginal Effects
12 pages
Chap1 Geometry of Complex Numbers
No ratings yet
Chap1 Geometry of Complex Numbers
7 pages
Inverse Trigonometric Functions
No ratings yet
Inverse Trigonometric Functions
4 pages
DFDP_15 Quiz 8: R Programming Concepts
No ratings yet
DFDP_15 Quiz 8: R Programming Concepts
5 pages
Mathematics For Economics and Finance: Answer Key To Final Exam
No ratings yet
Mathematics For Economics and Finance: Answer Key To Final Exam
13 pages
FLUENT Modeling Unsteady Flows
100% (1)
FLUENT Modeling Unsteady Flows
101 pages
Function: Examples by and
No ratings yet
Function: Examples by and
2 pages
Bowen 2009 Prelim Am p1
No ratings yet
Bowen 2009 Prelim Am p1
5 pages
Leather Iii To Viii PDF
No ratings yet
Leather Iii To Viii PDF
69 pages
Unit 5
No ratings yet
Unit 5
8 pages
Axial Deformations in Plane Frames Analysis
No ratings yet
Axial Deformations in Plane Frames Analysis
31 pages
41st Iranian Mathematics Conference Abstracts
No ratings yet
41st Iranian Mathematics Conference Abstracts
419 pages

Programming Assignment3

Uploaded by

Programming Assignment3

Uploaded by

Programming Assignment 3

In [2]: import numpy as np

AtA = [Link]((d + 1, d + 1))

for i in range(d + 1):

coeffs = [Link](AtA, Atb)

return coeffs, residual_norm

linear_coeffs, linear_residual_norm = poly_fit2d(data, 1)

x_values = [Link](-4, 4, 100)

linear_fit = [sum(linear_coeffs[j] * (x ** (1 - j)) for j in range(2)) for x in x_values]

In [63]: # Your Code Starts Here

print("Original shape: ",[Link])

Original shape: (1196, 1920, 3)

compressed_img = [Link](compressed, axis=-1)

def relative_error(original, compressed):

return [Link](original - compressed) / [Link](original)

In [66]: [Link](figsize=(8, 6))

In [82]: img3 = svd(img_rgb, 100)

MovieLens dataset available below. MovieLens Dataset 100k

In [170… # Your Code Starts Here

R = [Link](index="user_id", columns="item_id", values="rating")

R_norm = R_complete.apply(lambda row: row - user_mean[[Link]], axis=1)

Out[172… item_id 1 2 3 4 5 6 7 8 9 10 ... 1673 1674 1675 1676

Out[173… (943, 1682)

where is the average of ratings by user .

In [174… # Your Code Starts Here

In [184… def predict_rating(user_id, item_id):

Predicted rating of movie 4 by user 1: 3.333059473007486

random_users = [Link](R_complete.index, 10)

for user_id, movie_id in zip(random_users, random_movies):

Out[ ]: User Id Movie Id Actual Rating Predicted Rating

You might also like