0% found this document useful (0 votes)

12 views9 pages

Assignment 1 - Machine Learning

Uploaded by

ᗷEᑕᑕᗩ ᗷᗩᑕᗩᑎ

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views9 pages

Assignment 1 - Machine Learning

Uploaded by

ᗷEᑕᑕᗩ ᗷᗩᑕᗩᑎ

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

Assignment : Machine Learning

Submitted By : Majid Khan

Submitted To : Dr. Sher Afzal
Program : MS Computer Science [Fall-23 (1st semester)]

Question: What is a loss function?

A loss function is a measure of how well a machine learning model's predictions match the
true target labels or values. It quantifies the error between predictions and ground truth to
judge model performance. Loss functions map a prediction and label to a non-negative value,
with the goal of minimizing the overall loss. The loss is minimized during training by
updating model parameters through optimization algorithms like gradient descent. The choice
of loss function greatly impacts model behaviour and performance.

Question: Why is the choice of loss function important?

The choice of loss function is critical because it fundamentally defines the training objective
and optimization procedure. Different loss functions have significant implications on model
generalization, robustness, complexity, probabilistic interpretation, and more. Key reasons the
choice matters:

• It determines the error surface shape for optimization algorithms, affecting training
efficiency.
• It controls model complexity tradeoffs like overfitting and underfitting.
• It affects how well the model generalizes beyond the training data.
• It determines how robust the model is to noise and outliers in the data.
• It impacts the scale invariance and feature spaces the model operates in.
• It provides meaning and calibration to the predicted probabilities.
• It affects interpretability and intuitiveness of the training objective.

Question: List and define three common loss functions for regression tasks.

• Mean Squared Error (MSE): The average of squared differences between predictions
and true values. Penalizes larger errors due to squaring. Sensitive to outliers. Intuitive scale.

1 𝑛
𝑀𝑆𝐸 = ∑ (𝑎𝑐𝑡𝑢𝑎𝑙 𝑣𝑎𝑙𝑢𝑒 − 𝑃𝑟𝑒𝑑𝑖𝑐𝑡𝑒𝑑 𝑣𝑎𝑙𝑢𝑒)2
𝑛 𝑖=1

• Mean Absolute Error (MAE): The average of absolute differences between predictions
and truth. Linear error penalty. Robust to outliers. Scale dependent.
𝑛
1
𝑀𝐴𝐸 = ∑ |𝑎𝑐𝑡𝑢𝑎𝑙 𝑣𝑎𝑙𝑢𝑒 − 𝑝𝑟𝑒𝑑𝑖𝑐𝑡𝑒𝑑 𝑣𝑎𝑙𝑢𝑒|
𝑛
𝑖=1

• Huber Loss: Combines MSE and MAE. Uses MSE for small errors and MAE for large
ones. Provides robustness while maintaining intuitiveness.

𝐻𝑢𝑏𝑒𝑟 𝐿𝑜𝑠𝑠
𝑛
1
∑. (𝑎𝑐𝑡𝑢𝑎𝑙 − 𝑝𝑟𝑒𝑑𝑖𝑐𝑡𝑒𝑑)2 𝑓𝑜𝑟 |𝑎𝑐𝑡𝑢𝑎𝑙 − 𝑝𝑟𝑒𝑑𝑖𝑐𝑡𝑒𝑑| ≤ 𝛿
𝑛
𝑖=1
= 𝑛
1 1
∑. 𝛿 (|𝑎𝑐𝑡𝑢𝑎𝑙 − 𝑃𝑟𝑒𝑑𝑖𝑐𝑡𝑒𝑑| − 𝛿) 𝑂𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒
{ 𝑛 2
𝑖=1

Question: Compare and contrast the following loss functions:

o Mean squared error (MSE)

o Cross-entropy loss
o Absolute loss

Answer:
The Detailed comparison of Mean squared error, Cross-Entropy loss and Absolute loss is
mentioned below:

• While mean squared error (MSE) and absolute loss are both commonly used for regression,
MSE has some notable advantages and disadvantages compared to absolute loss. The
squaring of errors in MSE leads to a smoothly convex error surface that is straightforward
to optimize, unlike the sharp corners when taking absolute values. However, this squaring
also causes MSE to be much more sensitive to outliers than the more robust absolute loss.
The scale produced by squaring also makes the values of MSE more intuitively
interpretable. Additionally, MSE assumes normally distributed noise which may not fit all
problems appropriately.

• Comparing MSE and cross-entropy loss, used for regression and classification respectively,
there are clear tradeoffs. MSE weights all errors equally, while cross-entropy heavily
penalizes misclassifications and false positives/negatives. The probabilistic output and
logarithmic scale of cross-entropy also provides advantages in terms of optimizing
likelihood and calibrating confidence. However, MSE has a simpler quadratic form that is
easier to optimize than cross-entropy for some models.

• The absolute loss function contrasts sharply with cross-entropy loss on classification tasks
in a few ways. Absolute loss can be used but lacks the probabilistic justification that cross-
entropy provides. Cross-entropy is also generally easier to optimize efficiently than
absolute loss due to the sharp corners in its error surface. However, absolute loss would be
less sensitive to data scaling and outliers than cross-entropy.

Conclusion:

while all three loss functions have their merits, certain distinctions make each better suited for
specific tasks and models. MSE's sensitivities lend themselves to regression problems, cross-
entropy's probabilistic nature suits classification, and absolute loss provides greater robustness
when outliers are present.

Question: Which loss function would you choose for the following tasks? Justify your
answer.

o Classifying images of cats and dogs

o Predicting the price of a house
o Predicting the number of customers who will visit a store on a given day.

Answer:

• For the image classification task, I would select the cross-entropy loss function. Cross-
entropy loss is the most appropriate for multi-class image classification because it measures
the divergence between the predicted class probabilities and the true label distribution. By
optimizing cross-entropy, the model is trained to maximize the likelihood of the correct
class. Cross-entropy heavily penalizes confident incorrect classifications, forcing the model
to learn subtle features that distinguish classes. It also provides interpretability by operating
as a log-likelihood. Furthermore, cross-entropy enables proper calibration of predicted
probabilities from the softmax output. It is universally used for multi-class deep learning
classifiers with outstanding results. The convexity and smoothness of cross-entropy also
allows straightforward optimization with gradient descent methods.

• For the house price regression task, I would select mean squared error (MSE) as the loss
function. MSE naturally fits continuous value regression problems by quantifying the
squared magnitude of errors. This penalizes larger deviations, optimizing predictions for
Gaussian-like noise. MSE provides intuitive interpretations of error due to the squared
term. It does not make assumptions about the noise distribution like MAE does. MSE is a
standard baseline loss for regression problems that is smooth, convex, and easy to optimize
with gradient descent. The widespread use and interpretability of MSE make it the best
choice.
• For the customer prediction regression problem, I would also choose mean squared error
as the loss function. Since this is also a continuous value regression task, MSE is again the
most suitable loss for the same reasons described above. It directly optimizes for
quantitative accuracy in the numerical predictions. The intuitiveness, optimization
properties, and ubiquity of MSE make it the optimal choice over alternatives.

Python Implementation (IDE: Google Colab)

OUTPUT:
Raw Code:
import math

# 0/1 loss function (binary classification)

def zero_one_loss(y_true, y_pred):

misclassifications = sum(1 for a, b in zip(y_true, y_pred) if a != b)

return misclassifications / len(y_true)

# Squared loss function

def squared_loss(y_true, y_pred):

squared_errors = [(a - b) ** 2 for a, b in zip(y_true, y_pred)]

return sum(squared_errors) / len(y_true)

# Mean Squared Error (MSE)

def mean_squared_error(y_true, y_pred):

squared_errors = [(a - b) ** 2 for a, b in zip(y_true, y_pred)]

return sum(squared_errors) / len(y_true)

# Root Mean Squared Error (RMSE)

def root_mean_squared_error(y_true, y_pred):

squared_errors = [(a - b) ** 2 for a, b in zip(y_true, y_pred)]

mse = sum(squared_errors) / len(y_true)

return math.sqrt(mse)
# Absolute loss function (Mean Absolute Error, MAE)

def absolute_loss(y_true, y_pred):

absolute_errors = [abs(a - b) for a, b in zip(y_true, y_pred)]

return sum(absolute_errors) / len(y_true)

# Display the table of actual values and predictions

def display_table(actual, predicted):

print('-Machine Learning Assignment \n-Submitted by MAJID KHAN')

print("____________________________")

print("Actual value | Prediction")

print("------------- | -------------")

for a, b in zip(actual, predicted):

print(f"{a:^13} | {b:^11}")

actual_values = [100, 90, 110, 120]

predicted_values = [110, 80, 120, 110]

display_table(actual_values, predicted_values)

while True:

print('\n_________________________________________________________________')

print("\nChoose a loss function:")

print("1. 0/1 Loss")

print("2. Squared Loss")

print("3. MSE")

print("4. RMSE")

print("5. MAE")

print("6. Exit")

print('_________________________________________________________________')

choice = int(input("Enter the number of the loss function: "))

if choice == 1:

loss = zero_one_loss(actual_values, predicted_values)

print("\n 0/1 Loss Formula: (Number of Misclassifications) / (Total Samples)")

print("0/1 Loss Input Values - Number of Misclassifications:", sum(1 for a, b in

zip(actual_values, predicted_values) if a != b))

print("Total Samples:", len(actual_values))

print("0/1 Loss Result:", loss)

elif choice == 2:
loss = squared_loss(actual_values, predicted_values)

print("Squared Loss Formula: (Σ(actual - predicted)²) / Total Samples")

print("\nSquared Loss Input Values - (Σ(actual - predicted)²):", sum((a - b) ** 2

for a, b in zip(actual_values, predicted_values)))

print("Total Samples:", len(actual_values))

print("Squared Loss Result:", loss)

elif choice == 3:

loss = mean_squared_error(actual_values, predicted_values)

print("\nMSE Formula: (Σ(actual - predicted)²) / Total Samples")

print("MSE Input Values - (Σ(actual - predicted)²):", sum((a - b) ** 2 for a, b in

zip(actual_values, predicted_values)))

print("Total Samples:", len(actual_values))

print("MSE Result:", loss)

elif choice == 4:

loss = root_mean_squared_error(actual_values, predicted_values)

mse = mean_squared_error(actual_values, predicted_values)

print("\nRMSE Formula: √MSE")

print("RMSE Input Value - MSE:", mse)

print("RMSE Result:", loss)

elif choice == 5:

loss = absolute_loss(actual_values, predicted_values)

print("\nMAE Formula: (Σ|actual - predicted|) / Total Samples")

print("MAE Input Values - (Σ|actual - predicted|):", sum(abs(a - b) for a, b in

zip(actual_values, predicted_values)))

print("Total Samples:", len(actual_values))

print("MAE Result:", loss)

elif choice == 6:

break

else:

print("Invalid choice. Please select a valid loss function.")

________________The END________________

The Hundred-Page Machine Learning Book - Andriy Burkov
No ratings yet
The Hundred-Page Machine Learning Book - Andriy Burkov
16 pages
Stats 101c Final Project
100% (1)
Stats 101c Final Project
16 pages
Machine Learning Interview Questions
From Everand
Machine Learning Interview Questions
Tech Interviews
4.5/5 (2)
DL Practical 3 Loss Function
No ratings yet
DL Practical 3 Loss Function
6 pages
Ai - W3L6
No ratings yet
Ai - W3L6
29 pages
Loss Functions
No ratings yet
Loss Functions
7 pages
3 - Loss Functions
No ratings yet
3 - Loss Functions
14 pages
Loss Functions
No ratings yet
Loss Functions
29 pages
Loss Functions
No ratings yet
Loss Functions
8 pages
Loss Functions
No ratings yet
Loss Functions
37 pages
Losses
No ratings yet
Losses
9 pages
Loss Function in Deep Learning
No ratings yet
Loss Function in Deep Learning
15 pages
Loss
No ratings yet
Loss
18 pages
Loss Function
No ratings yet
Loss Function
23 pages
Loss Functions Types
No ratings yet
Loss Functions Types
11 pages
Detailed Guide 7 Loss Functions Machine Learning Python Code
No ratings yet
Detailed Guide 7 Loss Functions Machine Learning Python Code
16 pages
Cost Function Loss Function
No ratings yet
Cost Function Loss Function
7 pages
Lecture 4 - Cost Function
No ratings yet
Lecture 4 - Cost Function
18 pages
Cost Function Mae Mse Lr
No ratings yet
Cost Function Mae Mse Lr
19 pages
chp2 Cost Functions
No ratings yet
chp2 Cost Functions
7 pages
Module 6 - Loss Function
No ratings yet
Module 6 - Loss Function
22 pages
Loss Functions in Neural Networks PDF
No ratings yet
Loss Functions in Neural Networks PDF
6 pages
DeepLearning Lect2 3
No ratings yet
DeepLearning Lect2 3
89 pages
Abstract: y F X X X, X, X
No ratings yet
Abstract: y F X X X, X, X
10 pages
Deep Learning (Part 2) - Loss Function and Gradient Function - by Sumbatilinda - Medium
No ratings yet
Deep Learning (Part 2) - Loss Function and Gradient Function - by Sumbatilinda - Medium
30 pages
4-Loss Function
No ratings yet
4-Loss Function
8 pages
Lect 9 - Loss Functions
No ratings yet
Lect 9 - Loss Functions
28 pages
UNIT4 CostFunctions
No ratings yet
UNIT4 CostFunctions
23 pages
Week 6 - Lecture 12-1
No ratings yet
Week 6 - Lecture 12-1
34 pages
Hundred Page ML Book CH 3
No ratings yet
Hundred Page ML Book CH 3
16 pages
Linear Regression
No ratings yet
Linear Regression
31 pages
Lecture Notes 5 Linear Regression
No ratings yet
Lecture Notes 5 Linear Regression
11 pages
Loss Functions
No ratings yet
Loss Functions
17 pages
Linear Regression
No ratings yet
Linear Regression
60 pages
Lesson 04 Deep Neural Network
No ratings yet
Lesson 04 Deep Neural Network
81 pages
Lecture 1, Part 1: Linear Regression: Roger Grosse
No ratings yet
Lecture 1, Part 1: Linear Regression: Roger Grosse
9 pages
M02 Linear Regression Methods
No ratings yet
M02 Linear Regression Methods
40 pages
Lecture 2
No ratings yet
Lecture 2
66 pages
CH - En.u4cse19101 Cheduri Linearregression
No ratings yet
CH - En.u4cse19101 Cheduri Linearregression
8 pages
Machine Learning Fundamentals
No ratings yet
Machine Learning Fundamentals
52 pages
L02 Linear Regression
No ratings yet
L02 Linear Regression
9 pages
Using The Mean Absolute Percentage Error For Regression Models
No ratings yet
Using The Mean Absolute Percentage Error For Regression Models
7 pages
Lecture 11
No ratings yet
Lecture 11
26 pages
Group 30
No ratings yet
Group 30
33 pages
Performance Evaluation
No ratings yet
Performance Evaluation
24 pages
Practical-5 - 2CEIT606 - Artificial Intelligence
No ratings yet
Practical-5 - 2CEIT606 - Artificial Intelligence
14 pages
ML Intro Numericals
No ratings yet
ML Intro Numericals
27 pages
Linear Regression
No ratings yet
Linear Regression
20 pages
Lecture-17-Linear Regression Using Sklearn
No ratings yet
Lecture-17-Linear Regression Using Sklearn
32 pages
AI and Math - Python Multiple-Choice Questions
No ratings yet
AI and Math - Python Multiple-Choice Questions
16 pages
Linear Regression
No ratings yet
Linear Regression
91 pages
Introduction To Machine Learning Lecture 2: Linear Regression
No ratings yet
Introduction To Machine Learning Lecture 2: Linear Regression
38 pages
Foundations of Machine Learning - 3
No ratings yet
Foundations of Machine Learning - 3
38 pages
Top 7 Loss Functions To Evaluate Regression Models
No ratings yet
Top 7 Loss Functions To Evaluate Regression Models
8 pages
Machine Learning Models
No ratings yet
Machine Learning Models
52 pages
Machine Learning Lecture 1
No ratings yet
Machine Learning Lecture 1
5 pages
Linear Regression
No ratings yet
Linear Regression
29 pages
Lecture 8 Zainab
No ratings yet
Lecture 8 Zainab
21 pages
Week_6
No ratings yet
Week_6
72 pages
Computer Vision Graph Cuts: Exploring Graph Cuts in Computer Vision
From Everand
Computer Vision Graph Cuts: Exploring Graph Cuts in Computer Vision
Fouad Sabry
No ratings yet
Bundle Adjustment: Optimizing Visual Data for Precise Reconstruction
From Everand
Bundle Adjustment: Optimizing Visual Data for Precise Reconstruction
Fouad Sabry
No ratings yet
Project Final Report 1
No ratings yet
Project Final Report 1
5 pages
A Model For Determining The Critical Factors To Road Traffic Accident
No ratings yet
A Model For Determining The Critical Factors To Road Traffic Accident
82 pages
Empowering Deep Learning For Images: A Comparative Analysis of Regularization Techniques in CNNs
No ratings yet
Empowering Deep Learning For Images: A Comparative Analysis of Regularization Techniques in CNNs
13 pages
Soil Suitability Classification For Crop Selection in Precis - 2023 - Ecological
No ratings yet
Soil Suitability Classification For Crop Selection in Precis - 2023 - Ecological
15 pages
Belkin Et Al 2019 Reconciling Modern Machine Learning Practice and The Classical Bias Variance Trade Off
No ratings yet
Belkin Et Al 2019 Reconciling Modern Machine Learning Practice and The Classical Bias Variance Trade Off
6 pages
Week 6 - Random Forest
No ratings yet
Week 6 - Random Forest
12 pages
Chapter 5: Mind Map: Mathematical Functions
No ratings yet
Chapter 5: Mind Map: Mathematical Functions
1 page
Assignment Week 4-Deep-Learning PDF
100% (1)
Assignment Week 4-Deep-Learning PDF
7 pages
Data Science
No ratings yet
Data Science
38 pages
Practical - Regression
No ratings yet
Practical - Regression
114 pages
Calibration: The Achilles Heel of Predictive Analytics: Opinion Open Access
No ratings yet
Calibration: The Achilles Heel of Predictive Analytics: Opinion Open Access
7 pages
Ann CNN RNN
No ratings yet
Ann CNN RNN
26 pages
APCS Thesis-Proposal
No ratings yet
APCS Thesis-Proposal
18 pages
Model Selection Evaluation Algorithm Selection 1684595082
No ratings yet
Model Selection Evaluation Algorithm Selection 1684595082
51 pages
Oral Defense
No ratings yet
Oral Defense
20 pages
Modeling Climate Change Impacts and Predicting Future Vulnerability in The Mount Kenya Forest Ecosystem Using Remote Sensing and Machine Learning
No ratings yet
Modeling Climate Change Impacts and Predicting Future Vulnerability in The Mount Kenya Forest Ecosystem Using Remote Sensing and Machine Learning
21 pages
Fileml
No ratings yet
Fileml
54 pages
Fake News Detection in Hausa Language Using Transfer Learning Method
No ratings yet
Fake News Detection in Hausa Language Using Transfer Learning Method
11 pages
2023 Bfu Bayesian Federated Unlearning With Parameter Self-Sharing - Compressed
No ratings yet
2023 Bfu Bayesian Federated Unlearning With Parameter Self-Sharing - Compressed
12 pages
Case Study - Sentiment Analysis With RNNs
No ratings yet
Case Study - Sentiment Analysis With RNNs
8 pages
Chapter 2
No ratings yet
Chapter 2
15 pages
Multi-Classification of Brain Tumor Images Using Convolutional Neural Network - IEEE
No ratings yet
Multi-Classification of Brain Tumor Images Using Convolutional Neural Network - IEEE
11 pages
COMP4702 Notes 2019: Week 2 - Supervised Learning
No ratings yet
COMP4702 Notes 2019: Week 2 - Supervised Learning
23 pages
Accepted Version Full
No ratings yet
Accepted Version Full
48 pages
Classification and Prediction
No ratings yet
Classification and Prediction
126 pages
Data Mining Notes Unit 4
No ratings yet
Data Mining Notes Unit 4
30 pages
Chatbot For Multiple Documents
No ratings yet
Chatbot For Multiple Documents
3 pages
(AFM) Attentional Factorization Machines - Learning The Weight of Feature Interactions Via Attention Networks (ZJU 2017)
No ratings yet
(AFM) Attentional Factorization Machines - Learning The Weight of Feature Interactions Via Attention Networks (ZJU 2017)
7 pages
EE2211 CheatSheet
No ratings yet
EE2211 CheatSheet
15 pages

Assignment 1 - Machine Learning

Uploaded by

Assignment 1 - Machine Learning

Uploaded by

Assignment : Machine Learning

Submitted By : Majid Khan

Question: What is a loss function?

Question: Why is the choice of loss function important?

Question: Compare and contrast the following loss functions:

o Mean squared error (MSE)

o Classifying images of cats and dogs

Python Implementation (IDE: Google Colab)

# 0/1 loss function (binary classification)

def zero_one_loss(y_true, y_pred):

misclassifications = sum(1 for a, b in zip(y_true, y_pred) if a != b)

return misclassifications / len(y_true)

# Squared loss function

def squared_loss(y_true, y_pred):

squared_errors = [(a - b) ** 2 for a, b in zip(y_true, y_pred)]

return sum(squared_errors) / len(y_true)

# Mean Squared Error (MSE)

def mean_squared_error(y_true, y_pred):

squared_errors = [(a - b) ** 2 for a, b in zip(y_true, y_pred)]

return sum(squared_errors) / len(y_true)

# Root Mean Squared Error (RMSE)

def root_mean_squared_error(y_true, y_pred):

squared_errors = [(a - b) ** 2 for a, b in zip(y_true, y_pred)]

mse = sum(squared_errors) / len(y_true)

def absolute_loss(y_true, y_pred):

absolute_errors = [abs(a - b) for a, b in zip(y_true, y_pred)]

return sum(absolute_errors) / len(y_true)

# Display the table of actual values and predictions

def display_table(actual, predicted):

print('-Machine Learning Assignment \n-Submitted by MAJID KHAN')

print("Actual value | Prediction")

for a, b in zip(actual, predicted):

actual_values = [100, 90, 110, 120]

predicted_values = [110, 80, 120, 110]

print("\nChoose a loss function:")

print("1. 0/1 Loss")

print("2. Squared Loss")

choice = int(input("Enter the number of the loss function: "))

loss = zero_one_loss(actual_values, predicted_values)

print("\n 0/1 Loss Formula: (Number of Misclassifications) / (Total Samples)")

print("0/1 Loss Input Values - Number of Misclassifications:", sum(1 for a, b in

print("Total Samples:", len(actual_values))

print("0/1 Loss Result:", loss)

print("Squared Loss Formula: (Σ(actual - predicted)²) / Total Samples")

print("\nSquared Loss Input Values - (Σ(actual - predicted)²):", sum((a - b) ** 2

print("Total Samples:", len(actual_values))

print("Squared Loss Result:", loss)

loss = mean_squared_error(actual_values, predicted_values)

print("\nMSE Formula: (Σ(actual - predicted)²) / Total Samples")

print("MSE Input Values - (Σ(actual - predicted)²):", sum((a - b) ** 2 for a, b in

print("Total Samples:", len(actual_values))

print("MSE Result:", loss)

loss = root_mean_squared_error(actual_values, predicted_values)

mse = mean_squared_error(actual_values, predicted_values)

print("\nRMSE Formula: √MSE")

print("RMSE Input Value - MSE:", mse)

print("RMSE Result:", loss)

loss = absolute_loss(actual_values, predicted_values)

print("\nMAE Formula: (Σ|actual - predicted|) / Total Samples")

print("MAE Input Values - (Σ|actual - predicted|):", sum(abs(a - b) for a, b in

print("Total Samples:", len(actual_values))

print("MAE Result:", loss)

print("Invalid choice. Please select a valid loss function.")

You might also like