0% found this document useful (0 votes)

40 views9 pages

Cost Function in Machine Learning - Javatpoint

The document explains the concept of cost functions in machine learning, which measure how well a model predicts outcomes by calculating the difference between expected and predicted values. It discusses the importance of minimizing the cost function for model accuracy and introduces gradient descent as a method for optimization. Additionally, it outlines different types of cost functions, including regression and classification cost functions, and their specific applications.

Uploaded by

manoj walekar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

40 views9 pages

Cost Function in Machine Learning - Javatpoint

Uploaded by

manoj walekar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

9/12/24, 10:50 PM Cost Function in Machine Learning - Javatpoint

Cost Function in Machine Learning

A Machine Learning model should have a very high level of accuracy in order to perform well
with real-world applications. But how to calculate the accuracy of the model, i.e., how good or
poor our model will perform in the real world? In such a case, the Cost function comes into
existence. It is an important machine learning parameter to correctly estimate the model.

Cost function also plays a crucial role in understanding that how well your model estimates the
relationship between the input and output parameters.

In this topic, we will explain the cost function in Machine Learning, Gradient descent, and types
of cost functions.

What is Cost Function?

A cost function is an important parameter that determines how well a machine learning
model performs for a given dataset. It calculates the difference between the expected value
and predicted value and represents it as a single real number.

In machine learning, once we train our model, then we want to see how well our model is
performing. Although there are various accuracy functions that tell you how your model is
performing, but will not give insights to improve them. So, we need a function that can find
when the model is most accurate by finding the spot between the undertrained and
overtrained model.

https://www.javatpoint.com/cost-function-in-machine-learning 2/10
9/12/24, 10:50 PM Cost Function in Machine Learning - Javatpoint

In simple, "Cost function is a measure of how wrong the model is in estimating the
relationship between X(input) and Y(output) Parameter." A cost function is sometimes also
referred to as Loss function, and it can be estimated by iteratively running the model to
compare estimated predictions against the known values of Y.

The main aim of each ML model is to determine parameters or weights that can minimize the
cost function.

Why use Cost Function?

While there are different accuracy parameters, then why do we need a Cost function for the
Machine learning model. So, we can understand it with an example of the classification of data.
Suppose we have a dataset that contains the height and weights of cats & dogs, and we need
to classify them accordingly. If we plot the records using these two features, we will get a
scatter plot as below:

In the above image, the green dots are cats, and the yellow dots are dogs. Below are the three
possible solutions for this classification problem.

https://www.javatpoint.com/cost-function-in-machine-learning 3/10
9/12/24, 10:50 PM Cost Function in Machine Learning - Javatpoint

In the above solutions, all three classifiers have high accuracy, but the third solution is the best
because it correctly classifies each datapoint. The reason behind the best classification is that it
is in mid between both the classes, not close or not far to any of them.

To get such results, we need a Cost function. It means for getting the optimal solution; we need
a Cost function. It calculated the difference between the actual values and predicted values and
measured how wrong was our model in the prediction. By minimizing the value of the cost
function, we can get the optimal solution.

Gradient Descent: Minimizing the cost function

As we discussed in the above section, the cost function tells how wrong your model is? And
each machine learning model tries to minimize the cost function in order to give the best
results. Here comes the role of Gradient descent.

"Gradient Descent is an optimization algorithm which is used for optimizing the cost
function or error in the model." It enables the models to take the gradient or direction to
reduce the errors by reaching to least possible error. Here direction refers to how model
parameters should be corrected to further reduce the cost function. The error in your model
can be different at different points, and you have to find the quickest way to minimize it, to
prevent resource wastage.

Gradient descent is an iterative process where the model gradually converges towards a
minimum value, and if the model iterates further than this point, it produces little or zero
changes in the loss. This point is known as convergence, and at this point, the error is least, and
the cost function is optimized.

Below is the equation for gradient descent in linear regression:

In the gradient descent equation, alpha is known as the learning rate. This parameter decides
how fast you should move down to the slope. For large alpha, take big steps, and for small
alpha value, you need to take small steps.

https://www.javatpoint.com/cost-function-in-machine-learning 4/10
9/12/24, 10:50 PM Cost Function in Machine Learning - Javatpoint

Types of Cost Function

Cost functions can be of various types depending on the problem. However, mainly it is of
three types, which are as follows:

1. Regression Cost Function

2. Binary Classification cost Functions

3. Multi-class Classification Cost Function.

1. Regression Cost Function

Regression models are used to make a prediction for the continuous variables such as the price
of houses, weather prediction, loan predictions, etc. When a cost function is used with
Regression, it is known as the "Regression Cost Function." In this, the cost function is calculated
as the error based on the distance, such as:

Error= Actual Output-Predicted output

There are three commonly used Regression cost functions, which are as follows:

a. Means Error

In this type of cost function, the error is calculated for each training data, and then the mean of
all error values is taken.

It is one of the simplest ways possible.

The errors that occurred from the training data can be either negative or positive. While finding
mean, they can cancel out each other and result in the zero-mean error for the model, so it is
not recommended cost function for a model.

However, it provides a base for other cost functions of regression models.

b. Mean Squared Error (MSE)

Means Square error is one of the most commonly used Cost function methods. It improves the
drawbacks of the Mean error cost function, as it calculates the square of the difference between
the actual value and predicted value. Because of the square of the difference, it avoids any
possibility of negative error.

The formula for calculating MSE is given below:

https://www.javatpoint.com/cost-function-in-machine-learning 5/10
9/12/24, 10:50 PM Cost Function in Machine Learning - Javatpoint

Mean squared error is also known as L2 Loss.

In MSE, each error is squared, and it helps in reducing a small deviation in prediction as
compared to MAE. But if the dataset has outliers that generate more prediction errors, then
squaring of this error will further increase the error multiple times. Hence, we can say MSE is
less robust to outliers.

c. Mean Absolute Error (MAE)

Mean Absolute error also overcome the issue of the Mean error cost function by taking the
absolute difference between the actual value and predicted value.

The formula for calculating Mean Absolute Error is given below:

This means the Absolute error cost function is also known as L1 Loss. It is not affected by noise
or outliers, hence giving better results if the dataset has noise or outlier.

2. Binary Classification Cost Functions

Classification models are used to make predictions of categorical variables, such as predictions
for 0 or 1, Cat or dog, etc. The cost function used in the classification problem is known as the
Classification cost function. However, the classification cost function is different from the
Regression cost function.

One of the commonly used loss functions for classification is cross-entropy loss.

The binary Cost function is a special case of Categorical cross-entropy, where there is only one
output class. For example, classification between red and blue.

To better understand it, let's suppose there is only a single output variable Y

Cross-entropy(D) = - y*log(p) when y = 1

Cross-entropy(D) = - (1-y)*log(1-p) when y = 0

https://www.javatpoint.com/cost-function-in-machine-learning 6/10
9/12/24, 10:50 PM Cost Function in Machine Learning - Javatpoint

The error in binary classification is calculated as the mean of cross-entropy for all N training
data. Which means:

Binary Cross-Entropy = (Sum of Cross-Entropy for N data)/N

3. Multi-class Classification Cost Function

A multi-class classification cost function is used in the classification problems for which
instances are allocated to one of more than two classes. Here also, similar to binary class
classification cost function, cross-entropy or categorical cross-entropy is commonly used cost
function.

It is designed in a way that it can be used with multi-class classification with the target values
ranging from 0 to 1, 3, ….,n classes.

In a multi-class classification problem, cross-entropy will generate a score that summarizes the
mean difference between actual and anticipated probability distribution.

For a perfect cross-entropy, the value should be zero when the score is minimized.

← Prev Next →

For Videos Join Our Youtube Channel: Join Now

Feedback

Send your Feedback to [email protected]

Help Others, Please Share

https://www.javatpoint.com/cost-function-in-machine-learning 7/10
9/12/24, 10:50 PM Cost Function in Machine Learning - Javatpoint

Learn Latest Tutorials

Splunk SPSS Swagger Transact-SQL

Tumblr ReactJS Regex Reinforcement

Learning

R Programming RxJS React Native Python Design

Patterns

Python Pillow Python Turtle Keras

Preparation

Aptitude Logical Verbal Ability Interview

Reasoning Questions
Aptitude Verbal Ability
Reasoning Interview Questions

Company
Interview
Questions
Company Questions

https://www.javatpoint.com/cost-function-in-machine-learning 8/10
9/12/24, 10:50 PM Cost Function in Machine Learning - Javatpoint

Trending Technologies

Artificial AWS Tutorial Selenium Cloud

Intelligence tutorial Computing
AWS
Artificial Selenium Cloud Computing
Intelligence

Hadoop tutorial ReactJS Data Science Angular 7

Tutorial Tutorial Tutorial
Hadoop
ReactJS Data Science Angular 7

Blockchain Git Tutorial Machine DevOps

Tutorial Learning Tutorial Tutorial
Git
Blockchain Machine Learning DevOps

B.Tech / MCA

DBMS tutorial Data Structures DAA tutorial Operating

tutorial System
DBMS DAA
Data Structures Operating System

Computer Compiler Computer Discrete

Network tutorial Design tutorial Organization and Mathematics
Architecture Tutorial
Computer Network Compiler Design
Computer Discrete
Organization Mathematics

Ethical Hacking Computer Software html tutorial

Graphics Tutorial Engineering
Ethical Hacking Web Technology
Computer Graphics Software
Engineering

Cyber Security Automata C Language C++ tutorial

tutorial Tutorial tutorial
C++

https://www.javatpoint.com/cost-function-in-machine-learning 9/10
9/12/24, 10:50 PM Cost Function in Machine Learning - Javatpoint

Cyber Security Automata C Programming

Java tutorial .Net Python tutorial List of

Framework Programs
Java Python
tutorial
Programs
.Net

Control Data Mining Data

Systems tutorial Tutorial Warehouse
Tutorial
Control System Data Mining
Data Warehouse

https://www.javatpoint.com/cost-function-in-machine-learning 10/10

chp2 Cost Functions
No ratings yet
chp2 Cost Functions
7 pages
Top 7 Loss Functions To Evaluate Regression Models
No ratings yet
Top 7 Loss Functions To Evaluate Regression Models
8 pages
Cost Function Loss Function
No ratings yet
Cost Function Loss Function
7 pages
UNIT4 CostFunctions
No ratings yet
UNIT4 CostFunctions
23 pages
Cost Function
100% (1)
Cost Function
21 pages
Lecture 4 - Cost Function
No ratings yet
Lecture 4 - Cost Function
18 pages
Cost Function
No ratings yet
Cost Function
3 pages
Detailed Guide 7 Loss Functions Machine Learning Python Code
No ratings yet
Detailed Guide 7 Loss Functions Machine Learning Python Code
16 pages
Loss Functions
No ratings yet
Loss Functions
29 pages
Loss Function
No ratings yet
Loss Function
23 pages
Data Science - Machine Learning - Cost Function
No ratings yet
Data Science - Machine Learning - Cost Function
6 pages
What Is Machine Learning?
No ratings yet
What Is Machine Learning?
12 pages
Machine Learning
No ratings yet
Machine Learning
60 pages
What Is Machine Learning by Coursera
No ratings yet
What Is Machine Learning by Coursera
47 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
15 pages
Linear Regression For Absolute Beginners With Implementation in Python
No ratings yet
Linear Regression For Absolute Beginners With Implementation in Python
17 pages
Linear Regression: Level:4 Department: IT, Security
No ratings yet
Linear Regression: Level:4 Department: IT, Security
35 pages
Lecture3 - Linear Regression and Logistic Regression
No ratings yet
Lecture3 - Linear Regression and Logistic Regression
60 pages
ML:Introduction What Is Machine Learning?: Continuous and Discrete Data
No ratings yet
ML:Introduction What Is Machine Learning?: Continuous and Discrete Data
6 pages
C1 W1 Lab03 Cost Function Soln
No ratings yet
C1 W1 Lab03 Cost Function Soln
5 pages
Cost Function Mae Mse Lr
No ratings yet
Cost Function Mae Mse Lr
19 pages
ML Intro
No ratings yet
ML Intro
5 pages
Cost Function
No ratings yet
Cost Function
2 pages
Lecture7 Linear Regression
No ratings yet
Lecture7 Linear Regression
36 pages
Tom Mitchell Provides A More Modern Definition
No ratings yet
Tom Mitchell Provides A More Modern Definition
10 pages
ML Unit 3
No ratings yet
ML Unit 3
46 pages
ML Primer PDF
No ratings yet
ML Primer PDF
122 pages
ML Unit 3 1
No ratings yet
ML Unit 3 1
57 pages
Cost Function: y 2m 1 (Y ) 2m 1
No ratings yet
Cost Function: y 2m 1 (Y ) 2m 1
1 page
Linear Regression
No ratings yet
Linear Regression
37 pages
Linear Regression
No ratings yet
Linear Regression
38 pages
Linear Regression by IntuitiveAI v2.5
No ratings yet
Linear Regression by IntuitiveAI v2.5
5 pages
Deep Learning (Part 8) - Coursesteach
No ratings yet
Deep Learning (Part 8) - Coursesteach
16 pages
Cost Function - Coursera
No ratings yet
Cost Function - Coursera
1 page
Op Tim Ization
No ratings yet
Op Tim Ization
18 pages
(MLP) MidtermNote
No ratings yet
(MLP) MidtermNote
31 pages
Notes 1
No ratings yet
Notes 1
3 pages
Everything You Need To Know About Linear Regression - by Sushant Patrikar - Towards Data Science
No ratings yet
Everything You Need To Know About Linear Regression - by Sushant Patrikar - Towards Data Science
20 pages
Foundations of Machine Learning - 3
No ratings yet
Foundations of Machine Learning - 3
38 pages
Machine Learning: Introduction and Linear Regression
No ratings yet
Machine Learning: Introduction and Linear Regression
29 pages
W2 - LAB3 - Problem
No ratings yet
W2 - LAB3 - Problem
4 pages
ML Linear Model
No ratings yet
ML Linear Model
10 pages
GR 1 Report Week 7
No ratings yet
GR 1 Report Week 7
6 pages
C1 W1 Lab03 Cost Function Soln
No ratings yet
C1 W1 Lab03 Cost Function Soln
4 pages
(Machine Learning Coursera) Lecture Note Week 1
No ratings yet
(Machine Learning Coursera) Lecture Note Week 1
8 pages
C1 W1 Lab03 Cost Function Soln
No ratings yet
C1 W1 Lab03 Cost Function Soln
4 pages
Anuranan Das Summer of Sciences, 2019. Understanding and Implementing Machine Learning
No ratings yet
Anuranan Das Summer of Sciences, 2019. Understanding and Implementing Machine Learning
17 pages
ML Coursera
No ratings yet
ML Coursera
10 pages
Week_6
No ratings yet
Week_6
72 pages
C1 W1 Lab03 Cost Function Soln
No ratings yet
C1 W1 Lab03 Cost Function Soln
4 pages
Lecture 3
No ratings yet
Lecture 3
56 pages
Lec 3
No ratings yet
Lec 3
22 pages
Unit IV BPA GD
No ratings yet
Unit IV BPA GD
12 pages
ML Assignment
No ratings yet
ML Assignment
5 pages
ML:Introduction: Week 1 Lecture Notes
No ratings yet
ML:Introduction: Week 1 Lecture Notes
8 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
43 pages
Random Optimization: Fundamentals and Applications
From Everand
Random Optimization: Fundamentals and Applications
Fouad Sabry
No ratings yet
Computer Algebra: Fundamentals and Applications
From Everand
Computer Algebra: Fundamentals and Applications
Fouad Sabry
No ratings yet
Next Level Deep Machine Learning: Complete Tips and Tricks to Deep Machine Learning
From Everand
Next Level Deep Machine Learning: Complete Tips and Tricks to Deep Machine Learning
Joe Grant
No ratings yet
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet
List of Champion Sectors 08july2020
No ratings yet
List of Champion Sectors 08july2020
1 page
Sociology Mains 20012 - 14
No ratings yet
Sociology Mains 20012 - 14
10 pages
CO PO Mapping
No ratings yet
CO PO Mapping
4 pages
A Few Tips
No ratings yet
A Few Tips
1 page
Block Diagrams
No ratings yet
Block Diagrams
26 pages
Conversion, Obversion, and Contraposition of Categorical Syllogism
No ratings yet
Conversion, Obversion, and Contraposition of Categorical Syllogism
1 page
Discrete Course Outline
No ratings yet
Discrete Course Outline
2 pages
Large-Scale Multi-Class and Hierarchical Product Categorization For An E-Commerce Giant
No ratings yet
Large-Scale Multi-Class and Hierarchical Product Categorization For An E-Commerce Giant
11 pages
Machine Learning Textbook
No ratings yet
Machine Learning Textbook
191 pages
A Step by Step Backpropagation Example - Matt Mazur
No ratings yet
A Step by Step Backpropagation Example - Matt Mazur
17 pages
Gr11 Maths Statistics WS
No ratings yet
Gr11 Maths Statistics WS
9 pages
Quiz-2 COL100
No ratings yet
Quiz-2 COL100
4 pages
Gujarat Technological University: W.E.F. AY 2018-19
No ratings yet
Gujarat Technological University: W.E.F. AY 2018-19
3 pages
1.1.3 Transcript
No ratings yet
1.1.3 Transcript
1 page
Chapter3-Goodness of Fit Tests
No ratings yet
Chapter3-Goodness of Fit Tests
24 pages
Image Processing 4 Marks Questions
No ratings yet
Image Processing 4 Marks Questions
3 pages
Data Mining in CRM: Analytics-Intelligent Management of Product Life Cycle
No ratings yet
Data Mining in CRM: Analytics-Intelligent Management of Product Life Cycle
40 pages
Ty Ai&Ds Structure - Sem-I Ay 2025-26
No ratings yet
Ty Ai&Ds Structure - Sem-I Ay 2025-26
1 page
Week 4 Search Algorithms
No ratings yet
Week 4 Search Algorithms
27 pages
CH 24
No ratings yet
CH 24
19 pages
The Source Coding Theorem: M Ario S. Alvim (Msalvim@dcc - Ufmg.br)
No ratings yet
The Source Coding Theorem: M Ario S. Alvim (Msalvim@dcc - Ufmg.br)
62 pages
DAA Viva Questions
No ratings yet
DAA Viva Questions
6 pages
ME 704 Computational Methods in Thermal and Fluids Engineering (Introduction)
No ratings yet
ME 704 Computational Methods in Thermal and Fluids Engineering (Introduction)
3 pages
HK222 - Statistics Quality Control-002
No ratings yet
HK222 - Statistics Quality Control-002
4 pages
Btech Oe 7 Sem Machine Learning Koe073 2022
No ratings yet
Btech Oe 7 Sem Machine Learning Koe073 2022
1 page
Machine Intelligence
No ratings yet
Machine Intelligence
15 pages
SSRN 4080107
No ratings yet
SSRN 4080107
38 pages
Property of STI Weeks 5 - 6 SH1923
No ratings yet
Property of STI Weeks 5 - 6 SH1923
21 pages
2010 Canadian Computing Competition: Senior Division: Sponsor
No ratings yet
2010 Canadian Computing Competition: Senior Division: Sponsor
12 pages
Charles H. Bennett (Physicist)
No ratings yet
Charles H. Bennett (Physicist)
4 pages
Business Report Sparkling Dataset - TSF
No ratings yet
Business Report Sparkling Dataset - TSF
26 pages
Bits F464
No ratings yet
Bits F464
3 pages
The Scientific Method - NLP PDF
No ratings yet
The Scientific Method - NLP PDF
2 pages
Jurnal Jutrids C Indu-1
No ratings yet
Jurnal Jutrids C Indu-1
14 pages
Unit I SNM
No ratings yet
Unit I SNM
38 pages
Elliptic Curve Cryptography Master Thesis
100% (1)
Elliptic Curve Cryptography Master Thesis
6 pages

Cost Function in Machine Learning - Javatpoint

Uploaded by

Cost Function in Machine Learning - Javatpoint

Uploaded by

9/12/24, 10:50 PM Cost Function in Machine Learning - Javatpoint

Cost Function in Machine Learning

What is Cost Function?

Why use Cost Function?

Gradient Descent: Minimizing the cost function

Below is the equation for gradient descent in linear regression:

Types of Cost Function

1. Regression Cost Function

2. Binary Classification cost Functions

3. Multi-class Classification Cost Function.

1. Regression Cost Function

Error= Actual Output-Predicted output

It is one of the simplest ways possible.

However, it provides a base for other cost functions of regression models.

b. Mean Squared Error (MSE)

The formula for calculating MSE is given below:

Mean squared error is also known as L2 Loss.

c. Mean Absolute Error (MAE)

The formula for calculating Mean Absolute Error is given below:

2. Binary Classification Cost Functions

Cross-entropy(D) = - y*log(p) when y = 1

Cross-entropy(D) = - (1-y)*log(1-p) when y = 0

Binary Cross-Entropy = (Sum of Cross-Entropy for N data)/N

3. Multi-class Classification Cost Function

For Videos Join Our Youtube Channel: Join Now

Send your Feedback to [email protected]

Help Others, Please Share

Learn Latest Tutorials

Splunk SPSS Swagger Transact-SQL

Tumblr ReactJS Regex Reinforcement

R Programming RxJS React Native Python Design

Python Pillow Python Turtle Keras

Aptitude Logical Verbal Ability Interview

Artificial AWS Tutorial Selenium Cloud

Hadoop tutorial ReactJS Data Science Angular 7

Blockchain Git Tutorial Machine DevOps

DBMS tutorial Data Structures DAA tutorial Operating

Computer Compiler Computer Discrete

Ethical Hacking Computer Software html tutorial

Cyber Security Automata C Language C++ tutorial

Cyber Security Automata C Programming

Java tutorial .Net Python tutorial List of

Control Data Mining Data

You might also like