0% found this document useful (0 votes)

33 views44 pages

Unit - III

The document discusses machine learning topics like working with data, the mathematical and statistical basis of machine learning, creating matrices from data, exploring probabilities and probability formulas, interpreting learning as optimization, loss functions, and cost functions. It provides examples and explanations of these concepts.

Uploaded by

shubham

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

33 views44 pages

Unit - III

Uploaded by

shubham

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 44

SKN Sinhagd College of Engineering Korti.

Pandharpur

Class-Bachelor of Engineering
Subject-Machine Learning

Department of Computer Science

& Engineering

Presentation Prepared By
Mr. Subhash V. Pingale

1
Getting started with the Math Basics

Working with Data

1.Discovering criminal behavior and detecting
criminals in action
2. Recommending the right product to the right
person
3. Filtering and classifying data from the Internet at
an enormous scale
4. Driving a car autonomously

2 Thursday 25 April 2024

The mathematical and statistical basis of machine learning
makes outputting such useful results possible. Using math
and statistics in this way enables the algorithms to understand
anything with a numerical basis
a set of information useful for deciding whether to play tennis
outside or not, something a machine can learn using the
proper technique. The set of features described by as
follows:
Outlook: Sunny, overcast, or rain
Temperature: Cool, mild, hot
Humidity: High or normal
Windy: True or false
No matter what the information is, for a machine learning
3
algorithm to correctly process it, it should always be
Thursday 25 April 2024
transformed into a number.
Creating a Matrix
After you make all the data numeric, the machine
learning algorithm requires that you turn the individual
features into a matrix of features and the individual
responses into a vector or a matrix (when there are
multiple responses).
A matrix is a collection of numbers, arranged in rows
and columns, much like the squares in a chessboard.
However, unlike a chessboard, which is always square,
matrices can have a different number of rows and
columns.

4 Thursday 25 April 2024

Matrix Operations
 Addition
Multiplication
Y=xb

5 Thursday 25 April 2024

Glancing at advanced matrix operations
•

6 Thursday 25 April 2024

Exploring the World of Probabilities

 Probability tells you the likelihood of an event, and you

express it as a number. The probability of an event is
measured in the range from 0 (no probability that an event
occurs) to 1 (certainty that an event occurs). Intermediate
values, such as 0.25, 0.5, and 0.75, say that the event will
happen with a certain frequency when
tried enough times. If you multiply the probability by an
integer number representing the number of trials you’re
going to try, you’ll get an estimate of how many times an
event should happen on average if all the trials are tried.

7 Thursday 25 April 2024

Exploring the World of Probabilities

 For example, when you toss a coin, if the coin is fair, the a
priori probability of a head is 50 percent. No matter how
many times you toss the coin, when faced with a new toss
the probability for heads is still 50 percent. However, there
are other situations in which, if you change the context, the
a priori probability is not valid anymore because something
subtle happened and changed it. In this case, you can
express this belief as an a posteriori probability, which is
the a priori probability after something happened to modify
the count..

8 Thursday 25 April 2024

What is Probability - Probability can be defined as the
ratio of the number of favorable outcomes to the total
number of outcomes of an event.
Probability(Event) = Favorable Outcomes/Total
Outcomes
Probability formula with addition rule: Whenever an
event is the union of two other events, say A and B, then
P(A or B) = P(A) + P(B) - P(A∩B)
P(A ∪ B) = P(A) + P(B) - P(A∩B)

9 Thursday 25 April 2024

Probability formula with the complementary
rule: Whenever an event is the complement of another
event, specifically, if A is an event, then P(not A) = 1 -
P(A) or P(A') = 1 - P(A).
P(A) + P(A′) = 1.
Probability formula with the conditional rule:
When event A is already known to have occurred and
the probability of event B is desired, then P(B, given
A) = P(A and B), P(A, given B). It can be vice versa in
the case of event B.
P(B∣A) = P(A∩B)/P(A)
10 Thursday 25 April 2024
Find the probability of getting a number less than 5
when a dice is rolled by using the probability formula.

11 Thursday 25 April 2024

Probability of getting a number less than 5
Given: Sample space = {1,2,3,4,5,6}
Getting a number less than 5 = {1,2,3,4}
Therefore, n(S) = 6
n(A) = 4
Using Probability Formula,
P(A) = (n(A))/(n(s))
p(A) = 4/6
m = 2/3

12 Thursday 25 April 2024

What is the probability of getting a sum of 9 when two
dice are thrown?

13 Thursday 25 April 2024

There is a total of 36 possibilities when we throw two
dice.
To get the desired outcome i.e., 9, we can have the
following favorable outcomes.
(4,5),(5,4),(6,3)(3,6). There are 4 favorable outcomes.
Probability of an event P(E) = (Number of favorable
outcomes) ÷ (Total outcomes in a sample space)
Probability of getting number 9 = 4 ÷ 36 = 1/9

14 Thursday 25 April 2024

Interpreting Learning As Optimization
•
Supervised learning
Unsupervised learning
Reinforcement learning

The learning process

15 Thursday 25 April 2024

Loss Function
What’s a loss function?
At its core, a loss function is incredibly simple: It’s a
method of evaluating how well your algorithm models
your dataset. If your predictions are totally off, your
loss function will output a higher number. If they’re
pretty good, it’ll output a lower number. As you
change pieces of your algorithm to try and improve
your model, your loss function will tell you if you’re
getting anywhere.

16 Thursday 25 April 2024

17 Thursday 25 April 2024
Different types of loss functions
A lot of the loss functions that you see implemented
in machine learning can get complex and confusing.
But if you remember the end goal of all loss
functions—measuring how well your algorithm is
doing on your dataset—you can keep that
complexity in check.
We’ll run through a few of the most popular loss
functions currently being used, from simple to more
complex.

18 Thursday 25 April 2024

Exploring cost functions
A cost function is an important parameter that
determines how well a machine learning model
performs for a given dataset.
It calculates the difference between the expected value
and predicted value and represents it as a single real
number.

19 Thursday 25 April 2024

Why use cost function

20 Thursday 25 April 2024

21 Thursday 25 April 2024
Types of cost function
Regression Cost Function
Binary Classification cost Functions
Multi-class Classification Cost Function.

22 Thursday 25 April 2024

Regression cost function
 Regression models are used to make a prediction for the
continuous variables such as the price of houses, weather
prediction, loan predictions, etc.
Error= Actual Output-Predicted output
1. Means Error
 In this cost function, the error for each training data is calculated
and then the mean value of all these errors is derived.
 Calculating the mean of the errors is the simplest and most intuitive
way possible.
 The errors can be both negative and positive. So they can cancel
each other out during summation giving zero mean error for the
model.
.
23 Thursday 25 April 2024
2. Mean Squared Error (MSE) :
This improves the drawback we encountered in Mean
Error above. Here a square of the difference between the
actual and predicted value is calculated to avoid any
possibility of negative error.
It is measured as the average of the sum of squared
differences between predictions and actual observations.

MSE = (sum of squared errors)/n

24 Thursday 25 April 2024

3. Mean Absolute Error (MAE)
This cost function also addresses the shortcoming of
mean error differently. Here an absolute difference
between the actual and predicted value is calculated to
avoid any possibility of negative error.
So in this cost function, MAE is measured as the
average of the sum of absolute differences between
predictions and actual observations.

MAE = (sum of absolute errors)/n

25 Thursday 25 April 2024
 With respect to your target, a good practice is to define the cost
function that works the best in solving your problem, and then to figure
out which algorithms work best in optimizing it to define the
hypothesis space you want to test.
 When you work with algorithms that don’t allow the cost function you
want, you can still indirectly influence their optimization process by
fixing their hyper-parameters and selecting your input features with
respect to your cost function. Finally, when you’ve gathered all the
algorithm results, you evaluate them by using your chosen cost
function and then decide on the final hypothesis with the best result
from your chosen error function.

26 Thursday 25 April 2024

Gradient Descent Algorithm

 Gradient Descent is one of the most used machine learning algorithms in the
industry.
What is a Cost Function?
It is a function that measures the performance of a model for any given data. Cost
Function quantifies the error between predicted values and expected values

27 Thursday 25 April 2024

What is Gradient Descent?
Gradient descent is an iterative optimization algorithm
for finding the local minimum of a function.
Let’s say you are playing a game where the players are
at the top of a mountain, and they are asked to reach
the lowest point of the mountain. Additionally, they
are blindfolded. So, what approach do you think would
make you reach the lake?

28 Thursday 25 April 2024

To find the local minimum of a function using gradient
descent, we must take steps proportional to the
negative of the gradient (move away from the
gradient) of the function at the current point. If we take
steps proportional to the positive of the gradient
(moving towards the gradient), we will approach a
local maximum of the function, and the procedure is
called Gradient Ascent.

29 Thursday 25 April 2024

30 Thursday 25 April 2024
Batch Gradient Descent: This is a type of gradient
descent which processes all the training examples for
each iteration of gradient descent. But if the number of
training examples is large, then batch gradient descent
is computationally very expensive. Hence if the
number of training examples is large, then batch
gradient descent is not preferred. Instead, we prefer to
use stochastic gradient descent or mini-batch gradient
descent.

31 Thursday 25 April 2024

Stochastic Gradient Descent: This is a type of
gradient descent which processes 1 training example
per iteration. Hence, the parameters are being updated
even after one iteration in which only a single example
has been processed. Hence this is quite faster than
batch gradient descent. But again, when the number of
training examples is large, even then it processes only
one example which can be additional overhead for the
system as the number of iterations will be quite large.

32 Thursday 25 April 2024

Mini Batch gradient descent: This is a type of
gradient descent which works faster than both batch
gradient descent and stochastic gradient descent.
Here b examples where b<m are processed per
iteration. So even if the number of training examples is
large, it is processed in batches of b training examples
in one go. Thus, it works for larger training examples
and that too with lesser number of iterations.

33 Thursday 25 April 2024

34 Thursday 25 April 2024
Learning Curves in Machine Learning
 learning curve is just a plot showing the progress
over the experience of a specific metric related to
learning during the training of a machine learning
model. They are just a mathematical representation of
the learning process.
we’ll have a measure of time or progress in the x-axis
and a measure of error or performance in the y-axis.

35 Thursday 25 April 2024

Single Curves
The most popular example of a learning curve is loss
over time. Loss (or cost) measures our model error, or
“how bad our model is doing”. So, for now, the lower
our loss becomes, the better our model performance
will be.
In the picture below, we can see the expected behavior
of the learning process:

36 Thursday 25 April 2024

37 Thursday 25 April 2024
Despite the fact it has slight ups and downs, in the
long term, the loss decreases over time, so the
model is learning.
Other examples of very popular learning curves
are accuracy, precision, and recall. All of these
capture model performance, so the higher they are,
the better our model becomes.

38 Thursday 25 April 2024

39 Thursday 25 April 2024
Multiple Curves
One of the most widely used metrics combinations
is training loss + validation loss over time.
The training loss indicates how well the model is
fitting the training data, while the validation loss
indicates how well the model fits new data.
We will see this combination later on, but for now, see
below a typical plot showing both metrics

40 Thursday 25 April 2024

41 Thursday 25 April 2024
Optimization Learning Curves: Learning curves
calculated on the metric by which the parameters of
the model are being optimized, such as loss or Mean
Squared Error
Performance Learning Curves: Learning curves
calculated on the metric by which the model will be
evaluated and selected, such as accuracy, precision

42 Thursday 25 April 2024

Performance curve for two different model

43 Thursday 25 April 2024

Thank you

PSM Report Content FSKTM
100% (1)
PSM Report Content FSKTM
3 pages
ML Intro
No ratings yet
ML Intro
5 pages
01 Intro
No ratings yet
01 Intro
22 pages
Machine Learning and Data Mining
No ratings yet
Machine Learning and Data Mining
88 pages
Machine Learning Shortnote
No ratings yet
Machine Learning Shortnote
14 pages
Week 1 Lecture Notes
No ratings yet
Week 1 Lecture Notes
7 pages
ML Merge
No ratings yet
ML Merge
145 pages
Intro DL 01
No ratings yet
Intro DL 01
64 pages
Ads M1 02
No ratings yet
Ads M1 02
16 pages
QSRI Lecture1
No ratings yet
QSRI Lecture1
45 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
31 pages
Chapter 5 - Machine Learning
No ratings yet
Chapter 5 - Machine Learning
59 pages
ML:Introduction What Is Machine Learning?: Continuous and Discrete Data
No ratings yet
ML:Introduction What Is Machine Learning?: Continuous and Discrete Data
6 pages
CS168: The Modern Algorithmic Toolbox Lecture #5: Generalization (Or, How Much Data Is Enough?)
No ratings yet
CS168: The Modern Algorithmic Toolbox Lecture #5: Generalization (Or, How Much Data Is Enough?)
16 pages
Machine Learning Theory CSE 250C: Introductory Lecture
No ratings yet
Machine Learning Theory CSE 250C: Introductory Lecture
29 pages
Cheet Sheet
No ratings yet
Cheet Sheet
47 pages
Modelling
No ratings yet
Modelling
69 pages
DS-05 Introduction To Machine Learning
No ratings yet
DS-05 Introduction To Machine Learning
103 pages
ML Notes
No ratings yet
ML Notes
14 pages
Super Cheatsheet Machine Learning
100% (1)
Super Cheatsheet Machine Learning
15 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
24 pages
Machine Learning
No ratings yet
Machine Learning
33 pages
Module 1
No ratings yet
Module 1
27 pages
ML 01
No ratings yet
ML 01
24 pages
Machine Learning Models: by Mayuri Bhandari
No ratings yet
Machine Learning Models: by Mayuri Bhandari
48 pages
Ai Unit5 Learning
No ratings yet
Ai Unit5 Learning
62 pages
07 Intro To ML
No ratings yet
07 Intro To ML
38 pages
Foundations of Machine Learning - 3
No ratings yet
Foundations of Machine Learning - 3
38 pages
Week11 - Regularization and Optimization
No ratings yet
Week11 - Regularization and Optimization
75 pages
Cs Ai Lecture Notes 02
No ratings yet
Cs Ai Lecture Notes 02
103 pages
ML:Introduction: Week 1 Lecture Notes
No ratings yet
ML:Introduction: Week 1 Lecture Notes
10 pages
Intro To Machine Learning With PyTorch
No ratings yet
Intro To Machine Learning With PyTorch
48 pages
A Preliminary Idea On Machine Learning
No ratings yet
A Preliminary Idea On Machine Learning
40 pages
Machine Learning: The Basics
No ratings yet
Machine Learning: The Basics
288 pages
2-Inductive Learning
No ratings yet
2-Inductive Learning
37 pages
ML 3
No ratings yet
ML 3
14 pages
Machine Vesion hw6
No ratings yet
Machine Vesion hw6
18 pages
Chapter 02.background-Theory
No ratings yet
Chapter 02.background-Theory
20 pages
Unit 1 ML
No ratings yet
Unit 1 ML
155 pages
Lec 8
No ratings yet
Lec 8
35 pages
Brief Summary ML
No ratings yet
Brief Summary ML
25 pages
Gradient Descent
No ratings yet
Gradient Descent
15 pages
ML PPTS Merged
No ratings yet
ML PPTS Merged
514 pages
Gansp Awareness Quiz PDF
No ratings yet
Gansp Awareness Quiz PDF
13 pages
Machine Learning HC
No ratings yet
Machine Learning HC
4 pages
Curs 1 SSL - Introduction
No ratings yet
Curs 1 SSL - Introduction
57 pages
(Machine Learning Coursera) Lecture Note Week 1
No ratings yet
(Machine Learning Coursera) Lecture Note Week 1
8 pages
DSA5102X Lecture1
No ratings yet
DSA5102X Lecture1
51 pages
Lecture 17
No ratings yet
Lecture 17
33 pages
Concept Learning
No ratings yet
Concept Learning
85 pages
MLSM Lecture2 120923
No ratings yet
MLSM Lecture2 120923
35 pages
2021 Machine Learning Intro
No ratings yet
2021 Machine Learning Intro
43 pages
Summer of Science-Final Report
100% (1)
Summer of Science-Final Report
7 pages
CSE 440 AI Volume1 (p1)
No ratings yet
CSE 440 AI Volume1 (p1)
4 pages
PAC Bayesian Learning Introduction
No ratings yet
PAC Bayesian Learning Introduction
124 pages
Cs 171 18 IntroLearning Old
No ratings yet
Cs 171 18 IntroLearning Old
47 pages
Machine Learning Handbook - Radivojac and White
No ratings yet
Machine Learning Handbook - Radivojac and White
108 pages
Machine Learning
No ratings yet
Machine Learning
64 pages
Foundations of Elementary Analysis
From Everand
Foundations of Elementary Analysis
Roshan Trivedi
No ratings yet
Brute Force Search: Fundamentals and Applications
From Everand
Brute Force Search: Fundamentals and Applications
Fouad Sabry
No ratings yet
Easy Algebra Step-by-Step
From Everand
Easy Algebra Step-by-Step
Sandra Luna McCune
No ratings yet
Unit IV
No ratings yet
Unit IV
51 pages
Ayurveda
No ratings yet
Ayurveda
8 pages
Building Linked Lists With C Your Path To Data Structures
No ratings yet
Building Linked Lists With C Your Path To Data Structures
20 pages
Unit 3 DMS
No ratings yet
Unit 3 DMS
14 pages
JDBC
No ratings yet
JDBC
37 pages
Ajp Intro
No ratings yet
Ajp Intro
22 pages
Event Management System Documentation
No ratings yet
Event Management System Documentation
80 pages
Sales Promotion On Two Wheeler Dealers in Coimbatore
No ratings yet
Sales Promotion On Two Wheeler Dealers in Coimbatore
16 pages
Ie4-1le7 Simotics Motor Brochure - 06.24
No ratings yet
Ie4-1le7 Simotics Motor Brochure - 06.24
4 pages
Manuel #1116649 (FM841, FM840) Rig 301-52
No ratings yet
Manuel #1116649 (FM841, FM840) Rig 301-52
101 pages
Lel Tender Specs
No ratings yet
Lel Tender Specs
6 pages
Week 5 Lesson Plan Empowerment Technology
No ratings yet
Week 5 Lesson Plan Empowerment Technology
4 pages
Kirankumar Kaisetty Manoharan Resume
No ratings yet
Kirankumar Kaisetty Manoharan Resume
7 pages
113 Trellix NX 4600 Ds Trellix Network Security Tech Specifications Datasheet
No ratings yet
113 Trellix NX 4600 Ds Trellix Network Security Tech Specifications Datasheet
9 pages
Dynamic Difficulty Adjustment Via Fast User Adaptation
No ratings yet
Dynamic Difficulty Adjustment Via Fast User Adaptation
3 pages
Random Forest
No ratings yet
Random Forest
9 pages
Viva Questions
No ratings yet
Viva Questions
3 pages
PRJ 3
No ratings yet
PRJ 3
7 pages
Adept Owl Simulated Business
No ratings yet
Adept Owl Simulated Business
64 pages
SP800 Operating Manual
No ratings yet
SP800 Operating Manual
17 pages
Cse2012 PPS3 w2022
No ratings yet
Cse2012 PPS3 w2022
3 pages
EAPP 12 2nd Quarter
No ratings yet
EAPP 12 2nd Quarter
23 pages
OceanStor Dorado 6.1.x HyperReplication Feature Guide For Block
No ratings yet
OceanStor Dorado 6.1.x HyperReplication Feature Guide For Block
286 pages
Benchmarking Edge For Successful Sales Execution1
No ratings yet
Benchmarking Edge For Successful Sales Execution1
14 pages
Modelling of The Pressure Drop in Tangential Inlet Cyclone Separators
No ratings yet
Modelling of The Pressure Drop in Tangential Inlet Cyclone Separators
10 pages
Airline Reservation System
No ratings yet
Airline Reservation System
30 pages
Algeria (DZA) : Administrative Boundary Common Operational Database (COD-AB)
No ratings yet
Algeria (DZA) : Administrative Boundary Common Operational Database (COD-AB)
3 pages
850 Universal Interface Manual UI 5000
No ratings yet
850 Universal Interface Manual UI 5000
4 pages
Accenture Human Capital Services For SuccessFactors
No ratings yet
Accenture Human Capital Services For SuccessFactors
8 pages
HTML File Paths
No ratings yet
HTML File Paths
7 pages
Design For Optimization Even Semester 2021 Home Assignment-CO-2
No ratings yet
Design For Optimization Even Semester 2021 Home Assignment-CO-2
2 pages
Dictionary of Mahratta Language
No ratings yet
Dictionary of Mahratta Language
664 pages
Sec Sheet 3 Carnot Cycle
No ratings yet
Sec Sheet 3 Carnot Cycle
3 pages
Derivatives?: E World
No ratings yet
Derivatives?: E World
2 pages
Satellite Assisted Flight Tracking and Rescue: S.A.F.T.A.R
No ratings yet
Satellite Assisted Flight Tracking and Rescue: S.A.F.T.A.R
4 pages
MANUAL AMPLIFICADOR KENWOOD Ar304
No ratings yet
MANUAL AMPLIFICADOR KENWOOD Ar304
24 pages

Unit - III

Uploaded by

Unit - III

Uploaded by

SKN Sinhagd College of Engineering Korti.

Department of Computer Science

Working with Data

2 Thursday 25 April 2024

4 Thursday 25 April 2024

5 Thursday 25 April 2024

6 Thursday 25 April 2024

 Probability tells you the likelihood of an event, and you

7 Thursday 25 April 2024

8 Thursday 25 April 2024

9 Thursday 25 April 2024

11 Thursday 25 April 2024

12 Thursday 25 April 2024

13 Thursday 25 April 2024

14 Thursday 25 April 2024

The learning process

15 Thursday 25 April 2024

16 Thursday 25 April 2024

18 Thursday 25 April 2024

19 Thursday 25 April 2024

20 Thursday 25 April 2024

22 Thursday 25 April 2024

MSE = (sum of squared errors)/n

24 Thursday 25 April 2024

MAE = (sum of absolute errors)/n

26 Thursday 25 April 2024

27 Thursday 25 April 2024

28 Thursday 25 April 2024

29 Thursday 25 April 2024

31 Thursday 25 April 2024

32 Thursday 25 April 2024

33 Thursday 25 April 2024

35 Thursday 25 April 2024

36 Thursday 25 April 2024

38 Thursday 25 April 2024

40 Thursday 25 April 2024

42 Thursday 25 April 2024

43 Thursday 25 April 2024

You might also like