0% found this document useful (0 votes)

10 views52 pages

UNIT03

Uploaded by

Amit Yadav

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views52 pages

UNIT03

Uploaded by

Amit Yadav

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 52

UNIT03

Modelling and Evaluation

OUTLINE

• Selecting a Model: Predictive/Descriptive

• Training a Model for supervised learning

• model representation and interpretability

• Evaluating performance of a model

• Improving performance of a model

• The basic learning process, irrespective of the
fact that the learner is a human or a machine,
can be divided into three parts:
– Data Input
– Abstraction
• abstraction is a significant step as it represents raw
input data in a summarized and structured format, such
that a meaningful insight is obtained from the data.
This structured representation of raw input data to the
meaningful pattern is called a model.
– Generalization
• Generalization searches through the huge set of
abstracted knowledge to come up with a small and
manageable set of key findings.
Selecting a model
• Y=f(x)+e
– where f is target function
– x= independent variables/input
– Y = target variable/output
– random error term = e
– Cost function/error function
• helps to measure the extent to which the model is going wrong in
estimating the relationship between X and Y. In that sense, cost
function can tell how bad the model is performing.
– Loss function: function define on data points
– Objective function
• Objective function takes in data and model (along with
parameters) as input and returns a value.
• Supervised
– Classification
– Regression
• Unsupervised
– Clustering
– Association analysis
• Reinforcement
• Most important factors while selecting a model for
machine learning
– The kind of problem we want to solve using machine
learning and
– The nature of the underlying data.
• Machine learning algorithms are broadly of two
types:
– models for supervised learning, which primarily focus
on solving predictive problems and models for
– unsupervised learning, which solve descriptive
problems.
• Predictive model
– Supervised learning
– The predictive models have a clear focus on what
they want to learn and how they want to learn.
– The models which are used for prediction of target
features of categorical value are known as
classification models.
• k-Nearest Neighbor (kNN), Naïve Bayes, and Decision
Tree.
– The models which are used for prediction of the
numerical value of the target feature of a data
instance are known as regression models.
• Linear Regression and Logistic Regression models are
popular regression models.
• Descriptive Model
– Models for unsupervised learning
– Descriptive models which group together similar
data instances, i.e. data instances having a similar
value of the different features are called clustering
models.
• K-means
– Descriptive models related to pattern discovery
/transactional data are called Association analysis
Training a model
(for supervised learning)
• Hold out method
• K-fold Cross-validation method
• Bootstrap sampling
• Lazy vs. Eager learner
Holdout method
• Division of input data is random
• Random numbers are used to assign data items
to the partitions. This method of partitioning
the input data into two parts – training and
test data, which is by holding back a part of the
input data for validating the trained model is
known as holdout method.
• Effect of Maximum training set
• Effect of Maximum Testing set
• Problem with holdout method
– in this method is that the division of data of
different classes into the training and test data
may not be proportionate.
– Solution : Stratification
• In case of stratified random sampling, the whole data is
broken into several homogenous groups or strata and a
random sample is selected from each such stratum.
This ensures that the generated random partitions
have equal proportions of each class.
K-fold Cross-validation method
.

• Repeated holdout method

– Problem : Overlapping test set
• Two approaches of k-fold cross validation
method
– 10-fold cross-validation (10-fold CV)
– Leave-one-out cross-validation (LOOCV)
Overall approach of k-fold cross validation
Detail approach for fold selection
• Leave-one-out cross-validation (LOOCV)
– Leave-one-out cross-validation (LOOCV) is an
extreme case of k-fold cross validation using one
record or data instance at a time as a test data.
This is done to maximize the count of data used
to train the model. It is obvious that the number
of iterations for which it has to be run is equal to
the total number of data in the input data set.
Hence, obviously, it is computationally very
expensive and not used much in practice.
Bootstrap sampling
• This technique is particularly useful in case of
input data sets of small size, i.e. having very
less number of data instances.
• Simple Random Sampling with Replacement
(SRSWR),
• Bootstrapping can create one or more training
data sets having ‘n’ data instances, some of
the data instances being repeated multiple
times.
Lazy vs. Eager learner Eager learning
• Eager learners take more time in the learning phase
than the lazy learners. Some of the algorithms
which adopt eager learning approach include
Decision Tree, Support Vector Machine, Neural
Network,
• Lazy learners take very little time in training because
not much of training actually happens. However, it
takes quite some time in classification as for each
tuple of test data, a comparison-based assignment
of label happens. One of the most popular
algorithm for lazy learning is k-nearest neighbor.
MODEL REPRESENTATION AND
INTERPRETABILITY

• Underfitting
• Overfitting
• Bias – variance trade-off
• Underfitting
– When target function is kept to simple
– Unavailability of sufficient training data set
– Underfitting results in both poor performance
with training data as well as poor generalization to
test data.

– Underfitting can be avoided by

• using more training data
• reducing features by effective feature selection
• Overfitting
– situation where the model has been designed in
such a way that it emulates the training data too
closely.
– Noise and outliers may embedded in model
– Overfitting results in good performance with
training data set, but poor generalization and
hence poor performance with test data set.
– Overfitting can be avoided by
• using re-sampling techniques like k-fold cross validation
• hold back of a validation data set
• Remove the nodes which have little or no predictive
power for the given machine learning problem.
Bias – variance trade-off

• Bias
– Gap between predicted values and actual value
– Parametric models generally have high bias
making them easier to understand/interpret and
faster to learn. These algorithms have a poor
performance on data sets, which are complex in
nature and do not align with the simplifying
assumptions made by the algorithm.
– Underfitting results in high bias.
• Variance
– Errors due to variance occur from difference in
training data sets used to train the model.
– Distance of all predicted values with respect to
each other
EVALUATING PERFORMANCE OF
A MODEL
• Supervised learning - classification
– Accuracy
– Sensitivity
Understanding with cricket match win
example
• There are four possibilities with regards to the
cricket match win/loss prediction:
– The model predicted win and the team won
– The model predicted win and the team lost
– The model predicted loss and the team won
– The model predicted loss and the team lost In
this problem, the obvious class of interest is ‘win’.
The first case, i.e. the model predicted win and the
• model accuracy is given by total number of
correct classifications (either as the class of
interest, i.e. True Positive or as not the class of
interest, i.e. True Negative) divided by total
number of classifications done.
Model accuracy =
• Error rate : The percentage of misclassifications

• Kappa value of a model indicates the adjusted the

model accuracy. Kappa value can be 1 at the
maximum, which represents perfect agreement
between model’s prediction and actual values.
• Sensitivity:
– The sensitivity of a model measures the
proportion of TP examples or positive cases which
were correctly classified.

• Specificity
– Specificity of a model measures the proportion of
negative examples which have been correctly
classified.
• There are two other performance measures of
a supervised learning model which are similar
to sensitivity and specificity.
– Precision : precision gives the proportion of
positive predictions which are truly positive,

– Recall : Recall indicates the proportion of correct

prediction of positives to the total number of
positives.
• Example:
Calculate Model accuracy, error rate, kappa
value, Sensitivity, Specificity, Precision, Recall for
confusion matrix of the win/loss prediction of
cricket match problem to be as below:
• F-measure is another measure of model
performance which combines the precision
and recall. It takes the harmonic mean of
precision and recall
• Receiver operating characteristic (ROC) curves
– Receiver Operating Characteristic (ROC) curve
helps in visualizing the performance of a
classification model. It shows the efficiency of a
model in the detection of true positives while
avoiding the occurrence of false positives.
Supervised learning – regression

• A regression model which ensures that the

difference between predicted and actual
values is low can be considered as a good
model.
• y = α + βx
Unsupervised learning - clustering

• What is clustering?
• challenges which lie in the process of clustering:
– It is generally not known how many clusters can be formulated
from a particular data set. It is completely open-ended in most
cases and provided as a user input to a clustering algorithm.
– Even if the number of clusters is given, the same number of
clusters can be formed with different groups of data instances.
In a more objective way, it can be said that a clustering

• popular approaches which are adopted for cluster quality

evaluation.
– Internal evaluation
– External evaluation
• Internal Evaluation
– The internal evaluation methods generally measure cluster quality
based on homogeneity of data belonging to the same cluster and
heterogeneity of data belonging to different clusters.
– silhouette coefficient, which is one of the most popular internal
evaluation methods, uses distance (Euclidean or Manhattan distances
most commonly used) between data elements as a similarity
measure.
– The value of silhouette width ranges between –1 and +1, with a high
value indicating high intra-cluster homogeneity and inter-cluster
heterogeneity.
• There are four clusters namely cluster 1, 2, 3, and 4. Let’s consider
an arbitrary data element ‘i’ in cluster 1, resembled by the asterisk.
a(i) is the average of the distances ai1, ai2, …, ain1 of the different
data elements from the ith data element in cluster 1, assuming
there are n1 data elements in cluster 1. Mathematically,

• In the same way, let’s calculate the distance of an arbitrary data

element ‘i’ in cluster 1 with the different data elements from
another cluster, say cluster 4 and take an average of all those
distances.

• where n4 is the total number of elements in cluster 4. In the same

way, we can calculate the values of b12 (average) and b13
(average). b (i) is the minimum of all these values.
• Hence, we can say that, b(i) = minimum [b12(average),
b13(average), b14(average)]
• External Evaluation
– In this approach, class label is known for the data
set subjected to clustering.
– The cluster algorithm is assessed based on how
close the results are compared to those known
class labels. For example, purity is one of the most
popular measures of cluster algorithms – evaluates
the extent to which clusters contain a single class.
For a data set having
– purity is one of the most popular measures of
cluster algorithms – evaluates the extent to which
clusters contain a single class. For a data set having
– For a data set having ‘n’ data instances and ‘c’
known class labels which generates ‘k’ clusters,
purity is measured as: Purity =
IMPROVING PERFORMANCE OF A MODEL
• Can we improve the performance of our model?
• which model should be selected for which
machine learning task?
• We have already discussed earlier that the model
selection is done one several aspects:
– Type of learning the task in hand, i.e. supervised or
unsupervised
– Type of the data, i.e. categorical or numeric
– Sometimes on the problem domain
– Above all, experience in working with different models
to solve problems of diverse domains So, assuming
that
• Various methods to improve model
performance
– Model parameter tuning
• Model parameter tuning is the process of adjusting the
model fitting options.
• For example, in the popular classification model k-
Nearest Neighbour (kNN), using different values of ‘k’
or the number of nearest neighbours to be considered,
the model can be tuned.
• Ensemble
– combining different models with diverse strengths is known as ensemble
– Ensemble helps in averaging out biases of the different underlying models and
also reducing the variance. Ensemble methods combine weaker learners to
create stronger ones.
• Various methods of Ensemble
– bootstrap aggregating or bagging.
– Boosting
– Random Forest
• Following are the typical steps in ensemble process:
– Build a number of models based on the training data
– For diversifying the models generated, the training data subset can be varied
using the allocation function. Sampling techniques like bootstrapping may be
used to generate unique training data sets.
– Alternatively, the same training data may be used but the models combined are
quite varying, e.g, SVM, neural network, kNN, etc.
– The outputs from the different models are combined using a combination
function. A very simple strategy of combining, say in case of a prediction task
using ensemble, can be majority voting of the different models combined. For
example, 3 out of 5 classes predict ‘win’ and 2 predict ‘loss’ – then the final
outcome of the ensemble using majority vote would be a ‘win’.

PETLT EoMA Case Study Guidance
100% (1)
PETLT EoMA Case Study Guidance
5 pages
CSE4IFU BU S1 Subject Learning Guide 2023
No ratings yet
CSE4IFU BU S1 Subject Learning Guide 2023
9 pages
Admission Form PDF
No ratings yet
Admission Form PDF
6 pages
B 190313162555
No ratings yet
B 190313162555
29 pages
PROF ED CYCLE 1 (Teaching Profession)
No ratings yet
PROF ED CYCLE 1 (Teaching Profession)
9 pages
120 Pilot Interview Questions and Answers
100% (1)
120 Pilot Interview Questions and Answers
45 pages
Authentic Pedagogy
No ratings yet
Authentic Pedagogy
3 pages
Syllabus TTL1 20120-2021
100% (1)
Syllabus TTL1 20120-2021
28 pages
Chapter - 4 & 5
No ratings yet
Chapter - 4 & 5
63 pages
Course Outline Agribusiness
No ratings yet
Course Outline Agribusiness
3 pages
The Principles of Learning and Teaching PoLT PDF
No ratings yet
The Principles of Learning and Teaching PoLT PDF
20 pages
Tle Techdraft9 Q2 M5
No ratings yet
Tle Techdraft9 Q2 M5
18 pages
CHP 3
No ratings yet
CHP 3
70 pages
Team Building Presentation Paul Kastigu
No ratings yet
Team Building Presentation Paul Kastigu
15 pages
(Paper 1) Farisabdullah,+36+m3003+ Machine+Learning+for+Property+Price+Prediction+and+Price+Valuation+ 14.10.2021
No ratings yet
(Paper 1) Farisabdullah,+36+m3003+ Machine+Learning+for+Property+Price+Prediction+and+Price+Valuation+ 14.10.2021
12 pages
Staj 1
No ratings yet
Staj 1
7 pages
G. Narayanamma Institute of Technology & Science: (For Women)
No ratings yet
G. Narayanamma Institute of Technology & Science: (For Women)
2 pages
ML Unit 2
No ratings yet
ML Unit 2
86 pages
Asian Institute of Technology and Education: Office of The Dean
No ratings yet
Asian Institute of Technology and Education: Office of The Dean
5 pages
Application of The Universal Design For Learning: Inclusive Education - Case Study
No ratings yet
Application of The Universal Design For Learning: Inclusive Education - Case Study
10 pages
Huawei H12-211 PRACTICE EXAM HCNA-HNTD H
No ratings yet
Huawei H12-211 PRACTICE EXAM HCNA-HNTD H
117 pages
Beginners Guide To Arabic
No ratings yet
Beginners Guide To Arabic
4 pages
Improving Language and Literacy in A VPK Class Revised
No ratings yet
Improving Language and Literacy in A VPK Class Revised
7 pages
Detailed Lesson Plan (DLP) Format: Learning Competency/ies: Code: TLE - HEBC9HS-Ia-g1
No ratings yet
Detailed Lesson Plan (DLP) Format: Learning Competency/ies: Code: TLE - HEBC9HS-Ia-g1
2 pages
Unit 3 (ML)
No ratings yet
Unit 3 (ML)
26 pages
NLP Chapter 2
No ratings yet
NLP Chapter 2
79 pages
ML 1 2 3
No ratings yet
ML 1 2 3
54 pages
ML Merge
No ratings yet
ML Merge
145 pages
Wa0001.
No ratings yet
Wa0001.
173 pages
Frame of Reference 2020
No ratings yet
Frame of Reference 2020
4 pages
Brady Newton-Resume
No ratings yet
Brady Newton-Resume
2 pages
ML Unit 2 Part 1
No ratings yet
ML Unit 2 Part 1
47 pages
Training Evaluation
No ratings yet
Training Evaluation
42 pages
Module 5 Advanced Classification Techniques
No ratings yet
Module 5 Advanced Classification Techniques
40 pages
Machine Learning-2
No ratings yet
Machine Learning-2
87 pages
Csa202 Unit 2
No ratings yet
Csa202 Unit 2
36 pages
Fall Quality Work Protocol Summary Sheet
No ratings yet
Fall Quality Work Protocol Summary Sheet
2 pages
For More Visit WWW - Ktunotes.in
No ratings yet
For More Visit WWW - Ktunotes.in
21 pages
Artificial Intelligence Chapter 18 (Updated)
No ratings yet
Artificial Intelligence Chapter 18 (Updated)
19 pages
5 - Model For Predictions - ML
No ratings yet
5 - Model For Predictions - ML
52 pages
Unit 01
No ratings yet
Unit 01
32 pages
Unit3ModellingandEvaluationpptx 2023 09 02 15 19 21
No ratings yet
Unit3ModellingandEvaluationpptx 2023 09 02 15 19 21
49 pages
UNIT II Machine Learning
No ratings yet
UNIT II Machine Learning
43 pages
Basics of ML and Evaluation
No ratings yet
Basics of ML and Evaluation
42 pages
Unit Ii ML
No ratings yet
Unit Ii ML
57 pages
Machine - Learning - Unit - 1
No ratings yet
Machine - Learning - Unit - 1
70 pages
Unit III - I
No ratings yet
Unit III - I
15 pages
Approach Towards Model Evaluation, Model Selection
No ratings yet
Approach Towards Model Evaluation, Model Selection
13 pages
Lecture-4 Model Evaluation
No ratings yet
Lecture-4 Model Evaluation
28 pages
Model Evaluation
No ratings yet
Model Evaluation
29 pages
ML 3170724 Unit-3
No ratings yet
ML 3170724 Unit-3
48 pages
ML Unit 2
No ratings yet
ML Unit 2
35 pages
Xchapter 1
No ratings yet
Xchapter 1
31 pages
Jkkklphftbbhuii
No ratings yet
Jkkklphftbbhuii
17 pages
Lesson 2 - Public Speaking
No ratings yet
Lesson 2 - Public Speaking
45 pages
19 ML Intro
No ratings yet
19 ML Intro
31 pages
UNIT04
No ratings yet
UNIT04
35 pages
Gujarat Technological University (Established Under Gujarat Act No. 20 of 2007)
No ratings yet
Gujarat Technological University (Established Under Gujarat Act No. 20 of 2007)
30 pages
ML CP-23-24 EVEN As On 81.25
No ratings yet
ML CP-23-24 EVEN As On 81.25
13 pages
UNIT02
No ratings yet
UNIT02
41 pages
Model Selection NEW
No ratings yet
Model Selection NEW
24 pages
Machine Learning HC
No ratings yet
Machine Learning HC
4 pages
Chapter 01 Introduction To Machine Learning
No ratings yet
Chapter 01 Introduction To Machine Learning
59 pages
Unit-1 ML
No ratings yet
Unit-1 ML
19 pages
Classification
No ratings yet
Classification
53 pages
Lecture 2 - Supervised Learning
No ratings yet
Lecture 2 - Supervised Learning
6 pages
ML Notes (Module-3)
No ratings yet
ML Notes (Module-3)
21 pages
UNIT 4 Supervised Learning
No ratings yet
UNIT 4 Supervised Learning
38 pages
6019SSL-CW2 2425 Semester 2 - Standard
No ratings yet
6019SSL-CW2 2425 Semester 2 - Standard
10 pages
Chapter 7 - LAST
No ratings yet
Chapter 7 - LAST
29 pages
Module3 DS PPT
No ratings yet
Module3 DS PPT
68 pages
19 ML Intro
No ratings yet
19 ML Intro
33 pages
Unit1 ML
No ratings yet
Unit1 ML
15 pages
DSOST3
No ratings yet
DSOST3
31 pages
Pps Amit
No ratings yet
Pps Amit
35 pages
De 4
No ratings yet
De 4
2 pages
CS25S22008075 AdmitCard
No ratings yet
CS25S22008075 AdmitCard
1 page
Defining Key Concepts in Didactics 03 February-2025
No ratings yet
Defining Key Concepts in Didactics 03 February-2025
4 pages
Deeptech Circular 727004
No ratings yet
Deeptech Circular 727004
6 pages
Deep Learning
No ratings yet
Deep Learning
26 pages
Module 1
No ratings yet
Module 1
50 pages
UNIT 1 - Types of Learning
No ratings yet
UNIT 1 - Types of Learning
13 pages
Assignment 1
No ratings yet
Assignment 1
1 page
Modellingandevaluationunit2june2322 220623063944 5c70ebed
No ratings yet
Modellingandevaluationunit2june2322 220623063944 5c70ebed
53 pages
Assignment 2
No ratings yet
Assignment 2
2 pages
Teen Crime Spree Melbourne
No ratings yet
Teen Crime Spree Melbourne
1 page
Police Non Crime Hate Incidents
No ratings yet
Police Non Crime Hate Incidents
1 page
The Family Gang Crackdown
No ratings yet
The Family Gang Crackdown
1 page
Week 4 - Intro To ML
No ratings yet
Week 4 - Intro To ML
37 pages
Statistic Inference Unit 2 Notes
No ratings yet
Statistic Inference Unit 2 Notes
34 pages
Delhi Hit and Run Case
No ratings yet
Delhi Hit and Run Case
1 page
Artificial Intelligence
No ratings yet
Artificial Intelligence
24 pages
5e Lesson Plan of Grade-5 Maths
No ratings yet
5e Lesson Plan of Grade-5 Maths
4 pages
Ch6-Models Selection Evaluating Classifiers
No ratings yet
Ch6-Models Selection Evaluating Classifiers
28 pages
Mohammad Jari Resume
No ratings yet
Mohammad Jari Resume
1 page
CH-5 ML
No ratings yet
CH-5 ML
36 pages
Unit 3
No ratings yet
Unit 3
13 pages
ML Unit IV
No ratings yet
ML Unit IV
70 pages
Ai Unit-4-1
No ratings yet
Ai Unit-4-1
9 pages
Update Week 13 Machine Learning Supervised
No ratings yet
Update Week 13 Machine Learning Supervised
21 pages
Bi Unit 5
No ratings yet
Bi Unit 5
20 pages
ML 5
No ratings yet
ML 5
26 pages

UNIT03

Uploaded by

UNIT03

Uploaded by

UNIT03

Modelling and Evaluation

• Selecting a Model: Predictive/Descriptive

• Training a Model for supervised learning

• model representation and interpretability

• Evaluating performance of a model

• Improving performance of a model

• Repeated holdout method

– Underfitting can be avoided by

• Kappa value of a model indicates the adjusted the

– Recall : Recall indicates the proportion of correct

• A regression model which ensures that the

• popular approaches which are adopted for cluster quality

• In the same way, let’s calculate the distance of an arbitrary data

• where n4 is the total number of elements in cluster 4. In the same

You might also like