0% found this document useful (0 votes)

19 views

Module 5

This document provides an overview of prediction techniques in machine learning, including classification and regression. It discusses supervised learning algorithms like linear regression, logistic regression, and simple non-linear regression. Key points covered include how regression aims to build relationships between features and outputs, how to evaluate regression models using metrics like MSE and RMSE, and how classification categorizes data into predefined classes to solve problems like spam filtering. Non-linear relationships can be transformed into linear ones to apply regression techniques.

Uploaded by

Umme Aiyman

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views

Module 5

Uploaded by

Umme Aiyman

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 48

CSE 2027-Fundamental of Data Analysis

Module 5 – Prediction
• Introduction: Overview,
• Classification,
• Regression,
• Building a Prediction Model,
• Applying a Prediction Model,
• Simple Linear Regression,
• Simple Non Linear Regression.
Machine learning is divided into two main
categories

Machine learning

Supervised Unsupervised
learning learning

Algorithms trained Algorithms trained

with labelled data with
unlabelled data
How does supervised learning
work?

Algorithm
Input that learns the mapping
Output
(x) function from the input (y)
to the output
A supervised learning technique:
Regression
How does regression work?
•Regression models use an algorithm to understand the
relationship between a dependent variable (input) and
an independent variable (output).

•They are helpful for predicting numerical values based

on different features’ values. E.g., temperature forecast
based on wind, humidity and pressure.
Regression aims
to build a relationship between each feature and the output
for predictions
Linear relationships Linear regression

Linear regression uses a best fitting straight line – “regression line”

y = wX + b

dependent weight; the slope independent Bias; is the

variable of the gradient of variable; value of Y
the line - used to when there is
indicates the predict the no X or X is
impact X has on value of Y zero
Y
A simple linear regression model

28
26
Simple linear regression only has
one Y variable and one X variable: 24
22

Umbrellas sold
• The independent variable x: 20
rainfall measured in 1
millimeters 8

• The dependent variable y: 1

6
the number of umbrellas sold
We can predict the number of 1
umbrellas, or Y, for any quantity 4
of rain. 1 30 35 40 50 55 60
2 45
Rainfall
1 (mm)
0
How can we calculate the
regression line?
28
26
Y = wX + b
We draw a line to represent
1 the relationship 24
Bias
22
We measure the distances Slope

Umbrellas sold
2 between the line and each 20 (weight)
datapoint (the residuals) 1
8
3 We sum up the 1
residuals 6

We adjust the weight & the 1

4 bias to minimize this sum 4
1 30 35 40 50 55 60
2 45
Rainfall
1 (mm)
0
Multiple features call for multiple
linear regression
Multiple features Multiple linear regression

The aim is to predict output variable using multiple features

y = w 1x 1 + w 2x 2 + … + b
•Multiple linear regression can have many independent variables
to one dependent variable
•Datasets with multiple features like the number of bedrooms,
age of the building, covered area, etc.
How can we evaluate the
performance of a regression model?
• We use performance evaluation
metrics
• The most commonly used evaluation metrics is taking the
difference between predicted and actual value of some test
points:
• The mean of the squared difference is taken – Mean Squared
Error (MSE)
• The size of the error is measured by taking the square root
of MSE - Root Mean Squared Error (RMSE)
Evaluating the performance of a regression model
using MSE & RMSE
n

MSE
1 Σ ( yi - ŷi )2
= i=1
n
n
RMSE=(MSE)1/2
1 Σ ( yi - i
2
i= ŷ)
n 1
MSE = M ean squared error Yi = Observed values
n = Number of data ŷi = Predicted values
points
Simple Non-Linear Regression
• In situations where the relationship between two
variables is nonlinear, a simple way of generating a
regression equation is to transform the nonlinear
relationship to a linear relationship using a
mathematical transformation.
• A linear model can then be generated.
• Once a prediction has been made, the predicted value
is transformed back to the original scale.
• For example, in Table 7.10 two columns show a
nonlinear relationship.

11
Simple Non-Linear Regression
• Plotting these values results in the scatterplot in
Figure 7.13.
• There is no linear relationship between these two
variables and hence we cannot calculate a linear model
directly from the two variables.
• To generate a model, we transform x or y or both to
create a linear relationship.
• In this example, we transform the y variable using the
following formula:

12
Simple Non-Linear Regression

13
Simple Non-Linear Regression
• We now generate a new column, y’ (Table 7.12). If we
now plot x against y’, we can see that we now have an
approximate linear relationship (see Figure 7.14).

14
Simple Non-Linear Regression
• Using x we can now calculate a
predicted value for the
transformed value of y (y’).
• To map this new prediction of y’
we must now perform inverse
transformation, that is, -1/y’.
• In Table 7.12, we have calculated
the predicted value for y’ and
transformed the number to
Predicted y.
• The Predicted y values are close to
the actual y values.

15
Simple Non-Linear Regression
• Some common nonlinear relationships are shown in Figure 7.15.
• The following transformation may create a linear relationship for the
charts shown:
 Situation a: Transformations on the x, y or both x and y
variables such as log or square root
 Situation b: Transformation on the x variable such as square
root, log or -1/x.
 Situation c: Transformation on the y variable such as square
root, log or -1/y.
• This approach of creating simple nonlinear models can only be
used when there is a clear transformation of the data to a linear
relationship.

16
Fig 7.14 and 7.15

17
A supervised learning technique:
classification
What is classification?
•Classification is the process of categorizing a given set
of data into classes. The pre-defined classes act as our
labels, or ground truth.
•The model uses the features of an object to predict its
labels. E.g., filtering spam from non-spam emails or
classifying types of fruits based on their color, weight
and size.

1
What types of problems does
classification solve?
There are two types of classification problems

Binary Multi-class

The output is restricted The output has more

to two classes than two classes
To solve classification problems:
logistic regression
What is logistic regression?
Logistic regression is a linear regression but for
classification problems. Unlike linear regression,
logistic regression doesn’t need a linear relationship
between input and output variables.
Logistic regression uses a logistic
function: sigmoid function

The sigmoid function

takes any real input,
and outputs a value
between zero and one.
How can we measure the
performance of a logistic regression
classifier?
• Once we have the predicted
results from our classification
model (classifier), the results are
compared with the actual label
(ground truth)

• Then the performance of the

model is being evaluated using
the confusion matrix
Applying the confusion matrix to
measure the model performance
Negative Positive
• True positives (TP) - results which were predicted
as positive & ground truth were also positive.
Negative TN FP
• False positives (FP) - instances predicted as
positives but actually were negative. Actual
Class
• True negatives (TN) - instances predicted as
negatives & their ground truth was also negative. Positive FN TP

• False negatives (FN) - instances predicted as

negative but their ground truth was positive.
Predicted Class
The evaluation metrics

| 24
Support vector machine (SVM)
• What is support vector machine
(SVM)?
• Support vector machine (SVM), is a supervised ML technique
that can be used to solve classification and regression
problems. It is, however, mostly used for classification.
• In this algorithm, each feature & data points are plotted
in the space. Then, the SVM model finds boundaries to
separates different data samples into specific classes.

| 25
A practical example: finding a 2D
plane that differentiates two classes
Let’s say we have a dataset of
different animals of two classes:
birds & fish
•There are only three features:
body weight, body length, and
daily food consumption
•We draw a 3D grid and plot all
these points

A SVM model will try to find a 2D

plane that differentiates the 2
classes

| 26
If there are more than three
features, we would have a hyper-
space
A hyper-space is a space with higher than 3 dimensions like
4D, 5D etc., and a separating line in a dimension higher
than 3, is called a hyper-plane.
•If the hyper-planes are linear, the SVM is called
Linear Kernel SVM
•For nonlinear hyper-planes, a Polynomial Kernel
or other advanced SVMs are used
What is a Prediction Model
• Predictive models are used in many situations
where an estimate for forecast is required.
• Ex: To project sales or forecast the weather.

• A Predictive model will calculate an estimate

for one or more variables (responses), based on
other variables (descriptors).
• Ex: A dataset of cars is used to build a predictive
model to estimate car fuel efficiency (MPG).

28
A portion of the observations are shown in the below table.

29
• A model to predict the car fuel efficiency was
built using:
• The MPG variable as the response and,
• The Cylinders, Displacement, Horsepower, Weight
and Acceleration variables as descriptors.

• Once the model has been built, it can be used to

make predictions for car fuel efficiency.

30
Ex: The observations in the below table could be presented to the model & the model
would predict the MPG column.

31
• There are many methods for building prediction models
and they are often characterized based on the response
variable.
• When the response is a categorical variable, the model
is called a classification model.
• When the response is a continuous variable, then the
model is called a regression model.

32
• Below table summarizes some of the methods available:

33
• There are two distinct phases, each with a unique set of
processes and issues to consider:
• Building
• Applying

34
Building a Prediction Model
• Building:
• The prediction model is built using existing data
called training set.
• This training set contains examples with values for
the descriptor and response variables.
• The training set is used to determine and qualify
the relationships between the input descriptors
and the output response variables.
• This set will be divided into observations used to
build the model and assess the quality of any
model built.

35
Building a Prediction Model
A. Preparing the Data set
• It is important to prepare a data set prior to modeling.
• Preparation should include the operations outlined
such as characterizing, cleaning, and transforming the
data.
• Particular care should be taken to determine whether
subsetting the data is needed to simplify the resulting
models

36
Building a Prediction Model
B. Designing a Modelling Experiment:
• Building a prediction model is an experiment.
• It will be necessary to build many models for which
you do not necessarily know which model will be the
‘best’.
• This experiment should be appropriately designed to
ensure an optimal result.
• There are three major dimensions that should be
explored:

37
Building a Prediction Model

1. Different models:
• There are many different approaches to building prediction
models.
• A series of alternative models should be explored since all
models work well in different situations.
• The initial list of modeling techniques to be explored can be
based on the criteria previously defined as important to the
project.

38
Building a Prediction Model

2. Different descriptor combinations:

• Models that are based on a single descriptor are called
simple models, whereas those built using a number of
descriptors are called multiple (or multivariate) models.
• Correlation analysis as well as other statistical approaches
can be used to identify which descriptor variables appear
to be influential.
• A subject matter expert or business analyst may also
provide insight into which descriptors would work best
within a model.

39
Building a Prediction Model

3. Model parameters:
• Most predictive models can be optimized by fine tuning
different model parameters.
• Building a series of models with different parameter
settings and comparing the quality of each model will allow
you to optimize the model.
• For example, when building a neural network model there
are a number of settings, which will influence the quality of
the models built such as the number of cycles or the
number of hidden layers.

40
Building a Prediction Model
• Evaluating the ‘best’ model depends on the objective
of the modeling process defined at the start of the
project.
• Other issues, for example, the ability to explain how a
prediction was made, may also be important and
should be taken into account when assessing the
models generated.
• Wherever possible, when two or more models give
comparable results, the simpler model should be
selected.

41
Building a Prediction Model
C. Separating Test and Training Sets:
• The goal of building a predictive model is to generalize
the relationship between the input descriptors and the
output responses.
• The quality of the model depends on how well the
model is able to predict correctly for a given set of
input descriptors.
• If the model generalizes the input/output relationships
too much, the accuracy of the model will be low. -
Overfitting

42
Building a Prediction Model
C. Separating Test and Training Sets:
• If the model does not generalize the relationships
enough, then the model will have difficulties making
predictions for observations not included in the data
set used to build the model.
• Hence, when assessing the quality of the model, it is
important to use a data set to build the model, which is
different from the data set used to test the accuracy of
the model.
• There are a number of ways for achieving this
separation of test and training set.

43
Applying a Prediction Model
• Applying:
• Once a model has been built, a data set with no
output response variables can be fed into this
model and the model will produce an estimate for
this response.
• A measure that reflects the confidence in this
prediction is often calculated along with an
explanation of how the value was generated.

44
Applying a Prediction Model
• Once a model has been built and verified, it can be
used to make predictions.
• Along with the presentation of the prediction, there
should be some indications of the confidence in this
value.

45
• During the data preparation step of the process, the
descriptors and/or the response variables may have been
translated to facilitate analysis.
• Once a prediction has been made, the variables should be
translated back into their original format prior to
presenting the information to the end user.
• For example, the log of the variable Weight was taken in
order to create a new variable log(Weight) since the original
variable was not normally distributed.
• This variable was used as a response variable in a model.
• Before any results are presented to the end user, the
log(Weight) response should be translated back to Weight
by taking the inverse of the log and presenting the value
using the original weight scale.

46
Applying a Prediction Model
• When applying these models to new data, some
criteria will need to be established as to which model
the observation will be presented to.
• For example, a series of models predicting house
prices in different locations such as coastal,
downtown, and suburbs were built.
• When applying these models to a new data set, the
observations should be applied only to the appropriate
model.

Alfred North Whitehead Modes of Thought 2
100% (14)
Alfred North Whitehead Modes of Thought 2
188 pages
Educational Assessment Notes EDUC3143
90% (10)
Educational Assessment Notes EDUC3143
13 pages
TI Identifying and Developing High Potential Employees and Emerging Leaders PDF
No ratings yet
TI Identifying and Developing High Potential Employees and Emerging Leaders PDF
11 pages
Creating An Innovative Culture
No ratings yet
Creating An Innovative Culture
20 pages
Chapter2 Regression Summary Final
No ratings yet
Chapter2 Regression Summary Final
10 pages
Simple Linear Regression and Correlation 568a5ac2ce9b3
No ratings yet
Simple Linear Regression and Correlation 568a5ac2ce9b3
31 pages
05 Linear Regression 2
No ratings yet
05 Linear Regression 2
71 pages
LINEAR REGRESSION Feu Diliman
No ratings yet
LINEAR REGRESSION Feu Diliman
11 pages
RegrCorr PDF
No ratings yet
RegrCorr PDF
20 pages
Regrion
No ratings yet
Regrion
19 pages
Unit 5
No ratings yet
Unit 5
104 pages
Chat Openai Com Share 42b24a73 839b 4128 Ade9 7d8eed9e9533
No ratings yet
Chat Openai Com Share 42b24a73 839b 4128 Ade9 7d8eed9e9533
21 pages
Lesson 8_ Regression-T
No ratings yet
Lesson 8_ Regression-T
54 pages
Statistical Analysis: Linear Regression
No ratings yet
Statistical Analysis: Linear Regression
36 pages
UNIT-2 ML
No ratings yet
UNIT-2 ML
39 pages
REGRESSION and CORRELATION ANALYSIS STA 106 -DR. BASHIRU
No ratings yet
REGRESSION and CORRELATION ANALYSIS STA 106 -DR. BASHIRU
10 pages
Correlation and Regression Analysis
No ratings yet
Correlation and Regression Analysis
71 pages
Topic 8 - Regression Analysis
No ratings yet
Topic 8 - Regression Analysis
51 pages
REGRESSION ANALYSIS STA 221
No ratings yet
REGRESSION ANALYSIS STA 221
10 pages
Lec 3 Regression.
No ratings yet
Lec 3 Regression.
20 pages
Lecture-3---Linear-Regression-imran-20022025-092939am
No ratings yet
Lecture-3---Linear-Regression-imran-20022025-092939am
46 pages
Module III (Part II)(Regression and Time Series)
No ratings yet
Module III (Part II)(Regression and Time Series)
118 pages
Simple Linear Regression Analysis
No ratings yet
Simple Linear Regression Analysis
5 pages
Line Plots Presentation
No ratings yet
Line Plots Presentation
23 pages
L5 - Simple Linear Regression Students
No ratings yet
L5 - Simple Linear Regression Students
33 pages
Biopharm Exp 2 2
No ratings yet
Biopharm Exp 2 2
7 pages
Correlation Regression
No ratings yet
Correlation Regression
58 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
27 pages
Satyam
No ratings yet
Satyam
4 pages
SimpleMultipleLinearRegression_FoundationalMathofAI_S24
No ratings yet
SimpleMultipleLinearRegression_FoundationalMathofAI_S24
6 pages
Introduction To Linear Regression and Correlation Analysis
No ratings yet
Introduction To Linear Regression and Correlation Analysis
92 pages
Linear Function Education Presentation in A Yellow Red and Blue Gridded Style
No ratings yet
Linear Function Education Presentation in A Yellow Red and Blue Gridded Style
22 pages
Correlation Regression
No ratings yet
Correlation Regression
42 pages
2a Linear Regression 18may
No ratings yet
2a Linear Regression 18may
28 pages
Lecture 6
No ratings yet
Lecture 6
16 pages
Simple Linear Regression and Correlation: Abrasion Loss vs. Hardness
No ratings yet
Simple Linear Regression and Correlation: Abrasion Loss vs. Hardness
23 pages
Multiple Regression
No ratings yet
Multiple Regression
18 pages
LM10 Simple Linear Regression IFT Notes
No ratings yet
LM10 Simple Linear Regression IFT Notes
28 pages
F_Regression
No ratings yet
F_Regression
65 pages
01 - Simple Linear Regression
No ratings yet
01 - Simple Linear Regression
24 pages
BES - Lecture 10 - Simple Linear Regression
No ratings yet
BES - Lecture 10 - Simple Linear Regression
15 pages
Regression
No ratings yet
Regression
8 pages
UNIT 2
No ratings yet
UNIT 2
79 pages
Linear Regression 18may
No ratings yet
Linear Regression 18may
28 pages
Unit 6 Simple Regression and Corr
No ratings yet
Unit 6 Simple Regression and Corr
39 pages
1522665114 General Linear Model
No ratings yet
1522665114 General Linear Model
6 pages
Simple Regression
No ratings yet
Simple Regression
45 pages
03 Revisions L Regression
No ratings yet
03 Revisions L Regression
25 pages
Week 11-1 - Lecture 14 - Student
No ratings yet
Week 11-1 - Lecture 14 - Student
42 pages
Mla Unit 2
No ratings yet
Mla Unit 2
99 pages
Lect03 CSN382
No ratings yet
Lect03 CSN382
31 pages
Week+12+Presentation
No ratings yet
Week+12+Presentation
99 pages
Lecture Week 12 - Intro To Regression
No ratings yet
Lecture Week 12 - Intro To Regression
5 pages
Unit 17
No ratings yet
Unit 17
33 pages
Regression Analysis
100% (1)
Regression Analysis
43 pages
Linear Regression
No ratings yet
Linear Regression
11 pages
Chapter 3 Econometrics
No ratings yet
Chapter 3 Econometrics
34 pages
Lgt2425 Introduction To Business Analytics: Lecture 3: Linear Regression (Part I)
No ratings yet
Lgt2425 Introduction To Business Analytics: Lecture 3: Linear Regression (Part I)
36 pages
Handout 4 Regression and Correlation
No ratings yet
Handout 4 Regression and Correlation
13 pages
Unit 3 notes
No ratings yet
Unit 3 notes
35 pages
Linear Regression (Simple & Multiple)
No ratings yet
Linear Regression (Simple & Multiple)
29 pages
6.8 schaffer
No ratings yet
6.8 schaffer
8 pages
Introduction To Regression
No ratings yet
Introduction To Regression
13 pages
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Conjunctions (Connectors) Review Sheet
No ratings yet
Conjunctions (Connectors) Review Sheet
3 pages
School Manual
No ratings yet
School Manual
3 pages
What Are Your Short and Long-Term Career Goals and
No ratings yet
What Are Your Short and Long-Term Career Goals and
2 pages
Maxmont Progress Personified
No ratings yet
Maxmont Progress Personified
3 pages
DLL-Food Fish Processing 9-Q1-W6
No ratings yet
DLL-Food Fish Processing 9-Q1-W6
4 pages
Student Teaching Lesson Plan Evaluation #1
No ratings yet
Student Teaching Lesson Plan Evaluation #1
10 pages
Lesson 1 FL - Foreign Language
No ratings yet
Lesson 1 FL - Foreign Language
5 pages
My Junior High School and Senior High School Journey
No ratings yet
My Junior High School and Senior High School Journey
1 page
BPY-11 PSYCHOLOGY Basic Psychological Processes (Paper - I)
No ratings yet
BPY-11 PSYCHOLOGY Basic Psychological Processes (Paper - I)
4 pages
Web Quest
No ratings yet
Web Quest
3 pages
Bachelor of Science in Psychology: Program Curriculum Ay 2020 - 2021
No ratings yet
Bachelor of Science in Psychology: Program Curriculum Ay 2020 - 2021
4 pages
MSEL532-Schein and Organizational Culture
No ratings yet
MSEL532-Schein and Organizational Culture
15 pages
Mixing Techniques
100% (4)
Mixing Techniques
2 pages
Lesson Plan 17
No ratings yet
Lesson Plan 17
4 pages
TTL 1 L3 Module
No ratings yet
TTL 1 L3 Module
6 pages
Steampunk A Retrofuturistic 19th Century Aesthetic Workshop
No ratings yet
Steampunk A Retrofuturistic 19th Century Aesthetic Workshop
16 pages
Data Visualization
No ratings yet
Data Visualization
31 pages
Four Insights About The Brain 2012
No ratings yet
Four Insights About The Brain 2012
46 pages
Effective Literacy Strategies
100% (1)
Effective Literacy Strategies
178 pages
Happy at Work Matters
No ratings yet
Happy at Work Matters
6 pages
5. Exploring the use of the Quake Safe House video game to foster disaster and disaster risk reduction awareness in museum visitors
No ratings yet
5. Exploring the use of the Quake Safe House video game to foster disaster and disaster risk reduction awareness in museum visitors
26 pages
RL Lecture1-Introduction (IITH)
No ratings yet
RL Lecture1-Introduction (IITH)
44 pages
Faster - Skills Mindfulness (Blue)
100% (1)
Faster - Skills Mindfulness (Blue)
12 pages
Reading and Writing
No ratings yet
Reading and Writing
3 pages
What Is Sight Reading
No ratings yet
What Is Sight Reading
2 pages
Ai Chapter 1
100% (1)
Ai Chapter 1
4 pages

Module 5

Uploaded by

Module 5

Uploaded by

CSE 2027-Fundamental of Data Analysis

Algorithms trained Algorithms trained

•They are helpful for predicting numerical values based

Linear regression uses a best fitting straight line – “regression line”

dependent weight; the slope independent Bias; is the

• The dependent variable y: 1

We adjust the weight & the 1

The aim is to predict output variable using multiple features

The output is restricted The output has more

The sigmoid function

• Then the performance of the

• False negatives (FN) - instances predicted as

A SVM model will try to find a 2D

• A Predictive model will calculate an estimate

• Once the model has been built, it can be used to

2. Different descriptor combinations:

You might also like