0% found this document useful (0 votes)

12 views39 pages

Pma 5

Uploaded by

927621bad030

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views39 pages

Pma 5

Uploaded by

927621bad030

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 39

PREDICTIVE MODELLING ANALYTICS

DIVYA M

SANTHANAM L
Corporate Trainer
UNIT V
INTRODUCTION TO MODEL
MODELLING ALGORITHMS
Modelling Algorithms are used provides machines the ability to
learn automatically by feeding lot of data.
TYPES OF MACHINE LEARNING

SUPERVISED LEARNING UNSUPERVISED REINFORCEMENT

LEARNING LEARNING
SUPERVISIED LEARNING
• Supervised learning is a technique in which we teach or train
the machine using data which is well labeled.
Supervised learning can be grouped further in three
categories of algorithms:
1. CLASSIFICATION
2. REGRESSION
3. SEGMENTATION
CLASSIFICATION
The model is trained in such a way that the output data is separated
into different labels (or categories) according to the given input data.
Output variable will be assigned to a category or class is Discrete
Value.
EXAMPLES:
1.To Predict the customer is
eligible for getting loan?
o/p: yes or no

2.Prdeict team India will win or

loss?
o/p: win or loss / yes or no
CLASSIFIER
The algorithm which implements the classification on a dataset is known as a
classifier. There are two types of Classifications:

• BINARY CLASSIFIER: This classification problem has only two possible

Outcomes.
Examples: YES or NO, MALE or FEMALE, SPAM or NOT SPAM, CAT or
DOG, etc.

• MULTI CLASS CLASSIFIER: This classification problem has more than two
outcomes.

Example: Classification of types of music.

REGRESSION
It is statistical method to model the relationship between a dependent
(target) and independent (predictor) variables with one or more
independent variables.

No labels defined-variable output is a continuous numerical value.

Examples:
1.To predict the whether for next 24
hours?
o/p: Continues value depends on
temperature

2.Predict share price?

o/p: Continues numeric range not
exacted.
1. Predict the House price in 2030?

2. Spam Email Detection

3. Temperature Forecasting

4. Stock Price Prediction

5. Medical Diagnosis

6. Predict Netflix Monthly income?

7. Predict Mr. Narendra Modi will win 2024?

8. Predict if an individual likes IPL?

SEGMENTATION
Segmentation, the technique of splitting customers into separate groups
depending on their attributes or behavior, makes this possible.

Cluster customers based on age, call usage, data usage, etc. in order to
divide them into gold, silver and bronze segments.

Each segment of customers can then be approached separately.

Cluster insurance claims and look for unusual cases within the groups.
This is also known as anomaly detection and is commonly used method to
detect fraud.
CREATING A MODEL IN IBM SPSS MODELER
When execute a model, a model nugget (a yellow diamond node)
is added to the stream canvas.
The model nugget stores the results of the analysis and is linked
to the modeling node.
The link ensures that when you rerun the model, for example with
other inputs, that the model nugget is updated with the new
results.
To view the model's results, open the model nugget. The output
depends on which model was executed. For example, you will have
a tree diagram when you run a CHAID node, a cluster profile
when you run a segmentation node, and a set of rules when you
execute an association model.
MODELLING PALETTE
The Modeling palette is organized into categories based on type of
models: each type is a sub palette.

Selecting one of the sub palettes will show all modeling nodes
suitable for that category.
Each type of model requires specific field roles:
❑ Supervised models require one of more input fields
(predictors) and a target field.
❑ Segmentation models only require input fields. The cluster
solution will be based on these fields. No target field is
specified.
❑ Association models involve rules where a field can appear both.
as input and as target
NEURAL NETWORKS
A Neural Network has an Input Layer, a Hidden Layer, and an
Output Layer.

The input layer consists of all predictors / input variables.

The output layer has the target variable.

Hidden Layer(s) are created automatically during model training.

A simple Neural Network is shown below:
The predictors have an effect on the target via a hidden layer. This extra
layer enables you to model non-linear relationships between the
predictors and the target.
Neural networks are generally viewed as powerful models but the
interpretation of the results is difficult. Even when you know the values
of all the coefficients (the connections in the diagram), it will still be
hard to establish the relationship between a predictor and the target
because of the hidden layer, and there can be more than one hidden
layer. That is why neural networks are referred to as black box models.
Neural Networks can score new data by just inputting the values of the
predictors and passing these values through the network, which will
return a value for the target.
INTRODUCTION TO LINEAR REGRESSION
Linear Regression is a linear approach to modeling the
relationship between a continuous target variable and one or
more predictor variables.

It is used to predict a continuous target by finding a linear

combination of predictors such that the correlation between the
actual values of the target and the predicted values of the target
is maximum.
Linear Node is available under the Modeling palette of SPSS Modeler.

Some examples of Linear Regression:

Predicting house prices (target) using input variables like total area
of house (square feet, square meter), number of rooms, distance
from nearest shopping mall, etc.
Predicting the weight of a person using input variables like height,
age, etc.
INTRODUCTION TO LOGISTIC REGRESSION
The logistic model is expressed in terms of a ratio: the probability that a
particular event occurs (a customer churns, a customer accepts an offer,
a claim is fraudulent, a customer does not pay back a loan, a student
passes an exam, and so forth) versus the probability that the event does
not occur. This ratio is known as odds.

Logistic Node can be found under the Modeling Palette

1. LOGISTIC REGRSSION
It is a simple and widely used algorithm for solving classification problems.

A classification algorithm used for binary classification, which estimates the

probability of belonging to a particular class using a sigmoid function.

In the Logistic Regression we will get a ‘S’ shaped sigmoid function .

This function is responsible for predicting values between 0 and 1.

THRESHOLD VALUE
The threshold value is a parameter to
determine the probability of the output
values.
The values that are higher than the
threshold value - probability of 1.
The values lower than the threshold
value - probability of 0.
Sigmoid or Logit function
It gives ‘S’ shaped curve that can taken any real
valued number and maps it to value between 0
and 1.

▪ 'z' is positive, the sigmoid function

approaches 1.

▪ 'z' is negative, the sigmoid function

approaches 0.
CASE STUDY:
Problem Statement: Predict the Bank Costumers Loan eligibility using
logistic regression.

Input data: Age Have Insurance

50 1
34 0
42 0
46 1
54 1
33 0
. .
. .
. .
23 0
1.Draw the Scatterplot for values
Have
Insurance

0.5

20 30 40 50 60 80

Age
2. Suppose Linear regression:-
Have
Insurance

0.5

20 30 40 50 60 80

Age
3. In Logistic Regression
Have
Insurance

0.5

20 30 40 50 60 80

Age
For example, when the probability that a customer pays back a loan is 4/5, then the
probability that that customer does not pay back the loan= 1- 4/5 = 1/5.

Therefore, the odds will be (4/5) / (1/5) = 4 for this customer. When the odds are 1,
you know that the probability for the event to occur equals the probability that the
event does not occur, and both probabilities are 0.5
The odds are linked to the predictors by the equation given as:

Here, exp (…) is another way to write e ^ (…), where e is, approximately,
the number 2.72.
Logistic Regression can be used for classification problems such as:
Predict whether a customer churns or not
Predict whether customer accepts an offer or not
Predict whether an email is spam or not
INTRODUCTION OF NEURAL NETWORKS
Neural networks attempt to solve problems using methods modeled on
how the human brain operates.

A typical neural network consists of several neurons arranged in layers to

create a network.

Each neuron can be thought of as a processing element that is given a

simple part of a task.

The connections between the neurons provide the network with the ability
to learn patterns and relationships in data.
MULTILAYER PERCEPTRON (MLP)
The Multilayer Perceptron consists of several processing units, the neurons, arranged in layers
to create a network.

The neurons in the input layer represent the predictors.

The neuron in the output layer represents the target.

Each neuron in the hidden layer receives an input based on a weighted combination of the
values of the neurons in the previous layer.

The neurons within the hidden layer are, in turn, combined to produce an output value, the
prediction.

This predicted value is compared to the actual value of the target and the difference between
the two values (the error) is fed back into the network (known as "back propagation"), which
in turn is updated
HOW DOES A MULTILAYER PERCEPTRON
NEURAL NETWORK LEARN?
Consider the example of a child learning the difference between an apple and a pear. The child
currently does not know the difference between an apple and a pear.

When shown the first example of a fruit, the child may look at the fruit and decide that it is round,
red in color and of a particular weight.

Not knowing what an apple or a pear actually looks like, the child may decide to place equal
importance on each of these factors.

The importance is what a network refers to as weights. At this stage the child is most likely to
randomly choose either an apple or a pear for the prediction. On being told the correct response, the
child will increase or decrease the relative importance of each of the factors to improve the decision
(reduce the error).
In a similar fashion, a Multilayer Perceptron network begins with
random weights placed on each of the inputs and generates a
predicted value of target.
On being told the actual value of the target, the network adjusts
these internal weights. In time, the child and the network will
hopefully make correct predictions.
RADIAL BASIS FUNCTION (RBF)
The Radial Basis Function (RBF) is a more recent type of network and
is quicker to train than the Multilayer Perceptron.
The RBF can be thought of performing a type of clustering within the
input space, encircling individual clusters of data by a number of
so-called basis functions.
If a data point falls within the region of activation of a particular basis
function, then the neuron corresponding to that basis function
responds most strongly.
The selection of the centers of each basis function is where difficulties
arise.
RBF networks are typically quicker to train than a MLP, and it can
model data that are clustered within the input space.

Classification
100% (2)
Classification
105 pages
ML 1 PPT Unit 1
No ratings yet
ML 1 PPT Unit 1
93 pages
The Basics of Data Analytics
89% (9)
The Basics of Data Analytics
17 pages
ML - MU - Unit - 2 - Supervised Learning-Classification Techniques
No ratings yet
ML - MU - Unit - 2 - Supervised Learning-Classification Techniques
153 pages
Lecture - 2 & 3
No ratings yet
Lecture - 2 & 3
62 pages
MLT Unit 2 Linear Regression
No ratings yet
MLT Unit 2 Linear Regression
26 pages
Machine Learning
No ratings yet
Machine Learning
115 pages
Supervised Learning
No ratings yet
Supervised Learning
187 pages
Machine Learning & AI
No ratings yet
Machine Learning & AI
38 pages
Machine Learning
No ratings yet
Machine Learning
133 pages
Ds Module 4
No ratings yet
Ds Module 4
73 pages
BAI 3303 Notes
No ratings yet
BAI 3303 Notes
12 pages
Machine Learning Basics
No ratings yet
Machine Learning Basics
39 pages
16 Comparison of Data Science Algorithms
No ratings yet
16 Comparison of Data Science Algorithms
13 pages
ML Unit-IV Notes
No ratings yet
ML Unit-IV Notes
49 pages
Machine Learning
No ratings yet
Machine Learning
100 pages
ML LVC 1 Post-Session Summary
No ratings yet
ML LVC 1 Post-Session Summary
15 pages
Chapter - 2-ML
No ratings yet
Chapter - 2-ML
63 pages
Supervised and Unsupervised Learning
No ratings yet
Supervised and Unsupervised Learning
92 pages
Machine Learning
No ratings yet
Machine Learning
14 pages
ML QB
No ratings yet
ML QB
13 pages
Unit-4 Pda
No ratings yet
Unit-4 Pda
111 pages
Predictive Analys
No ratings yet
Predictive Analys
34 pages
ML Introduction
No ratings yet
ML Introduction
76 pages
Notes5 Regression
No ratings yet
Notes5 Regression
14 pages
Slide 1
No ratings yet
Slide 1
29 pages
Unit 3 DSA
No ratings yet
Unit 3 DSA
69 pages
Supervised Learning
No ratings yet
Supervised Learning
24 pages
Machine Learning Supervised
No ratings yet
Machine Learning Supervised
42 pages
Unit 1 Machine Learning
No ratings yet
Unit 1 Machine Learning
68 pages
SMDS Unit 5
No ratings yet
SMDS Unit 5
21 pages
Machine Learning For Quants
No ratings yet
Machine Learning For Quants
13 pages
Linear Regression For ML Ass
No ratings yet
Linear Regression For ML Ass
99 pages
DAC ML Tutorial Final Deck
No ratings yet
DAC ML Tutorial Final Deck
150 pages
Machine Learning Reg
No ratings yet
Machine Learning Reg
45 pages
CE880 Lecture5 Slides
No ratings yet
CE880 Lecture5 Slides
32 pages
SDL Unit 1
No ratings yet
SDL Unit 1
7 pages
INTRODUCTION
No ratings yet
INTRODUCTION
51 pages
Big Data Analytics - Unit 3
No ratings yet
Big Data Analytics - Unit 3
55 pages
Machine Learning Ppts
No ratings yet
Machine Learning Ppts
38 pages
Regression Vs Classification in Machine Learning Explained!
No ratings yet
Regression Vs Classification in Machine Learning Explained!
10 pages
Module 3
No ratings yet
Module 3
63 pages
Fam QB Ans
No ratings yet
Fam QB Ans
9 pages
Machine Learning
No ratings yet
Machine Learning
32 pages
IB Biology Lab Manual
100% (4)
IB Biology Lab Manual
87 pages
Q No. 1 1.1machine Learning:: Machine Learning Is The Study of Computer Algorithms That Improve Automatically
No ratings yet
Q No. 1 1.1machine Learning:: Machine Learning Is The Study of Computer Algorithms That Improve Automatically
10 pages
Week - 03 Week04
No ratings yet
Week - 03 Week04
32 pages
Machine Learning (Chapter1)
No ratings yet
Machine Learning (Chapter1)
8 pages
Machine Learning
No ratings yet
Machine Learning
33 pages
Chapter 4 Classification
No ratings yet
Chapter 4 Classification
78 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
24 pages
CS601 - Machine Learning - Unit 1 - Notes - 1672759748
No ratings yet
CS601 - Machine Learning - Unit 1 - Notes - 1672759748
13 pages
AI Notes
No ratings yet
AI Notes
8 pages
Lecture 9
No ratings yet
Lecture 9
27 pages
Ijcrt 195700
No ratings yet
Ijcrt 195700
7 pages
AI ML 3 Updated
No ratings yet
AI ML 3 Updated
34 pages
ML 01 (Shubham)
No ratings yet
ML 01 (Shubham)
14 pages
ML 01 (Pranavv)
No ratings yet
ML 01 (Pranavv)
14 pages
Machine Learning Shortnote
No ratings yet
Machine Learning Shortnote
14 pages
Agenda: - Introduction - Basics - Classification - Clustering - Regression - Use-Cases
No ratings yet
Agenda: - Introduction - Basics - Classification - Clustering - Regression - Use-Cases
30 pages
Machine Learning
No ratings yet
Machine Learning
9 pages
Nursing Research 1 Notes
No ratings yet
Nursing Research 1 Notes
18 pages
Ba Projet Term 2
No ratings yet
Ba Projet Term 2
18 pages
Statistics and Probability: Performance Task 2 2 Quarter
100% (1)
Statistics and Probability: Performance Task 2 2 Quarter
4 pages
Psychology: (9th Edition) David Myers
No ratings yet
Psychology: (9th Edition) David Myers
58 pages
LECTURE NOTES - Practical Research I
No ratings yet
LECTURE NOTES - Practical Research I
11 pages
ECO - Chapter 2 SLRM
No ratings yet
ECO - Chapter 2 SLRM
40 pages
Forecasting Final
No ratings yet
Forecasting Final
51 pages
I.R. Chapter-2 Dr. K.pandit
No ratings yet
I.R. Chapter-2 Dr. K.pandit
10 pages
Ch-04: Data and Analysis - MCQs - PDF
No ratings yet
Ch-04: Data and Analysis - MCQs - PDF
8 pages
The Impact of Service Delivery System Effectiveness On Service Quality
No ratings yet
The Impact of Service Delivery System Effectiveness On Service Quality
26 pages
Chapter 2
No ratings yet
Chapter 2
73 pages
Influence of Recreational Activities On Stress Sub
No ratings yet
Influence of Recreational Activities On Stress Sub
11 pages
Arnason A, Sigurdsson SB, Gudmundsson A, Holme I, Engebretsen L, Bahr R. Risk Factors For Injuries in Football
No ratings yet
Arnason A, Sigurdsson SB, Gudmundsson A, Holme I, Engebretsen L, Bahr R. Risk Factors For Injuries in Football
14 pages
DCM PEB Tutorial
No ratings yet
DCM PEB Tutorial
28 pages
Output
No ratings yet
Output
18 pages
7 HR Data Sets For People Analytics - AIHR Analytics
No ratings yet
7 HR Data Sets For People Analytics - AIHR Analytics
14 pages
Brown Durbin CUSUM
No ratings yet
Brown Durbin CUSUM
15 pages
Multiphase Flowing BHP Prediction
No ratings yet
Multiphase Flowing BHP Prediction
15 pages
Machine Learning-Based Maternal Health Risk Predic
No ratings yet
Machine Learning-Based Maternal Health Risk Predic
15 pages
7 Regression
No ratings yet
7 Regression
96 pages
Audit Market Concentration and Audit Quality of Listed Industrial Firms in Nigeria
No ratings yet
Audit Market Concentration and Audit Quality of Listed Industrial Firms in Nigeria
13 pages
Hoffmann Post 2014 JBEE PDF
No ratings yet
Hoffmann Post 2014 JBEE PDF
6 pages
Bridgesetal2007-Patientpreferencemethods ISPOR
No ratings yet
Bridgesetal2007-Patientpreferencemethods ISPOR
4 pages
How Online Social Network Providers' Privacy Policies Impact Users' Information Sharing Behavior
No ratings yet
How Online Social Network Providers' Privacy Policies Impact Users' Information Sharing Behavior
11 pages
1 s2.0 S31 Main
No ratings yet
1 s2.0 S31 Main
6 pages
Mid Term Test BFT 64
No ratings yet
Mid Term Test BFT 64
3 pages
"Effect of Store Atmosphere On Consumer Purchase Intention": Munich Personal Repec Archive
No ratings yet
"Effect of Store Atmosphere On Consumer Purchase Intention": Munich Personal Repec Archive
10 pages
The Predictive Power of Adult Attachment Patterns On Interpersonal Cognitive Distortions of University Students
No ratings yet
The Predictive Power of Adult Attachment Patterns On Interpersonal Cognitive Distortions of University Students
10 pages
Machine Learning Interview Questions
From Everand
Machine Learning Interview Questions
Tech Interviews
4.5/5 (2)

Pma 5

Uploaded by

Pma 5

Uploaded by

PREDICTIVE MODELLING ANALYTICS

SUPERVISED LEARNING UNSUPERVISED REINFORCEMENT

2.Prdeict team India will win or

• BINARY CLASSIFIER: This classification problem has only two possible

Example: Classification of types of music.

No labels defined-variable output is a continuous numerical value.

2.Predict share price?

2. Spam Email Detection

4. Stock Price Prediction

6. Predict Netflix Monthly income?

7. Predict Mr. Narendra Modi will win 2024?

8. Predict if an individual likes IPL?

Each segment of customers can then be approached separately.

The input layer consists of all predictors / input variables.

The output layer has the target variable.

Hidden Layer(s) are created automatically during model training.

It is used to predict a continuous target by finding a linear

Some examples of Linear Regression:

Logistic Node can be found under the Modeling Palette

A classification algorithm used for binary classification, which estimates the

In the Logistic Regression we will get a ‘S’ shaped sigmoid function .

This function is responsible for predicting values between 0 and 1.

▪ 'z' is positive, the sigmoid function

▪ 'z' is negative, the sigmoid function

Input data: Age Have Insurance

A typical neural network consists of several neurons arranged in layers to

Each neuron can be thought of as a processing element that is given a

The neurons in the input layer represent the predictors.

The neuron in the output layer represents the target.

You might also like