0% found this document useful (0 votes)

11 views46 pages

Lecture 04

Uploaded by

vkr2471

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views46 pages

Lecture 04

Uploaded by

vkr2471

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 46

Regression

Lecture 04

School of Energy Science & Engineering

Topics today…..

An Overview

• Introduction to Regression

• Types of Regression

• Key Concepts

• Applications

• Conclusion

School of Energy Science & Engineering

Regression
Definition
Regression analysis is a statistical method for
y
modelling relationships between a dependent dependent
variable and one or more independent variables. variable
(output)
Purpose x – independent variable (input)
To predict and forecast outcomes, and to understand the
strength and type of relationships.
y
For classification the output(s) is nominal
In regression the output is continuous

Function Approximation x
Many models could be used – Simplest is linear regression

Fit data with the best hyper-plane which "goes through" the points
For each point the difference between the predicted point and the actual
observation is the residue
School of Energy Science & Engineering
Linear Regression
Linear regression is like fitting a line or (hyper)plane to a set of points

[█$↓! $↓" ]=(())

Original (single) feature

Nonlinear curve needed Two features
Can fit a plane (linear)

For now, assume just one (input) independent variable x,

and one (output) dependent variable y
Multiple linear regression assumes an input vector x
Multivariate linear regression assumes an output vector y

School of Energy Science & Engineering

Types of Regression
• Linear Regression
• Multiple Regression
• Logistic Regression
• Polynomial Regression
• Ridge Regression
• Lasso Regression
• Elastic Net Regression
• Quantile Regression
• Non-Linear Regression

School of Energy Science & Engineering

Simple Linear Regression
We "fit" the points with a line (i.e. hyperplane)
Which line should we use?
Choose an objective function
For simple linear regression we use sum squared
residue (SSR)
SS (predictedi – actuali)2 = SS (residuei)2
Thus, find the line which minimizes the sum of the
squared residues (e.g. least squares)
This exactly mimics the case assuming data points
were sampled from an actual target hyperplane with
Gaussian noise added

School of Energy Science & Engineering

Numerical
You are given the following dataset representing the relationship between the
number of hours studied and the scores achieved by students in a test.

Hours Studied (X) Test Score (Y)

2 50
3 60
5 80
7 90
8 95

A linear regression model is proposed as: Y=5X+40

1.Calculate the predicted test scores for each value of hours studied
(X) using the given linear regression model.
2.Compute the Sum of Squared Errors (SSE) between the actual test
scores and the predicted test scores

School of Energy Science & Engineering

Learning parameters
For the 2-dproblem (line) there are coefficients for the bias and the independent variable (y-
intercept and slope)
y = β0 + β1 x

To find the values for the coefficients (weights) which minimize the objective function
we can take the partial derivatives of the objective function (SSE) with respect to the
coefficients. Set these to 0 and solve

n∑ xy − ∑ x∑ y
β0 =
∑ y − β1∑ x β1 = 2 2

n n∑ x − (∑ x )

School of Energy Science & Engineering

Numerical
Problem Statement:
Suppose we have a dataset that shows the number of hours studied by a
student and their corresponding scores on a test. The goal is to predict the
regression function for test score (y) based on the number of hours studied
(x).

Hours Studied (x) Test Score (y)

1 2
2 4
3 5
4 4
5 5

School of Energy Science & Engineering

Multiple Linear Regression
y = β0 + β1 x1 + β2 x2 + β3 x3 + ...... + +βn xn
There is a closed form for finding multiple linear regression weights which requires
matrix inversion, etc.

There are also iterative techniques to find

weights
(
Δw = c t − net × xi)
One is the delta rule. For regression we use an
△ 𝒘𝒊 : change in weight 𝒘𝒊
output node which is not thresholded (just does a
linear sum) and iteratively apply the delta rule – c : is the learning rate
For regression net is the output xi : is the input for that weight
Delta rule will update until minimizing the SSE, t: The target output (the actual label
thus solving multiple linear regression or value we want to predict).
There are other regression approaches that give net: The net input or the predicted
different results by trying to better handle outliers output (this could be the output from
and other statistical anomalies a neural network or another model).

School of Energy Science & Engineering

Linear Regression - Problem
(
Δw = c t − net × xi )
Assume we start with all weights as 1 (don’t use bias weight though you usually
always will – else forces the line through the origin)

Remember for regression we use an output node which is not thresholded (just does
a linear sum) and iteratively apply the delta rule – thus the net is the output

What are the new weights after one iteration through the following training set using
the delta rule with a learning rate c = 1. How does it generalize for the novel input (-
.3, 0)?

x1 x2 Target y
.5 -.2 1
1 1 0

School of Energy Science & Engineering

(
Δw = c t − net × xi )

Initial Setup
Initial Weights: w1=1, w2=1
Learning Rate: c=1

Predicted Δw2=
Input Target Error Δw1= Updated Updated
Output(net) c(t-net).x2
(x1,x2) (t) (t-net) c(t-net).x1 w1 w2
= w1x1+w2x2
Initial
1.00 1.00
Weights
(0.5, - 0.5*1+(−0.2)* 0.7⋅0.5=0. 0.7⋅−0.2= 1+0.35=1. 1−0.14=0.
1 1−0.3=0.7
0.2) 1 =0.3 35 −0.14 35 86
1*1.35+1*0.8 0−2.21=− −2.21*1= −2.21*1= 1.35−2.21 0.86−2.21
(1, 1) 0
6=2.21 2.21 −2.21 −2.21 =−0.86 =−1.35

School of Energy Science & Engineering

Final Weights
After processing all the inputs in the training set, the
final weights are:
w1=−0.86
w2=−1.35
Generalization to Novel Input (−0.3,0)
Y = w1⋅(−0.3)+w2⋅0 = −0.86⋅(−0.3)+(−1.35)⋅0 = 0.258

School of Energy Science & Engineering

Practice Numerical
( )
Δw = c t − net × xi
Assume we start with all weights as 0
What are the new weights after one iteration through
the following training set using the delta rule with a
learning rate c = .2
How does it generalize for the novel input (1, .5)?

x1 x2 Target
.3 .8 .7
-.3 1.6 -.1
.9 0 1.3

School of Energy Science & Engineering

Linear Regression- Summary
One advantage of linear regression models (and linear classification) is the
potential to look at the weights to give insight into which input variables are
most important in predicting the output
The variables with the largest weight magnitudes have the highest
correlation with the output
1. A large positive weight implies that the output will increase when this
input is increased (positively correlated)
2. A large negative weight implies that the output will decrease when this
input is increased (negatively correlated)
3. A small or 0 weight suggests that the input is uncorrelated with the
output (at least at the 1st order)
Linear regression/classification can be used to find best "indicators"
1. Be careful not to confuse correlation with causality
2. Linear models cannot detect higher order correlations! The power of
more complex machine learning models!!

School of Energy Science & Engineering

Linear regression for Classification

Obese 1.0

Not Obese

Weight

• If the model result > 0.5; predict Obese

• If the model result < 0.5; predict Not Obese

School of Energy Science & Engineering

Linear Regression
• Using data to predict something falls under the category of
“machine learning”

• Calculate R2 and determine

if weight and size are
correlated. Large values
imply a large effect
• Calculate a p-value to
Size determine if the R2 value is
statistically significant.
• Use the line to predict size
for give weight

Weight
School of Energy Science & Engineering
R2 compares a measure of a good fit, SS(fit)
to a measure of a bad fit, SS(mean)

2 SS(mean) − SS( fit)

R =
SS(mean)
• Size = 0.7 x weight + 0.86

Weight

School of Energy Science & Engineering

Logistic Regression
• Logistic regression is similar to linear regression, except
• Logistic regression predicts whether something is True or
False, instead of predicting something continuous like size

1
y= −x
1+ e

School of Energy Science & Engineering

Logistic Regression
• Instead of fitting a line to the data, logistic regression fits an “S”
Shaped “logistic function
• The curve tells you the probability that a mouse is obese based
on its weight

Obese 1

The curve
goes from
0 to 1

Not Obese 0

School of Energy Science & Engineering

Logistic Regression
• We use a continuous variable (like weight) to predict obesity
• Although logistic regression tells the probability that a mouse is
obese or not, its usually used for classification
• For example, if the probability a mouse is obese is >50%, then
we’ll classify it as obese, otherwise we’ll classify it as “not
obese”

Probability
a mouse is
Obese

School of Energy Science & Engineering

Logistic Regression
• Logistic regression’s ability to provide probabilties and classify
new samples using continuous and discrete measurements
makes it a popular machine learning method
• One big difference between linear regression and logistic
regression is how the line is fit to the data
• With linear regression, we fit the line using “least squares”. In
other words, we find the line that minimizes the sum of the
squares of these residuals (SSR)
• We also use the residuals to calculate R2 and to compare
simple models to complicated models
• Logistic regression doesn’t have the same concept of a
“residual”, so it can’t use least squares and it can’t calculate R2.
• Instead it uses something called “maximum likelihood”.

School of Energy Science & Engineering

We find the line that minimizes the sum
of the squares of these residuals

School of Energy Science & Engineering

Logistic Regression
• pick a probability, scaled by weight, of observing an obese mouse and use
that to calculate the likelihood of observing a non-obese mouse that weighs
this much
• Then we calculate the likelihood of observing all remaining mouse
• Lastly we multiply all of those likelihoods together. That’s the likelihood of the
data given as “S” Shaped line
• Finally, the curve with the maximum likelihood is selected

Obese

Not Obese
School of Energy Science & Engineering weight
Log(odds)
The Odds in favor of my team wining the game are 5 to 3:

We can write this as log(odds) = log(5/3)

Probability of wining: p = 5/8

⎛ 5 ⎞
⎛ p ⎞ ⎜ ⎟ ⎛ 5⎞
log(odds) = log ⎜ ⎟ = log ⎜ 8 ⎟ = log ⎜ ⎟
⎝ 1− p ⎠ ⎜⎜ 1− 5 ⎟⎟ ⎝ 3⎠
⎝ 8⎠

School of Energy Science & Engineering

The y-axis in logistic regression is transformed
from the “probability of obesity to the log(odds
of obesity
Obese

Probability
Of Obesity

Not Obese 0

weight

⎛ p ⎞
Log(odds of obesity) = log(odds) = log ⎜ ⎟
School of Energy Science & Engineering ⎝ 1− p ⎠
1

Log(odds of obesity)
0

weight ⎛ 0.88 ⎞
log ⎜ ⎟=2
⎛ 0.5 ⎞ ⎝ 1− 0.88 ⎠
log ⎜ ⎟=0
⎝ 1− 0.5 ⎠ ⎛ 0.95 ⎞
log ⎜ ⎟=3
⎛ 0.731 ⎞ ⎝ 1− 0.95 ⎠
log ⎜ ⎟ =1
⎝ 1− 0.731 ⎠
School of Energy Science & Engineering
The coefficient for the line in logistic Regression

The new y-axis transform

the squiggly line into a
straight line

Log(odds of obesity)
Y = -3.48 + 1.83 x weight

School of Energy Science & Engineering

Can’t use least-squares to find the best fitting
line, instead use the maximum likelihood
• First, project the original data
points onto candidate line
• This gives each sample a
candidate log(odds) value
• Then transform the candidate
log(odds) to candidate probabilities
using this formula

log(odds)
e
p= log(odds)
1+ e

School of Energy Science & Engineering

⎛ p ⎞
log ⎜ ⎟ = log(odds)
⎝ 1− p ⎠

An equation that takes probability

as input and outputs log(odds)

e log(odds)
p=
1+ e log(odds)

An equation that takes log(odds)

as input and outputs probability
School of Energy Science & Engineering
An equation that takes probability as input and
outputs log(odds)
⎛ p ⎞
log ⎜ ⎟ = log(odds)
⎝ 1− p ⎠
p log(odds)
=e
1− p
log(odds)
(
p = 1− p e )
log(odds) log(odds)
p=e − pe
log(odds)
p + pe log(odds)
=e log(odds) e
p= log(odds)
1+ e
(
p 1+ e log(odds)
)=e log(odds)
An equation that takes log(odds)
School of Energy Science & Engineering as input and outputs probability
• Now we use the observed status (obese or not obese) to
calculate their likelihood given the shape of the squiggly line
• Calculate the likelihood of the obese mice, given the shape of
the squiggle
• The likelihood that this mouse is obese, given the shape of the
squiggle, is the same as the predicted probability
• The likelihood that these mice are obese are 0.4,0.85,0.9, 0.98,
0.99

The likelihood for all of the obese mice

is just the product of the individual
likelihoods
= = 0.4 x 0.85 x 0.9 x 0.98 x 0.99

School of Energy Science & Engineering

The probability that these mice are
obese are both 0.01, so the probability
and likelihood that they are not obese
is (1 - 0.01)
The likelihood for the mice that are not obese:
(1 - 0.6), (1 – 0.03), (1 – 0.01), (1 – 0.01)
likelihood of data given the squiggle
= 0.4 x 0.85 x 0.9 x 0.98 x 0.99 x(1 - 0.6) x (1 – 0.03) x (1 – 0.01) x
(1 – 0.01)
Log(likelihood of data given the squiggle)
= log(0.4) + log(0.85) + log(0.9) + log(0.98) + log(0.99) + log(1 -
0.6) + log(1 – 0.03) + log(1 – 0.01) + log(1 – 0.01)

Log(likelihood of data given the squiggle) = -2.1813

School of Energy Science & Engineering
The algorithm that finds the line with
the maximum likelihood is pretty smart
– each time it rotates the line, it does
so in a way that increases the log-
likelihood. Thus the algorithm can find
the optimal fit after a few rotations

School of Energy Science & Engineering

Numerical
Consider a data set of 6 individual where 3 people are diagnosed with type 2
diabetics (T2D) and 3 are non-diabetic control. Below given plot shows the
probability of having T2D on the y axis and blood glucose level on the x axis.
Now, we have transformed the probability of T2D into log(odds of T2D) and
draw the candidate best fitting line.
If the log odds value of candidate data point is -1.9 what will be the candidate
probability of sample ‘d’ being non-diabetic ? Also calculate the log of likelihood
of overall probability of T2D.

School of Energy Science & Engineering

Numerical
Consider a data set with 6 mice where 3 are
obese and 3 are not obese. We have
calculated the log(odds) of obesity for each
candidate date by fitting line X.
The log (odds) of each candidate data point for
line X is as follows:
log(odds) of a = +.3, log(odds) of b = +1.2,
log(odds) of c = +2, log(odds) of d = -1.8,
log(odds) of e = -1.2, log(odds) of f = -0.1
Next, we rotate the line (Y) and calculate the
log(odds of obesity) for all the candidate data.
The log (odds) of each candidate data point for
line Y is as follows:
log(odds) of a = +0.2, log(odds) of b = +0.5,
log(odds) of c = +0.8, log(odds) of d = -0.9,
log(odds) of e = -0.5, log(odds) of f = -0.2

(a) Calculate the log(likelihood) of all the given data points when fitting line X
(b) Calculate the log(likelihood) of all the given data points when fitting line Y
(c) Which line can be considered the best fitting line for the above scenario and why?

School of Energy Science & Engineering

R2 compares a measure of a good fit, SS(fit)
to a measure of a bad fit, SS(mean)

2 SS(mean) − SS( fit)

R =
SS(mean)

School of Energy Science & Engineering

Like linear regression, we need to find a
measure of a good fit to compare with bad fit

Unfortunately, the residuals for

Logistic Regression are all infinite,
so we can’t use them

School of Energy Science & Engineering

Project the data onto the best fitting line and
then translate the log(odds) to probabilities

e log(odds)
p=
1+ e log(odds)

School of Energy Science & Engineering

Lastly, calculate the log-likelihood of the
data given the best fitting squiggle

Log(likelihood of data given the squiggle)

= log(0.4) + log(0.85) + log(0.9) + log(0.98) + log(0.99) +
log(1 - 0.6) + log(1 – 0.03) + log(1 – 0.01) + log(1 – 0.01) = -2.1813

We can call this LL(fit), for the log-likelihood of the fitted line,
and use it as substitute for SS(fit)
LL(fit)= -2.1813
School of Energy Science & Engineering
We need a measure of a poorly fitted line
that is analogous to SS(mean)

2 SS(mean) − SS( fit)

R =
SS(mean)

2 ???− LL( fit)

R =
???

School of Energy Science & Engineering

Log(odds of obesity)

⎛ no of obese mice ⎞
= log ⎜ ⎟
⎝ total no of mice not obese ⎠

⎛5⎞
= log ⎜ ⎟ = 0.22
⎝4⎠

School of Energy Science & Engineering

Translate the log(odds) back to probabilities

e 0.22
p= 0.22
= 0.56
1+ e
e log(odds)
p=
1+ e log(odds)

School of Energy Science & Engineering

Calculate the log-likelihood of the data
given the overall probability of obesity

Log(likelihood of data given overall probability of obesity)

= log(0.55) + log(0.55) + log(0.55) x log(0.55) + log(0.55) +
log(1 – 0.55) + log(1 – 0.55) + log(1 – 0.55) + log(1 – 0.55)
= -6.18
LL(overall probability)= -6.18
School of Energy Science & Engineering
LL(overall probability), LL(fit), hopefully
a measure of a bad fit a measure of a good fit
2 LL(overall probability) − LL( fit)
R =
LL(overall probability)

2
R =
(
−6.18 − −2.1813 ) = 0.6475
School of Energy Science & Engineering −6.18
Numerical
Calculate the log-likelihood of the data given the the best
fitting squiggle for malignant tumour. Then calculate for R2.

Malignant Non-Malignant
0.45 0.001
0.9 0.002
0.91 0.005
0.95 0.2
0.99 0.34

School of Energy Science & Engineering

Berk Ra Statistical Learning From A Regression Perspective
100% (3)
Berk Ra Statistical Learning From A Regression Perspective
451 pages
Customer Churn Data - A Project Based On Logistic Regression
100% (12)
Customer Churn Data - A Project Based On Logistic Regression
31 pages
2 Simple Linear Regression
No ratings yet
2 Simple Linear Regression
22 pages
Unit 3c Linear Regression
No ratings yet
Unit 3c Linear Regression
98 pages
Linear Regression
No ratings yet
Linear Regression
36 pages
Analysis of Mine Haul Truck Fuel Consumption Report
No ratings yet
Analysis of Mine Haul Truck Fuel Consumption Report
24 pages
3.1 Linear and Logistic Regression
No ratings yet
3.1 Linear and Logistic Regression
36 pages
Lecture 02 (3hrs) Linear Regression and Logistic Regression
No ratings yet
Lecture 02 (3hrs) Linear Regression and Logistic Regression
42 pages
Basic ML Algorithm
No ratings yet
Basic ML Algorithm
74 pages
LP III Lab Manual
100% (1)
LP III Lab Manual
8 pages
Datamining Lecture6
No ratings yet
Datamining Lecture6
41 pages
CH 03 Regression Techniques
No ratings yet
CH 03 Regression Techniques
74 pages
Lecture 4 Linear Regression
100% (1)
Lecture 4 Linear Regression
44 pages
Lecture 05
No ratings yet
Lecture 05
29 pages
Regression
No ratings yet
Regression
34 pages
Logistic Regression
No ratings yet
Logistic Regression
34 pages
2.1 Regression Analysis
No ratings yet
2.1 Regression Analysis
28 pages
LinearRegression1 210720 171800
No ratings yet
LinearRegression1 210720 171800
41 pages
Chapter 4 - Linear Model: Prepared By: Shier Nee, SAW Based On: Probabilistic Machine Learning by Kevin Murphy
No ratings yet
Chapter 4 - Linear Model: Prepared By: Shier Nee, SAW Based On: Probabilistic Machine Learning by Kevin Murphy
42 pages
ML - LAB - BE CSE (DS) Final
No ratings yet
ML - LAB - BE CSE (DS) Final
110 pages
Module 4
No ratings yet
Module 4
41 pages
Lecture 3.1
No ratings yet
Lecture 3.1
21 pages
Linear Regression
No ratings yet
Linear Regression
26 pages
N4cs2495a2 - Mat530 Group Assignment
No ratings yet
N4cs2495a2 - Mat530 Group Assignment
21 pages
2 - (9-3) Regression Classifiers
No ratings yet
2 - (9-3) Regression Classifiers
35 pages
Machine Learning Class Slide
No ratings yet
Machine Learning Class Slide
44 pages
Regression Analysis Linear Multiple Logistic
No ratings yet
Regression Analysis Linear Multiple Logistic
25 pages
Linear Regression
No ratings yet
Linear Regression
34 pages
Linear Regression
No ratings yet
Linear Regression
36 pages
Chapter 2 Simple Linear Regression
No ratings yet
Chapter 2 Simple Linear Regression
70 pages
ML L6 Linear Regresion
No ratings yet
ML L6 Linear Regresion
54 pages
Logistic Regression
No ratings yet
Logistic Regression
42 pages
GradientDescent-Regression Slides
No ratings yet
GradientDescent-Regression Slides
26 pages
Linear Regression
No ratings yet
Linear Regression
7 pages
Sample Size Determination: A Practical Guide For Health Researchers
No ratings yet
Sample Size Determination: A Practical Guide For Health Researchers
7 pages
EE708 Module 3A
No ratings yet
EE708 Module 3A
28 pages
Supervised Machine Learning - Regression
No ratings yet
Supervised Machine Learning - Regression
34 pages
04 - Linear-Classification-2024
No ratings yet
04 - Linear-Classification-2024
65 pages
ML Unit
No ratings yet
ML Unit
23 pages
Linear Reg, Logistic Reg and SVM
No ratings yet
Linear Reg, Logistic Reg and SVM
40 pages
Linear and Logistic Regression
No ratings yet
Linear and Logistic Regression
21 pages
2 Linear Logistic Regression
No ratings yet
2 Linear Logistic Regression
32 pages
Linear and Logistic Regression: Marta Arias Marias@lsi - Upc.edu
No ratings yet
Linear and Logistic Regression: Marta Arias Marias@lsi - Upc.edu
25 pages
Unit - Iii Supervisied Learning - Notes
No ratings yet
Unit - Iii Supervisied Learning - Notes
42 pages
Linear Regression - 1st Draft
No ratings yet
Linear Regression - 1st Draft
5 pages
ML4 Linear Models
No ratings yet
ML4 Linear Models
34 pages
ML Lecture - 3
No ratings yet
ML Lecture - 3
47 pages
Machine Learning Lecture 1
No ratings yet
Machine Learning Lecture 1
5 pages
09 23ECE216 LogisticRegression
No ratings yet
09 23ECE216 LogisticRegression
40 pages
Linear Regression-Part 2
No ratings yet
Linear Regression-Part 2
26 pages
Linear & Logistic Regression
No ratings yet
Linear & Logistic Regression
26 pages
Hundred Page ML Book CH 3
No ratings yet
Hundred Page ML Book CH 3
16 pages
A Tutorial of Machine Learning
No ratings yet
A Tutorial of Machine Learning
16 pages
Logistic Regression
No ratings yet
Logistic Regression
36 pages
AAI Lecture 10 SP 25
No ratings yet
AAI Lecture 10 SP 25
37 pages
What Is Linear Regression
No ratings yet
What Is Linear Regression
14 pages
Essentials of Linear Regression in Python
No ratings yet
Essentials of Linear Regression in Python
23 pages
Simple Linear Regression Definition: Two Variables Independent Variable Dependent Variable Equation
No ratings yet
Simple Linear Regression Definition: Two Variables Independent Variable Dependent Variable Equation
9 pages
ML Linear Regression Trupesh Patel
No ratings yet
ML Linear Regression Trupesh Patel
23 pages
Chapter 7 Section 9 Answers
No ratings yet
Chapter 7 Section 9 Answers
7 pages
Linear-Regression ML
No ratings yet
Linear-Regression ML
36 pages
Machine Learning Unit2
No ratings yet
Machine Learning Unit2
31 pages
11-Logistic Regression
No ratings yet
11-Logistic Regression
27 pages
FML Unit2
No ratings yet
FML Unit2
13 pages
3 Unit - Dspu
No ratings yet
3 Unit - Dspu
23 pages
Machine Learning in Production Andrew Kelleher, Adam Kelleher Isbn 978-0!13!4116549 Pearson 1st Edition 2019 282 Pages
No ratings yet
Machine Learning in Production Andrew Kelleher, Adam Kelleher Isbn 978-0!13!4116549 Pearson 1st Edition 2019 282 Pages
282 pages
Curing Studies of Unsaturated Polyester Resin Used in FRP Products
No ratings yet
Curing Studies of Unsaturated Polyester Resin Used in FRP Products
9 pages
Keywords: Quality Work Life, Working Condition, Social Environment, Job Security and Management Policy
No ratings yet
Keywords: Quality Work Life, Working Condition, Social Environment, Job Security and Management Policy
11 pages
Forest Plot Stata
No ratings yet
Forest Plot Stata
15 pages
Assignment
No ratings yet
Assignment
10 pages
E06 - Selective Inflow Performance
No ratings yet
E06 - Selective Inflow Performance
4 pages
Solomon L QP - S1 Edexcel
No ratings yet
Solomon L QP - S1 Edexcel
4 pages
Data Analytics With Cognos Questions
No ratings yet
Data Analytics With Cognos Questions
15 pages
2008-Response Surface Methodology (RSM) As A Tool For Optimization in Analytical Chemistry PDF
No ratings yet
2008-Response Surface Methodology (RSM) As A Tool For Optimization in Analytical Chemistry PDF
13 pages
A Review On Prediction and Analysis of Forest Fires Using AI and ML Algorithms
No ratings yet
A Review On Prediction and Analysis of Forest Fires Using AI and ML Algorithms
6 pages
Stats and Ecotrix
No ratings yet
Stats and Ecotrix
194 pages
Testbank For An IBM SPSS Companion To Political Analysis 6th Edition Pollock III Instant Download
No ratings yet
Testbank For An IBM SPSS Companion To Political Analysis 6th Edition Pollock III Instant Download
18 pages
Syllabus Actuarial Science
No ratings yet
Syllabus Actuarial Science
44 pages
9845 19595 1 SM PDF
No ratings yet
9845 19595 1 SM PDF
9 pages
Exercise 4 Time Series Forecasting
No ratings yet
Exercise 4 Time Series Forecasting
16 pages
Industry Concentration and Average Stock Returns
No ratings yet
Industry Concentration and Average Stock Returns
43 pages
Outline
No ratings yet
Outline
15 pages
The Micro Determinants of Financial Inclusion and Financial Resilience in Africa
No ratings yet
The Micro Determinants of Financial Inclusion and Financial Resilience in Africa
15 pages
ML Experiment No 1 Linear Regression Analysis
No ratings yet
ML Experiment No 1 Linear Regression Analysis
3 pages
N Endah 2019 J. Phys. Conf. Ser. 1179 012178 PDF
No ratings yet
N Endah 2019 J. Phys. Conf. Ser. 1179 012178 PDF
7 pages
08-14 Novita Sari
No ratings yet
08-14 Novita Sari
7 pages
Labor Economics-Research Paper
No ratings yet
Labor Economics-Research Paper
22 pages
Hirtenlehner, H. The Compensatory Effects of Inner and Outer Controls
No ratings yet
Hirtenlehner, H. The Compensatory Effects of Inner and Outer Controls
19 pages
Academic Achievement Among Upper Primary School Students in Relation To Their Mental Pressure After Kedarnath Disaster
No ratings yet
Academic Achievement Among Upper Primary School Students in Relation To Their Mental Pressure After Kedarnath Disaster
9 pages
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet

Lecture 04

Uploaded by

Lecture 04

Uploaded by

Regression

School of Energy Science & Engineering

School of Energy Science & Engineering

[█​$↓! ​$↓" ]=(())

Original (single) feature

For now, assume just one (input) independent variable x,

School of Energy Science & Engineering

School of Energy Science & Engineering

School of Energy Science & Engineering

Hours Studied (X) Test Score (Y)

A linear regression model is proposed as: Y=5X+40

School of Energy Science & Engineering

School of Energy Science & Engineering

Hours Studied (x) Test Score (y)

School of Energy Science & Engineering

There are also iterative techniques to find

School of Energy Science & Engineering

School of Energy Science & Engineering

School of Energy Science & Engineering

School of Energy Science & Engineering

School of Energy Science & Engineering

School of Energy Science & Engineering

• If the model result > 0.5; predict Obese

School of Energy Science & Engineering

• Calculate R2 and determine

2 SS(mean) − SS( fit)

School of Energy Science & Engineering

School of Energy Science & Engineering

School of Energy Science & Engineering

School of Energy Science & Engineering

School of Energy Science & Engineering

School of Energy Science & Engineering

We can write this as log(odds) = log(5/3)

Probability of wining: p = 5/8

School of Energy Science & Engineering

The new y-axis transform

School of Energy Science & Engineering

School of Energy Science & Engineering

An equation that takes probability

An equation that takes log(odds)

The likelihood for all of the obese mice

School of Energy Science & Engineering

Log(likelihood of data given the squiggle) = -2.1813

School of Energy Science & Engineering

School of Energy Science & Engineering

School of Energy Science & Engineering

2 SS(mean) − SS( fit)

School of Energy Science & Engineering

Unfortunately, the residuals for

School of Energy Science & Engineering

School of Energy Science & Engineering

Log(likelihood of data given the squiggle)

2 SS(mean) − SS( fit)

2 ???− LL( fit)

School of Energy Science & Engineering

School of Energy Science & Engineering

School of Energy Science & Engineering

Log(likelihood of data given overall probability of obesity)

School of Energy Science & Engineering

You might also like

[█$↓! $↓" ]=(())