0% found this document useful (0 votes)

6 views85 pages

Linear Regression

Uploaded by

dogasancak01

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views85 pages

Linear Regression

Uploaded by

dogasancak01

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 85

Supervised Classification:

Logistic Regression
NLP’s practical applications

● Machine translation ● Machine learning:

● Automatic speech recognition ○ Logistic regression
○ Personalized assistants ○ Probabilistic modeling
○ Auto customer service ○ Recurrent Neural Networks
● Information Retrieval how? ○ Transformers
○ Web Search ● Algorithms, e.g.:
○ Question Answering ○ Graph analytics
● Sentiment Analysis ○ Dynamic programming
● Computational Social Science ● Data science
● Growing day by day ○ Hypothesis testing
Topics we will cover
● Supervised Classification
● Goal of logistic regression
● The “loss function” -- what logistic regression tries to optimize
● Adding Multiple Features
● Training and Test Sets
● Overfitting; Role of Regularization
Supervised Classification
- features of N observations (i.e. words)

- class of each of N observations

GOAL: Produce a model that outputs the most likely class yi, given features xi.
Supervised Classification
- features of N observations (i.e. words)

- class of each of N observations

GOAL: Produce a model that outputs the most likely class yi, given features xi.

0 0.0 0
1 0.5 0
2 1.0 1
3 0.25 0
4 0.75 1
Supervised
Classification Some function or rules
- features of N observations (i.et.owg
orodsf)rom X to Y, as
close as possible.
- class of each of N observations

GOAL: Produce a model that outputs the most likely class yi, given features xi.

0 0.0 0
1 0.5 0
2 1.0 1
3 0.25 0
4 0.75 1
Supervised Classification
Logistic Regression
Binary classification goal: Build a model that can estimate P(A=1|B=?)

i.e. given B, yield (or “predict”) the probability that A=1

Logistic Regression
Binary classification goal: Build a “model” that can estimate P(A=1|B=?)

i.e. given B, yield (or “predict”) the probability that A=1

In machine learning, tradition to use Y for the variable being predicted and X for
the features use to make the prediction.
Logistic Regression
Binary classification goal: Build a “model” that can estimate P(Y=1|X=?)

i.e. given X, yield (or “predict”) the probability that Y=1

In machine learning, tradition is to use Y for the variable being predicted and X for
the features use to make the prediction.
Logistic Regression
Binary classification goal: Build a “model” that can estimate P(Y=1|X=?)

i.e. given X, yield (or “predict”) the probability that Y=1

In machine learning, tradition is to use Y for the variable being predicted and X for
the features use to make the prediction.

Example: Y: 1 if target is verb, 0 otherwise;

X: 1 if “was” occurs before target; 0 otherwise
Logistic Regression
Binary classification goal: Build a “model” that can estimate P(Y=1|X=?)

i.e. given X, yield (or “predict”) the probability that Y=1

In machine learning, tradition is to use Y for the variable being predicted and X for
the features use to make the prediction.

Example: Y: 1 if target is verb, 0 otherwise;

X: 1 if “was” occurs before target; 0 otherwise
Logistic Regression
Example: Y: 1 if target is a part of a proper noun, 0 otherwise;
X: number of capital letters in target and surrounding words.
Logistic Regression
Example: Y: 1 if target is a part of a proper noun, 0 otherwise;
X: number of capital letters in target and surrounding words.
Logistic Regression
Example: Y: 1 if target is a part of a proper noun, 0 otherwise;
X: number of capital letters in target and surrounding words.

x y

2 1

1 0

0 0

6 1

2 1
Logistic Regression
Example: Y: 1 if target is a part of a proper noun, 0 otherwise;
X: number of capital letters in target and surrounding words.

x y

2 1

1 0

0 0

6 1

2 1
Logistic Regression
Example: Y: 1 if target is a part of a proper noun, 0 otherwise;
X: number of capital letters in target and surrounding words.

x y

2 1

1 0

0 0

6 1

2 1
Logistic Regression
Example: Y: 1 if target is a part of a proper noun, 0 otherwise;
X: number of capital letters in target and surrounding words.

x y

2 1

1 0

0 0

6 1

2 1

1 1
Logistic Regression
Example: Y: 1 if target is a part of a proper noun, 0 otherwise;
X: number of capital letters in target and surrounding words.

x y

2 1

1 0

0 0

6 1

2 1

1 1
Logistic Regression
Example: Y: 1 if target is a part of a proper noun, 0 otherwise;
X: number of capital letters in target and surrounding words.

x y

2 1

1 0

0 0

6 1

2 1

1 1
Logistic Regression
Example: Y: 1 if target is a part of a proper noun, 0 otherwise;
X: number of capital letters in target and surrounding words.

x y

2 1

1 0

0 0

6 1

2 1

1 1
Logistic Regression
Example: Y: 1 if target is a part of a proper noun, 0 otherwise;
X: number of capital letters in target and surrounding words.

x y

2 1
optim al b_0, changed!
1 0
b_1
0 0

6 1

2 1

1 1
Logistic Regression on a single feature (x)

Yi ∊ {0, 1}; X is a single value and can be anything numeric.

Logistic Regression on a single feature (x)

Yi ∊ {0, 1}; X is a single value and can be anything numeric.

Logistic Regression on a single feature (x)
Yi ∊ {0, 1}; X can be anything numeric.
Logistic Regression on a single feature (x)
Yi ∊ {0, 1}; X can be anything numeric.
Logistic Regression on a single feature (x)
Yi ∊ {0, 1}; X can be anything numeric.
Logistic Regression on a single feature (x)
Yi ∊ {0, 1}; X can be anything numeric.

HOW? Essentially, try different

values until “best fit” to the
training data (example ).

learned
“best fit” : whatever maximizes the likelihood function:

Logistic Regression on a single feature (x)

Yi ∊ {0, 1}; X can be anything numeric.

HOW? Essentially, try different

values until “best fit” to the
training data (example ).

learned
X can be multiple features
Often we want to make a classification based on multiple features:

● Number of capital letters

surrounding: integer
● Begins with capital letter: {0, 1}
● Preceded by “the”? {0, 1}
X can be multiple features
Often we want to make a classification based on multiple features:

● Number ofYc-aaxpisitias lYle(i.tet.e1rsor 0)

surrounding: integer
● Begins wTitohmcaakpeirtoaolmleftotrer: {0, 1}
● Precededmbulytip“ltehXes”,?let{’s0g, e1t }rid
of y-axis. Instead, show
decision point.
X can be multiple features
Often we want to make a classification based on multiple features:

● Number of capital letters

surrounding: integer
● Begins with capital letter: {0, 1}
● Preceded by “the”? {0, 1}

We’re learning a linear (i.e. flat)

separating hyperplane, but fitting
it to a logit outcome.
X can be multiple features
Often we want to make a classification based on multiple features:

● Number of capital letters

surrounding: integer
● Begins with capital letter: {0, 1}
● Preceded by “the”? {0, 1}

We’re learning a linear (i.e. flat)

separating hyperplane, but fitting
it to a logit outcome.
X can be multiple features
Often we want to make a classification based on multiple features:

● Number of capital letters

surrounding: integer
● Begins with capital letter: {0, 1}
● Preceded by “the”? {0, 1}

We’re learning a linear (i.e. flat)

separating hyperplane, but fitting
it to a logit outcome.

(https://www.linkedin.com/pulse/predicting-outcomes-pr
obabilities-logistic-regression-konstantinidis/)
Logistic Regression
Yi ∊ {0, 1}; X can be anything numeric.

We’re still learning a linear

separating hyperplane, but
fitting it to a logit outcome.
(https://www.linkedin.com/pulse/predicting-outcomes-pr
obabilities-logistic-regression-konstantinidis/)
Logistic Regression
Example: Y: 1 if target is a part of a proper noun, 0 otherwise;
X: number of capital letters in target and surrounding words.

x y

2 1

1 0

0 0

6 1

2 1

1 1
Logistic Regression
Example: Y: 1 if target is a part of a proper noun, 0 otherwise;
X1: number of capital letters in target and surrounding words.
Let’s add a feature! X2: does the target word start with a capital letter?

x2 x1 y

1 2 1

0 1 0

0 0 0

1 6 1

1 2 1

1 1 1
Machine Learning: How to setup data

0
1 training
2
3 Model
4 Data
…

N
Machine Learning: How to setup data

0 0.0 0 0
1 0.5 1 0 training
2 1.0 1 1
3 0.25 0 0 Model
4 0.75Da0 ta 1
… … …

N 0.35 1 0
Machine Learning: How to setup data

0 0.0 0 0
“Corpus” 1 0.5 1 0 training
2 1.0 1 1
raw data:
3 0.25 0 0
sequences of 4 0.75Da0 ta 1
characters … … …

N 0.35 1 0
Machine Learning: How to setup data
Feature Extraction

0 0.0 0 0
“Corpus” 1 0.5 1 0 training
2 1.0 1 1
3 0.25 0 0
raw data:
sequences of 4 0.75Da0 ta 1
characters … … …

N 0.35 1 0
Machine Learning: How to setup data
Feature Extraction

e.g.: words, sentences,0 0.0 0 0

0.5 1 0
“Corpus” documents,users. 1 training
2 1.0 1 1
3 0.25 0 0
raw data:
sequences of 4 0.75Da0 ta 1
characters … … …

N 0.35 1 0
Machine Learning: How to setup data
Feature Extraction

e.g.: words, sentences,0 0.0 0 0

“Corpus” documents,users. 1 0.5 1 0 training
2 1.0 1 1
row of features; e.g. 3 0.25 0 0
raw data:
sequences of ➔ number of capital letters 4 0.75Da0 ta 1
characters ➔ whether “I” was … … …
mentioned or not
N 0.35 1 0
Machine Learning: How to setup data
Feature Extraction

e.g.: words, sentences,0 0.0 0 0

“Corpus” documents,users. 1 0.5 1 0 training
2 1.0 1 1
row of features; e.g. 3 0.25 0 0
raw data:
sequences of ➔ number of capital letters 4 0.75Da0 ta 1
characters ➔ whether “I” was … … …
mentioned or not
➔ kfeatures indicating
whether kwords were N 0.35 1 0
mentioned or not
Machine Learning: How to setup data
Feature Extraction

Multi-hot Encoding
● Each word gets an index in the vector
●“C1orippsre” sent; 0 if not
fu
raw data: of features; e.g.
sequences of ➔ number of capital letters Data
characters ➔ whether “I” was
mentioned or not
➔ k featuresindicating
whether k wordswere
mentioned or not
Machine Learning: How to setup data
Feature Extraction

Multi-hot Encoding
● Each word gets an index in the vector
●“C1orippsre” sent; 0 if not
fu
Feature exampole f :feiastuw
reo
s;rd
e
.gp
. resent in document?
raw data:
sequences of ➔ number of capital letters Data
characters ➔ whether “I” was
mentioned or not
➔ k featuresindicating
whether k wordswere
mentionedornot
Machine Learning: How to setup data
Feature Extraction

Multi-hot Encoding
● Each word gets an index in the vector
●“C1orippsre” sent; 0 if not
fu
Feature example: is word present in document?
raw data:
sequences of Data
characters
[0, 1, 1, 0, 1, …, 1, 0, 1, 1, 0, 1, …,
k
➔ kfeatures indicating
1]
whether kwords were
mentioned or not
Machine Learning: How to setup data
Feature Extraction

Multi-hot Encoding
● Each word gets an index in the vector
●“C1orippsre” sent; 0 if not
fu
Feature example: is word present in document
raw data:
sequences of Data
characters
[0, 1, 1, 0, 1, …, 1, 0, 1, 1, 0, 1, …,
k
➔ kfeatures indicating
1]
whether kwords were
mentioned or not
Machine Learning: How to setup data
Feature Extraction

Multi-hot Encoding
● Each word gets an index in the vector
●“C1orippsre” sent; 0 if not
fu
Feature example: is previous word “the”?
raw data: book
sequences of Data
characters
[0, 1, 1, 0, 1, …, 1, 0, 1, 1, 0, 1, …,
k
➔ kfeatures indicating
1]
whether kwords were
mentioned or not
Machine Learning: How to setup data
Feature Extraction

One-hot Encoding
● Each word gets an index in the vector
●“CA nsd”ices 0 except present word: Feature
orllpiu
example: is previous word “the”?
raw data: book
sequences of Data
characters
[0, 1, 0, 0, 0, …, 0, 0, 0, 0, 0, 0, …,
k
➔ kfeatures indicating
0]
whether kwords were
mentioned or not
Machine Learning: How to setup data
Feature Extraction

One-hot Encoding
● Each word gets an index in the vector
●“CA nsd”ices 0 except present word:
orllpiu
Feature example: which is previous word?
raw data: was
sequences of Data
characters
[0, 1, 0, 0, 0, …, 0, 0, 0, 0, 0, 0, …,
0]k

[0, 0, 1, 0, 0, …, 0, 0, 0, 0, 0, 0, …,
k
Machine Learning: How to setup data
Feature Extraction

One-hot Encoding
● Each word gets an index in the vector
●“CA nsd”ices 0 except present word:
olrlpiu
Feature example: which is previous word?
raw data: interesting
sequences of Data
[ch0a,rac1te,rs 0, 0, 0, …, 0, 0, 0, 0, 0, 0, …,
0]k

[0, 0, 1, 0, 0, …, 0, 0, 0, 0, 0, 0, …,
k
Machine Learning: How to setup data
Feature Extraction

Multiple One-hot encodings for one observation

(1) word before; (2) word after
“Corpus”
interesting
raw data:
[0, 0, 0,
sequences
Data
of 0, 1, 0, …, 0]k [0, …, 0, 1, 0, …, 0]k
characters
Machine Learning: How to setup data
Feature Extraction

Multiple One-hot encodings for one observation

(1) word before; (2) word after
“Corpus”
interesting
raw data:
[0, 0, 0,
sequences
Data
of 0, 1, 0, …, 0]k [0, …, 0, 1, 0, …, 0]k
characters
=

[0, 0, 0, 0, 1, 0, …, 0, 0, …, 0, 1, 0, …, 0]2k
Machine Learning: How to setup data
Feature Extraction

Multiple One-hot encodings for one observation

(1) word before; (2) word after; (3) percent capitals
“Corpus”
Interesting
raw data:
[0, 0, 0,
sequences
Data
of 0, 1, 0, …, 0]k [0, …, 0, 1, 0, …, 0]k
characters
=

[0, 0, 0, 0, 1, 0, …, 0, 0, …, 0, 1, 0, …, 0]2k
[0, 0, 0, 0, 1, 0, …, 0, 0, …, 0, 1, 0, …, 0,
0.09]2k+1
Machine Learning: How to setup data

Model
Does the
Data
model hold up?
Machine Learning Goal: Generalize to new
data

Training Data

Model Does the

model hold up?

Testing Data
Machine Learning Goal: Generalize to new
data

80% Training Data

Model Does the
model hold up?

20% Testing Data

Logistic Regression - Regularization
X = Y
0.5 0 0.6 1 0 0.25 1
0 0.5 0.3 0 0 0 1
0 0 1 1 1 0.5 0
0 0 0 0 1 1 0
0.25 1 1.25 1 0.1 2 1
Logistic Regression - Regularization
X = Y
0.5 0 0.6 1 0 0.25 1
0 0.5 0.3 0 0 0 1
0 0 1 1 1 0.5 0
0 0 0 0 1 1 0
0.25 1 1.25 1 0.1 2 1
Logistic Regression - Regularization
x1 x2 ... X = Y
0.5 0 0.6 1 0 0.25 1
0 0.5 0.3 0 0 0 1
0 0 1 1 1 0.5 0
0 0 0 0 1 1 0
0.25 1 1.25 1 0.1 2 1

1.2 + -63x1 + 179x2 + 71x3 + 18x4 + -59x5 + 19x6 = logit(Y)

Logistic Regression - Regularization
x1 x2 ... X = Y
0.5 0 0.6 1 0 0.25 1
0 0.5 0.3 0 0 0 1
0 0 1 1 1 0.5 0
0 0 0 0 1 1 0
0.25 1 1.25 1 0.1 2 1

1.2 + -63x1 + 179x2 + 71x3 + 18x4 + -59x5 + 19x6 = logit(Y)

Logistic Regression - Regularization
x1 x2 ... X = Y
0.5 0 0.6 1 0 0.25 1
0 0.5 0.3 0 0 0 1
0 0 1 “over1fitting” 1 0.5 0
0 0 0 0 1 1 0
0.25 1 1.25 1 0.1 2 1

1.2 + -63x1 + 179x2 + 71x3 + 18x4 + -59x5 + 19x6 = logit(Y)

Python Example
Overfitting (1-d non-linear example)
Overfitting (1-d non-linear example)

Underfit

(image credit: Scikit-learn; in practice data are rarely this clear)

Overfitting (1-d non-linear example)

Underfit Overfit

(image credit: Scikit-learn; in practice data are rarely this clear)

Logistic Regression - Regularization
x1 x2 ... X = Y
0.5 0 0.6 1 0 0.25 1
0 0.5 0.3 0 0 0 1
0 0 1 “over1fitting” 1 0.5 0
0 0 0 0 1 1 0
0.25 1 1.25 1 0.1 2 1

1.2 + -63x1 + 179x2 + 71x3 + 18x4 + -59x5 + 19x6 = logit(Y)

Logistic Regression - Regularization
x1 X x2 = Y
0.5 0 1
0 0.5 1
What if only 2
0 0 predictors? 0
0 0 0
0.25 1 1
Logistic Regression - Regularization
x1 X x2 = Y
0.5 0 1
0 0.5 1
0 0 What if only 2 0
0 0 predictors? 0
A: better fit
0.25 1 1

0 + 2x1 + 2x2 = logit(Y)

Logistic Regression - Regularization
L1 Regularization - “The Lasso”
Zeros out features by adding values that keep from perfectly fitting the data.
Logistic Regression - Regularization
L1 Regularization - “The Lasso”
Zeros out features by adding values that keep from perfectly fitting the data.
Logistic Regression - Regularization
L1 Regularization - “The Lasso”
Zeros out features by adding values that keep from perfectly fitting the data.

set betas that maximize L

Logistic Regression - Regularization
L1 Regularization - “The Lasso”
Zeros out features by adding values that keep from perfectly fitting the data.

set betas that maximize penalized L

Logistic Regression - Regularization
Sometimes written as:

L1 Regularization - “The Lasso”

Zeros out features by adding values that keep from perfectly fitting the data.

set betas that maximize penalized L

Logistic Regression - Regularization
Sometimes written as:

L2 Regularization - “Ridge”
Shrinks features by adding values that keep from perfectly fitting the data.

set betas that maximize penalized L

Machine Learning Goal: Generalize to new
data

80% Training Data

Model Does the
model hold up?

20% Testing Data

Machine Learning Goal: Generalize to new
data

Training Data
80% Set
penalty
Model
Does the
10% Development model hold up?

10% Testing Data

Logistic Regression - Review
● Classification: P(Y | X)
● Learn logistic curve based on example data
○ training + development + testing data
● Set betas based on maximizing the likelihood
○ “shifts” and “twists” the logistic curve
● Multivariate features: One-hot encodings
● Separation represented by hyperplane
● Overfitting
● Regularization
Example
See notebook on website.
Extra Material
One approach to finding the parameters which maximize the likelihood function...
“best fit” : whatever maximizes the likelihood function:

Logistic Regression on a single feature (x)

Yi ∊ {0, 1}; X can be anything numeric.

To estimate , HOW? Essentially, try different

one can use values until “best fit” to the
reweighted least training data (example ).
squares:
learned
(Wasserman, 2005; Li, 2010)
“best fit” : whatever maximizes the likelihood function:

Logistic Regression on a single feature (x)

Yi ∊ {0, 1}; X can be anything numeric.

This is just one way of finding the betas that maximize the likelihood
function. In practice, we will use existing libraries that are fast and
support additional useful steps like regularization..

To estimate , HOW? Essentially, try different

one can use values until “best fit” to the
reweighted least training data (example ).
squares:
learned
(Wasserman, 2005; Li, 2010)

TR-181 - Device Data Model For CWMP and USP
No ratings yet
TR-181 - Device Data Model For CWMP and USP
223 pages
MC4301 - ML Unit 4 (Parametric Machine Learning)
No ratings yet
MC4301 - ML Unit 4 (Parametric Machine Learning)
56 pages
SD
No ratings yet
SD
10 pages
FEM 2063 - Data Analytics: CHAPTER 4: Classifications
100% (2)
FEM 2063 - Data Analytics: CHAPTER 4: Classifications
76 pages
Unit Vi Parametric Machine Learning
No ratings yet
Unit Vi Parametric Machine Learning
77 pages
5 LR Apr 7 2021
No ratings yet
5 LR Apr 7 2021
93 pages
Multimedia Application L9
No ratings yet
Multimedia Application L9
43 pages
Lec 02 LogisticReg
No ratings yet
Lec 02 LogisticReg
33 pages
Logistic Regression - Byimran
No ratings yet
Logistic Regression - Byimran
35 pages
Lecture 3. Classification
No ratings yet
Lecture 3. Classification
60 pages
Fai Module 3
No ratings yet
Fai Module 3
67 pages
Module1.4 Regression
No ratings yet
Module1.4 Regression
24 pages
Displaying A Smart Form As PDF in Enterprise Portal Using WebDynpro For Java
No ratings yet
Displaying A Smart Form As PDF in Enterprise Portal Using WebDynpro For Java
13 pages
Logistic Regression
No ratings yet
Logistic Regression
78 pages
ML Unit-IV Notes
No ratings yet
ML Unit-IV Notes
49 pages
23 LogisticRegression
No ratings yet
23 LogisticRegression
67 pages
SMDS Unit 5
No ratings yet
SMDS Unit 5
21 pages
Logistic Regression Annotated
No ratings yet
Logistic Regression Annotated
23 pages
ML 03 Logistic Regression
No ratings yet
ML 03 Logistic Regression
32 pages
Exponents and Scientifi C Notation: "The of My Are - The of My Are My ."
No ratings yet
Exponents and Scientifi C Notation: "The of My Are - The of My Are My ."
54 pages
Lecture 3 - 1
No ratings yet
Lecture 3 - 1
22 pages
Machine Learning Unit 2 Que and Ans
No ratings yet
Machine Learning Unit 2 Que and Ans
16 pages
Logistic Regression
No ratings yet
Logistic Regression
25 pages
Text Classification Using Logistics Regression
No ratings yet
Text Classification Using Logistics Regression
64 pages
Logisticregression 2021
No ratings yet
Logisticregression 2021
78 pages
81742301
No ratings yet
81742301
2 pages
ML - MU - Unit - 2 - Supervised Learning-Classification Techniques
No ratings yet
ML - MU - Unit - 2 - Supervised Learning-Classification Techniques
153 pages
Logistic Regression
No ratings yet
Logistic Regression
4 pages
Logit PDF
No ratings yet
Logit PDF
44 pages
Machine Learning - Unit 2
No ratings yet
Machine Learning - Unit 2
104 pages
Lecture Note #9 - PEC-CS701E
No ratings yet
Lecture Note #9 - PEC-CS701E
41 pages
Wraith Systems F 16 HOTAS User Guide v1.3
No ratings yet
Wraith Systems F 16 HOTAS User Guide v1.3
38 pages
Logistic Regression
No ratings yet
Logistic Regression
24 pages
ML 4
No ratings yet
ML 4
80 pages
ML CLASS 5 Logistic Regression Algorithm
No ratings yet
ML CLASS 5 Logistic Regression Algorithm
16 pages
Https:chartswap My Salesforce-Sites Com:rrequestview?id a0G3y00000RcsDtEAJ
No ratings yet
Https:chartswap My Salesforce-Sites Com:rrequestview?id a0G3y00000RcsDtEAJ
2 pages
Notes9 - Class - 10 - Data Visualization Using MatPlotlib Notes
No ratings yet
Notes9 - Class - 10 - Data Visualization Using MatPlotlib Notes
5 pages
Lecture 5
No ratings yet
Lecture 5
61 pages
Lecture 4-Logistic-Regression
No ratings yet
Lecture 4-Logistic-Regression
50 pages
Chapter 5
No ratings yet
Chapter 5
4 pages
Lecture 21 - Logistic Regression
No ratings yet
Lecture 21 - Logistic Regression
34 pages
Asa5510 Sec Bun k9 Datasheet
No ratings yet
Asa5510 Sec Bun k9 Datasheet
4 pages
Chapter 4 Statistical Classification Methods
No ratings yet
Chapter 4 Statistical Classification Methods
63 pages
Logistic Regression
No ratings yet
Logistic Regression
20 pages
MEMORIZACIÓN Métodos para Piano Recopilación de Chuan C. Chuang 1
No ratings yet
MEMORIZACIÓN Métodos para Piano Recopilación de Chuan C. Chuang 1
22 pages
Logisticregression
No ratings yet
Logisticregression
22 pages
1-Logistic Regression
No ratings yet
1-Logistic Regression
27 pages
Logistic Regression
No ratings yet
Logistic Regression
16 pages
Chapter 4 Statistical Classification Methods
No ratings yet
Chapter 4 Statistical Classification Methods
73 pages
Logistic Regression: Some Slides Adapted From Dan Jurfasky and Brendan O'Connor
No ratings yet
Logistic Regression: Some Slides Adapted From Dan Jurfasky and Brendan O'Connor
53 pages
Unit II
100% (1)
Unit II
13 pages
Logistic Regression: Adapted From: Tom Mitchell's Machine Learning Book Evan Wei Xiang and Qiang Yang
No ratings yet
Logistic Regression: Adapted From: Tom Mitchell's Machine Learning Book Evan Wei Xiang and Qiang Yang
15 pages
Tally Prime Digital e Bookk
No ratings yet
Tally Prime Digital e Bookk
79 pages
Logistic Regression Report
No ratings yet
Logistic Regression Report
39 pages
13 Logistic Regression Main
No ratings yet
13 Logistic Regression Main
14 pages
P4 October 2022 QP
No ratings yet
P4 October 2022 QP
32 pages
Logistic Regression
No ratings yet
Logistic Regression
11 pages
Turbo VPN For PC
No ratings yet
Turbo VPN For PC
7 pages
Week 4 Logistic
No ratings yet
Week 4 Logistic
21 pages
DH-IPC-HDBW4433R-ZS: 4MP WDR IR Dome Network Camera
No ratings yet
DH-IPC-HDBW4433R-ZS: 4MP WDR IR Dome Network Camera
3 pages
CP Lab MidtermAssessment 16052020 125841am
No ratings yet
CP Lab MidtermAssessment 16052020 125841am
2 pages
(Xinfeng Zhou) A Practical Guide To Quantitative Finance Interviews PDF
100% (1)
(Xinfeng Zhou) A Practical Guide To Quantitative Finance Interviews PDF
96 pages
Second Term Exam - Writing - Reading and Listening (12.5%)
No ratings yet
Second Term Exam - Writing - Reading and Listening (12.5%)
5 pages
A Research Project On Applying Logistic Regression To Predict Result of Binary Classification Problems
No ratings yet
A Research Project On Applying Logistic Regression To Predict Result of Binary Classification Problems
6 pages
Logistic Regressions
No ratings yet
Logistic Regressions
11 pages
Logistic Regression
No ratings yet
Logistic Regression
4 pages
Ramgarh
No ratings yet
Ramgarh
25 pages
YLP Logistic Regression
No ratings yet
YLP Logistic Regression
61 pages
02 LogisticRegression
No ratings yet
02 LogisticRegression
29 pages
B24 ML Exp-1
No ratings yet
B24 ML Exp-1
10 pages
Logistic Regression: Classification
No ratings yet
Logistic Regression: Classification
28 pages
W8 - Logistic Regression
No ratings yet
W8 - Logistic Regression
18 pages
11-Logistic Regression
No ratings yet
11-Logistic Regression
27 pages
Cisco Meraki
No ratings yet
Cisco Meraki
15 pages
Logistic Regression
No ratings yet
Logistic Regression
14 pages
p89v51 Semi
No ratings yet
p89v51 Semi
3 pages
SAP Note 32215812931
No ratings yet
SAP Note 32215812931
2 pages
Store Management System Project 29092013023847 Store Management System Project
100% (1)
Store Management System Project 29092013023847 Store Management System Project
50 pages
P 2.1 Logistic Regression
No ratings yet
P 2.1 Logistic Regression
18 pages
Lecture 6
No ratings yet
Lecture 6
19 pages
Pentestmonkey
No ratings yet
Pentestmonkey
5 pages
Les Articles Contractés Worksheet
No ratings yet
Les Articles Contractés Worksheet
1 page
19 3 RTU560 Training 3
No ratings yet
19 3 RTU560 Training 3
8 pages
Module #1 WORKSHOP 1 - ICT - C1
No ratings yet
Module #1 WORKSHOP 1 - ICT - C1
7 pages
Logistic Regression: Jia Li
No ratings yet
Logistic Regression: Jia Li
44 pages
Acer Download Tool 3.006
No ratings yet
Acer Download Tool 3.006
9 pages
2023+CISSP+Domain+2+Study+Guide+by+ThorTeaches Com+v4 0
No ratings yet
2023+CISSP+Domain+2+Study+Guide+by+ThorTeaches Com+v4 0
9 pages
Model Welds in Drawings Tekla
No ratings yet
Model Welds in Drawings Tekla
3 pages

Linear Regression

Uploaded by

Linear Regression

Uploaded by

Supervised Classification:

● Machine translation ● Machine learning:

● Machine translation ● Machine learning:

- class of each of N observations

- class of each of N observations

i.e. given B, yield (or “predict”) the probability that A=1

i.e. given B, yield (or “predict”) the probability that A=1

i.e. given X, yield (or “predict”) the probability that Y=1

i.e. given X, yield (or “predict”) the probability that Y=1

Example: Y: 1 if target is verb, 0 otherwise;

i.e. given X, yield (or “predict”) the probability that Y=1

Example: Y: 1 if target is verb, 0 otherwise;

Yi ∊ {0, 1}; X is a single value and can be anything numeric.

Yi ∊ {0, 1}; X is a single value and can be anything numeric.

HOW? Essentially, try different

Logistic Regression on a single feature (x)

Yi ∊ {0, 1}; X can be anything numeric.

HOW? Essentially, try different

● Number of capital letters

● Number ofYc-aaxpisitias lYle(i.tet.e1rsor 0)

● Number of capital letters

We’re learning a linear (i.e. flat)

● Number of capital letters

We’re learning a linear (i.e. flat)

● Number of capital letters

We’re learning a linear (i.e. flat)

We’re still learning a linear

e.g.: words, sentences,0 0.0 0 0

e.g.: words, sentences,0 0.0 0 0

e.g.: words, sentences,0 0.0 0 0

Multiple One-hot encodings for one observation

Multiple One-hot encodings for one observation

Multiple One-hot encodings for one observation

Model Does the

80% Training Data

20% Testing Data

1.2 + -63*x1 + 179*x2 + 71*x3 + 18*x4 + -59*x5 + 19*x6 = logit(Y)

1.2 + -63*x1 + 179*x2 + 71*x3 + 18*x4 + -59*x5 + 19*x6 = logit(Y)

1.2 + -63*x1 + 179*x2 + 71*x3 + 18*x4 + -59*x5 + 19*x6 = logit(Y)

(image credit: Scikit-learn; in practice data are rarely this clear)

(image credit: Scikit-learn; in practice data are rarely this clear)

1.2 + -63*x1 + 179*x2 + 71*x3 + 18*x4 + -59*x5 + 19*x6 = logit(Y)

0 + 2*x1 + 2*x2 = logit(Y)

set betas that maximize L

set betas that maximize penalized L

L1 Regularization - “The Lasso”

set betas that maximize penalized L

set betas that maximize penalized L

80% Training Data

20% Testing Data

10% Testing Data

Logistic Regression on a single feature (x)

Yi ∊ {0, 1}; X can be anything numeric.

To estimate , HOW? Essentially, try different

Logistic Regression on a single feature (x)

Yi ∊ {0, 1}; X can be anything numeric.

To estimate , HOW? Essentially, try different

You might also like

1.2 + -63x1 + 179x2 + 71x3 + 18x4 + -59x5 + 19x6 = logit(Y)

1.2 + -63x1 + 179x2 + 71x3 + 18x4 + -59x5 + 19x6 = logit(Y)

1.2 + -63x1 + 179x2 + 71x3 + 18x4 + -59x5 + 19x6 = logit(Y)

1.2 + -63x1 + 179x2 + 71x3 + 18x4 + -59x5 + 19x6 = logit(Y)

0 + 2x1 + 2x2 = logit(Y)