Presentation On Jansche, hlt2005

This document presents a method for training logistic regression models to optimize the F-measure performance metric instead of accuracy. It formulates F-measure as a rational function of model utilities to approximate its optimization. Experimental results on a text summarization task show this F-measure training can outperform maximum likelihood training.

Uploaded by

philip zigoris

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

121 views11 pages

Presentation On Jansche, hlt2005

Uploaded by

philip zigoris

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 11

Maximum Expected F-Measure Training

of Logistic Regression Models

Martin Jansche, HLT 2005

presented by Philip Zigoris

Motivation

Learning algorithms generally optimize 0-1

accuracy.
Often this is not the performance measure we are
concerned with.
This tends to be the case with datasets heavily
skewed towards one class or when the cost of error
differs for each class.
Outline

●
Review: Logistic Regression
●
Review: F_alpha Performance Measure
●
Optimizing F_alpha
– Formulation and Algorithm
– Comparison to ML
– Experimental Results
●
Conclusion
Review: Logistic Regression

Sample: (x i , y i ) Î Â k
´ {±1}
1
Pr(+1 | x,q) = = g(x ×q)
1+ e - x×q

Classifier: y
MAP (x) = argmax Pr(+1 | x,q)
y

Objective: θ = argmax Õ g(y i (x i ×q))

q i
Review: F-measure

Predicted
A : true positive +1 -1
B : misses True +1 A B
C : false alarms -1 C D
D : true negative
æa 1- a ö- 1
Precision: A/(A+C) F a (R,P) = ç + ÷
èR P ø
vs. A
Fa (A,B,C) =
Recall: A/(A+B) A + a B + (1- a )C
Section 4: Relation to Expected Utility

é ù
êå Iy MAP (x i )= +1Iy i = +1ú
Express F as a rational i
éAù
1ê ú 1ê ú
function of a vector U S = êå Iy MAP (x i )=- 1Iyi = +1ú = Bú
valued utility nê i ú n ê
êëCúû
êå Iy (x )= +1Iy =- 1ú
êë i MAP i i
úû
(Approximately) Optimizing F

Similar to logistic regression:

Iy MAP (x i )= +1 » Pr(+1 | x,q)
We can also approximate A,B,C:
÷ (q) =
A å g(x ×q)
÷ (q)
i
y i = +1
÷ (q) = A
÷ (q) = n - A
B ÷ (q) F
pos ÷ pos
a n pos + (1- a ) m
÷ (q) = m
C ÷ (q)
÷ pos - A
÷ pos = å g(x ×q)
m
i
Comparison to maximum
likelihood: Toy dataset
x y
0 +1
1 -1
2 +1
3 -1
Comparison to maximum
likelihood: Toy dataset
Maximum Likelihood gives all +1 classifier (0.35,0.57)
•Recall is 1
•Precision is 3/4
•F.5=6/7 ≈ 0.86
Classifier trained with F.5 approximation (20, 15)
•F.25=4/5 ≈ 0.8
•Gives all one classifier (results the same as
ML) trained with F approximation labels first two
Classifier .25
examples negative (-20,15)
•F.5 = 4/5 ≈ 0.8
•F.25=8/9 ≈ 0.89
Experiments: Text Summarization

Task: Classify sentence (and like units) as belonging to summarization

Data:
•3535 train, 408 test instances
•29 features (1 binary, 28 real/integer valued)
•All features present

Results:

Data source: Sameer Maskey and Julia Hirschberg. Comparing lexical,

acoustic/prosodic, structural and discourse features for speech summarization. In
Conclusions
Main idea:
Approximate MAP classification with the
probability itself. This gives a continuous
potential over parameters which can be
optimized with standard techniques
Main criticism:
Experiments are inconclusive.

LaTeX Help Sheet
100% (50)
LaTeX Help Sheet
2 pages
Chapter 13 Experimental Design and Analysis of Variance PDF
100% (1)
Chapter 13 Experimental Design and Analysis of Variance PDF
43 pages
Machine Learning - Unit 2
No ratings yet
Machine Learning - Unit 2
104 pages
Log-Linear Models and Conditional Random Fieldsels
No ratings yet
Log-Linear Models and Conditional Random Fieldsels
27 pages
Mitchell Machine Learning
No ratings yet
Mitchell Machine Learning
37 pages
Week 3 - Lecture Slides - Logistic Regression
No ratings yet
Week 3 - Lecture Slides - Logistic Regression
54 pages
Yousef ML Washin Classification
100% (1)
Yousef ML Washin Classification
333 pages
ML RUSA Module 1 Intro
No ratings yet
ML RUSA Module 1 Intro
30 pages
Text Classification Using Logistics Regression
No ratings yet
Text Classification Using Logistics Regression
64 pages
ML Module - 1-1
No ratings yet
ML Module - 1-1
25 pages
CH 1
No ratings yet
CH 1
24 pages
Logistic Regression (Probability Concepts) and Perceptron
No ratings yet
Logistic Regression (Probability Concepts) and Perceptron
20 pages
W8 - Logistic Regression
No ratings yet
W8 - Logistic Regression
18 pages
Unit 1 ML
No ratings yet
Unit 1 ML
14 pages
Logistic Regression
No ratings yet
Logistic Regression
25 pages
07 Intro To ML
No ratings yet
07 Intro To ML
38 pages
Unit 3
No ratings yet
Unit 3
9 pages
ML Unit 1
No ratings yet
ML Unit 1
156 pages
Information Securtiy
No ratings yet
Information Securtiy
8 pages
06 Logistic Regression
No ratings yet
06 Logistic Regression
55 pages
Output 25
No ratings yet
Output 25
8 pages
Lecture Series On Machine Learning: Ravi Gupta G. Bharadwaja Kumar
No ratings yet
Lecture Series On Machine Learning: Ravi Gupta G. Bharadwaja Kumar
77 pages
CS115 01
No ratings yet
CS115 01
38 pages
Effective Applications of Learning: Speech Recognition
No ratings yet
Effective Applications of Learning: Speech Recognition
52 pages
ML Classification Trupesh Patel
No ratings yet
ML Classification Trupesh Patel
39 pages
Ai&ml Unit 4
No ratings yet
Ai&ml Unit 4
21 pages
7 Logistic-Regression
No ratings yet
7 Logistic-Regression
63 pages
Unit 3 LOGISTIC
No ratings yet
Unit 3 LOGISTIC
7 pages
Unit 1 ML
No ratings yet
Unit 1 ML
60 pages
ML 5 Units
No ratings yet
ML 5 Units
466 pages
Unit 1: Some Successful Applications of Machine Learning
No ratings yet
Unit 1: Some Successful Applications of Machine Learning
28 pages
ML Unit - 1
No ratings yet
ML Unit - 1
85 pages
Lecture 7 Classification
No ratings yet
Lecture 7 Classification
33 pages
BITS F464 ML Lecture Notes
No ratings yet
BITS F464 ML Lecture Notes
86 pages
Lecture 1
No ratings yet
Lecture 1
56 pages
Machine Learning
No ratings yet
Machine Learning
111 pages
CSCI-43646364 S25 - Lecture 4
No ratings yet
CSCI-43646364 S25 - Lecture 4
92 pages
Svit Dept of Computer Science and Engineering Machine Learning B.Tech Iiiyr
No ratings yet
Svit Dept of Computer Science and Engineering Machine Learning B.Tech Iiiyr
53 pages
8 CRF
No ratings yet
8 CRF
12 pages
ML Unit-I
No ratings yet
ML Unit-I
121 pages
Unit-I Machine Learning Basics
No ratings yet
Unit-I Machine Learning Basics
85 pages
Introduction To Maximum Entropy Models: Adwait Ratnaparkhi Yahoo! Labs
No ratings yet
Introduction To Maximum Entropy Models: Adwait Ratnaparkhi Yahoo! Labs
46 pages
2+logistic Regression
No ratings yet
2+logistic Regression
10 pages
Ai512 Book
No ratings yet
Ai512 Book
127 pages
Slide07 Bayes
No ratings yet
Slide07 Bayes
51 pages
Assignment 1
No ratings yet
Assignment 1
6 pages
Statistical Learning Theory
No ratings yet
Statistical Learning Theory
4 pages
Lec 05
No ratings yet
Lec 05
53 pages
ML Unit I Notes
No ratings yet
ML Unit I Notes
27 pages
01B DL2023 LinearModels
No ratings yet
01B DL2023 LinearModels
47 pages
Chapter 1
No ratings yet
Chapter 1
3 pages
Unit 2
No ratings yet
Unit 2
20 pages
Final Exam Epfl 2020 Machine Leaning
No ratings yet
Final Exam Epfl 2020 Machine Leaning
16 pages
UNIT 1 Machine Learning MTech
No ratings yet
UNIT 1 Machine Learning MTech
167 pages
DSA5102 Lecture1
No ratings yet
DSA5102 Lecture1
60 pages
48 Its Most Mutual
No ratings yet
48 Its Most Mutual
2 pages
Multimedia Application L9
No ratings yet
Multimedia Application L9
43 pages
Introduction To Machine Learning: Dr. Muhammad Amjad Iqbal
No ratings yet
Introduction To Machine Learning: Dr. Muhammad Amjad Iqbal
20 pages
Logistic Regression
No ratings yet
Logistic Regression
78 pages
Learning
No ratings yet
Learning
35 pages
CPSC540: Machine Learning Machine Learning Machine Learning Machine Learning
No ratings yet
CPSC540: Machine Learning Machine Learning Machine Learning Machine Learning
91 pages
Hyperbolic Functions (Trigonometry) Mathematics E-Book For Public Exams
From Everand
Hyperbolic Functions (Trigonometry) Mathematics E-Book For Public Exams
Mohmmad Khaja Shareef
No ratings yet
Akai MPC-1000 E3 Owners Manual
No ratings yet
Akai MPC-1000 E3 Owners Manual
104 pages
Latex Symbols Reference
100% (19)
Latex Symbols Reference
75 pages
fcb1010 Sysex Specification
100% (2)
fcb1010 Sysex Specification
3 pages
Latex Algorithms Package
100% (8)
Latex Algorithms Package
14 pages
GEOG2144 L2 Transport Planning and Analysis (2023-24) R
No ratings yet
GEOG2144 L2 Transport Planning and Analysis (2023-24) R
50 pages
Abaqus Example Problems Manual (6
No ratings yet
Abaqus Example Problems Manual (6
18 pages
Distillation Using Excel
No ratings yet
Distillation Using Excel
4 pages
Gtasa Carspawn
No ratings yet
Gtasa Carspawn
6 pages
Design of A Low-Profile Two-Axis Solar Tracker
No ratings yet
Design of A Low-Profile Two-Axis Solar Tracker
8 pages
Module 11 Unit 1 Correlation Analysis
No ratings yet
Module 11 Unit 1 Correlation Analysis
13 pages
Sample DOKA Paper U For Year 11 13
No ratings yet
Sample DOKA Paper U For Year 11 13
4 pages
Wave
No ratings yet
Wave
15 pages
Promaths Final Push Paper 2 Paper 2 (October 2023)
No ratings yet
Promaths Final Push Paper 2 Paper 2 (October 2023)
144 pages
Ama Ima Physics
No ratings yet
Ama Ima Physics
12 pages
Traupal Notes
No ratings yet
Traupal Notes
41 pages
(Xi Sci. Physics) (CH-2 QP)
No ratings yet
(Xi Sci. Physics) (CH-2 QP)
2 pages
Explanatory Research Design Handout Prof - Panke
No ratings yet
Explanatory Research Design Handout Prof - Panke
1 page
A Short Introduction To The Kotlin Language: For Java Developers
No ratings yet
A Short Introduction To The Kotlin Language: For Java Developers
35 pages
EXAMPLE Bar Graph
No ratings yet
EXAMPLE Bar Graph
1 page
Worksheet 1 - Graph of Motion
No ratings yet
Worksheet 1 - Graph of Motion
2 pages
Unit 4
No ratings yet
Unit 4
27 pages
Notes 2
No ratings yet
Notes 2
193 pages
Challenge 1: Microcontrollers Lab Leonardo Valencia Benitez Ángel Omar Medrano Castro
No ratings yet
Challenge 1: Microcontrollers Lab Leonardo Valencia Benitez Ángel Omar Medrano Castro
8 pages
Porous Media in Openfoam: Chalmers Spring 2009
No ratings yet
Porous Media in Openfoam: Chalmers Spring 2009
14 pages
Lecture 1 Slides
No ratings yet
Lecture 1 Slides
84 pages
Class 9 Sample Paper 2020-21
No ratings yet
Class 9 Sample Paper 2020-21
3 pages
Average Case Analysis of Binary Search
No ratings yet
Average Case Analysis of Binary Search
3 pages
Evaluating Risks of Construction-Induced Building Damage For Large Underground Construction Projects
No ratings yet
Evaluating Risks of Construction-Induced Building Damage For Large Underground Construction Projects
28 pages
A Review of Condition Monitoring and Fault Diagnosis For Diesel Engines
No ratings yet
A Review of Condition Monitoring and Fault Diagnosis For Diesel Engines
25 pages
Chapter 10
60% (5)
Chapter 10
54 pages
Fracture Analysis of Pressure Vessel Under Dynamic Loading and Thermal Effect PDF
No ratings yet
Fracture Analysis of Pressure Vessel Under Dynamic Loading and Thermal Effect PDF
108 pages
Definition of The Laplace Transform
No ratings yet
Definition of The Laplace Transform
15 pages
2.3 Finding The Equation of A Parabola Given Certain Conditions
100% (2)
2.3 Finding The Equation of A Parabola Given Certain Conditions
10 pages

Presentation On Jansche, hlt2005

Uploaded by

Presentation On Jansche, hlt2005

Uploaded by

Maximum Expected F-Measure Training

of Logistic Regression Models

presented by Philip Zigoris

Learning algorithms generally optimize 0-1

Objective: θ = argmax Õ g(y i (x i ×q))

Similar to logistic regression:

Task: Classify sentence (and like units) as belonging to summarization

Data source: Sameer Maskey and Julia Hirschberg. Comparing lexical,

You might also like