0% found this document useful (0 votes)

120 views6 pages

Statlect: Log-Likelihood

The log-likelihood is the natural logarithm of the likelihood function. The likelihood function gives the probability or probability density of observing a sample from a statistical distribution described by one or more parameters. Taking the log of the likelihood function transforms products of probabilities/densities into sums, which are more numerically stable and easier to analyze mathematically. The log-likelihood is commonly used to find maximum likelihood estimates of parameters by maximizing its value.

Uploaded by

Petros Piano

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

120 views6 pages

Statlect: Log-Likelihood

Uploaded by

Petros Piano

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

7/6/2020 Log-likelihood

StatLect
Index > Glossary

Log-likelihood
by Marco Taboga, PhD

The log-likelihood is, as the term suggests, the natural logarithm of the likelihood.

In turn, given a sample and a parametric family of distributions (i.e., a set of

distributions indexed by a parameter) that could have generated the sample, the
likelihood is a function that associates to each parameter the probability (or
probability density) of observing the given sample.

Definition

The following elements are needed to rigorously define the log-likelihood function:

we observe a sample , which is regarded as the realization of a random vector ,

whose distribution is unknown;

the distribution of belongs to a parametric family: there is a set of real

vectors (called the parameter space) whose elements (called parameters) are put
into correspondence with the distributions that could have generated ; in
particular:

if is an continuous random vector, its joint probability density function belongs

to a set of joint probability density functions

indexed by the parameter ;

if is a discrete random vector, its joint probability mass function belongs to a

set of joint probability mass functions

https://www.statlect.com/glossary/log-likelihood 1/6
7/6/2020 Log-likelihood

indexed by the parameter ;

when the joint probability mass (or density) function is considered as a function of
for fixed (i.e., for the sample we have observed), it is called likelihood (or
likelihood function) and it is denoted by . So,

if is discrete and

if is continuous.

Given all these elements, the log-likelihood function is the function defined by

Example

The typical example is the log-likelihood function of a sample that is made up

ofindependent and identically distributed draws from a normal distribution.

In this case, the sample is a vector

whose entries are draws from a normal distribution. The probability density
function of a generic draw is

where and are the parameters (mean and variance) of the normal distribution.

With the notation used in the previous section, the parameter vector is

The parametric family being considered is the set of all normal distributions (that can
be obtained by varying the parameters and ).

https://www.statlect.com/glossary/log-likelihood 2/6
7/6/2020 Log-likelihood

In order to stress the fact that the probability density depends on the two parameters,
we write

The joint probability density of the sample is

because the joint density of a set of independent variables is equal to the product of
their marginal densities (see the lecture on Independent random variables).

The likelihood function is

The log-likelihood function is

How the log-likelihood is used

The log-likelihood function is typically used to derive the maximum likelihood

estimator of the parameter . The estimator is obtained by solving

https://www.statlect.com/glossary/log-likelihood 3/6
7/6/2020 Log-likelihood

that is, by finding the parameter that maximizes the log-likelihood of the observed
sample . This is the same as maximizing the likelihood function because the
natural logarithm is a strictly increasing function.

Why the log is taken

One may wonder why the log of the likelihood function is taken. There are several
good reasons. To understand them, suppose that the sample is made up of
independent observations (as in the example above). Then, the logarithm transforms
a product of densities into a sum. This is very convenient because:

the asymptotic properties of sums are easier to analyze (one can apply Laws of
Large Numbers and Central Limit Theorems to these sums; see the proofs of
consistency and asymptotic normality of the maximum likelihood estimator);

products are not numerically stable: they tend to converge quickly to zero or to
infinity, depending on whether the densities of the single observations are on
average less than or greater than 1; sums are instead more stable from a
numerical standpoint; this is important because the maximum likelihood problem is
often solved numerically on computers where limited machine precision does not
allow to distinguish a very small number from zero and a very large number from
infinity.

More examples

More example of how to derive log-likelihood functions can be found in the lectures
on:

maximum likelihood (ML) estimation of the parameter of the Poisson distribution

ML estimation of the parameter of the exponential distribution

ML estimation of the parameters of a normal linear regression model

https://www.statlect.com/glossary/log-likelihood 4/6
7/6/2020 Log-likelihood

More details

The log-likelihood and its properties are discussed in a more detailed manner in the
lecture on maximum likelihood estimation.

Keep reading the glossary

Previous entry: Joint probability mass function

Next entry: Loss function

https://www.statlect.com/glossary/log-likelihood 5/6
7/6/2020 Log-likelihood

The book
Most of the learning materials found on this website are now available in a traditional
textbook format.

Learn more

Featured pages Main sections

Normal distribution Mathematical tools
Binomial distribution Fundamentals of probability
Beta function Probability distributions
F distribution Asymptotic theory
Uniform distribution Fundamentals of statistics
Law of Large Numbers Glossary

Explore About
Set estimation About Statlect
Convergence in distribution Contacts
Independent events Cookies, privacy and terms of use

Glossary entries
Discrete random variable
Almost sure
Type I error
Null hypothesis
Binomial coefficient
Type II error

https://www.statlect.com/glossary/log-likelihood 6/6

Mixed Effects Models in S and S-Plus PDF
100% (1)
Mixed Effects Models in S and S-Plus PDF
537 pages
Assignment Questions
0% (2)
Assignment Questions
5 pages
Part 2aa
No ratings yet
Part 2aa
89 pages
Lecture1 ML MLE
No ratings yet
Lecture1 ML MLE
103 pages
Principles of Statistics
No ratings yet
Principles of Statistics
113 pages
Section 5
No ratings yet
Section 5
18 pages
02第二课：基于机器学习方法的自然语言处理
No ratings yet
02第二课：基于机器学习方法的自然语言处理
54 pages
3logistic Regression
No ratings yet
3logistic Regression
61 pages
Vanegas 2016
No ratings yet
Vanegas 2016
25 pages
Lecture 2 - 4 Prior
No ratings yet
Lecture 2 - 4 Prior
51 pages
Package Car': March 30, 2023
No ratings yet
Package Car': March 30, 2023
158 pages
Stat100b Maximum Likelihood
No ratings yet
Stat100b Maximum Likelihood
9 pages
AP Stat Exploring Data 2
No ratings yet
AP Stat Exploring Data 2
9 pages
Slides 1
No ratings yet
Slides 1
73 pages
Chapman-Kolmogorov Equations 28 Likelihood Intervals Are 48511
No ratings yet
Chapman-Kolmogorov Equations 28 Likelihood Intervals Are 48511
9 pages
HW FRM2
No ratings yet
HW FRM2
5 pages
Notes For Lectures 1 To 10 - 2024
No ratings yet
Notes For Lectures 1 To 10 - 2024
39 pages
Week+3 418
No ratings yet
Week+3 418
9 pages
08 SS039
No ratings yet
08 SS039
17 pages
Session 32 - Point Estimate
No ratings yet
Session 32 - Point Estimate
53 pages
Chap 5
No ratings yet
Chap 5
32 pages
Chapter 1 B
No ratings yet
Chapter 1 B
35 pages
10-MAT U8 QP-50M - EM - SivAmoorthy VPM - Kalviexpress - Cropped
No ratings yet
10-MAT U8 QP-50M - EM - SivAmoorthy VPM - Kalviexpress - Cropped
2 pages
Hypothesis Testing - A Visual Introduction To Statistical Significance (Scott Hartshorn)
No ratings yet
Hypothesis Testing - A Visual Introduction To Statistical Significance (Scott Hartshorn)
137 pages
MLE Assingnment
No ratings yet
MLE Assingnment
7 pages
Note on Generalized Linear Models: y y Xβ w X β w I y Xβ I y Xβ X w X
No ratings yet
Note on Generalized Linear Models: y y Xβ w X β w I y Xβ I y Xβ X w X
4 pages
Prelims Stats
No ratings yet
Prelims Stats
39 pages
Tutorial 8 - Analysis of Variance (ANOVA) : Presented by Eng. Alaa Zarif & Eng. Lobna El Seify
No ratings yet
Tutorial 8 - Analysis of Variance (ANOVA) : Presented by Eng. Alaa Zarif & Eng. Lobna El Seify
19 pages
STAT 2006 Chapter 2 - 2022
No ratings yet
STAT 2006 Chapter 2 - 2022
83 pages
Introduction To Bayesian Statistics
No ratings yet
Introduction To Bayesian Statistics
33 pages
Fuskpaper Bayes
No ratings yet
Fuskpaper Bayes
51 pages
Maximum Likelihood Estimation
No ratings yet
Maximum Likelihood Estimation
7 pages
Regression Analysis Material
No ratings yet
Regression Analysis Material
12 pages
Chapter 5
No ratings yet
Chapter 5
60 pages
Lec 13
No ratings yet
Lec 13
16 pages
Chapter 5 - Correlation and Regression
No ratings yet
Chapter 5 - Correlation and Regression
3 pages
MATH 1281 - Unit 2 Discussion Assignment
No ratings yet
MATH 1281 - Unit 2 Discussion Assignment
3 pages
2019 Sem 1 - Main Exam
No ratings yet
2019 Sem 1 - Main Exam
21 pages
ProbabilityStatistics Probability2
No ratings yet
ProbabilityStatistics Probability2
11 pages
Lecture22 Dvi
No ratings yet
Lecture22 Dvi
2 pages
PD2004 9
No ratings yet
PD2004 9
26 pages
Log-Normal Distribution - Wikipedia
No ratings yet
Log-Normal Distribution - Wikipedia
23 pages
Distribucion Log Normal
No ratings yet
Distribucion Log Normal
52 pages
MLE Lecture Note For Econometrician
No ratings yet
MLE Lecture Note For Econometrician
13 pages
The Normal Distribution Is The Distribution
100% (1)
The Normal Distribution Is The Distribution
34 pages
Differential Privacy: On The Trade-Off Between Utility and Information Leakage
No ratings yet
Differential Privacy: On The Trade-Off Between Utility and Information Leakage
26 pages
The T-Test For Correlated Samples
No ratings yet
The T-Test For Correlated Samples
9 pages
Examples of Maximum Likelihood Estimation and Optimization in R
No ratings yet
Examples of Maximum Likelihood Estimation and Optimization in R
15 pages
Notes
No ratings yet
Notes
10 pages
Imp - Maximum Likelihood Estimation - STAT 414 - 415
No ratings yet
Imp - Maximum Likelihood Estimation - STAT 414 - 415
8 pages
AP Stats 12 AP Stats Vocab PDF
No ratings yet
AP Stats 12 AP Stats Vocab PDF
3 pages
Statistics Module 4, Testing Hypotheses, The Critical Ratio
No ratings yet
Statistics Module 4, Testing Hypotheses, The Critical Ratio
69 pages
Mlelectures PDF
No ratings yet
Mlelectures PDF
24 pages
On The Generalized Lognormal Distribution: T. L. Toulias and C. P. Kitsos
No ratings yet
On The Generalized Lognormal Distribution: T. L. Toulias and C. P. Kitsos
27 pages
MIT18 05S14 Class10slides PDF
No ratings yet
MIT18 05S14 Class10slides PDF
17 pages
Conditional
No ratings yet
Conditional
2 pages
Privacy Chapter
No ratings yet
Privacy Chapter
6 pages
C 7
No ratings yet
C 7
25 pages
Sum of Lognormals
No ratings yet
Sum of Lognormals
6 pages
EM202263TEJ623STAT - 1HI6007 Final Assessment T1 20221
No ratings yet
EM202263TEJ623STAT - 1HI6007 Final Assessment T1 20221
8 pages
MIT18 S096F13 Lecnote3
No ratings yet
MIT18 S096F13 Lecnote3
7 pages
CQF ML Lab Estimating Default Probability With Logistic Regression
No ratings yet
CQF ML Lab Estimating Default Probability With Logistic Regression
7 pages
Probability Concepts Explained
No ratings yet
Probability Concepts Explained
10 pages
202004160626023624rajiv Saksena Advance Statistical Inference
No ratings yet
202004160626023624rajiv Saksena Advance Statistical Inference
31 pages
Stat2112 1st and 2nd Quarter Exam
No ratings yet
Stat2112 1st and 2nd Quarter Exam
21 pages
Test
No ratings yet
Test
31 pages
θ, then the probability density function for Y, θ), can be written as  y∣=exp  ybcd  y θ) is called the natural −m  n y ,
No ratings yet
θ, then the probability density function for Y, θ), can be written as  y∣=exp  ybcd  y θ) is called the natural −m  n y ,
6 pages
Lognormal Distribution
No ratings yet
Lognormal Distribution
20 pages
Statistics and Probability: Quarter 4 - Module 1 Hypotheses Testing and Identifying The Parameter of A Real-Life Problem
100% (2)
Statistics and Probability: Quarter 4 - Module 1 Hypotheses Testing and Identifying The Parameter of A Real-Life Problem
18 pages
Analysis of Heights of Singers
100% (4)
Analysis of Heights of Singers
14 pages
Kitten Length (CM) Weight (G) : Solution
No ratings yet
Kitten Length (CM) Weight (G) : Solution
4 pages
Learning Models From Data: 1 Parametric Estimation
No ratings yet
Learning Models From Data: 1 Parametric Estimation
14 pages
Correlational Research Design: Sarah & Emeral
No ratings yet
Correlational Research Design: Sarah & Emeral
16 pages
Discrete Probability Distributions: Mcgraw-Hill/Irwin
No ratings yet
Discrete Probability Distributions: Mcgraw-Hill/Irwin
15 pages
AS306 Fme Cou HSK Jan13
No ratings yet
AS306 Fme Cou HSK Jan13
2 pages
Generalized Linear Models-1
No ratings yet
Generalized Linear Models-1
29 pages
Bayesian Network: Fundamentals and Applications
From Everand
Bayesian Network: Fundamentals and Applications
Fouad Sabry
No ratings yet
Lognormal Distribution Presentation
No ratings yet
Lognormal Distribution Presentation
2 pages
CF Chapter 11 Excel Master Student
No ratings yet
CF Chapter 11 Excel Master Student
40 pages
Maximum Likelihood
No ratings yet
Maximum Likelihood
7 pages
A Very Gentle Note On The Construction of DP Zhang
No ratings yet
A Very Gentle Note On The Construction of DP Zhang
15 pages
Mlelectures PDF
No ratings yet
Mlelectures PDF
24 pages
Bayesian Decision Networks: Fundamentals and Applications
From Everand
Bayesian Decision Networks: Fundamentals and Applications
Fouad Sabry
No ratings yet
A Simple But Effective Logistic Regression Derivation
No ratings yet
A Simple But Effective Logistic Regression Derivation
6 pages
Form 4 in Term Exam
No ratings yet
Form 4 in Term Exam
2 pages
Discrete Probability and Likelihood: Readings: Agresti (2002), Section 1.2
No ratings yet
Discrete Probability and Likelihood: Readings: Agresti (2002), Section 1.2
17 pages
ECO 4000 R Assignment
No ratings yet
ECO 4000 R Assignment
3 pages
Statistical Classification: Fundamentals and Applications
From Everand
Statistical Classification: Fundamentals and Applications
Fouad Sabry
No ratings yet
EX - NO: 3b Design and Performance Analysis of Error Date: Control Encoder and Decoder Using Hamming Codes
No ratings yet
EX - NO: 3b Design and Performance Analysis of Error Date: Control Encoder and Decoder Using Hamming Codes
7 pages
Evaluation Metrics For Regression: Dr. Jasmeet Singh Assistant Professor, Csed Tiet, Patiala
No ratings yet
Evaluation Metrics For Regression: Dr. Jasmeet Singh Assistant Professor, Csed Tiet, Patiala
13 pages
Maximum Likelihood
No ratings yet
Maximum Likelihood
7 pages
MLEstimation
No ratings yet
MLEstimation
8 pages
LogNormal From Wiki
No ratings yet
LogNormal From Wiki
9 pages
Dynamic Bayesian Networks: Fundamentals and Applications
From Everand
Dynamic Bayesian Networks: Fundamentals and Applications
Fouad Sabry
No ratings yet

Statlect: Log-Likelihood

Uploaded by

Statlect: Log-Likelihood

Uploaded by

7/6/2020 Log-likelihood

In turn, given a sample and a parametric family of distributions (i.e., a set of

we observe a sample , which is regarded as the realization of a random vector ,

the distribution of belongs to a parametric family: there is a set of real

if is an continuous random vector, its joint probability density function belongs

indexed by the parameter ;

if is a discrete random vector, its joint probability mass function belongs to a

indexed by the parameter ;

The typical example is the log-likelihood function of a sample that is made up

In this case, the sample is a vector

The joint probability density of the sample is

The likelihood function is

The log-likelihood function is

How the log-likelihood is used

The log-likelihood function is typically used to derive the maximum likelihood

Why the log is taken

maximum likelihood (ML) estimation of the parameter of the Poisson distribution

ML estimation of the parameter of the exponential distribution

ML estimation of the parameters of a normal linear regression model

Keep reading the glossary

Previous entry: Joint probability mass function

Next entry: Loss function

Featured pages Main sections

You might also like