0% found this document useful (0 votes)

102 views5 pages

AIC and BIC

Uploaded by

sid.bakshi2

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

102 views5 pages

AIC and BIC

Uploaded by

sid.bakshi2

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

AIC and BIC

AIC (Akaike Information Criterion)

AIC (Akaike Information Criterion) is a measure used for model selection, aiming to balance
goodness of fit with model complexity. It helps in identifying the model that best explains the
data without overfitting. AIC is defined as:

^
AI C = 2k − 2 ln(L)

Where:

k = Number of parameters in the model

^
L = Maximum likelihood of the model
^
ln(L) = Log-likelihood of the model

Steps to Calculate AIC

1. Fit the Model: Estimate the model's parameters (e.g., coefficients in a linear regression
model).
2. Calculate Log-Likelihood ln(L)^ : The likelihood function is a measure of how well the

model explains the data. The log-likelihood is the natural logarithm of this likelihood.
n

^ = ∑ ln(f (y |θ))
ln(L) i

i=1

Here, f (y i
|θ) is the probability density function, and θ\thetaθ represents the model parameters.

3. Count the Parameters (k): This includes all estimated parameters in the model. For
example, in a linear regression model, k would be the number of coefficients, including the
intercept.
4. Calculate AIC: Using the formula:

^
AI C = 2k − 2 ln(L)

This penalizes models with more parameters (to avoid overfitting) and rewards models with a
higher likelihood.

For an ARIMA(3, 2, 1) model:

k = 3+ 1 + 1 (if no intercept model)
k = 3 + 1 + 1 + 1 (if intercept is present)
Choosing a Model Based on AIC Value
Lower AIC is Better: When comparing models, the one with the lowest AIC is generally
considered the best. A lower AIC indicates a better trade-off between fit and complexity.
Relative Differences: AIC values by themselves don’t have much meaning, but
differences between AIC values are meaningful. A difference of 2 or more between two
models suggests that the model with the lower AIC is significantly better.
Limitations: AIC doesn't account for correlation in errors or non-stationarity in data. It's
also not suitable when models are non-nested.

BIC (Bayesian Information Criterion)

BIC (Bayesian Information Criterion) is another criterion used for model selection. It is similar to
AIC but incorporates a stronger penalty for the number of parameters to avoid overfitting. The
formula for BIC is:

^
BI C = k ln(n) − 2 ln(L)

Where:

k = Number of parameters in the model

n = Number of data points (observations)
^
L= Maximum likelihood of the model
^
ln(L)= Log-likelihood of the model

Steps to Calculate BIC

1. Fit the Model: Estimate the model parameters as you would for AIC.
2. Calculate Log-Likelihood (ln(L
^ ): Same as AIC, calculate the log of the maximum

likelihood function of the model.

3. Count the Parameters (k): Count the number of estimated parameters in the model,
including the intercept.
4. Number of Observations (n): Determine the number of data points in the dataset.
5. Calculate BIC:

^
BI C = k ln(n) − 2 ln(L)

The term k ln(n) increases more rapidly with the number of parameters compared to AIC’s
penalty term 2k, making BIC more conservative.

Choosing a Model Based on BIC Value

Lower BIC is Better: As with AIC, the model with the lowest BIC is preferred.
Stronger Penalty for Complexity: BIC penalizes models with more parameters more
heavily than AIC does. This means BIC generally prefers simpler models, especially for
large datasets.
Relative Differences: BIC values are meaningful only when comparing models. A
difference of 10 or more between models indicates strong evidence in favor of the model
with the lower BIC.

Comparison of AIC and BIC

Criterion Formula Penalty for Best For Model

Complexity Preference
AIC ^
AI C = 2k − 2 ln(L) Penalizes based Model Tends to prefer
on number of selection more complex
parameters when the goal models
is prediction
BIC ^
BI C = k ln(n) − 2 ln(L) Stronger penalty Model Prefers simpler
for large data or selection models,
more parameters when finding especially as
the "true" nnn increases
model

Key Differences:

Penalty Terms:

-### AIC penalizes models with more parameters using 2k, while BIC uses k ln(n). BIC's
penalty grows faster with the number of observations, making it more stringent for larger
datasets. ###

Focus: AIC is more focused on predictive accuracy, while BIC is more focused on finding
the "true" model by incorporating a stronger penalty for complexity.
Model Preference: AIC is likely to select more complex models, while BIC leans toward
simpler models, especially as the dataset size grows.

Pros and Cons of AIC and BIC

Criterion Pros Cons

AIC - Good for predictive model - More likely to select overly complex
selection. models (overfitting).
Criterion Pros Cons
- Less conservative (better for
smaller datasets).
BIC - Stronger penalty for complexity - May select overly simple models
(better for finding the simplest (underfitting) when the goal is predictive
model). accuracy.
- Considers dataset size. - Less flexible for small datasets.

AICc (Corrected AIC) and BICc (Corrected BIC)

For small sample sizes, the regular AIC and BIC can be biased because they don’t account for
the small-sample effects on model complexity. To adjust for this, AICc and BICc were
introduced.

AICc (Corrected AIC):

2k(k + 1)
AI Cc = AI C +
n − k − 1

Where:

n = Number of data points

k = Number of parameters

This correction term accounts for small sample sizes and adjusts AIC upwards when n is small,
preventing overfitting in small datasets.

BICc (Corrected BIC): Though less commonly used, BICc would adjust BIC in a similar
way, taking small-sample bias into account, but BIC is already heavily biased toward
simpler models in large datasets.

Summary of Differences between AIC, AICc, BIC, and BICc

AIC: Tends to select more complex models, best for prediction.
AICc: Adjusted for small datasets, making AIC more conservative when n is small.
BIC: Stronger preference for simpler models, especially with larger datasets.
BICc: Less commonly used. Theoretically would be an adjustment of BIC for small sample
sizes, though BIC is already conservative.

Log-Likelihood

Counting parameters
https://en.wikipedia.org/wiki/Akaike_information_criterion

AI C = −2log(L) + 2(p + q + l + 1)

,where L is the likelihood of the data,

l=1 if c≠0
and l=0 if c=0c=0.

Note that the last term in parentheses is the number of parameters in the model (including σ ,
2

the variance of the residuals)

Technical and Graphical Analysis
90% (10)
Technical and Graphical Analysis
66 pages
Introduction to Applied Econometrics Analysis Using Stata
From Everand
Introduction to Applied Econometrics Analysis Using Stata
Justin Doran
5/5 (3)
AIC and BIC
No ratings yet
AIC and BIC
3 pages
Prob Sensitivity and Specificity of Information Criteri
No ratings yet
Prob Sensitivity and Specificity of Information Criteri
20 pages
Bayesian Information Criterion
No ratings yet
Bayesian Information Criterion
3 pages
ML Mod2 5 Marks
No ratings yet
ML Mod2 5 Marks
7 pages
Model Selection and Model Averaging
No ratings yet
Model Selection and Model Averaging
16 pages
GLM Project 2
No ratings yet
GLM Project 2
5 pages
2004-Multimodel Inference Understanding AIC and BIC in Model Selection
No ratings yet
2004-Multimodel Inference Understanding AIC and BIC in Model Selection
44 pages
Burnham and Anderson 2004 Multimodel Inference
No ratings yet
Burnham and Anderson 2004 Multimodel Inference
44 pages
The Bayesian Information Criterion
No ratings yet
The Bayesian Information Criterion
32 pages
A New Criterion For Model Selection
No ratings yet
A New Criterion For Model Selection
12 pages
Mathematics 07 01215
No ratings yet
Mathematics 07 01215
12 pages
Feature Selection
No ratings yet
Feature Selection
22 pages
Akaike Information Criterion
100% (1)
Akaike Information Criterion
6 pages
Akaike Etc, Wli
No ratings yet
Akaike Etc, Wli
49 pages
DATT - Class 05 - Assignment - GR 9
No ratings yet
DATT - Class 05 - Assignment - GR 9
9 pages
AIC Tutorial - Hu
No ratings yet
AIC Tutorial - Hu
19 pages
Information Criteria For Selection of Appropriate Models: Aic, Sic, and Hqic
No ratings yet
Information Criteria For Selection of Appropriate Models: Aic, Sic, and Hqic
7 pages
Modified Akaike Information Criterion (MAIC) For Statistical Model Selection
No ratings yet
Modified Akaike Information Criterion (MAIC) For Statistical Model Selection
12 pages
The PDF of This Article Has Been Modified From Its Original Version
No ratings yet
The PDF of This Article Has Been Modified From Its Original Version
43 pages
Estimates Stats - Model-Selection Statistics
No ratings yet
Estimates Stats - Model-Selection Statistics
2 pages
Mixed Model Selection Information Theoretic
No ratings yet
Mixed Model Selection Information Theoretic
7 pages
Chapter 2
No ratings yet
Chapter 2
6 pages
Akaike's and Other Information Criteria
No ratings yet
Akaike's and Other Information Criteria
5 pages
Sensitivity and Specificity of Information Criteria
No ratings yet
Sensitivity and Specificity of Information Criteria
13 pages
Rio Thesis - 054559
No ratings yet
Rio Thesis - 054559
53 pages
Mean Dependant Var
No ratings yet
Mean Dependant Var
2 pages
Bayes Factor
No ratings yet
Bayes Factor
10 pages
Akaike's Information Criterion For Estimated Model - MATLAB Aic
No ratings yet
Akaike's Information Criterion For Estimated Model - MATLAB Aic
5 pages
Lec7 Model
No ratings yet
Lec7 Model
8 pages
Myung Ohio State Model Selection Methods
No ratings yet
Myung Ohio State Model Selection Methods
76 pages
Data Mining
No ratings yet
Data Mining
2 pages
How To Minimize Misclassification Rate and Expected Loss For Given Model
No ratings yet
How To Minimize Misclassification Rate and Expected Loss For Given Model
7 pages
Information Criterion: ( (Criterion, ) ) Lasso Model Fit With Lars Using BIC or AIC For Model Selection
No ratings yet
Information Criterion: ( (Criterion, ) ) Lasso Model Fit With Lars Using BIC or AIC For Model Selection
1 page
ARNOLD 2010 - Uninformative Parameters and Model Selection Using Akaike S Information Criterion
No ratings yet
ARNOLD 2010 - Uninformative Parameters and Model Selection Using Akaike S Information Criterion
4 pages
Uninformative Parameters and Model Selection Using Akaike's Information Criterion
No ratings yet
Uninformative Parameters and Model Selection Using Akaike's Information Criterion
4 pages
WINSEM2023-24 MAT6015 ETH VL2023240501308 2024-03-19 Reference-Material-I
No ratings yet
WINSEM2023-24 MAT6015 ETH VL2023240501308 2024-03-19 Reference-Material-I
39 pages
Uninformative Parameters and Model Selection Using Akaike's Information Criterion
No ratings yet
Uninformative Parameters and Model Selection Using Akaike's Information Criterion
5 pages
STEPAIC
No ratings yet
STEPAIC
36 pages
Chapter 2
No ratings yet
Chapter 2
37 pages
L2D-Multiple Regression D 2022-03-03 21 - 20 - 03
No ratings yet
L2D-Multiple Regression D 2022-03-03 21 - 20 - 03
31 pages
Extended Bayesian Information Criteria For Model Selection With Large Model Spaces
No ratings yet
Extended Bayesian Information Criteria For Model Selection With Large Model Spaces
27 pages
Kuha 2004
No ratings yet
Kuha 2004
44 pages
Statistica Neerlandica - 2012 - Wit - All Models Are Wrong An Introduction To Model Uncertainty
No ratings yet
Statistica Neerlandica - 2012 - Wit - All Models Are Wrong An Introduction To Model Uncertainty
20 pages
Akaike
No ratings yet
Akaike
1 page
The Akaike Information Criterion Background Derivation Properties
No ratings yet
The Akaike Information Criterion Background Derivation Properties
11 pages
Wagenmakers-Farrell 2004 - Model Selection Using Akaike Weights
No ratings yet
Wagenmakers-Farrell 2004 - Model Selection Using Akaike Weights
5 pages
A Comparison of Different Criteria To Construct Regression Model Employing The Box-Cox and Cole Green Transformation
No ratings yet
A Comparison of Different Criteria To Construct Regression Model Employing The Box-Cox and Cole Green Transformation
10 pages
Scalar Measures of Fit: Pseudo R and Information Measures (AIC & BIC)
No ratings yet
Scalar Measures of Fit: Pseudo R and Information Measures (AIC & BIC)
11 pages
ARIMA Models in Python Chapter3
No ratings yet
ARIMA Models in Python Chapter3
52 pages
L7 Model Selection
No ratings yet
L7 Model Selection
41 pages
Week8 Lecture 1 ML SPR25
No ratings yet
Week8 Lecture 1 ML SPR25
20 pages
Biomass Model Selection Presentation
No ratings yet
Biomass Model Selection Presentation
5 pages
14 Model Selection
No ratings yet
14 Model Selection
24 pages
Regression Basics
No ratings yet
Regression Basics
27 pages
A First Course in Dimensional Analysis: Simplifying Complex Phenomena Using Physical Insight
From Everand
A First Course in Dimensional Analysis: Simplifying Complex Phenomena Using Physical Insight
Juan G. Santiago
No ratings yet
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
César Pérez López
No ratings yet
Profit Driven Business Analytics: A Practitioner's Guide to Transforming Big Data into Added Value
From Everand
Profit Driven Business Analytics: A Practitioner's Guide to Transforming Big Data into Added Value
Wouter Verbeke
No ratings yet
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
IGNOU BCA Data and File Structure Previous Year Unsolved Papers MCS 021
From Everand
IGNOU BCA Data and File Structure Previous Year Unsolved Papers MCS 021
Manish Soni
No ratings yet
Price of Materials-1
No ratings yet
Price of Materials-1
2 pages
Geoinformatics For Marine and Coastal Management
100% (2)
Geoinformatics For Marine and Coastal Management
444 pages
Grade 8 - Integrated Science PDF
No ratings yet
Grade 8 - Integrated Science PDF
68 pages
LimeKiln Modeling
No ratings yet
LimeKiln Modeling
40 pages
Fire Safety Lecture
No ratings yet
Fire Safety Lecture
37 pages
Publication
No ratings yet
Publication
42 pages
Ideology of Islam
No ratings yet
Ideology of Islam
15 pages
PE FItness Test
No ratings yet
PE FItness Test
10 pages
NCERT Solutions For Class 9 Chapter 9 Parallelograms and Triangles Exercise 9 3
No ratings yet
NCERT Solutions For Class 9 Chapter 9 Parallelograms and Triangles Exercise 9 3
14 pages
2021 Super Duty Chassis Cab Tech Specs
100% (1)
2021 Super Duty Chassis Cab Tech Specs
9 pages
DCMT - Set 4 GR14 Supple
No ratings yet
DCMT - Set 4 GR14 Supple
2 pages
HT2000 H56163 HR PDF
No ratings yet
HT2000 H56163 HR PDF
1 page
Nephrology Pediatrics
No ratings yet
Nephrology Pediatrics
43 pages
Sinteza Chimica Adamantan
No ratings yet
Sinteza Chimica Adamantan
4 pages
Edu 251 Thematic Unit Final
No ratings yet
Edu 251 Thematic Unit Final
7 pages
A Reading of Baudelaire's "Recueillement"
No ratings yet
A Reading of Baudelaire's "Recueillement"
5 pages
3rd SUMMATIVE 4
No ratings yet
3rd SUMMATIVE 4
2 pages
Densified Wooden Nails For New Timber Assemblies and Restoration Works - A Pilot Research
No ratings yet
Densified Wooden Nails For New Timber Assemblies and Restoration Works - A Pilot Research
9 pages
Tour Guiding
No ratings yet
Tour Guiding
19 pages
Probor Datasheet
No ratings yet
Probor Datasheet
6 pages
JIS-G-3312-2019-Prepainted Hot-Dip Zinc-Coated Steel Sheet and Strip
No ratings yet
JIS-G-3312-2019-Prepainted Hot-Dip Zinc-Coated Steel Sheet and Strip
31 pages
Preparation of Haloalkanes Worksheet With Answers
No ratings yet
Preparation of Haloalkanes Worksheet With Answers
21 pages
Signals and Systems For Signals and Systems For
No ratings yet
Signals and Systems For Signals and Systems For
75 pages
Pharmacology Final Exam Study Guide
No ratings yet
Pharmacology Final Exam Study Guide
28 pages
Tunnelling Methods
0% (1)
Tunnelling Methods
15 pages
Wire Loops - Precast
No ratings yet
Wire Loops - Precast
2 pages
Drawing For Plinth Trasformer For 63 To 200KV Transformer
No ratings yet
Drawing For Plinth Trasformer For 63 To 200KV Transformer
1 page
EMS003 Rev2
No ratings yet
EMS003 Rev2
53 pages
Ehs 5 454
No ratings yet
Ehs 5 454
10 pages