0% found this document useful (0 votes)

115 views6 pages

SAS Code To Select The Best Multiple Linear Regression Model For Multivariate Data Using Information Criteria

Uploaded by

shakeel ahmad

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

115 views6 pages

SAS Code To Select The Best Multiple Linear Regression Model For Multivariate Data Using Information Criteria

Uploaded by

shakeel ahmad

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Paper SA01_05

SAS£ Code to Select the Best Multiple Linear Regression Model

for Multivariate Data Using Information Criteria
Dennis J. Beal, Science Applications International Corporation, Oak Ridge, TN

ABSTRACT

Multiple linear regression is a standard statistical tool that regresses p independent variables against a
single dependent variable. The objective is to find a linear model that best predicts the dependent variable
from the independent variables. Information criteria uses the covariance matrix and the number of
parameters in a model to calculate a statistic that summarizes the information represented by the model by
balancing a trade-off between a lack of fit term and a penalty term. SAS£ calculates Akaike’s Information
p
Criteria (AIC) for every possible 2 models for p d 10 independent variables. AIC estimates a measure of
the difference between a given model and the “true” model. The model with the smallest AIC among all
competing models is deemed the best model. This paper provides SAS code that can be used to
simultaneously evaluate up to 1024 models to determine the best subset of variables that minimizes the
information criteria among all possible subsets. Simulated multivariate data are used to compare the
performance of AIC to select the true model with standard statistical techniques such as minimizing RMSE,
forward selection, backward elimination, and stepwise regression. This paper is for intermediate SAS users
of SAS/STAT who understand multivariate data analysis.

Key words: Akaike’s Information Criteria, multivariate linear regression, model selection

INTRODUCTION

Multiple linear regression is one of the statistical tools used for discovering relationships between variables.
It is used to find the linear model that best predicts the dependent variable from the independent variables.
A data set with p independent variables has 2p possible subset models to consider since each of the p
variables is either included or excluded from the model, not counting interaction terms. Model diagnostics
are calculated for each model to help determine which model is “best”. These model diagnostics include the
root mean square error (RMSE) and the coefficient of determination (R2). A good linear model will have a
2
low RMSE and a high R close to 1. However, these model diagnostics alone are insufficient to determine
the best model.

The usual techniques taught in statistics courses to find the best linear model include minimizing the RMSE,
maximizing R2, forward selection, backward elimination and stepwise regression. This paper will compare
these techniques to minimizing the information criteria statistic on simulated data from several distributions.
SAS code for determining the best linear model will be shown.

COMMON STATISTICAL TECHNIQUES

Five common statistical techniques taught in most statistics courses to determine the best linear model
2
include minimizing the RMSE, maximizing R , forward selection, backward elimination and stepwise
regression.

The RMSE is a function of the sum of squared errors (SSE), number of observations n and the number of
parameters p and is shown in Eqn. (1).

SSE (1)
RMSE
n p

The RMSE is calculated for all possible subset models. Using this technique, the model with the smallest
RMSE is declared the best linear model. This approach does include the number of parameters in the
model; so additional parameters will decrease both the numerator and denominator.
2
The coefficient of determination R is the percentage of the variability of the dependent variable that is
explained by the variation of the independent variables. Therefore, the R2 value ranges from 0 to 1. R2 is a
function of the total sum of squares (SST) and the SSE and is shown in Eqn. (2).

SSE
R2 1 (2)
SST
2 2
The R is calculated for all possible subset models. Using this technique, the model with the largest R is
2
declared the best linear model. However, this technique has several disadvantages. First, the R increases
with each variable included in the model. Therefore, this approach encourages including all variables in the
best model although some variables may not significantly contribute to the model. This approach also
contradicts the principal of parsimony that encourages as few parameters in a model as possible.

Forward selection begins with only the intercept term in the model. For each of the independent variables
the F statistic is calculated to determine each variable’s contribution to the model. The variable with the
smallest p-value below a specified D cutoff value (e.g., 0.15) indicating statistical significance is kept in the
model. The model is rerun keeping this variable and recalculating F statistics on the remaining p-1
independent variables. This process continues until no remaining variables have F statistic p-values below
the specified D. Once a variable is in the model, it remains in the model.

Backward elimination begins by including all variables in the model and calculating F statistics for each
variable. The variable with the largest p-value exceeding the specified D cutoff value is then removed from
the model. This process continues until no remaining variables have F statistic p-values above the specified
D. Once a variable is removed from the model, it cannot be added to the model again.

Stepwise regression is a modification of the forward selection technique in that variables already in the
model do not necessarily stay there. As in the forward selection technique, variables are added one at a
time to the model, as long as the F statistic p-value is below the specified D. After a variable is added,
however, the stepwise technique evaluates all of the variables already included in the model and removes
any variable that has an insignificant F statistic p-value exceeding the specified D. Only after this check is
made and the identified variables have been removed can another variable be added to the model. The
stepwise process ends when none of the variables excluded from the model has an F statistic significant at
the specified D and every variable included in the model is significant at the specified D.
2
Other model selection techniques not evaluated in this paper include adjusted R and Mallow’s Cp. Hocking
(1976) and Sclove (1987) discuss the use of these and other statistical techniques in model selection.

INFORMATION CRITERIA

Information criteria is a measure of goodness of fit or uncertainty for the range of values of the data. In the
context of multiple linear regression, information criteria measures the difference between a given model
and the “true” underlying model. Akaike (1973) introduced the concept of information criteria as a tool for
optimal model selection. Akaike’s Information Criteria (AIC) is a function of the number of observations n,
the SSE and the number of parameters p, as shown in Eqn. (3).

§ SSE ·
AIC n ln¨ ¸ 2p (3)
© n ¹

The first term in Eqn. (3) is a measure of the model lack of fit while the second term is a penalty term for
additional parameters in the model. Therefore, as the number of parameters p included in the model
increases, the lack of fit term decreases while the penalty term increases. Conversely, as variables are
dropped from the model the lack of fit term increases while the penalty term decreases. The model with the
smallest AIC is deemed the “best” model since it minimizes the difference from the given model to the “true”
model.

Akaike (1973) forms the basis for the concept of information criteria. Other references that use AIC for
model selection include Akaike (1987), Bozdogan (1987 and 2000) and Sawa (1978).
EXAMPLE DATA

A multivariate data set with 10 independent variables and one dependent variable was simulated from a
known “true” model that is a linear function of a subset of the independent variables. The following SAS
code simulates 1000 observations for these 10 independent X variables and one dependent Y variable. The
10 independent X variables come from normal, lognormal, exponential and uniform distributions with various
means and variances. Variables X5, X6 and X9 are correlated with other variables.

data a;
do i = 1 to 1000;
x1 = 10 + 5*rannor(0); * Normal(10, 25);
x2 = exp(3*rannor(0)); * lognormal;
x3 = 5 + 10*ranuni(0); * uniform;
x4 = 100 + 50*rannor(0); * Normal(100, 2500);
x5 = x1 + 3*rannor(0); * normal bimodal;
x6 = 2*x2 + ranexp(0); * lognormal and exponential mixture;
x7 = 0.5*exp(4*rannor(0)); * lognormal;
x8 = 10 + 8*ranuni(0); * uniform;
x9 = x2 + x8 + 2*rannor(0); * lognormal, uniform and normal mix;
x10 = 200 + 90*rannor(0); * normal(200, 8100);
y = 3*x2 - 4*x8 + 5*x9 + 3*rannor(0); * true model with no intercept term;
output;
end;

SAS CODE FOR AIC

The following SAS code from SAS/STAT computes AIC for all possible subsets of multiple regression
2
models for main effects. The selection=adjrsq option specifies the adjusted R method will be used to
select the model, although other selection options may also be used such as selection=rsquare.
The SSE option displays the sum of squared errors for each model, while the AIC option displays the AIC
statistic for each model. The first proc reg calculates AIC for all possible subsets of main effects using an
intercept term. The second proc reg calculates AIC for all possible subsets of main effects without an
intercept term by specifying the noint option. The output data sets est and est0 are combined, sorted
and printed from smallest AIC to largest. The model with the smallest AIC value is deemed the “best”
model. The SAS code presented in this paper uses the SAS System for personal computers version 8.2 (TS
level 02M0) running on a Windows 2000 platform.

proc reg data=a outest=est;

model y=x1 x2 x3 x4 x5 x6 x7 x8 x9 x10 / selection=adjrsq sse aic ;
output out=out p=p r=r; run; quit;

proc reg data=a outest=est0;

model y=x1 x2 x3 x4 x5 x6 x7 x8 x9 x10 / noint selection=adjrsq sse aic ;
output out=out0 p=p r=r; run; quit;

data estout;
set est est0; run;

proc sort data=estout; by _aic_;

proc print data=estout(obs=8); run;

COMPARISON OF AIC RESULTS WITH HEURISTIC METHODS

SAS will calculate the AIC for every possible subset of variables for models with up to 10 independent
variables. SAS confirmed the minimum AIC for all possible subsets of variables is 2239.73 with only the X2,
X8 and X9 variables in the model and no intercept term.

Independent variables X1 through X10 were regressed against the no intercept dependent variable Y using
forward selection, backward selection and stepwise regression with an assumed entry and exit significance
level of 0.15. An entry significance level of 0.15, specified in the slentry=0.15 option, means a
variable must have a p-value < 0.15 in order to enter the model during forward selection and stepwise
regression. An exit significance level of 0.15, specified in the slstay=0.15 option, means a variable must
have a p-value > 0.15 in order to leave the model during backward selection and stepwise regression.

The following SAS code performs the forward selection method by specifying the option
selection=forward. The model diagnostics are output into the data set est1.

proc reg data=a outest=est1;

model y=x1 x2 x3 x4 x5 x6 x7 x8 x9 x10 / slstay=0.15 slentry=0.15
selection=forward ss2 sse aic;
output out=out1 p=p r=r; run; quit;

The following SAS code performs the backward elimination method by specifying the option
selection=backward. The model diagnostics are output into the data set est2.

proc reg data=a outest=est2;

model y=x1 x2 x3 x4 x5 x6 x7 x8 x9 x10 / slstay=0.15 slentry=0.15
selection=backward ss2 sse aic;
output out=out1 p=p r=r; run; quit;

The following SAS code performs stepwise regression by specifying the option selection=stepwise.
The model diagnostics are output into the data set est3.

proc reg data=a outest=est3;

model y=x1 x2 x3 x4 x5 x6 x7 x8 x9 x10 / slstay=0.15 slentry=0.15
selection=stepwise ss2 sse aic;
output out=out3 p=p r=r; run; quit;

The following SAS code calculates the RMSE for each possible subset model, sorts the models from
smallest to largest RMSE and then prints the best 10 models. Specifying adjrsq in the option
selection=adjrsq is not crucial since the goal is to minimize RMSE. Other choices for the selection
option are rsquare or CP. The model diagnostics are output into the data sets est4 and est5.

proc reg data=a outest=est4;

model y=x1 x2 x3 x4 x5 x6 x7 x8 x9 x10 /
selection=adjrsq sse aic adjrsq;
output out=out p=p r=r; run; quit;

proc reg data=a outest=est5;

model y=x1 x2 x3 x4 x5 x6 x7 x8 x9 x10 /
noint selection=adjrsq sse aic adjrsq;
output out=out p=p r=r; run; quit;

data both; set est4 est5; run;

proc sort data=both; by _rmse_; run;
proc print data=both(obs=10); run;

Table 1 shows that all four heuristic methods include variables X2, X3, X8, X9 and X10 along with an
intercept term. A heuristic method is an approximate method that does not guarantee convergence to the
optimal model. The AIC for these models is 2240.3, which is higher than the AIC for the true model.
Forward, backward and stepwise regression methods selected the same model that includes a nonzero
intercept and variables X3 and X10 that are not part of the true underlying model. The minimized RMSE
method includes variables X3, X7 and X10 in addition to the nonzero intercept term that are not part of the
true underlying model and has the largest AIC of the methods in Table 1. The model that minimized AIC is
the true underlying model. Therefore, Table 1 illustrates how the forward, backward, stepwise regression
and minimizing RMSE heuristic methods fail to identify the underlying model which minimizes AIC.
Table 1. A comparison of AIC with four heuristic methods

Parameter estimates
True Minimized Forward Backward Minimized
Variable Model AIC selection selection Stepwise RMSE
Intercept 1.30456 1.30456 1.30456 1.30058
X1
X2 3 3.0212 3.01216 3.01216 3.01216 3.0118
X3 -0.04905 -0.04905 -0.04905 -0.04722
X4
X5
X6
X7 -6.415E-06
X8 -4 -3.98233 -4.02578 -4.02578 -4.02578 -4.02631
X9 5 4.97887 4.98791 4.98791 4.98791 4.98827
X10 -0.00166 -0.00166 -0.00166 -0.0016849
Model Diagnostics
R2 1 1 1 1 1
Adjusted R2 1 1 1 1 1
RMSE 3.05985 3.05615 3.05615 3.05615 3.05613
AIC 2239.73 2240.3 2240.3 2240.3 2241.27
F 1.07E+10 6.44E+09 6.44E+09 6.44E+09 5.36E+09
Pr > F <.0001 <.0001 <.0001 <.0001 <.0001

CONCLUSION

SAS is a powerful tool that utilizes AIC to simultaneously evaluate all possible subsets of multiple regression
models to determine the best model for up to 10 independent variables. Using information criteria for
multivariate model selection has been shown to be superior to heuristic methods such as forward selection,
backward elimination, stepwise regression and minimizing RMSE using simulated data with a known
underlying model.

REFERENCES

Akaike, H. (1973). Information theory and an extension of the maximum likelihood principle. In B.N. Petrov
and F. Csaki (Eds.), Second international symposium on information theory, 267-281. Budapest:
Academiai Kiado.

Akaike, H. (1987). Factor analysis and AIC. Psychometrika, 52, 317-332.

Bozdogan, H. (1987). Model selection and Akaike’s information criterion (AIC): the general theory and its
analytical extensions, Psychometrika, 52, No. 3, 345-370.

Bozdogan, H. (2000). Akaike’s information criterion and recent developments in informational complexity.
Journal of Mathematical Psychology, 44, 62-91.

Hocking, R. R. (1976). The analysis and selection of variables in linear regression. Biometrics, 32, 1-49.

Sclove, S. L. (1987). Application of model selection criteria to some problems in multivariate analysis.
Psychometrika, 52, 333-343.

Sawa, T. (1978). Information criteria for discriminating among alternative regression models. Econometrica,
46, 1273-1282.
CONTACT INFORMATION

The author welcomes and encourages any questions, corrections, feedback and remarks. Contact the
author at:

Dennis J. Beal
Statistician / Risk Scientist
Science Applications International Corporation
P.O. Box 2501
151 Lafayette Drive
Oak Ridge, Tennessee 37831
phone: 865-481-8736
fax: 865-481-8714
e-mail: [email protected]

SAS and all other SAS Institute Inc. product or service names are registered trademarks or trademarks of
SAS Institute Inc. in the USA and other countries. £ indicates USA registration. Other brand and product
names are registered trademarks or trademarks of their respective companies.

R-Assignment Solution
25% (4)
R-Assignment Solution
8 pages
Econometrics: A Simple Introduction
From Everand
Econometrics: A Simple Introduction
K.H. Erickson
3.5/5 (5)
Determination of Fluorescein in Antifreeze by Fluorescence Spectros PDF
No ratings yet
Determination of Fluorescein in Antifreeze by Fluorescence Spectros PDF
8 pages
SIT718 Assessment-Task 4-T3 2019-Amended PDF
No ratings yet
SIT718 Assessment-Task 4-T3 2019-Amended PDF
7 pages
Rio Thesis _054559
No ratings yet
Rio Thesis _054559
53 pages
Yang-39 2 Proof 27
No ratings yet
Yang-39 2 Proof 27
11 pages
L2D-Multiple Regression D 2022-03-03 21_20_03
No ratings yet
L2D-Multiple Regression D 2022-03-03 21_20_03
31 pages
Lecture 5 Model Selection I: STAT 441: Statistical Methods For Learning and Data Mining
No ratings yet
Lecture 5 Model Selection I: STAT 441: Statistical Methods For Learning and Data Mining
17 pages
Unit 4
No ratings yet
Unit 4
7 pages
Chapter 4
No ratings yet
Chapter 4
23 pages
3rd Module EDBA Contiuation1
No ratings yet
3rd Module EDBA Contiuation1
6 pages
Week8_Lecture_1_ML_SPR25
No ratings yet
Week8_Lecture_1_ML_SPR25
20 pages
Module07 - Model Selection and Regularization
No ratings yet
Module07 - Model Selection and Regularization
46 pages
Lesson 5 Model Selection
No ratings yet
Lesson 5 Model Selection
41 pages
Chapter 2
No ratings yet
Chapter 2
37 pages
WINSEM2023-24 MAT6015 ETH VL2023240501308 2024-03-19 Reference-Material-I
No ratings yet
WINSEM2023-24 MAT6015 ETH VL2023240501308 2024-03-19 Reference-Material-I
39 pages
DATT - Class 05 - Assignment - GR 9
No ratings yet
DATT - Class 05 - Assignment - GR 9
9 pages
Ch5 Slide VariableSelection
No ratings yet
Ch5 Slide VariableSelection
36 pages
Model Selection-Handout PDF
No ratings yet
Model Selection-Handout PDF
57 pages
Jurnal Asli Diagram Sa
No ratings yet
Jurnal Asli Diagram Sa
11 pages
STA302 Week12 Full
No ratings yet
STA302 Week12 Full
30 pages
13 Paper PDF
No ratings yet
13 Paper PDF
14 pages
Reg07
No ratings yet
Reg07
22 pages
HW3
No ratings yet
HW3
27 pages
Stat 136 Chapter 6 Variable Selection and Comparison of Regression Coefficients
No ratings yet
Stat 136 Chapter 6 Variable Selection and Comparison of Regression Coefficients
40 pages
Linear Regression Analysis: Module - I
No ratings yet
Linear Regression Analysis: Module - I
13 pages
Sawa-InformationCriteriaDiscriminating-1978
No ratings yet
Sawa-InformationCriteriaDiscriminating-1978
20 pages
Revision235
No ratings yet
Revision235
8 pages
TRUE/FALSE. Write 'T' If The Statement Is True and 'F' If The Statement Is False
No ratings yet
TRUE/FALSE. Write 'T' If The Statement Is True and 'F' If The Statement Is False
20 pages
A New Criterion For Model Selection
No ratings yet
A New Criterion For Model Selection
12 pages
Mathematics 07 01215
No ratings yet
Mathematics 07 01215
12 pages
Methodology Expliained
No ratings yet
Methodology Expliained
2 pages
Chapter 9: Selection of Variables
No ratings yet
Chapter 9: Selection of Variables
30 pages
Dr. Hussin Abdullah School of Economics, Finance and Banking, Uum Cob
No ratings yet
Dr. Hussin Abdullah School of Economics, Finance and Banking, Uum Cob
12 pages
Chapter 5
No ratings yet
Chapter 5
30 pages
Regression PDF
No ratings yet
Regression PDF
16 pages
Problems With Stepwise Regression
No ratings yet
Problems With Stepwise Regression
1 page
Regression Analysis and Forecasting Models
No ratings yet
Regression Analysis and Forecasting Models
28 pages
Additional Notes 3 - Forecasting Model Performance
No ratings yet
Additional Notes 3 - Forecasting Model Performance
5 pages
stepwise regression
No ratings yet
stepwise regression
2 pages
MVA- Lectures' Topics
No ratings yet
MVA- Lectures' Topics
52 pages
SRM Notes
No ratings yet
SRM Notes
38 pages
Module 5.2
No ratings yet
Module 5.2
51 pages
445 Lecture 4
No ratings yet
445 Lecture 4
28 pages
Unit-3
No ratings yet
Unit-3
16 pages
Econometric Modeling
No ratings yet
Econometric Modeling
38 pages
Multiple Regression - Selecting The Best Equation: An Example
No ratings yet
Multiple Regression - Selecting The Best Equation: An Example
29 pages
Best Subset Methods
No ratings yet
Best Subset Methods
3 pages
AIC Tutorial - Hu
No ratings yet
AIC Tutorial - Hu
19 pages
Regression Analysis
No ratings yet
Regression Analysis
20 pages
Topic0 Introduction
No ratings yet
Topic0 Introduction
9 pages
AI - Mod 5. Part 3
No ratings yet
AI - Mod 5. Part 3
26 pages
Coding 2
No ratings yet
Coding 2
3 pages
Statistical Modelling: Regression: Choosing The Independent Variables
No ratings yet
Statistical Modelling: Regression: Choosing The Independent Variables
14 pages
Econometrics Session
No ratings yet
Econometrics Session
43 pages
Unit 2
No ratings yet
Unit 2
76 pages
Model Selection R Chap 4
No ratings yet
Model Selection R Chap 4
5 pages
Predictive Modelling Using Linear Regression: © Analy Datalab Inc., 2016. All Rights Reserved
No ratings yet
Predictive Modelling Using Linear Regression: © Analy Datalab Inc., 2016. All Rights Reserved
16 pages
Chapter 8: Multiple and Logistic Regression Learning Objectives
No ratings yet
Chapter 8: Multiple and Logistic Regression Learning Objectives
3 pages
Random Sample Consensus: Robust Estimation in Computer Vision
From Everand
Random Sample Consensus: Robust Estimation in Computer Vision
Fouad Sabry
No ratings yet
Process Performance Models: Statistical, Probabilistic & Simulation
From Everand
Process Performance Models: Statistical, Probabilistic & Simulation
Vishnuvarthanan Moorthy
No ratings yet
Machine Learning. Supervised Learning Techniques and Tools: Nonlinear Models Exercises with R, SAS, Stata, Eviews and SPSS
From Everand
Machine Learning. Supervised Learning Techniques and Tools: Nonlinear Models Exercises with R, SAS, Stata, Eviews and SPSS
César Pérez López
No ratings yet
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
From Everand
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
SUJAUL CHOWDHURY
No ratings yet
Tender RegionTREKLÃ - VERN (SÃ Dermannland, VÃ Rmland, Ã - Rebro)
No ratings yet
Tender RegionTREKLÃ - VERN (SÃ Dermannland, VÃ Rmland, Ã - Rebro)
8 pages
Tender RegionTREKLÃ - VERN (SÃ Dermannland, VÃ Rmland, Ã - Rebro)
No ratings yet
Tender RegionTREKLÃ - VERN (SÃ Dermannland, VÃ Rmland, Ã - Rebro)
9 pages
Item Wordings Item Label in Your Teaching, To What Extent Can You Do The Following? (1 Not at All, 2 To Some Extent, 3 Quite A Bit, 4 A Lot)
No ratings yet
Item Wordings Item Label in Your Teaching, To What Extent Can You Do The Following? (1 Not at All, 2 To Some Extent, 3 Quite A Bit, 4 A Lot)
1 page
Book 2
No ratings yet
Book 2
35 pages
Ps4sol PDF
No ratings yet
Ps4sol PDF
5 pages
Gepco Online Bill
No ratings yet
Gepco Online Bill
1 page
10-11 Final PDF
No ratings yet
10-11 Final PDF
428 pages
Assignment PDF
No ratings yet
Assignment PDF
2 pages
hw09 07ans PDF
No ratings yet
hw09 07ans PDF
7 pages
Some Additional Notes: Text Globl Main Main Addi $SP $SP SW $ra $SP
No ratings yet
Some Additional Notes: Text Globl Main Main Addi $SP $SP SW $ra $SP
1 page
IP Agreement Trading Algorithm PDF
No ratings yet
IP Agreement Trading Algorithm PDF
2 pages
Problem 0: Arrays Using For Loops
No ratings yet
Problem 0: Arrays Using For Loops
4 pages
Instrumental Variables: 2-Stage and 3-Stage Least Squares Regression of A Linear Systems of Equations
No ratings yet
Instrumental Variables: 2-Stage and 3-Stage Least Squares Regression of A Linear Systems of Equations
22 pages
Non Spherical Disturbances - Heteroskedasticity 1
No ratings yet
Non Spherical Disturbances - Heteroskedasticity 1
12 pages
Microeconometrics
No ratings yet
Microeconometrics
228 pages
Arch and Garch
No ratings yet
Arch and Garch
39 pages
MCQ Concept
No ratings yet
MCQ Concept
3 pages
Statistical Data Analysis - 2 - Step by Step Guide To SPSS & MINITAB - Nodrm
No ratings yet
Statistical Data Analysis - 2 - Step by Step Guide To SPSS & MINITAB - Nodrm
83 pages
7dJDuD5Y2Fia6Ch 6 Multicollinearity&Heterosced
No ratings yet
7dJDuD5Y2Fia6Ch 6 Multicollinearity&Heterosced
23 pages
Chapter 4 Market and Demand Analysis
100% (1)
Chapter 4 Market and Demand Analysis
33 pages
Ignou Forecasting Methods
No ratings yet
Ignou Forecasting Methods
74 pages
In-Class Assignment Chapter 3 Forecasting (Version 2)
No ratings yet
In-Class Assignment Chapter 3 Forecasting (Version 2)
11 pages
Logit Probit and Tobit Models For Catego PDF
No ratings yet
Logit Probit and Tobit Models For Catego PDF
19 pages
Exercises Identification Exams Previous Years
No ratings yet
Exercises Identification Exams Previous Years
3 pages
Econ452: Problem Set 2: University of Michigan - Department of Economics
No ratings yet
Econ452: Problem Set 2: University of Michigan - Department of Economics
4 pages
Dynamic Econometric Models Time Series Econometrics For Microeconometricians 2011
No ratings yet
Dynamic Econometric Models Time Series Econometrics For Microeconometricians 2011
51 pages
Chapter6 Solutions
No ratings yet
Chapter6 Solutions
8 pages
Empirical Studies in Finance (BA635) : Hschoi19@kaist - Ac.kr
No ratings yet
Empirical Studies in Finance (BA635) : Hschoi19@kaist - Ac.kr
9 pages
Ratio Imputation Improvement
No ratings yet
Ratio Imputation Improvement
39 pages
University of Zimbabwe: Authorized Materials: Calculator
No ratings yet
University of Zimbabwe: Authorized Materials: Calculator
11 pages
Econometrics. Macmillan Guide To Economics. Unit 2
No ratings yet
Econometrics. Macmillan Guide To Economics. Unit 2
2 pages
AP_Statistics_Worksheet_Residuals_and_Least_Squares
No ratings yet
AP_Statistics_Worksheet_Residuals_and_Least_Squares
3 pages
An Empirical Model of Social Insurance at The End of The Life Cycle
No ratings yet
An Empirical Model of Social Insurance at The End of The Life Cycle
22 pages
CEA_ECE069_SAS-17-1
No ratings yet
CEA_ECE069_SAS-17-1
9 pages
Lecture - ECON 7223 - 1
No ratings yet
Lecture - ECON 7223 - 1
26 pages
The Science and Art of DSGE Modelling
No ratings yet
The Science and Art of DSGE Modelling
9 pages
Ecotrics (PR) Panel Data - Fixed Effect
No ratings yet
Ecotrics (PR) Panel Data - Fixed Effect
8 pages
Biostatistics I - Assignment 02 Solution
No ratings yet
Biostatistics I - Assignment 02 Solution
5 pages
Assignment 2
No ratings yet
Assignment 2
9 pages
Econometrics Final Exam Study Guide PDF
No ratings yet
Econometrics Final Exam Study Guide PDF
14 pages
ch3 SEM methods of estimation_105548
No ratings yet
ch3 SEM methods of estimation_105548
17 pages

SAS Code To Select The Best Multiple Linear Regression Model For Multivariate Data Using Information Criteria

Uploaded by

SAS Code To Select The Best Multiple Linear Regression Model For Multivariate Data Using Information Criteria

Uploaded by

Paper SA01_05

SAS£ Code to Select the Best Multiple Linear Regression Model

COMMON STATISTICAL TECHNIQUES

SAS CODE FOR AIC

proc reg data=a outest=est;

proc reg data=a outest=est0;

proc sort data=estout; by _aic_;

COMPARISON OF AIC RESULTS WITH HEURISTIC METHODS

proc reg data=a outest=est1;

proc reg data=a outest=est2;

proc reg data=a outest=est3;

proc reg data=a outest=est4;

proc reg data=a outest=est5;

data both; set est4 est5; run;

Akaike, H. (1987). Factor analysis and AIC. Psychometrika, 52, 317-332.

You might also like