0% found this document useful (0 votes)

37 views6 pages

Chi Square

The Chi-Square distribution is commonly used to judge how far observations are from expected values. It arises as the distribution of a sum of squared normal random variables. The Chi-Square distribution depends on degrees of freedom and has applications in Bayesian analysis as a prior for variance parameters. It is convenient because it leads to simple posterior distributions when combined with normal likelihoods. Specifically, an inverse Chi-Square distribution is often used as a prior for a variance parameter, while a Chi-Square is used for a precision parameter.

Uploaded by

hammoudeh13

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

37 views6 pages

Chi Square

Uploaded by

hammoudeh13

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Chi-Square Distribution

June 10, 2013

James W. Stoutenborough <[email protected]> and Paul E. Johnson <[email protected]>

Introduction

The Chi-Square distribution is a staple of statistical analysis. It is often used to judge how far away some number is from some other number. The simplest place to start is that the Chi-Square distribution is what you get if you take observations from a standard Normal distribution and square them and add them up. If we use Z1 ,Z2 , and so forth to refer to draws from N (0, 1), then
N 2 2 2 Z1 + Z2 + . . . + ZN = i=1

Zi2 2 N

Thats means the sum of Z s squared has a Chi-Square distribution with N degrees of freedom. The term degrees of freedom has some emotional and cognitive implications for psychologists, but it is really just a parameter for us. Things that are sums of squares have 2 distributions. Now, suppose the numbers being added up are not standardized, but they are centered. That is to say, they have a Normal distribution with a mean of 0 and a standard deviation of sd. That means we would have to divide each observation by sd in order to obtain the Zi s which are standardized Normal observations. Obviously, Y1 sd
2

Y2 sd

YN sd

N 1 N 2 Y = Zi2 2 N sd2 i=1 i i=1

Equivalently, suppose you think of the Yi as being proportional to the Zi in this way: Yi = sd Zi The coecient sd is playing the role of a scaling coecient and without too much eort you nd out that if some variable xi = Zi2 has a Chi-square distribution, 2 N , then sd xi has a distribution equal to sd 2 . N The elementary laws of expected values and variances dictate that E (sd xi ) = sd E (xi ) and V ar(sd xi ) = sd2 V ar(xi ) 1

In other words, the Chi-square distribution applies not just for a sum of squares of a standardized Normal distribution, but in fact it describes a sum of squares of any Normal distribution that is centered around zero.

Mathematical Description
Zi2 is dened as:
(N/21)

The Chi-Square probability density function for xi = f (xi ) = xi

exp (xi /2) 2N/2 [N/2]

It is dened on a range of positive numbers, 0 xi . Because we are thinking of this value as a sum of squared values, it could not possibly be smaller than zero. It also assumes that N > 0, which is obviously true because we are thinking of the variable as a sum of N squared items. Why does the 2 have that functional form? Well, write down the probability model for a standardized Normal distribution, and then realize that the probability of a squared-value of that standardized Normal is EXTREMELY easy to calculate if you know a little bit of mathematical statistics. The only fancy bit is that this formula uses our friend the Gamma Function (see my handout on the Gamma distribution), to represent a factorial. But we have it on good authority (Robert V. Hogg and Allen T. Craig, Introduction to Mathematical Statistics, 4ed, New York: Macmillian, 1978, p. 115) that (1/2) = .

Illustrations

The probability density function of the Chi-Square distribution changes quite a bit when one puts in dierent values of the parameters. If somebody knows some interesting parameter settings, then a clear, beautiful illustration of the Chi-square can be produced. Consider the following code, which can be used to create the illustration of 2 possible Chi-Square density functions in Figure 1. x v a l s < s e q ( 0 , 1 0 , l e n g t h . o u t =1000) c h i s q u a r e 1 < d c h i s q ( x v a l s , d f =1) c h i s q u a r e 2 < d c h i s q ( x v a l s , d f =6) matplot ( x v a l s , cbind ( c h i s q u a r e 1 , c h i s q u a r e 2 ) , type=" l " , x l a b=" p o s s i b l e v a l u e s o f x " , y l a b=" p r o b a b i l i t y o f x " , ylim=c ( 0 , 1 ) , main=" Chi Square P r o b a b i l i t y D e n s i t i e s " ) t e x t ( . 4 , . 9 , " d f=1" , pos =4, c o l =1) t e x t ( 4 , . 2 , " d f=6" , pos =4, c o l =2) The shape of the Chi-Square is primarily dependent upon the degrees of freedom that are witnessed in any particular univariate analysis. The adjustment of the degrees of freedom will have a substantial impact on the shape of the distribution. The following code will produce example density functions for a variety of shapes with a variety of degrees of freedom. Examples of Chi-Square density function with a variety of degrees of freedom are found in Figure 2. 2

ChiSquare Probability Densities

1.0

df=1 probability of x 0.8 0.2 0.4 0.6

df=6

0.0 0

possible values of x
Figure 1: 2 Density Functions

Expected Value, Variance, and the role of the parameters

The Chi-Square distribution is a form of the Gamma distribution, and most treatments of the Chi-Square rely on the general results about the Gamma to state the characteristics of the special-case Chi-square. The Gamma distribution G(, ) is a two parameter distribution, with parameters shape () and scale ( ). Gamma probability density = 1 x1 ex/ ()

Note that if the shape parameter of a Gamma distribution is N 2 and the scale parameter is equal to 2, then this probability density is identical to the Chi-square distribution with degrees of freedom equal to N . Since it is known that the expected value of a Gamma distribution is and the variance is 2 , that means that the expected value of a Chi-square for N observations is E ( x) = N and the variance of a Chi-square variable is V ar(x) = 2N 3

ChiSquare Probability Densities

1.0

df=1 0.8 probability of x 0.4 0.6

df=2

0.2

df=3 df=6 df=10 df=15

0.0 0

10 possible values of x

Figure 2: Chi-Square Densities various df

Now, if a variable is proportional to a Chi-Square xi , yi = xi , we know that yi has a distribution yi 2 N and the probability density is (via a change of variables) ( N 1) yi yi 2 exp 2 N/2 2N/2 [N/2]

f (yi ) = and

E (yi ) = N V ar(yi ) = 2 N The mode (for N > 2) is mode(yi ) = (N 2) The Chi-Square is related to the Poisson distributions with parameter and expected value i equal to x 2 by: P [Chi Square(n) xi ] = P P oisson
xi 2

n 2

How is this useful in Bayesian analysis?

In statistical problems, we often confront 2 kinds of parameters. The slope coecients of a regression model are one type, and we usually have priors that are single-peaked and symmetric. The prior for such a coecient might be Uniform, Normal, or any other mathematically workable distribution. Sometimes other coecients are not supposed to be symmetrical. For example, the variance of a distribution cannot be negative, so we need a distribution that is shaped to have a minimum at zero. The Gamma, or its special case the Chi-square, is an obvious candidate. The most important aspect of the Chi-square, however, is that it is very mathematically workable! If one is discussing a Normal distribution, for example, N (, 2 ) one must specify prior beliefs about the distributions of and 2 . Recall that in Bayesian updating, we calculate the posterior probability as the product of the likelihood times the prior, so some formula that makes that result as simple as possible would be great. p( 2 |y ) = p(y | 2 )p( 2 ) From the story that we told about where Chi-square variables come from, it should be very obvious that if y is normal, we can calculate p(y | 2 ) (assuming is taken as given for the moment). So all we need is a prior that makes p( 2 |y ) as simple as possible. If you choose p( 2 ) to be Chi-squared, then it turns out to be very workable. Suppose you look at the numerator from the Chi-Square, and guess that you want to put 1/ 2 in place of xi . You describe your prior opinion about 2 5

1 prior : p 2 ( 2 )N/21 exp 2 So / 2

We use N and S0 as a scaling factors to describe how our beliefs vary from one situation to another. N is the degrees of freedom. Note that is very convenient if your Normal theory for y says: p(yi | 2 ) = 1 2 2 exp( 1 (yi )2 ) 2 2
n 2 i=1 (yi )

Suppose the sample size of the dataset is n. If you let S = sum of squares, then we rearrange to nd a posterior:

represent the

1 p( 2 |y ) ( 2 )(N +n)/21 exp( (So + S )/ 2 ) 2 Look how similar the prior is to the posterior. It gets confusing discussing 2 and 1/ 2 . Bayesians dont usually talk about estimating the variance of 2 , but rather the precision, which is dened as = 1 2

Hence, the distribution of the precision is given as a Chi-Square variable, and if your prior is 1 prior : p ( ) N/21 exp So 2 then the posterior is a Chi-Square variable (So + S ) 2 N +n If you really do want to talk about the variance, rather than the precision, then you are using a prior that is an INVERSE Chi-Square. Your prior is the inverse chi-square
2 2 So XN

which Ive seen referred to as

2 2 (S0 + S ) N +n

As a result, a prior for a variance parameter is often given as an inverse Chi-square, while the prior for a precision parameter is given as a Chi-square.

Food Calories List PDF
40% (5)
Food Calories List PDF
9 pages
Econ 316: Applied Statistics For Economist
No ratings yet
Econ 316: Applied Statistics For Economist
376 pages
Lec16 MTH305
No ratings yet
Lec16 MTH305
72 pages
Chap-4 LSCM 2017
No ratings yet
Chap-4 LSCM 2017
38 pages
Canon I-Sensys Mf8230cn
No ratings yet
Canon I-Sensys Mf8230cn
750 pages
Q3W2 Chi Square Distribution
No ratings yet
Q3W2 Chi Square Distribution
28 pages
chp2 1
No ratings yet
chp2 1
19 pages
Wa0503.
No ratings yet
Wa0503.
28 pages
Chi-Square Distribution
No ratings yet
Chi-Square Distribution
14 pages
Chi-Square+Test+ +Analysis+of+Variance
No ratings yet
Chi-Square+Test+ +Analysis+of+Variance
19 pages
Chi-Square Distribution
No ratings yet
Chi-Square Distribution
17 pages
7 Chi-Square and F
No ratings yet
7 Chi-Square and F
21 pages
Chi Squared Tests
100% (1)
Chi Squared Tests
24 pages
Chi-Square Test: DR Ramakanth
No ratings yet
Chi-Square Test: DR Ramakanth
38 pages
Chi Square Distribution
No ratings yet
Chi Square Distribution
19 pages
1 Stat511 U4-1
No ratings yet
1 Stat511 U4-1
45 pages
Week - 6 - Main File
No ratings yet
Week - 6 - Main File
24 pages
Chapter 5 - Continuous Probability Distribution
No ratings yet
Chapter 5 - Continuous Probability Distribution
42 pages
Chapter No. 08 Fundamental Sampling Distributions and Data Descriptions - 01 (Presentation)
No ratings yet
Chapter No. 08 Fundamental Sampling Distributions and Data Descriptions - 01 (Presentation)
35 pages
Wikipedia
No ratings yet
Wikipedia
29 pages
Estimating Population Variances
No ratings yet
Estimating Population Variances
17 pages
Test of Association
No ratings yet
Test of Association
27 pages
Lecture 05 - Addendum - Chisquare, T and F Distributions
No ratings yet
Lecture 05 - Addendum - Chisquare, T and F Distributions
4 pages
12 Chi-Square Distribution
No ratings yet
12 Chi-Square Distribution
17 pages
Capitulo 1 Inferencia Estadistica Web
No ratings yet
Capitulo 1 Inferencia Estadistica Web
23 pages
Chi-Squared Distribution
No ratings yet
Chi-Squared Distribution
15 pages
Session 7 Probability Distribution II - Continuous
No ratings yet
Session 7 Probability Distribution II - Continuous
30 pages
MATH& 146 Lesson 24: The Chi-Square Distribution
No ratings yet
MATH& 146 Lesson 24: The Chi-Square Distribution
27 pages
Chi Square Distribution
No ratings yet
Chi Square Distribution
10 pages
Statistical Tests of The Estimated Variance Factor
No ratings yet
Statistical Tests of The Estimated Variance Factor
10 pages
Cambridge Books Online
No ratings yet
Cambridge Books Online
27 pages
5 6mat271
No ratings yet
5 6mat271
6 pages
Very Basic Lie Theory PDF
No ratings yet
Very Basic Lie Theory PDF
24 pages
Very Basic Lie Theory PDF
No ratings yet
Very Basic Lie Theory PDF
24 pages
Probability and Distributions
No ratings yet
Probability and Distributions
6 pages
Question 18
No ratings yet
Question 18
2 pages
Pre Socratic Period
No ratings yet
Pre Socratic Period
6 pages
Sample Statistics: N I N I
No ratings yet
Sample Statistics: N I N I
13 pages
The Correspondence Theory of Truth PDF
No ratings yet
The Correspondence Theory of Truth PDF
14 pages
Stat100b Gamma Chi T F
No ratings yet
Stat100b Gamma Chi T F
18 pages
History of Logic
No ratings yet
History of Logic
12 pages
The Chi-Square Distribution: B.1. The Gamma Function
No ratings yet
The Chi-Square Distribution: B.1. The Gamma Function
9 pages
Unit 3 Standard Sampling Distributions-I: Structure
No ratings yet
Unit 3 Standard Sampling Distributions-I: Structure
16 pages
Grey - Spinoza Necessitarianism
No ratings yet
Grey - Spinoza Necessitarianism
21 pages
EULEUR
No ratings yet
EULEUR
10 pages
Galois Theory For Biginners
No ratings yet
Galois Theory For Biginners
7 pages
Chisquare
No ratings yet
Chisquare
10 pages
Lecture Types of Distribution
No ratings yet
Lecture Types of Distribution
14 pages
Chi-Square and Related Distribution
No ratings yet
Chi-Square and Related Distribution
7 pages
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
Aristotle sPriorandBoole sLawsofThoght Corcoran HPL24 (2003) 261 288b
No ratings yet
Aristotle sPriorandBoole sLawsofThoght Corcoran HPL24 (2003) 261 288b
28 pages
Chi-Squared Distribution
No ratings yet
Chi-Squared Distribution
12 pages
Fe Engineering Probability Statistics
No ratings yet
Fe Engineering Probability Statistics
9 pages
Chi Square Distributions
No ratings yet
Chi Square Distributions
3 pages
Chapter - Six The Chi-Square Distribution Objectives
No ratings yet
Chapter - Six The Chi-Square Distribution Objectives
16 pages
2.1 The Chi Square Distribution
No ratings yet
2.1 The Chi Square Distribution
11 pages
WIN SEM (2020-21) CSE4029 ETH AP2020215000156 Reference Material I 27-Jan-2021 Distibution
No ratings yet
WIN SEM (2020-21) CSE4029 ETH AP2020215000156 Reference Material I 27-Jan-2021 Distibution
7 pages
X2 Test (Chi Squared Test)
No ratings yet
X2 Test (Chi Squared Test)
5 pages
Chi Square&F: Stats
No ratings yet
Chi Square&F: Stats
21 pages
Chi-Square Dan F Test
No ratings yet
Chi-Square Dan F Test
17 pages
Chi-Square: History and Definition
No ratings yet
Chi-Square: History and Definition
16 pages
Three Special Continuous Distributions: 1 The T-Distribution
No ratings yet
Three Special Continuous Distributions: 1 The T-Distribution
5 pages
The Chi-Square Distribution
No ratings yet
The Chi-Square Distribution
2 pages
Tensor Calculus, Part 2
No ratings yet
Tensor Calculus, Part 2
14 pages
MIT15 075JF11 chpt05
No ratings yet
MIT15 075JF11 chpt05
8 pages
Chi Square
No ratings yet
Chi Square
11 pages
Significance Test: Population Versus Sample
100% (3)
Significance Test: Population Versus Sample
4 pages
Chi Square Table
No ratings yet
Chi Square Table
4 pages
Stress-Energy Pseudotensors and Gravitational Radiation Power
No ratings yet
Stress-Energy Pseudotensors and Gravitational Radiation Power
8 pages
Chi Square
No ratings yet
Chi Square
6 pages
Chap8 Lie Brackets
No ratings yet
Chap8 Lie Brackets
3 pages
Lecture Notes in Lie Groups PDF
No ratings yet
Lecture Notes in Lie Groups PDF
74 pages
Isomor Phism Theorammit18 703s13 Pra L 10
No ratings yet
Isomor Phism Theorammit18 703s13 Pra L 10
6 pages
Notes 2012
No ratings yet
Notes 2012
132 pages
Algebraic Equations
From Everand
Algebraic Equations
Demetrios P. Kanoussis
No ratings yet
Application of Derivatives Tangents and Normals (Calculus) Mathematics E-Book For Public Exams
From Everand
Application of Derivatives Tangents and Normals (Calculus) Mathematics E-Book For Public Exams
Mohmmad Khaja Shareef
5/5 (1)
Mit Ocw: 18.703 Modern Algebra Prof. James Mckernan
No ratings yet
Mit Ocw: 18.703 Modern Algebra Prof. James Mckernan
4 pages
De Moiver's Theorem (Trigonometry) Mathematics Question Bank
From Everand
De Moiver's Theorem (Trigonometry) Mathematics Question Bank
Mohmmad Khaja Shareef
No ratings yet