0% found this document useful (0 votes)

12 views33 pages

HT With R

Uploaded by

Krishnaprasanna M

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views33 pages

HT With R

Uploaded by

Krishnaprasanna M

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 33

Hypothesis Testing

In our day to day routine we come across many questions such as:

• How much liters of water should be alotted on an average to a household in Delhi [Nation saw a heated
debate on this during Delhi elections few years back]
• How much stock of apple juice should i order for this for powai location
• By what amount should we increase credit card limits for a certain group of customers
• Should BCCI select Ishant Sharma for Australia tour in spite of his poor performance in last 4 test
matches in India

• Should we provide additional tuition for language courses to science students.

To figure o ut a nswers to t hese q uestion a t very b asic l evel, you d on’t r eally n eed a ny h ardcore statistics
knowledge. For example lets take first and last q uestions f rom t he o nes l isted above.
To figure o ut how much water on an average a Delhi household would need, we can s imply ask every house
hold regarding the same. But its much harder than said. There are too many house holds to cover. Resources
and time required to do this will definitely not justify the utility of o utcome. So what do we do t hen? Instead
of connecting with each and every household, we can talk to handful of them. Or in other words , take a
small sample. And consider their average consumption as the general answer. But we need to be careful
about what all households we are selecting as sample. Water consumption needs might not be same across
all regions of Delhi, also there will be certain difference between residential and commercial entities. Our
sample should contain observations from all these different segments [ strata in population] in order to truly
represent entire Delhi.
Taking up next question of whether science students need additional tuition for language courses. Underlying
thought here is that science students perform poorly in language courses in comparison to students coming
from other streams. To figure out if that is the case we’ll collect student performance data and see the average
performance of students from both science and non-science streams. If science students are performing
significantly worse t han t heir counterparts, we’ll take t he decision in favor of p roviding t hem w ith additional
tuition.
Common theme in strategy of solving both these problem was to collect data and then use it to verify, refute
our claims or estimate the value of parameter. What our methods lacked though, was rigorous statistical
framework. We’d build the same as we progress in this module.
We’ll be discussing various concepts here on wards which might seem slightly disconnected in the beginning
but everything will fall in place as we near the end. Starting with Population and Sample, we have already
used these concepts above , its time to formally introduce them

Population & Samples

Population is a huge collection of data points. This can be anything from Age of all graduating engineers of
India in last decade or Number of pages in individual books published on topic statistics in last century. If
we want to figure o ut a parameter [ any c haracteristic] value for t his “ population” we can measure in theory
each and every observation to find the average value.
As we witnessed earlier, this is rarely feasible. And for all practical purposes to estimate value of this
population parameter [ such as average , standard deviation etc] we work with a sample instead.

1
These “sample” observations need to be randomly chosen in order to avoid personal bias. Also they should
represent all strata present with in the population.

Estimates & Errors

Purpose of the sampling from the population was to estimate the population parameter such as average age
of graduating engineers or average number of pages in statistics books.
We don’t really know what the actual value of population parameter is. Estimates give us an idea what it
might be. As your sample size grows bigger and bigger, estimate will be more close to real value of population
parameter.

2
Since these samples contain randomly chosen observations, each new sample will give you a different estimate
for population parameter. What this means is that an estimate from sample will always contain error. The
term error is not same as mistake.
Errors are inherent part of estimates by design. Hypothesis Testing is a frame work to quantify these errors.

Histograms, Probabilities, Cumulative Probabilities and Distributions

We all have seen those frequency bar charts at some point of our time while working with excel sheets.Lets
consider scores of 40 students in a quiz. First few data points look like this:

\ Score
1 7.5
2 10.7
3 6.7
4 16.4

. . . full table truncated

We can convert this to frequency counts, which tells us how many students got a certain score.

Score Frequency
1.1 1
2.0 1
4.1 1
4.5 1
6.7 2
7.5 2
. . . full table truncated

We can plot these counts as bar charts .

3
5

3
count

0 5 10 15
Score
We can see that this bar chart however is not very informative as many frequencies are simply equal and
many intervals remain blank.It doesn’t give us very good idea as to what kind of values are more frequent
and so on. Since we have less number of data points we can club them into classes. For example all scores
like 9.2, 9.4 9.6 etc can be clubbed into class 9.
Lets see how does the bar chart looks like once we do this

4
6

4
count

0 5 10 15
Score
Lets convert these counts to frequency percentages by dividing them by total counts [40]

0.15
Frequency Percent

0.10

0.05

0.00

0 5 10 15
Score
By looking at this Frequency Percent chart and assuming that this sample represent entire population of

5
students, i can say that 20% of students score in “class 11”. Or in other words, if i randomly pick a student’s
score , probability of it being in class 11 is 0.2/20% .
If you ask me what is the probability that a student would score between 5 to 10. I will simply add all the
probabilities associated with “class 5” to “class 10”.

P (5 ≤ Score ≤ 10) = P (Score = 5)+P (Score = 6)+P (Score = 7)+P (Score = 8)+P (Score = 9)+P (Score = 10)

P (5 ≤ Score ≤ 10) = 22.5%

That teaches us , we can comment on probabilities of occurrence in an interval with cumulative probabilities
[addition of probabilities] . Also by collecting more data we can make our “classes” finer and finer. Which
gives us idea about frequency of occurrence of values at much finer level. Lets see how this looks with 200
data points instead of just 40.
This contains much more information in comparison to

0.100

0.075
Frequency Percent

0.050

0.025

0.000

0 5 10 15 20
Score
We can keep on collecting this data until lets we have practically infinite data points. Imagine that we drew
a curve which joins top of all these fine bars now. We could still comment probabilities by looking at point
on the curve associated with those values of x.

6
0.100

0.075
Frequency Percent

0.050

0.025

0.000

0 10 20
Score
Lets say this curve is represented by y=f(x), if you pass a value of x, it gives probability of occurrence of that
value. Now instead of adding the probabilities of occurrence to get interval probabilities we can use this:

This f(x) here is nothing but distribution curve of the population.By looking at this curve you can get an
idea about probability of occurrence of the values from your population.
Normal distribution is such a curve. it has following equation:

1 (x µ)2
f (x) = √ exp − −2
σ 2π 2σ
Where µ and σ are population mean and population standard deviations respectively.They are parameters of
this functions . For different values of µ and σ , you’ll have a different Normal distribution. This is similar to

7
general equation of lines. y=mx+c . For different values of slope(m) and intercept(c) you’ll have different
lines.
Don’t let this weird equation intimidate you.Its just an equation associated with a particular distribution. It
also satisfies conditions given above for such a function.

Standardization

We are jumping on to another topic , have patience , things will connect evenutally.
Consider data points for variable X : x1, x2, x3 ... xn . For these data points we can calculate average and
standard deviation as follows.

n
Σ
xi
i=1
X̄ =
n
and
‚
. Σ (xi − X̄ )
n 2

, n
i=1
Sx =

8
Standard Normal Distribution

We have seen general equation of normal distribution already with parameters µ and σ.

1 (x µ)2
f (x) = √ exp − −2
σ 2π 2σ

Standard Normal Distribution is one specific distribution with µ = 0 and σ = 1.

2
1 −x
f (x) = √ exp
2π 2

Few results from standard normal distribution In literature else where you’ll find standard normal
varibale being represented as Z. We’ll follow the same notation.

Following result is self explanatory given that mean of standard normal distribution is 0 and normal
distributions in general are symmetric.

P (Z ≥ 0) = P (Z ≤ 0) = 0.50

Few other results which also can be broken in symme9rtical halves as above have been derived as follows:
P (−1 ≤ Z ≤ 1) = 0.682

P (−2 ≤ Z ≤ 2) = 0.954

P (−3 ≤ Z ≤ 3) = 0.996

Since the distribution is symmetric as mentioned before, one sided probabilities as well as remainder
probabilites can be easily calculated . For example, using one of the above results you can calculate:
0.682
P (Z ≤ 1) = = 0.341
2
0.954
P (Z ≥ 2) = 1 − P (Z ≤ 0) − P (0 ≤ Z ≤ 2) = 1 − 0.5 − = 0.023
2

Conftdence Intervals

I’m listing few more results from standard normal distribution, similar to as we listed above.

P (−1.645 ≤ Z ≤ 1.645) = 0.90

P (−1.96 ≤ Z ≤ 1.96) = 0.95
P (−2.576 ≤ Z ≤ 2.576) = 0.99

These probability results are stratight forward. One way to interpret any of them is, if you randomly pick an
observation from a population which follows standard normal distribution , there is 90% chance/probability
that it’ll fall in the interval [-1.645,1.645]
Formaly , [-1.645,1.645] is called 90% confidence interval for standard normal distribution. Of course there
can be other arbitrary intervals [a,b] such that :

P (a ≤ Z ≤ b) = 0.90

But there is only one possible 90% interval which is symmetric about mean of distribution. These symmetric
intervals are called Confidence Intervals . These are also written as CI in short notation.

Extrapolating Results for general normal distribution From our lessions of standardization we can
say that if we have a general normal distribution following variable X such that

X ∼ N (µ, σ2)

We can standardize this as follows

X−µ
Z=
σ
Where Z will follow normal distribution but with mean 0 and standard deviation 1. which is standard normal
distribution. So given any of the above probability results we can do following *[Lets for X µ = 10 and σ = 2]:

10
P (−1.645 ≤ Z ≤ 1.645) = 0.90
X−µ
P (−1.645 ≤ ≤ 1.645) = 0.90
σ
P (−1.645 ∗ σ + µ ≤ X ≤ µ + 1.645 ∗ σ) = 0.90

putting in values of µ and σ for X in above , we get

P (5.065 ≤ X ≤ 14.935) = 0.90

N (10, 9) is [5.065,14.935]. If you are wondering that this

Which tells us that 90% confidence interval for X ∼
CI doesn’t look symmetric. It is , about the mean 10.
It doesn’t make sense to claculate these CI results for infinitely many possible normal distributions . Howeve
using the technique shown above we can calculate this for any general normal distribution , with having
pre-calculated results for just standard normal distribution.
But there are many other possible distributions out there, why are we so keen about exploring Normal
distribution? Next segment will give you that answer.

Central Limit Theorem

2
Averages of samples [of sufficient large size] follow normal distribution with mean µ and variance σ where
n
(µ , σ 2 ) are mean and variance of the population and n is the sample size. If sample size is small, normal
distribution is replaced by t-distribution. [This is irrespective of distribution being followed by the variable
values]
This statement is backbone of what we are going to see in hypothesis testing framework.Lets try to understand
what it is saying with some example.

set.seed(1)
d=data.frame(X=rbeta(20000,2,5))
p=ggplot(d,aes(x=X))
p+geom_bar(aes(y=(..count..)/sum(..count..)))+ylab("Frequency Percent")

11
0.06
Frequency Percent

0.04

0.02

0.00

0.00 0.25 0.50 0.75 1.00

X
Clearly the distribution of the variable X here doesnt look normal or symmetric.What we are going to do next
is to text 1000 samples of size 100 each from this population and then plot histogram for their Averages
and see whether that looks like a normal distribution.

k=numeric(1000)
for (i in 1:1000){
j=sample(1:1000,100)
k[i]=mean(d$X[j])
}
d1=data.frame(k)
p=ggplot(d1,aes(x=k))
p+geom_bar(aes(y=(..count..)/sum(..count..)))+ylab("Frequency Percent")+xlab("Sample Averages")

12
0.09
Frequency Percent

0.06

0.03

0.00

0.225 0.250 0.275 0.300 0.325 0.350

Sample Averages
And it surprisingly does. Lets try this with some other more radical distribution. How about a parabolic
distribution, lets see if sample averages still follow normal distribution.

set.seed(1)
t=runif(20000)
set.seed(2000)
k=runif(20000)
X=ifelse(k>0.5,4+sqrt(1-1.332*t),4-sqrt(1-1.332*t))
d=data.frame(X=X)
p=ggplot(d,aes(x=X))
p+geom_bar(aes(y=(..count..)/sum(..count..)))+ylab("Frequency Percent")

13
0.06
Frequency Percent

0.04

0.02

0.00

3.0 3.5 4.0 4.5 5.0

X
This again is distribution which is no where close to a normal distribution, lets see how sample averages
behave in this case.

k=numeric(1000)
for (i in 1:1000){
j=sample(1:1000,100)
k[i]=mean(d$X[j],na.rm=T)
}
d1=data.frame(k)
p=ggplot(d1,aes(x=k))
p+geom_bar(aes(y=(..count..)/sum(..count..)))+ylab("Frequency Percent")+xlab("Sample Averages")

14
0.08

0.06
Frequency Percent

0.04

0.02

0.00

3.7 3.8 3.9 4.0 4.1 4.2

Sample Averages
And again, here it is . Sample averages seem to follow normal distribution in this case as well. Without
going into mathematical details of Central Limit Theorem, our take aways from here is that irrespective of
underlying population distribution, samples averages follow normal distribution.

Hypothesis Testing

Now we have gone through all the tools that are necessary to take the next step. Consider that I believe
Average Marks obtained by Highschool Students of Maharashtra Board is 85. This is my null hypothesis
which i have great faith in.

H0 : µ = 85

Now somebody comes and proposes that this null hypothesis which I have great faith in is not true and
Average Marks are not equal to 85. That is Alternate Hypothesis.

Ha : µ ƒ= 85

To check that whether this claim of null hypothesis not being correct is true, I will take a sample of student
scores. We know that this sample estimate will never be actually equal to 85 or other true mean if thats the
case; owing to inherent errors associated with estimates.
Now question arises when would I refute the claim by Ha. Since I have great faith in H0 , unless this sample
presents an extreme evidence against H0, I will stand by H0 and reject claims by Ha.
What is an extreme evidence then? How do i draw the line. Due to CLT I know that sample averages follow
2
normal distribution with (µ, nσ ). Using this I can say build a 95% confidence interval for my H0 and If
sample which i draw happens to have average marks fall outside of this interval, I’d consider that to be an
extreme evidence against H0 and reject H0.

15
Acceptance & Rejection Region There is no rule for taking a 95% CI as limit for accepting or rejecting
H0. It only depicts how comfortable or keen we are to accept/reject H0. Lets call this 95% as size of my
acceptance region. The size of my rejection region will be 5% [1-0.95]. This is denoted by
.
Choice of
is a subjective decion of business analyst and very much depends on the business process goal. Consider
these extreme cases where we’ll discuss what kind of
do we decide to take.
• I am manufacturing plant owner , I want to check whether the iron rings which i produce are deviating
from the diameter specifications. If my tests conclude that there is deviation I’ll to dismantle and re-
assemble the entire which is a huge cost in comparison to go on with manufacturing of iron rings
with small deviations. Which implies, unless i get a very extreme evidence against my null hypothesis
[being average diameter of iron rings= specifications] I will not reject null hypothesis. In this case I
would keep my α small.

• I am in the business of launching Rockets. Fuel efficiency is of critical importance and should not
deviate from the specifications. I get fuel barrels from an external supplier. I want to check the quality
of this fuel by measuring fuel efficiency. In this case I’d like to capture even small deviations. I’ll keep
my α relaxed.

Standard practice in industry is to keep α = 0.05 or 5%.

One Side and Two Sided Test Consider this alternate hypothesis again:

Ha ƒ= 85

In order to conclude in favor of this ,my extreme evidence can be on either side of the proposed mean [85] by
my null hypothesis. Too small, or too large, both will be extreme evidences in favor of alternate hypothesis.
Test, like this are called two sided test. We have already done this once above. Now, if alternate hypothesis
was not equality, such as :

Ha < 85 or Ha > 85

Then you need extreme evidence on left/right hand side only. These are called one sided tests.

16
P-value and Alpha Once we have decided our α , we can conclude by checking whether sample mean
falls beyond the CI limits to conclude in favor/against H0.
There is another way to look at this. Consider this diagram for a one sided test.

What this diagram tells us that essential α is nothing but area under the distribution curve beyond the
interval limit [in this case XL]. As mentioned in the figure if the sample estimate falls beyond XL or in the
shaded region, we’ll conclude against H0.
Consider diagram for anothe one sided test:

You can see that sample estimate falls in the shaded region beyond the interval limits, hence we’ll conclude
against H0 or reject H0. You can also see that the area under the curve beyond [-2.085] will definitley be
less than α [area under the curve beyond -1.645]. This area under the curve beyond [-2.085] is called p-value
associated with the sample estimate. You can see that if :

p − value < α : Reject H0

p − value > α : F ailed to Reject H0

Your software output for hypothesis tests will be in terms of p-values and using above mentioned guidelines,
you can interpret the results.

17
&

Summarising Hypothesis Testing Framwork : Steps to interpret a test results

1. H0 : You need to know null hypothesis associated with the test , so that you know what you are
concluding in favor or against.

2. α : You decide this depending upon your business process. Industry standard is 0.05
3. Once you carry out the test using your sample, if resulting p-value > α you conclude in favor of H0. if
p-value < α , you conclude against H0

few things to note:

• α does not depend on anything accept your decision

• p-value does not depend on α. It depends on H0/Ha and your sample data. p-value will change only
when either of these change.

We’ll now formally discuss various hypothesis tests without getting into mathematical details

18
One Sample T-test We have infact already done this. Let me however clarify the name of the test. If you
recall sample averages follow normal distribution for large enough sample size which is typically considere to
be 30. If sample size goes below 30, sample averages instead follow T-distribution. T-distribution is very
similar to normal, just a little thicker on tails. In fact for sample size more than 30, T-distribution and
Normal distribution take almost identical values. You can use T-distribution through out the range of sample
sizes and it wont make much difference.This is where the name T-test comes from.

H0 : µ = µ0
Ha : µ ƒ= µ0 OR µ > µ0 OR µ < µ0

Paired sample T-test Finding out whether the average statistic is equal to some value is not the only
kind of business problem that we are intrested in solving. Paired sample t-test is used to find out difference
in average value of population parameter “before and after”. Although the term “before and after” might be
misleading at times.
Let me give you few examples ,

• You want to check whether a medicine for lowering sugar levels in blood should be approved or not.
You’ll measure blood levels of say 100 patients before taking the medicine and after completing the
medicine course. You’d check whether average sugar levels in these patients have gone down significantly
after completing the medicine course or not.

• You want to check whether performance in Mathematics and English is very different for a certain
school. You’d check if average scores in Mathematics and English are significantly different.

What makes it “paired” is that both kind of observation have same source. In first example , before and after
sugar levels belonged same patients. In second example , scores in Mathematics and English belonged to
same students.

H0 : µ1 = µ2 OR µ1 − µ2 = δ = 0
Ha : δ ƒ= 0 OR δ > 0 OR δ < 0

Unpaired Two Sample T-test This is used when you want to check whether two group means are
significantly different or not. For example :

• Whether corporate salaries for same designations are different for Males and Females. You’ll take
samples from Male workers and Female workers separately and check whether their average salaries are
different or not.

Note that here observations are not paired , they do not have same source. Hypothesis remains same just
that the underlying statistics methods change slightly. We don’t need to worry about that.

H0 : µ1 = µ2 OR µ1 − µ2 = δ = 0
Ha : δ ƒ= 0 OR δ > 0 OR δ < 0

19
ANOVA : Analysis of Variance We’ll first discuss what do we use ANOVA for. We’ll get into a little
bit of mathematics involved with ANOVA which you can skip if you want to.
The T-test is limited to 2 groups at a time in your data. Now lets say you want to check if average yield of a
rice field is varying across states in India or not, so that you can accordingly plan farmer subsidies given to
rice farmers across state by the central government. You can not use T-test for it. Hypothesis for ANOVA
are as follows:

H0 : µ1 = µ2 = µ3....... = µp = µ0 where p is number of groups present in the data

Ha : µi ƒ= µ0 f or atleast one i in [1, 2, 3, ... p]

If the p-value for the test comes out to be < α then you might be interested in which group is differing
in mean. You can check that by doing bon-feroni test in conjunction with ANOVA. We’ll directly see an
example for it towards the end.
Now the time for a little mathematics. You can consider ANOVA to be very similar to a linear regression
model with all categorical variables. [If you havent gone through linear regression yet , you can always revisit
to understand better]
Consider Total variance in the data without any groups for variable X

n
Σ
T otal sum of squares = SST = (xi − x̄)2
i=1

If all groups means are equal then total within group variance is going to be very close to SST. and Between
group variance is going to be close to zero. and

SST = SSW + SSB

If group means are equal then SSB will be close to zero. This ratio statistic follows F-distribution .We can a
SSW
build a confidence interval around 0 and see whether this statistic is significantly different from zero or not.
If p-value for this F statitic comes out to be < α then we’ll conclude that atleast one of the group means are
different.

Chisq Test So far we have discussed just numeric variables. How about categorical variables. First we
need to understand what do we mean categorical variables affecting each other. Consider this cross table of
counts student’s preference for learning programing across genders

\ Male Female
Yes 30% 29%
No 70% 71%

You can see that preference for programing among both the genders is roughly same close to 30:70. Only
slightly different from females. Now when this slight difference is “significant”, can be assesed through chisq
test.

H0 : Categorical variables have no ef f ect on each other

Ha : Categorical variables do af fect each other

20
Examples with R

We’ll be looking at 4 kind of hypothesis tests. Key to understand results of any hypothesis test result is to
know what is null hypothesis and how to conclude by looking at p-value.
In short you need to do this:

1. Know what is the null hypothesis H0. Complimentary to this would be Alternate hypotheis Ha
2. If p-value of the test is smaller than alpha [standard value is 0.05, you can take 0.1,0.01 etc] then
conclude against H0; otherwise in favor of H0.

Lets import the data

wq=read.csv("winequality-white.csv",sep=";")

One Sample T-test. you are checking whther mean of the variable in question is equal to the specified
value [H0=6.10] or not. Changing α does not change p-value ofthe test as that depends on the data only
however it does change the confidence intervals being displayed.
In the example given below. Null hypothesis is that mean of fixed_acidity=6.10

t.test(wq$fixed.acidity,mu = 6.10)

##
## One Sample t-test
##
## data: wq$fixed.acidity
## t = 62.598, df = 4897, p-value < 2.2e-16
## alternative hypothesis: true mean is not equal to 6.1
## 95 percent confidence interval:
## 6.831149 6.878426
## sample estimates:
## mean of x
## 6.854788

You can see that the p-value is <2.2 * 10−16. Which is very low , we’ll conclude in favor Alternate hypothesis
which is stated as
alternative hypothesis: true mean is not equal to 6.1
You can also see confidence interval of the mean according to alternate h ypothesis. [6.831149 6 .878426]. Mean
which you propose in the null hypothesis does not fall in this, this is another indicator that null hypotehsis is
not true.

21
t.test(wq$fixed.acidity,mu = 6.10,alternative ="less" )

##
## One Sample t-test
##
## data: wq$fixed.acidity
## t = 62.598, df = 4897, p-value = 1
## alternative hypothesis: true mean is less than 6.1
## 95 percent confidence interval:
## -Inf 6.874625
## sample estimates:
## mean of x
## 6.854788

You can see here that p-value is large, in fact it is very close to 1, we’ll take against the alternate hypotehsis
that mean is less than 6.1.

t.test(wq$fixed.acidity,mu = 6.10,alternative ="greater" )

##
## One Sample t-test
##
## data: wq$fixed.acidity
## t = 62.598, df = 4897, p-value < 2.2e-16
## alternative hypothesis: true mean is greater than 6.1
## 95 percent confidence interval:
## 6.834951 Inf
## sample estimates:
## mean of x
## 6.854788

Here your decision will be in favor of alternate hypothesis that is mean is greater than 6.1

Paired Two Sample T-test : Data for this is in SAS format. We’ll use the function read.sas7bdat from
the package sas7bdat.
In the example given below, H0: Mean of Diffence between means of scores in write and read=0

22
library(sas7bdat)
d=read.sas7bdat("hsb2.sas7bdat")
glimpse(d)

## Observations: 200
## Variables:
## $ id (dbl) 70, 121, 86, 141, 172, 113, 50, 11, 84, 48, 75, 60, 95...
## $ female (dbl) 0, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, ...
## $ race (dbl) 4, 4, 4, 4, 4, 4, 3, 1, 4, 3, 4, 4, 4, 4, 3, 4, 4, 4, ...
## $ ses (dbl) 1, 2, 3, 3, 2, 2, 2, 2, 2, 2, 2, 2, 3, 3, 1, 1, 3, 2, ...
## $ schtyp (dbl) 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 2, ...
## $ prog (dbl) 1, 3, 1, 3, 2, 2, 1, 2, 1, 2, 3, 2, 2, 2, 2, 1, 2, 1, ...
## $ read (dbl) 57, 68, 44, 63, 47, 44, 50, 34, 63, 57, 60, 57, 73, 54...
## $ write (dbl) 52, 59, 33, 44, 52, 52, 59, 46, 57, 55, 46, 65, 60, 63...
## $ math (dbl) 41, 53, 54, 47, 57, 51, 42, 45, 54, 52, 51, 51, 71, 57...
## $ science (dbl) 47, 63, 58, 53, 53, 63, 53, 39, 58, 50, 53, 63, 61, 55...
## $ socst (dbl) 57, 61, 31, 56, 61, 61, 61, 36, 51, 51, 61, 61, 71, 46...

t.test(d$read,d$write,paired = TRUE)

##
## Paired t-test
##
## data: d$read and d$write
## t = -0.86731, df = 199, p-value = 0.3868
## alternative hypothesis: true difference in means is not equal to 0
## 95 percent confidence interval:
## -1.7841424 0.6941424
## sample estimates:
## mean of the differences
## -0.545

p-value of test is high [0.3868], we’ll be concluding in FAVOUR of H0 and say that means of the variable
write and read are NOT different.
By default this assumes that H0 is δ = 0, you can change that specifying mu= some other value.

t.test(d$read,d$write,paired = TRUE,mu=-0.50)

##
## Paired t-test
##
## data: d$read and d$write
## t = -0.071612, df = 199, p-value = 0.943
## alternative hypothesis: true difference in means is not equal to -0.5
## 95 percent confidence interval:
## -1.7841424 0.6941424
## sample estimates:
## mean of the differences
## -0.545

23
you can see that p-value for the test is now very close to one, because the null hypothesis which you are
suggesting is very close to result of the sample [ mean of differences being -0.543]
you cal also use conf.level and alternative option here to ahiceve the same thing. I’m leaving that for
you to try on your own.

unique(wq$quality)

Unpaired Two Sample T-test

## [1] 6 5 7 8 4 3 9

We need to find out whether alcohol percentages in wine vary across quality ratings . Remember that you
can do this test only if number of classes are two.for more classes you’ll have to use ANOVA.
Here variables values are from the same variable but belonging to different classes. They are not “paired” .
In this example
H0: Difference between means of alchol in class quality=3 and quality=9 is equal to zero
However before we go ahead and do the unpaired T-test for these two groups , we need to know another
thing. The underlying distribution changes slightly depending upon whether variance of these two groups are
same or not. We can find that out by doing a variance equivalence test first. [ This is also known as F-tes]

var.test(wq$alcohol[wq$quality==3],wq$alcohol[wq$quality==9])

##
## F test to compare two variances
##
## data: wq$alcohol[wq$quality == 3] and wq$alcohol[wq$quality == 9]
## F = 1.459, num df = 19, denom df = 4, p-value = 0.7784
## alternative hypothesis: true ratio of variances is not equal to 1
## 95 percent confidence interval:
## 0.1701394 5.1921582
## sample estimates:
## ratio of variances
## 1.459002

p- value for the test is very high, we’ll conclude against the stated alternate hypothesis which says true raio of
variances is not equal to 1 or in other words , variances are not equal. Once we know that we can do our
unpaired t test.

t.test(wq$alcohol[wq$quality==3],wq$alcohol[wq$quality==9],var.equal = TRUE)

##
## Two Sample t-test
##
## data: wq$alcohol[wq$quality == 3] and wq$alcohol[wq$quality == 9]
## t = -3.0837, df = 23, p-value = 0.005246
## alternative hypothesis: true difference in means is not equal to 0
## 95 percent confidence interval:

24
## -3.0659873 -0.6040127
## sample estimates:
## mean of x mean of y
## 10.345 12.180

Here the p-value for the test is pretty low, we’ll conclude in favor of stated alternate hypothesis which is true
difference in means is not equal to zero or in other words, average alcohol content for quality rating wine 3
and 9 are singnficantly different.
In the above test we checked whether avaerage alcohol content is statistically different for classes defined by
quality=3 or quality=9. Next you might be interested in whether alcohol averages are different across all the
classes defined by quality variable [not just two]

ANOVA In Case of ANOVA, test is based on F distribution, Test statistic is called F-statistics. Way to
conclude in favour/against the Ho remains same
H0 : means of variable in question is same in all the classes Ha : mean in Atleast one class is different from
the rest

fit=aov(alcohol ~ quality ,data=wq)

summary(fit)

## Df Sum Sq Mean Sq F value Pr(>F)

## quality 1 1407 1407.0 1146 <2e-16 ***
## Residuals 4896 6009 1.2
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

Table here look a bit different because of test being based on F-distribution. We’ll be looking at p-value below
Pr(>F). We see that p-value [<2 * 10−16 ] is very small. We conclude that Atleast one class mean is different.
Now to figure which class mean is differnt we need to do a pairwise bonferroni test.

pairwise.t.test(wq$alcohol, wq$quality, p.adj = "bonf")

##
## Pairwise comparisons using t tests with pooled SD
##
## data: wq$alcohol and wq$quality
##
## 3 4 5 6 7 8
## 4 1.00000 - - - - -
## 5 0.60054 0.00278 - - - -
## 6 1.00000 3.6e-05 < 2e-16 - - -
## 7 0.00068 < 2e-16 < 2e-16 < 2e-16 - -
## 8 1.1e-05 < 2e-16 < 2e-16 < 2e-16 0.06126 -
## 9 0.01566 0.00086 2.5e-05 0.02080 1.00000 1.00000
##
## P value adjustment method: bonferroni

by looking at this table we can see that p-value for the test tell us that

• alcohol content for quality rating 9 is singnificantly different from rating (3,4,5,6) [all p-values are low]
where as it is not so different from rating (7,8)

25
we have made comparison of alcohol content of quality rating 9 with rest of the quality ratings already, as
we move forward quality rating 9 comaprison need not be done. same will happen with subsequent quality
ratings as well.

• alcohol content for quality rating 8 is singnificantly different from rating (3,4,5,6,7) [all p-values are low]
• alcohol content for quality rating 7 is singnificantly different from rating (3,4,5,6) [all p-values are low]
• alcohol content for quality rating 6 is singnificantly different from rating (4,5) [all p-values are low] and
is similar to quality rating 3
• alcohol content for quality rating 5 is singnificantly different from rating 4 [all p-values are low] and is
similar to quality rating 3
• alcohol content for quality rating 3 and 4 are similar too.

Now you might be wondering how is it possible that 3 and 4 are similar , 3 and 5 are similar but 4 and 5 are
significantly different.
Consider this scenario, you decide that a difference of 50 points in score is significant, but anything less
than that is not a significant difference. Say A scored 40 points , B scored 70 points and C scored 100
points. According to the criterion A and B dont have significantly different scores , same goes for B and C
taken together , but when you consider A and C, difference is 60 points which know is significantly different
indicator. Same happened in the scenario above.

Chisq Test tests for categorical variable relative frequencies.

chisq.test(table(d$race),p=c(0.1,0.1,0.1,0.7))

##
## Chi-squared test for given probabilities
##
## data: table(d$race)
## X-squared = 5.0286, df = 3, p-value = 0.1697

This tells you whether different categories in the variable race are similar to your assumption of relative
frequencies which here is (10% 10% 10% 70%).
p- value for the test comes out to be 0.1697 which is greater than standard α value of 0.05 , hence we conclude
that relative frequency distribution of different categories of the variable race is different than (10% 10% 10%
70%). You can play around with passing different test realtive frequencies and see how the p-value changes.

chisq.test(table(d$schtyp,d$female))

##
## Pearson's Chi-squared test with Yates' continuity correction
##
## data: table(d$schtyp, d$female)
## X-squared = 0.00054009, df = 1, p-value = 0.9815

P-value is 0.9815 which is larger than α. We conclude that relative frequency distribution of schtype is not
affected by gender and vice versa.
Next we want to do the same test for race Vs Socio Economic Status.

26
chisq.test(table(d$race,d$ses))

## Warning in chisq.test(table(d$race, d$ses)): Chi-squared approximation may

## be incorrect

##
## Pearson's Chi-squared test
##
## data: table(d$race, d$ses)
## X-squared = 18.516, df = 6, p-value = 0.005064

At the bottom of the result , you see a warning that Chi-squared approximation may be incorrect.
This happens due to some cross tables having counts less than 5. Lets check the cross table.

table(d$race,d$ses)

##
## 1 2 3
## 1 9 11 4
## 2 3 5 3
## 3 11 6 3
## 4 24 73 48

In such scenario we can fisher’s test instead. Underlying hypothesis remains same. Lets do fisher exact test.

fisher.test(table(d$race,d$ses))

##
## Fisher's Exact Test for Count Data
##
## data: table(d$race, d$ses)
## p-value = 0.007329
## alternative hypothesis: two.sided

This produces result which tells you p-value [0.007329] is very low, you conlude against H0, which means
Socio Economic Status is affected by race.

Normality Test Although I personally am not a big fan of normality tests [ Hypothesis Tests for normality],
I am including them for completeness sake anyway. Here are two reasons for my reservations:

1. Normality tests are not very good at capturing what they should, I will demonstrate that , later in this
section.
2. Slight deviations from normality dont hurt as much as some people come to believe over time.

You can use function shapiro.test to test normality of a variable [ containing less than 5000 observations].
For larger number , you can do anderson-darling test using function ad.test found in package nor.test.
quick examples are given below.

27
x=runif(400)
shapiro.test(x)

##
## Shapiro-Wilk normality test
##
## data: x
## W = 0.95388, p-value = 7.382e-10

Null hypothesis for all normality test is that the underlying distribution for the variable in question is Normal.
Small p-value for the above example indicates that distribution is not Normal.
If the size is larger than 5000, you get an error.

y=rbeta(6000,2,8)
shapiro.test(y)

## Error in shapiro.test(y): sample size must be between 3 and 5000

library(nortest)
ad.test(y)

##
## Anderson-Darling normality test
##
## data: y
## A = 78.74, p-value < 2.2e-16

Anderson-Darling test reveals that , again underlying distribution is not normal. You see that these normality
test are giving proper results for good deviations from normality. You must be wondering, why then i warned
you against them. I will talk about that in a minute but first let me tell you another simpler method to check
if data is normal or not.
You can instead plot qqplots, if your data points seem to follow for most of the case on a straight line [ dotted
line shown in the plot], then it is normal enough for you to relax. thats all.

qqnorm(x);qqline(x);

28
Normal Q−Q Plot

1.0
0.8
Sample Quantiles

0.6
0.4
0.2
0.0

−3 −2 −1 0 1 2 3

Theoretical Quantiles

qqnorm(y);qqline(y);

Normal Q−Q Plot

0.8
0.6
Sample Quantiles

0.4
0.2
0.0

−4 −2 0 2 4

Theoretical Quantiles
you can see that in both the cases a large portion of the data points do not fall on the straight line. You can
also plot density curve and a normal curve as discussed in data viz module.

29
library(ggplot2)
df=data.frame(x,y)
ggplot(df,aes(x))+geom_density(color="red")+
stat_function(fun=dnorm,aes(x),color="green")+
ggtitle("Visual Normality Test for x ")

Visual Normality Test for x

0.9
density

0.6

0.3

0.0

0.00 0.25 0.50 0.75 1.00

ggplot(df,aes(y))+geom_density(color="red")+
stat_function(fun=dnorm,aes(y),color="green")+
ggtitle("Visual Normality Test for y ")

30
Visual Normality Test for y

3
density

0.0 0.2 0.4 0.6 0.8

y
you can see that these as well tell you that there is significant deviation from normality.
Now lets talk about my concerns about these tests. These tests are sensitive to sample size. If sample size
is small, they might fail to catch large deviation, where as if sample size is large, they’ll catch even a tiny
deviation which might not be worth the hassle anyway.Lets look at these examples to better understand.

set.seed(1)
v1=rlnorm(20,0,0.4)
shapiro.test(v1)

##
## Shapiro-Wilk normality test
##
## data: v1
## W = 0.98049, p-value = 0.9403

Clearly the data is from log normal distribution , not Normal, but the test tells you that data is normal. If
you looked at it visually instead, you would have concluded better.

df=data.frame(v1)
ggplot(df,aes(x=v1))+geom_density(color="red")+
stat_function(fun=dnorm,aes(x=v1),color="green")+
ggtitle("Visual Normality Test for v1 ")

31
Visual Normality Test for v1
1.00

0.75
density

0.50

0.25

0.00

0.5 1.0 1.5

v1
Now lets look at an example where you have a large sample which is as close to normal as it gets for all
practical purposes.

set.seed(1)
v2 = rt(60000,29)
ad.test(v2)

##
## Anderson-Darling normality test
##
## data: v2
## A = 3.5228, p-value = 8.465e-09

This tells you that data is not Normal. T-distribution with degrees of freedom 29 is very close to Normal
distribution.

df=data.frame(v2)
ggplot(df,aes(x=v2))+geom_density(color="red")+
stat_function(fun=dnorm,aes(x=v2),color="green")+
ggtitle("Visual Normality Test for v2 ")

32
Visual Normality Test for v2
0.4

0.3
density

0.2

0.1

0.0

−5.0 −2.5 0.0 2.5 5.0

v2
Look at those density curves, how much more close do you think they can be for all practical purposes, yet
your normality test tells you that data is not normal.
We’ll conclude here. You can play around checking various kind of hunches with your data. Lets me know if
you face any issue.
Prepared by : Anjani Sharma

Contact:

Email: [email protected]

Ph: 9910902849
In case of any doubts/question regarding contents of study material, please email or watsapp on the
number

MCO-03 Important Qs & Ans (New)
No ratings yet
MCO-03 Important Qs & Ans (New)
35 pages
Null Hypothesis Examples
No ratings yet
Null Hypothesis Examples
2 pages
احصاء
No ratings yet
احصاء
321 pages
Class 10-Distribution in Data Science
No ratings yet
Class 10-Distribution in Data Science
22 pages
Amul Dairy FINAL PROJECT
67% (6)
Amul Dairy FINAL PROJECT
76 pages
Consumer Behavior at Mysore Sandal Soaps
50% (2)
Consumer Behavior at Mysore Sandal Soaps
67 pages
MTE 201 (2024) Prof Mushayabasa
No ratings yet
MTE 201 (2024) Prof Mushayabasa
40 pages
Testing The Difference Between Proportions
100% (2)
Testing The Difference Between Proportions
20 pages
PracticalResearch2 Q1 W5 Formulation of A Conceptual Framework and Research Hypothesis
No ratings yet
PracticalResearch2 Q1 W5 Formulation of A Conceptual Framework and Research Hypothesis
19 pages
Edu 5950-1 Chapter 8 (Group 5)
No ratings yet
Edu 5950-1 Chapter 8 (Group 5)
58 pages
Pre FinalExam Reviewer
No ratings yet
Pre FinalExam Reviewer
4 pages
Sample of Hypothesis in Thesis Writing
100% (3)
Sample of Hypothesis in Thesis Writing
8 pages
Chapter 08 - Quiz
75% (4)
Chapter 08 - Quiz
74 pages
Complete Lectures PME
100% (1)
Complete Lectures PME
330 pages
Effect of Performance Appraisal On Emplo PDF
No ratings yet
Effect of Performance Appraisal On Emplo PDF
8 pages
BS-chapter1-2022-Intro Statistics-Descrptv N Sumary M & Measures of Location
No ratings yet
BS-chapter1-2022-Intro Statistics-Descrptv N Sumary M & Measures of Location
54 pages
Sampling Distribution
No ratings yet
Sampling Distribution
102 pages
AP Statistics Introduction
No ratings yet
AP Statistics Introduction
36 pages
Stat Prob 3rd Quarter
No ratings yet
Stat Prob 3rd Quarter
5 pages
1 Null Alternative Hypothesis SPTC 1301 Q4 FPF
No ratings yet
1 Null Alternative Hypothesis SPTC 1301 Q4 FPF
36 pages
Submitted To: Mrs. Geetika Vashisht College of Vocational Studies University of Delhi
No ratings yet
Submitted To: Mrs. Geetika Vashisht College of Vocational Studies University of Delhi
36 pages
04 Statistical Inference v0 1 09062022 090226pm
No ratings yet
04 Statistical Inference v0 1 09062022 090226pm
42 pages
Hydrology Paper
0% (1)
Hydrology Paper
26 pages
Hypothesis Testingg 2
No ratings yet
Hypothesis Testingg 2
44 pages
Part B Hypothesis Testing and Confidence Intervals
100% (1)
Part B Hypothesis Testing and Confidence Intervals
10 pages
Business Research Methods (BRM)
No ratings yet
Business Research Methods (BRM)
459 pages
CH 9
No ratings yet
CH 9
81 pages
ECON 601 - Module 2 PS - Solutions - FA 19 PDF
No ratings yet
ECON 601 - Module 2 PS - Solutions - FA 19 PDF
9 pages
Quanti Quiz 5 Quiz 6 Final PDF
No ratings yet
Quanti Quiz 5 Quiz 6 Final PDF
34 pages
FBA Module 2
No ratings yet
FBA Module 2
27 pages
11 One Way Anova
No ratings yet
11 One Way Anova
24 pages
Stats and Maths For Data Analyst
No ratings yet
Stats and Maths For Data Analyst
23 pages
Complete Lectures PME
50% (2)
Complete Lectures PME
329 pages
Poorani2018 Assessing Value Creation of HR Consultants On E - Consulting
No ratings yet
Poorani2018 Assessing Value Creation of HR Consultants On E - Consulting
21 pages
PSY 320 L7 Measures of Variability
No ratings yet
PSY 320 L7 Measures of Variability
12 pages
FDS Unit 2 Notes
No ratings yet
FDS Unit 2 Notes
46 pages
Dolores National High School Dolores, Eastern Samar: Researcher Researcher Research Adviser Research Adviser
No ratings yet
Dolores National High School Dolores, Eastern Samar: Researcher Researcher Research Adviser Research Adviser
16 pages
Lesson2 Stats
No ratings yet
Lesson2 Stats
58 pages
Comparative Study of Customer Preference With Respect To Two Fashion Brands H&M Versus Forever 21
No ratings yet
Comparative Study of Customer Preference With Respect To Two Fashion Brands H&M Versus Forever 21
14 pages
6 Descriptive StatisticsIntroduction
No ratings yet
6 Descriptive StatisticsIntroduction
72 pages
Lecture 01 Introduction To Statistics PPT 06022025 095924am
No ratings yet
Lecture 01 Introduction To Statistics PPT 06022025 095924am
40 pages
Statistics Overview
No ratings yet
Statistics Overview
13 pages
Cidam - Statistics Ffinal
No ratings yet
Cidam - Statistics Ffinal
12 pages
Lesson 5 Normal Distribution
No ratings yet
Lesson 5 Normal Distribution
9 pages
MAE 108 - Probability and Statistical Methods For Engineers - Spring 2014 Final Exam, June 10 Instructions
No ratings yet
MAE 108 - Probability and Statistical Methods For Engineers - Spring 2014 Final Exam, June 10 Instructions
8 pages
Effectiveness of Mother Tongue
No ratings yet
Effectiveness of Mother Tongue
57 pages
2024 F STA-1005ab Review Problems For The Final Exam
No ratings yet
2024 F STA-1005ab Review Problems For The Final Exam
65 pages
BDA Unit2 Notes
No ratings yet
BDA Unit2 Notes
23 pages
CAS - Descriptive Statistics - Final PPT-1
No ratings yet
CAS - Descriptive Statistics - Final PPT-1
112 pages
BDA Unit1 Notes
No ratings yet
BDA Unit1 Notes
14 pages
Slides of Discovering Statistics Using SPSS by Muhammad Yousaf Abid. Iqra University Islamabad.
No ratings yet
Slides of Discovering Statistics Using SPSS by Muhammad Yousaf Abid. Iqra University Islamabad.
31 pages
BDA Unit 3 Notes
No ratings yet
BDA Unit 3 Notes
10 pages
107 Final Q. Solve 50 Batch
No ratings yet
107 Final Q. Solve 50 Batch
63 pages
Lecture-1 Introduction
No ratings yet
Lecture-1 Introduction
51 pages
Statistic and Probability
No ratings yet
Statistic and Probability
15 pages
Hypothesis Types and Research: January 2018
No ratings yet
Hypothesis Types and Research: January 2018
4 pages
2ND Q PR2 Reviewer - 20241204 - 173855 - 0000
No ratings yet
2ND Q PR2 Reviewer - 20241204 - 173855 - 0000
14 pages
Ad3491 Fdsa Unit 2 Notes Eduengg
No ratings yet
Ad3491 Fdsa Unit 2 Notes Eduengg
85 pages
Statistics Guide
No ratings yet
Statistics Guide
27 pages
Chapter One - Introduction
No ratings yet
Chapter One - Introduction
156 pages
GBU 3311 Quantitative Methods in Business Fall 2022 Homework 1
No ratings yet
GBU 3311 Quantitative Methods in Business Fall 2022 Homework 1
2 pages
Unit - 2 Learning Notes
No ratings yet
Unit - 2 Learning Notes
7 pages
Statistic CH 1 30-Jan-2025 08-57-44
No ratings yet
Statistic CH 1 30-Jan-2025 08-57-44
14 pages
Statistics 12
No ratings yet
Statistics 12
29 pages
Statistics and Probability
No ratings yet
Statistics and Probability
43 pages
Exercise #7-Deli Depot-Differences
No ratings yet
Exercise #7-Deli Depot-Differences
3 pages
GET 321 - Compressed
No ratings yet
GET 321 - Compressed
32 pages
Chapter 1 Probability and Statistics
No ratings yet
Chapter 1 Probability and Statistics
83 pages
Bus 173 - 1
No ratings yet
Bus 173 - 1
28 pages
And Dividing It by Total Number of Values
No ratings yet
And Dividing It by Total Number of Values
3 pages
Cmda2005 Review
No ratings yet
Cmda2005 Review
65 pages
Applied Math Unit1 Summary and Useful Formulas
100% (1)
Applied Math Unit1 Summary and Useful Formulas
4 pages
Statistics 1
No ratings yet
Statistics 1
34 pages
Lecture 1
No ratings yet
Lecture 1
27 pages
Theoretical Questions in Basic Business Statistics
No ratings yet
Theoretical Questions in Basic Business Statistics
12 pages
Statistics Notes 1702100127
No ratings yet
Statistics Notes 1702100127
22 pages
2466939-EDA and STATISTICS NOTES
No ratings yet
2466939-EDA and STATISTICS NOTES
15 pages
1 Intro-Statistics
No ratings yet
1 Intro-Statistics
61 pages
Statistics
No ratings yet
Statistics
46 pages
Probability and SamplingDistributions
No ratings yet
Probability and SamplingDistributions
59 pages
LQ1 Notes
No ratings yet
LQ1 Notes
15 pages
Math CBSE Class 10th Statistics
No ratings yet
Math CBSE Class 10th Statistics
28 pages
Prelim Lec 2017
No ratings yet
Prelim Lec 2017
49 pages
Lesson 5 - Quantitative Analysis and Interpretation of Data
No ratings yet
Lesson 5 - Quantitative Analysis and Interpretation of Data
78 pages
Quality Control: Fundamentals of Statistics
No ratings yet
Quality Control: Fundamentals of Statistics
62 pages
Unit - 3 Learning Notes
No ratings yet
Unit - 3 Learning Notes
8 pages
Course Code & Number:FET201
No ratings yet
Course Code & Number:FET201
70 pages
ISI Workshop-16 - 20, 2015. ISI Campus, Kolkata: TH TH
No ratings yet
ISI Workshop-16 - 20, 2015. ISI Campus, Kolkata: TH TH
8 pages
Key of Week1 - Lecture Notes
No ratings yet
Key of Week1 - Lecture Notes
10 pages
Introduction To Stati Stics: There Are Three Kinds of Lies: Lies, Damned Lies, A ND Statistics." (B.Disraeli)
No ratings yet
Introduction To Stati Stics: There Are Three Kinds of Lies: Lies, Damned Lies, A ND Statistics." (B.Disraeli)
39 pages
Probability and Statistics in Engineering
No ratings yet
Probability and Statistics in Engineering
24 pages
MMW Module 4 - Statistics
No ratings yet
MMW Module 4 - Statistics
18 pages

HT With R

Uploaded by

HT With R

Uploaded by

Hypothesis Testing

• Should we provide additional tuition for language courses to science students.

Population & Samples

Estimates & Errors

Histograms, Probabilities, Cumulative Probabilities and Distributions

. . . full table truncated

We can plot these counts as bar charts .

P (5 ≤ Score ≤ 10) = 22.5%

Standard Normal Distribution is one specific distribution with µ = 0 and σ = 1.

P (−1.645 ≤ Z ≤ 1.645) = 0.90

We can standardize this as follows

putting in values of µ and σ for X in above , we get

P (5.065 ≤ X ≤ 14.935) = 0.90

N (10, 9) is [5.065,14.935]. If you are wondering that this

Central Limit Theorem

0.00 0.25 0.50 0.75 1.00

0.225 0.250 0.275 0.300 0.325 0.350

3.0 3.5 4.0 4.5 5.0

3.7 3.8 3.9 4.0 4.1 4.2

Standard practice in industry is to keep α = 0.05 or 5%.

p − value < α : Reject H0

p − value > α : F ailed to Reject H0

Summarising Hypothesis Testing Framwork : Steps to interpret a test results

few things to note:

• α does not depend on anything accept your decision

H0 : µ1 = µ2 = µ3....... = µp = µ0 where p is number of groups present in the data

SST = SSW + SSB

H0 : Categorical variables have no ef f ect on each other

Lets import the data

t.test(wq$fixed.acidity,mu = 6.10,alternative ="greater" )

Unpaired Two Sample T-test

fit=aov(alcohol ~ quality ,data=wq)

## Df Sum Sq Mean Sq F value Pr(>F)

pairwise.t.test(wq$alcohol, wq$quality, p.adj = "bonf")

Chisq Test tests for categorical variable relative frequencies.

## Warning in chisq.test(table(d$race, d$ses)): Chi-squared approximation may

## Error in shapiro.test(y): sample size must be between 3 and 5000

Normal Q−Q Plot

Visual Normality Test for x

0.00 0.25 0.50 0.75 1.00

0.0 0.2 0.4 0.6 0.8

0.5 1.0 1.5

−5.0 −2.5 0.0 2.5 5.0

You might also like