0% found this document useful (0 votes)
15 views11 pages

Research Statistics Lesson 5

Uploaded by

Eena Flores
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
15 views11 pages

Research Statistics Lesson 5

Uploaded by

Eena Flores
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 11

Lab 4: Data Analysis in Excel

Analysis ToolPak
1. Load ToolPak add-in
1. Click on the green “File” tab  select “Option” An excel option dialog
appears, select “Add-ins”.

2. Click Go button,The Add-Ins dialog appears, Check “Analysis ToolPak”


and Ok.
The Data Analysis tool bar now appears under the Data tab.

2. Histogram
A histogram is a graphical representation of the distribution of numerical data.
It is an estimate of the probability distribution of a continuous variable
(quantitative variable).

1. First, enter the bin numbers (upper levels), for example TTU’s scale
(A2:A13)
2. On the Data tab, click Data Analysis.

Select Histogram and click OK.

3. . Select the range (student grade column)


4. Click in the Bin Range box and select the range (column TTU scale).
5. Click the Output Range option button, click in the Output Range box and
select any cell.
6. Check Chart Output.

3. Descriptive Statistics
1. On the Data tab, click Data Analysis.
2. Select Descriptive Statistics and click OK.
3. Select the range (Student grade) as the Input Range.
4. Select any cell as the Output Range.
5. Make sure Summary statistics is checked.
Result

4. Analysis of variance (ANOVA)

a. A single factor
A single factor or one-way ANOVA is used to test the null hypothesis that
the means of several groups are identical.

• Suppose that the benefit of a company is separated by different


regions as follows.
• Now we consider that there is a difference of benefit among
regions (i.e., single factor)?
• Hypothesis o H0 = MG1≈ MG2 ≈ MG3≈ MG4≈ MG5≈ MG6≈ MG7≈ MG8≈
MG9 o H1 : there is a difference among regions
• Check these hypothesis using Excel
1. Select Data tab  click Data Analysis  select Anova: single factor

2. Select Input range, choose group by Rows (enter alpha value, not
important), select output range (any cell)

3. Result
v v

4. Conclusion: F<Fcritical , we accept the hypothesis H0. If H0 is rejected,


means that at least one of the means is different. However, the ANOVA
does not tell you where the difference lies. You need a T-Test (later) to
test each pair of means.

b. Two-way ANOVA without replication


Similar as One-Way Anova, but we consider the influence of two factors
on a dependent variable. For example: there are 4 experts to predict the
development rate of 5 companies as follows.
Company Expert 1 Expert 2 Expert 2 Expert 4
C1 8 12 8.5 13
C2 14 10 9 11
C3 11 9 12 10
C4 9 13 10 13
C5 12 10 10 10
Question: is there a difference of mean of development rate among 5
companies and experts?

Using Excel to check this hypothesis as follows.

1. Select Data tab  click Data Analysis  select Anova: two factor without
replication  select input range and output range. (change alpha if
needed)
2. Result

3. Conclusion:
 F_Rows < F_Rows Crit and F_columns < F_columns Crit , we accept the
null hypothesis.
c. Two-way ANOVA with replication
Similar as two-way ANOVA without replication, however a sample has
some rows of data as following table.
Fertilizer Wheat Corn Soy Rice
Blend X 123 128 166 151
156 150 178 125

112 174 187 117

100 116 153 155

168 109 195 158

Blend Y 135 175 140 167


130 132 145 183

176 120 159 142

120 187 131 167

155 184 126 168

Blend Z 156 186 185 175


180 138 206 173

147 178 188 154

146 176 165 191

193 190 188 169

1. Select Data tab  click Data Analysis  select Anova: two factor with
replication  select input range and output range. (change alpha if
needed)  Indicate rows per sample, here is 5 (i.e., there are 5 data rows
per fertilizer).
2. Result
Anova: Two-Factor With Replication

SUMMARY Wheat Corn Soy Rice Total


Blend X

Count 5 5 5 5 20

Sum 659 677 879 706 2921

Average 131.8 135.4 175.8 141.2 146.05

Variance 844.2 707.8 278.7 354.2 782.3658

Blend Y

Count 5 5 5 5 20

Sum 716 798 701 827 3042

Average 143.2 159.6 140.2 165.4 152.1

Variance 498.7 978.3 165.7 217.3 511.0421

Blend Z

Count 5 5 5 5 20
Sum 822 868 932 862 3484

Average 164.4 173.6 186.4 172.4 174.2

Variance 443.3 428.8 212.3 175.8 330.6947

Total
Count 15 15 15 15
Sum 2197 2343 2512 2395
Average 146.4667 156.2 167.4667 159.6667
Variance 705.8381 871.0286 605.981 404.9524

ANOVA
Source of SS df MS F P-value F crit
Variation
Sample 8782.9 2 4391.45 9.933347 0.000245 3.190727
Columns 3411.65 3 1137.217 2.572355 0.064944 2.798061
Interaction 6225.9 6 1037.65 2.347138 0.045555 2.294601
Within 21220.4 48 442.0917

Total 39640.85 59

3. Conclusion:
• F_sample > F_sampleCrit  there is a difference according to
fertilizer
• F_columns < F_columnsCrit  accept the null hypothesis (there is
no difference among type of crop)
• F_interaction > F_interactionCrit  reject the null hypothesis (there
is a difference)

You might also like