0% found this document useful (0 votes)
192 views20 pages

MAS202Group1 Group-Assignment

MAS202Group1_Group-Assignment

Uploaded by

duongbths180618
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
192 views20 pages

MAS202Group1 Group-Assignment

MAS202Group1_Group-Assignment

Uploaded by

duongbths180618
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 20

GROUP ASSIGNMENT

Subject: Applied Statistics For Business (MAS202)

Lecturer : Mr. Nguyen Viet Anh


Class : MKT1825
Group :1
Members : Nguyen Thi Ngoc Anh – HS180711
: Nguyen Minh Thuy – HS180808
: Nguyen Dieu Linh – HS180381
: Nguyen Hoang An – HS186500
: Trieu Viet Hoang – HS180380

: Đo Xuan Toan – HS180234

1
Table of Contents
Part 1: Introduction ..................................................................................................................3
Part 2: Collecting sample data ...................................................................................................3
Part 3: Descriptive statistics.......................................................................................................4
1. Classified by age ..................................................................................................................................... 4

2. Numerical descriptive measures for high school GPA ............................................................................ 4

3. Numerical descriptive measures for first year GPA ................................................................................. 6

Part 4: Inferential statisticsProblem 1: Test a hypothesis that the average first year GPA is less than 3
at 0.05 level of significance. ..........................................................................................................7
Problem 2: Construct a 95% confidence interval for the average first-year GPA ........................................ 9

Problem 3: Test the hypothesis that the proportion of good students is 20% at a 0.05 level of significance.
.................................................................................................................................................................... 10

Problem 4: Construct a 95% confidence interval for the proportion of good students .............................. 12

Problem 5: Test the hypothesis that there is a difference in the average first year GPA between female
and male students at 0.05 level of significance .......................................................................................... 13

Problem 6: Test the hypothesis that there is a difference in the proportion of good students between
female and male students at 0.05 level of significance .............................................................................. 15

Problem 7: Use simple linear regression to predict first year GPA based on high school GPA ................. 17

Part 5: Conclusion .................................................................................................................. 19


1. Summarize the results ........................................................................................................................... 19

2. Restate the meaning of the project ........................................................................................................ 19

3. State the skills learned through project work ......................................................................................... 19

2
Part 1: Introduction

The transition from school to university presents a formidable challenge for many students. This
shift in academic rigor can be attributed to several key factors. Chief among these is that university
courses are characterized by a greater depth of knowledge and a faster pace of learning compared
to school. This demands a higher level of critical thinking, independent research, and analytical
skills, all of which freshmen may severely lack. Another contributing factor is the newfound
independence that comes with university life. Unlike school, where there is often a structured
timetable and close monitoring by teachers and parents, university students are expected to manage
their own time, which can be a daunting task for those unaccustomed to this level of autonomy.

GPA stands for Grade Point Average. It is a numerical value that represents a student's average
academic performance.

GPA plays a significant role in assessing a student's academic performance and can have various
implications depending on the educational level. In high school, GPA is often used to determine
eligibility for graduation and to evaluate a student's readiness for college admissions.

For college admissions, a high school GPA is an essential factor considered by admissions
committees. It provides insight into a student's academic abilities and serves as a measure of their
consistency and dedication to their studies. A strong high school GPA can enhance the chances of
gaining admission to competitive colleges and universities.

In college, GPA continues to be a crucial indicator of academic achievement. It helps determine


eligibility for scholarships, honors programs, and internships. Additionally, a college GPA can
influence future opportunities, such as admission to graduate programs or career prospects.

Understanding the significance of GPA in both high school and college is important for students
aiming to maximize their academic success and future prospects.

The purpose of this study was to assess the influence of high school GPA, and gender on GPA of
freshman students.

Part 2: Collecting sample data

• Data source: https://www.openintro.org/data/index.php?data=satgpa


• Our data is secondary data.

3
• The variables used are sex: Gender of the student. hs_gpa: High school grade point
average. fy_gpa: First year (college) grade point average.

• Gender is a categorical variable while both high school GPA and first year college GPA
are continuous variables. The dependent variable is first year college GPA. The
independent variable studied is high school GPA. We choose these two variables because
we are curious about the relationship between them.

Part 3: Descriptive statistics

1. Classified by age

Gender

48%
52%

Male Female

The data sample has 515 male students and 484 female students.
2. Numerical descriptive measures for high school GPA
hs_gpa

Mean 3.20

Standard Error 0.017095852


Median 3.2
Mode 4
Standard
Deviation 0.54

4
Sample Variance 0.291975902
Kurtosis -0.908886554
Skewness -0.180353821
Range 2.2
Minimum 1.8
Maximum 4
Sum 3193.6
Count 999

The minimum high school GPA is 1.8, the maximum is 4. The range is 2.2.
The average high school GPA is 3.2 with a standard deviation of 0.54.

5
The column chart shows that the number of students with high school GPAs between 3.6-
3.8 and 2.6-3 is the largest.
Looking at the boxplot chart, we see that the data has no outlier.
3. Numerical descriptive measures for first year GPA
fy_gpa

Mean 2.47
Standard Error 0.023411424
Median 2.47
Mode 2.24
Standard
Deviation 0.74
Sample Variance 0.547546689
Kurtosis -0.183197715
Skewness -0.216848085
Range 4
Minimum 0
Maximum 4
Sum 2466.82

6
Count 999

Minimum first-year GPA is 0, maximum is 4. The range is 4.


The average freshman GPA is 2.47 with a standard deviation of 0.74.
The column chart has a fairly symmetrical shape, most students have first-year GPAs from
2 to 3.2.
Looking at the box plot chart, we see that the data has 2 outlier values: 0 and 0.36.
Part 4: Inferential statistics Problem 1: Test a hypothesis that the average first year GPA is less
than 3 at 0.05 level of significance.

• Parameter to be estimated: The population average first year GPA

7
• H0: The population average first year GPA is at least 3.
• H1: The population average first year GPA is less than 3.
• Sample data:

Sample size 999

Sample mean 2.47

Sample standard deviation 0.74


Level of significance 0.05

• Implementation:

𝐻0: 𝜇 ≥ 3

𝐻1: 𝜇 < 3

Since population standard deviation is unknown, we use a t-test.

Critical −𝑡𝛼,𝑛−1 = −𝑡0.05,998 = value:

-1.65

Thus, reject H0 if test statistic < -1.65.

Test 𝑥̅ − 𝜇 2.47 − 3 statistic: 𝑡𝑠𝑡𝑎𝑡

8
-22.67

Since -22.67 < -1.65, then we reject H0.

• Conclusion: There is sufficient evidence that the average first-year GPA is less than 3 at a
0.05 level of significance.

Problem 2: Construct a 95% confidence interval for the average first-year GPA.

• Parameter to be estimated: The population's average first-year GPA


• Sample data:

Sample size 999

Sample mean 2.47

Sample standard deviation 0.74

• Implementation:

Confidence level 95%


Alpha 0.05

Since population standard deviation is unknown, we use a t distribution.

Critical 𝑡𝛼/2,𝑛−1 = 𝑡0.025,998 = value:

1.96

9
The confidence interval is

𝑆 𝑆
𝑥̅≤ 𝜇 ≤ 𝑥̅

2.

Lower limit is 2.42


Upper limit is 2.52

• Conclusion: We are 95% confident that the average first-year GPA is between 2.42 and
2.52.

Problem 3: Test the hypothesis that the proportion of good students is 20% at a 0.05 level of
significance.

• Parameter to be estimated: The population proportion of students who have first-year GPA
at least 3.2

• H0: The population proportion of students who have first year GPA at least 3.2 is 20%.
• H1: The population proportion of students who have first year GPA at least 3.2 is different
from 20%.
• Sample data:

Sample size 999

Number of students who have first year GPA at least 3.2 176

10
Proportion of students who have first year GPA at least 3.2 0.18
Level of
significance 0.05

• Implementation:

𝐻0: 𝜋 = 20%

𝐻1: 𝜋 ≠ 20%

Since test for proportion, we use a Z-test.


Critical value:

1.96
±𝑍𝛼/2 = ±𝑍0.025 = ±
Thus, reject H0 if test statistic < -1.96 and test statistic > 1.96.

Test statistic:

𝑝−𝜋 0.18 − 0.2


𝑍𝑠𝑡𝑎𝑡


𝑛 999 -1.88

11
Since -1.96 < -1.88 < 1.96, then we do not reject
H0.

• Conclusion: There is sufficient evidence that the proportion of good students is 20% at 0.05

level of significance.

Problem 4: Construct a 95% confidence interval for the proportion of good students.

• Parameter to be estimated: The population proportion of students who have first year GPA
at least 3.2

• Sample data:

Sample size 999

Number of students who have first year GPA at least 3.2 176
Proportion of students who have first year GPA at least 3.2 0.18

Confidence level 95%


Alpha 5%

Since confidence interval for proportion, we use a Z distribution.


Critical value:

𝑍𝛼/2 = 𝑍0.025 =
1.96

The confidence interval is

𝑝 ∗√
≤ ≤ ∗ √𝑝 ∗ (1 − 𝑝)
𝑛
12
0.18 ∗ (1 − 0.18) 0.18 ∗ (1 − 0.18)
0.18 − 1.96 ∗ √ √
≤ 𝜋 ≤ 0.18 + 1.96 ∗
999 999

Lower limit is 0.15


Upper limit is 0.20

• Implementation

• Conclusion: We are 95% confident that the proportion of good students is between 15%
and 20%.

Problem 5: Test the hypothesis that there is a difference in the average first year GPA between
female and male students at 0.05 level of significance.

• Parameter to be estimated: The difference in the average first year GPA between female
and male students.

• H0: There is no difference in the average first year GPA between female and male students.
• H1: There is significantly difference in the average first year GPA between female and
male students.
• Sample data:
Female Male
Mean 2.544587 2.398524
Variance 0.576608 0.510947
Observations 484 515

• Excel output:
Female Male
Mean 2.544587 2.398524

13
Variance 0.576608 0.510947
Observations 484 515
Pooled Variance 0.542757
Hypothesized Mean
Difference 0
df 997
t Stat 3.131697
P(T<=t) one-tail 0.000894
t Critical one-tail 1.646383
P(T<=t) two-tail 0.001789
t Critical two-tail 1.962346

• Implementation:

𝐻0: 𝜇1 − 𝜇2 = 0

𝐻1: 𝜇1 − 𝜇2 ≠ 0

Since population standard deviations are unknown, we use a t-test.


Critical value: ±𝑡𝛼2,𝑛1 + 𝑛2−2 = ± 𝑡0.025,998 = ±

1.96

Thus, reject H0 if test statistic < -1.96 or test statistic > 1.96.

Test statistic: 3.13

Since 3.13 > 1.96, then we reject H0.

14
• Conclusion: There is sufficient evidence that there is a difference in the average first year
GPA between female students and male students at 0.05 level of significance.

Problem 6: Test the hypothesis that there is a difference in the proportion of good students between
female and male students at 0.05 level of significance.

• Parameter to be estimated: The difference in the proportion of good students between


female and male students.

• H0: There is no difference in the proportion of good students between female and male
students.

• H1: There is significantly difference in the proportion of good students between female and
male students.
• Sample data:
Column1 Female Male Total
>= 3.2 102 74 176
< 3.2 382 441 823
Total 484 515 999

• Implementation:

𝐻0: 𝜋1 − 𝜋2 = 0

𝐻1: 𝜋1 − 𝜋2 ≠ 0

Since test for difference in proportions, we use a Z-test.

±𝑍𝛼 = ±𝑍0.025 = ±
2
Critical value:

15
1.96

Thus, reject H0 if test statistic < -1.96 or test statistic > 1.96.

Test statistic:

𝑋1 + 𝑋2 102 + 74 0.18
𝑝= = =
𝑛1 + 𝑛2 484 + 515

𝑋1 102 0.21
𝑝1 = = =
𝑛1 482

0.14
𝑋2 74
𝑝2 = = =
𝑛2 515

𝑍𝑠𝑡𝑎𝑡

16
Since 2.78 > 1.96, then we reject H0.

• Conclusion: There is sufficient evidence that there is a difference in the proportion of good

students between female and male students at 0.05 level of significance.

Problem 7: Use simple linear regression to predict first year GPA based on high school GPA.

• Identify two random variables X and Y

Dependent variable (Y) is first year GPA.


Independent variable (X) is high school GPA.

• The equation of the estimated regression line Scatter plot:

The regression equation is

𝑌̂ = 0.06 + 0.75 ∗ 𝑋 b0

= 0.06

17
Meaning: Because a student cannot have a high school GPA of 0, b0 has no practical
application.

b1 = 0.75
Meaning: The mean value of first year GPA increases by 0.75, on avearge, for each
additional 1 point of high school GPA.

• Use regression equation to predict a future value for Y

We can predict the first year GPA for a student with high school GPA of 3.5 by using the
formula:

𝑌̂ = 0.06 + 0.75 ∗ 3.5 = 2.70


• Measures of variation

R^2 = 30.27%
Meaning: 30.27% of the total variation in the first-year GPA is explained by variation in
the high school GPA.

• Test a hypothesis that a linear relationship exists between the high school GPA and firstyear
GPA at a 0.05 significance level.

𝐻0: 𝛽1 = 0 (no linear relationship)

𝐻1: 𝛽1 ≠ 0 (linear relationship does exist)

alpha =
0.05
Thus, reject H0 if p-value < 0.05.
p-value = 0.00

Since 0 < 0.05, then we reject H-

18
Conclusion: There is sufficient evidence that high school GPA affects first year GPA.

Part 5: Conclusion

1. Summarize the results

The average first-year GPA is between 2.42 and 2.52.

The proportion of good students is between 15% and 20%.

There is a difference in the average first year GPA between female students and male students.

There is a difference in the proportion of good students between female and male students.

The high school GPA affects the first year GPA.


2. Restate the meaning of the project

We should focus on studying from high school. That will create good habits for students, helping
them achieve high scores in higher levels of education.

3. State the skills learned through project work.

State the skills learned through project work.

Through doing the Mas project, we learn the following skills:

Research and data analysis skills:

You will learn how to collect data, process raw data, and turn it into useful information.

Statistical analysis and graphing help you better understand data and find trends and relationships.

Skills in using data analysis tools:

Excel: Calculate, draw charts, and perform statistical calculations.

Tableau or other tools to create charts and dashboards.

Statistical estimation and testing skills:

Estimating confidence intervals: Measuring uncertainty in estimates.

19
Hypothesis testing: Determine the difference between groups or parameters.

Report writing and presentation skills:

Know how to present analysis results clearly and logically.

Write a report with introduction, analysis, results, and conclusion sections.

Teamwork and communication skills:

During a project, you may have to work with other teammates to collect data or discuss results.

Communicate results effectively to colleagues or management.

20

You might also like