0% found this document useful (0 votes)
31 views12 pages

STATS 1023A Assignment 3

Uploaded by

dylan212005
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
31 views12 pages

STATS 1023A Assignment 3

Uploaded by

dylan212005
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 12

STATS 1023A/2037A

Assignment 3 - due November 11, 2024, at 11:55 pm

Dylan Perez

1) Consider the following values representing the age of group of 30 adults: 75, 90, 60, 95, 85,
84, 76, 74, 92, 62, 83, 80, 90, 68, 72, 79, 36, 65, 98, 70, 88, 99, 60, 82, 65, 79, 76, 80, 52, 75

a) (6 pts) Create a five-number summary for these ages. Show all your work.

The sorted data set is: 36, 52, 60, 60, 62, 65, 65, 68, 70, 72, 74, 75, 75, 76, 76, 79, 79, 80, 80, 82,
83, 84, 85, 88, 90, 90, 92, 95, 98, 99

Minimum = 36

Maximum = 99
76+79 155
The 15th value is 76, and the 16th value is 79. Therefore, 𝑚𝑒𝑑𝑖𝑎𝑛 = = = 77.5
2 2

The lower half consists of the first 15 values: 36, 52, 60, 60, 62, 65, 65, 68, 70, 72, 74, 75, 75,
76, 76.

First Quartile (Q1) = Median of lower half = 68

The upper half consists of the last 15 values: 79, 79, 80, 80, 82, 83, 84, 85, 88, 90, 90, 92, 95, 98,
99.

Third Quartile (Q3) = Median of upper half = 85

Final Five-Number Summary:

1) Minimum = 36
2) Q1 = 68
3) Median (Q2) = 77.5
4) Q3 = 85
5) Maximum = 99
b) (4 pts) Create a boxplot using the five-number summary from part (a). Show all your work.
Boxplot must be drawn by hand. You can use your tablet for drawing, but not a statistical
software like Excel or Python.
2) Suppose the mean height for adult males in the U.S. is about 70 inches and the
standard\ deviation is about 5 inches. Assume men’s heights follow a normal curve. Show all
your work.

a) (3 pts) What percentage of adult males are under 65 inches tall? Use the Empirical rule and
draw a picture that illustrates your rationale.
b) (3 pts) What percentage of adult males are between 65 and 85 inches tall? Use the Empirical
rule and draw a picture that illustrates your rationale.
c) (4 pts) How high must an adult male's height be to be placed in the top 7% of all adult males
in the U.S.? Use Z table (available on OWL under Tables). Show all your work.

Mean height (μ) = 70 inches

Standard deviation (σ) = 5 inches

100% − 7% = 93% , the height that has 93% of the distribution below it,

0.93 corresponds to a Z-score of 1.48

With formula 𝑥 = 𝜇 + 𝑍 ⋅ 𝜎 , we can plug in the values

𝑥 = 70 + 1.48 ⋅ 5 = 77.4 inches


Therefore, the adult male must be approximately 77.4 inches tall to be in the top 7% for males in
the U.S.
3) The IQ scores are approximately Normal, with a mean of 100 and a standard deviation of 15.
For each part below, use Z Table (available on OWL under Tables) and sketch a Normal curve
and shade the area representing the region. Show all your work. Round the z scores to 2 decimal
places.

a) (3 pts) What is the percentage of adults with scores between 65 and 120?

b) (3 pts) What is the percentage of adults with scores above 95?


4) (8 pts) In its February 24-26, 1995, edition (p. 7), USA Weekend gave statistics on the
changing status of which parent live with. As noted in the article, the number do not total 100%
because they are drawn from two sources: the U.S Census Bureau and America’s Children:
Resource from family, Government and the Economy by Donald Hernandez (New York: Russell
Sage Foundation, 1995). Using the data shown in the table below, draw a bar graph presenting
the information. Be sure to include all the components of a good statistical picture.
5) Suppose an algebra professor found that the correlation between study time (in hours) and
exam score (out of 100) is +0.80, and the regression line was found to be y = 20 + 4x. He arrived
at this equation through years of collecting data on his students, most of whom reported studying
anywhere from 0 to 20 hours for his exams.

a) (3 pts) Which is the response, and which is the explanatory variable? Explain.

• Response Variable: Exam score (y)


• Depends on the study time.
• Explanatory Variable: Study time (x)
• Study time is used to predict/explain the exam scores.
b) (2 pts) What meaning (if any) does the intercept have in this situation? Use words that a non-
statistics student would be able to understand.

• The intercept means that, according to the model, if a student doesn't study at all, they are
predicted to score 20 points on the exam.

c) (2 pts) What meaning (if any) does the slope have in this situation? Use words that a
nonstatistics student would be able to understand.

• The slope means that, according to the model, for every additional hour a student studies
for, their exam score is expected to increase by 4 points.

d) (2 pts) What is the expected score for a student who studies for 15 hours? Show your work.

• 𝑦 = 20 + 4𝑥 = 20 + 4(15) = 80
• Therefore, a student who studies for 15 hours is expected to score 80 points on the exam.
6) Are each of the following pairs of variables likely to have a positive correlation or a negative
correlation? Explain your answer.

a) (2 pts) Daily temperatures at noon in London and in Toronto measured for a year.

• Positive Correlation
• Both experience seasonal changes, meaning temperatures would likely rise and fall
according to the season.

b) (2 pts) Weights of automobiles and their gas mileage in average miles per gallon.

• Negative Correlation
• Heavier cars will generally require more fuel, resulting with fewer miles per gallon.
c) (2 pts) Hours of television watched and GPA for college students.

• Negative Correlation
• The more time you spend on Tv, there will be less time for studying. Leading to a
potential decrease in GPA.

d) (2 pts) Sizes in square feet of houses and their prices.

• Positive Correlation
• With larger homes prices tend to be higher, meaning these two variables are positively
related.
7) The table below gives the self-reported heights of 10 college women (“Daughter’s height”),
along with the heights of their mothers (“Mother’s height’’).

a) (8 pts) Draw (by hand, do not use a software) a scatter plot for these data, placing Mother’s
height on the horizontal axis and the Daughter’s height on the vertical axis. Comment on whether
or not it looks like there is a linear relationship and, if so, whether it is negative or positive.
In the scatter plot, there is an average positive linear relationship between mothers’ and
daughters’ heights: when mothers are taller, so are their daughters. This suggests that the taller
the mother, the taller the daughter, although this relationship doesn’t go entirely as planned.

b) (8 pts) Find the correlation between the mother’s and daughter’s height. Show all your work.
Round your answer to two decimal places.

c) (2 pts) Do the value and the sign (positive or negative) of the correlation in part b) make sense
based on the scatter plot from part a)? Explain.

• Yes, as both the correlation value and the average linear relationship are both positive.

8) Refer to the previous exercise about the relationship between mothers’ and daughters’ heights.
Suppose the mean height of the mothers is about 63.9 inches, the mean height of the daughters is
about 64.5 inches, and that SSX = 62.9, SSY = 68.5, SXY = 48.5. Round your answers to two
decimal places.

a) (6 pts) Based on the information provided calculate the slope and intercept of the regression
line. Interpret the slope in the context of the problem. Show your work.

b) (2 pts) Use your regression line to predict the height of the daughter of a mother who is 65
inches tall. Show your work.

You might also like