Exercise - 1 - On Lectures I and II - 2025
Exercise - 1 - On Lectures I and II - 2025
5. A sociologist wishes to estimate the proportion of all adults in a certain region who have
never been married. In a random sample of 1,320 adults, 145 have never married, hence
145∕1320 ≈ 0.11 or about 11% have never married.
(a) What is the population of interest?
(b) What is the parameter of interest?
(c) What is the statistic involved?
15. Number of observations are 30 and value of arithmetic mean is 15 then sum of all values
is
(a) 15
(b) 450
(c) 200
(d) 45
18. Consider the set of all students enrolled in a STAT 111. Suppose you are interested in
learning about the current grade point averages (GPAs) of this group.
(a) Define the population and variable of interest.
(b) Is the variable qualitative or quantitative?
(c) Suppose you determine the GPA of every member of the class. Would this represent
a census or a sample?
(d) Suppose you determine the GPA of 10 members of the class. Would this represent a
census or a sample?
(e) If you determine the GPA of every member of the class and then calculate the
average, how much reliability does this have as an "estimate" of the class average
GPA?
(f) What must be true in order for the sample of 10 students you select from your class to
be considered as a random sample?
20. The ages (in years) of 30 employees of a certain company are given below
55 25 34 66 28
32 28 26 56 48
47 61 25 24 31
24 28 44 37 51
47 35 32 36 50
37 36 27 27 44
(a) Represent the above information on a stem and leaf diagram.
24. How many classes are to be used for grouping a data set with 50 observations?
25. The International Rhino Federation estimates that there are 17,470 rhinoceroses living in the
wild of Africa and Asia. A breakdown of the number of rhinos of each species is reported in
the accompanying table:
Rhino Species Population estimate
African Black 3,610
African White 11,100
Asian Sumatran 300
Asian Javan 60
Asian Indian 2,400
Total 17,470
(a) Construct a table of relative and cumulative frequencies for the data.
(b) Construct a bar graph for the relative frequencies.
26. Fifteen students (named A-O) took classes and statistics and biology. The marks earned by
each of the students are shown below.
Student A B C D E F G H I J K L M N O
Statistics 74 53 67 63 77 57 60 47 76 54 80 92 53 52 80
Biology 60 68 64 66 71 66 55 71 82 73 84 59 63 55 79
27. Chance (Spring 2000) reported on a study to estimate the number of pennies required to fill a
coin collectors’ album. The data used in the study were obtained by noting the mint date on
each in a sample of 2,000 pennies. The distribution of mint dates is summarized in the
following table:
28. The following data represents the annual income (in thousand cedis) for a sample of 12
households in Ghana:
23 , 17 , 32 , 60 , 22 , 52 , 29 , 38 , 42 , 92 , 27 , 46.
(a) Construct a box and whisker plot for the data.
(b) Find the lower and upper inner fences.
(c) Determine the smallest and largest value within the two inner fences.
Stem Leaf
5 1
4 4 5 7
3 0 0 0 3 6
2 1 1 3 4 5 9 9
1 2 2 4 8
32. The table below displays the population (in millions) and the number of violent crimes (in
millions) in US from 1982 to 1989.
(a) Draw a line graph representing the trend in violent crime over time.
(b) Draw a scatter plot of population versus violent crime, comment on the scatter plot.
34. The following are figures on an oil well’s daily production in barrels.
214 , 204 , 226 , 198 , 243 , 225 , 207 , 203 , 209 , 200 , 217 , 202 , 208 , 212
205 , 220 , 200 , 208 , 191 , 202 , 201 , 208 , 200 , 198
(a) Construct a stem-and-leaf plot with stems 19, 20 , … , 24
(b) Use the stem- and leaf plot to find the quartiles of the data.
(c) Find the modal production.
(d) Calculate the 95th and 64th percentiles.
35. The table below shows how support by voters for the main political parties in England in 2015
varied with a range of demographic factors. Voters are classified by sex, age group, socio-
economic group, location, and ethnicity. Each figure given is an index showing support by a
group for a party. A figure of 100 indicates that support is at the national average level; a figure
of 105 shows that support is 5% higher than the national average level; a figure of 90 indicates
that support is 10% less than the national average level. (The socio-economic group AB is
upper middle class and middle class, C1 is lower middle class, C2 is skilled working class, DE
is working class and those at the lowest level of subsistence.)
(i) Summarise how sex is related to political party preference in the data. Draw a
suitable graph, or graphs, to illustrate your answer.
(ii) The political parties are shown in order from what are generally considered to be the
most left wing (Green) to most right wing (UKIP). Identify the main patterns in
political preferences as they vary by age. Draw a suitable graph, or graphs, to illustrate
your answer.
(iii) Of the sixteen categories in the table identify that with profile most like the national
average, and that with profile least like the national average. Discuss briefly why
these groups might be expected to have such profiles.
(iv) Show that about 60% of Green supporters are female. Find the corresponding
percentage for UKIP supporters. State any assumptions that you have made in
answering those questions.
(v) A newspaper article noted the figures of 92 for the level of support among people in
the 18–24 age range for the Liberal Democrats and for the Conservatives. The article
stated that support in this age-group was evenly split between these two parties. Explain
why that is the wrong conclusion to draw from the data, and state the correct
conclusion.