Prob & Stat (1) - 1
Prob & Stat (1) - 1
1. Identify the type of data (nominal, ordinal, interval and ratio) represented by each of the
following. Confirm your answers by giving your own examples.
a) Blood group
b) Temperature (in Celsius)
c) Job satisfaction index (from 1 to 5)
d) Ethnic group
e) Number of heart attacks
f) Calendar year
g) Serum uric acid (mg/100ml)
h) Number of accidents in 3- year period
i) Number of cases of each reportable disease reported by a health worker
j) The average weight gain of 6 1-year old dogs (with a special diet supplement)
was 950grams last month.
k) The height of basketball players is considered a continuous variable.
l) Classification of automobiles as subcompact, compact, standard, and luxury
2. In each of the following statements describe whether inferential or descriptive statistics is
used.
a) Surveys of 100 civil service workers in Gondar town 45 are satisfied in their job.
b) The number of car accidents each week in Addis Ababa.
c) A diet high in fruits and vegetables will lower blood pressure
d) The number of gallons of milk sold each day at a grocery store
3. The last four semesters an instructor taught Introduction to Statistics, and the 17, 19, 14, 20,
students passed the exam. Which of the following conclusions can be obtained from purely
descriptive measures and which can be obtained by inferential methods?
1
a) The next time the instructor teaches Introduction to Statistics, we can expect
approximately 17 students to pass the exam.
b) The last four semesters the instructor taught Introduction to Statistics, an average of
17 students passed the exam.
4. Identify each study as being either observational or experimental.
a) Subjects were randomly assigned to two groups, and one group was given an herb and the
other group a placebo. After 6 months, the numbers of respiratory tract infections each group
had were compared.
b) A researcher stood at a busy intersection to see if the color of the automobile that a person
drives is related to running red lights.
c) A researcher finds that people who are more hostile have higher total cholesterol levels than
those who are less hostile.
d) Subjects are randomly assigned to four groups. Each group is placed on one of four special
diets—a low-fat diet, a high-fish diet, a combination of low-fat diet and high-fish diet, and a
regular diet. After 6 months, the blood pressures of the groups are compared to see if diet has
any effect on blood pressure.
5. An insurance company has insured 30,000 cars over the last six years. The company would
like to know the number of cars involved in one or more accidents over this period. The manager
selected 500 cars from the files and made a record of cars that were involved in one or more car
accidents.
a) What is the population
b) What is the sample
6. Identify the following as parameter and statistic
a) The mean age of 100 students in FBE Campus
b) The mean age of all students in FBE Campus
c) The mean income of 25 households in Bahir Dar
d) The mean income of all households in Bahir Dar
7. The table below shows the profit of a company (in millions of birr) in each of the years
1990-1994.
Yea r 1990 1991 1992 1993 1994
2
Which one of the following statement(s) is (are) descriptive and which statement(s) is (are)
inferential?
d) Based on the above table, the profit in 1995 is smaller than the profit in 1994.
8. A research randomly selects 100 students from all summer students in Peda Campus. She
asks each student his/her age and calculate the mean age of 100 students to be 21.3 years.
a) What is the population?
9. Twenty-five patients were given a blood test to determine their blood type at Gondar Hospital.
The data set is as follows. Construct a frequency distribution and pie chart using degree for the
data.
A B B AB O
O O B AB B
B B O AB O
A O O O AB
AB A O B A
10. Seniors of a high school were interviewed on their plan after completing high school. The
following data give plans of 548 seniors of a high school. Construct the appropriate diagram(s).
PLAN NUMBER OF SENIORS
3
May attend college (Female, Male) 66, 80
11. Construct a grouped frequency distribution of the following data on the amount of time (in
hours) that 80 college students devoted to leisure activities during a typical school week:
23 24 18 14 20 24 24 26 23 21 16 15 19 20 22 14 13 20 19 27 29 22
38 28 34 46 23 19 21 31 16 28 19 18 12 27 15 21 25 16 30 17 22 29
29 18 25 20 16 11 17 12 15 24 25 21 22 17 18 15 21 20 23 18 17 15
16 26 23 22 11 16 18 20 23 19 17 15 20 12
12. The frequency distribution of the birth weight (in kilogram) of 30 children given below.
Weight 1.9-2.3 2.3-2.7 2.7-3.1 3.1-3.5 3.5-3.9 3.9-4.3
No. of children 5 5 F3 4 4 3
4
15. The prices in 2008 expressed as ratios to the prices in 2009 for six chemicals of Gondar
university biology laboratory are 2.2, 1.85, 1.8, 2.05, 1.04 and 1.75 birr. Then calculate average
price ratio mean, sample standard deviation of the data.
16. The following are the percentages of ash content in 12 samples of coal found in close proximity:
9.2, 14.1, 9.8, 12.4, 16.0, 12.6, 22.7, 18.9, 21.0, 14.5, 20.4, 16.9 Find the sample mean, and sample
standard deviation of the data.
17. The sample mean and sample variance of five data values are, respectively, 104 and 16. If three
of the data values are 102, 100, 105, what are the other two data values?
18. The following is a sample of prices, rounded to the nearest cent, charged per gallon of standard
unleaded gasoline in the San Francisco Bay area in June 1997. 3.88, 3.90, 3.93, 3.90, 3.93, 3.96,
3.88, 3.94, 3.96, 3.88, 3.94, 3.99, 3.98, then find the mean of the dataset and its standard devation.
19. Four LaserJet printers will print computer written pages at the rates of 10, 8, 6, and 4 pages per
minute. Then calculate the average print rate of the four printers per minute using harmonic mean.
20. A dietitian obtains the amounts of sugar (in grams) from one gram in each of 16 different cereals
are: 0.03 0.24 0.30 0.47 0.43 0.07 0.47 0.13 0.44 0.39 0.48 0.17 0.13 0.09 0.45 and 0.43.
Then find IQR, MD, variance, SD, CV, skewness and kurtosis of the amount of sugar.
21. Random samples of 10 boys are selected from the population of a certain camp, and each boy’s
weight and height are measured and recorded. The average weight of boys in the sample is
32.66kg with a standard deviation of 3.9kg and the average height is 95.5cm with a standard
deviation of 5.2cm. Is measurement of weight or height has less variable?
22. Let's say you have a group of 8 people, and you want to choose a committee of 3 people from
this group. How many different committees can you form?
23. A sample of five tests was taken to determine the compression strength (ksi) of concrete. Tests
results are 2.5, 3.5, 2.2, 3.2, and 2.9 ksi. Compute the variance, standard deviation, coefficient of
variation, Q1, Q2, and Q3 of concrete strength.
24. Exposing steel to a corrosive environment, such as in the case of a steel bridge spanning a
waterway or a cargo ship making voyages regularly, leads to loss of thickness of structural
components. A corroded steel plate was measured at 20 locations and produced the following
measurements in mm: 7.807, 8.886, 8.694, 8.185, 9.235, 8.526, 6.890, 8.953, 6.284, 6.533, 8.953,
8.112, 7.372, 9.640, 7.344, 8.837, 8.900, 9.048, 7.253, and 8.588. then
a) Construct grouped frequency distribution
5
b) Construct histogram, frequency polygon, less than and more than ogive
c) Calculate standard deviation, quartiles and coefficient of variation
25. The following concrete strength data (in ksi) were collected using an ultrasonic nondestructive
testing method at different locations of an existing structure: 3.5, 3.2, 3.1, 3.5, 3.6, 3.2, 3.4, 2.9,
4.1, 2.6, 3.3, 3.5, 3.9, 3.8, 3.7, 3.4, 3.6, 3.5, 3.5, 3.7, 3.6, 3.8, 3.2, 3.4, 4.2, 3.6, 3.1, 2.9, 2.5, 3.5,
3.4, 3.2, 3.7, 3.8, 3.4, 3.6, 3.5, 3.2, 3.6, and 3.8. then
a) Construct grouped frequency distribution
b) Construct histogram, frequency polygon, less than and more than ogive
c) Calculate standard deviation, quartiles and coefficient of variation
26. The sample mean of the initial 99 values of a data set consisting of 198 values is equal to 120,
whereas the sample mean of the final 99 values is equal to 100. What can you conclude about the
sample mean of the entire data set
a) Repeat when “sample mean” is replaced by “sample median.”
b) Repeat when “sample mean” is replaced by “sample mode.”
27. The following data represent the lifetimes (in hours) of a sample of 40 transistors:
112, 121, 126, 108, 141, 104, 136, 134
121, 118, 143, 116, 108, 122, 127, 140
113, 117, 126, 130, 134, 120, 131, 133
118, 125, 151, 147, 137, 140, 132, 119
110, 124, 132, 152, 135, 130, 136, 128
a) construct grouped frequency distribution
b) Determine the sample mean, and standard deviation.
28. An experiment measuring the percent shrinkage on drying of 50 clay specimens produced the
following data:
18.2 21.2 23.1 18.5 15.6
20.8 19.4 15.4 21.2 13.4
16.4 18.7 18.2 19.6 14.3
16.6 24.0 17.6 17.8 20.2
17.4 23.6 17.5 20.3 16.6
19.3 18.5 19.3 21.2 13.9
20.5 19.0 17.6 22.3 18.4
21.2 20.4 21.4 20.3 20.1
19.6 20.6 14.8 19.7 20.5
18.0 20.8 15.8 23.1 17.0
a) Find the sample mean, sample standard deviation, sample quartiles
b) Construct grouped frequency distribution and hence find sample mean, sample standard
deviation, sample quartiles
c) Draw a histogram and boxplot of these data and comment the distribution of the data.
6
d) Compare the results you obtained in (a) and (b) and give your conclusion.