Math 231
Math 231
Probability and
Statistics
Lectures Notes
1. Like professional people, you must be able to read and understand the
various statistical studies performed in your fields. To have this
understanding, you must be knowledgeable about the vocabulary, symbols,
concepts, and statistical procedures used in these studies.
2. You may be called on to conduct research in your field, since statistical
procedures are basic to research. To accomplish this, you must be able to
design experiments; collect, organize, analyze, and summarize data; and
possibly make reliable predictions or forecasts for future use. You must also
be able to communicate the results of the study in your own words.
1
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
3. You can also use the knowledge gained from studying statistics to become
better consumers and citizens. For example, you can make intelligent
decisions about what products to purchase based on consumer studies, about
government spending based on utilization studies, and so on.
1.1 Descriptive and Inferential Statistics
To gain knowledge about seemingly haphazard situations, statisticians collect
information for variables, which describe the situation.
Data are the values (measurements or observations) that the variables can assume.
Variables whose values are determined by chance are called random variables.
Example 1.1: Suppose that an insurance company studies its records over the past
several years and determines that, on average, 3 out of every 100 automobiles the
company insured were involved in accidents during a 1-year period. Although there is
no way to predict the specific automobiles that will be involved in an accident (random
occurrence), the company can adjust its rates accordingly, since the company knows the
general pattern over the long run. (That is, on average, 3% of the insured automobiles will
be involved in an accident each year).
A collection of data values forms a data set. Each value in the data set is called
a data value.
In statistics it is important to distinguish between a sample and a population.
manufacturing, agriculture, and mortality. The Census is conducted by the Bureau of the
Census, which is part of the Department of commerce.
Most of the time, due to the expense, time, size of population, medical concerns, etc.,
it is not possible to use the entire population for a statistical study; therefore, researchers
u s e samples.
A sample is a group of subjects selected from a population.
If the subjects of a sample are properly selected, most of the time they should possess the
same or similar characteristics as the subjects in the population.
However, the information obtained from a statistical sample is said to be biased if the
results from the sample of a population are radically different from the results of a
census of the population. Also, a sample is said to be biased if it does not represent
the population from which it has been selected.
The body of knowledge called statistics is sometimes divided into two main areas,
depending on how data are used. The two areas are
1. Descriptive statistics
2. Inferential statistics
presentation of data.
3
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
Example1.3: A researcher may wish to know if a new drug will reduce the
number of heart attacks in men over age 70 years of age. For this study,
two groups of men over age 70 would be selected. One group would be
given the drug, and the other would be given a placebo (a substance with
no medical benefits or harm). Later, the number of heart at- tacks
occurring in each group of men would be counted, a statistical test would be run,
and a decision would be made about the effectiveness of the drug.
Example 1.4: A study conducted at Manatee Community College revealed that
students who attended class 95 to 100% of the time usually received an A in the
class. Students who attended class 80 to 90% of the time usually received a B or C
in the class. Students who attended class less than 80% of the time usually
received a D or an F or eventually withdrew from the class.
Based on this information, attendance and grades are related. The more you attend
class, the more
4
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
likely it is you will receive a higher grade. If you improve your attendance, your
grades will probably improve. Many factors affect your grade in a course. One factor
that you have considerable control over is attendance. You can increase your
opportunities for learning by attending class more often.
1.2.1 Qualitative variables are variables that have distinct categories according
to some characteristic or attribute.
For example, if subjects are classified according to gender (male or female), then
the variable gender is qualitative. Other examples of qualitative variables are
religious preference and geographic locations.
For example, the variable age is numerical, and people can be ranked in order
according to the value of their ages. Other examples of quantitative variables are
heights, weights, and body temperatures.
Quantitative variables can be further classified into two groups: discrete and
continuous.
(i) Discrete variables can be assigned values such as 0, 1, 2, 3 and are said
to be countable. Examples of discrete variables are the number of
children in a family, the number of students in a classroom, and the
number of calls received by a call center each day for a month.
5
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
a. The number of hours during a week that children ages 12 to 15 reported that
they watched television.
b. The number of touchdowns a quarterback scored each year in his college
football career.
c. The amount of money a person earns per week working at a fast-food restaurant.
d. The weights of the football players on the teams that play in the NFL this year.
Solution:
a. Continuous, since the variable time is measured.
b. Discrete, since the number of touchdowns is counted.
c. Discrete, since the smallest value that money can assume is in cents.
d. Continuous, since the variable weight is measured.
1.3 Types of Data
In statistical analysis, there are different kinds of data, whose values are closely
related to the nature of the variables. There are two main types of data that are
mostly observed in practical applications which further are of different types:
which each observation may fall. Categorical data can further be classified
as
1- Nominal Data: It is a variable whose measurement indicates a category or
characteristic, more than an exact mathematical measure. In nominal
variables, there is not a clear order among categories, and so a nominal
variable is just a label of a characteristic of the observational unit without
a rating scale (order). For example gender, eye colour, religion, brand.
2- Ordinal Data: It is a variable whose measurement indicates a clear
ordered category or characteristic. In ordinal variables, there is a clear
order among categories. So an ordinal variable points out a characteristic
of an observational unit that can be ranked regarding a rating scale. For
example, a student’s grades such as (A, B, C), clothing size (small,
medium, large).
• Numerical Data: This kind of data, also known as quantitative data, arise
when the observations are counts or measurements. For example, the
quantities such as number of students in the class, weight of an individual,
temperature at a particular place, etc. The numerical data can further be of
two types.
1- Discrete Data: The domain of discrete data is integers. For example,
number of houses in a society, number of chapters in a book, etc.
2- Continuous Data: The domain of a continuous variable or some interval
on real line. Continuous domains are lattices and are clearly well ordered.
measurement scales
In addition to being classified as qualitative or quantitative,
variables can be clas sified by how they are categorized, counted,
or measured.
For example, can the data be organized into specific categories,
such as area of residence (rural, suburban, or urban)? Can the data
values be ranked, such as first place, second place, etc.? Or are the
7
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
The next level of measurement is called the ordinal level. Data measured at
this level can be placed into categories, and these categories can be ordered,
or ranked. For example, from student evaluations, guest speakers might be
ranked as superior, average, or poor. Floats in a homecoming parade might
be ranked as first place, second place, etc. Note that precise measurement of
differences in the ordinal level of measurement does not exist. For instance,
when people are classified according to their build (small, medium, or large),
a large variation exists among the individuals in each class.
8
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
The third level of measurement is called the interval level. This level differs
from the ordinal level in that precise differences do exist between units. For
example, many standardized psychological tests yield values measured on
an interval scale. IQ is an ex- ample of such a variable. There is a meaningful
difference of 1 point between an IQ of 109 and an IQ of 110. Temperature is
another example of interval measurement, since there is a meaningful
difference of 1°F between each unit, such as 72 and 73°F. One property is
lacking in the interval scale: There is no true zero. For example, IQ tests do
not measure people who have no intelligence. For temperature, 0°F does not
mean no heat at all.
4. The ratio level of measurement possesses all the characteristics of
interval measurement, and there exists a true zero. In addition, true
ratios exist when the same variable is measured on two different
members of the population.
The final level of measurement is called the ratio level. Examples of ratio
scales are those used to measure height, weight, area, and number of phone
calls received. Ratio scales have differences between units (1 inch, 1 pound,
etc.) and a true zero. In addition, the ratio scale contains a true ratio between
values. For example, if one person can lift 200 pounds and another can lift
100 pounds, then the ratio between them is 2 to 1. Put another way, the first
person can lift twice as much as the second person.
a. Ratio
b. Nominal
c. Interval
d. Ordinal
10
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
Exercises
2. When the sample does not represent the population, it is called a biased sample.
11
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
called
a. A confounding variable. b. An explanatory variable.
c. An outcome variable. d. An interfering variable.
12
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
45 46 64 57 85
92 51 71 54 48
Class limits Tally Frequency
27 66 76 55 69
f
54 44 54 75 46
27–35 / 1
61 68 78 61 83
36–44 /// 3
88 45 89 67 56
45–53 //// //// 9
81 58 55 62 38
54–62 //// //// //// 15
55 56 64 81 38
63–71 //// //// 10
49 68 91 56 68
72–80 /// 3
46 47 83 71 62
81–89 //// // 7
90–98 // 2
50
and
frequencies.
13
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
∑𝑓 = 𝑛
Two types of frequency distributions that are most often used are the categorical
frequency distribution and the grouped frequency distribution. The procedures
for constructing these distributions are shown now.
• The relative frequency of an event is defined as the number of times that the
event occurs during experimental trials, divided by the total number of trials
conducted.
𝑓
• ∑ =1
𝑛
𝑓
(ii) Percentage Frequency ( 𝑛 × 100% )
• Many times, percentage frequency distributions are displayed as tables or as bar graphs
or pie charts.
Example 2.2: Twenty-five army inductees were given a blood test to determine their blood type.
14
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
Step 5: Find the percentage of values in each class by using the formula
𝑓
× 100%
𝑛
15
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
Relative
Class Tally Frequency Frequency Percentage
A //// 5 0.2 20
B //// // 7 0.28 28
O //// //// 9 0.36 36
AB //// 4 0.16 16
Total 25 1 100%
Step 3: Find the numerical frequencies from the tallies, and find the frequencies.
Example 2.3: These data represent the record high temperatures in degrees Fahrenheit (°F) for each
of the 50 states. Construct a grouped frequency distribution for the data, using 7 classes.
112 100 12 12 13 11 10 11 10 112
16
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
7 0 4 8 5 0 9
110 118 11 11 11 12 11 11 10 109
7 6 8 2 4 4 5
107 112 11 11 11 11 11 12 10 110
4 5 8 7 8 2 6
116 108 11 12 11 12 11 11 10 111
0 1 3 0 9 1 4
120 113 12 11 10 11 11 11 11 114
0 7 5 0 8 2 4
Solution:
The procedure for constructing a grouped frequency distribution for numerical data follows.
➢ Select a starting point for the lowest class limit. This can be the smallest data
value or any convenient number less than the smallest data value. In this case,
100 is used. Add the width to the lowest score taken as the starting point to get
the lower limit of the next class.
Subtract one unit from the lower limit of the second class to get the upper
limit of the first class. Then add the width to each upper limit to get all
17
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
105 − 1 = 104
➢ Find the class boundaries by subtracting 0.5 from each lower class limit and
adding 0.5 to each upper class limit:
Step 3: Find the numerical frequencies from the tallies. The completed frequency distribution is
Class Class
limits boundaries Tally Frequency
100–104 99.5–104.5 // 2
105–109 104.5–109.5 //// /// 8
110–114 109.5–114.5 //// //// //// /// 18
115–119 114.5–119.5 //// //// /// 13
120–124 119.5–124.5 //// // 7
125–129 124.5–129.5 / 1
130–134 129.5–134.5 / 1
Total 50
2. It is preferable but not absolutely necessary that the class width be an odd
number. This ensures that the midpoint of each class has the same place value
as the data. The class midpoint Xm is obtained by adding the lower and upper
boundaries and dividing by 2, or adding the lower and upper limits and
dividing by 2:
lower boundary + upper boundary
Xm =
2
3. The classes must be mutually exclusive.
18
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
cumulative
Example 2.4: The cumulative frequency distribution for the data in Example 2.3 is
as follows:
Cumulative frequency
Less than 99.5 0
Less than 104.5 2
Less than 109.5 10
Less than 114.5 28
Less than 119.5 41
Less than 124.5 48
Less than 129.5 49
Less than 134.5 50
Cumulative frequencies are used to show how many data values are accumulated up to
and including a specific class. In Example 2.3, of the total record high temperatures 28
are less than or equal to 114°F. Forty-eight of the total record high temperatures are less
than or equal to 124°F.
➢ There are two types of cumulative frequency.
(1) Less than type
(2) Greater than type
• Histogram
19
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
• Bar graph
• Pie diagram
• Frequency Polygon
2.2.1 Histogram
The histogram is a graph that displays the data by using contiguous vertical
bars (unless the frequency of a class is 0) of various heights to represent the
frequencies of the classes.
➢ Frequencies are along the vertical axis and the classes are along the horizontal axis.
➢ The frequencies within each interval of a histogram are represented by a rectangle, the
size of the interval being the base and the frequency of that interval the height.
➢ The area of each rectangle in a histogram corresponds to the frequency within a given
interval, while the total area of a histogram corresponds to the total frequency (n) of
the distribution.
Blood Types
AB
0 1 2 3 4 5 6 7 8
BLOOD TYPES
A B O AB
11%
13%
46%
30%
➢ Using the midpoints for the x values and the frequencies as the y values, plot the points.
21
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
➢ Connect adjacent points with line segments. Draw a line back to the x axis at the
beginning and end of the graph, at the same distance that the previous and next
midpoints would be located.
➢ The frequency polygon and the histogram are two different ways to represent the same
data set.
➢ From Example 2.3:
Find the midpoints of each class. Recall that midpoints are found by adding the upper and
lower boundaries and dividing by 2:
99.5 + 104.5
= 102
2
Class Midpoints Frequency
boundaries
99.5–104.5 102 2
104.5–109.5 107 8
109.5–114.5 112 18
114.5–119.5 117 13
119.5–124.5 122 7
124.5–129.5 127 1
129.5–134.5 132 1
The Ogive is defined as the frequency distribution graph of a series. The Ogive is a graph of a
cumulative distribution, which explains data values on the horizontal plane axis and either the
22
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
➢ Cumulative frequency graphs are used to visually represent how many values are below
a certain upper class boundary.
To find out how many record high temperatures are less than 14.5°F?
Locate 114.5°F on the x axis, draw a vertical line up until it intersects the graph,
and then draw a horizontal line at that point to the y axis. The y axis value is 28.
Exercises
3. It is not important to keep the width of each class the same in a frequency distribution.
4. Frequency distributions can aid the researcher in drawing charts and graphs.
5. The type of graph used to represent data is determined by the type of data collected and by
the researcher’s purpose.
6. In construction of a frequency polygon, the class limits are used for the x axis.
7. Data collected over a period of time can be graphed by using a pie graph.
10. What graph should be used to show the relationship between the parts and the whole?
a. Histogram
b. Pie graph
c. Pareto chart
d. Ogive
11. Except for rounding errors, relative frequencies should add up to what sum?
a. 0
b. 1
24
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
c. 50
H C H M H A C A M
C M C A M A C C M
C C H A H H M
25
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
Since the parameters are unknown, statistics (known in general) are used to approximate
(estimate) parameters.
We are now able to define a number of important summarized measures, starting with the
arithmetic average or mean.
Understanding descriptive statistics, their measures of center and their variability, helps form
the foundation of statistical analysis. Descriptive statistics tell us how frequently an
observation occurs, what is considered average, and how far data in our sample deviate from
being average. With these statistics, we are able to provide a summary of characteristics from
both large and small datasets. Measures of central tendency and variability provide valuable
information on their own, and form the cornerstone of the quantitative structures.
26
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
sample has been grouped around a value. Some important central tendency measures are the
Mean, Median and Mode.
3.1.1 Mean
The mean is the arithmetic average of all of the data points. It is also the most common measure
of central tendency and is the most widely understood. In fact, when most people think of
average, they are imagining the mean.
➢ The population mean, denoted by μ, is calculated by using all the values in the
population. The population mean is a parameter.
∑𝑁
𝑖=1 𝑋𝑖
𝜇=
𝑁
where N represents the total number of values in the population and 𝑋1 , 𝑋2 , … … . , 𝑋𝑁 are the
population values.
➢ The sample mean, denoted by 𝑋̅, is calculated by using sample data. The sample mean
is a statistic.
∑𝑛𝑖=1 𝑥𝑖
𝑥̅ =
𝑛
where n represents the total number of values in the sample and 𝑥1 , 𝑥2 , … … . , 𝑥𝑛 are the
sample values.
Example 3.1: The number of confirmed flu cases for a 9-year period is shown.
4 46 98 115 88 44 73 48 62
(ii) Find the mean for this sample, 98, 115, 88 and 44?
98 + 115 + 88 + 44
𝑥̅ = = 86.25.
4
27
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
Example 3.2: The frequency distribution shows the salaries (in millions) for a specific year
of the top 25 CEOs in the United States. Find the mean.
Class Frequency
boundaries
15.5–20.5 13
20.5–25.5 6
25.5–30.5 4
30.5–35.5 1
35.5–40.5 1
Total 25
Solution:
Construct this table
28
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
15.5–20.5 13 18 234
20.5–25.5 6 23 138
25.5–30.5 4 28 112
30.5–35.5 1 33 33
35.5–40.5 1 38 38
n = 25 ∑ 𝑓. 𝑚𝑖
= 555
∑𝑛𝑖=1 𝑓 𝑚𝑖 555
𝑥̅ = = = 22.2.
𝑛 25
3.1.2 Median
The median is the halfway point in a data set. Before you can find this point, the data must be
arranged in ascending or increasing order. When the data set is ordered, it is called a data array. The
median either will be a specific value in the data set or will fall between two values.
➢ To find the median is
ordered sequence.
4- If n is even there is strictly no middle observation, but the median is defined by
𝑛 𝑛
convention as the average of the two middle observations, ( 2 ), ( 2 + 1) from either
end.
Example 3.3: The number of tornadoes that have occurred in the United States
over an 8-year period is as follows. Find the median.
29
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
Solution:
Arrange the data values in ascending order.
The median of the grouped data can be calculated using the following formula;
𝑛⁄ − (∑ 𝑓)
𝑀𝑒𝑑𝑖𝑎𝑛 = 𝐿 + ( 2
𝑚−1
) ℎ,
𝑓𝑚
where
L: Lower class boundary of the median class.
n: Sum of all frequencies.
𝑓𝑚 : The frequency of the median class.
(∑ 𝑓)𝑚−1 : Sum of frequency before the median class.
h: the size of the class interval.
Example 3.3: A mobile phone company examines the ages of 150 customers to start special
plans for them. Consider frequency table
Age(years) 10-19 20-29 30-39 40-49 50-59 60-69
Frequency 14 40 28 27 24 17
Find the Median.
Solution:
𝑛
Since n=150, 2 = 75
Age True class Frequency Cumulative
Frequency
10-19 9.5-19.5 14 14
20-29 19.5-29.5 40 54
30-39 29.5-39.5 28 82
30
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
3.1.3 Mode
The mode is defined as the observation in the sample which occurs most frequently if
there is such an observation.
A data set that has only one value that occurs with the greatest frequency is said to be
unimodal. If a data set has two values that occur with the same greatest frequency, both values
are considered to be the mode and the data set is said to be bimodal. If a data set has more than
two values that occur with the same greatest frequency, each value is used as the mode, and
the data set is said to be multimodal. When no data value occurs more than once, the data set
is said to have no mode. Note: Do not say that the mode is zero. That would be incorrect,
because in some data, such as temperature, zero can be an actual value. A data set can have
more than one mode or no mode at all.
Example 3.4: The data show the number of public libraries in a sample of eight states. Find the
mode.
Solution:
31
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
Solution:
Since the values 104 and 109 both occur 5 times, the modes are 104 and 109. The data
set is said to be bimodal.
The mode of the grouped data can be calculated using the following formula;
∆1
𝑚𝑜𝑑𝑒 = 𝐿 + ( )ℎ
∆1 + ∆2
Where,
L: lower class boundary of model class.
∆𝟏 : excess of modal frequency over frequency of next lower class.
∆𝟐 : excess of model frequency over frequency of next higher class.
h: size of modal class interval.
Example 3.6: Calculate the mode for the data in Example 3.3.
Solution:
The modal class is the second class 19.5-29.5, from which we obtain
𝐿 = 19.5, 𝑓 = 40, 𝑓1 = 14, 𝑓2 = 28 𝑎𝑛𝑑 ℎ = 10
∆𝟏 = 𝑓 − 𝑓1 = 40 − 14 = 26
∆𝟐 = 𝑓 − 𝑓2 = 40 − 28 = 12
Then,
26
𝑚𝑜𝑑𝑒 = 19.5 + ( ) × 10 = 26.34.
26 + 12
The mode is not used widely in analytical statistics, other than as a descriptive measure, mainly
because of the ambiguity in its definition, as the fluctuations of small frequencies are apt to
produce spurious modes.
Notes:
1. The mode is used when the most typical case is desired.
2. The mode is the easiest average to compute.
32
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
3. The mode can be used when the data are nominal or categorical, such as religious preference,
gender, or political affiliation.
4. The mode is not always unique. A data set can have more than one mode, or the mode may
not exist for a data set.
When the majority of the data values fall to the right of the mean and cluster at the upper
end of the distribution, with the tail to the left, the distribution is said to be negatively skewed
or left-skewed. Also, the mean is to the left of the median, and the mode is to the right of the
median. As an example, a negatively skewed distribution results if the majority of students score
very high on an instructor’s examination. These scores will tend to cluster to the right of the
distribution.
When a distribution is extremely skewed, the value of the mean will be pulled toward the tail, but
the majority of the data values will be greater than the mean or less than the mean (depending on
which way the data are skewed); hence, the median rather than the mean is a more appropriate
measure of central tendency. An extremely skewed distribution can also affect other statistics.
33
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
Example 3.7: Calculate the Range for the data in Example 3.4.
Solution:
𝑅 = 382 − 21 = 361.
The range for these data is quite large since it depends on the highest data value and the lowest data
value. To have a more meaningful statistic to measure the variability, statisticians use measures called
the variance and standard deviation.
3.3.2 Variance
Variability refers to the spread of a sample, its range, or distribution. A variability measure
gives us an idea of how a data sample is spread around a value. Two important variability
34
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
∑(𝑥𝑖 − 𝑥̅ ) = 0
𝑖=1
(1) The population variance is the average of the squares of the distance each value is from
the mean. The symbol for the population variance is 𝜎 2 ,
𝑁
1
𝜎 = ∑(𝑋𝑖 − 𝜇)2
2
𝑁
𝑖=1
2
∑𝑛𝑖=1 𝑥𝑖 2 − (∑𝑛𝑖=1 𝑥𝑖 )2⁄𝑛
𝑆 = .
𝑛−1
The sample standard deviation is the square root of the variance,
𝑆 = √𝑆 2 .
where n − 1 is the degrees of freedom of the sample variance, one degree of freedom is lost
since ∑𝑛𝑖=1 𝑥𝑖 is known.
35
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
The variance is always nonnegative, and it has the squared units of the sample values. If the
data observations are more spread, the variance is higher and vice versa.
Example 3.8: Find the variance and standard deviation for the data set in an example 3.1,
98, 115, 88 and 44.
Solution:
2
∑𝑛𝑖=1 𝑓 𝑥𝑖 2 − (∑𝑛𝑖=1 𝑓 𝑥𝑖 )2⁄𝑛
𝑆 = ,
𝑛−1
Where 𝑓 denotes the frequency.
(2) Grouped data
∑𝑛𝑖=1 𝑓 𝑚𝑖 2 − (∑𝑛𝑖=1 𝑓 𝑚𝑖 )2 ⁄𝑛
𝑆2 = ,
𝑛−1
Where 𝑓 denotes the frequency and 𝑚𝑖 is the interval midpoint.
Example 3.9: Find the sample variance and the sample standard deviation for the frequency
distribution of the data shown. The data represent the number of miles that 20 runners ran during
one week.
True Class Midpoint Frequency
5.5–10.5 8 1
10.5–15.5 13 2
15.5–20.5 18 3
20.5–25.5 23 5
25.5–30.5 28 4
30.5–35.5 33 3
35.5–40.5 38 2
36
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
Solution:
Construct this table
5.5–10.5 8 1 8 64
10.5–15.5 13 2 26 338
15.5–20.5 18 3 54 972
20.5–25.5 23 5 115 2,645
25.5–30.5 28 4 112 3,136
30.5–35.5 33 3 99 3,267
35.5–40.5 38 2 76 2,888
∑ 𝑓 𝑚𝑖 𝑛
Total n=20
𝑖=1
∑ 𝑓 𝑚𝑖 2
= 490 𝑖=1
= 13,310
37
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
Example 3.10: The mean of the number of sales of cars over a 3-month period is 87, and the
standard deviation is 5. The mean of the commissions is $5225, and the standard deviation is $773.
Compare the variations of the two.
Solution:
The coefficients of variation are
5
𝐶. 𝑉1 = × 100% = 5.7%,
87
773
𝐶. 𝑉2 = × 100% = 14.8%,
5225
Since the coefficient of variation is larger for commissions, the commissions are more variable
than the sales.
Example 3.11: Find the coefficient of variation for the data set in an example 3.9.
Solution:
Coefficient of variation is
8.3
𝐶. 𝑉 = × 100% = 0.122%.
24.5
To be able to evaluate our relative position when interested in comparing performance and
knowing a ranking.
Used to describe the position of a data value in relation to the rest of the data
38
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
cores
core ormula
yes
positive score means that a score is a o e the mean.
negative score means that a score is elo the mean.
score of means that a score is the exa t sa e as the mean.
Example
ath
istor
39
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
Example
Test
Test
Test .
Test .
hich is higher
est is hi her therefore it is e er
t has a hi her rela e osi on
3.4.1 percentiles
In statistics, percentiles are used to understand and interpret data. The nth percentile of a set
of data is the value at which n percent of the data is below it. In everyday life, percentiles are
used to understand values such as test scores, health indicators, and other measurements. For
example, an 18-year-old male who is six and a half feet tall is in the 99th percentile for his
height. This means that of all the 18-year-old males, 99 percent have a height that is equal to
or less than six and a half feet. An 18-year-old male who is only five and a half feet tall, on the
other hand, is in the 16th percentile for his height, meaning only 16 percent of males his age
are the same height or shorter.
The percentiles divide the data set into 100 equal groups. Percentiles are symbolized by
𝑃1 , 𝑃2 , 𝑃3 , … . . , 𝑃99 and divide the distribution into 100 groups.
40
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
The percentile corresponding to a given value X is computed by using the following formula:
Example 3.12: The number of traffic violations recorded by a police department for a 10-day
period is shown. Find the percentile rank of 16.
22 19 25 24 18 15 9 12 16 20
Solution:
Arrange the data in order from lowest to highest.
9 12 15 16 18 19 20 22 24 25
Then substitute into the formula.
3 + 0.5
𝑃𝑒𝑟𝑐𝑒𝑛𝑡𝑖𝑙𝑒 = × 100 = 35𝑡ℎ
10
𝑛. 𝑃
𝑐=
100
where n = total number of values and p = percentile
3- If c is not a whole number, round up to the next whole number. Starting at the lowest
value, count over to the number that corresponds to the rounded- up value.
4- If c is a whole number, use the value halfway between the cth and (c + 1)st values
when counting up from the lowest value.
Example 3.13: Using the data in Example 3.12, find the value corresponding to the 65th
percentile and 30th percentile.
Solution:
Arrange the data in order from lowest to highest.
41
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
9 12 15 16 18 19 20 22 24 25
To compute 65th percentile
10 × 65
𝑐= = 6.5
100
Since c is not a whole number, round it up to the next whole number; in this case, it is c = 7.
Start at the lowest value and count over to the 7th value, which is 20. Hence, the value of 20
corresponds to the 65th percentile.
To compute 30th percentile
10 × 30
𝑐= = 3,
100
Since c is a whole number, use the value halfway between the c and c + 1 values when counting up from
the lowest. In this case, it is the third and fourth values.
9 12 15 16 18 19 20 22 24 25
3th 4th
The halfway value is between 15 and 16. It is 15.5. Hence, 15.5 corresponds to the 30th percentile.
er en les
alues of the variable that divide a ranked set into .
or e ample would be at .
( )
= + h i edian
r. abab abr
42
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
Example
27
= 6.75
4
54
= 13.5
4
81
= 20.25
4
6.75 3
1 = 10 20.25 18
3 = 129. 9
13.5 10
2 =8
4
3.4.2 Quartiles
Quartiles divide the distribution into four equal groups, denoted by Q1, Q2, Q3.
Note that Q1 is the same as the 25th percentile; Q2 is the same as the 50th percentile, or the
median; Q3 corresponds to the 75th percentile, as shown:
2- Find the median of the data values. This is the value for 𝑄2 .
3- Find the median of the data values that fall below 𝑄2 , This is the value for 𝑄1.
4- Find the median of the data values that fall above 𝑄2 . This is the value for 𝑄3 .
43
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
Example 3.14: Using the data in Example 3.12, find 𝑄1, 𝑄2 , and 𝑄3 .
Solution:
Arrange the data in order from lowest to highest.
9 12 15 16 18 19 20 22 24 25
9 12 15 16 18 19 20 22 24 25
18 + 19
𝑀𝑒𝑑𝑖𝑎𝑛 = = 18.5 = 𝑄2
2
9 12 15 16 18
𝑄1 = 15.
19 20 22 24 25
𝑄3 = 22.
44
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
. ( )
= . +
3.4.3 Deciles
Deciles divide the distribution into 10 groups, as shown. They are denoted by D1, D2, etc.
Note that D1 corresponds to P10; D2 corresponds to P20; etc. Deciles can be found by using the
formulas given for percentiles. Taken altogether then, these are the relationships among percentiles,
deciles, and quartiles.
Deciles are denoted by D1, D2, D3, . . . , D9, and they correspond to
P10, P20, P30, . . . , P90.
Quartiles are denoted by Q1, Q2, Q3 and they correspond to P25, P50, P75. The median
is the same as P50 or Q2 or D5.
45
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
, ..,
( )
= +
Example Find D2
46
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
= . +(
47
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
ol on
The data value is e tremel suspect. These are the
steps in checking for an outlier.
te ind
PrConstruction a Boxplot
Step 1 Find the five-number summary for the data.
Step 2 Draw a horizontal axis and place the scale on the axis. The scale should start on or
below the minimum data value and end on or above the maximum data value.
Step 3 Locate the lowest data value, Q1, the median, Q3, and the highest data value; then draw
a box whose vertical sides go through Q1 and Q3. Draw a vertical line through the
median. Finally, draw a line from the minimum data value to the left side of the box,
and draw a line from the maximum data value to the right side of the box.
Example
49
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
symmetric.
b. skewed.
c.
b.
c.
50
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
Example
51
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
52
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
ste and leaf lot is a data lot that ses art of the data al e
as the ste and art of the data value as the leaf to form groups or
classes
ol on
te rran e the data in order
53
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
54
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
ol on
te rran e the data for oth data sets in order
te onstr t a ste and leaf lot sin the sa e di its as
ste s la e the di its
for the leaves for tlanta on the le side of the stem and the digits
for the leaves for hiladelphia on the right sideas shown.
55
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
Exercises
(1) Determine whether each statement is true or false. If the statement is false, explain
why.
1. When the mean is computed for individual data, all values in the data set are used.
2. The mean cannot be found for grouped data when there is an open class.
3. A single, extremely large value can affect the median more than the mean.
4. One-half of all the data values will fall above the mode, and one-half will fall below the
mode.
5. In a data set, the mode will always be unique.
7.
10. If a person’s score on an exam corresponds to the 75th percentile, then that person
obtained 75 correct answers out of 100 questions.
(2) Select the best answer.
11. What is the value of the mode when all values in the data set are different?
a. 0 c. There is no mode.
b. 1 d. It cannot be determined unless the data values are given.
12. When data are categorized as, for example, places of residence (rural, suburban, urban), the
most appropriate measure of central tendency is the
a. Mean c. Mode
b. Median d. Midrange
15. A statistic that tells the number of standard deviations a data value is above or below the
mean is called
16. When a distribution is bell-shaped, approximately what percentage of data values will fall
within 1 standard deviation of the mean?
a.50% c. 95%
b.68% d. 99.7%
17. The number of highway miles per gallon of the 10 worst vehicles is shown.
12 15 13 14 15 16 17 16 17 18
Errors Frequency
0–2 1
3–5 3
6–8 4
9–11 1
12–14 1
19. The average number of newspapers for sale in an airport newsstand is 56 with a standard
deviation of 6. The average number of newspapers for sale in a convenience store is 44
with a standard deviation of 5. Which data set is more variable?
57
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
20. , use each boxplot to identify the maximum value, minimum value, median, first quartile, third quartile, and
interquartile range
1.
2.
50 55 60 65 70 75 80 85 90 95 100
58
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
Chapter 4: Probability
What is Probability?
The word probability means chance or possibility of an outcome. It explains
the possibility of a particular event to occur. We often use sentences like -
‘It will probably rain today’, ‘he will probably pass the test’, ‘there is very less
probability of getting storm tonight’, ‘most probably the price of onion will
go high again’. In all these sentences we replace words like chance, doubt,
maybe, likely, etc with the word probability. Probability is basically the
prediction of an event which is either based on the study of previous
records or the number and type of possible outcomes.
Later, J. Cardan, an Italian Mathematician wrote the first book named 'Book
on Games of Chance' in 1663 that deals with the inception of probability.
This caught the attention of some of the great Mathematicians J. Bernoulli,
P. Laplace, A.A Markov and A.N.Kolmogorov.
59
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
Uses of Probability
Probability is important to figure out if a particular thing is going to occur
in an event or not. It also helps us to predict future events and take action
accordingly. Below are the uses of probability in our day to day life.
jobs in the upcoming year so that the vacancy can be created accordingly.
Politicians can also analyze the rate of car and bike accidents increased in
past years so that they can take measures and reduce road accidents.
In the study of statistics, we consider experiments for which the outcome cannot
be predicted with certainty. Such experiments are called random experiments.
Some sample spaces for various probability experiments are shown here.
61
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
Die 2
Die 1 1 2 3 4 5 6
1 (1, 1) (1, 2) (1, 3) (1, 4) (1, 5) (1, 6)
2 (2, 1) (2, 2) (2, 3) (2, 4) (2, 5) (2, 6)
3 (3, 1) (3, 2) (3, 3) (3, 4) (3, 5) (3, 6)
4 (4, 1) (4, 2) (4, 3) (4, 4) (4, 5) (4, 6)
5 (5, 1) (5, 2) (5, 3) (5, 4) (5, 5) (5, 6)
6 (6, 1) (6, 2) (6, 3) (6, 4) (6, 5) (6, 6)
4.2 An Event:
• S is an event ( I m p o s s i b l e e v e n t ).
• S is an event (Sure event).
Example:
62
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
Ac is the complement of A.
Ac consists of all outcomes of but are not in A.
Ac occurs if A does not.
63
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
It is impossible that both events occur together (in the same time).
AB AB =
A and B are not A and B are mutually
mutually exclusive exclusive (disjoint)
(It is possible that both (It is impossible that both
events occur in the same events occur in the same
time) time)
64
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
Classical probability assumes that all outcomes in the sample space are equally
likely to occur.
For example, when a single die is rolled, each outcome has the same probability
of occurring. Since there are six outcomes, each outcome has a probability of 1⁄6
.
Equally likely events are events that have the same probability of occurring.
n(S)
where n(E) is the number of outcomes in E and n(S) is the number of outcomes in the samplespace S.
EXAMPLE: A coin is tossed twice. What is the probability that at least one head
occurs?
SOLUTION: The sample space for this experiment is
S = {HH, HT, TH, TT}
If the coin is fair, each of the outcomes would be equally likely to
occur. If A represents the event of occurring at least1 head then, A
= {HH, HT, TH} hence, P(A) = 3/ 4
EXAMPLE : Throw an unbiased coin three times and observe the sequence of heads
and tails. Here the sample space is the collection of all possible sequences,
S = {HHH, HHT, HTH, THH, HTT, THT, TTH, TTT}
Since the outcomes are equally likely and mutually exclusive then the probability
of each outcome is 1/8. Let A be the event that two or more heads appear
consecutively, andB that all the tosses are the same. Then A = {HHH, HHT, THH}
and B = {HHH, TTT} Therefore
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
𝟑 𝟐
Solution 𝑷(𝑨) = , 𝑷(𝑩) =
𝟖 𝟖
𝟏 𝟒
𝑷(𝑨 ∩ 𝑩) = , 𝑷(𝑨 ∪ 𝑩) = ,
𝟖 𝟖
EXAMPLE: A fair die is tossed twice. What is the probability that sum of
upturned faces is 9?
SOLUTION: The sample space for this experiment is
S = {(i, j): i = 1,2,...,6; j = 1,2,...,6}
Since the die is fair (unbiased), each of the 36 possible outcomes would be equally
to occur. If A represents the event that the sum of the upturned faces is 9 then,
A = {(3,6) , (4,5), (5,4), (6,3)} Hence, P(A) = 4/36 = 1/9.
PROBABILITY OF AN EVENT.
1. 𝑃(𝐴) ≥ 0
2. 𝑃(𝑆) = 1 𝑎𝑛𝑑 𝑃(∅) = 0
3. 𝑃(𝐴 ∪ 𝐵) = 𝑃(𝐴) + 𝑃(𝐵) − 𝑃(𝐴 ∩ 𝐵)
𝑃(𝐴𝐶 ) = 1 − 𝑃(𝐴)
∴ ∑𝑘𝑖=1 𝑃(𝐴1 ) = 1
EXAMPLE: The probability that a student passes mathematics is 0.85, and the
probability that he passes English is 0.8. If the probability of passing at least one
courseis 0.9, what is the probability that he will pass both courses?
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
SOLUTION: If M is the event "passing mathematic" and E the event "passing
English", then by transposing the terms in the Additive Rule, we have
P(M E) = P(M) + P(E) – P(M E)0.9 = 0.85 + 0.80 – P(M E) P(M E) =0.75
284
1. 𝑃(𝐸1 ) =
630
258
2. 𝑃(𝐸2 ) =
630
63
3. 𝑃(𝐸3 ) =
630
25
4. 𝑃(𝐸4 ) =
630
5. 𝑃(𝐸4 ∪ 𝐸2 ) = 𝑃(𝐸4 ) + 𝑃(𝐸2 )
284
6. 𝑃(𝐸1𝐶 ) = 1 −
630
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
er tations
he n er of er tations of o je ts takin r o je ts at a ti e
order is i ortant
xa les
1. permutations of two digits from {0, 1}: 01, 10
2. permutations of three letters from { x, y, z }:
xyz, xzy, yxz, yzx, zxy, zyx
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
ol tion The first seat can be filled in an one of a s and when this has
been done there are 9 a s of filling second seat 8 a s of filling the third
seat and wa s of filling the fourth seat. Therefore Number of
arrangements of people taken at a time . . .
n general
Number of arrangements of different objects taken at a
𝒏!
time - …… -r+ 𝑃𝑟 𝑛⬚ = (𝒏−𝒓)!
This is also called the of m r p rm o
j k and is denoted b 𝑃 𝑛 .
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
Example: How many three letter strings can be formed from the
letters in “compiler” if no letters can be repeated?
8!
Solution 𝑃(8,3) = = 336
5!
Example :
o ination
The number of combinations of objects taken from objects order is not
i ortant
𝒏 𝒏
( ) = 𝑪(𝒏, 𝒓) = 𝑪𝒓
𝒓
𝑷(𝒏, 𝒓)
=
𝒓!
𝒏!
=
𝒓! (𝒏 − 𝒓)!
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
Example
#
Example
a. In how many ways can a committee with at least 1
freshman be selected?
Method 1 1 C(19, 1) C(34, 7) +
2 C(19, 2) C(34, 6) +
3 C(19, 3) C(34, 5) +
number of freshmen 4 C(19, 4) C(34, 4) +
on committee
5 C(19, 5) C(34, 3) +
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
ol tion
a. Total number of possible selections (5) (7)
2 3
5 6
b. Total number of possible selections ( ) ( )
2 2
c. Total number of possible selections (3) (7)
2 3
When you begin to solve a counting problem, it's always, always, always a good idea to create
a specific example (or two or three...) of the things you are trying to count.
If there are ways of doing one thing, and ways of doing another thing, then the
Multiplication Principle tells us there are ways of doing both things.
If you want to place things in positions, and you care about order, you can do it in ways.
(Doing so is called permuting items at a time.)
If you want to place things in positions, and you care about order, you can do it in
ways. (Doing so is called permuting items at a time.)
If you have items of one kind, items of another kind, and items of a third kind, then
there are ways of arranging the items. (Doing so is called counting the number
of distinguishable permutations.)
If you have items of one kind and items of another kind, then there are
ways of choosing the items without replacement and without regard to order.
(Doing so is called counting the number of combinations, and we say " choose ".)
6
of them there are ( )
3
(6)
3
ence the probabilit of selecting all humanities courses is
(10
3)
. Number of wa s to select one humanit and ps cholog of them there
6 4
are ( ). ( )
1 2
(𝟔) (𝟒)
𝟏 𝟐
the ro a ilit of sele tin one h anit and s holo is
(𝟏𝟎)
𝟑
5.1 INTRODUCTION
Solution If E= { (H, H)} denotes the event that both flips land heads, and
F ={ (H, H), (H, T)} the event that the first flip lands heads, then the desired
probability is given by
𝑷(𝑬 ∩ 𝑭) 𝑷({ (H, H)} ) 𝟏⁄ 𝟏
𝑷(𝑬|𝑭) = = = 𝟒=
𝑷(𝑭) 𝑷({ (H, H), (H, T)}) 𝟐⁄ 𝟐
𝟒
Solution Let Y denote the event that the marble selected is yellow, and
𝐶
let 𝐵 denote the event that it is not black.
𝑷(𝒀 ∩ 𝐵𝐶 ) 𝟓⁄
𝑷(𝒀|𝐵 = 𝐶)
= 𝟐𝟓
𝐶
𝑷(𝐵 ) 𝟏𝟓⁄
𝟐𝟓
ol tion
irst t e is defe ti e B e ond t e is defe ti e ∩ 𝑩 : 𝐵𝑜𝑡ℎ 𝑎𝑟𝑒 𝑑𝑒𝑓𝑒𝑐𝑡𝑖𝑣𝑒
15
𝑃(𝐴) = 𝑃(𝐵|𝐴) 4/
240
15 14
𝑃(𝐴 ∩ 𝐵) = 𝑃(𝐴). 𝑃(𝐵|𝐴) = .
240 239
2
= = .
1 = .
and are inde endent
3
=1 =1 ( + )
= + .
= 1 = .
1 = . ( )
and are inde endent
Example
If A and B are independent, P ( A | B ) .2, P (B ) .3.
Find
1 - P (A B )
2 − P (A B )
ol tion
red marble lue marble
3 3
a. {both are blue} ∩𝐵 . | .
5 5
2 2
b. {both are red} ∩𝑅 . | .
5 5
c. {one red and one blue} {red and blue or blue and red}
∩𝐵 ∩𝑅
. | . |
2 3 3 2
. .
5 5 5 5
o of total ro a ilit
roof
𝐴 = (𝐴 ∩ 𝐵1) ∪ (𝐴 ∩ 𝐵2) … . . (𝐴 ∩ 𝐵𝑛)
f (𝐴 ∩ 𝐵1), (𝐴 ∩ 𝐵2), … . . , (𝐴 ∩ 𝐵𝑛)
are t all ex l si e
P(𝐴 ∩ 𝐵1) + 𝑃(𝐴 ∩ 𝐵2) … . . +𝑃(𝐴 ∩ 𝐵𝑛)
. | . | ….. n. | n
hen
𝑃(𝐴) = ∑𝑛𝑖=1 𝑃(𝐴|𝐵𝑖 )𝑃(𝐵𝑖 )
Bayes' Formula
𝑃(𝐵𝑖 ∩𝐴) 𝑃(𝐵𝑖 ).𝑃(𝐴|𝐵𝑖 )
𝑃(𝐵𝑖 |𝐴) =
𝑃(𝐴) 𝑛
𝐴𝐵
∑𝑖=1 𝑃( | 𝑖 )𝑃(𝐵𝑖 )
rst bo second bo
1 1
2
, ( 2)
2
3 2
|
5
, |
7
( ) . ( | ) 21
|
( )
= =
. + . ( | ) 31
Dr. Rabab Sabry Intro. Prob and Stat Dr. Dina Ahmed
Exercises
1.From a box containing 5 black balls and 3 green balls, 2 balls are
drawn in succession, the first ball being replaced in the box before the
second draw is made.
(a) What is the probability that both balls are the same color?
(b) What is the probability that color is represented?
2- In a certain population of women 4% have had breast cancer, 25% are
smokers and 2.5% are smokers and have had breast cancer. A women
is selected at random from thepopulation:
(a) If she is a smoker, what is the probability that she alsohas breast
cancer?
(b) If she hasn't breast cancer, what is the probability thatshe is a
smoker?
(c) What is the probability that she is neither a smoker norhas breast
cancer?
3- Out of 5 mathematicians and 7 statisticians, a committee consisting of
2 maths and 3 statists. Is to be formed. In howmany ways this be done
if
(a) Any mathematician and any statistician can beincluded.
(b) One particular statistician must be on the committee.
(c) Two particular mathematicians cannot be on the committee.
5 Urn has white and black balls Urn white and black and Urn white and
black. n urn is selected at random and a ball drawn at random is found to be white.
ind the probabilit that Urn was selected.
6 A box contains 8 red, 3 white, and 9 blue balls. If 3 balls are drawn at random without
replacement, determine the probability that (a)all 3 are red, (b) all 3 are white, (c) 2 are
red and 1 is white, (d) at least 1 is white, (e) 1 of each color is drawn, (f) The balls are
drawn in the order red, white, blue
7 Of 10 girls in the class, 3 have blue eyes. If two of the 11 girls are chosen at random,
what is the probability that 12 a) Both have blue eyes 13 b) Neither has blue eyes 14 c) At least
one has blue eyes