QM
QM
Ans: at most 1-0.91=0.09 (or 9%) of the observations are outside of this
interval, either less than 1.7 or greater than 5. Since we can not assume that
distribution is symmetric, we do not know what part of the 9% is less than 1.7
and what part is more than 5. To be conservative, the best we can say is at
most 9% of the observations are more than 5 mins long.
Mean S.D
English $32,300 $1,175
CS $ 42,500 $ 2,375
One issue with this question is how probability questions can be worded. One might ask:
“What proportion of employees are female?” The answer is 43%. I could also ask “if I
choose an employee at random, what is the probability that they are female?” The answer
is again 43%.
(a) The proportion of employees who are female and exercise is 6%.
(b) The proportion of employees who are female and do not exercise during the
lunch hour is 37%?
(c) This question restricts attention to females only. This is a conditional probability.
Out of the 43% of female employees, 6% exercise so the (conditional) probability is
6/43=14%. In terms of a formula
(d) This question restricts attention to 18% of employees who exercise. The
proportion of these who are male is 12% so the (conditional) probability of being male is
67%. In terms of a formula
• The probability a randomly selected family belongs to the AAA
automobile club is 0.25. If a family belongs to AAA, the
probability they have more than one car is 0.45. Suppose a
family is randomly selected. What is the probability they have
more than one car and belong to AAA?
Ans:
Use joint probability (not conditional!!)
It is 0.1125
• Better Bedding in East Hartford, Connecticut, claims 99.4% of
all mattress deliveries are on time. Suppose two mattress
deliveries are selected at random.
Ans: 0.41
• If the picnic was good, what is the probability that it did not
rain on that day
Ans: 0.65
• Suppose 20% of clerical staff in an office smoke cigarrettes.
Research shows that 60% of smokers and 15% of nonsmokers
suffer a breathing illness by age 65.
• (a) Do these percentages indicate that smoking and breathing
illness are independent?
• (b) what's the probability that a randomly selected 65 year old
employee who has breathing illness smokes.
• In a manufacturing plant, there are three machines producing
50%, 30% and 20% respectively of the total output. Out of the
items produced by 1st machine, 4% are defective.
Corresponding percentages for 2nd and 3rd machines are 5 and
3 respectively. An item is drawn at random from the
production line.
• Find the probability that this item is defective .
• Given that this item defective, what is the conditional
probability that it has been produced by machine 2?
• You are trying to assess the effectiveness of an
advertising campaign for a product which is bought by
20% of people. A survey shows that 40% of people
recognise your advertisement while 12% of people
both recognise your advertisement and buy your
product.
• The total of the product column gives the probability of the data – here the
probability that someone recalls the campaign is 27.5%.
• The posterior distribution gives the probability of the states given the data.
Conditional on recalling the campaign, the probability of being a buyer is
32.8% - much higher than the overrall 12%.
• Just calculate 32.8% of k$12,200 which is $3,996. The average value of a target
customer to the company increases by $2,532 if they can just recall the details
of the campaign.
Bayes in Market Research
• You are considering launching a new product- male aftershave fragrance. It is very
difficult for new products in this market. On the basis of your experience, expert
judgment and (to some extent) historical records, you “guestimate” that there is
around 10% chance your product will be a success. It is much more likely to be a
flop. To try and reduce your risk, you are considering retaining a marketing
company to do some test marketing based on product testing and focus groups.
• Q-1: How much does the company’s service help you to reduce the
uncertainty in the product’s outcome? Does it actually reduce uncertainty
at all?
• Q-2: How much would the marketing company’s service be worth to you?
For the sake of argument, suppose that a success makes you INR
1,000,000 and a flop loses you INR 100,00
Example: Disease detection
19
• A game is being played between two players A and B
according to following rule: Each of them rolls a die with faces
1,2,..,6. If the total score is a perfect square, then A receives
as many rupees as total score, otherwise B receives from A as
many rupees as total score.
x: 1 2 3 4 5 6 7
f(x): 0.05 0.10 0.15 0.25 0.20 0.15 0.10
• In 4 weeks the number of trials (i.e. customers) is 32. Using the binomial
distribution with n=32 and p=0.18, the probability of 12 or more is 0.007.
• Over a week the mean is 1.4 and the standard deviation is 1.1. Over a
month, the mean is 5.8 and the standard deviation is 2.17. Over a year, the
mean is 74.9 and the standard deviation is 7.84. Obviously the mean
number of sales increases in proportion to the number of weeks. But the
standard deviation does not. The standard deviation of 7.84 over a year is
a much smaller proportion of the mean of 74.8 during a year, than it is for
a week. The yearly sales are much more predictable than the weekly sales.
• Can a Binomially distributed r.v. have expectation 10 and
variance 3?
• Three contractors A, B, and C are bidding for n contracts.
Suppose A has exactly half the chance as B; B has 4/5th as
likely chance as C to win a contract. Results for different
contracts are independent
• In each year they get to make the claim or not. The years are now trials
and the chance of success is 0.0182. The chance of at least one successful
year in 5 equals 1 minus the chance of zero successful years. The answer is
0.088 from typing 1-binomdist(0,5,.0182,true). With 55% chance of
outperforming this is 0.359.
You are determined that the underlying proportion of customers
who give your products the lowest possible rating for meeting
expectations on a 5 point scale is to be no higher than 10%.
Suppose it is exactly 10%.
• What proportion of people sitting the test will score less than
500?
• What percent of 14-year old boys have more than 240 mg/dl
of cholesterol?
• Scores on the SAT test in 2002 followed approximately
normal distribution N(504, 111) .
• What is the probability the heat pump will last for at least 5
yrs
• Suppose the heat pump lasts for five years. What is the
probability it will last for at least another 5 years.
• Distribution of life in hours of a certain kind of electric bubs is
known to be exponential. It is also known that a typical bulb
of this kind survives for at least 8 hrs with probability 0.64.
Find the probability that out of 5 randomly chosen bulbs of
this kind, at least 4 will survive for 12 hours or more.
• In upstate New York, milk tanker trucks follow a daily routine,
stopping at the same dairy farms every day. Farm output
varies due to weather, time of year, number of cows, and
other factors. From years of recorded data, the mean amount
of milk delivered for processing by a truck is 7750 liters, with a
standard deviation of 150 liters. Suppose 36 trucks are
randomly selected.
• Find the value m such that the probability the sample mean is
less than m is 0.1
• In 2003, a major vendor of antispam software claimed that
the proportion of email consisting of unsolicited spam was
0.40. Suppose 120 email messages are selected at random
Is there any evidence to suggest the new driver has been able to
shorten the route completion time. Use α=0.01
339 348 324 343 365 355 347 343 343 358
328 326 343 358 340 331 326 355 342 349
• Find the 95% C.I. for the difference in population mean pizza-
stone weights.
• Although smoking is banned in many public places in Europe,
a high proportion of European adults (ages 15 and older) still
smoke daily. In a random sample of 250 adults in Bulgaria, 92
were daily smokers, and in a random sample of 300 adults in
Greece, 114 were daily smokers. Is there any evidence to
suggest the true proportion of daily smokers is different in
Bulgaria and in Greece? Use level of significance=0.05