Name: Suresh Basic Statistics (Module - 4 ( - 2) )
Name: Suresh Basic Statistics (Module - 4 ( - 2) )
1. For each of the following statements, indicate whether it is True/False. If false, explain why.
I. The sample size of the survey should at least be a fixed percentage of the population
size in order to produce representative results.
Sol:- False, A sample size of 30% is good enough, but may or may not be adequate
II. The sampling frame is a list of every item that appears in a survey sample, including
those that did not respond to questions.
Sol:- True
III. Larger surveys convey a more accurate impression of the population than
smaller surveys.
Sol:- True
2. PC Magazine asked all of its readers to participate in a survey of their satisfaction with
different brands of electronics. In the 2004 survey, which was included in an issue of the
magazine that year, more than 9000 readers rated the products on a scale from 1 to 10.
The magazine reported that the average rating assigned by 225 readers to a Kodak compact
digital camera was 7.5. For this product, identify the following:
A. The population
Sol:- All the readers of PC Magazine
F. Any potential sources of bias or other problems with the survey or sample
Sol:- Selection of readers
3. For each of the following statements, indicate whether it is True/False. If false, explain why.
II. If the 95% confidence interval for the number of moviegoers who purchase
concessions is 30% to 45%, this means that fewer than half of all
moviegoers purchase concessions.
Sol:- True
III. The 95% Confidence-Interval for μ only applies if the sample data are nearly
normally distributed.
Sol:- True
X?
A. ¼
B. ½
C. ¾
D. 1
Sol: B. ½
5. In January 2005, a company that monitors Internet traffic (WebSideStory) reported that its
sampling revealed that the Mozilla Firefox browser launched in 2004 had grabbed a 4.6%
share of the market.
I. If the sample were based on 2,000 users, could Microsoft conclude that Mozilla has
a less than 5% share of the market?
Sol:-Yes
II. WebSideStory claims that its sample includes all the daily Internet users. If that’s
the case, then can Microsoft conclude that Mozilla has a less than 5% share of the
market?
Sol:-Yes
6. A book publisher monitors the size of shipments of its textbooks to university bookstores.
For a sample of texts used at various schools, the 95% confidence interval for the size of the
shipment was 250 ± 45 books. Which, if any, of the following interpretations of this interval
are correct?
Sol:- Correct
C. The procedure that produced this interval generates ranges that hold the population
mean for 95% of samples.
Sol:- Correct
D. If we get another sample, then we can be 95% sure that the mean of this second
sample is between 205 and 295.
Sol:- Wrong
E. We can be 95% confident that the range 160 to 340 holds the population mean.
Sol:- Wrong
7. Which is shorter: a 95% z-interval or a 95% t-interval for μ if we know that σ =s?
A. 600
B. 400
C. 550
D. 1000
9. Suppose we want the above margin of error to be based on a 98% confidence level.
What sample size (minimum) must we now use?
A. 1000
B. 757
C. 848
D. 543
10. Examine the following normal Quantile plots carefully. Which of these plots indicates that
the data?
1. Are nearly normal?
Sol:- Fig ‘C’
Sol:-Fig ‘D’
Sol:-Fig ‘B
11. For each of the following statements, indicate whether it is True/False. If false, explain why.
The manager of a warehouse monitors the volume of shipments made by the delivery team.
The automated tracking system tracks every package as it moves through the facility. A
sample of 25 packages is selected and weighed every day. Based on current contracts with
customers, the weights should have μ = 22 lbs. and σ = 5 lbs.
i) Before using a normal model for the sampling distribution of the average package
weights, the manager must confirm that weights of individual packages are normally
distributed.
Sol:- False
12. Auditors at a small community bank randomly sample 100 withdrawal transactions made
during the week at an ATM machine located near the bank’s main branch. Over the
past 2 years, the average withdrawal amount has been $50 with a standard deviation of
$40. Since audit investigations are typically expensive, the auditors decide to not initiate
further investigations if the mean transaction amount of the sample is between $45 and
$55. What is the probability that in any given week, there will be an investigation?
A. 1.25%
B. 2.5%
C. 10.55%
D. 21.1%
E. 50%
Sol:-
Given , Sample Size, n= 100
Population Standard Deviation, s = 40
Sample Mean, X = 50
Standard error SE =(s/sqrt(n))=((40/sqrt(100)) = 4
13. The auditors from the above example would like to maintain the probability of investigation
to 5%. Which of the following represents the minimum number transactions that they
should sample if they do not want to change the thresholds of 45 and 55? Assume that the
sample statistics remain unchanged.
A. 144
B. 150
C. 196
D. 250
E. Not enough information
Sol:-
Minimum number= 250
14. An educational startup that helps MBA aspirants write their essays is targeting
individuals who have taken GMAT in 2012 and have expressed interest in applying to FT
top 20 b-schools. There are 40000 such individuals with an average GMAT score of 720
and a standard deviation of 120. The scores are distributed between 650 and 790 with
a very long and thin tail towards the higher end resulting in substantial skewness.
Which of the following is likely to be true for randomly chosen samples of aspirants?
A. The standard deviation of the scores within any sample will be 120.
Sol:- False
B. The standard deviation of the mean of across several samples will be 120.
Sol:- False
C. The mean score in any sample will be 720.
Sol:- False
D. The average of the mean across several samples will be 720.
Sol:- True
E. The standard deviation of the mean across several samples will be 0.60
Sol:- False
I have just made a mistake , I have given feedback for part 1 in part2. so here I will give feedback for part2.
Q6
A. False, the range is given for 95% CI not for not for 100% CI.
B. False, CI can’t say about particular value it shows the avg. value.
C. True.
Rest of the answers are correct, please continue with your assignments
Hints:
1. Business Problem
1.1. Objective
1.2. Constraints (if any)
2. For each assignment the solution should be submitted in the below format
3. Research and Perform all possible steps for obtaining solution
4. For Basic Statistics explanation of the solutions should be documented in black and white
along with the codes.
One must follow these guidelines as well:
4.1. Be thorough with the concepts of Probability, Central Limit Theorem and Perform the
calculation stepwise
4.2. For True/False Questions, explanation is must.
4.3. R & Python code for Univariate Analysis (histogram, box plot, bar plots etc.) for data
distribution to be attached
5. All the codes (executable programs) should execute without errors