Asa QB

Download as pdf or txt
Download as pdf or txt
You are on page 1of 5

asked earlier

Applied Statistical Analysis - Question Bank

Unit -1
Briefly explain the difference between a census and a sample survey. Why is conducting a sample survey
preferable to conducting a census? ppt1,

Briefly describe the two meanings of the word statistics. Briefly explain the types of statistics. ppt1,

What are different probability sampling methods ? ppt1


Indicate which of the following variables are quantitative and which are qualitative.
a. Number of typographical errors in newspapers
b. Monthly TV cable bills
c. Spring break locations favoured by college students
d. Number of cars owned by families
e. Lottery revenues of states
Also classify them as discrete or continuous.

Explain the difference between cross-section and time-series data. Give an example of each of these two
types of data. ppt1

Explain the meaning of the following terms.


Quantitative variable , Qualitative variable , Discrete variable , Continuous variable , Quantitative data ,
Qualitative data ppt1

Differentiate between independent , dependant and control variables. ppt1-87


Explain classification of data types for quantitative and qualitative data.
ppt1-66
Briefly explain the terms population, sample, representative sample, random sample, sampling with
replacement, and sampling without replacement. ppt1-19

Unit -2
The following data represent the numbers of tornadoes that touched down during 1950 to 1994 in the 12
states that had the most tornadoes during this period . The data for these states are given in the
following order:
1113, 2009, 1374, 1137, 2110, 1086, 1166, 1039, 1673, 2300, 1139, 5490
mean,median,outlier,mode
a. Calculate the mean and median for these data.
b. Identify the outlier in this data set. Drop the outlier and recalculate the mean and median. Which of
these two summary measures changes by a larger amount when you drop the outlier?
c. Which is the better summary measure for these data, the mean or the median? Explain.

Nixon Corporation manufactures computer monitors. The following data are the numbers of computer
monitors produced at the company for a sample of 30 days.

24 32 27 23 33 33 29 25 23 28 freq dist table, relative frequency,


percentage,histogram,polygon
21 26 31 22 27 33 27 23 28 29
31 35 34 22 26 28 23 35 31 27
a. Construct a frequency distribution table using the classes 21–23, 24–26, 27–29, 30–32, and 33–35.
b. Calculate the relative frequencies and percentages for all classes.
c. Construct a histogram and a polygon for the percentage distribution.
d. For what percentage of the days is the number of computer monitors produced in the interval 27–29?
Thirty adults were asked which of the following conveniences they would find most difficult to do without
: television (T), refrigerator (R), air conditioning (A), public transportation (P), or microwave (M).
Their responses are listed below.
R A R P P T R M P A rel freq , bar graph
A R R T P P T R A A
R P A T R P R A P R

a. Prepare a frequency distribution table.


b. Calculate the relative frequencies and percentages for all categories.
c. What percentage of these adults named refrigerator or air conditioning as the convenience that they
would find most difficult to do without?
d. Draw a bar graph for the relative frequency distribution.

According to a 2008 survey of adult men and women, close to 70% of these adults said that men and
women possess equal traits for being leaders. Suppose 70% of the current population of adults holds this
view.
Using the binomial formula, find the probability that in a sample of 16 adults, the number who will hold
this view is
i) exactly 13 ii) exactly 16 iii) at least 11 iv) 9 to 12 probablity

For 50 airplanes that arrived late at an airport during a week, the time by which they were late was
observed.
In the following table, x denotes the time (in minutes) by which an airplane was late, and f denotes the
number of airplanes.

mean , variance , sd

Find the mean, variance, and standard deviation.

One of the most profitable items at A1’s Auto Security Shop is the remote starting system. Let x be the
number of such systems installed on a given day at this shop. The following table lists the frequency
distribution of x for the past 80 days.

probablity distribution,
graph, probability

a. Construct a probability distribution table for the number of remote starting systems installed on a
given day. Draw a graph of the probability distribution.
b. Are the probabilities listed in the table of part a exact or approximate probabilities of various
outcomes? Explain.
c. Find the following probabilities.
i) P(x = 3) ii) P(x >= 3) iii) P(2 <= x <= 4) iv) P(x< 4)

Unit -3
The management at Massachusetts Savings Bank is always concerned about the quality of service
provided to its customers. With the old computer system, a teller at this bank could serve, on average, 22
customers per hour. The management noticed that with this service rate,
the waiting time for customers was too long. Recently the management of the bank installed a new
computer system, expecting that it would increase the service rate and consequently make the customers
happier by reducing the waiting time. To check if the new computer system is more efficient than the old
system, the management of the bank took a random sample of 70 hours and found that during these
hours the mean number of customers served by tellers was 27 per hour with a standard deviation of 2.5.
Testing at the 1% significance level, would you conclude that the new computer system is more efficient
than the old computer
system? ( Population σ is NOT known) pg-410 - tb01

The Telephone Company provides long-distance telephone service in an area. According to the company’s
records, the average length of all long-distance calls placed through this company in 2009 was 12.44
minutes. The company’s management wanted to check if the mean length of the current long-distance
calls is different from 12.44 minutes. A sample of 150 such calls placed through this company produced a
mean length of 13.71 minutes. The standard deviation of all such calls is 2.65 minutes. Using the 2%
significance level, can you conclude that the mean length of all current long-distance calls is different from
12.44 minutes? ( Population σ is known) pg - 395 - tb01

A 2008 survey of low- and middle-income households conducted by Demos, a liberal public policy group,
showed that consumers aged 65 years and older had an average credit card debt of $10,235 and
consumers in the 50- to 64-year age group had an average credit card debt of $9342 at the time of the
survey (USA TODAY, July 28, 2009). Suppose that these averages were based on random samples of 1200
and 1400 people for the two groups, respectively. Further assume that the population standard
deviations for the two groups were $2800 and $2500, respectively. Let µ 1 and µ2 be the respective
population means for the two groups, people aged 65 years and older and people in the 50- to 64-year
age group. ( Population σ1 and σ2 is known)

(a) What is the point estimate of µ1- µ2 ?


(b) Construct a 97% confidence interval for µ1- µ2 . pg- 442 - tb01

A sample of 14 cans of Brand I diet soda gave the mean number of calories of 23 per can with a standard
deviation of 3 calories. Another sample of 16 cans of Brand II diet soda gave the mean number of
calories of 25 per can with a standard deviation of 4 calories. At the 1% significance level, can you
conclude that the mean numbers of calories per can are different for these two brands of diet soda?
Assume that the calories per can of diet soda are normally distributed for each of the two brands and
that the standard deviations for the two populations are equal. ( Population σ1 and σ2 NOT known
but equal )
pg- 450 - tb01

A 2008 survey of low- and middle-income households conducted by Demos, a liberal public policy group,
showed that consumers aged 65 years and older had an average credit card debt of $10,235 and
consumers in the 50- to 64-year age group had an average credit card debt of $9342 at the time of the
survey (USA TODAY, July 28, 2009). Suppose that these averages were based on random samples of 1200
and 1400 people for the two groups, respectively. Further assume that the population standard
deviations for the two groups were $2800 and $2500, respectively. Let µ 1 and µ2 be the respective
population means for the two groups, people aged 65 years and older and people in the 50- to 64-year
age group. ( Population σ1 and σ2 is known)

Test at the 1% significance level whether the population means for the 2008 credit card debts for the two
groups are different. pg - 443 tb01

Briefly explain the meaning of each of the following terms.

Null hypothesis Alternative hypothesis ppt3 - 5,7,8


Critical point(s) Significance level
Non-rejection region Rejection region
Tails of a test Two types of errors

What are the four possible outcomes for a test of hypothesis? Show these outcomes by writing a table.
Briefly describe the Type I and Type II errors.
ppt - 12
Unit -4
What are residuals ? What is a regression line ?
Find equation of a regression line from the observation table given below
X 1 2 3 4 5
Y -3 -1 0 1 2

Find the correlation coefficient of relationship between X and Y from the given observation table.

X 1 2 3 4 5 ppt4 - 2
Y -4 -2 -1 0 2
How is correlation represented graphically ? What are different linear correlation types? What will
you conclude if correlation coefficient between two variable values is 0.42 ?

A car rental company charges $50 a day and 20 cents per mile for renting a car. Let y be the total rental
charges (in dollars) for a car for one day and x be the miles driven. The equation for the relationship
between x and y is

y = 50 + 0.20 x

a. How much will a person pay who rents a car for one day and drives it 100 miles?
b. Suppose each of 20 persons rents a car from this agency for one day and drives it 100 miles. Will each
of them pay the same amount for renting a car for a day or do you expect each person to pay a
different amount? Explain.
c. Is the relationship between x and y exact or nonexact?

What is coefficient of determination ?

From above data of table on monthly incomes and food expenditures of seven households, calculate the
coefficient of determination for the regression line. ppt4 - 57

A regression line intersects Y axis at -5.5 and the change in dependant variable (Y) value per unit
change in independent variable (X) value is 115.50.

Write the equation of regression line and estimate the value of dependant variable if the value of
independent variable is 250.

Is the correlation between X and Y positive or negative?

Unit -5 (Mixed)
Explain the Wilcoxon Signed Rank Test ppt3 - 160
Explain the Mann Whitney U Test. ppt3 - 133

Briefly explain when a one-way ANOVA procedure is used to make a test of hypothesis. Describe the
assumptions that must hold true to apply the one-way analysis of variance procedure ppt3 - 112,
to test hypotheses.
Explain the Kruskal-Wallis Test
ppt3 - 175
A random sample of 20 acres gave a mean yield of wheat equal to 41.2 quintals per acre with a standard
deviation of 3 quintals. Assuming that the yield of wheat per acre is normally distributed, construct a 90%
confidence interval for the population mean.

Explain the various alternatives for decreasing the width of a confidence interval. Which is the best
alternative?

Let x denote the time taken to run a road race. Suppose x is approximately normally distributed with a
mean of 190 minutes and a standard deviation of 21 minutes. If one runner is selected at random, what is
the probability that this runner will complete this road race

a) in less than 160 minutes ? b) in 215 to 245 minutes ?

Briefly explain the main characteristics of a normal distribution. Illustrate with the help of graphs.

A household receives an average of 1.7 pieces of junk mail per day. Find the probability that this
household will receive exactly 3 pieces of junk mail on a certain day. Use the Poisson probability
distribution formula.

You might also like