0% found this document useful (0 votes)
11 views18 pages

Semester I

This document contains a question bank on descriptive statistics and probability. It includes 33 questions covering topics like mean, variance, standard deviation, probability, independence of attributes, regression, correlation, and Bayes' theorem. The questions range from proving identities and relationships to real-world word problems involving probability calculations.

Uploaded by

Prachi Panwar
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
11 views18 pages

Semester I

This document contains a question bank on descriptive statistics and probability. It includes 33 questions covering topics like mean, variance, standard deviation, probability, independence of attributes, regression, correlation, and Bayes' theorem. The questions range from proving identities and relationships to real-world word problems involving probability calculations.

Uploaded by

Prachi Panwar
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 18

STAT C-101

DESCRIPTIVE STATISTICS
Descriptive Statistics Question Bank

1. In a frequency table, the upper boundary of each class interval has a constant ratio to the
lower boundary. Show that the geometric mean G may be expressed by the following
formula:

∑ ( )

where is the logarithm of the mid value of the first interval and c is the logarithm of the
ratio between upper and lower boundaries.
2. Prove that the sum of the squares of the deviations of a set of observations is minimum
when taken about mean.
3. A variate takes values a, ar, ar2, … , arn-1 each with frequency unity. Compute arithmetic
mean (A), geometric mean (G) and harmonic mean (H) and show that AH = G2.

4. If the deviations Xi = xi - M are small compared with the value of the mean M so that
(Xi/ M)3 and higher powers are neglected, prove that
( )

hence prove that

( )
√ ,

where G is the geometric mean of the values x1, x2, … , xn and V is the coefficient of
dispersion.
5. Show that in a discrete series if deviations xi = Xi – M, are small compared with the value of
the mean M so that (x/M)3 and higher powers of (x/M) are neglected,
(i) ( )

(ii) ( ) ( ) approx.
√ √
where H is the harmonic mean of the values x1, x2, … , xn and is the variance.

6. Find the mean deviation about mean and standard deviation of A.P. a, a+d, a+2d, ..., a+2nd
and verify that standard deviation is greater than mean deviation about mean.

7. Let r be the range and s be the standard deviation of a set of observations x1,x2,...xn. Prove

that s≤ r. Also prove that ( ) where


[ ∑( ̅) ]

8. If n1, n2 are the sizes ̅̅̅, ̅̅̅ the means and σ1, σ2 the standard deviations of two series, then
obtain the mean and variance of the combined series of size n1 + n2.
9. Find mean square deviation and variance if the variable takes values 0, 1, 2, ..., n with
frequencies given by the terms of binomial series ( ), ( ), ..., ( ) respectively.
10. For a random variable x, central moments of all order exist, then show that

where µj is the jth central moment.

11. If r is the rth absolute moment about zero, then the mean value of
{ | |( ) | |( )
} .
Show that (i) , and
( )
(ii) .
12. Establish the relationship between the moments about mean, in terms of moments about
any arbitrary point A. What are the Sheppard’s corrections to central moments.

13. Discuss the principle of least squares. Derive the normal equations for fitting the curve
Y=a*exp(bX + cX2) to the given set of n points *( ) +.

14. Two variables X and Y are known to be related to each other by the relation .
Derive the normal equations for fitting the given curve and estimate the constants ‘a’ and
‘b’from a given set of n points *( ) +.

15. Discuss the principle of least squares. Derive the normal equations for fitting the curve
Y=a*exp(cX2) to the given set of n points *( ) +

16. If X and Y are independent random variables, show that


r(X + Y, X - Y) = r2(X, X + Y) - r2(Y, X + Y)
where, r(X + Y, X - Y) is the correlation coefficient between X + Y and X - Y, r(X, X +
Y) is the correlation coefficient between X and X + Y and r(Y, X + Y) is the correlation
coefficient between Y and X + Y.
17. Define Spearman’s rank correlation coefficient. Let x1, x2, …, xn be the ranks of n
individuals according to a character A and y1, y2, …, yn be the corresponding ranks of the
individuals according to another character B. Obtain the rank correlation coefficient
between them.
18. X and Y ate two random variables with variances σx2 and σy2 respectively and r is the
coefficient of correlation between them. If U = X + kY and V = X + Y, find the value
of k so that U and V are uncorrelated.
19. Given that Y = kX + 4 and X = 4Y + 5 are the lines of regression of Y on X and X on Y
respectively, show that 0 ≤ k ≤ 1/4. If k = , find mean of two variables and the coefficient
of correlation between them.

20. Show that ( )( )


Also deduce that (i) R1.23 ≥ r12

(ii)
( )( )
( ) , provided all coefficients of zero order
( )
are equal to ρ.

21. X and Y are two random variables with same mean and the two regression equations are
Y = aX + b and X = αY + β. Show that . Also find the common mean.
22. If r12 and r13 are given, show that r23 must lie in the range:

( )

If r12 = k and r13 = - k, show that r23 will lie between -1 and 1 - 2k2.
23. What do you mean by independence of attributes? Give a criterion of independence of
attributes. If δ = (AB) – (AB)0, where (AB)0 is the value of (AB) under the hypothesis that
attributes A and B are independent, prove that
[(A) – (α)] [(B) –(β)] + 2Nδ = (AB)2 + (αβ)2 - (Aβ)2 - (αB)2

( ) ( ) ( ) ( ) ( ) ( )
24. If and ,
then, using the conditions of consistency of attributes show that
0 < y ≤ x ≤ ¼.

25. Given that (A) = (α) = (B) = (β) = (C) = (γ) = N/2 and (ABC) = (αβγ), then show that
2(ABC) = (AB) + (AC) + (BC) – N/2.
26. Show that for n attributes A1, A2, .... , An

( ) ( ) ( ) ( ) ( ) ( )

where N is the total number of observations.

27. Let A1, A2, …., An be the events in the domain of probability function P, such that
n  n


P  Ai  ≤  P[ A ] . Using this relationship, prove that:
i
 i 1  i 1

n  n
(i) P  Ai   1   P[ A ] , and i
 i 1  i 1

n  n
(ii) P  Ai    P[ A ] i  (n  1) .
 i 1  i 1

28. Three newspapers A, B, and C are published in a certain city. It is estimated from the
survey that of the adult population 20% read A, 16% read B, 14% read C, 8% read both A
and B, 5% read both A and C, 4% read both B and C, 2% read all three. Find the
percentage that read at least one of the papers? Also find the percentage that read both A
and B but does both read C?
29. A sportsman’s chance of shooting an animal at a distance r (>a) is a2/r2. He fires when r =
2a, and if he misses he reloads and fires when r = 3a, 4a, …. . If he misses at distance na,
the animal escapes. Find the probability that the animal escapes.

30. It is 8: 5 against the wife who is 40 years old living till she is 70 and 4: 3 against her
husband now 50 living till he is 80. Find the probability that (i) both will be alive, (ii) only
one will be alive and (iii) at least one will be alive 30 years hence.

31. A and B alternately cut a pack of cards and the pack is shuffled after each cut. If A starts
and the game is continued until one cuts a diamond, what are the respective chances of A
and B first cutting the diamond?

32. In a class of 75 students, 15 were considered to be very intelligent, 45 as medium and the
rest below average. The probability that a very intelligent student fails in a viva
examination is 0.005; the medium student failing has a probability 0.05; and the
corresponding probability for a below average student is 0.15. If a student is known to have
passed the viva-voce examination, what is the probability that he is below average?

33. A problem in Statistics is given to three students A, B and C, whose chance of solving it
are ½, ¾ and ¼ respectively. What is the probability that the problem will be solved if all
of them try independently.
34. State Bayes’ theorem.
In answering a multiple choice test, an examinee either knows the answer or he guesses or
he copies. Suppose each question has four choices. Let the probability that examinee copies
the answer is 1/6 and the probability that he guesses is 1/3. The probability that his answer
is correct given that he copied the answer is 1/8. Suppose an examinee answers a question
correctly, what is the probability that he really knows the answer?
STAT -C102
CALCULUS
Question Bank
Paper Name: Calculus
B.Sc. (Hon.) – 1st year (1st Sem.)

Set-A
Set- B
GENERIC ELECTIVE: 1
STATISTICAL METHODS
Question Bank for GE 1 – Statistical Methods

1. The mean and standard deviation of 20 items is found to be 10 and 2 respectively.


At the time of checking it was found that one item 8 was incorrect. Calculate the
mean and standard deviation if (i) the wrong item is omitted, and (ii) it is
replaced by 12.
2. A motor when travelling from rest travels the first twentieth of a mile at 6 miles
per hour and the next three twentieths at respectively 8, 12, 24 miles per hour.
Use the most appropriate measure of central tendency to find the average speed
of motor?
3. A person travels 900 miles by train at an average speed of 60 miles per hour; 300
miles by boat at an average of 25 miles per hour; 400 miles by plane at 350 miles
per hour and finally 15 miles by taxi at 25 miles per hour. Find the average speed
of the person for the entire distance using the most appropriate measure of central
tendency.
4. For the following frequency distribution:
Class 0 – 10 10 – 20 20 – 30 30 – 40 40 – 50 50 – 60 60 – 70
Interval
Frequency 8 7 12 28 20 10 10
Represent the frequency distribution by suitable graph, compute mode and locate
it graphically.

5. The mean weight of 150 students in a certain class is 60 kilograms. The mean
weight of boys in the class is 70 kilograms and that of girls is 55 kilograms. Find
the number of boys and number of girls in the class.
6. For a group of 50 male workers the mean and standard deviation of their daily
wages are Rs 63 and Rs 9 respectively. For a group of 40 female workers these
values are Rs 54 and Rs 6 respectively. Find the mean and variance of the
combined group of 90 workers.
7. Mean monthly salary of 12 workers and 3 managers in a factory was Rs. 26000.
When one of the managers whose salary was Rs. 27500 was replaced with a new
manager, the mean salary of the team went down to Rs. 25000. What is the salary
of the new manager?
8. Establish the relationship between the moments about mean, in terms of moments
about any arbitrary point A.
9. The first three moments about point A = 5 are 2, 20 and 40. Determine mean and
first three central moments.
10. Explain with the help of a suitable example the term “dispersion”. Discuss
different measures of dispersion in details.
11. Karl Pearson’s coefficient of skewness of a distribution is 0.32, the standard
deviation is 6 and mean is 29. Find the mode and hence median of the
distribution.
12. Explain what are skewness and kurtosis. Also explain the methods of measuring
skewness and kurtosis of a frequency distribution.
13. Obtain correlation coefficient r(X, Y) based on the following data:

𝑛 = 30, ∑30𝑖=1 𝑥𝑖 = 150, ∑30𝑖=1 𝑦𝑖 = 90, ∑30𝑖=1 𝑦𝑖 2 = 450, ∑30𝑖=1 𝑥𝑖2 = 1200,

∑ 30
𝑖=1 𝑥𝑖𝑦𝑖 = 630.

Also interpret the result.

14. The coefficient of Spearman’s rank correlation coefficient between


microeconomics and statistics marks of 10 students was found to be 0.5. It was
later discovered that the difference in ranks in two subjects obtained by one of the
students was wrongly taken as 3 instead of 7. Find the correct value of coefficient
of rank correlation coefficient.
15. Calculate the Spearman’s rank correlation coefficient from the following data:

5 10 6 3 19 5 6 12 8 2 10 19
X
Y 8 3 2 9 12 3 17 18 22 12 17 20

16. If r12 and r13 are given, show that r23 must lie in the range:
1
𝑟12𝑟13 ± (1 − 𝑟122 − 𝑟132 + 𝑟122𝑟132)2
If r12 = k and r13 = -k, show that r23 will lie between -1 and 1 – 2k2.

17. If r12=0.6, r13= 0.7 and r23= 0.8, then obtain R3.122 and r23.1. Interpret the results.
18. State the principle of least squares and use it to fit a curve of the form Y = aebX.
19. State the principle of least square and use it to obtain the curve of best fit of the
form Y = a + bX + cX2.
20. The following results were obtained in the analysis of data on X and Y.
X Y

Average 4.9 28

Variance 4 25

Correlation coefficient between X and Y is 0.8.

(i) Obtain the two lines of regression.


(ii) Estimate value of Y when X = 10.
(iii) Estimate the value of X when Y = 20.
21. From the following data:
X 15 20 25 30 35 40 45 50

Y 10 15 20 20 22 25 26 28

(i) Obtain the two regression coefficients.


(ii) Also obtain the two lines of regression.

22. In a survey of population of 12000, information is gathered regarding three


attributes A, B and C. In the usual notations,
(A) = 980, (AB) = 450, (ABC)= 130, (B) = 1190, (AC) = 280, (C) = 600 and
(BC) = 250

(i) Compute (αβγ), (Aβγ), (αBC) and (ABγ).


(ii) Obtain the coefficient of association between A and B. Interpret the result.
23. The following table gives the distribution of students and also of regular players
among them, according to age completed in years:
Age in years 15 16 17 18 19 20

No. of students 250 200 150 120 100 80

Regular Players 200 150 90 48 30 12

Calculate the coefficient of association between majority and playing habit, on


the assumption that majority is attained in 18th year.

24. 1000 candidates of both sexes appeared at an examination. The girls outnumbered
the boys by 20% of the total. The number of candidates, who passed exceed the
number failed by 480. Equal number of girls and boys failed in the examination.
Prepare a 2 X 2 table and find coefficient of association. Comment.
25. State the principle of least squares. Obtain the normal equations for fitting the
exponential curve y=abx. Given the following data N = 10, ∑Ui = 5.56, ∑Xi = 55,
∑Ui Xi = 39.83 and ∑Xi2 = 385 where Ui =log Yi. Obtain the estimates of
A = log a and B = log b.

You might also like