PSM Syllabus
PSM Syllabus
Preamble: Study of this course provides the learners a clear understanding of fundamental
concepts in probability and statistics. This course covers the modern theory of probability and
statistics, important models of sampling, techniques of hypothesis testing and correlation &
regression. The course helps the learners to find varied applications in engineering and science
like disease modelling, climate prediction and computer networks.
Prerequisite: A sound knowledge in Calculus.
Make use of concepts of sampling and theory of estimation to solve application level
CO3
problems (Cognitive Knowledge Level: Apply)
Organize the basic concepts in hypothesis testing and develop decision procedures for the
CO4
most frequently encountered testing problems(Cognitive Knowledge Level: Apply)
Build statistical methods like correlation and regression analysis to interpret experimental
CO5
data (Cognitive Knowledge Level: Apply)
PO1 PO2 PO3 PO4 PO5 PO6 PO7 PO8 PO9 PO10 PO11 PO12
CO1
CO2
CO3
CO4
CO5
CO6
Computer Science and Engineering (Data Science)
Conduct investigations of
PO4 complex problems PO10 Communication
Assessment Pattern
Remember 30 30 30
Understand 30 30 30
Apply 40 40 40
Analyze
Evaluate
Create
Mark Distribution
150 50 100 3
Computer Science and Engineering (Data Science)
Sample Size through the Approach Based on Precision Rate and Confidence Level, Determination
of Sample Size through the Approach Based on Bayesian Statistics
Text Books
1. Jay L. Devore, Probability and Statistics for Engineering and the Sciences, 8th edition,
Cengage, 2012
2. Research Methodology: Methods and Techniques: C.R. Kothari, New Age International
Publishers
Reference Books
1. HosseinPishro-Nik, Introduction to Probability, Statistics and Random Processes, Kappa
Research, 2014 (Also available online at www.probabilitycourse.com )
2. Sheldon M. Ross, Introduction to probability and statistics for engineers and scientists, 4th
edition, Elsevier, 2009.
3. T. VeeraRajan, Probability, Statistics and Random processes, Tata McGraw-Hill,2008
4. B.S. Grewal, Higher Engineering Mathematics, Khanna Publishers, 36 Edition, 2010
5. Levin R.I. and Rubin D.S., Statistics for Management, 7th edition, Prentice Hall of India
Pvt. Ltd., New Delhi, 2001.
6. Srivastava TN, Shailaja Rego, Statistics for Management, Tata McGraw Hill, 2008.
7. Anand Sharma, Statistics for Management, Himalaya Publishing House, Second Revised
edition, 2008.
8. Goon A.M., Gupta M.K. and Dasgupta B. (2002): Fundamentals of Statistics, Vol. I & II,
8th Edition. The World Press, Kolkata.
9. Miller, Irwin and Miller, Marylees (2006): John E. Freund’s Mathematical Statistics with
Applications, (7th Edition.), Pearson Education, Asia.
10. Sampling of Populations: Methods and Applications (2008): Paul S. Levy , Stanley
Lemeshow (Fourth Edition), John Wiley &Sons
Computer Science and Engineering (Data Science)
2. You are told that a 95% CI for expected lead content when traffic flow is 15, based on a
sample of n=10 observations is (462.1, 597.7). Calculate a CI with confidence level 99%
for expected lead content when traffic flow is 15.
3. A trucking company considered a multiple regression model for relating the dependent
variable y=total daily travel time for one of its drivers (hours) to the predictors x1=distance
travelled (miles) and x2=the number of deliveries made. Suppose that the model equation
is Y = -0.800 +0.060 x1+ 0.900x2+ ε. What is the mean value of travel time when distance
traveled is 50 miles and three deliveries are made?
Computer Science and Engineering (Data Science)
QP CODE:
PART A
1. Let X denote the number that shows up when an unfair die is tossed. Faces 1
to 5 of the die are equally likely, while face 6 is twice as likely as any other.
Find the probability distribution, mean and variance of X.
3. A random variable has a normal distribution with standard deviation 10. If the
probability that it will take on a value less than 82.5 is 0.82, what is the probability
that it will take on a value more than 58.3?
6. From a random sample of 36 New Delhi civil service personnel, the mean age
and the sample standard deviation were found to be 40 years and 4.5 years
Computer Science and Engineering (Data Science)
respectively. Construct a 95 per cent confidence interval for the mean age of
civil servants in New Delhi.
9. Let the test statistic T have a t distribution when H0 is true. Give the significance
level for the following situation Ha: µ>µ0, df=15, rejection region t ≥ 3.733.
10. Calculate the regression coefficient and obtain the lines of regression for the
following data
(10x3=30)
X 1 2 3 4 5 6 7
Y 9 8 10 12 11 13 14
Part B
(Answer any one question from each module. Each question carries 14 Marks)
11. (a) The probability mass function of a discrete random variable is P(x) = kx ; (7)
x = 1,2,3 where k is positive constant. Find (i) the value of k (ii) P(X ≤2)
(iii) E[X] (iv) var(1-X).
(b) Find the mean and variance of a binomial random variable (7)
OR
12. (a) Accidents occur at an intersection at a Poisson rate of 2 per day. What is the (7)
probability that there would be no accidents on a given day? What is the
probability that in January there are at least 3 days (not necessarily
consecutive) without any accidents?
(b) One fair die is rolled. Let X denote the number on the die and Y = 0 or 1, (7)
according as the die shows an even number or odd number. Find (i) the joint
probability distribution of X and Y, (ii) the marginal distributions. (iii) Are X
and Y independent?
Computer Science and Engineering (Data Science)
13. (a) The IQ of an individual randomly selected from a population is a normal (7)
distribution with mean 100 and standard deviation 15. Find the probability
that an individual has IQ (i) above 140 (ii) between 120 and 130.
(b) A continuous random variable X is uniformly distributed with mean 1 and (7)
variance 4/3. Find P(X < 0)?
OR
14. (a) The joint density function of random variables X and Y is given by (7)
(b) The lifetime of a certain type of electric bulb may be considered as an (7)
exponential random variable with mean 50 hours. Using central limit
theorem, find the approximate probability that 100 of these electric bulbs
will provide a total of more than 6000 hours of burning time.
15. (a) A market research survey in which 64 consumers were contacted and states (7)
that 64 percent of all consumers of a certain product were motivated by the
product’s advertising. Find the confidence limits for the proportion of
consumers motivated by advertising in the population, given a confidence
level equal to 0.95.
(b) Determine the size of the sample for estimating the true weight of the cereal (7)
containers for the universe with N = 5000 on the basis of the following
information:
(i) the variance of weight = 4 ounces on the basis of past records.
(ii) estimate should be within 0.8 ounces of the true average weight with 99%
probability.
OR
16. (a) The foreman of ABC mining company has estimated the average quantity of (7)
iron ore extracted to be 36.8 tons per shift and the sample standard deviation
to be 2.8 tons per shift, based upon a random selection of 4 shifts. Construct
a 90 percent confidence interval around this estimate.
Computer Science and Engineering (Data Science)
(b) What should be the size of the sample if a simple random sample from a (7)
population of 4000 items is to be drawn to estimate the percent defective
within 2 per cent of the true value with 95.5 per cent probability? What would
be the size of the sample if the population is assumed to be infinite in the
given case?
OR
18. (a) Lightbulbs of a certain type are advertised as having an average lifetime of
750 hours. The price of these bulbs is very favorable, so a potential customer
has decided to go ahead with a purchase arrangement unless it can be
conclusively demonstrated that the true average lifetime is smaller than what
is advertised. A random sample of 50 bulbs was selected, the lifetime of each
bulb determined, and the appropriate hypotheses were tested using Minitab,
resulting in the accompanying output.
Variable N Mean StDev SEMean Z P-Value
lifetime 50 738.44 38.20 5.40 -2.14 0
(7)
What conclusion would be appropriate for a significance level of 0.05? A
significance level of 0.01? What significance level and conclusion would you
recommend?
(b) The recommended daily dietary allowance for zinc among males older than
age 50 years is 15 mg/day. The article “Nutrient Intakes and Dietary Patterns
of Older Americans: A National Study” reports the following summary data
on intake for a sample of males age 65–74 years: n=115, x̄ =11.3, and s=6.43.
(7)
Does this data indicate that average daily zinc intake in the population of all
males ages 65–74 falls below the recommended allowance?
19. The flow rate y (m3/min) in a device used for air-quality measurement
depends on the pressure drop x (inches of water ) across the device’s filter.
Suppose that for x values between 5 and 20, the two variables are related
according to the simple linear regression model with true regression line
y = -0.12 + 0.095x
Computer Science and Engineering (Data Science)
(a)What is the expected change in flow rate associated with a 1 inch increase (7)
in pressure drop? Explain.
(b)What change in flow rate can be expected when pressure drop decreases (7)
by 5 inches?
OR
20. Suppose that in a certain chemical process the reaction time y (hr) is related to
the temperature (°F) in the chamber in which the reaction takes place
according to the simple linear regression model with equation y = 5.00 - 0.01x
and σ =0.075
(a)What is the expected change in reaction time for a 1°F increase and 10°F (7)
increase in temperature?
(b)What is the expected reaction time when temperature is 200°F and 250°F? (7)
Teaching Plan
No. of
Lecture
No Contents Hours
(45 hrs)
3.4 Concept of Standard Error, Estimation , Estimating the Population Mean(µ) 1 hour