0% found this document useful (0 votes)

32 views4 pages

Module3 BCS301

Maths 3 sem

Uploaded by

suhan99802804

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

32 views4 pages

Module3 BCS301

Maths 3 sem

Uploaded by

suhan99802804

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

MATHEMATICS FOR COMPUTER SCIENCE (BCS301) 2023

MODULE-3

Statistical Inference-1: Introduction. Sampling distributions, standard error. Levels of significance.

Test of significances. Problems. Levels of significance. Confidence limits.
Simple sampling of attributes, Test of significance for large samples.
Comparison of large samples.

Population: A population consists of the totality of the observations with which we are concerned.

Examples: Groups of people, animals, or all possible outcomes of some system.

Sampling: A small section selected from the population is called a sample, and the process of drawing a
sample is called sampling. It is essential that a sample must be a random selection.

Simple sampling: A random sampling in which each event has the same probability 𝑝 of success and the
chance of success of different events are independent whether previous trials have been made
or not, is known as simple sampling.

Parameters: The statistical constants of the population such as mean (𝜇), standard deviation (𝜎) etc.
are called the parameters.

Statistic: The statistical constants for the sample drawn from the given population such as mean (𝑥),
standard deviation (𝑆) etc. are called the Statistic.
Generalization from the sample to population is called Statistical inference.
Sampling distribution: Consider all possible samples of size 𝑛 which can be drawn from a given population at
random. Frequency distribution of different means of samples is called sampling distribution of the means.
Frequency distribution of different standard deviation of samples is called sampling distribution of the S.D. etc.
Standard error: The standard deviation of the sampling distribution is called standard error.(S.E.)
Thus the standard error of the sampling distribution of the means is called standard error of means.
Precision: The reciprocal of the standard error is called precision.
Statistical hypothesis: To take the decisions about populations on the basis of sample information, we make
certain assumptions about the populations, such assumptions are called statistical hypothesis.
Testing a hypothesis: First assume that hypothesis is correct, and then compute the probability of observed
sample. If this probability is less than the pre assigned value, then hypothesis is rejected.

Errors:
Type I error: If a hypothesis is rejected while it should have been accepted, then we say that type I error has
been committed.
Type II error: If a hypothesis is accepted while it should have been rejected, then we say that type II error has
been made.
Null hypothesis: The hypothesis formulated for the sake of rejecting it, under the assumption that it is true, is
called null hypothesis and is denoted by 𝐻0 .

DEPARTMENT OF SCIENCE & HUMANITIES /C.E.C.

1
MATHEMATICS FOR COMPUTER SCIENCE (BCS301) 2023

Level of significance: The probability level below which the hypothesis is rejected is called
level of significance.
Critical region: The region in which a sample value falling is rejected, is known as critical region.

Test of significance: The procedure which enables us to decide whether to accept or reject the hypothesis is
called test of significance.

Confidence limits: 95% confidence limits for sample statistic 𝑆 to estimate 𝜇 are 𝑆 ± 1.96𝜎 .
And 99% confidence limits for sample statistic 𝑆 to estimate 𝜇 are 𝑆 ± 2.58𝜎 .

Simple sampling of attributes: The expected value of success in a sample of size 𝑛 is 𝑛𝑝,
and standard deviation is √𝑛𝑝𝑞 .

𝑛𝑝
Mean proportion of successes = =𝑝.
𝑛

𝑝𝑞
Standard error of proportion of successes= √ 𝑛 .

𝑛
Precision of the proportion of successes= √𝑝𝑞 .

Test of significance for large samples: If 𝑥 be the observed number of successes in the large sample and 𝑧 is
𝑥−𝜇
the standard normal variate then 𝑧 = .
𝜎

1. If |𝑧| < 1.96, difference between the observed and expected number of successes is not significant.
2. If |𝑧| > 1.96, difference is significant at 5% level of significance.
3. If |𝑧| > 2.58, difference is significant at 1% level of significance.
Examples:
1. A coin was tossed 400 times and the head turned up 216 times. Test the hypothesis that the coin is unbiased
at 5% level of significance.
Solution: Suppose the coin is unbiased, then the probability of getting the head in each toss is 0.5.
Therefore expected number of successes is 𝜇 = 𝑛𝑝 = 0.5 × 400 = 200.
And the observed value of successes is 𝑥 = 216.
𝑥−𝜇 16
Since 𝜇 = 200, 𝜎 = √𝑛𝑝𝑞 = √100 = 10. 𝑧 = = 10 = 1.6 < 1.96.
𝜎

And hence difference between the observed and expected number of successes is not significant.
That is the coin is unbiased at 5% level of significance.

2. A die was thrown 9000 times and a throw of 5 or 6 was obtained 3240 times. On the assumption of random
throwing, do the data indicate an unbiased die?
1
Solution: Suppose the die is unbiased. Then the probability of throwing 5 or 6 in each throw is 𝑝 = 3 .

DEPARTMENT OF SCIENCE & HUMANITIES /C.E.C.

2
MATHEMATICS FOR COMPUTER SCIENCE (BCS301) 2023

9000
Therefore expected number of successes is 𝜇 = 𝑛𝑝 = = 3000.
3

And the observed value of successes is 𝑥 = 3240.

𝑥−𝜇 240
Since 𝜇 = 3000, 𝜎 = √𝑛𝑝𝑞 = √2000 = 44.7214. 𝑧 = = 44.7214 = 5.3666 > 2.58.
𝜎

And hence difference is significant at 1% level of significance. And hypothesis is rejected at 1% level
of significance. That is the die is biased.
3. In a locality containing 18000 families, a sample of 840 families was selected at random. Of these 840
Families, 206 families were found to have a monthly income of Rs 3000 or less. It is desired to estimate how
many out of 18,000 families have a monthly income of Rs 3000 or less. Whiten what limits would you place
your estimate in 1% level of significance?
206 103 317
Solution: Here 𝑝 = 840 = 420 , 𝑞 = 420 .

∴ standard error of the population of families having monthly income of Rs 3000 or less s
𝑝𝑞 103×317
= √ 𝑛 = √420×420×840 = 0.0148 = 1.48%.

206
Since = 840 , Mean proportion of successes is 24.52% .

Limits are (24.52 ± 2.58 × 1.48) That is 20.7% to 28.34 %.

Therefore 3726 to 5101 families are expected to have monthly income of Rs 3000 or less.
Exercise:
1. A die is tossed 960 times and it falls with 5 upwards 184 times. Is the die biased?
2. 12 dice are thrown 3086 times and a throw of 2, 3, 4 is reckoned as a success. Suppose that 19142 throws of
2, 3, 4 have been made out. Do you think that this observed value deviates from the expected value? If so,
can the deviation from the expected value be due to fluctuations of simple sampling?
3. Balls are drawen from a bag containing equal number of black and white balls. Each ball being replaced
before drawing another. In 2250 drawings 1018 black and 1232 white balls have been drawn. Do you suspect
some bias on the part of drawer ?
4. A sample of 1000 days is taken from meteorological records of certain district and 120 of them are found to
be foggy. What are the probable limits to the percentage of foggy days in the district?
5. In a group of 50 first cousins there were found to be 27 males and 23 females. Ascertain if the observed
proportions are inconsistent with the hypothesis that the sexes should be in equal proportion.

6. A random sample of 500 apples was taken from a large consignment and 65 were found to be bad. Estimate
the proportion of the bad apples in the consignment and standard error of the estimate. Deduce that the
percentage of bad apples in the consignment is between 8.5 and 17.5 .

7. 400 children are chosen in an industrial town and 150 are found to be underweight. Assuming the conditions
of simple sampling, estimate the percentage of children who are underweight in the industrial town and
assign limits within which the percentage probably lies.

DEPARTMENT OF SCIENCE & HUMANITIES /C.E.C.

3
MATHEMATICS FOR COMPUTER SCIENCE (BCS301) 2023

8. In a sample of 500 people from a state 280 take tea, and rest take coffee. Can we assume that tea and coffee
are equally popular in the state at 5% level of significance?

Comparison of large samples: Two large samples of sizes 𝑛1 and 𝑛2 are taken from two populations giving
mean proportion of successes are 𝑝1 and 𝑝2 respectively.
1. If the proportions are similar in the two populations,
𝑛1 𝑝1 +𝑛2 𝑝2
Then common mean proportion of successes is 𝑝 = .
𝑛1 +𝑛2
𝑝𝑞 𝑝𝑞
If 𝑒 be the standard error of the difference between 𝑝1 and 𝑝2 , then 𝑒 2 = +𝑛 .
𝑛1 2
2. If the proportions are not same in the two populations,
𝑝 𝑞 𝑝 𝑞
Then 𝑒 2 = 𝑛1 1 + 𝑛2 2
1 2

𝑝1 ~ 𝑝2
∴𝑧= .
𝑒
And if 𝑧 > 2.58, the difference between 𝑝1 and 𝑝2 is real one.
If 𝑧 < 1.96, the difference may be due to fluctuations of simple sampling.
If 1.96 < 𝑧 < 2.58, the difference is significant at 5% level of significance.

Examples:

1. In a city A 20% of a random sample of 900 school boys had a certain slight physical defect.
In another city B, 18.5% of a random sample of 1600 school boys had the same defect. Is the difference
between the proportions significant?
Ans: Given that 𝑛1 = 900, 𝑛2 = 1600, 𝑝1 = 0.2, 𝑝2 = 0.185
𝑛1 𝑝1 +𝑛2 𝑝2
∴ 𝑝= = 0.19, 𝑞 = 1 − 𝑝 = 0.81.
𝑛1 +𝑛2

𝑝𝑞 𝑝𝑞
𝑒2 = + 𝑛 = 0.00027 ⟹ 𝑒 ≈ 0.016.
𝑛1 2

𝑝1 ~ 𝑝2 0.015
∴𝑧= = 0.016 = 0.093 < 2, The difference between the proportions is not significant.
𝑒

2. In two large populations there are 30% and 25% respectively of fair haired people. Is this difference likely to
be hidden in samples of 1200 and 900 respectively from the two populations?
Ans: Given that 𝑝1 = 0.3, 𝑝2 = 0.25 and for 𝑛1 = 1200, 𝑛2 = 900,
𝑝1 𝑞1 𝑝2 𝑞2
𝑒2 = + = 0.00038 ⟹ 𝑒 ≈ 0.0195.
𝑛1 𝑛2
𝑝1 ~ 𝑝2 0.05
∴𝑧= = 0.0195 ≈ 2.5, The difference between the proportions is significant.
𝑒

And hence it is unlikely that the difference will be hidden.

Exercise:
1. A machine produces 16 defective objects in a sample of 500. After machine is overhauled, it produces 3
defective objects in a batch of 100. Has the machine been improved?
2. One type of aircraft is found to develop engine trouble in 5 flights out of 100, and another type in 7 flights
out of 200 flights. Is there a significant difference in the two types of aircrafts so for as engine defects are
concerned?
DEPARTMENT OF SCIENCE & HUMANITIES /C.E.C.
4

Ieee Guide For Synchronization, Calibration, Testing, and Installation of Phasor Measurement Units (Pmus) For Power System Protection and Control
0% (1)
Ieee Guide For Synchronization, Calibration, Testing, and Installation of Phasor Measurement Units (Pmus) For Power System Protection and Control
107 pages
Engineering Mathematics - IV (15MAT41) Module-V: SAMPLING THEORY and Stochastic Process
100% (1)
Engineering Mathematics - IV (15MAT41) Module-V: SAMPLING THEORY and Stochastic Process
28 pages
Weighted Moving Average Formula
No ratings yet
Weighted Moving Average Formula
25 pages
Testing of Hypothesis For Large Sample
No ratings yet
Testing of Hypothesis For Large Sample
11 pages
Sampling Theory - Notes
100% (2)
Sampling Theory - Notes
43 pages
Sampling Theory
No ratings yet
Sampling Theory
22 pages
Maternal Pelvis
100% (2)
Maternal Pelvis
32 pages
Hypothesis Testing II
No ratings yet
Hypothesis Testing II
98 pages
Padeepz MA3251 Notes-1
No ratings yet
Padeepz MA3251 Notes-1
239 pages
Author: Dr. K. GURURAJAN: Class Notes of Engineering Mathematics Iv Subject Code: 06mat41
0% (1)
Author: Dr. K. GURURAJAN: Class Notes of Engineering Mathematics Iv Subject Code: 06mat41
122 pages
Ma8452 NOTES - by WWW - Easyengineering.net 3
No ratings yet
Ma8452 NOTES - by WWW - Easyengineering.net 3
240 pages
Data Migration
100% (2)
Data Migration
31 pages
Statistics Cheat Sheet
100% (3)
Statistics Cheat Sheet
23 pages
Corner To Corner Vest
No ratings yet
Corner To Corner Vest
15 pages
Sampling Theory Notes
No ratings yet
Sampling Theory Notes
8 pages
The Logic of Statistical Tests of Significance
0% (1)
The Logic of Statistical Tests of Significance
19 pages
Test of Hypotheses
0% (1)
Test of Hypotheses
26 pages
Unit-1 Introduction To SI 1
No ratings yet
Unit-1 Introduction To SI 1
52 pages
89128001EN
No ratings yet
89128001EN
106 pages
U-3 Notes
No ratings yet
U-3 Notes
42 pages
Estimation and Hypothesis Testing
No ratings yet
Estimation and Hypothesis Testing
46 pages
Testing of Hypothesis
No ratings yet
Testing of Hypothesis
33 pages
SM-2 Basic Statistics
No ratings yet
SM-2 Basic Statistics
35 pages
Research Paper 1 Decoder 7 Segment PDF
0% (1)
Research Paper 1 Decoder 7 Segment PDF
7 pages
Research Methodology and Biostatistics Part II 2
No ratings yet
Research Methodology and Biostatistics Part II 2
45 pages
Module 4 (301 SI-2)
No ratings yet
Module 4 (301 SI-2)
24 pages
Summary Performance Rating
No ratings yet
Summary Performance Rating
51 pages
SAMPLING by Naresh Vasant Afre 13.04.23 Shareable
No ratings yet
SAMPLING by Naresh Vasant Afre 13.04.23 Shareable
58 pages
Week 3
No ratings yet
Week 3
56 pages
Statistical Analysis Data Treatment and Evaluation
No ratings yet
Statistical Analysis Data Treatment and Evaluation
55 pages
M2 R5 Jan2023 Set1
No ratings yet
M2 R5 Jan2023 Set1
21 pages
PSCV Unit-Iii Digital Notes
No ratings yet
PSCV Unit-Iii Digital Notes
46 pages
Probability
No ratings yet
Probability
33 pages
Statistical Inference
No ratings yet
Statistical Inference
29 pages
Statistical Inference - Part1.4
No ratings yet
Statistical Inference - Part1.4
28 pages
M-Iii Unit-3ln
No ratings yet
M-Iii Unit-3ln
44 pages
Sem 2 Stats
No ratings yet
Sem 2 Stats
15 pages
Chapter 4 Vector Space
No ratings yet
Chapter 4 Vector Space
66 pages
??module 6 ?
No ratings yet
??module 6 ?
33 pages
BCS301 - Module 3
No ratings yet
BCS301 - Module 3
20 pages
Sampling Theory
No ratings yet
Sampling Theory
19 pages
Unit2 (Testing of Hypothesis-Parametric Tests)
No ratings yet
Unit2 (Testing of Hypothesis-Parametric Tests)
20 pages
Oppe-2 (24 July) Java
No ratings yet
Oppe-2 (24 July) Java
16 pages
Stats 2035 Midterm 2 From 2023-24
No ratings yet
Stats 2035 Midterm 2 From 2023-24
14 pages
Sampling
No ratings yet
Sampling
11 pages
Statistical Mechanics Part 1
No ratings yet
Statistical Mechanics Part 1
17 pages
Notes ch3 Sampling Distributions
No ratings yet
Notes ch3 Sampling Distributions
20 pages
Wsheet 6
No ratings yet
Wsheet 6
7 pages
Daniel 9 Exegesis
No ratings yet
Daniel 9 Exegesis
6 pages
Sample Theory and Test of Significance
No ratings yet
Sample Theory and Test of Significance
11 pages
File Management
No ratings yet
File Management
14 pages
Stat Hypothesis Testing
No ratings yet
Stat Hypothesis Testing
14 pages
Destination: Proposal
No ratings yet
Destination: Proposal
5 pages
Philippine Christian University: Week 1
No ratings yet
Philippine Christian University: Week 1
6 pages
Testing of Hypothesis: Ypothesis Testing. This Is One of The Most Useful Aspects of Statistical Inference
No ratings yet
Testing of Hypothesis: Ypothesis Testing. This Is One of The Most Useful Aspects of Statistical Inference
9 pages
Inferential Statistics
No ratings yet
Inferential Statistics
29 pages
Biodegradeability Testing of 10 Oils by OECD301B
No ratings yet
Biodegradeability Testing of 10 Oils by OECD301B
20 pages
Statistics II Essentials
From Everand
Statistics II Essentials
Emil Milewski
2.5/5 (1)
BCS301M33
No ratings yet
BCS301M33
11 pages
Sampling Theory
No ratings yet
Sampling Theory
7 pages
Assignment 4 PS
100% (1)
Assignment 4 PS
2 pages
Chapter 5: Sampling Distributions & Hypothesis Testing
No ratings yet
Chapter 5: Sampling Distributions & Hypothesis Testing
19 pages
MCQ 01
No ratings yet
MCQ 01
5 pages
5 Errors
No ratings yet
5 Errors
21 pages
General Physics Assignment
No ratings yet
General Physics Assignment
14 pages
2022 Scheme Module 3 BCS302
No ratings yet
2022 Scheme Module 3 BCS302
17 pages
Sampling Distribution and Central Limit Theorem: Session 2
No ratings yet
Sampling Distribution and Central Limit Theorem: Session 2
19 pages
Testing of Hypothesis
No ratings yet
Testing of Hypothesis
15 pages
UNITIII Question Bank FDSA Upload
No ratings yet
UNITIII Question Bank FDSA Upload
4 pages
Ap Lab Repor1
No ratings yet
Ap Lab Repor1
18 pages
Handout#3 - Statistical Inference, Z and T Test
No ratings yet
Handout#3 - Statistical Inference, Z and T Test
3 pages
Structured Decision Making
From Everand
Structured Decision Making
Andreas Michael Theodorou
No ratings yet
A Tour of The Famous Scientists Laid To Rest in Göttingen City Cemetery - COMSOL Blog
No ratings yet
A Tour of The Famous Scientists Laid To Rest in Göttingen City Cemetery - COMSOL Blog
14 pages
Field Expedient Methods For Explosives Preparation - 5ac3733a1723dd9445078f1b
No ratings yet
Field Expedient Methods For Explosives Preparation - 5ac3733a1723dd9445078f1b
9 pages
HobbyTronics - Texas Instruments H-Bridge Motor Driver 1A - SN754410 - COM-00315
100% (1)
HobbyTronics - Texas Instruments H-Bridge Motor Driver 1A - SN754410 - COM-00315
2 pages
BIO DS 3D Instructions v3
No ratings yet
BIO DS 3D Instructions v3
7 pages
Statistics FinalReview
No ratings yet
Statistics FinalReview
8 pages
Definition of Median
No ratings yet
Definition of Median
6 pages
Eurolyzer STX en
No ratings yet
Eurolyzer STX en
8 pages
Assignment III CSE
No ratings yet
Assignment III CSE
1 page
National 5 Maths Memory List
No ratings yet
National 5 Maths Memory List
2 pages
Chapter 2: Finite Element Formulation Starting From Governing Differential Equations
No ratings yet
Chapter 2: Finite Element Formulation Starting From Governing Differential Equations
3 pages
Takagi Universal Mobility I 1994
No ratings yet
Takagi Universal Mobility I 1994
6 pages
Ada and The Galaxies Press Kit
No ratings yet
Ada and The Galaxies Press Kit
2 pages
Stat 115 - Basic Statistical Methods
No ratings yet
Stat 115 - Basic Statistical Methods
6 pages
Problem Sheet
No ratings yet
Problem Sheet
3 pages
Sampling Questions
No ratings yet
Sampling Questions
4 pages
.300 Win. Magnum Ballistics Calcs (QuickTarget Unlimited Lapua Edition)
No ratings yet
.300 Win. Magnum Ballistics Calcs (QuickTarget Unlimited Lapua Edition)
4 pages
Statistics S1 Summary: X X X S
No ratings yet
Statistics S1 Summary: X X X S
3 pages
Models of Communication: Aristotle's Model
No ratings yet
Models of Communication: Aristotle's Model
3 pages

Module3 BCS301

Uploaded by

Module3 BCS301

Uploaded by

MATHEMATICS FOR COMPUTER SCIENCE (BCS301) 2023

Statistical Inference-1: Introduction. Sampling distributions, standard error. Levels of significance.

Examples: Groups of people, animals, or all possible outcomes of some system.

DEPARTMENT OF SCIENCE & HUMANITIES /C.E.C.

DEPARTMENT OF SCIENCE & HUMANITIES /C.E.C.

And the observed value of successes is 𝑥 = 3240.

Limits are (24.52 ± 2.58 × 1.48) That is 20.7% to 28.34 %.

DEPARTMENT OF SCIENCE & HUMANITIES /C.E.C.

And hence it is unlikely that the difference will be hidden.

You might also like