0% found this document useful (0 votes)

22 views9 pages

Statistics 1

The document discusses measures of central tendency, including arithmetic mean, median, and mode, as well as measures of dispersion such as range and standard deviation. It explains the concept of normal distribution and its characteristics, including the standard normal distribution and hypothesis testing. Additionally, it covers the definitions of key statistical terms and the steps involved in testing hypotheses.

Uploaded by

Preeti Yadav

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views9 pages

Statistics 1

Uploaded by

Preeti Yadav

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

Review

Measures of central tendency:

The single value, which represents the group of values, is termed as a

“measure of central tendency” or a measure of location or an average.

Types of average:1. Arithmetic Mean
2. Median
3. Mode
4. Geometric Mean
5. Harmonic Mean
Arithmetic Mean (A.M): It is deﬁned as the sum of the given observations divided

by the number of observations. A.M. is measured with the same units as that of the

observations.
Let x1, x2 , ………,xn be ‘n’ observations then the A.M is computed from the formula:

A.M.= where = sum of the given observations

n = Number of observations

Median : The median is the middle most item that divides the distribution into two

equal parts when the items are arranged in ascending order of magnitude.
If the number of observations is odd, then median is the middle value after

the values have been arranged in ascending or descending order of magnitude. In

case of even number of observations, there are two middle terms and median is

obtained by taking the arithmetic mean of the middle terms.

Mode: Mode is the value which occurs most frequently in a set of observations or

mode is the value of the variable which is predominant in the series.

Measures of Dispersion:
Dispersion means scattering of the observations among themselves or from a

central value (Mean/ Median/ Mode) of data. We study the dispersion to have an

idea about the variation.

Suppose that we have the distribution of the yields (kg per plot) of two Ground nut
varieties from 5 plots each. The distribution may be as follows:
Variety 1: 46 4850 5254
Variety 2: 30 40 50 60 70

It can be seen that the mean yield for both varieties is 50 kg. But we can not say

that the performances of the two varieties are same. There is greater uniformity of

yields in the ﬁrst variety where as there is more variability in the yields of the
second variety. The ﬁrst variety may be preferred since it is more consistent in yield

performance.

Types of dispersion:
1. Range
2. Quartile Deviation
3. Mean Deviation
4. Standard Deviation and Variance
5. Coeﬃcient of Variation
6. Standard Error
Range: It is the difference between maximum value and minimum value.

Standard Deviation (: It is deﬁned as the positive square root of the

arithmetic mean of the squares of the deviations of the given values from

arithmetic mean. The square of the standard deviation is called variance.

Let x1, x2 , …….,xn be n observations then the standard deviation is given by the

formula

S.D. = where A.M. = ,

where n = no. of observations.
Simplifying the above formula, we have

or S.D () =
Example:
Calculate S.D. for the values 5, 6, 7, 7, 9, 4, 5.
S.D.=
= 1.55 kg.

Coeﬃcient of Variation (C.V):

Coeﬃcient of variation is the percentage ratio of standard deviation and the
arithmetic mean. It is usually expressed in percentage. The formula for C.V. is,

C.V. = x10 0
The coeﬃcient of variation will be small if the variation is small of the two
groups, the one with less C.V. said to be more consistent.
Note: 1. Standard deviation is absolute measure of dispersion
2. Coeﬃcient of variation is relative measure of dispersion.
NORMAL DISTRIBUTION

The Normal Distribution (N.D.) was ﬁrst discovered by De- Moivre as the limiting

form of the binomial model in 1733, later independently worked by Laplace and

Gauss.

The Normal distribution is ‘probably’ the most important distribution in statistics. It

is a probability distribution of a continuous random variable and is often used to

model the distribution of discrete random variable as well as the distribution of

other continuous random variables. The basic form of normal distribution is that of

a bell, it has single mode and is symmetric about its central values.

Deﬁnition: A random variable X is said to follow a Normal Distribution with

2
parameter and and if its density function is given by the probability law

f(x) = - < x< ; - < < ; > 0

where = a mathematical constant equality = 22/7
e = Naperian base equaling 2.7183
= population mean
= population standard deviation
x = a given value of the random variable in the range - < x <
Characteristics of Normal distribution and normal curve:

i. The curve is bell shaped and symmetrical, about the mean

ii. The height of normal curve is at its maximum at the mean. Hence the

mean and mode of normal distribution coincides. Also the number of

observations below the mean in a normal distribution is equal to the

number of observations about the mean. Hence mean and median of

N.D. coincides. Thus, N.D. has Mean = median = mode

iii. As ‘x’ increases numerically, f(x) decreases rapidly, the maximum

probability occurring at the point x = , and given by

p[(x)] max =

the area under the normal curve is distributed as follows

i) - < x < + covers 68.26% of total area (or) 0.6826

ii)- 2 < x < +2 covers 95.44% of total area (or) 0.9544

iii) - 3 < x < +3 coves 99.73% of total area (or) 0.9973

Standard Normal Distribution: If ‘X’ is a normal random variable with Mean and

standard deviation , then Z = is a standard normal variate with zero mean

and standard deviation = 1.
The probability density function of standard normal variate ‘z’ is
f(z) = and =1

A graph representing the density function of the Normal probability distribution

is also known as a Normal Curve or a Bell Curve (see Figure below). To draw

such a curve, one needs to specify two parameters, the mean and the standard

deviation. The graph below has a mean of zero and a standard deviation of 1, i.e.,

(m =0, s =1). A Normal distribution

with a mean of zero and a standard deviation of 1 is also known as the Standard

Normal Distribution.

Standard Normal Distribution

Testing of Hypothesis

Introduction: The estimate based on sample values do not equal to the true value

in the population due to inherent variation in the population. The samples drawn

will have different estimates compared to the true value. It has to be veriﬁed that

whether the difference between the sample estimate and the population value is

due to sampling ﬂuctuation or real difference. If the difference is due to sampling

ﬂuctuation only it can be safely said that the sample belongs to the population

under question and if the difference is real we have every reason to believe that

sample may not belong to the population under question. The following are a few
technical terms in this context.

Hypothesis: The assumption made about any unknown characteristics is called

hypothesis. It may or may be true.
Ex: 1. = 2.3; be the population mean
2. = 2.1 ; be the population standard deviation

Population follows Normal Distribution. There are two types of hypothesis, namely

null hypothesis and alternative hypothesis.

Null Hypothesis: Null hypothesis is the statement about the parameters. Such a

hypothesis, which is usually a hypothesis of no difference is called null hypothesis

and is usually denoted by H 0. (or) any statistical hypothesis under test is called null

hypothesis. It is denoted by H 0 .
1. H 0 : = 0

2.
H0: 1 = 2

Alternative Hypothesis: Any hypothesis, which is complementary to the null

hypothesis, is called an alternative hypothesis, usually denoted by H1.

Ex: 1. H 1: ≠ 0

2. H 1: 1 ≠ 2

Population: In a statistical investigation the interest usually lies in the assessment

of the general magnitude and the study of variation with respect to one or more

characteristics relating to objects belonging to a group. This group of objects

under study is called population or universe. i.e the totality of all the objects under

study is called Population.

Sample: A ﬁnite subset of statistical objects in a population is called a sample and

the number of objects in a sample is called the sample size.

Parameter: A characteristics of population values is known as parameter. For

2
example, population mean () and population variance ( ).

In practice, if parameter values are not known and the estimates based on the
sample values are generally used.

Statistic: A Characteristics of sample values is called a statistic. For example,

2
sample mean ( ), sample variance (s ) where =

2
and s =

Sampling distribution: The distribution of a statistic computed from all possible

samples is known as sampling distribution of that statistic.

Standard error: The standard deviation of the sampling distribution of a statistic is

known as its standard error, abbreviated as S.E.

S.E.( )= ; where = population standard deviation and n = sample size

Random sampling: If the sampling units in a population are drawn independently

with equal chance, to be included in the sample then the sampling will be called

random sampling. Simple Hypothesis: A hypothesis is said to be simple if it

completely speciﬁes the distribution of the population. For instance, in case of

normal population with mean and standard deviation , a simple null hypothesis

is of the form H0 : = , is known, knowledge about would be enough to

understand the entire distribution.

Composite Hypothesis: If the hypothesis does not specify the distribution of the

population completely, it is said to be a composite hypothesis. Following are some

examples:

H0 : and is known

H0 : and is known
Types of Errors:

In testing of statistical hypothesis, there are four possible types of decisions

1. Rejecting H 0 when H 0 is true

2. Rejecting H 0 when H 0 is false

3. Accepting H0 when H 0 is true

4. Accepting H0 when H 0 is false

th
1 and 4 possibilities leads to error decisions. Statistician gives speciﬁc names

to these concepts namely Type- I error and Type- II error respectively.

The above decisions can be arranged in the following table

H 0 is true H 0 is false

Rejecting H 0 Type- I error Correct

(Wrong decision)

Accepting H0 Correct Type- II error

Type- I error: Rejecting H 0 when H 0 is true

Type- II error: Accepting H 0 when H 0 is false

The probabilities of type- I and type- II errors are denoted by and respectively.

Degrees of freedom: It is deﬁned as the difference between the total number of

items and the total number of constraints.

If ‘n’ is the total number of items and ‘k’ the total number of constraints then the

degrees of freedom (d.f.) is given by d.f. = n- k

Level of signiﬁcance(LOS): The maximum probability at which we would be willing

to risk a type- I error is known as level of signiﬁcance or the size of Type- I error is

level of signiﬁcance. The level of signiﬁcance usually employed in testing of

hypothesis are 5% and 1% . The Level of signiﬁcance is always ﬁxed in advance

before collecting the sample information. LOS 5% means the results obtained will

be true is 95% out of 10 0 cases and the results may be wrong is 5 out of 10 0
cases.

Critical value: while testing for the difference between the means of two
populations, our concern is whether the observed difference is too large to believe

that it has occurred just by chance. But then the question is how much difference

should be treated as too large? Based on sampling distribution of the means, it is

possible to deﬁne a cut- off or threshold value such that if the difference exceeds

this value, we say that it is not an occurrence by chance and hence there is
suﬃcient evidence to claim that the means are different. Such a value is called the

critical value and it is based on the level of signiﬁcance.

Steps involved in test of hypothesis:

1. The null and alternative hypothesis will be formulated

2. Test statistic will be constructed

3. Level of signiﬁcance will be ﬁxed

4. The table (critical) values will be found out from the tables for a given level

of signiﬁcance

5. The null hypothesis will be rejected at the given level of signiﬁcance if the
value of test statistic is greater than or equal to the critical value.

Otherwise null hypothesis will be accepted.

6. In the case of rejection the variation in the estimates will be called

‘signiﬁcant’

variation. In the case of acceptance the variation in the estimates will be

called ‘not- signiﬁcant’.

*****

(Ebook PDF) Introduction To Probability Models 12th Edition PDF Download
100% (1)
(Ebook PDF) Introduction To Probability Models 12th Edition PDF Download
63 pages
Engineering Data Analysis Chapter 1 2
No ratings yet
Engineering Data Analysis Chapter 1 2
78 pages
Statistics: a QuickStudy Laminated Reference Guide
From Everand
Statistics: a QuickStudy Laminated Reference Guide
BarCharts Publishing, Inc.
No ratings yet
ECON 321 Extra Sample Midterm 3 Solutions 2
No ratings yet
ECON 321 Extra Sample Midterm 3 Solutions 2
8 pages
Lecture 8 - Continuous Probability Distributions
No ratings yet
Lecture 8 - Continuous Probability Distributions
33 pages
Unit II: Basic Data Analytic Methods
No ratings yet
Unit II: Basic Data Analytic Methods
38 pages
Statistics
100% (1)
Statistics
11 pages
Reviewer Part 1
No ratings yet
Reviewer Part 1
9 pages
Lecture Note On Biostatistics
No ratings yet
Lecture Note On Biostatistics
74 pages
Important Measures of Central Tendency Are Mean, Median and Mode
No ratings yet
Important Measures of Central Tendency Are Mean, Median and Mode
31 pages
Lecture 1
No ratings yet
Lecture 1
32 pages
Statistics 1 (Final) / Orthodontic Courses by Indian Dental Academy
No ratings yet
Statistics 1 (Final) / Orthodontic Courses by Indian Dental Academy
15 pages
Biometry
No ratings yet
Biometry
67 pages
Basic Statistics
No ratings yet
Basic Statistics
105 pages
AYURSURE (Research and Stat) 4
No ratings yet
AYURSURE (Research and Stat) 4
44 pages
Normal Distribution: X e X F
No ratings yet
Normal Distribution: X e X F
30 pages
Stats Revieew
No ratings yet
Stats Revieew
9 pages
Unit 8. Data Analysis
No ratings yet
Unit 8. Data Analysis
69 pages
2466939-EDA and STATISTICS NOTES
No ratings yet
2466939-EDA and STATISTICS NOTES
15 pages
Biostatistics Revision DR - NJ
No ratings yet
Biostatistics Revision DR - NJ
67 pages
DSML
No ratings yet
DSML
510 pages
43hyrs Principles of Statistics 3
No ratings yet
43hyrs Principles of Statistics 3
56 pages
Basic Statistics
No ratings yet
Basic Statistics
31 pages
Midterms Gec Math Adooooor
No ratings yet
Midterms Gec Math Adooooor
6 pages
Location) .: Distribution Is The Purpose of Measure of Central
No ratings yet
Location) .: Distribution Is The Purpose of Measure of Central
13 pages
Descriptive Statistics - Measures of Central Tendency and Dispersion - PHD 2021
No ratings yet
Descriptive Statistics - Measures of Central Tendency and Dispersion - PHD 2021
31 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
51 pages
Csc-Reviewer-Stats and Prob
No ratings yet
Csc-Reviewer-Stats and Prob
13 pages
Basic Statistics
No ratings yet
Basic Statistics
24 pages
Basic Statistics
No ratings yet
Basic Statistics
23 pages
Business Statistics
No ratings yet
Business Statistics
25 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
9 pages
4689-2 Final
No ratings yet
4689-2 Final
11 pages
Biostatistics: Khadeeja PK
0% (1)
Biostatistics: Khadeeja PK
27 pages
3 - Introduction To Inferential Statistics
No ratings yet
3 - Introduction To Inferential Statistics
32 pages
Measures of Dispersion
100% (1)
Measures of Dispersion
13 pages
Statistics Definition Part 1
No ratings yet
Statistics Definition Part 1
26 pages
Unit 4 & 5 8614
No ratings yet
Unit 4 & 5 8614
58 pages
Stat Notes
No ratings yet
Stat Notes
5 pages
MATM111
No ratings yet
MATM111
8 pages
Chapter 01
No ratings yet
Chapter 01
56 pages
Probstats Reviewer
No ratings yet
Probstats Reviewer
3 pages
Statistics 101
100% (1)
Statistics 101
20 pages
Summary Biometry
No ratings yet
Summary Biometry
51 pages
Measures of Central Tendency and Dispersion
100% (1)
Measures of Central Tendency and Dispersion
7 pages
Lecture 3
No ratings yet
Lecture 3
14 pages
Mba Statistics Midterm Review Sheet
No ratings yet
Mba Statistics Midterm Review Sheet
1 page
Basics For Understanding
No ratings yet
Basics For Understanding
8 pages
Basic Concepts
100% (1)
Basic Concepts
17 pages
Prof. Joy V. Lorin-Picar Davao Del Norte State College: New Visayas, Panabo City
No ratings yet
Prof. Joy V. Lorin-Picar Davao Del Norte State College: New Visayas, Panabo City
91 pages
Chapter 4 Basic Statistics
No ratings yet
Chapter 4 Basic Statistics
22 pages
Lecture of BIOSTATISTICS 12.2022 RMDC
No ratings yet
Lecture of BIOSTATISTICS 12.2022 RMDC
85 pages
Faculty Introduction: Tkachwala@nmims - Edu
No ratings yet
Faculty Introduction: Tkachwala@nmims - Edu
27 pages
Math
No ratings yet
Math
6 pages
Statistics and Probability
No ratings yet
Statistics and Probability
2 pages
Statistics
No ratings yet
Statistics
13 pages
Biostatistics 140127003954 Phpapp02
No ratings yet
Biostatistics 140127003954 Phpapp02
47 pages
Univariate Statistics
No ratings yet
Univariate Statistics
7 pages
Statistics
No ratings yet
Statistics
4 pages
Chapter 3&4 5
No ratings yet
Chapter 3&4 5
14 pages
Stats 1 Module Updated
No ratings yet
Stats 1 Module Updated
53 pages
Statistics For Data Analysis
No ratings yet
Statistics For Data Analysis
13 pages
Learn Statistics Fast: A Simplified Detailed Version for Students
From Everand
Learn Statistics Fast: A Simplified Detailed Version for Students
Hesbon R.M
No ratings yet
Statistics II Essentials
From Everand
Statistics II Essentials
Emil Milewski
2.5/5 (1)
ADM SHS StatProb Q3 M11 Identifying Regions Under Normal Curve That Corresponds To Different Standard Normal Values Version 3
No ratings yet
ADM SHS StatProb Q3 M11 Identifying Regions Under Normal Curve That Corresponds To Different Standard Normal Values Version 3
28 pages
Lesson 2-04 Probability Distributions of Discrete Random Variables
No ratings yet
Lesson 2-04 Probability Distributions of Discrete Random Variables
12 pages
Probability Notes Part 2
No ratings yet
Probability Notes Part 2
51 pages
Lecture16 General Transformations of RVs PDF
No ratings yet
Lecture16 General Transformations of RVs PDF
6 pages
Conditional Probability and Expectation
No ratings yet
Conditional Probability and Expectation
19 pages
MATH/STAT 235A - Probability Theory Lecture Notes, Fall 2011
No ratings yet
MATH/STAT 235A - Probability Theory Lecture Notes, Fall 2011
111 pages
University of Pune
No ratings yet
University of Pune
40 pages
Illustrating A Probability Distribution For A Discrete Random Variable and Its Properties
No ratings yet
Illustrating A Probability Distribution For A Discrete Random Variable and Its Properties
5 pages
Complete Outline
No ratings yet
Complete Outline
4 pages
The Normal Distribution
No ratings yet
The Normal Distribution
32 pages
Walpole Ch-07 KZ
100% (1)
Walpole Ch-07 KZ
29 pages
Past Exams
No ratings yet
Past Exams
20 pages
Prelim Prob Stat
100% (1)
Prelim Prob Stat
2 pages
Chapter 1
No ratings yet
Chapter 1
13 pages
Statistics and Probability Activity Sheet Answer Key
100% (7)
Statistics and Probability Activity Sheet Answer Key
2 pages
Risk Analysis Using Simulation
100% (1)
Risk Analysis Using Simulation
29 pages
ML Cheat Sheet
50% (2)
ML Cheat Sheet
74 pages
SOB 1040 Lecture 3 - Introduction To Probabibility and Distributions
No ratings yet
SOB 1040 Lecture 3 - Introduction To Probabibility and Distributions
86 pages
Making Sense of Data Statistic Course
No ratings yet
Making Sense of Data Statistic Course
39 pages
Unit-12 IGNOU STATISTICS
No ratings yet
Unit-12 IGNOU STATISTICS
34 pages
Core 11 Statistics-And-Probability Q3 1 Random-Variable v1
No ratings yet
Core 11 Statistics-And-Probability Q3 1 Random-Variable v1
19 pages
Mining Geostatistics - Journel PDF
No ratings yet
Mining Geostatistics - Journel PDF
306 pages
Complete Stochastic Processes and Their Applications 1st Edition Frank Beichelt (Author) PDF For All Chapters
No ratings yet
Complete Stochastic Processes and Their Applications 1st Edition Frank Beichelt (Author) PDF For All Chapters
50 pages
An Introduction To Financial Mathematics With Matlab: Research Reports Mdh/Ima Issn 1404-4978
No ratings yet
An Introduction To Financial Mathematics With Matlab: Research Reports Mdh/Ima Issn 1404-4978
49 pages
2 5244801349324911431 ١٠٢٨١٤
No ratings yet
2 5244801349324911431 ١٠٢٨١٤
62 pages
Continuous Random Variables and Probability Distributions
No ratings yet
Continuous Random Variables and Probability Distributions
45 pages