BASIC STATISTICS (Definitions)

The document provides an overview of basic statistics concepts, including definitions of information, data, primary and secondary data, variables, and various types of data handling. It explains measures of central tendency such as mean, median, and mode, as well as measures of dispersion like range, variance, and standard deviation. Additionally, it covers methods for presenting data, including tabulation, frequency distribution, and graphical representations like histograms and frequency polygons.

Uploaded by

sabasalman32164

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views

BASIC STATISTICS (Definitions)

Uploaded by

sabasalman32164

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

BASIC STATISTICS

INFORMATION:
To know about something is known as ‘Information’.
INFORMATION HANDLING:
To present the information in a manageable way so that useful conclusions can be drawn, is called
‘Information handling’.
DATA:
The numerical figures obtained from any field of study are known as ‘Data’.
It can be obtained from existing sources, office records, published papers, or the same can be obtained
directly from the field according to needs.
PRIMARY DATA:
The data directly collected from its source is called ‘Primary data’.
SECONDARY DATA:
The data which have been passed through some statistical treatments at least once, is called
‘Secondary data’. e.g. the raw data, when put in some order change into secondary data.
CONSTANT:
Any quantity that has a single value is called a ‘Constant’.
VARIABLE:
Any characteristic whose values are always different from one individual to another, is called a
‘Variable’.
DISCRETE VARIABLE:
It can only take some specific values present in the data. It is always a whole figure, cannot be a
fraction.
CONTINUOUS VARIABLE:
It can take every possible value in a given interval (say a to b). It can be a whole figure or a fraction.
UNGROUPED DATA:
Numerical figures which are obtained on the first hand and recorded as they stand, are known as
‘Ungrouped data’.
TABULATION:
‘Tabulation’ of the data means to present the data in a classified form or into rows and columns.
CLASSIFICATION:
‘Classification’ is a process of arranging the data into certain groups or classes of similar characteristic.
The main function of classification is to condense a large number of observations into easy and
understandable form.
FREQUENCY DISTRIBUTION:
A ‘Frequency distribution’ is a tabular arrangement for classifying data into different groups and the
number of observations falling in each group corresponds to the respective group.
GROUPED DATA:
The data in the form of ‘Frequency distribution’ is called ‘Grouped data’.
CLASS LIMITS:
Each class or group is defined by two values, one small and one large, called the ‘Class limits’.
The smaller one is called ‘Lower class limit’ and the larger one is called ‘Upper class limit’.
SIZE OF CLASS INTERVAL:
The ‘size’ or ‘width’ of a class interval is defined as the difference between two consecutive lower
limits or two consecutive upper limits of a group. It is denoted by ′ℎ′.
Range 𝑥𝑚𝑎𝑥 − 𝑥𝑚𝑖𝑛
ℎ= =
No. of groups 𝑘
CLASS FREQUENCY:
The number of occurrences of items corresponding to a class interval is called ‘Class frequency’.
CLASS MARK:
‘Class mark’ is defined as the midpoint or average of a class.
It is obtained by dividing sum of lower and upper limits of a class by 2.
CLASS BOUNDARIES:
The real class limits of a class or group are called ‘Class boundaries’.

Class boundaries can be obtained from midpoints as (𝑥 ± ℎ⁄2).

CUMULATIVE FREQUENCY:
The total of frequency up to an upper class limit or boundary is called ‘Cumulative frequency’.
HISTOGRAM:
A ‘Histogram’ is the graph of adjacent rectangles.
In histogram, width of the rectangles corresponds to size of class interval and heights of rectangles correspond
to class frequency. Histogram is usually used when size of class intervals is unequal.
FREQUENCY POLYGON:
A ‘Frequency polygon’ is a many sided closed figure in which class marks are taken along 𝑥 − axis and
frequencies along 𝑦 − axis.
CUMULATIVE FREQUENCY DISTRIBUTION:
A table showing cumulative frequencies against upper class boundaries is called ‘Cumulative frequency
distribution’. It is also called a ‘Less than cumulative frequency distribution’.
CUMULATIVE FREQUENCY POLYGON/OGIVE:
The graph of a Less than cumulative frequency distribution is called ‘Cumulative frequency polygon or
Ogive’ in which cumulative frequencies are plotted against upper class boundaries.
CENTRAL TENDENCY:
‘Central tendency’ is more or less a central value around which the data appear to be crowded.
It is a single representative value which shows tendency or behavior of the distribution of the variable under
study.
MEASURES OF CENTRAL TENDENCY:
The measures or techniques that are used to determine the central value are called ‘Measures of
central tendency’.
The following measures of central tendency will be discussed.
1. Arithmetic mean 2. Median 3. Mode
4. Geometric mean 5. Harmonic mean 4. Quartiles
ARITHMETIC MEAN:
‘Arithmetic mean or simply called Mean’ is a single value obtained by dividing sum of all observations
by their total number. It is denoted by 𝑋̅.
FORMULAE OF ARITHMETIC MEAN:
1. DIRECT FORMULA FOR UNGROUPED DATA:
Σ𝑥 Sum of all observations
𝑋̅ = =
𝑛 no. of observations
2. SHORT FORMULA FOR UNGROUPED DATA:
‘Deviation’ is defined as difference of observed values of data and a constant.
So, 𝑫 = 𝒙 − 𝑨, where 𝐴 is a constant called ‘Assumed or Provisional mean’, is called Deviation.
ΣD
𝑋̅ = 𝐴 +
𝑛
3. CODING FORMULA FOR UNGROUPED DATA:
ΣU
𝑋̅ = 𝐴 + ×ℎ
𝑛
Where ℎ is the constant multiple of values of 𝑋.
4. DIRECT FORMULA FOR GROUPED DATA:
Σf𝑥
𝑋̅ =
Σf
5. SHORT FORMULA FOR GROUPED DATA:
ΣfD
𝑋̅ = 𝐴 +
Σf
6. CODING FORMULA FOR GROUPED DATA:
ΣfU
𝑋̅ = 𝐴 + ×ℎ
Σf
MEDIAN:
The middle most observation in an arranged data set is called ‘Median’. It divides the data into two
equal parts, i.e. 50% data lies before median and 50% after it. It is denoted by 𝑋̃.
FORMULAE FOR MEDIAN:
1. MEDIAN FOR UNGROUPED DATA:
CASE 1: When number of observations ‘n’ is odd.
𝑛+1
𝑋̃ = ( ) th observation
2
CASE 2: When number of observations ‘n’ is even.
1 𝑛 𝑛+2
𝑋̃ = [ th observation + ( ) th observation]
2 2 2
2. MEDIAN FOR GROUPED DATA:
ℎ 𝑛
𝑋̃ = 𝑙 + ( − 𝑐)
𝑓 2
Where, 𝑙 : lower class boundary of median class.
ℎ : class interval size of median class.
𝑓 : frequency of the median class.
𝑐 : cumulative frequency of the class preceding the median class.
QUARTILES:
The values which divide an arranged data set into four equal parts are called ‘Quartiles’.
ℎ 𝑛
FIRST QUARTILE is 𝑄1 = 𝑙 + 𝑓 ( 4 − 𝑐)
ℎ 𝟑𝒏
THIRD QUARTILE is 𝑄3 = 𝑙 + 𝑓 ( 4 − 𝑐)

MODE:
The most frequent occurring observation in a data is called ‘Mode’.
1. MODE FOR UNGROUPED DATA:
Mode = the most frequent observation
2. MODE FOR GROUPED DATA:
𝑓𝑚 − 𝑓1
𝑋̂ = 𝑙 + [ ]×ℎ
2𝑓𝑚 − 𝑓1 − 𝑓2
Where, 𝑙 : lower class boundary of modal class.
ℎ : class interval size of modal class.
𝑓𝑚 : frequency of modal class or maximum frequency.
𝑓1 : frequency of the class preceding the modal class.
𝑓2 : frequency of the class succeeding the modal class.
Empirical relation between Mean, Median and Mode is 𝐌𝐨𝐝𝐞 = 𝟑𝐌𝐞𝐝𝐢𝐚𝐧 − 𝟐𝐌𝐞𝐚𝐧
GEOMETRIC MEAN:
The nth positive root of product of ‘n’ observations is called ‘Geometric mean’.
1. Basic formula of Geometric mean for Ungrouped data:

𝐺. 𝑀. = 𝑛√𝑥1 . 𝑥2 . 𝑥3 … … … 𝑥𝑛
2. Logarithmic formula of Geometric mean for Ungrouped data:
1⁄ 1⁄
𝐺. 𝑀. = (𝑥1. 𝑥2 . 𝑥3 … … … 𝑥𝑛 ) 𝑛 ⇒ log 𝐺. 𝑀. = log(𝑥1 . 𝑥2. 𝑥3 … … … 𝑥𝑛 ) 𝑛

1 1
log 𝐺. 𝑀. = log(𝑥1 . 𝑥2 . 𝑥3 … … … 𝑥𝑛 ) ⇒ log 𝐺. 𝑀. = (log 𝑥1 + log 𝑥2 + log 𝑥3 + … … … + log 𝑥𝑛 )
𝑛 𝑛
1 𝚺 𝐥𝐨𝐠 𝒙
log 𝐺. 𝑀. = Σ log 𝑥 ⇒ 𝑮. 𝑴. = 𝐚𝐧𝐭𝐢𝐥𝐨𝐠 ( )
𝑛 𝒏
3. Logarithmic formula of Geometric mean for Grouped data:
Σf log 𝑥
𝐺. 𝑀. = antilog ( )
Σf
HARMONIC MEAN:
The value obtained by reciprocating the mean of reciprocals of observations is called ‘Arithmetic
mean’.
1. Harmonic mean for Ungrouped data:
𝑛
𝐻. 𝑀. =
Σ 1⁄𝑥
2. Harmonic mean for Grouped data:
Σf
𝐻. 𝑀. =
𝑓
Σ ⁄𝑥
PROPERTIES OF ARITHMETIC MEAN:
1. Mean of a variable with similar observations, say constant ‘k’, is the constant ‘k’ itself.
2. Mean is affected by change in origin.
3. Mean is affected by change in scale.
4. Sum of deviations of observations from arithmetic mean is always zero.
Σ(𝑥 − 𝑥̅ ) = 0 (Ungrouped data)
Σf(𝑥 − 𝑥̅ ) = 0 (Grouped data)
WEIGHTED ARITHMETIC MEAN:
The relative importance of a number is called its weight.
When all the observations 𝑥1, 𝑥2 , 𝑥3 … … , 𝑥𝑛 are not equally important, certain weights 𝑤1 , 𝑤2 , 𝑤3 … … , 𝑤𝑛 are
associated with them depending on the importance or significance.
So, the ‘Weighted Arithmetic mean’ is defined as
𝑤1 𝑥1 + 𝑤2 𝑥2 + 𝑤3 𝑥3 … … + 𝑤𝑛 𝑥𝑛 Σ𝑤𝑥
𝑥̅ 𝑤 = =
𝑤1 + 𝑤2 + 𝑤3 … … + 𝑤𝑛 Σ𝑤
MOVING AVERAGES:
‘Moving averages’ are defined as the successive arithmetic means which are computed for a sequence
of days/months/years etc. at a time.
DISPERSION:
Statistically, ‘Dispersion’ means the spread or scatterness of observations in a data set.
The purpose of finding Dispersion is to study the behavior of each unit of population around the average
value. It also helps in comparison of two sets of data in more detail.
The spread or scatterness in a data set can be seen in two ways:
1. The spread of observations between two extreme observations in a data set.
2. The spread of observations around an average value, say their arithmetic mean.
MEASURES OF DISPERSION:
The measures that are used to determine the degree or extent of variation in a data set are called
‘Measures of dispersion’.
The following measures of dispersion will be discussed:
1. Range 2. Variance 3. Standard Deviation
Dispersion is not affected by change in origin but it is affected by change in scale.
RANGE:
The extent of variation between two extreme observations in a data set is called ‘Range’.
1. Range for Ungrouped data:
Range = 𝑥𝑚𝑎𝑥 − 𝑥𝑚𝑖𝑛 = 𝑥𝑚 − 𝑥0
2. Range for Grouped data:
Range = (Upper class boundary of last class) − (Lower class boundary of first class)
OR
Range = Maximum midpoint − Minimum midpoint
VARIANCE:
The mean of squared deviations of observations from their arithmetic mean is called ‘Variance’.
1. Proper mean or Definitional formula for Ungrouped data:
Σ(𝑥 − 𝑥̅ )2
𝑆2 =
𝑛
2. Direct or Computational formula for Ungrouped data:
Σ𝑥 2 Σ𝑥 2
𝑆2 = −( )
𝑛 𝑛

3. Proper mean or Definitional formula for Grouped data:

Σf(𝑥 − 𝑥̅ )2
𝑆2 =
Σf
4. Direct or Computational formula for Grouped data:

2
Σ𝑓𝑥 2 Σf𝑥 2
𝑆 = −( )
Σf Σf
STANDARD DEVIATION:
The positive square root of mean of squared deviations of observations from their arithmetic mean is
called ‘Standard deviation’.
1. Proper mean or Definitional formula for Ungrouped data:

Σ(𝑥 − 𝑥̅ )2
𝑆=√
𝑛

2. Direct or Computational formula for Ungrouped data:

Σ𝑥 2 Σ𝑥 2
𝑆=√ −( )
𝑛 𝑛

3. Proper mean or Definitional formula for Grouped data:

Σf(𝑥 − 𝑥̅ )2
𝑆=√
Σf

4. Direct or Computational formula for Grouped data:

Σ𝑓𝑥 2 Σf𝑥 2
𝑆=√ −( )
Σf Σf

Statistics
No ratings yet
Statistics
17 pages
Measures of Central Tendency or Averages
No ratings yet
Measures of Central Tendency or Averages
9 pages
Statistics 302
No ratings yet
Statistics 302
29 pages
Statistics
No ratings yet
Statistics
30 pages
Quantitative Analysis and Business Development (UNIT-1)
No ratings yet
Quantitative Analysis and Business Development (UNIT-1)
31 pages
Statistics 24 04 2021 20210618114031
No ratings yet
Statistics 24 04 2021 20210618114031
41 pages
MTH302 Short Notes Lec 23 To 45 VUAnswer - Com-1
100% (1)
MTH302 Short Notes Lec 23 To 45 VUAnswer - Com-1
14 pages
B 0
No ratings yet
B 0
24 pages
10th Maths Chap6 Rev Ex 6
No ratings yet
10th Maths Chap6 Rev Ex 6
6 pages
Basic Concepts of Statistics
No ratings yet
Basic Concepts of Statistics
41 pages
Maths Statistics LN PDF
No ratings yet
Maths Statistics LN PDF
36 pages
Statistics Ppt.1
No ratings yet
Statistics Ppt.1
39 pages
Statistics
No ratings yet
Statistics
41 pages
Statistics
No ratings yet
Statistics
81 pages
Basics of Statistics
No ratings yet
Basics of Statistics
32 pages
B26 Notes
No ratings yet
B26 Notes
11 pages
1431364846L02.EE3121.Review of Measures of Central Tendency
No ratings yet
1431364846L02.EE3121.Review of Measures of Central Tendency
7 pages
Measure of Locations
No ratings yet
Measure of Locations
6 pages
Statistics
No ratings yet
Statistics
10 pages
Basics For Understanding
No ratings yet
Basics For Understanding
8 pages
Statistics L 1
No ratings yet
Statistics L 1
27 pages
Measures of Central Tendency: Presentation By: DR Dharuv
No ratings yet
Measures of Central Tendency: Presentation By: DR Dharuv
44 pages
11statistics Sheet
No ratings yet
11statistics Sheet
41 pages
Statistics & Psychology
No ratings yet
Statistics & Psychology
47 pages
Basic Statistics
No ratings yet
Basic Statistics
30 pages
Origin and Growth of Statistics
No ratings yet
Origin and Growth of Statistics
18 pages
Statistics For Css
No ratings yet
Statistics For Css
73 pages
7 Statistics Sets Relation Handout
No ratings yet
7 Statistics Sets Relation Handout
11 pages
Measures of Central Tendency: Presentation By: Dr. Sampda Rajurkar
100% (1)
Measures of Central Tendency: Presentation By: Dr. Sampda Rajurkar
44 pages
Modern Math Reviewer
No ratings yet
Modern Math Reviewer
14 pages
Chapter 15 (3)nnn
No ratings yet
Chapter 15 (3)nnn
16 pages
PC 2 Statistics by Praveen Mathur
No ratings yet
PC 2 Statistics by Praveen Mathur
44 pages
Lect 7 Measures of Central Tendency
No ratings yet
Lect 7 Measures of Central Tendency
40 pages
Frequency and Measures of Central Tendency and Variability
No ratings yet
Frequency and Measures of Central Tendency and Variability
108 pages
Statistics Assignment 05
50% (2)
Statistics Assignment 05
14 pages
Measures of Central Tendency
No ratings yet
Measures of Central Tendency
5 pages
Lecture 7-9 Measure of Central Tendency
No ratings yet
Lecture 7-9 Measure of Central Tendency
58 pages
RM Module 3
No ratings yet
RM Module 3
34 pages
Grouped and Ungrouped Data
75% (4)
Grouped and Ungrouped Data
13 pages
GE 104 Module 4
No ratings yet
GE 104 Module 4
24 pages
Hand Outs3
No ratings yet
Hand Outs3
6 pages
Math 5
No ratings yet
Math 5
3 pages
Statistics
No ratings yet
Statistics
22 pages
Measures of Central Tendency Chap 3
No ratings yet
Measures of Central Tendency Chap 3
5 pages
Wikimama Class 11 CH 15 Statistics
No ratings yet
Wikimama Class 11 CH 15 Statistics
8 pages
Chapter 3 Descriptive Statistics
No ratings yet
Chapter 3 Descriptive Statistics
78 pages
Definition of Terms 1. Statistics
No ratings yet
Definition of Terms 1. Statistics
25 pages
Stat Handbook
No ratings yet
Stat Handbook
18 pages
Unit 2
No ratings yet
Unit 2
29 pages
SM 38
No ratings yet
SM 38
25 pages
Unit 6 Basic Statistics Definitions
No ratings yet
Unit 6 Basic Statistics Definitions
4 pages
Statistics nda math
No ratings yet
Statistics nda math
9 pages
STATSECO-XI-5
No ratings yet
STATSECO-XI-5
9 pages
Statistics English 781679327228760
No ratings yet
Statistics English 781679327228760
15 pages
MCA Mathematical Foundation For Computer Application 12
No ratings yet
MCA Mathematical Foundation For Computer Application 12
30 pages
Statistics 1 Year Paper Pattern
No ratings yet
Statistics 1 Year Paper Pattern
7 pages
2) Measures of Central Tendency
No ratings yet
2) Measures of Central Tendency
9 pages
CH, 3 Definitions
No ratings yet
CH, 3 Definitions
3 pages
Mmw Statistics
No ratings yet
Mmw Statistics
50 pages
Statistical Foundations for Psychology
From Everand
Statistical Foundations for Psychology
James C. Ware
No ratings yet
Anova Report Ankita PDF
No ratings yet
Anova Report Ankita PDF
16 pages
MMPC 05 Ignou: Self Gyan
No ratings yet
MMPC 05 Ignou: Self Gyan
52 pages
Lampiran Quick DASH
100% (1)
Lampiran Quick DASH
24 pages
Hypothesis Testing: 10.1 Testing The Mean of A Normal Population
No ratings yet
Hypothesis Testing: 10.1 Testing The Mean of A Normal Population
13 pages
Specimen Exam Solutions Cs1a Ifoa 2019 Final
No ratings yet
Specimen Exam Solutions Cs1a Ifoa 2019 Final
11 pages
Chapter Four Correlation Analysis: Positive or Negative
No ratings yet
Chapter Four Correlation Analysis: Positive or Negative
15 pages
Binomial and Poisson Distribution
No ratings yet
Binomial and Poisson Distribution
15 pages
Jurnal Kepuasan Kerja 1
No ratings yet
Jurnal Kepuasan Kerja 1
13 pages
Assignment4 Es111 Mobs
No ratings yet
Assignment4 Es111 Mobs
5 pages
One-Stage Cluster Sampling and Systematic Sampling
0% (1)
One-Stage Cluster Sampling and Systematic Sampling
25 pages
Exp3 ML
No ratings yet
Exp3 ML
4 pages
Stat 371 Ass#1
No ratings yet
Stat 371 Ass#1
2 pages
Simple Symmetric Random Walk: Reference: Feller, Volume I, Chapter 3
No ratings yet
Simple Symmetric Random Walk: Reference: Feller, Volume I, Chapter 3
19 pages
Econ 3180 Final Exam, April 15th 2013 Ryan Godwin
No ratings yet
Econ 3180 Final Exam, April 15th 2013 Ryan Godwin
14 pages
HANDOUT Topic5 Statistical Process Control
No ratings yet
HANDOUT Topic5 Statistical Process Control
73 pages
PTSP Objective Questions
No ratings yet
PTSP Objective Questions
7 pages
Applied Statistics Outliers Chapter 2
No ratings yet
Applied Statistics Outliers Chapter 2
12 pages
Martingale Theory and Applications: DR Nic Freeman June 4, 2015
No ratings yet
Martingale Theory and Applications: DR Nic Freeman June 4, 2015
40 pages
ECON 1005 Tutorial Sheet - 5
No ratings yet
ECON 1005 Tutorial Sheet - 5
3 pages
Forecasting Problems Solved
No ratings yet
Forecasting Problems Solved
8 pages
Topic 5
No ratings yet
Topic 5
11 pages
Segunda Asignación de Estadística Aplicada A La Ingeniería Química 2016 I
No ratings yet
Segunda Asignación de Estadística Aplicada A La Ingeniería Química 2016 I
5 pages
Gujarat Technological University
No ratings yet
Gujarat Technological University
3 pages
(Student's Handouts) Data Management
No ratings yet
(Student's Handouts) Data Management
12 pages
Econometrics Formulas Updated
No ratings yet
Econometrics Formulas Updated
4 pages
Download ebooks file (Ebook) Environmental Econometrics Using Stata by Christopher F. Baum, Stan Hurn ISBN 9781597183550, 9781597183567, 9781597183574, 1597183555, 1597183563, 1597183571 all chapters
100% (8)
Download ebooks file (Ebook) Environmental Econometrics Using Stata by Christopher F. Baum, Stan Hurn ISBN 9781597183550, 9781597183567, 9781597183574, 1597183555, 1597183563, 1597183571 all chapters
81 pages
Statistics 578 Assignment 5 Homework
100% (6)
Statistics 578 Assignment 5 Homework
13 pages
C TSAF Box Jenkins - Method
No ratings yet
C TSAF Box Jenkins - Method
83 pages
Ekonometrika - Results For Logistic Regression in XLSTAT
No ratings yet
Ekonometrika - Results For Logistic Regression in XLSTAT
4 pages
Chapter 7: Heteroscedasticity
No ratings yet
Chapter 7: Heteroscedasticity
20 pages

BASIC STATISTICS (Definitions)

Uploaded by

BASIC STATISTICS (Definitions)

Uploaded by

BASIC STATISTICS

Class boundaries can be obtained from midpoints as (𝑥 ± ℎ⁄2).

3. Proper mean or Definitional formula for Grouped data:

2. Direct or Computational formula for Ungrouped data:

3. Proper mean or Definitional formula for Grouped data:

4. Direct or Computational formula for Grouped data:

You might also like