QT Module 2
QT Module 2
FOR BUSINESS
Module:2 - Statistical Survey
Syllabus
❖Planning and design of enquiry
❖Statistical units
❖Executing a survey
❖Data Sources(Primary &Secondary)
❖Methods of collecting primary and secondary data
❖Sampling & Census
❖Types of sampling
❖Theoretical base of sampling
❖Statistical Laws
❖Statistical errors
❖Editing,coding,classification &Tabulation
Statistical Survey
Survey: means a search for truth, knowledge or information
Statistics: means data
Statistical survey :A method of studying a problem by collecting numerical figures
through survey.
ii. Units of Analysis and interpretation: units in which statistical data are analyzed
and interpreted. For eg. ratios, percentages, coefficients etc.
Units of Presentation: units by which attributes are presented.
Essentials of good statistical units : Simple & Specific , Stable, Suitable to the enquiry ,
Homogeneous, Self explanatory
Collection of the data
Methods that are to be employed for obtaining the required information from the units
under investigation.
Types of data – Primary and Secondary Data
Primary Data : Original Data Collected from primary sources. Data collected for the first
time . Data originated directly as a result of problem under investigation.
Merits of primary data:
❑ collects data specific to the problem under study.
❑ no doubt about the quality of the data.
❑ possible to obtain additional data.
Demerits of Primary Data:
❑ Lot of time, labour, skill and money required.
❑ In some cases it is not possible to collect primary data
Methods of collecting primary data
1. Observation Method : Data is collected with the help of observation by observer or
by personally going to the field.
Features of observation
➢ Observation is Systematic (planning) Observation is Specific:
➢ Observation is an Affair of Eyes, Definite Aim
➢ Direct method – field visit Use Only for primary data collection
Only used among educated responded Can be used even among illiterate
respondents
Time and Quite expensive both in time and Requires less time and money
Money money terms.
Basic terms
Population /universe: Totality of objects under consideration
Finite: If number of units can be determined
Infinite: If number of units cannot be determined
Sample: Representative part of the population
Sampling: Process of selecting sample from a population
Sampling design: Procedure designed for selecting sample.
Law of inertia of large numbers: Law of Inertia of Large Numbers states, “Other things
being equal, as the sample size increases, the results tend to be more reliable and
accurate.”
Statistical errors
Difference between a value obtained from a data collection process and the 'true' value for
the population. Statistical errors can be classified as :
Sampling Error: Arising as a result of unrepresentativeness of sample taken. Sampling
errors are of two types:
a)Biased sampling error: Sample is chosen in a way that makes some individuals less
likely to be included in the sample than others. arises if there is bias in selection of sample
b)Unbiased sampling error: The investigator has not intentionally tampered with the
sample and that the difference between the population and sample have occurred by chance
1. Non sampling errors: Errors arising in statistical investigation not because of sampling. It
occurs at the planning stage, fieldwork stage as well as at tabulation and computation
stage. Measurement of errors
1.Standard Error / Absolute Error: Difference between true value and estimated value
Absolute error = Actual value- Estimated value
2.Relative error: Relative Error is the percentage/ ratio of the estimated error.
Relative error = Absolute error/ Estimated value
Editing of data
Detection of errors or irregularities in collected data and their elimination.
a) Editing of primary data
Primary data is raw data characterized by in accuracy, heterogeneity, inconsistency,
incompleteness etc. Therefore editing is necessary to make the data Accurate,
Consistency, Adequacy , Completeness and Homogeneity.
b) Editing of secondary data
Editing is done to figure out in consistencies, probable errors and omissions. IT makes the
data suitable for the purpose of present investigation.
Secondary data is edited to ensure Reliability, Suitability, Adequacy, Accuracy and to
identify Unit of measurement and analysis. Reliability of data can be ensured by verifying
the source from which data is collected.
Coding of data: “Coding is the process of assigning numbers or letters or combination
of both to the data collected for the purpose of purpose of proper identification and
presentation in a secret manner.”
Classification of data
“ Process of arranging data in groups or classes according to resemblances and
similarities is technically called classification”
Features of classification
❑ Exhaustiveness: Each and every item of data must be presented in any of one class.
❑ Mutually Exclusive: Each item of data must be presented in one class only
❑ Flexibility: Possible to make adjustments to new situation and circumstance
❑ Homogeneous : Variables included in all classes must be homogeneous
❑ Suitability: Suited for the purpose and object of the study
❑ Stability: It should be stable
❑ Accuracy: Ensure accuracy of data
❑ Clarity: It presents data in a simple and clear form
❑ Condensation: It helps in shortening the huge mass of data
❑ Presentation and analysis: It is essential for presentation of data and application of
tools of analysis.
Methods of classification
➢ Geographical classification: Data classified according to the geographical differences.
Data are classified on the basis of states , districts , villages, zones etc
➢ Chronological classification: When data are collected for a number of period and arranged
chronologically. Data are classified on the basis of year, month, week, day etc.
➢Qualitative classification: Classified on the basis of presence or absence of an attribute.
a) Two fold or Dichotomous classification : Classification on the basis of one attribute
leads to creation of two groups. Eg. on the basis of literacy data can be divided into two
literate and illiterate
b) Manifold classification: Classification on the basis of more than one attribute
Eg. When population is divided on the basis of gender and literacy there can be four
groups: Male literate , Male illiterate, Female literate, Female illiterate
➢Quantitative Classification: Classification on the basis of measurable characteristics.
Variable: Variable is a value that can change depending on conditions.
Attributes: Qualitative Characteristics like gender, education etc.
Statistical series
systematic arrangement of the items into some logical orders. The data so arranged is called
a statistical series. Types of statistical series:
Statistical series can be classified according to their general characteristic into:
1. Time series: When data are arranged on the basis of time or chronologically.
year 2015 2016 2017 2018 2019 2020
Sales 200 500 600 700 900 1000
2.Spatial series: When data are arranged or presented in relation to place
Town A B C D E F
Population 20 15 23 45 55 16
3. Condition series: When data are presented with reference to some conditions
Division of sector Private sector Public sector Joint sector
Productive capital 4800 6200 700
On the basis of their construction :
1. Individual series: In individual series the items are listed singly , showing the observation
relating to them
Marks: 20 30 40 56 25 88 96 63
2.Discrete series: When the exact measurement of various items are shown along with their
corresponding frequencies(numbers)
Marks 50 60 40 55 65 89 90 89 78
Numbers 10 12 15 15 20 18 5 15 0
3. Continuous series: When approximate measurements are shown as class intervals along
with their corresponding frequencies.