Unit1 BusStat
Unit1 BusStat
Module 1
Presented by
Dr.Analjyoti Basu
Assistant Professor
School of Commerce, Presidency University
UNIVARIATE, BIVARIATE
AND MULTIVARIATE
DISCRETE QUANTITAVE
DATA
CONTINIOUS QUANTITAVE
Types of statistical data-
meanings(Quality & quantity)
QUANTITATIVE
Involving or relating to considerations of amount or size. Capable of being measured.
These are expressed in meaningful numbers that represents quantity of some
characteristics possessed by an element. So, the QUANTITATIVE data indicate how
much, how many.
For example- The quantity of grains exported in a single year(in Metric tonnes).
QUALITATIVE
Qualitative means relating to the nature or standard of something, rather than to its
quantity. involving or relating to distinctions based on quality or qualities. Thus
Qualitative data provide only lebels or names for categories of elements under
study. The data could be identified by non-numeric description or numeric codes.
For example- The quality of Indian Intellectual resource is being praised world wide.
Types of statistical data-
meanings(Quality vs quantity)
Types of statistical data-
meanings(Quality vs quantity)
Types of statistical data-
meanings(nominal & ordinal)
NOMINAL
Nominal data is data that can be labeled or classified but cannot be ordered in a
meaningful way. So it involves different categorization but not meaningful ordering.
For example- Mode of transportation, you may have the categories of car, bus, train,
tram or bicycle.
Mode of transaction by Indian’s- Cash, Credit cards, Debit cards, E-Payment.
ORDINAL
ordinal data are the type of data in which the values follow a natural order. One of the
most notable features of ordinal data is that the differences between the data values
cannot be determined or are meaningless. Generally, the data categories lack the
width representing the equal increments of the underlying attribute.
For example- Preferred ranking of transportation, you may have in the categories of car,
bus, train, tram or bicycle in ranks.
Preferred ranking of transaction by Indian’s- Cash, Credit cards, Debit cards, E-
Payment in ranks.
Ordinal data is data that can be labelled ,classified, ordered and ranked.
Types of statistical data-
meanings(nominal vs ordinal)
ORDINAL SCALE EXAMPLE
Types of statistical data-
meanings(discrete & continious)
DISCRETE
It constitutes a separate entity that Constitutes of distinct or unconnected elements or non
continuous.
For example-
• Number of shares held by a person could be 5,10,or 15 but not 15.555, 20.05 etc..
• Number of times India was hit by the COVID waves- 1, 2, or 3 but not 1.3,or 2.66 or
1.57 etc.
CONTINIOUS
It can take any value between two numbers, no matter how small. The measure can be any value
on the scale.
For Example-
• People's heights could be any value (within the range of human heights), not just certain
fixed heights.
• The time taken by Ushan Bolt to complete a race.
Types of statistical data-
meanings(discrete vs continious)
Both data types are important for statistical analysis. However, some major
differences need to be noted before drawing any conclusions or making decisions. The
key differences are:
Discrete data is the type of data that has clear spaces between values. Continuous
data is data that falls in a constant sequence.
To accurately represent discrete data, the bar graph is used. Histogram or line
graphs are used to represent continuous data graphically. A diagram of the discrete
function shows a distinct point that remains unconnected. While in a continuous
function graph, the points are connected with an unbroken line.
Discrete data contains distinct or separate values. Continuous data includes any
value within the preferred range.
Types of statistical data-
meanings(discrete vs continious)
Types of statistical data-qualitative &
quantitative
NOMINAL
Nominal data is data that can be labeled or classified but cannot be ordered in a meaningful way. So it involves
different categorization but not meaningful ordering.
For example-
• Preferred mode of transportation, you may have the categories of car, bus, train, tram or bicycle.
• Preferred mode of transaction by Indian’s- Cash, Credit cards, Debit cards, E-Payment.
QUALITATIVE
Qualitative means relating to the nature or standard of something, rather than to its quantity. involving or
relating to distinctions based on quality or qualities. Thus Qualitative data provide only lebels or names for
categories of elements under study. The data could be identified by non-numeric description or numeric
codes.
For example-
The quality of Indian Intellectual resource is being praised world wide.
QUALITATIVE
Qualitative means relating to the nature or standard of something, rather than to its quantity.
involving or relating to distinctions based on quality or qualities. Thus Qualitative data provide
only lebels or names for categories of elements under study. The data could be identified by
non-numeric description or numeric codes.
For example-
The quality of Indian Intellectual resource is being praised world wide .
QUANTITATIVE
Involving or relating to considerations of amount or size. Capable of being
measured. These are expressed in meaningful numbers that represents
quantity of some characteristics possessed by an element. So, the
QUANTITATIVE data indicate how much, how many.
For example- The quantity of grains exported in a single year(in Metric tonnes).
Univariate analysis is the simplest form of analyzing data. Uni means one, so in
other words the data has only one variable.Univariate data requires to analyze
each variable separately. Data is gathered for the purpose of answering a question.
Like all the other data, univariate data can be visualized using graphs, images or
other analysis tools after the data is measured, collected, reported, and analyzed.
Types of statistical data-univariate data
Types of statistical data-univariate data
Some univariate data consists of numbers (such as the height of 65 inches or the
weight of 100 pounds), while others are nonnumerical (such as eye colors of brown or
blue). Generally, the terms categorical univariate data and numerical univariate data
are used to distinguish between these types.
Categorical univariate data
Categorical univariate data consists of non-numerical observations that may be placed
in categories. It includes labels or names used to identify an attribute of each element.
Categorical univariate data usually use either nominal or ordinal scale of measurement.
Numerical univariate data
Numerical univariate data consists of observations that are numbers. This type of
univariate data can be classified even further into two
subcategories: discrete and continuous. A numerical univariate data is discrete if the
set of all possible values is finite or countably infinite. Discrete univariate data are
usually associated with counting (such as the number of books read by a person). A
numerical univariate data is continuous if the set of all possible values is an interval of
numbers. Continuous univariate data are usually associated with measuring (such as
the weights of people).
Types of statistical data-univariate
data(Examples)
Data for two variables (usually two types of related data).The data sets that involve
information on two characteristics for each subjects.
Example: Ice cream sales versus the temperature on that day. The two variables are
Ice Cream Sales and Temperature.
(If only one set of data is there, such as just Temperature, it is called "Univariate
Data")
In statistics, bivariate data is data on each of two variables, where each value of one
of the variables is paired with a value of the other variable. Typically it would be of
interest to investigate the possible association between the two variables. The
association can be studied via a tabular or graphical display, or via sample statistics
which might be used for inference.
The Bi-variate data gives rise to the concept of dependent and independent
variables.
Types of statistical data-Bivariate
data(Examples)
Types of statistical data-Bivariate
data(Examples)
These data points typically consist of successive measurements made from the same
source over a time interval and are used to track change over time.
Cross-sectional data are the result of a data collection, carried out at a single point
in time.
With cross-sectional data, the main interest doesn’t lies in the change of data over
time, but in the current time.
Each scale is an incremental level of measurement, meaning, each scale fulfills the function
of the previous scale, and all survey question scales such as Likert, Semantic
Differential, Dichotomous, etc, are the derivation of this these 4 fundamental levels of
variable measurement.
Nominal scale is a naming scale, where variables are simply “named” or labeled, with no
specific order. Ordinal scale has all its variables in a specific order, beyond just naming them.
Interval scale offers labels, order, as well as, a specific interval between each of its variable
options. Ratio scale bears all the characteristics of an interval scale, in addition to that, it can
also accommodate the value of “zero” on any of its variables.
Types of measurement scales
Nominal
NominalScales
Scales
Ordinal
Ordinal(Ranking)
(Ranking)Scales
Scales
Interval
IntervalScales
Scales
Ratio
RatioScales
Scales
Types of measurement scales-nominal scale
The numbers serve only as labels or tags for
identifying and classifying objects.
When used for identification, there is a strict one-to-
one correspondence between the numbers and the
objects.
The numbers do not reflect the amount of the
characteristic possessed by the objects.
The only permissible operation on the numbers in a
nominal scale is counting.
Only a limited number of statistics, all of which are
based on frequency counts, are permissible, e.g.,
percentages, and mode.
Types of measurement scales-nominal
scale(examples)
Are you a management graduate?
Yes-1
No-2
0 to 10 Scale
15.2 14.1 13.4
Ratio Time to
Finish, in
Seconds.
Primary & secondary data.
Primary & secondary data.
• Primary data: Information that is developed or
gathered by the researcher specifically for the
research project at hand.
1. OBSERVATION METHOD
2. INTERVIEW METHOD
3. QUESTIONNAIRE METHOD
4. DISTRIBUTOR AUDITS.
5. CONSUMER PANELS
6. DEPTH ANALYSIS
7. CONTENT ANALYSIS
Primary data sources.
OBSERVATION METHOD
The main characteristics of observation method are as follows:-
1. The information is sought by the investigators’ own direct
observation without asking from the respondent.
2. The information obtained under the method relates to what
is currently happening.
3. This technique is not complicated by the past behavior or
future attentions or attitudes.
4. This method is independent of the respondents
Primary data sources.
INTERVIEW METHOD
The main characteristics of interview Method are as
follows:-
The main characteristics of Questionnaire are:
1.Here the specially appointed enumerators go for
recording the answers of the questions.
2.Enumerators explain the meaning of the particular
questions and helps to answer it.
3.The questions may be structured or unstructured.
Primary data sources.
QUESTIONNAIRE METHOD
The main characteristics of interview Method are as
follows:-
1.In these method questionnaire is send to the persons
concerned with the request to answer the questions
and returned to the asker.
2.The questioners are mailed to the respondents.
3.The respondents have to answer the questions by
their own.
4.The method is extensively used in different economic
and business surveys.
Primary data sources.
DISTRIBUTOR AUDITS.
Census
Gathers information from every individual in a certain
group.
Since data from the entire population is used, there is no
sampling variance.
Provides detailed information about smaller groups.
Can be quite costly, particularly for large populations, due
to census tally workers as well as hiring temporary census
home visitors.
Includes an uncomfortable visit from a government
worker if the census is not filled out on time.
Census vs sampling.
Sampling
Gathers information from only a section of the
population
May have a significant degree of sample variance,
since the data is derived from only a small section
of a population
May not provide enough information about
smaller groups or smaller geographical sections
of a place
Costs much less than a census, since data is
gathered from only a small section of a group
THANK YOU