B.ed 1.5 Year (Allama Iqbal Open University) B.ed 1.5 Year (Allama Iqbal Open University)
B.ed 1.5 Year (Allama Iqbal Open University) B.ed 1.5 Year (Allama Iqbal Open University)
8614
The instructor may see that the average exam score of her class is
gradually declining over the course of the year just by looking at this
bar chart. This can enable the teacher to enhance her teaching
techniques or possibly provide her students a survey to seek for
feedback.
Q.2 Discuss different types of data. Also elaborate difference between
primary and secondary data?
Answer
Data:
Data are the facts and figures that are collected, analyzed, and
summarized for presentation and interpretation. Data may be classified
as either quantitative or qualitative.
In the discipline of data science, data is crucial. Large-scale experiments
using raw data are conducted to produce insightful results that support
numerous business objectives. Having a better understanding of the
various statistical data kinds can help you choose the one that best
meets your needs.
Types of data:
The quantitative data can be classified into two different types based on
the data sets. The two different classifications of numerical data are
discrete data and continuous data.
Discrete Data:
Discrete data can take only discrete values. Discrete information
contains only a finite number of possible values. Those values cannot be
subdivided meaningfully. Here, things can be counted in whole
numbers.
Example: Number of students in the class
Continuous Data:
Continuous data is data that can be calculated. It has an infinite number
of probable values that can be selected within a given specific range.
Example: Temperature range
Types of data in research:
In research, different methods are used to collect data, all of which fall
into two categories, i.e. primary data and secondary data. It is a common
classification based upon who collected the data.
Primary data:
For the first time, the researcher creates primary data to solve his study
challenge. It also goes by the name first-hand raw data. Numerous
techniques can be used to gather the data, including surveys,
observations, physical examinations, postal questionnaires,
questionnaires completed and sent by enumerators, in-person and
telephone interviews, focus groups, case studies, etc.
Secondary data:
Any dataset gathered by a party other than the one using it is referred to
as secondary data (also known as second-party data). Secondary sources
of information are quite helpful. They make it possible for researchers
and data analysts to create sizable, superior databases that aid in the
resolution of corporate issues. Examples of secondary data are censuses
data, publications, internal records of the organizations, reports, books,
journal articles, websites etc.
Difference between primary and secondary data:
Primary data Secondary data
1. Primary data refers to the data Secondary data is already
originated by the researcher for existing data, collected by other
the first time. researchers, agencies, and
organizations
2. Primary data is real time data Secondary data relates to past
3. Primary data is collected to The purpose behind collection of
address the problem in hand secondary data is different from
the problem in hand
4. Collection of primary data is a Collection of secondary data is
laborious process. easy and rapid.
5. Sources of primary data are Sources of secondary are
survey, observations, physical censuses data, publications,
testing, mailed questionnaire, internal records of the
questionnaire filled and sent by organizations, reports, books,
enumerators, personal journal articles, websites etc.
interviews, telephonic
interviews, focus groups
discussion, case studies, etc.
6. Collection of primary data Collection of secondary data is
requires a large amount of expensive and easily available
resources like time, cost, and
human resources.
7. Primary data is specific to the Secondary data is neither specific
researcher’s needs. He can to researcher needs nor has he
control the quality of research. control over the quality of data
8. Primary data is available in the Secondary data has undergone
raw form. some statistical procedures and is
refined from primary data.
Q.4 Pie Chart is common way to depict data. Discuss its usage and
drawbacks?
Answer
Pie chart:
A pie chart is a type of graph that represents the data in the circular
graph. The slices of pie show the relative size of the data, and it is a type
of pictorial representation of data. A pie chart requires a list of
categorical variables and numerical variables. Here, the term “pie”
represents the whole, and the “slices” represent the parts of the whole.
The “pie chart” is also known as a “circle chart”, dividing the circular
statistical graphic into sectors or sections to illustrate the numerical
problems. Each sector denotes a proportionate part of the whole. To find
out the composition of something, Pie-chart works the best at that time.
In most cases, pie charts replace other graphs like the bar graph, line
plots, histograms, etc.
Types:
Donut Plot
A donut plot is a variation of the pie chart where the center portion of
the chart is removed.
3D Pie Chart
The 3D form is often used to improve the aesthetics of the pie chart.
Formula:
The pie chart is an important type of data representation. It contains
different segments and sectors in which each segment and sector of a pie
chart forms a specific portion of the total(percentage). The sum of all the
data is equal to 360°.
The total value of the pie is always 100%.
To work out with the percentage for a pie chart, follow the steps given
below:
Categorize the data
Calculate the total
Divide the categories
Convert into percentages
Finally, calculate the degrees
Uses:
A pie chart is a circle that is divided into areas, or slices. Each slice
represents the count or percentage of the observations of a level for the
variable. Pie charts are often used in business. Examples include
showing percentages of types of customers, percentage of revenue from
different products, and profits from different countries. Pie charts can be
helpful for showing the relationship of parts to the whole when there are
a small number of levels.
Advantages of a Pie Chart:
.A simple and easy-to-understand picture.
.It represents data visually as a fractional part of a whole, which can be
an effective communication tool for the even uninformed audience.
.It enables the audience to see a data comparison at a glance to make an
immediate analysis or to understand information quickly.
.The need for readers to examine or measure underlying numbers
themselves can be removed by using this chart.
.To emphasize points you want to make, you can manipulate pieces of
data in the pie chart.
Drawbacks of using Pie Chart:
.Pie charts lose their effectiveness when they contain too many pieces of
information.
.If there are too many pieces of data, they may become congested and
difficult to understand, and even adding data labels and numbers may
not help.
.As this chart only displays one data set, you will need a series to
compare multiple sets.
.This may make it harder for readers to quickly comprehend and digest
material
.Comparing the data slices presents challenges because the reader must
account for angles and contrast non-adjacent slices.
.Readers draw incorrect conclusions when decisions are based on visual
impact rather than data analysis.
Q.5 What do you understand by ‘measure of dispersion’? Also
briefly discuss some common measures of dispersion?
Answer:
Dispersion:
Dispersion refers to the ‘distribution’ of objects over a large region. The
degree to which numerical data are dispersed or squished around an
average value is referred to as dispersion in statistics. It is, in a nutshell,
the dispersion of data. A vast amount of data will always be widely
dispersed or firmly packed. Data that is widely dispersed – 0, 30, 60, 90,
120, With tiny data grouped densely – 1, 2, 2, 3, 3, 4, 4…
Understanding Dispersion:
The term “dispersion” refers to how dispersed a set of data is. The
measure of dispersion is always a non-negative real number that starts at
zero when all the data is the same and rises as the data gets more varied.
The homogeneity or heterogeneity of the scattered data is defined by
dispersion measures. It also refers to how data differs from one another.
Measures of Dispersion:
1.Range:
Range refers to the difference between each series’ minimum and
maximum values. The range offers us a good indication of how
In simple words, standard deviation tells how tightly all the scores are
clustered around the mean in a data set. When the scores are close to the
mean, standard deviation is small. And large standard deviation tells that
scores are spread apart. Standard deviation is simply square root of
variance, i.e.
Standard deviation Ϭ = √ Variance
Or
Ϭ = √ Ʃ (X – X)2 / n
Ϭ is a Greek letter “Sigma”
Merits of Standard Deviation
• Squaring the deviations overcomes the drawback of ignoring signs
in mean deviations
• Suitable for further mathematical treatment
• Least affected by the fluctuation of the observations
• The standard deviation is zero if all the observations are constant
• Independent of change of origin
Demerits of Standard Deviation
• Not easy to calculate
• Difficult to understand for a layman
• Dependent on the change of scale
Varience(σ2):
In simple terms, the variance can be calculated by obtaining the sum of
the squared distance of each term in the distribution from the Mean, and
then dividing this by the total number of the terms in the distribution.
It basically shows how far a number, for example, a student’s mark in an
exam, is from the Mean of the entire class.
Formula:
(σ2) = ∑ ( X − μ)2 / N