0% found this document useful (0 votes)

29 views45 pages

Collection of Data Part 2 Edited MLIS

This document defines key terms used in statistics and data analysis, including variables, data, experiments, parameters, and statistics. It provides examples to illustrate these concepts, distinguishing between population and sample, and qualitative and quantitative variables that can be nominal, ordinal, discrete, or continuous. It also discusses methods for collecting and organizing data through frequency distributions shown in tables, histograms, polygons, bar graphs, and smooth curves. The shape of distributions is addressed.

Uploaded by

Whieslyn Cole

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

29 views45 pages

Collection of Data Part 2 Edited MLIS

Uploaded by

Whieslyn Cole

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 45

Variable: A characteristic about each

individual element of a population or sample.

Data (plural): The set of values collected for
the variable from each of the elements
belonging to the sample.
Experiment: A planned activity whose results
yield a set of data.
Parameter: A numerical value summarizing all
the data of an entire population.
Statistic: A numerical value summarizing the
sample data.
EXAMPLE: A COLLEGE DEAN IS INTERESTED IN
LEARNING ABOUT THE AVERAGE AGE OF
FACULTY. IDENTIFY THE BASIC TERMS IN THIS
SITUATION.

Population
The age of all faculty members at the college.
Sample
Any subset of that population. Like, we might
select 10 faculty members and determine their age.
Variable
the “age” of each faculty member.
EXAMPLE: A COLLEGE DEAN IS INTERESTED IN
LEARNING ABOUT THE AVERAGE AGE OF FACULTY.
IDENTIFY THE BASIC TERMS IN THIS SITUATION.

Data
It would be the age of a specific faculty member.
Data
 It would be the set of values in the sample.
EXAMPLE: A COLLEGE DEAN IS INTERESTED IN
LEARNING ABOUT THE AVERAGE AGE OF FACULTY.
IDENTIFY THE BASIC TERMS IN THIS SITUATION.

Experiment
The method used to select the ages forming the
sample and determining the actual age of each faculty
member in the sample.
EXAMPLE: A COLLEGE DEAN IS INTERESTED IN
LEARNING ABOUT THE AVERAGE AGE OF FACULTY.
IDENTIFY THE BASIC TERMS IN THIS SITUATION.

Parameter
The “average” age of all faculty at the college.
Statistic
The “average” age for all faculty in the sample.
Two kinds of variables:
Qualitative, or Attribute, or Categorical,
Variable:
Quantitative, or Numerical, Variable:
Two kinds of variables:
Qualitative, or Attribute, or Categorical,
Variable: A variable that categorizes or
describes an element of a population.
Note: Arithmetic operations, such as addition
and averaging, are not meaningful for data
resulting from a qualitative variable.
Two kinds of variables:
Quantitative, or Numerical, Variable: A
variable that quantifies an element of a
population.
Note: Arithmetic operations such as addition
and averaging, are meaningful for data
resulting from a quantitative variable.
Example: Identify each of the following examples as
attribute (qualitative) or numerical (quantitative)
variables.
 The residence hall for each student in a statistics class.
 (Attribute)
 The amount of gasoline pumped by the next 10
customers at the local Savemore.
 (Numerical)
 The amount of radon in the basement of each of 25
homes in a new development.
 (Numerical)
Example: Identify each of the following examples as
attribute (qualitative) or numerical (quantitative)
variables.
The color of the baseball cap worn by each of 20
students.
(Attribute)
The length of time to complete a mathematics
homework assignment.
(Numerical)
The state in which each truck is registered when
stopped and inspected at a weigh station.
(Attribute)
Qualitative and quantitative variables may be further
subdivided:

Nominal
Qualitative
Ordinal
Variable
Discrete
Quantitative
Continuous
Nominal Variable: A qualitative variable that
categorizes (or describes, or names) an element of a
population.
Nominal scales are used for labeling
variables, without any quantitative value.
“Nominal” scales could simply be called
“labels.”
Ordinal Variable: A qualitative variable that
incorporates an ordered position, or ranking.
-With ordinal scales, it is the order of the
values is what’s important and significant, but
the differences between each one is not really
known.
-Ordinal scales are typically measures of non-numeric
concepts like satisfaction, happiness, discomfort, etc.
-Advanced note: The best way to determine central
tendency on a set of ordinal data is to use the mode or
median; the mean cannot be defined from an ordinal
set.
 Discrete Variable: A quantitative variable that
can assume a countable number of values.
Intuitively, a discrete variable can assume
values corresponding to isolated points along a
line interval. That is, there is a gap between any
two values.
 Discrete Data can only take certain values.
 Example:
 1. the number of students in a class
 2. the results of rolling 2 dice
Continuous Variable: A quantitative variable that can assume
an uncountable number of values. Intuitively, a continuous
variable can assume any value along a line interval, including
every possible value between any two values. Continuous Data
can take any value (within a range) Examples:
A person's height: could be any value (within the range of human
heights), not just certain fixed heights,
Time in a race: you could even measure it to fractions of a
second,
A dog's weight,
The length of a leaf,
 Collecting Data
1. Data from a designed of experiment (primary
data)
2. Data from a survey (primary data)
3. Data from an observational study (primary
data)
4. Data from a published source (secondary data)
 Definition :Representative Sample:
 A representative sample exhibits characteristics
typical of those possessed by the target population.
 The most common way to satisfy the representative
sample requirement is to select a random sample.
 A random sample ensures that every subset of fixed
size in the population has the same chance of being
included in the sample.
 Definition : Random Sample:

 A random sample of n experimental units

is a sample selected from the population
in such a way that every different sample
of size n has an equal chance of selection.
Collection of Data

 Statistics very often involves the collection of data.

There are many ways to obtain data, and the World
Wide Web is one of them. The advantages and
disadvantages of common data collecting method
are discussed below.
Chapter 2: Frequency
Distributions
24
Frequency Distributions

 After collecting data, the first task for a

researcher is to organize and simplify the
data so that it is possible to get a general
overview of the results. This is the goal of
descriptive statistical techniques.

 One method for simplifying and organizing data

is to construct a frequency distribution.

25
Frequency Distributions (cont.)

 A frequency distribution is an organized

tabulation showing exactly how many
individuals are located in each category on
the scale of measurement.
 A frequency distribution presents an
organized picture of the entire set of
scores, and it shows where each individual
is located relative to others in the
distribution.

26
FREQUENCY DISTRIBUTIONS
(CONT.)

A table that organizes data values into classes

or intervals along with number of values that
fall in each class (frequency, f ).
1. Ungrouped Frequency Distribution – for
data sets with few different values. Each
value is in its own class.

2. Grouped Frequency Distribution: for data

sets with many different values, which
are grouped together in the classes.
Grouped and Ungrouped
Frequency Distributions
Ungrouped Grouped

Courses Frequency, f Age of Frequency, f

Taken Voters
1 25 18-30 202
2 38 31-42 508
3 217 43-54 620
4 1462 55-66 413
5 932 67-78 158
6 15 78-90 32
Frequency Distribution Graphs

 In a frequency distribution graph, the score categories (X

values) are listed on the X axis and the frequencies are
listed on the Y axis.
 When the score categories consist of numerical scores
from an interval or ratio scale, the graph should be
either a histogram or a polygon.
Histograms

 In a histogram, a bar is centered above each score (or class

interval) so that the height of the bar corresponds to the
frequency and the width extends to the real limits, so that
adjacent bars touch.
Polygons

 In a polygon, a dot is centered above each score so that

the height of the dot corresponds to the frequency. The
dots are then connected by straight lines. An additional
line is drawn at each end to bring the graph back to a zero
frequency.

32
Bar graphs

 When the score categories (X values) are

measurements from a nominal or an ordinal scale, the
graph should be a bar graph.
 A bar graph is just like a histogram except that gaps
or spaces are left between adjacent bars.

34
Smooth curve
 If the scores in the population are measured on an
interval or ratio scale, it is customary to present the
distribution as a smooth curve rather than a jagged
histogram or polygon.
 The smooth curve emphasizes the fact that the
distribution is not showing the exact frequency for
each category.

36
Frequency distribution graphs

 Frequency distribution graphs are useful because they

show the entire set of scores.
 At a glance, you can determine the highest score, the
lowest score, and where the scores are centered.
 The graph also shows whether the scores are clustered
together or scattered over a wide range.

38
Shape
A graph shows the shape of the distribution.
A distribution is symmetrical if the left side of the
graph is (roughly) a mirror image of the right side.
One example of a symmetrical distribution is the bell-
shaped normal distribution.
On the other hand, distributions are skewed when
scores pile up on one side of the distribution, leaving a
"tail" of a few extreme values on the other side.

39
Positively and Negatively
Skewed Distributions
 In a positively skewed distribution, the scores tend to
pile up on the left side of the distribution with the tail
tapering off to the right.
 In a negatively skewed distribution, the scores tend
to pile up on the right side and the tail points to the
left.

40
Time Series
(Paired data)

Time Series
 Data set is composed of quantitative entries taken at regular
intervals over a period of time.
 e.g., The amount of precipitation measured each day for
one month.
 Use a time series chart to graph.

Quantitative
data
time
Time-Series Graph
Number of Screens at Drive-In Movies
Theaters

Figure 2-8
44 Graphing Qualitative Data Sets

Pie Chart
 A circle is divided into sectors that
represent categories.

Pareto Chart
• A vertical bar graph in which the
height of each bar represents
frequency or relative frequency.

Frequency

Categories
Constructing Pareto Charts
 Create a bar for each category, where the height of the bar can
represent frequency or relative frequency.
 The bars are often positioned in order of decreasing height,
with the tallest bar positioned at the left.

Statistics: a QuickStudy Laminated Reference Guide
From Everand
Statistics: a QuickStudy Laminated Reference Guide
BarCharts Publishing, Inc.
No ratings yet
Mass and Energy Balances - Basic Principles For Calculation, Design, and Optimization of Macro - Nano Systems
100% (8)
Mass and Energy Balances - Basic Principles For Calculation, Design, and Optimization of Macro - Nano Systems
276 pages
Stats For PGDM
No ratings yet
Stats For PGDM
52 pages
Lecture 01 Introduction To Statistics PPT 06022025 095924am
No ratings yet
Lecture 01 Introduction To Statistics PPT 06022025 095924am
40 pages
Statistics - Basic Concepts
No ratings yet
Statistics - Basic Concepts
29 pages
Data Types: and Its Representation Session - 2 & 3
No ratings yet
Data Types: and Its Representation Session - 2 & 3
33 pages
Introduction Book 1
No ratings yet
Introduction Book 1
41 pages
Introduction To Stati Stics: There Are Three Kinds of Lies: Lies, Damned Lies, A ND Statistics." (B.Disraeli)
No ratings yet
Introduction To Stati Stics: There Are Three Kinds of Lies: Lies, Damned Lies, A ND Statistics." (B.Disraeli)
39 pages
Lesson 1: Engineering Data Analysis First Semester - A.Y. 2021 - 2022
100% (1)
Lesson 1: Engineering Data Analysis First Semester - A.Y. 2021 - 2022
4 pages
Intro To Statistics Lecture
No ratings yet
Intro To Statistics Lecture
41 pages
M 301 - Ch1 - Introduction To Statistics
No ratings yet
M 301 - Ch1 - Introduction To Statistics
96 pages
Introduction To Statistics: There Are Three Kinds of Lies: Lies, Damned Lies, and Statistics." (B.Disraeli)
No ratings yet
Introduction To Statistics: There Are Three Kinds of Lies: Lies, Damned Lies, and Statistics." (B.Disraeli)
26 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
39 pages
Course Introduction Inferential Statistics Prof. Sandy A. Lerio
No ratings yet
Course Introduction Inferential Statistics Prof. Sandy A. Lerio
46 pages
Data Management (1)
No ratings yet
Data Management (1)
46 pages
Biostatistics Notes-Numbered
No ratings yet
Biostatistics Notes-Numbered
21 pages
Unit 1 - Examining Distributions
No ratings yet
Unit 1 - Examining Distributions
80 pages
Introduction To Statistics
No ratings yet
Introduction To Statistics
45 pages
Statistic Reviewer
No ratings yet
Statistic Reviewer
9 pages
Lecture No 01 Statistics 13-2-24
No ratings yet
Lecture No 01 Statistics 13-2-24
34 pages
Introduction To Statistics Presentation of Data
No ratings yet
Introduction To Statistics Presentation of Data
20 pages
Topic 1 Descriptive Statistics SV
No ratings yet
Topic 1 Descriptive Statistics SV
113 pages
MAT 361 Lecture 15 16
No ratings yet
MAT 361 Lecture 15 16
40 pages
Nature of Statistics Part 2
No ratings yet
Nature of Statistics Part 2
48 pages
Statistics Review
No ratings yet
Statistics Review
59 pages
3rd QTR Stats Reviewer
No ratings yet
3rd QTR Stats Reviewer
24 pages
Ns Statistics 2022
No ratings yet
Ns Statistics 2022
70 pages
Stats Reviewer
No ratings yet
Stats Reviewer
5 pages
Part1 141104090445 Conversion Gate01
No ratings yet
Part1 141104090445 Conversion Gate01
27 pages
SLIDES Statistics-Chapter 2
No ratings yet
SLIDES Statistics-Chapter 2
31 pages
Introduction To Statistics: "There Are Three Kinds of Lies: Lies, Damned Lies, and Statistics." (B.Disraeli)
No ratings yet
Introduction To Statistics: "There Are Three Kinds of Lies: Lies, Damned Lies, and Statistics." (B.Disraeli)
32 pages
Statistics A Review
No ratings yet
Statistics A Review
47 pages
PROBABILITY Lecture 1 - 2 - 3
No ratings yet
PROBABILITY Lecture 1 - 2 - 3
63 pages
Lecture 1
No ratings yet
Lecture 1
28 pages
Engineering Data Analysis
No ratings yet
Engineering Data Analysis
4 pages
Introduction To Statistics and SPSS
100% (1)
Introduction To Statistics and SPSS
110 pages
1 Biostatistics LECTURE 1
100% (1)
1 Biostatistics LECTURE 1
64 pages
Data Management
No ratings yet
Data Management
44 pages
Math Notes Module 4A
No ratings yet
Math Notes Module 4A
4 pages
Review of Statistical Concepts
No ratings yet
Review of Statistical Concepts
60 pages
Basic Concepts in Statistics
No ratings yet
Basic Concepts in Statistics
42 pages
Biostatics For Nurses
No ratings yet
Biostatics For Nurses
74 pages
Emdad Rahman
No ratings yet
Emdad Rahman
85 pages
Stat 2017
No ratings yet
Stat 2017
397 pages
What Is Statistics?: "Statistics Is A Way To Get Information From Data"
No ratings yet
What Is Statistics?: "Statistics Is A Way To Get Information From Data"
220 pages
Trust Wallet Spamming
No ratings yet
Trust Wallet Spamming
50 pages
ABE 322 Sta Class 1-2
No ratings yet
ABE 322 Sta Class 1-2
35 pages
1 Descriptive Part
No ratings yet
1 Descriptive Part
13 pages
Math 5
No ratings yet
Math 5
3 pages
Biostatistics Biochemistry 1
No ratings yet
Biostatistics Biochemistry 1
22 pages
1st Mid
No ratings yet
1st Mid
19 pages
Ae 9 Reviewer
No ratings yet
Ae 9 Reviewer
7 pages
Basic Statistical Concepts - Measures of Location
No ratings yet
Basic Statistical Concepts - Measures of Location
14 pages
Basic and Valuable Concepts of Statistics
No ratings yet
Basic and Valuable Concepts of Statistics
16 pages
Physics
No ratings yet
Physics
6 pages
365 Data Science - Statistics: Glossary Section Lesson Word
No ratings yet
365 Data Science - Statistics: Glossary Section Lesson Word
5 pages
Sta 131 Complete Note
No ratings yet
Sta 131 Complete Note
33 pages
Statistics I Essentials
From Everand
Statistics I Essentials
Emil G. Milewski
No ratings yet
Image Histogram: Unveiling Visual Insights, Exploring the Depths of Image Histograms in Computer Vision
From Everand
Image Histogram: Unveiling Visual Insights, Exploring the Depths of Image Histograms in Computer Vision
Fouad Sabry
No ratings yet
Descriptive Statistics: Six Sigma Thinking, #3
From Everand
Descriptive Statistics: Six Sigma Thinking, #3
Sumeet Savant
No ratings yet
Introduction To Business Statistics Through R Software: Software
From Everand
Introduction To Business Statistics Through R Software: Software
Editor IJSMI
No ratings yet
Chi Squareedited
No ratings yet
Chi Squareedited
39 pages
Collection Development Plan Template and Guide
100% (1)
Collection Development Plan Template and Guide
11 pages
When To Use Descriptive Statistics - Part 3 - MLIS
No ratings yet
When To Use Descriptive Statistics - Part 3 - MLIS
55 pages
Anova
No ratings yet
Anova
32 pages
406d PDF
No ratings yet
406d PDF
6 pages
00-Qe20-00014 Rev B - Draf 021625
No ratings yet
00-Qe20-00014 Rev B - Draf 021625
9 pages
물리 교재 28단원
No ratings yet
물리 교재 28단원
26 pages
Bda Important Questions
100% (1)
Bda Important Questions
4 pages
Two Radii and A Chord Make An Isosceles Triangle
No ratings yet
Two Radii and A Chord Make An Isosceles Triangle
3 pages
Aryabhatta 2021 Class VIII QP PDF
No ratings yet
Aryabhatta 2021 Class VIII QP PDF
14 pages
G2 Bayesian Analysis
No ratings yet
G2 Bayesian Analysis
4 pages
1st Year Honours Syllabus Statistics Physics
No ratings yet
1st Year Honours Syllabus Statistics Physics
16 pages
Ch04 Cost Volume Profit Analysis
0% (1)
Ch04 Cost Volume Profit Analysis
21 pages
Polarization Through Quarter
No ratings yet
Polarization Through Quarter
10 pages
Experiment 2.4 DL
No ratings yet
Experiment 2.4 DL
4 pages
02 Chapter 3 - Weight Volume Relationships
No ratings yet
02 Chapter 3 - Weight Volume Relationships
42 pages
Trigonometry Sheet - 05
No ratings yet
Trigonometry Sheet - 05
10 pages
10 General Aptitude - GQB (Ddpanda)
No ratings yet
10 General Aptitude - GQB (Ddpanda)
71 pages
Maintaining Test Methods in The User's Laboratory: Standard Guide For
No ratings yet
Maintaining Test Methods in The User's Laboratory: Standard Guide For
4 pages
Asset Pricing
No ratings yet
Asset Pricing
23 pages
Submitted in Partial Fulfilment For The Award of Degree of
No ratings yet
Submitted in Partial Fulfilment For The Award of Degree of
13 pages
Take Home Section Logarithms
No ratings yet
Take Home Section Logarithms
6 pages
Partial Differential Equation Part C Upto 21oct
No ratings yet
Partial Differential Equation Part C Upto 21oct
7 pages
G7 Q1 Week 01
No ratings yet
G7 Q1 Week 01
8 pages
ChE 3323 Syllabus 2016
No ratings yet
ChE 3323 Syllabus 2016
5 pages
Semantic Danielou-53: (4th of October 2014)
100% (1)
Semantic Danielou-53: (4th of October 2014)
20 pages
FYP Final Report
No ratings yet
FYP Final Report
40 pages
Quadratic Equation - Arjuna Jee 2.0 2025
No ratings yet
Quadratic Equation - Arjuna Jee 2.0 2025
15 pages
RelaySimTest Brochure ENU
No ratings yet
RelaySimTest Brochure ENU
8 pages
Math-12th Sample Question Papers (Solved) 2024-25
No ratings yet
Math-12th Sample Question Papers (Solved) 2024-25
21 pages
6625 ImproveTransmission VM 20131021 Web
No ratings yet
6625 ImproveTransmission VM 20131021 Web
13 pages
Physics Electrostatics MCQ
No ratings yet
Physics Electrostatics MCQ
8 pages
EDAN
No ratings yet
EDAN
2 pages

Collection of Data Part 2 Edited MLIS

Uploaded by

Collection of Data Part 2 Edited MLIS

Uploaded by

Variable: A characteristic about each

individual element of a population or sample.

 A random sample of n experimental units

 Statistics very often involves the collection of data.

 After collecting data, the first task for a

 One method for simplifying and organizing data

 A frequency distribution is an organized

A table that organizes data values into classes

2. Grouped Frequency Distribution: for data

Courses Frequency, f Age of Frequency, f

 In a frequency distribution graph, the score categories (X

 In a histogram, a bar is centered above each score (or class

 In a polygon, a dot is centered above each score so that

 When the score categories (X values) are

 Frequency distribution graphs are useful because they

You might also like