0% found this document useful (0 votes)

81 views

Data Handling

This document discusses statistics and data collection and representation. It covers topics such as samples and populations, categorical and numerical data, and measures of central tendency. Specifically, it defines key statistical terms like population, sample, and inference. It provides examples of how to organize categorical data using dot plots, tally tables, and graphs. It also demonstrates how to display numerical data using stem-and-leaf plots and discusses calculating the mean or average of a data set.

Uploaded by

Jose Arturo Gonzalez

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

81 views

Data Handling

Uploaded by

Jose Arturo Gonzalez

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 39

DATA COLLECTION &

REPRESENTATION
(STATISTICS)
TEACHER: JOSE ARTURO GONZALEZ
THINKING PROBLEM
Manuel and Santiago play for the same basketball team. Unfortunately, during practice Manuel
suffered an injury and could only play half the season. The points scored by both boys in each
match were:

Manuel: 17,21,15,23,18,12,27,15,22,31,28,25
Santiago: 19,19,13,10,15,15,24,18,26,27,23,13,20,24,18,26,19,25,8,26,21,23,26,19

Which player’s performance was better?

Things to think about:

• Would it be fair to simply total the points each player scored for the season?
• How could we display the data in a meaningful way?
• What would be the “best” way to solve the problem?
INTRODUCTION
Statistics deals with the collection, organization, display, analysis and interpretation of data.
Individuals and many groups such as businesses and government agencies collect data. This data is then
transformed into information that will later be used to determine whether changes are needed, or if
changes that have been made were successful.
For example, governments every so often perform what is known as a census. A census is conducted to
gather data from all of the country’s population. Once this data is transformed into information it is
used to help make decisions which will affect that country’s future. The government might have to
consider how much money it needs to be provided for health care in the years ahead because the
number of elderly people has increased.
Thinking exercise:
• What might a government decide needs to be done if the country’s birth rate has increased?
• What might a government decide needs to be done if the country’s birth rate has decreased and
it’s elderly population is increasing?
Usually the results of the collection and interpretation of data is displayed by using graphs, tables and
diagrams.
SAMPLES AND POPULATIONS
Important words used in statistics:
Population: Refers to the whole group of people or objects from where we are collecting data.

Sample: A representative group chosen from the population to take part in the survey, be measured,
or be tested.
Random Sample: A sample selected so that any person or object has the same possibility of being
selected than any other.

Inference: A conclusion we can make based on the information that was collected and interpreted.

Consider the following example, suppose we want to determine how many students at CCB like vanilla
ice cream. What could the population be? How could we select a sample? What might an inference be?
SAMPLES AND POPULATIONS
Special Cases:
When a government carries out a CENSUS it involves gathering information from everyone in the
population. This process is very expensive and takes lots of time.

Because of the previous statement many governments may decide to gather the required information
from a sample of the population. To do this, and to make any inference real it is critical that the results
be as typical of the whole population as possible. To ensure this, it is important to randomly select the
sample and to make the sample as large as is practical.

Class Discussion (To be answered and submitted using classroom)

1. Discuss why would:
a. Apparel manufacturers like to know the body measurements of people in different age groups.
b. CCB’s restaurant be interested in the types and quantities of food consumed.
c. Meteorologists be interested in temperature, rainfall and atmospheric pressure measurements
throughout the country and throughout the world.
2. For each of the three situations given in question 1, discuss how information could be collected.
SAMPLES AND POPULATIONS
Example:
SAMPLES AND POPULATIONS
Exercises:
SAMPLES AND POPULATIONS
Exercises:
SAMPLES AND POPULATIONS
Exercises:

5 Scientist in the jungle want to find the best estimate for the lion population. They
tagged and released 20 lions as part of a research project. Later, they found 160
lions, 8 of which where tagged. Find the nearest whole number that best estimates
the lion population?

6 Juanita works in an Ornithology Department. Students asked her to find out the
best estimate of the local bird population. So she tied a belt around the legs of 40
birds. A few days later, he observed 520 birds, 34 of which had belts. To the
nearest whole number, what is the best estimate for the bird population?
CATEGORICAL DATA
When we talk about categorical data we refer to data which can be placed in categories.

An example could be if we stand at a street intersection and record the color of the different cars driving
past the intersection. In this case we could use the following code for the colors; R for Red, B for blue, G
for green, W for white and O for all other colors.

We could then obtain the following results after observing a 50 car sample:
BGWWR OGWRW OOBBG OGRWR WWWGB
BBGGW WWWOG WOBWW RWWRB OOBWR

Once we have our categorical data, we first organize it in groups. To do this we can either use a:
a. a dot plot or
b. a tally and frequency table.
At this point we can identify key features of the data. For example, the mode. The mode is the most
frequently occurring category.
A dot plot is a graph used to display data, each dot represents one data value. They can be horizontal
or vertical.
CATEGORICAL DATA

Example:
CATEGORICAL DATA
(DOT PLOT)

Exercises:
CATEGORICAL DATA
(DOT PLOT)

Exercises:
CATEGORICAL DATA
(TALLY & FREQUENCY TABLES)

If the problem we are studying has lots of data, it might be easier to use a tally and frequency table. This
tool will help us in the data collection process.

The tally part is used to keep a count of data in each category. The frequency simply summarizes the
tally, meaning it lets us know the total number of each category.

This type of table is sometimes called a frequency distribution table or simply a frequency table.
Example:
CATEGORICAL DATA
(TALLY & FREQUENCY TABLES)

Example:
CATEGORICAL DATA
(TALLY & FREQUENCY TABLES)
Exercises:
CATEGORICAL DATA
(TALLY & FREQUENCY TABLES)

Exercises:
GRAPHS OF CATEGORICAL DATA
Bar Graphs
Bar Graphs consist of rectangular shaped columns of equal width. The height of each column represents
the number of observations (frequency) of the different categories.
Example:
GRAPHS OF CATEGORICAL DATA
Bar Graphs
Exercises:
GRAPHS OF CATEGORICAL DATA
Bar Graphs
Exercises:
GRAPHS OF CATEGORICAL DATA
Pie Chart
Pie Charts are a useful of showing how a quantity is divided up. A full pie/circle represents the whole
quantity. We can then divide the pie into wedges or slices to show the frequency of each category.

The table opposite shows the results when 8th grade students were asked
“What is your favorite fruit?”
!
There are 60 kids in the sample, so each person is entitled to "# 𝑡ℎ of the
!
pie chart. "# 𝑡ℎ of 360ª is 6ª, so we can determine the angles of the
different wedges in the pie chart.

13 x 6ª = 78ª for orange

21 x 6ª = 126ª for apple
10 x 6ª = 60ª for banana
7x 6ª = 42ª for pineapple
9x 6ª = 54ª for pear
GRAPHS OF CATEGORICAL DATA
Example:
GRAPHS OF CATEGORICAL DATA
Pie Chart
Exercises:
GRAPHS OF CATEGORICAL DATA
Pie Chart
Exercises:
NUMERICAL DATA
When we talk about NUMERICAL DATA, we refer to data which is in number form.

Numerical data can be arranged using either a stem-and-leaf plot or a tally and frequency table. As in
the case of categorical data, numerical data can also be presented by a bar/column graph.

STEM-AND-LEAF PLOTS

A stem-and-leaf plot can be used to show a set of data in order.

Consider the weights (kg) of firefighter recruits:

101, 91, 83, 84, 72, 93, 67, 85, 79, 87, 78, 89, 68, 80, 107, 70, 85, 64, 95, 76, 87, 74, 68, 59, 82, 77

For each data value, the units digit will be the leaf, and the digits before it determines the stem on which
the leaf is placed.

For this example the stem labels are 5, 6, 7, 8, 9, and 10. These will be written under one another in
Ascending order.
NUMERICAL DATA
Once the stems have been recorded we start to look at each dada value. The first value is 101, here 10
is the stem and 1 is the leaf. So we record a 1to the right of the stem label 10. The next value we see is
91. Here its stem label is 9 and its leaf would be 1. Again we record a 1 to the right of the stem label 9.
We proceed to record all the data in an un ordered stem-and-leaf plot.
NUMERICAL DATA
Example:
NUMERICAL DATA
Exercises:
NUMERICAL DATA
Exercises:
WORKING WITH NUMERICAL DATA
Example:
WORKING WITH NUMERICAL DATA
Exercises:
WORKING WITH NUMERICAL DATA
Exercises:
MEASURES OF CENTRAL TENDENCY
The mean or average of a set of numbers is an important measure of their middle (central tendency). We
Talk about averages all the time. For example:
MEAN OR AVERAGE

• The average speed of a car

• Average height or weight
• The average score of an exam
• The average income for a country.
The mean or average is the total sum of all numbers in the data set divided by the number of observations.

Example:
MEASURES OF CENTRAL TENDENCY
Exercises:
MEAN OR AVERAGE
MEASURES OF CENTRAL TENDENCY
Exercises:
MEAN OR AVERAGE
MEASURES OF CENTRAL TENDENCY
The Median of a data set is dependent on whether the number of observations in the data set is odd or
even. To determine the median, first reorder the data set from the smallest to the largest then if the
MEDIAN & MODE

number of observations is odd, then the median is the observation in the middle of the data set. If the
number of observations is even, then the median is the average of the two middle observations.
MEASURES OF CENTRAL TENDENCY
The Mode for a data set is the observation that occurs the most often. It is not uncommon for a data set
to have more than one mode. This happens when two or more observation occur with equal frequency in
the data set. A data set with two modes is called bimodal. A data set with three modes is called
MEDIAN & MODE

trimodal.
MEASURE OF VARIABILITY
The Range for a data set is the difference between the largest value and smallest value contained in the
data set. First reorder the data set from smallest to largest then subtract the first observation from the
last observation.
RANGE

Motor Learning
100% (2)
Motor Learning
39 pages
Painless Statistics
From Everand
Painless Statistics
Barron's Educational Series
No ratings yet
Statistics 22-23 (IDU)
No ratings yet
Statistics 22-23 (IDU)
41 pages
Data Handling 20-21 Part 1 Samples and Populations
No ratings yet
Data Handling 20-21 Part 1 Samples and Populations
11 pages
Intro To Statistics Lecture
No ratings yet
Intro To Statistics Lecture
41 pages
Stats For PGDM
No ratings yet
Stats For PGDM
52 pages
4th Grade 7 Reviewer
No ratings yet
4th Grade 7 Reviewer
2 pages
Part1 141104090445 Conversion Gate01
No ratings yet
Part1 141104090445 Conversion Gate01
27 pages
Chapter 4
No ratings yet
Chapter 4
23 pages
Data Management Notes
No ratings yet
Data Management Notes
2 pages
Data Management ( 1)
No ratings yet
Data Management ( 1)
46 pages
SLIDES Statistics-Chapter 2
No ratings yet
SLIDES Statistics-Chapter 2
31 pages
Statistics- slide 2
No ratings yet
Statistics- slide 2
15 pages
Introduction To Stati Stics: There Are Three Kinds of Lies: Lies, Damned Lies, A ND Statistics." (B.Disraeli)
No ratings yet
Introduction To Stati Stics: There Are Three Kinds of Lies: Lies, Damned Lies, A ND Statistics." (B.Disraeli)
39 pages
EDA - First Quiz Reviewer
No ratings yet
EDA - First Quiz Reviewer
5 pages
EDA - Midterms - Reviewer
No ratings yet
EDA - Midterms - Reviewer
7 pages
Statistics
No ratings yet
Statistics
49 pages
Introduction To Statistics: "There Are Three Kinds of Lies: Lies, Damned Lies, and Statistics." (B.Disraeli)
No ratings yet
Introduction To Statistics: "There Are Three Kinds of Lies: Lies, Damned Lies, and Statistics." (B.Disraeli)
32 pages
5.1 Visual Displays of Data
No ratings yet
5.1 Visual Displays of Data
8 pages
Quantitative and Qualitative
No ratings yet
Quantitative and Qualitative
41 pages
Notes
No ratings yet
Notes
71 pages
Meeting 1 To 2 Statistics
No ratings yet
Meeting 1 To 2 Statistics
85 pages
Lesson 5 Statistics
No ratings yet
Lesson 5 Statistics
119 pages
Defining Statistics and Basic Terms
No ratings yet
Defining Statistics and Basic Terms
31 pages
Data-managementmmw (1)
No ratings yet
Data-managementmmw (1)
26 pages
Statistics 2ND Sem Reviewer
No ratings yet
Statistics 2ND Sem Reviewer
5 pages
Written Report Gathering and Organizing Data
No ratings yet
Written Report Gathering and Organizing Data
13 pages
Lecture 01 Introduction to Statistics Ppt 06022025 095924am
No ratings yet
Lecture 01 Introduction to Statistics Ppt 06022025 095924am
40 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
58 pages
Data Types: and Its Representation Session - 2 & 3
No ratings yet
Data Types: and Its Representation Session - 2 & 3
33 pages
Intro of Statistics - Ogive
No ratings yet
Intro of Statistics - Ogive
35 pages
Statistics for Business and Economics
No ratings yet
Statistics for Business and Economics
6 pages
STATISTICS
No ratings yet
STATISTICS
4 pages
3rd-qtr-stats-reviewer
No ratings yet
3rd-qtr-stats-reviewer
24 pages
Lesson 1: Engineering Data Analysis First Semester - A.Y. 2021 - 2022
100% (1)
Lesson 1: Engineering Data Analysis First Semester - A.Y. 2021 - 2022
4 pages
Intro To Statistics
No ratings yet
Intro To Statistics
35 pages
1st Mid
No ratings yet
1st Mid
19 pages
MATH-Lesson-1-2
No ratings yet
MATH-Lesson-1-2
64 pages
Module 0. Review on Statistics
No ratings yet
Module 0. Review on Statistics
76 pages
Biostat Aguila Mission Solis (1)
No ratings yet
Biostat Aguila Mission Solis (1)
44 pages
Engineering Data Analysis
No ratings yet
Engineering Data Analysis
4 pages
AA SL - Unit 1a - Representing Data (Statistics)
No ratings yet
AA SL - Unit 1a - Representing Data (Statistics)
74 pages
Math 5
No ratings yet
Math 5
3 pages
Topic 5 Data Management (Statistics)
No ratings yet
Topic 5 Data Management (Statistics)
116 pages
MATH 361 (Autosaved)
No ratings yet
MATH 361 (Autosaved)
17 pages
1_Intro_Statistics_Data
No ratings yet
1_Intro_Statistics_Data
7 pages
Data Handling Notes and Exercises
No ratings yet
Data Handling Notes and Exercises
16 pages
Data Management: Bryan S. Ambre
100% (2)
Data Management: Bryan S. Ambre
104 pages
Lesson 5 - Quantitative Analysis and Interpretation of Data
No ratings yet
Lesson 5 - Quantitative Analysis and Interpretation of Data
78 pages
1 Stats Intro 14022024 105127am
No ratings yet
1 Stats Intro 14022024 105127am
26 pages
Week 1
No ratings yet
Week 1
6 pages
Revision SB Chap 2 7
No ratings yet
Revision SB Chap 2 7
55 pages
Notebook PDF v2
No ratings yet
Notebook PDF v2
182 pages
Unit1 - 2charts and Graphs
No ratings yet
Unit1 - 2charts and Graphs
26 pages
Mathematics in The Modern World
No ratings yet
Mathematics in The Modern World
50 pages
Data Management
No ratings yet
Data Management
44 pages
DATA MANAGEMENT (MMW)
No ratings yet
DATA MANAGEMENT (MMW)
6 pages
Statistics and Probability: Bill Thaddeus Padasas
No ratings yet
Statistics and Probability: Bill Thaddeus Padasas
102 pages
INTRODUCTION
No ratings yet
INTRODUCTION
16 pages
Statistics I Essentials
From Everand
Statistics I Essentials
Emil G. Milewski
No ratings yet
Business Statistics I Essentials
From Everand
Business Statistics I Essentials
Louise Clark
5/5 (5)
Lesson 5 Fact Opinion
No ratings yet
Lesson 5 Fact Opinion
2 pages
Inventory of Sample Action Research Conducted by Teachers
100% (1)
Inventory of Sample Action Research Conducted by Teachers
3 pages
Investment in Intangible Assets and Economic Complexity 2025 Research Policy
No ratings yet
Investment in Intangible Assets and Economic Complexity 2025 Research Policy
14 pages
K-7_Sanjay Talukdar, M&E_Final
No ratings yet
K-7_Sanjay Talukdar, M&E_Final
30 pages
Thesis About Solid Waste Management in The Philippines
75% (4)
Thesis About Solid Waste Management in The Philippines
5 pages
International Bus System Benchmarking: Performance Measurement Development, Challenges, and Lessons Learned
No ratings yet
International Bus System Benchmarking: Performance Measurement Development, Challenges, and Lessons Learned
15 pages
Analysis of Churn Behavior of Consumers in Indian Telecom Sector
No ratings yet
Analysis of Churn Behavior of Consumers in Indian Telecom Sector
12 pages
Vietnam STI Report - World Bank
No ratings yet
Vietnam STI Report - World Bank
129 pages
Jurnal 1
No ratings yet
Jurnal 1
20 pages
Hire Glocal - Best Rated HR - Recruitment Consultants - Top Job Placement Agency in Jalgaon - Executive Search Service
100% (4)
Hire Glocal - Best Rated HR - Recruitment Consultants - Top Job Placement Agency in Jalgaon - Executive Search Service
16 pages
Nurs 05: Community Health Nursing 1 Community Health Nursing of Individual and Family As Client
No ratings yet
Nurs 05: Community Health Nursing 1 Community Health Nursing of Individual and Family As Client
23 pages
Rlee-Piggott - Course Design Matrix Rev
No ratings yet
Rlee-Piggott - Course Design Matrix Rev
5 pages
Assying Gold and Jewellery
No ratings yet
Assying Gold and Jewellery
11 pages
Ensayos Sobre La Censura Musical
100% (1)
Ensayos Sobre La Censura Musical
4 pages
Effect of Mobile Banking On The Saving Practices of Low-Income Users in Kathmandu Valley
No ratings yet
Effect of Mobile Banking On The Saving Practices of Low-Income Users in Kathmandu Valley
70 pages
Excerpt From "Ghost Fleet" by PW Singer and August Cole.
No ratings yet
Excerpt From "Ghost Fleet" by PW Singer and August Cole.
2 pages
Listening and Critical Thinking
No ratings yet
Listening and Critical Thinking
24 pages
Unit Test 4A
No ratings yet
Unit Test 4A
5 pages
Andreas, 2011
No ratings yet
Andreas, 2011
21 pages
Chapter 2 Introducing Product Management and Managing Product Managers
No ratings yet
Chapter 2 Introducing Product Management and Managing Product Managers
9 pages
Music Room Acoustics
No ratings yet
Music Room Acoustics
188 pages
AERS
No ratings yet
AERS
16 pages
Experiment Design - A Property of A Spring - Oscillation
No ratings yet
Experiment Design - A Property of A Spring - Oscillation
3 pages
Penelitian Epidemiologi: (Observational Dan Analitik) : Dr. Lukman Waris Univ Alma Ata Yogyakarta Rabu, 30 Oktober 2019
No ratings yet
Penelitian Epidemiologi: (Observational Dan Analitik) : Dr. Lukman Waris Univ Alma Ata Yogyakarta Rabu, 30 Oktober 2019
59 pages
GroupDiscussion PreRequisite
No ratings yet
GroupDiscussion PreRequisite
5 pages
Brain Drain Brain Gain PDF
No ratings yet
Brain Drain Brain Gain PDF
24 pages
Get Responsible Innovation: Business Opportunities and Strategies For Implementation Katharina Jarmai PDF Ebook With Full Chapters Now
100% (1)
Get Responsible Innovation: Business Opportunities and Strategies For Implementation Katharina Jarmai PDF Ebook With Full Chapters Now
52 pages
Strategic Audit and Its Importance
No ratings yet
Strategic Audit and Its Importance
11 pages
Data Mining Primer
No ratings yet
Data Mining Primer
15 pages

Data Handling

Uploaded by

Data Handling

Uploaded by

DATA COLLECTION &

Which player’s performance was better?

Things to think about:

Class Discussion (To be answered and submitted using classroom)

13 x 6ª = 78ª for orange

A stem-and-leaf plot can be used to show a set of data in order.

Consider the weights (kg) of firefighter recruits:

• The average speed of a car

You might also like