0% found this document useful (0 votes)

64 views

02 Data and Preliminary Data Analysis - Print

The document provides information on preliminary data analysis techniques, including: 1) Definitions of key terms like raw data, frequency distribution, class intervals, and measures of central tendency. 2) Procedures for forming frequency distributions from raw data by determining class intervals and frequencies. 3) Descriptions of common measures of central tendency - the arithmetic mean, median, and mode - and how to calculate them from grouped or ungrouped data.

Uploaded by

Cahyo Agung Saputra

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

64 views

02 Data and Preliminary Data Analysis - Print

Uploaded by

Cahyo Agung Saputra

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 20

Data

DR. FLORENTINA PUNGKY PRAMESTI, ST., MT.

Preliminary Data Analysis

 Raw data: collected data that have not been organized

numerically. e.g: the set of heights of 100 male students obtained from
an alphabetical listing of university records
 Array: an arrangement of raw numerical data in ascending or
descending order of magnitude
 Frequency distribution (frequency table): a tabular
arrangement of data by classes together with the corresponding
class frequencies
 Class Intervals And Class Limits
 Class Boundaries. Bisa jg merupakan simbol kelas. Should
not coincide with actual observation ➔ to avoid ambiguity
 The size, or width, of a class interval is the difference between the lower and
upper class boundaries
 The class mark is the midpoint of the class interval

Forming frequency distributions

 Determine the largest and smallest numbers in the raw data and thus find the
range.
 Divide the range into a convenient number of class intervals having the same
size. If this is not feasible, use class intervals of different sizes or open class
intervals. The number of class intervals is usually between 5 and 20, depending
on the data. Class intervals are also chosen so that the class marks (or midpoints)
coincide with the actually observed data. This tends to lessen the so-called
grouping error involved in further mathematical analysis. However, the class
boundaries should not coincide with the actually observed data.
 Determine the number of observations falling into each class interval; that is, find
the class frequencies. This is best done by using a tally, or score sheet.
 Table 1 shows a frequency
distribution of the weekly wages
of 65 employees at the P&R
Company.
 Five new employees were hired
at weekly wages of $285.34,
$316.83, $335.78, $356.21, and
$374.50. Construct a frequency
distribution of wages for the 70
employees.
Graphic of the frequency

 histogram or frequency histogram: set of rectangles

Graphic of the frequency

 frequency polygon : line graph

Cumulative-frequency distributions and ogives

 Ogive: cumulative-frequency polygon

 Can you make it?
Case

 The final grades in mathematics of 80 students at State

University are recorded in the accompanying table
Case
Measuring the central tendency

 Arithmetic mean

 Arithmetic weighed mean

 Arithmetic mean from grouped data

Size of class intervals : c,

A : any guessed or assumed arithmetic
mean (which may be any number)
Deviations dj = Xj A, expressed as cuj ,
where uj can be positive or negative
integers or zero
either the middle value or the arithmetic mean of the
Median two middle values.
The set of numbers 3, 4, 4, 5, 6, 8, 8, 8, and 10 has median 6

The set of numbers 5, 5, 7, 9, 11, 12, 15, and 18 has median ½ *(9+11) =10

For grouped data

L1 : lower class boundary of the median class (i.e., the class containing the median)
N : number of items in the data (i.e., total frequency)
( f)1 : sum of frequencies of all classes lower than the median class
fmedian : frequency of the median class
c : size of the median class interval
value which occurs with the greatest frequency
Mode
The set 2, 2, 5, 7, 9, 9, 9, 10, 10, 11, 12, and 18 has mode 9.

The set 3, 5, 8, 10, 12, 15, and 16 has no mode

The set 2, 3, 4, 4, 4, 5, 5, 7, 7, 7, and 9 has two modes, 4 and 7, and is called bimodal

A distribution having only one mode is called unimodal

where L1 : lower class boundary of the modal class (i.e.,

the class containing the mode)
1 : excess of modal frequency over frequency of
next-lower class
2 : excess of modal frequency over frequency of
next-higher class
c : size of the modal class interval
EMPIRICAL RELATION BETWEEN THE MEAN, MEDIAN, AND MODE

For unimodal frequency curves that are

moderately skewed (asymmetrical), we
have the empirical
relation

Mean - mode = 3(mean - median)

THE GEOMETRIC MEAN G

THE HARMONIC MEAN H The geometric mean of the numbers 2, 4, and 8

is ….
And The harmonic mean
is ….

RELATION BETWEEN THE ARITHMETIC, GEOMETRIC, AND

HARMONIC MEANS
H  G  X
THE ROOT MEAN SQUARE

The RMS of the set 1, 3, 4, 5, and 7 is

QUARTILES, DECILES, AND PERCENTILES

Standard deviation for a grouped data

 fiXi − ( fiXi) / n

2 2

=w
n−1
W: class width n – 1 : degree of freedom (page 39)
Fi: Frequency ➔ membicarakan sampel maka gunakan
Xi: class mid point or deviation degree of freedom (n - m)
from an arbitrary origin ➔ membicarakan populasi gunakan n

Standard deviation  (xi −  )

n 2

= i=1
for an ungrouped data n
Catatan
will learn about the construction of ogive or cumulative frequency curve and cumulative
frequency polygon. There are two methods of constructing frequency polygon and cumulative
frequency curve but the techniques of drawing it is same.

1) Less than method

2) More than method

Less than method :

First prepare a less than type cumulative frequency table.
1) On the x – axis use the upper limits of the class.
2) Mark the less than type cumulative frequency on y – axis.
3) Plot the points using upper limits and corresponding cumulative frequencies.
4) Join the points by a free hand curve to get ogive and to get the cumulative frequency
polygon join the points by line segments.

More than method :

First prepare a more than type cumulative frequency table.
5) On the x – axis use the lower limits of the class.
6) Mark the more than type cumulative frequency on y – axis.
7) Plot the points using upper limits and corresponding cumulative frequencies.
8) Join the points by a free hand curve to get ogive and to get the cumulative frequency
polygon join the points by line segments.

Statistics: a QuickStudy Laminated Reference Guide
From Everand
Statistics: a QuickStudy Laminated Reference Guide
BarCharts Publishing, Inc.
No ratings yet
Skittles Statistics Project
No ratings yet
Skittles Statistics Project
9 pages
Intro To Statistics
No ratings yet
Intro To Statistics
38 pages
Chapter 15 (3)nnn
No ratings yet
Chapter 15 (3)nnn
16 pages
Lesson Note For S.S 2
No ratings yet
Lesson Note For S.S 2
24 pages
1 Review of Statistics
No ratings yet
1 Review of Statistics
24 pages
MATH 322: Probability and Statistical Methods
No ratings yet
MATH 322: Probability and Statistical Methods
27 pages
Statistics
No ratings yet
Statistics
39 pages
Fresher four
No ratings yet
Fresher four
33 pages
Statistics Class 11 Notes CBSE Maths Chapter 15 (PDF)
No ratings yet
Statistics Class 11 Notes CBSE Maths Chapter 15 (PDF)
13 pages
2 FREQUENCY-DISTRIBUTION
No ratings yet
2 FREQUENCY-DISTRIBUTION
75 pages
Frequency Distribution PDF
No ratings yet
Frequency Distribution PDF
36 pages
Statistics
No ratings yet
Statistics
6 pages
Frequency Distribution
100% (2)
Frequency Distribution
25 pages
Basic Mathematics - Lecture 12
No ratings yet
Basic Mathematics - Lecture 12
10 pages
Statistics Merged
No ratings yet
Statistics Merged
59 pages
QM Statistic Notes
No ratings yet
QM Statistic Notes
24 pages
Statistics Review Worksheet-1a
No ratings yet
Statistics Review Worksheet-1a
6 pages
MAT114, 217 Lecture Note.
No ratings yet
MAT114, 217 Lecture Note.
12 pages
Notes Statistic Graph
No ratings yet
Notes Statistic Graph
5 pages
CHAPTER ONE1
No ratings yet
CHAPTER ONE1
25 pages
Organization of Data
No ratings yet
Organization of Data
56 pages
Chap 3
No ratings yet
Chap 3
6 pages
Notes
No ratings yet
Notes
18 pages
Lecture 2 - Table and Chart
No ratings yet
Lecture 2 - Table and Chart
9 pages
Introduction To Statistics: Ungrouped Data
No ratings yet
Introduction To Statistics: Ungrouped Data
8 pages
Chapter1 (L1) Updated
No ratings yet
Chapter1 (L1) Updated
59 pages
Chapter 2 SUMMARY Descriptive Statistics
No ratings yet
Chapter 2 SUMMARY Descriptive Statistics
32 pages
Stat 2
No ratings yet
Stat 2
39 pages
g11 10 Statistics
No ratings yet
g11 10 Statistics
49 pages
Statistical Organization of Scores
No ratings yet
Statistical Organization of Scores
70 pages
Statistical Unit-3 Maths
No ratings yet
Statistical Unit-3 Maths
50 pages
1739892143
No ratings yet
1739892143
8 pages
WEEK 11 MODULE-MIDTERM
No ratings yet
WEEK 11 MODULE-MIDTERM
6 pages
Arithmetic Progression
No ratings yet
Arithmetic Progression
6 pages
Psychological Statistics Midterm - 2023 2024
No ratings yet
Psychological Statistics Midterm - 2023 2024
7 pages
2-3-4 Descriptive, Central Tendency, Variation PDF
No ratings yet
2-3-4 Descriptive, Central Tendency, Variation PDF
56 pages
RAYMUNDO, Rinajean M. Bsa-Ii I. Frequency Distribution
No ratings yet
RAYMUNDO, Rinajean M. Bsa-Ii I. Frequency Distribution
2 pages
Chapter - 14 Statistics
No ratings yet
Chapter - 14 Statistics
33 pages
Measures of Central Tendency
No ratings yet
Measures of Central Tendency
13 pages
Module 1.2 - Descriptive Statistics
No ratings yet
Module 1.2 - Descriptive Statistics
30 pages
GOY AL Brothers Prakashan: X X X X X N
No ratings yet
GOY AL Brothers Prakashan: X X X X X N
22 pages
Statistics For Css
No ratings yet
Statistics For Css
73 pages
10th Math Chapter 6
No ratings yet
10th Math Chapter 6
38 pages
Business Statistics - Chapter 2 (1)
No ratings yet
Business Statistics - Chapter 2 (1)
112 pages
Sydnie's Day: Hours of Sleep in One Night Hours of Sleep in One Night
No ratings yet
Sydnie's Day: Hours of Sleep in One Night Hours of Sleep in One Night
4 pages
Statistics & Probability
No ratings yet
Statistics & Probability
105 pages
MMW Module 4 - Statistics
No ratings yet
MMW Module 4 - Statistics
18 pages
Test Bank Chap014
100% (1)
Test Bank Chap014
71 pages
Chapter 5
No ratings yet
Chapter 5
4 pages
Basic of Statistical Data
No ratings yet
Basic of Statistical Data
15 pages
Elementary Statistics: Davis Lazarus Assistant Professor ISIM, The IIS University
No ratings yet
Elementary Statistics: Davis Lazarus Assistant Professor ISIM, The IIS University
73 pages
Statistics Notes
No ratings yet
Statistics Notes
4 pages
Week3 Frequency Analysis
No ratings yet
Week3 Frequency Analysis
50 pages
Lecture (1) - Statistics
No ratings yet
Lecture (1) - Statistics
31 pages
Chapter 1 Eqt 271 (Part 1) : Basic Statistics
No ratings yet
Chapter 1 Eqt 271 (Part 1) : Basic Statistics
69 pages
Mt271 Statistics For Non-Majors (3 Units) : Lecture 2: Graphic Presentation of Data
No ratings yet
Mt271 Statistics For Non-Majors (3 Units) : Lecture 2: Graphic Presentation of Data
15 pages
MAT 152 - P2 Reviewer
No ratings yet
MAT 152 - P2 Reviewer
9 pages
CHAPTER 1 - PART 1 Latest PDF
No ratings yet
CHAPTER 1 - PART 1 Latest PDF
69 pages
Lesson 2 Frequency Distribution and Graphs
No ratings yet
Lesson 2 Frequency Distribution and Graphs
11 pages
Statistical Foundations for Psychology
From Everand
Statistical Foundations for Psychology
James C. Ware
No ratings yet
S942008004 - Cahyo Agung Saputra - Uts Dinamika Tanah 12 Okt 2021
No ratings yet
S942008004 - Cahyo Agung Saputra - Uts Dinamika Tanah 12 Okt 2021
6 pages
01 Materi Kuliah Online MK Penyelidikan Tanah
No ratings yet
01 Materi Kuliah Online MK Penyelidikan Tanah
2 pages
T4-1.ref - Cahyo Agung Saputra.s942008004
No ratings yet
T4-1.ref - Cahyo Agung Saputra.s942008004
1 page
Uas - Ppt.cahyo Agung Saputra.s942008004
No ratings yet
Uas - Ppt.cahyo Agung Saputra.s942008004
10 pages
UAS STATISTIKA - Cahyo Agung Saputra - S942008004..
No ratings yet
UAS STATISTIKA - Cahyo Agung Saputra - S942008004..
6 pages
05 Normal Distribution 20
No ratings yet
05 Normal Distribution 20
13 pages
03 Probability, Probability Distributions, & Expectation
No ratings yet
03 Probability, Probability Distributions, & Expectation
13 pages
AuditorSlack6 NoAuthor
No ratings yet
AuditorSlack6 NoAuthor
68 pages
Unit 2 Topic 4 STS
No ratings yet
Unit 2 Topic 4 STS
6 pages
EGM6365: Structural Optimization EGM6365: Structural Optimization
No ratings yet
EGM6365: Structural Optimization EGM6365: Structural Optimization
9 pages
Criminology Public Policy - 2024 - Weisburd - Can Increasing Preventive Patrol in Large Geographic Areas Reduce Crime A
No ratings yet
Criminology Public Policy - 2024 - Weisburd - Can Increasing Preventive Patrol in Large Geographic Areas Reduce Crime A
23 pages
MPP Assignment Draft 3
No ratings yet
MPP Assignment Draft 3
24 pages
Pharmaceutical Preformulation and Formulation
No ratings yet
Pharmaceutical Preformulation and Formulation
10 pages
Orum, A.M. and Christmann, G.B. (2022) - Community Studies.
No ratings yet
Orum, A.M. and Christmann, G.B. (2022) - Community Studies.
5 pages
Sda-03 A Taste of Adam: Beilei Xu, Merck & Co., Inc., Rahway, NJ Changhong Shi, Merck & Co., Inc., Rahway, NJ
No ratings yet
Sda-03 A Taste of Adam: Beilei Xu, Merck & Co., Inc., Rahway, NJ Changhong Shi, Merck & Co., Inc., Rahway, NJ
9 pages
NVA. 2022. Employers-Statement-Okp-And-Msp-Programmes
No ratings yet
NVA. 2022. Employers-Statement-Okp-And-Msp-Programmes
3 pages
JP Morgan Research Paper
No ratings yet
JP Morgan Research Paper
5 pages
G8 Social Studies
No ratings yet
G8 Social Studies
8 pages
Thesis Ideas For Elementary Education
100% (3)
Thesis Ideas For Elementary Education
7 pages
GROUP 2 Operational Analysis
100% (1)
GROUP 2 Operational Analysis
2 pages
Project Report Format 1
No ratings yet
Project Report Format 1
15 pages
Week 2-Outline, Research, Research Question, Note Cards, Sources
No ratings yet
Week 2-Outline, Research, Research Question, Note Cards, Sources
3 pages
EECS 401: Ninth Problem Assignment: Due by 5PM, Fri., Mar. 30, 2007 in Changhun's Mailbox in Room 2420 EECS
No ratings yet
EECS 401: Ninth Problem Assignment: Due by 5PM, Fri., Mar. 30, 2007 in Changhun's Mailbox in Room 2420 EECS
2 pages
Thesis Psychology PDF
100% (3)
Thesis Psychology PDF
6 pages
Mock Exam
No ratings yet
Mock Exam
6 pages
Ux Ui
No ratings yet
Ux Ui
55 pages
IT Report
50% (2)
IT Report
33 pages
Bamboo Wall With Mortar Plaster
No ratings yet
Bamboo Wall With Mortar Plaster
11 pages
Business Analytics
No ratings yet
Business Analytics
8 pages
Trắc nghiệm NCKH
No ratings yet
Trắc nghiệm NCKH
11 pages
Practical Research
No ratings yet
Practical Research
6 pages
Journal - Effectiveness of Electroconvulsive Therapy in Patients With Treatment Resistant Schizophrenia
No ratings yet
Journal - Effectiveness of Electroconvulsive Therapy in Patients With Treatment Resistant Schizophrenia
22 pages
Capstone Project Title Proposal 1
No ratings yet
Capstone Project Title Proposal 1
6 pages
MSC Food Service Management and Dietetics
No ratings yet
MSC Food Service Management and Dietetics
36 pages
Tutorial 10 Solution
No ratings yet
Tutorial 10 Solution
4 pages
Brute Force: Design and Analysis of Algorithms - Chapter 3 1
No ratings yet
Brute Force: Design and Analysis of Algorithms - Chapter 3 1
18 pages

02 Data and Preliminary Data Analysis - Print

Uploaded by

02 Data and Preliminary Data Analysis - Print

Uploaded by

Data

DR. FLORENTINA PUNGKY PRAMESTI, ST., MT.

 Raw data: collected data that have not been organized

Forming frequency distributions

 histogram or frequency histogram: set of rectangles

 frequency polygon : line graph

 Ogive: cumulative-frequency polygon

 The final grades in mathematics of 80 students at State

 Arithmetic weighed mean

 Arithmetic mean from grouped data

Size of class intervals : c,

For grouped data

The set 3, 5, 8, 10, 12, 15, and 16 has no mode

A distribution having only one mode is called unimodal

where L1 : lower class boundary of the modal class (i.e.,

For unimodal frequency curves that are

Mean - mode = 3(mean - median)

THE HARMONIC MEAN H The geometric mean of the numbers 2, 4, and 8

RELATION BETWEEN THE ARITHMETIC, GEOMETRIC, AND

The RMS of the set 1, 3, 4, 5, and 7 is

QUARTILES, DECILES, AND PERCENTILES

 fiXi − ( fiXi) / n

Standard deviation  (xi −  )

1) Less than method

Less than method :

More than method :

You might also like