Lesson 1: Engineering Data Analysis First Semester - A.Y. 2021 - 2022

This document provides an overview of key concepts in descriptive statistics and data analysis, including: - Descriptive statistics are used to summarize data through measures of central tendency and variability, while inferential statistics allow generalizing from samples to populations. - A population is the entire set of data, while a sample is a subset selected from the population. There are different types of measurements (continuous vs discrete) and scales (nominal, ordinal, interval, ratio). - Data can be collected through simple random or stratified sampling. It is then organized and presented through tables, graphs like histograms and frequency polygons, and other visualizations like stem plots and ogives.

Uploaded by

Robert Macalanao

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

561 views4 pages

Lesson 1: Engineering Data Analysis First Semester - A.Y. 2021 - 2022

Uploaded by

Robert Macalanao

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

ENGINEERING

DATA ANALYSIS
FIRST SEMESTER
– A.Y. 2021 - 2022
LESSON 1
PART 1
◉ Descriptive and Inferential Statistics
◉ Population and Sample
◉ Types of Measurements and Scales
◉ Data Collection and Presentation
Statistics – is the study of the collection, organization, examination, summarization, manipulation,
interpretation, and presentation of quantitative data. It deals with all aspects of data including the planning of
data collection in terms of the design of surveys and experiments.
TWO MAJOR FUNCTIONS OF STATISTICS
 Descriptive Statistics – are brief descriptive coefficients that summarize a given data set, which can be
either a representation of the entire or sample population. Descriptive statistics are broken down into
measures of central tendency and measures of variability. In short, descriptive statistics help describe
and understand the features of a specific data set, by giving short summaries about the sample ad
measures of the data.
 Inductive/Inferential Statistics – are techniques that allow us to use samples to make generalizations
about the populations from which the samples were drawn. It is, therefore, important that the sample
accurately represent the population. The process of achieving this is called sampling. Inferential
statistics arise out of that sampling naturally incurs sampling error and thus a sample is not expected to
perfectly represent the population.
Some definitions:
Population – is a set of similar items or events which is of interest for some question or experiment. A
statistical population can be a group of actually existing objects or a hypothetical and potentially infinite group
of objects conceived as a generalization from experience.
Parameter – is any numerical quantity that characterizes a given population or some aspect of it. this means the
parameter tells us something about the whole population.
Data Sample – is a set of data collected and/or selected from a statistical population by a defined procedure.
Statistics - are numbers that summarize data from a sample.
Variable - the characteristic that is being studied. A variable may be a qualitative or quantitative.
Typically, the population is very large, making a census or a complete enumeration of all the values in
the population is either impractical or impossible. The sample usually represents a subset of manageable size.
Samples are collected and statistics are calculated from the samples so that one can make inferences or
extrapolations form the sample to the population.

POPULATION AND SAMPLE

In the language of statistics, one of the most basic concepts is sampling. In most statistical problems, a specified
number of measurements or data – a sample – is drawn from a much larger body of measurements, called the
population.
A population is the set of all measurements of interest to the investigator.
A sample is a subset of measurements selected from the population of interest.
TYPES OF MEASUREMENTS
 Continuous Data – is information that can be measured on a continuum or scale. Continuous data can
have almost any numeric value and can be meaningfully subdivided into finer and finer increments,
depending upon the precision of measurement system. Ex. Standard Normal Distribution
 Discrete Data – is information that can be categorized into a classification. Discrete data is based on
counts. Only a finite number of values is possible, and the values cannot be subdivided meaningfully. It
is typical things counted in whole numbers. Ex. Binomial Probability Distribution
MEASUREMENT SCALES
 Nominal – used for labeling variables, without any qualitative value. “Normal” scales could simply be
called “labels”. A good way to remember all of this is that “nominal” sounds a lot like “name” and
nominal scales are kind of like “names” or labels.
Note: a subtype of nominal scale with only two categories (e.g. males/female) is called “dichotomous”.
 Ordinal – with ordinal scales, it is the order of the values is what’s important and significant, but the
differences between each one is not really known. Ordinal scales are typically measures of non-numeric
concepts like satisfaction, happiness, discomfort, etc. “Ordinal” is easy to remember because it sounds
like “order” and that’s the key to remember with “ordinal scales” – it is the order that matters, but that’s
all you really get from these.
 Interval – are numeric scales in which we know not only the order, but also the exact differences
between the values. The classic example of an interval scale is Celsius temperature because the
difference between each value is the same. “Interval” itself means “space between”, which is the
important thing to remember – interval scales not only tell us about order, but also about the value
between each item.
Here’s the problem with the interval scales; they don’t have a “true zero”. For example, there is no such
thing as “no temperature”. Without a true zero, it is impossible to compute ratios. With interval data, we
can add and subtract, but cannot multiply or divide.
 Ratio – they tell us about the order, they tell us the exact value between units, and they also have an
absolute zero – which allows for a wide range of both descriptive and inferential statistics to be applied.
Good examples of ratio variable include height and weight.
COLLECTION OF DATA
Simple Random Sample – is a subset of a statistical population in which each member of the subset has an
equal probability of being chosen. An example of a simple random sample would be the names of 25 employees
being chosen out of a hat from a company of 250 employees. In this case, the population is all 250 employees,
and the sample is random because each employee has an equal chance of being chosen.
Stratified Sampling – is a method of sampling that involves the division of a population into smaller groups
known as strata. In stratified random sampling, or stratification, the strata are formed based on members’ shared
attributes or characteristics.
- Stratified random sampling is also called proportional random sampling or quota random sampling.
TABULAR AND GRAPHICL METHODS IN DESCRIPTIVE STATISTICS
Frequency Distribution – is a list or graph that displays frequency of various outcomes in a sample. Each entry
in the table contains the frequency or count of the occurrences of values within a particular group or interval,
and in this way, the table summarizes the distribution of values in the sample.
Raw Data – are collected data that have not been organized numerically.
Array – an arrangement of raw data in ascending or descending order or magnitude.
Frequency – the number of times a value appears in the listing
Relative Frequency – actual frequency of the observation divided by the total frequency
Range – is the difference between the largest and smallest values
Class Intervals – range of values in a class consisting of a lower limit and an upper limit
Ungrouped Data – when the data is small (n ≤ 30) or when there are few distinct values, the data may be
organized without grouping.
f
Relative Frequency =
Σf
Grouped Data – statistical data generated in large masses (n > 30) can be assessed by grouping the data into
different classes.
Frequency Distribution from Raw Data
1. Find the range (R)
2. Decide on a suitable number of classes.
M = 1 + 3.3log n where m = number of cases
3. Determine the class size.
R
c=
m
4. Find the number of observations in each class. This is the class frequency.
Class marks –The midpoint of the class interval. Ex. 31.5 is the class mark of 28-35.
Class boundaries – a point that represents halfway or dividing point between successive classes.
Ex. 35.5 is the upper boundary of 28-35. Its lower boundary is 27.5.
Where:
𝑈𝑝𝑝𝑒𝑟 𝑙𝑖𝑚𝑖𝑡 𝑜𝑓 𝑡ℎ𝑒 𝑓𝑖𝑟𝑠𝑡 𝑐𝑙𝑎𝑠𝑠 +𝐿𝑜𝑤𝑒𝑟 𝑙𝑖𝑚𝑖𝑡 𝑜𝑓 𝑡ℎ𝑒 𝑠𝑒𝑐𝑜𝑛𝑑 𝑐𝑙𝑎𝑠𝑠 89+90
𝐶𝑙𝑎𝑠𝑠 𝐵𝑜𝑢𝑛𝑑𝑎𝑟𝑦 = = = 89.5
2 2
𝐿𝑜𝑤𝑒𝑟 𝑙𝑖𝑚𝑖𝑡 +𝑈𝑝𝑝𝑒𝑟 𝑙𝑖𝑚𝑖𝑡 80+89
𝐶𝑙𝑎𝑠𝑠 𝑀𝑎𝑟𝑘𝑠 = = = 84.5
2 2

Cumulative Frequency – total frequency of all values either “less than” or “more than” any class boundary.
Frequency Histogram - a graph that uses vertical columns to show frequencies.
- there should not be any gaps between the bars.
Frequency Polygon - a frequency polygon is very similar to a histogram. In fact, they are almost identical
except that frequency polygons can be used to compare sets of data to display cumulative frequency
distribution. In addition, histograms tend to be rectangle while a frequency polygon assembles a line graph.
CUMULATIVE FREQUENCY POLYGON / OGIVE
- An ogive graph plots cumulative frequency on the y-axis and class boundaries along
the x-axis. It’s very similar to a histogram, only instead of rectangles, an ogive has a
single point marking where the top right of the rectangle would be.

STEMPLOT
- typically used when there is a medium amount of quantitative variables to analyze; Stem plots of
more than 50 observations are unusual. The name “Stem plot” comes because there is one “stem”
with the largest place-value digits to the left and one “leaf” to the right.

1. Select one or more leading digits for the stem values. The remaining digits become the leaves.
2. List all the possible stem values in a vertical column.
3. Record the leaf for every observation beside the corresponding stem value. Indicate the unit for stems
and leaves in the display.
4. A display having between 5 and 20 stems is recorded.

Pie chart –is the familiar circular graph that shows how the measurements are distributed among the categories
Bar chart –shows the same distribution of measurements in categories, with the height of the bar measuring
how often a particular category was observed.
Example (Introduction to Probability and Statistics by Mendenhall and Beaver, 13th edition, 2009, p.12

Mathematics in The Modern World (MathEd) Syllabus 2024-2025
No ratings yet
Mathematics in The Modern World (MathEd) Syllabus 2024-2025
9 pages
Rizal Activity 4.1
No ratings yet
Rizal Activity 4.1
5 pages
Engineering Data Analysis
100% (1)
Engineering Data Analysis
82 pages
Dirr-112 Week 11-19 by Kuyajovert
100% (4)
Dirr-112 Week 11-19 by Kuyajovert
10 pages
Engineering Data Analysis 4
No ratings yet
Engineering Data Analysis 4
193 pages
Group-4 Optimization in Business, Economics, and Life Sciences
No ratings yet
Group-4 Optimization in Business, Economics, and Life Sciences
14 pages
CEP233 - M5 - Taping Corrections
100% (2)
CEP233 - M5 - Taping Corrections
29 pages
CEP233 - M5 - Taping Corrections
100% (2)
CEP233 - M5 - Taping Corrections
29 pages
UPANG CEA Common MAT171 P1
No ratings yet
UPANG CEA Common MAT171 P1
88 pages
Engineering Data Analysis (Mod2)
100% (1)
Engineering Data Analysis (Mod2)
7 pages
HSBC Bank Statement TemplateLab Com
100% (1)
HSBC Bank Statement TemplateLab Com
1 page
1a Math 2 MT Prelim Revised
No ratings yet
1a Math 2 MT Prelim Revised
82 pages
Engineering Data Analysis Module 1-4
No ratings yet
Engineering Data Analysis Module 1-4
122 pages
Physics For Engineers Final Exam Reviewer
No ratings yet
Physics For Engineers Final Exam Reviewer
19 pages
Planning and Conducting Experiments
No ratings yet
Planning and Conducting Experiments
55 pages
Eng'g Data Analysis Module 1
No ratings yet
Eng'g Data Analysis Module 1
19 pages
MMW Module 7 - Measures of Dispersion
No ratings yet
MMW Module 7 - Measures of Dispersion
9 pages
MMW - Module 4-1
100% (1)
MMW - Module 4-1
80 pages
Cesafi Math Quizbowl 2019: Prepared By: Armenion, Mark Allen G
100% (1)
Cesafi Math Quizbowl 2019: Prepared By: Armenion, Mark Allen G
39 pages
Speaking Mathematically
No ratings yet
Speaking Mathematically
25 pages
Module 1 - Math-4 - Obtaining Data
No ratings yet
Module 1 - Math-4 - Obtaining Data
25 pages
1-2C-Introduction To Statistical Analysis For Industrial Engineering 2
100% (1)
1-2C-Introduction To Statistical Analysis For Industrial Engineering 2
12 pages
MATH019A Engineering Data Analysis: Engr. Jan Justine A. Razon Faculty, Cpe Department
100% (2)
MATH019A Engineering Data Analysis: Engr. Jan Justine A. Razon Faculty, Cpe Department
50 pages
Module 1 EE Data Analysis
100% (2)
Module 1 EE Data Analysis
13 pages
TOPIC 6 Sampling Distribution and Point Estimation of Parameters
No ratings yet
TOPIC 6 Sampling Distribution and Point Estimation of Parameters
38 pages
Engineering Data Analysis Chapter 3 - Discrete Probability Distribution
No ratings yet
Engineering Data Analysis Chapter 3 - Discrete Probability Distribution
18 pages
Gitaarmap Deel B PDF
100% (1)
Gitaarmap Deel B PDF
150 pages
Physics For Engineers PDF
100% (1)
Physics For Engineers PDF
12 pages
Notes On Engineering Data Analysis
100% (1)
Notes On Engineering Data Analysis
4 pages
EDA Module
No ratings yet
EDA Module
356 pages
XKWorkshopManual PDF
No ratings yet
XKWorkshopManual PDF
3,165 pages
INTEGRAL CALCULUS Module 4
No ratings yet
INTEGRAL CALCULUS Module 4
12 pages
Week 8 Statistical Intervals
No ratings yet
Week 8 Statistical Intervals
32 pages
Engineering Data Analysis Chapter 1 2
No ratings yet
Engineering Data Analysis Chapter 1 2
78 pages
Lesson 1 (Obtaining Data)
100% (1)
Lesson 1 (Obtaining Data)
7 pages
Module 3 - DISCRETE PROBABILITY DISTRIBUTIONS
No ratings yet
Module 3 - DISCRETE PROBABILITY DISTRIBUTIONS
22 pages
Online Module in Ethics
100% (2)
Online Module in Ethics
61 pages
The Corrossion
No ratings yet
The Corrossion
12 pages
PDF Differential and Integral Calculus by Feliciano
No ratings yet
PDF Differential and Integral Calculus by Feliciano
39 pages
Engineering Data Analysis Final Exam
No ratings yet
Engineering Data Analysis Final Exam
4 pages
Mathematics in The Modern World: Binary Operation
No ratings yet
Mathematics in The Modern World: Binary Operation
11 pages
Data-Management MMW
No ratings yet
Data-Management MMW
22 pages
Measure of Variation: Range: ST RD
No ratings yet
Measure of Variation: Range: ST RD
10 pages
Subject: Ce 221 Engineering Data Analysis: Simple Random Sample
No ratings yet
Subject: Ce 221 Engineering Data Analysis: Simple Random Sample
23 pages
BSCE Mathematics in The Modern World
No ratings yet
BSCE Mathematics in The Modern World
16 pages
Orientering
No ratings yet
Orientering
15 pages
Isogonal and Orthogonal Trajectories
No ratings yet
Isogonal and Orthogonal Trajectories
122 pages
Lesson 1 Methods of Data Collection
No ratings yet
Lesson 1 Methods of Data Collection
70 pages
Midterm Exam MMW Part 1
No ratings yet
Midterm Exam MMW Part 1
5 pages
50 KLD STP Boq
No ratings yet
50 KLD STP Boq
104 pages
Physics For Engineers (Phys E1) Laboratory Manual
No ratings yet
Physics For Engineers (Phys E1) Laboratory Manual
6 pages
Chapter 1 7 Midterm
100% (1)
Chapter 1 7 Midterm
25 pages
Activity 6 - Equilibrium and Le Chatelier - S Principle
No ratings yet
Activity 6 - Equilibrium and Le Chatelier - S Principle
13 pages
COMPROG Flowchart For DCIT22.document
No ratings yet
COMPROG Flowchart For DCIT22.document
4 pages
MMW Mod#4 Statistics
No ratings yet
MMW Mod#4 Statistics
6 pages
9 Sampling Distribution and Point Estimation of Parameters
No ratings yet
9 Sampling Distribution and Point Estimation of Parameters
4 pages
MMW - Correlation Analysis
No ratings yet
MMW - Correlation Analysis
5 pages
CAD Activity 4
No ratings yet
CAD Activity 4
1 page
Calling Tools: Aims of This Chapter
No ratings yet
Calling Tools: Aims of This Chapter
13 pages
Sta404 Chapter 06
No ratings yet
Sta404 Chapter 06
94 pages
Engineering Data Analysis
No ratings yet
Engineering Data Analysis
5 pages
Central Philippine University College of Engineering Course Syllabus Emath 1202 Engineering Data Analysis
No ratings yet
Central Philippine University College of Engineering Course Syllabus Emath 1202 Engineering Data Analysis
8 pages
Module 6 Sptopics
No ratings yet
Module 6 Sptopics
25 pages
MMW Module 3 Mathematics of Finance2
No ratings yet
MMW Module 3 Mathematics of Finance2
10 pages
Midterm Exam Engineering Engineering Data Analysis Group 5
No ratings yet
Midterm Exam Engineering Engineering Data Analysis Group 5
2 pages
BPLCK105D - Module 2 - Functions in C++
No ratings yet
BPLCK105D - Module 2 - Functions in C++
10 pages
Me 314 1-1 Introduction
No ratings yet
Me 314 1-1 Introduction
33 pages
EDA 11 Introduction To Lesson 6
No ratings yet
EDA 11 Introduction To Lesson 6
38 pages
Lesson 8 - Linear Regression and Correlation PDF
No ratings yet
Lesson 8 - Linear Regression and Correlation PDF
3 pages
Statistics Notes
No ratings yet
Statistics Notes
89 pages
Discrete Probability Distribution
No ratings yet
Discrete Probability Distribution
34 pages
Computerised Assessment of Handwriting
No ratings yet
Computerised Assessment of Handwriting
15 pages
'Engineering Data Analysis (Probability and Statistics)
100% (1)
'Engineering Data Analysis (Probability and Statistics)
2 pages
Azu TD Hy E9791 1985 139 Sip1 W
No ratings yet
Azu TD Hy E9791 1985 139 Sip1 W
133 pages
Acceleration-Deceleration Behaviour of Various Vehicle Types PDF
No ratings yet
Acceleration-Deceleration Behaviour of Various Vehicle Types PDF
29 pages
IOT Smart Energy Grid
No ratings yet
IOT Smart Energy Grid
10 pages
Group 11 Laboratory Work No. 1 1
No ratings yet
Group 11 Laboratory Work No. 1 1
10 pages
Mercedes-Benz: Faculty of Political Science
No ratings yet
Mercedes-Benz: Faculty of Political Science
7 pages
Narrative Report 2 Group 6
No ratings yet
Narrative Report 2 Group 6
13 pages
Philippine Popular Culture
No ratings yet
Philippine Popular Culture
5 pages
Compact, High-Flow, Electric Remote Controlled Water Monitor
No ratings yet
Compact, High-Flow, Electric Remote Controlled Water Monitor
2 pages
ABC Telecom
No ratings yet
ABC Telecom
8 pages
Laboratory 3 Group 2
No ratings yet
Laboratory 3 Group 2
7 pages
Laboratory - 3 - Group - 11 2
No ratings yet
Laboratory - 3 - Group - 11 2
6 pages
Diamsay Edward D. Lab 1
No ratings yet
Diamsay Edward D. Lab 1
4 pages
Computer Fundamentals and Programming 2 Laboratory Laboratory No. 3 Title: Microsoft Excel Programming
No ratings yet
Computer Fundamentals and Programming 2 Laboratory Laboratory No. 3 Title: Microsoft Excel Programming
3 pages
Activity 3 Array and Arc Drawing: Link To Lecture 3 - Https://youtu - be/Ps-mT5Cl0gk
No ratings yet
Activity 3 Array and Arc Drawing: Link To Lecture 3 - Https://youtu - be/Ps-mT5Cl0gk
2 pages
Cir 200457281200
No ratings yet
Cir 200457281200
4 pages
Customer Information Form
No ratings yet
Customer Information Form
4 pages
Project Charter Template
No ratings yet
Project Charter Template
9 pages
Chapter-II-Resultant-of-a-Force-2.1-2.2 (1) B273 Lectures
No ratings yet
Chapter-II-Resultant-of-a-Force-2.1-2.2 (1) B273 Lectures
3 pages
Kroger Et Al. Low Grade Weirs Asu - 2011
No ratings yet
Kroger Et Al. Low Grade Weirs Asu - 2011
14 pages
Trabajo Final de Ingles Técnico
No ratings yet
Trabajo Final de Ingles Técnico
5 pages
Data Umum SSH 2024
No ratings yet
Data Umum SSH 2024
376 pages
10 11648 J Ajwse 20220802 12
No ratings yet
10 11648 J Ajwse 20220802 12
11 pages
Onion - Wikipedia, The Free Encyclopedia1
No ratings yet
Onion - Wikipedia, The Free Encyclopedia1
7 pages
91 99 28135 Hani Sept 2020 73g3
No ratings yet
91 99 28135 Hani Sept 2020 73g3
9 pages
Discharge Coefficient of Side Weirs. Experimental Study and Comparative Analysis of Different Formulas
No ratings yet
Discharge Coefficient of Side Weirs. Experimental Study and Comparative Analysis of Different Formulas
8 pages
Briefing: Water Surface Profile Over Side Weir in A Trapezoidal Channel
No ratings yet
Briefing: Water Surface Profile Over Side Weir in A Trapezoidal Channel
7 pages
CEPE1S-Project Requirement Details
No ratings yet
CEPE1S-Project Requirement Details
2 pages
Futong Ism Tds SCG Hdpe h2001wc 20jul20
No ratings yet
Futong Ism Tds SCG Hdpe h2001wc 20jul20
3 pages
Lab02 DataTypes PDF
No ratings yet
Lab02 DataTypes PDF
5 pages
November 09
No ratings yet
November 09
2 pages
Term 2 Year 1 Hass
No ratings yet
Term 2 Year 1 Hass
13 pages
Introduction To Soil Ecology
No ratings yet
Introduction To Soil Ecology
15 pages
Failure Mode For Gas CHromatograph
No ratings yet
Failure Mode For Gas CHromatograph
2 pages
Computer Fundamentals and Programming 2 Laboratory Laboratory No. 2.1 Title: Microsoft Excel Programming I. Objective
No ratings yet
Computer Fundamentals and Programming 2 Laboratory Laboratory No. 2.1 Title: Microsoft Excel Programming I. Objective
8 pages
Porsche Case Study
No ratings yet
Porsche Case Study
4 pages
Practice Problems For Solid Geometry
No ratings yet
Practice Problems For Solid Geometry
12 pages
Instant Download Understanding Race and Crime 1st Edition Colin Webster PDF All Chapter
100% (3)
Instant Download Understanding Race and Crime 1st Edition Colin Webster PDF All Chapter
84 pages
Second Quarter Lesson Plan in English 7
No ratings yet
Second Quarter Lesson Plan in English 7
5 pages
Risk Assessment Table New Version
No ratings yet
Risk Assessment Table New Version
4 pages
Oral Characteristics of Newborns: Journal of Dentistry For Children (Chicago, Ill.) December 2008
No ratings yet
Oral Characteristics of Newborns: Journal of Dentistry For Children (Chicago, Ill.) December 2008
4 pages
Upgrading Cimplicity 6.1 To 8.1 License Issue
No ratings yet
Upgrading Cimplicity 6.1 To 8.1 License Issue
2 pages
M. Ali Asdar Departement of Pulmonology and Respiratory Medicine Faculty of Medicine University of Indonesia - Persahabatan General Hospital Jakarta
No ratings yet
M. Ali Asdar Departement of Pulmonology and Respiratory Medicine Faculty of Medicine University of Indonesia - Persahabatan General Hospital Jakarta
30 pages

Lesson 1: Engineering Data Analysis First Semester - A.Y. 2021 - 2022

Uploaded by

Lesson 1: Engineering Data Analysis First Semester - A.Y. 2021 - 2022

Uploaded by

ENGINEERING

POPULATION AND SAMPLE

You might also like