0% found this document useful (0 votes)
47 views

STPDF1 - Recalling Basic Concepts

The document provides an overview of engineering data analysis and basic statistical concepts. It discusses objectives like recalling statistical terminology, types of data, and levels of measurement. It also covers topics such as descriptive statistics, which involves organizing and summarizing data, and inferential statistics, which involves using samples to draw conclusions about populations. The document defines key terms and provides examples to illustrate statistical concepts.

Uploaded by

EunnicePanaligan
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
47 views

STPDF1 - Recalling Basic Concepts

The document provides an overview of engineering data analysis and basic statistical concepts. It discusses objectives like recalling statistical terminology, types of data, and levels of measurement. It also covers topics such as descriptive statistics, which involves organizing and summarizing data, and inferential statistics, which involves using samples to draw conclusions about populations. The document defines key terms and provides examples to illustrate statistical concepts.

Uploaded by

EunnicePanaligan
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 31

Engineering Data Analysis

Recalling Basic
Concepts
MPS Department | FEU Institute of Technology
Subtopic 1
OBJECTIVES

 Recall basic statistical concepts and sampling techniques


Subtopic 1
Recalling Basic
Concepts
 Basic Statistical Terminologies
 Types of Data
 Levels of Measurement https://psihomedeor.ro/blog/psihoterapie-consiliere-
psihologica/page/8/
1. Data are everywhere
2. Statistical techniques are used to make many decisions that affect
our lives
3. No matter what your career, you will make professional decisions
that involve data. An understanding of statistical methods will help
you make these decisions efectively
• The science of collecting, organizing, presenting, analyzing, and
interpreting data to assist in making more effective decisions
• Statistical analysis – used to manipulate, summarize, and investigate
data, so that useful decision-making information results.
The study of statistics has two major branches: descriptive statistics
and inferential statistics.
Statistics

Descriptive statistics Inferential statistics


• Descriptive statistics – Involves the organization, summarization, and
display of data.

• Inferential statistics – Involves using a sample to draw conclusions about


a population.

Population –The entire set of individuals or objects of interest or the


measurements obtained from all individuals or objects of interest
Sample – A portion, or part, of the population of interest
Descriptive statistics consists of the collection, organization, summarization,
and presentation of data.

• Collect data https://docplayer.es/50880990-Preparacion-


de-propuestas-en-horizonte-puerto-real-18-
de-junio-de-2015.html

• e.g., Survey

• Present data
• e.g., Tables and graphs

• Summarize data
• e.g., Sample mean = X i

n
Inferential statistics consists of generalizing from samples to populations, performing estimations and
hypothesis tests, determining relationships among variables, and making predictions.

• Estimation
• e.g., Estimate the population mean weight using
the sample mean weight
• Hypothesis testing
• e.g., Test the claim that the population mean
http://eceresearchmethods.tripod.com/sitebuilderc
weight is 70 kg ontent/sitebuilderfiles/methres.pdf

Inference is the process of drawing conclusions or making decisions about a population based on sample
results
• In a recent study, volunteers who had less than 6 hours of sleep were four times
more likely to answer incorrectly on a science test than were participants who
had at least 8 hours of sleep. Decide which part is the descriptive statistic and
what conclusion might be drawn using inferential statistics.

The statement “four times more likely to answer incorrectly” is a descriptive


statistic. An inference drawn from the sample is that all individuals sleeping less
than 6 hours are more likely to answer science question incorrectly than
individuals who sleep at least 8 hours.
Determine in what area of statistics does each of the following statements belong:
1. The average points per game, percent of free throws made, average number of rebounds per game,
and average number of fouls per game as well as several other measures for players in the NBA are
computed.
Ans. Descriptive
2. Ten percent of the boxes of cereal sampled by a quality technician are found to be under the labeled
weight. Based on this finding, the filling machine is adjusted to increase the amount of fill.
Ans. Inferential
3. A student determines the average weekly amount spent for food in the past 3 months
Ans. Descriptive
4. Based on a study of 500 single parent households by a social researcher, a magazine reports that
25% of all single parent households are headed by a high school dropout.
Ans. Inferential
5. A researcher claim that a new drug will reduce the number of heart attacks in men over 70 years
old.
Ans. Inferential
A population consists of all subjects (human or otherwise) that are being
studied.
A sample is a group of subjects selected from a population.

https://www.researchgate.net/profile/Ahmad_Al_Musawi/publication/323358065_Chapter_Two_Understa
nding_Data_Arabic/links/5a8fe72745851535bcd41d4b/Chapter-Two-Understanding-Data-Arabic.pdf
In a recent survey, 250 college students at Union College were asked if
they go to library regularly. 35 of the students said yes. Identify the
population and the sample.
Responses of all students at Union
College (population)

Responses of students in
survey (sample)
Data consists of information coming from observations, counts,
measurements, or responses. Most data can be put into the following
categories:
• Qualitative - data are measurements that each fail into one of several
categories. (hair color, ethnic groups and other attributes of the
population)
• Quantitative - data are observations that are measured on a numerical
scale (distance traveled to college, number of children in a family, etc.)
Statistical data are usually obtained by counting or measuring items.
 Primary data are collected specifically for the analysis desired
 Secondary data have already been compiled and are available for statistical
analysis

 A variable is an item of interest that can take on many different


numerical values. Variables whose values are determined by chance
are called random variables.

 A constant has a fixed numerical value.


Data sets can consist of two types of data: qualitative data and quantitative data.

Data

Qualitative Data Quantitative Data

Consists of attributes, Consists of numerical


labels, or measurements or counts.
nonnumerical entries.
Qualitative data are generally described by words or letters. They are not as widely
used as quantitative data because many numerical techniques do not apply to the
qualitative data. For example, it does not make sense to find an average hair color or
blood type.

Qualitative data can be separated into two subgroups:


 dichotomic (if it takes the form of a word with two options (gender - male or female)
 polynomic (if it takes the form of a word with more than two options (education -
primary school, secondary school and university).
Quantitative data are always numbers and are the result of counting or measuring
attributes of a population.

Quantitative data can be separated into two subgroups:


• discrete (if it is the result of counting (the number of students of a given ethnic group
in a class, the number of books on a shelf, ...)
• continuous (if it is the result of measuring (distance traveled, weight of luggage, …)
Determine whether the data is discrete or continuous.

1. Distance when you throw a baseball


Ans. Continuous
2. Number of pages of the new published book
Ans. Discrete
3. Amount of possible yields (in grams) from a certain chemical reaction
carried out in the laboratory
Ans. Continuous
4. Sum of points in tossing a pair of dice
Ans. Discrete
• Nominal – consist of categories in each of which the number of respective observations is
recorded. The categories are in no logical order and have no particular relationship. The
categories are said to be mutually exclusive since an individual, object, or measurement can
be included in only one of them.
• Ordinal – contain more information. Consists of distinct categories in which order is implied.
Values in one category are larger or smaller than values in other categories (e.g. rating-
excellent, good, fair, poor)
• Interval – is a set of numerical measurements in which the distance between numbers is of a
known, constant size.
• Ratio – consists of numerical measurements where the distance between numbers is of a
known, constant size, in addition, there is a nonarbitrary zero point.
The level of measurement determines which statistical calculations are meaningful. The
four levels of measurement are: nominal, ordinal, interval, and ratio.

Nominal
Levels of Lowest to
Measurement
Ordinal highest

Interval
Ratio
Data at the nominal level of measurement are qualitative only.

Nominal
Levels of Measurement Calculated using names, labels, or qualities. No
mathematical computations can be made at this level.

Colors in the US Names of students in your Textbooks you are using this
flag class semester
Data at the ordinal level of measurement are qualitative or quantitative.

Levels of Measurement Ordinal


Arranged in order, but differences between data
entries are not meaningful.

Class standings: freshman, Numbers on the back of each Top 50 songs played on the
sophomore, junior, senior player’s shirt radio
Data at the interval level of measurement are quantitative. A zero entry simply represents
a position on a scale; the entry is not an inherent zero.

Levels of Measurement

Interval
Arranged in order, the differences between data entries can be
calculated.

Temperatures Years on a timeline Atlanta Braves World Series


victories
Data at the ratio level of measurement are similar to the interval level, but a zero entry
(absolute zero) is meaningful.

A ratio of two data values can be formed so one data value can be
Levels of Measurement
expressed as a ratio.

Ratio

Ages Grade point averages Weights


https://www.shutterstock.com/s
earch/fahrenheit+thermometer

Interval Ratio
https://lh3.googleusercontent.com/PKumoPGYG
Nf7Sm9JIDGKpRyWYsFTpWJcmPU051kKVqJiJa2N
ZMgelCgMWvluEqAvf4q80eE=s85

https://www.gs1-
us.info/product-number/

Ordinal https://www.mymarketresearchmeth
ods.com/types-of-data-nominal-
ordinal-interval-ratio/

https://quizlet.com/28350766
2/statistics-polit-and-beck-
2018-chapter-14-flash-cards/

Nominal
Arrange Determine if one
Level of Put data in Subtract data
data in data value is a
measurement categories values
order multiple of another

Nominal Yes No No No
Ordinal Yes Yes No No
Interval Yes Yes Yes No
Ratio Yes Yes Yes Yes
Source: Elementary Statistics by Bluman
Elementary Statistics by Bluman
• https://psihomedeor.ro/blog/psihoterapie-consiliere-psihologica/page/8/
• https://docplayer.es/50880990-Preparacion-de-propuestas-en-horizonte-puerto-real-18-de-
junio-de-2015.html
• http://eceresearchmethods.tripod.com/sitebuildercontent/sitebuilderfiles/methres.pdf
• https://www.researchgate.net/profile/Ahmad_Al_Musawi/publication/323358065_Chapter_T
wo_Understanding_Data_Arabic/links/5a8fe72745851535bcd41d4b/Chapter-Two-
Understanding-Data-Arabic.pdf
• https://quizizz.com/
• https://lh3.googleusercontent.com/PKumoPGYGNf7Sm9JIDGKpRyWYsFTpWJcmPU051kK
VqJiJa2NZMgelCgMWvluEqAvf4q80eE=s85
• https://www.shutterstock.com/search/fahrenheit+thermometer
• https://www.gs1-us.info/product-number/
• https://quizlet.com/283507662/statistics-polit-and-beck-2018-chapter-14-flash-cards/
• https://www.mymarketresearchmethods.com/types-of-data-nominal-ordinal-interval-ratio/

You might also like