Prelim
Prelim
G DATA
ANALYSIS
Instructor:
BJERN CORTADO-LOAÑA
BASIC STATISTICS
• Statistics in Engineering
• Collecting Engineering Data
• Data summary and Presentation
• Probability Distributions
- Discrete Probability Distribution
- Continuous Probability Distribution
• Sampling Distributions of the Mean and
Proportion
STATISTICS IN ENGINEERING
Statistics is the area of science that deals with collection,
organization, analysis, and interpretation of data.
It also deals with methods and techniques that can be used to draw
conclusions about the characteristics of a large number of data
points, commonly called a population.
By using smaller subset of the entire data called sample.
Because many aspects of engineering practice involve working with
data, obviously some knowledge of statistics is important to an
engineer.
Specifically, statistical techniques can
be a powerful aid in designing new
products and systems, improving
existing designs, and improving
production process.
Engineers apply physical and chemical
laws and mathematics to design,
develop, test, and supervise various
products and services.
Engineers perform tests to learn how
things behave under stress, and at
what point they might fail.
• As engineers perform experiments, they collect data that can be
used to explain relationships better and to reveal information about
the quality of products and services they provide.
COLLECTING ENGINEERING DATA
Direct observation
the simplest method of observing data
advantage: Relatively inexpensive
drawbacks: difficult to produce useful information since it does not consider all
aspects regarding the issues.
Experiments
more expensive method but better way to produce data
data produced are called experimental
Surveys
most familiar method of data collection
Personal interview
has the advantage of having higher expected response rate
fewer incorrect respondents
DATA PRESENTATION
MODE:
the value with the highest frequency in a data set.
* It is important to note that there can be more than one mode and if
no number occurs more than once in the set, then there is no mode
for that set of numbers.
EXAMPLE: UNGROUPED DATA
EXAMPLE: GROUPED DATA
• A large variance means that the
individual scores (data) of the sample
deviate a lot from the mean.
• A small variance indicates the scores
(data) deviate little from the mean.