0% found this document useful (0 votes)
49 views5 pages

Reviewer in IE-SAN1

This document discusses basic concepts in statistics including definitions of statistics, probability, types of data, and methods of presenting data. It explains how to construct frequency distribution tables to summarize qualitative and quantitative data by enumerating the frequencies of each category and calculating their relative percentages. The three measures of central tendency - mode, median, and range - are also introduced as common ways to condense and represent all the values in a data set using a single number.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
49 views5 pages

Reviewer in IE-SAN1

This document discusses basic concepts in statistics including definitions of statistics, probability, types of data, and methods of presenting data. It explains how to construct frequency distribution tables to summarize qualitative and quantitative data by enumerating the frequencies of each category and calculating their relative percentages. The three measures of central tendency - mode, median, and range - are also introduced as common ways to condense and represent all the values in a data set using a single number.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 5

Reviewer in IE-SAN1

Lesson 1: Basic Concepts of Probability and 2. Qualitative - It is non numerical data and is
Statistics subdivided into Two Types:
- Categorical data: are purely descriptive
Statistics - Statistical knowledge helps you use
and imply no ordering of any kind such
the proper methods to collect the data, employ the
as sex, area of residence.
correct analyses, and effectively present the
- Ordinal data: are those which imply some
results. Statistics is a crucial process behind how
kind of ordering like Level of education,
we make discoveries in science, make decisions
Socio-economic status, and Degree of
based on data, and make predictions.
severity of disease.
Probability - plays a vital role in the day to day
life. In the weather forecast, sports and gaming
strategies, buying or selling insurance, online Presentation of Data
shopping, and online games, determining blood
The first step in statistical analysis is to present
groups, and analyzing political strategies.
data in an easy way to be understood. The two
basic ways for data presentation are:
Definition of Statistics 1. Tabular presentation.
2. Graphical presentation
 Statistics is the science of dealing with
numbers.
 It is used for Collection, Summarization,
Tabulation
Presentation and Analysis of data.
 Statistics provides a way of organizing data to Some rules for the construction tables:
get information on a wider and more formal
1. The table must be self-explanatory.
(objective) basis than relying on personal
2. Title: written at the top of table to define
experience (subjective).
precisely the content, the place and the
time.
3. Clear heading of the columns and rows and
Types of data
units of measurements
Any aspect of an individual that is measured, is 4. The size of the table depends on the
called variable. Variables are either: number of classes. Usually lie between 2
and 10 rows or classes. Its selection
1. Quantitative - it is numerical data.
depends on the form of data and the
- Discrete data: are usually whole
requirement of the distribution. Too small
numbers, such as number of cases of
may obscure some information and too
certain disease, number of hospital beds
long will not differ from raw data.
(no decimal fraction).
- Continuous data: it implies the Types of tables
measurement on a continuous scale e.g.
o For Qualitative data, draw a simple table e.g.,
height, weight, age (a decimal fraction can
List Table: count the number of observations
be present).
(frequencies) in each category.
o For Quantitative data, we have to form 3. Enumerate
frequency distribution Table. - the individuals in each blood group i.e.
individuals with blood group A are 6 and
those with blood group B are 6, AB are 5
List Table: and blood group O are 3
- Make sure that the total number of
 A table consisting of two columns, the first individuals in all blood groups is 20 (the
giving an identification of the observational number of the studied group).
unit and the second giving the value of 4. Calculate The relative frequency
variable for that unit. - (%) of each blood group by dividing the
 Example: number of patients in each hospital frequency of that group over the total
department are: number of individuals and multiplied by
Medicine 100 patients 100
- the percentage of group A = 6/20 x 100,
Surgery 80 patients
and the same for group AB = 5/20 x 100
ENT 28 patients and group O = 3/20 x 100.

Frequency Distribution Table:


In Conclusion:
 are used for presentation of qualitative (and
o We can conclude from this table that blood
quantitative Discrete) data
 By recording the number of observations in groups A & B are the most common groups
each category. and the rarest is group O (depending on the
 These counts are called frequencies. percentage of each group).
 For Quantitative Continuous Data consists of a o So presenting data in table is beneficial in
series of classes (intervals) together with the deducing facts and information than raw data.
number of observations (frequency) whose
values fall within the interval of each class.
 Example: Assume we have a group of 20
individuals whose blood groups were as Lesson 2: Central of Tendency
followed: A, AB, AB, O, В, А, A, B, B, AB, O, AB,
AB, A, B, B, B, A, O, A. We want to present these
data by table. Measure of Central Tendency
 Usually when two or more different data sets
How to Construct a Frequency Distribution are to be compared it is necessary to condense
Tables the data, but for comparison the condensation
of data set into frequency distribution and
1. Put a title visual presentation are not enough.
- Distribution of the studied individuals  It is then necessary to summarize the data
according to their blood group. e.g. set in a single value. Such a value usually
2. Draw a table (Columns & Rows), somewhere in the center and represent the
- First column - Studied Variable - " Blood entire data set
Group",
- 2nd column - heading - "Frequency-
Number"
- 3rd column – heading - " Percentage %"
Mode b. According to Consumer Reports, the brand
with the lowest overall taste rating costs
 The set of data that occurs most frequently,
35cents/ounce.
it is also uncommon for data set to have more
than one mode. Solution:
 This happens when two or more elements
Eliminate that brand and find the median
occur with equal frequency in the data set.
price per ounce for the remaining barbecue-
 Example: Single Mode
flavored chips. Again, order the data. Note
Data Set = 2, 5, 9, 3, 5, 4, 7
that there are on add number of entries, so
Mode = 5
the median is simply the middle value.
 Example: Bimodal
Data Set = 2, 5, 2, 3, 5, 4, 7 18 19 19 27 28
Mode = 2 and 5
Median = Middle Value = 19 cents
 Example: Trimodal
Data Set = 2, 5, 2, 7, 5, 4, 7
Mode = 2, 5, and 7
Range
 A data set is the difference between the
Median largest value and smallest value contained
in the data set.
 The set of data that depends on whether the
 Steps in determining the Range
number of elements in the data set is odd
1. Reorder the data set from smallest to
or even.
largest.
 How to find the Median:
2. Subtract the last element to the first
1. Order the data from smallest to largest.
element.
2. For an odd number of data values
distribution, Median = Middle data value
 Example:
3. For an even number of data value in the
Data Set = 2, 5, 9, 3, 5, 4, 7
distribution, Median = Sum of middle two
Reorder = 2, 3, 4, 5,5,7,9
values / 2
Subtract = 9 – 2 = 7
Range = 7
 Example:
A consumer report on barbecue - flavored
potato chip price per ounce was released, and
the prices are 19, 19, 27, 28, 18, & 35. Difference Between Sample & Population
a. Find the median. Population Sample
Solution: The measurable The measurable
quality is called a quality is called a
18 19 19 27 28 35 parameter statistics
The population is a The population is a
19+27 complete set subset of the
Median=
2 population
Median=23 cents Reports are true Reports have a margin
representation of of error and
opinion confidence interval
It contains all It is a subset that
members of a specified represents entire
group population
Mean
 An average that uses the exact value of each 56
¿ =7
entry. 8
 How to find the Mean:
Variance
1. Compute ∑ x that is, find the sum of all the
data values.  The term variance refers to a statistical
2. Divide the total by the number of data measurement of the spread between
values. numbers in a data set. More specifically,
- Sample statistic x variance measures how far each number in

x=
∑x the set is from the mean and thus from every
n other number in the set.
- Population parameter μ  Steps in Calculating the Variance
1. Find the mean of the data set. Add all data
μ=
∑x values and divide by the sample size n.
N
2. Find the squared difference from the mean
- Where,
for each data value. Subtract the mean
n = number of data values in the sample
from each data value and square the
N= number of data values in the
result.
population
3. Find the sum of all the squared
Trimmed Mean differences. The sum of squares is all the
squared differences added together.
 A measure of center that is more resistant
4. Calculate the variance. Variance is the
than the mean but still sensitive to specific
sum of squares divided by the number of
data values is the trimmed mean. A trimmed
data points.
mean is the mean of the data values left
after "trimming" a specified percentage of
- Population
the smallest and largest data values from the
data set. Usually, a 5% trimmed mean is used. Variance=σ =

2 ( x i−μ )
2

This implies that we trim the lowest 5% of the n


data as well as the highest 5% of the data. - Sample Set

Variance=s =

2 ( x i−x )
2

HOW TO COMPUTE A 5% TRIMMED MEAN


n−1
1. Order the data from smallest to largest.
2. Delete the bottom 5% of the data and the top
5% of the data. Note: If the calculation of 5% Standard Deviation
of the number of data values does not produce
 A standard deviation is a statistic that
a whole number, round to the nearest Integer.
measures the dispersion of a dataset
3. Compute the mean of the remaining 90% of
relative to its mean. The standard deviation is
the data.
calculated as the square root of variance by
Example: Find the sample statistic mean of 6, 8, determining each data point's deviation
11, 5, 2, 9, 7, 8 relative to the mean. If the data points are
further from the mean, there is a higher
x=
∑x deviation within the data set; thus, the more
n spread out the data, the higher the standard
6+8+11+5+2+ 9+7+8 deviation.
¿  Steps in Calculating the Standard Deviation
8
1. The mean value is calculated by adding all 8 8−6=2 22
the data points and dividing by the number 10 10−6=4 4
2

of data points. 10 10−6=4 42


∑ x =36 ∑ ( x −x )2 =70

2. The variance for each data point is Solving for Sample Variance:
calculated by subtracting the mean from
the value of the data point. Each of those 2
s=
∑ ( x i−x )
2

resulting values is then squared and the n−1


results summed. The result is then divided 2 70
s=
by the number of data points less one. 5
2
3. The square root of the variance-result from s =14
no. 2 is then used to find the standard
deviation.
Solving for the Sample Standard Deviation:
 Formula:

Sample Standard Population Standard


s=

s= √ 14
∑ ( x−x )2
n−1

√ ∑ ( x−x )
√ ∑ ( x− x )
2 2
s=3.47
s= σ=
n−1 n

 Example:
Big Blossom greenhouse was commissioned to
develop an extra-large rose for the Rose Bowl
Parade. A random sample of blossoms from
Hybrid A bushes yielded the following
diameters (in inches) for mature peak blooms.
2, 3, 3, 8, 10, 10

Find the Sample Variance and the Standard


Deviation

Solution:

x=
∑x
n
36
x=
6
x=6 inches
x x−x ( x−x )2
2 2−6=−4 −4
2

3 3−6=−3 −3
2

3 3−6=−3 −3
2

You might also like