Trust Wallet Spamming
Trust Wallet Spamming
STATISTICAL ANALYSIS IN
ENGINEERING RESEARCH
by
Akachukwu Nwaobi
Definition of Statistics
196 142 169 174 162 151 129 186 168 201
209 126 132 156 109 122 148 163 169 109
The three distributions below represent different data conditions. In each distribution, look for the region
where the most common values fall. Even though the shapes and type of data are different, you can find that
central tendency. That’s the area in the distribution where the most common values are located.
Arithmetic Mean
σ𝑥
• For a set of observations: Mean(x̄) =
𝑛
σ 𝑓𝑥
• For a set of grouped data: Mean, ̄ x =
Σ𝑓
where,
̄ x = the mean value of the set of given data.
f = frequency of each class
x = mid-interval value of each class
GEOMETRIC MEAN
https://www.cuemath.com/data/geometric-mean/
Example:
• Let 2, 4, 8, 16 and 18 be the 5 items needed for the
distribution of grains in a field. Determine the geometric
mean.
• Solution x Log x
𝟓 2
𝟐 ∗ 𝟒 ∗ 𝟖 ∗ 𝟏𝟔 ∗ 𝟏𝟖 = 4
8
16
18
Total
Harmonic Mean
=109.45g
140-160 150 15 169
160-180 170 12 181
180-200 190 9 190
Total 190
Mode
For grouped data: Mode = l + [ 𝒇𝒔
𝒇𝒑+𝒇𝒔
×C ]
L = lower limit of the modal class
Fp =the frequency of the class preceeding the modal class
Fs =the frequency of the class succeeding the modal and
C =Class interval
• Quartiles (4-quantiles): Three quartiles split the data into four parts.
• Deciles (10-quantiles): Nine deciles split the data into 10 parts.
• Percentiles (100-quantiles): 99 percentiles split the data into 100 parts
Quartiles
• Quartiles are three values that split sorted data into four parts, each
with an equal number of observations. Quartiles are a type
of quantile.
• First quartile: Also known as Q1, or the lower quartile. This is the
number halfway between the lowest number and the middle
number.
• Second quartile: Also known as Q2, or the median. This is the middle
number halfway between the lowest number and the highest
number.
• Third quartile: Also known as Q3, or the upper quartile. This is the
number halfway between the middle number and the highest
number.
Percentiles
• Assume that the elements in a data set are rank ordered from
the smallest to the largest. The values that divide a rank-ordered
set of elements into 100 equal parts are called percentiles.
• An element having a percentile rank of Pi would have a greater
value than i percent of all the elements in the set. Thus, the
observation at the 50th percentile would be denoted P50, and it
would be greater than 50 percent of the observations in the set.
An observation at the 50th percentile would correspond to the
median value in the set.
QUICK NOTE
• Px = l + [
𝒏
𝒙.𝟏𝟎𝟎 −𝒄𝒇
𝒇
×C ] • C =Class interval
𝒙(𝒏+𝟏) 𝟕𝟓(𝟏𝟒+𝟏)
P75 (or Q3) = P75 = th item = 𝒕𝒉 𝒊𝒕𝒆𝒎
𝟏𝟎𝟎 𝟏𝟎𝟎
= 55.75kg
Example: Compute the 25th and 75th Percentile
X =Weight
of seed
yams(g)
Mid point No of
insect (f)
CF
Px = l + [
𝒏
𝒙.𝟏𝟎𝟎 −𝒄𝒇
𝒇
×C ]
40-60
60-80
50
70
6
28
6
34 P25 = 80 + [ 190
25.100 −34
35
]
× 20
80-100 90 35 69 = 87.71 g
100-120 110 55 124
120-140
140-160
130
150
30
15
154
169 Px = l + [ 𝒙.
𝒏
𝟏𝟎𝟎
−𝒄𝒇
𝒇
×C ]
]
160-180 170 12 181
180-200
Total
190 9
190
190
P75 = 120 + [ 190
75.100 −124
30
× 20
=132.33g
Find P50 or Q2
IMPORTANT CHARACTERISTICS OF A GOOD
AVERAGES
• An average is a representative item of a distribution. it should possess the following
properties :
Mean satisfies all the properties excepting that it is affected by the presence of
extreme items. For example, if the items are 5, 6, 7, 7, 7, 8 and 9 then the mean,
median and mode are all equal to 7. if the last value is 30 instead of 9 , the mean will
be 10, whereas median and mode are not changed. Though median and mode are
better in this respect, they do not satisfy the other properties Hence mean is the best
average among these three.
lecture 3
Measures of Dispersion
• RANGE
• QUARTILE DEVIATION
• MEAN DEVIATION
• STANDARD DEVIATION
Range
Suppose we have the distribution of the yield (kg/plot) of two maize variety from 5 plots
each
Variety 1 : 45 42 42 41 40
Variety 2 : 54 48 42 33 30