Week 01statistics
Week 01statistics
Statistics
BS-1402
Instructor
LECTURES for week # 01
Engr.Syed Muhammad Khubaib
Sweden
Topics to be Covered in week 1
By
R.E. Walpole
Grading Policy
Assignments 15%
Quizzes 10%
Midterm 25%
Final Exam 50%
Statistics
• The science of collecting, organizing,
analyzing and interpreting data.
• Making data meaningful
• Statistical Question:
• A question where we expect to get a
variety of answers and you are interested
in the distribution, tendency of those
answers
Statistics
• Numerical data/facts that has been
arranged systematically اعداد و ش مار
Mean
• Mean = Average
Median
• Median: The middle number; found by
ordering all data points and picking out the
one in the middle (or if there are two
middle numbers, taking the mean of those
two numbers)
• It is the value separating the upper and
lower half of the data set (after putting the
data set into order)
• Median of even data set?
Mode
• Mode is the most frequently appearing
number in the data set.
• This may not necessarily be equal to the
mean of the data set
Statistics
Descriptiv Inferential
e Statistics Statistics
Descriptive Statistics
• Involves the description of collected data
• Qualitative Data
• Non-numeric: eye color, ethnicity, marital
status
Variable
• That quantity which varies for every
individual
Variable
Quantitativ
Qualitative
e
Discrete Continuous
Continuous Variable
• Height of a person
Interval
Scale
Nominal Scale
• The classification or grouping of
observations into mutually exclusive
qualitative categories is said to constitute
a nominal scale
• Students can be classified as male/female
• Classification based on nationality
Ordinal/ Ranking Scale
• Includes the properties of nominal scale
and has additional quality of
ordering/ranking.
• Students can be ranked by class
performance: excellent, good, fair,
average, poor
• Rainfall can be heavy, moderate, light
(difference between two ranks may not be
same)
Interval Scale
• A measurement scale with a constant
interval size but no true zero point:
thermometer
• 104oF is equal to 40oC but there is no true
zero. -15oC still has some average kinetic
energy in the molecules.
Ratio Scale
• A special type of interval scale where there
is a true zero point. Height, weight, volume
etc.
Biased Error
Error
Random Error
Errors
• Biased Errors: when the measurement
instrument has an error.
• Example of short scale measuring cloth
• Cumulative error as measurement quantity
increases
• Random errors: sometime measurement is
less or more than the actual.
Compensating error. Chance error
Statistical Inference
• A statistical inference is a prediction or
estimate or a generalization about a
population based on based on information
contained in a sample.
• We use information from a small group to
generalize conclusions about the whole
population
Elements of Statistical Inference
• There are 5 elements of a statistical
inference:
• Population (frame)
• Variable of interest
• The inference itself
• An estimate of reliability
Measure of Reliability
• A statement (usually quantified) about the
degree of uncertainty associated with an
inference
• Note: Only way to be certain that the
inference is correct is to include the whole
population in the sample (which is not
possible due to our limitations)
• Hence our inference is based on a sample
smaller than the population
Reliability
• It is important to determine and report the
reliability of each inference
Example
• A large number complaints about under-
filled paint cans
• So the retailer starts inspecting the paint
cans coming in from the supplier
• Under filled cans will be returned to the
supplier
• A shipment contained 2,440 gallon size
cans and the retailer sampled 50 cans on
a weight scale capable of measuring
weight up to 4 decimal places.
• Properly filled cans weigh 10 pounds
Example
a. Describe the population
b. Describe the variable of interest
c. Describe the sample
d. Describe the inference
e. Describe the measure of uncertainty of
our inference
Bound on the Estimation Error
• It is simply a number that our estimation
error ( the difference b/w the average
weight of the sample and the average
weight of the population of cans) is not
likely to exceed.
• So the bound is the measure of
uncertainty of our inference or the
reliability of our inference.
• Essentially the inference is incomplete
without the measure of its reliability
Example
• When the weights of 50 paint cans are
used to estimate the avg weight of cans,
the estimate will not exactly mirror the
entire population.
• For example: In the sample of 50 cans if
the avg weigh is 9 pounds it does not
mean that the whole population avg is also
9 pounds.
Example
• Nevertheless, we can use sound statistical
reasoning to ensure that our sampling
procedure will generate estimate that is
almost certainly within a specified limit of
the true mean weight of the population.
Example
• For example a reasoning might ensure
that the estimate is accurate to within one
pound of the actual population mean.
• Consequently we can write (9±1) as the
mean weight of the population
• The interval of 1 pound represents the
measure of reliability of the inference.
Population Vs Sample
• Population: The collection of all
individuals, items or data under
consideration in a statistical study