Statistics Lecture Notes - UNIT 1
Statistics Lecture Notes - UNIT 1
OUTCOMES
This unit deals with the role of statistics in the data analysis process. Concepts that are basic to the study of
statistics are discussed.
We live in an era where we are faced with increasing amounts of information, referred to as data. To perform
many tasks efficiently we need a basic understanding of statistical methods. The field of statistics covers a
problem-solving process that seeks answers to questions through data. To be an informed consumer of
information you must be able to:
extract information from tables and graphs
follow numerical arguments
understand the basics of how data should be gathered, summarised and analysed to draw statistical
conclusions
1
Example 1.1
As part of a weekly check to access the calibration of a filling machine, the quality control manager randomly
selects 50 bottles of beer that were filled on a specific day.
4. Interpret results
1.2. DEFINTION
Statistics is the scientific discipline that provides methods to help us make sense of data by:
collecting data in a methodical way
analysing data using methods to organise and summarise data using tables, graphs and numbers
interpret data to draw conclusions or to answer questions
2
The field of statistics is subdivided into descriptive and inferential statistics:
Descriptive statistics includes the collection and summarising of data to give an overview of the
information collected
Inferential statistics is the process of making an estimate, prediction or decision about a population
based on sample data
o A population is almost always very large
o A sample is drawn and data summarised using descriptive techniques
o The results are used to make decisions about the population
o Reliability of decisions/conclusions are measured
Confidence level: the proportion of times that an estimating procedure will be correct
in the long run
Significance level: how frequently the conclusion will be wrong
3
Quantitative variables are further classified as discrete or continuous
o Discrete variables are countable
Values are obtained through counting
E.g. the number of students in the class
o Continuous variables have infinite number of possible values that are not countable
Values are obtained through measuring or weighing
E.g. weight, length, time taken to complete a task, age, etc.
Example 1.2
Distinguish between qualitative and quantitative variables.
1. Gender
2. Temperature
3. Postal code
Example 1.3
Distinguish between discrete and continuous variables.
2. Number of cars that arrive at the KFC drive-through between 10h00 and 12h00
3. Distances different model cars with the same tank capacity can drive in city driving conditions
4. Temperature
4
1.4. MEASUREMENT
Measurement is the process we use to assign a value to the observations or elements of a variable. There are
four levels or scales of measurement: nominal, ordinal, interval, ratio. The analyses depend on the scale used
to measure a variable.
5
1.4.4. 1.4.4 Ratio scale
Data can be arranged in order
Both differences between data values and ratios of data values are meaningful
This scale must contain a zero value that indicates that nothing exists for the variable at the zero point
Arithmetic operations can be performed on the numeric values themselves.
E.g. money:
o The zero point is meaningful, i.e. at zero you have none
o The difference between R10 and R20 is the same as the difference between R50 and R60
o R10 is twice as much as R5
Arithmetic operations can be performed on the numbers themselves
Variables such as distance, height, weight, and time use the ratio scale
Activity 1.1
Categorise these measurements according to level.
Variable/measurements Measurement scale
1. Species of fish in the Vaaldam
2. Cost of rod and reel
3. Time of return home
4. Rating area of fishing area: Poor, fair, good
5. Number of fish caught
6. Temperature of water
Activity 1.2
The student council at a university with 10000 students is interested in the proportion of students who favour
a change in the admission requirements at the university. Two hundred students are interviewed to determine
their attitude towards this proposed change. Of the 200, 64 (32%) are in favour of the change. The student
council announced that less than 35% of all students are in favour of a change.
6
c) Classify the variable in terms of type and measurement scale