0% found this document useful (0 votes)
56 views83 pages

STAT 2301 Introduction To Statistical Analysis Lecture Notes-1

The document provides an introduction to statistical analysis, covering key concepts such as statistics, descriptive statistics, and basic statistical terms. It explains the difference between theoretical and applied statistics, types of variables, and data sources. Additionally, it discusses methods for organizing and presenting data, including frequency distributions and graphical representations.

Uploaded by

malisenrichard80
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
56 views83 pages

STAT 2301 Introduction To Statistical Analysis Lecture Notes-1

The document provides an introduction to statistical analysis, covering key concepts such as statistics, descriptive statistics, and basic statistical terms. It explains the difference between theoretical and applied statistics, types of variables, and data sources. Additionally, it discusses methods for organizing and presenting data, including frequency distributions and graphical representations.

Uploaded by

malisenrichard80
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 83

Introduction to Statistical Analysis

Enock Mwenitete, BSc. (Statistics & Mathematics) - UNIMA, MSc. (Mathe-


matical Sciences - Statistics & Data Science) - Stellenbosch University, RSA.
October 27, 2023
LECTURE 1

1
S TATISTICS

• Statistics: Is the group of methods used to collect, analyze,


present and interpret data to make decision.
• Statistical methods help us to make scientific and intelligent
decision.
• Statistics has two aspects:
1. Theoretical or Mathematical statistics - deals with the
development, derivation, and proof of statistical theorems,
formula, rules and laws.
2. Applied statistics - invloves the applications of those
theorems, formulas, rules and laws to solve real-world problems.

1
D ESCRIPTIVE STATISTICS

• Data set: a collection of organized and structured data.


• Element: is a single data point. Elements - are individual units
of data that make up the data set.
• Observation: a case within the data set
Example:
Suppose we have information on the test scores of students
enrolled in the introduction to statistical analysis class.
• data set - whole set of numbers that represent the scores of
students.
• element - name of each student.
• observation - score of each student.

2
D ESCRIPTIVE STATISTICS

• A data set in its original form is usually large.


• Its not helpful to draw conclusion or make decision from original
data set.
• There is need to reduce data to manageable size by constructing
tables, drawing graphs or calculating summary measures such
as averages.
Portion of statistics that helps us to do this type of statistical
analysis is called descriptive statistics.
• Descriptive statistics - consists of methods for organizing,
displaying and describing data by using tables, graphs, and
summary measures.

3
BASIC STATISTICS TERMS

Population: Consists of all elements, individuals whose


characteristics are being studied.
Target population: population that is being studied.
Sample: A portion of the population selected for study.
Census & sample survey
Census: a survey that includes every member of the population.
Sample survey: a technique of collecting information from a portion
of the population.
Representative sample & Random sample
Representative sample: a sample that represents the
characteristics of the population as closely as possible.
Random sample: sample drawn in such a way that each element of
the population has chance of being selected.
Simple random sampling: all sample of the same size selected
from a population have the same chance of being selected.
4
Variable: is the characteristic under study that assumes different
values for different elements. (e.g., income of household, student
scores, student height)
Constant - value is fixed.
Observation/Measurement: the value of a variable for an element.

5
T YPES OF VARIABLES

1. Quantitative variables
• quantitative variable: a variable that can be measured
numerically.
• quantitative data: data collected on quantitative variable.

a. Discrete variable: a variable whose values are countable (discrete


variable assume only certain values with no intermediate values).
b. Continuous variables: a variable that can assume any numerical
value over a certain intervals (e.g., time taken to complete exams).

5
2. Qualitative or categorical variable
- a variable that cannot assume a numerical value but can be
classified into two or more non-numeric categories.
-qualitative data: data collected on qualitative variable.

6
C ROSS - SECTION & T IME SERIES DATA

1. Cross-section Data
- data collected on different elements at the same point in time or for
the same period of time.

6
2. Time series data
- data collected on the same element for the same variable at
different points in time or for different points of time.

7
S OURCES OF DATA

Internal sources
External sources
Survey and experiments

8
R EVISION : M ATHEMATICAL NOTATION

Mathematical notation helps express a mathematical relationship


concisely.
Summation notation: denote sum of values.
e.g., x1 = 75, x2 = 80, x3 = 35, x4 = 97, x5 = 88
x1 + x2 + x3 + x4 + x5 = 75 + 80 + 35 + 97 + 88 = 375
P
x = x1 + x2 + x3 + x4 + x5 = 375
Example: Given:
m: 12 15 20 30
f: 5 9 10 16
Compute: (a). m, (b). f 2 , (c). mf , (d). m2 f
P P P P

9
P RESENTATION OF DATA

organizing and graphing data


Raw data: data recorded in the sequence in which they are collected
and before they are processed or ranked.

10
- The data in above tables are also called Ungrouped data
-Ungrouped data set: contains information on each member of a
sample or population individually.
11
O RGANIZING AND GRAPHING QUALITATIVE DATA

- Frequency distributions for qualitative data: lists all categories


and the number of elements that belong to each of the categories.
Example:
Relative frequency of a category = frequency of that category
Sum of all frequencies
Percentage= Relative frequency × 100
Example:

11

You might also like