EECM3724 Unit 1 Ch1 Slides 2022
EECM3724 Unit 1 Ch1 Slides 2022
From 19 /07/2022
Overview • Most successful managers and decision
makers use statistical data for effective
decision making. Economists,
From chapter 1: accountants, financial analysts,
marketers, producers use data. These
• Statistics
days, a buzzword is BIG DATA.
• Data • Big Data is described by four
characteristics: Volume (quantity of
• Scales of measurement stored data), Variety (type and nature),
Velocity (the speed at which data is
• Nominal, ordinal, interval, ratio
generated and processed), and veracity
• Qualitative/Categorical and quantitative data (the quality of the captured data).
Epidemiologists are currently using data
• Cross-sectional and time series data to make sense of the COVID-19
pandemic.
• Descriptive statistics • So you need this stuff and it starts here.
• Process of statistical inference • This section defines the basic terms in
•
statistics for economics.
Objectives
• Understand the meaning of the terms: elements, variables and observations, as
• Understand the role a sample plays in making statistical inferences about the
population.
Note: when you get to the end of this unit and having attempted suggested questions, come back to this slide and see if the
objectives are met.
Statistics
• Statistics is the art and science of collecting, analyzing, presenting and
interpreting data
• The social, natural, life and medical sciences are all concerned with the
gathering, organising, summarising and presentation of information.
So with this course we are introducing you to the art and science of data use for informed decisions.
How do you gather it? How do you organize it? How do you summarize it? The project component of the
module will give you a platform for hands-on experience.
Statistics
• Statistics is all about using DATA from the past to extract the INFORMATION we
need to MAKE DECISIONS about the future!
I am sure you are getting anxious now but also excited. You are imagining these things. Element, variable,
observation – all these describe data – in a data set. Let us make these aspects clearer.
Data, Data Sets,
Elements, Variables, and Observations
Variables
Putting it into
Element perspective: An
Names Stock Annual Earn/ element like
Exchange Sales(RM) Share(R) LandCare has
Company three variables
(stock exchange
Dataram NQ 73.10 0.86 it belongs to;
its annual sales,
EnergySouth N 74.00 1.67 and its earnings
Keystone N 365.70 0.86 per share) with
observations
LandCare NQ 111.40 0.33
which are NQ,
Psychemedics N 17.60 0.13 111.40 and
0.33.
Data Set
Scales of measurement
• In statistics, data collections requires one of the following scales of measurement: nominal,
ordinal, interval and ratio.
• These measurement scales are ways to sub-categorize different types of data and
determines the amount of information contained in the data
• The scale indicates the most appropriate data summarization and statistical analyses.
• When data for a variable are labels or names used to identify an attribute without
any quantitative value. Using example above, these are NQ or N.
• Alternatively, a numeric code can be used for gender were 1 denotes “female”
and 2 denotes “male”. Coding a ‘male’ as a ‘2’ does not make him of more value.
• In my South African data with a variable “Province” I can code the nine provinces
with arbitrary numbers. It does not make one province better than the other.
Scales of measurement
• Ordinal scale:
• The data have properties of nominal data, but the order or rank of the data is
meaningful.
• While the order is important, the differences between the codes is not really
known.
• Example, a student can rate his/her health status using non-numeric labels such
as poor, good or excellent.
• The data have properties of ordinal data and the interval between observations is
expressed in terms of a fixed unit of measure.
• Data are always numeric, and we know both the order and the exact differences
between the values.
• The scale also has an absolute zero that indicates that no numbers exist below
the zero point of the variable.
• Ratio data gives the ultimate–order and variables such as distance, height and
weight use the ratio scale.
• There are more alternatives for statistical analysis if the data is quantitative,
while the options are limited if the data is qualitative/categorical.
•
Qualitative/Categorical Data
• Data labels or names are used to identify an attribute of each element and the data
represent groups.
• Each observation can be placed in only one group, and the groups are mutually exclusive.
• Can either be nominal or ordinal scale of measurement
• Can be non-numeric or numeric.
• Appropriate statistical analyses are limited.
• Categorical often referred to as qualitative data as well
• However, a difference exist between categorical and qualitative data.
• Qualitative data does not take on numeric values and can only be used to describe the
types of categorical data that are nominal.
• We are unable to refer to ordinal data as qualitative as meaningful numeric values can be
assigned to the data.
• Question: Is all qualitative data categorical and is all categorical data qualitative?
• Can you identify examples?
Quantitative data
• It takes numerical values and represent some kind of measurement.
•
Classification of Scales of measurement
Data
Categorical Quantitative
• So the data comes from different individuals, firms, countries at a single point in time.
• Example is the daily closing price of a certain stock recorded over the last six weeks.
We have talked a lot about the classification of data. It is not just about classification. Data has to communicate something to
us. The next slides takes a look at this. It is just highlights. We will cover detail about these issues as the course unfolds.
Descriptive statistics
• Most of the statistical information in newspapers, magazines, company reports,
and other publications consists of data that are summarized and presented in a
Inferential statistics
body of methods used to draw
conclusions or inferences
about characteristics of
populations based on sample
data.
Attempt the exercises at the end of the chapter in your book
Next is:
Unit 1 (continued): Graphical and Numerical of Data