0% found this document useful (0 votes)
77 views22 pages

7.data Analysis 1

Chi-square test for nominal variables. T-test or ANOVA for comparing means of interval/ratio variables between two or more groups. Correlation or regression for examining relationships between two interval/ratio variables. Nonparametric equivalents like Mann-Whitney U or Kruskal-Wallis H can be used for ordinal data.

Uploaded by

Pamela Gabriel
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPT, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
77 views22 pages

7.data Analysis 1

Chi-square test for nominal variables. T-test or ANOVA for comparing means of interval/ratio variables between two or more groups. Correlation or regression for examining relationships between two interval/ratio variables. Nonparametric equivalents like Mann-Whitney U or Kruskal-Wallis H can be used for ordinal data.

Uploaded by

Pamela Gabriel
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPT, PDF, TXT or read online on Scribd
You are on page 1/ 22

Data Analysis

What Statistical Test is Appropriate?


Statistics: The science of collecting, describing,
and interpreting data.
Two areas of statistics:
Descriptive Statistics: collection, presentation,
and description of sample data.
Inferential Statistics: making decisions and
drawing conclusions about populations.
Statistics

Descriptive Statistics Inferential Statistics


• Gives numerical and • Provides procedures
graphic procedures to to draw inferences
summarize a collection about a population
of data in a clear and from a sample
understandable way

Make predictions
Describing data
Levels of Measurement
• helps you decide how to interpret the data
from that variable
• knowing the level of measurement helps you
decide what statistical analysis is appropriate
on the values that were assigned.
Nominal Measurement
• The values “name” the attribute uniquely.
• “Nominal” scales could simply be called
“labels.”
• The value does not imply any ordering of the
cases, for example, jersey numbers in
football.
• Even though player 32 has higher number
than player 19, you can’t say from the data
that he’s greater than or more than the
other.
Other example of nominal data
• Gender
• Marital status
• Grade level
• Color of eyes
• Name of school
• Type of car
Example of nominal data
Gender Frequency Count
male 15
Female 35
Ordinal Measurement
When attributes/ data can be rank-ordered
or have natural ordering.
The ranking of favorite sports, the order of
people's place in a line, the order of
runners finishing a race or more often
the choice on a rating scale from 1 to 5.
Ordinal Level of Measurement
• With ordinal data you cannot state with certainty
whether the intervals between each value are
equal.
• A survey item – Satisfaction with registration
(very dissatisfied), 2 (dissatisfied), 3 (neutral), 4
(satisfied), and 5 (very satisfied)
• For example, we often using rating scales (Likert
questions). On a 5 point scale, the difference
between a 2 and a 3 is not necessarily the same
difference as the difference between a 4 and a 5.
Interval Level of Measurement
• Possesses both nominal and ordinal
characteristics
• Data of this type could have a value on a
continuous scale but there is no zero point where
the trait does not exist.
• The intervals between each number are the same
and the magnitude of what is measured and
represented by the numbers has equal intervals.
• Most common is I.Q. and temperature.
Ratio Level of Measurement
Variables measured at the ratio level have all
the characteristics of nominal-, ordinal-, and
interval-level measures (categories that have
names, order, and equal intervals), and the
categories include a true zero point. 
 Example is :  age, number of children, number
of pets, years of education, number of arrests,
years on the job, number of marriages, and so
on.
Example of ratio
• Age:  the categories have names (1 year old,
22 years old, etc.); the
categories have an inherent order from
youngest to oldest; the intervals between the
categories are equal (1 year); and it is possible
to be 0 years old.
Other examples: number of children, number of
pets, years of education, number of arrests,
years on the job
Test your knowledge
• Military title
• Categorization of clothing: hat, shirt, shoes, pants 
• heat measured in degrees centigrade. 
• Pain level (mild, moderate, severe)
• A score on a 5-point quiz measuring knowledge of algebra
• City of birth
• Number of marriages
• Height
• Weight
• Stage of cancer (stage I, II, III, IV)
• Education level (elementary, secondary, college)
• Satisfaction level (very dissatisfied, dissatisfied, neutral, satisfied, very satisfied)
• Agreement level (strongly disagree, disagree, neutral, agree, strongly agree)
• Interval and Ratio data are sometimes
referred to as parametric 
• Nominal and Ordinal data are referred to
as nonparametric.
• Parametric means that it meets certain
requirements with respect to parameters of
the population (for example, the data will be
normal--the distribution parallels the normal
or bell curve). In addition, it means that
numbers can be added, subtracted,
multiplied, and divided.
What appropriate statistical test?
Descriptive Statistics
Analysis of data that helps describe, show or
summarize data in a meaningful way such that,
for example, patterns might emerge from the
data.
Do not allow us to make conclusions beyond the
data we have analysed or reach conclusions
regarding any hypotheses we might have made.
They are simply a way to describe our data.
Measures of Central Tendency
• Quantitative data:
– Mode – the most
frequently occurring
observation
– Median – the middle
value in the data
– Mean – arithmetic
average
• Qualitative data:
– Mode – always
appropriate
– Mean – never
appropriate
Measures of Variability
Measures of Variability
2. Variance: (use all data points):

average of the distance that each score is from the mean


(Squared deviation from the mean)

Notation for variance


s2

3. Standard Deviation= SD= s2

4. Standard Error of the mean = SEM = SD/ n


Inferential Statistics

• Two nominal variables


Chi-square –can only read the significance of the
relationship of the two variables; is not a measure of
the degree of relationship
 
• Interval/ratio and nominal variables
Analysis of Variance (ANOVA) –used to find out if
there are significant differences between the means of
two groups
 
Inferential Statistics
• Two interval/ratio variables
Linear Regression or Correlation (Pearson-
Product Moment Correlation)– shows the pattern
of relationship between variables; a negative
valence signifies an inverse or indirect
relationship, not the absence of a relationship

• Ordinal and nominal variables


Analysis of Variance with ranks

You might also like