Variables and Data Presentation
Variables and Data Presentation
• Dr.Nadia Elemam
Variable
• An item of data
• Examples:
– gender
– test scores
– weight
• Value varies from one observation to
another
What are the 7 types of variables in
research?
• Independent & dependent variables,
Active and attribute variables, Continuous,
discrete and categorical variable,
Extraneous variables and Demographic
variables
Variables
• The dependent variable is the variable that
the researcher measures; it is called a
dependent variable because it depends
upon (is caused by) the independent
variable.
• Non-overlapping categories or
characteristics
• Examples:
– Completes/Incompletes
– Professions
– Gender
Quantitative Data
• Frequencies
• Measurements
Discrete
Performance Appraisals
10%
Same
Much Easier
Easier
33%
90
80
70
60
50 East
40 West
30 North
20
10
0
1st Qtr 2nd Qtr 3rd Qtr 4th Qtr
Histogram
Histogram
12
10
F re q u e n c y
8
6
4
2
0
49 59 69 79 89 99
Score
Boxplot
Boxplot of C1
20 30 40 50 60 70 80 90 100 110
C1
Stem and Leaf Plot
7 5
8 3
8 7999
9 23
9 66789
10
10 688
11 2244
11 788
12 4
12 8
13
13 8
14 1
Analyze Ranked Data
• Frequency tables
• Mode, Median, Quartiles
• Graphs:
– Bar Charts
– Dot Plots, Pie Charts
– Line Charts (2 variables)
Data Example
Suggest some ways you could analyze these
items.
English
Spanish 55%
Segment - percentage of 25%
the whole that falls into
each category Native Language
Bar Charts
Vertical scale - 20
frequencies, relative 15
5
Horizontal scale - Before Training
A fter Training
0
categories B41 BA 42 B41F C21 Other
Product
Allows comparisons
Constructing Bar Charts
• All boxes should have the same width
• Gaps between the boxes - no connection
between
• Any order.
• Use to represent two categorical variables
simultaneously
Graphs: Measured
Continues Quantitative Data
• Histograms
• Stem and Leaf
• Box plots
• Line Graphs
• XY Scatter Charts (2 variables)
Histograms
distributions of 12
10
continuous variables
Frequency
8
6
Drawn without gaps 4
2
Non-overlapping
intervals Grade Distribution
Intervals - generally 12
10
the same length
Frequency
8
6
Number of values in 4
2
each interval -class 0
59 69 79 89 99
Grade
frequency
Relative frequencies o
XY Scatter Chart
Two variables
Abscent by Age
Variables: quantitative and
20
continuous.
15
Plot pairs - rectangular
10
DaysAbsent
coordinate system 5
Sales (x$10,000)
155
J an
J une
they do not have to be
Feb
Mar
Apr
May
Month
continuous variables).
Basic Principles for Constructing All
Plots
• Data should stand out clearly from
background
• The information should be clearly labeled
– title
– axes, bars, pie segments, etc. - include units
that are needed to interpret data
– scale including starting points.
Principles cont.
• Source
• No clutter
• Minimize information or data on one
graph.
• Try several approaches
Describing Data
• Shape of the Distribution
– Symmetry
– Skewness
– Modality: most frequently occurring value
– Unimodal or bimodal or uniform
Right Skewed Left Skewed
Histogram Histogram
12 12
10 10
Frequency
Frequency
8 8
6 6
4 4
2 2
0 0
59 69 79 89 99 59 69 79 89 99
Grade Grade
Histogram
12
10
Frequency
8
6
4 Symmetrical
2
0
59 69 79 89 99
Grade
Describing Data
• Centrality
• Spread
• Extreme values
Measures of Centrality
• Mean
• Median
• Mode
Mean
T1 T2 T3
85 85 85
90 90 90
75 35 75
90 90 110
340 300 360 Sum
85 75 90 Mean
Median
• Central point .
• Half of the data has a value than the
median
• Half of the data has a higher value than
the median
• Not affected by extremely large or small
values
Find the Median
85 90 75 92 95 Data
75 85 90 92 95 Sorted Data
Median is 90.
Find the Median
95 90 92 85 Data
85 90 92 95 Sorted Data
Median:
(90 + 92)/2 = 91
Measures of Spread
Range
85 90 75 92 95 Scores
Range: 75 to 95
or 20
Variance/Standard Deviation
• Quartiles
Quartiles
• The lower quartile is the same as the 25th
percentile.
– 25% of the scores are lower and
– 75% of the scores are higher than the lower
quartile.
• The upper quartile is the same as the 75th
percentile.
– 75% of the scores are lower and
Correlation
r will be a positive
8
7 number.
6
5
4
Y
3
2
1
0
0 2 4 6
X
Negative Relationship
8
r will be a negative
7 number.
6
5
4
Y
3
2
1
0
0 2 4 6
X