Graphical Descriptive Statistics Lec 1
Graphical Descriptive Statistics Lec 1
Techniques
Adeel Nasir
Learning Objectives
• Types of Data and Information
• Describing a Set of Nominal Data
• Describing the Relationship between Two Nominal Variables and
Comparing Two or More Nominal Data Sets
Types of Data and Information
• Variable
• A variable is some characteristic of a population or sample.
• the mark on a statistics exam is a characteristic of statistics exams that is certainly of interest to readers of this
book.
• Not all students achieve the same mark. The marks will vary from
student to student, thus the name variable
• The price of a stock is another variable. The prices of most stocks vary daily
• Values
• The values of the variable are the possible observations of the variable
• The values of statistics exam marks are the integers between 0 and 100 (assuming the exam
is marked out of 100)
• The values of a stock price are real numbers that are usually measured in dollars and cents (sometimes in fractions
of a cent)
Types of Data and Information
• Data
• Data are the observed values of a variable
• suppose that we observe the following midterm test marks of 10 students:
67 74 71 83 93 55 48 82 68 62
• Types of Data
• Interval Data
• Nominal Data
• Ordinal Data
Types of Data and Information
• Interval Data
• Interval data are real numbers, such as heights, weights, incomes, and
distances.
• We also refer to this type of data as quantitative or numerical
• Nominal Data
• The values of nominal data are categories
• responses to questions about marital status produce nominal data. The values
of this variable are single, married, divorced, and widowed
• single = 1, married = 2, divorced = 3, widowed = 4
• Nominal data are also called qualitative or categorical
Types of Data and Information
• Ordinal Data
• Ordinal data appear to be nominal, but the difference is that the order of their values has meaning
• at the completion of most college and university courses, students are asked to evaluate the
course
• poor, fair, good, very good, and excellent
• We give values in order
• Poor = 1, Fair = 2, Good = 3, Very good = 4, Excellent = 5
• Here is another example. Suppose that you are given the following list of the most
active stocks traded on the NASDAQ in descending order of magnitude:
Order Most Active Stocks
1 Microsoft
2 Cisco Systems
3 Dell Computer
4 Sun Microsystems
5 JDS Uniphase
DESCRIBING A SET OF NOMINAL DATA
• We describe nominal data by calculation on nominal data is to count
the frequency or compute the percentage that each value of the variable represents
• Frequency Distribution
• Frequency distribution summarize the data in a table, which presents the categories and their
counts
• Relative Frequency Distribution
• lists the categories and the proportion with which each occurs.
• We can use graphical techniques to present a picture of the data.
• There are two graphical methods we can use: the bar chart and the pie chart
Example
• “Last week were you working full-time, part-time, going to school, keeping house,
or what”? The responses were
1. Working full-time
2. Working part-time
3. Temporarily not working
4. Unemployed, laid off
5. Retired
6. School
7. Keeping house
8. Other
The responses were recorded using the codes 1, 2, 3, 4, 5, 6, 7, and 8, respectively. The
first 150 observations are listed here.
Responses
Bar and Pie Charts
• A bar chart is often used to display frequencies
• a pie chart graphically shows relative frequencies
DESCRIBING THE REL ATIONSHIP BET WEEN TWO NOMINAL
VARIABLES AND COMPARING TWO OR MORE NOMINAL
DATA SETS
• situations where we wish to depict the relationship between
variables; in such cases, bivariate methods are required.
• cross-classification table (also called a cross-tabulation table) is used
to describe the relationship between two nominal variables
Cross classification table for frequencies
Row Relative Frequencies
GRAPHICAL TECHNIQUES TO
DESCRIBE A SET OF INTERVAL DATA
• Mostly we use histograms to describe the interval data