0% found this document useful (1 vote)

939 views20 pages

Introduction To Statistics

This document provides a summary of lecture notes for an introduction to statistics class covering chapters 3-5 on graphs and descriptive statistics. It discusses different types of graphs like histograms, bar charts, and pie charts that can be used to summarize categorical and quantitative variables. Key concepts covered include frequency, measures of center and spread, and Simpson's paradox. Students are given various exercises to work through involving creating graphs and interpreting data sets.

Uploaded by

Ahmed Kadem Arab

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (1 vote)

939 views20 pages

Introduction To Statistics

Uploaded by

Ahmed Kadem Arab

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 20

Please sign in (SIGNATURES) as you come in to class.

It will save
my voice instead of my taking attendance (this is only to settle the
class roster).

Introduction to Statistics
Lecture Notes
Chapters 3-5
Whats up with the powerpoint?
I dont usually use slides, but am going to try
to use these to save my voice somewhat.

Notes: Still working on getting the class roster

settled. Has been some movement on the
waitlist, will keep in touch as things develop.
Be sure youve signed in!
First homework is posted (on our course
website), but isnt due until next Friday (the
4th). The additional problem is NOT optional,
that just means it is not a book problem.
Handouts for Today
There is one handout on graphs/descriptive
statistics going around. Save this to use
tomorrow in class.

There is a second handout the anonymous

survey largely designed by the class on Monday.
Please go ahead and take a few minutes to fill
this out (no names!) and get it back to me. Well
take a look at this data next week in lab.

If you missed class Monday, I have extra course

syllabuses at the front as well.
The Ws of a Data Set
Who the observations (population set of all objects
you are interested in obtaining the value of some
parameter for since we usually cant observe all
objects, we take a sample of objects a subset of the
overall population of objects to observe)
Note: There is NO such thing as a population sample
or sample population.
What the variables
Why why was the data collected
How how was the data collected (related to
design/sampling in chapters 12-13)
When/Where more information that could be relevant
Chapters 3-5 Overview
Covers basic graphs and descriptive statistics for
both categorical and quantitative variables
This is what you would do as a preliminary
analysis for a variable.

Recall: a data set can have multiple variables in

it.

These chapters focus on mostly univariate (single

variable) analyses. There is one comparative
graph a side-by-side boxplot in Chapter 5.
3 Rules of Data Analysis
Rule 1- Make a picture
Rule 2 Make a picture (really, before you do
anything else)
Rule 3 Make a picture (really, we mean a
well-chosen picture for your variables)
Categorical Variable Prelim Analysis
Frequency tables (one variable) summarize
counts by category
Contingency tables (2 or more variables)
summarize counts by category for multiple
variables
Bar charts
Pie charts
Frequency
What is frequency?
Frequency is the number of objects/cases per
category
You can also look at relative frequency.
Relative frequency is the number of objects/cases
per category divided by the total number of
objects.
Hence it gives proportions for each category out of
the total.
It is often converted to %.
Bar Charts
One bar per category height is determined
by frequency or relative frequency
Order of categories is arbitrary.
Does NOT let you talk about the shape of a
distribution.

Area principle areas are supposed to be

relative. This is often violated when people try
to make graphs cool and go 3-D, etc. (see
Example passed around).
Pie Charts
Take 100% of cases and divide up 360
degrees based on relative frequencies.

We will look at bar charts over pie charts.

Note that for bar charts you do not need to

create bars for 100% of the cases. You could
look at the top three risk factors for a disease,
etc. However, we usually do have 100% of
cases shown.
Contingency Tables - Example
See first page of Handout
Totals for rows/columns give marginal
distributions for each variable.
You can also look at conditional distributions.
Fix a row or column and work solely within that
row or column.

Concept of independence (will formalize later):

If the distribution of one variable is the same for all
categories of another variable, then the two
variables are independent.
On Your Own
Text has some discussion of segmented bar-
charts and side-by-side (feel free to read or
skip)
Simpsons Paradox
Something that can happen when you aggregate
categorical data
Looking at overall averages or % can be misleading
Can get different results looking at breakdown
Berkeley Discrimination Data Example (see bottom
of page one of the handout)
Claims of Sexual Discrimination in1973 Graduate
School Admissions
Overall, 44.28% of males who applied were admitted,
while only 34.58% of females were admitted.
Look what happens when you breakdown by the 6
largest departments though! (try this on your own or
with a partner). Is there evidence of discrimination
against females at the dept. level? What is going on?
Quantitative Variables Preliminary Analysis
Graphs
Dot plot wont use much read about on your own
Stem and leaf wont use much read about on
your own
Histogram
Boxplot (chapter 5)
Qqplot (Friday or next week)
Time plot (Friday or next week)
Descriptive statistics
Measures of center: mean, median
Measures of spread: standard deviation, IQR, range
Describing the distribution of a quantitative
variable
You should focus on three things when
describing the distribution of a quantitative
variable:
Shape unimodal (one peak), bimodal (two peaks),
multimodal (many peaks), bell-shaped, skewed left
(tail to the left), skewed right (tail to the right),
symmetric, uniform (no peaks, basically flat)
Center estimate the center (or use a descriptive
statistic)
If multiple peaks, report the peak locations
Spread estimate the spread (can use a
descriptive statistic)
Dot Plot On Your Own
Most basic quantitative graph
Use for a low number of observations (<50)
Basically use a number line and place a dot
above it for each value you have observed.
Example from wikipedia:
Stem and Leaf On Your Own
Your book discusses lots of options for these,
including split leaves (which is something
R/Rcmdr will do).
Basics: You take your values and set a stem
maybe tens. Then the leaves are the ones place.
For each stem, you list the leaves that coincide
in numeric order.
Usually works decently for fewer than 100
observations
Try it. Suppose you have scores on a pre-test for
an at-risk youth group as follows:
5, 11, 13, 21, 34, 36, 45, 47, 48, 48, 49
Histogram
Take the quantitative variable and break it up into
piles or bins (usually the same width).
Count the number of observations in each bin or pile.
Plot the frequencies per bin.
Usually no spaces between bins (if there is, it is a gap
NOT like a bar chart).
You DO need to know the boundaries. (5,10], (10,15]
as bins IS different from [5,10),[10,15). (If anyone
needs me to explain open/closed brackets, please
ask).
Technology lets us vary the width of bins (effectively
the number)
You can also use unequal bin widths but then you
need something called density, not frequency.
Examples
See page 2 of the handout
Try to describe the shape of each histogram

Then see page 3 of the handout

Were going to create a histogram by hand if there
is time
If no time, you can do this on your own.
Cookie Lab
Time Permitting (otherwise, Friday)

The last page (to turn in) is not due till the end
of class tomorrow. So dont worry if we dont
get to it today. You can look at it tonight or
tomorrow in class (Ill give last five minutes of
class for you to work on it).

PRACTICE WORKSHEET - Conditional Statements
100% (1)
PRACTICE WORKSHEET - Conditional Statements
11 pages
SLIDES Statistics-Chapter 2
No ratings yet
SLIDES Statistics-Chapter 2
31 pages
Picturing Distributions With Graphs
No ratings yet
Picturing Distributions With Graphs
21 pages
G10 DLL Fourth-Quarter
100% (1)
G10 DLL Fourth-Quarter
143 pages
Ratio and Proportion Activity
100% (1)
Ratio and Proportion Activity
3 pages
Linear Functions (COMPLETE)
No ratings yet
Linear Functions (COMPLETE)
86 pages
Slope of A Tangent Line and Derivative
100% (1)
Slope of A Tangent Line and Derivative
29 pages
M 301 - Ch1 - Introduction To Statistics
No ratings yet
M 301 - Ch1 - Introduction To Statistics
96 pages
Measure of Central Tendency (Ungrouped and Grouped Data)
100% (1)
Measure of Central Tendency (Ungrouped and Grouped Data)
40 pages
Nanoporous Materials PDF
100% (1)
Nanoporous Materials PDF
458 pages
TCD 2012 L04/06 4V TCD 2013 L04/06 4V: Operation Manual
100% (5)
TCD 2012 L04/06 4V TCD 2013 L04/06 4V: Operation Manual
80 pages
If You Were A Restaurant Owner and The Customers Ask You: "What Is Your Best Seller? How Are You Going To Answer Them?
No ratings yet
If You Were A Restaurant Owner and The Customers Ask You: "What Is Your Best Seller? How Are You Going To Answer Them?
20 pages
Math10 - PPT - Week 1 - 2nd Meeting - MOP For Grouped Data
No ratings yet
Math10 - PPT - Week 1 - 2nd Meeting - MOP For Grouped Data
34 pages
Chapter 2: Frequency Distribution and Measures of Central Tendency 2.1 A FREQUENCY DISTRIBUTION Is A Tabular Arrangement of Data Whereby The Data Is Grouped
No ratings yet
Chapter 2: Frequency Distribution and Measures of Central Tendency 2.1 A FREQUENCY DISTRIBUTION Is A Tabular Arrangement of Data Whereby The Data Is Grouped
9 pages
Frequency Distribution Table
No ratings yet
Frequency Distribution Table
9 pages
W1. Discrete Probability Distribution
No ratings yet
W1. Discrete Probability Distribution
3 pages
Day 1 - Measures of Position - Quartiles - Percentiles - ZScores-BoxPlots 2.6
No ratings yet
Day 1 - Measures of Position - Quartiles - Percentiles - ZScores-BoxPlots 2.6
57 pages
MODULE 2 - Measures of Position For Grouped Data
No ratings yet
MODULE 2 - Measures of Position For Grouped Data
22 pages
Introduction To Statistics
100% (1)
Introduction To Statistics
31 pages
Frequency Distribution Table Worksheet
100% (1)
Frequency Distribution Table Worksheet
3 pages
Permutation Vs Combination
No ratings yet
Permutation Vs Combination
26 pages
Test 1 Notes
No ratings yet
Test 1 Notes
6 pages
Lesson Plan Math 7
No ratings yet
Lesson Plan Math 7
6 pages
Lesson 2 Vector Spaces PDF
No ratings yet
Lesson 2 Vector Spaces PDF
15 pages
الاتجاهات المعاصرة لتحليل العلاقة بين نقطة التعادل
No ratings yet
الاتجاهات المعاصرة لتحليل العلاقة بين نقطة التعادل
25 pages
SMB013 Risk Assessment Use Storage and Disposal of Flammable Liquids
No ratings yet
SMB013 Risk Assessment Use Storage and Disposal of Flammable Liquids
6 pages
Organization and Presentation of Data
50% (2)
Organization and Presentation of Data
55 pages
8august2010 - Confidence Interval and Sample Size
No ratings yet
8august2010 - Confidence Interval and Sample Size
5 pages
2 - Module 1 - Descriptive Statistics - Frequency Tables, Measure of Central Tendency & Measures of Dispersion
No ratings yet
2 - Module 1 - Descriptive Statistics - Frequency Tables, Measure of Central Tendency & Measures of Dispersion
21 pages
Representation of Data - Frequency Distribution
No ratings yet
Representation of Data - Frequency Distribution
23 pages
Measure of Dispersion and Location
No ratings yet
Measure of Dispersion and Location
51 pages
General Mathematics Week 1 PDF
100% (1)
General Mathematics Week 1 PDF
17 pages
Distance and Midpoint Formula
100% (1)
Distance and Midpoint Formula
18 pages
HP Probook 4510s (PN Fn068ut) Laptop Motherboard Schematic Diagram
100% (1)
HP Probook 4510s (PN Fn068ut) Laptop Motherboard Schematic Diagram
52 pages
285 Notes
100% (1)
285 Notes
45 pages
E Commerce
No ratings yet
E Commerce
69 pages
Centre Radius Diameter Circumference Chord Tangent Arc
No ratings yet
Centre Radius Diameter Circumference Chord Tangent Arc
39 pages
Lesson Plan in Mathematics 10
No ratings yet
Lesson Plan in Mathematics 10
6 pages
Gec3 Prelim Examination
100% (1)
Gec3 Prelim Examination
3 pages
Chapter 6 Measures of Position
No ratings yet
Chapter 6 Measures of Position
12 pages
Quarter 4 Module 1 MATH 8
No ratings yet
Quarter 4 Module 1 MATH 8
14 pages
5 Right Triangle
No ratings yet
5 Right Triangle
15 pages
Trigonometry 5
No ratings yet
Trigonometry 5
31 pages
Bivariate Data
No ratings yet
Bivariate Data
4 pages
Statistic and Probability
No ratings yet
Statistic and Probability
83 pages
مدى تأثير المناخ التنظيمي في منطقة سلفيت التعليمية
No ratings yet
مدى تأثير المناخ التنظيمي في منطقة سلفيت التعليمية
34 pages
Lesson 11.4: Scatter Plots: Standards: SDP 1.0 and 1.2 Objective: Determine The Correlation of A Scatter Plot
No ratings yet
Lesson 11.4: Scatter Plots: Standards: SDP 1.0 and 1.2 Objective: Determine The Correlation of A Scatter Plot
14 pages
Commonly Used Materials: Forging (Limited To A Maximum Wt. of 10000 LB)
No ratings yet
Commonly Used Materials: Forging (Limited To A Maximum Wt. of 10000 LB)
1 page
Gestra Pa46 Mpa46 Pa47 Mpa47 Installation..
No ratings yet
Gestra Pa46 Mpa46 Pa47 Mpa47 Installation..
32 pages
Exam in Statistics 1
100% (1)
Exam in Statistics 1
2 pages
Mathematics 10 - Fourth Quarter
No ratings yet
Mathematics 10 - Fourth Quarter
2 pages
Introduction To Sets
No ratings yet
Introduction To Sets
25 pages
3 Random Sampling
No ratings yet
3 Random Sampling
4 pages
File:Circuit Diagram - Pictorial and Schematic - PNG
No ratings yet
File:Circuit Diagram - Pictorial and Schematic - PNG
4 pages
BMW 8 Us Brake Booster Rebuild
No ratings yet
BMW 8 Us Brake Booster Rebuild
14 pages
Chapter 4 - Measures of Position
No ratings yet
Chapter 4 - Measures of Position
11 pages
Normal Distribution
No ratings yet
Normal Distribution
24 pages
Introduction To Statistics
No ratings yet
Introduction To Statistics
18 pages
Measures of Central Tendency Position
No ratings yet
Measures of Central Tendency Position
12 pages
Measures of Central Tendency and Measures of Variability For Grouped Data
No ratings yet
Measures of Central Tendency and Measures of Variability For Grouped Data
13 pages
Factorial Notation
No ratings yet
Factorial Notation
12 pages
04 Discrete and Continuous Random Variables
100% (1)
04 Discrete and Continuous Random Variables
29 pages
Interpreting Correlation
No ratings yet
Interpreting Correlation
13 pages
Lesson Plan (Quartile)
No ratings yet
Lesson Plan (Quartile)
5 pages
Lesson 5 Measure of Spread 1
No ratings yet
Lesson 5 Measure of Spread 1
9 pages
Kivymd Readthedocs Io en Latest
No ratings yet
Kivymd Readthedocs Io en Latest
441 pages
Activity # 4.12
No ratings yet
Activity # 4.12
4 pages
Measures of Position For Ungrouped Data PDF Free
No ratings yet
Measures of Position For Ungrouped Data PDF Free
1 page
Lecture 1 of 7
No ratings yet
Lecture 1 of 7
29 pages
Korda's Dethermalizer - A Free-Flight Model Airplane
No ratings yet
Korda's Dethermalizer - A Free-Flight Model Airplane
6 pages
Q4 Summative Test 3
No ratings yet
Q4 Summative Test 3
7 pages
Lec 2
No ratings yet
Lec 2
32 pages
Mathematical Statistics: Instructor: Dr. Deshi Ye
No ratings yet
Mathematical Statistics: Instructor: Dr. Deshi Ye
42 pages
Activity Worksheet MV
No ratings yet
Activity Worksheet MV
5 pages
DN 6720 PDF
No ratings yet
DN 6720 PDF
12 pages
Bab1 TimerCounter
No ratings yet
Bab1 TimerCounter
12 pages
VOIP Roaming Call Prices
No ratings yet
VOIP Roaming Call Prices
83 pages
PS Call Flows
No ratings yet
PS Call Flows
9 pages
Course Outline Title Probability and Statistics Code MT-205 Credit Hours
No ratings yet
Course Outline Title Probability and Statistics Code MT-205 Credit Hours
7 pages
Introducing Web Forms: VB Intro1.aspx
No ratings yet
Introducing Web Forms: VB Intro1.aspx
42 pages
Experiment 6 Perunit Calculations and Impedance Diagrams
No ratings yet
Experiment 6 Perunit Calculations and Impedance Diagrams
3 pages
Algorithm For Break Even Availability Allocation in Process
No ratings yet
Algorithm For Break Even Availability Allocation in Process
8 pages
Fast Track Design and Construction of Bridges in India
No ratings yet
Fast Track Design and Construction of Bridges in India
10 pages
Ahp Model For The Container Port
No ratings yet
Ahp Model For The Container Port
12 pages
SPKL 25 Nov
No ratings yet
SPKL 25 Nov
2 pages
Econ 309 Lect 1 Basics
No ratings yet
Econ 309 Lect 1 Basics
8 pages
Predicting Firm Reputation Through Content
No ratings yet
Predicting Firm Reputation Through Content
24 pages
Simulated Annealing: Netreba Kirill
No ratings yet
Simulated Annealing: Netreba Kirill
14 pages
Lecture 7 Correlation
No ratings yet
Lecture 7 Correlation
5 pages
TI - 20190923 - SG250HX Grounding - V10 - EN
No ratings yet
TI - 20190923 - SG250HX Grounding - V10 - EN
6 pages
BC 22msds
No ratings yet
BC 22msds
2 pages
Infosys - VRIO Analysis Final
No ratings yet
Infosys - VRIO Analysis Final
8 pages
Instructor:: Palestine University Faculty of Commerce
No ratings yet
Instructor:: Palestine University Faculty of Commerce
3 pages
Elements of Statistics
No ratings yet
Elements of Statistics
6 pages
E Commerce Strategies
No ratings yet
E Commerce Strategies
5 pages
Lidar Technology and Its Applications
No ratings yet
Lidar Technology and Its Applications
10 pages
TOPIC: Sample Size Determination: To determine the sample size we need, we must know desired precision and σ
No ratings yet
TOPIC: Sample Size Determination: To determine the sample size we need, we must know desired precision and σ
4 pages
An Overview of Electronic Auctions
No ratings yet
An Overview of Electronic Auctions
4 pages
BMW Wbaul92040vl01235 2024-03-31113558am
No ratings yet
BMW Wbaul92040vl01235 2024-03-31113558am
3 pages
Alok Presentation Bettman Model
No ratings yet
Alok Presentation Bettman Model
13 pages
Aadhaar VINESH
No ratings yet
Aadhaar VINESH
1 page
Common Mode Rejection Ratio PDF
No ratings yet
Common Mode Rejection Ratio PDF
2 pages
Stability Check
No ratings yet
Stability Check
1 page

Introduction To Statistics

Uploaded by

Introduction To Statistics

Uploaded by

Please sign in (SIGNATURES) as you come in to class.

Notes: Still working on getting the class roster

There is a second handout the anonymous

If you missed class Monday, I have extra course

Recall: a data set can have multiple variables in

These chapters focus on mostly univariate (single

Area principle areas are supposed to be

We will look at bar charts over pie charts.

Note that for bar charts you do not need to

Concept of independence (will formalize later):

Then see page 3 of the handout

You might also like