COMM 291 Tutorial 1
COMM 291 Tutorial 1
Week 2
TAs: Ophir Greif & Hima Kattumuri
Emails: [email protected]
[email protected]
what is statistics
Data and their computations
methods to data
analyze
is statistics
why make
important
Helps decisions under with
should I learn
uncertainty partialknowledge
statistics
why
Applicable to all of business
segments
should we care
why
There is so much data and it needs to
and made useful today be
analyzed
Definitions
variable a characteristic recorded
Data a collection of
being
values
observations synonymn For data
Data table a table data also known as a
sheet containing spread
Record a row in a
spreadsheet
Database multiple spreadsheets combined
of data
types
Data can be
categorical
or
quantitative
data can be classified into distinct bins
categorical
Few much
options repetition
two
a
Binary options
b Normal more than two unordered
c ordinal more than two ordered
Identifier variables
unique to you
UBC number
Social INSURANE NUMBER
variable
string dates and times
can be made quantitative
age
Ext Identifer tablet
When:
➔ Tue. 12:30–2:00 pm (PST) - Hima
➔ Wed.4:00–5:30 pm (PST) - Ophir
➔ Th. 12:30–2:00 pm (PST) - Alternating
Note: Three sessions are identical. Recordings will
be posted at the end of every week
Office Hours: after tutorials + before exams. Email me if you need any help!
Lecture 1: Introduction
What is statistics?
➔ Data and their computations
➔ Methods to analyze data
➔ Quantitative ➔ Strings
◆ Have units ◆ Identifier Variable
◆ Allow arithmetic operations ● E.g. Student number, Social
◆ Can be discrete or continuous Insurance number, UPS
tracking number.
Example: Sauder BCom International Students
List the variables in the data set. Indicate whether each variable is
treated as categorical or quantitative, or neither. If the variable is
quantitative, state the units. If the variable is categorical, state the type.
Problem #2 - Solution Quantitative
Categorical ➔ Age (years)
Identifier ➔ Sex (nominal) ➔ Height (inches)
Variable/Strings ➔ Major (nominal) ➔ Weight (pounds)
➔ None ➔ Only child (binary) ➔ GPA (4.33 scale)
Lecture 2: Context and Units Matter
➔ Which country is best to live in?
◆ Answer depends on what you are looking for (e.g. average income,
greenhouse emissions, quality of education, etc.)
➔ Pay attention to units
◆ Different measurement systems
◆ E.g. $/pounds, km/miles, kg/pounds
A. United States
B. Qatar
C. Luxembourg
D. Singapore
E. Canada
Problem 4: Which Country had the
highest GDP per person in 2020?
Source:
http://www.270towin.com/
Lecture 2: Data Quality
➔ Where did the data come from? Is the source reliable?
➔ How well are variables defined? Be specific!
➔ Level of measurement and accuracy (use rounding)
➔ How were the data collected (e.g. objective vs. subjective)?
➔ Missing data (what is missing and does it matter?)
Problem#7: Is Climate Change Real?
Source: NASA
Take-Away Message
➔ Data classification determines how we can phrase our
questions and conduct data analysis. Units matter!
➔ The same variable may be viewed as categorical or
quantitative, depending on the situation.
➔ Understanding the context of the data is the first step!
➔ Pay attention to the quality of collected data (e.g. sources,
level of accuracy, missing data)
Thank you and see you next week!