0% found this document useful (0 votes)

2 views

Chapter 1 Notes

The document outlines a structured approach to data analysis, including defining problems, collecting data, and modeling. It emphasizes the importance of understanding the context of data, identifying variables as categorical or quantitative, and the role of data mining in deriving actionable insights. Key definitions related to data types, variables, and analytics processes are also provided.

Uploaded by

burnsburner29

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

Chapter 1 Notes

Uploaded by

burnsburner29

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

Test #1 – Chapter 1-6 Notes

Plan (1–2)
1. Define the problem.
2. Collect and/or find data and
identify the variables.
Do (3–6)
3. Prepare and wrangle data.
4. Characterize the data.
5. Explore the data.
Summarize
Visualize
6. Model (if appropriate).
Check conditions and
assumptions for modeling.
Fit the model and make the
necessary calculations.
Report (7)
7. Communicate and present.

- When companies try to obtain actionable information from data that may have been collected in the course
of doing business (such as records of transactions or a customer database) it is usually called data mining.
Sometimes the analysis is called predictive analytics if it focuses on future performance
- Newspaper journalists know that the lead paragraph of a good story should establish the “Five W’s”: who,
what, when, where, and (if possible) why. Often, we add how to the list as well. Answering these questions
connects the data to the business problem at hand.

- The columns are called variables. You’ll usually find the name of the variable at the top of the column as in
Table 1.1. We call cases by different names, depending on the situation. Individuals who answer a survey
are referred to as respondents. People on whom we experiment are subjects or (in an attempt to
acknowledge the importance of their role in the experiment) participants, but animals, plants, websites, and
other inanimate subjects are often called experimental units. Often we call cases just what they are: for
example, customers, economic quarters, or companies. When referring to a transaction, rows are often
called records. In Table 1.1, the rows are the individual orders, or purchase records. A common place to
find the who of the table is the leftmost column. It’s often an identifying variable for the cases, in this
example, the order number.
- A general term for a data table like the one shown in Table 1.1 is a spreadsheet, a name that comes from
bookkeeping ledgers of financial information. The data were typically spread across facing pages of a
bound ledger, the book used by an accountant for keeping records of expenditures and sources of income.

- When the values of a variable are simply the names of categories we call it a categorical, or qualitative,
variable. When the values of a variable are measured numerical quantities, we call it a quantitative variable.
Descriptive responses to questions are often categories

- Identifier variables are categorical variables whose only purpose is to assign a unique identifier code to
each individual in the data set. Your student ID number, social security number, and phone number are all
identifiers. Identifier variables are crucial in this era of Big Data because, by uniquely identifying the cases,
they make it possible to combine data from different sources and provide unique labels.
- The identifiers in Table 1.2 are the Customer Number, Product ID, and Transaction Number. Variables like
UPS Tracking Number and Social Security Number are other examples of identifiers.

- When the values of a categorical variable have an intrinsic order, we can say that the variable is ordinal. By
contrast, a categorical variable with unordered categories is sometimes called nominal. Values can be
individually ordered (e.g., the ranks of employees based on the number of days they’ve worked for the
company) or ordered in classes (e.g., Freshman, Sophomore, Junior, Senior).

- The quantitative variable Total Revenue in Table 1.4 is an example of a time series. A time series is an
ordered sequence of values of a single quantitative variable measured at regular intervals over time. Time
series are common in business. Typical measuring points are months, quarters, or years, but virtually any
consistently spaced time interval is possible.

list of the variables and their descriptions.

Account ID – categorical (nominal, identifier)
Pre Spending – quantitative (units $)
Post Spending – quantitative (units $)
Age – categorical (ordinal). Could be quantitative if we had more precise information
Segment – categorical (nominal)
Enroll? – categorical (nominal)
Offer – categorical (nominal)
Segment Spend – quantitative (units $)
- All these data are cross-sectional. We do not have successive values over time.

Understand the business context of the data and the problem you are trying to solve to be successful when making
decisions from data.
• Who, what, why, where, when (and how)—the W’s—help nail down the context of the data.
• We must know who, what, and why to be able to say anything useful based on the data. The who are the cases
(or records or rows). The what are the variables. A variable gives information about each of the cases. The why
helps us decide which way to treat the variables.
• Stop and identify the W’s whenever you have data, and be sure you can identify the cases and the variables.

Identify whether a variable is being used as categorical or quantitative.

• Categorical variables identify a category for each case. Usually we think about the counts of cases that fall in
each category. (An exception is an identifier variable that just names each case.)
• Quantitative variables record measurements or amounts of something; they must have units.
• Sometimes we may treat the same variable as categorical or quantitative depending on what we want to learn
from it, which means some variables can’t be pigeonholed as one type or the other.

Big Data - The collection and analysis of data sets so large and complex that traditional methods typically brought
to bear on the problem would be overwhelmed.

Business analytics - The process of using statistical analysis and modeling to drive business decisions.

Categorical (or qualitative) variable - A variable that names categories (whether with words or numerals) is called categorical
or qualitative.

Context - The context ideally tells who was measured, what was measured, how the data were collected, where the
data were collected, and when and why the study was performed.

Cross-sectional data - Data taken from situations that vary over time but measured at a single time instant are said to be a
cross-section of the time series.

Data - Recorded values, whether numbers or labels, together with their context.

Case - A case is an individual about whom or which we have data. Also called a record or row.

Data mining (or predictive analytics) - The process of using a variety of statistical tools to analyze large databases or data
warehouses.
Data table - An arrangement of data in which each row represents a case and each column represents a variable.

Data warehouse - A large database of information collected by a company or other organization usually to record transactions
that the organization makes, but also used for analysis via data mining.

Experimental unit - An individual in a study for which or for whom data values are recorded. Human experimental units are
usually called subjects or participants.

Identifier variable - A categorical variable that records a unique value for each case, used to name or identify it.

Metadata - Auxiliary information about variables in a database, typically including how, when, and where (and possibly
why) the data were collected; who each case represents; and the definitions of all the variables.

Nominal variable - The term “nominal” can be applied to a variable whose values are used only to name categories.

Ordinal variable - The term “ordinal” can be applied to a variable whose categorical values possess some kind of order.
Participant A human experimental unit. Also called a subject.

Quantitative variable - A variable in which the numbers are values of measured quantities with units. Record Information about
an individual in a database.

Relational database - A relational database stores and retrieves information. Within the database, information is kept in data
tables that can be “related” to each other.

Respondent - Someone who answers, or responds to, a survey.

Spreadsheet - A spreadsheet is a layout designed for accounting that is often used to store and manage data tables. Excel is a
common example of a spreadsheet program.

Subject - A human experimental unit. Also called a participant.

Time series Data - measured over time. Usually the time intervals are equally spaced or regularly spaced (e.g., every week,
every quarter, or every year).

Units - A quantity or amount adopted as a standard of measurement, such as dollars, hours, or grams.

Variable - A variable holds information about the same characteristic for many cases.

Quantitative data are data about numeric variables (e.g. how many; how much; or
how often). Qualitative data are measures of 'types' and may be represented by a
name, symbol, or a number code. Qualitative data are data about categorical variables
(e.g. what type).

242 Assignment 1 Interview Protocol
No ratings yet
242 Assignment 1 Interview Protocol
6 pages
Two Decades of Research On Business Intelligence System Adoption, Utilization and Success - A Systematic Literature Review PDF
No ratings yet
Two Decades of Research On Business Intelligence System Adoption, Utilization and Success - A Systematic Literature Review PDF
37 pages
Chapter 1 Data and Decisions
No ratings yet
Chapter 1 Data and Decisions
3 pages
Chapter1_StatisticsDeskriptive
No ratings yet
Chapter1_StatisticsDeskriptive
74 pages
Chapters 1 and 2 Statistics
No ratings yet
Chapters 1 and 2 Statistics
7 pages
Quantitative Methods - I (Statistics)
No ratings yet
Quantitative Methods - I (Statistics)
30 pages
Statistics Notes
No ratings yet
Statistics Notes
7 pages
Statistics
No ratings yet
Statistics
2 pages
Session_2_3___4_ISDA_ (1)
No ratings yet
Session_2_3___4_ISDA_ (1)
22 pages
Introduction To STATISTICS-new
No ratings yet
Introduction To STATISTICS-new
44 pages
Ch1_2
No ratings yet
Ch1_2
37 pages
Lesson 1 Introduction To Statistics
No ratings yet
Lesson 1 Introduction To Statistics
9 pages
QM 1
No ratings yet
QM 1
58 pages
Quantitative Methods 3
No ratings yet
Quantitative Methods 3
174 pages
Doane Chapter 02
No ratings yet
Doane Chapter 02
82 pages
Introduction and Data Collection
No ratings yet
Introduction and Data Collection
3 pages
Report Stat
No ratings yet
Report Stat
21 pages
UNIT 1-Module 1-Teaching
No ratings yet
UNIT 1-Module 1-Teaching
26 pages
UNIT 1-Module 1
No ratings yet
UNIT 1-Module 1
39 pages
Notes (Chapter 1 - 3)
No ratings yet
Notes (Chapter 1 - 3)
15 pages
1 Introduction To Statistics
No ratings yet
1 Introduction To Statistics
89 pages
(Buiness Statistics) Chapter 1 2
No ratings yet
(Buiness Statistics) Chapter 1 2
33 pages
W1_Data
No ratings yet
W1_Data
30 pages
Measurement Scale: Dr. Myint Moe Moe Khin Professor / Head Department of Statistics Monywa University of Economics
No ratings yet
Measurement Scale: Dr. Myint Moe Moe Khin Professor / Head Department of Statistics Monywa University of Economics
27 pages
PAS 111 Week 1
No ratings yet
PAS 111 Week 1
3 pages
Section 6 Data - Statistics For Quantitative Study
No ratings yet
Section 6 Data - Statistics For Quantitative Study
142 pages
2035 CH1 Notes
No ratings yet
2035 CH1 Notes
32 pages
01 Introduction
No ratings yet
01 Introduction
50 pages
MAT114 (Online) Packet 1
No ratings yet
MAT114 (Online) Packet 1
113 pages
Business Analytics (MIS171) Summary Notes
No ratings yet
Business Analytics (MIS171) Summary Notes
6 pages
Descriptive Statistics: Overview of Using Data
No ratings yet
Descriptive Statistics: Overview of Using Data
47 pages
Statistical Data
No ratings yet
Statistical Data
2 pages
AF Notes W2
No ratings yet
AF Notes W2
2 pages
Sbs1e PPT Chapter02
No ratings yet
Sbs1e PPT Chapter02
24 pages
Eco2061 Week 2
No ratings yet
Eco2061 Week 2
68 pages
Introduction To STATISTICS-new
100% (1)
Introduction To STATISTICS-new
46 pages
Business Statistics Introduction
No ratings yet
Business Statistics Introduction
38 pages
Notes (Chapter 1 - 3)
No ratings yet
Notes (Chapter 1 - 3)
15 pages
MGT 1103
No ratings yet
MGT 1103
4 pages
Data Managemennt
No ratings yet
Data Managemennt
20 pages
Bsa s01 s02 Ppt-In-class
No ratings yet
Bsa s01 s02 Ppt-In-class
125 pages
TOPIC 1 - Introduction To Statistics in Relation To
No ratings yet
TOPIC 1 - Introduction To Statistics in Relation To
47 pages
BBFH 103 Notes
No ratings yet
BBFH 103 Notes
38 pages
Stat For ds-1 (IITM BS Degree)
No ratings yet
Stat For ds-1 (IITM BS Degree)
109 pages
Statistics
No ratings yet
Statistics
24 pages
Business Statistics Introduction. 1
No ratings yet
Business Statistics Introduction. 1
18 pages
Introduction To Statistics CH 1
No ratings yet
Introduction To Statistics CH 1
29 pages
Basic Statistics: Chapter One
No ratings yet
Basic Statistics: Chapter One
15 pages
Nature of Statistics
100% (1)
Nature of Statistics
7 pages
Nature of Statistics
No ratings yet
Nature of Statistics
7 pages
Statisticts for business
No ratings yet
Statisticts for business
29 pages
Introduction and Data Collection
No ratings yet
Introduction and Data Collection
3 pages
Introduction and Data Collection
No ratings yet
Introduction and Data Collection
3 pages
business Analytics (tanya pandey) mba m3a
No ratings yet
business Analytics (tanya pandey) mba m3a
64 pages
MGS2150_Lecture1
No ratings yet
MGS2150_Lecture1
46 pages
L1 Intro Data Analytics
No ratings yet
L1 Intro Data Analytics
2 pages
Descriptive Statistics: Instructor: Maira Sami
No ratings yet
Descriptive Statistics: Instructor: Maira Sami
55 pages
2 Types of Data
No ratings yet
2 Types of Data
44 pages
EIE2003 Lecture 1
No ratings yet
EIE2003 Lecture 1
6 pages
CH 01
No ratings yet
CH 01
11 pages
Unit 5 Chapter 1 Data Analysis For Decision Making
No ratings yet
Unit 5 Chapter 1 Data Analysis For Decision Making
33 pages
Business Statistics I Essentials
From Everand
Business Statistics I Essentials
Louise Clark
5/5 (5)
Chapter 2
No ratings yet
Chapter 2
23 pages
Chapter 3
No ratings yet
Chapter 3
21 pages
Chapter 1
No ratings yet
Chapter 1
24 pages
Chapter 2 – The Totality of Decisions
No ratings yet
Chapter 2 – The Totality of Decisions
2 pages
Chapter 2 - Developing marketing strategies and a marketing plan
No ratings yet
Chapter 2 - Developing marketing strategies and a marketing plan
8 pages
2 - Marketing Strategy Fundamentals
No ratings yet
2 - Marketing Strategy Fundamentals
3 pages
Chapter 2 - Cost Concepts and Cost Behaviours
No ratings yet
Chapter 2 - Cost Concepts and Cost Behaviours
77 pages
1 - Managing for the Future
No ratings yet
1 - Managing for the Future
2 pages
Chapter 4 - Price Elasticity of Demand
No ratings yet
Chapter 4 - Price Elasticity of Demand
14 pages
Chapter 1 Video Slides
No ratings yet
Chapter 1 Video Slides
12 pages
Chapter 4 - Job-based Pay Structures and Job Evaluation & Person-Based Pay Structures
No ratings yet
Chapter 4 - Job-based Pay Structures and Job Evaluation & Person-Based Pay Structures
6 pages
Chapter 5 - Defining External Competitiveness
No ratings yet
Chapter 5 - Defining External Competitiveness
7 pages
Chapter 16 - Financial Leverage and Capital Structure Policy
No ratings yet
Chapter 16 - Financial Leverage and Capital Structure Policy
50 pages
Chapter 3 – Defining Internal Alignment & Job Analysis
No ratings yet
Chapter 3 – Defining Internal Alignment & Job Analysis
5 pages
Chapter 2 - UCC and CCA
No ratings yet
Chapter 2 - UCC and CCA
3 pages
Chapter 5
No ratings yet
Chapter 5
2 pages
Chapter 15 - Raising Capital
No ratings yet
Chapter 15 - Raising Capital
7 pages
Chapter 14 – Cost of Capital
No ratings yet
Chapter 14 – Cost of Capital
6 pages
Chapter 1 – The Pay Model
No ratings yet
Chapter 1 – The Pay Model
3 pages
Chapter 8
No ratings yet
Chapter 8
3 pages
Chapter 6
No ratings yet
Chapter 6
4 pages
Chapter 1
No ratings yet
Chapter 1
2 pages
Chapter 7
No ratings yet
Chapter 7
2 pages
ENHANCING ENGLISH AS A FOREIGN LANGUAGE ACADEMIC WRITING THROUGH AI AND PEER-ASSISTED LEARNING
No ratings yet
ENHANCING ENGLISH AS A FOREIGN LANGUAGE ACADEMIC WRITING THROUGH AI AND PEER-ASSISTED LEARNING
34 pages
Handout Methodology
No ratings yet
Handout Methodology
2 pages
Abiy 000652632 Busi 1359 Mba Thesis Isc-Bse
0% (1)
Abiy 000652632 Busi 1359 Mba Thesis Isc-Bse
63 pages
Where To Put Definition of Terms in Research Paper
No ratings yet
Where To Put Definition of Terms in Research Paper
8 pages
Syllabus: Cambridge IGCSE Sociology
No ratings yet
Syllabus: Cambridge IGCSE Sociology
23 pages
Turninglightaround, Secret of The Golden Flower
No ratings yet
Turninglightaround, Secret of The Golden Flower
28 pages
Diagnostic Test in PR2 - 1ST Sem 2022 2023 - Apr
No ratings yet
Diagnostic Test in PR2 - 1ST Sem 2022 2023 - Apr
4 pages
Proposal Literature THE INVINCIBLE MAN
No ratings yet
Proposal Literature THE INVINCIBLE MAN
26 pages
Applied Online Mentoring: Its Implication
No ratings yet
Applied Online Mentoring: Its Implication
13 pages
Open Works Final Digital Version PDF
No ratings yet
Open Works Final Digital Version PDF
203 pages
Smu Proposed Four Chapter Format
No ratings yet
Smu Proposed Four Chapter Format
11 pages
Gondim Junqueira & Souza 2016
No ratings yet
Gondim Junqueira & Souza 2016
12 pages
Using Teaching Courseware To Enhance Classroom Int
No ratings yet
Using Teaching Courseware To Enhance Classroom Int
13 pages
The Learning Styles of Selected Senior
No ratings yet
The Learning Styles of Selected Senior
12 pages
Using Authentic Videos To Improve English Listening Skills of Dong Nai Technology University Non English Majored Students
No ratings yet
Using Authentic Videos To Improve English Listening Skills of Dong Nai Technology University Non English Majored Students
6 pages
Udd Med Filipino Summer Final Exam
No ratings yet
Udd Med Filipino Summer Final Exam
6 pages
Research Proposal
No ratings yet
Research Proposal
19 pages
Conceptualization of Research: What? Why? How?
No ratings yet
Conceptualization of Research: What? Why? How?
68 pages
Belete Final Assignment Proposal
No ratings yet
Belete Final Assignment Proposal
47 pages
ayehu research
No ratings yet
ayehu research
37 pages
Ethical Issues in Collaborative Action Research
100% (1)
Ethical Issues in Collaborative Action Research
18 pages
Panorama Spring07
No ratings yet
Panorama Spring07
83 pages
FInal Marketing ALP
No ratings yet
FInal Marketing ALP
62 pages
Metareflective Essay - Final
No ratings yet
Metareflective Essay - Final
5 pages
Practical Research 12-Onda
100% (1)
Practical Research 12-Onda
11 pages
Dynamic Systems and Performance in Team Sports
No ratings yet
Dynamic Systems and Performance in Team Sports
5 pages
Campus Bullying in The Senior High School A Qualitative Case Study
100% (1)
Campus Bullying in The Senior High School A Qualitative Case Study
9 pages
A Study of The MRO Supply Chain For Paper Mills: Final Research Report
No ratings yet
A Study of The MRO Supply Chain For Paper Mills: Final Research Report
51 pages

Chapter 1 Notes

Uploaded by

Chapter 1 Notes

Uploaded by

Test #1 – Chapter 1-6 Notes

list of the variables and their descriptions.

Identify whether a variable is being used as categorical or quantitative.

Respondent - Someone who answers, or responds to, a survey.

Subject - A human experimental unit. Also called a participant.

You might also like