Ch1 - Basics
Ch1 - Basics
Example:
Price of a book at a retail store is $200, while the
price of the same book sold online is $100. The
ratio property shows that retail stores charge twice
the online price.
Categorical and Quantitative
Data
• Data that can be grouped into specific categories.
For example if the class has 15 males and 12
females, this would be categorical data. Although
there is a numerical value, the class of male or
females bring meaning to the data.
• Categorical is either nominal or ordinal
• Data that uses numerical values to indicate how
much or how many are referred to as quantitative
data. Used by obtaining interval or ratio scale of
measurement
Cross Sectional Data
• Data that is collected at the same or approximately
the same point in time
• For example, if you were comparing the price of 3
different stocks at the same time each year, this
would be cross sectional data.
• Another example may be if you were comparing
population of 5 different countries at the same point
in time.
Time Series Data
• Data collected over several periods.
• The following picture of SIRI stock (www.yahoofinance.com)
shows time series data
Time Series Data Continued
• Useful in Business and Economics in helping identify
trends, what has happened in the past and
potentially future trends
• Typically, pretty easy to understand
• In Accounting balance sheets or other incomes
statements may be reviewed and are good examples
of time series data.
DATA Sources
• Data can be obtained from either existing sources or from
surveys and experimental studies
• Existing Archival Data Sources – This could be from an
already existing database about employees or operations.
Routinely information obtained by HR may go into an existing
data. Data such as addresses, phone numbers, etc. may
already be obtained. In quantitative research, often, archival
data is used for analysis. These are data that already exists
somewhere.
• Customer databases are also a common existing database
(such as vendors for billing and ordering).
Observational Data
Statistical Studies – Observational
• In observational (nonexperimental) studies no attempt is made to
control or influence the variables of interest. In other words, the
dependent variable is not being controlled.
• Example - Studies of smokers and nonsmokers are observational
studies because researchers do not determine or control who will
smoke and who will not smoke.
Data Sources
• Statistical Studies– We’ve all had to fill our surveys for one
reason or another. The main goal of a survey is the collection
of data. Can be classified as experimental or observational.
• Experimental – Variable of Interest is first identified. Then one or
multiple variables are identified to determine how they interact
with one another. If a companies has Blood Pressure in mind, they
may change drug levels to see how the changes affect the main
variable, Blood Pressure. Thus, we often hear the phrase
“experimental drug” because it’s being tested or going through
some sort of study.
• Observational – There is no need or desire to control variables.
• Surveys are a good example of observational
Descriptive Statistics
• These are the numbers that are used to describe the
data that has been collected
• In research, descriptive statistics can be the number of
subjects/participants, Mean, Standard deviation, etc.
Please review to Word Table in Assignment
• Descriptive statistics are typically summarized in
graphs, tables, etc.
• Most common is the Mean (average or central
tendency)
• Know how to find Mean in Odd and Even number sets
Statistical Inference
• When we draw conclusions about the data collected, this is
statistical inference.
• These are estimates and Test hypothesis about the
characteristics of a sample population. Later we will discuss
the alternate and null hypothesis
• To make a statistical Inference, one might study data, the
outcome of data and make a hypothesis about a the study that
was conducted.
• For example, average lifespan of how long a battery might last
or a light bulb might burn would be done by studying data and
making a statistical inference.
Data Mining
• Deals with methods for developing useful decision-making
information from large databases.
• Analysts “mine the data” in the data warehouse, giving it the
name data mining.
• You may have heard the phrase “data driven” before. This
deals with taking data from the database and making data
minded decision. For example, you may have a room
utilization database that show how often your conference
rooms are used. From this bigger database you can sort data
to determine which rooms are used more frequently. This may
help you support budgetary decisions to buy additional
resources for these rooms.