Business Statistics Notes
Business Statistics Notes
5. Data Visualization: Tools like charts, graphs, and dashboards are used to
visually represent data, making it easier to interpret and communicate
insights.
11. Data Mining and Machine Learning: These techniques help uncover
hidden patterns and insights in large datasets, contributing to data-driven
decision-making.
1
12. Big Data: Businesses are dealing with ever-increasing volumes of data,
requiring specialized tools and techniques to manage and analyze big data
effectively.
What is data data sets variables and observations with example in business
statistics
In business statistics, it's important to understand the concepts of data, data sets,
variables, and observations. Let me explain each of these with examples:
1. Data: Data are individual pieces of information. They can represent facts,
figures, measurements, or descriptions. In business statistics, data are often
used to make informed decisions and gain insights. Examples of data in a
business context include sales figures, customer names, product prices, or
employee salaries.
2. Data Sets: A data set is a collection of data points or values. It's a structured
way to organize and store data for analysis. In a business context, a data set
might be a spreadsheet containing sales data for a specific month, an
inventory list, or a customer database.
2
Quantitative Variable Example: Sales revenue is a quantitative
variable. It can take on numeric values such as $1,000, $5,000, etc.
In business statistics, the scale of measurement and categories of data help classify
and understand the nature of the data being analyzed. Here's a table that outlines
the four commonly used scales of measurement and the categories of data
associated with each:
3
are not consistent. Dissatisfied), Educational
attainment levels (e.g.,
High School, Bachelor's,
Master's, Ph.D.), Ranks in a
competition
Certainly, let's explain the concept of the scale of measurement and categories of
data in business statistics without a table:
2. Ordinal Scale (Categorical Data): In the ordinal scale, data categories have
a meaningful order or rank, but the intervals between them are not
consistent. Ordinal data provide information about the relative position of
items in a list but don't convey how much difference exists between them.
Examples include customer satisfaction ratings (e.g., Very Satisfied,
4
Satisfied, Neutral, Dissatisfied, Very Dissatisfied), educational attainment
levels (e.g., High School, Bachelor's, Master's, Ph.D.), and ranks in a
competition.
3. Interval Scale (Numerical Data): Data at the interval scale have categories
with a meaningful order and consistent intervals between values, but there is
no true zero point. Interval data can be measured and differences between
values are meaningful, but the absence of a true zero means that we can't say
one value is "twice" or "half" of another. Examples include temperature
(measured in Celsius or Fahrenheit), IQ scores, and years (e.g., 2020, 2021,
2022).
4. Ratio Scale (Numerical Data): The ratio scale includes data with categories
having a meaningful order, consistent intervals between values, and a true
zero point, indicating the absence of the measured attribute. Ratio data
provide the most information and support all mathematical operations.
Examples include height (in inches or centimeters), weight (in pounds or
kilograms), revenue (in dollars), age (in years), and the number of
employees.
Ethical guidelines for statistical practices are crucial to ensure the integrity,
accuracy, and responsible use of statistical methods. Here are some key ethical
principles for statisticians and researchers:
5
4. Data Quality: Strive for data accuracy and quality. Document data sources,
cleaning processes, and any adjustments made to the data.
7. Responsible Data Handling: Handle data with care and protect it from
unauthorized access. Use secure storage and transmission methods to
prevent data breaches.
10. Use of Appropriate Methods: Select statistical methods that are suitable
for the data and research questions. Misusing statistical methods can lead to
incorrect conclusions.
6
14. Data Ownership and Sharing: Clarify data ownership and sharing
arrangements. Make data available to other researchers whenever possible,
while respecting privacy and confidentiality.
4. Percentiles: Percentiles divide a dataset into 100 equal parts. For example,
the 25th percentile is the value below which 25% of the data falls.
7
6. Measures of Shape: Skewness and kurtosis are used to describe the shape
of a data distribution. Skewness measures the degree of asymmetry, and
kurtosis measures the degree of "peakedness" or "flatness."
1. Distribution Shape:
2. Score in Statistics:
8
In statistics, a "score" typically refers to an individual data point or observation
within a dataset. Scores can represent various types of measurements or attributes,
such as test scores, survey responses, temperatures, or any other quantitative
values.
The term "score" is often used in the context of measurement or assessment. For
example, a test score could represent a student's performance on an exam. The
scores can be used for analysis, comparison, and making inferences about a
population.
Covariance and correlation are two statistical concepts that describe the
relationship between two or more variables in a dataset. They are often used to
analyze how variables change in relation to each other.
Covariance:
scssCopy code
Cov(X, Y) = Σ (X - X) * (Y - Ȳ)] / (n - 1)
9
Where:
X Ȳ X Y
Units: The unit of covariance is the product of the units of the two variables
being measured.
Correlation:
cssCopy code
Interpretation:
11
4. Events and Complements: In business statistics, you can define events and
their complements. The complement of an event A is the event "not A." For
example, if you're interested in the probability of a product being sold, the
complement is the probability of it not being sold.
12
1. Discrete Probability Distribution: This type of distribution is used when
the random variable can take on only a countable number of distinct values.
Each of these values has a probability associated with it. Common examples
of discrete probability distributions include the binomial distribution (used
for events with two outcomes, like success/failure) and the Poisson
distribution (used for counting the number of events in a fixed interval).
Sample:
13
Samples should be selected in a way that avoids bias and is representative of
the population. Various sampling methods, such as random sampling or
stratified sampling, are used to achieve this.
Sampling Distribution:
When you repeatedly take random samples from a population and calculate a
statistic (e.g., the sample mean) for each sample, you'll get a range of values
for that statistic.
14
What is meant by point estimation
2. Sample Data: To perform point estimation, you need a sample drawn from
the population. The point estimate is calculated based on this sample.
3. Single Value: A point estimate is a single value that is considered the best
approximation of the population parameter. It's often repre
( , for the sample mean) and calculated using a specific
formula or method.
4. Uncertainty: Point estimates provide a single value but do not account for
uncertainty. They do not give any information about the range of possible
values that the true parameter might take. For this reason, point estimates are
often accompanied by confidence intervals to provide a sense of the
uncertainty in the estimate.
( ) u ( )
Interval estimation
16
confidence interval is typically of the form: "Point Estimate ± Margin of
Error."
1. Formulating Hypotheses:
Null Hypothesis (H0): The null hypothesis is the default or status quo
assumption. It typically states that there is no effect, no difference, or
no relationship in the population. It represents what you are trying to
test or investigate.
2. Collecting Data:
You use statistical methods to analyze the sample data and calculate a
test statistic.
4. Comparing Results:
5. Making a Decision:
6. Interpreting Results:
If you reject the null hypothesis, you accept the alternative hypothesis,
and you conclude that there is evidence to support your claim.
If you fail to reject the null hypothesis, you do not conclude that the
null hypothesis is true; you simply do not have enough evidence to
support the claim.
Hypothesis testing is used in a wide range of fields, from scientific research and
quality control in manufacturing to marketing and social sciences. It allows
researchers and analysts to make data-driven decisions, draw conclusions, and
assess the significance of results. The process of hypothesis testing helps ensure
that claims about populations are based on sound statistical evidence.
18
19