Lesson 5 Probability Distributions 1
Lesson 5 Probability Distributions 1
Probability distributions describe how the values of a random variable are distributed.
Among these, the normal distribution is one of the most important and widely used in statistics.
Here we will discuss the normal distribution and its properties, how to visualize and analyze
probability distributions using SPSS and Excel, and the applications of the normal distribution in
business.
Properties:
1. Symmetry: The distribution is symmetric around the mean μ.
▪ In data analysis, if the data is normally distributed, it means the mean, median, and
mode are all equal and located at the center of the distribution. This helps in
understanding that the data is evenly distributed around the central value.
▪ In performance evaluation, knowing that data is symmetric allows analysts to use the
mean as a reliable measure of central tendency.
2. Mean, Median, and Mode: In a normal distribution, the mean, median, and mode are all
equal and located at the center of the distribution.
3. Bell-Shaped Curve: The curve is bell-shaped and extends infinitely in both directions,
approaching but never touching the horizontal axis.
▪ The bell-shaped curve indicates that most of the data points are concentrated around
the mean, with fewer data points as you move away from the mean.
▪ In quality control, the bell-shaped curve helps in identifying that most products or
processes perform near the average level, with fewer instances of extreme
performance.
4. 68-95-99.7 / Empirical Rule: Approximately 68% of the data falls within one standard
deviation (σ) of the mean, 95% within two standard deviations, and 99.7% within three
standard deviations.
▪ This rule allows analysts to understand the distribution of data around the mean in a
predictable manner. Approximately 68% of data falls within one standard deviation,
95% within two, and 99.7% within three.
▪ In finance, this rule is used to assess risk. For instance, an investor can predict that
about 95% of the time, the returns on an investment will fall within two standard
deviations of the mean return, providing a measure of expected variability.
5. Asymptotic: The tails of the distribution approach the horizontal axis but never touch it.
▪ The tails of the normal distribution approaching the horizontal axis but never touching
imply that extreme values are possible but rare.
▪ In risk management, this property helps in understanding that while extreme events
(like financial crashes or exceptional profits) are unlikely, they are still possible and
should be considered in risk assessments.
Example 1
The grades on a history midterm at Almond have a mean of 85 and a standard deviation of 2 .
Michael scored 86 on the exam.
Michael's z-score is 0.5. His grade was half of a standard deviation above the mean.
Example 2
The grades on a midterm have a mean of 82 and a standard deviation of 4 .
Michael scored 74 on the exam.
Find the z-score for Michael's exam grade.
Michael's z-score is -2 . His grade was two standard deviations below the mean.
Practical Applications and Examples
1. Quality Control and Process Improvement
▪ Example: A factory producing light bulbs can use the normal distribution to monitor
the lifetime of the bulbs. By understanding the mean lifetime and standard deviations,
the factory can identify if the production process is consistent or if there are defects.
2. Finance and Investment Analysis
▪ Example: Analyzing stock returns with the normal distribution helps in understanding
the expected return and risk. Using the 68-95-99.7 rule, investors can predict the
range within which future returns are likely to fall.
3. Healthcare and Epidemiology
▪ Example: In epidemiology, the normal distribution can be used to analyze the spread
of a disease. Understanding the mean and variability of the incubation period helps in
predicting and managing outbreaks.
4. Real Estate Market Analysis
▪ Example: Real estate analysts use the normal distribution to understand the
distribution of house prices in a given area. By analyzing the mean and standard
deviation of house prices, they can provide better pricing strategies and investment
advice.
5. Education and Testing
▪ Example: Standardized test scores are often normally distributed. Educators and
psychologists use the normal distribution to interpret test scores and understand
student performance relative to the population.
Problem 3. If the mean of a dataset is 100 and the standard deviation is 15, what
percentage of data lies between 55 and 145?
Solution: Using the 68-95-99.7 rule:
• The range 55 to 145 represents three standard deviations from the mean (100 ± 3(15)).
• According to the rule, approximately 99.7% of the data falls within three standard
deviations of the mean.
Answer: 99.7%
Problem 4: A dataset has a normal distribution with a mean of 80 and a standard
deviation of 20. What is the probability that a randomly selected value from this
dataset is between 40 and 120?
Solution: Using the 68-95-99.7 rule:
• The range 40 to 120 represents two standard deviations from the mean (80 ±
2(20)).
• According to the rule, approximately 95% of the data falls within two standard
deviations of the mean.
Answer: 95%
Answer: 2.5%
Using SPSS and Excel to Visualize and Analyze Probability Distributions
SPSS:
1. Visualizing Normal Distribution:
o Data Entry: Enter your data into SPSS Data View.
o Histogram: Go to Graphs > Chart Builder, select Histogram, and drag it to the
chart preview area. Specify the variable you want to plot.
o Normal Curve: Check the option to display the normal curve in the histogram
properties.
o Example: Plotting the test scores of students to see if they follow a normal
distribution.
2. Analyzing Normal Distribution:
o Descriptive Statistics: Go to Analyze > Descriptive Statistics > Descriptives to
get the mean, standard deviation, skewness, and kurtosis.
o Normality Tests: Use Analyze > Descriptive Statistics > Explore and then check
Normality plots with tests to perform tests like Shapiro-Wilk and Kolmogorov-
Smirnov.
o Example: Testing whether the heights of a group of people follow a normal
distribution.
Excel:
1. Visualizing Normal Distribution:
o Data Entry: Enter your data into an Excel spreadsheet.
o Histogram: Select your data, go to Insert > Insert Statistic Chart > Histogram.
Adjust the bin width if necessary.
o Normal Curve: Use a scatter plot with a line (plot the normal distribution curve
using the NORM.DIST function).
o Example: Visualizing the distribution of daily sales figures.
2. Analyzing Normal Distribution:
o Descriptive Statistics: Use Data Analysis (available in the Data tab if the
Analysis ToolPak is installed) to get descriptive statistics.
o Normality Tests: Excel does not directly support normality tests, but you can use
add-ins or perform calculations manually.
o Example: Checking the distribution of customer satisfaction scores.
Applications of the Normal Distribution in Business
1. Quality Control:
o Process Improvement: Companies use the normal distribution to monitor
production processes and ensure they meet quality standards. Control charts are
based on the assumption of normality.
o Example: Monitoring the thickness of paper produced in a factory to ensure it
stays within specified limits.
2. Finance:
o Stock Returns: The returns on stocks and other financial assets are often
assumed to be normally distributed for risk assessment and portfolio
management.
o Example: Using the mean and standard deviation of historical returns to assess
the risk of an investment portfolio.
3. Marketing:
o Customer Preferences: Analyzing normally distributed customer satisfaction
scores to improve products or services.
o Example: Surveying customers to determine the average satisfaction level and
targeting improvements to shift the mean satisfaction higher.
4. Human Resources:
o Performance Evaluation: Employee performance ratings can be analyzed using
the normal distribution to identify outstanding or underperforming employees.
o Example: Using performance appraisal scores to determine the average
performance and identify employees for rewards or further training.
5. Inventory Management:
o Demand Forecasting: Predicting future demand for products based on past
sales data assumed to follow a normal distribution.
o Example: A retail store uses past sales data to forecast future demand and
manage inventory levels.
Conclusion
Understanding the normal distribution and its properties is essential for statistical analysis and
interpretation. Using tools like SPSS and Excel, you can visualize and analyze probability
distributions effectively. The normal distribution has numerous applications in business, helping
organizations make data-driven decisions in quality control, finance, marketing, human
resources, and inventory management.