0% found this document useful (0 votes)
38 views8 pages

Visualizing Distributions

Distributions refer to how data is spread out or distributed in a dataset. They provide insights into the central tendency and variability of data, enabling predictions and conclusions about populations based on samples. Distributions form the foundation for statistical inference and hypothesis testing. There are several ways to visualize distributions, including histograms which use bins to show frequency or density, density plots which use kernel smoothing for smoother distributions, violin plots which combine box plots and density plots, and box and whisker plots which use quartiles and whiskers to show variability.

Uploaded by

sathvik94912
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
38 views8 pages

Visualizing Distributions

Distributions refer to how data is spread out or distributed in a dataset. They provide insights into the central tendency and variability of data, enabling predictions and conclusions about populations based on samples. Distributions form the foundation for statistical inference and hypothesis testing. There are several ways to visualize distributions, including histograms which use bins to show frequency or density, density plots which use kernel smoothing for smoother distributions, violin plots which combine box plots and density plots, and box and whisker plots which use quartiles and whiskers to show variability.

Uploaded by

sathvik94912
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 8

What are Distributions?

Distributions refer to the way data is spread out or distributed. They


provide valuable insights into the characteristics of a dataset and are
crucial in data analysis.
• Importance of Distributions
• Distributions help us understand the central tendency and variability
of data.
• They enable us to make predictions and draw conclusions about the
population based on sample data.
• Distributions are the foundation for statistical inference and
hypothesis testing.
Types of Distributions
Histograms

• Histograms are one of the most straightforward ways


to visualize the distribution of a continuous variable.
• Data is divided into bins, and the height of each bar
represents the frequency or density of data points
within that bin.
• Histograms provide a sense of the shape, center, and
spread of the distribution.
Density Plot
• A Density Plot visualises the distribution of data over
a continuous interval or time period. This chart is a
variation of a Histogram that uses kernel smoothing
to plot values, allowing for smoother distributions by
smoothing out the noise. The peaks of a Density Plot
help display where values are concentrated over the
interval.
Violin Plot
• A Violin Plot is used to visualise the distribution of
the data and its probability density.
• This chart is a combination of a Box Plot and a
Density Plot that is rotated and placed on each side
(to show the distribution shape of the data). The
white dot in the middle is the median value and
the thick black bar in the centre represents the
interquartile range. The thin black line extending
from it represents the upper (max) and lower (min)
adjacent values in the data. Sometimes the graph
marker is clipped from the end of this line.
Box and Whisker Plot
• A Box and Whisker Plot is a convenient way of
visually displaying the data distribution through
their quartiles.
• The lines extending parallel from the boxes are
known as the “whiskers”, which are used to
indicate variability outside the upper and lower
quartiles. Outliers are sometimes plotted as
individual dots that are in-line with whiskers. Box
Plots can be drawn either vertically or horizontally.
THANK YOU

You might also like