0% found this document useful (0 votes)

11 views8 pages

Advanced Plot Types With Matplotlib

The document provides an overview of advanced plot types in Matplotlib, including scatter plots, histograms, density plots, and box plots. Each plot type is explained with key features, example code, and descriptions of how to create and customize them using Python. The document emphasizes the importance of visualizing data distributions and relationships through these various plotting techniques.

Uploaded by

Riya Shah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views8 pages

Advanced Plot Types With Matplotlib

Uploaded by

Riya Shah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Advanced Plot Types in Matplotlib

1. Scatter Plots: A scatter plot is a two-dimensional data visualization that

uses individual data points to represent values for two different variables.
Each point on the plot corresponds to a single observation in the dataset,
with the x and y coordinates indicating the values of the two variables.
a. Key Features:

• Markers: Points on the plot representing individual data observations.

• Axes: The x and y axes represent the variables being compared.

• Patterns: Patterns or trends in the scatter plot may reveal relationships

or correlations between the variables.

b. Creating and Customizing Scatter Plots: Scatter plots are effective

for visualizing the relationship between two continuous variables.

Example Code:

import matplotlib.pyplot as plt

import numpy as np

# Generate random data

np.random.seed(42)
x = np.random.rand(50)
y = 2 * x + 1 + 0.1 * np.random.randn(50)

# Create a scatter plot

plt.scatter(x, y, color='red', marker='o',
label='Scatter Plot')

# Add title and labels

plt.title('Scatter Plot Example')
plt.xlabel('X-axis Label')
plt.ylabel('Y-axis Label')

# Show the plot

plt.legend()

1|Page
Output

This code generates a scatter plot

using randomly generated data
points (x and y) and demonstrates
how to customize the appearance
of the plot using Matplotlib
functions. The resulting plot will
have red circular markers, a title,
and labeled axes. The legend will
indicate that the plot represents a
scatter plot.

Import Libraries:
• matplotlib.pyplot: This imports the pyplot module from the
Matplotlib library in Python.
• NumPy: It is used for numerical operations and generating random
data.
Generate Random Data:
• np.random.seed(42): Sets the random seed to ensure
reproducibility of the random data.
• x = np.random.rand(50): Generates an array of 50 random values
between 0 and 1.
• y = 2 * x + 1 + 0.1 * np.random.randn(50): Creates a corresponding
array 'y' using a linear relationship with some random noise.

Create a Scatter Plot:

plt.scatter(x, y, color='red', marker='o',
label='Scatter Plot')
• plt.scatter(): Creates a scatter plot with x as the x-axis values, y as
the y-axis values.
• color='red': Sets the color of the markers to red.
• marker='o': Specifies that circular markers ('o') should be used.
• label='Scatter Plot': Adds a label to the scatter plot for later use in
the legend.

2|Page
Add Title and Labels:

• plt.title(): Sets the title of the plot.

• plt.xlabel(): Sets the label for the x-axis.
• plt.ylabel(): Sets the label for the y-axis.

Display the plots:

• plt.legend(): Displays the legend using the label provided in the

scatter plot.
• plt.show(): Displays the entire plot.

Let us now understand what histograms are and how to make them.

2. Histograms: A histogram is a graphical representation of the

distribution of a dataset. It is a way to visualize the underlying frequency
distribution of continuous data.
a. Key Features:
• Bins: The range of values is divided into intervals, or "bins," and the
number of data points falling into each bin is represented by the
height of a bar.
• Frequency: The height of each bar indicates the frequency of data
points within a specific bin.
b. Creating Histograms

import matplotlib.pyplot as plt

import numpy as np

# Generate random data

np.random.seed(42)
data = np.random.randn(1000)

# Create a histogram
plt.hist(data, bins=30, color='blue', alpha=0.7)
plt.title('Histogram Example')
plt.xlabel('Values')
plt.ylabel('Frequency')
plt.show()

3|Page
Output

This code generates a histogram to

visualize the distribution of random
data. The histogram represents the
frequency (or count) of values in
different bins, providing insight into
the shape of the data distribution.
The plot has a title, and the x-axis
and y-axis are labelled for clarity.

Import Libraries:
• matplotlib.pyplot: This imports the pyplot module from the
Matplotlib library in Python.
• NumPy: It is used for numerical operations and generating
random data.

Generate Random Data:

• np.random.seed(42): This sets the random seed for
reproducibility. It ensures that if you run the code again, you will
get the same random numbers.
• np.random.randn(1000): This generates an array of 1000 random
numbers from a standard normal distribution (mean=0, standard
deviation=1).

Create a Histogram:
plt.hist(data, bins=30, color='blue', alpha=0.7)

• plt.hist: This function is used to create a histogram.

• data: The array of random data to be plotted.
• bins=30: This specifies the number of bins (intervals) in the
histogram. In this case, there are 30 bins.
• color='blue': This sets the color of the bars in the histogram to blue.
• alpha=0.7: This controls the transparency of the bars, where 0 is
fully transparent and 1 is fully opaque. Here, it's set to 0.7.

4|Page
Add Title and Labels:

plt.title('Histogram Example')
plt.xlabel('Values')
plt.ylabel('Frequency')

• plt.title: Adds a title to the plot.

• plt.xlabel: Adds a label to the x-axis.
• plt.ylabel: Adds a label to the y-axis.

Display the Plot:

plt.show() function is used to display the histogram.

Now let’s learn to create density plots.

3. Density Plots: A density plot is a smoothed, continuous representation of

a dataset's distribution. It is often used to estimate the probability
density function of the underlying population.
a. Key Features:
Kernel Density Estimation (KDE): The density plot is created by
smoothing the histogram using a kernel function, providing a
continuous estimate of the distribution.
b. Creating density plots to show the probability density of a dataset:

plt.hist(data, bins=30, color='blue', alpha=0.7)

Output

This code creates a density

plot using Matplotlib with a
histogram and a kernel
density estimate (KDE)
overlaid on top of it.

5|Page
The Python code can be explained as follows:
Generate Example Data:
The code generates a random dataset of 1000 points drawn from a
standard normal distribution (mean = 0, standard deviation = 1).
Histogram:

plt.hist(data, bins=30, density=True, alpha=0.7,

color='skyblue', edgecolor='black')

This line generates a histogram using plt.hist(). The density=True

argument normalizes the histogram to represent a probability density. The
bins parameter controls the number of bins in the histogram. The alpha,
color, and edgecolor parameters are used for visual customization.

Kernel Density Estimate (KDE):

xmin, xmax = plt.xlim()

x = np.linspace(xmin, xmax, 100)
kde = (1 / (data.std() * np.sqrt(2 * np.pi))) *
np.exp(-0.5 * ((x - data.mean()) / data.std())**2)
plt.plot(x, kde, linewidth=2, color='darkred')

This part manually adds a kernel density estimate (KDE) to the plot. It
calculates a Gaussian KDE using a set of points (x) and then plots the
KDE on top of the histogram.

Labels and Title:

plt.title('Density Plot of the Data')

plt.xlabel('X-axis label')
plt.ylabel('Density')

These lines add a title and axis labels to the plot for better understanding.

The combination of the histogram and the KDE provides a visualization

of the probability density of the dataset. The histogram represents the
distribution of the data, while the KDE smooths out the distribution to
give a more continuous estimate of the underlying probability density
function.

6|Page
4. Box Plots: A box plot (also known as a whisker plot) is a graphical
representation of the distribution of a dataset. It provides a summary of
the central tendency, spread, and shape of the distribution. Box plots are
particularly useful for comparing distributions across different
categories or groups.
a. Key Features:
• Box: It depicts the interquartile range (IQR), the span from the
first quartile (Q1) to the third quartile (Q3), showcasing the middle
50% of the data.
• Whiskers: They stretch from the box to the minimum and
maximum values within a specified range, typically 1.5 times the
interquartile range. Data beyond the whiskers are outliers, often
shown individually.
• Median Line: Inside the box, a line denotes the dataset's median.
• Outliers: Data points beyond the whiskers are outliers and are
usually plotted separately.
b. Constructing Box Plots: Box plots are excellent for visualizing the
distribution of a dataset and identifying outliers. Here's an example:
import numpy as np
import matplotlib.pyplot as plt

# Generate some example data

np.random.seed(42)
data = np.random.randn(1000)

# Create a density plot using Matplotlib

plt.hist(data, bins=30, density=True, alpha=0.7,
color='skyblue', edgecolor='black')

# Add a kernel density estimate (KDE) using Gaussian

kernel
xmin, xmax = plt.xlim()
x = np.linspace(xmin, xmax, 100)
kde = (1 / (data.std() * np.sqrt(2 * np.pi))) * np.exp(-
0.5 * ((x - data.mean()) / data.std())**2)
plt.plot(x, kde, linewidth=2, color='darkred')

# Add labels and title

7|Page
plt.title('Density Plot of the Data')
plt.xlabel('X-axis label')
plt.ylabel('Density')

# Display the plot

plt.show()

Output:

This code generates a box plot with

three categories of random data, each
with a different standard deviation.
The resulting plot provides a visual
representation of the distribution of
each dataset, including key statistics
such as the median, quartiles, and
potential outliers. The
patch_artist=True option adds
colours to the boxes for better
visualization.

Generating Random Data:

np.random.seed(42)
data = [np.random.normal(0, std, 100) for std in
range(1, 4)]
These lines are setting random seed for reproducibility and
generating three sets of random data with increasing standard
deviations (1, 2, and 3).

Creating box plot

plt.boxplot(data, vert=True, patch_artist=True)

boxplot function is used to create a box plot. The data variable

contains the random datasets, vert=True specifies a vertical
orientation, and patch_artist=True fills the boxes with colors.

8|Page

M2 - Problem Set - Introduction To Statistics-2021 - Lagios
No ratings yet
M2 - Problem Set - Introduction To Statistics-2021 - Lagios
15 pages
Creating and Customizing Advanvced Plots
No ratings yet
Creating and Customizing Advanvced Plots
10 pages
L34, 35 Matplotlib
No ratings yet
L34, 35 Matplotlib
4 pages
CHAPTER-2 Data Visualization
No ratings yet
CHAPTER-2 Data Visualization
4 pages
Unit 5
No ratings yet
Unit 5
10 pages
Boxplot, Histogram Codes With Explanations
No ratings yet
Boxplot, Histogram Codes With Explanations
2 pages
Data Visualization Lab3
No ratings yet
Data Visualization Lab3
23 pages
Data Visualization
No ratings yet
Data Visualization
35 pages
Data Visualization Using Matplotlib in Python
No ratings yet
Data Visualization Using Matplotlib in Python
15 pages
Chapter1.3 - Data Visualization
No ratings yet
Chapter1.3 - Data Visualization
27 pages
42 Histograms2
No ratings yet
42 Histograms2
6 pages
Introduction To Matplotlib Using Python For Beginners
No ratings yet
Introduction To Matplotlib Using Python For Beginners
14 pages
ML Week 7
No ratings yet
ML Week 7
12 pages
Data Visualization
No ratings yet
Data Visualization
18 pages
Datascienece
No ratings yet
Datascienece
18 pages
Unit 4 (2) Python
No ratings yet
Unit 4 (2) Python
27 pages
Data Visualization Exp. 3
No ratings yet
Data Visualization Exp. 3
3 pages
Lab 3
No ratings yet
Lab 3
14 pages
Description of Data Visualization Tools
No ratings yet
Description of Data Visualization Tools
15 pages
Data Visualization Using Matplotlib and Seaborn
No ratings yet
Data Visualization Using Matplotlib and Seaborn
28 pages
Data Visualization - 1 by Matplot Lib
No ratings yet
Data Visualization - 1 by Matplot Lib
19 pages
DATA VISUALIZATION - Part 4
No ratings yet
DATA VISUALIZATION - Part 4
12 pages
Data Visualisation PyPlot
No ratings yet
Data Visualisation PyPlot
47 pages
Data Visulation
No ratings yet
Data Visulation
8 pages
Data Visualization Using Matplotlib
No ratings yet
Data Visualization Using Matplotlib
10 pages
Matplotlib in Python
No ratings yet
Matplotlib in Python
43 pages
Data Visualization
No ratings yet
Data Visualization
17 pages
2.5. Introduction To Matplotlib - 2
No ratings yet
2.5. Introduction To Matplotlib - 2
60 pages
42 Histograms
No ratings yet
42 Histograms
5 pages
Machinelearning Prac
No ratings yet
Machinelearning Prac
17 pages
FDS Unit 5 JPR
No ratings yet
FDS Unit 5 JPR
64 pages
Matplotlib Bov
No ratings yet
Matplotlib Bov
12 pages
Mat Plot Lib
No ratings yet
Mat Plot Lib
18 pages
BarPlot and Histogram
No ratings yet
BarPlot and Histogram
28 pages
Unit 1 - Chap 2 - Data Visualisation
No ratings yet
Unit 1 - Chap 2 - Data Visualisation
29 pages
Data Visualization Using Matplotlib
No ratings yet
Data Visualization Using Matplotlib
30 pages
11 PlottingExperimental
No ratings yet
11 PlottingExperimental
40 pages
Matplotlib
No ratings yet
Matplotlib
5 pages
Mat Plot Lib
No ratings yet
Mat Plot Lib
12 pages
Histogram
No ratings yet
Histogram
16 pages
Mat Plot Lib
No ratings yet
Mat Plot Lib
22 pages
Graphs Using Matplotlib
No ratings yet
Graphs Using Matplotlib
23 pages
Mat Plot Lib
No ratings yet
Mat Plot Lib
8 pages
19 Matplotlib
No ratings yet
19 Matplotlib
26 pages
Matplot Lib Practicals
No ratings yet
Matplot Lib Practicals
24 pages
Matplotlib Starter: Import As Import As Import As
No ratings yet
Matplotlib Starter: Import As Import As Import As
24 pages
Data Analysis Graphs
No ratings yet
Data Analysis Graphs
9 pages
Unit 05
No ratings yet
Unit 05
26 pages
Adobe Scan Jan 08, 2025
No ratings yet
Adobe Scan Jan 08, 2025
1 page
Matplotlib
No ratings yet
Matplotlib
13 pages
UNIT - 3 Matplotlib
No ratings yet
UNIT - 3 Matplotlib
10 pages
Data Visualization
No ratings yet
Data Visualization
48 pages
Pyplot
No ratings yet
Pyplot
14 pages
Lab 10
No ratings yet
Lab 10
16 pages
Content From Jose Portilla's Udemy Course Learning Python For Data Analysis and Visualization Notes by Michael Brothers, Available On
No ratings yet
Content From Jose Portilla's Udemy Course Learning Python For Data Analysis and Visualization Notes by Michael Brothers, Available On
13 pages
Class 1 Data Visualization in Python Using Matplotlib
No ratings yet
Class 1 Data Visualization in Python Using Matplotlib
13 pages
Data Visualization
No ratings yet
Data Visualization
26 pages
Data Visualisation
No ratings yet
Data Visualisation
5 pages
Data Visualization - Matplotlib PDF
100% (1)
Data Visualization - Matplotlib PDF
15 pages
To Matplotlib: Anas Irtaza Ashmal
No ratings yet
To Matplotlib: Anas Irtaza Ashmal
15 pages
Ams 310 HW 1
No ratings yet
Ams 310 HW 1
9 pages
Histogram Steps V V Imp
No ratings yet
Histogram Steps V V Imp
15 pages
Chapter - 5 - Correlation and Regression
100% (1)
Chapter - 5 - Correlation and Regression
70 pages
Basic Statistical Functions in Excel
No ratings yet
Basic Statistical Functions in Excel
16 pages
Introduction To Biostatistics
No ratings yet
Introduction To Biostatistics
53 pages
Cbsnews 20250327 Flying
No ratings yet
Cbsnews 20250327 Flying
8 pages
Quantitative Techniques: Rajveer Singh Bhatia Rimjhim Khandelwal
No ratings yet
Quantitative Techniques: Rajveer Singh Bhatia Rimjhim Khandelwal
18 pages
Sampling Distribution of The Sample Mean
No ratings yet
Sampling Distribution of The Sample Mean
34 pages
UPDATED - Correlation Worksheet
No ratings yet
UPDATED - Correlation Worksheet
5 pages
A Basic Overview of Statistical Tests That Are Used Commonly
No ratings yet
A Basic Overview of Statistical Tests That Are Used Commonly
25 pages
Correlation: Self Instructional Study Material Programme: M.A. Development Studies
No ratings yet
Correlation: Self Instructional Study Material Programme: M.A. Development Studies
21 pages
Lesson 9 Using Macros For Analytics
No ratings yet
Lesson 9 Using Macros For Analytics
96 pages
GMS 2014 Module 2
No ratings yet
GMS 2014 Module 2
113 pages
STAT Act1
No ratings yet
STAT Act1
16 pages
Chapter13-Using IBM SPSS Statistic
No ratings yet
Chapter13-Using IBM SPSS Statistic
17 pages
Frequency Table: Data: GROUP 2-Mary Help of Christians
No ratings yet
Frequency Table: Data: GROUP 2-Mary Help of Christians
2 pages
Quiz 1: Formulas
No ratings yet
Quiz 1: Formulas
7 pages
Statistics Lesson 7 Sampling
No ratings yet
Statistics Lesson 7 Sampling
16 pages
LU 3 Descriptive Statistics in SPSS
No ratings yet
LU 3 Descriptive Statistics in SPSS
60 pages
Chapter 7 HW
No ratings yet
Chapter 7 HW
20 pages
PETA 1 Statistics and Probability
No ratings yet
PETA 1 Statistics and Probability
14 pages
Mba 103 PDF
No ratings yet
Mba 103 PDF
2 pages
Measures of Dispersion Kurtosis and Skewness
No ratings yet
Measures of Dispersion Kurtosis and Skewness
19 pages
Output SPSS
No ratings yet
Output SPSS
13 pages
Mean, Median, and Mode Review (Article) - Khan Academy
No ratings yet
Mean, Median, and Mode Review (Article) - Khan Academy
6 pages
4th Periodical Examination in Mathematics 10
No ratings yet
4th Periodical Examination in Mathematics 10
8 pages
3 Ways To Calculate A Pearson's Correlation Coefficient in Excel
No ratings yet
3 Ways To Calculate A Pearson's Correlation Coefficient in Excel
4 pages
Tutorial 5
No ratings yet
Tutorial 5
2 pages
Spearman Rank Correlation:, Y, - . - , Y) R (Y) R (X)
No ratings yet
Spearman Rank Correlation:, Y, - . - , Y) R (Y) R (X)
6 pages