0% found this document useful (0 votes)

19 views9 pages

Data Analysis Graphs

This document discusses various types of plots that can be created in Python using the matplotlib and seaborn libraries for data visualization. It covers common plot types like line plots, scatter plots, histograms, bar plots, and also specialized plots like pseudocolor plots, box plots, residual plots, and KDE plots. Code snippets are provided for creating each type of plot.

Uploaded by

sid

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views9 pages

Data Analysis Graphs

Uploaded by

sid

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

4/24/24, 9:43 PM about:blank

Data Visualization commands in Python

Estimated Effort: 20 mins

Visualizations play a key role in data analysis. In this reading, you'll be introduced to various forms of graphs and plots that you can create with your data in Python
that help you in visualising your data for better analysis.

The two major libraries used to create plots are matplotlib and seaborn. We will learn the prominent plotting functions of both these libraries as applicable to Data
Analysis.

Importing libraries
You can import the above mentioned libraries as shown below.

a. matplotlib
1. 1

1. from matplotlib import pyplot as plt

Copied!

Alternatively, the command can also be written as:

1. 1

1. import matplotlib.pyplot as plt

Copied!

Note that most of the plots that are of interest to us in this library are contained in the pyplot subfolder of the package.

matplotlib functions return a plot object which requires additional statements to display. While using matplotlib in Jupyter Notebooks, we require the graph to be
displayed inside the notebook interface itself. It is, therefore, essential to add the following 'magic' statement after loading the library.

1. 1

1. %matplotlib inline

Copied!

b. seaborn
Seaborn is usually imported in a code using the following statement:
1. 1

1. import seaborn as sns

Copied!

matplotlib functions
1. Standard Line Plot

The simplest and most fundamental plot is a standard line plot. The function expects two arrays as input, x and y, both of the same size. x is treated as an independent
variable and y as the dependent one. The graph is plotted as shortest line segments joining the x,y point pairs ordered in terms of the variable x.

Syntax:

1. 1

1. plt.plot(x,y)

Copied!

A sample plot is shown in the image below.

about:blank 1/9
4/24/24, 9:43 PM about:blank

2. Scatter plot

Scatter plots are graphs that present the relationship between two variables in a data set. It represents data points on a two-dimensional plane. The independent
variable or attribute is plotted on the X-axis, while the dependent variable is plotted on the Y-axis.

Scatter plots are used in either of the following situations:

When we have paired numerical data

When there are multiple values of the dependent variable for a unique value of an independent variable
In determining the relationship between variables in some scenarios

Syntax:

1. 1

1. plt.scatter(x,y)

Copied!

Here, x contains the independent variable, and y contains the dependent variable. You have the option to change the size, color, and shape of the markers with
additional attributes in the function.

about:blank 2/9
4/24/24, 9:43 PM about:blank
A sample scatter plot is shared below.

3. Histogram

A histogram is an important visual representation of data in categorical form. To view the data in a "Binned" form, we may use the histogram plot with a number of
bins required or even with the data points that mark the bin edges. The x-axis represents the data bins, and the y-axis represents the number of elements in each of the
bins.

Syntax:

1. 1

1. plt.hist(x,bins)

Copied!

An example of a histogram plot is shown below. Use an additional argument, edgecolor, for better clarity of plot.
Consider the graph shown below. The left graph is the histogram plot for a data set, plotted without setting the edgecolor. The right one is the same graph but has the
edgecolor argument set as the color black.

4. Bar plot

A bar plot is used for visualizing catogorical data. The y-axis represents the average value of data points belonging to a particular category, while the x-axis
represents the number of elements in the different categories.

about:blank 3/9
4/24/24, 9:43 PM about:blank
Syntax:
1. 1

1. plt.bar(x,height)

Copied!

Here, x is the categorical variable, and height is the number of values belonging to the category. You can adjust the width of each bin using an additional width
argument in the function.

A sample graph is shown below.

5. Pseudo Color Plot

A pseudocolor plot displays matrix data as an array of colored cells (known as faces). This plot is created as a flat surface in the x-y plane. The surface is defined by a
grid of x and y coordinates that correspond to the corners (or vertices) of the faces. Matrix C specifies the colors at the vertices. The color of each face depends on the
color of one of its four surrounding vertices. Of the four vertices, the one that comes first in the x-y grid determines the color of the face.

In this course, you use the pcolor plot for visualizing the contents of a pivot table that has been grouped on the basis of 2 parameters. Those parameters then represent
the x and y-axis components that create the grid. The values in the pivot table are the average values of a third parameter. These values act as the code for the color
the cell is going to take.

Syntax:

1. 1

1. plt.pcolor(C)

Copied!

You can define an additional cmap argument to specify the color scheme of the plot.

Two sample pcolor plots are shown below, created for same data but for different color schemes.

about:blank 4/9
4/24/24, 9:43 PM about:blank

seaborn functions
1. Regression plot

A regression plot draws a scatter plot of two variables, x and y, and then fits the regression model and plots the resulting regression line along with a 95% confidence
interval for that regression. The x and y parameters can be shared as the dataframe headers to be used, and the data frame itself is passed to the function as well.

Syntax:

1. 1

1. sns.regplot(x = 'header_1',y = 'header_2',data= df)

Copied!

A sample regression plot is shared below.

about:blank 5/9
4/24/24, 9:43 PM about:blank

2. Box and whisker plot

A box plot (or box-and-whisker plot) shows the distribution of quantitative data in a way that facilitates comparisons between variables or across levels of a
categorical variable. The box shows the quartiles of the dataset while the whiskers extend to show the rest of the distribution, except for points that are determined to
be "outliers".

Consider the Box and whisker plot interpretation figure shown below.

The plot uses whiskers to represent Minimum value to 25% quartile data and 75% quartile to Maximum value data. The range between 25% quartile and 75%
quartile is considered as the Inter-Quartile Range. Outliers are generally classified as being outside 1.5 times the interquartile range.

about:blank 6/9
4/24/24, 9:43 PM about:blank
A sample box plot is shown below

3. Residual Plot

A residual plot is used to display the quality of polynomial regression. This function will regress y on x as a polynomial regression and then draw a scatterplot of the
residuals.
Residuals are the differences between the observed values of the dependent variable and the predicted values obtained from the regression model. In other words, a
residual is a measure of how much a regression line vertically misses a data point, meaning how far off the predictions are from the actual data points.

Syntax:
1. 1

1. sns.residplot(data=df,x='header_1', y='header_2')

Copied!

Alternatively:
1. 1

1. sns.residplot(x=df['header_1'], y=df['header_2'])

Copied!

A sample plot is shown below.

about:blank 7/9
4/24/24, 9:43 PM about:blank

4. KDE plot

A Kernel Density Estimate (KDE) plot is a graph that creates a probability distribution curve for the data based upon its likelihood of occurrence on a specific value.
This is created for a single vector of information. It is used in the course in order to compare the likely curves of the actual data with that of the predicted data.

Syntax:
1. 1

1. sns.kdeplot(X)

Copied!

A sample graph made for a random set of values is shown below.

about:blank 8/9
4/24/24, 9:43 PM about:blank

5. Distribution Plot

This plot has the capacity to combine the histogram and the KDE plots. This plot creates the distribution curve using the bins of the histogram as a reference for
estimation. You can optionally keep or discard the histogram from being displayed. In the context of the course, this plot can be used interchangeably with the KDE
plot.

Syntax:
1. 1

1. sns.distplot(X,hist=False)

Copied!

Here, keeping the argument hist as True would plot the histogram along with the distribution plot. Both variations are shown in the image below.

Conclusion
This concludes the summary of the different types of plots being used in this course for the purpose of visualization.

Author(s)
Abhishek Gagneja

Changelog
Date Version Changed by Change Description
2023-10-05 0.2 Steve Hord QA pass with edits
2023-09-28 0.1 Abhishek Gagneja Initial version created

about:blank 9/9

Data Analysis and Visualisation With Python
No ratings yet
Data Analysis and Visualisation With Python
42 pages
SAFe Agilist
No ratings yet
SAFe Agilist
4 pages
Mivec Fault
No ratings yet
Mivec Fault
1 page
Pandas Cheat Sheet 2
No ratings yet
Pandas Cheat Sheet 2
12 pages
Seaborn
No ratings yet
Seaborn
7 pages
Seaborn 2
No ratings yet
Seaborn 2
49 pages
Matplot Lib Practicals
No ratings yet
Matplot Lib Practicals
24 pages
Seaborn: Key Features
No ratings yet
Seaborn: Key Features
5 pages
Introduction To Matplotlib Using Python For Beginners
No ratings yet
Introduction To Matplotlib Using Python For Beginners
14 pages
Data Visualisation
No ratings yet
Data Visualisation
5 pages
Data Visualization Using Matplotlib in Python
No ratings yet
Data Visualization Using Matplotlib in Python
15 pages
Matplotlib
No ratings yet
Matplotlib
9 pages
Lecture 2.3
No ratings yet
Lecture 2.3
25 pages
Visualization With Help of PANDAS
No ratings yet
Visualization With Help of PANDAS
83 pages
An Introduction To Seaborn
No ratings yet
An Introduction To Seaborn
42 pages
Data Visualization Using Matplotlib and Seaborn
No ratings yet
Data Visualization Using Matplotlib and Seaborn
28 pages
Matplotlib and Seaborn Functions A Quick Overview
No ratings yet
Matplotlib and Seaborn Functions A Quick Overview
2 pages
Matplotlib
No ratings yet
Matplotlib
5 pages
Sl-3 Assignment No.8
No ratings yet
Sl-3 Assignment No.8
21 pages
Data Visualization Using Matplotlib
No ratings yet
Data Visualization Using Matplotlib
30 pages
Unit 05
No ratings yet
Unit 05
26 pages
CHAPTER-2 Data Visualization
No ratings yet
CHAPTER-2 Data Visualization
4 pages
Python Plots
No ratings yet
Python Plots
47 pages
Day 15
No ratings yet
Day 15
20 pages
Class 1 Data Visualization in Python Using Matplotlib
No ratings yet
Class 1 Data Visualization in Python Using Matplotlib
13 pages
Data Visualization Part 2
No ratings yet
Data Visualization Part 2
18 pages
Data Visualization
No ratings yet
Data Visualization
35 pages
Content From Jose Portilla's Udemy Course Learning Python For Data Analysis and Visualization Notes by Michael Brothers, Available On
No ratings yet
Content From Jose Portilla's Udemy Course Learning Python For Data Analysis and Visualization Notes by Michael Brothers, Available On
13 pages
MATPLOTLIB NOTES Pandas
No ratings yet
MATPLOTLIB NOTES Pandas
17 pages
Description of Data Visualization Tools
No ratings yet
Description of Data Visualization Tools
15 pages
Mfds QnA
No ratings yet
Mfds QnA
8 pages
Unit 5 Plotting - Matplotlib in Python
No ratings yet
Unit 5 Plotting - Matplotlib in Python
15 pages
Data Visualization in Python With Libraries
No ratings yet
Data Visualization in Python With Libraries
28 pages
ProgrammingForDS12 Viz
No ratings yet
ProgrammingForDS12 Viz
25 pages
Data Visualization With Matplotlib
No ratings yet
Data Visualization With Matplotlib
20 pages
Visualization
No ratings yet
Visualization
18 pages
19 Matplotlib
No ratings yet
19 Matplotlib
26 pages
Data Visualization With Python
No ratings yet
Data Visualization With Python
36 pages
01 Matplotlib
No ratings yet
01 Matplotlib
2 pages
DEV Lecture Notes Unit II
No ratings yet
DEV Lecture Notes Unit II
57 pages
Datascienece
No ratings yet
Datascienece
18 pages
Mat Plot Lib
No ratings yet
Mat Plot Lib
18 pages
Python Unit 4.notes
No ratings yet
Python Unit 4.notes
50 pages
Data Visualization
No ratings yet
Data Visualization
33 pages
Dev Lecture Notes UNIT-2
No ratings yet
Dev Lecture Notes UNIT-2
57 pages
Visualization Library Documentation
No ratings yet
Visualization Library Documentation
16 pages
Mat Plot Lib
No ratings yet
Mat Plot Lib
22 pages
Data Visulation
No ratings yet
Data Visulation
8 pages
Unit 2
No ratings yet
Unit 2
36 pages
Matplotlib Notes
No ratings yet
Matplotlib Notes
5 pages
Unit 5
No ratings yet
Unit 5
10 pages
Mat Plot Lib
No ratings yet
Mat Plot Lib
12 pages
Lecture Week3
No ratings yet
Lecture Week3
51 pages
A9bf73 - Introduction To Matplotlib
No ratings yet
A9bf73 - Introduction To Matplotlib
18 pages
Python
No ratings yet
Python
29 pages
Matplotlib in Python
No ratings yet
Matplotlib in Python
43 pages
DS - UNIT - IV - QB & Ans
No ratings yet
DS - UNIT - IV - QB & Ans
27 pages
Exp 8
No ratings yet
Exp 8
19 pages
Graphs with MATLAB (Taken from "MATLAB for Beginners: A Gentle Approach")
From Everand
Graphs with MATLAB (Taken from "MATLAB for Beginners: A Gentle Approach")
Peter Kattan
4/5 (2)
Co-Clustering: Models, Algorithms and Applications
From Everand
Co-Clustering: Models, Algorithms and Applications
Gérard Govaert
No ratings yet
Image Histogram: Unveiling Visual Insights, Exploring the Depths of Image Histograms in Computer Vision
From Everand
Image Histogram: Unveiling Visual Insights, Exploring the Depths of Image Histograms in Computer Vision
Fouad Sabry
No ratings yet
Line Drawing Algorithm: Mastering Techniques for Precision Image Rendering
From Everand
Line Drawing Algorithm: Mastering Techniques for Precision Image Rendering
Fouad Sabry
No ratings yet
3 Regression Diagnostics
100% (1)
3 Regression Diagnostics
53 pages
Student Dropout Prediction
No ratings yet
Student Dropout Prediction
11 pages
Speech and Language Processing 3rd Edition Daniel Jurafsky James H Martin Download
100% (1)
Speech and Language Processing 3rd Edition Daniel Jurafsky James H Martin Download
29 pages
Ladd
No ratings yet
Ladd
1 page
Preparing Product Roadmaps - A Pragmatic Guide PDF
No ratings yet
Preparing Product Roadmaps - A Pragmatic Guide PDF
109 pages
DDR3 and LPDDR3 Measurement and Analysis: 6 Series MSO Opt. 6-CMDDR3 and Opt. 6-DBDDR3 Application Datasheet
No ratings yet
DDR3 and LPDDR3 Measurement and Analysis: 6 Series MSO Opt. 6-CMDDR3 and Opt. 6-DBDDR3 Application Datasheet
14 pages
ANSWER SHEET IN Statisctics and Probabilty: Written Work
No ratings yet
ANSWER SHEET IN Statisctics and Probabilty: Written Work
1 page
Nicolet In10 MX-PS51511
No ratings yet
Nicolet In10 MX-PS51511
4 pages
MCA Cloud Storage Report
No ratings yet
MCA Cloud Storage Report
13 pages
Marketing Cell, BTCL.: Bangladesh Telecommunications Company Limited
No ratings yet
Marketing Cell, BTCL.: Bangladesh Telecommunications Company Limited
42 pages
Practice English Literacy Questions With Answer Keys and Discussion
No ratings yet
Practice English Literacy Questions With Answer Keys and Discussion
10 pages
TIOBE Programming Community Index For December 2011
No ratings yet
TIOBE Programming Community Index For December 2011
8 pages
Accomplishment Report Format
No ratings yet
Accomplishment Report Format
6 pages
CURVIC1
No ratings yet
CURVIC1
48 pages
DX Diag
No ratings yet
DX Diag
31 pages
KJRP-86I, A Installation and Owner's Manual
No ratings yet
KJRP-86I, A Installation and Owner's Manual
25 pages
Mustafa Awni CV PDF
No ratings yet
Mustafa Awni CV PDF
1 page
Course Expert: Prof. Arunkumar Khannur, Course Code: 17CS61 Course Name: Cryptography, Network Security and Cyber Law Module: 01 & Part of 02
No ratings yet
Course Expert: Prof. Arunkumar Khannur, Course Code: 17CS61 Course Name: Cryptography, Network Security and Cyber Law Module: 01 & Part of 02
4 pages
Na
No ratings yet
Na
75 pages
Ya1w Series
No ratings yet
Ya1w Series
72 pages
Imp Web Address
No ratings yet
Imp Web Address
7 pages
RD 01 Mus 2
No ratings yet
RD 01 Mus 2
9 pages
5G Fixed Wireless Gigabit Services Today
No ratings yet
5G Fixed Wireless Gigabit Services Today
32 pages
Coding Resources Coding Clinic, Encoders, Automated Coding
No ratings yet
Coding Resources Coding Clinic, Encoders, Automated Coding
11 pages
161 - Course Details B.E.electrical Engineering
No ratings yet
161 - Course Details B.E.electrical Engineering
36 pages
431-342-02 Using Mitutoyo DP-1 VR
No ratings yet
431-342-02 Using Mitutoyo DP-1 VR
2 pages
Designing E-Courses For Hearing Impaired Students
No ratings yet
Designing E-Courses For Hearing Impaired Students
14 pages
In-Place Expansion of 32-Bit Aggregates To 64-Bit PDF
No ratings yet
In-Place Expansion of 32-Bit Aggregates To 64-Bit PDF
25 pages