0% found this document useful (0 votes)

48 views

Area Plots, Histogram and Bar Plots in Python

The document discusses various data visualization techniques including area plots, histograms, and bar plots using pandas and matplotlib. It provides examples of creating stacked and unstacked area plots to visualize immigration trends over time for different countries. It also demonstrates how to generate histograms to view the frequency distribution of immigration numbers from different countries and years. Modifications like changing bin sizes, transparency, colors are discussed to improve histogram plots.

Uploaded by

Amudha priya

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

48 views

Area Plots, Histogram and Bar Plots in Python

Uploaded by

Amudha priya

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

Area

Plots, Histograms, and Bar Plots

Importing required libraries
%matplotlib inline
import matplotlib.pyplot as plt
import pandas as pd
import numpy as np

Loading data
Note: All steps that are performed below are explain in detail in Tutorial

df = pd.read_excel(
'https://cf-courses-data.s3.us.cloud-object-storage.appdomain.cloud/IBMDeveloperSkillsNetwork-DV0101EN-SkillsNetw
sheet_name='Canada by Citizenship',
skiprows=range(20),
skipfooter=2)

print('Data read into a pandas dataframe!')

Data read into a pandas dataframe!

# in pandas axis=0 represents rows (default) and axis=1 represents columns.

df.drop(['AREA','REG','DEV','Type','Coverage'], axis=1, inplace=True)
df.rename(columns={'OdName':'Country', 'AreaName':'Continent', 'RegName':'Region'}, inplace=True)
df['Total'] = df.sum(axis=1)
df.set_index('Country', inplace=True)

C:\Users\Meer Moazzam\AppData\Local\Temp\ipykernel_8116\3820691460.py:4: FutureWarning: Dropping of nuisance co

lumns in DataFrame reductions (with 'numeric_only=None') is deprecated; in a future version this will raise Typ
eError. Select only valid columns before calling the reduction.
df['Total'] = df.sum(axis=1)

df.head()

Continent Region DevName 1980 1981 1982 1983 1984 1985 1986 ... 2005 2006 2007 2008 2009 2010 2011 2012 2013

Country

Southern Developing
Afghanistan Asia 16 39 39 47 71 340 496 ... 3436 3009 2652 2111 1746 1758 2203 2635 2004
Asia regions

Southern Developed
Albania Europe 1 0 0 0 0 0 1 ... 1223 856 702 560 716 561 539 620
Europe regions

Northern Developing
Algeria Africa 80 67 71 69 63 44 69 ... 3626 4807 3623 4005 5393 4752 4325 3774 4331
Africa regions

American Developing
Oceania Polynesia 0 1 0 0 0 0 0 ... 0 1 0 0 0 0 0 0
Samoa regions

Southern Developed
Andorra Europe 0 0 0 0 0 0 2 ... 0 1 1 0 0 0 0 1
Europe regions

5 rows × 38 columns

Area Plots
In the last module, we created a line plot that visualized the top 5 countries that contribued the most immigrants to Canada from 1980 to
2013. With a little modification to the code, we can visualize this plot as a cumulative plot, also knows as a Stacked Line Plot or Area
plot.

df.sort_values(['Total'], ascending=False, axis=0, inplace=True)

# get the top 5 entries

df_top5 = df.head()
years=list(range(1980,2014))
# transpose the dataframe
df_top5 = df_top5[years].transpose()

df_top5.head()

Country India China United Kingdom of Great Britain and Northern Ireland Philippines Pakistan

1980 8880 5123 22045 6051 978

1981 8670 6682 24796 5921 972

1982 8147 3308 20620 5249 1201

1983 7338 1863 10015 4562 900

1984 5704 1527 10170 3801 668

Area plots are stacked by default. And to produce a stacked area plot, each column must be either all positive or all negative values (any
NaN , i.e. not a number, values will default to 0). To produce an unstacked plot, set parameter stacked to value False .

# let's change the index values of df_top5 to type integer for plotting
df_top5.index = df_top5.index.map(int)
df_top5.plot(kind='area',
stacked=False,
figsize=(20, 10)) # pass a tuple (x, y) size

plt.title('Immigration Trend of Top 5 Countries')

plt.ylabel('Number of Immigrants')
plt.xlabel('Years')

plt.show()

The unstacked plot has a default transparency (alpha value) at 0.5. We can modify this value by passing in the alpha parameter.

df_top5.plot(kind='area',
alpha=0.25, # 0 - 1, default value alpha = 0.5
stacked=False,
figsize=(20, 10))

plt.title('Immigration Trend of Top 5 Countries')

plt.ylabel('Number of Immigrants')
plt.xlabel('Years')

plt.show()
Question: Use the scripting layer to create a stacked area plot of the 5 countries that contributed the least to immigration to Canada
from 1980 to 2013. Use a transparency value of 0.45.

### type your answer here

Click here for a sample python solution

Histograms
A histogram is a way of representing the frequency distribution of numeric dataset. The way it works is it partitions the x-axis into bins,
assigns each data point in our dataset to a bin, and then counts the number of data points that have been assigned to each bin. So the y-
axis is the frequency or the number of data points in each bin. Note that we can change the bin size and usually one needs to tweak it so
that the distribution is displayed nicely.

Question: What is the frequency distribution of the number (population) of new immigrants from the various countries to Canada in
2013?

Before we proceed with creating the histogram plot, let's first examine the data split into intervals. To do this, we will us Numpy's
histrogram method to get the bin ranges and frequency counts as follows:

# let's quickly view the 2013 data

df[2013].head()

Country
India 33087
China 34129
United Kingdom of Great Britain and Northern Ireland 5827
Philippines 29544
Pakistan 12603
Name: 2013, dtype: int64

# np.histogram returns 2 values

count, bin_edges = np.histogram(df[2013])

print(count) # frequency count

print(bin_edges) # bin ranges, default = 10 bins

[178 11 1 2 0 0 0 0 1 2]
[ 0. 3412.9 6825.8 10238.7 13651.6 17064.5 20477.4 23890.3 27303.2
30716.1 34129. ]

We can easily graph this distribution by passing kind=hist to plot() .

df[2013].plot(kind='hist', figsize=(8, 5))

# add a title to the histogram

plt.title('Histogram of Immigration from 195 Countries in 2013')
# add y-label
plt.ylabel('Number of Countries')
# add x-label
plt.xlabel('Number of Immigrants')
plt.show()

In the above plot, the x-axis represents the population range of immigrants in intervals of 3412.9. The y-axis represents the number of
countries that contributed to the aforementioned population.

Notice that the x-axis labels do not match with the bin size. This can be fixed by passing in a xticks keyword that contains the list of
the bin sizes, as follows:

# 'bin_edges' is a list of bin intervals

count, bin_edges = np.histogram(df[2013])

df[2013].plot(kind='hist', figsize=(8, 5), xticks=bin_edges)

plt.title('Histogram of Immigration from 195 countries in 2013') # add a title to the histogram
plt.ylabel('Number of Countries') # add y-label
plt.xlabel('Number of Immigrants') # add x-label

plt.show()

We can also plot multiple histograms on the same plot. For example, let's try to answer the following questions using a histogram.

Question: What is the immigration distribution for Pakistan, China, and India for years 1980 - 2013?

# let's quickly view the dataset

df.loc[['Pakistan', 'China', 'India'], years]

1980 1981 1982 1983 1984 1985 1986 1987 1988 1989 ... 2004 2005 2006 2007 2008 2009 2010 2011 2012

Country

Pakistan 978 972 1201 900 668 514 691 1072 1334 2261 ... 13399 14314 13127 10124 8994 7217 6811 7468 11227

China 5123 6682 3308 1863 1527 1816 1960 2643 2758 4323 ... 36619 42584 33518 27642 30037 29622 30391 28502 33024

India 8880 8670 8147 7338 5704 4211 7150 10189 11522 10343 ... 28235 36210 33848 28742 28261 29456 34235 27509 30933

3 rows × 34 columns

df.loc[['Pakistan', 'China', 'India'], years].plot.hist()

<AxesSubplot:ylabel='Frequency'>
That does not look right!

Don't worry, you'll often come across situations like this when creating plots. The solution often lies in how the underlying dataset is
structured.

Instead of plotting the population frequency distribution of the population for the 3 countries, pandas instead plotted the population
frequency distribution for the years .

This can be easily fixed by first transposing the dataset, and then plotting as shown below.

# transpose dataframe

df_new=df.loc[['Pakistan', 'China', 'India'], years].transpose()

# generate histogram
df_new.plot(kind='hist', figsize=(10, 6))

plt.title('Histogram of Immigration from Pakistan, China, and India from 1980 - 2013')
plt.ylabel('Number of Years')
plt.xlabel('Number of Immigrants')

plt.show()

Let's make a few modifications to improve the impact and aesthetics of the previous plot:
increase the bin size to 15 by passing in bins parameter;
set transparency to 60% by passing in alpha parameter;
label the x-axis by passing in x-label parameter;
change the colors of the plots by passing in color parameter.

# let's get the x-tick values

count, bin_edges = np.histogram(df_new, 15)

# un-stacked histogram
df_new.plot(kind ='hist',
figsize=(10, 6),
bins=15,
alpha=0.6,
xticks=bin_edges,
color=['coral', 'darkslateblue', 'mediumseagreen']
)

plt.title('Histogram of Immigration from Pakistan, China, and India from 1980 - 2013')
plt.ylabel('Number of Years')
plt.xlabel('Number of Immigrants')

plt.show()

If we do not want the plots to overlap each other, we can stack them using the stacked parameter. Let's also adjust the min and max
x-axis labels to remove the extra gap on the edges of the plot. We can pass a tuple (min,max) using the xlim paramater, as show
below.

count, bin_edges = np.histogram(df_new, 15)

xmin = bin_edges[0] - 10 # first bin value is 31.0, adding buffer of 10 for aesthetic purposes
xmax = bin_edges[-1] + 10 # last bin value is 308.0, adding buffer of 10 for aesthetic purposes

# stacked Histogram
df_new.plot(kind='hist',
figsize=(10, 6),
bins=15,
xticks=bin_edges,
color=['coral', 'darkslateblue', 'mediumseagreen'],
stacked=True,
xlim=(xmin, xmax)
)

plt.title('Histogram of Immigration from Pakistan, China, and India from 1980 - 2013')
plt.ylabel('Number of Years')
plt.xlabel('Number of Immigrants')

plt.show()
Question: Use the scripting layer to display the immigration distribution for Greece, Albania, and Bulgaria for years 1980 - 2013? Use an
overlapping plot with 15 bins and a transparency value of 0.35.

### type your answer here

Click here for a sample python solution

Bar Charts (Dataframe)

A bar plot is a way of representing data where the length of the bars represents the magnitude/size of the feature/variable. Bar graphs
usually represent numerical and categorical variables grouped in intervals.

To create a bar plot, we can pass one of two arguments via kind parameter in plot() :

kind=bar creates a vertical bar plot

kind=barh creates a horizontal bar plot

Vertical bar plot

In vertical bar graphs, the x-axis is used for labelling, and the length of bars on the y-axis corresponds to the magnitude of the variable
being measured. Vertical bar graphs are particularly useful in analyzing time series data. One disadvantage is that they lack space for
text labelling at the foot of each bar.

Let's start off by analyzing the effect of Iceland's Financial Crisis:

The 2008 - 2011 Icelandic Financial Crisis was a major economic and political event in Iceland. Relative to the size of its economy,
Iceland's systemic banking collapse was the largest experienced by any country in economic history. The crisis led to a severe economic
depression in 2008 - 2011 and significant political unrest.

Question: Let's compare the number of Icelandic immigrants (country = 'Iceland') to Canada from year 1980 to 2013.

# step 1: get the data

df_iceland = df.loc['Iceland', years]
df_iceland.head()

1980 17
1981 33
1982 10
1983 9
1984 13
Name: Iceland, dtype: object

# step 2: plot data

df_iceland.plot(kind='bar', figsize=(10, 6))

plt.xlabel('Year') # add to x-label to the plot

plt.ylabel('Number of immigrants') # add y-label to the plot
plt.title('Icelandic immigrants to Canada from 1980 to 2013') # add title to the plot

plt.show()
The bar plot above shows the total number of immigrants broken down by each year. We can clearly see the impact of the financial crisis;
the number of immigrants to Canada started increasing rapidly after 2008.

Let's annotate this on the plot using the annotate method of the scripting layer or the pyplot interface. We will pass in the following
parameters:

s : str, the text of annotation.

xy : Tuple specifying the (x,y) point to annotate (in this case, end point of arrow).
xytext : Tuple specifying the (x,y) point to place the text (in this case, start point of arrow).
xycoords : The coordinate system that xy is given in - 'data' uses the coordinate system of the object being annotated (default).
arrowprops : Takes a dictionary of properties to draw the arrow:
arrowstyle : Specifies the arrow style, '->' is standard arrow.
connectionstyle : Specifies the connection type. arc3 is a straight line.
color : Specifies color of arrow.
lw : Specifies the line width.

df_iceland.plot(kind='bar', figsize=(10, 6), rot=90) # rotate the xticks(labelled points on x-axis) by 90 degrees

plt.xlabel('Year')
plt.ylabel('Number of Immigrants')
plt.title('Icelandic Immigrants to Canada from 1980 to 2013')

# Annotate arrow
plt.annotate('', # s: str. Will leave it blank for no text
xy=(32, 70), # place head of the arrow at point (year 2012 , pop 70)
xytext=(28, 20), # place base of the arrow at point (year 2008 , pop 20)
xycoords='data', # will use the coordinate system of the object being annotated
arrowprops=dict(arrowstyle='->', connectionstyle='arc3', color='blue', lw=2)
)

plt.show()

Let's also annotate a text to go over the arrow. We will pass in the following additional parameters:
Let's also annotate a text to go over the arrow. We will pass in the following additional parameters:

rotation : rotation angle of text in degrees (counter clockwise)

df_iceland.plot(kind='bar', figsize=(10, 6), rot=90)

plt.xlabel('Year')
plt.ylabel('Number of Immigrants')
plt.title('Icelandic Immigrants to Canada from 1980 to 2013')

# Annotate arrow
plt.annotate('', # s: str. will leave it blank for no text
xy=(32, 70), # place head of the arrow at point (year 2012 , pop 70)
xytext=(28, 20), # place base of the arrow at point (year 2008 , pop 20)
xycoords='data', # will use the coordinate system of the object being annotated
arrowprops=dict(arrowstyle='->', connectionstyle='arc3', color='blue', lw=2)
)

# Annotate Text
plt.annotate('2008 - 2011 Financial Crisis', # text to display
xy=(28, 30), # start the text at at point (year 2008 , pop 30)
rotation=72.5, # based on trial and error to match the arrow
va='bottom', # want the text to be vertically 'bottom' aligned
ha='left', # want the text to be horizontally 'left' algned.
)

plt.show()

Horizontal Bar Plot

Sometimes it is more practical to represent the data horizontally, especially if you need more room for labelling the bars. In horizontal bar
graphs, the y-axis is used for labelling, and the length of bars on the x-axis corresponds to the magnitude of the variable being measured.
As you will see, there is more room on the y-axis to label categorical variables.

Question: Using the scripting later and the df_can dataset, create a horizontal bar plot showing the total number of immigrants to
Canada from the top 15 countries, for the period 1980 - 2013. Label each country with the total immigrant count.

### type your answer here

Click here for a sample python solution

Thank you

Author
Moazzam Ali

Rareswans High Quality Dorks Tutorial
No ratings yet
Rareswans High Quality Dorks Tutorial
13 pages
Kintsch-1988-The Role of Knowledge in Discourse Comprehension
No ratings yet
Kintsch-1988-The Role of Knowledge in Discourse Comprehension
20 pages
PAGCOR Technical Standards Version 1 0 PDF
No ratings yet
PAGCOR Technical Standards Version 1 0 PDF
60 pages
CompTIA Networkplus Rapid Review Exam N10 005
100% (2)
CompTIA Networkplus Rapid Review Exam N10 005
362 pages
Data Visualization - New
No ratings yet
Data Visualization - New
5 pages
Modulo 8. Data Visualization With Python
No ratings yet
Modulo 8. Data Visualization With Python
30 pages
Course3 Notes
No ratings yet
Course3 Notes
44 pages
DV0101EN-2-2-1-Area-Plots-Histograms-and-Bar-Charts-py-v2.0: 1 Exploring Datasets With Pandas and Matplotlib
No ratings yet
DV0101EN-2-2-1-Area-Plots-Histograms-and-Bar-Charts-py-v2.0: 1 Exploring Datasets With Pandas and Matplotlib
29 pages
Plotting Directly With Matplotlib: Objectives
No ratings yet
Plotting Directly With Matplotlib: Objectives
28 pages
Data Visualization with Python
No ratings yet
Data Visualization with Python
42 pages
Using Python For Data Analysis - July 2018 - Slides
No ratings yet
Using Python For Data Analysis - July 2018 - Slides
43 pages
DataVisualizationUsingPython
No ratings yet
DataVisualizationUsingPython
3 pages
Intreoduction To Python Basic Plots With Matplolib
No ratings yet
Intreoduction To Python Basic Plots With Matplolib
37 pages
Unit 4 - Statistical Thinking
No ratings yet
Unit 4 - Statistical Thinking
59 pages
Matplotlib Notes
No ratings yet
Matplotlib Notes
5 pages
Intermediate Python
No ratings yet
Intermediate Python
22 pages
Matplotlib
No ratings yet
Matplotlib
10 pages
WEBINTEL GUIDED LAB ACTIVITY Introduction To Pandas
No ratings yet
WEBINTEL GUIDED LAB ACTIVITY Introduction To Pandas
1 page
Gnuplot Demo
No ratings yet
Gnuplot Demo
5 pages
Line Plot (1) : Datacamp Courses-Jhu-Genomics-Demo
No ratings yet
Line Plot (1) : Datacamp Courses-Jhu-Genomics-Demo
22 pages
DVPD_LABfile[1]
No ratings yet
DVPD_LABfile[1]
41 pages
Advanced_Plot_Types_with_Matplotlib
No ratings yet
Advanced_Plot_Types_with_Matplotlib
8 pages
Histrogram: A Histogram Is A Graph Showing Frequency Distributions
No ratings yet
Histrogram: A Histogram Is A Graph Showing Frequency Distributions
10 pages
Matplotlib (2)
No ratings yet
Matplotlib (2)
5 pages
Data Visualization Using Matplotlib
No ratings yet
Data Visualization Using Matplotlib
10 pages
DATA VISUALIZATION - Part 4
No ratings yet
DATA VISUALIZATION - Part 4
12 pages
Python Fundamental - 2
No ratings yet
Python Fundamental - 2
49 pages
Data Visual Iz
No ratings yet
Data Visual Iz
54 pages
intro-to-pandas-world-happiness
No ratings yet
intro-to-pandas-world-happiness
20 pages
19_Matplotlib
No ratings yet
19_Matplotlib
26 pages
BDA File
No ratings yet
BDA File
26 pages
Chapter1.3 - Data Visualization
No ratings yet
Chapter1.3 - Data Visualization
27 pages
CHAPTER-2 Data Visualization
No ratings yet
CHAPTER-2 Data Visualization
4 pages
Wa0029.
No ratings yet
Wa0029.
16 pages
XII IP CH 3 Plotting With Pyplot
No ratings yet
XII IP CH 3 Plotting With Pyplot
52 pages
graphs using matplotlib
No ratings yet
graphs using matplotlib
23 pages
Intermediate Python
No ratings yet
Intermediate Python
22 pages
lecture4
No ratings yet
lecture4
60 pages
pandas (1)
No ratings yet
pandas (1)
25 pages
Python Cheat Sheet For Excel Users
100% (2)
Python Cheat Sheet For Excel Users
5 pages
exp_2_sdk_ok
No ratings yet
exp_2_sdk_ok
18 pages
42.Histograms
No ratings yet
42.Histograms
5 pages
Pierian Data - Python For Finance & Algorithmic Trading Course Notes
No ratings yet
Pierian Data - Python For Finance & Algorithmic Trading Course Notes
11 pages
12 IP-Data Visualization (Part-2) - Note
No ratings yet
12 IP-Data Visualization (Part-2) - Note
20 pages
NCERT Plotting Ch - 4 Ex Solutons
No ratings yet
NCERT Plotting Ch - 4 Ex Solutons
6 pages
Cheat Python
No ratings yet
Cheat Python
8 pages
Dictionaries, Part 1: Hugo Bowne-Anderson
No ratings yet
Dictionaries, Part 1: Hugo Bowne-Anderson
60 pages
Python Data Vis
No ratings yet
Python Data Vis
43 pages
DSBDAL - Assignment No 9
No ratings yet
DSBDAL - Assignment No 9
12 pages
Data Visualization
No ratings yet
Data Visualization
17 pages
Data Visualization With Python
No ratings yet
Data Visualization With Python
34 pages
Python 3 Labo
No ratings yet
Python 3 Labo
30 pages
Chapter2 PDF
No ratings yet
Chapter2 PDF
60 pages
Chapter 2 - part 2 - (Histogram)
No ratings yet
Chapter 2 - part 2 - (Histogram)
18 pages
42.Histograms2
No ratings yet
42.Histograms2
6 pages
Unit 1 - Chap 2 - Data Visualisation
No ratings yet
Unit 1 - Chap 2 - Data Visualisation
29 pages
Experiment - 2.3 Krikita
No ratings yet
Experiment - 2.3 Krikita
12 pages
DSBDL Write Ups 8 To 10
No ratings yet
DSBDL Write Ups 8 To 10
7 pages
Unit 5
No ratings yet
Unit 5
10 pages
SESION 12 (Pandas)
No ratings yet
SESION 12 (Pandas)
41 pages
Sl-3 Assignment No.8
No ratings yet
Sl-3 Assignment No.8
21 pages
Data Visualization
No ratings yet
Data Visualization
48 pages
Python Program - X Pyplot-Ii
No ratings yet
Python Program - X Pyplot-Ii
9 pages
Math Workbook - Grade 8
From Everand
Math Workbook - Grade 8
Beverly Nance
5/5 (3)
CGV Lab Ver3
No ratings yet
CGV Lab Ver3
9 pages
1ep18cs037-Janvee Dixit
No ratings yet
1ep18cs037-Janvee Dixit
19 pages
Think Parallel Brochure - Hybridmode
No ratings yet
Think Parallel Brochure - Hybridmode
1 page
Millet Leaf Product Catalogue
No ratings yet
Millet Leaf Product Catalogue
6 pages
Mat202 Linear-Algebra TH 1.10 Ac26 PDF
No ratings yet
Mat202 Linear-Algebra TH 1.10 Ac26 PDF
2 pages
Help Manual - Nakshatra Jyotish
No ratings yet
Help Manual - Nakshatra Jyotish
24 pages
04.fourier Analysis
No ratings yet
04.fourier Analysis
18 pages
NIOS 7.3.0 ReleaseNotes
No ratings yet
NIOS 7.3.0 ReleaseNotes
32 pages
Short Research On Voice Control System Based On Artificial Intelligence Assistant
No ratings yet
Short Research On Voice Control System Based On Artificial Intelligence Assistant
2 pages
Meraki Datasheet Ms125 Series en
No ratings yet
Meraki Datasheet Ms125 Series en
7 pages
Download Complete Virtual Training Basics 2nd Edition Cindy Huggett PDF for All Chapters
100% (2)
Download Complete Virtual Training Basics 2nd Edition Cindy Huggett PDF for All Chapters
50 pages
WP Wan Macsecdep Aug2016
No ratings yet
WP Wan Macsecdep Aug2016
48 pages
Efficient Transformer Survey-dual
No ratings yet
Efficient Transformer Survey-dual
56 pages
Strivers Problem Solving Java
No ratings yet
Strivers Problem Solving Java
11 pages
Notice of Non Consent Liability
No ratings yet
Notice of Non Consent Liability
3 pages
The State of Product Management Annual Report 2023
No ratings yet
The State of Product Management Annual Report 2023
34 pages
Inkscape Tutorial For Beginners: How To Make A Yoga Classes Flyer
No ratings yet
Inkscape Tutorial For Beginners: How To Make A Yoga Classes Flyer
21 pages
Tables and Graphs (Practical Research 2)
50% (2)
Tables and Graphs (Practical Research 2)
44 pages
Parent Project Name - XXX Number & Name
No ratings yet
Parent Project Name - XXX Number & Name
11 pages
Massachusetts - ITS55 - IBM Passport Advantage Software Pricing 05-10-2022 Vrs2
No ratings yet
Massachusetts - ITS55 - IBM Passport Advantage Software Pricing 05-10-2022 Vrs2
4,994 pages
CSCS 323: Course Title Course Code Credits Instructor
No ratings yet
CSCS 323: Course Title Course Code Credits Instructor
34 pages
POSMV_NMEA_formats
No ratings yet
POSMV_NMEA_formats
15 pages
Solidworks-Weldments 01
No ratings yet
Solidworks-Weldments 01
21 pages
Professional Experience: Bochra - Feki@insat - Ucar.tn (+216) 25300602 Bochra Feki Bochra906 Tunisia, Ariana
No ratings yet
Professional Experience: Bochra - Feki@insat - Ucar.tn (+216) 25300602 Bochra Feki Bochra906 Tunisia, Ariana
1 page
Cyber Law Notes (Becom 6TH Sem)
No ratings yet
Cyber Law Notes (Becom 6TH Sem)
34 pages
2001-12 The Computer Paper - Ontario Edition
No ratings yet
2001-12 The Computer Paper - Ontario Edition
104 pages
The Chunnel Case Study
No ratings yet
The Chunnel Case Study
12 pages
Ece Tech Questions
No ratings yet
Ece Tech Questions
4 pages
Practical 1: Aim: Study of Different Types of Cables
No ratings yet
Practical 1: Aim: Study of Different Types of Cables
61 pages

Area Plots, Histogram and Bar Plots in Python

Uploaded by

Area Plots, Histogram and Bar Plots in Python

Uploaded by

Area

Plots, Histograms, and Bar Plots

print('Data read into a pandas dataframe!')

Data read into a pandas dataframe!

# in pandas axis=0 represents rows (default) and axis=1 represents columns.

C:\Users\Meer Moazzam\AppData\Local\Temp\ipykernel_8116\3820691460.py:4: FutureWarning: Dropping of nuisance co

df.sort_values(['Total'], ascending=False, axis=0, inplace=True)

# get the top 5 entries

1980 8880 5123 22045 6051 978

1981 8670 6682 24796 5921 972

1982 8147 3308 20620 5249 1201

1983 7338 1863 10015 4562 900

1984 5704 1527 10170 3801 668

plt.title('Immigration Trend of Top 5 Countries')

plt.title('Immigration Trend of Top 5 Countries')

### type your answer here

Click here for a sample python solution

# let's quickly view the 2013 data

# np.histogram returns 2 values

print(count) # frequency count

We can easily graph this distribution by passing kind=hist to plot() .

df[2013].plot(kind='hist', figsize=(8, 5))

# add a title to the histogram

# 'bin_edges' is a list of bin intervals

df[2013].plot(kind='hist', figsize=(8, 5), xticks=bin_edges)

# let's quickly view the dataset

df.loc[['Pakistan', 'China', 'India'], years].plot.hist()

df_new=df.loc[['Pakistan', 'China', 'India'], years].transpose()

# let's get the x-tick values

count, bin_edges = np.histogram(df_new, 15)

### type your answer here

Click here for a sample python solution

Bar Charts (Dataframe)

kind=bar creates a vertical bar plot

Vertical bar plot

Let's start off by analyzing the effect of Iceland's Financial Crisis:

# step 1: get the data

# step 2: plot data

plt.xlabel('Year') # add to x-label to the plot

s : str, the text of annotation.

rotation : rotation angle of text in degrees (counter clockwise)

df_iceland.plot(kind='bar', figsize=(10, 6), rot=90)

Horizontal Bar Plot

### type your answer here

Click here for a sample python solution

You might also like