0% found this document useful (0 votes)

43 views5 pages

Data Visualization - New

This document discusses various data visualization techniques in Python such as line plots, area plots, histograms, and bar charts. It provides code examples for reading data from Excel files and manipulating DataFrames. Methods like .plot(), .hist(), and .annotate() are used to generate the visualizations. Both the scripting layer and artist layer approaches in Matplotlib are covered.

Uploaded by

WHITE YT

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

43 views5 pages

Data Visualization - New

Uploaded by

WHITE YT

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

Data Visualization – Python

Read Data from Excel:

- Import numpy as np
- Import pandas as pd
- From __future__ import print_function #adds compatibility to python 2
- !pip install xlrd
- Print('xlrd installed!')
- Df_can = pd.read_excel('https:.....", sheetname = 'Canada', skiprows = range(20), skip_footer = 2)

- .tolist() – pentru a transforma in lista o serie sau altceva

- .shape() – pentru a vedea marimea dataframe
- .isnull().sum() – pentru a vedea suma valorilor nule
- Df.loc[label] – filter by the labels of the index/column
- Df.iloc[index] – filter by the positions of the index
- Df_can.set_index('Country', inplace = True) – pentru a schimba index-ul
- Df_can.reset_index() – pentru a reseta index-ul
- print(df_can.loc['Japan', [1980, 1981, 1982, 1983, 1984, 1984]]) – loc pentru valori
- print(df_can.iloc[87, [3, 4, 5, 6, 7, 8]]) – iloc pentru index
- df_can.columns = list(map(str, df_can.columns)) - convert the column names into strings
- years = list(map(str, range(1980, 2014))) – lista cu anii de la 1980 pana in 2014
- df_can[(df_can['Continent']=='Asia') & (df_can['Region']=='Southern Asia')] -filtram
- # note: When using 'and' and 'or' operators, pandas requires we use '&' and '|' instead of
'and' and 'or'
- haiti.index = haiti.index.map(int) # let's change the index values of Haiti to type integer for
plotting

What is a line plot and why use it?

A line chart or line plot is a type of plot which displays information as a series of data points
called 'markers' connected by straight line segments. It is a basic type of chart common in
many fields. Use line plot when you have a continuous data set. These are best suited for
trend-based visualizations of data over a period of time.
- haiti.plot(kind='line')
- plt.title('Immigration from Haiti')
- plt.ylabel('Number of immigrants')
- plt.xlabel('Years')
- plt.show()
- # annotate the 2010 Earthquake.
- # syntax: plt.text(x, y, label)
- plt.text(2000, 6000, '2010 Earthquake') # see note below
- Since the x-axis (years) is type 'integer', we specified x as a year. The y axis (number of
immigrants) is type 'integer', so we can just specify the value y = 6000.
- plt.text(2000, 6000, '2010 Earthquake') # years stored as type int
- If the years were stored as type 'string', we would need to specify x as the index position of
the year. Eg 20th index is year 2000 since it is the 20th year with a base year of 1980.
- plt.text(20, 6000, '2010 Earthquake') # years stored as type int
- df_CI = df_CI.transpose() – pentru a modifica coloanele cu randurile.

AREA PLOT
- Also know as area chart or area graph.
- Commonly used to represent cumulated totals using numers or percentages over time.
- Is commonly used when trying to compare two or more quantities.

- Import matplotlib as mpl

- Import matplotlib.pyplot as plt
- Df_top5.plot(kind = 'area')
- Plt.title('Immigration trend of top 5 counteries')
- Plt.ylabel('Number of immigrants')
- Plt.xlabel('Years')
- Plt.show()

- df_top5.index = df_top5.index.map(int) # let's change the index values of df_top5 to type integer
for plotting
- df_top5.plot(kind='area', stacked=False,figsize=(20, 10), # pass a tuple (x, y) size)
- plt.title('Immigration Trend of Top 5 Countries')
- plt.ylabel('Number of Immigrants')
- plt.xlabel('Years')
- plt.show()

HISTOGRAMS
- Import matplotlib as mpl
- Import matplotlib.pyplot as plt
- Df_canada['2013'].plot(kind = 'hist', figsize = (10,6))
- Plt.title('Histogram of immigration from 195 countries in 2013')
- Plt.ylabel('Number of countries')
- Plt.xlabel('Number of immigrants')
- Plt.show()
BINS HISTOGRAMA

- Import matplotlib as mpl

- Import matplotlib.pyplot as plt
- Import numpy as np
- Count, bin_edges = np.histogram(df_canada['2013']) # 10 parti egale
- Df_canada['2013'].plot(kind = 'hist', xticks = bin_edges)
- Plt.title('Histogram of immigration from 195 countries in 2013')

- # np.histogram returns 2 values

- count, bin_edges = np.histogram(df_can['2013'])
- print(count) # frequency count
- print(bin_edges) # bin ranges, default = 10 bins

- # transpose dataframe
- df_t = df_can.loc[['Denmark', 'Norway', 'Sweden'], years].transpose()
- df_t.head()

- increase the bin size to 15 by passing in bins parameter

- set transparency to 60% by passing in alpha paramemter
- label the x-axis by passing in x-label paramater
- change the colors of the plots by passing in color parameter

- # let's get the x-tick values

- count, bin_edges = np.histogram(df_t, 15)
- # un-stacked histogram
- df_t.plot(kind ='hist', figsize=(10, 6),bins=15,alpha=0.6,xticks=bin_edges,color=['coral',
'darkslateblue', 'mediumseagreen'])
- plt.title('Histogram of Immigration from Denmark, Norway, and Sweden from 1980 - 2013')
- plt.ylabel('Number of Years')
- plt.xlabel('Number of Immigrants')
- plt.show()
BAR CHART

- To create a bar plot, we can pass one of two arguments via kind parameter in plot():
- kind=bar creates a vertical bar plot
- kind=barh creates a horizontal bar plot

Let's annotate this on the plot using the annotate method of the scripting layer or the pyplot
interface. We will pass in the following parameters:

- s: str, the text of annotation.

- xy: Tuple specifying the (x,y) point to annotate (in this case, end point of arrow).
- xytext: Tuple specifying the (x,y) point to place the text (in this case, start point of arrow).
- xycoords: The coordinate system that xy is given in - 'data' uses the coordinate system of the
object being annotated (default).
- arrowprops: Takes a dictionary of properties to draw the arrow:
- arrowstyle: Specifies the arrow style, '->' is standard arrow.
- connectionstyle: Specifies the connection type. arc3 is a straight line.
- color: Specifes color of arror.
- lw: Specifies the line width.

- df_iceland.plot(kind='bar', figsize=(10, 6), rot=90) # rotate the xticks(labelled points on x-axis)

by 90 degrees

# Annotate arrow
plt.annotate("",# s: str. Will leave it blank for no text
xy=(32, 70), # place head of the arrow at point (year 2012 , pop 70)
xytext=(28, 20), # place base of the arrow at point (year 2008 , pop 20)
xycoords='data', # will use the coordinate system of the object being annotated
arrowprops=dict(arrowstyle='->', connectionstyle='arc3', color='blue', lw=2)
)
# annotate value labels to each country

- for index, value in enumerate(df_top15):

- label = format(int(value), ',') # format int with commas

# place text at the end of bar (subtracting 47000 from x, and 0.1 from y to make it fit within the
bar)

- plt.annotate(label, xy=(value - 47000, index - 0.10), color='white')

- plt.show()

Unlike a histogram, a bar chart is commonly used to compare the values of a variable at a given point in
time.
# let's examine the types of the column labels

- all(isinstance(column, str) for column in df_can.columns)

So let's change them all to string type.

- df_can.columns = list(map(str, df_can.columns))

# finally, let's create a list of years from 1980 - 2013
# this will come in handy when we start plotting the data

- years = list(map(str, range(1980, 2014)))

*Option 2: Artist layer (Object oriented method) - using an Axes instance from Matplotlib (preferred)
*You can use an Axes instance of your current plot and store it in a variable (eg. ax). You can add more
elements by calling methods with a little change in syntax (by adding "set_" to the previous methods). For
example, use ax.set_title() instead of plt.title() to add title, or ax.set_xlabel() instead of plt.xlabel() to add
label to the x-axis.
This option sometimes is more transparent and flexible to use for advanced plots (in particular when
having multiple plots, as you will see later).
In this course, we will stick to the scripting layer, except for some advanced visualizations where we will
need to use the artist layer to manipulate advanced aspects of the plots.
# option 2: preferred option with more flexibility

- ax = df_top5.plot(kind='area', alpha=0.35, figsize=(20, 10))

- ax.set_title('Immigration Trend of Top 5 Countries')
- ax.set_ylabel('Number of Immigrants')
- ax.set_xlabel('Years')

Tilted Working Plane Command Specifications: FANUC Series 16 FANUC Series 18
100% (1)
Tilted Working Plane Command Specifications: FANUC Series 16 FANUC Series 18
54 pages
ωt + Φ) A = Amplitude Φ = Phase angle Ω = Frequency and t = Time period
No ratings yet
ωt + Φ) A = Amplitude Φ = Phase angle Ω = Frequency and t = Time period
12 pages
Course3 Notes
No ratings yet
Course3 Notes
44 pages
DV0101EN-2-2-1-Area-Plots-Histograms-and-Bar-Charts-py-v2.0: 1 Exploring Datasets With Pandas and Matplotlib
No ratings yet
DV0101EN-2-2-1-Area-Plots-Histograms-and-Bar-Charts-py-v2.0: 1 Exploring Datasets With Pandas and Matplotlib
29 pages
Modulo 8. Data Visualization With Python
No ratings yet
Modulo 8. Data Visualization With Python
30 pages
Plotting Directly With Matplotlib: Objectives
No ratings yet
Plotting Directly With Matplotlib: Objectives
28 pages
Data Visualization with Python
No ratings yet
Data Visualization with Python
42 pages
Area Plots, Histogram and Bar Plots in Python
No ratings yet
Area Plots, Histogram and Bar Plots in Python
9 pages
Session 13, Data Visualization
No ratings yet
Session 13, Data Visualization
13 pages
WEBINTEL GUIDED LAB ACTIVITY Introduction To Pandas
No ratings yet
WEBINTEL GUIDED LAB ACTIVITY Introduction To Pandas
1 page
Basic Line Plot Using Matplotlib
No ratings yet
Basic Line Plot Using Matplotlib
9 pages
Pandas Complete + Visualisation Summary of IBM Visualization
No ratings yet
Pandas Complete + Visualisation Summary of IBM Visualization
21 pages
Summary: Introduction To Data Visualization Tools
No ratings yet
Summary: Introduction To Data Visualization Tools
13 pages
pandas (1)
No ratings yet
pandas (1)
25 pages
DVA Practical
No ratings yet
DVA Practical
19 pages
Data Visualisation Using Pyplot
No ratings yet
Data Visualisation Using Pyplot
20 pages
Cheat Python
No ratings yet
Cheat Python
8 pages
Using Python For Data Analysis - July 2018 - Slides
No ratings yet
Using Python For Data Analysis - July 2018 - Slides
43 pages
DMV Unit-4-1.pdf
No ratings yet
DMV Unit-4-1.pdf
10 pages
Pandas Cheat Sheet
No ratings yet
Pandas Cheat Sheet
2 pages
Pandas
No ratings yet
Pandas
13 pages
Data Visualization With Python
No ratings yet
Data Visualization With Python
34 pages
Python Cheat Sheet For Excel Users
100% (2)
Python Cheat Sheet For Excel Users
5 pages
Pierian Data - Python For Finance & Algorithmic Trading Course Notes
No ratings yet
Pierian Data - Python For Finance & Algorithmic Trading Course Notes
11 pages
Data Visualization Python Tutorial
No ratings yet
Data Visualization Python Tutorial
9 pages
BDA File
No ratings yet
BDA File
26 pages
DataVisualizationUsingPython
No ratings yet
DataVisualizationUsingPython
3 pages
20 June BA Class
No ratings yet
20 June BA Class
17 pages
intro-to-pandas-world-happiness
No ratings yet
intro-to-pandas-world-happiness
20 pages
visualization.rst
No ratings yet
visualization.rst
33 pages
Python-Pandas Notes
No ratings yet
Python-Pandas Notes
5 pages
Lab 10
No ratings yet
Lab 10
16 pages
Exercises Part2
No ratings yet
Exercises Part2
7 pages
matplotlib-cheat-sheet
No ratings yet
matplotlib-cheat-sheet
6 pages
Python Cheatsy
No ratings yet
Python Cheatsy
1 page
Unit 3 CHP 1
No ratings yet
Unit 3 CHP 1
18 pages
LAB RECORD SET 3
No ratings yet
LAB RECORD SET 3
5 pages
LAB4_EDA_desc_analysis
No ratings yet
LAB4_EDA_desc_analysis
26 pages
exp_2_sdk_ok
No ratings yet
exp_2_sdk_ok
18 pages
Bokeh Cheat Sheet Python For Data Science: 3 Renderers & Visual Customizations
No ratings yet
Bokeh Cheat Sheet Python For Data Science: 3 Renderers & Visual Customizations
26 pages
Data Visualization part 2
No ratings yet
Data Visualization part 2
18 pages
Pandaspythonfordatascience
No ratings yet
Pandaspythonfordatascience
1 page
Pandas Python For Data Science
100% (1)
Pandas Python For Data Science
1 page
Pandas Python For Data Science
No ratings yet
Pandas Python For Data Science
1 page
Pandas - Cheat - Sheet
No ratings yet
Pandas - Cheat - Sheet
6 pages
World Happiness Report
No ratings yet
World Happiness Report
7 pages
UNIT-IV - Matplotlib
No ratings yet
UNIT-IV - Matplotlib
10 pages
Expt 2 EDAV
No ratings yet
Expt 2 EDAV
24 pages
Data Visualization
No ratings yet
Data Visualization
48 pages
Add Data Labels
No ratings yet
Add Data Labels
25 pages
ProgrammingForDS12_viz
No ratings yet
ProgrammingForDS12_viz
25 pages
2,3. Introduction Pandas & Matplotlib - Copy
No ratings yet
2,3. Introduction Pandas & Matplotlib - Copy
32 pages
Pandas
No ratings yet
Pandas
36 pages
GRAPHS USING MATPLOTLIB
No ratings yet
GRAPHS USING MATPLOTLIB
9 pages
Line Plot (1) : Datacamp Courses-Jhu-Genomics-Demo
No ratings yet
Line Plot (1) : Datacamp Courses-Jhu-Genomics-Demo
22 pages
Assignment 4 On Visualization On Graph With Solution
No ratings yet
Assignment 4 On Visualization On Graph With Solution
14 pages
Data Visualization
No ratings yet
Data Visualization
24 pages
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
C Language Programming Codes
From Everand
C Language Programming Codes
Durgesh
No ratings yet
Computer Engineering Laboratory Solution Primer
From Everand
Computer Engineering Laboratory Solution Primer
Karan Bhandari
No ratings yet
Learn C++
From Everand
Learn C++
Durgesh
4.5/5 (9)
CSS Grid Layout
From Everand
CSS Grid Layout
Abdelfattah Ragab
No ratings yet
Chap 2
100% (1)
Chap 2
13 pages
RD Sharma Maths Class7 Solution Chapter 24
No ratings yet
RD Sharma Maths Class7 Solution Chapter 24
15 pages
Bianconi 2013
No ratings yet
Bianconi 2013
10 pages
Grade 11 Tech - Science (Learner Book) - Eng
No ratings yet
Grade 11 Tech - Science (Learner Book) - Eng
242 pages
O Level (P1) Coordinate Geometery Question'S
No ratings yet
O Level (P1) Coordinate Geometery Question'S
17 pages
3 - Complex Numbers
100% (1)
3 - Complex Numbers
10 pages
CBSE G+09 Coordinate+Geometry EIQ
No ratings yet
CBSE G+09 Coordinate+Geometry EIQ
6 pages
4024 s10 QP 22 PDF
No ratings yet
4024 s10 QP 22 PDF
12 pages
IGCSE Physics Exam Overview - 2023-2025 Syllabus
No ratings yet
IGCSE Physics Exam Overview - 2023-2025 Syllabus
34 pages
Bauer Ultrasonic Koden en
No ratings yet
Bauer Ultrasonic Koden en
1 page
Group Theory: Symmetry Operations
No ratings yet
Group Theory: Symmetry Operations
4 pages
Исатаев С.С - - Mechanics - laboratory practicum in physics-КазНУ (2016) PDF
No ratings yet
Исатаев С.С - - Mechanics - laboratory practicum in physics-КазНУ (2016) PDF
220 pages
Hw1 Cive210 Vectors-Forces Solution
No ratings yet
Hw1 Cive210 Vectors-Forces Solution
17 pages
Math SBG11
No ratings yet
Math SBG11
486 pages
Tutorial Fluent
100% (1)
Tutorial Fluent
39 pages
AutoCAD Coordinate Systems
No ratings yet
AutoCAD Coordinate Systems
2 pages
Python Geospatial Development - Third Edition - Sample Chapter
No ratings yet
Python Geospatial Development - Third Edition - Sample Chapter
32 pages
ISC-2025 Sample Question Paper - 4
No ratings yet
ISC-2025 Sample Question Paper - 4
6 pages
Ship Hydrodynamics Lecture Notes Part 2 Propeller Geometry
No ratings yet
Ship Hydrodynamics Lecture Notes Part 2 Propeller Geometry
10 pages
Math G8 Q2 POST TEST
No ratings yet
Math G8 Q2 POST TEST
3 pages
2. FRQ3 Task Models A - H AP PC Exam Review
No ratings yet
2. FRQ3 Task Models A - H AP PC Exam Review
16 pages
Dokumen - Tips Catia v5 Surfaces Catiacatia Shape Design Acatia v5 Surfaces Your Notes Lesson
No ratings yet
Dokumen - Tips Catia v5 Surfaces Catiacatia Shape Design Acatia v5 Surfaces Your Notes Lesson
54 pages
Solidworks Teacher Guide Lesson9: School'S Name Teacher'S Name Date
100% (1)
Solidworks Teacher Guide Lesson9: School'S Name Teacher'S Name Date
33 pages
Chapter 1 Grade 7 - Coordinates and Design
No ratings yet
Chapter 1 Grade 7 - Coordinates and Design
11 pages
Circles Level 4
No ratings yet
Circles Level 4
2 pages
Winols Diy Guide
100% (6)
Winols Diy Guide
27 pages
Xgo-Mini Protocol v1
No ratings yet
Xgo-Mini Protocol v1
16 pages
Blender Quick Start Guide 11-2016 PDF
No ratings yet
Blender Quick Start Guide 11-2016 PDF
23 pages

Data Visualization - New

Uploaded by

Data Visualization - New

Uploaded by

Data Visualization – Python

Read Data from Excel:

- .tolist() – pentru a transforma in lista o serie sau altceva

What is a line plot and why use it?

- Import matplotlib as mpl

- Import matplotlib as mpl

- # np.histogram returns 2 values

- increase the bin size to 15 by passing in bins parameter

- # let's get the x-tick values

- s: str, the text of annotation.

- df_iceland.plot(kind='bar', figsize=(10, 6), rot=90) # rotate the xticks(labelled points on x-axis)

- for index, value in enumerate(df_top15):

- plt.annotate(label, xy=(value - 47000, index - 0.10), color='white')

- all(isinstance(column, str) for column in df_can.columns)

So let's change them all to string type.

- df_can.columns = list(map(str, df_can.columns))

- years = list(map(str, range(1980, 2014)))

- ax = df_top5.plot(kind='area', alpha=0.35, figsize=(20, 10))

You might also like