A2 : Data Visualization using Python
Course Objective
This course introduces students to data visualization in the field of exploratory data
science using Python.
Course Learning Outcomes
On successful completion of the course, the students will be able to :
1. Import and export data using files
2. Perform data handling using Numpy arrays.
3. Load data in Pandas data frames and its manipulation.
4. Visualize data using Matplot library.
Syllabus
Unit 1 Introduction : Motivation for using Python for Data Visualization, Essential Python
Libraries: NumPy, Pandas, Matplotlib, Import and Export of Data, Import and Export of data
using files.
Unit 2 Array manipulation using Numpy: Numpy array: Creating Numpy arrays , Data Types
for Numpy arrays, Arithmetic with NumPy Arrays Basic Indexing and Slicing, swapping axes,
transposing arrays.
Unit 3 Data Manipulation using Pandas: Data Structures in Pandas: Series, DataFrame, Index
objects, Loading data into Pandas data frame. Working with Data frames. Grouped and aggregate
calculations
Unit 4 Plotting and Visualization: Using matplotlib to plot data: figures, subplots, markings,
color and line styles, labels and legends, Plotting functions in Pandas: Line,bar, Scatter plots,
histograms, stacked bars, boxplot.
References
1. McKinney W. Python for Data Analysis: Data Wrangling with Pandas, NumPy and IPython.
2nd edition. O’Reilly Media, 2018.
2. Chen D. Y, Pandas for Everyone: Python Data Analysis, Pearson, 2018.
Additional References
(i) Molin S. Hands-On Data Analysis with Pandas, Packt Publishing, 2019.
Suggested Practical List
Use data set of your choice from Open Data Portal (https:// data.gov.in/) for the
following exercises.
1. Practicals based on NumPy
2. Practicals based on Pandas
3. Practicals based on Data Visualization using matplotlib.