Experiment No 2
Experiment No 2
EXPERIMENT No. 2
OBJECTIVES :
• Learn the basics of NumPy
• Gain an introduction to the DataFrame and Series data structures of the Pandas library
• Access and manipulate data within a DataFrame and Series
• Do visualization using Matplotlib and explore the distribution and relationship between
variables using Seaborn.
THEORY :
• NUMPY
NumPy is a library for the Python programming language, adding support for large, multi-
dimensional arrays and matrices, along with a large collection of high-level mathematical
functions to operate on these arrays
• PANDAS
Pandas is a column-oriented data analysis API. It's a great tool for handling and analyzing
input data, and many ML frameworks support pandas data structures as inputs.
The primary data structures in pandas are implemented as two classes:
• DataFrame, which you can imagine as a relational data table, with rows and named
columns.
• Series, which is a single column. A DataFrame contains one or more Series and a name for
each Series.
The data frame is a commonly used abstraction for data manipulation.
• MATPLOTLIB
Matplotlib is a plotting library for the Python programming language and its numerical
mathematics extension NumPy. It provides an object-oriented API for embedding plots into
applications using general-purpose GUI toolkits like Tkinter, wxPython, Qt, or GTK.
• SEABORN
Seaborn is a Python data visualization library based on matplotlib. It provides a high-level
interface for drawing attractive and informative statistical graphics.
PROGRAM:
Numpy Tutorial
Pandas Tutorial
Matplotlib Tutorial
Seaborn Tutorial
CONCLUSION : We have learnt the basics of NumPy and learnt how to access and manipulate
data within a DataFrame and Series data structures of the pandas library. We have also performed
visualization using matplotlib and explored the distribution and relationship between variables in
the Tips dataset using various plots in Seaborn.