0% found this document useful (0 votes)

81 views4 pages

Pandas - Jupyter Notebook

Pandas is a Python library built on NumPy for data manipulation and analysis. It contains two main data structures - Series for 1D data and DataFrame for 2D tabular data. DataFrame is the most widely used data structure in Pandas for data analysis and manipulation. It allows storing and manipulating data efficiently by providing functions for aggregation like sum, mean, count etc.

Uploaded by

Maximus Aranha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

81 views4 pages

Pandas - Jupyter Notebook

Uploaded by

Maximus Aranha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Python Libraries - Pandas - Pandas Basics

Pandas is a library built using NumPy specifically for data analysis.you will be using Pandas heavily
for data manipulation,visuilization,building machine learning models,etc.

There are two main data structures in pandas:

• series

• dataframes

The default way to store data in dataframes,and thus manipilating dataframes quickly in probable the most important skill set for datya analysis.

In [1]:

1 pip install pandas

Requirement already satisfied: pandas in c:\users\student\anaconda3\lib\site-packages (1.4.4)

Requirement already satisfied: pytz>=2020.1 in c:\users\student\anaconda3\lib\site-packages (from pandas) (2022.1)
Requirement already satisfied: numpy>=1.18.5 in c:\users\student\anaconda3\lib\site-packages (from pandas) (1.21.5)
Requirement already satisfied: python-dateutil>=2.8.1 in c:\users\student\anaconda3\lib\site-packages (from pandas) (2.8.2)
Requirement already satisfied: six>=1.5 in c:\users\student\anaconda3\lib\site-packages (from python-dateutil>=2.8.1->panda
s) (1.16.0)
Note: you may need to restart the kernel to use updated packages.

In [3]:

1 import pandas as pd

In [4]:

1 # The Pandas series

2 #creating a numeric pandas series
3 s = pd.Series([2,4,5,6,9])
4 print(s)
5 print(type(s))
0 2
1 4
2 5
3 6
4 9
dtype: int64
<class 'pandas.core.series.Series'>

In [5]:

1 #creating a series of type datetime

2 data_series = pd.date_range(start = '11-09-2017', end= '12-12-2017')
3 data_series
4 #type (data_series)
Out[5]:

DatetimeIndex(['2017-11-09', '2017-11-10', '2017-11-11', '2017-11-12',

'2017-11-13', '2017-11-14', '2017-11-15', '2017-11-16',
'2017-11-17', '2017-11-18', '2017-11-19', '2017-11-20',
'2017-11-21', '2017-11-22', '2017-11-23', '2017-11-24',
'2017-11-25', '2017-11-26', '2017-11-27', '2017-11-28',
'2017-11-29', '2017-11-30', '2017-12-01', '2017-12-02',
'2017-12-03', '2017-12-04', '2017-12-05', '2017-12-06',
'2017-12-07', '2017-12-08', '2017-12-09', '2017-12-10',
'2017-12-11', '2017-12-12'],
dtype='datetime64[ns]', freq='D')

The Dataframe
Dataframe is the most widely used data-structure in data analysis.It is a table with rows andcolumns,with rows having index and columns having meaningful
data.

creating dataframes from dictionaries.

EXAMPLE - 1
In [8]:

1 country = ['United States','Australia','India','Russia','Morrocco']

2 symbol = ['US','AU','IND','RUS','MOR']
3 dic_world = {"country":country,"symbol":symbol}

In [9]:

1 print(dic_world)
{'country': ['United States', 'Australia', 'India', 'Russia', 'Morrocco'], 'symbol': ['US', 'AU', 'IND', 'RUS', 'MOR']}

In [10]:

1 dic_world["country"]
2
Out[10]:

['United States', 'Australia', 'India', 'Russia', 'Morrocco']

In [11]:

1 dic_world["symbol"]
Out[11]:

['US', 'AU', 'IND', 'RUS', 'MOR']

In [12]:

1 data = pd.DataFrame(dic_world)

In [13]:

1 print(type(data))
2

In [14]:

1 print(data)
2

country symbol
0 United States US
1 Australia AU
2 India IND
3 Russia RUS
4 Morrocco MOR

In [15]:

1 print(data["country"])

0 United States
1 Australia
2 India
3 Russia
4 Morrocco
Name: country, dtype: object

In [16]:

1 print(data["symbol"])
2
0 US
1 AU
2 IND
3 RUS
4 MOR
Name: symbol, dtype: object

EXAMPLE-2
In [18]:

1 #defining data to create lists for dictionary

2 cars_per_cap = [809,731,588,18,200,70,45]
3 country = ['United states','Australia','Japan','India','Russia','Morroco','Egypt']
4 drives_right = [False,True,True,True,False,False,False]
5
In [19]:

1 #creating the dictionaries to state the entries as key:value pair.

2 cars_dict = {"cars_per_cap":cars_per_cap,"country":country,"drives_right":drives_right}

In [20]:

1 print(cars_dict)

{'cars_per_cap': [809, 731, 588, 18, 200, 70, 45], 'country': ['United states', 'Australia', 'Japan', 'India', 'Russia', 'M
orroco', 'Egypt'], 'drives_right': [False, True, True, True, False, False, False]}

In [21]:

1 print(cars_dict['cars_per_cap'])
[809, 731, 588, 18, 200, 70, 45]

In [22]:

1 cars = pd.DataFrame(cars_dict)

AGGREGATION FUNCTION
In [24]:

1 cars
Out[24]:

cars_per_cap country drives_right

0 809 United states False

1 731 Australia True

2 588 Japan True

3 18 India True

4 200 Russia False

5 70 Morroco False

6 45 Egypt False

In [25]:

1 cars.cars_per_cap

Out[25]:

0 809
1 731
2 588
3 18
4 200
5 70
6 45
Name: cars_per_cap, dtype: int64

In [26]:

1 print(cars.cars_per_cap.max())
809

In [27]:

1 print(cars.cars_per_cap.min())

In [28]:

1 print(cars.cars_per_cap.mean())
351.57142857142856

In [29]:

1 print(cars.cars_per_cap.std())
345.59555222005633

In [30]:

1 print(cars.cars_per_cap.count())

7
In [39]:

1 country = ['United states','Australia','Japan','India','Russia','Morroco','Egypt']

2 cars_per_cap = [809,731,588,18,200,70,45]

In [41]:

1 lst = [['tom','reacher',25],['krish','pete',30],['nick','wilson',26],['julie', 'jonny', 28]]

2 df = pd.DataFrame(lst,columns = ['FName','LName','Age'],dtype = float)
3 df

C:\Users\student\AppData\Local\Temp\ipykernel_9292\3002031254.py:2: FutureWarning: Could not cast to float64, falling back

to object. This behavior is deprecated. In a future version, when a dtype is passed to 'DataFrame', either all columns will
be cast to that dtype, or a TypeError will be raised.
df = pd.DataFrame(lst,columns = ['FName','LName','Age'],dtype = float)

Out[41]:

FName LName Age

0 tom reacher 25.0

1 krish pete 30.0

2 nick wilson 26.0

3 julie jonny 28.0

In [42]:

1 df.Age.max()
Out[42]:

30.0

In [43]:

1 df.Age.min()
Out[43]:

25.0

In [44]:

1 df.Age.mean()
Out[44]:

27.25

In [45]:

1 df.Age.std()

Out[45]:

2.217355782608345

In [46]:

1 df.Age.count()
Out[46]:

In [ ]:

USER REQUIREMENTS TEMPLATE For A Supervisory Control and Data Acquisition (SCADA) Process Control System
100% (2)
USER REQUIREMENTS TEMPLATE For A Supervisory Control and Data Acquisition (SCADA) Process Control System
66 pages
Maths Clinic Gr12 ENG SmartPrep v1.0 1 PDF
50% (8)
Maths Clinic Gr12 ENG SmartPrep v1.0 1 PDF
69 pages
Stepper Motors Catalog
100% (1)
Stepper Motors Catalog
35 pages
Khasdar Krida Mahotsav-2025 Events Information
No ratings yet
Khasdar Krida Mahotsav-2025 Events Information
3 pages
TSA 9 Week Intermediate Program
No ratings yet
TSA 9 Week Intermediate Program
20 pages
Pandas Complete Notes
No ratings yet
Pandas Complete Notes
105 pages
FM 8900S PDF
No ratings yet
FM 8900S PDF
4 pages
Consultant Empanelment Form
No ratings yet
Consultant Empanelment Form
24 pages
ISTQB FL Chap 1
No ratings yet
ISTQB FL Chap 1
10 pages
Class 11 - Introduction To Data Structures
No ratings yet
Class 11 - Introduction To Data Structures
50 pages
Ec 1403 Satellite Communication: Eographical Nformation Ystem (Gis)
No ratings yet
Ec 1403 Satellite Communication: Eographical Nformation Ystem (Gis)
23 pages
Lab Mpls LDP Configuration
No ratings yet
Lab Mpls LDP Configuration
17 pages
IP Practical File - Reference
No ratings yet
IP Practical File - Reference
98 pages
Python Pandas ch-2
No ratings yet
Python Pandas ch-2
56 pages
Pandas
No ratings yet
Pandas
25 pages
Python Cheat Sheet: Pandas - Numpy - Sklearn Matplotlib - Seaborn BS4 - Selenium - Scrapy
100% (4)
Python Cheat Sheet: Pandas - Numpy - Sklearn Matplotlib - Seaborn BS4 - Selenium - Scrapy
11 pages
Aes Unit Ii
No ratings yet
Aes Unit Ii
104 pages
Unit 04 Pandas
No ratings yet
Unit 04 Pandas
46 pages
Nabil 201 Chap 06
No ratings yet
Nabil 201 Chap 06
37 pages
Meta Search Engine Using Distributed Information Retrieval
No ratings yet
Meta Search Engine Using Distributed Information Retrieval
35 pages
Pandas
No ratings yet
Pandas
82 pages
P03 Introduction To Pandas Ans
No ratings yet
P03 Introduction To Pandas Ans
45 pages
WBNR Fslcup Lect5mpc5607b PDF
No ratings yet
WBNR Fslcup Lect5mpc5607b PDF
138 pages
Java Foundation With Data Structures Topic: Installation Guide For JDK and Eclipse
No ratings yet
Java Foundation With Data Structures Topic: Installation Guide For JDK and Eclipse
7 pages
Lecture 3 - Pandas
No ratings yet
Lecture 3 - Pandas
37 pages
Pandas
No ratings yet
Pandas
36 pages
Pandas
No ratings yet
Pandas
63 pages
Unit 2
No ratings yet
Unit 2
81 pages
Pandas DataFrame Notes
100% (1)
Pandas DataFrame Notes
10 pages
Features and Applications of The P82B715 I2C-bus Extender
No ratings yet
Features and Applications of The P82B715 I2C-bus Extender
29 pages
Python Data Processing
No ratings yet
Python Data Processing
36 pages
Pandas
No ratings yet
Pandas
57 pages
Unit 04 Pandas
No ratings yet
Unit 04 Pandas
46 pages
Cheat Sheet: The Pandas Dataframe Object: Preliminaries Get Your Data Into A Dataframe
100% (1)
Cheat Sheet: The Pandas Dataframe Object: Preliminaries Get Your Data Into A Dataframe
10 pages
PR 1 Research
No ratings yet
PR 1 Research
57 pages
Create A Data Frame
No ratings yet
Create A Data Frame
25 pages
Pandas
No ratings yet
Pandas
21 pages
Expert Evaluation Form PK
No ratings yet
Expert Evaluation Form PK
2 pages
Pandas
No ratings yet
Pandas
49 pages
Pandas & Numpy
No ratings yet
Pandas & Numpy
32 pages
EPG REST Integration V17
No ratings yet
EPG REST Integration V17
48 pages
Short Notes On Pandas
No ratings yet
Short Notes On Pandas
21 pages
IP Slybuss
No ratings yet
IP Slybuss
21 pages
Pandas Notes
No ratings yet
Pandas Notes
19 pages
Python Programs
No ratings yet
Python Programs
29 pages
Ip Notes
No ratings yet
Ip Notes
20 pages
Pandas in Py: A Detailed Overview Into Series and Dataframe Functions in Pandas
No ratings yet
Pandas in Py: A Detailed Overview Into Series and Dataframe Functions in Pandas
21 pages
Pandas Shan Ver2
No ratings yet
Pandas Shan Ver2
25 pages
18 Pandas
No ratings yet
18 Pandas
33 pages
Final Formatted After Iloc Loc
No ratings yet
Final Formatted After Iloc Loc
34 pages
Class 12 Practical File
No ratings yet
Class 12 Practical File
29 pages
14 Pandas
No ratings yet
14 Pandas
25 pages
DS Assignment-2 (20csu073)
No ratings yet
DS Assignment-2 (20csu073)
21 pages
Jupyter Notebook Viewer1
No ratings yet
Jupyter Notebook Viewer1
17 pages
Sylvesters Theorem
No ratings yet
Sylvesters Theorem
3 pages
Unit 4
No ratings yet
Unit 4
36 pages
BioBlocksLab - A Portable DIY Bio Lab Using BioBlocks Language - ScienceDirect
No ratings yet
BioBlocksLab - A Portable DIY Bio Lab Using BioBlocks Language - ScienceDirect
14 pages
Python Pandas New Sylabus
No ratings yet
Python Pandas New Sylabus
53 pages
Ip Study
No ratings yet
Ip Study
18 pages
ML Unit-2 Notes
No ratings yet
ML Unit-2 Notes
17 pages
Lab-3 Pandas Library
No ratings yet
Lab-3 Pandas Library
14 pages
Data Handling and CSV 2024 - 2025
No ratings yet
Data Handling and CSV 2024 - 2025
12 pages
Introduction To Pandas & Data Structures
No ratings yet
Introduction To Pandas & Data Structures
11 pages
Demystifying Noise Spectre Example
No ratings yet
Demystifying Noise Spectre Example
20 pages
Pandas DataFrame Notes
No ratings yet
Pandas DataFrame Notes
10 pages
Data Science Programming In Python
From Everand
Data Science Programming In Python
Anita Raichand
No ratings yet
Mohit
No ratings yet
Mohit
19 pages
Cheat Python
No ratings yet
Cheat Python
8 pages
2.1 Pandas Objects
No ratings yet
2.1 Pandas Objects
10 pages
Python Cheat Sheet For Excel Users
100% (2)
Python Cheat Sheet For Excel Users
5 pages
Multiple Integral Cartesian Polar
No ratings yet
Multiple Integral Cartesian Polar
19 pages
Introduction To Pandas
No ratings yet
Introduction To Pandas
26 pages
04 Introduction To Python-1
No ratings yet
04 Introduction To Python-1
29 pages
4
No ratings yet
4
6 pages
Chapter 2 Q & A
No ratings yet
Chapter 2 Q & A
2 pages
Class 16 - Doubly Linked List
No ratings yet
Class 16 - Doubly Linked List
15 pages
MentorApp Golden Template AY 2022-23-1 2 Compressed 1687926099903
No ratings yet
MentorApp Golden Template AY 2022-23-1 2 Compressed 1687926099903
28 pages
Pandas Data Structures: Sections
No ratings yet
Pandas Data Structures: Sections
13 pages
CSE (DS) Scheme 2023-2024
No ratings yet
CSE (DS) Scheme 2023-2024
11 pages
HW 1
No ratings yet
HW 1
3 pages
Class 12 Panda Project
No ratings yet
Class 12 Panda Project
13 pages
NZ Business Courses
No ratings yet
NZ Business Courses
77 pages
Ab
No ratings yet
Ab
13 pages
YMXzlCEuTdyGA9xnO3s2 - PRO Athlete Handbook
No ratings yet
YMXzlCEuTdyGA9xnO3s2 - PRO Athlete Handbook
20 pages
Pandas Cheat Sheet
No ratings yet
Pandas Cheat Sheet
1 page
PandasGUIA PYTHON-04
No ratings yet
PandasGUIA PYTHON-04
1 page
EI Lecture 15-3-2011
No ratings yet
EI Lecture 15-3-2011
2 pages
Construction of Transmission Line Catenary From Survey Data
No ratings yet
Construction of Transmission Line Catenary From Survey Data
7 pages
Attendance Below 75%
No ratings yet
Attendance Below 75%
1 page
Pandas - Cheat - Sheet (1) - 240511 - 113437
No ratings yet
Pandas - Cheat - Sheet (1) - 240511 - 113437
1 page
Tacacs Huawei
No ratings yet
Tacacs Huawei
1 page
A
No ratings yet
A
3 pages
Lab 3 (Saha)
No ratings yet
Lab 3 (Saha)
7 pages
Pandas DataFrameObject
No ratings yet
Pandas DataFrameObject
4 pages
Cheat Sheet: The Pandas Dataframe Object I: Preliminaries Get Your Data Into A Dataframe
No ratings yet
Cheat Sheet: The Pandas Dataframe Object I: Preliminaries Get Your Data Into A Dataframe
12 pages
Sec B Ce
No ratings yet
Sec B Ce
2 pages
Gamma Function
No ratings yet
Gamma Function
10 pages
Pandas Basics Cheat Sheet Python For Data Science: Retrieving Series/Dataframe Information
No ratings yet
Pandas Basics Cheat Sheet Python For Data Science: Retrieving Series/Dataframe Information
1 page
CyberBean Computer Notes For Class 5th
No ratings yet
CyberBean Computer Notes For Class 5th
11 pages
Indicative Grade Profile 2022-23 (NTU NUS SMU)
No ratings yet
Indicative Grade Profile 2022-23 (NTU NUS SMU)
4 pages
CSEDs Maximus 51
No ratings yet
CSEDs Maximus 51
2 pages
U-4 Resume, Cover Letter, Job Application Letter
No ratings yet
U-4 Resume, Cover Letter, Job Application Letter
3 pages
Dsa QB 2023-24
No ratings yet
Dsa QB 2023-24
3 pages
WEBINTEL GUIDED LAB ACTIVITY Introduction To Pandas
No ratings yet
WEBINTEL GUIDED LAB ACTIVITY Introduction To Pandas
1 page
Question bank-CNS
No ratings yet
Question bank-CNS
6 pages
How to a Developers Guide to 4k: Developer edition, #3
From Everand
How to a Developers Guide to 4k: Developer edition, #3
Xinc Cyberwizard
No ratings yet
Enhanced Virtual Internship Report Maximus Aranha
No ratings yet
Enhanced Virtual Internship Report Maximus Aranha
4 pages
EC3032 - Power Electronics
No ratings yet
EC3032 - Power Electronics
6 pages
100 Public Reports On Bugcrowd
No ratings yet
100 Public Reports On Bugcrowd
3 pages
Your Charges in Detail - 7400447196: Monthly Rentals
No ratings yet
Your Charges in Detail - 7400447196: Monthly Rentals
5 pages
Exam AZ-800: Administering Windows Server Hybrid Core Infrastructure Preparation
From Everand
Exam AZ-800: Administering Windows Server Hybrid Core Infrastructure Preparation
Georgio Daccache
No ratings yet
Cao Assignment 51.
No ratings yet
Cao Assignment 51.
1 page
Medical Certificate 18
No ratings yet
Medical Certificate 18
1 page
Aga A2 0101 Ap
No ratings yet
Aga A2 0101 Ap
1 page

Pandas - Jupyter Notebook

Uploaded by

Pandas - Jupyter Notebook

Uploaded by

Python Libraries - Pandas - Pandas Basics

There are two main data structures in pandas:

1 pip install pandas

Requirement already satisfied: pandas in c:\users\student\anaconda3\lib\site-packages (1.4.4)

1 # The Pandas series

1 #creating a series of type datetime

DatetimeIndex(['2017-11-09', '2017-11-10', '2017-11-11', '2017-11-12',

creating dataframes from dictionaries.

1 country = ['United States','Australia','India','Russia','Morrocco']

['United States', 'Australia', 'India', 'Russia', 'Morrocco']

['US', 'AU', 'IND', 'RUS', 'MOR']

1 #defining data to create lists for dictionary

1 #creating the dictionaries to state the entries as key:value pair.

cars_per_cap country drives_right

0 809 United states False

1 731 Australia True

2 588 Japan True

4 200 Russia False

1 country = ['United states','Australia','Japan','India','Russia','Morroco','Egypt']

1 lst = [['tom','reacher',25],['krish','pete',30],['nick','wilson',26],['julie', 'jonny', 28]]

C:\Users\student\AppData\Local\Temp\ipykernel_9292\3002031254.py:2: FutureWarning: Could not cast to float64, falling back

FName LName Age

0 tom reacher 25.0

1 krish pete 30.0

2 nick wilson 26.0

3 julie jonny 28.0

You might also like