0% found this document useful (0 votes)

596 views1 page

Pandas Basics Cheat Sheet Python For Data Science: Retrieving Series/Dataframe Information

This document provides a summary of key Pandas functions for working with DataFrames and Series. It covers reading and writing data to common file types like CSV and Excel. It also discusses selecting and filtering DataFrames, applying functions, descriptive statistics, and alignment of indexes during arithmetic operations. The Pandas library is built on NumPy and provides easy-to-use data structures and analysis tools for Python.

Uploaded by

locuto

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

596 views1 page

Pandas Basics Cheat Sheet Python For Data Science: Retrieving Series/Dataframe Information

Uploaded by

locuto

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 1

> I/O > Retrieving Series/DataFrame Information

Python For Data Science Read and Write to CSV Basic Information

Pandas Basics Cheat Sheet >>> pd.read_csv(‘file.csv’, header=None, nrows=5)

>>> df.to_csv('myDataFrame.csv')
>>>
>>>
>>>
df.shape #(rows,columns)

df.index #Describe index

df.columns #Describe DataFrame columns

>>> df.info() #Info on DataFrame

Learn Pandas Basics online at www.DataCamp.com Read and Write to Excel >>> df.count() #Number of non-NA values

>>> pd.read_excel(‘file.xlsx’)

>>> df.to_excel('dir/myDataFrame.xlsx', sheet_name='Sheet1')

Summary
Read multiple sheets from the same file df.sum() #Sum of values

Pandas
>>>
>>> df.cumsum() #Cummulative sum of values

>>> xlsx = pd.ExcelFile(‘file.xls’)

>>> df.min()/df.max() #Minimum/maximum values

>>> df = pd.read_excel(xlsx, 'Sheet1')

>>> df.idxmin()/df.idxmax() #Minimum/Maximum index value

>>> df.describe() #Summary statistics

The Pandas library is built on NumPy and provides easy-to-use data

structures and data analysis tools for the Python programming language. Read and Write to SQL Query or Database Table >>>
>>>
df.mean() #Mean of values

df.median() #Median of values

Use the following import convention: >>> from sqlalchemy import create_engine

>>> engine = create_engine('sqlite:///:memory:')

>>> import pandas as pd >>>

>>>
pd.read_sql("SELECT * FROM my_table;", engine)

pd.read_sql_table('my_table', engine)
> Applying Functions
>>> pd.read_sql_query("SELECT * FROM my_table;", engine)
read_sql() is a convenience wrapper around read_sql_table() and read_sql_query() >>> f = lambda x: x*2

> Pandas Data Structures >>> df.to_sql('myDf', engine) >>> df.apply(f) #Apply function

>>> df.applymap(f) #Apply function element-wise

Series
> Selection Also see NumPy Arrays
> Data Alignment
A one-dimensional labeled array
a 3
capable of holding any data type b -5 Getting Internal Data Alignment
Index
c 7 >>> s['b'] #Get one element

NA values are introduced in the indices that don’t overlap:

d 4 -5

>>> s = pd.Series([3, -5, 7, 4], index=['a', 'b', 'c', 'd']) >>> df[1:] #Get subset of a DataFrame
>>> s3 = pd.Series([7, -2, 3], index=['a', 'c', 'd'])

Country Capital Population

>>> s + s3

1 India New Delhi 1303171035

a 10.0

Dataframe 2 Brazil Brasília 207847528 b NaN

c 5.0

Selecting, Boolean Indexing & Setting

d 7.0
A two-dimensional labeled data structure

with columns of potentially different types

By Position Arithmetic Operations with Fill Methods
Columns Country Capital Population
>>> df.iloc[[0],[0]] #Select single value by row & column

0 Belgium Brussels 11190846 'Belgium'

You can also do the internal data alignment yourself with the help of the fill methods:
Index 1 India New Delhi 1303171035 >>> df.iat([0],[0])
>>> s.add(s3, fill_values=0)

'Belgium' a 10.0

2 Brazil Brasilia 207847528

b -5.0

By Label
>>> data = {'Country': ['Belgium', 'India', 'Brazil'],
c 5.0

'Capital': ['Brussels', 'New Delhi', 'Brasília'],

>>> df.loc[[0], ['Country']] #Select single value by row & column labels
d 7.0

'Population': [11190846, 1303171035, 207847528]}

'Belgium'
>>> s.sub(s3, fill_value=2)

>>> df = pd.DataFrame(data,
>>> df.at([0], ['Country'])
>>> s.div(s3, fill_value=4)

columns=['Country', 'Capital', 'Population']) 'Belgium' >>> s.mul(s3, fill_value=3)

By Label/Position

> Dropping
>>> df.ix[2] #Select single row of subset of rows

Country Brazil

Capital Brasília

Population 207847528

>>> s.drop(['a', 'c']) #Drop values from rows (axis=0)

>>> df.ix[:,'Capital'] #Select a single column of subset of columns

>>> df.drop('Country', axis=1) #Drop values from columns(axis=1) 0 Brussels

1 New Delhi

2 Brasília

>>> df.ix[1,'Capital'] #Select rows and columns

> Asking For Help 'New Delhi'

Boolean Indexing
>>> help(pd.Series.loc) >>> s[~(s > 1)] #Series s where value is not >1

>>> s[(s < -1) | (s > 2)] #s where value is <-1 or >2

>>> df[df['Population']>1200000000] #Use filter to adjust DataFrame

> Sort & Rank Setting

>>> s['a'] = 6 #Set index a of Series s to 6

>>> df.sort_index() #Sort by labels along an axis

Learn Data Skills Online at
>>> df.sort_values(by='Country') #Sort by the values along an axis

>>> df.rank() #Assign ranks to entries

www.DataCamp.com

Python Unit 3 4
No ratings yet
Python Unit 3 4
92 pages
Data Handling Using Pandas-1
No ratings yet
Data Handling Using Pandas-1
60 pages
Pandas Complete Notes
No ratings yet
Pandas Complete Notes
105 pages
Python Pandas Tutorial For Beginners
No ratings yet
Python Pandas Tutorial For Beginners
203 pages
Pandas Notes
No ratings yet
Pandas Notes
44 pages
DAP 3 Module
No ratings yet
DAP 3 Module
62 pages
Lecture 5
No ratings yet
Lecture 5
36 pages
P Unit-4 NP
No ratings yet
P Unit-4 NP
30 pages
Pandas Tutorial
No ratings yet
Pandas Tutorial
7 pages
Pandas
No ratings yet
Pandas
26 pages
Pandas
No ratings yet
Pandas
21 pages
Cheat Python
No ratings yet
Cheat Python
8 pages
Unit 3
No ratings yet
Unit 3
10 pages
Python For Data Science 1662157639
No ratings yet
Python For Data Science 1662157639
6 pages
Pandas Cheat Sheet........
No ratings yet
Pandas Cheat Sheet........
11 pages
Pandas
No ratings yet
Pandas
13 pages
Ip Study
No ratings yet
Ip Study
18 pages
Python Cheatsy
No ratings yet
Python Cheatsy
1 page
Cheat Sheet
No ratings yet
Cheat Sheet
10 pages
Pandas Cheet Sheet
No ratings yet
Pandas Cheet Sheet
1 page
The Pandas Library
No ratings yet
The Pandas Library
39 pages
PandasGUIA PYTHON-04
No ratings yet
PandasGUIA PYTHON-04
1 page
Pandas DataFrame Notes
No ratings yet
Pandas DataFrame Notes
10 pages
Pandas - Cheat - Sheet (1) - 240511 - 113437
No ratings yet
Pandas - Cheat - Sheet (1) - 240511 - 113437
1 page
Pandas Cheat Sheet
No ratings yet
Pandas Cheat Sheet
1 page
Unit 2
No ratings yet
Unit 2
81 pages
Lab-3 Pandas Library
No ratings yet
Lab-3 Pandas Library
14 pages
Pandas Data Structures: Sections
No ratings yet
Pandas Data Structures: Sections
13 pages
DevOps Session 3 Pandas
No ratings yet
DevOps Session 3 Pandas
33 pages
Pandas Handbook
No ratings yet
Pandas Handbook
33 pages
Cheat Sheet: The Pandas Dataframe Object: Column Index (DF - Columns)
No ratings yet
Cheat Sheet: The Pandas Dataframe Object: Column Index (DF - Columns)
6 pages
Unit 4
No ratings yet
Unit 4
36 pages
12 Pandas
No ratings yet
12 Pandas
9 pages
Cheat Sheet - Pandas
No ratings yet
Cheat Sheet - Pandas
12 pages
Pandas DataFrame Notes
100% (1)
Pandas DataFrame Notes
10 pages
Introduction To Pandas For Data Analysis
No ratings yet
Introduction To Pandas For Data Analysis
6 pages
Python Pandas New Sylabus
No ratings yet
Python Pandas New Sylabus
53 pages
Cheat Sheet: The Pandas Dataframe Object: Preliminaries Get Your Data Into A Dataframe
100% (1)
Cheat Sheet: The Pandas Dataframe Object: Preliminaries Get Your Data Into A Dataframe
10 pages
Pandas DataFrame Notes
100% (1)
Pandas DataFrame Notes
6 pages
Pandas
No ratings yet
Pandas
4 pages
Pandaspythonfordatascience
No ratings yet
Pandaspythonfordatascience
1 page
Pandas Dataframe Export The CSV File
No ratings yet
Pandas Dataframe Export The CSV File
9 pages
WEBINTEL GUIDED LAB ACTIVITY Introduction To Pandas
No ratings yet
WEBINTEL GUIDED LAB ACTIVITY Introduction To Pandas
1 page
Pandas DataFrameObject
No ratings yet
Pandas DataFrameObject
4 pages
Pandas
No ratings yet
Pandas
5 pages
Pandas DataFrame Notes
No ratings yet
Pandas DataFrame Notes
13 pages
Cheat Sheet: The Pandas Dataframe Object I: Preliminaries Get Your Data Into A Dataframe
No ratings yet
Cheat Sheet: The Pandas Dataframe Object I: Preliminaries Get Your Data Into A Dataframe
12 pages
Pandas
No ratings yet
Pandas
8 pages
Javascript
No ratings yet
Javascript
158 pages
Cheat Sheet: The Pandas Dataframe Object: Preliminaries Get Your Data Into A Dataframe
100% (1)
Cheat Sheet: The Pandas Dataframe Object: Preliminaries Get Your Data Into A Dataframe
12 pages
Etap 16 Demo Install Guide en
No ratings yet
Etap 16 Demo Install Guide en
4 pages
Oracle SQL Tuning: For Day-to-Day Data Warehouse Support
No ratings yet
Oracle SQL Tuning: For Day-to-Day Data Warehouse Support
68 pages
Pandas Python For Data Science
100% (1)
Pandas Python For Data Science
1 page
Pandas Cheat Sheet
100% (1)
Pandas Cheat Sheet
2 pages
10000002623ec Connection To 3rd Party Autom Sys
No ratings yet
10000002623ec Connection To 3rd Party Autom Sys
16 pages
Pandas Python For Data Science
No ratings yet
Pandas Python For Data Science
1 page
Financial Statements Guide
No ratings yet
Financial Statements Guide
78 pages
MySQL - Learn Data Analytics Together's Group
No ratings yet
MySQL - Learn Data Analytics Together's Group
96 pages
Introduction To Python Part 3
No ratings yet
Introduction To Python Part 3
2 pages
Salesforce Single Sign On
No ratings yet
Salesforce Single Sign On
22 pages
Gujarat Technological University
No ratings yet
Gujarat Technological University
1 page
Seaborn Cheat Sheet Python For Data Science: 3 Plotting With Seaborn 3 Plotting With Seaborn
No ratings yet
Seaborn Cheat Sheet Python For Data Science: 3 Plotting With Seaborn 3 Plotting With Seaborn
1 page
Screenshot 2025-03-12 at 1.07.27 AM
No ratings yet
Screenshot 2025-03-12 at 1.07.27 AM
69 pages
Control Engineering August 2020
No ratings yet
Control Engineering August 2020
70 pages
Log
No ratings yet
Log
100 pages
USB Meter Reader: B Etjenings Vejledning
No ratings yet
USB Meter Reader: B Etjenings Vejledning
72 pages
Checklist - For - Software Verification Plan
No ratings yet
Checklist - For - Software Verification Plan
7 pages
EasyScopeX Install Guide
No ratings yet
EasyScopeX Install Guide
12 pages
Discover The Secret To Legendary IT Service
No ratings yet
Discover The Secret To Legendary IT Service
24 pages
Case Study Boeing by Ridwan
No ratings yet
Case Study Boeing by Ridwan
4 pages
Moisture Analyzers Guide
No ratings yet
Moisture Analyzers Guide
16 pages
1.what Is Opactch in Oracle?
No ratings yet
1.what Is Opactch in Oracle?
5 pages
Bokeh Cheat Sheet Python For Data Science: 3 Renderers & Visual Customizations
0% (1)
Bokeh Cheat Sheet Python For Data Science: 3 Renderers & Visual Customizations
1 page
Oracle Database 19c Auto-Indexing
No ratings yet
Oracle Database 19c Auto-Indexing
15 pages
Jupyter Cheat Sheet Python For Data Science: Working With Different Programming Languages Widgets
No ratings yet
Jupyter Cheat Sheet Python For Data Science: Working With Different Programming Languages Widgets
1 page
GIS Demystified Skyler
No ratings yet
GIS Demystified Skyler
6 pages
Database Management Systems
No ratings yet
Database Management Systems
41 pages
Adaptive Server Enterprise: Performance and Tuning Series: Monitoring Tables
No ratings yet
Adaptive Server Enterprise: Performance and Tuning Series: Monitoring Tables
66 pages
AWR Warehouse: An Introduction
No ratings yet
AWR Warehouse: An Introduction
38 pages
Proximity Beacon Campaigns On: Your App
No ratings yet
Proximity Beacon Campaigns On: Your App
37 pages
European Food Research and Technology A - Zeitschrift Für Lebensmittel-Untersuchung Und - Forschung A - Incl
No ratings yet
European Food Research and Technology A - Zeitschrift Für Lebensmittel-Untersuchung Und - Forschung A - Incl
11 pages
Frontend
No ratings yet
Frontend
1 page
3a Memo Vs Letter
No ratings yet
3a Memo Vs Letter
2 pages
Matplotlib Cheat Sheet Python For Data Science: Plotting Cutomize Plot Plotting Routines
No ratings yet
Matplotlib Cheat Sheet Python For Data Science: Plotting Cutomize Plot Plotting Routines
1 page
BES103 PythonLab4
No ratings yet
BES103 PythonLab4
4 pages
Network Security Lab
No ratings yet
Network Security Lab
3 pages
3 - Automated Classification of Neonatal Amplitude-Integrated EEG Based On Gradient Boosting Method
No ratings yet
3 - Automated Classification of Neonatal Amplitude-Integrated EEG Based On Gradient Boosting Method
8 pages
Using List Views React Native
No ratings yet
Using List Views React Native
3 pages
Anviz C2Slim QuickGuide EN 8.28.2018
No ratings yet
Anviz C2Slim QuickGuide EN 8.28.2018
2 pages
Python For Data Science: Advanced Indexing Data Wrangling in Pandas Cheat Sheet Combining Data
No ratings yet
Python For Data Science: Advanced Indexing Data Wrangling in Pandas Cheat Sheet Combining Data
1 page
Importing Data Cheat Sheet Python For Data Science: Pickled Files Exploring Your Data
No ratings yet
Importing Data Cheat Sheet Python For Data Science: Pickled Files Exploring Your Data
1 page
The Pathologies of Big Data Summary
No ratings yet
The Pathologies of Big Data Summary
2 pages
MiFi 2372 Datasheet
No ratings yet
MiFi 2372 Datasheet
2 pages
Evaluation of Some SMTP Testing, Email Verification, Header Analysis, SSL Checkers, Email Delivery, Email Forwarding and WordPress Email Tools
From Everand
Evaluation of Some SMTP Testing, Email Verification, Header Analysis, SSL Checkers, Email Delivery, Email Forwarding and WordPress Email Tools
Hidaia Mahmood Alassouli
No ratings yet
JDK Tutorials - Herong's Tutorial Examples
From Everand
JDK Tutorials - Herong's Tutorial Examples
Herong Yang
No ratings yet
Mastering C++ Network Automation
From Everand
Mastering C++ Network Automation
Justin Barbara
No ratings yet
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
From Everand
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
Marcus Richards
No ratings yet
Mastering C++ Network Automation: Run Automation across Configuration Management, Container Orchestration, Kubernetes, and Cloud Networking
From Everand
Mastering C++ Network Automation: Run Automation across Configuration Management, Container Orchestration, Kubernetes, and Cloud Networking
Justin Barbara
No ratings yet
The Essential R Reference
From Everand
The Essential R Reference
Mark Gardener
No ratings yet
Oracle Certified Professional Java Programmer OCPJP 1Z0 809
From Everand
Oracle Certified Professional Java Programmer OCPJP 1Z0 809
Manish Soni
No ratings yet
Inspiring Powershell Articles
From Everand
Inspiring Powershell Articles
Murat Yildirimoglu
No ratings yet

Pandas Basics Cheat Sheet Python For Data Science: Retrieving Series/Dataframe Information

Uploaded by

Pandas Basics Cheat Sheet Python For Data Science: Retrieving Series/Dataframe Information

Uploaded by

> I/O > Retrieving Series/DataFrame Information

Pandas Basics Cheat Sheet >>> pd.read_csv(‘file.csv’, header=None, nrows=5)

df.index #Describe index

df.columns #Describe DataFrame columns

>>> df.info() #Info on DataFrame

>>> df.to_excel('dir/myDataFrame.xlsx', sheet_name='Sheet1')

>>> xlsx = pd.ExcelFile(‘file.xls’)

>>> df.min()/df.max() #Minimum/maximum values

>>> df = pd.read_excel(xlsx, 'Sheet1')

>>> df.describe() #Summary statistics

The Pandas library is built on NumPy and provides easy-to-use data

df.median() #Median of values

>>> engine = create_engine('sqlite:///:memory:')

>>> import pandas as pd >>>

>>> df.applymap(f) #Apply function element-wise

NA values are introduced in the indices that don’t overlap:

Country Capital Population

1 India New Delhi 1303171035

Dataframe 2 Brazil Brasília 207847528 b NaN

Selecting, Boolean Indexing & Setting

with columns of potentially different types

0 Belgium Brussels 11190846 'Belgium'

2 Brazil Brasilia 207847528

'Capital': ['Brussels', 'New Delhi', 'Brasília'],

'Population': [11190846, 1303171035, 207847528]}

columns=['Country', 'Capital', 'Population']) 'Belgium' >>> s.mul(s3, fill_value=3)

>>> s.drop(['a', 'c']) #Drop values from rows (axis=0)

>>> df.drop('Country', axis=1) #Drop values from columns(axis=1) 0 Brussels

>>> df.ix[1,'Capital'] #Select rows and columns

> Asking For Help 'New Delhi'

>>> df[df['Population']>1200000000] #Use filter to adjust DataFrame

> Sort & Rank Setting

>>> s['a'] = 6 #Set index a of Series s to 6

>>> df.sort_index() #Sort by labels along an axis

>>> df.rank() #Assign ranks to entries

You might also like