Cheat Sheet: Python For Data Science

This document provides a cheat sheet on using Python Pandas for data science. It covers topics such as importing and exporting data, viewing DataFrame contents, data selection and slicing, grouping data, descriptive statistics, and creating test data. Pandas allows working with labeled data structures similar to R data frames in Python. It provides easy to use data structures and tools for data analysis and manipulation.

Uploaded by

Shishir Ray

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

1K views1 page

Cheat Sheet: Python For Data Science

Uploaded by

Shishir Ray

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 1

PYTHON FOR DATA Importing Data Operations Oper

Arithmetic Operations:

•
ations -
G r - oReturns
df.groupby(column) u paBgroupby
y object for values
SCIENCE •

•
pd.read_csv(filename)

pd.read_table(filename)
View DataFrame Contents:
• df.head(n) - look at first n rows of the DataFrame. •
from one column
df.groupby([column1,column2]) - Returns a groupby

CHEAT SHEET •

•
pd.read_excel(filename)

pd.read_sql(query, connection_object)
•
•
df.tail(n) – look at last n rows of the DataFrame.
df.shape() - Gives the number of rows and columns. •
object values from multiple columns
df.groupby(column1)[column2].mean() - Returns the
• df.info() - Information of Index, Datatype and Memory. mean of the values in column2, grouped by the values in
• pd.read_json(json_string)
Python Pandas • df.describe() -Summary statistics for numerical column1
columns. • df.groupby(column1)[column2].median() - Returns the
Selection: mean of the values in column2, grouped by the values in

What is Pandas? Exporting Data • iloc column1

• df.iloc[0] - Select first row of data frame
• df.to_csv(filename)
It is a library that provides easy to use data structure and • df.iloc[1] - Select second row of data frame
data analysis tool for Python Programming Language. • df.to_excel(filename) • df.iloc[-1] - Select last row of data frame
Functions
• df.to_sql(table_name, connection_object) • df.iloc[:,0] - Select first column of data frame
Mean:
• df.to_json(filename) • df.iloc[:,1] - Select second column of data
Import Convention • df.mean() - mean of all columns
frame
Median
• loc
import pandas as pd – Import pasdas • df.median() - median of each column
• df.loc([0], [column labels])- Select single
Create Test/Fake value by row position & column labels
Standard Deviation
Data • df.loc['row1':'row3', 'column1':'column3’]-
• df.std() - standard deviation of each column
Pandas Data Max
• pd.DataFrame(np.random.rand(4,3)) - 3 columns and 4 Select and slicing on labels
Structure • df.max() - highest value in each column
rows of random floats Sort:
• df.sort_index() - Sorts by labels along an axis Min
• pd.Series(new_series) - Creates a series from an
• df.sort_values by='Column label’ - Sorts by the values • df.min() - lowest value in each column
• Series: iterable new_series
along an axis Count
s = pd.Series([1, 2, 3, 4], index=['a', 'b', 'c', 'd'])
• • df.count() - number of non-null values in each DataFrame
• Data Frame: df.sort_values(column1) - Sorts values by column1 in
ascending order column
data_mobile = {'Mobile': ['iPhone', 'Samsung',
• Describe
'Redmi'], 'Color': ['Red', 'White', 'Black'], 'Price': [High, Plotting df.sort_values(column2,ascending=False) - Sorts
values by column2 in descending order • df.describe() - Summary statistics for numerical columns
Medium,Low]}
• Histogram: df.plot.hist()
df = pd.DataFrame(data_mobile,
• Scatter Plot: df.plot.scatter(x='column1',y='column2')
columns=['Mobile', 'Color', 'Price'])
FURTHERMORE:
Python for Data Science Certification Training Course

Linux Commands Cheatsheet V1.01
No ratings yet
Linux Commands Cheatsheet V1.01
36 pages
Pandas Handbook
No ratings yet
Pandas Handbook
33 pages
IT Essentials v7 220-1002 Skills Assessment Answer Key
No ratings yet
IT Essentials v7 220-1002 Skills Assessment Answer Key
7 pages
Flask: The Cheat Sheet: Flask For Django Users
No ratings yet
Flask: The Cheat Sheet: Flask For Django Users
1 page
Resource - Python Cheat Sheets - Python Programming With Sequences of Data - Y9
No ratings yet
Resource - Python Cheat Sheets - Python Programming With Sequences of Data - Y9
8 pages
Review of Basic Statistical Concepts Hanke
No ratings yet
Review of Basic Statistical Concepts Hanke
28 pages
Tango With Django
100% (1)
Tango With Django
289 pages
12 Comp Sci 1 Revision Notes Pythan Advanced Prog
No ratings yet
12 Comp Sci 1 Revision Notes Pythan Advanced Prog
5 pages
SQL Cheat Sheet A4 GOOD
100% (1)
SQL Cheat Sheet A4 GOOD
4 pages
Pandas Cheat Sheet PDF
67% (3)
Pandas Cheat Sheet PDF
1 page
VRM 20130424
No ratings yet
VRM 20130424
115 pages
Acceleo User Guide
No ratings yet
Acceleo User Guide
56 pages
11 Beginner Tips For Learning Python Programming - Real Python
No ratings yet
11 Beginner Tips For Learning Python Programming - Real Python
8 pages
Python Course Book
No ratings yet
Python Course Book
219 pages
Python Questionaire
100% (1)
Python Questionaire
4 pages
3.menus and Toolbars in WxPython
No ratings yet
3.menus and Toolbars in WxPython
13 pages
Data Science Python Cheat Sheet
No ratings yet
Data Science Python Cheat Sheet
25 pages
Variables: Web Browser Users
No ratings yet
Variables: Web Browser Users
8 pages
SQL Cheat Sheet - Query by Example
100% (3)
SQL Cheat Sheet - Query by Example
5 pages
Tkinter CheatSheet
100% (1)
Tkinter CheatSheet
1 page
Using Scrapy in PyCharm
100% (1)
Using Scrapy in PyCharm
8 pages
Pandas Series Practice Questions
0% (1)
Pandas Series Practice Questions
42 pages
Tkinter Tutorial For Beginners
No ratings yet
Tkinter Tutorial For Beginners
23 pages
Customer Segmentation Clustering
No ratings yet
Customer Segmentation Clustering
35 pages
Python 2 Python 3
100% (1)
Python 2 Python 3
4 pages
Lesson 5 Python For Loops While Loops
No ratings yet
Lesson 5 Python For Loops While Loops
7 pages
Python Variables Cheatsheet
No ratings yet
Python Variables Cheatsheet
2 pages
Python 3 Cheat Sheet: Int Float Bool STR List Tuple
No ratings yet
Python 3 Cheat Sheet: Int Float Bool STR List Tuple
2 pages
Python Cheat Sheet For Excel Users
No ratings yet
Python Cheat Sheet For Excel Users
5 pages
Revision Point - Series
No ratings yet
Revision Point - Series
5 pages
Python Cheet Sheet PDF
100% (1)
Python Cheet Sheet PDF
8 pages
Unit 5 GUI Programming Tkinter
No ratings yet
Unit 5 GUI Programming Tkinter
26 pages
SQL Cheatsheet
No ratings yet
SQL Cheatsheet
2 pages
Coding Interview Python Language Essentials
No ratings yet
Coding Interview Python Language Essentials
5 pages
DAX Cheat Sheet
No ratings yet
DAX Cheat Sheet
10 pages
Chapter2 PDF
No ratings yet
Chapter2 PDF
24 pages
Django
No ratings yet
Django
45 pages
A Taste of Python Discrete and Fast Fourier Transforms
No ratings yet
A Taste of Python Discrete and Fast Fourier Transforms
11 pages
VIP Cheatsheet: Convolutional Neural Networks: Afshine Amidi and Shervine Amidi November 26, 2018
No ratings yet
VIP Cheatsheet: Convolutional Neural Networks: Afshine Amidi and Shervine Amidi November 26, 2018
5 pages
Qlik Sense Concat
No ratings yet
Qlik Sense Concat
5 pages
Python 3 Beginner's Reference Cheat Sheet: by Via
100% (1)
Python 3 Beginner's Reference Cheat Sheet: by Via
1 page
Rapids Cheatsheet
100% (1)
Rapids Cheatsheet
2 pages
Data Science Cheatsheets PDF
No ratings yet
Data Science Cheatsheets PDF
9 pages
MacOS Catalina 10.15 Beta 5 Release Notes
No ratings yet
MacOS Catalina 10.15 Beta 5 Release Notes
27 pages
Graphics, Pygame Basics: Programming in Python: Graphics
No ratings yet
Graphics, Pygame Basics: Programming in Python: Graphics
3 pages
Python Journey From Novice To Expert B01LD8K8WW SAMPLE
0% (1)
Python Journey From Novice To Expert B01LD8K8WW SAMPLE
21 pages
Python Textbok
No ratings yet
Python Textbok
215 pages
Financial Analytics With Python
100% (1)
Financial Analytics With Python
40 pages
Pandas DataFrame Notes
No ratings yet
Pandas DataFrame Notes
13 pages
Pandas Cheat Sheet
100% (4)
Pandas Cheat Sheet
2 pages
Python Date Time
No ratings yet
Python Date Time
6 pages
Top 50 Pandas Interview Questions and Answers (2024)
No ratings yet
Top 50 Pandas Interview Questions and Answers (2024)
34 pages
Basics: Showing Output To User
No ratings yet
Basics: Showing Output To User
17 pages
Djaneiro Cheat Sheet: by Via
No ratings yet
Djaneiro Cheat Sheet: by Via
3 pages
Beginners Python Cheat Sheet PCC Classes
No ratings yet
Beginners Python Cheat Sheet PCC Classes
2 pages
Basics of Python and Numpy
100% (2)
Basics of Python and Numpy
98 pages
Pandas Cheat Sheet - Python For Data Science
No ratings yet
Pandas Cheat Sheet - Python For Data Science
5 pages
Pandas
No ratings yet
Pandas
13 pages
Pandas PDF
No ratings yet
Pandas PDF
25 pages
DevOps Session 3 Pandas
No ratings yet
DevOps Session 3 Pandas
33 pages
Python Pandas Cheatsheety
No ratings yet
Python Pandas Cheatsheety
7 pages
ArcGIS Tutorials PDF
No ratings yet
ArcGIS Tutorials PDF
3 pages
4 Ece Cao
No ratings yet
4 Ece Cao
69 pages
Lesson Plan DC
No ratings yet
Lesson Plan DC
9 pages
Servlet 6
No ratings yet
Servlet 6
17 pages
AAA Architecture As Per 3GPP Standards in Wireless Communication S
No ratings yet
AAA Architecture As Per 3GPP Standards in Wireless Communication S
30 pages
Lab4 - Switched LANs
No ratings yet
Lab4 - Switched LANs
11 pages
VNX Family
No ratings yet
VNX Family
35 pages
Rfid Access Control System and Security Webcam
No ratings yet
Rfid Access Control System and Security Webcam
6 pages
Platform Developer 2
No ratings yet
Platform Developer 2
19 pages
Collection Types in PL/SQL
No ratings yet
Collection Types in PL/SQL
16 pages
Computer Studies JSS 3
No ratings yet
Computer Studies JSS 3
2 pages
SDG Implementers Guide
No ratings yet
SDG Implementers Guide
260 pages
Acterna Ant-20 Sonet
No ratings yet
Acterna Ant-20 Sonet
28 pages
Cyberark Pas: Install and Configure
No ratings yet
Cyberark Pas: Install and Configure
3 pages
LAB 08 (Procedure, Functions, Views)
No ratings yet
LAB 08 (Procedure, Functions, Views)
8 pages
Laporan Penjualan Perangkat Komputer PD - Komputindo
No ratings yet
Laporan Penjualan Perangkat Komputer PD - Komputindo
3 pages
Disk Technologies
No ratings yet
Disk Technologies
18 pages
Data Communication Lab
No ratings yet
Data Communication Lab
16 pages
Back To 'Certificate Final Exam/': Correct 1.00 Points Out of 1.00
No ratings yet
Back To 'Certificate Final Exam/': Correct 1.00 Points Out of 1.00
15 pages
Whitepaper SAP Bestpractice Vmware
No ratings yet
Whitepaper SAP Bestpractice Vmware
35 pages
Instructions For Upgrading: REV Description Date Approved
No ratings yet
Instructions For Upgrading: REV Description Date Approved
6 pages
04 - Ch4
No ratings yet
04 - Ch4
32 pages
Micro
No ratings yet
Micro
20 pages
Introduction To Logstash
No ratings yet
Introduction To Logstash
4 pages
Assignment 3
No ratings yet
Assignment 3
6 pages
h17713 Dell Emc Unity XT Series Ss
No ratings yet
h17713 Dell Emc Unity XT Series Ss
10 pages
Complete Query - User Requirement and System Requirement
No ratings yet
Complete Query - User Requirement and System Requirement
39 pages
Whats New
No ratings yet
Whats New
43 pages

Cheat Sheet: Python For Data Science

Uploaded by

Cheat Sheet: Python For Data Science

Uploaded by

PYTHON FOR DATA Importing Data Operations Oper

What is Pandas? Exporting Data • iloc column1

You might also like