Importing Data Python Cheat Sheet PDF

Python provides several methods for importing different data file types into Python for data analysis. These include importing Excel, CSV, text, HDF5, pickled, SAS, Stata, and database files using NumPy, Pandas, h5py, pickle, and SQLAlchemy. Pandas and NumPy are commonly used to import array and table data, while other libraries provide functionality for specific file types. The data can then be explored and queried within Python.

Uploaded by

ruler3382

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

409 views1 page

Importing Data Python Cheat Sheet PDF

Uploaded by

ruler3382

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 1

Python For Data Science Cheat Sheet Excel Spreadsheets Pickled Files

>>> file = 'urbanpop.xlsx' >>> import pickle

Importing Data >>> data = pd.ExcelFile(file) >>> with open('pickled_fruit.pkl', 'rb') as file:
pickled_data = pickle.load(file)
>>> df_sheet2 = data.parse('1960-1966',
Learn Python for data science Interactively at www.DataCamp.com skiprows=[0],
names=['Country',
'AAM: War(2002)'])
>>> df_sheet1 = data.parse(0, HDF5 Files
parse_cols=[0],
Importing Data in Python skiprows=[0], >>> import h5py
>>> filename = 'H-H1_LOSC_4_v1-815411200-4096.hdf5'
names=['Country'])
Most of the time, you’ll use either NumPy or pandas to import >>> data = h5py.File(filename, 'r')
your data: To access the sheet names, use the sheet_names attribute:
>>> import numpy as np >>> data.sheet_names
>>> import pandas as pd Matlab Files
Help SAS Files >>> import scipy.io
>>> filename = 'workspace.mat'
>>> from sas7bdat import SAS7BDAT >>> mat = scipy.io.loadmat(filename)
>>> np.info(np.ndarray.dtype)
>>> help(pd.read_csv) >>> with SAS7BDAT('urbanpop.sas7bdat') as file:
df_sas = file.to_data_frame()

Text Files Exploring Dictionaries

Stata Files Accessing Elements with Functions
Plain Text Files >>> data = pd.read_stata('urbanpop.dta') >>> print(mat.keys()) Print dictionary keys
>>> filename = 'huck_finn.txt' >>> for key in data.keys(): Print dictionary keys
>>> file = open(filename, mode='r') Open the file for reading print(key)
>>> text = file.read() Read a file’s contents Relational Databases meta
quality
>>> print(file.closed) Check whether file is closed
>>> from sqlalchemy import create_engine strain
>>> file.close() Close file
>>> print(text) >>> engine = create_engine('sqlite://Northwind.sqlite') >>> pickled_data.values() Return dictionary values
>>> print(mat.items()) Returns items in list format of (key, value)
Use the table_names() method to fetch a list of table names: tuple pairs
Using the context manager with
>>> with open('huck_finn.txt', 'r') as file:
>>> table_names = engine.table_names() Accessing Data Items with Keys
print(file.readline()) Read a single line
print(file.readline()) Querying Relational Databases >>> for key in data ['meta'].keys() Explore the HDF5 structure
print(file.readline()) print(key)
>>> con = engine.connect() Description
>>> rs = con.execute("SELECT * FROM Orders") DescriptionURL
Table Data: Flat Files >>> df = pd.DataFrame(rs.fetchall()) Detector
>>> df.columns = rs.keys() Duration
GPSstart
Importing Flat Files with numpy >>> con.close()
Observatory
Files with one data type Using the context manager with Type
UTCstart
>>> filename = ‘mnist.txt’ >>> with engine.connect() as con:
>>> print(data['meta']['Description'].value) Retrieve the value for a key
>>> data = np.loadtxt(filename, rs = con.execute("SELECT OrderID FROM Orders")
delimiter=',', String used to separate values df = pd.DataFrame(rs.fetchmany(size=5))
df.columns = rs.keys()
skiprows=2,
usecols=[0,2],
Skip the first 2 lines
Read the 1st and 3rd column
Navigating Your FileSystem
dtype=str) The type of the resulting array Querying relational databases with pandas
Magic Commands
Files with mixed data types >>> df = pd.read_sql_query("SELECT * FROM Orders", engine)
>>> filename = 'titanic.csv' !ls List directory contents of files and directories
>>> data = np.genfromtxt(filename, %cd .. Change current working directory
%pwd Return the current working directory path
delimiter=',',
names=True, Look for column header
Exploring Your Data
dtype=None)
NumPy Arrays os Library
>>> data_array = np.recfromcsv(filename) >>> data_array.dtype Data type of array elements >>> import os
>>> data_array.shape Array dimensions >>> path = "/usr/tmp"
The default dtype of the np.recfromcsv() function is None. >>> wd = os.getcwd() Store the name of current directory in a string
>>> len(data_array) Length of array
>>> os.listdir(wd) Output contents of the directory in a list
Importing Flat Files with pandas >>> os.chdir(path) Change current working directory
pandas DataFrames >>> os.rename("test1.txt", Rename a file
>>> filename = 'winequality-red.csv' "test2.txt")
>>> data = pd.read_csv(filename, >>> df.head() Return first DataFrame rows
nrows=5, >>> os.remove("test1.txt") Delete an existing file
Number of rows of file to read >>> df.tail() Return last DataFrame rows >>> os.mkdir("newdir") Create a new directory
header=None, Row number to use as col names >>> df.index Describe index
sep='\t', Delimiter to use >>> df.columns Describe DataFrame columns
comment='#', Character to split comments >>> df.info() Info on DataFrame
na_values=[""]) String to recognize as NA/NaN >>> data_array = data.values Convert a DataFrame to an a NumPy array DataCamp
Learn R for Data Science Interactively

Matlab Python Xref
No ratings yet
Matlab Python Xref
17 pages
Gakhov Time Series Forecasting With Python
No ratings yet
Gakhov Time Series Forecasting With Python
66 pages
Pandas Plotting Capabilities
No ratings yet
Pandas Plotting Capabilities
27 pages
A Taste of Python Discrete and Fast Fourier Transforms
No ratings yet
A Taste of Python Discrete and Fast Fourier Transforms
11 pages
The Matplotlib User's Guide
No ratings yet
The Matplotlib User's Guide
868 pages
Python Regex & Socket Guide
No ratings yet
Python Regex & Socket Guide
237 pages
Customer Data Analysis & Feature Engineering
No ratings yet
Customer Data Analysis & Feature Engineering
35 pages
R For MATLAB Users - Mathesaurus
No ratings yet
R For MATLAB Users - Mathesaurus
12 pages
Lesson 5 Python For Loops While Loops
No ratings yet
Lesson 5 Python For Loops While Loops
7 pages
Oop Assignment
No ratings yet
Oop Assignment
16 pages
Openpyxl Documentation: Release 2.0.2
No ratings yet
Openpyxl Documentation: Release 2.0.2
47 pages
R Markdown
No ratings yet
R Markdown
15 pages
INFO2180
No ratings yet
INFO2180
67 pages
Excel Automation with xlwings
No ratings yet
Excel Automation with xlwings
214 pages
Pandas for Data Analysts
100% (1)
Pandas for Data Analysts
64 pages
100 Pandas Exercises
No ratings yet
100 Pandas Exercises
6 pages
Matlab To Numpy PDF
No ratings yet
Matlab To Numpy PDF
14 pages
Midsem Regular MFDS 22-12-2019 Answer Key PDF
No ratings yet
Midsem Regular MFDS 22-12-2019 Answer Key PDF
5 pages
An Overview of Practical Time Series Forecasting Using Pytho
No ratings yet
An Overview of Practical Time Series Forecasting Using Pytho
30 pages
Writing Reproducible Reports: Knitr With R Markdown
No ratings yet
Writing Reproducible Reports: Knitr With R Markdown
24 pages
Scikit Learn Docs
100% (1)
Scikit Learn Docs
2,201 pages
Undergraduate Fundamentals of Machine Learning
No ratings yet
Undergraduate Fundamentals of Machine Learning
163 pages
Curse NG
No ratings yet
Curse NG
464 pages
SciPy Essentials for Data Scientists
No ratings yet
SciPy Essentials for Data Scientists
48 pages
Vmls Python Companion
No ratings yet
Vmls Python Companion
192 pages
ENG 202: Computers and Engineering Object Oriented Programming in PYTHON
No ratings yet
ENG 202: Computers and Engineering Object Oriented Programming in PYTHON
56 pages
Ggplot2 Cheatsheet 2.1
No ratings yet
Ggplot2 Cheatsheet 2.1
2 pages
Cheatsheet Machine Learning Tips and Tricks PDF
No ratings yet
Cheatsheet Machine Learning Tips and Tricks PDF
2 pages
Python Guide for Multivariate Analysis
No ratings yet
Python Guide for Multivariate Analysis
47 pages
Numerical Analysis For Engineer - 1
No ratings yet
Numerical Analysis For Engineer - 1
18 pages
Python NumPy for Beginners
No ratings yet
Python NumPy for Beginners
50 pages
Machine Learning With Python
No ratings yet
Machine Learning With Python
487 pages
Python and Finance DATACAMP Chapter 1
No ratings yet
Python and Finance DATACAMP Chapter 1
30 pages
Python Cheat Sheet: Print Print ("Hello World") Input Input ("What's Your Name")
100% (1)
Python Cheat Sheet: Print Print ("Hello World") Input Input ("What's Your Name")
16 pages
Linear Regression
100% (1)
Linear Regression
51 pages
Programming Essentials
No ratings yet
Programming Essentials
5 pages
Statistics and Machine Learning in Python
No ratings yet
Statistics and Machine Learning in Python
218 pages
Variable Selection
No ratings yet
Variable Selection
15 pages
Statistical Models
No ratings yet
Statistical Models
35 pages
Expected Value Markov Chains
No ratings yet
Expected Value Markov Chains
10 pages
Matlab Python Cheatsheet Formulae PDF
100% (1)
Matlab Python Cheatsheet Formulae PDF
17 pages
Python Interview Questions
No ratings yet
Python Interview Questions
61 pages
A Summer Training Report On "Python Language"
No ratings yet
A Summer Training Report On "Python Language"
20 pages
Python Guide
No ratings yet
Python Guide
162 pages
CSE-Machine Learning & Big Data - WSS Source Book
100% (1)
CSE-Machine Learning & Big Data - WSS Source Book
181 pages
Python Data Importing Guide
No ratings yet
Python Data Importing Guide
1 page
2.1 Importing Python Data
No ratings yet
2.1 Importing Python Data
1 page
Working With Data in Python
No ratings yet
Working With Data in Python
5 pages
Unit 3
No ratings yet
Unit 3
110 pages
Pandas 1
No ratings yet
Pandas 1
64 pages
Python Data Import
100% (1)
Python Data Import
28 pages
Python Libraries for Data Science
No ratings yet
Python Libraries for Data Science
53 pages
Importing Data in Python
No ratings yet
Importing Data in Python
13 pages
Python Data Import/Export with Pandas
No ratings yet
Python Data Import/Export with Pandas
6 pages
Exp - 1 - Introduction To Data Analytics and Python Fundamentals - SDK - Ok
No ratings yet
Exp - 1 - Introduction To Data Analytics and Python Fundamentals - SDK - Ok
9 pages
Unit6 - Working With Data
No ratings yet
Unit6 - Working With Data
29 pages
Data Type in Python
No ratings yet
Data Type in Python
20 pages
Unit - V
No ratings yet
Unit - V
29 pages
Pandas for Data Science Beginners
No ratings yet
Pandas for Data Science Beginners
2 pages
Pandas Documentation PDF
No ratings yet
Pandas Documentation PDF
86 pages
Grad School Finder for MS in US
No ratings yet
Grad School Finder for MS in US
8 pages
Top Universities For Engineering: Yocket Virtual Fair
No ratings yet
Top Universities For Engineering: Yocket Virtual Fair
20 pages
Ch16 ClimateChange
No ratings yet
Ch16 ClimateChange
24 pages
SQL Server Interview Questions and Answers PDF
No ratings yet
SQL Server Interview Questions and Answers PDF
22 pages
Grad School Finder for MS Applicants
No ratings yet
Grad School Finder for MS Applicants
8 pages
Ch15-Future of The Arctic
No ratings yet
Ch15-Future of The Arctic
24 pages
CMPE Quiz 5
No ratings yet
CMPE Quiz 5
2 pages
Competititve Programmer's Handbook
No ratings yet
Competititve Programmer's Handbook
21 pages
Ch15-Future of The Arctic
No ratings yet
Ch15-Future of The Arctic
24 pages
Economics Final Exam Study Guide Final Version
No ratings yet
Economics Final Exam Study Guide Final Version
314 pages
Urban Street Cleanliness Assessment Using Mobile Edge Computing and Deep Learning
No ratings yet
Urban Street Cleanliness Assessment Using Mobile Edge Computing and Deep Learning
13 pages
Networking Practice Problems
No ratings yet
Networking Practice Problems
7 pages
Leak Off Test Guideline
100% (2)
Leak Off Test Guideline
4 pages
Spanish Vocabulary Study Guide
No ratings yet
Spanish Vocabulary Study Guide
1 page
100 Important Verbs
No ratings yet
100 Important Verbs
3 pages
Solaris P2V with ZFS Snapshots Guide
No ratings yet
Solaris P2V with ZFS Snapshots Guide
12 pages
Nebra Helium Miner EMMC SD Upgrade
No ratings yet
Nebra Helium Miner EMMC SD Upgrade
8 pages
Trick 2-WPS Office
No ratings yet
Trick 2-WPS Office
3 pages
Linux System Administration Guide
No ratings yet
Linux System Administration Guide
28 pages
Linux Commands For Devops
No ratings yet
Linux Commands For Devops
21 pages
Linux Disk Encryption Guide
No ratings yet
Linux Disk Encryption Guide
44 pages
Perhitungan Hidrograf Satuan Metode Gama I Bangunan Konservasi:Wadas Putih S-2 Sungai: S. Serayu
No ratings yet
Perhitungan Hidrograf Satuan Metode Gama I Bangunan Konservasi:Wadas Putih S-2 Sungai: S. Serayu
10 pages
Lab 16 Abestros Jharine
No ratings yet
Lab 16 Abestros Jharine
34 pages
Manifest NonUFSFiles Win64
No ratings yet
Manifest NonUFSFiles Win64
116 pages
Understanding Operating Systems Sixth Edition: File Management
No ratings yet
Understanding Operating Systems Sixth Edition: File Management
118 pages
Grading and Returning Papers Using A Moodle Assignment
No ratings yet
Grading and Returning Papers Using A Moodle Assignment
5 pages
WBL7372 Release Note
No ratings yet
WBL7372 Release Note
5 pages
How To Install ETAP 16
75% (4)
How To Install ETAP 16
13 pages
Linux Command cheatsheet-AZ
No ratings yet
Linux Command cheatsheet-AZ
6 pages
Windows Backup Batch Script
No ratings yet
Windows Backup Batch Script
2 pages
UCSER Worksheet 1 File Management
No ratings yet
UCSER Worksheet 1 File Management
2 pages
01 Introduction To Linux System Os 2024
No ratings yet
01 Introduction To Linux System Os 2024
50 pages
A+ Guide To IT Technical Support, 9th Edition: Maintaining Windows
No ratings yet
A+ Guide To IT Technical Support, 9th Edition: Maintaining Windows
76 pages
Lom Log
No ratings yet
Lom Log
255 pages
ZFS NFS Share Configuration
No ratings yet
ZFS NFS Share Configuration
7 pages
COMP6153 Operating System: Practicum Case
No ratings yet
COMP6153 Operating System: Practicum Case
9 pages
Assignment - 2 - File System and Permissions-2023 - Final
No ratings yet
Assignment - 2 - File System and Permissions-2023 - Final
3 pages
#Example .PLG
No ratings yet
#Example .PLG
2 pages
MT6572 Android Scatter
0% (1)
MT6572 Android Scatter
6 pages
Magisk
No ratings yet
Magisk
21 pages
GMDSOFT Catalog Red Compressed
No ratings yet
GMDSOFT Catalog Red Compressed
2 pages
MiniTool Partition Wizard Crackaplpw PDF
No ratings yet
MiniTool Partition Wizard Crackaplpw PDF
3 pages
Lab 2 - Unix Command
No ratings yet
Lab 2 - Unix Command
10 pages
Package Cstats
No ratings yet
Package Cstats
6 pages
Command List-74
No ratings yet
Command List-74
3 pages

Importing Data Python Cheat Sheet PDF

Uploaded by

Importing Data Python Cheat Sheet PDF

Uploaded by

Python For Data Science Cheat Sheet Excel Spreadsheets Pickled Files

>>> file = 'urbanpop.xlsx' >>> import pickle

Text Files Exploring Dictionaries

You might also like