0% found this document useful (0 votes)

54 views9 pages

Python Data Science: Lists & NumPy Basics

The document provides an overview of Python basics like variables, data types, operations and functions. It covers key Python concepts like lists, strings, NumPy arrays and related methods. The document also lists popular Python libraries for data science and machine learning.

Uploaded by

teega

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

54 views9 pages

Python Data Science: Lists & NumPy Basics

Uploaded by

teega

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Python For Data Science Cheat Sheet Lists Also see NumPy Arrays Libraries

>>> a = 'is' Import libraries

Python Basics >>> b = 'nice' >>> import numpy Data analysis Machine learning
Learn More Python for Data Science Interactively at [Link] >>> my_list = ['my', 'list', a, b] >>> import numpy as np
>>> my_list2 = [[4,5,6,7], [3,4,5,6]] Selective import
>>> from math import pi Scientific computing 2D plo"ing
Variables and Data Types Selecting List Elements Index starts at 0
Subset Install Python
Variable Assignment
>>> my_list[1] Select item at index 1
>>> x=5 Select 3rd last item
>>> my_list[-3]
>>> x
Slice
5 >>> my_list[1:3] Select items at index 1 and 2
Calculations With Variables >>> my_list[1:] Select items a!er index 0
>>> my_list[:3] Select items before index 3 Leading open data science platform Free IDE that is included Create and share
>>> x+2 Sum of two variables powered by Python with Anaconda documents with live code,
>>> my_list[:] Copy my_list
7 visualizations, text, ...
>>> x-2 Subtraction of two variables
Subset Lists of Lists
>>> my_list2[1][0] my_list[list][itemOfList]
3
>>> my_list2[1][:2] Numpy Arrays Also see Lists
>>> x*2 Multiplication of two variables
>>> my_list = [1, 2, 3, 4]
10 List Operations >>> my_array = [Link](my_list)
>>> x**2 Exponentiation of a variable
25 >>> my_list + my_list >>> my_2darray = [Link]([[1,2,3],[4,5,6]])
>>> x%2 Remainder of a variable ['my', 'list', 'is', 'nice', 'my', 'list', 'is', 'nice']
Selecting Numpy Array Elements Index starts at 0
1 >>> my_list * 2
>>> x/float(2) Division of a variable ['my', 'list', 'is', 'nice', 'my', 'list', 'is', 'nice'] Subset
2.5 >>> my_list2 > 4 >>> my_array[1] Select item at index 1
True 2
Types and Type Conversion Slice
List Methods >>> my_array[0:2] Select items at index 0 and 1
str() '5', '3.45', 'True' Variables to strings
my_list.index(a) Get the index of an item array([1, 2])
>>>
int() 5, 3, 1 Variables to integers >>> my_list.count(a) Count an item Subset 2D Numpy arrays
>>> my_list.append('!') Append an item at a time >>> my_2darray[:,0] my_2darray[rows, columns]
my_list.remove('!') Remove an item array([1, 4])
float() 5.0, 1.0 Variables to floats >>>
>>> del(my_list[0:1]) Remove an item Numpy Array Operations
bool() True, True, True Variables to booleans >>> my_list.reverse() Reverse the list
>>> my_array > 3
>>> my_list.extend('!') Append an item array([False, False, False, True], dtype=bool)
>>> my_list.pop(-1) Remove an item >>> my_array * 2
Asking For Help >>> my_list.insert(0,'!') Insert an item array([2, 4, 6, 8])
>>> help(str) >>> my_list.sort() Sort the list >>> my_array + [Link]([5, 6, 7, 8])
array([6, 8, 10, 12])
Strings
>>> my_string = 'thisStringIsAwesome' Numpy Array Functions
String Operations Index starts at 0
>>> my_string >>> my_array.shape Get the dimensions of the array
'thisStringIsAwesome' >>> my_string[3] >>> [Link](other_array) Append items to an array
>>> my_string[4:9] >>> [Link](my_array, 1, 5) Insert items in an array
String Operations >>> [Link](my_array,[1]) Delete items in an array
String Methods >>> [Link](my_array) Mean of the array
>>> my_string * 2
'thisStringIsAwesomethisStringIsAwesome' >>> my_string.upper() String to uppercase >>> [Link](my_array) Median of the array
>>> my_string + 'Innit' >>> my_string.lower() String to lowercase >>> my_array.corrcoef() Correlation coefficient
'thisStringIsAwesomeInnit' >>> my_string.count('w') Count String elements >>> [Link](my_array) Standard deviation
>>> 'm' in my_string >>> my_string.replace('e', 'i') Replace String elements
True >>> my_string.strip() Strip whitespaces DataCamp
Learn Python for Data Science Interactively
Working with Different Programming Languages Widgets
Python For Data Science Cheat Sheet Kernels provide computation and communication with front-end interfaces Notebook widgets provide the ability to visualize and control changes
Jupyter Notebook like the notebooks. There are three main kernels: in your data, often as a control like a slider, textbox, etc.
Learn More Python for Data Science Interactively at [Link]
You can use them to build interactive GUIs for your notebooks or to
IRkernel IJulia
synchronize stateful and stateless information between Python and
Installing Jupyter Notebook will automatically install the IPython kernel. JavaScript.
Saving/Loading Notebooks Restart kernel Interrupt kernel
Create new notebook Restart kernel & run Interrupt kernel & Download serialized Save notebook
all cells clear all output state of all widget with interactive
Open an existing
Connect back to a models in use widgets
Make a copy of the notebook Restart kernel & run remote notebook
current notebook all cells Embed current
Rename notebook Run other installed
widgets
kernels
Revert notebook to a
Save current notebook
previous checkpoint Command Mode:
and record checkpoint
Download notebook as
Preview of the printed - IPython notebook 15
notebook - Python
- HTML
Close notebook & stop - Markdown 13 14
- reST
running any scripts - LaTeX 1 2 3 4 5 6 7 8 9 10 11 12
- PDF

Writing Code And Text

Code and text are encapsulated by 3 basic cell types: markdown cells, code
cells, and raw NBConvert cells.
Edit Cells Edit Mode: 1. Save and checkpoint 9. Interrupt kernel
2. Insert cell below 10. Restart kernel
3. Cut cell 11. Display characteristics
Cut currently selected cells Copy cells from 4. Copy cell(s) 12. Open command palette
to clipboard clipboard to current 5. Paste cell(s) below 13. Current kernel
cursor position 6. Move cell up 14. Kernel status
Paste cells from Executing Cells 7. Move cell down 15. Log out from notebook server
clipboard above Paste cells from 8. Run current cell
current cell Run selected cell(s) Run current cells down
clipboard below
and create a new one
Paste cells from current cell Asking For Help
below
clipboard on top Run current cells down
Delete current cells
of current cel and create a new one Walk through a UI tour
Split up a cell from above Run all cells
Revert “Delete Cells” List of built-in keyboard
current cursor Run all cells above the Run all cells below
invocation shortcuts
position current cell the current cell Edit the built-in
Merge current cell Merge current cell keyboard shortcuts
Change the cell type of toggle, toggle Notebook help topics
with the one above with the one below current cell scrolling and clear Description of
Move current cell up Move current cell toggle, toggle current outputs markdown available Information on
down scrolling and clear in notebook unofficial Jupyter
Adjust metadata
underlying the Find and replace all output Notebook extensions
Python help topics
current notebook in selected cells IPython help topics
View Cells
Remove cell Copy attachments of NumPy help topics
attachments current cell Toggle display of Jupyter SciPy help topics
Toggle display of toolbar Matplotlib help topics
Paste attachments of Insert image in logo and filename
SymPy help topics
current cell selected cells Toggle display of cell Pandas help topics
action icons:
Insert Cells - None About Jupyter Notebook
- Edit metadata
Toggle line numbers - Raw cell format
Add new cell above the Add new cell below the - Slideshow
current one in cells - Attachments
current one DataCamp
- Tags
Learn Python for Data Science Interactively
Python For Data Science Cheat Sheet Inspecting Your Array Subse!ing, Slicing, Indexing Also see Lists
>>> [Link] Array dimensions Subse!ing
NumPy Basics >>>
>>>
len(a)
[Link]
Length of array
Number of array dimensions
>>> a[2]
3
1 2 3 Select the element at the 2nd index
Learn Python for Data Science Interactively at [Link] >>> [Link] Number of array elements >>> b[1,2] 1.5 2 3 Select the element at row 0 column 2
>>> [Link] Data type of array elements 6.0 4 5 6 (equivalent to b[1][2])
>>> [Link] Name of data type
>>> [Link](int) Convert an array to a different type Slicing
NumPy >>> a[0:2]
array([1, 2])
1 2 3 Select items at index 0 and 1
2
The NumPy library is the core library for scientific computing in Asking For Help >>> b[0:2,1] 1.5 2 3 Select items at rows 0 and 1 in column 1
>>> [Link]([Link]) array([ 2., 5.]) 4 5 6
Python. It provides a high-performance multidimensional array
Array Mathematics
1.5 2 3
object, and tools for working with these arrays. >>> b[:1] Select all items at row 0
array([[1.5, 2., 3.]]) 4 5 6 (equivalent to b[0:1, :])
Arithmetic Operations >>> c[1,...] Same as [1,:,:]
Use the following import convention: array([[[ 3., 2., 1.],
>>> import numpy as np [ 4., 5., 6.]]])
>>> g = a - b Subtraction
array([[-0.5, 0. , 0. ], >>> a[ : :-1] Reversed array a
NumPy Arrays [-3. , -3. , -3. ]])
array([3, 2, 1])
Boolean Indexing
1D array 2D array 3D array >>> [Link](a,b) Subtraction
>>> a[a<2] Select elements from a less than 2
>>> b + a Addition 1 2 3
array([1])
axis 1 axis 2 array([[ 2.5, 4. , 6. ],
1 2 3 axis 1 [ 5. , 7. , 9. ]]) Fancy Indexing
1.5 2 3 >>> [Link](b,a) Addition >>> b[[1, 0, 1, 0],[0, 1, 2, 0]] Select elements (1,0),(0,1),(1,2) and (0,0)
axis 0 axis 0 array([ 4. , 2. , 6. , 1.5])
4 5 6 >>> a / b Division
array([[ 0.66666667, 1. , 1. ], >>> b[[1, 0, 1, 0]][:,[0,1,2,0]] Select a subset of the matrix’s rows
[ 0.25 , 0.4 , 0.5 ]]) array([[ 4. ,5. , 6. , 4. ], and columns
>>> [Link](a,b) Division [ 1.5, 2. , 3. , 1.5],
Creating Arrays >>> a * b
array([[ 1.5, 4. , 9. ],
Multiplication
[ 4. , 5.
[ 1.5, 2.
,
,
6.
3.
,
,
4. ],
1.5]])

>>> a = [Link]([1,2,3]) [ 4. , 10. , 18. ]])

>>> b = [Link]([(1.5,2,3), (4,5,6)], dtype = float) >>> [Link](a,b) Multiplication Array Manipulation
>>> c = [Link]([[(1.5,2,3), (4,5,6)], [(3,2,1), (4,5,6)]], >>> [Link](b) Exponentiation
dtype = float) >>> [Link](b) Square root Transposing Array
>>> [Link](a) Print sines of an array >>> i = [Link](b) Permute array dimensions
Initial Placeholders >>> [Link](b) Element-wise cosine >>> i.T Permute array dimensions
>>> [Link](a) Element-wise natural logarithm
>>> [Link]((3,4)) Create an array of zeros >>> [Link](f) Dot product
Changing Array Shape
>>> [Link]((2,3,4),dtype=np.int16) Create an array of ones array([[ 7., 7.], >>> [Link]() Fla"en the array
>>> d = [Link](10,25,5) Create an array of evenly [ 7., 7.]]) >>> [Link](3,-2) Reshape, but don’t change data
spaced values (step value)
>>> [Link](0,2,9) Create an array of evenly Comparison Adding/Removing Elements
spaced values (number of samples) >>> [Link]((2,6)) Return a new array with shape (2,6)
>>> e = [Link]((2,2),7) Create a constant array >>> a == b Element-wise comparison >>> [Link](h,g) Append items to an array
>>> f = [Link](2) Create a 2X2 identity matrix array([[False, True, True], >>> [Link](a, 1, 5) Insert items in an array
>>> [Link]((2,2)) Create an array with random values [False, False, False]], dtype=bool) >>> [Link](a,[1]) Delete items from an array
>>> [Link]((3,2)) Create an empty array >>> a < 2 Element-wise comparison Combining Arrays
array([True, False, False], dtype=bool)
>>> np.array_equal(a, b) Array-wise comparison >>> [Link]((a,d),axis=0) Concatenate arrays
I/O array([ 1, 2, 3, 10, 15, 20])
Stack arrays vertically (row-wise)
Aggregate Functions >>> [Link]((a,b))
Saving & Loading On Disk array([[ 1. ,
[ 1.5,
2. ,
2. ,
3. ],
3. ],
>>> [Link]() Array-wise sum [ 4. , 5. , 6. ]])
>>> [Link]('my_array', a) >>> [Link]() Array-wise minimum value >>> np.r_[e,f] Stack arrays vertically (row-wise)
>>> [Link]('[Link]', a, b) >>> [Link](axis=0) Maximum value of an array row >>> [Link]((e,f)) Stack arrays horizontally (column-wise)
>>> [Link]('my_array.npy') >>> [Link](axis=1) Cumulative sum of the elements array([[ 7., 7., 1., 0.],
>>> [Link]() Mean [ 7., 7., 0., 1.]])
Saving & Loading Text Files >>> [Link]() Median >>> np.column_stack((a,d)) Create stacked column-wise arrays
>>> [Link]("[Link]") >>> [Link]() Correlation coefficient array([[ 1, 10],
>>> [Link](b) Standard deviation [ 2, 15],
>>> [Link]("my_file.csv", delimiter=',') [ 3, 20]])
>>> [Link]("[Link]", a, delimiter=" ") >>> np.c_[a,d] Create stacked column-wise arrays
Copying Arrays Spli!ing Arrays
Data Types >>> h = [Link]() Create a view of the array with the same data >>> [Link](a,3) Split the array horizontally at the 3rd
>>> [Link](a) Create a copy of the array [array([1]),array([2]),array([3])] index
>>> np.int64 Signed 64-bit integer types >>> [Link](c,2) Split the array vertically at the 2nd index
>>> np.float32 Standard double-precision floating point >>> h = [Link]() Create a deep copy of the array [array([[[ 1.5, 2. , 1. ],
>>> [Link] Complex numbers represented by 128 floats [ 4. , 5. , 6. ]]]),
array([[[ 3., 2., 3.],
>>>
>>>
[Link]
[Link]
Boolean type storing TRUE and FALSE values
Python object type Sorting Arrays [ 4., 5., 6.]]])]

>>> np.string_ Fixed-length string type >>> [Link]() Sort an array

>>> np.unicode_ Fixed-length unicode type >>> [Link](axis=0) Sort the elements of an array's axis DataCamp
Learn Python for Data Science Interactively
Python For Data Science Cheat Sheet Linear Algebra Also see NumPy
You’ll use the linalg and sparse modules. Note that [Link] contains and expands on [Link].
SciPy - Linear Algebra >>> from scipy import linalg, sparse Matrix Functions
Learn More Python for Data Science Interactively at [Link]
Creating Matrices Addition
>>> [Link](A,D) Addition
>>> A = [Link]([Link]((2,2)))
SciPy >>> B = [Link](b) Subtraction
>>> C = [Link]([Link]((10,5))) >>> [Link](A,D) Subtraction
The SciPy library is one of the core packages for >>> D = [Link]([[3,4], [5,6]]) Division
scientific computing that provides mathematical >>> [Link](A,D) Division
Basic Matrix Routines Multiplication
algorithms and convenience functions built on the
>>> [Link](D,A) Multiplication
NumPy extension of Python. Inverse >>> [Link](A,D) Dot product
>>> A.I Inverse >>> [Link](A,D) Vector dot product
Inverse
Interacting With NumPy Also see NumPy >>>
>>>
[Link](A)
A.T Tranpose matrix >>> [Link](A,D) Inner product
>>> [Link](A,D) Outer product
>>> import numpy as np >>> A.H Conjugate transposition >>> [Link](A,D) Tensor dot product
>>> a = [Link]([1,2,3]) >>> [Link](A) Trace >>> [Link](A,D) Kronecker product
>>> b = [Link]([(1+5j,2j,3j), (4j,5j,6j)])
>>> c = [Link]([[(1.5,2,3), (4,5,6)], [(3,2,1), (4,5,6)]]) Norm Exponential Functions
>>> [Link](A) Frobenius norm >>> [Link](A) Matrix exponential
Index Tricks >>> [Link](A,1) L1 norm (max column sum) >>> linalg.expm2(A) Matrix exponential (Taylor Series)
>>> [Link](A,[Link]) L inf norm (max row sum) >>> linalg.expm3(D) Matrix exponential (eigenvalue
>>> [Link][0:5,0:5] Create a dense meshgrid decomposition)
>>> [Link][0:2,0:2] Create an open meshgrid Rank Logarithm Function
>>> np.r_[[3,[0]*5,-[Link]j] Stack arrays vertically (row-wise) >>> [Link].matrix_rank(C) Matrix rank >>> [Link](A) Matrix logarithm
>>> np.c_[b,c] Create stacked column-wise arrays Determinant Trigonometric Tunctions
>>> [Link](A) Determinant Matrix sine
Shape Manipulation Solving linear problems
>>> [Link](D)
>>> [Link](D) Matrix cosine
>>> [Link](b) Permute array dimensions >>> [Link](A,b) Solver for dense matrices >>> [Link](A) Matrix tangent
>>> [Link]() Fla!en the array >>> E = [Link](a).T Solver for dense matrices Hyperbolic Trigonometric Functions
>>> [Link]((b,c)) Stack arrays horizontally (column-wise) >>> [Link](D,E) Least-squares solution to linear matrix >>> [Link](D) Hypberbolic matrix sine
>>> [Link]((a,b)) Stack arrays vertically (row-wise) equation >>> [Link](D) Hyperbolic matrix cosine
>>> [Link](c,2) Split the array horizontally at the 2nd index Generalized inverse >>> [Link](A) Hyperbolic matrix tangent
>>> [Link](d,2) Split the array vertically at the 2nd index >>> [Link](C) Compute the pseudo-inverse of a matrix Matrix Sign Function
(least-squares solver) Matrix sign function
Polynomials >>> linalg.pinv2(C) Compute the pseudo-inverse of a matrix
>>> [Link](A)

>>> from numpy import poly1d (SVD) Matrix Square Root

>>> [Link](A) Matrix square root
>>> p = poly1d([3,4,5]) Create a polynomial object
Creating Sparse Matrices Arbitrary Functions
Vectorizing Functions >>> [Link](A, lambda x: x*x) Evaluate matrix function
>>> F = [Link](3, k=1) Create a 2X2 identity matrix
>>> def myfunc(a):
if a < 0: >>> G = [Link]([Link](2)) Create a 2x2 identity matrix Decompositions
return a*2 >>> C[C > 0.5] = 0
else:
return a/2
>>> H = sparse.csr_matrix(C) Compressed Sparse Row matrix Eigenvalues and Eigenvectors
>>> I = sparse.csc_matrix(D) Compressed Sparse Column matrix >>> la, v = [Link](A) Solve ordinary or generalized
>>> [Link](myfunc) Vectorize functions >>> J = sparse.dok_matrix(A) Dictionary Of Keys matrix eigenvalue problem for square matrix
>>> [Link]() Sparse matrix to full matrix >>> l1, l2 = la Unpack eigenvalues
Type Handling >>> sparse.isspmatrix_csc(A) Identify sparse matrix >>> v[:,0] First eigenvector
>>> v[:,1] Second eigenvector
>>> [Link](c) Return the real part of the array elements
>>> [Link](c) Return the imaginary part of the array elements Sparse Matrix Routines >>> [Link](A) Unpack eigenvalues
>>> np.real_if_close(c,tol=1000) Return a real array if complex parts close to 0 Singular Value Decomposition
>>> [Link]['f']([Link]) Cast object to a data type Inverse >>> U,s,Vh = [Link](B) Singular Value Decomposition (SVD)
>>> [Link](I) Inverse >>> M,N = [Link]
Other Useful Functions Norm >>> Sig = [Link](s,M,N) Construct sigma matrix in SVD
>>> [Link](I) Norm LU Decomposition
>>> [Link](b,deg=True) Return the angle of the complex argument LU Decomposition
>>> g = [Link](0,[Link],num=5) Create an array of evenly spaced values
Solving linear problems >>> P,L,U = [Link](C)
(number of samples) >>> [Link](H,I) Solver for sparse matrices
>>> g [3:] += [Link]
>>> [Link](g) Unwrap Sparse Matrix Decompositions
>>> [Link](0,10,3) Create an array of evenly spaced values (log scale) Sparse Matrix Functions
>>> la, v = [Link](F,1) Eigenvalues and eigenvectors
>>> [Link]([c<4],[c*2]) Return values from a list of arrays depending on >>> [Link](I) Sparse matrix exponential >>> [Link](H, 2) SVD
conditions
>>> [Link](a) Factorial
Combine N things taken at k time
>>>
>>>
[Link](10,3,exact=True)
misc.central_diff_weights(3) Weights for Np-point central derivative Asking For Help DataCamp
>>> [Link](myfunc,1.0) Find the n-th derivative of a function at a point >>> help([Link])
>>> [Link]([Link]) Learn Python for Data Science Interactively
Python For Data Science Cheat Sheet Asking For Help Dropping
>>> help([Link])
>>> [Link](['a', 'c']) Drop values from rows (axis=0)
Pandas Basics Selection Also see NumPy Arrays >>> [Link]('Country', axis=1) Drop values from columns(axis=1)
Learn Python for Data Science Interactively at [Link]
Ge!ing
>>> s['b'] Get one element Sort & Rank
-5
Pandas >>> df.sort_index() Sort by labels along an axis
>>> df.sort_values(by='Country') Sort by the values along an axis
>>> df[1:] Get subset of a DataFrame
The Pandas library is built on NumPy and provides easy-to-use Country Capital Population >>> [Link]() Assign ranks to entries
data structures and data analysis tools for the Python 1 India New Delhi 1303171035

programming language.
2 Brazil Brasília 207847528
Retrieving Series/DataFrame Information
Selecting, Boolean Indexing & Se!ing Basic Information
Use the following import convention: By Position >>> [Link] (rows,columns)
>>> import pandas as pd >>> [Link]([0],[0]) Select single value by row & >>> [Link] Describe index
'Belgium' column >>> [Link] Describe DataFrame columns
Pandas Data Structures >>> [Link]([0],[0])
>>>
>>>
[Link]()
[Link]()
Info on DataFrame
Number of non-NA values
Series 'Belgium'
Summary
A one-dimensional labeled array a 3 By Label
>>> [Link]([0], ['Country']) Select single value by row & >>> [Link]() Sum of values
capable of holding any data type b -5
'Belgium' column labels >>> [Link]() Cummulative sum of values
>>> [Link]()/[Link]() Minimum/maximum values
c 7 >>> [Link]([0], ['Country'])
Index >>> [Link]()/[Link]() Minimum/Maximum index value
d 4 'Belgium' >>> [Link]() Summary statistics
>>> [Link]() Mean of values
>>> s = [Link]([3, -5, 7, 4], index=['a', 'b', 'c', 'd'])
By Label/Position >>> [Link]() Median of values
>>> [Link][2] Select single row of
DataFrame Country
Capital
Brazil
Brasília
subset of rows Applying Functions
Population 207847528 >>> f = lambda x: x*2
Columns
Country Capital Population A two-dimensional labeled >>> [Link][:,'Capital'] Select a single column of >>> [Link](f) Apply function
>>> [Link](f) Apply function element-wise
data structure with columns 0 Brussels subset of columns
0 Belgium Brussels 11190846 1 New Delhi
of potentially different types 2 Brasília Data Alignment
1 India New Delhi 1303171035
Index >>> [Link][1,'Capital'] Select rows and columns
2 Brazil Brasília 207847528 Internal Data Alignment
'New Delhi'
NA values are introduced in the indices that don’t overlap:
Boolean Indexing
>>> data = {'Country': ['Belgium', 'India', 'Brazil'], >>> s3 = [Link]([7, -2, 3], index=['a', 'c', 'd'])
>>> s[~(s > 1)] Series s where value is not >1
'Capital': ['Brussels', 'New Delhi', 'Brasília'], >>> s[(s < -1) | (s > 2)] s where value is <-1 or >2 >>> s + s3
'Population': [11190846, 1303171035, 207847528]} >>> df[df['Population']>1200000000] Use filter to adjust DataFrame a 10.0
b NaN
>>> df = [Link](data, Se!ing
c 5.0
columns=['Country', 'Capital', 'Population']) >>> s['a'] = 6 Set index a of Series s to 6
d 7.0

I/O Arithmetic Operations with Fill Methods

You can also do the internal data alignment yourself with
Read and Write to CSV Read and Write to SQL Query or Database Table
the help of the fill methods:
>>> pd.read_csv('[Link]', header=None, nrows=5) >>> from sqlalchemy import create_engine >>> [Link](s3, fill_value=0)
>>> df.to_csv('[Link]') >>> engine = create_engine('sqlite:///:memory:') a 10.0
>>> pd.read_sql("SELECT * FROM my_table;", engine) b -5.0
Read and Write to Excel c 5.0
>>> pd.read_sql_table('my_table', engine) d 7.0
>>> pd.read_excel('[Link]') >>> pd.read_sql_query("SELECT * FROM my_table;", engine) >>> [Link](s3, fill_value=2)
>>> pd.to_excel('dir/[Link]', sheet_name='Sheet1') >>> [Link](s3, fill_value=4)
read_sql()is a convenience wrapper around read_sql_table() and
Read multiple sheets from the same file >>> [Link](s3, fill_value=3)
read_sql_query()
>>> xlsx = [Link]('[Link]')
>>> df = pd.read_excel(xlsx, 'Sheet1') >>> pd.to_sql('myDf', engine) DataCamp
Learn Python for Data Science Interactively
Python For Data Science Cheat Sheet Create Your Model Evaluate Your Model’s Performance
Supervised Learning Estimators Classification Metrics
Scikit-Learn
Learn Python for data science Interactively at [Link] Linear Regression Accuracy Score
>>> from sklearn.linear_model import LinearRegression >>> [Link](X_test, y_test) Estimator score method
>>> lr = LinearRegression(normalize=True) >>> from [Link] import accuracy_score Metric scoring functions
>>> accuracy_score(y_test, y_pred)
Support Vector Machines (SVM)
Scikit-learn >>> from [Link] import SVC Classification Report
>>> svc = SVC(kernel='linear') >>> from [Link] import classification_report Precision, recall, f1-score
Scikit-learn is an open source Python library that Naive Bayes >>> print(classification_report(y_test, y_pred)) and support
implements a range of machine learning, >>> from sklearn.naive_bayes import GaussianNB Confusion Matrix
>>> from [Link] import confusion_matrix
preprocessing, cross-validation and visualization >>> gnb = GaussianNB() >>> print(confusion_matrix(y_test, y_pred))
algorithms using a unified interface. KNN
>>> from sklearn import neighbors Regression Metrics
A Basic Example >>> knn = [Link](n_neighbors=5)
>>> from sklearn import neighbors, datasets, preprocessing
Mean Absolute Error
>>> from sklearn.model_selection import train_test_split Unsupervised Learning Estimators >>> from [Link] import mean_absolute_error
>>> from [Link] import accuracy_score >>> y_true = [3, -0.5, 2]
>>> iris = datasets.load_iris() Principal Component Analysis (PCA) >>> mean_absolute_error(y_true, y_pred)
>>> X, y = [Link][:, :2], [Link] >>> from [Link] import PCA Mean Squared Error
>>> X_train, X_test, y_train, y_test = train_test_split(X, y, random_state=33) >>> pca = PCA(n_components=0.95) >>> from [Link] import mean_squared_error
>>> scaler = [Link]().fit(X_train)
>>> X_train = [Link](X_train)
K Means >>> mean_squared_error(y_test, y_pred)
>>> X_test = [Link](X_test) >>> from [Link] import KMeans R² Score
>>> knn = [Link](n_neighbors=5) >>> k_means = KMeans(n_clusters=3, random_state=0) >>> from [Link] import r2_score
>>> r2_score(y_true, y_pred)
>>> [Link](X_train, y_train)
>>>
>>>
y_pred = [Link](X_test)
accuracy_score(y_test, y_pred) Model Fi!ing Clustering Metrics
Adjusted Rand Index
Supervised learning
Loading The Data Also see NumPy & Pandas >>> [Link](X, y) Fit the model to the data >>> from [Link] import adjusted_rand_score
>>> adjusted_rand_score(y_true, y_pred)
>>> [Link](X_train, y_train)
Your data needs to be numeric and stored as NumPy arrays or SciPy sparse >>> [Link](X_train, y_train) Homogeneity
>>> from [Link] import homogeneity_score
matrices. Other types that are convertible to numeric arrays, such as Pandas Unsupervised Learning >>> homogeneity_score(y_true, y_pred)
DataFrame, are also acceptable. >>> k_means.fit(X_train) Fit the model to the data
>>> pca_model = pca.fit_transform(X_train) Fit to data, then transform it V-measure
>>> import numpy as np >>> from [Link] import v_measure_score
>>> X = [Link]((10,5)) >>> metrics.v_measure_score(y_true, y_pred)
>>> y = [Link](['M','M','F','F','M','F','M','M','F','F','F'])
>>> X[X < 0.7] = 0 Prediction Cross-Validation
>>> from sklearn.cross_validation import cross_val_score
Supervised Estimators
Training And Test Data >>> y_pred = [Link]([Link]((2,5))) Predict labels
>>> print(cross_val_score(knn, X_train, y_train, cv=4))
>>> print(cross_val_score(lr, X, y, cv=2))
>>> y_pred = [Link](X_test) Predict labels
>>> from sklearn.model_selection import train_test_split >>> y_pred = knn.predict_proba(X_test) Estimate probability of a label
>>> X_train, X_test, y_train, y_test = train_test_split(X,
y, Unsupervised Estimators Tune Your Model
random_state=0) >>> y_pred = k_means.predict(X_test) Predict labels in clustering algos Grid Search
>>> from sklearn.grid_search import GridSearchCV
>>> params = {"n_neighbors": [Link](1,3),
Preprocessing The Data "metric": ["euclidean", "cityblock"]}
>>> grid = GridSearchCV(estimator=knn,
Standardization Encoding Categorical Features param_grid=params)
>>> [Link](X_train, y_train)
>>> from [Link] import StandardScaler >>> from [Link] import LabelEncoder >>> print(grid.best_score_)
>>> scaler = StandardScaler().fit(X_train) >>> print(grid.best_estimator_.n_neighbors)
>>> enc = LabelEncoder()
>>> standardized_X = [Link](X_train) >>> y = enc.fit_transform(y)
>>> standardized_X_test = [Link](X_test) Randomized Parameter Optimization
Normalization Imputing Missing Values >>> from sklearn.grid_search import RandomizedSearchCV
>>> params = {"n_neighbors": range(1,5),
>>> from [Link] import Normalizer "weights": ["uniform", "distance"]}
>>> from [Link] import Imputer >>> rsearch = RandomizedSearchCV(estimator=knn,
>>> scaler = Normalizer().fit(X_train) >>> imp = Imputer(missing_values=0, strategy='mean', axis=0) param_distributions=params,
>>> normalized_X = [Link](X_train) >>> imp.fit_transform(X_train) cv=4,
>>> normalized_X_test = [Link](X_test) n_iter=8,
random_state=5)
Binarization Generating Polynomial Features >>> [Link](X_train, y_train)
>>> print(rsearch.best_score_)
>>> from [Link] import Binarizer >>> from [Link] import PolynomialFeatures
>>> binarizer = Binarizer(threshold=0.0).fit(X) >>> poly = PolynomialFeatures(5)
>>> binary_X = [Link](X) >>> poly.fit_transform(X) DataCamp
Learn Python for Data Science Interactively
Python For Data Science Cheat Sheet Plot Anatomy & Workflow
Plot Anatomy Workflow
Matplotlib Axes/Subplot The basic steps to creating plots with matplotlib are:
Learn Python Interactively at [Link] 1 Prepare data 2 Create plot 3 Plot 4 Customize plot 5 Save plot 6 Show plot
>>> import [Link] as plt
>>> x = [1,2,3,4] Step 1
>>> y = [10,20,25,30]
Matplotlib Y-axis Figure
>>>
>>>
fig = [Link]() Step 2
ax = fig.add_subplot(111) Step 3
>>> [Link](x, y, color='lightblue', linewidth=3) Step 3, 4
Matplotlib is a Python 2D plo!ing library which produces >>> [Link]([2,4,6],
publication-quality figures in a variety of hardcopy formats [5,15,25],
color='darkgreen',
and interactive environments across marker='^')
platforms. X-axis
>>> ax.set_xlim(1, 6.5)
>>> [Link]('[Link]')

1 Prepare The Data Also see Lists & NumPy

>>> [Link]() Step 6

1D Data 4 Customize Plot

>>> import numpy as np Colors, Color Bars & Color Maps Mathtext
>>> x = [Link](0, 10, 100)
>>> y = [Link](x) >>> [Link](x, x, x, x**2, x, x**3) >>> [Link](r'$sigma_i=15$', fontsize=20)
>>> z = [Link](x) >>> [Link](x, y, alpha = 0.4)
>>> [Link](x, y, c='k') Limits, Legends & Layouts
2D Data or Images >>> [Link](im, orientation='horizontal')
>>> im = [Link](img, Limits & Autoscaling
>>> data = 2 * [Link]((10, 10)) cmap='seismic')
>>> data2 = 3 * [Link]((10, 10)) >>> [Link](x=0.0,y=0.1) Add padding to a plot
>>> [Link]('equal') Set the aspect ratio of the plot to 1
>>> Y, X = [Link][-[Link]j, -[Link]j] Markers >>> [Link](xlim=[0,10.5],ylim=[-1.5,1.5]) Set limits for x-and y-axis
>>> U = -1 - X**2 + Y
>>> V = 1 + X - Y**2 >>> fig, ax = [Link]() >>> ax.set_xlim(0,10.5) Set limits for x-axis
>>> from [Link] import get_sample_data >>> [Link](x,y,marker=".") Legends
>>> img = [Link](get_sample_data('axes_grid/bivariate_normal.npy')) >>> [Link](x,y,marker="o") >>> [Link](title='An Example Axes', Set a title and x-and y-axis labels
ylabel='Y-Axis',
Linestyles
2
xlabel='X-Axis')
Create Plot >>> [Link](x,y,linewidth=4.0)
>>> [Link](loc='best') No overlapping plot elements
>>> [Link](x,y,ls='solid')
Ticks
>>> import [Link] as plt >>> [Link](ticks=range(1,5), Manually set x-ticks
>>> [Link](x,y,ls='--') ticklabels=[3,100,-12,"foo"])
Figure >>> [Link](x,y,'--',x**2,y**2,'-.') >>> ax.tick_params(axis='y', Make y-ticks longer and go in and out
>>> [Link](lines,color='r',linewidth=4.0) direction='inout',
>>> fig = [Link]() length=10)
>>> fig2 = [Link](figsize=[Link](2.0)) Text & Annotations
Subplot Spacing
Axes >>> [Link](1, >>> fig3.subplots_adjust(wspace=0.5, Adjust the spacing between subplots
-2.1, hspace=0.3,
All plo!ing is done with respect to an Axes. In most cases, a 'Example Graph', left=0.125,
style='italic') right=0.9,
subplot will fit your needs. A subplot is an axes on a grid system. >>> [Link]("Sine", top=0.9,
>>> fig.add_axes() xy=(8, 0), bottom=0.1)
>>> ax1 = fig.add_subplot(221) # row-col-num xycoords='data', >>> fig.tight_layout() Fit subplot(s) in to the figure area
xytext=(10.5, 0),
>>> ax3 = fig.add_subplot(212) textcoords='data', Axis Spines
>>> fig3, axes = [Link](nrows=2,ncols=2) arrowprops=dict(arrowstyle="->", >>> [Link]['top'].set_visible(False) Make the top axis line for a plot invisible
>>> fig4, axes2 = [Link](ncols=3) connectionstyle="arc3"),) >>> [Link]['bottom'].set_position(('outward',10)) Move the bo!om axis line outward

3 Plo!ing Routines 5 Save Plot

1D Data Vector Fields Save figures
>>> [Link]('[Link]')
>>> fig, ax = [Link]() >>> axes[0,1].arrow(0,0,0.5,0.5) Add an arrow to the axes
>>> lines = [Link](x,y) Draw points with lines or markers connecting them >>> axes[1,1].quiver(y,z) Plot a 2D field of arrows Save transparent figures
>>> [Link](x,y) Draw unconnected points, scaled or colored >>> axes[0,1].streamplot(X,Y,U,V) Plot a 2D field of arrows >>> [Link]('[Link]', transparent=True)
>>> axes[0,0].bar([1,2,3],[3,4,5]) Plot vertical rectangles (constant width)
>>>
>>>
>>>
axes[1,0].barh([0.5,1,2.5],[0,1,2])
axes[1,1].axhline(0.45)
axes[0,1].axvline(0.65)
Plot horiontal rectangles (constant height)
Draw a horizontal line across axes
Draw a vertical line across axes
Data Distributions
>>> [Link](y) Plot a histogram
6 Show Plot
>>> [Link]()
>>> [Link](x,y,color='blue') Draw filled polygons >>> [Link](y) Make a box and whisker plot
>>> ax.fill_between(x,y,color='yellow') Fill between y-values and 0 >>> [Link](z) Make a violin plot
2D Data or Images Close & Clear
>>> fig, ax = [Link]() >>> [Link]() Clear an axis
>>> axes2[0].pcolor(data2) Pseudocolor plot of 2D array >>> [Link]() Clear the entire figure
>>> im = [Link](img, Colormapped or RGB arrays >>> axes2[0].pcolormesh(data) Pseudocolor plot of 2D array
cmap='gist_earth', >>> [Link]() Close a window
interpolation='nearest', >>> CS = [Link](Y,X,U) Plot contours
>>> axes2[2].contourf(data1) Plot filled contours
vmin=-2,
vmax=2) >>> axes2[2]= [Link](CS) Label a contour plot DataCamp
Learn Python for Data Science Interactively
Matplotlib 2.0.0 - Updated on: 02/2017
Python For Data Science Cheat Sheet 3 Plo!ing With Seaborn
Seaborn Axis Grids
Learn Data Science Interactively at [Link] >>> g = [Link](titanic, Subplot grid for plo!ing conditional >>> h = [Link](iris) Subplot grid for plo!ing pairwise
col="survived", relationships >>> h = [Link]([Link]) relationships
row="sex") >>> [Link](iris) Plot pairwise bivariate distributions
>>> g = [Link]([Link],"age") >>> i = [Link](x="x", Grid for bivariate plot with marginal
>>> [Link](x="pclass", Draw a categorical plot onto a y="y", univariate plots
y="survived", Facetgrid data=data)
Statistical Data Visualization With Seaborn hue="sex",
data=titanic)
>>> i = [Link]([Link],
[Link])
The Python visualization library Seaborn is based on >>> [Link](x="sepal_width", Plot data and regression model fits >>> [Link]("sepal_length", Plot bivariate distribution
y="sepal_length", across a FacetGrid "sepal_width",
matplotlib and provides a high-level interface for drawing hue="species", data=iris,
a!ractive statistical graphics. data=iris) kind='kde')

Categorical Plots Regression Plots

Make use of the following aliases to import the libraries: >>> [Link](x="sepal_width", Plot data and a linear regression
Sca!erplot
>>> import [Link] as plt >>> [Link](x="species", Sca!erplot with one y="sepal_length", model fit
>>> import seaborn as sns data=iris,
y="petal_length", categorical variable
data=iris) ax=ax)
The basic steps to creating plots with Seaborn are: >>> [Link](x="species", Categorical sca!erplot with Distribution Plots
y="petal_length", non-overlapping points
1. Prepare some data data=iris) >>> plot = [Link](data.y, Plot univariate distribution
2. Control figure aesthetics Bar Chart kde=False,
color="b")
3. Plot with Seaborn >>> [Link](x="sex", Show point estimates and
y="survived", confidence intervals with Matrix Plots
4. Further customize your plot hue="class", sca!erplot glyphs
>>> [Link](uniform_data,vmin=0,vmax=1) Heatmap
data=titanic)
>>> import [Link] as plt Count Plot
>>>
>>>
>>>
import seaborn as sns
tips = sns.load_dataset("tips")
sns.set_style("whitegrid") Step 2
Step 1
>>> [Link](x="deck",
data=titanic,
Show count of observations
4 Further Customizations Also see Matplotlib
palette="Greens_d")
>>> g = [Link](x="tip", Step 3
Point Plot Axisgrid Objects
y="total_bill",
data=tips, >>> [Link](x="class", Show point estimates and >>> [Link](left=True) Remove le# spine
aspect=2) y="survived", confidence intervals as >>> g.set_ylabels("Survived") Set the labels of the y-axis
>>> g = (g.set_axis_labels("Tip","Total bill(USD)"). hue="sex", rectangular bars >>> g.set_xticklabels(rotation=45) Set the tick labels for x
set(xlim=(0,10),ylim=(0,100))) data=titanic, >>> g.set_axis_labels("Survived", Set the axis labels
Step 4 palette={"male":"g", "Sex")
>>> [Link]("title")
>>> [Link](g) Step 5 "female":"m"}, >>> [Link](xlim=(0,5), Set the limit and ticks of the
markers=["^","o"], ylim=(0,5), x-and y-axis
linestyles=["-","--"]) xticks=[0,2.5,5],

1
Boxplot yticks=[0,2.5,5])
Data Also see Lists, NumPy & Pandas >>> [Link](x="alive", Boxplot
Plot
y="age",
>>> import pandas as pd hue="adult_male",
>>> import numpy as np >>> [Link]("A Title") Add plot title
data=titanic)
>>> uniform_data = [Link](10, 12) >>> [Link]("Survived") Adjust the label of the y-axis
>>> [Link](data=iris,orient="h") Boxplot with wide-form data >>> [Link]("Sex") Adjust the label of the x-axis
>>> data = [Link]({'x':[Link](1,101),
'y':[Link](0,4,100)}) Violinplot >>> [Link](0,100) Adjust the limits of the y-axis
>>> [Link](x="age", Violin plot >>> [Link](0,10) Adjust the limits of the x-axis
Seaborn also offers built-in data sets: y="sex", >>> [Link](ax,yticks=[0,5]) Adjust a plot property
>>> titanic = sns.load_dataset("titanic") hue="survived", >>> plt.tight_layout() Adjust subplot params
>>> iris = sns.load_dataset("iris") data=titanic)

2 Figure Aesthetics Also see Matplotlib

5 Show or Save Plot Also see Matplotlib
>>> [Link]() Show the plot
Context Functions >>> [Link]("[Link]") Save the plot as a figure
>>> f, ax = [Link](figsize=(5,6)) Create a figure and one subplot >>> [Link]("[Link]", Save transparent figure
>>> sns.set_context("talk") Set context to "talk" transparent=True)
>>> sns.set_context("notebook", Set context to "notebook",
Seaborn styles font_scale=1.5, scale font elements and
>>> [Link]() (Re)set the seaborn default
rc={"[Link]":2.5}) override param mapping Close & Clear Also see Matplotlib
>>> sns.set_style("whitegrid") Set the matplotlib parameters Color Pale!e >>> [Link]() Clear an axis
>>> sns.set_style("ticks", Set the matplotlib parameters >>> [Link]() Clear an entire figure
{"[Link]":8, >>> sns.set_palette("husl",3) Define the color pale!e >>> [Link]() Close a window
"[Link]":8}) >>> sns.color_palette("husl") Use with with to temporarily set pale!e
>>> sns.axes_style("whitegrid") Return a dict of params or use with >>> flatui = ["#9b59b6","#3498db","#95a5a6","#e74c3c","#34495e","#2ecc71"]
with to temporarily set the style >>> sns.set_palette(flatui) Set your own color pale!e DataCamp
Learn Python for Data Science Interactively
Python For Data Science Cheat Sheet 3 Renderers & Visual Customizations
Bokeh Glyphs Grid Layout
Learn Bokeh Interactively at [Link], Sca!er Markers >>> from [Link] import gridplot
taught by Bryan Van de Ven, core contributor >>> [Link]([Link]([1,2,3]), [Link]([3,2,1]), >>> row1 = [p1,p2]
fill_color='white') >>> row2 = [p3]
>>> [Link]([Link]([1.5,3.5,5.5]), [1,4,3], >>> layout = gridplot([[p1,p2],[p3]])
color='blue', size=1)
Plo!ing With Bokeh Line Glyphs Tabbed Layout
>>> [Link]([1,2,3,4], [3,4,5,6], line_width=2)
>>> from [Link] import Panel, Tabs
The Python interactive visualization library Bokeh >>> p2.multi_line([Link]([[1,2,3],[5,6,7]]),
>>> tab1 = Panel(child=p1, title="tab1")
[Link]([[3,4,5],[3,2,1]]),
enables high-performance visual presentation of color="blue") >>> tab2 = Panel(child=p2, title="tab2")
>>> layout = Tabs(tabs=[tab1, tab2])
large datasets in modern web browsers.
Customized Glyphs Also see Data
Linked Plots
Bokeh’s mid-level general purpose [Link] Selection and Non-Selection Glyphs
>>> p = figure(tools='box_select') Linked Axes
interface is centered around two main components: data >>> [Link]('mpg', 'cyl', source=cds_df, >>> p2.x_range = p1.x_range
and glyphs. selection_color='red', >>> p2.y_range = p1.y_range
nonselection_alpha=0.1) Linked Brushing
+ = Hover Glyphs >>> p4 = figure(plot_width = 100,
tools='box_select,lasso_select')
>>> from [Link] import HoverTool
>>> [Link]('mpg', 'cyl', source=cds_df)
data glyphs plot >>> hover = HoverTool(tooltips=None, mode='vline')
>>> p5 = figure(plot_width = 200,
>>> p3.add_tools(hover)
tools='box_select,lasso_select')
The basic steps to creating plots with the [Link] >>> [Link]('mpg', 'hp', source=cds_df)
interface are: US
Colormapping >>> layout = row(p4,p5)
1. Prepare some data:
Asia

>>> from [Link] import CategoricalColorMapper

Europe

Python lists, NumPy arrays, Pandas DataFrames and other sequences of values
2. Create a new plot
>>> color_mapper = CategoricalColorMapper(
factors=['US', 'Asia', 'Europe'],
palette=['blue', 'red', 'green'])
4 Output & Export
3. Add renderers for your data, with visual customizations >>> [Link]('mpg', 'cyl', source=cds_df, Notebook
color=dict(field='origin',
4. Specify where to generate the output transform=color_mapper), >>> from [Link] import output_notebook, show
5. Show or save the results legend='Origin') >>> output_notebook()
>>> from [Link] import figure
>>> from [Link] import output_file, show Legend Location HTML
>>> x = [1, 2, 3, 4, 5] Step 1
>>> y = [6, 7, 2, 4, 5] Inside Plot Area Standalone HTML
>>> p = figure(title="simple line example", Step 2 >>> [Link] = 'bottom_left' >>> from [Link] import file_html
>>> from [Link] import CDN
x_axis_label='x',
>>> html = file_html(p, CDN, "my_plot")
y_axis_label='y') Outside Plot Area
>>> [Link](x, y, legend="Temp.", line_width=2) Step 3 >>> from [Link] import Legend
>>> r1 = [Link]([Link]([1,2,3]), [Link]([3,2,1]) >>> from [Link] import output_file, show
>>> output_file("[Link]") Step 4 >>> r2 = [Link]([1,2,3,4], [3,4,5,6]) >>> output_file('my_bar_chart.html', mode='cdn')
>>> show(p) Step 5 >>> legend = Legend(items=[("One" ,[p1, r1]),("Two",[r2])],
location=(0, -30)) Components
1 Data Also see Lists, NumPy & Pandas
>>> p.add_layout(legend, 'right')

Legend Orientation
>>> from [Link] import components
>>> script, div = components(p)
Under the hood, your data is converted to Column Data
Sources. You can also do this manually: >>> [Link] = "horizontal" PNG
>>> import numpy as np >>> [Link] = "vertical"
>>> from [Link] import export_png
>>> import pandas as pd >>> export_png(p, filename="[Link]")
>>> df = [Link]([Link]([[33.9,4,65, 'US'], Legend Background & Border
[32.4,4,66, 'Asia'],
[21.4,4,109, 'Europe']]), >>> [Link].border_line_color = "navy" SVG
columns=['mpg','cyl', 'hp', 'origin'], >>> [Link].background_fill_color = "white"
index=['Toyota', 'Fiat', 'Volvo']) >>> from [Link] import export_svgs
>>> from [Link] import ColumnDataSource Rows & Columns Layout >>> p.output_backend = "svg"
>>> export_svgs(p, filename="[Link]")
>>> cds_df = ColumnDataSource(df) Rows
>>> from [Link] import row

2 Plo!ing >>> layout = row(p1,p2,p3)

Columns
5 Show or Save Your Plots
>>> from [Link] import figure >>> from [Link] import columns >>> show(p1) >>> show(layout)
>>> p1 = figure(plot_width=300, tools='pan,box_zoom') >>> layout = column(p1,p2,p3) >>> save(p1) >>> save(layout)
>>> p2 = figure(plot_width=300, plot_height=300, Nesting Rows & Columns
x_range=(0, 8), y_range=(0, 8)) >>>layout = row(column(p1,p2), p3) DataCamp
>>> p3 = figure() Learn Python for Data Science Interactively

Python 5
No ratings yet
Python 5
9 pages
Python Cheat Sheet for Beginners
100% (1)
Python Cheat Sheet for Beginners
9 pages
Python Data Science Cheat Sheet
No ratings yet
Python Data Science Cheat Sheet
11 pages
Python Data Structures Cheat Sheet
100% (1)
Python Data Structures Cheat Sheet
9 pages
Python For Data Science - Cheat Sheets
100% (4)
Python For Data Science - Cheat Sheets
10 pages
All Python CS
100% (2)
All Python CS
10 pages
Cheat Sheet
No ratings yet
Cheat Sheet
22 pages
Data Science Cheatsheets PDF
No ratings yet
Data Science Cheatsheets PDF
9 pages
Python Sheet
No ratings yet
Python Sheet
1 page
Python List and Numpy Array Basics
No ratings yet
Python List and Numpy Array Basics
1 page
Python DataScience Cheat-Sheet
100% (1)
Python DataScience Cheat-Sheet
7 pages
Python List and Numpy Array Basics
No ratings yet
Python List and Numpy Array Basics
1 page
Python
No ratings yet
Python
132 pages
01 Introduction To Python
No ratings yet
01 Introduction To Python
36 pages
Jupyter Notebook Features Guide
No ratings yet
Jupyter Notebook Features Guide
1 page
Jupyter Notebook Cheat Sheet
No ratings yet
Jupyter Notebook Cheat Sheet
1 page
Jupyter Cheatsheet
No ratings yet
Jupyter Cheatsheet
1 page
Python
No ratings yet
Python
30 pages
01 Introduction To Python
No ratings yet
01 Introduction To Python
36 pages
Cheat Sheet: Python For Data Science
No ratings yet
Cheat Sheet: Python For Data Science
4 pages
Cheat Sheet: Python For Data Science
No ratings yet
Cheat Sheet: Python For Data Science
4 pages
Part1 Cours Python
No ratings yet
Part1 Cours Python
62 pages
Python Basics: Setup and Commands
No ratings yet
Python Basics: Setup and Commands
58 pages
Getting Started With Python Cheat Sheet
No ratings yet
Getting Started With Python Cheat Sheet
1 page
Python Cheat Sheet For Beginners
0% (1)
Python Cheat Sheet For Beginners
1 page
Python Basics Cheat Sheet
No ratings yet
Python Basics Cheat Sheet
3 pages
Python Numpy-Github - Io
No ratings yet
Python Numpy-Github - Io
25 pages
Python Numpy Tutorial Overview
No ratings yet
Python Numpy Tutorial Overview
29 pages
Python Basics: Input & NumPy Arrays
No ratings yet
Python Basics: Input & NumPy Arrays
64 pages
Python Basics & Data Structures
No ratings yet
Python Basics & Data Structures
47 pages
Introduction To Python Programming
No ratings yet
Introduction To Python Programming
9 pages
02 Python Basics
No ratings yet
02 Python Basics
52 pages
Python BasicsGUIA PYTHON-01
No ratings yet
Python BasicsGUIA PYTHON-01
1 page
Short Introduction To Python Basics: Geared Towards Data Analysis
No ratings yet
Short Introduction To Python Basics: Geared Towards Data Analysis
28 pages
Data Filtering in Python Programming
No ratings yet
Data Filtering in Python Programming
5 pages
oG1M8adGXOGe DHBiQVrXgXHO6GrHU01tHWZgd tpRqUW65xGX9ufzrZMtM6hjBWlvlYViPn6r2Cgghq2M8oiXNNdf0HeL-DQvJKWM
No ratings yet
oG1M8adGXOGe DHBiQVrXgXHO6GrHU01tHWZgd tpRqUW65xGX9ufzrZMtM6hjBWlvlYViPn6r2Cgghq2M8oiXNNdf0HeL-DQvJKWM
42 pages
NumPy Cheat Sheet
No ratings yet
NumPy Cheat Sheet
7 pages
NumPy & Pandas
No ratings yet
NumPy & Pandas
27 pages
Numpy Semi 1
No ratings yet
Numpy Semi 1
15 pages
Python Info
No ratings yet
Python Info
11 pages
Cheat Sheet Collection
100% (1)
Cheat Sheet Collection
15 pages
Python Basics for Data Science
100% (3)
Python Basics for Data Science
15 pages
Value Added Course: Programming in Python and Machine Learning UNIT-2
No ratings yet
Value Added Course: Programming in Python and Machine Learning UNIT-2
41 pages
(Ebook) Circuit Design With VHDL by Volnei A. Pedroni ISBN 9780262042642, 0262042649 Instant Download
100% (1)
(Ebook) Circuit Design With VHDL by Volnei A. Pedroni ISBN 9780262042642, 0262042649 Instant Download
57 pages
Language Tour - Uiua Docs
No ratings yet
Language Tour - Uiua Docs
18 pages
Batangas State University: The National Engineering University
100% (1)
Batangas State University: The National Engineering University
6 pages
Introduction To Arrays - Final
No ratings yet
Introduction To Arrays - Final
17 pages
PSC - Course Handout - CSE1002 - JAN2025-final
No ratings yet
PSC - Course Handout - CSE1002 - JAN2025-final
7 pages
Mongo DB
No ratings yet
Mongo DB
77 pages
M1301-7003 14.1 PiCPro SoftwareManual
No ratings yet
M1301-7003 14.1 PiCPro SoftwareManual
574 pages
An Introduction To Python Programming For Scientists and Engineers
100% (2)
An Introduction To Python Programming For Scientists and Engineers
766 pages
Array Basics for Beginners
No ratings yet
Array Basics for Beginners
8 pages
BAGEXERCISE
No ratings yet
BAGEXERCISE
20 pages
Project Work IV Submission Guidelines
No ratings yet
Project Work IV Submission Guidelines
4 pages
Get Python: 3 books in 1: Beginner’s guide, Data science and Machine learning. The easiest guide to start Python programming. Unlock your programmer potential and develop your project in just 30 days First Edition William Dimick PDF ebook with Full Chapters Now
100% (3)
Get Python: 3 books in 1: Beginner’s guide, Data science and Machine learning. The easiest guide to start Python programming. Unlock your programmer potential and develop your project in just 30 days First Edition William Dimick PDF ebook with Full Chapters Now
55 pages
Java A Detailed Approach To Practical Coding (Step-By-Step Java Book 2)
No ratings yet
Java A Detailed Approach To Practical Coding (Step-By-Step Java Book 2)
129 pages
Unit No.2 Client-Side Scripting
No ratings yet
Unit No.2 Client-Side Scripting
270 pages
JavaScript Beginner's Cheat Sheet
No ratings yet
JavaScript Beginner's Cheat Sheet
24 pages
Gosu Ref Guide
No ratings yet
Gosu Ref Guide
468 pages
10 Java Pre Defined Classes
No ratings yet
10 Java Pre Defined Classes
33 pages
Mastering NumPy - Part 2 (Array Manipulation)
No ratings yet
Mastering NumPy - Part 2 (Array Manipulation)
12 pages
Ruby Programming Logic Guide
No ratings yet
Ruby Programming Logic Guide
9 pages
Question Bank ETE - 2023
No ratings yet
Question Bank ETE - 2023
7 pages
JavaScript Array Constructor Guide
No ratings yet
JavaScript Array Constructor Guide
5 pages
2.0 Object Oriented Programming in Java
No ratings yet
2.0 Object Oriented Programming in Java
158 pages
PHP Unit2 Notes
No ratings yet
PHP Unit2 Notes
27 pages
Essential Numpy Operations Guide
No ratings yet
Essential Numpy Operations Guide
2 pages
Algol Compiler Message
No ratings yet
Algol Compiler Message
126 pages
Essential Python for Data Analysts
No ratings yet
Essential Python for Data Analysts
6 pages
Vision of The University: Manonmaniam Sundaranar University, Tirunelveli
No ratings yet
Vision of The University: Manonmaniam Sundaranar University, Tirunelveli
29 pages
C Character Arrays vs Strings
No ratings yet
C Character Arrays vs Strings
18 pages
C5311c5dbcad6d9ddfab
No ratings yet
C5311c5dbcad6d9ddfab
25 pages
C++ Program For Business
No ratings yet
C++ Program For Business
848 pages

Python Data Science: Lists & NumPy Basics

Uploaded by

Python Data Science: Lists & NumPy Basics

Uploaded by

Python For Data Science Cheat Sheet Lists Also see NumPy Arrays Libraries

>>> a = 'is' Import libraries

Writing Code And Text

>>> a = [Link]([1,2,3]) [ 4. , 10. , 18. ]])

>>> np.string_ Fixed-length string type >>> [Link]() Sort an array

>>> from numpy import poly1d (SVD) Matrix Square Root

I/O Arithmetic Operations with Fill Methods

1 Prepare The Data Also see Lists & NumPy

1D Data 4 Customize Plot

3 Plo!ing Routines 5 Save Plot

Categorical Plots Regression Plots

2 Figure Aesthetics Also see Matplotlib

>>> from [Link] import CategoricalColorMapper

2 Plo!ing >>> layout = row(p1,p2,p3)

You might also like