0% found this document useful (0 votes)

7 views1 page

Python Cheatsheet ML

Uploaded by

pxawkmziqburmciojh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views1 page

Python Cheatsheet ML

Uploaded by

pxawkmziqburmciojh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 1

General Tips

Python
Stop wasting time explicitly importing all the data science libraries you Tips & Tricks
for Machine Learning
need to use in your dev environment. Instead, use Pyforest to lazily
import them for you only when you need them: Predictive Analysis Guide
pip install pyforest
And finally, a simple rule guide for when to apply which regression
technique when doing predictive analysis:
Tired of writing for loops to join lists? Use the zip function instead:
Regression
Usage Result
a = ("John", "Charles", "Mike") Type
Initial data
b = ("For", "Against", "For")
Visualization Tips
15

Zip function x = zip(a, b)

Tired of rendering static plots in Jupyter with %matplotlib ? Try
Used to predict the value
importing %matplotlib notebook which gives you interactive plots of a variable (called the 10
(('John', For), ('Charles', 'Against'), ('Mike', you can resize and zoom in on. dependent variable)
Result
'For')) Linear based on the value of
Want to more easily spot patterns in tabulated data? Create a heatmap another variable 5

using Seaborn's gradient capabilities. For example: Y’ = bX + A

10 20 30 40 50 60
hm = sns.light.palette('green', as_cmap = True)
Pandas Tips style = df.style.background_gradient(cmap = hm) 0 10 20 20 40 60 8 60 24 0 25 50
120

Used when you have

100
y

A B C D E
90

many variables and want 20

Shortcut your initial data analysis with the pandas’ profile function, which
10

Stepwise to identify a subset of

0 -1.264051 1.52791 -0.970711 0.47056 -0.100697

generates a detailed report of your data (missing values, variables, predictors 40

1 0.303793 -1.72596 1.5851 -1.10686

0.13297
33

bj.std=bj(sx*sy-1)
counts, etc) in just one line of code. For example:
16

2 1.57823 0.10798 -0.764048 -0.775189 1.38385

x3 8

df = pd.read_csv(‘somedata.csv’) 3 0.760385 -0.285647 0.538367 -2.0839 0.937782 10

df.profile_report() 08

Used when the

dependent variable is a
06

Jupyter Notebook Tips

sigmoid(x)
Speed up pandas operations with pandarallel. For example, instead of Logistic binary value
04

df.progress_apply() use df.parallel_apply() to run the process in parallel. ß0=ß1x1+ ß2x2

Need to debug your code in Jupyter notebooks? Just type %debug to 02

launch an interactive debugger that takes you back to the point where
Need to unstack a table? Use pandas, which can convert one level of an the exception happened. Press q to exit.
00
-4 -2 0 2 4 6

index into the columns of your data frame. For example:

100

Initial data state email_provider Computational costs matter. To check the running time of a block of code
AK aol.com
(trimmed) hotmail.cm in a Jupyter notebook preface the code block with %%time 90

cox.net
kitty.com Used for curvilinear data 80

AR deleo.com Working with Python in a Jupyter Notebook, but wish you had access to Polynomial
AZ yahoo.com R? Now you can run both of them together by typing ß0+ß0x1+ 70

aol.com
cox.net pip install rpy2. 60
nikolozakes.org
parvis.com
CA gmail.com 0 5 10 15 20

cox.net
aol.com Scikit-Learn Tips Best used when you have
a small number of
Unstack -----------------------------> Always use the stratify parameter to ensure test and train sets are Lasso significant parameters
clients.groupby('state')['email_provider'].value_counts 1.0

code ().unstack().fillna(0) split into equal proportions for better prediction and reproducibility of N-1 i
N
=1ƒ(xi, y
-----------------------------> results. For example: 0.8

Best used when you have

Result email_provider angalich.com ankeny.org aol.com
test_x, train_x, test_y, train_y = tain_test_split (x, y, a large number of
0.6

state
(trimmed) random_state = 59, stratify = y) Ridge significant parameters 0.4
ak
ar 0.0 0.0 2.0 LinearRegression:m=0.05

az 0.0 0.0 0.0 ß= (XT -1

XTy 0.2
Ridge: m=0.02

ca 0.0
0.0
0.0
1.0
2.0
9.0
Missing values in your dataset? Don’t settle for univariate methods to Lasso: m=0.00
ElasticNet: m=0.00
co
create the missing values when scikit-learn’s multivariate input based on
0.0

ct 0.0 0.0 0.0 The happy medium 0 2 4 6 8 10

dc 0.0 0.0 1.0 between Lasso and Ridge

fl 0.0 0.0 0.0 k-Nearest Neighbor can offer better accuracy: ElasticNet
ga 0.0 0.0 4.0
0.0 0.0 1.0 impute = KNNImputer(n_neighbors=2) =
1
P
j=1 j

Regression Analysis - Cheatsheet
No ratings yet
Regression Analysis - Cheatsheet
9 pages
(Feature Engineering) (Extended-Cheatsheet)
100% (1)
(Feature Engineering) (Extended-Cheatsheet)
9 pages
Assignment 1:: Intro To Machine Learning
No ratings yet
Assignment 1:: Intro To Machine Learning
6 pages
Assigment Regression
No ratings yet
Assigment Regression
9 pages
Natural Disasters Prediction
No ratings yet
Natural Disasters Prediction
21 pages
Practical File Machine Learning
No ratings yet
Practical File Machine Learning
29 pages
ML Algorithms
100% (1)
ML Algorithms
1 page
Data Science
No ratings yet
Data Science
15 pages
Natural Disasters Prediction 1
No ratings yet
Natural Disasters Prediction 1
26 pages
Enthought Python Machine Learning SciKit Learn Cheat Sheets 1 3 v1.0
No ratings yet
Enthought Python Machine Learning SciKit Learn Cheat Sheets 1 3 v1.0
3 pages
Data-Analytics-Manual Lab G.anill Kumar
No ratings yet
Data-Analytics-Manual Lab G.anill Kumar
23 pages
Notes5 Regression
No ratings yet
Notes5 Regression
14 pages
Privacy-Preserving and Lossless Distributed Estimation of High-Dimensional Generalized Additive Mixed Models
No ratings yet
Privacy-Preserving and Lossless Distributed Estimation of High-Dimensional Generalized Additive Mixed Models
16 pages
C1 W2 Lab02 Multiple Variable Soln
No ratings yet
C1 W2 Lab02 Multiple Variable Soln
11 pages
Efficient Python Tricks and Tools For Data Scientists - by Khuyen Tran
No ratings yet
Efficient Python Tricks and Tools For Data Scientists - by Khuyen Tran
20 pages
MODELS (AutoRecovered)
No ratings yet
MODELS (AutoRecovered)
9 pages
Udacity Machine Learning Analysis Supervised Learning
100% (1)
Udacity Machine Learning Analysis Supervised Learning
504 pages
AI and ML Lab Ex3 To 12
No ratings yet
AI and ML Lab Ex3 To 12
27 pages
Machine Learning Cheat Sheet
No ratings yet
Machine Learning Cheat Sheet
15 pages
Fds Lab Manual (6)
No ratings yet
Fds Lab Manual (6)
74 pages
Data Analytics Lab Manual - 250402 - 095326
No ratings yet
Data Analytics Lab Manual - 250402 - 095326
58 pages
Report On Stock Price Prediction
No ratings yet
Report On Stock Price Prediction
14 pages
Notes MSM
No ratings yet
Notes MSM
66 pages
Unit III - Data Visualization and Representation
No ratings yet
Unit III - Data Visualization and Representation
17 pages
11+ Merged
No ratings yet
11+ Merged
36 pages
Aychew Chernet
No ratings yet
Aychew Chernet
8 pages
Paper Prediction: Ostl Mini Project 1.hrutwika Ambavane 2.juili Kadu 3. Bhavesh Bawankar 4.akshat Singh
No ratings yet
Paper Prediction: Ostl Mini Project 1.hrutwika Ambavane 2.juili Kadu 3. Bhavesh Bawankar 4.akshat Singh
13 pages
Data Science Cheatsheet 2.0: Statistics Model Evaluation Logistic Regression
No ratings yet
Data Science Cheatsheet 2.0: Statistics Model Evaluation Logistic Regression
4 pages
Fiches Machine Learning
No ratings yet
Fiches Machine Learning
21 pages
Hypothesis Testing - Cheatsheet
No ratings yet
Hypothesis Testing - Cheatsheet
10 pages
Python Predictive Modeling
No ratings yet
Python Predictive Modeling
24 pages
Know Your Dataset: Season Holiday Weekday Workingday CNT 726 727 728 729 730
No ratings yet
Know Your Dataset: Season Holiday Weekday Workingday CNT 726 727 728 729 730
1 page
Data Science and Machine Learning - Interview Questions
No ratings yet
Data Science and Machine Learning - Interview Questions
185 pages
ML__DAY3
No ratings yet
ML__DAY3
10 pages
Python in Research
No ratings yet
Python in Research
18 pages
12 Dimensionality Reduction Techniqwues (With Python Codes)
No ratings yet
12 Dimensionality Reduction Techniqwues (With Python Codes)
20 pages
ML Regression Documentation
No ratings yet
ML Regression Documentation
7 pages
n27 PDF
No ratings yet
n27 PDF
3 pages
Graph Analysis3 Code
No ratings yet
Graph Analysis3 Code
2 pages
ML Cheatsheet
No ratings yet
ML Cheatsheet
4 pages
ml lab
No ratings yet
ml lab
10 pages
Statistics For Data Science
No ratings yet
Statistics For Data Science
4 pages
Machine Learning Lab
No ratings yet
Machine Learning Lab
23 pages
ML 01 (Shubham)
No ratings yet
ML 01 (Shubham)
14 pages
machine learning (1)
No ratings yet
machine learning (1)
30 pages
Machine Intelligence
No ratings yet
Machine Intelligence
24 pages
ML 2024 Part4 More Methods
No ratings yet
ML 2024 Part4 More Methods
90 pages
Machine Learning Lab
No ratings yet
Machine Learning Lab
20 pages
Scikit Learn
No ratings yet
Scikit Learn
17 pages
Unit III Da Notes
No ratings yet
Unit III Da Notes
43 pages
Aih Lab1
No ratings yet
Aih Lab1
10 pages
Ids
No ratings yet
Ids
6 pages
Assignment No.4 - (20-Ele-68)
No ratings yet
Assignment No.4 - (20-Ele-68)
17 pages
Data Preprocessing: Modern Data Analytics (G0Z39A) Prof. Dr. Ir. Jan de Spiegeleer
No ratings yet
Data Preprocessing: Modern Data Analytics (G0Z39A) Prof. Dr. Ir. Jan de Spiegeleer
82 pages
Data Science Cheatsheet
No ratings yet
Data Science Cheatsheet
4 pages
Final
No ratings yet
Final
13 pages
Aiml Lab
No ratings yet
Aiml Lab
37 pages
Experimenting With Data Analysis Packages and Statistical Operations
No ratings yet
Experimenting With Data Analysis Packages and Statistical Operations
18 pages
Foundation of Data Science (BSC)
No ratings yet
Foundation of Data Science (BSC)
64 pages
Arya College of Engineering: Industrial Training Presentation
No ratings yet
Arya College of Engineering: Industrial Training Presentation
14 pages
BTech AI Syllabus June2023
No ratings yet
BTech AI Syllabus June2023
148 pages
Informatics Practices Record Class 12
No ratings yet
Informatics Practices Record Class 12
60 pages
Intro To Python For Computer Science and Data Science Learning To Program With AI Big Data and The Cloud Deitel Full Download
100% (2)
Intro To Python For Computer Science and Data Science Learning To Program With AI Big Data and The Cloud Deitel Full Download
402 pages
Vignesh Final Mini Project
No ratings yet
Vignesh Final Mini Project
39 pages
Pandasquiz
No ratings yet
Pandasquiz
7 pages
Ip Sample Paper 6
No ratings yet
Ip Sample Paper 6
8 pages
PGP DS&A 122c5ebe
No ratings yet
PGP DS&A 122c5ebe
23 pages
OM Rai
No ratings yet
OM Rai
1 page
Vimal - Data Science Resume
No ratings yet
Vimal - Data Science Resume
2 pages
Kaggle Machine Learning
No ratings yet
Kaggle Machine Learning
6 pages
Python For Beginners Part 1 B0DXDCL6KJ
No ratings yet
Python For Beginners Part 1 B0DXDCL6KJ
404 pages
Python For RF
No ratings yet
Python For RF
15 pages
PDS Assignments
No ratings yet
PDS Assignments
3 pages
Sdet Questions
No ratings yet
Sdet Questions
6 pages
Python Lab Manual
No ratings yet
Python Lab Manual
21 pages
Astha Gupta Project Report Word-1
No ratings yet
Astha Gupta Project Report Word-1
57 pages
Summer Internship Report
No ratings yet
Summer Internship Report
35 pages
Asfasdas
No ratings yet
Asfasdas
36 pages
Python Project-tarekegn Kelta
No ratings yet
Python Project-tarekegn Kelta
14 pages
Docs Xlwings
No ratings yet
Docs Xlwings
362 pages
2 Month Data Analyst Study Plan
No ratings yet
2 Month Data Analyst Study Plan
4 pages
MSBTE Python Model Answer Full
No ratings yet
MSBTE Python Model Answer Full
8 pages
Resume - Sajid - Khan (1) - 1
No ratings yet
Resume - Sajid - Khan (1) - 1
1 page
Data Analytics
No ratings yet
Data Analytics
70 pages
PCA - Colab
No ratings yet
PCA - Colab
2 pages
Split Up Syllabus 2023 24 IP XII
No ratings yet
Split Up Syllabus 2023 24 IP XII
5 pages
Python Cheatsy
No ratings yet
Python Cheatsy
1 page
NOTES OF Python Ok
No ratings yet
NOTES OF Python Ok
73 pages

Python Cheatsheet ML

Uploaded by

Python Cheatsheet ML

Uploaded by

General Tips

Zip function x = zip(a, b)

using Seaborn's gradient capabilities. For example: Y’ = bX + A

Used when you have

many variables and want 20

Stepwise to identify a subset of

0 -1.264051 1.52791 -0.970711 0.47056 -0.100697

generates a detailed report of your data (missing values, variables, predictors 40

1 0.303793 -1.72596 1.5851 -1.10686

2 1.57823 0.10798 -0.764048 -0.775189 1.38385

df = pd.read_csv(‘somedata.csv’) 3 0.760385 -0.285647 0.538367 -2.0839 0.937782 10

Used when the

Jupyter Notebook Tips

df.progress_apply() use df.parallel_apply() to run the process in parallel. ß0=ß1x1+ ß2x2

index into the columns of your data frame. For example:

Best used when you have

az 0.0 0.0 0.0 ß= (XT -1

ct 0.0 0.0 0.0 The happy medium 0 2 4 6 8 10

dc 0.0 0.0 1.0 between Lasso and Ridge

You might also like