0% found this document useful (0 votes)

52 views16 pages

Improve Your Python Code Automatically

Uploaded by

Al Wikah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

52 views16 pages

Improve Your Python Code Automatically

Uploaded by

Al Wikah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

Efficient Python Tricks and Tools for Data

Scientists - By Khuyen Tran

Code Review
GitHub View on GitHub Book View Book

This section covers some tools to automatically review and improve your
code such as sorting imports, check for missing docstrings, etc.
isort: Automatically Sort your Python Imports in 1
Line of Code
As your codebase expands, you may find yourself importing numerous
libraries, which can become overwhelming to navigate. To avoid arranging
your imports manually, use isort.

isort is a Python library that automatically sorts imports alphabetically,

grouping them by section and type.

Consider the following example where your imports are unsorted:

from sklearn.metrics import confusion_matrix, fl_score,

classification_report, roc_curve
from sklearn.model_selection import train_test_split
from sklearn.model_selection import GridSearchCV,
StratifiedKFold
from sklearn import svm
from sklearn.naive_bayes import GaussianNB, MultinomialNB
from sklearn.neighbors import KNeighborsClassifier
from sklearn.tree import DecisionTreeClassifier
from sklearn.model_selection import TimeSeriesSplit
By running isort name_of_your_file.py, isort can sort your imports
automatically into the following:

from sklearn import svm

from sklearn.metrics import (classification_report,
confusion_matrix, fl_score,
roc_curve)
from sklearn.model_selection import (GridSearchCV,
StratifiedKFold,
TimeSeriesSplit,
train_test_split)
from sklearn.naive_bayes import GaussianNB, MultinomialNB
from sklearn.neighbors import KNeighborsClassifier
from sklearn.tree import DecisionTreeClassifier

You can use isort with pre-commit by adding the following to your .pre-
commit-config.yaml file:

- repo: https://github.com/timothycrosley/isort
rev: 5.12.0
hooks:
- id: isort

Link to isort.
interrogate: Check your Python Code for Missing
Docstrings
!pip install interrogate

Sometimes, you might forget to include docstrings for classes and functions.
Instead of manually searching through all your functions and classes for
missing docstrings, use interrogate.

Consider the following example where there are missing docstrings:

# interrogate_example.py
class Math:
def __init__(self, num) -> None:
self.num = num

def plus_two(self):
"""Add 2"""
return self.num + 2

def multiply_three(self):
return self.num * 3
You can use interrogate to identify missing docstrings:

$ interrogate interrogate_example.py

Output:

RESULT: FAILED (minimum: 80.0%, actual: 20.0%)

You can use interrogate with pre-commit by adding the following to your
.pre-commit-config.yaml file:

- repo: https://github.com/pre-commit/mirrors-interrogate
rev: v1.4.0
hooks:
- id: interrogate

Link to interrogate.
mypy: Static Type Checker for Python

!pip install mypy

Type hinting in Python is useful for other developers to understand the

expected data types to be used in your functions. To automate type checking
in your code, use mypy.

Consider the following file that includes type hinting:

# mypyÏ_example.py
from typing import List, Union

def get_name_price(fruits: list) -> Union[list, tuple]:

return zip(*fruits)

fruits = [('apple', 2), ('orange', 3), ('grape', 2)]

names, prices = get_name_price(fruits)
print(names) # ('apple', 'orange', 'grape')
print(prices) # (2, 3, 2)
When typing the following command on your terminal:

$ mypy mypy_example.py

you will get the output similar to this:

mypy_example.py:4: error: Incompatible return value type

(got "zip[Any]", expected "Union[List[Any], Tuple[Any,
...]]")

You can use mypy with pre-commit by adding the following to your .pre-
commit-config.yaml file:

repos:
- repo: https://github.com/pre-commit/mirrors-mypy
rev: v0.910
hooks:
- id: mypy

Link to mypy.
Refurb: Refurbish and Modernize Python Codebases
If you want to have some guidelines to improve and optimize your code, try
Refurb.

For example, if you have a file like this:

# test_refurb.py
for n in [1, 2, 3, 4]:
if n == 2 or n == 4:
res = n/2

You can use Refurb to refurbish your code.

$ refurb test_refurb.py

test_refurb.py:1:10 [FURB109]: Replace `in [x, y, z]` with

`in (x, y, z)`
test_refurb.py:2:8 [FURB108]: Use `x in (y, z)` instead of
`x == y or x == z`

Run `refurb --explain ERR` to further explain an error.

Use `--quiet` to silence this message
$refurb test_refurb.py --explain FURB109

['Since tuple, list, and set literals can be used with the
`in` operator, it',
'is best to pick one and stick with it.',
'',
'Bad:',
'',
'```',
'for x in [1, 2, 3]:',
' pass',
'',
'nums = [str(x) for x in [1, 2, 3]]',
'```',
'',
'Good:',
'',
'```',
'for x in (1, 2, 3):',
' pass',
'',
'nums = [str(x) for x in (1, 2, 3)]',
'```']

Refurb only works with Python 3.10 and above.

You can use Refurb with pre-commit by adding the following to your .pre-
commit-config.yaml file:

repos:
- repo: https://github.com/dosisod/refurb
rev: REVISION
hooks:
- id: refurb

Link to Refurb.
Pydantic: Enforce Data Types on Your Function
Parameters at Runtime
!pip install pydantic

If you want to enforce data types on your function parameters and validate
their values at runtime, use Pydantic.

In the code below, since the value of test_size is a string, Pydantic raises a
ValidationError.

from pydantic import BaseModel

class ProcessConfig(BaseModel):
drop_columns: list = ["a", "b"]
target: str = "y"
test_size: float = 0.3
random_state: int = 1
shuffle: bool = True
def process(config: ProcessConfig = ProcessConfig()):
target = config.target
test_size = config.test_size
...

process(ProcessConfig(test_size="a"))

ValidationError: 1 validation error for ProcessConfig

test_size
value is not a valid float (type=type_error.float)

Link to Pydantic.

Build a full-stack ML application with Pydantic and Prefect.

perfplot: Performance Analysis for Python Snippets
!pip install perfplot

If you want to compare the performance between different snippets and plot
the results, use perfplot.

Consider the following file that includes three functions that create a list.

import perfplot

def append(n):
l = []
for i in range(n):
l.append(i)
return l

def comprehension(n):
return [i for i in range(n)]

def list_range(n):
return list(range(n))
To visualize the perfomance of these functions, use the perfplot.show
method.

perfplot.show(
setup=lambda n: n,
kernels=[
append,
comprehension,
list_range,
],
n_range=[2**k for k in range(25)],
)

Link to perfplot.
Analyze the Memory Usage of Your Python Code
!pip install memory_profiler

If you want to analyze the memory consumption of your Python code line-
by-line, use memory_profiler. This package allows you to generate a full
memory usage report of your executable and plot it.

$ mprof run memory_profiler_test.py

mprof: Sampling memory every 0.1s

Line # Mem usage Increment Line Contents

========================================================
4 41.9 MiB 41.9 MiB @profile
5 def func():
6 49.5 MiB 7.6 MiB a = [1]*(10**6)
7 202.1 MiB 152.6 MiB b = [2]*(2*10**7)
8 49.5 MiB -152.6 MiB del b
9 49.5 MiB 0.0 MiB return a
Plot the memory usage:

$ mprof plot

Link to memory_profiler.

System Adm. Question and Answer
No ratings yet
System Adm. Question and Answer
15 pages
The Art of Quantitative Finance Vol.2 Volatilities, Stochastic Analysis and Valuation Tools (Gerhard Larcher) (Z-Library)
No ratings yet
The Art of Quantitative Finance Vol.2 Volatilities, Stochastic Analysis and Valuation Tools (Gerhard Larcher) (Z-Library)
363 pages
Python: Learn Python in 24 Hours
From Everand
Python: Learn Python in 24 Hours
Alex Nordeen
4/5 (12)
Important (Python Built-In Methods) (CheatSheet)
No ratings yet
Important (Python Built-In Methods) (CheatSheet)
6 pages
Machine Learning in Microservices Productionizing Microservices Architecture For Machine Learning Solutions (Mohamed Abouahmed, Omar Ahmed) (Z-Library)
No ratings yet
Machine Learning in Microservices Productionizing Microservices Architecture For Machine Learning Solutions (Mohamed Abouahmed, Omar Ahmed) (Z-Library)
270 pages
Hazard Analysis and Critical Control Point (Haccp) : A Food Protection System
100% (2)
Hazard Analysis and Critical Control Point (Haccp) : A Food Protection System
13 pages
Hermes-Pjt Training Manual: RSJ1/RSH1
100% (4)
Hermes-Pjt Training Manual: RSJ1/RSH1
152 pages
POSManual
100% (2)
POSManual
85 pages
Income Taxation 2021 Rex Banggawan Answers Multiple Choice-Theory: General Concepts
100% (1)
Income Taxation 2021 Rex Banggawan Answers Multiple Choice-Theory: General Concepts
6 pages
30 Python Best Practices, Tips, and Tricks by Erik Van Baaren Python Land Medium
No ratings yet
30 Python Best Practices, Tips, and Tricks by Erik Van Baaren Python Land Medium
23 pages
Tips For Testing in Python 1646539645
No ratings yet
Tips For Testing in Python 1646539645
23 pages
Chapter1-Foundations For Efficiencies
No ratings yet
Chapter1-Foundations For Efficiencies
5 pages
Python's Built-In Utilities
No ratings yet
Python's Built-In Utilities
5 pages
Lecture 2
No ratings yet
Lecture 2
19 pages
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
From Everand
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
Marcus Richards
No ratings yet
Quick Python Guide
From Everand
Quick Python Guide
Coder1
No ratings yet
Python Best Practices Tips and Tricks
No ratings yet
Python Best Practices Tips and Tricks
12 pages
Data Analysis Python Read The Docs Io en Latest
No ratings yet
Data Analysis Python Read The Docs Io en Latest
79 pages
TOC Python
No ratings yet
TOC Python
10 pages
Python Indepth Live Session
No ratings yet
Python Indepth Live Session
8 pages
Python: Advanced Guide to Programming Code with Python
From Everand
Python: Advanced Guide to Programming Code with Python
Charlie Masterson
No ratings yet
Expanded List of Additional Built-In Functions
No ratings yet
Expanded List of Additional Built-In Functions
4 pages
Advanced Python Hacks
No ratings yet
Advanced Python Hacks
11 pages
Python: Advanced Guide to Programming Code with Python: Python Computer Programming, #4
From Everand
Python: Advanced Guide to Programming Code with Python: Python Computer Programming, #4
Charlie Masterson
No ratings yet
Functions Programs
No ratings yet
Functions Programs
9 pages
10 Python Built-In Functions That Will Simplify Your Code
No ratings yet
10 Python Built-In Functions That Will Simplify Your Code
8 pages
Pyhton Potential Interview Questions
No ratings yet
Pyhton Potential Interview Questions
34 pages
Intership Body
No ratings yet
Intership Body
31 pages
Datascienceusing Python Training
No ratings yet
Datascienceusing Python Training
11 pages
PYTHON
No ratings yet
PYTHON
164 pages
Pytest
No ratings yet
Pytest
5 pages
DS Final
No ratings yet
DS Final
46 pages
Common Python Data Science Interview Questions1
No ratings yet
Common Python Data Science Interview Questions1
5 pages
X Sa Xy CQitu ZXNBND
No ratings yet
X Sa Xy CQitu ZXNBND
16 pages
Logging & Debugging Python Tricks
No ratings yet
Logging & Debugging Python Tricks
22 pages
Built in Function
No ratings yet
Built in Function
16 pages
Python Mid Syllabus Solved
No ratings yet
Python Mid Syllabus Solved
10 pages
Python For Web Development Pre
No ratings yet
Python For Web Development Pre
15 pages
Python QA
No ratings yet
Python QA
97 pages
Pythonn SE
No ratings yet
Pythonn SE
18 pages
Simplifying Data Science With Python
From Everand
Simplifying Data Science With Python
Billy David millican
No ratings yet
Mastering Python: A Comprehensive Guide for Beginners and Experts
From Everand
Mastering Python: A Comprehensive Guide for Beginners and Experts
Rick Spair
No ratings yet
DS ML Python
No ratings yet
DS ML Python
4 pages
Ct3 QB Answers
No ratings yet
Ct3 QB Answers
8 pages
Python Lab PRG
No ratings yet
Python Lab PRG
20 pages
Python-Deprecated Library v1.1 Documentation
From Everand
Python-Deprecated Library v1.1 Documentation
Laurent LAPORTE
No ratings yet
Python Self Study Material
0% (1)
Python Self Study Material
9 pages
Python Built in Functions Notes
No ratings yet
Python Built in Functions Notes
2 pages
Easy Programming for Everyone
From Everand
Easy Programming for Everyone
Umar Asghar
No ratings yet
Part1 Cours Python
No ratings yet
Part1 Cours Python
62 pages
A Beginner's guide to Python
From Everand
A Beginner's guide to Python
Steven Mcananey
No ratings yet
Built-In Functions and Methods
No ratings yet
Built-In Functions and Methods
3 pages
Document
No ratings yet
Document
16 pages
Python Summary
No ratings yet
Python Summary
10 pages
MDA File
No ratings yet
MDA File
37 pages
Python Notes
No ratings yet
Python Notes
11 pages
Python Ultimate Guide
100% (1)
Python Ultimate Guide
10 pages
08 - Mixedprogramming: 1 Mixed Programming
No ratings yet
08 - Mixedprogramming: 1 Mixed Programming
41 pages
ML Lab File Vijay Kumar
No ratings yet
ML Lab File Vijay Kumar
27 pages
Manoj 5th Sem Project Report
No ratings yet
Manoj 5th Sem Project Report
20 pages
Python U-5 Combined Notes
No ratings yet
Python U-5 Combined Notes
76 pages
Machine Learning - Manual
No ratings yet
Machine Learning - Manual
32 pages
ML Lab File
No ratings yet
ML Lab File
33 pages
Data Science Online Training Course Content 1626830873
No ratings yet
Data Science Online Training Course Content 1626830873
26 pages
Chapter 1
No ratings yet
Chapter 1
28 pages
Prac1 AAM
No ratings yet
Prac1 AAM
6 pages
Packages in Python
No ratings yet
Packages in Python
54 pages
The Sovereign Individual PDF
No ratings yet
The Sovereign Individual PDF
25 pages
Vertex Ai Agent Builder
No ratings yet
Vertex Ai Agent Builder
3 pages
???: ??? ??? ?? ??????? ?? ?????????!
No ratings yet
???: ??? ??? ?? ??????? ?? ?????????!
6 pages
BBG Feb 23rd 2024 NF
No ratings yet
BBG Feb 23rd 2024 NF
14 pages
ThesisFinal - Predicting Forex Rates Using Sentiment
No ratings yet
ThesisFinal - Predicting Forex Rates Using Sentiment
49 pages
Manual ICT 2022
No ratings yet
Manual ICT 2022
75 pages
MIS602 - Assessment 3 - 20240603
No ratings yet
MIS602 - Assessment 3 - 20240603
5 pages
3314911-030 Int-Eng Lifepak 15 Operating Instructions
No ratings yet
3314911-030 Int-Eng Lifepak 15 Operating Instructions
318 pages
Lee Jenevieve Resume
No ratings yet
Lee Jenevieve Resume
2 pages
ECE Group4 FACEMASK
No ratings yet
ECE Group4 FACEMASK
17 pages
Khatoon Bibi Vs Abdul Wahab PDF
No ratings yet
Khatoon Bibi Vs Abdul Wahab PDF
8 pages
Theory Exams - How You Sit Your Exam
No ratings yet
Theory Exams - How You Sit Your Exam
9 pages
Enrolio Documentation
No ratings yet
Enrolio Documentation
48 pages
Plant Tissue Culture
No ratings yet
Plant Tissue Culture
3 pages
Venkatesh Resume
100% (1)
Venkatesh Resume
6 pages
List of Accepted Street Lighting August 2017 PDF
No ratings yet
List of Accepted Street Lighting August 2017 PDF
12 pages
Translation Application
No ratings yet
Translation Application
11 pages
Maths 06 2008 v1
No ratings yet
Maths 06 2008 v1
38 pages
PEO Memorandum (Changed Names)
No ratings yet
PEO Memorandum (Changed Names)
11 pages
Amazon Strategy
No ratings yet
Amazon Strategy
9 pages
Nielson Vs Lepanto
100% (1)
Nielson Vs Lepanto
16 pages
Akash Shukla POSTER
No ratings yet
Akash Shukla POSTER
1 page
Cost Driver
100% (1)
Cost Driver
37 pages
Rails Magazine - Issue #7: Field Day
100% (1)
Rails Magazine - Issue #7: Field Day
28 pages
6021-P2-Lembar Kerja 2023
No ratings yet
6021-P2-Lembar Kerja 2023
36 pages
Class 11 IP Ch-2, 3, 4 Python Programming Basics Notes
50% (2)
Class 11 IP Ch-2, 3, 4 Python Programming Basics Notes
37 pages
1 PPT Underwriting of Shares and Debentures - 95673
No ratings yet
1 PPT Underwriting of Shares and Debentures - 95673
14 pages
Nepal Telecom Intern of AtulLekhnathMukesh Prabin2017
60% (5)
Nepal Telecom Intern of AtulLekhnathMukesh Prabin2017
51 pages
C16-Chokes and Degasser
100% (1)
C16-Chokes and Degasser
68 pages
Mathematics A: 4MA1/2FR
No ratings yet
Mathematics A: 4MA1/2FR
24 pages

Improve Your Python Code Automatically

Uploaded by

Improve Your Python Code Automatically

Uploaded by

Efficient Python Tricks and Tools for Data

Scientists - By Khuyen Tran

isort is a Python library that automatically sorts imports alphabetically,

Consider the following example where your imports are unsorted:

from sklearn.metrics import confusion_matrix, fl_score,

from sklearn import svm

Consider the following example where there are missing docstrings:

RESULT: FAILED (minimum: 80.0%, actual: 20.0%)

!pip install mypy

Type hinting in Python is useful for other developers to understand the

Consider the following file that includes type hinting:

def get_name_price(fruits: list) -> Union[list, tuple]:

fruits = [('apple', 2), ('orange', 3), ('grape', 2)]

you will get the output similar to this:

mypy_example.py:4: error: Incompatible return value type

For example, if you have a file like this:

You can use Refurb to refurbish your code.

test_refurb.py:1:10 [FURB109]: Replace `in [x, y, z]` with

Run `refurb --explain ERR` to further explain an error.

Refurb only works with Python 3.10 and above.

from pydantic import BaseModel

ValidationError: 1 validation error for ProcessConfig

Build a full-stack ML application with Pydantic and Prefect.

$ mprof run memory_profiler_test.py

mprof: Sampling memory every 0.1s

Line # Mem usage Increment Line Contents

You might also like