0% found this document useful (0 votes)

32 views11 pages

Python Notes

Uploaded by

Niranjan Patidar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

32 views11 pages

Python Notes

Uploaded by

Niranjan Patidar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

[Link]

pdf

Smart Syntax

with_suffix
## Bad
new_filepath = str(Path("[Link]"))[:4] + ".md"

## Good
new_filepath = Path("[Link]").with_suffix(".md")

New Concepts
Walrus Operator

Make value assignment on the go!!

The walrus operator (:=) is an assignment expression introduced in Python 3.8. It allows you to

assign values to variables as part of an expression. Before its introduction, assignment was only

allowed in standalone statements, but the walrus operator enables assignment within

expressions like loops, conditions, and function calls.

The walrus operator reduces redundancy by combining assignment and evaluation in a single

step, which is especially useful in loops and conditional statements.

It enhances readability and efficiency, reducing the number of lines of code by eliminating the

need for separate assignment steps. Additionally, it helps streamline code when you need to

evaluate and assign a value that will be used multiple times.

while True:
user_input = input("Enter a valid string (non-empty): ")
if len(user_input) > 0:
break
print(f"Valid input received: {user_input}")

In the above code, the string is evaluated twice — once when reading the input and again in

the if condition to check its length.

while (user_input := input("Enter a valid string (non-empty): ")) and len(user_input) == 0:

print("Invalid input, try again.")
print(f"Valid input received: {user_input}")

Memory Slots

You knew saving memory is cool, but saving millions of bytes is even cooler.

In Python, the __slots__ feature is used to reduce memory usage by limiting the attributes an

object can have. Normally, Python uses a dynamic dictionary (__dict__) to store attributes of an

object, which allows for flexibility but consumes more memory.

By defining __slots__, you explicitly declare a fixed set of attributes, eliminating the use of

a dict and reducing the memory footprint.

 Memory Efficient: Objects with __slots__ use less memory because they avoid the overhead

of the attribute dictionary.

 Well Optimized: Access to attributes is faster since there’s no need to look them up in a

dynamic dictionary.

Imagine you’re creating millions of lightweight User objects in a system where each user only

needs a few attributes (name, email). Reducing memory usage can drastically improve

performance.

class User:
def __init__(self, name, email):
[Link] = name
[Link] = email

# Creating a million users without `slots`

users = [User(f"User{i}", f"@[Link]">user{i}@[Link]") for i in range(1000000)]

In this case, every User object stores attributes in a dynamic dictionary, which consumes more

memory. Using __slots__ significantly reduces the memory overhead since objects get stored in

predefined slots without the need for a dictionary making the system more efficient when

creating large numbers of objects.

class User:
__slots__ = ['name', 'email'] # Declare fixed attributes
def __init__(self, name, email):
[Link] = name
[Link] = email

# Creating a million users with `slots`

users = [User(f"User{i}", f"@[Link]">user{i}@[Link]") for i in range(1000000)]

functools.lru_cache

In Python, functools.lru_cache is a decorator that provides a simple yet effective way to add

memoization to a function. Memoization is a technique used to speed up programs by caching

the results of expensive function calls and returning the cached result when the same inputs

occur again.

This can drastically improve performance for functions that are computationally expensive or

frequently called with the same arguments.

The lru_cache stands for "Least Recently Used Cache", which means that if the cache reaches its

maximum size, it will discard the least recently used items first.

from functools import lru_cache

@lru_cache(maxsize=None) # maxsize=None means the cache can grow indefinitely
def fibonacci(n):
if n < 2:
return n
return fibonacci(n-1) + fibonacci(n-2)

# Calculate Fibonacci numbers

print(fibonacci(10)) # Output: 55
print(fibonacci(15)) # Output: 610

requests -> httpx

Requests (Yes, Really!)

Look, there is nothing inherently wrong with Requests. It’s intuitive, it has a great API, and it’s

practically the mascot of Python HTTP libraries. But it’s overkill for when you just need to make

simple GET/POST requests, and it will lag in environments where you want asynchronous

performance.

Why It’s Overrated:

Blocking IO: Requests is synchronous, which means each call waits for the previous call to finish.

This is less than ideal when working with I/O-bound programs.

Heavy: It’s got loads of convenience baked in, but it does have a cost in terms of speed and

memory footprint. Not a big deal on a simple script, but on larger systems this can be a resource

hog.
What You Should Instead Use: httpx

For parallel processing of requests, httpxprovides a similar API but with asynchronous support.

So, if you make many API calls, it’ll save you some time and resources because it will process

those requests concurrently.

import httpx

async def fetch_data(url):

async with [Link]() as client:
response = await [Link](url)
return [Link]()

# Simple and non-blocking

data = fetch_data("[Link]

Pro Tip: Asynchronous requests can reduce the processing time by a great amount if the task at

hand is web scraping or ingesting data from somewhere.

BeautifulSoup -> selectolax

2. BeautifulSoup (Yup, This One Too)

Alright, I know this is controversial. BeautifulSoup has been the standard library to tackle HTML

parsing for years, but it’s not really performing as well as it used to. Large or complex documents

have the tendency to make Beauti-fulSoup feel sluggish, and it hasn’t evolved to keep up with

Python’s async-first landscape.

Why It’s Overrated:

Speed: Not very fast, when the size of a document is very big.

Thread blocking: Much like Requests itself, it is not designed with async in mind, which certainly

makes it ill-suited for scraping dynamic websites.

Instead What you should use: selectolax

selectolax is a less famous library that uses libxml2 for better performance and with less memory

consumption.

from [Link] import HTMLParser

html_content = "<html><body><p>Test</p></body></html>"
tree = HTMLParser(html_content)
text = [Link]("p")[0].text()
print(text) # Output: Test

As it will turn out, by using Selectolax, you retain the same HTML parsing capabilities but with

much-enhanced speed, making it ideal for web scraping tasks that are quite data-intensive.

“Do not fall in love with the tool; rather, fall in love with the outcome.” Choosing the proper

tool is half the battle.

Pandas -> polaris

3. Pandas for All Data Manipulation Tasks

Now, listen up-the thing is, Pandas is great at data exploration and for middle-sized datasets. But

people just use it for everything, like it’s some magic solution that’s going to solve every problem

in data, and quite frankly, it isn’t. Working with Pandas on huge datasets can turn your machine

into a sputtering fan engine, and memory overhead just doesn’t make sense for some

workflows.
Why It Is Overrated:

Memory Usage: As Pandas operates mainly in-memory, any operation on a large dataset will

badly hit performance.

Limited Scalability: Scaling with Pandas isn’t easy. It was never designed for big data.

What You Should Use Instead: Polars

Polars is an ultra-fast DataFrame library in Rust using Apache Arrow. Optimized for memory

efficiency and multithreaded performance, this makes it perfect for when you want to crunch

data without heating up your CPU.

import polars as pl

df = pl.read_csv("big_data.csv")
filtered_df = [Link]([Link]("value") > 50)
print(filtered_df)

Why Polars? It will process data that would bring Pandas to its knees, and it handles operations

in a fraction of the time. Besides that, it also has lazy evaluation-meaning it is only computing

what’s needed.

Dictionary
- Hashing is calculated on the key and if keys have same has values then last key is kept as
it is but value is replaced with latest one. Hence for visually same key, last value is taken
in case of multiple key entries
- Key can only be added to the dictionary if hashing is possible for the key. Like list cannot
be added as key as it’s hashing is not possible. But behavior can be imposed see
example function listaskey ().
- whenever we add an object as a dictionary’s key, Python invokes the __𝐡𝐚𝐬𝐡__
function of that object’s class.
Hashing

my_dict = {'1': 'string', True: 'bool', 1: 'int', 1.0: float}

print(my_dict)

# o/p: {'1': 'string', True: <class 'float'>}

Reason
- In Python, dictionaries find a key based on the equivalence of hash (computed using
hash()), but not identity (computed using id()).
- Hash of True. 1 and 1.0 are same (1).
- Now as hash is same so key Is considered same and last kept value is considered.
- But key remains with initial value only because it overwrite values

This is because, at first, True is added as a key and its value is 'bool'. Next, while adding the
key 1, python recognizes it as an equivalence of the hash value.

Thus, the value corresponding to True is overwritten by 'int', while the key (True) is kept as
is.

Finally, while adding 1.0, another hash equivalence is encountered with an existing key of
True. Yet again, the value corresponding to True, which was updated to 'int' in the previous
step, is overwritten by 'float'.

__missing__
class customMissing(dict):
def __missing__(self,key):
self[key] = ls = []
return ls
def implementMissing(self):
d = customMissing()
d['1'] = 'a'
d['2'] = 'b'
print(d['fd'])

Date and Time

Refer
[Link]

Classes of DateTime Python Module

1. The date data type stores calendar date information, including the year, month, and
day. It allows you to represent a specific date on the calendar.
2. The time data type stores the time of day, including the hour, minute, second, and
microsecond. It allows you to represent a specific point in time each day.
3. The datetime data type combines the date and time data types to store both calendar
date and time of day information together. It allows you to represent a full timestamp,
specifying both when something happened and what day it occurred on.
4. The timedelta data type is used to compute the difference between two dates, times, or
datetimes. It allows you to calculate the amount of time between two points in time so
you can determine how much time has passed or how much time remains until a future
date.
5. The tzinfo data type is used to store timezone information. It allows you to specify the
timezone for a particular date, time, or datetime value so you know the local time
represented and can correctly handle daylight saving time and other timezone-related
adjustments.
import datetime

# Get current date and time

now = [Link]()

# Get date from now

date = [Link]()

# Get time from now

time = [Link]()

# Get datetime from now

datetime_now = [Link]()

# Find difference between two datetimes

difference = [Link]() - [Link](2017, 7, 1)

print("difference - ", difference)

# Create a timezone aware datetime

tz_aware_datetime = [Link](2017, 7, 1,
tzinfo=[Link])

print("tz_aware_datetime - ", tz_aware_datetime)

Python For Data Engineering
No ratings yet
Python For Data Engineering
18 pages
Python Cheat Sheet - The Basics CC
No ratings yet
Python Cheat Sheet - The Basics CC
2 pages
Python Cheat Sheet - The Basics Coursera
No ratings yet
Python Cheat Sheet - The Basics Coursera
2 pages
Pyhton Potential Interview Questions
No ratings yet
Pyhton Potential Interview Questions
34 pages
Python Cheat Sheet - The Basics Coursera
No ratings yet
Python Cheat Sheet - The Basics Coursera
2 pages
Python Cheat Sheet - The Basics Edx
No ratings yet
Python Cheat Sheet - The Basics Edx
2 pages
DS ML Python
No ratings yet
DS ML Python
4 pages
DeepSeek - Python Tutorial
No ratings yet
DeepSeek - Python Tutorial
8 pages
Isom3400 Cheatsheet
No ratings yet
Isom3400 Cheatsheet
29 pages
Ultimate Python Cheat Sheet Guide
No ratings yet
Ultimate Python Cheat Sheet Guide
94 pages
Python Self Study Material
0% (1)
Python Self Study Material
9 pages
Python Functions & Data Structures
No ratings yet
Python Functions & Data Structures
16 pages
Intership Body
No ratings yet
Intership Body
31 pages
Week 1: 1 The Python Programming Language: Functions
No ratings yet
Week 1: 1 The Python Programming Language: Functions
9 pages
Real Python Interview Questions American Express
No ratings yet
Real Python Interview Questions American Express
7 pages
Unit 2
No ratings yet
Unit 2
15 pages
30 Python Best Practices, Tips, and Tricks by Erik Van Baaren Python Land Medium
No ratings yet
30 Python Best Practices, Tips, and Tricks by Erik Van Baaren Python Land Medium
23 pages
120+ Py Interview Q&A Py
100% (1)
120+ Py Interview Q&A Py
137 pages
PythonProgrammingStudyGuide StudyGuide
No ratings yet
PythonProgrammingStudyGuide StudyGuide
8 pages
Python
No ratings yet
Python
35 pages
Rapti Lab Report Python (Nagendra)
No ratings yet
Rapti Lab Report Python (Nagendra)
17 pages
Comprehensive Python Guide
No ratings yet
Comprehensive Python Guide
6 pages
Cat 2 Python
No ratings yet
Cat 2 Python
20 pages
Python Cheat Sheet: Syntax & Functions
No ratings yet
Python Cheat Sheet: Syntax & Functions
19 pages
Python Best Practices Tips and Tricks
No ratings yet
Python Best Practices Tips and Tricks
12 pages
Data Analysis Python Read The Docs Io en Latest
No ratings yet
Data Analysis Python Read The Docs Io en Latest
79 pages
PythonVK Notes
No ratings yet
PythonVK Notes
219 pages
Intermediate Python by Yasoob, Muhammad Khalid, Ullah
No ratings yet
Intermediate Python by Yasoob, Muhammad Khalid, Ullah
93 pages
PY0101 - Python For Data Science, AI, & Development Cheat Sheet
No ratings yet
PY0101 - Python For Data Science, AI, & Development Cheat Sheet
2 pages
Comprehensive Guide To PYTHON
No ratings yet
Comprehensive Guide To PYTHON
17 pages
Python for Py-Curious Programmers
No ratings yet
Python for Py-Curious Programmers
131 pages
Python Project Tutorial for Programmers
No ratings yet
Python Project Tutorial for Programmers
131 pages
Python Dictionary Tips to Know Early
100% (1)
Python Dictionary Tips to Know Early
8 pages
Python Cheat Sheet
No ratings yet
Python Cheat Sheet
16 pages
Python Cheat Sheet: Topics
No ratings yet
Python Cheat Sheet: Topics
16 pages
Unit 3 Python Notes
No ratings yet
Unit 3 Python Notes
12 pages
Discrete Structures Lab 1 Python Basics: 1 Python Installation 2 Python Tutorial
No ratings yet
Discrete Structures Lab 1 Python Basics: 1 Python Installation 2 Python Tutorial
8 pages
Python For Data Science AI Development
No ratings yet
Python For Data Science AI Development
7 pages
Python Record Manual
No ratings yet
Python Record Manual
18 pages
Python for Programmers: Project Guide
No ratings yet
Python for Programmers: Project Guide
131 pages
Introduction to Python Programming
No ratings yet
Introduction to Python Programming
53 pages
Python PPR
No ratings yet
Python PPR
7 pages
Python Technical Interviews Questions
100% (2)
Python Technical Interviews Questions
15 pages
Introductiontocourse: 1 The Python Programming Language: Functions
No ratings yet
Introductiontocourse: 1 The Python Programming Language: Functions
11 pages
CPP To Python
No ratings yet
CPP To Python
4 pages
Python Unit1
No ratings yet
Python Unit1
46 pages
Python Cheat Sheet Overview
No ratings yet
Python Cheat Sheet Overview
7 pages
Chapter1 Getting Started
No ratings yet
Chapter1 Getting Started
5 pages
Short Notes On Python
No ratings yet
Short Notes On Python
12 pages
Python Cheat Sheet
No ratings yet
Python Cheat Sheet
11 pages
Java Certification - Questions and Answers
No ratings yet
Java Certification - Questions and Answers
37 pages
Principles of Programming Language Unit 1
No ratings yet
Principles of Programming Language Unit 1
27 pages
Infrastructure Penetration
No ratings yet
Infrastructure Penetration
6 pages
C# Database Programming Guide
100% (1)
C# Database Programming Guide
69 pages
SAP Security Role Change Request
No ratings yet
SAP Security Role Change Request
8 pages
Agile Development Overview
No ratings yet
Agile Development Overview
10 pages
Improving ETL Quality with ISO/IEC 25010
No ratings yet
Improving ETL Quality with ISO/IEC 25010
4 pages
100%FINALOOP
No ratings yet
100%FINALOOP
14 pages
Lab No 02
No ratings yet
Lab No 02
8 pages
Understanding Application Software Types
No ratings yet
Understanding Application Software Types
16 pages
Logbook
No ratings yet
Logbook
13 pages
7.x86 - Architecture
No ratings yet
7.x86 - Architecture
94 pages
Student Enrollment Database Schema
No ratings yet
Student Enrollment Database Schema
5 pages
Class 7 Computer Assignment Booklet
No ratings yet
Class 7 Computer Assignment Booklet
6 pages
OVR Instaaltion
No ratings yet
OVR Instaaltion
2 pages
How To Be A Coder - Learn To Think Like A Coder With Fun Activities
100% (12)
How To Be A Coder - Learn To Think Like A Coder With Fun Activities
146 pages
SAP Tester with 9+ Years Experience
100% (2)
SAP Tester with 9+ Years Experience
6 pages
Smarter Work Management System Overview
100% (1)
Smarter Work Management System Overview
31 pages
Linux Advanced Bash-Scripting Guide
No ratings yet
Linux Advanced Bash-Scripting Guide
978 pages
EBS - APEX Donald Clarke
No ratings yet
EBS - APEX Donald Clarke
14 pages
Test Automation Trends and Insights 2018
100% (1)
Test Automation Trends and Insights 2018
15 pages
Java Software Solutions For Ap Computer Science 3rd Edition Loftus Download
No ratings yet
Java Software Solutions For Ap Computer Science 3rd Edition Loftus Download
78 pages
Ewm Kittting
No ratings yet
Ewm Kittting
4 pages
SQL Command Execution in C#
No ratings yet
SQL Command Execution in C#
5 pages
Basic Controls
No ratings yet
Basic Controls
3 pages
Reading A Datadump
No ratings yet
Reading A Datadump
15 pages
Black Box Testing Overview and Examples
No ratings yet
Black Box Testing Overview and Examples
14 pages
IT Java Programming Lab Manual
No ratings yet
IT Java Programming Lab Manual
61 pages
Java Programs (Class 8)
No ratings yet
Java Programs (Class 8)
3 pages
Resume - Venkat (Backend Developer) - 03012023
No ratings yet
Resume - Venkat (Backend Developer) - 03012023
3 pages

Python Notes

Uploaded by

Python Notes

Uploaded by

[Link]

Make value assignment on the go!!

expressions like loops, conditions, and function calls.

step, which is especially useful in loops and conditional statements.

evaluate and assign a value that will be used multiple times.

the if condition to check its length.

while (user_input := input("Enter a valid string (non-empty): ")) and len(user_input) == 0:

object, which allows for flexibility but consumes more memory.

a __dict__ and reducing the memory footprint.

of the attribute dictionary.

# Creating a million users without `__slots__`

creating large numbers of objects.

# Creating a million users with `__slots__`

memoization to a function. Memoization is a technique used to speed up programs by caching

frequently called with the same arguments.

from functools import lru_cache

# Calculate Fibonacci numbers

requests -> httpx

Requests (Yes, Really!)

Why It’s Overrated:

This is less than ideal when working with I/O-bound programs.

those requests concurrently.

async def fetch_data(url):

# Simple and non-blocking

hand is web scraping or ingesting data from somewhere.

BeautifulSoup -> selectolax

2. BeautifulSoup (Yup, This One Too)

Python’s async-first landscape.

Why It’s Overrated:

makes it ill-suited for scraping dynamic websites.

from [Link] import HTMLParser

tool is half the battle.

3. Pandas for All Data Manipulation Tasks

badly hit performance.

What You Should Use Instead: Polars

data without heating up your CPU.

my_dict = {'1': 'string', True: 'bool', 1: 'int', 1.0: float}

# o/p: {'1': 'string', True: <class 'float'>}

Date and Time

Classes of DateTime Python Module

# Get current date and time

# Get date from now

# Get time from now

# Get datetime from now

# Find difference between two datetimes

print("difference - ", difference)

# Create a timezone aware datetime

print("tz_aware_datetime - ", tz_aware_datetime)

You might also like

a dict and reducing the memory footprint.

# Creating a million users without `slots`

# Creating a million users with `slots`