Python Packages for Web Data Access
Python Packages for Web Data Access
API Path:
Send API Request → We send a request to an API server.
Receive Data (JSON/XML) → The server sends back structured
data.
Store Data (CSV, JSON, TXT) → We save the API data in a file.
REGEX (Regular Expressions)
Regular Expressions (Regex) are a special sequence of characters used to find, match, and manipulate
patterns in text. It acts like a smart filter that helps you search for specific words, numbers, or patterns
inside a large amount of text.
Methods
Examples:
1. re.findall()
2. re.sub() 3. re.search()
4. re.match()
5. re.split()
Urllib
URL (Uniform Resource Locator) Library
urllib is a built-in Python module used for fetching, processing, and handling URLs. It allows
Python to interact with websites by sending requests, downloading data, and handling web-related
tasks like encoding URLs and managing errors.
Key Features:
✅ Open and read web pages (urllib.request)
✅ Parse and manipulate URLs (urllib.parse)
✅ Handle HTTP errors (urllib.error)
3. urlib.error
4. urlib.robotparser
Submitted by:
CB.PS.I5MAT24004
CB.PS.I5MAT24006
CB.PS.I5MAT24009