0% found this document useful (0 votes)

57 views4 pages

Manipulating Text With Regular Expression in Python

This document provides an overview of using regular expressions in Python for text manipulation, detailing special characters, sequences, and quantifiers. It explains basic functions such as matching patterns, finding all matches, splitting strings, replacing substrings, and capturing groups with examples. Additionally, it includes practical examples for validating email addresses, extracting hashtags, and normalizing text spacing.

Uploaded by

RANJIT Yadav

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

57 views4 pages

Manipulating Text With Regular Expression in Python

Uploaded by

RANJIT Yadav

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

Manipulating Text with Regular Expression in python

Regular expressions (regex) in Python are a powerful tool for text manipulation. They allow
you to search, match, and manipulate text strings with complex patterns. The re module in
Python provides several functions to work with regular expressions.

Special Characters

 . (Dot): Matches any character except a newline.

 ^ (Caret): Matches the start of the string.

 $ (Dollar Sign): Matches the end of the string.

 [] (Square Brackets): Matches any one of the characters inside the brackets.

 \ (Backslash): Escapes special characters or signals a particular sequence.

Special Sequences

 \d: Matches any digit.

 \D: Matches any non-digit character.

 \s: Matches any whitespace character.

 \S: Matches any non-whitespace character.

 \w: Matches any alphanumeric character.

 \W: Matches any non-alphanumeric character.

Quantifiers

 *: Matches 0 or more repetitions of the preceding pattern.

 +: Matches 1 or more repetitions of the preceding pattern.

 ?: Matches 0 or 1 repetition of the preceding pattern.

 {n}: Matches exactly n repetitions of the preceding pattern.

 {n,}: Matches n or more repetitions of the preceding pattern.

 {n,m}: Matches between n and m repetitions of the preceding pattern.

Basic Functions

Matching Patterns

To check if a pattern exists within a string, you can use re.match() or re.search().

 re.match() checks for a match only at the beginning of the string.

 re.search() checks for a match anywhere in the string.

import re

text = "Hello, world!"

# Match at the beginning

match = re.match(r'Hello', text)

if match:

print("Match found:", match.group()) # Output: Match found: Hello

# Search anywhere in the string

search = re.search(r'world', text)

if search:

print("Search found:", search.group()) # Output: Search found: world

Finding All Matches

To find all occurrences of a pattern in a string, use re.findall().

text = "The rain in Spain stays mainly in the plain."

# Find all occurrences of 'ain'

matches = re.findall(r'ain', text)

print("Find all matches:", matches) # Output: Find all matches: ['ain', 'ain', 'ain']

Splitting Strings

To split a string by a pattern, use re.split().

text = "one1two2three3four4"

# Split by digits

split_result = re.split(r'\d', text)

print("Split result:", split_result) # Output: Split result: ['one', 'two', 'three', 'four', '']

Replacing Substrings

To replace substrings that match a pattern, use re.sub().

text = "The rain in Spain."

# Replace 'rain' with 'sun'

replace_result = re.sub(r'rain', 'sun', text)

print("Replace result:", replace_result) # Output: Replace result: The sun in Spain.

Capturing Groups

Capturing groups allow you to extract specific parts of a match.

text = "My phone number is 123-456-7890."

# Capture groups for area code, prefix, and line number

match = re.search(r'(\d{3})-(\d{3})-(\d{4})', text)

if match:
area_code, prefix, line_number = match.groups()

print("Area code:", area_code) # Output: Area code: 123

print("Prefix:", prefix) # Output: Prefix: 456

print("Line number:", line_number) # Output: Line number: 7890

Examples

Here are some more examples to illustrate the use of regular expressions for text
manipulation:

# Example 1: Validate an email address

email = "[email protected]"

is_valid = re.match(r'^[\w\.-]+@[\w\.-]+\.\w+$', email)

print("Valid email:", bool(is_valid)) # Output: Valid email: True

# Example 2: Extract all hashtags from a tweet

tweet = "Loving the new features in #Python3.9! #coding #programming"

hashtags = re.findall(r'#\w+', tweet)

print("Hashtags:", hashtags) # Output: Hashtags: ['#Python3', '#coding', '#programming']

# Example 3: Replace multiple spaces with a single space

text = "This is an example with irregular spacing."

normalized_text = re.sub(r'\s+', ' ', text)

print("Normalized text:", normalized_text) # Output: Normalized text: This is an example

with irregular spacing.

9 RegEx
No ratings yet
9 RegEx
57 pages
9 RegEx
No ratings yet
9 RegEx
57 pages
Unit - 4 Regex
No ratings yet
Unit - 4 Regex
28 pages
Regular Expressions in Python
No ratings yet
Regular Expressions in Python
4 pages
UNIT4
No ratings yet
UNIT4
67 pages
Regular Expression L
No ratings yet
Regular Expression L
20 pages
App Dev Using Python-Chapter 3
No ratings yet
App Dev Using Python-Chapter 3
16 pages
Python Regular Expressions Tutorial
No ratings yet
Python Regular Expressions Tutorial
27 pages
Module 24 Regular Expressions Revisited
No ratings yet
Module 24 Regular Expressions Revisited
15 pages
Unit7 RegularExpressionpdf 2023 10 17 09 16 29
No ratings yet
Unit7 RegularExpressionpdf 2023 10 17 09 16 29
17 pages
Mastering Regular Expressions in Python
No ratings yet
Mastering Regular Expressions in Python
4 pages
Unit 4 Regular Expression
No ratings yet
Unit 4 Regular Expression
16 pages
Regular
No ratings yet
Regular
9 pages
Python Course: Session 6b - Regular Expressions
No ratings yet
Python Course: Session 6b - Regular Expressions
11 pages
Python Regex Basics
No ratings yet
Python Regex Basics
16 pages
Regular Expression 01
No ratings yet
Regular Expression 01
48 pages
Regular Expression
No ratings yet
Regular Expression
21 pages
Reg Ex
No ratings yet
Reg Ex
3 pages
Python Regular Expressions
No ratings yet
Python Regular Expressions
14 pages
Python Re
No ratings yet
Python Re
18 pages
Regular Expression
No ratings yet
Regular Expression
39 pages
Lec 06 - Regular Expression
No ratings yet
Lec 06 - Regular Expression
19 pages
17 - Regular Expression
No ratings yet
17 - Regular Expression
20 pages
Learn Python Regular Expressions
No ratings yet
Learn Python Regular Expressions
50 pages
Unit-3 - Regular Expression
No ratings yet
Unit-3 - Regular Expression
15 pages
9python Simple Character Matches
No ratings yet
9python Simple Character Matches
19 pages
Regular Expressions in Python
No ratings yet
Regular Expressions in Python
12 pages
Python Regex Basics and Usage
No ratings yet
Python Regex Basics and Usage
12 pages
Python - Regular Expressions
No ratings yet
Python - Regular Expressions
13 pages
RegEx in Python
No ratings yet
RegEx in Python
5 pages
Assignment 3
No ratings yet
Assignment 3
3 pages
Python Regex: Match, Search, Findall
No ratings yet
Python Regex: Match, Search, Findall
10 pages
Regular Expressions - Regexes in Python (Part 1) - Real Python
No ratings yet
Regular Expressions - Regexes in Python (Part 1) - Real Python
44 pages
Python Regular Expression
100% (1)
Python Regular Expression
31 pages
Understanding Python's re Module
No ratings yet
Understanding Python's re Module
9 pages
Python Reg Expressions
No ratings yet
Python Reg Expressions
8 pages
Python Regular Expressions Tutorial
No ratings yet
Python Regular Expressions Tutorial
23 pages
Regex for Genomics & Programming
No ratings yet
Regex for Genomics & Programming
38 pages
Lecture 7 Re Part2 Split
No ratings yet
Lecture 7 Re Part2 Split
8 pages
Python Regex
No ratings yet
Python Regex
8 pages
PP - Module-3 Notes
No ratings yet
PP - Module-3 Notes
56 pages
Python Regex Cheatsheet With Examples: Re Module Functions
No ratings yet
Python Regex Cheatsheet With Examples: Re Module Functions
1 page
Howto Regex
No ratings yet
Howto Regex
19 pages
Regular Expression
No ratings yet
Regular Expression
20 pages
Full Python Regex Questions Detailed
No ratings yet
Full Python Regex Questions Detailed
4 pages
Lecture 9 Python
No ratings yet
Lecture 9 Python
8 pages
Python Reg Expressions PDF
No ratings yet
Python Reg Expressions PDF
8 pages
Python RegEx
No ratings yet
Python RegEx
11 pages
Howto Regex
No ratings yet
Howto Regex
17 pages
Python Unit 5
No ratings yet
Python Unit 5
143 pages
Python Regex & NLTK Guide
No ratings yet
Python Regex & NLTK Guide
53 pages
Module II
No ratings yet
Module II
17 pages
Untitled
No ratings yet
Untitled
53 pages
Python Regex Examples
No ratings yet
Python Regex Examples
24 pages
Unit 2
No ratings yet
Unit 2
69 pages
Pandas
No ratings yet
Pandas
8 pages
Python Complete Unit 3
No ratings yet
Python Complete Unit 3
40 pages
Transaction
No ratings yet
Transaction
3 pages
Index and Hashng
No ratings yet
Index and Hashng
2 pages
CN After Mid-2
No ratings yet
CN After Mid-2
13 pages
Movie Recommendation System with ML
No ratings yet
Movie Recommendation System with ML
21 pages
IT3101 - Object-Oriented Systems Development: University of Colombo, Sri Lanka
No ratings yet
IT3101 - Object-Oriented Systems Development: University of Colombo, Sri Lanka
12 pages
SDM Case Studies
No ratings yet
SDM Case Studies
193 pages
4021Q2 Specimen Computer Science
100% (1)
4021Q2 Specimen Computer Science
12 pages
Bsac 117 Computer Audit Week 4 Seatwork Aug 24-Student
No ratings yet
Bsac 117 Computer Audit Week 4 Seatwork Aug 24-Student
7 pages
AutoCAD Basics for Engineering Students
No ratings yet
AutoCAD Basics for Engineering Students
6 pages
API Testing - Jaikishan
No ratings yet
API Testing - Jaikishan
8 pages
IP Telephony Tender Notice
No ratings yet
IP Telephony Tender Notice
12 pages
Tom's E-Commerce Cart Structure
100% (1)
Tom's E-Commerce Cart Structure
267 pages
Unit III PHP
No ratings yet
Unit III PHP
31 pages
Lib M Bus Master
No ratings yet
Lib M Bus Master
283 pages
16 Channel Rail Rs485 Commamd
No ratings yet
16 Channel Rail Rs485 Commamd
8 pages
Assignment01 Generics Part 1
No ratings yet
Assignment01 Generics Part 1
2 pages
Communication Failure Alphasat in EMEA Region (FSN150007A)
No ratings yet
Communication Failure Alphasat in EMEA Region (FSN150007A)
12 pages
2 Marks Question Bank
No ratings yet
2 Marks Question Bank
7 pages
SN 74194
No ratings yet
SN 74194
13 pages
.PSA27 Radar System (New Version) - Manual
No ratings yet
.PSA27 Radar System (New Version) - Manual
31 pages
Schneider Electric Vijeo-Designer VJDBTPRO1P
No ratings yet
Schneider Electric Vijeo-Designer VJDBTPRO1P
2 pages
W 10 S H I: Indows Egment EAP Nternals
No ratings yet
W 10 S H I: Indows Egment EAP Nternals
54 pages
General Elements of A C Program
No ratings yet
General Elements of A C Program
6 pages
Microsoft Modern Desktop Admin Path
No ratings yet
Microsoft Modern Desktop Admin Path
1 page
RF07 Reopening of Enrollment and EOSY Finalization
No ratings yet
RF07 Reopening of Enrollment and EOSY Finalization
6 pages
M1 Formative
No ratings yet
M1 Formative
5 pages
Google Cloud Architect Exam Q&A Guide
No ratings yet
Google Cloud Architect Exam Q&A Guide
89 pages
CCNA 2 Practice Exam Answers
No ratings yet
CCNA 2 Practice Exam Answers
11 pages
NFT-Based Digital Ownership Solutions
No ratings yet
NFT-Based Digital Ownership Solutions
81 pages
Telco Edge Cloud Evolution to NaaS
No ratings yet
Telco Edge Cloud Evolution to NaaS
67 pages
Activity 1-2 Web Application Development: Instructions/Directions
No ratings yet
Activity 1-2 Web Application Development: Instructions/Directions
5 pages
OPT-5107C Digital TV Multiplexer
No ratings yet
OPT-5107C Digital TV Multiplexer
2 pages
Artificial Intelligence Tool in RPF
No ratings yet
Artificial Intelligence Tool in RPF
11 pages
77 Useful Linux Commands and Utilities
No ratings yet
77 Useful Linux Commands and Utilities
12 pages

Manipulating Text With Regular Expression in Python

Uploaded by

Manipulating Text With Regular Expression in Python

Uploaded by

Manipulating Text with Regular Expression in python

 . (Dot): Matches any character except a newline.

 ^ (Caret): Matches the start of the string.

 $ (Dollar Sign): Matches the end of the string.

 \ (Backslash): Escapes special characters or signals a particular sequence.

 \d: Matches any digit.

 \D: Matches any non-digit character.

 \s: Matches any whitespace character.

 \S: Matches any non-whitespace character.

 \w: Matches any alphanumeric character.

 \W: Matches any non-alphanumeric character.

 *: Matches 0 or more repetitions of the preceding pattern.

 +: Matches 1 or more repetitions of the preceding pattern.

 ?: Matches 0 or 1 repetition of the preceding pattern.

 {n}: Matches exactly n repetitions of the preceding pattern.

 {n,}: Matches n or more repetitions of the preceding pattern.

 {n,m}: Matches between n and m repetitions of the preceding pattern.

 re.match() checks for a match only at the beginning of the string.

text = "Hello, world!"

# Match at the beginning

match = re.match(r'Hello', text)

print("Match found:", match.group()) # Output: Match found: Hello

# Search anywhere in the string

search = re.search(r'world', text)

print("Search found:", search.group()) # Output: Search found: world

Finding All Matches

To find all occurrences of a pattern in a string, use re.findall().

text = "The rain in Spain stays mainly in the plain."

# Find all occurrences of 'ain'

matches = re.findall(r'ain', text)

To split a string by a pattern, use re.split().

split_result = re.split(r'\d', text)

To replace substrings that match a pattern, use re.sub().

text = "The rain in Spain."

# Replace 'rain' with 'sun'

replace_result = re.sub(r'rain', 'sun', text)

print("Replace result:", replace_result) # Output: Replace result: The sun in Spain.

Capturing groups allow you to extract specific parts of a match.

text = "My phone number is 123-456-7890."

# Capture groups for area code, prefix, and line number

match = re.search(r'(\d{3})-(\d{3})-(\d{4})', text)

print("Area code:", area_code) # Output: Area code: 123

print("Prefix:", prefix) # Output: Prefix: 456

print("Line number:", line_number) # Output: Line number: 7890

# Example 1: Validate an email address

is_valid = re.match(r'^[\w\.-]+@[\w\.-]+\.\w+$', email)

print("Valid email:", bool(is_valid)) # Output: Valid email: True

# Example 2: Extract all hashtags from a tweet

tweet = "Loving the new features in #Python3.9! #coding #programming"

hashtags = re.findall(r'#\w+', tweet)

print("Hashtags:", hashtags) # Output: Hashtags: ['#Python3', '#coding', '#programming']

# Example 3: Replace multiple spaces with a single space

text = "This is an example with irregular spacing."

normalized_text = re.sub(r'\s+', ' ', text)

print("Normalized text:", normalized_text) # Output: Normalized text: This is an example

You might also like