0% found this document useful (0 votes)

18 views67 pages

DOC4

Regular expressions (regex) are patterns used to search and manipulate text strings, enhancing text processing capabilities. They come in two types: Basic Regular Expressions (BRE) and Extended Regular Expressions (ERE), each with specific meta characters and functionalities. Regex is commonly utilized in Unix/Linux commands like grep, sed, and awk for text searching and manipulation.

Uploaded by

abtutorialkarthik

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views67 pages

DOC4

Uploaded by

abtutorialkarthik

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 67

Regular Expression

Palani Karthikeyan
[email protected]
What are regular expressions?
● A regular expression is a pattern that describes a
set of strings.
● Regular expressions are used to search and
manipulate the text, based on the patterns.
● A regular expression, often shortened to “regex” or
“regexp”.
● Regexes enhance the ability to meaningfully
process text content, especially when combined
with other commands.
grep ,sed,awk
● Usually, regular expressions are included in the
grep,sed and awk in the following format:
● grep [options] [regexp] [inputfile]
● In sed : sed [option] '/[regexp]action/' [inputfile]
● In awk: awk [option] '/[regexp]{Action}' [inputfile]
BRE & ERE
● Two types of regular expression feature in
unix/Linux shell
● Basic Regular Expression – BRE
● Extended Regular Expression – ERE
BRE
● BRE – following meta characters are used
● . (dot) Matches any single character.
● ^ match expression at the start of a line, as ^PATTERN
● $ match expression at the end of a line, as in PATTERN$.
● \ (Back Slash) = turn off the special meaning of the next character, as in \^
● [ ] (Brackets)=match any one of the enclosed characters
● [^ ]= match any one character except those enclosed in [ ]
● * (Asterisk) = match zero or more of the preceding character or expression
● ^PATTERN$ = match PATTERN only in single line
● [-]=Character ranges as [A-Z] [0-9] [a-z] [A-Za-z0-9]
ERE
● ERE – Following meta characters are used.
● ? means that the preceding item is optional, and if found, will be matched at the
most, once.
● + means the preceding item will be matched one or more times.
● {n} means the preceding item is matched exactly n times
●
{n,} means the item is matched n or more times.
●
{n,m} means that the preceding item is matched at least n times, but not more
than m times.
●
{,m} means that the preceding item is matched, at the most, m times.
● | (alternation) operator means that the pattern containing this operator separately
matches the parts on either side of it; if either one is found, the line containing it is
a match.
●
( ) Grouping means that ( ) to group several patterns to behave as one.
ERE
● In general ERE supports following operations
– Alternative Match Patterns
– Grouping Alternatives
– Quantifiers
Alternative Match Patterns

● Alternative Match Pattern means that you can

specify a series of alternatives for a pattern
using | to separate them.
● |(called alternation) is equivalent to an “or” in
regular expression.
● Alternatives are checked from left to right, so
the first alternative that matches is the one
that’s used.
Grouping Alternatives

● Grouping “( ) “ allows parts of a regular

expression to be treated as a single unit.
● Parts of a regular expression are grouped by
enclosing them in parentheses.
● Used to group similar terms by their common
characters and only specified the differences.
● The pairs of parentheses are numbered from left
to right by the positions of the left parentheses.
Quantifiers

● Quantifiers says how many times something

may match,instead of the default of matching
just once.
● You can use quantifier to specify that a pattern
must match a specific number of times.
● Quantifiers in a regular expression are like
loops in a program.
Quantifiers (Contd..)
character Description

* It indicates that the string Immediately to

match 0 or more times the left should
be matched zero or more times in order to
be evaluated
as a true.
Example:-
$var =~ /st*/ # Will match for the strings
like
“st”, ”sttr”, “ sts ”, “star”, “son “....
The regexp “a*” will search for a followed
by either “a” or any other
character.
It matches all strings which
contain the character “a”
Quantifiers (Contd..)
character Description

+ It indicates that the string Immediately to

the left should
match 1 or more times be matched one or more times in order to
be evaluated as
a true.

Example:-
$var =~ /st+/ # Will match for the strings
like “st”,”sttr”, “sts” ,”star “, but not “son”.
Quantifiers (Contd..)
character Description

? It indicates that the string Immediately to

the left should be matched zero or one
times in order to be evaluated as a true.
match 1 or 0 times Example : -

$var =~ /st?r/ # will match either “star” or

“sttr”.

$var =~ /comm?a/ # will match either

“coma” or “comma”
Quantifiers (Contd..)
character Description

{} It indicates that how many times the string

immediately to the left should be matched.

Example : -
{n} - should match exactly n times.
{n,} - should match at least n times
{n, m} - Should match at least n times but
not more than m times.
Example :
$var =~ /mn{2,4}p/ # will match “mnnp”,
“mnnnp”, ”mnnnnp” .
Making Quantifiers Less Greedy

● To make Quantifiers less greedy –that is ,to match the

minimum number of times possible –you follow the
quantifier with a ?
● *? Matches zero or more times.
● +? Matches one or more times.
● ?? Matches zero or one times.
● {n}? Matches n times.
● {n,}? Matches at least n times
● {m,n} Matches at least n times but more than m times.
BRE vs ERE
● In basic regular expressions the
metacharacters "?", "+", "{", "|", "(", and ")" lose
their special meaning; instead use the
backslashed versions "\?", "\+", "\{", "\|", "$",
and "$".
● In ERE options
● grep -E
● sed -r
Examples using grep
● we now exclusively want to display lines starting with
the string "root":
● grep ^root /etc/passwd
● root:x:0:0:root:/root:/bin/bash
● If we want to see which accounts have no shell
assigned whatsoever, we search for lines ending in ":"
● grep :$ /etc/passwd
● news:x:9:13:news:/var/spool/news:
Character classes
● grep [yf] /etc/group
● sys:x:3:root,bin,adm
● tty:x:5:
● mail:x:12:mail,postfix
● ftp:x:50:
● nobody:x:99:
● floppy:x:19:
● xfs:x:43:
● nfsnobody:x:65534:
● postfix:x:89:
●
●
dog matches the string "dog"
●
[dog]matches matches one character: a "d" an "o" or a "g"
●
[dog]* matches matches a string of zero or more characters from the set {"d" an "o" or a "g"}
●
(dog|cat) matches the string "dog" or the string "cat"
●
dog.*cat matches the string "dog" followed by the string "cat" somewhere later in the string
●
x(dog|cat)x matches the string "dog" or the string "cat" between two "x"s
●
xx* matches a string of one or more "x"s
●
x+matches a string of one or more "x"s
●
x(dog|cat)?x matches two "x"s with optionally the string "dog" or the string "cat" between the "x"'s
●
[aeiou] matches a single vowel
●
[A-Z]+ matches a string of one or more uppercase characters
●
[az-]+ matches a string of one characters from the set or three characters "a", "z", "-"
●
[^a-z]+ matches a string of one or more characters that are not lowercaase letters
●
"[a-z]" in flex matches exactly the five character string "[a-z]"
●
[a-zA-Z][a-zA-Z0-9]*matches a letter optionally followed by letters or digits
●
[1-9][0-9]*|0 matches a positive integer with no leading zero except when the number is zero
●
[+-]?[0-9]+ matches an integer with optional sign (note that leading zeroes are allowed
●
([0-9].)*matches an even number of characters where every odd numbered character is a digit
●
[+-]?[1-9][0-9]*|0 matches an integer with no leading zero except when the number is zero. The
number may have an optional sign
●
[\^\+\-\:\*\]] matches one of the 6 characters: "^", "+", "-", ":", "*", "]"
Regx Snaps
Thank you

D OSWP-Report
0% (2)
D OSWP-Report
18 pages
RARE WEDGE 100BF 32X Manual Installation Guide
No ratings yet
RARE WEDGE 100BF 32X Manual Installation Guide
32 pages
Cs 32 Final Notes
No ratings yet
Cs 32 Final Notes
21 pages
Robotics Resource Lego Wedo
No ratings yet
Robotics Resource Lego Wedo
7 pages
IDC: Predictive Analytics and ROI
No ratings yet
IDC: Predictive Analytics and ROI
10 pages
OpenText Media Management CE 22.2 - Integration Guide English (MEDMGT220200-AIN-EN-01)
No ratings yet
OpenText Media Management CE 22.2 - Integration Guide English (MEDMGT220200-AIN-EN-01)
170 pages
Lecture 9
No ratings yet
Lecture 9
26 pages
WT - Regular Expression
No ratings yet
WT - Regular Expression
22 pages
Regex Cheat Sheet
No ratings yet
Regex Cheat Sheet
10 pages
Regular Expressions
No ratings yet
Regular Expressions
5 pages
Python Regular Expressions Cheat Sheet PDF
No ratings yet
Python Regular Expressions Cheat Sheet PDF
1 page
Regex Slides PDF
No ratings yet
Regex Slides PDF
435 pages
Using Regular Expressions With PHP
No ratings yet
Using Regular Expressions With PHP
6 pages
Sys LW-08EN Regex-Filters
No ratings yet
Sys LW-08EN Regex-Filters
31 pages
Andrei's Regex Clinic - PHP Quebec 2009
100% (2)
Andrei's Regex Clinic - PHP Quebec 2009
209 pages
Regular Expression Syntax: Literals
No ratings yet
Regular Expression Syntax: Literals
5 pages
Regular Expression Overview
No ratings yet
Regular Expression Overview
5 pages
Unix Regular Expression
No ratings yet
Unix Regular Expression
7 pages
REGULAR EXPRESSIONS Workbook
No ratings yet
REGULAR EXPRESSIONS Workbook
8 pages
Regex
No ratings yet
Regex
30 pages
Jan Goyvaerts - All About Regular Expressions-Https - WWW - Regular-Expressions - Info - (2019)
No ratings yet
Jan Goyvaerts - All About Regular Expressions-Https - WWW - Regular-Expressions - Info - (2019)
206 pages
Chapter Two
No ratings yet
Chapter Two
72 pages
Regex
No ratings yet
Regex
24 pages
Regular Expressions: Exceptions in A Character Set
No ratings yet
Regular Expressions: Exceptions in A Character Set
10 pages
Regex in A Nutshell
No ratings yet
Regex in A Nutshell
2 pages
POSIX Regular Expressions: Brackets
No ratings yet
POSIX Regular Expressions: Brackets
5 pages
Matching This or That: ' - ' Dog Cat Dog - Cat Dog Dog Cat Cat
No ratings yet
Matching This or That: ' - ' Dog Cat Dog - Cat Dog Dog Cat Cat
7 pages
$address M/ (/D . ) /N ( (A-Z) (2) ) (/D (5) ) - ? (/D (0,5) )
No ratings yet
$address M/ (/D . ) /N ( (A-Z) (2) ) (/D (5) ) - ? (/D (0,5) )
98 pages
Lecture02 Scanning 1
No ratings yet
Lecture02 Scanning 1
72 pages
PHP - Regular Expressions
No ratings yet
PHP - Regular Expressions
14 pages
Bash Regex
No ratings yet
Bash Regex
53 pages
Re - Regular Expression Operations - Python 3.13.3 Documentation
No ratings yet
Re - Regular Expression Operations - Python 3.13.3 Documentation
28 pages
Python RegEx
No ratings yet
Python RegEx
8 pages
PHP - Regular Expressions
No ratings yet
PHP - Regular Expressions
7 pages
David Wang Computing Science and Information Technology: Info 1211 - Operating System'S Principles and Applications
No ratings yet
David Wang Computing Science and Information Technology: Info 1211 - Operating System'S Principles and Applications
73 pages
Perl Training Regex
No ratings yet
Perl Training Regex
27 pages
Regex Tutorial - A Quick Cheatsheet by Examples - by Jonny Fox - Factory Mind - Medium
No ratings yet
Regex Tutorial - A Quick Cheatsheet by Examples - by Jonny Fox - Factory Mind - Medium
7 pages
2 Regular Expression
No ratings yet
2 Regular Expression
23 pages
How To Write Regular Expressions?: What Is A Regular Expression and What Makes It So Important?
No ratings yet
How To Write Regular Expressions?: What Is A Regular Expression and What Makes It So Important?
2 pages
Regex Clinic
100% (1)
Regex Clinic
148 pages
CC 2
No ratings yet
CC 2
65 pages
Regex Tutorial-A Quick Cheatsheet by Examples: Anchors - and $
No ratings yet
Regex Tutorial-A Quick Cheatsheet by Examples: Anchors - and $
7 pages
Ayan Saha - 10700121101
No ratings yet
Ayan Saha - 10700121101
10 pages
Regular Expressions and Sed & Awk
No ratings yet
Regular Expressions and Sed & Awk
13 pages
Regular Expressions and Sed & Awk
No ratings yet
Regular Expressions and Sed & Awk
14 pages
Regular Expressions: SESSION - 14 - 15 - 16
No ratings yet
Regular Expressions: SESSION - 14 - 15 - 16
42 pages
Lecture19 12PM
No ratings yet
Lecture19 12PM
38 pages
Regexp
No ratings yet
Regexp
28 pages
Regular PDF
No ratings yet
Regular PDF
2 pages
Chapter 10
No ratings yet
Chapter 10
28 pages
Regex Cheatsheet
No ratings yet
Regex Cheatsheet
6 pages
CSS Unit 5
No ratings yet
CSS Unit 5
18 pages
Regular Expressions
No ratings yet
Regular Expressions
4 pages
Javascript Regexp Object
No ratings yet
Javascript Regexp Object
4 pages
Regex Cheat Sheet
No ratings yet
Regex Cheat Sheet
4 pages
Perl Re Quick
No ratings yet
Perl Re Quick
9 pages
Java Lect 17
No ratings yet
Java Lect 17
24 pages
Linux Regular Expression Tutorial - Grep Regex Example
No ratings yet
Linux Regular Expression Tutorial - Grep Regex Example
8 pages
Java Lect 17
No ratings yet
Java Lect 17
24 pages
3-Regular Expressions
No ratings yet
3-Regular Expressions
34 pages
Regular Expressions in Perl
No ratings yet
Regular Expressions in Perl
13 pages
Introduction To The Idirect Web Service Interface: Revision B
No ratings yet
Introduction To The Idirect Web Service Interface: Revision B
42 pages
AE306 Digital Signal Processing
No ratings yet
AE306 Digital Signal Processing
2 pages
Matlab Group 1 Project Report
No ratings yet
Matlab Group 1 Project Report
2 pages
2373 Programming With MS Visual Basic
No ratings yet
2373 Programming With MS Visual Basic
5 pages
Noodle Analytics in 2018 AI For The Enterprise
No ratings yet
Noodle Analytics in 2018 AI For The Enterprise
28 pages
Component Based Software Engineering
No ratings yet
Component Based Software Engineering
4 pages
KUKA-youBot UserManual v0.86.1
No ratings yet
KUKA-youBot UserManual v0.86.1
46 pages
Get The LUN ID at AIX
No ratings yet
Get The LUN ID at AIX
4 pages
System On Chip (SOC)
No ratings yet
System On Chip (SOC)
9 pages
The OPENGL Basic Graphics Primitives
No ratings yet
The OPENGL Basic Graphics Primitives
55 pages
Nov - 2016 - NEO - Firmware Upgrade v130
No ratings yet
Nov - 2016 - NEO - Firmware Upgrade v130
6 pages
Certificate of Training Jeraldin T. Bulat-Ag: Sibugay Technical Institute Incorporated Inc
No ratings yet
Certificate of Training Jeraldin T. Bulat-Ag: Sibugay Technical Institute Incorporated Inc
2 pages
Free - Proxy - List Russia Socks 5
No ratings yet
Free - Proxy - List Russia Socks 5
2 pages
Prasanth Resume 1
No ratings yet
Prasanth Resume 1
4 pages
LCP4809-Exam Oct Nov 2023
No ratings yet
LCP4809-Exam Oct Nov 2023
8 pages
Microcontroller Lab Manual
100% (1)
Microcontroller Lab Manual
38 pages
C++ Sockets TCP
No ratings yet
C++ Sockets TCP
28 pages
Recruiter's Handbook - Boolean Strings
100% (1)
Recruiter's Handbook - Boolean Strings
17 pages
Vikram Resume-2
No ratings yet
Vikram Resume-2
3 pages
Sai Construction
No ratings yet
Sai Construction
2 pages
RPAx User Manual Rev 9.6 Update 27 - 5 - 2017
No ratings yet
RPAx User Manual Rev 9.6 Update 27 - 5 - 2017
11 pages
Luci Cheltuieli Nov 24
No ratings yet
Luci Cheltuieli Nov 24
2 pages
Lab4 Top Level SSN Insertion
No ratings yet
Lab4 Top Level SSN Insertion
19 pages
Schemeof Work P4 Set
No ratings yet
Schemeof Work P4 Set
14 pages

DOC4

Uploaded by

DOC4

Uploaded by

Regular Expression

● Alternative Match Pattern means that you can

● Grouping “( ) “ allows parts of a regular

● Quantifiers says how many times something

* It indicates that the string Immediately to

+ It indicates that the string Immediately to

? It indicates that the string Immediately to

$var =~ /st?r/ # will match either “star” or

$var =~ /comm?a/ # will match either

{} It indicates that how many times the string

● To make Quantifiers less greedy –that is ,to match the

You might also like