Regular Expressions

This document provides an introduction to regular expressions (regexes). It explains that regexes are sequences of characters that define search patterns to match strings. Some key points covered include: - Regexes start and end with "/" and are used in programming languages to filter data. - Common regex patterns include matching specific words, ranges of characters, numbers of repetitions, and starting/ending patterns. - Metacharacters like "\d" have special meanings, and flags like "i" and "g" modify matching behavior. - Braces {}, brackets [], and symbols like ?, *, +, | have specific regex functions around repetition, character sets, and alternation.

Uploaded by

edgar leiva

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

44 views35 pages

Regular Expressions

Uploaded by

edgar leiva

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 35

REGULAR EXPRESSIONS

WHAT IS A REGULAR
EXPRESSION (REGEX)

SEQUENCE OF
CHARACTERS THAT
DEFINES A SEARCH
PATTERN

IT PROVIDES A CONCISE AND USED IN VARIOUS

POWERFUL WAY TO DESCRIBE PROGRAMMING LANGUAGES
THOSE PATTERNS
IMPORTANCE
As we’re learning about SQL commands sometimes we need to filter data using
regular expressions, this occurs with e-mails, passwords, aliases, etc.
BASICS
- All regular expressions starts and ends with “/”
Example: /example/
Note: This regular expression catches all strings that contains the exact word “example”.
Note: When a string accomplishes the regex rule, we call that a “match”.
REGEX FOR STRINGS THAT
CONTAINS A SPECIFIC WORD
In this case, if we need a regular expression that matches strings that contains an
specific word, then we might use: /<our word>/
So, /watermelon/ will have this effect:
 “odfkvokkmwater-melongbbjofg”, result: not match
 “watermelondifbijingfi”, result: match
 “sfnvjdfslb”, result: not match
 “xzfdsewatermelongfi”, result: match Note that this regex only
 “vfvxwatermelon”, result: match match the first appearance of
 “vfvxwatermelonfdndjfdfwatermelon”, result: match the word “watermelon”, then
returns.
REGEX FOR STRINGS THAT
CONTAINS A SPECIFIC WORD
We can also use a regex flag (an optional parameter to a regex that modifies its behavior
of searching) named “global” (g) to match all the appearances of the word
“watermelon”.
So, /watermelon/g will have this effect: Note: Regex flags are always typed
 “odfkvokkmwater-melongbbjofg”, result: not match after the final “/”.
 “watermelondifbijingfi”, result: 1 match
 “sfnvjdfslb”, result: not match
 “xzfdsewatermelongfi”, result: 1 match
 “vfvxwatermelon”, result: 1 match
 “vfvxwatermelonfdndjfdfwatermelon”, result: 2 matches
REGEX FOR STRINGS THAT
CONTAINS A SPECIFIC WORD
We can also use a regex flag named “insensitive” (i) to match all the appearances of
the word “watermelon” even when it’s been written with capital letters.
So, /watermelon/gi will have this effect:
 “odfkvokkmwater-melongbbjofg”, result: not match
 “wAtErmelOndifbijingfi”, result: 1 match
 “sfnvjdfslb”, result: not match
 “xzfdseWATERMELONgfi”, result: 1 match
 “vfvxwatermElon”, result: 1 match
 “vfvxWateRmelonfdndjfdfwatErmelon”, result: 2 matches
REGEX FOR STRINGS THAT
CONTAINS A VARIETY OF
OPTIONS FOR A CHARACTER
If we need to validate a string which one of his characters can be these letters: “a”,
“b” or “c” for example, then we can use square brackets [ ] and put the possibilities
inside of them.
So, the regular expression “/a[g45]bc/”, has the following effects:
- “a4bc”, result: match
- “a45bc”, result: not match
- “a5bd”, result: not match
- “agbcdfvlkdfmlf”, result: match
REGEX FOR STRINGS THAT
CONTAINS A VARIETY OF
OPTIONS FOR A CHARACTER
As we can include a certain quantity of options for a character, we can also exclude.
Better saying, we can give the characters that can’t be used. For that, we use a caret
(^), and then we put the characters we don’t want to match.
So, the regular expression “/a[^123]h[12]/”, has the following effects:
- “a4h1”, result: match
- “a1h3”, result: not match Note:
We are using a exclude
- “a2h0”, result: not match set.
- “a2h2”, result: match
RANGES
RANGES
Imagine, we want a first character to be able to take all the letters of the alphabet,
then our regular expression would seem like this:
/[abcdefghijklmnopqrstuvwxyz]hello/
And we have to admit that this regular expression takes a lot of space and it’s
exposed to human error, so to make it pretty smaller we have to use ranges, and this
is possible while using a dash (-).
The equivalent of the previous expression is: /[a-z]hello/
“[a-z]” means the character can take values from “a” to “z”.
Note: Ranges can be used in exclude sets too. Example: /[^a-d]hello/, in
consequence, “ahello”, “bhello”, “chello” and “dhello” won’t match.
Last section, we used the regex flag insensitive(i) to include capital letters, but this
affects all the regular expression, what if we want this effect only in one character?
To solve this problem we won’t need to use that regex flag, we only need to use
ranges.
Example: /[a-bA-B]oat/ matches these words: aoat, boat, Aoat, Boat.
REPETITIONS
REPEATING CHARACTERS
Now, imagine that we want a phone number, here in Peru, phone numbers has 9
digits between 0 and 9, so our regular expression could look like this:
/[0-9][0-9][0-9][0-9][0-9][0-9][0-9][0-9][0-9]/
But this looks too long, we can short it by using curly brackets (called braces too,
{}).
Inside the curly brackets we have to put the number of repetitions for the character.
So the equivalent would be: /[0-9]{9}/
But, the repetitions can vary too, if we want a word between 4 and 5 digits, we can
use this: Use a comma inside the
braces to express the range
/[a-zA-Z]{4,5}/ of repetitions
If we don’t want to express an upper limit for the repetitions, we can leave it like
this /[a-zA-Z]{4,}/, with this, the minimum number of repetitions will be 4, and
then we can repeat more as we want.
Even though we can express a minimum of 1 character with /[a-zA-Z]{1,}/, there is
an equivalent for that expression by using the plus symbol (+), this is:
/[a-zA-Z]+/
METACHARACTERS
WHAT IS A METACHARACTER?

A character that has a special meaning during pattern

processing.
“\d” METACHARACTER
This metacharacter matches digits (“d” for digit), it’s the same as [0-9].
In other words, “/m[0-9]uch/” and “/m\duch/” do exactly the same thing.

Note: it’s very important to use back slashes(\) to make know the computer that
you’re using the metacharacter “\d” and not the letter “d”, same happens with the
rest of the metacharacters.
“\” is a special character like “+”, “.”, “[]”, “[^]” and “?”. We will see the rest of
them in the following section.
“\w” METACHARACTER
In this case, “w” refers to “word” and this metacharacter matches any word character
(a-z, A-Z, 0-9, and lowercases “_” ).
So “/m[0-9a-zA-Z_]uch/” and “/m\wuch/” do the same.
“\s” METACHARACTER
“s” comes from “space”, then “\s” metacharacter matches all kind of whitespaces.
So, “/abadiel\s2014/” will match: “abadiel 2014”, “abadiel 2014”, etc.
SPECIAL CHARACTERS
“?” CHARACTER
If we use “?” after another character, then it makes it optional to match, in other
words, it can appear 0 or 1 times.
So, if we have /hello?/, then “hell” and “hello” will match, but “helloo” won’t.
“.” CHARACTER
If we use “.” it matches any character, except newline character.
So, if we have /.+/, then “gjnnj51//-*+595_##/” will match.
KNOWN SPECIAL
CHARACTERS
“+” character “\” character
In the last section we explained it, it It’s also called “escape character” and it
matches the preceding character 1 or enables us to use metacharacters. But
more times. also helps with matching characters like
“/” and “.” that, as we saw, have
different meanings and can’t be used
literally.
KNOWN SPECIAL
CHARACTERS
“[]” character “^” character
We used it to create sets of possibilities We used it to exclude options for a
for one character, like [a-zA-Z] for character in a set, like [^123]
example.
“*” CHARACTER
Similar to “+”, but it gives us the possibility of 0 repetitions for the preceding
character.
So, /a*/ matches “”, “a” and “a…aaaa…aaa”.
STARTING AND
ENDING PATTERNS
STARTING AND ENDING
PATTERN
Sometimes, we only want our regular expression to affect the beginning of the text,
until now, our regex’s were always focusing at any part inside the string.
For example: If we have /\w{4}/, then:
- “w_ter” matches
- “waternfkfkjngt” matches too
- “dfngijwaterfbjgfn” matches too
- “fvnkjdfnvjdfjnwater” matches too
But this can also be dangerous, because in the second example, even though the first
part is the only one that matches, the computer will take all the string. Sometimes
when we want all the string to match completely this becomes a problem.
STARTING PATTERN
And that’s why is necessary to establish starting and ending patterns, so we can put a
start and an end to the string.
To establish a starting pattern we use a carat(^) at the beginning of the expression.
Then, the regular expression /^abcd/ will have these effects:
- “abcdfmvfdnivj”, result: match
- “dicifabcdnjf”, result: not match
- “cdbhjdbfdsabcd”, result: not match
Our regular expression now focuses in the beginning of the string.
ENDING PATTERN
And, in the other hand, if we want our regular expression to focus in the end of the
string, we need an ending pattern, to establish it we use the dollar-sign ($).
Then, /abcd$/ will have these effects:
- “abcdfmvfdnivj”, result: not match
- “dicifabcdnjf”, result: not match
- “cdbhjdbfdsabcd”, result: match
And we can use both patterns at the same time, this makes all the string be matched
by our regular expression.
Then, /^abcd$/ will have these effects:
- “abcdfmvfdnivj”, result: not match
- “dicifabcdnjf”, result: not match
- “cdbhjdbfdsabcd”, result: not match
- “abcd”, result: match
Now our regular expression focuses in all the string.
ALTERNATE
CHARACTERS
ALTERNATE CHARACTERS
We learnt how to establish possibilities for character’s value, but now we will see
how to establish possibilities for a substring or the entire one.
So, imagine that we are asking for a phone number or a name, we can put those 2
possibilities in one regular expression by using a pipe (|) which means “or”.
It would be like this: /[a-zA-Z]{1,20}|[0-9]{9}/
The pipe then separates both possibilities. In this case, the possibilities are for the
complete string.
If we want to focus the options on a part of the string, then the parentheses are
useful.
Example: /^(orange|apple)-juice$/ matches both “orange-juice” and “apple-juice”

NLP Chapter 5
No ratings yet
NLP Chapter 5
70 pages
FishTail Constraint Verification
100% (1)
FishTail Constraint Verification
20 pages
Comm Prog - Fexam
33% (3)
Comm Prog - Fexam
99 pages
Chapter Two
No ratings yet
Chapter Two
72 pages
An Introduction To GCC - Brian Gough PDF
No ratings yet
An Introduction To GCC - Brian Gough PDF
124 pages
Mastering ArduinoJson
0% (1)
Mastering ArduinoJson
49 pages
OOSE Chapter One
No ratings yet
OOSE Chapter One
58 pages
Badi Enhancement Process
No ratings yet
Badi Enhancement Process
48 pages
Regex Cheat Sheet
No ratings yet
Regex Cheat Sheet
10 pages
12 CS Board Set 4 QP
No ratings yet
12 CS Board Set 4 QP
5 pages
MYSQL REGEX Details
No ratings yet
MYSQL REGEX Details
13 pages
Regular Expression HOWTO: Guido Van Rossum Fred L. Drake, JR., Editor
100% (1)
Regular Expression HOWTO: Guido Van Rossum Fred L. Drake, JR., Editor
18 pages
An Introduction To Regular Expressions (9781492082569)
100% (1)
An Introduction To Regular Expressions (9781492082569)
17 pages
12CS em 2025-5-178
No ratings yet
12CS em 2025-5-178
174 pages
Nested Loop: Muhammad Ahmad Lecturer Cs Department
100% (1)
Nested Loop: Muhammad Ahmad Lecturer Cs Department
16 pages
WT - Regular Expression
No ratings yet
WT - Regular Expression
22 pages
Jan Goyvaerts - All About Regular Expressions-Https - WWW - Regular-Expressions - Info - (2019)
No ratings yet
Jan Goyvaerts - All About Regular Expressions-Https - WWW - Regular-Expressions - Info - (2019)
206 pages
Lecture 2
No ratings yet
Lecture 2
70 pages
David Wang Computing Science and Information Technology: Info 1211 - Operating System'S Principles and Applications
No ratings yet
David Wang Computing Science and Information Technology: Info 1211 - Operating System'S Principles and Applications
73 pages
Loop
No ratings yet
Loop
74 pages
3-Regular Expressions
No ratings yet
3-Regular Expressions
34 pages
Regular Expressions
No ratings yet
Regular Expressions
20 pages
Chapter 5 Regular Expression, Rollover and Frames
No ratings yet
Chapter 5 Regular Expression, Rollover and Frames
56 pages
CSS Unit 5
No ratings yet
CSS Unit 5
18 pages
CG Lab Manual 2018-19-V1
No ratings yet
CG Lab Manual 2018-19-V1
57 pages
Regular Expression in Javascript
No ratings yet
Regular Expression in Javascript
19 pages
Regular Expressions Guide - Mozilla Developer Center
No ratings yet
Regular Expressions Guide - Mozilla Developer Center
12 pages
Regular Expressions (Slides)
No ratings yet
Regular Expressions (Slides)
20 pages
Chapter 10
No ratings yet
Chapter 10
28 pages
Howto Regex
No ratings yet
Howto Regex
20 pages
2 Regular Expressions
No ratings yet
2 Regular Expressions
34 pages
COMP3 RegEx
No ratings yet
COMP3 RegEx
10 pages
06 CBME 1 Project Management 2
No ratings yet
06 CBME 1 Project Management 2
43 pages
Lecture 9
No ratings yet
Lecture 9
26 pages
(CSC221 2024-02-08) Regular Expressions
No ratings yet
(CSC221 2024-02-08) Regular Expressions
21 pages
Chapter 5 Regular Expressions, Rollover and Frames Regular Expression
No ratings yet
Chapter 5 Regular Expressions, Rollover and Frames Regular Expression
16 pages
What Is Data Science
No ratings yet
What Is Data Science
34 pages
DLD Lecture 6
No ratings yet
DLD Lecture 6
38 pages
Regular Expressions in JavaScript
No ratings yet
Regular Expressions in JavaScript
7 pages
3 Regular Expression
No ratings yet
3 Regular Expression
15 pages
Css Unit 5 Dev Notes
No ratings yet
Css Unit 5 Dev Notes
13 pages
Lecture 5
No ratings yet
Lecture 5
24 pages
Excel - Advanced Interview Questions
No ratings yet
Excel - Advanced Interview Questions
9 pages
Lecture 6 Re Basics
No ratings yet
Lecture 6 Re Basics
12 pages
Regular Expression Tutorial: What Regular Expressions Are Exactly - Terminology
No ratings yet
Regular Expression Tutorial: What Regular Expressions Are Exactly - Terminology
42 pages
Regular Expression
No ratings yet
Regular Expression
10 pages
Advanced Bash Shell Scripting Guide - Reference Cards
No ratings yet
Advanced Bash Shell Scripting Guide - Reference Cards
5 pages
Busqueda
No ratings yet
Busqueda
18 pages
Chapter Three Regular Expressions and Finite-State Automata
No ratings yet
Chapter Three Regular Expressions and Finite-State Automata
19 pages
CV. Mohammad Vicky Agassi 2024
No ratings yet
CV. Mohammad Vicky Agassi 2024
3 pages
Apache Cassandra: Database
No ratings yet
Apache Cassandra: Database
55 pages
Regex
No ratings yet
Regex
24 pages
Regex
100% (1)
Regex
42 pages
What Is Rapid Application Development?
No ratings yet
What Is Rapid Application Development?
35 pages
Java
No ratings yet
Java
27 pages
CS 179: GPU Computing: Recitation 1 - 4/1/16
No ratings yet
CS 179: GPU Computing: Recitation 1 - 4/1/16
18 pages
Step by Step Guide On How To Install and Use Containerd With Kubernetes On A Rancher K3 Distribution
No ratings yet
Step by Step Guide On How To Install and Use Containerd With Kubernetes On A Rancher K3 Distribution
4 pages
Matlabtalk 2
No ratings yet
Matlabtalk 2
43 pages
REGULAR EXPRESSIONS Workbook
No ratings yet
REGULAR EXPRESSIONS Workbook
8 pages
Howto Regex
No ratings yet
Howto Regex
20 pages
Network Security - 4.2 Reg Ex Primer
No ratings yet
Network Security - 4.2 Reg Ex Primer
3 pages
Regular Expressions Basics
No ratings yet
Regular Expressions Basics
11 pages
Python RegEx
No ratings yet
Python RegEx
8 pages
C++ Programming TI00AA50: Jarkko - Vuori@metropolia - Fi
No ratings yet
C++ Programming TI00AA50: Jarkko - Vuori@metropolia - Fi
13 pages
Regular Expression HOWTO: Guido Van Rossum and The Python Development Team
No ratings yet
Regular Expression HOWTO: Guido Van Rossum and The Python Development Team
20 pages
Regular Expression in Javascript Regular Expression
No ratings yet
Regular Expression in Javascript Regular Expression
5 pages
Specialized Model in Software Engineering: Component Based Development
No ratings yet
Specialized Model in Software Engineering: Component Based Development
6 pages
Regular Expression Python
No ratings yet
Regular Expression Python
23 pages
Regular Expression HOWTO: Guido Van Rossum and The Python Development Team
No ratings yet
Regular Expression HOWTO: Guido Van Rossum and The Python Development Team
18 pages
NN Tool Example
No ratings yet
NN Tool Example
3 pages
Howto Regex
No ratings yet
Howto Regex
20 pages
Howto Regex PDF
No ratings yet
Howto Regex PDF
20 pages
45 The Matching Characters
No ratings yet
45 The Matching Characters
3 pages
Howto Regex
No ratings yet
Howto Regex
17 pages
Regular Expression HOWTO: Guido Van Rossum Fred L. Drake, JR., Editor
No ratings yet
Regular Expression HOWTO: Guido Van Rossum Fred L. Drake, JR., Editor
18 pages
Python How To Regex
No ratings yet
Python How To Regex
19 pages
How To Write Regular Expressions?: What Is A Regular Expression and What Makes It So Important?
No ratings yet
How To Write Regular Expressions?: What Is A Regular Expression and What Makes It So Important?
2 pages
Regular Expression HOWTO: Guido Van Rossum Fred L. Drake, JR., Editor
No ratings yet
Regular Expression HOWTO: Guido Van Rossum Fred L. Drake, JR., Editor
18 pages
Regular Expression Overview
No ratings yet
Regular Expression Overview
5 pages
CountAllHoles SRC
No ratings yet
CountAllHoles SRC
4 pages
Data Pump PDF
No ratings yet
Data Pump PDF
5 pages
Gujarat Technological University: Diploma in Computer Engineering Semester: 3
No ratings yet
Gujarat Technological University: Diploma in Computer Engineering Semester: 3
3 pages
JavaScript.
From Everand
JavaScript.
Tom Henricksen
No ratings yet
Ian Talks Regex A-Z
From Everand
Ian Talks Regex A-Z
Ian Eress
No ratings yet
Simplified PHP
From Everand
Simplified PHP
James Blanchette
No ratings yet
Beyond the Basics of JavaScript
From Everand
Beyond the Basics of JavaScript
Tom Henricksen
No ratings yet
Just the basics of JavaScript
From Everand
Just the basics of JavaScript
Tom Henricksen
No ratings yet
Java: Best Practices to Programming Code with Java: Java Computer Programming, #3
From Everand
Java: Best Practices to Programming Code with Java: Java Computer Programming, #3
Charlie Masterson
No ratings yet
Java: Best Practices to Programming Code with Java
From Everand
Java: Best Practices to Programming Code with Java
Charlie Masterson
No ratings yet
Introduction to PHP, Part 2, Second Edition
From Everand
Introduction to PHP, Part 2, Second Edition
Adam Majczak
No ratings yet

Regular Expressions

Uploaded by

Regular Expressions

Uploaded by

REGULAR EXPRESSIONS

IT PROVIDES A CONCISE AND USED IN VARIOUS

A character that has a special meaning during pattern

You might also like