0% found this document useful (0 votes)

23 views36 pages

Regular Expressions and Languages

This document discusses regular expressions and finite automata. [1] It explains that regular expressions provide a declarative way to express string patterns, while finite automata are more machine-like in accepting or rejecting input strings. [2] It then reviews the basic operators of regular expressions: union, concatenation, and Kleene closure. [3] Examples are provided to illustrate how regular expressions define languages.

Uploaded by

Atik

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

23 views36 pages

Regular Expressions and Languages

Uploaded by

Atik

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 36

Regular Expressions and

Languages
1

4/11/2020
Regular Expressions vs. Finite Automata
2

⚫ Offers a declarative way to express the pattern of any string

we want to accept
⚪ E.g., 01*+ 10*

⚫ Automata => more machine-like

< input: string , output: [accept/reject] >
⚫ Regular expressions => more program syntax-like

⚫ Unix environments heavily use regular expressions

⚪ E.g., bash shell, grep, vi & other editors

⚫ Lexical analyzers such as Lex or Flex

4/11/2020
Regular Expressions
3

⚫ Operators of Regular Expressions

⚪ Review of three operations on languages L and M:
Union --- L∪M = {x | x∈L or x∈M}
Example: If L={001, 10, 111} and M={ε, 001} then
L∪M={ε, 10, 001, 111 }
Concatenation --- LM = {xy | x∈L, y∈M}
Example: If L={001, 10, 111} and M={ε, 001} then
LM={001, 10, 111, 001001, 10001, 111001 }

4/11/2020
Kleene Closure (the * operator)
4

⚫ Kleene Closure of a given language L:

⚪ L 0 = { ε}
⚪ L1= {w | for some w ∈ L}
⚪ L2= { w1w2 | w1 ∈ L, w2 ∈ L (duplicates allowed)}
⚪ Li= { w1w2…wi | all w’s chosen are ∈ L (duplicates allowed)}
⚪ (Note: the choice of each wi is independent)
⚪ L* = Ui≥0 Li (arbitrary number of concatenations)
Example:
⚫ Let L = { 1, 00}
0
⚪ L = { ε}
⚪ L1= {1,00}
⚪ L2= {11,100,001,0000}
⚪ L3= {111,1100,1001,10000,000000,00001,00100,0011}
⚪ L* = L0 U L1 U L2 U …

4/11/2020
Regular Expressions
5

Building Regular Expressions

⚪ Recursive definition of a regular expression
(RE) E and the language , L(E):
Basis:
Constants ε and φ are RE’s, defining languages {ε} and φ,
respectively ⇒ L(ε) = {ε}, L(φ) = φ.
If a is a symbol, then a is an RE, defining the language {a}
⇒ L(a) = {a}. (note: a is of bold face)

4/11/2020
Regular Expressions
6

Building Regular Expressions

⚪ Induction: given two RE’s E and F, then
E + F is an RE such that L(E + F) = L(E) ∪ L(F)
(union)
EF is an RE such that L(EF) = L(E)L(F)
(concatenation)
E * is an RE such that L(E*) = (L(E))* (closure)

(E) is an RE such that L((E)) = L(E)

(parenthesization).

4/11/2020
Regular Expressions
7

⚫ Examples
⚪ RE F = 1 “expresses” the language L(1) = {1}.

*
⚪ RE E = 1

Language expressed by E ---

L = L(E) = L(1*) = (L(1))* = ({1})*
(closure of language)
= {ε, 1, 11, 111, 1111, …}
= {1n | n ≥ 0}

4/11/2020
Regular Expressions
8

⚫ Examples
*
⚪ RE G = 01

Language expressed by G ---

L = L(G) = L(01*) = L(0)L(1*) (concatenation)
= {0}{ε, 1, 11, 111, 1111, …}
= {0, 01, 011, 0111, …}
= {01n | n ≥ 0}

4/11/2020
Regular Expressions
9

⚫ Examples
*
⚪ RE H = 1 + 01

Language expressed by H ---

L = L(H) = L(1 + 01*) = L(1) U L(01*)
= {1} U {0, 01, 011, 0111, …}
= {1, 0, 01, 011, 0111, …}
= {1}U{01n | n ≥ 0}

4/11/2020
Regular Expressions
10

⚫ Examples
⚪ RE K = ε + a*
Language expressed by K ---
L = L(K) = L(ε + a*) = L(ε ) U L(a*)
= {ε} U {ε, a, aa, aaa, …}
= {ε, a, aa, aaa, …}
= L(a*)
That is, we have the following RE equalities:

ε + a* = a* = a* + ε

4/11/2020
Regular Expressions
11
⚫ Example
⚪ A RE defining a language of strings of alternating 0’s and 1’s
(including none) is one of the two below:

(01)* + (10)* + 0(10)* + 1(01)*

(0…1 1…0 0…0 1…1)
(ε + 1)(01)*(ε + 0)

4/11/2020
Regular Expressions
12

Precedence of RE operators
⚪ Precedence
*
Highest --- (closure)
Next--- . (concatenation) (left to right)
Last--- + (union) (left to right)
Use parentheses anywhere to resolve ambiguity

4/11/2020
Regular Expressions
13

Precedence of RE operators
⚪ Example

Three ways to interpret 01* + 1:

(0(1*)) + 1 by precedence above (= 01* + 1)
(01)* + 1 (another meaning)
0(1* + 1) (a third meaning)

4/11/2020
FA’s & RE’s
14

⚫ Important Theorems
⚪ Every language defined by a DFA is also defined by an RE.

⚪ Every language defined by an RE is also defined by an ε-NFA.

4/11/2020
FA’s & RE’s
15

ε-NFA NFA

RE DFA

4/11/2020
FA’s & RE’s
16

From DFA’s to RE’s

If L = L(A) for some DFA A, then there is an RE
R such that L = L(R).
Prove by constructing progressively string sets
defined by a certain RE form Rij(k) until the entire set
of acceptable strings (i.e., language L(A)) is
obtained.
Assume the states are {1, 2, ..., n} (1 is the start
state).

4/11/2020
FA’s & RE’s
17

⚫ Meaning of Rij(k) ---

⚪ Rij(k) is a regular expression

Language is set of strings w
w is the label of a path from state i to state j of DFA
A
The path has no intermediate state greater than k

4/11/2020
FA’s & RE’s
18

⚫ Meaning of Rij(k) ---

⚪ T construct Rij(k), we use induction, starting at k = 0
and stop at k = n (the largest state number).

Then, when k = n, i =1, and j specifies an accepting

state, then Rij(k) defines a set of strings accepted by
DFA A, with each string forming a path starting
from the start state to the accepting state.

4/11/2020
FA’s & RE’s
19

(k)
⚫ Meaning of R ---
ij
Basis:
⚪ when k = 0, since all state numbers ≥ 1, and so there is no
intermediate state in path i to j;
2 cases to consider:
(1) an arc (a transition) from i to j;
(2) a path from i to i itself.

4/11/2020
FA’s & RE’s
20

(k)
⚫ Meaning of R ---
ij

Basis (cont’d):
⚪ If i ≠ j, only (case 1) is possible:
no symbol for such a transition ⇒ Rij(0) = φ
one symbol a for the transition ⇒ Rij(0) = a
multiple symbls a1, a2, ..., am for the transition,
• ⇒ Rij(0) = a1 + a2 + ... + am

4/11/2020
FA’s & RE’s
21
(k)
⚫ Meaning of R ---
ij

Basis (cont’d) i ≠ j: qi qj

Rij(0) = φ
a
qi qj
Rij(0) = a
a1+…+am
(0)
qi qj
Rij = a1 + a2 + ... + am

4/11/2020
FA’s & RE’s
22
(k)
⚫ Meaning of R ---
ij
⚪ If i = j, only (case 2) is possible, which means there
exists at least a path ε from i to i itself, in addition to
the 3 cases:
no symbol for such a transition ⇒ Rij(0)= ε
one symbol a for the transition ⇒ Rij(0)= ε + a
multiple symbls a1, a2, ..., am for the transition,
• ⇒ Rij(0) = ε + a1 + a2 + ... + am

4/11/2020
FA’s & RE’s
(k) 23
⚫ Meaning of R ---
ij ε

Basis (cont’d) i = j: qi

Rij(0) = ε ε+a

Rij(0) = ε + a qi

Rij(0) = ε + a1 + a2 + ... + am ε+a1+…+am

4/11/2020
FA’s & RE’s
24

R
Induction (to compute ij(k) ):
⚪ Suppose there is a path from i to j that goes through no state
numbered higher than k. Then, two cases should be
considered:
(1) the path does not go through k ⇒ Rij(k-1)
(2) the path goes through k at least once, then the path may be
broken into 3 pieces:
through i to k without passing k ⇒ Rik(k-1)
from k to k itself ⇒ (Rkk(k-1))* (recusive);
from k to j without passing k ⇒ Rkj(k-1).

4/11/2020
FA’s & RE’s
25

⚫ Illustration of paths represented by Rij(k) :

(Rkk(k-1))*
…… circulating zero
or more times

i … k … j
Rik (k-1)
Rkj(k-1)

4/11/2020
FA’s & RE’s
26

Induction (cont’d):

⚪ The three pieces are concatenated to be

Rik(k-1)(Rkk(k-1))*Rkj(k-1).

⚪ Combining (1) & (2), we get the RE defining “all the paths from
i to j that go through no state higher than k” as
Rij(k) = Rij(k-1) + Rik(k-1)(Rkk(k-1))*Rkj(k-1).

4/11/2020
FA’s & RE’s
27

⚫ Example
⚪ Convert the following DFA into an RE.
1 0, 1

0
start 1 2
⚪ Rij(0) may be constructed to be (details in the next page):

R11(0) ε+1
R12(0) 0
R21(0) φ
R22(0) (ε + 0 + 1)
4/11/2020
FA’s & RE’s
28

⚫ Example
1 0, 1

0
start 1 2

⚪ R11(0) = ε + 1 because δ(1, 1) = 1 & going back to itself

⚪ R12(0) = 0 because δ(1, 0) = 2 (going out to state 2)
⚪ R21(0) = φ because there is no path from state 2 to 1
⚪ R22(0) = (ε + 0 + 1) because δ(2, 0) = 2 & δ(2, 1) = 2 &
going back to itself

4/11/2020
FA’s & RE’s
29

⚫ Example (cont’d) 0, 1
1

0
start 1 2

⚪ We can then compute all Rij(k) for k=1 & k=2.

⚪ However, we may alternatively compute only necessary

terms of Rij(k) backward from the final states, to save time.

4/11/2020
FA’s & RE’s
30

⚫ Example (cont’d)
⚪ There is only one final state 2, so only have to compute
R12(2) = R12(1) + R12(1)(R22(1))*R22(1).

⚪ Only have to compute R12(1) and R22(1), without computing

R21(1) and R11(1).

⚪ To compute each of these terms, we need some RE equalities

to simplify intermediate results.

4/11/2020
FA’s & RE’s
31

⚫ Some equalities (R is an RE):

1. φR=Rφ=φ (φ=annihilator for concatenation)

2. φ + R = R + φ = R (φ=identity for union)

3. εR = Rε = R (ε = identity for concatenation)

4. (ε + a)* = a* = (a + ε)*

5. (ε + a)a* = (εa* + aa) = a + a+ = a*

6. a(ε + a) = (aε + aa) = a + a+ = a*

(all provable by easy deduction)

4/11/2020
FA’s & RE’s
32

⚫ To compute
R12(2) = R12(1) + R12(1)(R22(1))*R22(1)
⚪ R12(1) = R12(0) + R11(0)(R11(0))*R12(0)
= 0 + (ε + 1)(ε + 1)*0 (by substitutions)
= 0 + (ε + 1)1*0 (by 4. in last slide )
= 0 + 1* 0 (by 5.)
= (ε + 1*)0 (by distributive law)
= 1*0 (by 4.)
4/11/2020
FA’s & RE’s
33

⚫ To compute
R12(2) = R12(1) + R12(1)(R22(1))*R22(1)
⚪ R22(1) = R22(0) + R21(0)(R11(0))*R12(0)
= (ε + 0 + 1) + φ(ε + 1)*0 (by substitutions)
= (ε + 0 + 1) + φ (by 1.)
=ε+0+1 (by 2.)

4/11/2020
FA’s & RE’s
34

⚫ To compute
R12(2) = R12(1) + R12(1)(R22(1))*R22(1)
⚪ Finally, R12(2)
= 1*0 +1*0(ε + 0 + 1)*(ε + 0 + 1) (by subst.)
= 1*0 +1*0(0 + 1)*(ε + 0 + 1) (by 4.)
= 1*0 +1*0(0 + 1)* (by 6.)
=1*0(ε + (0 + 1)*) (by distributive law)
= 1*0(0 + 1)* (by 4.)

4/11/2020
FA’s & RE’s
35

⚫ Check the correctness of the final result

R12(2) = 1*0(0 + 1)*

1 0, 1

0
start 1 2

It is a language that begins with zero or more 1’s

than have a zero and then any string of zero’s
and 1’s
correct (by looking at the diagram directly)!
4/11/2020
Acknowledgement
36

⚫ Tania Akter Setu

⚫ Lecturer, Dept. of CSE , UITS

4/11/2020

Class 13 Rij Equation Method To Convert DFA To RE
No ratings yet
Class 13 Rij Equation Method To Convert DFA To RE
27 pages
FLAT-Regular Expression and Language
No ratings yet
FLAT-Regular Expression and Language
69 pages
Flat-Unit-2 Notes
No ratings yet
Flat-Unit-2 Notes
23 pages
C2 Onward - FA and RL
No ratings yet
C2 Onward - FA and RL
109 pages
Unit-2 RL FA - All Topics
No ratings yet
Unit-2 RL FA - All Topics
140 pages
Regular Expression
No ratings yet
Regular Expression
106 pages
Toc 2
No ratings yet
Toc 2
26 pages
Lecture 5 - Regular Expressions
No ratings yet
Lecture 5 - Regular Expressions
35 pages
Unit 2
No ratings yet
Unit 2
35 pages
Toc U2ppt
No ratings yet
Toc U2ppt
41 pages
Wa0014.
No ratings yet
Wa0014.
85 pages
Bengal College of Engineering & Technology: Regular Expressions
No ratings yet
Bengal College of Engineering & Technology: Regular Expressions
12 pages
Solved IA2 QP
No ratings yet
Solved IA2 QP
11 pages
ch2 Engineering
No ratings yet
ch2 Engineering
78 pages
Chapter 2 REGULAR EXPRESSION
No ratings yet
Chapter 2 REGULAR EXPRESSION
26 pages
FLAT Lec - 3
No ratings yet
FLAT Lec - 3
34 pages
Theory of Computation: Sathyabama
No ratings yet
Theory of Computation: Sathyabama
92 pages
Chapter 2 RegularExpressions
No ratings yet
Chapter 2 RegularExpressions
95 pages
Regular Expressions (Re) : Res: Formal Definition
No ratings yet
Regular Expressions (Re) : Res: Formal Definition
12 pages
4 Reg Ex
No ratings yet
4 Reg Ex
26 pages
Unit 2-Theory of Computation
No ratings yet
Unit 2-Theory of Computation
44 pages
Regular Expressions
No ratings yet
Regular Expressions
22 pages
Regular Expression
No ratings yet
Regular Expression
18 pages
UNIT-2 2024 Theory of Automata and Formal Languages AKTU University
No ratings yet
UNIT-2 2024 Theory of Automata and Formal Languages AKTU University
13 pages
Lesson 4
No ratings yet
Lesson 4
18 pages
07 RegLangClosureProperties
No ratings yet
07 RegLangClosureProperties
23 pages
FLAT - UNIT-2 - Question Bank - V2
No ratings yet
FLAT - UNIT-2 - Question Bank - V2
10 pages
Regular Expressions: Definitions Equivalence To Finite Automata
No ratings yet
Regular Expressions: Definitions Equivalence To Finite Automata
29 pages
Unit II Regular Expression
No ratings yet
Unit II Regular Expression
176 pages
Module 2flat
No ratings yet
Module 2flat
26 pages
MITWPU - Unit 2-Theory of Computation
No ratings yet
MITWPU - Unit 2-Theory of Computation
50 pages
Unit 4: Regular Expressions
No ratings yet
Unit 4: Regular Expressions
52 pages
TOC Module-2 Notes
No ratings yet
TOC Module-2 Notes
24 pages
Theory of Automata - Solved Assignments - Semester Spring 2010
74% (38)
Theory of Automata - Solved Assignments - Semester Spring 2010
33 pages
Flat Unit 2 - 17.9.20
No ratings yet
Flat Unit 2 - 17.9.20
22 pages
Regular Expressions: Reading: Chapter 3
No ratings yet
Regular Expressions: Reading: Chapter 3
39 pages
CH 3 - Regular Languages Amd Regular Grammars
No ratings yet
CH 3 - Regular Languages Amd Regular Grammars
67 pages
WINSEM2024-25 BCSE304L TH VL2024250501632 2025-02-12 Reference-Material-I
No ratings yet
WINSEM2024-25 BCSE304L TH VL2024250501632 2025-02-12 Reference-Material-I
13 pages
Toc Unit-2
No ratings yet
Toc Unit-2
109 pages
Regular Expressions
No ratings yet
Regular Expressions
34 pages
Regular Expression: Operations On Regular Language
No ratings yet
Regular Expression: Operations On Regular Language
33 pages
Regular-Expressions: LECT-2
No ratings yet
Regular-Expressions: LECT-2
12 pages
Formal Languages & Finite Theory of Automata: BS Course
No ratings yet
Formal Languages & Finite Theory of Automata: BS Course
33 pages
2.1regular Expression-UNIT - II
No ratings yet
2.1regular Expression-UNIT - II
31 pages
Unit Ii Regular Expressions and Languages: 2.1.1. Definition
No ratings yet
Unit Ii Regular Expressions and Languages: 2.1.1. Definition
31 pages
Automata
No ratings yet
Automata
11 pages
05 Handout 1
No ratings yet
05 Handout 1
3 pages
Automata Chapter 2
No ratings yet
Automata Chapter 2
15 pages
Module 2 Part 1
No ratings yet
Module 2 Part 1
7 pages
Regular Expressions (RE) 3.1
100% (3)
Regular Expressions (RE) 3.1
53 pages
Kleene
No ratings yet
Kleene
6 pages
Chapter 2 RegularExpressions
No ratings yet
Chapter 2 RegularExpressions
95 pages
CS351 Regular Expressions
No ratings yet
CS351 Regular Expressions
14 pages
CS402 Short Notes: For More Visit
No ratings yet
CS402 Short Notes: For More Visit
39 pages
Regular Expressions and Languages
No ratings yet
Regular Expressions and Languages
16 pages
2.4 E-Nfa To Nfa Conversion
No ratings yet
2.4 E-Nfa To Nfa Conversion
19 pages
Unit-Ii Regular Expressions and Languages Definition
No ratings yet
Unit-Ii Regular Expressions and Languages Definition
34 pages
R20-Atcd-Q.p - Model Paper.
100% (1)
R20-Atcd-Q.p - Model Paper.
3 pages
Unit 1 - Finite Automata
100% (1)
Unit 1 - Finite Automata
91 pages
Discrete Mathematics: Second Edition
No ratings yet
Discrete Mathematics: Second Edition
11 pages
Arid Agriculture University, Rawalpindi: (Theory)
No ratings yet
Arid Agriculture University, Rawalpindi: (Theory)
6 pages
Automata & Complexity Theory Cosc3025: Chapter One
100% (1)
Automata & Complexity Theory Cosc3025: Chapter One
27 pages
Theory of Computation: Course Outcomes: On Completion of The Course, The Students Will Be
No ratings yet
Theory of Computation: Course Outcomes: On Completion of The Course, The Students Will Be
9 pages
Regular Expressions
No ratings yet
Regular Expressions
30 pages
Kleene Closure
No ratings yet
Kleene Closure
6 pages
CS402 Final Term Solved SUBJECTIVE by JUNAID
No ratings yet
CS402 Final Term Solved SUBJECTIVE by JUNAID
59 pages
Act CH 1
No ratings yet
Act CH 1
23 pages
Theory of Computer Science
No ratings yet
Theory of Computer Science
3 pages
Automata and Complexity Theory AssignmentE
0% (1)
Automata and Complexity Theory AssignmentE
2 pages
Csci3255 HW 3
67% (3)
Csci3255 HW 3
5 pages
Unit 1
No ratings yet
Unit 1
45 pages
CPSC 388 - Compiler Design and Construction: Scanner - Regular Expressions To DFA
No ratings yet
CPSC 388 - Compiler Design and Construction: Scanner - Regular Expressions To DFA
23 pages
Lecture 07 Pushdown, CFG
No ratings yet
Lecture 07 Pushdown, CFG
28 pages
Automata 3 Slide
No ratings yet
Automata 3 Slide
30 pages
Semester III
No ratings yet
Semester III
25 pages
B.Tech II Year CSE R20 Syllabus
No ratings yet
B.Tech II Year CSE R20 Syllabus
35 pages
AY21 22 Computer Syllabus Final
No ratings yet
AY21 22 Computer Syllabus Final
220 pages
Pdf24 Merged
No ratings yet
Pdf24 Merged
54 pages
13-08-24 CoursePack - Theory - of - Computation R1UC501T
No ratings yet
13-08-24 CoursePack - Theory - of - Computation R1UC501T
11 pages
Unit I Flat LM Cse
No ratings yet
Unit I Flat LM Cse
31 pages
Regular Expressions To Finite Automata: - High-Level Sketch
No ratings yet
Regular Expressions To Finite Automata: - High-Level Sketch
32 pages
R18 B.tech 3-1 CSE Syllabus
No ratings yet
R18 B.tech 3-1 CSE Syllabus
34 pages
5th Sem Syllabus
No ratings yet
5th Sem Syllabus
13 pages
ALC Prev - 2023
No ratings yet
ALC Prev - 2023
2 pages
Syllabus (TOC)
No ratings yet
Syllabus (TOC)
1 page
Differential Calculus and Its Applications
From Everand
Differential Calculus and Its Applications
Michael J. Field
2.5/5 (6)
Laplace Transforms Essentials
From Everand
Laplace Transforms Essentials
Morteza Shafii-Mousavi
3.5/5 (3)
Multiple Integrals, A Collection of Solved Problems
From Everand
Multiple Integrals, A Collection of Solved Problems
Steven Tan
No ratings yet

Regular Expressions and Languages

Uploaded by

Regular Expressions and Languages

Uploaded by

Regular Expressions and

⚫ Offers a declarative way to express the pattern of any string

⚫ Automata => more machine-like

⚫ Unix environments heavily use regular expressions

⚫ Lexical analyzers such as Lex or Flex

⚫ Operators of Regular Expressions

⚫ Kleene Closure of a given language L:

Building Regular Expressions

Building Regular Expressions

(E) is an RE such that L((E)) = L(E)

Language expressed by E ---

Language expressed by G ---

Language expressed by H ---

(01)* + (10)* + 0(10)* + 1(01)*

Three ways to interpret 01* + 1:

⚪ Every language defined by an RE is also defined by an ε-NFA.

From DFA’s to RE’s

⚫ Meaning of Rij(k) ---

⚪ Rij(k) is a regular expression

⚫ Meaning of Rij(k) ---

Then, when k = n, i =1, and j specifies an accepting

Rij(0) = ε + a1 + a2 + ... + am ε+a1+…+am

⚫ Illustration of paths represented by Rij(k) :

⚪ The three pieces are concatenated to be

⚪ R11(0) = ε + 1 because δ(1, 1) = 1 & going back to itself

⚪ We can then compute all Rij(k) for k=1 & k=2.

⚪ However, we may alternatively compute only necessary

⚪ Only have to compute R12(1) and R22(1), without computing

⚪ To compute each of these terms, we need some RE equalities

⚫ Some equalities (R is an RE):

2. φ + R = R + φ = R (φ=identity for union)

3. εR = Rε = R (ε = identity for concatenation)

5. (ε + a)a* = (εa* + aa*) = a* + a+ = a*

6. a*(ε + a) = (a*ε + a*a) = a* + a+ = a*

(all provable by easy deduction)

⚫ Check the correctness of the final result

It is a language that begins with zero or more 1’s

⚫ Tania Akter Setu

You might also like

5. (ε + a)a* = (εa* + aa) = a + a+ = a*

6. a(ε + a) = (aε + aa) = a + a+ = a*