0% found this document useful (0 votes)

24 views4 pages

Regex - Regular Expression

1) The document summarizes a lecture on regular expressions and finite state automata. It discusses precedence rules for regular expressions, equivalence of regular expressions, and introduces nondeterministic finite state automata (NFA). 2) An NFA is presented that recognizes strings ending in "babb" using fewer states than a deterministic finite automaton. The subset construction algorithm is described to convert an NFA to an equivalent deterministic finite automaton (DFA). 3) Closure properties of regular languages are discussed. An example finite state automaton is constructed that recognizes strings containing "aaa" and an even number of b's.

Uploaded by

Sakib Jobaid

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views4 pages

Regex - Regular Expression

Uploaded by

Sakib Jobaid

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

CSC 236 H1F Lecture Summary for Week 11 Fall 2015

Regular Expressions (Continued)

Precedencse Rules: The following conventions allow us to simplify our regexps considerably without introduc-
ing ambiguity:

• We leave out the outermost pair of parantheses. E.g., (0 + 1)(11)∗ is an abbreviation of ((0 + 1)(11)∗ ).

• Star operator has precedence over all operators. E.g., RS ∗ is an abbreviation of R(S ∗ )

• Concatenation has precedence over union. E.g., RS ∗ + T is an abbreviation of ((RS ∗ ) + T )

• When the same binary operator is applied several times in a row, we can leave out the parantheses and
assume the grouping is to the right. E.g., 11 + 01 + 10 + 11 is an abbreviation of (11 + (01 + (10 + 11)))

Equivalence of Regexps: Regexps R and S are equivalent (denoted R ≡ S) iff they represent the same
language (i.e., L(R) = L(S)), e.g., b∗ a(a + b)∗ ≡ (a + b)∗ ab∗ .

Theorem 1. The general regexps R, S and T , the following equivalences hold:

• Comutativity of union: R + S ≡ S + R

• Associativity of union: (R + S) + T ≡ R + (S + T )

• Associativity of concatenation: (RS)T ≡ R(ST )

• Left distributivity: R(S + T ) ≡ RS + RT

• Right distrbutivity: (S + T )R ≡ SR + T R

• Identity for union: R + {} ≡ R

• Identity for concatenation: R ≡ R ≡ R

• Annihilator for concatenation: {}R ≡ {} ≡ R{}

∗
• Idempotence of Kleene star: R∗ ≡ R∗

Example 1. We prove that L(b∗ a(a + b)∗ ) = L = {all strings of a’s and b’s that contain at least one a}, by
showing double inclusion (standard technique for proving set equality).
Intuition: Stating L(b∗ a(a + b)∗ ) = L amounts to making two separate claims.

1. Every string in L(b∗ a(a + b)∗ ) has at least one a (i.e., RE pattern does not include bad strings)

2. Every string with at least one a belongs to L(b∗ a(a + b)∗ ) (i.e., RE pattern includes every good string).

Proof. Now, let’s prove both parts.

1. (L(b∗ a(a + b)∗ ) subset of L): Let s be an arbitrary string in L(b∗ a(a + b)∗ ). This means s = t ◦ u ◦ v for some
strings t ∈ L(b∗ ), u ∈ L(a), and v ∈ L((a + b)∗ ). Since there is only one string a ∈ L(a), u = a so s = t ◦ a ◦ v and
s is a string that contains at least one a, so s ∈ L.
2. (L subset of L(b∗ a(a + b)∗ )): Let s be an arbitrary string in L. This means that s contains at least one a, so
it contains a first occurrence of a and can be broken up into three substrings: s = r ◦ a ◦ t, where r is some string
that contains no a (maybe empty), a is the first occurrence of a in s, and t is some string of a’s and b’s. But then,
r ∈ L(b∗ ), a ∈ L(a), and t ∈ L((a + b)∗ ) so by definition, s = r ◦ a ◦ t is in L(b∗ a(a + b)∗ ).

Remark. See textbook for other detailed examples.

Dept. of Computer Science, University of Toronto, St. George Campus Page 1 of 4

CSC 236 H1F Lecture Summary for Week 11 Fall 2015

Nondeterministic Finite State Automata (NFA or NFSA)

Assume that you want to construct a DFA that accepts the following language

L = {s ∈ {a, b}∗ : s ends with babb}

The DFA for this language must remember the last 4 symbols processed. As we saw in our tutorial, this requires all
possible combinations of the last 4 characters (16 of them). We should consider all possible combinations because
in a DFA, a given state and the current input symbol uniquely determines the next state of the automaton. It
is for this reason that such automata are called deterministic. But if we remove the determinism constraint, the
following FSA accepts L:

a, b

q0 b q1 a q2 a q3 a q4
start

Figure 1:

Remark. Note that if a string does not end with babb, then every attempt to follow transition out of q0 (in Figure
1) ends up in empty set of states (one of the transitions won’t work).
Notice the simplicity of FSA in Figure 1. Such properties have lead to the definition of a variant of finite state
automata, called nondeterministic finite state automata (NFA or NFSA). In these FSAs, given the current state,
when the automaton reads an input symbol a, there may be several states to which it may go next (hence the
nondeterminism).
NFA or NFSA: A nondeterministic finite state automaton is a quintuple (Q, Σ, q0 , F, δ), where Q is a fixed,
finite, non-empty set of states. Σ is a fixed (finite, non-empty) alphabet (Q ∩ Σ = {}). q0 ∈ Q is the initial state.
F ⊆ Q is the set of accepting (“final”) states. δ : Q × (Σ ∪ {}) → P(Q) is a transition function (i.e., δ(q, a) is the
set of next states of the NFA when processing symbol a from state q)
Note: P(Q) is the power set of Q.
We can see that the definition of the NFA contains transitions like δ(q, ). These transitions are called spon-
taneous state transition or -transition, in which the NFA makes a transition from the current state to the next
state without reading any input symbol. The NFA can be defined without the introduction of -transitions by
extending the defintion of initial state to a set of states rather than a state. However, -transitions will allow us
to simplify our notations and arguments in some cases (e.g., when we talk about closure properties).
Remark. The power of NFA is that, by definition, NFA accepts a string iff set of states reached at the end contains
at least one accepting state. It is like saying that NFA has unlimited parallelism.
Subset Construction: Given a NFA M = (Q, Σ, q0 , F, δ), we can construct a DFA M 0 = (Q0 , Σ, q00 , F 0 , δ 0 ) that
accepts the same language as M as follows:
• Q0 = P (Q)
• q00 = E(q0 ) (i.e., the set of all states reachable from the initial state of the given NFSA via -transitions only)
• F 0 = {q 0 ∈ Q0 : q 0 ∩ F 6= ∅} (i.e., all states that contain an accepting state of the given NFSA)
• For any q 0 ∈ Q and a ∈ Σ, δ 0 (q 0 , a) = ∪qx ∈q0 ∪qy ∈δ(qx ,a) E(qy ) where E(qy ) is the set of states reachable from

qy following any number of transitions.

This construction is called the subset construction, because each state of M 0 is a set of states of M

Dept. of Computer Science, University of Toronto, St. George Campus Page 2 of 4

CSC 236 H1F Lecture Summary for Week 11 Fall 2015

Example 2. Consider the following NFA:

a b

, a
start q0 q1

Figure 2: NFA corresponding to regexp a∗ b∗

The corresponding DFA using the subset construction is:

a b

q0 q1 b q1
start

Figure 3: Correponding DFA of NFA in Figure 2

Remark. Although NFA may introduce unlimited parallelism. But it is not a practical model!

Closure properties

Let’s construct a FSA that accepts the language

L = {s ∈ {a, b}∗ : s contains three a’s in a row and an even number of b’s }

Another way to express L is to say

L = {s : s contains aaa} ∩ {s : s contains even many b’s}

Each of these sub-languages correspond to a FSA as follows:

b a, b
a a
q0 q1 q2 a q3
start b

Figure 4: FSA for {s : s contains aaa} (FSA1)

Dept. of Computer Science, University of Toronto, St. George Campus Page 3 of 4

CSC 236 H1F Lecture Summary for Week 11 Fall 2015

a a
b
start q0 q1

Figure 5: FSA for {s : s contains even many b’s} (FSA2)

Now, let’s try to combine the states in FSA1 and FSA2 so that the resulting states can track the states in
both of the aforementioned FSAs at the same time. The resulting FSA will look like follows (qxy is a state that
represents state qx in FSA1 and state qy in FSA2):

q00 a q10 a q20 a q30

start
b b b
b b b b
b
q01 a q11 a q21 a q31

Figure 6: FSA for {s : s contains even many b’s} (FSA2)

It should be obvious now that the only accepting state in this FSA should be q30 in which we have seen three
a’s and even number of b’s.
The aforementioned example demonstrates a powerful design technique by which we can combine FSAs that
accept languages to obtain an FSA that accepts the resulting language of the combination.

Closure Property: Let R and S represent two languages that are accepted by FSAR and FSAS respectively. If
an operation that is applied to R and S results in a language T for which there exists a FSA (FSAT ) that decides
language T , we say that the class of languages accepted by FSA is closed under this operation

Theorem 2. The class of languages that are accepted by FSA is closed under complementation, union, in-
tersection, concatenation and the Kleene star operation. In other words, if L and L0 are languages that are
accepted by FSA, then so are all of the following: L̄, L ∩ L0 , L ∪ L0 , L ◦ L0 and L~ .

Regular Languages
Theorem 3. Let L be a language. The following statements are equivalent:

1. L = L(A) for some NFA A

2. L = L(A0 ) for some DFA A0

3. L = L(R) for some regexp R

We are not going to prove this theorem. However, we are going to talk about the main ideas of the proof. You
can look at Sections 7.4.2 and Sections 7.6 in the textbook for a foraml treatment of this theorem.

Dept. of Computer Science, University of Toronto, St. George Campus Page 4 of 4

Website Project Specification Template
100% (2)
Website Project Specification Template
6 pages
The Evolution of Internet Services PDF
No ratings yet
The Evolution of Internet Services PDF
12 pages
NEMO-Q Software Installation Manual
33% (3)
NEMO-Q Software Installation Manual
36 pages
Finite Automata: Part Three
No ratings yet
Finite Automata: Part Three
51 pages
Non Deterministic Finite Automata (NFA)
No ratings yet
Non Deterministic Finite Automata (NFA)
26 pages
ch2 Engineering
No ratings yet
ch2 Engineering
78 pages
Home Work For Automata Theory
No ratings yet
Home Work For Automata Theory
4 pages
Lex Analysis
No ratings yet
Lex Analysis
13 pages
Slides4week2 FA+REX
No ratings yet
Slides4week2 FA+REX
43 pages
Automata
No ratings yet
Automata
11 pages
1 PDF
No ratings yet
1 PDF
58 pages
CSC236 A6
No ratings yet
CSC236 A6
6 pages
Comp 416 L4 R
No ratings yet
Comp 416 L4 R
14 pages
CSCI 3313-10: Foundation of Computing: 1.1 Mathematical Notations and Terminologies
No ratings yet
CSCI 3313-10: Foundation of Computing: 1.1 Mathematical Notations and Terminologies
50 pages
Flat CH 2
No ratings yet
Flat CH 2
86 pages
Lecture 3 Lexical Analyzer
No ratings yet
Lecture 3 Lexical Analyzer
44 pages
TAFL Unit 1 - Basic Concepts and Automata Theory - Detailed Notes
No ratings yet
TAFL Unit 1 - Basic Concepts and Automata Theory - Detailed Notes
13 pages
02 Automata
No ratings yet
02 Automata
78 pages
Final Revision FLAT
No ratings yet
Final Revision FLAT
22 pages
Slides 3
No ratings yet
Slides 3
34 pages
Finite Automata
No ratings yet
Finite Automata
30 pages
Formal Languages, Automata and Computability
No ratings yet
Formal Languages, Automata and Computability
51 pages
Complierdesign Operatingsonlanguagesrefiniteautomata 240920162828 5f5b45f9
No ratings yet
Complierdesign Operatingsonlanguagesrefiniteautomata 240920162828 5f5b45f9
16 pages
Regular Languages and Finite State Automata
No ratings yet
Regular Languages and Finite State Automata
15 pages
Formal Languages, Automata and Computability: (For Next Time: Read Chapter 1.3 of The Book)
No ratings yet
Formal Languages, Automata and Computability: (For Next Time: Read Chapter 1.3 of The Book)
56 pages
Theory of Automata Notes
No ratings yet
Theory of Automata Notes
29 pages
Automata 5
No ratings yet
Automata 5
33 pages
Regular Expressions
No ratings yet
Regular Expressions
34 pages
CS372 Formal Languages & The Theory of Computation
No ratings yet
CS372 Formal Languages & The Theory of Computation
29 pages
Answer Fo Auomata
No ratings yet
Answer Fo Auomata
61 pages
Non Deterministic Finite Automata
No ratings yet
Non Deterministic Finite Automata
37 pages
Theory of Computation: Sathyabama
No ratings yet
Theory of Computation: Sathyabama
92 pages
FLAT - Ch.2
No ratings yet
FLAT - Ch.2
86 pages
Compilation Techniques
No ratings yet
Compilation Techniques
21 pages
03 Toc
No ratings yet
03 Toc
35 pages
Nondeterministic Finite Automata: Nondeterminism Subset Construction ε-Transitions
No ratings yet
Nondeterministic Finite Automata: Nondeterminism Subset Construction ε-Transitions
35 pages
3B-Formal Languages
No ratings yet
3B-Formal Languages
24 pages
Lecture 2
No ratings yet
Lecture 2
21 pages
CH 2 Part 2 - Non-Deterministic Finite Automata
No ratings yet
CH 2 Part 2 - Non-Deterministic Finite Automata
32 pages
Chapter 3 Regular Expression
No ratings yet
Chapter 3 Regular Expression
25 pages
Unit 4: Regular Expressions
No ratings yet
Unit 4: Regular Expressions
52 pages
Regular Anguage
No ratings yet
Regular Anguage
38 pages
Tcs Theory Notes by Kamal Sir
No ratings yet
Tcs Theory Notes by Kamal Sir
24 pages
Chapter 1&2
No ratings yet
Chapter 1&2
26 pages
Toc U2ppt
No ratings yet
Toc U2ppt
41 pages
105 SubsetConst
No ratings yet
105 SubsetConst
19 pages
Ch-2 Equivalence of FA & NFA With - Transition
No ratings yet
Ch-2 Equivalence of FA & NFA With - Transition
21 pages
Lecture 5-FSMs-NFA-2-DFA
No ratings yet
Lecture 5-FSMs-NFA-2-DFA
62 pages
Regular Expressions
No ratings yet
Regular Expressions
30 pages
Lecture 5 NFA Equivalence
No ratings yet
Lecture 5 NFA Equivalence
30 pages
Regular Expression
No ratings yet
Regular Expression
106 pages
Kleene
No ratings yet
Kleene
6 pages
CSC236
No ratings yet
CSC236
17 pages
DFA Construction Ideas
No ratings yet
DFA Construction Ideas
4 pages
Non-Deterministic Finite Automata
No ratings yet
Non-Deterministic Finite Automata
36 pages
Formal Languages & Finite Theory of Automata: BS Course
No ratings yet
Formal Languages & Finite Theory of Automata: BS Course
56 pages
CH 3 - Regular Languages Amd Regular Grammars
No ratings yet
CH 3 - Regular Languages Amd Regular Grammars
67 pages
Finite Automata: Anab Batool Kazmi
No ratings yet
Finite Automata: Anab Batool Kazmi
54 pages
Finite Automata Examples
No ratings yet
Finite Automata Examples
68 pages
Equivalence NFA To DFA
No ratings yet
Equivalence NFA To DFA
11 pages
Theory of Computer Science
No ratings yet
Theory of Computer Science
30 pages
Square Summable Power Series
From Everand
Square Summable Power Series
Louis de Branges
5/5 (1)
Laplace Transforms Essentials
From Everand
Laplace Transforms Essentials
Morteza Shafii-Mousavi
3.5/5 (3)
EPPLER 593 AIRFOIL (E593-Il)
No ratings yet
EPPLER 593 AIRFOIL (E593-Il)
1 page
Discuss The Importance of Information As A Resource
No ratings yet
Discuss The Importance of Information As A Resource
4 pages
Overview of Technology Skills
No ratings yet
Overview of Technology Skills
26 pages
Csat Planner Arun Sharma
No ratings yet
Csat Planner Arun Sharma
7 pages
Beyond Trust Capacity Planning Guide
100% (1)
Beyond Trust Capacity Planning Guide
56 pages
String Function
No ratings yet
String Function
3 pages
PID Controller by Matlab
No ratings yet
PID Controller by Matlab
36 pages
Babel
No ratings yet
Babel
217 pages
Cost Management Case Study
100% (2)
Cost Management Case Study
4 pages
Vlan Trunking
No ratings yet
Vlan Trunking
11 pages
LightWave Modeler
100% (2)
LightWave Modeler
372 pages
HHHH
No ratings yet
HHHH
6 pages
Amit - Kumar - Patel - Leave Management System - Project - Sem - V
No ratings yet
Amit - Kumar - Patel - Leave Management System - Project - Sem - V
62 pages
InTech-Real Time Robotic Hand Control Using Hand Gestures
No ratings yet
InTech-Real Time Robotic Hand Control Using Hand Gestures
16 pages
The Breadboard
No ratings yet
The Breadboard
18 pages
2G KPI Improvement
No ratings yet
2G KPI Improvement
4 pages
21CS743
100% (1)
21CS743
1 page
Npar Tests: Descriptive Statistics
No ratings yet
Npar Tests: Descriptive Statistics
58 pages
Internet Bill Format
100% (2)
Internet Bill Format
1 page
Multilevel Security System For Bank Locker
No ratings yet
Multilevel Security System For Bank Locker
6 pages
寶馬E SYS漢化
No ratings yet
寶馬E SYS漢化
23 pages
PC-G850VSEng V3 0
No ratings yet
PC-G850VSEng V3 0
321 pages
Tensor Decomp Presentation
No ratings yet
Tensor Decomp Presentation
9 pages
CRC Press - Computer-Aided Design Engineering and Manufacturing Vol-I, Systems Techniques and Com
No ratings yet
CRC Press - Computer-Aided Design Engineering and Manufacturing Vol-I, Systems Techniques and Com
342 pages
The Application of National Biometric Database System in Nigerian Electoral Process
No ratings yet
The Application of National Biometric Database System in Nigerian Electoral Process
15 pages
Ready Reckoner For Emails
No ratings yet
Ready Reckoner For Emails
6 pages
CABAL Online Starter Guide
100% (2)
CABAL Online Starter Guide
18 pages

Regex - Regular Expression

Uploaded by

Regex - Regular Expression

Uploaded by

CSC 236 H1F Lecture Summary for Week 11 Fall 2015

Regular Expressions (Continued)

• Concatenation has precedence over union. E.g., RS ∗ + T is an abbreviation of ((RS ∗ ) + T )

Theorem 1. The general regexps R, S and T , the following equivalences hold:

• Associativity of concatenation: (RS)T ≡ R(ST )

• Left distributivity: R(S + T ) ≡ RS + RT

• Identity for union: R + {} ≡ R

• Identity for concatenation: R ≡ R ≡ R

• Annihilator for concatenation: {}R ≡ {} ≡ R{}

Proof. Now, let’s prove both parts.

Remark. See textbook for other detailed examples.

Dept. of Computer Science, University of Toronto, St. George Campus Page 1 of 4

Nondeterministic Finite State Automata (NFA or NFSA)

L = {s ∈ {a, b}∗ : s ends with babb}

qy following any number of  transitions.

Dept. of Computer Science, University of Toronto, St. George Campus Page 2 of 4

Example 2. Consider the following NFA:

Figure 2: NFA corresponding to regexp a∗ b∗

The corresponding DFA using the subset construction is:

Figure 3: Correponding DFA of NFA in Figure 2

Let’s construct a FSA that accepts the language

Another way to express L is to say

L = {s : s contains aaa} ∩ {s : s contains even many b’s}

Each of these sub-languages correspond to a FSA as follows:

Figure 4: FSA for {s : s contains aaa} (FSA1)

Dept. of Computer Science, University of Toronto, St. George Campus Page 3 of 4

Figure 5: FSA for {s : s contains even many b’s} (FSA2)

q00 a q10 a q20 a q30

Figure 6: FSA for {s : s contains even many b’s} (FSA2)

1. L = L(A) for some NFA A

2. L = L(A0 ) for some DFA A0

3. L = L(R) for some regexp R

Dept. of Computer Science, University of Toronto, St. George Campus Page 4 of 4

You might also like

• Identity for concatenation: R ≡ R ≡ R

qy following any number of transitions.