0% found this document useful (0 votes)

11 views

03 RegularExpression

The document introduces regular expressions and their use in defining formal languages. Some key points: 1) Regular expressions use operations like concatenation, union, and Kleene star to concisely define languages over an alphabet. 2) Examples show how regular expressions can define languages of strings meeting certain criteria, like containing a specific number of a's and b's. 3) Equivalent regular expressions generate the same language, and examples demonstrate transforming expressions into equivalent ones.

Uploaded by

waqas khan

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views

03 RegularExpression

Uploaded by

waqas khan

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 42

Theory of Computation

Defining Languages by Another

New Method
Regular Expressions
• Defining Languages by Another New Method

• Formal Definition of Regular Expressions

• Languages Associated with Regular Expressions

• Finite Languages Are Regular

• How Hard It Is to Understand a Regular Expression

• Introducing EVEN-EVEN
3
Language-Defining Symbols
• We now introduce the use of the Kleene star, applied not
to a set, but directly to the letter x and written as a
superscript: x*.
• This simple expression indicates some sequence of x’s
(may be none at all):
x* = Λ or x or x2 or x3…
= xn for some n = 0, 1, 2, 3, …

• Letter x is intentionally written in boldface type to

distinguish it from an alphabet character.

• We can think of the star as an unknown power. That is,

x* stands for a string of x’s, but we do not specify how
many, and it may be the null string .

4
• The notation x* can be used to define languages
by writing, say L4 = language (x*)
• Since x* is any string of x’s, L4 is then the
language of all possible strings of x’s of any
length (including Λ).

• We should not confuse x* (which is a language-

defining symbol) with L4 (which is the name we
have given to a certain language).

5
• Given the alphabet = {a, b}, suppose we wish to define the
language L that contains all words of the form one a followed by
some number of b’s (maybe no b’s at all); that is
L = {a, ab, abb, abbb, abbbb, …}

• Using the language-defining symbol, we may write

L = language (ab*)

• This equation obviously means that L is the language in which the

words are the concatenation of an initial a with some or no b’s.

• From now on, for convenience, we will simply say some b’s to mean
some or no b’s. When we want to mean some positive number of
b’s, we will explicitly say so.

6
• We can apply the Kleene star to the whole string
ab if we want:
(ab)* = Λ or ab or abab or ababab…
• Observe that
(ab)* ≠ a*b*
• because the language defined by the expression
on the left contains the word abab, whereas the
language defined by the expression on the right
does not.

7
• If we want to define the language L1 = {x; xx; xxx; …}
using the language-defining symbol, we can write
L1 = language(xx*)
which means that each word of L1 must start with an x
followed by some (or no) x’s.

• Note that we can also define L1 using the notation + (as

an exponent) introduced in Chapter 2:
L1 = language(x+)

• which means that each word of L1 is a string of some

positive number of x’s.

8
Plus Sign
• Let us introduce another use of the plus sign. By
the expression
x+y
where x and y are strings of characters from an
alphabet, we mean either x or y.

• Care should be taken so as not to confuse this

notation with the notation + (as an exponent).

9
Example
• Consider the language T over the alphabet
Σ = {a; b; c}:
• T = {a; c; ab; cb; abb; cbb; abbb; cbbb; abbbb;
cbbbb; …}
• In other words, all the words in T begin with
either an a or a c and then are followed by some
number of b’s.
• Using the above plus sign notation, we may
write this as
T = language((a+ c)b*)
10
Example
• Consider a finite language L that contains all the
strings of a’s and b’s of length three exactly:
L = {aaa, aab, aba, abb, baa, bab, bba, bbb}
• Note that the first letter of each word in L is
either an a or a b; so are the second letter and
third letter of each word in L.
• Thus, we may write
L = language((a+ b)(a + b)(a + b))
• or for short,
L = language((a+ b)3)
11
Example
• In general, if we want to refer to the set of all possible
strings of a’s and b’s of any length whatsoever, we could
write
language((a+ b)*)

• This is the set of all possible strings of letters from the

alphabet Σ = {a, b}, including the null string.

• This is powerful notation. For instance, we can describe

all the words that begin with first an a, followed by
anything (i.e., as many choices as we want of either a or
b) as
a(a + b)*

12
Formal Definition of Regular Expressions
• The set of regular expressions is defined by the following rules:

• Rule 1: Every letter of the alphabet Σ can be made into a regular

expression by writing it in boldface, Λ itself is a regular expression.

• Rule 2: If r1 and r2 are regular expressions, then so are:

(i) (r1)
(ii) r1r2
(iii) r1 + r2
(iv) r1*

• Rule 3: Nothing else is a regular expression.

• Note: If r1 = aa + b then when we write r1* , we really mean (r1)*,

that is r1* = (r1)* = (aa + b)*

13
Difference
• It is important to be clear about the difference of
the following regular expressions
• r1 = a*+b*
• r2 = (a+b)*
• Here r1 does not generate any string of
concatenation of a and b, while r2 generates
such strings.
• The language generated by any regular
expression is called a regular language.

14
Equivalent Regular Expressions
• Two regular expressions are said to be
equivalent if they generate the same language.
• Example
• Consider the following regular expressions
• r1 = (a + b)* (aa + bb)
• r2 = (a + b)*aa + ( a + b)*bb then both regular
expressions define the language of strings
ending in aa or bb.

15
Example
• Consider the language defined by the expression
(a + b)*a(a + b)*

• At the beginning of any word in this language we have

(a + b)*, which is any string of a’s and b’s, then comes an
a, then another any string.

• For example, the word abbaab can be considered to

come from this expression by 3 different choices:

(Λ)a(bbaab) or (abb)a(ab) or (abba)a(b)

16
Example contd.
• This language is the set of all words over the
alphabet Σ = {a, b} that have at least one a.
• The only words left out are those that have only
b’s and the word Λ.
These left out words are exactly the language
defined by the expression b*.
• If we combine this language, we should provide
a language of all strings over the alphabet Σ =
{a, b}. That is,
(a + b)* = (a + b)*a(a + b)* + b*

17
Example
• The language of all words that have at least two a’s can
be defined by the expression:
(a + b)*a(a + b)*a(a + b)*

• Another expression that defines all the words with at

least two a’s is
b*ab*a(a + b)*

• Hence, we can write

(a + b)*a(a + b)*a(a + b)* = b*ab*a(a + b)*

where by the equal sign we mean that these two

expressions are equivalent in the sense that they
describe the same language.
18
Example
• The language of all words that have at least one a and at least one b
is somewhat trickier. If we write
(a + b)*a(a + b)*b(a + b)*
then we are requiring that an a must precede a b in the word. Such
words as ba and bbaaaa are not included in this language.

• Since we know that either the a comes before the b or the b comes
before the a, we can define the language by the expression

(a + b)a(a + b)b(a + b) + (a + b)b(a + b)a(a + b)

• Note that the only words that are omitted by the first term
(a + b)*a(a + b)*b(a + b)* are the words of the form some b’s
followed by some a’s. They are defined by the expression bb*aa*

19
Example
• We can add these specific exceptions. So, the
language of all words over the alphabet Σ = {a,
b} that contain at least one a and at least one b
is defined by the expression:
(a + b)a(a + b)b(a + b) + bb*aa*
• Thus, we have proved that
(a + b)*a(a + b)*b(a + b)* + (a + b)*b(a + b)*a(a + b)*
= (a + b)*a(a + b)*b(a + b)* + bb*aa*

20
Example
• In the above example, the language of all words that
contain both an a and a b is defined by the expression
(a + b)*a(a + b)*b(a + b)* + bb*aa*

• The only words that do not contain both an a and a b are

the words of all a’s, all b’s, or Λ.

• When these are included, we get everything. Hence, the

expression
(a + b)*a(a + b)*b(a + b)* + bb*aa* + a* + b*
defines all possible strings of a’s and b’s, including
(accounted for in both a and b).

21
• Thus

(a + b)* = (a + b)a(a + b)b(a + b)* + bbaa + a* + b*

22
Example
• The following equivalences show that we should not treat
expressions as algebraic polynomials:

(a + b)* = (a + b)* + (a + b)*

(a + b)* = (a + b)* + a*
(a + b)* = (a + b)*(a + b)*
(a + b)* = a(a + b)* + b(a + b)* + Λ
(a + b)* = (a + b)*ab(a + b)* + b*a*

• The last equivalence may need some explanation:

– The first term in the right hand side, (a + b)*ab(a + b)*, describes all the
words that contain the substring ab.

– The second term, b*a* describes all the words that do not contain the
substring ab (i.e., all a’s, all b’s, Λ, or some b’s followed by some a’s).

23
Example
• Let V be the language of all strings of a’s and b’s in
which either the strings are all b’s, or else an a followed
by some b’s. Let V also contain the word Λ. Hence,
V = {Λ, a, b, ab, bb, abb, bbb, abbb, bbbb, …}
• We can define V by the expression
b* + ab*
where Λ is included in b*.
• Alternatively, we could define V by
(Λ + a)b*
which means that in front of the string of some b’s, we have
either an a or nothing.

24
Example contd.
• Hence,
(Λ + a)b* = b* + ab*

• Since b* = Λ b*, we have

(Λ + a)b* = b* + ab*
which appears to be distributive law at work.

• However, we must be extremely careful in

applying distributive law. Sometimes, it is difficult
to determine if the law is applicable.

25
Product Set
• If S and T are sets of strings of letters (whether
they are finite or infinite sets), we define the
product set of strings of letters to be

ST = {all combinations of a string from S

concatenated with a string from T in that order}

26
Example
• If S = {a, aa, aaa} and T = {bb, bbb} then

ST = {abb, abbb, aabb, aabbb, aaabb, aaabbb}

• Note that the words are not listed in lexicographic order.

• Using regular expression, we can write this example as

(a + aa + aaa)(bb + bbb)
= abb + abbb + aabb + aabbb + aaabb + aaabbb

27
Example
• If M = {λ, x, xx} and N = {λ, y, yy, yyy, yyyy, …}
then
• MN ={λ, y, yy, yyy, yyyy,…x, xy, xyy, xyyy,
xyyyy, …xx, xxy, xxyy, xxyyy, xxyyyy, …}

• Using regular expression

(λ + x + xx)(y) = y + xy* + xxy*

28
Languages Associated with
Regular Expressions
Definition
• The following rules define the language associated with
any regular expression:

• Rule 1: The language associated with the regular

expression that is just a single letter is that one-letter
word alone, and the language associated with λ is just
{λ}, a one-word language.

• Rule 2: If r1 is a regular expression associated with the

language L1 and r2 is a regular expression associated
with the language L2, then:
(i) The regular expression (r1)(r2) is associated with the product
L1L2, that is the language L1 times the language L2:

language(r1r2) = L1L2

30
Definition contd.
• Rule 2 (cont.):

(ii) The regular expression r1 + r2 is associated with the

language formed by the union of L1 and L2:
language(r1 + r2) = L1 + L2

(iii) The language associated with the regular

expression (r1)* is L1*, the Kleene closure of the set L1
as a set of words:
language(r1*) = L1*

31
Finite Languages Are Regular
Theorem 5
• If L is a finite language (a language with only finitely many
words), then L can be defined by a regular expression. In other
words, all finite languages are regular.

• Proof

• Let L be a finite language. To make one regular expression that

defines L, we turn all the words in L into boldface type and insert
plus signs between them.

• For example, the regular expression that defines the language

L = {baa, abbba, bababa} is baa + abbba + bababa

• This algorithm only works for finite languages because an infinite

language would become a regular expression that is infinitely long,
which is forbidden.

33
How Hard It Is To Understand A
Regular Expression

Let us examine some regular expressions and

see if we could understand something about the
languages they represent.
Example
• Consider the expression

(a + b)(aa + bb)(a + b) =(arbitrary)(double letter)(arbitrary)

• This is the set of strings of a’s and b’s that at

some point contain a double letter.

Let us ask, “What strings do not contain a

double letter?” Some examples are
λ; a; b; ab; ba; aba; bab; abab; baba; …
35
Example contd.
• The expression (ab)* covers all of these except
those that begin with b or end with a. Adding
these choices gives us the expression:

(λ + b)(ab)*(λ + a)

• Combining the two expressions gives us the one

that defines the set of all strings
(a + b)*(aa + bb)(a + b)* + (λ + b)(ab)*(λ + a)

36
Examples
• Note that
(a + b*)* = (a + b)*
since the internal * adds nothing to the language.
However,

(aa + ab) ≠ (aa + ab)*

since the language on the left includes the word
abbabb, whereas the language on the right does
not. (The language on the right cannot contain
any word with a double b.)
37
Example
• Consider the regular expression: (a*b*)*.

• The language defined by this expression is all strings

that can be made up of factors of the form a*b*.

• Since both the single letter a and the single letter b are
words of the form a*b*, this language contains all strings
of a’s and b’s. That is,
(a*b*)* = (a + b)*

• This equation gives a big doubt on the possibility of

finding a set of algebraic rules to reduce one regular
expression to another equivalent one.
38
Introducing EVEN-EVEN
• Consider the regular expression
E = [aa + bb + (ab + ba)(aa + bb)*(ab + ba)]*

• This expression represents all the words that are made up of

syllables of three types:
type1 = aa
type2 = bb
type3 = (ab + ba)(aa + bb)*(ab + ba)

• Every word of the language defined by E contains an even number

of a’s and an even number of b’s.

• All strings with an even number of a’s and an even number of b’s
belong to the language defined by E.

39
Algorithms for EVEN-EVEN
• We want to determine whether a long string of a’s and b’s has the
property that the number of a’s is even and the number of b’s is
even.

• Algorithm 1: Keep two binary flags, the a-flag and the b-flag.
Every time an a is read, the a-flag is reversed (0 to 1, or 1 to 0); and
every time a b is read, the b-flag is reversed. We start both flags at 0
and check to be sure they are both 0 at the end.

• Algorithm 2: Keep only one binary flag, called the type3-flag. We

read letter in two at a time. If they are the same, then we do not
touch the type3-flag, since we have a factor of type1 or type2. If,
however, the two letters do not match, we reverse the type3-flag. If
the flag starts at 0 and if it is also 0 at the end, then the input string
contains an even number of a’s and an even number of b’s.

40
• If the input string is

(aa)(ab)(bb)(ba)(ab)(bb)(bb)(bb)(ab)(ab)(bb)(ba)
(aa) then, by Algorithm 2, the type3-flag is
reversed 6 times and ends at 0.

• We give this language the name EVEN-EV EN.

so, EVEN-EV EN ={λ, aa, bb, aaaa, aabb, abab,
abba, baab, baba, bbaa, bbbb, aaaaaa, aaaabb,
aaabab, …}

41
• Useful Reading

Fourth chapter of Daniel I. Cohen book.

Big Keyboard and Piano Chord Book: 500+ Keyboard and Piano Chords in a Unique Visual Format
From Everand
Big Keyboard and Piano Chord Book: 500+ Keyboard and Piano Chords in a Unique Visual Format
Richard Moran
4.5/5 (5)
Atometa Book
67% (3)
Atometa Book
145 pages
Chapter 4
No ratings yet
Chapter 4
23 pages
03-RegularExpression 112422
No ratings yet
03-RegularExpression 112422
22 pages
2 CTH
No ratings yet
2 CTH
61 pages
LEC-3
No ratings yet
LEC-3
25 pages
Ecture: Regular Expressions
No ratings yet
Ecture: Regular Expressions
19 pages
Lecture 3 Regular Expressions
No ratings yet
Lecture 3 Regular Expressions
36 pages
Lecture 2 PDF
No ratings yet
Lecture 2 PDF
33 pages
Lecture # 2: Automata Theory and Formal Languages (CSC-221)
No ratings yet
Lecture # 2: Automata Theory and Formal Languages (CSC-221)
48 pages
Theory Comp
No ratings yet
Theory Comp
76 pages
Lec 02 - Recursive Definition
No ratings yet
Lec 02 - Recursive Definition
33 pages
Automata and Formal Language Theory
No ratings yet
Automata and Formal Language Theory
18 pages
Automata 2
No ratings yet
Automata 2
31 pages
Chapter 4
No ratings yet
Chapter 4
23 pages
Chapter 4-Computer Theory BY Danial I. A Cohen
70% (10)
Chapter 4-Computer Theory BY Danial I. A Cohen
23 pages
Theory of Automata (Regular Expression)
No ratings yet
Theory of Automata (Regular Expression)
42 pages
TOA Lecture 03
No ratings yet
TOA Lecture 03
63 pages
TOC-L02-Regular Expressions-S25
No ratings yet
TOC-L02-Regular Expressions-S25
23 pages
Introduction To Languages There Are Two Types of Languages Formal Languages (Syntactic Languages)
No ratings yet
Introduction To Languages There Are Two Types of Languages Formal Languages (Syntactic Languages)
29 pages
Theory of Computer Science - SCJ 3203: Paridah Samsuri Mohd Soperi Mohd Zahid
No ratings yet
Theory of Computer Science - SCJ 3203: Paridah Samsuri Mohd Soperi Mohd Zahid
49 pages
Introduction and RE
No ratings yet
Introduction and RE
13 pages
Chapter4 Dvi (4-1)
No ratings yet
Chapter4 Dvi (4-1)
12 pages
Chapter No.1
No ratings yet
Chapter No.1
31 pages
Fouzia Jabeen: Theory of Automata
No ratings yet
Fouzia Jabeen: Theory of Automata
35 pages
Regular Expressions
No ratings yet
Regular Expressions
31 pages
Chapter 4
No ratings yet
Chapter 4
31 pages
Lecture 03
No ratings yet
Lecture 03
16 pages
Microsoft PowerPoint - Tcs1-Languages
No ratings yet
Microsoft PowerPoint - Tcs1-Languages
23 pages
Automata Theory
No ratings yet
Automata Theory
27 pages
Automata 3
No ratings yet
Automata 3
21 pages
Block-3 MS-031 Unit-2
No ratings yet
Block-3 MS-031 Unit-2
14 pages
Theory of Automata and Formal Languages
No ratings yet
Theory of Automata and Formal Languages
24 pages
Theory of Automata Week-1 Lecture-1: Semester-# Fall-2018
No ratings yet
Theory of Automata Week-1 Lecture-1: Semester-# Fall-2018
65 pages
Lecture 2 - Alphabets-Strings, Languages and Grammars
No ratings yet
Lecture 2 - Alphabets-Strings, Languages and Grammars
47 pages
CSC312 Automata Theory Languages: Lecture # 2
No ratings yet
CSC312 Automata Theory Languages: Lecture # 2
50 pages
Lesson 03
No ratings yet
Lesson 03
30 pages
Lecture#03,4
No ratings yet
Lecture#03,4
27 pages
Welcome To !: Theory of Automata
No ratings yet
Welcome To !: Theory of Automata
49 pages
Theory of Automata-RE (2)
No ratings yet
Theory of Automata-RE (2)
25 pages
Theory of Computation: Lecture # 1-2-3
No ratings yet
Theory of Computation: Lecture # 1-2-3
36 pages
Lecture 1 27022023 011941pm
No ratings yet
Lecture 1 27022023 011941pm
52 pages
Absent Tha
No ratings yet
Absent Tha
33 pages
02 PDF
No ratings yet
02 PDF
13 pages
CS273 Theory of Automata & Fomal Languages: (WEEK-2) Lecture-3 & 4
No ratings yet
CS273 Theory of Automata & Fomal Languages: (WEEK-2) Lecture-3 & 4
41 pages
TOA Lesson 02
No ratings yet
TOA Lesson 02
30 pages
Theory of automata lectures
No ratings yet
Theory of automata lectures
62 pages
2022 CSC 353 2.0 2 Alphabets and Languages
No ratings yet
2022 CSC 353 2.0 2 Alphabets and Languages
3 pages
Theory of Automata: Dr. S. M. Gilani
No ratings yet
Theory of Automata: Dr. S. M. Gilani
29 pages
Fall_Semester_2023-24_CSE1013_TH_AP2023242000613_Reference_Material_I_02-Aug-2023_Module_-_I_Part_-_I
No ratings yet
Fall_Semester_2023-24_CSE1013_TH_AP2023242000613_Reference_Material_I_02-Aug-2023_Module_-_I_Part_-_I
38 pages
Formal Languages and Automata Theory - Regular Expressions and Finite Automata
No ratings yet
Formal Languages and Automata Theory - Regular Expressions and Finite Automata
17 pages
Lesson 03
No ratings yet
Lesson 03
18 pages
Lesson 03
No ratings yet
Lesson 03
18 pages
Regular Expression2
No ratings yet
Regular Expression2
12 pages
Week 2: (Properties of Formal Language)
No ratings yet
Week 2: (Properties of Formal Language)
45 pages
Unit 1 Finite Automata and Languages: Structure Page Nos
No ratings yet
Unit 1 Finite Automata and Languages: Structure Page Nos
19 pages
CSC312 Automata Theory: Recursive Definations Regular Expressions
No ratings yet
CSC312 Automata Theory: Recursive Definations Regular Expressions
28 pages
The Genetic Code of All Languages,(Part-1; An Overview)
From Everand
The Genetic Code of All Languages,(Part-1; An Overview)
Moni Kanchan Panda
No ratings yet
The Real Number System in an Algebraic Setting
From Everand
The Real Number System in an Algebraic Setting
J. B. Roberts
No ratings yet
The Geometry of René Descartes: with a Facsimile of the First Edition
From Everand
The Geometry of René Descartes: with a Facsimile of the First Edition
René Descartes
3.5/5 (3)
The Genetic Code of All Languages,(Part 2.1; Numerals)
From Everand
The Genetic Code of All Languages,(Part 2.1; Numerals)
Moni Kanchan Panda
No ratings yet
Automata Theory Computability - M2
No ratings yet
Automata Theory Computability - M2
68 pages
Cs3452 Question Bank - TOC - 2324
No ratings yet
Cs3452 Question Bank - TOC - 2324
2 pages
Syllabus of Sem 5th
No ratings yet
Syllabus of Sem 5th
12 pages
Lecture 0 CSE322
No ratings yet
Lecture 0 CSE322
46 pages
AY21 22 Computer Syllabus Final
No ratings yet
AY21 22 Computer Syllabus Final
220 pages
Updated 5th and 6th Sem 2021 Scheme and Syllabus
No ratings yet
Updated 5th and 6th Sem 2021 Scheme and Syllabus
71 pages
TOC LP
No ratings yet
TOC LP
8 pages
Practice Problems For Final Exam: Solutions CS 341: Foundations of Computer Science II Prof. Marvin K. Nakayama
No ratings yet
Practice Problems For Final Exam: Solutions CS 341: Foundations of Computer Science II Prof. Marvin K. Nakayama
16 pages
CS 301 Theory of Automata Fall 2018
No ratings yet
CS 301 Theory of Automata Fall 2018
4 pages
Toc U2
No ratings yet
Toc U2
31 pages
T.Y.B.sc. Computer Science 30june
No ratings yet
T.Y.B.sc. Computer Science 30june
53 pages
Subject: Formal Languages and Automata Theory (15A05404) Max Marks: 10M Exam Type: Objective Date: 14/03/2020 Reg, No: F H A 0
No ratings yet
Subject: Formal Languages and Automata Theory (15A05404) Max Marks: 10M Exam Type: Objective Date: 14/03/2020 Reg, No: F H A 0
2 pages
Lesson 05
No ratings yet
Lesson 05
20 pages
YOGI
No ratings yet
YOGI
4 pages
Theory of Computation - Regular Expressions and Regular Languages - Sanfoundry
No ratings yet
Theory of Computation - Regular Expressions and Regular Languages - Sanfoundry
2 pages
TOC Chapter-2 For Reference
No ratings yet
TOC Chapter-2 For Reference
122 pages
Toa-Lecture Notes-20 - Pumping Lemma
No ratings yet
Toa-Lecture Notes-20 - Pumping Lemma
21 pages
Full Download Automata Computability and Complexity Theory and Applications Elaine A. Rich PDF DOCX
100% (1)
Full Download Automata Computability and Complexity Theory and Applications Elaine A. Rich PDF DOCX
81 pages
TOC papers
No ratings yet
TOC papers
5 pages
Automata Theory and Formal Languages - Module 2
No ratings yet
Automata Theory and Formal Languages - Module 2
4 pages
Flat It Gate 2
No ratings yet
Flat It Gate 2
33 pages
Lecture 09
No ratings yet
Lecture 09
26 pages
R20-BE (CSE) - V-VIII-Semesters-Syllabus-without Matrix-Final-23-06-22
No ratings yet
R20-BE (CSE) - V-VIII-Semesters-Syllabus-without Matrix-Final-23-06-22
163 pages
Turing Machine Notes
No ratings yet
Turing Machine Notes
11 pages
Cat1 Model1
No ratings yet
Cat1 Model1
2 pages
Normal Forms For Context Free Grammars
No ratings yet
Normal Forms For Context Free Grammars
54 pages
511CIT05 Formal Languages and Automata Theory LTPM C 3 1 0 100 4 Aim
No ratings yet
511CIT05 Formal Languages and Automata Theory LTPM C 3 1 0 100 4 Aim
2 pages
Theory of Computation A
No ratings yet
Theory of Computation A
171 pages
Computer Science & Engineering Syllabus '11
No ratings yet
Computer Science & Engineering Syllabus '11
22 pages

03 RegularExpression

Uploaded by

03 RegularExpression

Uploaded by

Theory of Computation

Defining Languages by Another

• Formal Definition of Regular Expressions

• Languages Associated with Regular Expressions

• Finite Languages Are Regular

• How Hard It Is to Understand a Regular Expression

• Letter x is intentionally written in boldface type to

• We can think of the star as an unknown power. That is,

• We should not confuse x* (which is a language-

• Using the language-defining symbol, we may write

• This equation obviously means that L is the language in which the

• Note that we can also define L1 using the notation + (as

• which means that each word of L1 is a string of some

• Care should be taken so as not to confuse this

• This is the set of all possible strings of letters from the

• This is powerful notation. For instance, we can describe

• Rule 1: Every letter of the alphabet Σ can be made into a regular

• Rule 2: If r1 and r2 are regular expressions, then so are:

• Rule 3: Nothing else is a regular expression.

• Note: If r1 = aa + b then when we write r1* , we really mean (r1)*,

• At the beginning of any word in this language we have

• For example, the word abbaab can be considered to

(Λ)a(bbaab) or (abb)a(ab) or (abba)a(b)

• Another expression that defines all the words with at

• Hence, we can write

where by the equal sign we mean that these two

(a + b)a(a + b)b(a + b) + (a + b)b(a + b)a(a + b)

• The only words that do not contain both an a and a b are

• When these are included, we get everything. Hence, the

(a + b)* = (a + b)*a(a + b)*b(a + b)* + bb*aa* + a* + b*

(a + b)* = (a + b)* + (a + b)*

• The last equivalence may need some explanation:

• Since b* = Λ b*, we have

• However, we must be extremely careful in

ST = {all combinations of a string from S

ST = {abb, abbb, aabb, aabbb, aaabb, aaabbb}

• Note that the words are not listed in lexicographic order.

• Using regular expression, we can write this example as

• Using regular expression

(λ + x + xx)(y*) = y* + xy* + xxy*

• Rule 1: The language associated with the regular

• Rule 2: If r1 is a regular expression associated with the

(ii) The regular expression r1 + r2 is associated with the

(iii) The language associated with the regular

• Let L be a finite language. To make one regular expression that

• For example, the regular expression that defines the language

• This algorithm only works for finite languages because an infinite

Let us examine some regular expressions and

(a + b)*(aa + bb)(a + b)* =(arbitrary)(double letter)(arbitrary)

• This is the set of strings of a’s and b’s that at

Let us ask, “What strings do not contain a

• Combining the two expressions gives us the one

(aa + ab*)* ≠ (aa + ab)*

• The language defined by this expression is all strings

• This equation gives a big doubt on the possibility of

• This expression represents all the words that are made up of

• Every word of the language defined by E contains an even number

• Algorithm 2: Keep only one binary flag, called the type3-flag. We

• We give this language the name EVEN-EV EN.

Fourth chapter of Daniel I. Cohen book.

You might also like

(a + b)* = (a + b)a(a + b)b(a + b)* + bbaa + a* + b*

(λ + x + xx)(y) = y + xy* + xxy*

(a + b)(aa + bb)(a + b) =(arbitrary)(double letter)(arbitrary)

(aa + ab) ≠ (aa + ab)*