0% found this document useful (0 votes)

44 views6 pages

Kleene

This document provides an alternative proof of Kleene's theorem, which states that regular languages and finite automata (FA) languages are equivalent. It does so in three steps: 1. It defines predicates and relations on regular expressions to represent ε membership and transitions between expressions. 2. It proves properties of these definitions, showing they accurately capture language membership and transitions. 3. It constructs a finite automaton from any regular expression by taking the reachability set of expressions as states, the transition relation as edges, and the ε-membership predicate as accepting states. It proves this automaton recognizes the same language as the original expression.

Uploaded by

Prerna Singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

44 views6 pages

Kleene

Uploaded by

Prerna Singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

A Proof of Kleene’s Theorem

Rance Cleaveland
Spring 2000

1. Introduction

So far in class, we have concentrated on two classes of languages: the regular languages, which are
those that may be defined using regular expressions, and the FA languages, which are those that
are accepted by finite automata. Kleene’s Theorem states that, in fact, these classes are the same:
every regular language may be recognized by some FA, and every FA language may be represented
using a regular expression. The book presents one proof of these statements; this handout offers
an alternative, and I hope simpler, argument for the former.
On the basis of what we have seen in class, to establish that every regular language can be
recognized by an FA it suffices to show how, given a regular expression r, we can build a NFA M
such that L (r) = L (M). We give such a construction in a couple of steps.
√ √
1. We first define a predicate on regular expressions; intuitively, r is intended to hold if
ε ∈ L (r).
a
2. We then define a ternary relation −→⊆ R (Σ) × Σ × R (Σ). Intuitively, r −→ r′ is true if the
start state for a NFA for r can have an a-transition to the start state for a NFA for r′ . Put
a
differently, if r −→ r′ then a NFA for r should be able to “process” symbol a and then accept
all the strings in L (r′ ).

3. Using these relations, we then show how to build a NFA from r whose states are regular
√
expressions, whose transitions are given by −→, and whose final states are defined using .

2. Formal Definitions
√
We define recursively using the following rules.
√
Definition 2.1. Let Σ be an alphabet. Then r , where r ∈ R (Σ), is defined as follows.
√
• ε .
√
• r∗ .

1
√ √ √
• If r then (r + s) and (s + r) .
√ √ √
• If r and s then (rs) .
√
Intuitively, r holds if r is capable of “generating” the empty word, i.e. if ε ∈ L (r). Certainly
ε ∈ L (ε), and ε ∈ L (r∗ ) regardless of what r is. The definition of L (r + s) ensures that ε is in the
language of r + s if and only if it is in the language of r or s, while in the case of rs ε must be in
the language of both. As examples, we have the following.
√ √ √
εa∗ since ε and a∗ .
√ √ √
¬((a + b) ) since neither a nor b .
√ √
01 + (1 + 01)∗√ since (1 + 01) √
∗ .

¬(01(1 + 01)∗ ) since ¬(01 ).

We also use recursion to define −→.
a
Definition 2.2. Let Σ be an alphabet. Then for r, r′ ∈ R (Σ) and a ∈ Σ, r −→ r′ is defined as follows.
a
• If a ∈ Σ then a −→ ε.
a a
• If r −→ r′ then r + s −→ r′ .
a a
• If s −→ s′ then r + s −→ s′ .
a a
• If r −→ r′ then rs −→ r′ s.
√ a a
• If r and s −→ s′ then rs −→ s′ .
a a
• If r −→ r′ then r∗ −→ r′ (r∗ ).
The definition of this relation is somewhat complex, but the idea that it is trying to capture is rela-
a
tively simple: r −→ r′ if one can build words in L (r) by taking the a labeling −→ and appending
a
a word from L (r′ ). So we have the rule a −→ ε for a ∈ Σ, while the rules for + follow from the
fact that L (r + s) = L (r) ∪ L (s). The rules for rs in essence state that ax ∈ L (rs) can hold either if
there is a way of splitting x into x1 , x2 such that ax1 is in the language of r and x2 is in the language
of s or if ε is in the language of r and ax is in the language of s. Finally, the rule for r∗ essentially
permits such regular expressions to “loop”. As examples, we have the following.
0
0 + 1 −→ ε by the rules for 0 and +.
a
(abb + a) −→ εbb(abb + a) by the rules for a, concatenation, +, and ∗ .
∗ ∗

In this latter example, note that applying the rules literally requires that we include the ε in
a a
εbb(abb + a)∗ . This is because the rule for a says that a −→ ε, meaning that abb −→ εbb, etc.
a
However, when we have εs like this, we will often leave them out; thus we will write abb −→ bb
a
rather than abb −→ εbb.
√
The following lemmas about and −→ formally establish the intuitive properties that we wish
them to have.

2
√
Lemma 2.3. Let r be a regular expression. Then r if and only if ε ∈ L (r).

Lemma 2.4. Let r ∈ R (Σ) be a regular expression over Σ, a ∈ Σ, and x ∈ Σ∗ . Then ax ∈ L (r) if
a
and only if there is an r′ ∈ R (Σ) such that r −→ r′ and x ∈ L (r′ ).
Both lemmas may be proved using strong induction on the size of regular expression r.
√
3. Building Automata using and −→
√
To see how and −→ may be used to build NFAs, first note how we can use them to determine
whether a string is in the language of a regular expression. Consider the following sequence of
“transitions” starting from the regular expression (abb + a)∗.
a b b a
(abb + a)∗ −→ bb(abb + a)∗ −→ b(abb + a)∗ −→ (abb + a)∗ −→ (abb + a)∗
Using Lemma 2.4 (four times!), we can√conclude that if x ∈ L ((abb +a)∗), then abbax ∈ L ((abb +
a)∗ ) also. In addition, since (abb + a)∗ , we know from Lemma 2.3 that ε ∈ L ((abb + a)∗). Since
abbaε = abba, then, it follows that abba ∈ L ((abb + a)∗).
a1 an √
More generally, if there is a sequence of transitions r0 −→ r1 ··· −→ rn and rn , then we can
assert that a1 ...an ∈ L (r0), and vice versa. This observation suggests the following possible strategy
for building a NFA from a regular expression r.
1. Let the states be all possible regular expressions that can be reached by some sequence of
−→-transitions from r.
2. Take r to be the start state.
3. Let the transitions be given by −→.
√
4. Let the accepting states be those regular expressions r′ for which r′ holds.
Of course, this would only work if the set of “all possible regular expressions” mentioned in part 1
is finite, since a NFA is required to have a finite number of states.
To make this construction precise, and to examine the issue of “finiteness” of state sets, we
need to define mathematically the set of “all possible regular expressions that can be reached by
some sequence”. We can do this as follows.

Definition 3.5. Let r ∈ R (Σ) be a regular expression. Then the set RS(r) ⊆ R (Σ) is defined
recursively as follows.
• r ∈ RS(r).
a
• If r1 ∈ RS(r) and r1 −→ r2 for some a ∈ Σ, then r2 ∈ RS(r).
The RS stands for “reachability set.” As an example, note that
RS((abb + a)∗) = {(abb + a)∗, bb(abb + a)∗, b(abb + a)∗}.
The following result indicates that the number of reachable regular expressions is always finite.

3
Lemma 3.6. Let r be a regular expression. Then RS(r) is finite.
Proof. Follows from observations such as

• RS(r1 + r2 ) = RS(r1) ∪ RS(r2) and

• RS(r∗ ) = { r′ r∗ | r′ ∈ RS(r) }.

✷
We can now define our NFA construction as follows.

Definition 3.7. Let r ∈ R (Σ) be a regular expression. Then NFA(r) = hQ, Σ, q0, δ, Ai is the NFA
defined as follows.

• Q = RS(r).

• q0 = r.
a
• δ(r1 , a) = { r2 ∈ Q | r1 −→ r2 }.
√
• A = { r′ ∈ Q | r′ }.

The next theorem establishes that r and NFA(r) always have the same languages.

Theorem 3.8. Let r be a regular expression. The L (r) = L (NFA(r)).

Proof. Relies on the fact that Lemmas 2.3 and 2.4 guarantee that x = a1 ...an ∈ L (r) if and only if
a1 an √
there is a regular expression r′ such that r −→ ··· −→ r′ and r′ . ✷

4. How to Compute NFA(r)

It may not be apparent from the discussion up to now, but in fact the construction for NFA(r) given
above can be automated; that is, one can come up with a routine for building NFA(r), given r.
Before describing how this may be done, we first make precise the notion of “outgoing transitions”
from a regular expression and explain how they may be calculated.

Definition 4.9. Let r ∈ R (Σ) be a regular expression. Then the set of outgoing transitions from r
a
is defined as the set { ha, r′i | r −→ r′ }.
Intuitively, the outgoing transitions from r consists of pairs ha, r′ i that, when combined with r,
a
constitute a valid “transition” r −→ r′ . Figure 1 contains a recursive procedure computing for
outgoing transitions. The routine, out, uses the structure of r and the rules that define −→ to
guide its computation. For regular expressions of the form 0, / ε and a ∈ Σ, the definition of −→
immediately gives all the transitions. For regular expressions built using +, · and ∗ , one must
first recursively compute the outgoing transitions of the subexpressions of r and then combine the
results appropriately, based on the rules given in the definition of −→.
The next lemma states that out correctly computes of outgoing transitions.

4


 {} if r = 0/ or r = ε
{ha, εi} if r = a ∈ Σ




out(r1 ) ∪ out(r2 ) if r = r1 + r2

out(r) =
 { ha, r1′ r2 i | ha, r1′ i ∈ out(r1 ) } √
∪{ ha, r2′ i | ha, r2′ i ∈ out(r2 ) ∧ r1 }

if r = r1 r2



{ ha, r1′ r1∗ i | ha, r1′ i ∈ out(r1) } if r = r1∗



Figure 1: Calculating the outgoing transitions of regular expressions.

Lemma 4.10. Let r ∈ R (Σ) be a regular expression, and let out be as defined in Figure 1. Then
a
out(r) = { ha, r′ i | r −→ r′ }.
Proof. The proof breaks into two pieces. The first requires us to show that every ha, r′ i ∈ out(r)
a a
satisfies: r −→ r′ . In the second, we establish that whenever r −→ r′ , then ha, r′ i ∈ out(r). Both
arguments can be carried out using induction, with the first being done on the structure of the
definition of out and the second using the definition of −→.
✷
We now sketch a routine for computing NFA(r); it relies on maintaining three sets of regular
expressions.

• Q, a set that will eventually contain the states of NFA(r).

• A, a set that will eventually contain the accepting states of NFA(r).

• toProc, a subset of Q containing states that have not yet had their transitions computed and
thus require some “processing”.

The algorithm works as follows. Initially, Q and toProc contain only r. While there remains
at least one regular expression to process, we remove one such an expression from toProc and
√
perform the following. First, we check to see if holds for the expression; if so then we add the
expression to the set of accepting states. Then we compute all the “outgoing transitions” from the
given expression; the target expressions of these transitions that are not already in Q are added both
to Q and to toProc, as they have not yet been encountered and thus need their transitions computed.
The algorithm terminates when toProc is empty. Pseudocode for this procedure may be found in
Figure 2, while Figure 3 gives the NFA resulting from applying the procedure to (abb + a)∗ .

5
procedure NFA (r) =
begin
Q := {r};
/
A := 0;
set δ(r, a) := 0/ for all a ∈ Σ;
toProc := {r};
while toProc 6= 0/ do
begin
choose r1 ∈ toProc;
delete r1 from toProc;
√
if r1 then add r1 to A;
compute T = out(r1 );
for each ha, r1′ i ∈ T do
begin
add r1′ into δ(r1 , a);
if r1′ 6∈ Q then add r2 to Q and toProc;
end
end;
return NFA hQ, Σ, r, δ, Ai;
end

Figure 2: Procedure for building NFA from regular expression.

(abb + a)∗ a
a
b bb(abb + a)∗
b
b(abb + a)∗

Figure 3: A NFA for (abb + a)∗.

Python Imp Questions
No ratings yet
Python Imp Questions
7 pages
TOC Module-2 Notes
No ratings yet
TOC Module-2 Notes
24 pages
(2019) - Shirali Satish & Vasudeva Lal. Measure and Integration
100% (1)
(2019) - Shirali Satish & Vasudeva Lal. Measure and Integration
609 pages
4 Re
No ratings yet
4 Re
78 pages
Chapter 03 - Regular Expression and Language
No ratings yet
Chapter 03 - Regular Expression and Language
42 pages
Kleens Theorem&NFA
No ratings yet
Kleens Theorem&NFA
78 pages
Finite Automata With Epsilon-Transitions and With Outputs
No ratings yet
Finite Automata With Epsilon-Transitions and With Outputs
18 pages
Unit-2 ATCD
No ratings yet
Unit-2 ATCD
65 pages
Toc 2
No ratings yet
Toc 2
26 pages
Regular Expressions: Reading: Chapter 3
No ratings yet
Regular Expressions: Reading: Chapter 3
39 pages
Chapter 2 REGULAR EXPRESSION
No ratings yet
Chapter 2 REGULAR EXPRESSION
26 pages
Regular Expression
No ratings yet
Regular Expression
106 pages
Module-2 Imp Definitions and Theorems
No ratings yet
Module-2 Imp Definitions and Theorems
9 pages
Mathematical Reasoning - Writing and Proof Version 2.1
100% (5)
Mathematical Reasoning - Writing and Proof Version 2.1
608 pages
CH 3 - Regular Languages Amd Regular Grammars
No ratings yet
CH 3 - Regular Languages Amd Regular Grammars
67 pages
Toc U2ppt
No ratings yet
Toc U2ppt
41 pages
TOC-L05-Converting RE Into FA-S25-new
No ratings yet
TOC-L05-Converting RE Into FA-S25-new
29 pages
06 KleensTheorem&NFA
No ratings yet
06 KleensTheorem&NFA
76 pages
Chapter 3 REGULAR EXPRESSION
No ratings yet
Chapter 3 REGULAR EXPRESSION
26 pages
Toc Unit2
No ratings yet
Toc Unit2
24 pages
Unit 3 - Regular Expression
No ratings yet
Unit 3 - Regular Expression
45 pages
Regular Expressions (RE) 3.1
100% (3)
Regular Expressions (RE) 3.1
53 pages
Presentation: Kleene Theorem Automata Theory
No ratings yet
Presentation: Kleene Theorem Automata Theory
21 pages
07 Kleenes Theorem
No ratings yet
07 Kleenes Theorem
43 pages
Introduction to Partial Differential Equations: From Fourier Series to Boundary-Value Problems
From Everand
Introduction to Partial Differential Equations: From Fourier Series to Boundary-Value Problems
Arne Broman
2.5/5 (2)
Lecture05 RegularExpression&FA
No ratings yet
Lecture05 RegularExpression&FA
44 pages
Chapter Two Regular Expression and Regular Language
No ratings yet
Chapter Two Regular Expression and Regular Language
30 pages
اللغات الرسمية والأالات نظري 3
No ratings yet
اللغات الرسمية والأالات نظري 3
47 pages
Final Revision FLAT
No ratings yet
Final Revision FLAT
22 pages
Slides
No ratings yet
Slides
27 pages
CS372 Formal Languages & The Theory of Computation
No ratings yet
CS372 Formal Languages & The Theory of Computation
29 pages
35 MG
No ratings yet
35 MG
38 pages
Toc Unit 2
No ratings yet
Toc Unit 2
29 pages
UNIT-2 2024 Theory of Automata and Formal Languages AKTU University
No ratings yet
UNIT-2 2024 Theory of Automata and Formal Languages AKTU University
13 pages
Regular-Expressions: LECT-2
No ratings yet
Regular-Expressions: LECT-2
12 pages
Lecture 6 Regular Expressions
No ratings yet
Lecture 6 Regular Expressions
28 pages
Unit II Regular Expression
No ratings yet
Unit II Regular Expression
176 pages
TAFL Unit 1 - Basic Concepts and Automata Theory - Detailed Notes
No ratings yet
TAFL Unit 1 - Basic Concepts and Automata Theory - Detailed Notes
13 pages
Regular Expressions
No ratings yet
Regular Expressions
34 pages
Java 5
No ratings yet
Java 5
38 pages
Regular Expressions (Re) : Res: Formal Definition
No ratings yet
Regular Expressions (Re) : Res: Formal Definition
12 pages
Chapter 3
No ratings yet
Chapter 3
10 pages
Spring 2024 Compiler Constructoin A Lab 3-2
No ratings yet
Spring 2024 Compiler Constructoin A Lab 3-2
16 pages
Chapter 7
50% (2)
Chapter 7
66 pages
Formal Languages, Automata and Computability: (For Next Time: Read Chapter 1.3 of The Book)
No ratings yet
Formal Languages, Automata and Computability: (For Next Time: Read Chapter 1.3 of The Book)
56 pages
Kleene's Theorem: Department of Computer Science
No ratings yet
Kleene's Theorem: Department of Computer Science
46 pages
Tcom005n PDF
No ratings yet
Tcom005n PDF
41 pages
Computation Theory Lecture 2
No ratings yet
Computation Theory Lecture 2
5 pages
Unit 4: Regular Expressions
No ratings yet
Unit 4: Regular Expressions
52 pages
Rebuttal of Colin Leslie Dean's Critique of Kurt Godel
100% (1)
Rebuttal of Colin Leslie Dean's Critique of Kurt Godel
4 pages
Unit-Ii Regular Expressions and Languages Definition
No ratings yet
Unit-Ii Regular Expressions and Languages Definition
34 pages
Regular Expressions and Languages
No ratings yet
Regular Expressions and Languages
16 pages
Formal Languages & Finite Theory of Automata: BS Course
No ratings yet
Formal Languages & Finite Theory of Automata: BS Course
33 pages
Chapter 3 REGULAR EXPRESSION
No ratings yet
Chapter 3 REGULAR EXPRESSION
28 pages
Notation To Specify A Language: Theory of Computation - Regular Expressions
No ratings yet
Notation To Specify A Language: Theory of Computation - Regular Expressions
23 pages
5CS4-AOA-Unit-5 - PPT @zammers
No ratings yet
5CS4-AOA-Unit-5 - PPT @zammers
88 pages
Differential Forms
From Everand
Differential Forms
Henri Cartan
5/5 (2)
La Mitrana Curs 4
No ratings yet
La Mitrana Curs 4
21 pages
A Short Course in Automorphic Functions
From Everand
A Short Course in Automorphic Functions
Joseph Lehner
No ratings yet
Theory of Computation - Regular Expressions
No ratings yet
Theory of Computation - Regular Expressions
23 pages
Continuity
No ratings yet
Continuity
49 pages
Palmström - The Lambda Calculus For Absolute Dummies (Like Myself)
No ratings yet
Palmström - The Lambda Calculus For Absolute Dummies (Like Myself)
18 pages
Incompleteness The Proof and Paradox of Kurt Godel PDF
No ratings yet
Incompleteness The Proof and Paradox of Kurt Godel PDF
9 pages
Regular Expressions: Definitions Equivalence To Finite Automata
No ratings yet
Regular Expressions: Definitions Equivalence To Finite Automata
29 pages
CS351 Regular Expressions
No ratings yet
CS351 Regular Expressions
14 pages
Kleene
No ratings yet
Kleene
6 pages
Quiz 1: Solutions: 6.045J/18.400J: Automata, Computability and Complexity
No ratings yet
Quiz 1: Solutions: 6.045J/18.400J: Automata, Computability and Complexity
9 pages
Class 12 Chapter 1 Maths Important Formulas
No ratings yet
Class 12 Chapter 1 Maths Important Formulas
3 pages
Chapter 3 Regular Expression
No ratings yet
Chapter 3 Regular Expression
25 pages
Math 8 Quarter 2
No ratings yet
Math 8 Quarter 2
16 pages
Best-First Search
No ratings yet
Best-First Search
2 pages
Regular Expressions
No ratings yet
Regular Expressions
30 pages
Combinatorics 13
No ratings yet
Combinatorics 13
11 pages
Lex Analysis
No ratings yet
Lex Analysis
13 pages
DP Practice - 2
No ratings yet
DP Practice - 2
9 pages
GE Math 4 Midterm
No ratings yet
GE Math 4 Midterm
13 pages
1 Sam Nadler
No ratings yet
1 Sam Nadler
8 pages
DAA Assignment - 3 - Tut - 2
No ratings yet
DAA Assignment - 3 - Tut - 2
2 pages
Lecture 06 - Binary Search Tree (BST) - Design Analysis of Algorithm
No ratings yet
Lecture 06 - Binary Search Tree (BST) - Design Analysis of Algorithm
30 pages
Lecture #5 - Limits and Continuity
No ratings yet
Lecture #5 - Limits and Continuity
12 pages
Algorithms: 1. Which of The Following Is Not An In-Place Algorithm?
No ratings yet
Algorithms: 1. Which of The Following Is Not An In-Place Algorithm?
7 pages
Math3283 hw4 f16 Sol
No ratings yet
Math3283 hw4 f16 Sol
2 pages
Sathaye College: Practical No: 7
No ratings yet
Sathaye College: Practical No: 7
6 pages
5 Tree 1
No ratings yet
5 Tree 1
15 pages
HW10 Sets & Logics
No ratings yet
HW10 Sets & Logics
4 pages
Traverse Through The Linked List: 10.detect Loop / Cycle in A LL
No ratings yet
Traverse Through The Linked List: 10.detect Loop / Cycle in A LL
4 pages
Lagranges Mean Value Theorem Proof
No ratings yet
Lagranges Mean Value Theorem Proof
5 pages
Neutrosophic Generalized Semi Pre Regular and Normal Space
No ratings yet
Neutrosophic Generalized Semi Pre Regular and Normal Space
4 pages
13.practice Questions and Solutions
No ratings yet
13.practice Questions and Solutions
17 pages
Bitwise Operators in C
No ratings yet
Bitwise Operators in C
3 pages

Kleene

Uploaded by

Kleene

Uploaded by

A Proof of Kleene’s Theorem

¬(01(1 + 01)∗ ) since ¬(01 ).

• RS(r1 + r2 ) = RS(r1) ∪ RS(r2) and

Theorem 3.8. Let r be a regular expression. The L (r) = L (NFA(r)).

4. How to Compute NFA(r)

Figure 1: Calculating the outgoing transitions of regular expressions.

• Q, a set that will eventually contain the states of NFA(r).

• A, a set that will eventually contain the accepting states of NFA(r).

Figure 2: Procedure for building NFA from regular expression.

Figure 3: A NFA for (abb + a)∗.

You might also like