0% found this document useful (0 votes)

9 views24 pages

Lecture Week 03

Uploaded by

malikmalikayaan99

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views24 pages

Lecture Week 03

Uploaded by

malikmalikayaan99

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 24

Compiler

Construction
CS 322
Mr. Atif Ali
Lecture 6
How to Describe Tokens?
 Regular Languages are the most popular for specifying tokens
because
• These are based on Simple and useful theory
• Easy to understand
• Efficient implementations exist for generating lexical analyzers
based on such languages.

Languages
 Let be a set of characters.  is called the
alphabet.
 A language over  is set of strings of characters
drawn from 
2
Example of Languages
Alphabet = English characters
Language = English sentences
Alphabet = ASCII
Language = C++ programs,
Java, C#
Notation
 Languages are sets of strings (finite sequence of
characters)
 Need some notation for specifying which sets we want
 For lexical analysis we care about regular
languages.
 Regular languages can be described using regular
3
expressions.
Regular Languages
 Each regular expression is a notation for a regular
language (a set of words).
 If A is a regular expression, we write L(A) to refer
to language denoted by A.
 A regular expression (RE) is defined inductively
a ordinary character from 
the empty string
R|S = either R or S
RS = R followed by S (concatenation)
R* = concatenation of R zero or more
times
(R*=  |R|RR|RRR...) 4
RE Extensions
Regular expression extensions are used as
convenient notation of complex RE:

R? =  | R (zero or one R)
R+ = RR* (one or more R)
(R) = R (grouping)
[abc] = a|b|c (any of listed)
[a-z] = a|b|....|z (range)
[^ab] = c|d|... (anything but ‘a’‘b’)
5
Regular Expression
RE Strings in L(R)
a “a”
ab “ab”
a|b “a” “b”
(ab)* “” “ab” “abab” ...
(a|)b “ab” “b”
Here are examples of common tokens found in
programming languages.
 integer: a non-empty string of digits
 digit = ‘0’|’1’|’2’|’3’|’4’|’5’|’6’|’7’|’8’|’9’
 integer = digit digit*

6
Example: identifiers
 identifier:
string or letters or digits starting with a letter
 C identifier: [a-zA-Z_][a-zA-Z0-9_]*

How to Use REs

 We need mechanism to determine if an input
string w belongs to L(R), the language
denoted by regular expression R.
. 7
Acceptor
 Such a mechanism is called
an acceptor.
input w
string yes, if w  L
acceptor
no, if w  L
language L

8
Finite Automata (FA)
 Specification: Regular Expressions
 Implementation: Finite Automata

Finite Automaton consists of

 An input alphabet (
 A set of states
 A start (initial) state
 A set of transitions
 A set of accepting (final) states 9
Finite Automaton
State Graphs
A state
The start state

An accepting state
a

A transition 10
Finite Automata
 A finite automaton accepts a string if we can
follow transitions labelled with characters in the
string from start state to some accepting state.

FA Example
A FA that accepts only “1”
1

11
FA Example
 A FA that accepts any number of 1’s followed by
a single 0
1
0

 A FA that accepts ab*a

 Alphabet: {a,b} b
a a
12
Table Encoding of FA
 Transition b
table a a
0 1 2

a b
0 1 err
1 2 1
2 err err
13
RE → Finite Automata
 Can we build a finite automaton for every regular
expression?
 Yes, – build FA inductively based on the definition
of Regular Expression
NFA
Nondeterministic Finite Automaton (NFA)
 Can have multiple transitions for one input in a given state
 Can have  - moves

Epsilon Moves
 ε – moves 
machine can move from state A
to state B without consuming
input
A 14 B
NFA
operation of the automaton is not completely defined by input
1
0 1
A B C
On input “11”, automaton could be in either state
Execution of FA
A NFA can choose
 Whether to make -moves.
 Which of multiple transitions to take for a single
input. 15
Acceptance of NFA
 NFA can get into multiple states
 Rule: NFA accepts if it can get in a final state
1
0 1
A B C

0
DFA and NFA
Deterministic Finite Automata (DFA)
 One transition per input per state.
 No  - moves
16
Execution of FA
A DFA
 can take only one path through the state graph.
 Completely determined by input.

NFA vs DFA
 NFAs and DFAs recognize the same set of languages (RL)
 DFAs are easier to implement – table driven.
 For a given language, the NFA can be simpler than the DFA.
 DFA can be exponentially larger than NFA.
 NFAs are the key to automating RE → DFA construction.

17
RE → NFA Construction
Thompson’s construction (CACM 1968)
 Build an NFA for each RE term.
 Combine NFAs with -moves.
Subset construction
NFA → DFA
 Build the simulation.
 Minimize number of states in DFA (Hopcroft’s
algorithm)
Key idea:
 NFA pattern for each symbol and each operator.
 Join them with -moves in precedence order.
18
RE → NFA Construction
a
NFA for a s0 s1
b
NFA for b s3 s4

a  b
s0 s1 s3 s4

NFA for ab
19
RE → NFA Construction
a
 s1 s2 
s0 s5
 b
s3 s4 

NFA for a | b
20
RE → NFA Construction


 a 
s0 s1 s2 s4


NFA for a*
21
RE → NFA Construction


 a 
s0 s1 s2 s4


NFA for a*
22
Example RE → NFA
NFA for a ( b|c )* 

b
   s4 s5 
a 
s0 s1 s2 s3 s8 s9
 s c
6 s7 


23
Thank You!

TCS Notes
No ratings yet
TCS Notes
14 pages
Compiler Construction Lecture 3-4
No ratings yet
Compiler Construction Lecture 3-4
78 pages
Regular Expression, DFA and NFA: Prepared By: Prof. J. S. Dhobi Prof. M. D. Mehta
No ratings yet
Regular Expression, DFA and NFA: Prepared By: Prof. J. S. Dhobi Prof. M. D. Mehta
82 pages
04 Regular Expressions & FAs
No ratings yet
04 Regular Expressions & FAs
46 pages
Chapter 2
No ratings yet
Chapter 2
99 pages
Lexical Analysis: Regular Expressions
No ratings yet
Lexical Analysis: Regular Expressions
11 pages
Lecture 04
No ratings yet
Lecture 04
37 pages
Lecture 06
No ratings yet
Lecture 06
27 pages
Regualr Languages
No ratings yet
Regualr Languages
29 pages
Lexical Analysis
No ratings yet
Lexical Analysis
36 pages
Lecture 06
No ratings yet
Lecture 06
26 pages
Lecture 4 Regular Expression
No ratings yet
Lecture 4 Regular Expression
30 pages
Compilers CH 3
No ratings yet
Compilers CH 3
58 pages
Agronomy ACC
No ratings yet
Agronomy ACC
174 pages
Week 06 TD, Short Dfa Nfa
No ratings yet
Week 06 TD, Short Dfa Nfa
22 pages
Lecture 06
No ratings yet
Lecture 06
27 pages
Juki HZL-G220 Sewing Machine Instruction Manual
No ratings yet
Juki HZL-G220 Sewing Machine Instruction Manual
212 pages
Chapter 3 - Lexical Analysis
No ratings yet
Chapter 3 - Lexical Analysis
51 pages
Chapter 3 Implementation - of - Lexical - Analysis
No ratings yet
Chapter 3 Implementation - of - Lexical - Analysis
63 pages
Chapter 3 - Lexical Analysis
No ratings yet
Chapter 3 - Lexical Analysis
51 pages
1st Phase Lexical Analyzer
No ratings yet
1st Phase Lexical Analyzer
33 pages
Chapter 4
No ratings yet
Chapter 4
14 pages
SLD 2
No ratings yet
SLD 2
67 pages
File 1675742677 110405 LexicalAnalysis-Continue1
No ratings yet
File 1675742677 110405 LexicalAnalysis-Continue1
39 pages
UNIT-I - Lexical Analysis
No ratings yet
UNIT-I - Lexical Analysis
51 pages
CD - Unit1 - Lecture4 5 6 7
No ratings yet
CD - Unit1 - Lecture4 5 6 7
50 pages
Chapter 3 - Lexical Analysis
100% (3)
Chapter 3 - Lexical Analysis
51 pages
Implementation of The Regular Expression
No ratings yet
Implementation of The Regular Expression
10 pages
Token, Lexemes and Regular Expression
No ratings yet
Token, Lexemes and Regular Expression
22 pages
Lecture 6
No ratings yet
Lecture 6
31 pages
Recognition of Tokens
No ratings yet
Recognition of Tokens
34 pages
Regular Expression & Autometa
No ratings yet
Regular Expression & Autometa
62 pages
3 Regex
No ratings yet
3 Regex
16 pages
Lexical Analysis All Token List and Diffence
No ratings yet
Lexical Analysis All Token List and Diffence
4 pages
Regular Expressions and Tokens
No ratings yet
Regular Expressions and Tokens
14 pages
Formal Language & Automata Theory
No ratings yet
Formal Language & Automata Theory
32 pages
Module 1&2
No ratings yet
Module 1&2
98 pages
Toc L04 Nfa TG GTG S25
No ratings yet
Toc L04 Nfa TG GTG S25
25 pages
ch-2.pdf 2
No ratings yet
ch-2.pdf 2
27 pages
002 - The Fifth Discipline - Summary
No ratings yet
002 - The Fifth Discipline - Summary
14 pages
Unit II - Lexical Analysis-20-1-2021
No ratings yet
Unit II - Lexical Analysis-20-1-2021
49 pages
Finite Automata Answers
No ratings yet
Finite Automata Answers
33 pages
Compiler Course: Lexical Analysis
No ratings yet
Compiler Course: Lexical Analysis
50 pages
Lexical Analysis
No ratings yet
Lexical Analysis
47 pages
Lec 03 - Finite Languages
No ratings yet
Lec 03 - Finite Languages
29 pages
FLAT - Ch.2
No ratings yet
FLAT - Ch.2
86 pages
Code Source Tokens Scanner Parser IR
No ratings yet
Code Source Tokens Scanner Parser IR
26 pages
Lec 4
No ratings yet
Lec 4
17 pages
Compiler Design Lab Manual
No ratings yet
Compiler Design Lab Manual
32 pages
Chapter 3 - Lexical Analysis
100% (1)
Chapter 3 - Lexical Analysis
51 pages
Lect 07
No ratings yet
Lect 07
46 pages
CompilerD L3
No ratings yet
CompilerD L3
36 pages
CS-352 - Spring 2024 - Lec4
No ratings yet
CS-352 - Spring 2024 - Lec4
38 pages
CS 346: Compilers: Lexical Analyzer Lexical Analyzer
No ratings yet
CS 346: Compilers: Lexical Analyzer Lexical Analyzer
52 pages
Finite Automata
No ratings yet
Finite Automata
16 pages
Lec02 Lexicalanalyzer
100% (1)
Lec02 Lexicalanalyzer
50 pages
School Management System Database Project
100% (1)
School Management System Database Project
15 pages
RE With DFA: Subject: System Programing
No ratings yet
RE With DFA: Subject: System Programing
16 pages
3-Lexical Analysis Part2
No ratings yet
3-Lexical Analysis Part2
39 pages
Brain Herniation PDF
No ratings yet
Brain Herniation PDF
5 pages
The Accident by C. L. Taylor
No ratings yet
The Accident by C. L. Taylor
10 pages
A709a 709M-17 PDF
No ratings yet
A709a 709M-17 PDF
8 pages
Finite Automata: A Simple Computing Model
No ratings yet
Finite Automata: A Simple Computing Model
53 pages
CS 160
No ratings yet
CS 160
4 pages
Corpo Bar Qs
100% (7)
Corpo Bar Qs
15 pages
External Waterproofing Brochure 0
No ratings yet
External Waterproofing Brochure 0
13 pages
The Role of Quantitative Techniques in Business and Management
No ratings yet
The Role of Quantitative Techniques in Business and Management
3 pages
Aristotle Short Notes by E - 53
No ratings yet
Aristotle Short Notes by E - 53
3 pages
Quotation & Specification
No ratings yet
Quotation & Specification
9 pages
Fransiskus Daud Try Surya A Bahasa Inggris PTK PPG DALJAB 2
No ratings yet
Fransiskus Daud Try Surya A Bahasa Inggris PTK PPG DALJAB 2
47 pages
STATS Stem and Leaf Plots
No ratings yet
STATS Stem and Leaf Plots
5 pages
IO Wheel Balancer WB220L - CE - 1.1 - ENG - Set910710984
No ratings yet
IO Wheel Balancer WB220L - CE - 1.1 - ENG - Set910710984
18 pages
Social Mores
No ratings yet
Social Mores
17 pages
The Book of Daniel-Chapter 1-6
No ratings yet
The Book of Daniel-Chapter 1-6
15 pages
Progress Test 2 (U 3&4)
No ratings yet
Progress Test 2 (U 3&4)
4 pages
Phantom
No ratings yet
Phantom
6 pages
Heading / Description: 1 - Poultry Shed/Duck House 78.EW.2.1 - Earthwork in Excavation by Manual Means LIFT UP TO 1.5 METRE in
No ratings yet
Heading / Description: 1 - Poultry Shed/Duck House 78.EW.2.1 - Earthwork in Excavation by Manual Means LIFT UP TO 1.5 METRE in
10 pages
Gr11 Acc P2 (English) June 2019 Possible Answers
No ratings yet
Gr11 Acc P2 (English) June 2019 Possible Answers
9 pages
Adr Project Final PDF
No ratings yet
Adr Project Final PDF
5 pages
Poem 19,20
No ratings yet
Poem 19,20
1 page
Assignment 1
No ratings yet
Assignment 1
3 pages
PingAccess Interview Questions
No ratings yet
PingAccess Interview Questions
12 pages
Hauwam Muhammed - Updated CV
No ratings yet
Hauwam Muhammed - Updated CV
4 pages
SNC1W Flame Test Lab
No ratings yet
SNC1W Flame Test Lab
4 pages
NORTHERN SAMAR NLC Action Plan
No ratings yet
NORTHERN SAMAR NLC Action Plan
3 pages
Grade 6 CS CH2 - 3
No ratings yet
Grade 6 CS CH2 - 3
2 pages
30 2090 0109 - 707PlateFinisher EN
No ratings yet
30 2090 0109 - 707PlateFinisher EN
1 page
Lisp Interpreter in Rust
From Everand
Lisp Interpreter in Rust
Vishal Patil
1/5 (1)
Laplace Transforms Essentials
From Everand
Laplace Transforms Essentials
Morteza Shafii-Mousavi
3.5/5 (3)
A Short Course in Automorphic Functions
From Everand
A Short Course in Automorphic Functions
Joseph Lehner
No ratings yet

Lecture Week 03

Uploaded by

Lecture Week 03

Uploaded by

Compiler

How to Use REs

Finite Automaton consists of

 A FA that accepts ab*a

You might also like