0% found this document useful (0 votes)

56 views

Compiler Design Assignment

The document discusses specifications of tokens including strings, languages, and regular expressions. It also covers recognition of tokens using finite automata. Strings are finite sequences of symbols from a fixed alphabet. A regular expression denotes a regular language that can be defined by the regular expression. Finite automata are used to recognize patterns in input and accept or reject based on whether the pattern occurs.

Uploaded by

Vikas Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

56 views

Compiler Design Assignment

Uploaded by

Vikas Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 6

Q 1.

Short note on specifications of tokens

Answer: There are 3 specifications of tokens:
1)Strings
2) Language
3)Regular expression

Strings and Languages

v An alphabet or character class is a finite set of symbols.
v A string over an alphabet is a finite sequence of symbols drawn from that
alphabet.
v A language is any countable set of strings over some fixed alphabet.
In language theory, the terms "sentence" and "word" are often used as
synonyms for

"string." The length of a string s, usually written |s|, is the number of

occurrences of symbols in s. For example, banana is a string of length six. The
empty string, denoted ε, is the string of length zero.

Operations on strings
The following string-related terms are commonly used:

1. A prefix of string s is any string obtained by removing zero or more

symbols from the end of string s. For example, ban is a prefix of banana.

2. A suffix of string s is any string obtained by removing zero or more

symbols from the beginning of s. For example, nana is a suffix of banana.

3. A substring of s is obtained by deleting any prefix and any suffix from s.

For example, nan is a substring of banana.

4. The proper prefixes, suffixes, and substrings of a string s are those

prefixes, suffixes, and substrings, respectively of s that are not ε or not equal to
s itself.
5. A subsequence of s is any string formed by deleting zero or more not
necessarily consecutive positions of s
6. For example, baan is a subsequence of banana.

Operations on languages:
The following are the operations that can be applied to languages:
1. Union
2. Concatenation
3. Kleene closure
4. Positive closure

The following example shows the operations on strings: Let L={0,1} and
S={a,b,c}

Regular Expressions
· Each regular expression r denotes a language L(r).

· Here are the rules that define the regular expressions over some alphabet
Σ and the languages that those expressions denote:

1.ε is a regular expression, and L(ε) is { ε }, that is, the language whose sole
member is the empty string.
2. If ‘a’ is a symbol in Σ, then ‘a’ is a regular expression, and L(a) = {a}, that is,
the language with one string, of length one, with ‘a’ in its one position.
3.Suppose r and s are regular expressions denoting the languages L(r) and L(s).
Then, a) (r)|(s) is a regular expression denoting the language L(r) U L(s).

b) (r)(s) is a regular expression denoting the language L(r)L(s). c) (r)* is a

regular expression denoting (L(r))*.
d) (r) is a regular expression denoting L(r).
4.The unary operator * has highest precedence and is left associative.
5.Concatenation has second highest precedence and is left associative.
6. | has lowest precedence and is left associative.

Regular set

A language that can be defined by a regular expression is called a regular set. If

two regular expressions r and s denote the same regular set, we say they are
equivalent and write r = s.

There are a number of algebraic laws for regular expressions that can be used to
manipulate into equivalent forms.
For instance, r|s = s|r is commutative; r|(s|t)=(r|s)|t is associative.

Regular Definitions
Giving names to regular expressions is referred to as a Regular definition. If Σ is
an alphabet of basic symbols, then a regular definition is a sequence of
definitions of the form
dl → r 1
d2 → r2

………
dn → rn
1.Each di is a distinct name.
2.Each ri is a regular expression over the alphabet Σ U {dl, d2,. . . , di-l}.

Example: Identifiers is the set of strings of letters and digits beginning with a
letter. Regular
definition for this set:

letter → A | B | …. | Z | a | b | …. | z | digit → 0 | 1 | …. | 9

id → letter ( letter | digit ) *

Shorthands

Certain constructs occur so frequently in regular expressions that it is

convenient to introduce notational short hands for them.
1. One or more instances (+):
- The unary postfix operator + means “ one or more instances of” .

- If r is a regular expression that denotes the language L(r), then ( r ) + is a regular

expression that denotes the language (L (r ))+

- Thus the regular expression a+ denotes the set of all strings of one or more a’s.
- The operator + has the same precedence and associativity as the operator *.

2. Zero or one instance ( ?):

- The unary postfix operator ? means “zero or one instance of”.

- The notation r? is a shorthand for r | ε.

- If ‘r’ is a regular expression, then ( r )? is a regular expression that denotes the
language

3. Character Classes:
- The notation [abc] where a, b and c are alphabet symbols denotes the regular
expression a | b | c.
- Character class such as [a – z] denotes the regular expression a | b | c | d | ….|z.
- We can describe identifiers as being strings generated by the regular
expression, [A–Za–z][A– Za–z0–9]*

Non-regular Set

A language which cannot be described by any regular expression is a

non-regular set. Example: The set of all strings of balanced parentheses and
repeating strings cannot be described by a regular expression. This set can be
specified by a context-free grammar.

Q 3. Short note on Recognition of tokens

Answer: Tokens can be recognized by Finite Automata
A Finite automaton(FA) is a simple idealized machine used to recognize
patterns within input taken from some character set(or Alphabet) C. The job of
FA is to accept or reject an input depending on whether the pattern defined by
the FA occurs in the input.
There are two notations for representing Finite Automata. They are
Transition Diagram
Transition Table
Transition diagram is a directed labeled graph in which it contains nodes and
edges
Nodes represents the states and edges represents the transition of a state
Every transition diagram is only one initial state represented by an arrow mark
(-->) and zero or more final states are represented by double circle
Example:

Where state "1" is initial state and state 3 is final state.

Finite Automata for recognizing identifiers
Finite Automata for recognizing keywords

Finite Automata for recognizing numbers

Finite Automata for relational operators

Finite Automata for recognizing white spaces

ioi

Zero Inflated Models and Generalized Linear Mixed Models With R PDF
80% (5)
Zero Inflated Models and Generalized Linear Mixed Models With R PDF
342 pages
Timoshenko Beam Theory
No ratings yet
Timoshenko Beam Theory
6 pages
UMN EE 2301 Exam 1
No ratings yet
UMN EE 2301 Exam 1
7 pages
Essay Transfer of Learning
No ratings yet
Essay Transfer of Learning
37 pages
SPECIFICATION OF TOKENS - Unit 1
No ratings yet
SPECIFICATION OF TOKENS - Unit 1
13 pages
Specification of Tokens
No ratings yet
Specification of Tokens
21 pages
Lexical Analyzer 1
No ratings yet
Lexical Analyzer 1
37 pages
Specification of Tokens
No ratings yet
Specification of Tokens
21 pages
Chapter Two (3) (Autosaved)
No ratings yet
Chapter Two (3) (Autosaved)
29 pages
Lec 4
No ratings yet
Lec 4
16 pages
Specification of Tokens
0% (1)
Specification of Tokens
17 pages
Specification of Tokens
No ratings yet
Specification of Tokens
17 pages
2_2Specification of Tokens
No ratings yet
2_2Specification of Tokens
17 pages
Unit22pdf 2021 03 13 13 38 11
No ratings yet
Unit22pdf 2021 03 13 13 38 11
114 pages
Lexi Cal a Analyzer
No ratings yet
Lexi Cal a Analyzer
38 pages
Specification of Tokens Using Regular Expressions
No ratings yet
Specification of Tokens Using Regular Expressions
8 pages
2. Regular Expressions
No ratings yet
2. Regular Expressions
4 pages
Chapter 3 - Regular Expression
No ratings yet
Chapter 3 - Regular Expression
16 pages
chapter 3
No ratings yet
chapter 3
10 pages
Regular Expressions and Regular Languages
No ratings yet
Regular Expressions and Regular Languages
5 pages
Chap-2 2 (RegularExpression)
No ratings yet
Chap-2 2 (RegularExpression)
46 pages
Lecture 3a and 3b
No ratings yet
Lecture 3a and 3b
21 pages
Compiler 2
No ratings yet
Compiler 2
10 pages
Chapter THREE
No ratings yet
Chapter THREE
24 pages
TPL lect 15 - 16
No ratings yet
TPL lect 15 - 16
5 pages
CC 2
No ratings yet
CC 2
65 pages
Regular - Expressions For FL & A
No ratings yet
Regular - Expressions For FL & A
34 pages
Lexical Analyzer 2023
No ratings yet
Lexical Analyzer 2023
38 pages
Lecture02 Scanning 1
No ratings yet
Lecture02 Scanning 1
72 pages
Lexical Analysis
No ratings yet
Lexical Analysis
41 pages
Unit I
No ratings yet
Unit I
37 pages
Regular Expression: Dept. of Computer Science Faculty of Science and Technology
No ratings yet
Regular Expression: Dept. of Computer Science Faculty of Science and Technology
16 pages
Lecture Slides Regular Expressions
No ratings yet
Lecture Slides Regular Expressions
138 pages
2 Lexical Analizer
No ratings yet
2 Lexical Analizer
56 pages
TOA Lecture 03
No ratings yet
TOA Lecture 03
63 pages
chapter two
No ratings yet
chapter two
59 pages
Acd Unit-2
No ratings yet
Acd Unit-2
16 pages
Regular expressions
No ratings yet
Regular expressions
21 pages
CS 346: Compilers: Lexical Analyzer Lexical Analyzer
No ratings yet
CS 346: Compilers: Lexical Analyzer Lexical Analyzer
52 pages
Lect2 Lexical
No ratings yet
Lect2 Lexical
9 pages
Unit Ii
No ratings yet
Unit Ii
25 pages
Chapter No.1
No ratings yet
Chapter No.1
31 pages
Regular Expressions and Languages
No ratings yet
Regular Expressions and Languages
16 pages
Regular Expressions (2)
No ratings yet
Regular Expressions (2)
17 pages
3 RegularExpressions
No ratings yet
3 RegularExpressions
25 pages
Regular expression
No ratings yet
Regular expression
89 pages
Theory of Computation: Dr. Krishnendu Rarhi E: Krishnendu.e9621@cumail - in
No ratings yet
Theory of Computation: Dr. Krishnendu Rarhi E: Krishnendu.e9621@cumail - in
44 pages
TOC Full Syllabus Notes
No ratings yet
TOC Full Syllabus Notes
145 pages
ch3 M.PPTX - 0
No ratings yet
ch3 M.PPTX - 0
46 pages
Compiler Lecture 7
No ratings yet
Compiler Lecture 7
18 pages
Regular Expressions
100% (2)
Regular Expressions
4 pages
Compiler Lecture 7
No ratings yet
Compiler Lecture 7
18 pages
Chapter 4
No ratings yet
Chapter 4
31 pages
Formal Languages and Automata Theory
No ratings yet
Formal Languages and Automata Theory
24 pages
Regular Expressions
No ratings yet
Regular Expressions
31 pages
Automata Lectuee3
No ratings yet
Automata Lectuee3
27 pages
Recognition of Tokens
No ratings yet
Recognition of Tokens
34 pages
Unit 3 - Regular Expression
No ratings yet
Unit 3 - Regular Expression
45 pages
Language About Complier Construction
No ratings yet
Language About Complier Construction
23 pages
Regular Grammars
100% (2)
Regular Grammars
46 pages
Automata Theory Computability - M2
No ratings yet
Automata Theory Computability - M2
68 pages
Formal Languages and Automata Theory - Regular Expressions and Finite Automata
No ratings yet
Formal Languages and Automata Theory - Regular Expressions and Finite Automata
17 pages
Introduction to Formal Languages
From Everand
Introduction to Formal Languages
György E. Révész
2/5 (1)
Ian Talks Regex A-Z
From Everand
Ian Talks Regex A-Z
Ian Eress
No ratings yet
Bunglu Ka Kaam Without Page Number
No ratings yet
Bunglu Ka Kaam Without Page Number
68 pages
Pharma Simmu Edt
No ratings yet
Pharma Simmu Edt
29 pages
Python Assignment 2
No ratings yet
Python Assignment 2
8 pages
Automata
No ratings yet
Automata
9 pages
PB22 Algebra RMT
No ratings yet
PB22 Algebra RMT
4 pages
Powerpoint On Permutation and Combination
No ratings yet
Powerpoint On Permutation and Combination
19 pages
Formulario CDI
No ratings yet
Formulario CDI
6 pages
Bicomplex Holomorphic Functions The Algebra Geometry And Analysis Of Bicomplex Numbers 1st Edition M Elena Lunaelizarrars download
No ratings yet
Bicomplex Holomorphic Functions The Algebra Geometry And Analysis Of Bicomplex Numbers 1st Edition M Elena Lunaelizarrars download
80 pages
Chapter 6F-PropCRV - W PDF
No ratings yet
Chapter 6F-PropCRV - W PDF
30 pages
anne of green gables
No ratings yet
anne of green gables
3 pages
Grey Kangaroo 2023 Solutions
No ratings yet
Grey Kangaroo 2023 Solutions
12 pages
Matlab For Dynamic Modeling
No ratings yet
Matlab For Dynamic Modeling
45 pages
EEE241 Lectureuctory Concepts
No ratings yet
EEE241 Lectureuctory Concepts
66 pages
Transportation Problem Using Vogel's Approximation Method Calculator - 2
No ratings yet
Transportation Problem Using Vogel's Approximation Method Calculator - 2
5 pages
Linear Model
No ratings yet
Linear Model
11 pages
ME-314 - Assignment - Root Locus
No ratings yet
ME-314 - Assignment - Root Locus
5 pages
Introduction To Machine Learning Prof. Anirban Santara Department of Computer Science and Engineering Indian Institute of Technology, Kharagpur
No ratings yet
Introduction To Machine Learning Prof. Anirban Santara Department of Computer Science and Engineering Indian Institute of Technology, Kharagpur
15 pages
3
No ratings yet
3
100 pages
Just For Fun
No ratings yet
Just For Fun
12 pages
MS 207 Management Science Notes Unit 1-4
No ratings yet
MS 207 Management Science Notes Unit 1-4
616 pages
Maths Activity 12 All PDF
50% (4)
Maths Activity 12 All PDF
77 pages
Physics From Symmetry Jakob Schwichtenberg All Chapter Instant Download
100% (4)
Physics From Symmetry Jakob Schwichtenberg All Chapter Instant Download
52 pages
How To Write Homework in Hindi
100% (1)
How To Write Homework in Hindi
8 pages
3 Differential Leveling
No ratings yet
3 Differential Leveling
4 pages
Lecture-23 - LQR PDF
No ratings yet
Lecture-23 - LQR PDF
15 pages
Azimuths Coordinates
No ratings yet
Azimuths Coordinates
13 pages
2012 Math Talent Quest Official Test PDF
No ratings yet
2012 Math Talent Quest Official Test PDF
5 pages
The in Uence of The Building Shape On The Costs of Its Construction
No ratings yet
The in Uence of The Building Shape On The Costs of Its Construction
14 pages
02c 1MA1 2F June 2024 mark scheme (pdf)
No ratings yet
02c 1MA1 2F June 2024 mark scheme (pdf)
22 pages
Laws of Indices
No ratings yet
Laws of Indices
52 pages

Compiler Design Assignment

Uploaded by

Compiler Design Assignment

Uploaded by

Q 1.

Short note on specifications of tokens

Strings and Languages

"string." The length of a string s, usually written |s|, is the number of

1. A prefix of string s is any string obtained by removing zero or more

2. A suffix of string s is any string obtained by removing zero or more

3. A substring of s is obtained by deleting any prefix and any suffix from s.

4. The proper prefixes, suffixes, and substrings of a string s are those

b) (r)(s) is a regular expression denoting the language L(r)L(s). c) (r)* is a

A language that can be defined by a regular expression is called a regular set. If

id → letter ( letter | digit ) *

Certain constructs occur so frequently in regular expressions that it is

- If r is a regular expression that denotes the language L(r), then ( r ) + is a regular

2. Zero or one instance ( ?):

- The notation r? is a shorthand for r | ε.

A language which cannot be described by any regular expression is a

Q 3. Short note on Recognition of tokens

Where state "1" is initial state and state 3 is final state.

Finite Automata for recognizing numbers

Finite Automata for relational operators

Finite Automata for recognizing white spaces

You might also like