0% found this document useful (0 votes)

127 views5 pages

Sri Vidya College of Engineering and Technology Question Bank

The document discusses lexical analysis and regular expressions. It contains 14 questions and answers about topics such as: 1) The role of a lexical analyzer is to read the source code and group characters into tokens. 2) A regular expression can be converted to a non-deterministic finite automaton and then to a deterministic finite automaton to recognize patterns in the source code. 3) Lexical analysis, parsing, and semantic analysis are the main tasks of a compiler that operate on the token stream from the lexical analyzer.

Uploaded by

Thiyagarajan Jayasankar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

127 views5 pages

Sri Vidya College of Engineering and Technology Question Bank

Uploaded by

Thiyagarajan Jayasankar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

SRI VIDYA COLLEGE OF ENGINEERING AND TECHNOLOGY QUESTION BANK

UNIT-II LEXICAL ANALYSIS

2 MARKS
1. What is Lexical Analysis?
The first phase of compiler is Lexical Analysis. This is also known as linear analysis
in which the stream of characters making up the source program is read from
left-to-right and grouped into tokens that are sequences of characters having a
collective meaning.
2. What is a lexeme? Define a regular set.
• A Lexeme is a sequence of characters in the source program that is matched by
the pattern for a token.
• A language denoted by a regular expression is said to be a regular set

3. What is a sentinel? What is its usage?

A Sentinel is a special character that cannot be part of the source program. Normally
weuse ‘eof’ as the sentinel. This is used for speeding-up the lexical analyzer.

4. What is a regular expression? State the rules, which define regular expression?
Regular expression is a method to describe regular language
Rules:
1) ε-is a regular expression that denotes {ε} that is the set containing the empty
string
2) If a is a symbol in ∑,then a is a regular expression that denotes {a}
3) Suppose r and s are regular expressions denoting the languages L(r ) and L(s)
Then,
a) (r )/(s) is a regular expression denoting L(r) U L(s).
b) (r )(s) is a regular expression denoting L(r )L(s)
c) (r )* is a regular expression denoting L(r)*.
d) (r) is a regular expression denoting L(r ).

5. What are the Error-recovery actions in a lexical analyzer?

1. Deleting an extraneous character
2. Inserting a missing character
3. Replacing an incorrect character by a correct character
4. Transposing two adjacent characters

6. Construct Regular expression for the language

L= {w ε{a,b}/w ends in abb}
Ans: {a/b}*abb.

7. What is recognizer?
Recognizers are machines. These are the machines which accept the strings belonging to
certain
language. If the valid strings of such language are accepted by the machine then it is said that
the corresponding language is accepted by that machine, otherwise it is rejected.

CS6660 COMPILER DESIGN UNIT-2

STUDENTSFOCUS.COM
SRI VIDYA COLLEGE OF ENGINEERING AND TECHNOLOGY QUESTION BANK

8. Differentiate compiler and interpreter.

Compiler produces a target program whereas an interpreter performs the
operations implied by the source program.

9. Write short notes on buffer pair.

Concerns with efficiency issues
Used with a lookahead on the input
It is a specialized buffering technique used to reduce the overhead required to
process an input character. Buffer is divided into two N-character halves. Use two
pointers. Used at times when the lexical analyzer needs to look ahead several characters
beyond the lexeme for a pattern before a match is announced.
10. Differentiate tokens, patterns, lexeme.

Tokens- Sequence of characters that have a collective meaning.

Patterns- There is a set of strings in the input for which the same token is produced as
output. This set of strings is described by a rule called a pattern associated with the
token
Lexeme- A sequence of characters in the source program that is matched by the
pattern for a token.

11. List the operations on languages.

Union - L U M ={s | s is in L or s is in M}
Concatenation – LM ={st | s is in L and t is in M}
Kleene Closure – L* (zero or more concatenations of L)
Positive Closure – L+ ( one or more concatenations of L)

12. Write a regular expression for an identifier.

An identifier is defined as a letter followed by zero or more letters or digits.The

regular expression for an identifier is given as letter (letter | digit)*

13. Mention the various notational shorthands for representing regular expressions.

One or more instances (+)

Zero or one instance (?)
Character classes ([abc] where a,b,c are alphabet symbols denotes the regular
expressions a | b | c.)
Non regular sets

14. What is the function of a hierarchical analysis?

Hierarchical analysis is one in which the tokens are grouped hierarchically into nested
collections with collective meaning. Also termed as Parsing.

CS6660 COMPILER DESIGN UNIT-2

STUDENTSFOCUS.COM
SRI VIDYA COLLEGE OF ENGINEERING AND TECHNOLOGY QUESTION BANK

15. What does a semantic analysis do?

Semantic analysis is one in which certain checks are performed to ensure that
components of a program fit together meaningfully. Mainly performs type checking.

16 MARKS

1)What are roles and tasks of a lexical analyzer?

Main Task: Take a token sequence from the scanner and verify that it is a syntactically correct
program.
Secondary Tasks:
Process declarations and set up symbol table information accordingly, in preparation for
semantic analysis.
Construct a syntax tree in preparation for intermediate code generation.

2. Converting a Regular Expression into a Deterministic Finite Automaton

The task of a scanner generator, such as JLex, is to generate the transition tables or to synthesize
the scanner program given a scanner specification (in the form of a set of REs). So it needs to
convert REs into a single DFA. This is accomplished in two steps: first it converts REs into a
non-deterministic finite automaton (NFA) and then it converts the NFA into a DFA.

An NFA is similar to a DFA but it also permits multiple transitions over the same character and
transitions over . In the case of multiple transitions from a state over the same character, when
we are at this state and we read this character, we have more than one choice; the NFA succeeds
if at least one of these choices succeeds. The transition doesn't consume any input characters,
so you may jump to another state for free.

Clearly DFAs are a subset of NFAs. But it turns out that DFAs and NFAs have the same
expressive power. The problem is that when converting a NFA to a DFA we may get an
exponential blowup in the number of states.

We will first learn how to convert a RE into a NFA. This is the easy part. There are only 5 rules,
one for each type of RE:

CS6660 COMPILER DESIGN UNIT-2

STUDENTSFOCUS.COM
SRI VIDYA COLLEGE OF ENGINEERING AND TECHNOLOGY QUESTION BANK

As it can been shown inductively, the above rules construct NFAs with only one final state. For
example, the third rule indicates that, to construct the NFA for the RE AB, we construct the
NFAs for A and B, which are represented as two boxes with one start state and one final state for
each box. Then the NFA for AB is constructed by connecting the final state of A to the start state
of B using an empty transition.

For example, the RE (a| b)c is mapped to the following NFA:

The next step is to convert a NFA to a DFA (called subset construction). Suppose that you assign
a number to each NFA state. The DFA states generated by subset construction have sets of
numbers, instead of just one number. For example, a DFA state may have been assigned the set
{5, 6, 8}. This indicates that arriving to the state labeled {5, 6, 8} in the DFA is the same as
arriving to the state 5, the state 6, or the state 8 in the NFA when parsing the same input. (Recall
that a particular input sequence when parsed by a DFA, leads to a unique state, while when
parsed by a NFA it may lead to multiple states.)

First we need to handle transitions that lead to other states for free (without consuming any
input). These are the transitions. We define the closure of a NFA node as the set of all the
nodes reachable by this node using zero, one, or more transitions. For example, The closure of
node 1 in the left figure below

CS6660 COMPILER DESIGN UNIT-2

STUDENTSFOCUS.COM
SRI VIDYA COLLEGE OF ENGINEERING AND TECHNOLOGY QUESTION BANK

is the set {1, 2}. The start state of the constructed DFA is labeled by the closure of the NFA start
state. For every DFA state labeled by some set {s1,..., sn} and for every character c in the
language alphabet, you find all the states reachable by s1, s2, ..., or sn using c arrows and you
union together the closures of these nodes. If this set is not the label of any other node in the
DFA constructed so far, you create a new DFA node with this label. For example, node {1, 2} in
the DFA above has an arrow to a {3, 4, 5} for the character a since the NFA node 3 can be
reached by 1 on a and nodes 4 and 5 can be reached by 2. The b arrow for node {1, 2} goes to
the error node which is associated with an empty set of NFA nodes.

The following NFA recognizes (a| b)*(abb | a+b), even though it wasn't constructed with the
above RE-to-NFA rules. It has the following DFA:

CS6660 COMPILER DESIGN UNIT-2

STUDENTSFOCUS.COM

Starplan Cabinet Vision Manual Basic
No ratings yet
Starplan Cabinet Vision Manual Basic
48 pages
Pembekalan MOS Excel Expert 2019 - Day 3
No ratings yet
Pembekalan MOS Excel Expert 2019 - Day 3
59 pages
RFI Completion Guide - DOC-59001 PDF
No ratings yet
RFI Completion Guide - DOC-59001 PDF
3 pages
Unit Ii
No ratings yet
Unit Ii
5 pages
CompilerD L3
No ratings yet
CompilerD L3
36 pages
Unit 2
No ratings yet
Unit 2
93 pages
CD - Unit II - Notes
No ratings yet
CD - Unit II - Notes
20 pages
Chapter 2 Lexical Analysis
No ratings yet
Chapter 2 Lexical Analysis
55 pages
Lec02 Lexicalanalyzer
100% (1)
Lec02 Lexicalanalyzer
50 pages
Lexical Analysis
No ratings yet
Lexical Analysis
36 pages
ECS-603 Put 13-14 Sol
No ratings yet
ECS-603 Put 13-14 Sol
24 pages
CS 346: Compilers: Lexical Analyzer Lexical Analyzer
No ratings yet
CS 346: Compilers: Lexical Analyzer Lexical Analyzer
52 pages
Chapter 2
No ratings yet
Chapter 2
56 pages
CD Mod 1 & 2
No ratings yet
CD Mod 1 & 2
32 pages
1) Role of Lexical Analysis and Its Issues
No ratings yet
1) Role of Lexical Analysis and Its Issues
10 pages
Chapter Two LexicalAnalysis
No ratings yet
Chapter Two LexicalAnalysis
16 pages
1st Phase Lexical Analyzer
No ratings yet
1st Phase Lexical Analyzer
33 pages
Lexical Analysis: Winter 2007 SEG2101 Chapter 8 1
No ratings yet
Lexical Analysis: Winter 2007 SEG2101 Chapter 8 1
50 pages
2nd Unti QN
No ratings yet
2nd Unti QN
6 pages
Compiler Design - Lexical Analysis
No ratings yet
Compiler Design - Lexical Analysis
16 pages
CS3501 CD Qb-Unit 1
No ratings yet
CS3501 CD Qb-Unit 1
6 pages
18CS61 SSC II IA Question Bank
No ratings yet
18CS61 SSC II IA Question Bank
4 pages
Question Bank Part A, Part B&C
No ratings yet
Question Bank Part A, Part B&C
15 pages
Unit II - Lexical Analysis-20-1-2021
No ratings yet
Unit II - Lexical Analysis-20-1-2021
49 pages
CSE302: Compiler Design
No ratings yet
CSE302: Compiler Design
18 pages
Chapter Two (3) (Autosaved)
No ratings yet
Chapter Two (3) (Autosaved)
29 pages
Compiler Design Lab Manual
No ratings yet
Compiler Design Lab Manual
32 pages
Lexical Analysis: Leonidas Fegaras
No ratings yet
Lexical Analysis: Leonidas Fegaras
28 pages
Compiler Design Part 2
No ratings yet
Compiler Design Part 2
20 pages
Compiler Construction Lecture 3-4
No ratings yet
Compiler Construction Lecture 3-4
78 pages
Compiler Design Two Marks
50% (2)
Compiler Design Two Marks
17 pages
Compiler Construction Week 6
No ratings yet
Compiler Construction Week 6
10 pages
Lexical Analysis All Token List and Diffence
No ratings yet
Lexical Analysis All Token List and Diffence
4 pages
2 Lexical
100% (1)
2 Lexical
7 pages
2.1 Constituents of Lexical Analysis
No ratings yet
2.1 Constituents of Lexical Analysis
10 pages
CC Unit 2
No ratings yet
CC Unit 2
80 pages
CD GTU Study Material Presentations Unit-2 27082020063553AM
No ratings yet
CD GTU Study Material Presentations Unit-2 27082020063553AM
84 pages
5CS4-02-CD - Guess Paper @zammers
No ratings yet
5CS4-02-CD - Guess Paper @zammers
97 pages
Compiler Assignment
No ratings yet
Compiler Assignment
12 pages
Slides CHP 3 and 4
No ratings yet
Slides CHP 3 and 4
21 pages
Chapter 2
No ratings yet
Chapter 2
99 pages
Compilers - Week 2
No ratings yet
Compilers - Week 2
14 pages
VMKV Engineering College Department of Computer Science & Engineering Principles of Compiler Design Unit I Part-A
No ratings yet
VMKV Engineering College Department of Computer Science & Engineering Principles of Compiler Design Unit I Part-A
80 pages
CS606 Midterm
No ratings yet
CS606 Midterm
11 pages
CP 324 Lexical Analysis l3
No ratings yet
CP 324 Lexical Analysis l3
27 pages
2 - Compilers (Lexical Analysis)
No ratings yet
2 - Compilers (Lexical Analysis)
60 pages
Lexical Analysis
No ratings yet
Lexical Analysis
47 pages
1 Question Bank
No ratings yet
1 Question Bank
43 pages
SLD 2
No ratings yet
SLD 2
67 pages
Lecture 3
No ratings yet
Lecture 3
31 pages
CT 1 - A Answer Key
No ratings yet
CT 1 - A Answer Key
6 pages
CD ppt1
No ratings yet
CD ppt1
62 pages
Cs2303 Theory of Computation 2marks
100% (1)
Cs2303 Theory of Computation 2marks
20 pages
Compiler Design
No ratings yet
Compiler Design
46 pages
Code Source Tokens Scanner Parser IR
No ratings yet
Code Source Tokens Scanner Parser IR
26 pages
Cse384 Compiler Design Laboratory Lab Manual
No ratings yet
Cse384 Compiler Design Laboratory Lab Manual
55 pages
IA3-CTCD QB - COE-General 20.9.24 New
No ratings yet
IA3-CTCD QB - COE-General 20.9.24 New
19 pages
Lect 03
No ratings yet
Lect 03
19 pages
Gujarati Barakhadi - PDF
No ratings yet
Gujarati Barakhadi - PDF
33 pages
Shameeluddin Medium Com Easy Ways To Hack Into Zkteco Biomet
No ratings yet
Shameeluddin Medium Com Easy Ways To Hack Into Zkteco Biomet
11 pages
Unicast vs. Multicast vs. Broadcast
No ratings yet
Unicast vs. Multicast vs. Broadcast
3 pages
FM 02-02technical Query Inwards Register
No ratings yet
FM 02-02technical Query Inwards Register
2 pages
Esoteric Magic by Slidesgo
No ratings yet
Esoteric Magic by Slidesgo
54 pages
PPT ch04
No ratings yet
PPT ch04
71 pages
CA5668
No ratings yet
CA5668
1 page
Công Nghệ Blockchain UET
No ratings yet
Công Nghệ Blockchain UET
7 pages
SPOM SET D P4 Digital Ecosystem and Controls
No ratings yet
SPOM SET D P4 Digital Ecosystem and Controls
6 pages
Tests Answer Key PDF
0% (1)
Tests Answer Key PDF
11 pages
Disassembly & Reassembly
No ratings yet
Disassembly & Reassembly
4 pages
ME3EI02 Operations Research
No ratings yet
ME3EI02 Operations Research
4 pages
Log
No ratings yet
Log
86 pages
Robotics TP
No ratings yet
Robotics TP
15 pages
MSC Course Information 2025
No ratings yet
MSC Course Information 2025
22 pages
Backup Procedure For RS232
100% (1)
Backup Procedure For RS232
36 pages
Train Ticket Reservation
No ratings yet
Train Ticket Reservation
14 pages
Tourism and Travel Management Formatted Paper
No ratings yet
Tourism and Travel Management Formatted Paper
10 pages
Suvigya Saxena - 20IM30022 - CV
No ratings yet
Suvigya Saxena - 20IM30022 - CV
1 page
2009 Batch Regular
No ratings yet
2009 Batch Regular
2 pages
Lab 4
No ratings yet
Lab 4
7 pages
Leica Cyclone REGISTER 360 QuickStartGuide - EN
No ratings yet
Leica Cyclone REGISTER 360 QuickStartGuide - EN
29 pages
Software Engineering Processes
No ratings yet
Software Engineering Processes
34 pages
Data Strategy Worksheet: Component Typical Questions
No ratings yet
Data Strategy Worksheet: Component Typical Questions
2 pages
SSH Key Implementation
No ratings yet
SSH Key Implementation
2 pages
SeeThru RE7 Touch - Datasheet
No ratings yet
SeeThru RE7 Touch - Datasheet
1 page
Montero Sport
No ratings yet
Montero Sport
56 pages

Sri Vidya College of Engineering and Technology Question Bank

Uploaded by

Sri Vidya College of Engineering and Technology Question Bank

Uploaded by

SRI VIDYA COLLEGE OF ENGINEERING AND TECHNOLOGY QUESTION BANK

UNIT-II LEXICAL ANALYSIS

3. What is a sentinel? What is its usage?

5. What are the Error-recovery actions in a lexical analyzer?

6. Construct Regular expression for the language

CS6660 COMPILER DESIGN UNIT-2

8. Differentiate compiler and interpreter.

9. Write short notes on buffer pair.

Tokens- Sequence of characters that have a collective meaning.

11. List the operations on languages.

12. Write a regular expression for an identifier.

An identifier is defined as a letter followed by zero or more letters or digits.The

One or more instances (+)

14. What is the function of a hierarchical analysis?

CS6660 COMPILER DESIGN UNIT-2

15. What does a semantic analysis do?

1)What are roles and tasks of a lexical analyzer?

2. Converting a Regular Expression into a Deterministic Finite Automaton

CS6660 COMPILER DESIGN UNIT-2

For example, the RE (a| b)c is mapped to the following NFA:

CS6660 COMPILER DESIGN UNIT-2

CS6660 COMPILER DESIGN UNIT-2

You might also like