0% found this document useful (0 votes)

997 views8 pages

Exercises For Section 3.3

The document discusses the key concepts in lexical analysis including: - Tokens are abstract symbols consisting of a name and attribute value. Patterns describe the form of lexemes which are sequences of characters matching a token. - Regular expressions are used to specify patterns of lexemes. They define languages as sets of strings over an alphabet. - Key components of regular expressions include the empty string, concatenation, union, closure, and parentheses for grouping.

Uploaded by

Mule Won Tege

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

997 views8 pages

Exercises For Section 3.3

Uploaded by

Mule Won Tege

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 8

a) L à [b-d f-h j-n p-t v-z] String à L(a|A)+ L(e|E)+ L(i|I ) + L(o|O)+ L(u|U)+L

b) A*B*...Z*
c) S1 : the set of all characters and "*/" S2 : S1-/,* Comment -> /* (/* ** (S2 S1*)*)* */
d) want à 0|A?0?1(A0?1|01)*A?0?|A0? A à 0?2(02)*
e) want à (FE*G|(aa)*b)(E|FE*G) E à b(aa)*b F à...

Exercises for Section 3.3

3.3.1
Consult the language reference manuals to determine

1. the sets of characters that form the input alphabet (excluding those that may
only appear in character strings or comments)
2. the lexical form of numerical constants, and
3. the lexical form of identifiers,

for each of the following languages:

1. C
2. C++
3. C#
4. Fortran
5. Java
6. Lisp
7. SQL

3.3.2
Describe the languages denoted by the following regular expressions:

1. a(a|b)*a
2. ((ε|a)b*)*
3. (a|b)*a(a|b)(a|b)
4. a*ba*ba*ba*
5. !! (aa|bb)*((ab|ba)(aa|bb)*(ab|ba)(aa|bb)*)*
Answer

1. String of a's and b's that start and end with a.

2. String of a's and b's.
3. String of a's and b's that the character third from the last is a.
4. String of a's and b's that only contains three b.
5. String of a's and b's that has a even number of a and b.

3.3.3
In a string of length n, how many of the following are there?

1. Prefixes.
2. Suffixes.
3. Proper prefixes.
4. ! Substrings.
5. ! Subsequences.

Answer

1. n + 1
2. n + 1
3. n - 1
4. C(n+1,2) + 1 (need to count epsilon in)
5. Σ(i=0,n) C(n, i)

3.3.4
Most languages are case sensitive, so keywords can be written only one way, and the
regular expressions describing their lexeme is very simple. However, some languages,
like SQL, are case insensitive, so a keyword can be written either in lowercase or in
uppercase, or in any mixture of cases. Thus, the SQL keyword SELECT can also be written
select, Select, or sElEcT, for instance. Show how to write a regular expression for a
keyword in a case insensitive language. Illustrate the idea by writing the expression for
"select" in SQL.

Answer
select -> [Ss][Ee][Ll][Ee][Cc][Tt]

3.3.5
！Write regular definitions for the following languages:

1. All strings of lowercase letters that contain the five vowels in order.
2. All strings of lowercase letters in which the letters are in ascending lexicographic
order.
3. Comments, consisting of a string surrounded by /* and */, without an intervening
*/, unless it is inside double-quotes (")
4. !! All strings of digits with no repeated digits. Hint: Try this problem first with a
few digits, such as {O, 1, 2}.
5. !! All strings of digits with at most one repeated digit.
6. !! All strings of a's and b's with an even number of a's and an odd number of b's.
7. The set of Chess moves,in the informal notation,such as p-k4 or kbp*qn.
8. !! All strings of a's and b's that do not contain the substring abb.
9. All strings of a's and b's that do not contain the subsequence abb.

Answer

1、

want -> other* a (other|a)* e (other|e)* i (other|i)* o (other|o)* u (other|u)*

other -> [bcdfghjklmnpqrstvwxyz]

2、

a* b* ... z*

3、

\/\*([^*"]*|".*"|\*+[^/])*\*\/

4、

want -> 0|A?0?1(A0?1|01)*A?0?|A0?

A -> 0?2(02)*

Steps:

step1. Transition diagram

step2. GNFA
step3. Remove node 0 and simplify

step4. Remove node 2 and simplify

step5. Remove node 1 and simplify

5、

want -> (FEG|(aa)b)(E|FE*G)

E -> b(aa)*b
F -> a(aa)*b
G -> b(aa)*ab|a
F -> ba(aa)*b

Steps:

step1. Transition diagram

step2. GNFA

step3. Remove node A and simplify

step4. Remove node D and simplify

step5. Remove node C and simplify

8、

b*(a+b?)*

9、

b* | b*a+ | b*a+ba*

3.3.6
Write character classes for the following sets of characters:

1. The first ten letters (up to "j") in either upper or lower case.
2. The lowercase consonants.
3. The "digits" in a hexadecimal number (choose either upper or lower case for the
"digits" above 9).
4. The characters that can appear at the end of alegitimate English sentence (e.g. ,
exclamation point) .
Answer

1. [A-Ja-j]
2. [bcdfghjklmnpqrstvwxzy]
3. [0-9a-f]
4. [.?!]

3.3.7
Note that these regular expressions give all of the following symbols (operator
characters) a special meaning:

\ " . ^ $ [ ] * + ? { } | /

Their special meaning must be turned off if they are needed to represent themselves in
a character string. We can do so by quoting the character within a string of length one
or more; e.g., the regular expression "**" matches the string ** . We can also get the
literal meaning of an operator character by preceding it by a backslash. Thus, the regular
expression \*\* also matches the string **. Write a regular expression that matches the
string "\.

Answer

\"\\

3.3.9 !
The regular expression r{m, n} matches from m to n occurrences of the pattern r. For
example, a [ 1 , 5] matches a string of one to five a's. Show that for every regular
expression containing repetition operators of this form, there is an equivalent regular
expression without repetition operators.

Answer

r{m,n} is equals to r.(m).r | r.(m + 1).r | ... | r.(n).r

3.3.10 !
The operator ^ matches the left end of a line, and $ matches the right end of a line. The
operator ^ is also used to introduce complemented character classes, but the context
always makes it clear which meaning is intended. For example, ^[^aeiou]*$ matches any
complete line that does not contain a lowercase vowel.

1. How do you tell which meaning of ^ is intended?

2. Can you always replace a regular expression using the ^ and $ operators by an
equivalent expression that does not use either of these operators?

Answer

1. if ^ is in a pair of brakets, and it is the first letter, it means complemented classes,

or it means the left end of a line.

Token: a two tuple abstract symbol <name, attribute

value>
Pattern: description of the form or representation of
lexemes.
Lexeme: sequence of characters that match with a
pattern of a token identified by lexical analyzer as
instance of token.

Eg- printf("Sum=%d\n",total);

printf and total are lexemes matching the pattern for

token id, "sum=%d\n" is lexeme matching literal.

Specification of Tokens
Regular expressions are used to specify lexeme patterns.

Strings and Languages

Alphabet - finite set of symbols. Ex - {0,1} is binary
alphabet, ASCII is a popular alphabet.
String - finite sequence of symbols belonging to
alphabet.
Language - finite set of strings over a specific alphabet.
Substring - a smaller set of consecutive elements of
string. Ex- pil from compiler.
Subsequence - a smaller set of elements in any order from
string obtained by deleting zero or more elements. Ex
- cpile from compiler.

Regular Expressions
Mathematically, a regular expression is defined as -

1. ε is a regular expression
L ( ε ) = { ε }
It is the language consisting of only the empty
string.

2. r = a is another regular expression for the

language -
L(a)={a}

3. ( a ) + ( b ) → L ( a ) U L ( b )
2.   ( a ) | ( b ) → L ( a ) U L ( b )
3.   ( a ) . ( b ) → L ( a ) . L ( b )
4.   ( a ) *  → ( L ( a ) ) *
5. ( ( a ) ) → L ( a )
6. Tokens- Sequence of characters that have a collective meaning.
7. · Patterns- There is a set of strings in the input for which the same token
is produced as output. This set of strings is described by a rule called a
pattern associated with the token
8. · Lexeme- A sequence of characters in the source program that is
matched by the pattern for a token

Definitions:
Translator
A device that changes a sentence from one language to
another without change of meaning.
Compiler
A program that translates between programming
languages.
Interpreter
A processor that compiles and executes programming
language statements one by one in an interleaved manner.
Syntax
An alphabet and a set of rules defining spatial relationships
between symbols and symbol sets in a language.
Semantics
The meanings assigned to symbols and symbol sets in a
language.
Pragmatics
The meanings perceived to be associated with symbols
and symbol sets in a language

Exercises of Compiler
75% (4)
Exercises of Compiler
51 pages
10th Remidial English
No ratings yet
10th Remidial English
53 pages
More More 5 Practice Book SkillsBook Dictionary 1
No ratings yet
More More 5 Practice Book SkillsBook Dictionary 1
30 pages
CompilerDesign Lab Manual
No ratings yet
CompilerDesign Lab Manual
66 pages
Compiler Design: Ambo University School of Informatics and Electrical Engineering Department of Computer Science
No ratings yet
Compiler Design: Ambo University School of Informatics and Electrical Engineering Department of Computer Science
35 pages
Distributed System - Question Bank.
100% (1)
Distributed System - Question Bank.
4 pages
Compiler Design Chapter-2
60% (5)
Compiler Design Chapter-2
105 pages
Compiler Design Chapter-4
100% (2)
Compiler Design Chapter-4
77 pages
Sincerity Striving Success
No ratings yet
Sincerity Striving Success
13 pages
Compiler Design Lex and Yacc
100% (3)
Compiler Design Lex and Yacc
48 pages
Advanced Database Systems: Chapter 3:query Processing and Evaluation
100% (1)
Advanced Database Systems: Chapter 3:query Processing and Evaluation
36 pages
Bài tập bổ trợ anh 8 Global UNIT 1 (PRACTICE TEST)
100% (1)
Bài tập bổ trợ anh 8 Global UNIT 1 (PRACTICE TEST)
7 pages
Lexical Analysis
No ratings yet
Lexical Analysis
57 pages
Compiler Design Chapter-3
0% (1)
Compiler Design Chapter-3
177 pages
ch3 M.PPTX - 0
No ratings yet
ch3 M.PPTX - 0
46 pages
100 Top Compiler Design Important Questions and Answers PDF
50% (2)
100 Top Compiler Design Important Questions and Answers PDF
20 pages
Primary Academic Calendar 25-26 - Tlm4all
No ratings yet
Primary Academic Calendar 25-26 - Tlm4all
30 pages
CD 2,3 Unit's Material
100% (1)
CD 2,3 Unit's Material
170 pages
3D Game Engine Design A Practical Approach To Real Time Computer Graphics 2nd Edition by David Eberly ISBN 0122290631 9780122290633
100% (7)
3D Game Engine Design A Practical Approach To Real Time Computer Graphics 2nd Edition by David Eberly ISBN 0122290631 9780122290633
40 pages
Rhetorical Devices Exercise
No ratings yet
Rhetorical Devices Exercise
3 pages
Proposal Terbaru
No ratings yet
Proposal Terbaru
21 pages
5.tokens, Patterns, and Lexemes
No ratings yet
5.tokens, Patterns, and Lexemes
7 pages
Cse-V-Formal Languages and Automata Theory (10cs56) - Notes
67% (3)
Cse-V-Formal Languages and Automata Theory (10cs56) - Notes
125 pages
Chapter 4 Automata
No ratings yet
Chapter 4 Automata
36 pages
Module-3 Syntax Analyzer
No ratings yet
Module-3 Syntax Analyzer
80 pages
Clearance Management System Updated
100% (2)
Clearance Management System Updated
25 pages
Tenses Workssheet Grade 9
No ratings yet
Tenses Workssheet Grade 9
4 pages
Exercises For Snbhection 3.3
50% (2)
Exercises For Snbhection 3.3
7 pages
Design and Analysis of Algorithms Important Questions - 2024
No ratings yet
Design and Analysis of Algorithms Important Questions - 2024
5 pages
Unit 4 Question Bank Solutions-1
No ratings yet
Unit 4 Question Bank Solutions-1
11 pages
Chapter 03 - Regular Expression and Language
No ratings yet
Chapter 03 - Regular Expression and Language
42 pages
Assignment 5
100% (1)
Assignment 5
2 pages
A Introduction To Computing Questions For 2016 Exit Exam
No ratings yet
A Introduction To Computing Questions For 2016 Exit Exam
27 pages
First and Follow Set
86% (7)
First and Follow Set
5 pages
WINSEM2023-24 CSI2005 TH VL2023240501823 2024-01-08 Reference-Material-I
No ratings yet
WINSEM2023-24 CSI2005 TH VL2023240501823 2024-01-08 Reference-Material-I
23 pages
Lutchman Katrina - Psyc 3706el 12 - Assignment 1
No ratings yet
Lutchman Katrina - Psyc 3706el 12 - Assignment 1
9 pages
Chapter 7: Kleene's Theorem: Regular Expressions, Finite Automata, Transition Graphs Are All The Same!!
No ratings yet
Chapter 7: Kleene's Theorem: Regular Expressions, Finite Automata, Transition Graphs Are All The Same!!
48 pages
SE Compiler Chapter 2
No ratings yet
SE Compiler Chapter 2
16 pages
Operator Precedence Grammar
100% (2)
Operator Precedence Grammar
5 pages
Numerical Question Based On Shift Reduce Parser
No ratings yet
Numerical Question Based On Shift Reduce Parser
8 pages
Compiler Design Important Questions
100% (1)
Compiler Design Important Questions
1 page
Compiler Design Unit-1 - 4
No ratings yet
Compiler Design Unit-1 - 4
4 pages
Compiler Design Unit 2
No ratings yet
Compiler Design Unit 2
117 pages
Compiler Design Chapter 2
No ratings yet
Compiler Design Chapter 2
14 pages
Chapter Two-Technical Report Writing
No ratings yet
Chapter Two-Technical Report Writing
8 pages
Recursive and Recursively Enumerable Languages
No ratings yet
Recursive and Recursively Enumerable Languages
2 pages
1 Types of Parsers in Compiler Design
100% (1)
1 Types of Parsers in Compiler Design
4 pages
Compiler Design: Syntactic Analysis Sample Exercises and Solutions
No ratings yet
Compiler Design: Syntactic Analysis Sample Exercises and Solutions
22 pages
Final Exam For Finite Automata
50% (2)
Final Exam For Finite Automata
4 pages
Final Year Project Documentation
No ratings yet
Final Year Project Documentation
90 pages
Compiler Construction Midterm Exam
100% (1)
Compiler Construction Midterm Exam
1 page
Final Examination For The Beginner Class
No ratings yet
Final Examination For The Beginner Class
3 pages
Contextualized Individual Summary Record
No ratings yet
Contextualized Individual Summary Record
4 pages
Comparisons - Practise (TV)
No ratings yet
Comparisons - Practise (TV)
7 pages
Prolog Lab Sheets
No ratings yet
Prolog Lab Sheets
36 pages
Passive Voice in English and Vietnamese
No ratings yet
Passive Voice in English and Vietnamese
46 pages
Liste Complète Des 200 Verbes Irréguliers en Anglais
No ratings yet
Liste Complète Des 200 Verbes Irréguliers en Anglais
9 pages
DFA To Regular Expression
No ratings yet
DFA To Regular Expression
38 pages
Csci3255 HW 3
67% (3)
Csci3255 HW 3
5 pages
He Reads The Newspaper Everyday
No ratings yet
He Reads The Newspaper Everyday
13 pages
Basic Medical Terminologies: Leniza Rae L. de Guzman, RMT, Mls (Ascpi)
No ratings yet
Basic Medical Terminologies: Leniza Rae L. de Guzman, RMT, Mls (Ascpi)
32 pages
TOC Unit 3 (CFG) Context Free Grammar
No ratings yet
TOC Unit 3 (CFG) Context Free Grammar
90 pages
Prolog Lab Exercise Assignment by Tolosa Tafese
No ratings yet
Prolog Lab Exercise Assignment by Tolosa Tafese
7 pages
Unit 9 - Lesson 3
No ratings yet
Unit 9 - Lesson 3
6 pages
Course Outline: Addis Ababa University Department of Computer Science
No ratings yet
Course Outline: Addis Ababa University Department of Computer Science
1 page
Test A: Unit 3
No ratings yet
Test A: Unit 3
29 pages
First and Follow-First Function - Rules For Calculating First Function
No ratings yet
First and Follow-First Function - Rules For Calculating First Function
20 pages
Metrics For Software Project Size Estimation
No ratings yet
Metrics For Software Project Size Estimation
3 pages
Lexical Analysis: (Section 3.3)
100% (1)
Lexical Analysis: (Section 3.3)
3 pages
Unit-II RE Question Bank
No ratings yet
Unit-II RE Question Bank
4 pages
Virtual Reality - Definition of Virtual Reality by Merriam ... Merriam ..
No ratings yet
Virtual Reality - Definition of Virtual Reality by Merriam ... Merriam ..
5 pages
Atcd Unit 1
No ratings yet
Atcd Unit 1
58 pages
Left Recursion
No ratings yet
Left Recursion
10 pages
Revisiongrammar
No ratings yet
Revisiongrammar
7 pages
Prctice Question On DAG
No ratings yet
Prctice Question On DAG
21 pages
Exp-4-Eliminating Ambiguity, Left Recursion and Left Factoring - 012
No ratings yet
Exp-4-Eliminating Ambiguity, Left Recursion and Left Factoring - 012
14 pages
A) Keeping The Central Values of The Centralized DFT
No ratings yet
A) Keeping The Central Values of The Centralized DFT
3 pages
Introduction Yo Python Programming
No ratings yet
Introduction Yo Python Programming
23 pages
Huffman Coding MCQ
No ratings yet
Huffman Coding MCQ
9 pages
Automata Theory Assignment 1
100% (1)
Automata Theory Assignment 1
8 pages
SMMM
No ratings yet
SMMM
1 page
Output
No ratings yet
Output
1 page
Exampl
No ratings yet
Exampl
1 page
Question Bank: Short Answer Type Questions
No ratings yet
Question Bank: Short Answer Type Questions
29 pages
Client Server Program Using Remote Procedure Call (RPC)
No ratings yet
Client Server Program Using Remote Procedure Call (RPC)
14 pages
Practice Final Exam For Natural Language Processing
No ratings yet
Practice Final Exam For Natural Language Processing
9 pages
REFERENCES
No ratings yet
REFERENCES
2 pages
Mata Pelajaran: BAHASA INGGRIS Sat. Pendidikan: SMA/MA Kelas / Program: X (SEPULUH)
No ratings yet
Mata Pelajaran: BAHASA INGGRIS Sat. Pendidikan: SMA/MA Kelas / Program: X (SEPULUH)
5 pages
Writing Continuum Checklist
No ratings yet
Writing Continuum Checklist
3 pages
Passive Voice em Vestibulares e Concursos
No ratings yet
Passive Voice em Vestibulares e Concursos
1 page
9.4 Going To For Plans
No ratings yet
9.4 Going To For Plans
1 page
Allama Iqbal Open University, Islamabad (Department of English)
No ratings yet
Allama Iqbal Open University, Islamabad (Department of English)
4 pages
What Do You Mean by Software Crisis? Explain With Examples. How Can Software Crisis Can Be Minimized?
No ratings yet
What Do You Mean by Software Crisis? Explain With Examples. How Can Software Crisis Can Be Minimized?
11 pages
Present Perfect
100% (1)
Present Perfect
3 pages
1 Heads and Modifiers
No ratings yet
1 Heads and Modifiers
7 pages
Tos-English G2 1Q 2022-2023
100% (1)
Tos-English G2 1Q 2022-2023
3 pages
Learning Module - English 9 - Module 3 - Q1 - WK3
No ratings yet
Learning Module - English 9 - Module 3 - Q1 - WK3
3 pages