0% found this document useful (0 votes)

116 views22 pages

Lexical Analyzer: Using Flex by Dr. S. M. Farhad

The document describes a lexical analyzer and how it works with Flex. It discusses that a lexical analyzer is the first phase of a compiler that scans the character stream and groups them into meaningful tokens. It outputs a sequence of tokens. Flex is a tool that can generate a lexical analyzer from a file with regular expressions rules. The document provides examples of token patterns, actions, and the overall structure of a lexical analysis program in Flex.

Uploaded by

HM Mahmudul Hasan Hridoy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

116 views22 pages

Lexical Analyzer: Using Flex by Dr. S. M. Farhad

Uploaded by

HM Mahmudul Hasan Hridoy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 22

Lexical Analyzer

Using Flex
By
Dr. S. M. Farhad
Lexical Analysis
• First phase of a Compiler
• Also called Scanning
• Scans the character stream of the Source
program
• Groups them into meaningful sequences
– Output: A sequence of token
Role of Lexical Analyzer
Identify Tokens
Remove Whitespace
Install lexme in symbol table
Returns token to parser

Token To semantic
Lexical analysis
Source
Parser
Program Analyzer

getNextToken

Symbol
Table
Lexical Analyzer

Performs those functions

Tokens
Source Program A Program in any
language

do {
Install in Symbol Table
Study;

}while (t_CGPA<
3.90)
Lexical Analyzer

• No need to write the code

• Tools that produce the analyzer
– Lex
Lex Tool

Lex source Lex lex.yy.c

lex.l Compiler

C a.out
lex.yy.c Compiler

Source a.out Tokens

program
Token, Pattern, Lexeme
• Token: Set of strings that represent a
particular construct in source language

• Pattern: Rules that describe that string set

– It matches each string in the set

• Lexeme: sequence of characters that is

matched by a pattern for a token
Example
Token Sample Lexemes Pattern Description

WHILE while while

RELOP <, <=, >, >=, <>, == < or <= or > or >= or
<> or ==

ID count, account, flag2 letter followed by letters

and digits

C comment /* hubi jabi/* aro habi jabi / anything between / and

NUM 3.14, 3.2E+5, 5.9E-2 sequence of digits

having fraction and
exponent
Structure of Lex Programs
%{ #include<stdio.h>
// anything here is directly copied to lex.yy.c
int Word_count;
%}
Declarations // regular definitions
%%
Transition rules // token matching & actions
%%
auxiliary functions // any other functions
Transition rules
• Pattern { Action }

Regular expressions C code

to to
Match the token Do the functions
Regular Expressions
• Specifies a set of strings to match
• One expression for each token pattern
• Some expression
– [ \t\n] //for delimiter
– [ \t\n]+ // for white space
– a(b)* //a followed by zero or more occurrence of b
//a, ab, abb, abbb
Actions
• Specify what to do if a rule matches a token
• Basically C code
• Examples
%%
[a-zA-z] {
printf(“I found a letter”);
}
[0-9] {
printf(“I found a digit”)
}
[ \t\n] {
// actually I do nothing
}
%%
Structure of Lex Programs
%{
#include<stdio.h>

%} int Word_count;

Declarations // regular definitions

[0-9] {
printf(“I found a digit”);
}
%%
auxiliary functions // any other functions
Regular Definitions
• Give symbolic name to regular expressions
• Examples

delim [ \t\n]
ws {delim}+
digit [0-9]
number {digit}+
Complete Lex Source
%{
#include<stdio.h>
int word_count = 0;
%}
delim [ \t\n]
digit [0-9]
%%
{delim}+ { } //no action
{digit}+ { printf(“Here I found a digit”);
word_count++ }
%%
Printf(“Total Count: %d”,word_count);
Assignment
• Write a lexical analyzer for a subset of Pascal.
– Ignore white space
– Match all identifiers (keywords, variables etc )
• Insert variables in symbol table
• No need to insert keywords just show it in console

– Match all numbers and insert it in symbol table

– Find all comments
– Find all double quoted strings
– Count line numbers
Assignment
– Variables start with a letter or underscore (_)
• Ex : a, a9bc, _abc but not 8cde.
– Numbers may contain optional fraction or
exponent
• Ex: 3, 3.056, 3.45E5, 3.45E-2, 3E+2
– Comments are anything between { }, they
may not contain a { and appear after any
token
– Relational operators are =, <>, < , <=, >=, >
• Insert the lexeme in symbol table and print the token RELOP
Assignment
– Addition operators are + - or
– Multiplier operators are * / div mod and
• Insert lexeme and print token MULOP
– Other tokens to match
• := // (assignment operator, token name is ASSIGNOP)
• [ , ] , ( , ) , .. // token name is DOTDOT
• ,
• ;,:
Assignment
• Keywords to match
– program
– if
– not
– end Print the corresponding token name and line no of occurring
– begin
– else
– then Token name for parser is keyword name with capital letter
– do
– while
– function
– Procedure
– integer
– real
– var
– oh
– array
– write
Additional Requirement

• Identify Multiple line comments in C

– /* abrabrabr*/
– /*abrabrabr/*abrabrabr*****abr**/
Compilation code
flex -t sample.l >sample.c
g++ -c -o sample.o sample.c
g++ -o samp sample.o -ll
./samp <in.txt>out.txt

002chapter 2 - Lexical Analysis
No ratings yet
002chapter 2 - Lexical Analysis
114 pages
Sentential Connectives
100% (2)
Sentential Connectives
10 pages
Algebraic Laws For Regular Epxressions
0% (3)
Algebraic Laws For Regular Epxressions
14 pages
cs3501 Compiler Design Lab Manual
No ratings yet
cs3501 Compiler Design Lab Manual
56 pages
Regular Pumping Examples
No ratings yet
Regular Pumping Examples
31 pages
Lecture 02
No ratings yet
Lecture 02
150 pages
Automata Theory T
No ratings yet
Automata Theory T
5 pages
Math1011 CH 1ppt
No ratings yet
Math1011 CH 1ppt
106 pages
Chapter 3 - Lexical Analysis and Lexical Analyzer Generators
No ratings yet
Chapter 3 - Lexical Analysis and Lexical Analyzer Generators
52 pages
Midterm Exam
No ratings yet
Midterm Exam
15 pages
Compiler Design Lab KCS552
No ratings yet
Compiler Design Lab KCS552
82 pages
AlteryxDesignerDesktop RegexCheatSheet v2 EN
No ratings yet
AlteryxDesignerDesktop RegexCheatSheet v2 EN
1 page
Resolution Theorem Proving: by Dr. Ismael Abdulsattar
No ratings yet
Resolution Theorem Proving: by Dr. Ismael Abdulsattar
24 pages
4-Intro To Flex and Bison-09!09!2024
No ratings yet
4-Intro To Flex and Bison-09!09!2024
28 pages
CD Assignment-4 21brs1018
No ratings yet
CD Assignment-4 21brs1018
16 pages
CSC 214 Assignment
No ratings yet
CSC 214 Assignment
6 pages
Discrete Math Lecture 2
No ratings yet
Discrete Math Lecture 2
64 pages
Compiler - Lexical Analyzer-2
No ratings yet
Compiler - Lexical Analyzer-2
16 pages
Chapter 3 Lexical Analysis
No ratings yet
Chapter 3 Lexical Analysis
5 pages
Definition of Validity
No ratings yet
Definition of Validity
3 pages
Elementary Set Theory
100% (2)
Elementary Set Theory
240 pages
Compiler Desing-Final ppt2
No ratings yet
Compiler Desing-Final ppt2
194 pages
Lexical Analysis
No ratings yet
Lexical Analysis
14 pages
Flex
No ratings yet
Flex
36 pages
Compiler Design (CD) : Lab Assignment 1
No ratings yet
Compiler Design (CD) : Lab Assignment 1
36 pages
Chapter 2 - Lexical Analyser
No ratings yet
Chapter 2 - Lexical Analyser
39 pages
Unit 4 BNF and Syntax Diagrams
No ratings yet
Unit 4 BNF and Syntax Diagrams
28 pages
1 - Scanning Slides Sanyal Part1
No ratings yet
1 - Scanning Slides Sanyal Part1
22 pages
CYK Algorithm
No ratings yet
CYK Algorithm
33 pages
Topic DLD Final Targets
No ratings yet
Topic DLD Final Targets
1 page
Lexical Analysis 2
No ratings yet
Lexical Analysis 2
24 pages
Cs6612 Compiler Laboratory
No ratings yet
Cs6612 Compiler Laboratory
42 pages
Chapter 2
No ratings yet
Chapter 2
36 pages
Lecture 2.76
No ratings yet
Lecture 2.76
31 pages
Lecture 2 10022025 035804pm
No ratings yet
Lecture 2 10022025 035804pm
27 pages
Unit I Introduction To Compilers: Lex - The Lexical-Analyzer Generator
No ratings yet
Unit I Introduction To Compilers: Lex - The Lexical-Analyzer Generator
19 pages
Lecture 3 - Lexical Analysis
No ratings yet
Lecture 3 - Lexical Analysis
42 pages
Heart Disease Prediction System
No ratings yet
Heart Disease Prediction System
5 pages
SSCD Chapter3
No ratings yet
SSCD Chapter3
97 pages
21CS51 ATCD MODULE 2 - 2 Lexical Analyser Part2
No ratings yet
21CS51 ATCD MODULE 2 - 2 Lexical Analyser Part2
62 pages
HW 31712
No ratings yet
HW 31712
22 pages
2 - Lexical Analysis
No ratings yet
2 - Lexical Analysis
52 pages
Lexical Analyzer
No ratings yet
Lexical Analyzer
31 pages
Chapter 2 - Lexical Analysis
No ratings yet
Chapter 2 - Lexical Analysis
74 pages
2 Lexing
No ratings yet
2 Lexing
73 pages
Lexical Analysis 3
No ratings yet
Lexical Analysis 3
27 pages
Unit 2 Lexical Analyzer
No ratings yet
Unit 2 Lexical Analyzer
63 pages
CH 2 - Lexical Analysis
No ratings yet
CH 2 - Lexical Analysis
36 pages
Java: An Introduction: Sanjay Saha
No ratings yet
Java: An Introduction: Sanjay Saha
32 pages
Chomsky Hierarchy
100% (1)
Chomsky Hierarchy
30 pages
Unit 1 (B)
No ratings yet
Unit 1 (B)
69 pages
Chapter 2
No ratings yet
Chapter 2
41 pages
2024 CD-Ch02 Lexical Analysis
No ratings yet
2024 CD-Ch02 Lexical Analysis
25 pages
Compiler Design Lexical Analysis
No ratings yet
Compiler Design Lexical Analysis
24 pages
CS351 Context Free Grammars
No ratings yet
CS351 Context Free Grammars
9 pages
Syntax Analysis
No ratings yet
Syntax Analysis
73 pages
CD Lab Manual
No ratings yet
CD Lab Manual
52 pages
2-Lexical Analysis Part1
No ratings yet
2-Lexical Analysis Part1
39 pages
Comp Chap2
No ratings yet
Comp Chap2
36 pages
@CD - ch2 Compiler Design
No ratings yet
@CD - ch2 Compiler Design
26 pages
CD Cse Record
No ratings yet
CD Cse Record
76 pages
Logic-8 8-8 10
No ratings yet
Logic-8 8-8 10
34 pages
Cs3501 Compiler Design Lab Manual
No ratings yet
Cs3501 Compiler Design Lab Manual
54 pages
03 Lexing Parsing
No ratings yet
03 Lexing Parsing
78 pages
Chapter 2 - Lexical Analyser
No ratings yet
Chapter 2 - Lexical Analyser
38 pages
Logic 1
No ratings yet
Logic 1
33 pages
Day 2 - Lexial Analyzer
No ratings yet
Day 2 - Lexial Analyzer
37 pages
02 Lexical Analysis
No ratings yet
02 Lexical Analysis
86 pages
Lexical Analysis: Programming Languages Translators
No ratings yet
Lexical Analysis: Programming Languages Translators
21 pages
Church - A Set of Postulates For The Foundation of Logic
100% (1)
Church - A Set of Postulates For The Foundation of Logic
22 pages
Decision Control Structures: If Statements Switch Statements
No ratings yet
Decision Control Structures: If Statements Switch Statements
15 pages
SPCC Exp7
No ratings yet
SPCC Exp7
8 pages
Chapter 2 - Lexical Analysis
No ratings yet
Chapter 2 - Lexical Analysis
10 pages
ATCD Mod 3
No ratings yet
ATCD Mod 3
46 pages
AnswerAssignment1-Prop Logic
No ratings yet
AnswerAssignment1-Prop Logic
10 pages
Chapter 2 Lexical Analysis
No ratings yet
Chapter 2 Lexical Analysis
14 pages
Tut5 PDF
No ratings yet
Tut5 PDF
3 pages
Compiler Lab Manual Final E-Content
75% (16)
Compiler Lab Manual Final E-Content
55 pages
Chapter 2 - Lexical Analyser
No ratings yet
Chapter 2 - Lexical Analyser
40 pages
The Real Number System
No ratings yet
The Real Number System
13 pages
Compiler Design Lab
No ratings yet
Compiler Design Lab
68 pages
Lexical Analysis: Textbook:Modern Compiler Design
No ratings yet
Lexical Analysis: Textbook:Modern Compiler Design
43 pages
Compiler Construction CS-4207: Lecture 4-5 Instructor Name: Atif Ishaq
100% (1)
Compiler Construction CS-4207: Lecture 4-5 Instructor Name: Atif Ishaq
37 pages
Smth012 Lecture Notes 2019
No ratings yet
Smth012 Lecture Notes 2019
151 pages
MATHEMATICS 8 Remedial Activity
No ratings yet
MATHEMATICS 8 Remedial Activity
7 pages
Theory of Automata
No ratings yet
Theory of Automata
12 pages
Lexical Analysis
No ratings yet
Lexical Analysis
6 pages
Hirst
No ratings yet
Hirst
119 pages
Kings: Department of Computer Science and Engineering
No ratings yet
Kings: Department of Computer Science and Engineering
20 pages
2-Lexical Analysis
No ratings yet
2-Lexical Analysis
52 pages
CSI 218, Practice Problem
No ratings yet
CSI 218, Practice Problem
4 pages
CSI 218, Practice Problem
No ratings yet
CSI 218, Practice Problem
4 pages
Learn C++
From Everand
Learn C++
Durgesh
4.5/5 (9)
Dive Into Sea of C
From Everand
Dive Into Sea of C
M Ashok
No ratings yet
Lisp Interpreter in Rust
From Everand
Lisp Interpreter in Rust
Vishal Patil
1/5 (1)

Lexical Analyzer: Using Flex by Dr. S. M. Farhad

Uploaded by

Lexical Analyzer: Using Flex by Dr. S. M. Farhad

Uploaded by

Lexical Analyzer

Performs those functions

• No need to write the code

Lex source Lex lex.yy.c

Source a.out Tokens

• Pattern: Rules that describe that string set

• Lexeme: sequence of characters that is

WHILE while while

ID count, account, flag2 letter followed by letters

C comment /* hubi jabi/* aro habi jabi */ anything between /* and

NUM 3.14, 3.2E+5, 5.9E-2 sequence of digits

Regular expressions C code

Declarations // regular definitions

– Match all numbers and insert it in symbol table

• Identify Multiple line comments in C

You might also like

C comment /* hubi jabi/* aro habi jabi / anything between / and