Lab 2

The document discusses using Flex to perform lexical analysis. It describes what Flex is and how it works, including how it uses regular expressions to identify tokens in input and generates C code. It also explains the structure of Flex specification files and some important Flex concepts and functions.

Uploaded by

Arsii Genius of Siikkoo&Mandoo channel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views22 pages

Lab 2

Uploaded by

Arsii Genius of Siikkoo&Mandoo channel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 22

Compiler Design LAB

Introduction To Flex
Lesson objective
At the end of the lesson student will able to:
 be familiar with Lexical analysis using Flex
and the process how to create tokens
Flex and lexical analysis
 From the area of compilers, we get a host of tools
to convert text files into programs. The first part of
that process is often called lexical analysis,
particularly for such languages as C / C++…etc.

 A good tool for creating lexical analysis is flex or lex.

 It takes a file.l specification file and creates an

analyzer, usually called lex.yy.c then using gcc/g++
….etc we create an application that can create
tokens
Lexical analysis terms
 A token is a group of characters having collective
meaning.
 A lexeme is an actual character sequence forming a
specific instance of a token, such as num.
 A pattern is a rule expressed as a regular
expression and describing how a particular token
can be formed. For example,
[A-Za-z][A-Za-z_0-9]* is a rule.
 Note : Characters between tokens are called
whitespace; these include spaces, tabs, newlines.
Tools for lexical analysis
Use a lexical analyzer generator tool, such as lex /
flex.
lex= lexical analyzer generator ( in Unix OS)
flex= fast lexical analyzer generator ( In windows)
flex takes your specification code(RE) and
generates a combined NFA to recognize all your
patterns, converts it to an equivalent DFA,
minimizes the automaton as much as possible, and
generates C code that will implement it.
How it process ? It uses some command lines
flex name.l produce lex .yy.c
gcc/g++ lex .yy.c produce a.exe
gcc/g++ lex.yy.c -o token produce token.exe
flex source
program flex lex.yy.c
with . l
lex.yy.c C++ compiler a.exe

input a.exe tokens

6
Flex file format
Definition section
%%
Rule section
%%
auxiliary procedures
flex input files are structured as follows( flex
specifications)
 The flex input file consists of three sections
separated by a line with just %%
%{
declarations
%}
regular definitions
%%
translation rules
%%
auxiliary procedures (user subroutines)
Definitions is structured as follows:
Declarations of ordinary variables ,constants,
%{  Include header files …
Declaration of some global variables
Declarations
%}
%{
%option directive /* This is a comment inside the
definition*/
Regular definition #include <math.h> //may need headers
#include <iostream> // for cout
%}

Definitions that can be used in rules section

syntax: name definition
Example: DIGIT [0-9]
ID [a-z][a-z0-9]*
flex Rules (Translation Rules Section)
The form of rules are: The actions are C/C++ code
P1 action1 If it takes more than one line,
enclose with braces {action }
P2 action2
...
In specifying patterns, flex supports a
Pn actionn fairly rich set of conveniences(REs)
(character classes, repetition, etc.)

where Pi are regular expressions pattern and

actioni are C/C++ program segments (actions)
Example rules:

[a-z]+ cout<<"found word:" << yytext <<"\n";

[A-Z][a-z]* { cout<<"found capitalized word:";
cout<< yytext <<"\n";
}
Rules: Most modern lexical- analyzer
generators follow 3 rules
• Look for the longest token
The longest initial substring that can match any
regular expression is taken as the next token.
• Rule priority: Look for the first-listed pattern
that matches the longest token
– In keywords and identifiers, keywords must be
written first , then identifiers
• List frequently occurring patterns first
– white space
12
User Subroutines Section
• You can use your flex routines in the same ways you
use routines in other programming languages.

int main()
{
yylex();
}

13
Example
[ \t\n] { /* no action and no return */ }
if {cout<”keyword found”;}
else {cout<”keyword found”;}
[A_Za-z_][A-Za-z0-9_]+ {cout<<“”ID found”;}
[0-9]+ {cout<<”integer found”;}
“<=” {cout<<“relop found”;}
“==” {cout<<”relop found”;}
...
%%

14
option directive
1. Maintaining Line Number :
Flex allows to maintain the number of the current line in
the global variable yylineno using the following option
mechanism
%option yylineno
2. Removes the call to the routine yywrap()
%option noyywrap
- is called whenever flex reaches an end-of-file(eof)
- It is an option not include yywrap()
- indicating this is the end of the file or no more file content

Note: we write this in the first section

15
Some flex Predefined Variables
• yytext -- a string containing the lexeme
• yyleng -- the length of the lexeme
. Etc…..
• E.g.
[a-z]+ cout<<yytext;
[a-zA-Z]+{words++; chars += yyleng;}

16
flex Library Routines
• yylex()
– The default main() contains a call of yylex()
• yywarp()
– is called whenever flex reaches an end-of-file(eof)
– The default yywarp() always returns 1
• yymore()
– return the next token
• yyless(n)
– retain the first n characters in yytext

17
yylex()
• Most programs with flex scanners use the
scanner to return a stream of tokens that are
handled by a parser
• Each time the program needs a token, it calls
yylex(), which reads a little input and returns the
token
yywrap()
• Used to continue reading from another file
• It is called at EOF
• U can then open another file and return 0 or
• U can return 1, indicating this is the end or
• U can use an option not include it
Reading from a file
• Flex reads its input from a global pointer to a C
FILE variable called yyin
• yyin is set to STDIN by default
• So all we have to do is set that pointer to our file
handle
-FILE*myfile=fopen(“filename","r");
-If(! myfile) {…}//cout<<"Error opening file"<<endl;
// return -1
- yyin=myfile;

- yylex();
How the input is matched

• When the generated scanner is run, it analyzes

its input looking for strings which match any of
its patterns
• The text corresponding to the match is made
available in the global character pointer yytext
and its length in the global integer yyleng
• The action corresponding to the matched
pattern is then executed and then the
remaining input is scanned for another match
How the input is matched…
• yytext can be defined in two different ways
• U can control which definition flex uses by including
‘%pointer’ or ‘%array’ directives in the control section of
the flex program
– As a character pointer
• Faster scanning and no buffer overflow when matching very large tokens
• Calls to the unput() function destroys the present contents of yytext
– As a character array
• Size of YYLMAX but u can modify it by #define YYLMAX
• calls to the unput() do not destroy yytext

An Introduction To Flex
No ratings yet
An Introduction To Flex
7 pages
Lab 4
No ratings yet
Lab 4
12 pages
Flexman PDF
No ratings yet
Flexman PDF
37 pages
Flex Coursz
No ratings yet
Flex Coursz
15 pages
Flex Tool Presentation - DVK
No ratings yet
Flex Tool Presentation - DVK
17 pages
Flex Help
No ratings yet
Flex Help
18 pages
SPCC Exp7
No ratings yet
SPCC Exp7
8 pages
CMSC 141 - Automata and Language Theory
No ratings yet
CMSC 141 - Automata and Language Theory
2 pages
Lex
No ratings yet
Lex
41 pages
More On LEX Programming
50% (2)
More On LEX Programming
42 pages
Flex
No ratings yet
Flex
36 pages
Lex Material 1
No ratings yet
Lex Material 1
37 pages
Lab Manual - Compiler Lab CSL411 Manual 2022
No ratings yet
Lab Manual - Compiler Lab CSL411 Manual 2022
114 pages
Experiment No. 9 3118013: Aim: Theory: Lexical Analyzer
No ratings yet
Experiment No. 9 3118013: Aim: Theory: Lexical Analyzer
16 pages
Lecture 07 PDF
No ratings yet
Lecture 07 PDF
8 pages
Compiler Design (CD) : Lab Assignment 1
No ratings yet
Compiler Design (CD) : Lab Assignment 1
36 pages
Lexical Analyzer Generator Lex (Flex in Recent Implementation)
No ratings yet
Lexical Analyzer Generator Lex (Flex in Recent Implementation)
14 pages
Lex Yacc Tutorial
No ratings yet
Lex Yacc Tutorial
38 pages
Lexical Analyzer
No ratings yet
Lexical Analyzer
33 pages
Compiler Design Lab KCS552
No ratings yet
Compiler Design Lab KCS552
82 pages
CompilerDesignLabManual PDF
No ratings yet
CompilerDesignLabManual PDF
11 pages
Lab Programs
No ratings yet
Lab Programs
18 pages
Compiler Design Practical List
No ratings yet
Compiler Design Practical List
5 pages
Introduction To Lex
No ratings yet
Introduction To Lex
20 pages
Lex and Yacc
No ratings yet
Lex and Yacc
8 pages
Project 1 - Lexical Analyzer Using The Lex Unix Tool No Due Date - Project Not Graded
No ratings yet
Project 1 - Lexical Analyzer Using The Lex Unix Tool No Due Date - Project Not Graded
4 pages
Flex/Le X: Javeria Akram (276) Ifra Zahid
No ratings yet
Flex/Le X: Javeria Akram (276) Ifra Zahid
21 pages
Lab Manual2021 Regulation
No ratings yet
Lab Manual2021 Regulation
28 pages
Spring 2024 Compiler Constructoin A Lab 5
No ratings yet
Spring 2024 Compiler Constructoin A Lab 5
9 pages
Quantitative Aptitude
No ratings yet
Quantitative Aptitude
33 pages
SSCD Assignment1
No ratings yet
SSCD Assignment1
11 pages
LP Practical File 21dit044
No ratings yet
LP Practical File 21dit044
51 pages
Flex and Bison
100% (1)
Flex and Bison
23 pages
Theory:: Aim: Implement A Lexical Analyzer For A Subset of C Using LEX Implementation Should Support Error Handling
No ratings yet
Theory:: Aim: Implement A Lexical Analyzer For A Subset of C Using LEX Implementation Should Support Error Handling
5 pages
Lecture #5 Began Here: Avoid These Top 10 Homework #1 Bugs in Your Homework #2
No ratings yet
Lecture #5 Began Here: Avoid These Top 10 Homework #1 Bugs in Your Homework #2
5 pages
Chapter 3 Lexical Analysis
No ratings yet
Chapter 3 Lexical Analysis
5 pages
Lexnyacc
No ratings yet
Lexnyacc
15 pages
Compiler Construction: Lab Report # 06
No ratings yet
Compiler Construction: Lab Report # 06
5 pages
CC2
No ratings yet
CC2
6 pages
Unit 3
No ratings yet
Unit 3
10 pages
CD (Aicte 2020-2021)
No ratings yet
CD (Aicte 2020-2021)
74 pages
Compiler Construction: Lab Report # 08
No ratings yet
Compiler Construction: Lab Report # 08
5 pages
Course: IT794 Compiler Construction Lab Manuaul Tools
No ratings yet
Course: IT794 Compiler Construction Lab Manuaul Tools
5 pages
System Programming Lab: LEX: Lexical Analyser Generator
No ratings yet
System Programming Lab: LEX: Lexical Analyser Generator
33 pages
Compiler Construction: Lab Report # 07
No ratings yet
Compiler Construction: Lab Report # 07
6 pages
System Software Manual
No ratings yet
System Software Manual
27 pages
Compiler File
No ratings yet
Compiler File
47 pages
Lexical Analysis: Textbook:Modern Compiler Design
No ratings yet
Lexical Analysis: Textbook:Modern Compiler Design
43 pages
CC Lab5
No ratings yet
CC Lab5
15 pages
Lex Tool
No ratings yet
Lex Tool
7 pages
Lexical Analyzer Flex Lab Report
No ratings yet
Lexical Analyzer Flex Lab Report
12 pages
Cs6109 - Compiler Design: Lab Assignment
No ratings yet
Cs6109 - Compiler Design: Lab Assignment
8 pages
Lex Yacc
No ratings yet
Lex Yacc
22 pages
Compilerdesign Book
No ratings yet
Compilerdesign Book
22 pages
Lex Yacc
No ratings yet
Lex Yacc
17 pages

Lab 2

Uploaded by

Lab 2

Uploaded by

Compiler Design LAB

 A good tool for creating lexical analysis is flex or lex.

 It takes a file.l specification file and creates an

input a.exe tokens

Definitions that can be used in rules section

where Pi are regular expressions pattern and

[a-z]+ cout<<"found word:" << yytext <<"\n";

Note: we write this in the first section

• When the generated scanner is run, it analyzes

You might also like