0% found this document useful (0 votes)

119 views19 pages

1 Introduction To LEX: Input - File.l

The document provides an introduction and overview of LEX, a tool used to generate lexical analyzers. It discusses the following key points: 1. LEX translates regular expression specifications into a C program (lex.yy.c) that implements a lexical analyzer as a finite state machine. This C program can then be compiled to generate an executable lexical analyzer. 2. A LEX program has three sections - declarations, rules, and auxiliary functions. The rules section specifies patterns to match and actions to perform. 3. LEX provides variables like yyin, yytext, and yyleng to the programmer to aid in lexical analyzer development. Yylex() is the main function that scans the input and

Uploaded by

Samba Shiva Reddy.g Shiva

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

119 views19 pages

1 Introduction To LEX: Input - File.l

Uploaded by

Samba Shiva Reddy.g Shiva

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 19

1 Introduction to LEX

LEX is a tool used to generate a lexical analyzer. This document is a tutorial for the
use of LEX for SIL Compiler development. Technically, LEX translates a set of
regular expression specifications (given as input in input_file.l) into a C
implementation of a corresponding finite state machine (lex.yy.c). This C program,
when compiled, yields an executable lexical analyzer.

input_file.l LEX lex.yy.c

lex.yy.c C compiler a.out

e
Input a.out Output

The source SIL program is fed as the input to the the lexical analyzer which produces
a sequence of tokens as output. (Tokens are explained below).

Conceptually, a lexical analyzer scans a given source SIL program and produces an
output of tokens. A token is a single element of the SIL programming language that
is recognized by the compiler. For instance integer, boolean, begin, end,
if, while etc. are tokens in SIL.

Example:

“integer” {return ID_TYPE_INTEGER;}

This example demonstrates the specification of rule in LEX. This rule in this
example specifies that the lexical analyzer must return the token named
ID_TYPE_INTEGER when the pattern “integer” is found in the input file. A rule
in a LEX program comprises of a pattern part (specified by a regular expression) and
a corresponding (semantic) action part (a sequence of C statements). In the above
example, “integer” is the pattern and {return ID_TYPE_INTEGER;} is the
corresponding action. The statements in the action part will be executed when the
pattern is detected in the input.

2 The structure of LEX programs

A LEX program consists of three sections : Declarations, Rules and Auxiliary

functions.

DECLARATIONS

RULES

AUXILIARY FUNCTIONS

2.1 Declarations

The declarations section consists of two parts, regular definitions and auxiliary
declarations. LEX allows the use of short-hands and extensions to regular
expressions for the regular definitions. The auxiliary declarations are copied as such
by LEX to the output lex.yy.c file.
Example:

%{
#include<stdio.h>
int global_variable; //Auxiliary declarations
%}
number [0-9]+ //Regular definitions
op [-|+|*|/|^|=]

%%
/* Rules */
%%
/* Auxiliary functions */

A regular definition in LEX is of the form :

D R
where D is the symbol representing the regular expression R. The auxiliary
declarations (which are optional) are written in C language and are enclosed within
' %{ ' and ' %} ' . It is generally used to declare functions, include header files, or
define global variables and constants.

2.2 Rules

Rules in a LEX program consists of two parts :

i. The pattern to be matched
ii. The corresponding action to be executed

Example:

/* Declarations*/
%%

{number} {printf(“ number”);}

{op} {printf(“ operator”);}

%%
/* Auxiliary functions */

The pattern to be matched is specified as a regular expression .

Sample Input/Output for the above example:

I: 234
O: number

I: *
O: operator

I: 2+3
O: number operator number

LEX obtains the regular expressions of the symbols number and op from the
declarations section and generates code into a function yylex() in the lex.yy.c file.
This function checks the input stream for the first match to one of the patterns
specified and executes code in the action part corresponding to the pattern.

2.3 Auxiliary functions

LEX generates C code for the rules specified in the Rules section and places this
code into a single function called yylex(). (To be discussed in detail later). In
addition to this LEX generated code, the programmer may wish to add his own code
to the lex.yy.c file. The auxiliary functions section allows the programmer to
achieve this.

Example:

/* Declarations */
%%
/* Rules */
%%

int main()
{
yylex();
return 1;
}

The C code in the auxiliary section and the declarations in the declaration section are
copied as such to the lex.yy.c file.

-------------------exercises to be added here ------------------

3 The yyvariables

The following variables are offered by LEX to aid the programmer in designing
sophisticated lexical analyzers. These variables are accessible in the LEX program
and are automatically declared by LEX in lex.yy.c.

➢ yyin
➢ yytext
➢ yyleng

3.1 yyin

yyin is a variable of the type FILE* and points to the input file. yyin is defined by
LEX automatically. If the programmer assigns an input file to yyin in the auxiliary
functions section, then yyin is set to point to that file. Otherwise LEX assigns yyin
to stdin(console input).

Example:

/* Declarations */
%%
/* Rules */
%%
main(int argc, char* argv[])
{
if(argc > 1)
{
FILE *fp = fopen(argv[1], “r”)
if(fp)
yyin = fp;
}
yylex();
return 1;
}
Exercise:

In the generated lex.yy.c, the following code segment can be found under the
definition of yylex().

if( ! yyin )
yyin = stdin;

Try to locate this code segment in the lex.yy.c. What could be the consequences of
removing this code segment from lex.yy.c before compiling it for generating the
lexical analyzer?

The above statement indicates that if the programmer does not define yyin, then
yylex() by default sets yyin to the console input. Hence, any re-definition for
yyin must be made before invoking yylex(). (This will be explained in detail
later).

3.2 yytext

yytext is of the type char* and it contains the lexeme currently found. A lexeme
is a sequence of characters in the input stream that matches some pattern in the Rules
Section. (In fact, it is the first matching sequence in the input from the position
pointed to by yyin.)

Each invocation of the function yylex() results in yytext carrying a pointer to the
lexeme found in the input stream by yylex(). The value of yytext will be
overwritten after the next yylex() invocation.

Example:

{%
#include<stdlib.h>
#include<stdio.h>
%}

number [0-9]+
%%

{number} {printf("Found : %d",atoi(yytext));}

int main()
{
yylex();
return 1;
}

In the above example, if a lexeme is found for the pattern defined by number then
corresponding action is executed . Consider the following sample i/o,

Sample Input/Output:

I: 25
O: Found : 25

In this case when yylex() is called, the input is read from the location given by
yyin and a string “25” is found as a match to number. This location of this string in
the memory is pointed to by yytext. The corresponding action in the above rule
uses a built-in function atoi() to convert the string “25” (of type char*) to the
integer 25 (of the type int) and then prints the result on the screen. Note that the
header file “stdlib.h” is called in the auxiliary declarations section in order to
invoke atoi() in the actions part of the rule.

Exercise:

Suggest a modification in the above example to check whether a number found is

even or odd.

3.3 yyleng

yyleng is a variable of the type int and it stores the length of the lexeme pointed to
by yytext.
Example:

/* Declarations */
%%
/* Rules */
%%

{number} printf("Number of digits = %d",yyleng);

Sample Input/Output:

I: 1234
O: Number of digits = 4

4 The yyfunctions

LEX provides the following functions to the programmer:

.
➢ yylex()
➢ yywrap()

4.1 yylex()

yylex() is a function of return type int. LEX automatically defines yylex() in

lex.yy.c but does not call it. The programmer must call yylex() in the Auxiliary
functions section of the LEX program. LEX generates code for the definition of
yylex() according to the rules specified in the Rules section.

Example 1:

/* Declarations */
%%

{number} {return atoi(yytext);}

int main()
{
int num = yylex();
printf(“Found: %d”,num);
return 1;
}

Sample Input/Output:

I: 42
O: Found: 42

When yylex() is invoked, it reads the input as pointed to by yyin and scans
through the input looking for a matching pattern. When the input or a part of the
input matches one of the given patterns, yylex() executes the corresponding action
associated with the pattern as specified in the Rules section. In the above example,
since there is no explicit definition of yyin, the input is taken from the console. If a
match is found in the input for the pattern number, yylex() executes the
corresponding action , i.e. return atoi(yytext). As a result yylex() returns
the number matched. The value returned by yylex() is stored in the variable num.
The value stored in this variable is then printed on screen using printf().

yylex() continues scanning through the input till one of the actions
corresponding to a matched pattern executes a return statement or till the end of
input (input end-marker ) has been encountered. In case of the above example,
yylex() terminates immediately after executing the rule because it consists of a
return statement. Note that if none of the actions in the Rules section executes a
return statement, yylex() would:
• (In case of an input file) continue scanning for more matching patterns till it
finds EOF in the file
• (In case of console input) wait for more input through the console

In the latter case, since there is no input end-marker, the user will have to input
ctrl+d in the terminal to indicate the end of input. As a result, yylex() will
terminate.

NOTE: If yylex() is called more than once, it simply starts scanning from the
position in the input file where it had ceased in the previous call.

Exercise:

What would be the outputs of the lexical analyzer generated by the example LEX
programs under section 3.2 and 4.1 for the following input :

25
32
44

Would both the outputs be the same ? If not, explain why.

4.2 yywrap()

LEX declares the function yywrap() of return type int in lex.yy.c . LEX does not
provide any definition for yywrap(). yylex() makes makes a call to yywrap()
when it encounters the end of an input. If yywrap() returns zero (indicating false)
yylex() assumes there is more input and it continues scanning from the location
pointed to by yyin. If yywrap() returns a non-zero value (indicating true),
yylex() terminates the scanning process and returns 0 (i.e. “wraps up”).

If the programmer wishes to scan more than one input file using the generated lexical
analyzer, it can be simply done by setting yyin to a new input file in yywrap()'s
definition which returns zero.

As LEX does not define yywrap() in lex.yy.c file but makes a call to it under yylex(),
the programmer must define it in the Auxiliary functions section or provide %option
noyywrap in the declarations section. This options removes the call to yywrap() in
the lex.yy.c file. Note that, it is mandatory to either define yywrap() or indicate the
absence using the %option feature. If not, LEX will flag an error.

Example :
/* Declarations */
%%
/* Rules */
%%

int yywrap()
{
FILE *newfile_pointer;
newfile_pointer = fopen(“input_file_2.l”,”r”);
if(yyin != newfile_pointer)
{
yyin = newfile_pointer;
return 0;
}
else
return 1;
}

int main()
{
yyin=fopen(“input_file.l”,”r”);
yylex();
return 1;
}

When yylex() finishes scanning the first input file, input_file.l yylex() invokes
yywrap(). The above definition of yywrap() sets the input file pointer to
input_file_2.l and returns 0 . As a result, the scanner continues scanning in
input_file_2.l . When yylex() calls yywrap() on encountering EOF of
input_file_2.l, yywrap() returns 0 and thus yylex() ceases scanning.

Exercise:

Suggest a modification in the above example LEX program to make the generated
lexical analyzer read input
• Initially from the console and then from a file input_file.l
• Initially from a file input_file.l and then from the console
• Twice from the console
5 Even-Odd.l , a complete LEX program
%{
/*
1.Request input of an even and an odd number
2.indicate input characteristic : Even/Odd [digit_length]
3.check for input's correctness and print result
*/

#include<stdlib.h>
#include<stdio.h>

int number_1;
int number_2;
%}

number_sequence [0-9]*

{number_sequence}[0|2|4|6|8] {
printf("Even number [%d]",yyleng);
return atoi(yytext);
}

{number_sequence}[1|3|5|7|9] {
printf("Odd number [%d]",yyleng);
return atoi(yytext);
}
%%

int yywrap
{
return 1;
}

int main()
{
printf("\nInput an even number and an odd number\n");
number_1 = yylex();
number_2 = yylex();
int diff = number_1 - number_2;
if(diff%2!=0)
printf("\nYour inputs were checked for correctness,
\nResult : Correct\n");
else
printf("\nYour inputs were checked for correctness,
\nResult : You do not know how to read\n");
return 1;
}

6 Disambiguation rules

yylex() uses two important disambiguation rules in selecting the right action to
execute in case there is more than one pattern that matches a string in the given
input:
➢ Order of occurrence is assigned as the pattern's matching priority.
➢ “Longest match” is preferred.
Example :

“break” { return BREAK; }

[a-zA-Z][a-zA-Z0-9]* { return IDENTIFIER; }

Here, break is matched by both the regular expressions, but break is a keyword
and not an identifier hence it is ordered in such a manner that LEX uses its first
disambiguation rule to execute return BREAK;

Example :

/* Declarations section */
%%

“-” {return MINUS;}

“--” {return DECREMENT;}

%%
/* Auxiliary functions */

Assume that the function calling yylex() prints the name of the token.

Sample Input/Output:

I: -
O: MINUS

I: --
O: DECREMENT
I: ---
O: DECREMENT MINUS

Note that, in case of an –- input to the lexical analyzer, yylex() does not return two
MINUS tokens, but instead returns a DECREMENT token, by the second disambiguation
rule.

7 Pattern matching using LEX

Conceptually, LEX constructs a finite state machine to recognize all the regular
expression patterns specified in the LEX program file. The code written by the
programmer in the action part is executed when the machine is in accept state. The
lex.yy.c program stores information about the finite state machine in the form of a
decision table (transition table).
A transition(current_state,input_char) function is used to access
the decision table. LEX makes it's decision table visible if we compile the program
with the -T flag. The finite state machine used by LEX is deterministic finite state
automation. The lex.yy.c simulates the DFA.

8 The Token simulator program

%{
/* Scan and return a token for identifiers of the format :
(string)(number)
Note : strings are not case sensitive
examples : a0 , A1 , ab2 , AB4 , aBc5
*/
#include<stdio.h>

#define ID 1 //Identifier token

#define ER 2 //Error token

%}
low_case [a-z]
upp_case [A-Z]
number [0-9]

%option noyywrap

%%
({low_case}|{upp_case})({low_case}|{upp_case})*({number})
return ID;
(.)* return ER;

%%
int main()
{
int token = yylex();
if(token==ID)
printf("Acceptable\n");
else if(token==ER)
printf("Unacceptable\n");
return 1;
}
In this program, the main() function obtains the tokens returned by yylex() and
checks if the input contains a valid identifier.

Sample Input/Output:

I: Var9
O: Acceptable

When Var9 is provided as the input, the DFA constructed by LEX accepts the string,
and the corresponding action return ID executed. As a result yylex() returns the
token ID, and the main() function prints Acceptable on the screen.

9 Construction of a DFA from a regular expression

The construction of a DFA from a regular expression takes place in two steps.
➢ Constructing a syntax tree from the regular expression
➢ Converting the syntax tree into a DFA

9.1 The intermediate syntax tree

Consider the first rule in the token simulator program in section 9. It consists of the
following regular expression :
({low_case}|{upp_case})({low_case}|{upp_case})*({number})
For convenience in representation , let it be represented by :
( a | A ) ( a | A )* (N)
where, 'a' represents {low_case}, 'A' represents {upp_case} and 'N' represents
{number}.
The syntax tree constructed for the above regular expression would look like :

º
/ \
º N₅
/ \
| *
/ \ |
a₁ A₂ |
/ \
a₃ A₄

In the above figure º represents the 'cat' (concatenation) operator, * represents the
'star' operator (a unary operator) and | represents the 'or' operator. In the syntax tree
the inner nodes are operators while the leaves are the operands. The subscript
assigned to every leaf is called the position of the leaf. The position of a leaf plays a
vital role in the process of constructing states for the DFA.

NOTE:
This syntax tree is an intermediate data structure. There will be no traces of this in
lex.yy.c file, because it is only used in the construction of the DFA.

9.2 Constructing the DFA

Constructing the DFA involves two steps:
➢ Constructing the set of states of the DFA
➢ Constructing all the possible transitions made by the DFA from one state to
another on different inputs.

The language represented by the regular expression ( a | A ) ( a | A )* (N), can only

possibly start with an 'a' or 'A'. From the syntax tree we may infer that these could
only correspond to positions 1 or 2 . Let the set of these positions {1,2} be the start
state of the DFA. For convenience it has been named as state I.

Consider the position 1 ('a'), it could be followed by either of the positions 3,4
or 5. Let this be a new state {3,4,5} represented by II. The position 2 ('A') could be
possibly followed by either of the positions 3,4 or 5. But a new state is not required
as {3,4,5} has already been represented by II. Similarly the positions 3 and 4 could
be followed by the position 3 or 4 or 5. If followed by 5, the DFA must accept and
terminate (syntax tree ends at position 5). Hence let the final (accept) state be III.
Thus, the transitions maybe formulated as :

Current state -> Input symbol -> Next state

I a II

I A II

II a II

II A II

II N III

9.3 The constructed DFA

The DFA obtained for the above syntax tree would look like :
a|A

a|A N
I II III

This DFA represents the regular expression provided as a specification (i.e. pattern to
be matched) in the first rule of the token simulator program in section 9. When the
DFA is in the final state i.e. III, then the corresponding action is executed as
instructed in the lex.yy.c file. The constructed DFA is simulated using a simulation
algorithm.

10 The DFA simulation algorithm

The working of the constructed DFA is simulated using the following algorithm.

DFA_simulator()
current_state = start_state
c = get_next_char()
while(c != EOF)
current_sate = transition(current_state , c)
c = get_next_char()
if(current_state ∈ Final_states)
/*ACCEPT*/
else
/*REJECT*/

The information about all the transitions made by the DFA can be obtained from the
decision table (generally a two dimensional matrix) through the transition()
function.

11 Using the generated lexical analyzer

The generated lex.yy.c file is the code for the scanner. This is compiled using a C
compiler and the generated object file (the executable scanner) is executed. LEX is
just a tool which helps generate a smart scanner in C.
The lexical analyzer only checks for recognizable pattern sequences in the
source program and produces an output of tokens (fed to the parser) , it cannot be
used to check for syntax. Syntax analysis is done by the parser. The parser is built
using YACC , a parser generator. YACC obtains the tokens from LEX through
yylex() and the associated attributes from yylval, which it uses for parsing and rest
of the the compilation process. In this case, the programmer need not call yylex() in
the LEX program, it is automatically called by YACC to read the tokens. The
parser , when given an input source program obtains all the tokens from the lexical
analyzer and checks for correctness of the syntax. Hence, the next step towards
compiling a given source program would be to build a parser.

12 References

For further details on the topics covered in this document, the reader may refer to the
following :
 Compilers : Principles,Techniques and Tools by Alfred V.Aho, Monica S.
Lam, Ravi Sethi and Jeffrey D.Ulman .
 Modern Compiler Implementation in C by Andrew W.Appel
 Flex & Bison by John Levine
 http://dinosaur.compilertools.net/

SS Lab Manual
No ratings yet
SS Lab Manual
38 pages
Compiler Desing-Final ppt2
No ratings yet
Compiler Desing-Final ppt2
194 pages
Lex Programming Lab
No ratings yet
Lex Programming Lab
9 pages
Lexical Analysis & Lex Tool
No ratings yet
Lexical Analysis & Lex Tool
17 pages
SS & OS Final Lab Manual
No ratings yet
SS & OS Final Lab Manual
46 pages
Lab Manual2021 Regulation
No ratings yet
Lab Manual2021 Regulation
28 pages
Lab Manual - Compiler Lab CSL411 Manual 2022
No ratings yet
Lab Manual - Compiler Lab CSL411 Manual 2022
114 pages
CD Lab RECORD PRINT' 1
No ratings yet
CD Lab RECORD PRINT' 1
91 pages
Lex Yacc Tutorial
No ratings yet
Lex Yacc Tutorial
38 pages
CDLabmanual
No ratings yet
CDLabmanual
40 pages
Introduction To Lex
No ratings yet
Introduction To Lex
18 pages
LEX and YACC
No ratings yet
LEX and YACC
3 pages
Lex Material 1
No ratings yet
Lex Material 1
37 pages
Lex PDF
No ratings yet
Lex PDF
20 pages
SS Manual GEC 18CSL66
No ratings yet
SS Manual GEC 18CSL66
49 pages
LP IV Compiler Manual
No ratings yet
LP IV Compiler Manual
26 pages
Lex - A Lexical Analyzer Ge...
No ratings yet
Lex - A Lexical Analyzer Ge...
19 pages
Introduction To LEX
No ratings yet
Introduction To LEX
9 pages
9536 Exp5 Merged
No ratings yet
9536 Exp5 Merged
18 pages
Lecture 07 PDF
No ratings yet
Lecture 07 PDF
8 pages
Compiler Design Lab KCS552
No ratings yet
Compiler Design Lab KCS552
82 pages
Lex Yaac
No ratings yet
Lex Yaac
24 pages
Compiler Lab Manual
No ratings yet
Compiler Lab Manual
32 pages
Lab Session
No ratings yet
Lab Session
27 pages
CD LexProgram
No ratings yet
CD LexProgram
11 pages
Lex Yacc
No ratings yet
Lex Yacc
22 pages
Compiler Design Manual
No ratings yet
Compiler Design Manual
69 pages
Yaac and Lex
No ratings yet
Yaac and Lex
13 pages
The Function of Lex Is As Follows
No ratings yet
The Function of Lex Is As Follows
3 pages
Compiler 56
No ratings yet
Compiler 56
39 pages
System Programming Lab: LEX: Lexical Analyser Generator
No ratings yet
System Programming Lab: LEX: Lexical Analyser Generator
33 pages
Lab Manual
No ratings yet
Lab Manual
23 pages
Lab 05 - Lexical Analysis (Part 3) : Lab Objectives: Upon Successful Completion of This Topic, You Will Be Able To
No ratings yet
Lab 05 - Lexical Analysis (Part 3) : Lab Objectives: Upon Successful Completion of This Topic, You Will Be Able To
11 pages
Flex/Le X: Javeria Akram (276) Ifra Zahid
No ratings yet
Flex/Le X: Javeria Akram (276) Ifra Zahid
21 pages
Compiler Design Lab (CSP358) : Practical No. 1 (LEX)
No ratings yet
Compiler Design Lab (CSP358) : Practical No. 1 (LEX)
16 pages
Lex-Yacc For Exam
100% (1)
Lex-Yacc For Exam
17 pages
Lex Tool
No ratings yet
Lex Tool
7 pages
Theory:: Aim: Implement A Lexical Analyzer For A Subset of C Using LEX Implementation Should Support Error Handling
No ratings yet
Theory:: Aim: Implement A Lexical Analyzer For A Subset of C Using LEX Implementation Should Support Error Handling
5 pages
Unit I Introduction To Compilers: Lex - The Lexical-Analyzer Generator
No ratings yet
Unit I Introduction To Compilers: Lex - The Lexical-Analyzer Generator
19 pages
CD Lab
No ratings yet
CD Lab
26 pages
SPCC Exp7
No ratings yet
SPCC Exp7
8 pages
Lex and Yacc
No ratings yet
Lex and Yacc
8 pages
Assignment No.: Assignment To Understand Basic Syntax of LEX Specifications, Built-In Functions and Variables
No ratings yet
Assignment No.: Assignment To Understand Basic Syntax of LEX Specifications, Built-In Functions and Variables
32 pages
Lexnyacc
No ratings yet
Lexnyacc
15 pages
A3 47 Mushan Khan Practical1
No ratings yet
A3 47 Mushan Khan Practical1
13 pages
Lex
No ratings yet
Lex
4 pages
System Programming (2150708) : Topic: Implementation of Lexical Analyser Using LEX Utility Tool in UNIX
No ratings yet
System Programming (2150708) : Topic: Implementation of Lexical Analyser Using LEX Utility Tool in UNIX
5 pages
Lec5 LEX Lexical Analyzer Generator
No ratings yet
Lec5 LEX Lexical Analyzer Generator
12 pages
System Programming (BTHU-301A) : Bachelor of Technology
No ratings yet
System Programming (BTHU-301A) : Bachelor of Technology
22 pages
Lex1 Lab Manual TE Computer SPPU
No ratings yet
Lex1 Lab Manual TE Computer SPPU
6 pages
Compiler Construction
No ratings yet
Compiler Construction
5 pages
Lex Yacc
No ratings yet
Lex Yacc
17 pages
Study of Lex
No ratings yet
Study of Lex
3 pages
Compiler Design Practical List
No ratings yet
Compiler Design Practical List
5 pages
Byt BZB 1 8 Tsfi Eng PDF
No ratings yet
Byt BZB 1 8 Tsfi Eng PDF
51 pages
Machine Learning (15Cs73) : Text Book Tom M. Mitchell, Machine Learning, India Edition 2013, Mcgraw Hill
No ratings yet
Machine Learning (15Cs73) : Text Book Tom M. Mitchell, Machine Learning, India Edition 2013, Mcgraw Hill
78 pages
SCM Module1 Questions and Answers 1
No ratings yet
SCM Module1 Questions and Answers 1
11 pages
ASTM A249 Stainless Steel Tubes
No ratings yet
ASTM A249 Stainless Steel Tubes
10 pages
Machine Learning Module-3
No ratings yet
Machine Learning Module-3
23 pages
Technology-Plan BPP Frelyn
100% (2)
Technology-Plan BPP Frelyn
4 pages
D8.1M 2007PV PDF
No ratings yet
D8.1M 2007PV PDF
5 pages
Let's Create Local and Global Ads: Activity 2: Home Business! Lead in
100% (1)
Let's Create Local and Global Ads: Activity 2: Home Business! Lead in
3 pages
Prepare Installer Lesson
No ratings yet
Prepare Installer Lesson
25 pages
Skyjack 4740 Parts Manual
No ratings yet
Skyjack 4740 Parts Manual
148 pages
SCM Module2 Questions and Answers
100% (1)
SCM Module2 Questions and Answers
12 pages
Lab Manual Digital Marketing: Mr. Prince Vohra
No ratings yet
Lab Manual Digital Marketing: Mr. Prince Vohra
32 pages
M-Duino 21+Arduino-PLC
No ratings yet
M-Duino 21+Arduino-PLC
3 pages
Course Plan - Linux Lab
No ratings yet
Course Plan - Linux Lab
12 pages
A Queueing Model With Server Breakdowns Repairs Va
No ratings yet
A Queueing Model With Server Breakdowns Repairs Va
13 pages
Module Iv
0% (1)
Module Iv
3 pages
Usm Thesis Format Ips
100% (3)
Usm Thesis Format Ips
6 pages
Web-Question Bank
No ratings yet
Web-Question Bank
7 pages
Web-Module I
No ratings yet
Web-Module I
35 pages
JNCIS-SP Certification - Juniper Networks US
No ratings yet
JNCIS-SP Certification - Juniper Networks US
6 pages
T9 Assembly Modeling
No ratings yet
T9 Assembly Modeling
15 pages
Module V
No ratings yet
Module V
6 pages
Major Final Project - BW
No ratings yet
Major Final Project - BW
80 pages
Iot 220112132928
No ratings yet
Iot 220112132928
31 pages
Unit 1
No ratings yet
Unit 1
23 pages
Remote Servicing Suite Security Manual Version A
No ratings yet
Remote Servicing Suite Security Manual Version A
45 pages
Ns2-Vw00-p0uyq-174226 Vehicle Repair Shop Side Elevation Rev.0int1
No ratings yet
Ns2-Vw00-p0uyq-174226 Vehicle Repair Shop Side Elevation Rev.0int1
1 page
ASME B31 1 2016 Power Piping ASME Code For Pressure Piping B31 1st Edition Asme - Get The Ebook in PDF Format For A Complete Experience
No ratings yet
ASME B31 1 2016 Power Piping ASME Code For Pressure Piping B31 1st Edition Asme - Get The Ebook in PDF Format For A Complete Experience
41 pages
Module-2: Introduction, Lexical Analysis: Syllabus
No ratings yet
Module-2: Introduction, Lexical Analysis: Syllabus
28 pages
K039-Pic-Checklist For JCB
No ratings yet
K039-Pic-Checklist For JCB
1 page
ABHISHEK
No ratings yet
ABHISHEK
3 pages
Dda
No ratings yet
Dda
8 pages
59 Pages Aptitude Questions and Answers
No ratings yet
59 Pages Aptitude Questions and Answers
59 pages
3.1 Usage of Ajax and Json
No ratings yet
3.1 Usage of Ajax and Json
18 pages
Web-Module II - Chapter1
No ratings yet
Web-Module II - Chapter1
16 pages
ALL Boolean Algebra
No ratings yet
ALL Boolean Algebra
25 pages
EQUIP9-Operations-Use Case Challenge
No ratings yet
EQUIP9-Operations-Use Case Challenge
6 pages
SCM Module 4 New Questions and Answers
No ratings yet
SCM Module 4 New Questions and Answers
8 pages
Selective Repeat ARQ Protocol
No ratings yet
Selective Repeat ARQ Protocol
16 pages
Web-Module IV
No ratings yet
Web-Module IV
38 pages
Module Iii
No ratings yet
Module Iii
5 pages
FIRE FIGHTING TANK - MEP-Model
No ratings yet
FIRE FIGHTING TANK - MEP-Model
1 page
ThinkCentre M75q Gen 2 Spec
No ratings yet
ThinkCentre M75q Gen 2 Spec
9 pages
KOM-MICS, A "Tsunagaruka" System For Production Sites: Technical Paper
No ratings yet
KOM-MICS, A "Tsunagaruka" System For Production Sites: Technical Paper
6 pages
Valet Parking Management System - Requirement Specification - V1
No ratings yet
Valet Parking Management System - Requirement Specification - V1
14 pages
3.1 Critical Thinking Rubric
No ratings yet
3.1 Critical Thinking Rubric
1 page

1 Introduction To LEX: Input - File.l

Uploaded by

1 Introduction To LEX: Input - File.l

Uploaded by

1 Introduction to LEX

input_file.l LEX lex.yy.c

lex.yy.c C compiler a.out

“integer” {return ID_TYPE_INTEGER;}

2 The structure of LEX programs

A LEX program consists of three sections : Declarations, Rules and Auxiliary

A regular definition in LEX is of the form :

Rules in a LEX program consists of two parts :

{number} {printf(“ number”);}

The pattern to be matched is specified as a regular expression .

2.3 Auxiliary functions

-------------------exercises to be added here ------------------

{number} {printf("Found : %d",atoi(yytext));}

Suggest a modification in the above example to check whether a number found is

{number} printf("Number of digits = %d",yyleng);

LEX provides the following functions to the programmer:

yylex() is a function of return type int. LEX automatically defines yylex() in

{number} {return atoi(yytext);}

Would both the outputs be the same ? If not, explain why.

“break” { return BREAK; }

“-” {return MINUS;}

7 Pattern matching using LEX

8 The Token simulator program

#define ID 1 //Identifier token

9 Construction of a DFA from a regular expression

9.1 The intermediate syntax tree

9.2 Constructing the DFA

The language represented by the regular expression ( a | A ) ( a | A )* (N), can only

Current state -> Input symbol -> Next state

9.3 The constructed DFA

10 The DFA simulation algorithm

11 Using the generated lexical analyzer

You might also like