0% found this document useful (0 votes)

12 views7 pages

Lex Tool

Lex is a tool that generates C code for a lexical analyzer from a specification file containing regular expressions. A Lex program is structured into three sections: declarations, translation rules, and auxiliary procedures, with each section separated by %% delimiters. The generated C code includes functions for scanning input, managing tokens, and handling errors, allowing for efficient pattern matching and lexical analysis.

Uploaded by

jr.musicclub

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views7 pages

Lex Tool

Uploaded by

jr.musicclub

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Lex Tool:

Lex is a tool that reads a specification file (typically with the .l extension), containing
regular expressions, and generates C code that implements a lexical analyzer. The
lexical analyzer scans the input text, recognizing patterns and converting it into a
sequence of tokens.
A Lex program consists of three parts and is separated by %% delimiters:-
Declarations
%%
Translation rules
%%
Auxiliary procedures
Definitions Section: You can define macros, global variables, or regular expressions.
This section contains user-defined macros, regular expressions, and includes any
necessary header files.
It’s where you define constants, include libraries, or define regular expressions that will
be reused in the rules section.
Rules Section: This is where you write regular expressions and the actions for each
matched pattern.
The rules section defines regular expressions and the actions associated with them.
Each rule consists of a regular expression pattern and an action.
The pattern specifies the strings to be matched, and the action specifies the operation
to perform when a pattern is matched. Actions are usually written in C code.
The rules section is placed between two %% delimiters.
User Code Section: C code that is included in the generated lexical analyzer, typically
for initialization or cleanup.
The user code section is where you can add C code that needs to be included in the
generated C file.
This section typically includes the main() function and any other necessary initialization
or cleanup functions. The function yylex() (generated by Lex) is called in this section to
start the lexical analysis.
The user code section is placed after the second %% delimiter.
Explanation of the Example:
● %{ ... %}: Code enclosed in this section is copied directly into the generated C
file. Here, we include the stdio.hheader.
● %%: Delimiters that separate the different sections of the Lex file.
● Pattern-Action Pairs:
○ [0-9]+: Matches one or more digits. When matched, it prints the number.
○ [ \t\n]+: Matches whitespace and ignores it.
○ [a-zA-Z]+: Matches identifiers (alphabetic strings).
○ "+" and "=": Match the + operator and = assignment, respectively,
printing the corresponding output.
● int main(): The main function calls yylex(), which is the generated function to
start scanning the input.
Processing of the Lex File
Once you write the Lex specification file, you can feed it into the Lex tool. The Lex tool
then processes the file and performs the following steps:
1. Lexical Analysis:
○ Lex reads the specification file and generates a C source file (lex.yy.c) that
implements a lexical analyzer.
○ The Lex tool internally creates a finite automaton for the regular
expressions and embeds this automaton in the generated C code.
2. Finite Automaton (DFA/NFA):
○ NFA Construction: Internally, Lex converts each regular expression into a
Non-deterministic Finite Automaton (NFA).
○ DFA Construction: Then, it converts the NFA into a Deterministic Finite
Automaton (DFA), which is used for efficient pattern matching. Lex
optimizes the DFA to improve scanning performance.
3. Generating C Code:
○ Lex generates a C source file (lex.yy.c) that contains the code for the
lexical analyzer.
○ This file includes:
■ yylex(): A function that performs the scanning of the input stream
and matches patterns against the defined regular expressions.
■ yytext: A global variable that holds the current text matched by a
pattern.
■ yyin and yyout: Input and output files, respectively, which are used
by Lex for reading and writing data.
■ State Machine: The core of the generated code, which implements
the DFA/NFA.
4. Compiling the Generated Code:
○ After generating the lex.yy.c file, the next step is to compile the C file into
an object code. This can be done using a C compiler.
Example Command:
lex example.l # Generate lex.yy.c
gcc lex.yy.c -o lexer -lfl # Compile and link with the Flex library
5. Running the Lexer:
○ Once compiled, the generated lexical analyzer can be executed, which
reads the input, matches the patterns, and executes the associated
actions (e.g., printing tokens, recognizing keywords, etc.).
Example Command:
./lexer < input.txt # Runs the lexer on input.txt
Key Predefined Variables in Lex
1. Yytext
● Purpose: Contains the text of the current token matched by the regular
expression.
● Type: char* (a pointer to a string).
● Usage:
○ You can access yytext in the action part of a rule to refer to the
string that was matched.
○ It is automatically updated after each successful pattern match.
Example: [a-zA-Z]+ { printf("Identifier: %s\n", yytext); }
2. Yyleng
● Purpose: Stores the length of the string in yytext.
● Type: int.
● Usage:
○ Indicates the number of characters matched by the regular
expression.
○ Useful for validating the length of tokens.
Example: [a-zA-Z]+ { printf("Identifier (%d characters): %s\n", yyleng, yytext); }
3. Yylineno
● Purpose: Tracks the current line number in the input file being scanned.
● Type: int.
● Usage:
○ Helps in error reporting and debugging by providing the line number
where a token is found.
○ You need to enable line tracking by defining #define YYLMAX or
including -lfl (in Flex).
Example: { printf("Unrecognized character '%s' at line %d\n", yytext, yylineno); }
4. Yyin
● Purpose: Points to the input file being scanned.
● Type: FILE*.
● Default Value: stdin.
● Usage:
○ By default, Lex reads from the standard input. You can assign yyin
to another file pointer to scan from a file.
Example: yyin = fopen("input.txt", "r"); // Redirect input to "input.txt"
5. Yyout
● Purpose: Points to the output file for token processing.
● Type: FILE*.
● Default Value: stdout.
● Usage:
○ You can redirect output to a specific file by changing the value of
yyout.
Example: yyout = fopen("output.txt", "w"); // Redirect output to "output.txt"
6. yywrap()
● Purpose: Determines what happens when the end of the input is reached.
● Type: Function.
● Default Behavior: Returns 1, signaling the end of input.
● Usage:
○ If you want to provide additional input after reaching the end of a
file, you can override yywrap() to return 0 and continue scanning.
Example: int yywrap() {
return 1; // Indicate end of input
}
7. YYSTATE
● Purpose: Represents the current state of the scanner.
● Type: int.
● Usage:
○ Used in conjunction with start conditions to manage lexical analysis
in different contexts.
○ You can explicitly set or check the scanner's state.
Example:
%x COMMENT
%%
"/*" { BEGIN(COMMENT); }
<COMMENT>"*/" { BEGIN(INITIAL); }
8. YY_START
● Purpose: Indicates the starting condition for the scanner.
● Type: Constant.
● Usage:
○ Represents the current state in terms of start conditions.
○ Can be used in actions to check or set the scanner’s state.
Example:
%x STRING
%%
"\"" { printf("Entering STRING mode\n"); BEGIN(STRING); }
<STRING>. { printf("STRING content: %s\n", yytext); }
<STRING>"\"" { printf("Exiting STRING mode\n"); BEGIN(INITIAL); }
9. YY_USER_ACTION
● Purpose: Allows you to insert user-defined code that will execute before
any action for a matched token.
● Type: Macro.
● Usage:
○ Can be used for debugging or tracking purposes.
○ Typically defined in the definitions section.
Example:
%{
#define YY_USER_ACTION printf("Matched token: %s\n", yytext);
%}
%%
[a-zA-Z]+ { /* Your action here */ }
10. YY_FATAL_ERROR()
● Purpose: Handles fatal errors during lexical analysis.
● Type: Function/Macro.
● Usage:
○ You can override this function to customize error handling in case of
scanner failures.
Example:
void YY_FATAL_ERROR(const char* msg) {
fprintf(stderr, "Fatal Error: %s\n", msg);
exit(1);
}

Summary of Predefined Variables

Variable Purpose

yytext Holds the matched token text.

yyleng Length of the matched token.

yylineno Current line number (useful for debugging).

yyin Input file pointer (default is stdin).

yyout Output file pointer (default is stdout).

yywrap() Determines what happens at the end of input.

YYSTATE Current scanner state (for start conditions).

YY_START Starting condition (useful for context switching).

YY_USER_ACTION Executes user-defined code before each action.

YY_FATAL_ERROR Handles errors in the scanner.

Library Routines:

Routine Purpose

yylex() Main scanning function.

yywrap() Handles end-of-input conditions.

yyrestart() Resets the scanner for a new input file.

yy_flush_buffer() Clears the internal buffer.

yy_create_buffer() Creates a buffer for a new input source.

yy_switch_to_buffer() Switches to a specified buffer.

yytext Holds the current matched token.

yyleng Length of the matched token.

yyin Input file pointer.

yyout Output file pointer.

YY_FATAL_ERROR() Handles fatal errors.

yymore() Appends to the current token.

yyless() Pushes part of the token back to the input

stream.

Lex PDF
No ratings yet
Lex PDF
20 pages
LEX and YACC
No ratings yet
LEX and YACC
3 pages
Compiler Design Manual
No ratings yet
Compiler Design Manual
69 pages
SPCC Exp7
No ratings yet
SPCC Exp7
8 pages
1 Introduction To LEX: Input - File.l
No ratings yet
1 Introduction To LEX: Input - File.l
19 pages
Theory:: Aim: Implement A Lexical Analyzer For A Subset of C Using LEX Implementation Should Support Error Handling
No ratings yet
Theory:: Aim: Implement A Lexical Analyzer For A Subset of C Using LEX Implementation Should Support Error Handling
5 pages
SS & OS Final Lab Manual
No ratings yet
SS & OS Final Lab Manual
46 pages
Lex Programming Lab
No ratings yet
Lex Programming Lab
9 pages
Compiler Design Lab (CSP358) : Practical No. 1 (LEX)
No ratings yet
Compiler Design Lab (CSP358) : Practical No. 1 (LEX)
16 pages
LP IV Compiler Manual
No ratings yet
LP IV Compiler Manual
26 pages
System Programming (BTHU-301A) : Bachelor of Technology
No ratings yet
System Programming (BTHU-301A) : Bachelor of Technology
22 pages
Lab Session
No ratings yet
Lab Session
27 pages
Flex/Le X: Javeria Akram (276) Ifra Zahid
No ratings yet
Flex/Le X: Javeria Akram (276) Ifra Zahid
21 pages
System Software Manual
No ratings yet
System Software Manual
27 pages
Introduction For Lab Compiler
No ratings yet
Introduction For Lab Compiler
15 pages
Lex and Yacc
No ratings yet
Lex and Yacc
70 pages
Lecture 07 PDF
No ratings yet
Lecture 07 PDF
8 pages
Compiler Desing-Final ppt2
No ratings yet
Compiler Desing-Final ppt2
194 pages
Lab Programs
No ratings yet
Lab Programs
18 pages
Class 2019 Lex
No ratings yet
Class 2019 Lex
30 pages
Compiler 56
No ratings yet
Compiler 56
39 pages
Compiler Construction CS-4207: Instructor Name: Atif Ishaq
No ratings yet
Compiler Construction CS-4207: Instructor Name: Atif Ishaq
13 pages
Lex Yaac
No ratings yet
Lex Yaac
24 pages
9536 Exp5 Merged
No ratings yet
9536 Exp5 Merged
18 pages
The Function of Lex Is As Follows
No ratings yet
The Function of Lex Is As Follows
3 pages
Introduction To Lex
No ratings yet
Introduction To Lex
18 pages
Lex and Yacc
No ratings yet
Lex and Yacc
8 pages
SS Manual GEC 18CSL66
No ratings yet
SS Manual GEC 18CSL66
49 pages
Study of Lex
No ratings yet
Study of Lex
3 pages
Lex - A Lexical Analyzer Ge...
No ratings yet
Lex - A Lexical Analyzer Ge...
19 pages
SS Lab Manual
No ratings yet
SS Lab Manual
38 pages
Notes About Lex and Yacc: Pablo Nogueira Iglesias December 26, 1999
No ratings yet
Notes About Lex and Yacc: Pablo Nogueira Iglesias December 26, 1999
15 pages
Lex1 Lab Manual TE Computer SPPU
No ratings yet
Lex1 Lab Manual TE Computer SPPU
6 pages
Lex Yacc
No ratings yet
Lex Yacc
9 pages
Lex Material 1
No ratings yet
Lex Material 1
37 pages
Compiler Design Lab KCS552
No ratings yet
Compiler Design Lab KCS552
82 pages
Lexical Analysis & Lex Tool
No ratings yet
Lexical Analysis & Lex Tool
17 pages
Lex Yacc Tutorial
No ratings yet
Lex Yacc Tutorial
38 pages
Lab Manual
No ratings yet
Lab Manual
23 pages
SS Lab Manual
No ratings yet
SS Lab Manual
66 pages
Estd 1919
No ratings yet
Estd 1919
22 pages
More On LEX Programming
50% (2)
More On LEX Programming
42 pages
Lex Yacc
No ratings yet
Lex Yacc
17 pages
Lec5 LEX Lexical Analyzer Generator
No ratings yet
Lec5 LEX Lexical Analyzer Generator
12 pages
Lex-Yacc For Exam
100% (1)
Lex-Yacc For Exam
17 pages
CDLabmanual
No ratings yet
CDLabmanual
40 pages
Lex Yacc
No ratings yet
Lex Yacc
22 pages
Lab 05 - Lexical Analysis (Part 3) : Lab Objectives: Upon Successful Completion of This Topic, You Will Be Able To
No ratings yet
Lab 05 - Lexical Analysis (Part 3) : Lab Objectives: Upon Successful Completion of This Topic, You Will Be Able To
11 pages
CD LexProgram
No ratings yet
CD LexProgram
11 pages
Using Lex
No ratings yet
Using Lex
15 pages
Lexnyacc
No ratings yet
Lexnyacc
15 pages
Unit I Introduction To Compilers: Lex - The Lexical-Analyzer Generator
No ratings yet
Unit I Introduction To Compilers: Lex - The Lexical-Analyzer Generator
19 pages
Lex Manual
No ratings yet
Lex Manual
21 pages
SSCD LAB MAUNUAL DRTTIT FULL (Santhosh) PDF
No ratings yet
SSCD LAB MAUNUAL DRTTIT FULL (Santhosh) PDF
50 pages
Experiment No 4 CD
No ratings yet
Experiment No 4 CD
4 pages
C Programming Language
From Everand
C Programming Language
Younish Pathan
No ratings yet
Dive Into Sea of C
From Everand
Dive Into Sea of C
M Ashok
No ratings yet
Python Programming Concepts
From Everand
Python Programming Concepts
MRB
No ratings yet
C Programming
From Everand
C Programming
Netra
No ratings yet
Basic Information About C language PDF
From Everand
Basic Information About C language PDF
Suraj Das
No ratings yet
XSET
No ratings yet
XSET
43 pages
Module 5
No ratings yet
Module 5
80 pages
C Programming: Mark Allen Weiss
No ratings yet
C Programming: Mark Allen Weiss
16 pages
C' Questions: Visit - For More Downloads!
No ratings yet
C' Questions: Visit - For More Downloads!
56 pages
All Kali Linux Commands
75% (4)
All Kali Linux Commands
12 pages
Session 2: - Manipulating Container With Docker Client
No ratings yet
Session 2: - Manipulating Container With Docker Client
20 pages
The Complete Guide For Linux System Administration CH03 Powerpoint
No ratings yet
The Complete Guide For Linux System Administration CH03 Powerpoint
40 pages
B.C.A Sem-5 & 6 Major, Minor Syllabus 2025-26 (Dt.16-06-2025)
No ratings yet
B.C.A Sem-5 & 6 Major, Minor Syllabus 2025-26 (Dt.16-06-2025)
52 pages
Automation With Shell Scripting & Python in DevOps
No ratings yet
Automation With Shell Scripting & Python in DevOps
63 pages
CP Unit 2
No ratings yet
CP Unit 2
21 pages
Yad - Display GTK+ Dialogs in Shell Scripts: Options
No ratings yet
Yad - Display GTK+ Dialogs in Shell Scripts: Options
43 pages
Authentication Service
No ratings yet
Authentication Service
114 pages
GNU GREP and RIPGREP
No ratings yet
GNU GREP and RIPGREP
111 pages
Accenture Coding Handout
100% (3)
Accenture Coding Handout
94 pages
3delight UserManual
No ratings yet
3delight UserManual
265 pages
Linux - Basic
No ratings yet
Linux - Basic
84 pages
Python Cheat Sheet - OverAPI
No ratings yet
Python Cheat Sheet - OverAPI
1 page
r22 B.tech Cseaiml Iyriiyr Syllabusupdated
No ratings yet
r22 B.tech Cseaiml Iyriiyr Syllabusupdated
87 pages
Ysoft Safeq
No ratings yet
Ysoft Safeq
22 pages
SS and OS Lab Manual 2013 (VTU)
100% (2)
SS and OS Lab Manual 2013 (VTU)
37 pages
Librerias COdesys Festo Provisional
No ratings yet
Librerias COdesys Festo Provisional
22 pages
Hudson Remoting Architecture: Winston Prakash
No ratings yet
Hudson Remoting Architecture: Winston Prakash
10 pages
Java Io: - Outputstream: - Inputstream
No ratings yet
Java Io: - Outputstream: - Inputstream
80 pages
Domjudge
No ratings yet
Domjudge
47 pages
Unix Shell Scripting Book
100% (1)
Unix Shell Scripting Book
145 pages
60-256 System Programming: Introduction To Unix: by Dr. B. Boufama
No ratings yet
60-256 System Programming: Introduction To Unix: by Dr. B. Boufama
28 pages
Working With Shells, Scripting
No ratings yet
Working With Shells, Scripting
13 pages
Talend Examples DataIntegration EN 7.2.1
No ratings yet
Talend Examples DataIntegration EN 7.2.1
54 pages
Iisql
No ratings yet
Iisql
4 pages
Wave OSDR0 Developer Manual
No ratings yet
Wave OSDR0 Developer Manual
632 pages

Lex Tool

Uploaded by

Lex Tool

Uploaded by

Lex Tool:

Summary of Predefined Variables

yytext Holds the matched token text.

yyleng Length of the matched token.

yylineno Current line number (useful for debugging).

yyin Input file pointer (default is stdin).

yyout Output file pointer (default is stdout).

yywrap() Determines what happens at the end of input.

YYSTATE Current scanner state (for start conditions).

YY_START Starting condition (useful for context switching).

YY_USER_ACTION Executes user-defined code before each action.

YY_FATAL_ERROR Handles errors in the scanner.

yylex() Main scanning function.

yywrap() Handles end-of-input conditions.

yyrestart() Resets the scanner for a new input file.

yy_flush_buffer() Clears the internal buffer.

yy_create_buffer() Creates a buffer for a new input source.

yy_switch_to_buffer() Switches to a specified buffer.

yytext Holds the current matched token.

yyleng Length of the matched token.

yyin Input file pointer.

yyout Output file pointer.

YY_FATAL_ERROR() Handles fatal errors.

yymore() Appends to the current token.

yyless() Pushes part of the token back to the input

You might also like