0% found this document useful (0 votes)

32 views31 pages

Lexical Analyzer

Uploaded by

most.maharin.khan.mithi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

32 views31 pages

Lexical Analyzer

Uploaded by

most.maharin.khan.mithi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 31

Lexical Analyzer

Using Lex
Lexical Analysis
• First phase of a Compiler
• Also called Scanning
• Scans the character stream of the Source
program
• Groups them into meaningful sequences
– Output: A sequence of token
Role of Lexical Analyzer
Identify Tokens
Remove Whitespace
Install lexme in symbol table
Returns token to parser

Token To semantic
Lexical analysis
Source
Parser
Program Analyzer

getNextToken

Symbol
Table
Lexical Analyzer

Performs those functions

Tokens
Source Program A Program in any
language

do { DO
do Install in Symbol Table
Study; ID
Study
}while (t_CGPA<
3.90)
Lexical Analyzer

• No need to write the code

• Tools that produce the analyzer
– Lex
Lex Tool

Lex source Lex lex.yy.c

lex.l Compiler

C a.out
lex.yy.c Compiler

Source a.out Tokens

program
Lex Tool
Token, Pattern, Lexeme
• Token: Set of strings that represent a
particular construct in source language.
• Pattern: Rules that describe that string set
– It match each string in the set

• Lexeme: sequence of characters that is

matched by a pattern for a token
Example
Token Sample Lexemes Pattern Description

WHILE while while

RELOP <, <=, >, >=, <>, == < or <= or > or >= or
<> or ==
ID count, account, flag2 letter followed by letters
and digits

C comment /* hubi jabi/* aro habi jabi / anything between / and

NUM 3.14, 3.2E+5, 5.9E-2 sequence of digits

having fraction and
exponent
Structure of Lex Programs
%{ #include<stdio.h>
// anything here is directly copied to lex.yy.c
int Word_count;
%}
Declarations
// regular definitions

%% // token matching & actions

Transition rules
%%
// any other functions
auxiliary functions/ User Subroutines
Transition rules
• Pattern { Action }

Regular expressions C code

to to
Match the token Do the functions
Regular Expressions
• Specifies a set of strings to match
• One expression for each token pattern
• Some expression
– [ \t\n] //for delimiter
– [ \t\n]+ // for white space
– a(b)* //a followed by zero or more occurrence of b
//a, ab, abb, abbb
Actions
• Specify what to do if a rule matches a token
• Basically C code
• Examples
%%
[a-zA-z] {
printf(“I found a letter”);
}
[0-9] {
printf(“I found a digit”)
}
[ \t\n] {
// actually I do nothing
}
%%
Structure of Lex Programs
%{
#include<stdio.h>

%} int Word_count;

Declarations // regular definitions

[0-9] {
printf(“I found a digit”);
}
%%
auxiliary functions // any other functions
regular definitions
• Give symbolic name to regular expressions
• ( declaration )
• Examples

delim [ \t\n]
ws {delim}+
digit [0-9]
number {digit}+
Complete Lex Source
%{
#include<stdio.h>
int word_count = 0;
%}
delim [ \t\n]
digit [0-9]
%%
{delim}+ { } //no action
{digit}+ { printf(“Here I found a digit”);
word_count++ }
%%
Printf(“Total Count: %d”,word_count);
Assignment
• Write a lexical analyzer for a C program that-
– Ignore white space
– Match all identifiers (keywords, variables etc )
• Insert variables in symbol table
• No need to insert keywords just show it in console

– Match all numbers and insert it in symbol table

– Find all comments
– Find all double quoted strings
– Count line numbers
Assignment
– Variables start with a letter or underscore (_)
• Ex : a, a9bc, _abc but not 8cde.
– Numbers may contain optional fraction or
exponent
• Ex: 3, 3.056, 3.45E5, 3.45E-2, 3E+2
– comments starts with // and ends with a
newline
– Relational operators are =, <>, < , <=, >=, >
• Insert the lexeme in symbol table and print the token RELOP
Assignment
– Addition operators are + - or
– Multiplier operators are * / div mod and
• Insert lexeme and print token ADMULOP
– Other tokens to match
• := // (assignment operator, token name is ASSIGNOP)
• [ , ] , ( , ) , .. // token name is DOTDOT
• ,
• ;,:
Assignment
• Keywords to match
– program
– if
– not
– end Print the corresponding token name and line no of occurring
– begin
– else
– then Token name for parser is keyword name with capital letter
– do
– while
– function
– Procedure
– integer
– real
– var
– oh
– array
– write
Compilation code
• flex example.l // lex source
• g++ lex.yy.c – o example –ll //object file
• ./example <file.txt> <target.txt>
// file.txt contains the source program

ServerAdmin v10.6
No ratings yet
ServerAdmin v10.6
197 pages
Campaign Checck Your Vocbulary For Military English
100% (7)
Campaign Checck Your Vocbulary For Military English
66 pages
Pmbok: Rajani Nair, Ramesh B, Upendra Bapat Group 3
33% (3)
Pmbok: Rajani Nair, Ramesh B, Upendra Bapat Group 3
31 pages
You Must Be Mad!: Warbirds RPG Mad Science Sourcebook
100% (2)
You Must Be Mad!: Warbirds RPG Mad Science Sourcebook
55 pages
Module 1 Rhyming Words (For Reading On-The-Air) (Final)
No ratings yet
Module 1 Rhyming Words (For Reading On-The-Air) (Final)
12 pages
Tribhuvan University Institute of Engineering Pulchwok Central Campus Pulchwok, Lalitpur
No ratings yet
Tribhuvan University Institute of Engineering Pulchwok Central Campus Pulchwok, Lalitpur
13 pages
5 6089131777291453670
100% (1)
5 6089131777291453670
70 pages
Compiler Desing-Final ppt2
No ratings yet
Compiler Desing-Final ppt2
194 pages
GMAT - 2018.PDF Version 1
No ratings yet
GMAT - 2018.PDF Version 1
21 pages
Pursue Lesson 1
No ratings yet
Pursue Lesson 1
10 pages
Compiler Lab Manual Final E-Content
75% (16)
Compiler Lab Manual Final E-Content
55 pages
Lactic Acid
No ratings yet
Lactic Acid
5 pages
CH 2 - Lexical Analysis
No ratings yet
CH 2 - Lexical Analysis
36 pages
NY2B21
No ratings yet
NY2B21
8 pages
Unit 2 Lexical Analyzer
No ratings yet
Unit 2 Lexical Analyzer
63 pages
Atitude of Fast-Food Worker
No ratings yet
Atitude of Fast-Food Worker
8 pages
Loading XL Sheet
No ratings yet
Loading XL Sheet
9 pages
Leica TCS SP5 II System Overview - EN
No ratings yet
Leica TCS SP5 II System Overview - EN
20 pages
Chapter 2 - Lexical Analyser
No ratings yet
Chapter 2 - Lexical Analyser
38 pages
Flex
No ratings yet
Flex
36 pages
02 Lexical Analysis
No ratings yet
02 Lexical Analysis
86 pages
Advisory: Region11.Davaodelsur@Tesda - Gov.Ph, Ftbarretejr@Tesda - Gov.Ph. Dz4Oxerkpthbyig-Kddmfjhdt4Iefefkhy/Edit#Gid 0
No ratings yet
Advisory: Region11.Davaodelsur@Tesda - Gov.Ph, Ftbarretejr@Tesda - Gov.Ph. Dz4Oxerkpthbyig-Kddmfjhdt4Iefefkhy/Edit#Gid 0
2 pages
The Function of Lex Is As Follows
No ratings yet
The Function of Lex Is As Follows
3 pages
Chapter 2 - Lexical Analyser
No ratings yet
Chapter 2 - Lexical Analyser
39 pages
A Tricky Joint Probability Density Problem - John Petrie's LifeBlag
No ratings yet
A Tricky Joint Probability Density Problem - John Petrie's LifeBlag
3 pages
Compiler Design Lab
No ratings yet
Compiler Design Lab
68 pages
For Placement
No ratings yet
For Placement
7 pages
A Study On Business Market Research On Croma To Release Their Own Products
No ratings yet
A Study On Business Market Research On Croma To Release Their Own Products
3 pages
Implementation of Symbol Table Using Flex On Unix Environment
No ratings yet
Implementation of Symbol Table Using Flex On Unix Environment
19 pages
Chapter 2 - Lexical Analyser
No ratings yet
Chapter 2 - Lexical Analyser
40 pages
Chapter 2 - Lexical Analysis
No ratings yet
Chapter 2 - Lexical Analysis
56 pages
Chapter 3 - Lexical Analysis and Lexical Analyzer Generators
No ratings yet
Chapter 3 - Lexical Analysis and Lexical Analyzer Generators
52 pages
2-Lexical Analysis
No ratings yet
2-Lexical Analysis
52 pages
CD - Ch.1
No ratings yet
CD - Ch.1
28 pages
ATCD Mod 3
No ratings yet
ATCD Mod 3
46 pages
CD Cse Record
No ratings yet
CD Cse Record
76 pages
Unit 1 (B)
No ratings yet
Unit 1 (B)
69 pages
Lexical Analyzer: Using Flex by Dr. S. M. Farhad
No ratings yet
Lexical Analyzer: Using Flex by Dr. S. M. Farhad
22 pages
Chapter 1
No ratings yet
Chapter 1
28 pages
Compiler Construction CS-4207: Lecture 4-5 Instructor Name: Atif Ishaq
100% (1)
Compiler Construction CS-4207: Lecture 4-5 Instructor Name: Atif Ishaq
37 pages
Basics of Share Allotement
No ratings yet
Basics of Share Allotement
3 pages
Compiler Design Lexical Analysis
No ratings yet
Compiler Design Lexical Analysis
24 pages
Chapter 2 - Lexical Analysis
No ratings yet
Chapter 2 - Lexical Analysis
74 pages
System Programming & Compiler Design Lab Manual
No ratings yet
System Programming & Compiler Design Lab Manual
41 pages
Lecture 07 PDF
No ratings yet
Lecture 07 PDF
8 pages
Chapter2-Lexical Analysis
No ratings yet
Chapter2-Lexical Analysis
64 pages
Compiler Design (CD) : Lab Assignment 1
No ratings yet
Compiler Design (CD) : Lab Assignment 1
36 pages
Embroidery Stitches
No ratings yet
Embroidery Stitches
16 pages
Pdf&rendition 1
No ratings yet
Pdf&rendition 1
14 pages
Compiler Construction: Department of Computer Science
No ratings yet
Compiler Construction: Department of Computer Science
17 pages
Lexical Analysis 2
No ratings yet
Lexical Analysis 2
24 pages
Unit 8
No ratings yet
Unit 8
62 pages
Chapter 2 - Lexical Analysis - Regular Expressions
No ratings yet
Chapter 2 - Lexical Analysis - Regular Expressions
27 pages
4-Intro To Flex and Bison-09!09!2024
No ratings yet
4-Intro To Flex and Bison-09!09!2024
28 pages
Chapter 3 Lexical Analysis
No ratings yet
Chapter 3 Lexical Analysis
5 pages
Daa 2
No ratings yet
Daa 2
4 pages
LP IV Compiler Manual
No ratings yet
LP IV Compiler Manual
26 pages
2 - Lexical Analysis
No ratings yet
2 - Lexical Analysis
52 pages
CD - Ch.1
No ratings yet
CD - Ch.1
28 pages
Ielts Listening Pretest
No ratings yet
Ielts Listening Pretest
5 pages
Day 2 - Lexial Analyzer
No ratings yet
Day 2 - Lexial Analyzer
37 pages
Cs3501 Compiler Design Lab Manual
No ratings yet
Cs3501 Compiler Design Lab Manual
54 pages
Vasilka
No ratings yet
Vasilka
4 pages
Chapter 2
No ratings yet
Chapter 2
36 pages
cs3501 Compiler Design Lab Manual
No ratings yet
cs3501 Compiler Design Lab Manual
56 pages
Lecture 2.76
No ratings yet
Lecture 2.76
31 pages
Chapter 2
No ratings yet
Chapter 2
41 pages
Studentsco: Computer Science
No ratings yet
Studentsco: Computer Science
6 pages
Chapter 2 - Lexical Analysis
No ratings yet
Chapter 2 - Lexical Analysis
10 pages
2 Lexing
No ratings yet
2 Lexing
73 pages
Lecture 3 - Lexical Analysis
No ratings yet
Lecture 3 - Lexical Analysis
42 pages
CD Lab Manual
No ratings yet
CD Lab Manual
52 pages
Chapter 2
No ratings yet
Chapter 2
77 pages
Chapter 2 Lexical Analysis
No ratings yet
Chapter 2 Lexical Analysis
14 pages
Lecture 2 10022025 035804pm
No ratings yet
Lecture 2 10022025 035804pm
27 pages
SPCC Exp7
No ratings yet
SPCC Exp7
8 pages
Jicnyaal Gnundeng
No ratings yet
Jicnyaal Gnundeng
65 pages
2024 CD-Ch02 Lexical Analysis
No ratings yet
2024 CD-Ch02 Lexical Analysis
25 pages
Ug II New Sem 2024 Time Table
No ratings yet
Ug II New Sem 2024 Time Table
4 pages
Unit I Introduction To Compilers: Lex - The Lexical-Analyzer Generator
No ratings yet
Unit I Introduction To Compilers: Lex - The Lexical-Analyzer Generator
19 pages
1 - Scanning Slides Sanyal Part1
No ratings yet
1 - Scanning Slides Sanyal Part1
22 pages
Compiler - Lexical Analyzer-2
No ratings yet
Compiler - Lexical Analyzer-2
16 pages
CD Chapter 1
No ratings yet
CD Chapter 1
28 pages
Compiler Design Lab KCS552
No ratings yet
Compiler Design Lab KCS552
82 pages
Tour de Samos 2025 Results Overall
No ratings yet
Tour de Samos 2025 Results Overall
1 page
Corrosion of Aluminium 2nd Edition Christian Vargel Instant Download
100% (1)
Corrosion of Aluminium 2nd Edition Christian Vargel Instant Download
62 pages
WHKF DWH Instructions
No ratings yet
WHKF DWH Instructions
11 pages
Learn C++
From Everand
Learn C++
Durgesh
4.5/5 (9)
Lisp Interpreter in Rust
From Everand
Lisp Interpreter in Rust
Vishal Patil
1/5 (1)
Dive Into Sea of C
From Everand
Dive Into Sea of C
M Ashok
No ratings yet

Lexical Analyzer

Uploaded by

Lexical Analyzer

Uploaded by

Lexical Analyzer

Performs those functions

• No need to write the code

Lex source Lex lex.yy.c

Source a.out Tokens

• Lexeme: sequence of characters that is

WHILE while while

C comment /* hubi jabi/* aro habi jabi */ anything between /* and

NUM 3.14, 3.2E+5, 5.9E-2 sequence of digits

%% // token matching & actions

Regular expressions C code

Declarations // regular definitions

– Match all numbers and insert it in symbol table

You might also like

C comment /* hubi jabi/* aro habi jabi / anything between / and