0% found this document useful (0 votes)

63 views31 pages

CD Mini Project Lexical Analyzer

The document is a mini project report for a lexical analyzer for the C language. It was submitted by three students and guided by Dr. Kowsigan. The aim was to design a lexical analyzer for C using lex that takes C code as input and outputs a stream of tokens along with their type. The output also includes a symbol table containing the tokens and types. The lexical analyzer was implemented to recognize various tokens in C like keywords, identifiers, constants, operators, comments, and more. It displays the token, token type and line number to help debug errors.

Uploaded by

someoneswati123

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

63 views31 pages

CD Mini Project Lexical Analyzer

Uploaded by

someoneswati123

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 31

SRM INSTITUTE OF SCIENCE AND TECHNOLOGY

Kattankulathur, Chengalpattu District - 603203

18CSC304J/ COMPLIER DESIGN

MINI PROJECT REPORT

LEXICAL ANALYZER FOR C LANGUAGE

Gudied by:
DR.KOWSIGAN

Submitted By:
IKLASH KHAN(RA2011003011391)
SWATI ANAND(RA2011003011383)
SIDDHARTH SINGH(RA2011003011395)
AIM: LEXICAL ANALYZER FOR C LANGUAGE

ABSTRACT:-
A compiler is a special program that processes statements written in
a particular programming language and turns them into machine language
or code that a computer's processors use. The file used for writing a C-
language contains what are called the source statements. The programmer
then runs the appropriate language compiler, specifying the name of the
file that contains the source statements. When executing, the compiler first
parses all of the language statements syntactically one after the other and
then, in one or more successive stages, builds the output code, making sure
that statements that refer to other statements are referred to correctly in
the final code. The output of the compilation is called object code or
sometimes an object module. Lexical analysis is the first phase of a
compiler. It takes the modified source code from language preprocessors
that are written in the form of sentences. The lexical analyzer breaks these
syntaxes into a series of tokens, by removing any whitespace or comments in
the source code. Symbol table is an important data structure created and
maintained by compilers in order to store information about the
occurrence of various entities such as variable names, function names, etc.
Symbol table is used by both the analysis and the synthesis parts of a
compiler. We have designed a lexical analyzer for the C language using lex.
It takes as input a C code and outputs a stream of tokens. The tokens
displayed as part of the output include keywords, identifiers,
signed/unsigned integer/floating point constants, operators, special
characters, headers, data-type specifiers, array, single-line comment, multi-
line comment, preprocessor directive, pre-defined functions (printf and
scanf), user-defined functions and the main function. The token, the type of
token and the line number of the token in the C code are being displayed.
The line number is displayed so that it is easier to debug the code for
errors. Errors in single-line comments, multi-line comments are displayed
along with line numbers. The output also contains the symbol table which
contains tokens and their type. The symbol table is generated using the hash
organisation.
REQUIREMENT TO RUNS THE SCRIPT:
COMPILER DESIGN PHASES
A compiler is a special program that processes statements written in a
particular programming language and turns them into machine
language or code that a computer's processors use. A compiler can
broadly be divided into two phases based on the way theycompile.

1. Analysis phase: Known as the front-end of the compiler, the

analysis phase of the compiler reads the source program, divides it
into core parts and then checks for lexical, grammar and syntax
errors.The analysis phase generates an intermediate representation
of the source program and symbol table. This phase consists of:

➢ Lexical Analysis
➢ Syntax Analysis
➢ Semantic Analysis
➢ Intermediate Code Generation

2. Synthesis phase: Known as the back-end of the compiler, the

synthesis phase generates the target program with the help of
intermediate source code representation and symbol table. This
phase consists of:

➢ Code Optimization
➢ Code Generator
Lexical Analysis
Lexical analysis is the first phase of a compiler. It takes the modified source
code from language preprocessors that are written in the form of sentences.
The lexical analyzer breaks these syntaxes into a series of tokens, by
removing any whitespace or comments in the sourcecode.
If the lexical analyzer finds a token invalid, it generates an error. The
lexical analyzer works closely with the syntax analyzer. It reads character
streams from the source code, checks for legal tokens, and passes the data
to the syntax analyzer when it demands.
Syntax Analysis
Syntax analysis or parsing is the second phase of a compiler. It takes the
token produced by lexical analysis as input and generates a parse tree (or
syntax tree).

Semantic Analysis
Semantic analysis is the third phase of a compiler. Semantic analyzer
checks whether the parse tree constructed by the syntax analyzer follows
the rules of language.

Intermediate Code generation

After semantic analysis the compiler generates an intermediate code of
the source code for the target machine. It represents a program for some
abstract machine. It is in between the high-level language and the machine
language. This intermediate code should be generated in such a way that it
makes it easier to be translated into the target machine code.

Code Optimization
In this phase, code optimization of the intermediate code is done.
Optimization can be assumed as something that removes unnecessary code
lines, and arranges the sequence of statements in order to speed up the
program execution without wasting resources (CPU, memory).

Code Generation
In this phase, the code generator takes the optimized representation of the
intermediate code and maps it to the target machine language.
THE LEXICAL ANALYSIS:
In computer science, lexical analysis is the process of converting a sequence
of characters (such as in a computer program or web page) into a sequence
of tokens (strings with an identified "meaning"). A program that performs
lexical analysis may be called a lexer, tokenizer, or scanner (though
"scanner" is also used to refer to the first stage of a lexer). Such a lexer is
generally combined with a parser, which together analyze the syntax of
programming languages, web pages, and so forth.
The script written by us is a computer program called the “lex” program,
is the one that generates lexical analyzers ("scanners" or "lexers"). Lex
reads an input stream specifying the lexical analyzer and outputs source
code implementing the lexer in the C programming language.
The structure of the lex program consists of three sections:
{definition section}
%%
{rules section}
%%
{C code section}
The definition section defines macros and imports header files written in C.
It is also possible to write any C code here, which will be copied verbatim
into the generated source file.
The rules section associates regular expression patterns with C statements.
When the lexer sees text in the input matching a given pattern, it will
execute the associated C code.
The C code section contains C statements and functions that are copied
verbatim to generated source file. These statements presumably contain
code called by the rules in the rules section. In large programs it is more
convenient to place this code in a separate file linked in at compile time.
The lex program, when compiled using the lex command, generates a file
called lex.yy.c, which when executed recognizes the tokens present in the
input C program.
Lexical analysis only takes care of parsing the tokens and identifying their
type. The output of this phase is the stream of tokens as well as the symbol
table representing the tokens and their type.
Code:

%{
int lineno = 1;

#include<stdio.
h>
#include<stdlib.
h>
#include<string.
h>

#define AUTO 1
#define BREAK 2
#define CASE 3
#define CHAR 4
#define CONST 5
#define CONTINUE 6
#define DEFAULT 7
#define DO 8
#define DOUBLE 9
#define ELSE 10
#define ENUM 11
#define EXTERN 12
#define FLOAT 13
#define FOR 14
#define GOTO 15
#define IF 16
#define INT 17
#define LONG 18
#define REGISTER 19
#define RETURN 20
#define SHORT 21
#define SIGNED 22
#define SIZEOF 23
#define STATIC 24
#define STRUCT 25
#define SWITCH 26
#define TYPEDEF 27
#define UNION 28
#define UNSIGNED 29
#define VOID 30
#define VOLATILE 31
#define WHILE 32

#define IDENTIFIER 33
#define SLC 34
#define MLCS 35
#define MLCE 36

#define LEQ 37
#define GEQ 38
#define EQEQ 39
#define NEQ 40
#define LOR 41
#define LAND 42
#define ASSIGN 43
#define PLUS 44
#define SUB 45
#define MULT 46
#define DIV 47
#define MOD 48
#define LESSER 49
#define GREATER 50
#define INCR 51
#define DECR 52
#define SEMI 54

#define HEADER 55
#define MAIN 56

#define PRINTF 57
#define SCANF 58
#define DEFINE 59

#define INT_CONST 60
#define FLOAT_CONST 61

#define TYPE_SPEC 62

#define DQ 63

#define OBO 64
#define OBC 65
#define CBO 66
#define CBC 67
#define HASH 68

#define ARR 69
#define FUNC 70
#define NUM_ERR 71
#define UNKNOWN 72

#define CHAR_CONST 73
#define STRING_CONST 75
%}

alpha [A-
Za-z]digit
[0-9] und
[_]
space [
] tab [ ]
line
[\n]
char
\'.\' at
[@]
string \"(.^([%d]|[%f]|[%s]|[%c]))\"

%%
{space}* {}
{tab}* {}

{string} return STRING_CONST;

{char} return CHAR_CONST;
{line} {lineno++;}
auto return
AUTO; break
return BREAK;
case return
CASE; char

return CHAR;
const return
CONST;
continue return
CONTINUE;default
return DEFAULT;
do return DO;
double return
DOUBLE;else
return ELSE;

enum return
ENUM; extern
return EXTERN;
float return
FLOAT; for return
FOR;
goto return
GOTO;if return
IF;
int return INT;
long return
LONG;
register return
REGISTER;return
return RETURN;
short return SHORT;
signed return
SIGNED; sizeof
return SIZEOF; static
return STATIC; struct
return STRUCT;
switch return
SWITCH; typedef
return TYPEDEF;
union return UNION;
unsigned return
UNSIGNED;void return
VOID;

volatile return
VOLATILE;while
return WHILE;

printf return
PRINTF;scanf
return SCANF;

{alpha}({alpha}|{digit}|{und})* return IDENTIFIER;

[+-][0-9]{digit}*(\.{digit}+)? return SIGNED_CONST;

"//" return
SLC; "/*"
return MLCS;
"*/" return
MLCE;

"<=" return
LEQ; ">="
return GEQ;
"==" return
EQEQ; "!="
return NEQ;
"||" return
LOR; "&&"
return LAND;
"=" return
ASSIGN;"+"
return PLUS;
"-" return SUB;
"*" return
MULT; "/"
return DIV;
"%" return
MOD; "<"
return LESSER;
">" return
GREATER;"++"
return INCR;
"--" return DECR;
"," return COMMA;";" return SEMI;

"#include<stdio.h>" return
HEADER; "#include <stdio.h>"
return HEADER; "main()"
return MAIN;

{digit}+ return INT_CONST;

({digit}+)\.({digit}+) return
FLOAT_CONST;

"%d"|"%f"|"%u"|"%s" return
TYPE_SPEC;"\"" return DQ;
"(" return OBO;
")" return OBC;
"{" return CBO;
"}" return
CBC; "#"
return HASH;

{alpha}({alpha}|{digit}|{und})\[{digit}\] return ARR;

{alpha}({alpha}|{digit}|{und})*\(({alpha}|{digit}|{und}|{space})*\
) return FUNC;({digit}+)\.({digit}+)\.({digit}|\.)* return NUM_ERR;
({digit}|{at})+({alpha}|{digit}|{und}|{at})* return UNKNOWN;
%%

struct node
{

char token[100];
char
attr[100];struct
node *next;
};

struct hash
{

struct node

struct node * createNode(char token, char attr)

{

struct node *newnode;

newnode = (struct node *)
malloc(sizeof(struct node));strcpy(newnode-
>token, token);
strcpy(newnode->attr,
attr);newnode->next =
NULL; return newnode;
}

int hashIndex(char *token)

{

int
hi=0;
int l,i;
for(i=0;token[i]!='\0';i++)
{

hi = hi + (int)token[i];
}
hi = hi%eleCount;return hi;
}

void insertToHash(char token, char attr){

int
flag=0;
int hi;
hi = hashIndex(token);
struct node *newnode = createNode(token, attr);
/* head of list for the bucket with index "hashIndex" */
struct node
*myNode;int i,j,
k=1;
printf(" -------------------------------------------------------");
printf("\nSNo \t|\tToken \t\t|\tToken Type \t\n");
printf("------------------------------------------------------- \n");
for (i = 0; i < eleCount; i++)
{

if (hashTable[i].count
== 0)continue;
myNode =
hashTable[i].head;if
(!myNode)
continue;
while (myNode != NULL)
{
}
}
return;
}
printf("%d\t\t", k++); printf("%s\t\t\t",
myNode->token);printf("%s\t\n", myNode-
>attr); myNode = myNode->next;

if (hashTable[hi].head==NULL)
{

hashTable[hi].head =
newnode;
hashTable[hi].count = 1;
return;
}

struct node *myNode;

myNode =
hashTable[hi].head;while
(myNode != NULL)
{

if (strcmp(myNode->token, token)==0)
{

flag =
1;
break;
}

myNode = myNode->next;
}
if(!flag)
{
//adding new node to the list
newnode->next = (hashTable[hi].head);
//update the head of the list and no of nodes in the
current buckethashTable[hi].head = newnode;
hashTable[hi].count++;
}

return;
}

void display()
{

int scan, slcline=0, mlc=0, mlcline=0, dq=0,

dqline=0;yyin = fopen("isPrime.c","r");
printf("\n\
n"); scan =
yylex();
while(scan)

*head;int count;
};

struct hash
hashTable[1000];int
eleCount = 1000;

{
if(lineno == slcline)
{

scan =
yylex();
continue;
}

if(lineno!=dqline && dqline!=0)

{

if(dq%2!=0)
printf("\n******** ERROR!! INCOMPLETE STRING at Line
%d
********\n\n", dqline);
dq=0;

}
if((scan>=1 && scan<=32) && mlc==0)
{

printf("%s\t\t\tKEYWORD\t\t\t\tLine %d\n",
yytext, lineno);insertToHash(yytext,
"KEYWORD");
}

if(scan==33 && mlc==0)

{

printf("%s\t\t\tIDENTIFIER\t\t\tLine %d\n",
yytext, lineno);insertToHash(yytext,
"IDENTIFIER");
}

if(scan==34)
{

printf("%s\t\t\tSingleline Comment\t\tLine %d\n",

yytext, lineno);slcline = lineno;
}

if(scan==35 && mlc==0)

{

printf("%s\t\t\tMultiline Comment Start\t\tLine %d\n", yytext, lineno);

mlcline = lineno;
mlc = 1;
}

if(scan==36 && mlc==0)

{

printf("\n******** ERROR!! UNMATCHED MULTILINE

COMMENT END %s atLine %d ********\n\n", yytext, lineno);
}
if(scan==36 && mlc==1)
{

mlc = 0;
printf("%s\t\t\tMultiline Comment End\t\tLine %d\n", yytext,
lineno);
}

if((scan>=37 && scan<=52) && mlc==0)

{

printf("%s\t\t\tOPERATOR\t\t\tLine %d\n",
yytext, lineno);insertToHash(yytext,
"OPERATOR");
}

if((scan==53||scan==54||scan==63||(scan>=64 && scan<=68)) &&

mlc==0)
{

printf("%s\t\t\tSPECIAL SYMBOL\t\t\tLine %d\n",

yytext, lineno);if(scan==63)
{

dq++;
dqline = lineno;
}

insertToHash(yytext, "SPECIAL SYMBOL");

}

if(scan==55 && mlc==0)

{
printf("%s\tHEADER\t\t\t\tLine %d\n",yytext, lineno);
}

if(scan==56 && mlc==0)

{

printf("%s\t\t\tMAIN FUNCTION\t\t\tLine %d\n",

yytext, lineno);insertToHash(yytext, "IDENTIFIER");
}

if((scan==57 || scan==58) && mlc==0)

{

printf("%s\t\t\tPRE DEFINED FUNCTION\t\tLine %d\n",

yytext, lineno);insertToHash(yytext, "PRE DEFINED
FUNCTION");
}
if(scan==59 && mlc==0)
{

printf("%s\t\t\tPRE PROCESSOR DIRECTIVE\t\tLine %d\n", yytext,

lineno);
}

if(scan==60 && mlc==0)

{

printf("%s\t\t\tINTEGER CONSTANT\t\tLine %d\n",

yytext, lineno);insertToHash(yytext, "INTEGER
CONSTANT");
}

if(scan==61 && mlc==0)

{

printf("%s\t\t\tFLOATING POINT CONSTANT\t\tLine %d\n",

yytext, lineno);insertToHash(yytext, "FLOATING POINT
CONSTANT");
}

if(scan==62 && mlc==0)

{

printf("%s\t\t\tTYPE SPECIFIER\t\t\tLine %d\n", yytext, lineno);

}
if(scan==69 && mlc==0)
{

printf("%s\t\t\tARRAY\t\t\t\tLine %d\n",
yytext, lineno);insertToHash(yytext, "ARRAY");
}

if(scan==70 && mlc==0)

{

printf("%s\t\t\tUSER DEFINED FUNCTION\t\tLine %d\n",

yytext, lineno);insertToHash(yytext, "USER DEFINED
FUNCTION");
}

if(scan==71 && mlc==0)

{

printf("\n******** ERROR!! CONSTANT ERROR %s at Line %d

********\n\n", yytext, lineno);
}
if(scan==72 && mlc==0)
{

printf("\n******** ERROR!! UNKNOWN TOKEN %s

at Line %d
********\n\n", yytext, lineno);
}
if(scan==73 && mlc==0)
{

printf("%s\t\t\tCHARACTER CONSTANT\t\t\tLine %d\n",

yytext, lineno);insertToHash(yytext, "CHARACTER
CONSTANT");
}

if(scan==74 && mlc==0)

{

printf("%s\t\t\tSIGNED CONSTANT\t\t\tLine %d\n",

yytext, lineno);insertToHash(yytext, "SIGNED
CONSTANT");

}
if(scan==75 && mlc==0)
{

printf("%s\t\t\tSTRING CONSTANT\t\t\tLine %d\n",

yytext, lineno);insertToHash(yytext, "STRING
CONSTANT");
}

scan = yylex();
}

if(mlc==1)
printf("\n******** ERROR!! UNMATCHED COMMENT STARTING at Line
%d
********\n\n",mlcli
ne);printf("\n");
printf("\n\t******** SYMBOL TABLE
********\t\t\n");display();
printf("------------------------------------------------------- \n\n");

}
int yywrap()
{

return 1;
Output:
Input file (isPrime.c)
#include<stdio
.h>int main()
{

int a,i,flag=0;
printf("Input
no");
scanf("%d",&a)
; i=2;
while(i <= a/2)
{

if(a%i == 0)
{

flag=
1;
} break
i++ ;
;
}

if(flag==0)
printf("%d Prime", a);

return 0;
}
OUTPUT:
SYMBOL TABLE:
RESULT:

The task of the lexical program is to read characters one by one from the
program and analyse the character stream to distinguish words in the
program. The word refers to the set of characters which have compact logical
relationship and have collective meaning called token. This is the process is
tokenization. To adapt to the multi-core environment the process of
tokenization has to be parallelised. This is achieved by exploiting the parallel
constructs of the languages.

Compiler Design Unit 1 SRM 21 Regulation
100% (1)
Compiler Design Unit 1 SRM 21 Regulation
193 pages
Compiler Design Module
100% (1)
Compiler Design Module
120 pages
PCC All Units QuestionBank
No ratings yet
PCC All Units QuestionBank
121 pages
Acd 2.1
No ratings yet
Acd 2.1
20 pages
Acd Unit 4
No ratings yet
Acd Unit 4
52 pages
Linkedin Posts 2024 Blue
No ratings yet
Linkedin Posts 2024 Blue
368 pages
Unit 1
No ratings yet
Unit 1
109 pages
CT - Lecture 2
No ratings yet
CT - Lecture 2
23 pages
Compiler Design Note1
No ratings yet
Compiler Design Note1
111 pages
Smart Mini Compilar
No ratings yet
Smart Mini Compilar
40 pages
Chapter-1 Compiler Design
100% (1)
Chapter-1 Compiler Design
13 pages
Compiler Design
No ratings yet
Compiler Design
47 pages
Compiler Design
No ratings yet
Compiler Design
23 pages
Compiler Construction: Lecture 1 - An Overview
No ratings yet
Compiler Construction: Lecture 1 - An Overview
30 pages
Gate Compiler Design
No ratings yet
Gate Compiler Design
72 pages
Compiler Design Slide Chapter 1-6
No ratings yet
Compiler Design Slide Chapter 1-6
250 pages
Compiler Design
No ratings yet
Compiler Design
117 pages
Project Report
No ratings yet
Project Report
26 pages
System Programming Unit-2 by Arun Pratap Singh
100% (1)
System Programming Unit-2 by Arun Pratap Singh
82 pages
Muhammad Hamza BSCS-E3-22-23 Compiler
No ratings yet
Muhammad Hamza BSCS-E3-22-23 Compiler
11 pages
CD Laqs
No ratings yet
CD Laqs
29 pages
Unit 5 SP
No ratings yet
Unit 5 SP
28 pages
Cat 1
No ratings yet
Cat 1
150 pages
Automata Theory and Compiler Design
No ratings yet
Automata Theory and Compiler Design
55 pages
CD UNIT-1
No ratings yet
CD UNIT-1
60 pages
Compiler Constructer
No ratings yet
Compiler Constructer
17 pages
Compiler Design Unit-1
No ratings yet
Compiler Design Unit-1
25 pages
Compiler Design Quick Guide
No ratings yet
Compiler Design Quick Guide
45 pages
Unit 1
No ratings yet
Unit 1
9 pages
Compiler Notes
No ratings yet
Compiler Notes
68 pages
Mini Compiler: Submitted By: Tejash Niroula 16bce2292
No ratings yet
Mini Compiler: Submitted By: Tejash Niroula 16bce2292
14 pages
Introduction Compiler
No ratings yet
Introduction Compiler
47 pages
Compiler Unit1
No ratings yet
Compiler Unit1
23 pages
Unit 1 Compiler Design
No ratings yet
Unit 1 Compiler Design
70 pages
Introduction To Compiler Design-Unit I
No ratings yet
Introduction To Compiler Design-Unit I
30 pages
Language Translator
No ratings yet
Language Translator
71 pages
CSE353 Slides
No ratings yet
CSE353 Slides
76 pages
Compiler CH1
No ratings yet
Compiler CH1
24 pages
Compilers and Translators Assignment
No ratings yet
Compilers and Translators Assignment
3 pages
Unit 1
No ratings yet
Unit 1
50 pages
Compiler
No ratings yet
Compiler
17 pages
CH 1
No ratings yet
CH 1
23 pages
Cousins of Compiler
100% (1)
Cousins of Compiler
25 pages
CD Notes
No ratings yet
CD Notes
69 pages
Unit 1 Slides
No ratings yet
Unit 1 Slides
49 pages
Unit 1
No ratings yet
Unit 1
37 pages
Quick Book of Compiler
100% (1)
Quick Book of Compiler
66 pages
Compiler Design Chapter-1
No ratings yet
Compiler Design Chapter-1
41 pages
Control Statements in R
No ratings yet
Control Statements in R
34 pages
Unit I SRM
100% (1)
Unit I SRM
36 pages
Module - I: Introduction To Compiling: 1.1 Introduction of Language Processing System
No ratings yet
Module - I: Introduction To Compiling: 1.1 Introduction of Language Processing System
7 pages
Compiler Construction Notes
No ratings yet
Compiler Construction Notes
61 pages
CD Unit - 1 Lms Notes
No ratings yet
CD Unit - 1 Lms Notes
58 pages
Introduction To Compilation
No ratings yet
Introduction To Compilation
33 pages
1 Lexial Analysis
No ratings yet
1 Lexial Analysis
24 pages
Lexical Analyser Parser
No ratings yet
Lexical Analyser Parser
37 pages
Language Translation: Programming Tools
No ratings yet
Language Translation: Programming Tools
7 pages
Unit 1
No ratings yet
Unit 1
29 pages
CS5 Expire Fix - Readme - Amtlib - DLL
0% (1)
CS5 Expire Fix - Readme - Amtlib - DLL
1 page
Unit 1
No ratings yet
Unit 1
29 pages
Yet Another Insignificant Programming Notes
60% (5)
Yet Another Insignificant Programming Notes
5 pages
MAD - Micro-Project (1) )
No ratings yet
MAD - Micro-Project (1) )
23 pages
Innovus addEndCap
No ratings yet
Innovus addEndCap
2 pages
Bitlocker Windows 10
No ratings yet
Bitlocker Windows 10
24 pages
UM352P-1 r5
No ratings yet
UM352P-1 r5
307 pages
CSC 311: Intro. To Computer Organization and Architecture
No ratings yet
CSC 311: Intro. To Computer Organization and Architecture
25 pages
Internship Report
No ratings yet
Internship Report
42 pages
11i Uploading A Journal Using Web ADI - Step by Step - Oracle Apps Epicenter
No ratings yet
11i Uploading A Journal Using Web ADI - Step by Step - Oracle Apps Epicenter
12 pages
Introduction To MASM
100% (1)
Introduction To MASM
24 pages
Configuring Any Connect VPN Client Connections
No ratings yet
Configuring Any Connect VPN Client Connections
18 pages
Intel Architecture: 2.1. Brief History of The Ia-32 Architecture
No ratings yet
Intel Architecture: 2.1. Brief History of The Ia-32 Architecture
19 pages
Aim: Code:: Program 1
No ratings yet
Aim: Code:: Program 1
33 pages
SeaChest Format
No ratings yet
SeaChest Format
43 pages
Us 10089238
No ratings yet
Us 10089238
43 pages
DP Sound Creative 14060 Drivers
No ratings yet
DP Sound Creative 14060 Drivers
85 pages
MTS/Sintech Model 10/GL Specifications:: Frank Bacon Machinery Sales Co. 21251 Ryan Road Warren, MI 48091 586-756-4280
No ratings yet
MTS/Sintech Model 10/GL Specifications:: Frank Bacon Machinery Sales Co. 21251 Ryan Road Warren, MI 48091 586-756-4280
1 page
LECTURE 2 Operating Systems Functions
No ratings yet
LECTURE 2 Operating Systems Functions
25 pages
14 Must-Know Linux Commands
No ratings yet
14 Must-Know Linux Commands
26 pages
A Presentation On: RIDEX - Ruggedised Field Exchange
No ratings yet
A Presentation On: RIDEX - Ruggedised Field Exchange
21 pages
Geo Measure
No ratings yet
Geo Measure
10 pages
Instant Messages
No ratings yet
Instant Messages
7 pages
Covariant Return Type
No ratings yet
Covariant Return Type
2 pages
Using Backup4all - FAQ 8 - Backup4all
No ratings yet
Using Backup4all - FAQ 8 - Backup4all
11 pages
CISP Appa
No ratings yet
CISP Appa
4 pages
Name: Sandoval, Jane B. Section: BSOA 2 Integrated Software Application 1. Mainframe OS
No ratings yet
Name: Sandoval, Jane B. Section: BSOA 2 Integrated Software Application 1. Mainframe OS
5 pages
Root File System
No ratings yet
Root File System
2 pages
Preferences
No ratings yet
Preferences
2 pages
The 1 Page Python Book
From Everand
The 1 Page Python Book
Barani Kumar
2/5 (1)
Dive Into Sea of C
From Everand
Dive Into Sea of C
M Ashok
No ratings yet
Python Programming Concepts
From Everand
Python Programming Concepts
MRB
No ratings yet

CD Mini Project Lexical Analyzer

Uploaded by

CD Mini Project Lexical Analyzer

Uploaded by

SRM INSTITUTE OF SCIENCE AND TECHNOLOGY

Kattankulathur, Chengalpattu District - 603203

18CSC304J/ COMPLIER DESIGN

LEXICAL ANALYZER FOR C LANGUAGE

1. Analysis phase: Known as the front-end of the compiler, the

2. Synthesis phase: Known as the back-end of the compiler, the

Intermediate Code generation

{string} return STRING_CONST;

{alpha}({alpha}|{digit}|{und})* return IDENTIFIER;

[+-][0-9]{digit}*(\.{digit}+)? return SIGNED_CONST;

{digit}+ return INT_CONST;

{alpha}({alpha}|{digit}|{und})*\[{digit}*\] return ARR;

struct node * createNode(char *token, char *attr)

struct node *newnode;

int hashIndex(char *token)

void insertToHash(char *token, char *attr){

struct node *myNode;

int scan, slcline=0, mlc=0, mlcline=0, dq=0,

if(lineno!=dqline && dqline!=0)

if(scan==33 && mlc==0)

printf("%s\t\t\tSingleline Comment\t\tLine %d\n",

if(scan==35 && mlc==0)

printf("%s\t\t\tMultiline Comment Start\t\tLine %d\n", yytext, lineno);

if(scan==36 && mlc==0)

printf("\n******** ERROR!! UNMATCHED MULTILINE

if((scan>=37 && scan<=52) && mlc==0)

if((scan==53||scan==54||scan==63||(scan>=64 && scan<=68)) &&

printf("%s\t\t\tSPECIAL SYMBOL\t\t\tLine %d\n",

insertToHash(yytext, "SPECIAL SYMBOL");

if(scan==55 && mlc==0)

if(scan==56 && mlc==0)

printf("%s\t\t\tMAIN FUNCTION\t\t\tLine %d\n",

if((scan==57 || scan==58) && mlc==0)

printf("%s\t\t\tPRE DEFINED FUNCTION\t\tLine %d\n",

printf("%s\t\t\tPRE PROCESSOR DIRECTIVE\t\tLine %d\n", yytext,

if(scan==60 && mlc==0)

printf("%s\t\t\tINTEGER CONSTANT\t\tLine %d\n",

if(scan==61 && mlc==0)

printf("%s\t\t\tFLOATING POINT CONSTANT\t\tLine %d\n",

if(scan==62 && mlc==0)

printf("%s\t\t\tTYPE SPECIFIER\t\t\tLine %d\n", yytext, lineno);

if(scan==70 && mlc==0)

printf("%s\t\t\tUSER DEFINED FUNCTION\t\tLine %d\n",

if(scan==71 && mlc==0)

printf("\n******** ERROR!! CONSTANT ERROR %s at Line %d

printf("\n******** ERROR!! UNKNOWN TOKEN %s

printf("%s\t\t\tCHARACTER CONSTANT\t\t\tLine %d\n",

if(scan==74 && mlc==0)

printf("%s\t\t\tSIGNED CONSTANT\t\t\tLine %d\n",

printf("%s\t\t\tSTRING CONSTANT\t\t\tLine %d\n",

You might also like

{alpha}({alpha}|{digit}|{und})\[{digit}\] return ARR;

struct node * createNode(char token, char attr)

void insertToHash(char token, char attr){