0% found this document useful (0 votes)

37 views22 pages

Lex Yacc

Uploaded by

sarabujoshna

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

37 views22 pages

Lex Yacc

Uploaded by

sarabujoshna

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 22

Lex & Yacc

by
H. Altay Güvenir

A compiler or an interpreter performs its task in 3 stages:

1) Lexical Analysis:
Lexical analyzer: scans the input stream and converts sequences of
characters into tokens.
Token: a classification of groups of characters.
Examples: Lexeme Token
Sum ID
for FOR
= ASSIGN_OP
== EQUAL_OP
57 INTEGER_CONST
“Abcd” STRING_CONST
* MULT_OP
, COMMA
: SEMICOLUMN
( LEFT_PAREN
Lex is a tool for writing lexical analyzers.

2) Syntactic Analysis (Parsing):

Parser: reads tokens and assembles them into language constructs using
the grammar rules of the language.
Yacc (Yet Another Compiler Compiler) is a tool for constructing parsers.

3) Actions:
Acting upon input is done by code supplied by the compiler writer.
Lex & Yacc 2
Basic model of parsing for interpreters and compilers:

input Lexical stream of parse Actions output

Parser
stream analyzer tokens tree executable

Lex: reads a specification file containing regular expressions and generates

a C routine that performs lexical analysis.
Matches sequences that identify tokens.
Yacc: reads a specification file that codifies the grammar of a language and
generates a parsing routine.

Using lex and yacc tools:

.l Lex specification Yacc specification .y

lex yacc
*.c
. .
Custom C
lex.yy.c yylex() routines yyparse() y.tab.c
. .

gcc gcc libraries

gcc -o scanner lex.yy.c gcc -o parser y.tab.c

scanner parser
Lex & Yacc 3
Lex
Regular Expressions in lex:
a matches a
abc matches abc
[abc] matches a, b or c
[a-f] matches a, b, c, d, e, or f
[0-9] matches any digit
X+ matches one or more of X
X* matches zero or more of X
[0-9]+ matches any natural number
(…) grouping an expression into a single unit
| alternation (or)
(a|b|c)* is equivalent to [a-c]*
X? X is optional (0 or 1 occurrence)
if(def)? matches if or ifdef (equivalent to if|ifdef)
[A-Za-z] matches any alphabetical character
. matches any character except newline character
\. matches the dot character
\n matches the newline character
\t matches the tab character
\\ matches the \ character
[ \t] matches either a space or tab character
[^a-d] matches any character other than a,b,c and d

Examples:
Real numbers, e.g., 0, 27, 2.10, .17
[0-9]+|[0-9]+\.[0-9]+|\.[0-9]+
[0-9]+(\.[0-9]+)?|\.[0-9]+
[0-9]*(\.)?[0-9]+
To include an optional preceding sign: [+-]?[0-9]*(\.)?[0-9]+
Lex & Yacc 4
Contents of a lex specification file:
definitions
%%
regular expressions and associated actions (rules)
%%
user routines

Example ($ is the unix prompt):

$emacs ex1.l
$ls
ex1.l
$cat ex1.l
%option main
%%
funny printf("I recognized FUNNY");
$lex ex1.l
$ls
ex1.l lex.yy.c
$gcc -o ex1 lex.yy.c
$ls
ex1 ex1.l lex.yy.c
$emacs test
$cat test
fun
funny
ali is funny
and the course is fun

$cat test | ./ex1 or $./ex1 < test

fun
I recognized FUNNY
Ali is I recognized FUNNY
this course is fun

During pattern matching, lex searches the set of patterns for the single longest
possible match.
$cat ex2.l
%option main
%%
fun printf("FUN");
funny printf("FUNNY");
Lex & Yacc 5
$cat test | ex2
FUN
FUNNY
Ali is FUNNY
this course is FUN
Lex declares an external variable called yytext which contains the matched
string
$cat ex3.l
%option main
%%
tom|jerry printf(">%s<", yytext);
$cat test3
Did tom chase jerry?
$cat test3 | ex3
Did >tom< chase >jerry<?
Definitions:
/* float0.l */
%option main
%%
[+-]?[0-9]*(\.)?[0-9]+ printf("FLOAT");

input: ab7.3c--5.4.3+d++5-
output: abFLOATc-FLOATFLOAT+d+FLOAT-

The same lex specification can be written as:

/* float1.l */
%option main
digit [0-9]
%%
[+-]?{digit}*(\.)?{digit}+ printf("FLOAT");

Local variables can be defined:

/* float2.l */
%option main
digit [0-9]
sign [+-]
%%
{sign}?{digit}*(\.)?{digit}+ { float val;
sscanf(yytext, "%f", &val);
printf(">%f<", val);
}
Lex & Yacc 6
Input Output
ali-7.8veli ali>-7.800000<veli
ali--07.8veli ali->-7.800000<veli
+3.7.5 >3.700000<>0.500000<

Other examples
/* echo-upcase-wrods.l */
%option main
%%
[A-Z]+[ \t\n\.\,] printf("%s",yytext);
. ; /* no action specified */
The scanner with the specification above echoes all strings of capital letters,
followed by a space, tab (\t), newline (\n), dot (\.), or comma (\,) to stdout,
and all other characters will be ignored.
Input Output
Ali VELI A7, X. 12 VELI X.
HAMI BEY a HAMI BEY
Definitions can be used in definitions
/* def-in-def.l */
%option main
alphabetic [A-Za-z_$]
digit [0-9]
alphanumeric ({alphabetic}|{digit})
%%
{alphabetic}{alphanumeric}* printf("Java identifier");
\, printf("Comma");
\{ printf("Left brace");
\= printf("Assignment op");
\=\= printf("Equality op");

Among all of the rules that match the same number of characters, the rule given
first in the file will be chosen.
Example,
/* rule-order.l */
%option main
%%
for printf("FOR");
[a-z]+ printf("IDENTIFIER");
Lex & Yacc 7
for input
for count = 1 to 10
the output would be
FOR IDENTIFIER = 1 IDENTIFIER 10

However, if we swap the two lines in the specification file:

%option main
%%
[a-z]+ printf("IDENTIFIER");
for printf("FOR");
for the same input
the output would be
IDENTIFIER IDENTIFIER = 1 IDENTIFIER 10

Note that we get a warning from lex, about this problem!

Important Lex Rules:

1) At any point in the input stream, the rule that matches the longest string
is used.
2) If two or more rules march the same input string, the one given the
earliest in the specification file is used

Important note:
Do not leave extra spaces and/or empty lines at the end of a lex specification
file.
Lex & Yacc 8
Yacc
Yacc specification describes a CFG, that can be used to generate a parser.
Elements of a CFG:
1. Terminals: tokens and literal characters,
2. Variables (nonterminals): syntactical elements,
3. Production rules, and
4. Start rule.

Format of a production rule:

symbol: definition
{action}
;
Example:
<a> ::= <b>c in BNF is written as
a: b 'c'; in yacc

Format of a yacc specification file:

declarations
%%
grammar rules and associated actions
%%
C programs

Declarations: To define tokens and their characteristics

%token: declare names of tokens
%left: define left-associative operators
%right: define right-associative operators
%nonassoc: define operators that may not associate with themselves
%type: declare the type of variables
%union: declare multiple data types for semantic values
%start: declare the start symbol (default is the first variable in rules)
%prec: assign precedence to a rule
%{
C declarations directly copied to the resulting C program
%} (E.g., variables, types, macros…)
Lex & Yacc 9
Example: A yacc specification to accept L = {anbn | n>0}.
/* anbn0.l */
%% Function yywrap() is
a return (A); called by lex when input
b return (B); is exhausted.
. return (yytext[0]);
Return 1 if you are done
\n return ('\n');
or 0 if more processing
%%
is required.
int yywrap() { return 1; }

/*anbn0.y */
%token A B
%%
anbn: s '\n' {return 0;}
s: A B
| A s B
;
%%
#include "lex.yy.c"
int main() {
return yyparse();
}
int yyerror( char *s ) { fprintf(stderr, "%s\n", s); }

If the input stream cannot be derived from the start variable, the default
message of "syntax error" is printed and program terminates.
However, customized error messages can be generated.
/*anbn1.y */
%token A B
%%
anbn: s '\n' { printf(" is in anbn\n");
return 0;}
s: A B
| A s B
;
%%
#include "lex.yy.c"
void yyerror(char *s) { printf("%s, it is not in anbn\n", s); }
int main() {
return yyparse();
}
Lex & Yacc 10
$./anbn
aabb
is in anbn
$./anbn
acadbefbg
Syntax error, it is not in anbn
$
A grammar to accept L = {anbn | n  0}.
/*anbn_0.y */
%token A B
%%
anbn: s '\n' { printf(" is in anbn_0\n");
return 0;}
s: empty
| A s B
;
empty: ;
%%
#include "lex.yy.c"
void yyerror(char *s){ printf("%s, it is not in anbn_0\n", s); }
int main() {
return yyparse();
}

Positional assignment of values for items.

$$: left-hand side
$1: first item in the right-hand side
$n: nth item in the right-hand side
Example: Simple adder
/* add.l */
digit [0-9]
%%
{digit}+ {sscanf(yytext, "%d", &yylval);
return(INT);
}
\+ return(PLUS);
\n return(NL);
. ;
%%
int yywrap() { return 1; }
Lex & Yacc 11
/* add.y */
/* L = {INT PLUS INT NL} */
%token INT PLUS NL
%%
add: INT PLUS INT NL { printf(" = %d\n", $1 + $3);}
%%
#include "lex.yy.c"
void yyerror(char *s) { printf("%s\n", s); }
int main() {
return yyparse();
}

$ ./add
003 + 05
= 8
1+2
syntax error

Example: printing integers in a loop

/* print-int.l */
%%
[0-9]+ {sscanf(yytext, "%d", &yylval);
return(INTEGER);
}
\n return(NEWLINE);
. return(yytext[0]);
%%
int yywrap() { return 1; }

/* print-int.y */
%token INTEGER NEWLINE
%%
lines: /* empty */
| lines NEWLINE
| lines value NEWLINE {printf(" =%d\n", $2);}
| error NEWLINE {yyerror("! Reenter: "); yyerrok;}
;
value: INTEGER {$$ = $1;}
;
%%
#include "lex.yy.c"
void yyerror(char *s) { printf("%s", s); }
int main() {
return yyparse();
}

error is a token provided by yacc. The macro yyerrok says, ‘‘the old error is
finished.”
Lex & Yacc 12
Execution:
$./print-int
7
=7
007
=7
funny
syntax error! Reenter: 0007
=7
^D

Keeping track of line numbers in the source:

/* print-int-wln.l */
/* printing integers with line numbers */
%%
[0-9]+ { sscanf(yytext, "%d", &yylval);
return(INTEGER);
}
\n { extern int lineno; lineno++;
return(NEWLINE);
}
. return(yytext[0]);
%%
int yywrap() { return 1; }

/* print-int-wln.y */
/* prints integers with line numbers */
%token INTEGER NEWLINE
%%
lines: /* empty */
| lines NEWLINE
| lines line NEWLINE {printf("%d) %d\n", lineno, $2);}
| error NEWLINE { printf(" in line %d!\nReenter: ",lineno);
yyerrok;
}
;
line: INTEGER {$$ = $1;}
%%
#include "lex.yy.c"
int lineno=0;
void yyerror(char *s) { printf("%s", s); }
int main() {
return yyparse();
}
Lex & Yacc 13
Execution:
$./print-int-wln
007
1) 7
jhg
syntax error in line 2!
Reenter: 66
3) 66
_

Although right-recursive rules can be used in yacc, left-recursive rules are

preferred, and, in general, generate more efficient parsers.
The type of yylval is int by default. To change the type of yylval use
macro YYSTYPE in the declarations section of a yacc specifications file.
%{
#define YYSTYPE double
%}
If there are more than one data types for token values,
yylval is declared as a union.
Example with three possible types for yylval:
%union{
double real; /* real value */
int integer; /* integer value */
char str[30]; /* string value */
}
Example:
yytext = “0012”, type of yylval: int, value of yylval.integer: 12
yytext = “+1.70”, type of yylval: double, value of yylval.real: 1.7
The type of associated values of tokens can be specified by %token as
%token <real> REAL
%token <integer> INTEGER
%token <str> IDENTIFIER STRING

Values associated with tokens, yylval

Input Lexical analyzer Tokens Parser 0 or 1

yylex() main(){
stream literal yyparse() …
strings yyparse()
Unmatched …
0: input is valid }
strings to stdout 1: input is invalid
Lex & Yacc 14
To return values, associated with tokens, from a lexical analyzer:
/* types.l */
alphabetic [A-Za-z]
digit [0-9]
alphanumeric ({alphabetic}|{digit})
%%
[+-]?{digit}*(\.)?{digit}+ {sscanf(yytext, "%lf", &yylval.real);
return REAL;
}
{alphabetic}{alphanumeric}* {strcpy(yylval.str, yytext);
return IDENTIFIER;
}
\<\- return ASSIGNOP;
\n return NL;
%%
int yywrap() { return 1; }

Type of variables can be defined by %type as

%type <real> real-expr
%type <integer> integer-expr
/* types.y */
%union{
double real; /* real value */
int integer; /* integer value */
char str[30]; /* string value */
}
%token <real> REAL
%token <str> IDENTIFIER
%token ASSIGNOP NL
%type <real> assignment_stmt
%%
assignment_stmt: IDENTIFIER ASSIGNOP REAL NL {
$$ = $3;
printf("%s is assigned to %g\n", $1, $$);
}
%%
#include "lex.yy.c"
void yyerror(char *s) { printf("%s, it is not an assignment!\n", s); }
int main() {
return yyparse();
}

[guvenir@dijkstra types]$ ./types

total <- -01.57
total is assigned to -1.57
^D

Example: yacc specification of a calculator is given the web page of the course.
(http://www.cs.bilkent.edu.tr/~guvenir/courses/CS315/lex-yacc/calculator/)
Lex & Yacc 15
Actions between rule elements:
/* actions.l */
%%
a return A;
b return B;
\n return NL;
. ;
%%
int yywrap() { return 1; }

/* actions.y */
%{
#include <stdio.h>
%}
%token A B NL
%%
s: {printf("1");}
a
{printf("2");}
b
{printf("3");}
NL
{return 0;}
;
a: {printf("4");}
A
{printf("5");}
;
b: {printf("6");}
B
{printf("7");}
;
%%
#include "lex.yy.c"
int yyerror(char *s) {
printf ("%s\n", s);
}
int main(void){ yyparse(); }

actions: 14ab
52673
actions 14aa
526syntax error
actions 14ba
syntax error
actions 14xyzafghbnm
52673
Lex & Yacc 16
Conflicts
Pointer model: A pointer moves (right) on the RHS of a rule while input tokens
and variables are processed.

%token A B C
%%
start: A B C ; /* after reading A: start: A B C */

When all elements on the right-hand side are processed (the pointer reaches
the end of a rule), the rule is reduced.
If a rule reduces, the pointer then returns to the rule where it was called.

Conflict: There is a conflict if a rule is reduced when there is more than one
pointer. yacc looks one-token-ahead to see if the number of pointers
reduces to one before declaring a conflict.
Example:
%token A B C D E F
%%
start: x | y;
x: A B C D;
y: A B E F;
After tokens A and B, either one of the tokens, or both will disappear. For
example, if the next token is E, the first, if the next token is C the second token
will disappear. If the next token is anything other than C or E both pointers will
disappear. Therefore, there is no conflict.

The other way for pointers to disappear is to merge in a common subrule.

Example:
%token A B C D E F
%%
start: x | y;
x: A B z D E;
y: A B z D F;
z: C;
Initially, there are two pointers, one in x, the other in y rules. After reading
tokens A, and B, these two pointers shift. Then, these two pointers merge in
the z rule. The state after reading token C is shown below.
Lex & Yacc 17
%token A B C D E F
%%
start: x | y ;
x: A B z D E ;
y: A B z D F ;
z: C ;
However, after reading A B C, the z rule reduces. There is only one pointer
when z reduces. Then, this pointer splits again into two pointers in x and y
rules.
%token A B C D E F
%%
start: x | y ;
x: A B z D E ;
y: A B z D F ;
z: C; No conflicts

Conflict example:
%token A B
%%
start: x B | y B ;
x: A ; reduce
y: A ; reduce reduce/reduce conflict on B.
After A, there are two pointers. Both rules (x and y) want to reduce at the
same time. If the next token is B, there will be still two pointers. Such
conflicts are called reduce/reduce conflict.

Note that yacc looks one-token-ahead before declaring any conflict.

%token A B C D E
%%
start: A x C D | A y C E ;
x: B ;
y: B ; reduce/reduce conflict on C.
The pointers in x and y rules will reduce on C, resulting in reduced/reduce
conflict on C, although the grammar is not ambiguous. If yacc has looked two
tokens ahead, it would have realized that only one pointer would remain on
tokens D or E, and no pointer otherwise, so it would not declare any conflict.
Lex & Yacc 18
Another type of conflict occurs when one rule reduces while the other shifts.
Such conflicts are called shift/reduce conflicts.
Example:
%token A R
%%
start: x | y R;
x: A R ; shift
y: A ; reduce shift/reduce conflict on R
After A, y rule reduces, x rule shifts. The next token for both cases is R.
Example:
%token A
%%
start: x | y;
x: A; reduce
y: A; reduce reduce/reduce conflict on $end.
At the end of each string there is a $end token. Therefore, yacc declares
reduce/reduce conflict on $end for the grammar above.

Debugging:
$yacc -v filename.y
produces a file named y.output for debugging purposes.
Example:
%token A P
%%
s: x | y P;
x: A P; /* shifts on P */
y: A; /* reduces on P */
Lex & Yacc 19
The y.output file for the grammar above is shown below:
0 $accept : s $end

s: x is called rule number 1

1 s : x
2 | y P
3 x : A P
4 y : A Each state corresponds to a unique
combination of possible pointers in
state 0 the yacc specifications file.
$accept : . s $end
A shift 1 In state 0, if the lookahead token is A, then push the current
state (0) onto the stack, shift the pointer, goto state 1.
. error
Otherwise, call yyerror()
s goto 2
x goto 3
When s rule is reduced goto state 2

y goto 4
Reduce rule 4
Shift and goto state 5
Shift/reduce conflict on P
1: shift/reduce conflict (shift 5, reduce 4) on P
state 1
One pointer is in rule 3 between tokens A and P
x : A . P (3)
y : A . (4)
The other pointer is in rule (4) after token A

P shift 5 If the next token is P, the system will choose to shift and goto
state 5.
state 2
State2: input matched the start variable s,
$accept : s . $end (0) if this is the end of string, accept it.

$end accept

state 3 State 3: rule (1) s: x is to reduce on any text token

s : x . (1)
Any character or token
. reduce 1

State 4: pointer is in rule 2. After y rule is processed

state 4
s : y . P (2)
If the look-ahead token is P, shift the pointer, go to state 6

P shift 6 If the look-ahead token is anything else, call yyerror()

. error
Lex & Yacc 20

state 5 State 5: Token A and then Token P are seen.

x : A P . (3)
Reduce rule (3) without consulting the look-ahead token
. reduce 3
state 6
s : y P . (2) Reduce rule (2) without consulting the look-ahead token
. reduce 2

Rules never reduced:

y : A (4)

State 1 contains 1 shift/reduce conflict.

{$end, A, P, .} {$accept, s, x, y}

4 terminals, 4 nonterminals
5 grammar rules, 7 states

Recursive Rules:
Consider the following grammar:
/* recursive.y */
%token A
%%
s: A // L ={A, AAA, AAAAA, …}, Not ambiguous !
| A s A
;

y.output file:
0 $accept : s $end

1 s : A
2 | A s A
^L
state 0
$accept : . s $end (0)

A shift 1
. error

s goto 2 if the state machine pops back to this state,

the lookahead symbol is s, the parser will go to state 2

1: shift/reduce conflict (shift 1, reduce 1) on A

state 1
s : A . (1) reduce rule (1)
s : A . s A (2) shift in rule (2)

A shift 1 if A, shift to state 1, that is, stay in the same state

$end reduce 1 if $end, reduce rule 1

s goto 3
...
Lex & Yacc 21
However, the same language can also be represented by the following
grammar, which does not have any conflict.
/* recursive.y */
%token A
%%
s: A // L ={A, AAA, AAAAA, …}, Not ambiguous !
| s A A
;

Actions on a Rule:
Actions can appear anywhere in the RHS of a rule.
However, for technical reasons, it is convenient for yacc to transform the
grammar so that actions always appear at the very end.
For this reason, yacc introduces new variables, called marker variables (non-
terminals), so that all actions are at the end of the rules.
Example,
Rule
a: {action1} b {action2} c {action3};
is replaced by
a: $$1 b $$2 c {action3};
$$1: {action1}; // Empty rules
$$2: {action2};

Example:
%token A B NL
%%
start: x | y;
x: A A NL ;
y: A B NL ;

Internally:
0 $accept : start $end
1 start : x
2 | y
3 x : A A NL
4 y : A B NL

No Conflict.
Lex & Yacc 22
However, the equivalent following grammar
%token A B NL
%%
start: x | y;
x: {printf("using x");} A A NL ;
y: {printf("using y");} A B NL ;

Converted into:
0 $accept : start $end
1 start : x
2 | y
3 $$1 :
4 x : $$1 A A NL
5 $$2 :
6 y : $$2 A B NL

Conflict:
reduce/reduce conflict (reduce 3, reduce 5) on A

Make utility
Using the make utility on linux systems:
Contents of the file named Makefile:
parser: lex.yy.c y.tab.c
gcc -o parser y.tab.c
y.tab.c: parser.y
yacc parser.y
lex.yy.c: scanner.l
lex scanner.l

On the command prompt, just type

make
It automatically determines which source files (in this example, y.tab.c,
parser.y, lex.yy.c, scanner.l) of a program (parser in this
example) need to be recompiled and/or linked.

Bibliography
Saumya Debray “A Quick Introduction to Handling Conflicts in Yacc Parsers”
https://www2.cs.arizona.edu/~debray/Teaching/CSc453/DOCS/conflicts.pdf
Tom Niemann, “LEX & YACC TUTORIAL”,
https://www.epaperpress.com/lexandyacc/

E36 Asc+t
No ratings yet
E36 Asc+t
16 pages
Hoeganaes Corporation
No ratings yet
Hoeganaes Corporation
11 pages
Lex Yacc
No ratings yet
Lex Yacc
17 pages
Lex-Yacc For Exam
100% (1)
Lex-Yacc For Exam
17 pages
SS Manual GEC 18CSL66
No ratings yet
SS Manual GEC 18CSL66
49 pages
Lex and Yacc
No ratings yet
Lex and Yacc
8 pages
Lex Material 1
No ratings yet
Lex Material 1
37 pages
SS & OS Final Lab Manual
No ratings yet
SS & OS Final Lab Manual
46 pages
LexYacc Final
No ratings yet
LexYacc Final
44 pages
Tutorial On Lex & Yacc: Presented by Dewan Tanvir Ahmed Lecturer, CSE Bangladesh University of Engineering and Technology
No ratings yet
Tutorial On Lex & Yacc: Presented by Dewan Tanvir Ahmed Lecturer, CSE Bangladesh University of Engineering and Technology
31 pages
SS Lab Manual
No ratings yet
SS Lab Manual
38 pages
Lex Yaac
No ratings yet
Lex Yaac
24 pages
Yaac and Lex
No ratings yet
Yaac and Lex
13 pages
SS Lab Manual
No ratings yet
SS Lab Manual
66 pages
System Programming (BTHU-301A) : Bachelor of Technology
No ratings yet
System Programming (BTHU-301A) : Bachelor of Technology
22 pages
Compiler 56
No ratings yet
Compiler 56
39 pages
Compiler Design Practical List
No ratings yet
Compiler Design Practical List
5 pages
Lexnyacc
No ratings yet
Lexnyacc
15 pages
Lex and Yacc: A Brisk Tutorial
No ratings yet
Lex and Yacc: A Brisk Tutorial
25 pages
LEX and YACC
No ratings yet
LEX and YACC
31 pages
Lab Manual
No ratings yet
Lab Manual
23 pages
Implementation of Calculator Using LEX and YACC
0% (1)
Implementation of Calculator Using LEX and YACC
4 pages
1lex and Yacc
No ratings yet
1lex and Yacc
42 pages
Theory:: Aim: Implement A Lexical Analyzer For A Subset of C Using LEX Implementation Should Support Error Handling
No ratings yet
Theory:: Aim: Implement A Lexical Analyzer For A Subset of C Using LEX Implementation Should Support Error Handling
5 pages
System Software Manual
No ratings yet
System Software Manual
27 pages
CD MANUAL Edited
No ratings yet
CD MANUAL Edited
26 pages
Yacc Tutorial
No ratings yet
Yacc Tutorial
15 pages
Lex & Yacc
No ratings yet
Lex & Yacc
46 pages
SPCC Exp7
No ratings yet
SPCC Exp7
8 pages
SSCD LAB MAUNUAL DRTTIT FULL (Santhosh) PDF
No ratings yet
SSCD LAB MAUNUAL DRTTIT FULL (Santhosh) PDF
50 pages
Compiler Design File Part 1
No ratings yet
Compiler Design File Part 1
9 pages
Lexy Acc
No ratings yet
Lexy Acc
91 pages
Notes About Lex and Yacc: Pablo Nogueira Iglesias December 26, 1999
No ratings yet
Notes About Lex and Yacc: Pablo Nogueira Iglesias December 26, 1999
15 pages
Language Processing: Introduction To Compiler Construction: Andy D. Pimentel Computer Systems Architecture Group
No ratings yet
Language Processing: Introduction To Compiler Construction: Andy D. Pimentel Computer Systems Architecture Group
91 pages
CD (Aicte 2020-2021)
No ratings yet
CD (Aicte 2020-2021)
74 pages
Lex Yacc Tutorial
No ratings yet
Lex Yacc Tutorial
38 pages
Compiler Design Lab KCS552
No ratings yet
Compiler Design Lab KCS552
82 pages
More On LEX Programming
50% (2)
More On LEX Programming
42 pages
Module 4 RVC
No ratings yet
Module 4 RVC
59 pages
Lex
No ratings yet
Lex
41 pages
Compiler File
No ratings yet
Compiler File
47 pages
Yacc Examples
No ratings yet
Yacc Examples
9 pages
Lab Manual2021 Regulation
No ratings yet
Lab Manual2021 Regulation
28 pages
LEX and YACC
No ratings yet
LEX and YACC
3 pages
Lex and Yacc Roll No 23
No ratings yet
Lex and Yacc Roll No 23
7 pages
Compiler Design Practical File
No ratings yet
Compiler Design Practical File
49 pages
Course: IT794 Compiler Construction Lab Manuaul Tools
No ratings yet
Course: IT794 Compiler Construction Lab Manuaul Tools
5 pages
SPCC 9
No ratings yet
SPCC 9
4 pages
A3 47 Practical2
No ratings yet
A3 47 Practical2
14 pages
Lab Programs
No ratings yet
Lab Programs
18 pages
CD Lab
No ratings yet
CD Lab
26 pages
CompilerDesignLabManual PDF
No ratings yet
CompilerDesignLabManual PDF
11 pages
Lab Session
No ratings yet
Lab Session
27 pages
A3 47 Mushan Khan Practical1
No ratings yet
A3 47 Mushan Khan Practical1
13 pages
1 Introduction To LEX: Input - File.l
No ratings yet
1 Introduction To LEX: Input - File.l
19 pages
CD LexProgram
No ratings yet
CD LexProgram
11 pages
Mil 1ST Sem 2ND Quarter Week 1
No ratings yet
Mil 1ST Sem 2ND Quarter Week 1
8 pages
Printer Friendly View
No ratings yet
Printer Friendly View
4 pages
X1E Spec (EN)
No ratings yet
X1E Spec (EN)
3 pages
John Deere 4320 Tractor Operator's Manual (C)
No ratings yet
John Deere 4320 Tractor Operator's Manual (C)
84 pages
Technical Brochure Metal Ceilings V100-V200-en EU
No ratings yet
Technical Brochure Metal Ceilings V100-V200-en EU
12 pages
Reliability Analysis
100% (1)
Reliability Analysis
16 pages
Bcs-41 Jadi Buti
No ratings yet
Bcs-41 Jadi Buti
3 pages
Ed TVN 041920
No ratings yet
Ed TVN 041920
88 pages
Statistical Analysis System: First SAS Program
No ratings yet
Statistical Analysis System: First SAS Program
8 pages
DCA Vantage Brochure
No ratings yet
DCA Vantage Brochure
2 pages
Hepa Filters 01
No ratings yet
Hepa Filters 01
1 page
One
No ratings yet
One
41 pages
Solar Photovoltaic Glint and Glare Guidance First Edition
No ratings yet
Solar Photovoltaic Glint and Glare Guidance First Edition
55 pages
Southjetair MD80 From The Movie Marteriair
No ratings yet
Southjetair MD80 From The Movie Marteriair
8 pages
MECH0023 Week 01 Notes
No ratings yet
MECH0023 Week 01 Notes
24 pages
Option 1 Project Management Issues and Concerns About The Project Timeline
No ratings yet
Option 1 Project Management Issues and Concerns About The Project Timeline
8 pages
Crypto Combine
No ratings yet
Crypto Combine
26 pages
MAIN Electrical Parts List: Design LOC Sec Code Description
No ratings yet
MAIN Electrical Parts List: Design LOC Sec Code Description
10 pages
Ns2-Vw00-p0uyq-174226 Vehicle Repair Shop Side Elevation Rev.0int1
No ratings yet
Ns2-Vw00-p0uyq-174226 Vehicle Repair Shop Side Elevation Rev.0int1
1 page
Viva Question CSE-376
No ratings yet
Viva Question CSE-376
7 pages
Digital B&W Copiers (M156/M157/M176/M177-EU/AA) Parts Catalog
No ratings yet
Digital B&W Copiers (M156/M157/M176/M177-EU/AA) Parts Catalog
50 pages
Quadratic Equations Final
No ratings yet
Quadratic Equations Final
6 pages
Abhishek Arora
No ratings yet
Abhishek Arora
2 pages
Taoufik Hachi Mi
No ratings yet
Taoufik Hachi Mi
11 pages
Project of Smart Bin
No ratings yet
Project of Smart Bin
13 pages
S4 Planning Phase
No ratings yet
S4 Planning Phase
4 pages
Food Irradiation: Communication Strategies To Bridge The Gap Between Scientists and The Public
No ratings yet
Food Irradiation: Communication Strategies To Bridge The Gap Between Scientists and The Public
10 pages
FIRE FIGHTING TANK - MEP-Model
No ratings yet
FIRE FIGHTING TANK - MEP-Model
1 page

Lex Yacc

Uploaded by

Lex Yacc

Uploaded by

Lex & Yacc

A compiler or an interpreter performs its task in 3 stages:

2) Syntactic Analysis (Parsing):

input Lexical stream of parse Actions output

Lex: reads a specification file containing regular expressions and generates

Using lex and yacc tools:

*.l Lex specification Yacc specification *.y

gcc gcc libraries

gcc -o scanner lex.yy.c gcc -o parser y.tab.c

Example ($ is the unix prompt):

$cat test | ./ex1 or $./ex1 < test

The same lex specification can be written as:

Local variables can be defined:

However, if we swap the two lines in the specification file:

Note that we get a warning from lex, about this problem!

Important Lex Rules:

Format of a production rule:

Format of a yacc specification file:

Declarations: To define tokens and their characteristics

Positional assignment of values for items.

Example: printing integers in a loop

Keeping track of line numbers in the source:

Although right-recursive rules can be used in yacc, left-recursive rules are

Values associated with tokens, yylval

Input Lexical analyzer Tokens Parser 0 or 1

Type of variables can be defined by %type as

[guvenir@dijkstra types]$ ./types

The other way for pointers to disappear is to merge in a common subrule.

Note that yacc looks one-token-ahead before declaring any conflict.

s: x is called rule number 1

state 3 State 3: rule (1) s: x is to reduce on any text token

State 4: pointer is in rule 2. After y rule is processed

P shift 6 If the look-ahead token is anything else, call yyerror()

state 5 State 5: Token A and then Token P are seen.

Rules never reduced:

State 1 contains 1 shift/reduce conflict.

s goto 2 if the state machine pops back to this state,

1: shift/reduce conflict (shift 1, reduce 1) on A

A shift 1 if A, shift to state 1, that is, stay in the same state

On the command prompt, just type

You might also like

.l Lex specification Yacc specification .y