0% found this document useful (0 votes)

8 views39 pages

Complier Design Documentation

A compiler is a program that converts high-level source code into low-level machine language, involving several components like pre-processors, assemblers, linkers, and loaders. The compilation process consists of multiple phases including lexical analysis, syntax analysis, semantic analysis, intermediate code generation, code optimization, and code generation. Additionally, parsing techniques such as top-down and bottom-up parsing are utilized, with various types of parsers like recursive descent, predictive, and shift-reduce parsers being discussed.

Uploaded by

edla.laxman

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views39 pages

Complier Design Documentation

Uploaded by

edla.laxman

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 39

Compiler Design

What is Compiler ?
A compiler is a computer program which helps you transform source code
written in a high-level language into low-level language.

Steps for Language processing systems

Before knowing about the concept of compilers, you first need to understand a
few other tools which work with compilers.
Here we are drawing the language process system by using the compiler
Pre-processor
The pre-processor is considered as a part of the Compiler. It is a tool which
produces input for Compiler. It deals with macro processing, augmentation,
language extension, etc.

1
Compiler
A compiler is a computer program which helps you transform source code written
in a high-level language into low-level machine language.
Assembler:
It translates assembly language code into machine understandable language. The
output result of assembler is known as an object file which is a combination of
machine instruction as well as the data required to store these instructions in
memory.
• Linker:
The linker helps you to link and merge various object files to create an executable
file. All these files might have been compiled with separate assemblers.

2
Loader:
The loader is a part of the OS, which performs the tasks of loading executable
files into memory and run them. It also calculates the size of a program which
creates additional memory space.
Machine code
a computer programming language consisting of binary or hexadecimal
instructions which a computer can respond to directly.

Phases of Compiler
The compilation process contains the sequence of various phases. Each phase
takes source program in one representation and produces output in another
representation. Each phase takes input from its previous stage.

3
There are the various phases of compiler:

4
Lexical Analysis
The first phase of scanner works as a text scanner. This phase scans the source
code as a stream of characters and converts it into meaningful lexemes. Lexical
analyser represents these lexemes in the form of tokens.
<token-name, attribute-value>
Example:
Sum=old sum Rate*50
id1=id2+id3*50
Syntax Analysis
The next phase is called the syntax analysis or parsing.
the parser checks if the expression made by the tokens is syntactically correct.
id1=id2+id3*50

Example

5
Semantic Analysis
Semantic analysis is the third phase of compilation process. It checks whether
the parse tree follows the rules of language. Semantic analyser keeps track of
identifiers, their types and expressions. The output of semantic analysis phase is
the annotated tree syntax.
• Intermediate Code Generation
compiler generates the source code into the intermediate code. Intermediate
code is generated between the high-level language and the machine language.
Example
id1=id2+id3*50
temp1 = inttoreal(50)
temp2 = id3*temp1
temp3 = id2+temp2
id1 = temp3

Code Optimization
It removes the unnecessary lines of the code and arranges the sequence of
statements in order to speed up the program execution.
Example id1=id2+id3*50
temp1=id3*50.0
id1=id2+temp1
• Code Generation
Code generation is the final stage of the compilation process. It takes the
optimized intermediate code as input and maps it to the target machine
language. Example: id1=id2+id3*50

MOV R1, Id3

MUL R1#50.0
MOV R2, id2

6
ADD R1, R2
MOV Id1, R1

Syntax directed translation

In syntax directed translation, along with the grammar we associate some
informal notations and these notations are called as semantic rules.
So, we can say that
1. Grammar + semantic rule = SDT (syntax directed translation)
In syntax directed translation, every non-terminal can get one or more than one
attribute or sometimes 0 attribute depending on the type of the attribute. The
value of these attributes is evaluated by the semantic rules associated with the
production rule.
In the semantic rule, attribute is VAL and an attribute may hold anything like a
string, a number, a memory location and a complex record
• In Syntax directed translation, whenever a construct encounters in the
programming language then it is translated according to the semantic
rules define in that particular programming language.

7
Productions Semantic rule

Parser

A parser takes input in the form of sequence of tokens and produces output in
the form of parse tree.
Parsing is of two types: 1.top down parsing
2.bottom up parsing.

8
Top-down parsing

• The process of constructing the parse tree which starts from the root and
goes down to the leaf is Top-Down Parsing.
• Top-Down Parsers constructs from the Grammar which is free from
ambiguity and left recursion. Top-Down Parsers uses leftmost derivation
to construct a parse tree.

9
Example
S aABe
A bc
B d
Input string is abcde.

10
Recursive descent parser
• Recursive Descent Parser uses the technique of Top-Down Parsing
without backtracking.
• It can be defined as a Parser that uses the various recursive procedure to
process the input string with no backtracking. It can be simply performed
using a Recursive language.
Backtracking
Example1 − Consider the Grammar
S→aAd
A→bc|b
i/p=abd

11
Predictive Parser:

In this, we will cover the overview of Predictive Parser and mainly focus on
the role of Predictive Parser. And will also cover the algorithm for the
implementation of the Predictive parser algorithm and finally will discuss an
example by implementing the algorithm for precedence parsing.

Example :

Given grammar

E->E+T|T
T->T*F|F
F->(E)|id

After removing left recursion, left factoring

Computation of FIRST & FOLLOW

First(E)=First(T)=FIRST(F)={(,id}

First(

12
Bottom-Up Parsing
Bottom-up parsing parses the stream of tokens from the lexical analyzer. And
after parsing the input string it generates a parse tree.
The bottom-up parser builds a parse tree from the leaf nodes and proceeds
towards the root node of the tree. In this section, we will be discussing bottom-
up parsing along with its types.

Example:

S aABe
A Abc/b
B d
Input string “ abbcde ”
• abbcde
aAbcde(A b)
aAde(A Abc)
aABe(B d)
S(S aABe)

13
Example

E → E+T|T
T → T*F|F
F → (E)|id i/p=id*id

id*id
F*id (F id)
T*id (T F)
T*F (F id)
T (E T)
E

Example

E → E+T|T
T → T*F|F
F → (E)|id i/p=id*id
id*id
F*id (F id)
T*id (T F)
T*F (F id)
T (E T)
E

14
Shift reduce parser:

Shift Reduce Parser is a type of Bottom-Up Parser. It generates the

Parse Tree from Leaves to the Root. In Shift Reduce Parser, the input
string will be reduced to the starting symbol. This reduction can be
produced by handling the rightmost derivation in reverse, i.e., from
starting symbol to the input string.
Shift Reduce Parser requires two Data Structures
• Input Buffer
• Stack
There are the various steps of Shift Reduce Parsing which are as
follows −
There are the various steps of Shift Reduce Parsing which are as
follows −

• It uses a stack and an input buffer.

• Insert $ at the bottom of the stack and the right end of the input
string in Input Buffer.

• Shift − Parser shifts zero or more input symbols onto the stack
until the handle is on top of the stack.
• Reduce − Parser reduce or replace the handle on top of the stack
to the left side of production, i.e., R.H.S. of production is
popped, and L.H.S is pushed.

15
• Accept − Step 3 and Step 4 will be repeated until it has detected
an error or until the stack includes start symbol (S) and input
Buffer is empty, i.e., it contains $.

Example

Given grammar
E → E+T|T
T → T*F|F
F → (E)|id i/p=id*id

16
Canonical Collection of LR(0) items

Example
Given grammar:
1. S → AA
2. A → aA | b
Add Augment Production and insert '•' symbol at the first position for every
production in G
S` → •S
S → •AA
A → •aA
A → •b

Drawing DFA:

17
LR(0) Table

• If a state is going to some other state on a terminal then it

correspond to a shift move.
• If a state is going to some other state on a variable then it
correspond to go to move.
• If a state contain the final item in the particular row then write
the reduce node completely.

18
SLR(1) Parser

Steps for constructing the SLR parsing table :

1. Writing augmented grammar
2. LR(0) collection of items to be found
3. Find FOLLOW of LHS of production
4. Defining 2 functions:goto and action in the parsing table

Example
S–>AA
A–>aA|b

Solution:

STEP1 – Find augmented grammar

The augmented grammar of the given grammar is:-
S’–>.S [0th production]
S–>.AA [1st production]
A–>.aA [2nd production]

19
A–>.b [3rd production]

STEP2 – Find LR(0) collection of items

The terminals of this grammar are {a,b}.

The non-terminals of this grammar are {S,A}
STEP3 –
Find FOLLOW of LHS of production
FOLLOW(S)=$
FOLLOW(A)=a,b,$
20
Step4-Parsing table

CLR(1) Parser

Steps for constructing CLR parsing table :

1. Writing augmented grammar
2. LR(1) collection of items to be found
3. Defining 2 functions: goto and action in the CLR parsing table

EXAMPLE

S-->AA
A-->aA|b

Solution :

STEP 1 – Find augmented grammar

The augmented grammar of the given grammar is:-
21
S'-->.S ,$ [0th production]
S-->.AA ,$ [1st production]
A-->.aA ,a|b [2nd production]
A-->.b ,a|b [3rd production]

STEP 2 – Find LR(1) collection of items

22
STEP 3-

LALR (1) Parsing:

Steps for constructing the LALR parsing table :

1. Writing augmented grammar
2. LR(1) collection of items to be found
3. Defining 2 functions: goto and action in the LALR parsing table

EXAMPLE

S-->AA
A-->aA|b

Solution:

STEP1- Find augmented grammar

The augmented grammar of the given grammar is:-
S'-->.S ,$ [0th production]

23
S-->.AA ,$ [1st production]
A-->.aA ,a|b [2nd production]
A-->.b ,a|b [3rd production]

STEP2 – Find LR(1) collection of items

STEP 3 –

24
From step 2

• I3 and I6 are similar except their lookaheads.

• I4 and I7 are similar except their lookaheads.
• I8 and I9 are similar except their lookaheads.
• Wherever there is 3 or 6, make it 36(combined form) Wherever there is
4 or 7, make it 47(combined form) Wherever there is 8 or 9, make
it 89(combined form)

25
Three address code in Compiler

Three address code is a type of intermediate code which is easy to generate

and can be easily converted to machine code.It makes use of at most three
addresses and one operator to represent an expression and the value computed
at each instruction is stored in temporary variable generated by compiler.

Example – Consider expression a = b * – c + b * – c.

The three address code is:

t1 = uminus c
t2 = b * t1
t3 = uminus c
t4 = b * t3
t5 = t2 + t4
a = t5
Implementation of Three Address Code –

There are 3 representations of three address code namely

1. Quadruple
2. Triples
3. Indirect Triples

1. Quadruple –
It is structure with consist of 4 fields namely op, arg1, arg2 and result.
op denotes the operator and arg1 and arg2 denotes the two operands and
result is used to store the result of the expression.

Advantage –

• Easy to rearrange code for global optimization.

• One can quickly access value of temporary variables using symbol
table.
Disadvantage –

• Contain lot of temporaries.

• Temporary variable creation increases time and space complexity.

26
Example – Consider expression a = b * – c + b * – c.

The three address code is:

t1 = uminus c
t2 = b * t1
t3 = uminus c
t4 = b * t3
t5 = t2 + t4
a = t5

2. Triples –

This representation doesn’t make use of extra temporary variable to

represent a single operation instead when a reference to another triple’s
value is needed, a pointer to that triple is used. So, it consist of only
three fields namely op, arg1 and arg2.

Disadvantage –

• Temporaries are implicit and difficult to rearrange code.

• It is difficult to optimize because optimization involves moving
intermediate code. When a triple is moved, any other triple referring
to it must be updated also. With help of pointer one can directly
access symbol table entry.

27
Example – Consider expression a = b * – c + b * – c

3. Indirect Triples –

This representation makes use of pointer to the listing of all references

to computations which is made separately and stored. Its similar in
utility as compared to quadruple representation but requires less space
than it. Temporaries are implicit and easier to rearrange code.

Example – Consider expression a = b * – c + b * – c

28
LEX:

o Lex is a program that generates lexical analyzer. It is used with YACC

parser generator.
o The lexical analyzer is a program that transforms an input stream into a
sequence of tokens.
o It reads the input stream and produces the source code as output through
implementing the lexical analyzer in the C program.

The function of Lex is as follows:

o Firstly lexical analyzer creates a program lex.1 in the Lex language. Then
Lex compiler runs the lex.1 program and produces a C program lex.yy.c.
o Finally C compiler runs the lex.yy.c program and produces an object
program a.out.
o a.out is lexical analyzer that transforms an input stream into a sequence of
tokens.

29
Lex file format:
A Lex program is separated into three sections by %% delimiters. The formal of
Lex source is as follows:

1. { definitions }
2. %%
3. { rules }
4. %%
5. { user subroutines

Definitions include declarations of constant, variable and regular definitions.

Rules define the statement of form p1 {action1} p2 {action2}....pn {action}.

Where pi describes the regular expression and action1 describes the actions
what action the lexical analyzer should take when pattern pi matches a lexeme.

User subroutines are auxiliary procedures needed by the actions. The

subroutine can be loaded with the lexical analyzer and compiled separately.

Storage Organization:

o When the target program executes then it runs in its own logical address
space in which the value of each program has a location.
o The logical address space is shared among the compiler, operating system
and target machine for management and organization. The operating
system is used to map the logical address into physical address which is
usually spread throughout the memory.

Storage Allocation

The different ways to allocate memory are:

1. Static storage allocation

2. Stack storage allocation

30
3. Heap storage allocation

Static storage allocation

o In static allocation, names are bound to storage locations.
o If memory is created at compile time then the memory will be created in
static area and only once.
o Static allocation supports the dynamic data structure that means memory
is created only at compile time and deallocated after program completion.
o The drawback with static storage allocation is that the size and position of
data objects should be known at compile time.
o Another drawback is restriction of the recursion procedure.

Stack Storage Allocation

o In static storage allocation, storage is organized as a stack.
o An activation record is pushed into the stack when activation begins and it
is popped when the activation end.
o Activation record contains the locals so that they are bound to fresh storage
in each activation record. The value of locals is deleted when the activation
ends.
o It works on the basis of last-in-first-out (LIFO) and this allocation supports
the recursion process.

Heap Storage Allocation

o Heap allocation is the most flexible allocation scheme.
o Allocation and deallocation of memory can be done at any time and at any
place depending upon the user's requirement.
o Heap allocation is used to allocate memory to the variables dynamically
and when the variables are no more used then claim it back.
o Heap storage allocation supports the recursion process.

Activation Record

o Control stack is a run time stack which is used to keep track of the live
procedure activations i.e. it is used to find out the procedures whose
execution have not been completed.

31
When it is called (activation begins) then the procedure name will push on to
the stack and when it returns (activation ends) then it will popped.

Activation record is used to manage the information needed by a single

execution of a procedure.

An activation record is pushed into the stack when a procedure is called and it is
popped when the control returns to the caller function.

The diagram below shows the contents of activation records:

Return Value: It is used by calling procedure to return a value to calling

procedure.

Actual Parameter: It is used by calling procedures to supply parameters to the

called procedures.

Control Link: It points to activation record of the caller.

Access Link: It is used to refer to non-local data held in other activation records.

Saved Machine Status: It holds the information about status of machine before
the procedure is called.

32
Local Data: It holds the data that is local to the execution of the procedure.

Temporaries: It stores the value that arises in the evaluation of an expression.

Heap Management
The heap is the portion of the store that is used for data that lives indefinitely, or
until the program explicitly deletes it. While local variables typically become
inaccessible when their procedures end, many languages enable us to create
objects or other data whose existence is not tied to the procedure activation that
creates them. For example, both C + + and Java give the programmer new to
create objects that may be passed — or pointers to them may be passed — from
procedure to procedure, so they continue to exist long after the procedure that
created them is gone. Such objects are stored on a heap.

1 The Memory Manager

2 The Memory Hierarchy of a Computer
3 Locality in Programs

1 The Memory Manager

The memory manager keeps track of all the free space in heap storage at all times.
It performs two basic functions:

• Allocation. When a program requests memory for a variable or object,3 the

memory manager produces a chunk of contiguous heap memory of the requested
size. If possible, it satisfies an allocation request using free space in the heap; if
no chunk of the needed size is available, it seeks to increase the heap storage
space by getting consecutive bytes of virtual memory from the operating system.
If space is exhausted, the memory manager passes that information back to the
application program.

• Deallocation. The memory manager returns deallocated space to the pool of

free space, so it can reuse the space to satisfy other allocation requests. Memory
managers typically do not return memory to the operating sys-tem, even if the
program's heap usage drops.
33
2. The Memory Hierarchy of a Computer

3. Locality in Programs

Most programs exhibit a high degree of locality; that is, they spend most of their
time executing a relatively small fraction of the code and touching only a small
fraction of the data. We say that a program has temporal locality if the memory
locations it accesses are likely to be accessed again within a short period of time.
We say that a program has spatial locality if memory locations close to the
location accessed are likely also to be accessed within a short period of time.

34
Peephole Optimization :
Peephole optimization is a type of code Optimization performed on a small
part of the code. It is performed on a very small set of instructions in a
segment of code.
The small set of instructions or small part of code on which peephole
optimization is performed is known as peephole or window.
It basically works on the theory of replacement in which a part of code is
replaced by shorter and faster code without a change in output. The peephole
is machine-dependent optimization.

Characteristics of peephole optimizations:

Redundant-instructions elimination
Flow-of-control optimizations
Algebraic simplifications
Use of machine idioms
Unreachable

Redundant Loads And Stores:

If we see the instructions sequence

(1) MOV R0,a
(2) MOV a,R0

we can delete instructions (2) because whenever (2) is executed. (1) will
ensure that the value of a is already in register R0.If (2) had a label we could not
be sure that (1) was always executed immediately before (2) and so we could not
remove (2).

Unreachable Code:

Another opportunity for peephole optimizations is the removal of

unreachable instructions. An unlabeled instruction immediately following an
unconditional jump may be removed. This operation can be repeated to eliminate
a sequence of instructions. For example, for debugging purposes, a large program
may have within it certain segments that are executed only if a variable debug is
1. In C, the source code might look like:

35
#define debug 0
….

If ( debug ) {
Print debugging information

Flows-Of-Control Optimizations:

The unnecessary jumps can be eliminated in either the intermediate code

or the target code by the following types of peephole optimizations. We can
replace the jump sequence

goto L1
….

L1: gotoL2 (d)

by the sequence
goto L2
….

L1: goto L2

Algebraic Simplification:

There is no end to the amount of algebraic simplification that can be

attempted through peephole optimization. Only a few algebraic identities occur
frequently enough that it is worth considering implementing them. For example,
statements such as
x := x+0 or
x := x * 1

are often produced by straightforward intermediate code-generation algorithms,

and they can be eliminated easily through peephole optimization.

Use of Machine Idioms:

The target machine may have hardware instructions to implement certain

specific operations efficiently. For example, some machines have auto-increment
and auto-decrement addressing modes. These add or subtract one from an
36
operand before or after using its value. The use of these modes greatly improves
the quality of code when pushing or popping a stack, as in parameter passing.
These modes can also be used in code for statements like i : =i+1.

i:=i+1 → i++
i:=i-1 → i- -

Basic Block and Flow Graph:

Basic Block is a straight line code sequence that has no branches in and out
branches except to the entry and at the end respectively. Basic Block is a set of
statements that always executes one after other, in a sequence.
The first task is to partition a sequence of three-address code into basic blocks.
A new basic block is begun with the first instruction and instructions are added
until a jump or a label is met. In the absence of a jump, control moves further
consecutively from one instruction to another. The idea is standardized in the
algorithm below:
Algorithm:
Partitioning three-address code into basic blocks.

Rule 1 : Determining the Leader.

Rule 2: Determining the Basic Block

In Rule 1 we have 3 statement:

1. The first three-address instruction of the intermediate code is a

leader.

2. Instructions that are targets of unconditional or conditional

jump/goto statements are leaders.

3. Instructions that immediately follow unconditional or conditional

jump/goto statements are considered leaders.

Intermediate code to set a 10*10 matrix to an identity matrix:

1) i=1 //Leader 1 (First statement)

2) j=1 //Leader 2 (Target of 11th statement)
3) t1 = 10 * i //Leader 3 (Target of 9th statement)

37
4) t2 = t1 + j
5) t3 = 8 * t2
6) t4 = t3 - 88
7) a[t4] = 0.0
8) j = j + 1
9) if j <= goto (3)
10) i = i + 1 //Leader 4 (Immediately following Conditional goto
statement)
11) if i <= 10 goto (2)
12) i = 1 //Leader 5 (Immediately following Conditional goto
statement)
13) t5 = i - 1 //Leader 6 (Target of 17th statement)
14) t6 = 88 * t5
15) a[t6] = 1.0
16) i = i + 1
17) if i <= 10 goto (13)

The given algorithm is used to convert a matrix into identity matrix i.e. a
matrix with all diagonal elements 1 and all other elements as 0.
Steps (3)-(6) are used to make elements 0, step (14) is used to make an
element 1. These steps are used recursively by goto statements.
There are 6 Basic Blocks in the above code :

B1) Statement 1
B2) Statement 2
B3) Statement 3-9
B4) Statement 10-11
B5) Statement 12
B6) Statement 13-17

38
Issues in the design of a code generator:

Code generator converts the intermediate representation of source code

into a form that can be readily executed by the machine. A code generator
is expected to generate the correct code.

The following issue arises during the code generation phase:

• Input to code generator – The input to the code generator is the intermediate
code generated by the front end, along with information in the symbol table
that determines the run-time addresses of the data objects denoted by the
names in the intermediate representation.

• Target program: The target program is the output of the code generator. The
output may be absolute machine language, relocatable machine language, or
assembly language.

• Memory Management – Mapping the names in the source program to the

addresses of data objects is done by the front end and the code generator. A
name in the three address statements refers to the symbol table entry for the
name.

• Instruction selection – Selecting the best instructions will improve the

efficiency of the program. It includes the instructions that should be complete
and uniform. Instruction speeds and machine idioms also play a major role
when efficiency is considered.

Compiler Design 1
No ratings yet
Compiler Design 1
206 pages
CD Notes
No ratings yet
CD Notes
194 pages
Unit 3 Part 1
No ratings yet
Unit 3 Part 1
49 pages
Unit 3
No ratings yet
Unit 3
43 pages
Multimedia Application L4
No ratings yet
Multimedia Application L4
42 pages
Unit 1 CD
No ratings yet
Unit 1 CD
26 pages
Compiler Design Study Material Unit 1
No ratings yet
Compiler Design Study Material Unit 1
26 pages
Unit - 1 Compiler Design
No ratings yet
Unit - 1 Compiler Design
36 pages
All Units
No ratings yet
All Units
19 pages
CD Important Questions With Answers
No ratings yet
CD Important Questions With Answers
34 pages
Sem 5 Major
No ratings yet
Sem 5 Major
25 pages
Compler
No ratings yet
Compler
35 pages
SPCC2 Alok
No ratings yet
SPCC2 Alok
15 pages
Compiler Design
No ratings yet
Compiler Design
16 pages
66fe65b5746f9CCWeek 02lecture03
No ratings yet
66fe65b5746f9CCWeek 02lecture03
47 pages
CD Unit 2 RV
No ratings yet
CD Unit 2 RV
21 pages
CD Model Set-3 Answer Key
No ratings yet
CD Model Set-3 Answer Key
29 pages
SCS13033
No ratings yet
SCS13033
121 pages
1 QP
No ratings yet
1 QP
31 pages
Compailer Design Assignment
No ratings yet
Compailer Design Assignment
14 pages
Compiler Designnotes
No ratings yet
Compiler Designnotes
18 pages
Compiler Design Lab
No ratings yet
Compiler Design Lab
40 pages
CD Farre
No ratings yet
CD Farre
13 pages
Demonstrate The Phases of A Compiler With Example
No ratings yet
Demonstrate The Phases of A Compiler With Example
16 pages
Compiler Design Solved Question Paper
No ratings yet
Compiler Design Solved Question Paper
20 pages
Compilerchapter 4
No ratings yet
Compilerchapter 4
26 pages
Unit 2. The Phases of A Compiler
No ratings yet
Unit 2. The Phases of A Compiler
23 pages
Compiler 2 PDF
No ratings yet
Compiler 2 PDF
43 pages
Compiler 2
No ratings yet
Compiler 2
45 pages
CSC 409 Note 2
No ratings yet
CSC 409 Note 2
12 pages
Name: Gapkwi S. Reuel REG NO: U21DLCS10193 Course: Cosc 408: A. What Is Analytic Grammar?
No ratings yet
Name: Gapkwi S. Reuel REG NO: U21DLCS10193 Course: Cosc 408: A. What Is Analytic Grammar?
8 pages
Additional Note CSC 409
No ratings yet
Additional Note CSC 409
11 pages
SDT Material
No ratings yet
SDT Material
30 pages
Recap: Mooly Sagiv
No ratings yet
Recap: Mooly Sagiv
42 pages
MID 1 FLCD (Part 1)
No ratings yet
MID 1 FLCD (Part 1)
6 pages
CS3501 Compiler Design
No ratings yet
CS3501 Compiler Design
13 pages
Compiler Theory: (A Simple Syntax-Directed Translator)
No ratings yet
Compiler Theory: (A Simple Syntax-Directed Translator)
50 pages
Syntax Analysis Parsing
No ratings yet
Syntax Analysis Parsing
9 pages
Interview Questions Part 1
100% (1)
Interview Questions Part 1
123 pages
Complier Construction (Final)
No ratings yet
Complier Construction (Final)
8 pages
Compiler Construction Final
No ratings yet
Compiler Construction Final
6 pages
Compiler Design: Instructor: Mohammed O. Samara University
100% (1)
Compiler Design: Instructor: Mohammed O. Samara University
28 pages
Cheatsheet Generator
No ratings yet
Cheatsheet Generator
2 pages
Compiler Design
No ratings yet
Compiler Design
19 pages
Eex 6335 Tma 1
0% (1)
Eex 6335 Tma 1
12 pages
CSC 318 Class Notes
No ratings yet
CSC 318 Class Notes
21 pages
Principles of Compiler Design
100% (2)
Principles of Compiler Design
35 pages
Chapter 1
No ratings yet
Chapter 1
43 pages
Introduction To Compiling
100% (1)
Introduction To Compiling
26 pages
Overview of Compiler Environment Pass and Phase Phases of Compiler Regular Expression Lexical Analyzer LEX Tool Bootstrapping
No ratings yet
Overview of Compiler Environment Pass and Phase Phases of Compiler Regular Expression Lexical Analyzer LEX Tool Bootstrapping
35 pages
Compiler Design
No ratings yet
Compiler Design
19 pages
Compiler Design: Instructor: Mohammed O. Samara University
No ratings yet
Compiler Design: Instructor: Mohammed O. Samara University
28 pages
UNIT-I Compiler Design - SCS1303: School of Computing Department of Computer Science and Engineering
No ratings yet
UNIT-I Compiler Design - SCS1303: School of Computing Department of Computer Science and Engineering
27 pages
2021 Sec 4 A Maths Prelim Nan Chiau High (Worked Solutions)
No ratings yet
2021 Sec 4 A Maths Prelim Nan Chiau High (Worked Solutions)
37 pages
CD-30 Questions With Solution
No ratings yet
CD-30 Questions With Solution
43 pages
Slides 01 - Compiler Construction - UET CS - Introduction
No ratings yet
Slides 01 - Compiler Construction - UET CS - Introduction
37 pages
6th Sem Cs CD Ct1 11 Solution
No ratings yet
6th Sem Cs CD Ct1 11 Solution
20 pages
Phases of Compiler
No ratings yet
Phases of Compiler
9 pages
Syntax Analysis: Chapter - 4
No ratings yet
Syntax Analysis: Chapter - 4
41 pages
Subject: Compiler Design Code: T141 Unit-I
No ratings yet
Subject: Compiler Design Code: T141 Unit-I
6 pages
Haramaya University
No ratings yet
Haramaya University
29 pages
III Year-V Semester: B.Tech. Computer Science and Engineering 5CS4-02: Compiler Design UNIT-1
100% (1)
III Year-V Semester: B.Tech. Computer Science and Engineering 5CS4-02: Compiler Design UNIT-1
11 pages
FDS - Aids Complete Notes
No ratings yet
FDS - Aids Complete Notes
138 pages
Big o Solutions To Past Questions 1
No ratings yet
Big o Solutions To Past Questions 1
6 pages
DevOps Using Python
No ratings yet
DevOps Using Python
57 pages
Bda Unit 3
No ratings yet
Bda Unit 3
28 pages
BDA Unit-5
No ratings yet
BDA Unit-5
18 pages
DAA Record Niladri Ghoshal RA2011003040003
No ratings yet
DAA Record Niladri Ghoshal RA2011003040003
82 pages
OOP (Python) in DevOps
No ratings yet
OOP (Python) in DevOps
14 pages
Chapter 1 - Concept of Data Type
No ratings yet
Chapter 1 - Concept of Data Type
36 pages
CG Syllabus
No ratings yet
CG Syllabus
2 pages
CAIE-IGCSE-Computer Science - Practical
No ratings yet
CAIE-IGCSE-Computer Science - Practical
18 pages
Bsc20 Java e Content U Sample
No ratings yet
Bsc20 Java e Content U Sample
21 pages
JS Codes
No ratings yet
JS Codes
4 pages
C++ Identifiers, Data Types and Operators
No ratings yet
C++ Identifiers, Data Types and Operators
5 pages
15 Class Design
No ratings yet
15 Class Design
12 pages
Assignment 3 Java
No ratings yet
Assignment 3 Java
19 pages
Ranjith Shaganti Java FullStack
No ratings yet
Ranjith Shaganti Java FullStack
5 pages
محاضرات تطبيقات حاسبة مرحلة رابعة
No ratings yet
محاضرات تطبيقات حاسبة مرحلة رابعة
8 pages
单片机驱动LIS3DH
No ratings yet
单片机驱动LIS3DH
62 pages
10 D CS Paper 1
No ratings yet
10 D CS Paper 1
14 pages
Summer Training Report
No ratings yet
Summer Training Report
53 pages
Year 9CS - Revision Worksheet
No ratings yet
Year 9CS - Revision Worksheet
8 pages
Compiler Da2
No ratings yet
Compiler Da2
14 pages
Palak Agnihotri
No ratings yet
Palak Agnihotri
1 page
A New Approach For Solving Location Routing Problems With
No ratings yet
A New Approach For Solving Location Routing Problems With
4 pages
In The Fig Given Below, The Number of Zeroes of The Polynomial F (X) Is
No ratings yet
In The Fig Given Below, The Number of Zeroes of The Polynomial F (X) Is
4 pages
Exp 5
No ratings yet
Exp 5
3 pages
Practice Task DPD - Answers
No ratings yet
Practice Task DPD - Answers
2 pages
QB JP Unit 1 Unit 2
No ratings yet
QB JP Unit 1 Unit 2
2 pages
Report File Programs (1 To 15) Xii (CS) (2025-26)
No ratings yet
Report File Programs (1 To 15) Xii (CS) (2025-26)
1 page
Foundations of Computer Security: Lecture 40: Substitution Ciphers
No ratings yet
Foundations of Computer Security: Lecture 40: Substitution Ciphers
9 pages
1BBM01 - Basics of Mathematics
No ratings yet
1BBM01 - Basics of Mathematics
3 pages
CSCI 2400 - Exam 4
No ratings yet
CSCI 2400 - Exam 4
2 pages
Lisp Interpreter in Rust
From Everand
Lisp Interpreter in Rust
Vishal Patil
1/5 (1)

Complier Design Documentation

Uploaded by

Complier Design Documentation

Uploaded by

Compiler Design

Steps for Language processing systems

MOV R1, Id3

Syntax directed translation

After removing left recursion, left factoring

Computation of FIRST & FOLLOW

Shift Reduce Parser is a type of Bottom-Up Parser. It generates the

• It uses a stack and an input buffer.

• If a state is going to some other state on a terminal then it

Steps for constructing the SLR parsing table :

STEP1 – Find augmented grammar

STEP2 – Find LR(0) collection of items

The terminals of this grammar are {a,b}.

Steps for constructing CLR parsing table :

STEP 1 – Find augmented grammar

STEP 2 – Find LR(1) collection of items

LALR (1) Parsing:

Steps for constructing the LALR parsing table :

STEP1- Find augmented grammar

STEP2 – Find LR(1) collection of items

• I3 and I6 are similar except their lookaheads.

Three address code is a type of intermediate code which is easy to generate

Example – Consider expression a = b * – c + b * – c.

The three address code is:

There are 3 representations of three address code namely

• Easy to rearrange code for global optimization.

• Contain lot of temporaries.

The three address code is:

This representation doesn’t make use of extra temporary variable to

• Temporaries are implicit and difficult to rearrange code.

This representation makes use of pointer to the listing of all references

Example – Consider expression a = b * – c + b * – c

o Lex is a program that generates lexical analyzer. It is used with YACC

The function of Lex is as follows:

Definitions include declarations of constant, variable and regular definitions.

Rules define the statement of form p1 {action1} p2 {action2}....pn {action}.

User subroutines are auxiliary procedures needed by the actions. The

The different ways to allocate memory are:

1. Static storage allocation

Static storage allocation

Stack Storage Allocation

Heap Storage Allocation

Activation record is used to manage the information needed by a single

The diagram below shows the contents of activation records:

Return Value: It is used by calling procedure to return a value to calling

Actual Parameter: It is used by calling procedures to supply parameters to the

Control Link: It points to activation record of the caller.

Temporaries: It stores the value that arises in the evaluation of an expression.

1 The Memory Manager

1 The Memory Manager

• Allocation. When a program requests memory for a variable or object,3 the

• Deallocation. The memory manager returns deallocated space to the pool of

Characteristics of peephole optimizations:

Redundant Loads And Stores:

If we see the instructions sequence

Another opportunity for peephole optimizations is the removal of

The unnecessary jumps can be eliminated in either the intermediate code

L1: gotoL2 (d)

There is no end to the amount of algebraic simplification that can be

are often produced by straightforward intermediate code-generation algorithms,

Use of Machine Idioms:

The target machine may have hardware instructions to implement certain

Basic Block and Flow Graph:

Rule 1 : Determining the Leader.

In Rule 1 we have 3 statement:

1. The first three-address instruction of the intermediate code is a

2. Instructions that are targets of unconditional or conditional

3. Instructions that immediately follow unconditional or conditional

Intermediate code to set a 10*10 matrix to an identity matrix:

1) i=1 //Leader 1 (First statement)

Code generator converts the intermediate representation of source code

The following issue arises during the code generation phase:

• Memory Management – Mapping the names in the source program to the

• Instruction selection – Selecting the best instructions will improve the

You might also like