0% found this document useful (0 votes)

154 views8 pages

Compiler Notes

The document discusses several topics related to compilers including: 1. Assembly language which uses symbols and abbreviations instead of binary and is slower than machine language. 2. Lexical analysis which identifies tokens by scanning the input string and maintains two pointers in a buffered input. 3. Type checking which verifies types in values at compile-time or run-time to catch errors.

Uploaded by

ABHISHEK KUMAR SAH

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

154 views8 pages

Compiler Notes

Uploaded by

ABHISHEK KUMAR SAH

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 8

Assembly language

is the more than low level and less than high-level

language so it is intermediary language. Assembly languages use numbers,
symbols, and abbreviations instead of 0s and 1s.For example: For addition,
subtraction and multiplications it uses symbols likes Add, sub and Mul, etc.
Execution is slow as compared to machine language.

Pattern is the set of rules that determines whether a given lexeme is a valid token or not.
Token is a sequence of characters that can be treated as a single logical entity.
Lexeme is a sequence of characters in the source program that is matched for a token.
Parse tree: it is a graphical representation of how the start symbol of grammar generates
the string.

Topdown parser cant accept left recursive grammar bcz it will fall in infinite loop, so we
have to remove left recursion. Also it can’t take ambiguous grammar and non deterministic
grammar. It uses leftmost derivation.
Bottom up parser will work on left recursive and non deterministic but not on ambiguous
grammar except operator precedence which will work on any grammar. It uses rightmost
derivation.
To convert ambiguous grammar to unambiguous we have to:
1.ensure that higher precedence operator remains at lower lever.
2.if operator is left associative make grammar left recursive, otherwise right recursive.

If RHS of more than one production starts with the same symbol, then such a grammar is
called as Grammar With Common Prefixes or non deterministic grammar.
 This kind of grammar creates a problematic situation for Top down parsers.
 Top down parsers can’t decide which production must be chosen to parse the string in
hand.
To remove this confusion, we use left factoring
Left factoring is the process to remove common prefix or converting non det. Grammar
to det. Grammar.

Predictive parser is a recursive descent parser, which has the capability to predict
which production is to be used to replace the input string. The predictive parser
does not suffer from backtracking.
To accomplish its tasks, the predictive parser uses a look-ahead pointer, which
points to the next input symbols. To make the parser back-tracking free, the
predictive parser puts some constraints on the grammar and accepts only a class of
grammar known as LL(k) grammar.

Predictive parsing uses a stack and a parsing table to parse the input and generate
a parse tree. Both the stack and the input contains an end symbol $ to denote that
the stack is empty and the input is consumed. The parser refers to the parsing table
to take any decision on the input and stack element combination.

LR PARSER
The LR parser is a non-recursive, shift-reduce, bottom-up parser. It uses a wide
class of context-free grammar which makes it the most efficient syntax analysis
technique. LR parsers are also known as LR(k) parsers, where L stands for left-to-
right scanning of the input stream; R stands for the construction of right-most
derivation in reverse, and k denotes the number of lookahead symbols to make
decisions.
There are three widely used algorithms available for constructing an LR parser:

 SLR(1) – Simple LR Parser:

o Works on smallest class of grammar
o Few number of states, hence very small table
o Simple and fast construction
 LR(1) – LR Parser:
o Works on complete set of LR(1) Grammar
o Generates large table and large number of states
o Slow construction
 LALR(1) – Look-Ahead LR Parser:
o Works on intermediate size of grammar
o Number of states are same as in SLR(1)

A three-address code has at most three address locations to calculate the

expression. A three-address code can be represented in two forms : quadruples
and triples.

Quadruples
Each instruction in quadruples presentation is divided into four fields: operator,
arg1, arg2, and result. The above example is represented below in quadruples
format:

Op arg1 arg2 result

* c d r1

+ b r1 r2

+ r2 r1 r3

= r3 a

Triples
Each instruction in triples presentation has three fields : op, arg1, and arg2.The
results of respective sub-expressions are denoted by the position of expression.
Triples represent similarity with DAG and syntax tree. They are equivalent to DAG
while representing expressions.

Op arg1 arg2

* c d

+ b (0)
+ (1) (0)

= (2)

Triples face the problem of code immovability while optimization, as the results are
positional and changing the order or position of an expression may cause problems.

Indirect Triples
This representation is an enhancement over triples representation. It uses pointers
instead of position to store results. This enables the optimizers to freely re-position
the sub-expression to produce an optimized code.

Symbol table is data structure created and maintained by compilers in order to

store information about the occurrence of various entities such as variable names,
function names, objects, classes, interfaces, etc. Symbol table is used by both the
analysis and the synthesis parts of a compiler.
A symbol table may serve the following purposes depending upon the language in
hand:
 To store the names of all entities in a structured form at one place.
 To verify if a variable has been declared.
 To implement type checking, by verifying assignments and expressions in the
source code are semantically correct.
 To determine the scope of a name (scope resolution).

Implementation
If a compiler is to handle a small amount of data, then the symbol table can be
implemented as an unordered list, which is easy to code, but it is only suitable for
small tables only. A symbol table can be implemented in one of the following ways:

 Linear (sorted or unsorted) list

 Binary Search Tree
 Hash table
INPUT BUFFERING: Lexical Analysis has to access secondary memory each time to
identify tokens. It is time-consuming and costly. So, the input strings are stored into a buffer
and then scanned by Lexical Analysis.
Lexical Analysis scans input string from left to right one character at a time to identify tokens.
It uses two pointers to scan tokens −
 Begin Pointer (bptr) − It points to the beginning of the string to be read.
 Look Ahead Pointer (lptr) − It moves ahead to search for the end of the token.
Example − For statement int a, b;
 Both pointers start at the beginning of the string, which is stored in the buffer.

 Look Ahead Pointer scans buffer until the token is found.

 The character ("blank space") beyond the token ("int") have to be examined before
the token ("int") will be determined.

 After processing token ("int") both pointers will set to the next token ('a'), & this
process will be repeated for the whole program.
A buffer can be divided into two halves. If the look Ahead pointer moves towards halfway in
First Half, the second half is filled with new characters to be read. If the look Ahead pointer
moves towards the right end of the buffer of the second half, the first half will be filled with
new characters, and it goes on.

Sentinels − Sentinels are used to making a check, each time when the forward pointer is
converted, a check is completed to provide that one half of the buffer has not converted off.
If it is completed, then the other half should be reloaded.
Buffer Pairs − A specialized buffering technique can decrease the amount of overhead,
which is needed to process an input character in transferring characters. It includes two
buffers, each includes N-character size which is reloaded alternatively.

Type Checking : Type checking is the process of verifying and enforcing

constraints of types in values. It checks the type of objects and reports a type
error in the case of a violation, and incorrect types are corrected
There are two kinds of type checking:
1. Static Type Checking.(done at compile time)
2. Dynamic Type Checking.(done at run time)
In C, C++, C# and other programming languages, an identifier is a name
that is assigned by the user for a program element such as variable,
type, template, class, function or namespace. It is usually limited to letters,
digits, and underscores. Certain words, such as "new," "int" and "break,"
are reserved keywords and cannot be used as identifiers

Storage allocation techniques:

Static Allocation
It is the simplest allocation scheme in which allocation of data objects is done at
compile time because the size of every data item can be determined by the compiler.

 In static allocation, the compiler can decide the amount of storage needed by each
data object. Thus, it becomes easy for a compiler to identify the address of these
data in the activation record. It is not possible to use variables whose size has to
be determined at run time.
 FORTRAN uses this

Stack Allocation: The stack allocation is a runtime storage management technique.

The activation records are pushed and popped as activations begin and end
respectively.
It can be determined the size of the variables at a run time & hence local variables
can have different storage locations & different values during various activations.
It allows recursive subprograms
If procedure A calls B, and then B calls C, then stack allocation will be

ALGOL language uses this strategy

Heap Storage Allocation

It enables the allocation of memory in a Non-nested design. Storage can be
allocated & freed arbitrarily from an area known as Heap.
Heap Allocation is helpful for executing data whose size varies as the program is
running.
Heap is maintained as a list of free space called free space list.
It creates the problem of fragmentation.

BackPatching: While generating three address codes for the given expression, it can
specify the address of the Label in goto statements. It is very difficult to assign
locations of these label statements in one pass so, two passes are used. In the first
pass, it can leave these addresses unspecified & in the next pass, and it can fill
these addresses. Therefore filling of incomplete transformation is called
Backpatching.

Question - Udvash Unmesh Online V-17.160.8
No ratings yet
Question - Udvash Unmesh Online V-17.160.8
9 pages
English Edge by Santosh Sir (1)
0% (1)
English Edge by Santosh Sir (1)
25 pages
Grammar Bank - Doc 2 T.P INGLES DENIS
100% (1)
Grammar Bank - Doc 2 T.P INGLES DENIS
2 pages
System Programming
100% (2)
System Programming
48 pages
创意写作练习册
100% (2)
创意写作练习册
7 pages
Complier Design SUMMER 2022 PAPER SOLUTION
No ratings yet
Complier Design SUMMER 2022 PAPER SOLUTION
17 pages
Breaking Sentences
No ratings yet
Breaking Sentences
11 pages
Complier Design WINTER 2021 PAPER SOLUTION
No ratings yet
Complier Design WINTER 2021 PAPER SOLUTION
19 pages
Compiler Design - Complete Study Notes
No ratings yet
Compiler Design - Complete Study Notes
14 pages
Compiler 76fddac7 42da 4b0e 985e 8cf8a92cd723
No ratings yet
Compiler 76fddac7 42da 4b0e 985e 8cf8a92cd723
20 pages
CD Final
No ratings yet
CD Final
27 pages
cd
No ratings yet
cd
4 pages
Compler
No ratings yet
Compler
35 pages
Y11 Spanish
No ratings yet
Y11 Spanish
4 pages
CSC 461 Final
No ratings yet
CSC 461 Final
170 pages
FLAT Course Plan
No ratings yet
FLAT Course Plan
10 pages
Tri 3 Grade 6
No ratings yet
Tri 3 Grade 6
9 pages
CD 22-23 Answers
No ratings yet
CD 22-23 Answers
28 pages
Sa Zicem
No ratings yet
Sa Zicem
3 pages
Compiler Sem
No ratings yet
Compiler Sem
8 pages
Demonstrate the Phases of a Compiler With Example
No ratings yet
Demonstrate the Phases of a Compiler With Example
16 pages
Descriptive Text X Utk Siswa-1
No ratings yet
Descriptive Text X Utk Siswa-1
10 pages
CD Final Nnotes for Semester Exam
No ratings yet
CD Final Nnotes for Semester Exam
13 pages
Compiler Constration Solve by Noman Tariq
No ratings yet
Compiler Constration Solve by Noman Tariq
35 pages
ACFrOgAqi55k6LP6EgLvKx21SSVNpjPzzziW4Wq5ayfVkryb OdY7 Z8cwkQdQCe6AyZYd6A25fygOglR7CnYrqBGRHhE4MVScv8tno2CgWmSWMjCqUS9SuWKNpzmZx1BPnGOIqq8y ZpcXLd To
No ratings yet
ACFrOgAqi55k6LP6EgLvKx21SSVNpjPzzziW4Wq5ayfVkryb OdY7 Z8cwkQdQCe6AyZYd6A25fygOglR7CnYrqBGRHhE4MVScv8tno2CgWmSWMjCqUS9SuWKNpzmZx1BPnGOIqq8y ZpcXLd To
16 pages
Cd Solutions
No ratings yet
Cd Solutions
9 pages
.Back Patching Is A Technique To Solve The Problem of Replacing Symbolic Names Into
No ratings yet
.Back Patching Is A Technique To Solve The Problem of Replacing Symbolic Names Into
8 pages
PPL Unit-1
No ratings yet
PPL Unit-1
9 pages
Unit 4 and 5
No ratings yet
Unit 4 and 5
31 pages
compiler notes
No ratings yet
compiler notes
12 pages
Pronoun / Hwät, HWƏT, Wät, WƏT/: What For
No ratings yet
Pronoun / Hwät, HWƏT, Wät, WƏT/: What For
3 pages
PCD UNIT II New
No ratings yet
PCD UNIT II New
63 pages
ALL UNITS
No ratings yet
ALL UNITS
19 pages
CD Question Bank
No ratings yet
CD Question Bank
7 pages
cdsem
No ratings yet
cdsem
14 pages
CD (1) Removed
No ratings yet
CD (1) Removed
1 page
Verbs Tenses
No ratings yet
Verbs Tenses
11 pages
CD Question Bank
100% (1)
CD Question Bank
16 pages
cd aat
No ratings yet
cd aat
8 pages
AS00001155
No ratings yet
AS00001155
28 pages
sem 5 major
No ratings yet
sem 5 major
25 pages
CD Previous QA 2010
No ratings yet
CD Previous QA 2010
64 pages
Test On Unit 2 Full Blast
No ratings yet
Test On Unit 2 Full Blast
2 pages
408
No ratings yet
408
8 pages
Comparative, Superlative and Equality
No ratings yet
Comparative, Superlative and Equality
1 page
Compiler Design KCS5
No ratings yet
Compiler Design KCS5
10 pages
Linking Words v2 SHIT 2
No ratings yet
Linking Words v2 SHIT 2
11 pages
Compiler-Group Assignment
No ratings yet
Compiler-Group Assignment
15 pages
imp
No ratings yet
imp
9 pages
Compiler Design Assignment(1)
No ratings yet
Compiler Design Assignment(1)
12 pages
Cd notes
No ratings yet
Cd notes
194 pages
Grammar Point
No ratings yet
Grammar Point
20 pages
CD Lexical
No ratings yet
CD Lexical
26 pages
Compailer Design Assignment (2)
No ratings yet
Compailer Design Assignment (2)
14 pages
Piapoco and Natural Morphology Theory
No ratings yet
Piapoco and Natural Morphology Theory
21 pages
Visvesvaraya Technological University: Artificial Intelligence & Data Science
No ratings yet
Visvesvaraya Technological University: Artificial Intelligence & Data Science
11 pages
CD Unit3,4
No ratings yet
CD Unit3,4
21 pages
(UPDATED) PBA Manual - Shark Tank Project
No ratings yet
(UPDATED) PBA Manual - Shark Tank Project
29 pages
Chapter 1: The Macro Skills: Listening
No ratings yet
Chapter 1: The Macro Skills: Listening
17 pages
Diagnostic Test in English Iv
No ratings yet
Diagnostic Test in English Iv
4 pages
Must Know+Korean+Slang+Words+&+Phrases
100% (3)
Must Know+Korean+Slang+Words+&+Phrases
103 pages
Compiler Contruction QB PDF
No ratings yet
Compiler Contruction QB PDF
7 pages
Grade 6-2022 Programme Leyla Musaqizi
No ratings yet
Grade 6-2022 Programme Leyla Musaqizi
12 pages
CD-compiler Designe Akash
No ratings yet
CD-compiler Designe Akash
70 pages
cc QB
No ratings yet
cc QB
7 pages
PCC-CS501
No ratings yet
PCC-CS501
10 pages
Subject Verb Agreement Worksheets 1
50% (2)
Subject Verb Agreement Worksheets 1
3 pages
System Programming Question List
No ratings yet
System Programming Question List
9 pages
Day 1 Japanese Grammar
No ratings yet
Day 1 Japanese Grammar
23 pages
Present Simple vs. Present Continuous (Test
No ratings yet
Present Simple vs. Present Continuous (Test
1 page
Lec 03 Syntax Analysis
No ratings yet
Lec 03 Syntax Analysis
19 pages
CD Unitwise Imp Questions
100% (1)
CD Unitwise Imp Questions
5 pages
CD Questions With Answers
100% (1)
CD Questions With Answers
36 pages
Compiler Design Study Material Unit 2nd
No ratings yet
Compiler Design Study Material Unit 2nd
28 pages
ITripleE Style - Trabajo CEMA
No ratings yet
ITripleE Style - Trabajo CEMA
4 pages
Complier Construction (Final)
No ratings yet
Complier Construction (Final)
8 pages
Recap: Mooly Sagiv
No ratings yet
Recap: Mooly Sagiv
42 pages
CS3501 Compiler Design
No ratings yet
CS3501 Compiler Design
13 pages
Language Q4 Reveiwer
No ratings yet
Language Q4 Reveiwer
5 pages
Practise Worksheet Class 7
No ratings yet
Practise Worksheet Class 7
5 pages
7 Cs
No ratings yet
7 Cs
35 pages
Overview of Compiler
No ratings yet
Overview of Compiler
56 pages
GW0_Test_9
No ratings yet
GW0_Test_9
7 pages
CS1601 Important Model III
No ratings yet
CS1601 Important Model III
5 pages
CS1352 May07
No ratings yet
CS1352 May07
19 pages
Engexam - info-CAE Reading and Use of English Practice Test 3
No ratings yet
Engexam - info-CAE Reading and Use of English Practice Test 3
5 pages
Lisp Interpreter in Rust
From Everand
Lisp Interpreter in Rust
Vishal Patil
1/5 (1)
Learn C++
From Everand
Learn C++
Durgesh
4.5/5 (9)
Introduction to Algorithms
From Everand
Introduction to Algorithms
S VASIST
No ratings yet
Introduction to PHP, Part 2, Second Edition
From Everand
Introduction to PHP, Part 2, Second Edition
Adam Majczak
No ratings yet

Compiler Notes

Uploaded by

Compiler Notes

Uploaded by

Assembly language

is the more than low level and less than high-level

 SLR(1) – Simple LR Parser:

A three-address code has at most three address locations to calculate the

Op arg1 arg2 result

Symbol table is data structure created and maintained by compilers in order to

 Linear (sorted or unsorted) list

 Look Ahead Pointer scans buffer until the token is found.

Type Checking : Type checking is the process of verifying and enforcing

Storage allocation techniques:

Stack Allocation: The stack allocation is a runtime storage management technique.

ALGOL language uses this strategy

Heap Storage Allocation

You might also like