0% found this document useful (0 votes)

62 views10 pages

Compiler Design - Webview

Uploaded by

Sneha R

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

62 views10 pages

Compiler Design - Webview

Uploaded by

Sneha R

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

Compiler Design

For

Computer Science
&
Information Technology

www.thegateacademy.com
✆080-40611000
Syllabus

Syllabus for Compiler Design

Lexical Analysis, Parsing, Syntax-Directed Translation, Runtime Environments, Intermediate Code

Generation.

Previous Year GATE Papers and Analysis

GATE Papers with answer key

thegateacademy.com/gate-papers

Subject wise Weightage Analysis

thegateacademy.com/gate-syllabus

[email protected] ©Copyright reserved. Web:www.thegateacademy.com

Contents

Contents
Chapters Page No.
#1. Introduction to Compilers
 Compilers
 Analysis of the Source Program
 The Phases of a Compiler
 Lexical Analyzer
 Specification of Tokens

#2. Parsing
 Syntax Analysis
 The Role of the Parser
 Context-Free Grammar
 Writing a Grammar
 Top-Down Parsing
 Bottom-Up Parsing
 Operator-Precedence Parsing
 LR Parsers
 Parser Generators

#3. Syntax Directed Translation

 Syntax Directed Translation
 Syntax Directed Definitions (SD-Definitions)
 Construction of Syntax Trees
 Bottom-Up Evaluation of S-Attributed Definition
 L-Attributed Definitions
 Top-Down Translation
 Bottom-Up Evaluation of Inherited Attributes
 Run Time Environment

#4. Intermediate Code Generation

 Intermediate Code Generation
 Intermediate Language
 Issues in the Design of a Code Generator
 Target Machine
 Code Optimization
 The Principal Sources of Optimization

Reference Books

[email protected] ©Copyright reserved. Web:www.thegateacademy.com i

“The will to do springs from the

11
knowledge that we can do."
…. James Allen
CHAPTER

Introduction to Compilers

Learning Objectives
After reading this chapter, you will know:
1. Compiler
2. Analysis of the Source Program
3. The Phases of a Compiler
4. Lexical Analyzer
5. Specification of Tokens

Compiler
A compiler is a program that reads a program written in one language – the source language – and
translates it into an equivalent program in another language – the target language (see Fig. Shows
below. As an important part of this translation process, the compiler reports to its user the presence
of errors in the source program.

Source Compiler Target

Program Program

Error
Messages
A Compiler

Compilers are sometimes classified as single pass multi-pass, load and go, debugging or optimising
depending on how they have been constructed or on what function they are supposed to perform.
Despite this apparent complexity, the basic tasks that any compiler must perform are essentially the
same. By understanding these tasks, we can construct compilers for a variety of source languages
and target machines using the same basic techniques.

The Analysis-Synthesis Model of Compilation

There are two parts of compilation: Analysis and Synthesis. The analysis part breaks up the source
program into constituent pieces and creates an intermediate representation of the source program.
The synthesis part constructs the desired target program from the intermediate representation. Out
of the two parts, synthesis requires the most specialized techniques.

[email protected] ©Copyright reserved. Web:www.thegateacademy.com 1

Introduction to Compilers

During analysis, the operations implied by the source program are determined and recorded in a
hierarchical structure called as tree. Often, a special kind of tree called as Syntax tree is used, in
which each node represents an operation and the children of node represent the argument of the
operation. For example, a syntax tree for an assignment statement is shown in below.

position

initial

rate
Syntax Tree for Position: = Initial + Rate 60.

The Context of a Compiler

In addition to a compiler, several other programs may be required to create an executable target
program. A source program may be divided into modules stored in separate files. The task of
collecting the source program is sometimes entrusted to a distinct program, called as Preprocessor.
The preprocessor may also expand shorthands, called macros, into source language statements.

Following below figure shows a typical “compilation.” The target program created by the compiler
may require further processing before it can be run. The compiler in below figure creates assembly
code that is translated by an assembler into machine code and then linked together with some
library routines into the code that actually runs on the machine.
Skeletal Source Program

Preprocessor

Source Program

Compiler

Target Assembly Program

Assembler

Relocatable Machine Code

Loader/Link Editor Library,

Relocatable Object File
Absolute Machine Code
A Language-Processing System
[email protected] ©Copyright reserved. Web:www.thegateacademy.com 2
Introduction to Compilers

Analysis of the Source Program

1. Linear or Lexical analysis, in which stream of characters making up the source program is read
from left-to-right and grouped into tokens that are sequences of characters having a collective
meaning.
2. Hierarchical or Syntax analysis, in which characters or tokens are grouped hierarchically into
nested collections with collective meanings.
3. Semantic analysis, in which certain checks are performed to ensure that the components of a
program fit together meaningfully.

Lexical Analysis
A token is a string of characters, categorized according to the rules as a symbol (e.g. IDENTIFIER,
NUMBER, COMMA, etc.). The process of forming tokens from an input stream of characters is called
tokenization and the lexer categorizes them according to symbol type. A token can look like anything
that is useful for processing an input text stream or text file.
A lexical analyzer generally does nothing with combinations of tokens, a task left for a parser. For
example, a typical lexical analyzer recognizes parenthesis as tokens, but does nothing to ensure that
each '(' is matched with a ')'.
In a compiler, linear analysis is called lexical analysis or scanning. For example, in lexical analysis
the characters in the assignment statement
Position: = initial + rate 60
would be grouped into the following tokens:
1. The identifier position.
2. The assignment symbol : =
3. The identifier initial.
4. The plus sign +
5. The identifier .
6. The multiplication sign
7. The number 60
The blanks separating the characters of these tokens would normally be eliminated during
lexical analysis.

Syntax Analysis
Hierarchical analysis is called parsing or syntax analysis. It involves grouping the tokens of the
source program into grammatical phrases that are used by the compiler to synthesize output.
Usually, the grammatical phrases of the source program are represented by a parse tree such as the
one shown in figure below.

[email protected] ©Copyright reserved. Web:www.thegateacademy.com 3

Introduction to Compilers

Assignment
Statement

Identifier Expression

Position
Expression Expression

Identifier
Expression
Expression
Initial
Identifier
Number

Rate
60

Parse Tree for Position: = Initial + Rate 60

In the expression initial + rate 60, the phrase rate 60 is a logical unit because the usual
conventions of arithmetic expressions tell us that multiplication is performed before addition.
Because the expression initial + rate is followed by a it is not grouped into a single phrase by
itself in Fig. above.
The hierarchical structure of a program is usually expressed by recursive rules. For example, we
might have the following rules as part of the definition of expressions:
1. Any identifier is an expression.
2. Any number is an expression.
3. If expression and expression are expressions, then so are
expression expression
expression expression
(expression )
Rules (1) and (2) are non-recursive basic rules, while (3) defines expressions in terms of
operators applied to other expressions. Thus, by rule (1), initial and rate are expressions. By
rule (2), 60 is an expression, while by rule (3), we can first infer that rate 60 is an expression
and finally that initial + rate 60 is an expression.

[email protected] ©Copyright reserved. Web:www.thegateacademy.com 4

Introduction to Compilers

Position Position

Initial Initial

Rate Rate int to real

(a) (b)

Semantic Analysis Inserts a Conversion from Integer to Real

The parse tree for position describes the syntactic structure of the input. A more common internal
representation of this syntactic structure is given by the syntax in Fig. above (a). A syntax tree is a
compressed representation of the parse tree in which the operators appear as the interior nodes,
and the operands of an operator are the children of the node for that operator.

Semantic Analysis
The semantic analysis phase checks the source program for semantic errors and gathers type
information for the subsequent code-generation phase. It uses the hierarchical structure determined
by the syntax-analysis phase to identify the operators and operands of expressions and statements.
An important component of semantic analysis is type checking.
Here the compiler checks that each operator has operands that are permitted by the source
language specification. For example, many programming language definitions require a compiler to
report an error every time a real number is used to index an array. However, the language
specification may permit some operand corrections, for example, when binary arithmetic operator
is applied to an integer and real. In this case, the compiler may need to convert the integer to a real.

The Phases of a Compiler

Conceptually, a compiler operates in phases, each of which transforms the source program from one
representation to another. A typical decomposition of a compiler is shown in Fig. below. In practice,
some of the phases may be grouped together, and the intermediate representations between the
grouped phases need not be explicitly constructed.

Introduction to Compilers

Source Program

Lexical Analyzer

Syntax Analyzer

Semantic Analyzer

Symbol Table Error

Intermediate Code Generator
Manager Handler

Code Optimizer

Target Code Generator

Target Program
Phases of a Compiler

The first three forming the bulk of the analysis portion of a compiler, were introduced in the last
section. Two other activities, symbol-table management and error handling, are shown interacting
with the six phases of compilation lexical analysis, syntax analysis, semantic analysis, intermediate
code generation, code optimization, and code generation. Informally, we shall also call the symbol-
table manager and the error handler “phases.”
The 6 phases divided into 2 Groups
1. Front End: Depends on stream of tokens and parse tree
2. Back End: Dependent on Target, Independent of source code

Symbol-Table Management
A symbol table is a data structure containing a record for each identifier, with fields for the
attributes of the identifier. The data structure allows us to find the record for each identifier quickly
and to store or retrieve data from that record quickly.
Symbol table is a Data Structure in a Compiler used for Managing information about variables &
their attributes.

Error Detection and Reporting

Each phase can encounter errors. However, after detecting an error, a phase must somehow deal
with that error, so that compilation can proceed, allowing further errors in the source program to be
detected. A compiler that stops when it finds the first error is not as helpful as it could be.
The syntax and semantic analysis phases usually handle a large fraction of the errors detectable by
the compiler. The lexical phase can detect errors where the characters remaining in the input do not
form any token of the language. Errors where the token stream violates the structure rules (syntax)
of the language are determined by the syntax analysis phase.

Introduction to Compilers

The Analysis Phases

As translation progresses, the compiler’s internal representation of the source program changes. We
illustrate these representations by considering the translation of the statement.
Position: = initial + rate 60 -------------- (1.1)

Lexical Analyzer
1. The lexical analysis phase reads the characters in the source program and groups them into a
stream of tokens in which each token represents a logically cohesive sequence of characters,
such as an identifier, a keyword (if, while, etc.), a punctuation character, or a multi-character
operator like : = .The character sequence forming a token is called the lexeme for the token.
Certain tokens will be augmented by a “lexical value”. For example, when an identifier like
rate is found, the lexical analyzer not only generates a token, say id, but also enters the lexeme
rate into the symbol table, if it is not already there. The lexical value associated with this
occurrence of id points to the symbol-table entry for rate.
In this section, we shall use id1, id2, and id3 for position, initial, and rate, respectively, to
emphasize that the internal representation of an identifier is different from the character
sequence forming the identifier. The representation of assignment statement (1.1) after
lexical analysis is therefore suggested by:
id1 = id2 + id3 60 -------------- (1.2)

Syntax Analysis Phase

1. The syntax Analysis Phase: The syntax analysis phase imposes a hierarchical structure on the
token stream, shown below

id
2. The Semantic Analysis Phase: During the semantic analysis, it is considered that in our
example all identifiers have been declared to be reals and that 60 by itself is assumed to be an
integer. Type checking of syntax tree reveals that is applied to a real rate and an integer, 60.
The general approach is to convert the integer into a real. This has been achieved by creating
an integer into a real

id int to real

Fitnessappmarketingplan 170116233843 PDF
No ratings yet
Fitnessappmarketingplan 170116233843 PDF
40 pages
Cse Module 8
No ratings yet
Cse Module 8
32 pages
Part - 9: Complier Design: 9.1 Introduction To Compilers
No ratings yet
Part - 9: Complier Design: 9.1 Introduction To Compilers
6 pages
Part - 9: Complier Design: 9.1 Introduction To Compilers
No ratings yet
Part - 9: Complier Design: 9.1 Introduction To Compilers
6 pages
CD Unit 1
No ratings yet
CD Unit 1
63 pages
Compiler Design: Computer Science
No ratings yet
Compiler Design: Computer Science
117 pages
System Software Notes
No ratings yet
System Software Notes
81 pages
Compiler Design and Implementation
No ratings yet
Compiler Design and Implementation
5 pages
Compiler Design
No ratings yet
Compiler Design
118 pages
Unit 4 SS
No ratings yet
Unit 4 SS
13 pages
CD Unit - 1 Lms Notes
No ratings yet
CD Unit - 1 Lms Notes
58 pages
Module 1
No ratings yet
Module 1
86 pages
Unit I SRM
100% (1)
Unit I SRM
36 pages
Compiler Design: Instructor: Mohammed O. Samara University
No ratings yet
Compiler Design: Instructor: Mohammed O. Samara University
28 pages
Compiler Design: Instructor: Mohammed O. Samara University
100% (1)
Compiler Design: Instructor: Mohammed O. Samara University
28 pages
Cs133 Group A: Compiler Construction
No ratings yet
Cs133 Group A: Compiler Construction
24 pages
Introduction To Compiling
100% (1)
Introduction To Compiling
26 pages
Compiler Design Module
100% (1)
Compiler Design Module
120 pages
Compiler Design Slide Chapter 1-6
No ratings yet
Compiler Design Slide Chapter 1-6
250 pages
Compiler Unit 1
No ratings yet
Compiler Unit 1
36 pages
Principle of Compiler Design: Translator
No ratings yet
Principle of Compiler Design: Translator
20 pages
Compiler Notes
No ratings yet
Compiler Notes
68 pages
Compiler Design Note1
No ratings yet
Compiler Design Note1
111 pages
Compiler Construction CS-4207 Lecture - 01 - 02: Input Output Target Program
No ratings yet
Compiler Construction CS-4207 Lecture - 01 - 02: Input Output Target Program
8 pages
Lecture 1,2 Introduction
No ratings yet
Lecture 1,2 Introduction
40 pages
CD - 1
No ratings yet
CD - 1
22 pages
UNIT_1_CD
No ratings yet
UNIT_1_CD
17 pages
SCS13033
No ratings yet
SCS13033
121 pages
UNIT-I Compiler Design - SCS1303: School of Computing Department of Computer Science and Engineering
No ratings yet
UNIT-I Compiler Design - SCS1303: School of Computing Department of Computer Science and Engineering
27 pages
1.1 Compilers
No ratings yet
1.1 Compilers
129 pages
Compiler Lec-One
No ratings yet
Compiler Lec-One
46 pages
CSC 320 Notes - 1
No ratings yet
CSC 320 Notes - 1
67 pages
Lec#1
No ratings yet
Lec#1
36 pages
Overview of Compiler Environment Pass and Phase Phases of Compiler Regular Expression Lexical Analyzer LEX Tool Bootstrapping
No ratings yet
Overview of Compiler Environment Pass and Phase Phases of Compiler Regular Expression Lexical Analyzer LEX Tool Bootstrapping
35 pages
Indian Institute of Information Technology, Bhagalpur: Assignment - 1
No ratings yet
Indian Institute of Information Technology, Bhagalpur: Assignment - 1
26 pages
Introduction To Compilation
No ratings yet
Introduction To Compilation
33 pages
Module-1 1
No ratings yet
Module-1 1
53 pages
Compiler Design
No ratings yet
Compiler Design
47 pages
Compier Design - Unit I
No ratings yet
Compier Design - Unit I
97 pages
Lec00 Outline
No ratings yet
Lec00 Outline
27 pages
Compiler Design Mod 1
No ratings yet
Compiler Design Mod 1
75 pages
Intro To Compilers
No ratings yet
Intro To Compilers
77 pages
Unit 1
No ratings yet
Unit 1
109 pages
Unit 1
No ratings yet
Unit 1
37 pages
Unit 1
No ratings yet
Unit 1
29 pages
Unit 1
No ratings yet
Unit 1
29 pages
Unit 1 Compiler Design
No ratings yet
Unit 1 Compiler Design
124 pages
CD - Module 1
No ratings yet
CD - Module 1
22 pages
Automata Theory and Compiler Design
No ratings yet
Automata Theory and Compiler Design
55 pages
Chapter 1
No ratings yet
Chapter 1
43 pages
Compiler Design Chapter-1
No ratings yet
Compiler Design Chapter-1
41 pages
SSCDNotes PDF
100% (1)
SSCDNotes PDF
53 pages
Bedasa
No ratings yet
Bedasa
31 pages
Compiler Notes 1
No ratings yet
Compiler Notes 1
92 pages
CD Lec1
No ratings yet
CD Lec1
42 pages
CD - Unit 1
No ratings yet
CD - Unit 1
46 pages
Compiler Design - Module 1-Notes
No ratings yet
Compiler Design - Module 1-Notes
74 pages
Compiler Design
No ratings yet
Compiler Design
11 pages
COMPUTER PROGRAMMING FOR KIDS: An Easy Step-by-Step Guide For Young Programmers To Learn Coding Skills (2022 Crash Course for Newbies)
From Everand
COMPUTER PROGRAMMING FOR KIDS: An Easy Step-by-Step Guide For Young Programmers To Learn Coding Skills (2022 Crash Course for Newbies)
Dexter Rogers
No ratings yet
The 1 Page Python Book
From Everand
The 1 Page Python Book
Barani Kumar
2/5 (1)
Swift Programming Simplified: A Practical Guide with Examples
From Everand
Swift Programming Simplified: A Practical Guide with Examples
William E. Clark
No ratings yet
Kandula Sai Ganesh Automation Testing 2+ Yrs CTS Bengaluru
No ratings yet
Kandula Sai Ganesh Automation Testing 2+ Yrs CTS Bengaluru
1 page
Joel Abraham Automation Testing 2.5yrs CTS Chennai
No ratings yet
Joel Abraham Automation Testing 2.5yrs CTS Chennai
3 pages
Electronics 09 00435 PDF
No ratings yet
Electronics 09 00435 PDF
20 pages
Theory of Computation: Computer Science & Information Technology by
No ratings yet
Theory of Computation: Computer Science & Information Technology by
10 pages
About Project: Python
No ratings yet
About Project: Python
1 page
Discrete Mathematics: Computer Science
No ratings yet
Discrete Mathematics: Computer Science
10 pages
Design and Analysis of Algorithm - Webview
No ratings yet
Design and Analysis of Algorithm - Webview
10 pages
Computer Network - Webview
No ratings yet
Computer Network - Webview
10 pages
IC STMicroelectronics STM32L010F4P6 Eec
No ratings yet
IC STMicroelectronics STM32L010F4P6 Eec
91 pages
Job Portal
No ratings yet
Job Portal
16 pages
PCAN Driver Linux - UserManual
No ratings yet
PCAN Driver Linux - UserManual
67 pages
V3Applcore User Reference Guide
No ratings yet
V3Applcore User Reference Guide
17 pages
Ros2 Brochure LTR Web
No ratings yet
Ros2 Brochure LTR Web
2 pages
SodaPDF-converted-Si5351 RXTX VFO V3
No ratings yet
SodaPDF-converted-Si5351 RXTX VFO V3
12 pages
HX8394 F PDF
No ratings yet
HX8394 F PDF
272 pages
New OSY PPT - Unit 1.1
No ratings yet
New OSY PPT - Unit 1.1
14 pages
SplunkFundamentals1 Module4
100% (1)
SplunkFundamentals1 Module4
8 pages
APC UPS Fault
No ratings yet
APC UPS Fault
2 pages
X-Plane Installer Log
No ratings yet
X-Plane Installer Log
6 pages
Micrex-Sx SPB Leh984c
No ratings yet
Micrex-Sx SPB Leh984c
24 pages
k2000 Simm
No ratings yet
k2000 Simm
2 pages
Python VFP1
No ratings yet
Python VFP1
6 pages
MQTT Communication Protocol (PDFDrive)
No ratings yet
MQTT Communication Protocol (PDFDrive)
97 pages
Data Structures Homework 2 Queues
No ratings yet
Data Structures Homework 2 Queues
2 pages
Computer Studies Grade 4 Term 2 Revision
No ratings yet
Computer Studies Grade 4 Term 2 Revision
4 pages
CS, DSE-2, 2022 5th
No ratings yet
CS, DSE-2, 2022 5th
3 pages
USB Type D
No ratings yet
USB Type D
2 pages
EC2203 Digital Electronics Question Bank
No ratings yet
EC2203 Digital Electronics Question Bank
16 pages
PaperCut MF - HP Pro Fast Release Embedded Manual - 2020-05-15
No ratings yet
PaperCut MF - HP Pro Fast Release Embedded Manual - 2020-05-15
31 pages
Blessing Komponen 11 Desember 2023
No ratings yet
Blessing Komponen 11 Desember 2023
261 pages
Internet Technology & Programming With Java: Maharaja Ganga Singh University Bikaner
No ratings yet
Internet Technology & Programming With Java: Maharaja Ganga Singh University Bikaner
3 pages
Basic Computing Periods
No ratings yet
Basic Computing Periods
32 pages
Release Notes For Catalyst 6500 Series and Cisco 7600 Series Internet Router DFC3A ROMMON Software
No ratings yet
Release Notes For Catalyst 6500 Series and Cisco 7600 Series Internet Router DFC3A ROMMON Software
6 pages
BGC-8088 Microengineer Specification
No ratings yet
BGC-8088 Microengineer Specification
2 pages
Product Manual-MTTEK - 2021
No ratings yet
Product Manual-MTTEK - 2021
46 pages
13 - Implementation of LZW Algorithm For Binary Lossless Data Compression
No ratings yet
13 - Implementation of LZW Algorithm For Binary Lossless Data Compression
6 pages
Ge Fanuc 90 70 PLC Manual
No ratings yet
Ge Fanuc 90 70 PLC Manual
522 pages
Sports Store Management System Documentation
No ratings yet
Sports Store Management System Documentation
24 pages

Compiler Design - Webview

Uploaded by

Compiler Design - Webview

Uploaded by

Compiler Design

Syllabus for Compiler Design

Lexical Analysis, Parsing, Syntax-Directed Translation, Runtime Environments, Intermediate Code

Previous Year GATE Papers and Analysis

GATE Papers with answer key

Subject wise Weightage Analysis

[email protected] ©Copyright reserved. Web:www.thegateacademy.com

#3. Syntax Directed Translation

#4. Intermediate Code Generation

[email protected] ©Copyright reserved. Web:www.thegateacademy.com i

Source Compiler Target

The Analysis-Synthesis Model of Compilation

[email protected] ©Copyright reserved. Web:www.thegateacademy.com 1

The Context of a Compiler

Target Assembly Program

Relocatable Machine Code

Loader/Link Editor Library,

Analysis of the Source Program

[email protected] ©Copyright reserved. Web:www.thegateacademy.com 3

Parse Tree for Position: = Initial + Rate 60

[email protected] ©Copyright reserved. Web:www.thegateacademy.com 4

Rate Rate int to real

Semantic Analysis Inserts a Conversion from Integer to Real

The Phases of a Compiler

[email protected] ©Copyright reserved. Web:www.thegateacademy.com 5

Symbol Table Error

Target Code Generator

Error Detection and Reporting

[email protected] ©Copyright reserved. Web:www.thegateacademy.com 6

The Analysis Phases

Syntax Analysis Phase

[email protected] ©Copyright reserved. Web:www.thegateacademy.com 7

You might also like