0% found this document useful (0 votes)

52 views8 pages

Unit-1: Introduction To Compilers

The document discusses the structure and phases of compilers. Compilers translate programs written in a high-level language into machine-readable object code. The major phases are: 1) Lexical analysis breaks the source code into individual tokens. 2) Syntax analysis validates the syntax and generates a parse tree. 3) Semantic analysis validates meanings and generates symbol tables. 4) Intermediate code generation converts the parse trees into an intermediate representation for further optimization and translation to machine code.

Uploaded by

Ashok Madaan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

52 views8 pages

Unit-1: Introduction To Compilers

Uploaded by

Ashok Madaan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Compiler Design

Unit-1: Introduction to Compilers

1.1 Compilers and Translators
If you are unable to speak French and yet wish to communicate a French
speaker then you need someone to translate English into French. The same
happens with computers languages. We would like to communicate in
English but the computer only understands binary - so a translator is
required. Thus the basic function of a translator is to convert a SOURCE (or
original) program into an Object (or binary) program.

There are three main categories of translator:

• Assemblers,
• Interpreters and
• Compilers.

Under normal circumstances (ie unless great speed or compactness is

required) the source program will be written in a high level language which is
either interpreted or compiled.

A translator can be defined as: “A device that changes a sentence from one
language to another without change of meaning.”

1.1.1 Assemblers

In the early days of programming, machine code (binary) was the only option.
Unfortunately this was laborious, prone to error and difficult. A slight
improvement on this was the use of hexadecimal or octal which reduced the
number of errors and the time to enter the program. Eventually assembly
languages were developed which were easier and more productive to use
whilst preserving the speed and compactness of machine code.

Assembly languages vary from one type of computer to another (or more
correctly from processor to processor) which results in a difficulty in
transporting programs from one computer to another.

Assemblers are the simplest of all the translators to understand since the
majority of the statements in the source code are mnemonics (short words
that help you to remember something) representing specific binary patterns -
the others being labels, directives (or pseudo-ops) which give instructions to
the assembler.

So, for instance, rather than enter the binary pattern 01011100 which might
mean “Increment the contents of the Accumulator by 1” we could type in the
mnemonic “INC” which the assembler would translate into the appropriate
binary pattern.

1
Compiler Design

1.1.2 Interpreters

An interpreter is another common kind of language processor. Instead of

producing a target program as a translation, an interpreter appears to
directly execute the operations specified in the source program on inputs
supplied by the user, as shown in Fig.

Source program Output

Interpreter
Input

1.1.3 Compilers

A compiler is a program that can read a program in one language - the source
language - and translate it into an equivalent program in another language -
the target language. An important role of the compiler is to report any errors
in the source program that it detects during the translation process.
Source Program Input

Target
Compiler Program

Target Program Output

If the target program is an executable machine-language program, it can then

be called by the user to process inputs and produce outputs.

1.2 Need of Translators

A computer system can understand only a machine language. With machine
language, the user must communicate directly with computer in terms of
bits, registers, and very primitive machine operations. Since a machine
language program is nothing more than a sequence of 0’s and 1’s,
programming a complex algorithm in such a language is terribly tedious and
fraught with opportunities for mistakes. Perhaps the most serious
disadvantage of machine – language coding is that all operations and
operands must be specified in a numeric code.

2
Compiler Design

So it is required to convert the machine language into user – understandable

language and user – understandable language into machine language.

A computer cannot execute a program written in assembly language. That

program has to be first translated into machine language, which the
computer can understand.

Thus arises the need of translators.

1.3 Structure of Compilers

Mapping of a source program into a semantically equivalent target program is
divided into two parts: analysis and synthesis.

Character stream

Lexical Analysis
Token Stream

Syntax Analysis

Syntax Tree
Semantic Analysis
Syntax Tree
Symbol Intermediate Code Generator
Table
Intermediate Representation

Machine Independent Code - Optimizer

Intermediate Representation
Code Generator

Target Machine Code

Machine Dependent Code - Optimizer

Target Machine Code

Phases of a Compiler

3
Compiler Design

The analysis part breaks up the source program into constituent pieces and
imposes a grammatical structure on them. It then uses this structure to
create an intermediate representation of the source program. The analysis
part also collects information about the source program and stores it in a
data structure called a symbol table, which is passed along with the
intermediate representation to the synthesis part.
The synthesis part constructs the desired target program from the
intermediate representation and the information in the symbol table.

The analysis part is often called the front end of the compiler; the synthesis
part is the back end.

Compiler operates as a sequence of phases, each of which transforms one

representation of the source program to another. These phases are:

1.3.1 Lexical Analysis

The lexical analyzer is the interface between the source program and the
compiler. It reads the source program one character at a time, carving the
source program into a sequence of atomic units called tokens. In other words,
the main function of the lexical analyzer is to determine the tokens, that may
be identifiers, keywords, constants, operations, and punctuation symbols
such as commas and parentheses.

For example, suppose a source program contains the following statement:

IF ( 5 eq MAX ) GOTO 100

The characters in this statement are mapped into the following eight tokens
passed on to the syntax analyzer:

“IF”, “(”, “5”, “eq”, “MAX”, “)”, “GOTO”, and “100”

Blanks separating the lexemes would be discarded by the lexical analyzer.

1.3.2 Syntax Analysis

The second phase of the compiler is syntax analysis or parsing. A parser has
two functions as:

(i) To check whether the tokens occurring in the input are permitted by
the specification of the source language, and
(ii) To give the sequence of tokens, a tree like structure also called as
parse tree.

The parser uses the first components of the tokens produced by the lexical
analyzer to create a tree-like intermediate representation that depicts the
grammatical structure of the token stream. A typical representation is a

4
Compiler Design

syntax tree in which each interior node represents an operation and the
children of the node represent the arguments of the operation.

For example, consider an operation

A/B*C
Parser will generate two parse trees for the statement as:

/ *

A * / C

B C A B

(a) (b)

PARSER checks whether the output of lexical analyzer satisfies the context
free grammar (CFG).

1.3.3 Semantic Analysis

The semantic analyzer uses the syntax tree and the information in the symbol
table to check the source program for semantic consistency with the language
definition. It also gathers type information and saves it in either the syntax
tree or the symbol table, for subsequent use during intermediate-code
generation. An important part of semantic analysis is type checking, where
the compiler checks that each operator has matching operands. For example,
many programming language definitions require an array index to be an
integer; the compiler must report an error if a floating-point number is used
to index an array.

1.3.4 Intermediate Code Generation

In the process of translating a source program into target code, a compiler

may construct one or more intermediate representations, which can have a
variety of forms. Syntax trees are a form of intermediate representation; they
are commonly used during syntax and semantic analysis.

For example, the parse tree generated for the statement: A/B*C
/ *

A * / C

B C A B
(a) (b)

5
Compiler Design

T1 = B * C T1 = A/B
T2 = T1/A T2 = T1*C

(a) (b)

1.3.5 Code Optimization

The machine-independent code-optimization phase attempts to improve the

intermediate code so that better target code will result. Usually better means
faster, but other objectives may be desired, such as shorter code, or target
code that consumes less power.

Code optimization can be done in two ways:

Local Optimization: There are local transformations that can be applied to a

program to attempt an improvement.

For example, consider the following statements:

if A > B GOTO L2
GOTO L3
L2:
----------
----------
L3:
----------
----------
This sequence could be replaced by the single statement

if A <=B GOTO L3
L3:
----------
----------
Loop Optimization: A typical improvement is to move a computation that
produces the same result, each time around the loop to a point in the
program just before the loop is entered.

1.3.6 Code Generation

The code generator takes as input an intermediate representation of the

source program and maps it into the target language. If the target language is
machine code, registers or memory locations are selected for each of the
variables used by the program. Then, the intermediate instructions are
translated into sequences of machine instructions that perform the same
task. A crucial aspect of code generation is the judicious assignment of
registers to hold variables.

6
Compiler Design

For example, A statement A = B + C is mapped into the target language code

sequence as:
LOAD B
ADD C
STORE A

1.3.7 Symbol Table Management (BOOK KEEPING)

A compiler need to collect information about all the data objects that appear
in the source program. The information is collected by early phases of the
compiler – lexical and syntactic analysis – and entered into the symbol table.
The symbol table is a data structure containing a record for each variable
name, with fields for the attributes of the name. The data structure should be
designed to allow the compiler to find the record for each name quickly and to
store or retrieve data from that record quickly.

1.3.8 Grouping of Phases into Phases

Grouping of phases deals with the logical organization of a compiler. In an

implementation, activities from several phases may be grouped together into
a pass that reads an input file and writes an output file. For example, the
front-end phases of lexical analysis, syntax analysis, semantic analysis, and
intermediate code generation might be grouped together into one pass. Code
optimization might be an optional pass. Then there could be a back-end pass
consisting of code generation for a particular target machine.

1.4 Compiler Construction Tools

The compiler writer, like any software developer, can profitably use modern
software development environments containing tools such as language
editors, debuggers, version managers, profilers, test harnesses, and so on. In
addition to these general software-development tools, other more specialized
tools have been created to help implement various phases of a compiler.

The most successful tools are those that hide the details of the generation
algorithm and produce components that can be easily integrated into the
remainder of the compiler. Some commonly used compiler-construction tools
include:

1. Parser generators that automatically produce syntax analyzers from a

grammatical description of a programming language.
2. Scanner generators that produce lexical analyzers from a regular-
expression description of the tokens of a language.
3. Syntax-directed translation engines that produce collections of routines for
walking a parse tree and generating intermediate code.
4. Code-generator generators that produce a code generator from a collection
of rules for translating each operation of the intermediate language into
the machine language for a target machine.

7
Compiler Design

5. Data-flow analysis engines that facilitate the gathering of information

about how values are transmitted from one part of a program to each
other part. Data-flow analysis is a key part of code optimization.
6. Compiler-construction toolkits that provide an integrated set of routines
for constructing various phases of a compiler

1.4. Compiler Vs Interpreters

Compiler and Interpreter can be distinguished as follows:

Compiler Interpreter
1. Spends a lot of time analyzing and 1. Relatively little time is spent
processing the program. analyzing and processing the
program.
2. The resulting executable is some 2. The resulting code is some sort of
form of machine – specific binary intermediate code.
code.
3. The computer hardware interprets 3 The resulting code is interpreted
(executes) the resulting code. by another program.

4 Program execution is fast. 4. Program execution is relatively

slow.

Representation of Differences

Compiler Design Note1
No ratings yet
Compiler Design Note1
111 pages
Assembly Language
100% (2)
Assembly Language
132 pages
Introduction To Computing and Problem Solving Using Python 1nbsped 9352602587 9789352602582
100% (1)
Introduction To Computing and Problem Solving Using Python 1nbsped 9352602587 9789352602582
336 pages
Programming and Logic Slide 1
No ratings yet
Programming and Logic Slide 1
39 pages
Compiler Design Chapter-1
No ratings yet
Compiler Design Chapter-1
41 pages
FlexSim 17.1.0 Manual
No ratings yet
FlexSim 17.1.0 Manual
1,591 pages
Unit 1
No ratings yet
Unit 1
37 pages
Computer Organization & Architecture MCQs and Answers
No ratings yet
Computer Organization & Architecture MCQs and Answers
33 pages
Chapter 2 Problem Solving
100% (2)
Chapter 2 Problem Solving
110 pages
Memory Segmentation, Generating Memory Address: Mustafa Shakir
No ratings yet
Memory Segmentation, Generating Memory Address: Mustafa Shakir
94 pages
Compiler Design Short Notes
No ratings yet
Compiler Design Short Notes
133 pages
Quick Book of Compiler
100% (1)
Quick Book of Compiler
66 pages
Compiler Design: - Language Processor - Language Processing System - Phases of Compiler
No ratings yet
Compiler Design: - Language Processor - Language Processing System - Phases of Compiler
11 pages
Compiler Design
No ratings yet
Compiler Design
47 pages
Automata Theory and Compiler Design (AT&CD) Vtu Sce 5th Sem 21cs51
No ratings yet
Automata Theory and Compiler Design (AT&CD) Vtu Sce 5th Sem 21cs51
12 pages
Computer Programming and Data Structure
No ratings yet
Computer Programming and Data Structure
203 pages
Basic Concepts (2nd Class)
100% (1)
Basic Concepts (2nd Class)
22 pages
m433-نظرية المترجمات د عبدالباقي
No ratings yet
m433-نظرية المترجمات د عبدالباقي
146 pages
Lecture Notes of Compiler Design Lab
No ratings yet
Lecture Notes of Compiler Design Lab
170 pages
Assignment-1 Solution July 2019
No ratings yet
Assignment-1 Solution July 2019
5 pages
Computer Packages Notes
No ratings yet
Computer Packages Notes
147 pages
CD Part
No ratings yet
CD Part
159 pages
Compiler Design
100% (2)
Compiler Design
17 pages
COMPUTER PROGRAMMING FOR KIDS: An Easy Step-by-Step Guide For Young Programmers To Learn Coding Skills (2022 Crash Course for Newbies)
From Everand
COMPUTER PROGRAMMING FOR KIDS: An Easy Step-by-Step Guide For Young Programmers To Learn Coding Skills (2022 Crash Course for Newbies)
Dexter Rogers
No ratings yet
Computer Essentails Notes
No ratings yet
Computer Essentails Notes
65 pages
CD Unit 1
No ratings yet
CD Unit 1
63 pages
Assembly Langauge Experiment #1
100% (1)
Assembly Langauge Experiment #1
15 pages
Compiler Notes
No ratings yet
Compiler Notes
68 pages
CD Unit - 1 Lms Notes
No ratings yet
CD Unit - 1 Lms Notes
58 pages
Compiler Design
No ratings yet
Compiler Design
65 pages
AI Lec 1
No ratings yet
AI Lec 1
48 pages
CD Unit I Part I Introduction
No ratings yet
CD Unit I Part I Introduction
67 pages
Principles of Compiler Design: Million G/her
No ratings yet
Principles of Compiler Design: Million G/her
40 pages
Lec#1
No ratings yet
Lec#1
36 pages
Unit 1 Compiler Design
No ratings yet
Unit 1 Compiler Design
70 pages
Compiler 1
No ratings yet
Compiler 1
28 pages
ICT4
No ratings yet
ICT4
51 pages
ComProg Module - M5 Final
No ratings yet
ComProg Module - M5 Final
6 pages
COMPILER - DESIGN Unit 1
No ratings yet
COMPILER - DESIGN Unit 1
25 pages
15CS205J MCQ PDF
No ratings yet
15CS205J MCQ PDF
48 pages
Compiler Design - Quick Guide: Language Processing System
No ratings yet
Compiler Design - Quick Guide: Language Processing System
51 pages
1 Compiler Design Lect1
No ratings yet
1 Compiler Design Lect1
28 pages
CD Unit-I
No ratings yet
CD Unit-I
25 pages
Compiler
No ratings yet
Compiler
17 pages
CD Notes
No ratings yet
CD Notes
69 pages
Introduction To Compilers Complier: Ompiler Source Program Target Program Error Message
No ratings yet
Introduction To Compilers Complier: Ompiler Source Program Target Program Error Message
23 pages
Compiler Design Unit-1
No ratings yet
Compiler Design Unit-1
25 pages
Compiler Construction
No ratings yet
Compiler Construction
63 pages
MIT Unit 2 Notes
No ratings yet
MIT Unit 2 Notes
19 pages
Lecture 1 - Ch1. Introduction To Compiler
No ratings yet
Lecture 1 - Ch1. Introduction To Compiler
29 pages
Compiler Construction Notes
No ratings yet
Compiler Construction Notes
61 pages
CD 1
No ratings yet
CD 1
15 pages
Assembly Language: by - Prof. Prithi K.S
No ratings yet
Assembly Language: by - Prof. Prithi K.S
67 pages
Com 413 Compiler - Notes1-1
No ratings yet
Com 413 Compiler - Notes1-1
6 pages
Complier Design1
No ratings yet
Complier Design1
17 pages
Class Xi Computer Science Chapter 5.unlocked
100% (1)
Class Xi Computer Science Chapter 5.unlocked
6 pages
Unit-1 Notes CD OU
No ratings yet
Unit-1 Notes CD OU
19 pages
Computer Programming
No ratings yet
Computer Programming
57 pages
Programmable Logic Devices (PLDS)
No ratings yet
Programmable Logic Devices (PLDS)
36 pages
LESSON 2 What Is A Computer
No ratings yet
LESSON 2 What Is A Computer
19 pages
Structured Programming: Lecture 02: Introduction To Programming Languages
No ratings yet
Structured Programming: Lecture 02: Introduction To Programming Languages
47 pages
Compiler Design Quick Guide
No ratings yet
Compiler Design Quick Guide
45 pages
Compiler Design and Implementation
No ratings yet
Compiler Design and Implementation
5 pages
Chapter 1 - Introduction
No ratings yet
Chapter 1 - Introduction
13 pages
Assignment 5 CS
No ratings yet
Assignment 5 CS
15 pages
CD Unit 1
No ratings yet
CD Unit 1
11 pages
Chapter 2
No ratings yet
Chapter 2
11 pages
Compiler Design - Introduction
No ratings yet
Compiler Design - Introduction
6 pages
Compiler Unit - 1 PDF
No ratings yet
Compiler Unit - 1 PDF
16 pages
Unit 1
No ratings yet
Unit 1
9 pages
Set 01 Introduction
No ratings yet
Set 01 Introduction
6 pages
Compiler 2021 Module 1
No ratings yet
Compiler 2021 Module 1
15 pages
CD Unit1 Notes
No ratings yet
CD Unit1 Notes
28 pages
CD Finalized Notes
No ratings yet
CD Finalized Notes
6 pages
CD UNIT 1 Chapter 1
No ratings yet
CD UNIT 1 Chapter 1
9 pages
Compiler Design
No ratings yet
Compiler Design
11 pages
CD KCS502 Unit 1 A
No ratings yet
CD KCS502 Unit 1 A
8 pages
Module 1 - Programming Basics and Logic
No ratings yet
Module 1 - Programming Basics and Logic
13 pages
By Getachew Teshome: Addis Ababa University, Department of Electrical and Computer Engineering
No ratings yet
By Getachew Teshome: Addis Ababa University, Department of Electrical and Computer Engineering
23 pages
Parsers
No ratings yet
Parsers
11 pages
Assembly Language:Simple, Short, And Straightforward Way Of Learning Assembly Programming
From Everand
Assembly Language:Simple, Short, And Straightforward Way Of Learning Assembly Programming
Sherwyn Allibang
2/5 (1)
Microcontroller Basics Course
No ratings yet
Microcontroller Basics Course
5 pages
Module - I: Introduction To Compiling: 1.1 Introduction of Language Processing System
No ratings yet
Module - I: Introduction To Compiling: 1.1 Introduction of Language Processing System
7 pages
Principle of Compiler Design: Translator
No ratings yet
Principle of Compiler Design: Translator
20 pages
Syntax Analysis: Role of Parsers
No ratings yet
Syntax Analysis: Role of Parsers
6 pages
Compiler Construction and Phases
No ratings yet
Compiler Construction and Phases
8 pages
Compiler Construction: Language Processing System
No ratings yet
Compiler Construction: Language Processing System
8 pages
Exam 1: CS 447: Computer Organization and Assembly Language Programming Date: 10/18/01 Fall 2001 Jason D. Bakos
No ratings yet
Exam 1: CS 447: Computer Organization and Assembly Language Programming Date: 10/18/01 Fall 2001 Jason D. Bakos
8 pages
LR Parsers (SLR, LALR, and Canonical LR Parser)
No ratings yet
LR Parsers (SLR, LALR, and Canonical LR Parser)
4 pages
Language Processing System:-: Compiler
No ratings yet
Language Processing System:-: Compiler
6 pages
AT&FL Lab 11
No ratings yet
AT&FL Lab 11
6 pages
Null
No ratings yet
Null
1 page
Unit-1 Cao
No ratings yet
Unit-1 Cao
71 pages
Code Beneath the Surface: Mastering Assembly Programming
From Everand
Code Beneath the Surface: Mastering Assembly Programming
Kameron Hussain
No ratings yet

Unit-1: Introduction To Compilers

Uploaded by

Unit-1: Introduction To Compilers

Uploaded by

Compiler Design

Unit-1: Introduction to Compilers

There are three main categories of translator:

Under normal circumstances (ie unless great speed or compactness is

An interpreter is another common kind of language processor. Instead of

Source program Output

Target Program Output

If the target program is an executable machine-language program, it can then

1.2 Need of Translators

So it is required to convert the machine language into user – understandable

A computer cannot execute a program written in assembly language. That

Thus arises the need of translators.

1.3 Structure of Compilers

Machine Independent Code - Optimizer

Target Machine Code

Machine Dependent Code - Optimizer

Target Machine Code

Compiler operates as a sequence of phases, each of which transforms one

1.3.1 Lexical Analysis

For example, suppose a source program contains the following statement:

IF ( 5 eq MAX ) GOTO 100

“IF”, “(”, “5”, “eq”, “MAX”, “)”, “GOTO”, and “100”

Blanks separating the lexemes would be discarded by the lexical analyzer.

1.3.2 Syntax Analysis

For example, consider an operation

1.3.3 Semantic Analysis

1.3.4 Intermediate Code Generation

In the process of translating a source program into target code, a compiler

1.3.5 Code Optimization

The machine-independent code-optimization phase attempts to improve the

Code optimization can be done in two ways:

Local Optimization: There are local transformations that can be applied to a

For example, consider the following statements:

1.3.6 Code Generation

The code generator takes as input an intermediate representation of the

For example, A statement A = B + C is mapped into the target language code

1.3.7 Symbol Table Management (BOOK KEEPING)

1.3.8 Grouping of Phases into Phases

Grouping of phases deals with the logical organization of a compiler. In an

1.4 Compiler Construction Tools

1. Parser generators that automatically produce syntax analyzers from a

5. Data-flow analysis engines that facilitate the gathering of information

1.4. Compiler Vs Interpreters

4 Program execution is fast. 4. Program execution is relatively

You might also like