0% found this document useful (0 votes)

123 views19 pages

Compiler Design

Uploaded by

yaikobdiriba1

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

123 views19 pages

Compiler Design

Uploaded by

yaikobdiriba1

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 19

Addis Ababa University

College of Natural and Computational Sciences

Department of Computer Science

Compiler and Complexity Module

Part I: Automata and Complexity Theory

Part II: Compiler Design

May 2024
Addis Ababa,
Ethiopia
Compiler Design
Objective of the Course
 To learn basic techniques used in compiler construction such as lexical analysis, top-
down and bottom-up parsing, context-sensitive analysis, and intermediate code
generation.
 To learn basic data structures used in compiler construction such as abstract syntax trees,
symbol tables, three-address code, and stack machines.
 To learn software tools used in compiler construction such as lexical analyzer generators,
and parser generators.
Chapter One:

Introduction to Compiling
What is Compiler
 a program that reads a program written in one language and translates it into an
equivalent program in another language.

Compiler vs Interpreter
 Compiler: convert human readable instructions to computer readable instructions one
time.
 Interpreter: converts human instructions to machine instructions each time the program
is run.
Applications of compiler technology
⚫ Parsers for HTML in web browser
⚫ Machine code generation for high level languages
⚫ Software testing

2|Page
⚫ Program optimization
⚫ Malicious code detection
⚫ Design of new computer architectures
Cousins of the Compiler
 Preprocessor:
⚫ produces input for compiler
⚫ file inclusion, language extension, etc.
 Assembler
⚫ assembly language into machine code
⚫ output of an assembler is called an object file
 Linker
⚫ links and merges various object files to make an executable file.
⚫ determine the memory location where these codes will be loaded
 Loader
⚫ loading executable files into memory and execute them.
⚫ It calculates the size of a program (instructions and data) and creates memory
space for it.
⚫ It initializes various registers to initiate execution.
 Cross-Compiler
⚫ compiler that runs on platform (A) and generates executable code for another
platform (B).
 Source-to-source Compiler
⚫ compiler that translates source code of one programming language to another
Phases of a Compiler
 Analysis
⚫ Machine Independent/Language Dependent
 Synthesis
⚫ Machine Dependent/Language independent

3|Page
4|Page
Analysis of the Source Program
1. Lexical / Linear Analysis (scanning)
⚫ Scans the source code as a stream of characters
⚫ Represent lexemes in the form of tokens as:
<token-name, attribute-value>
⚫ Token
 smallest meaningful element that a compiler understands.
⚫ Eg.
 Identifiers, Keywords, Literals, Operators and Special symbols.
⚫ Blanks, new lines, comments will be removed from the source program.

2. Syntax / Hierarchical Analysis – Parsing

 Tokens are grouped hierarchically into nested collections with collective meaning.
 The result is generally a parse tree.
 expressions, statements, declarations etc... are identified by using the results of lexical
analysis.
 Most syntactic errors in the source program are caught in this phase.
 Syntactic rules of the source language are given via a Grammar.

5|Page
3. Semantic Analysis
 Certain checks are performed to make sure that the components of the program fit
together meaningfully.
 Unlike parsing, this phase checks for semantic errors in the source program (e.g. type
mismatch)
- Type checking of various programming language constructs is one of the most
important tasks.
 Stores type information in the symbol table or the syntax tree.
- Types of variables, function parameters, array dimensions, etc.

4. Intermediate Code Generation

Easy to produce and easy to translate to machine code

6|Page
5. Code Optimization
Changes the IC by removing such inefficiencies
Improve the code
a. Improvement may be time, space, or power consumption.
It changes the structure of programs,

6. Code Generation
Converts intermediate code to machine code.
Must handle all aspects of machine architecture
Storage allocation decisions are made
a. Register allocation and assignment

7|Page
Chapter 2:

Lexical Analysis
What is Lexical Analysis
 The first phase of a compiler
 The input is a high level language program
 The output is a sequence of tokens
 Strips off blanks, tabs, newlines, and comments from the source program
 Keeps track of line numbers

Tokens, Patterns, and Lexemes

 Token
⚫ A string of characters which logically belong together
⚫ Classes of similar lexemes
l identifier, keywords, constants etc.
 Pattern
⚫ A rule which describes a token
 Lexeme
⚫ The sequence of characters matched by a pattern to form the token

 Classes of Tokens
⚫ Identifiers: names chosen by the programmer
⚫ Keywords: names already in the programming language
⚫ Separators: punctuation characters
⚫ Operators: symbols that operate on arguments and produce results
⚫ Literals: numeric, textual literals

8|Page
Chapter 3

Syntax Analysis
 Every language has rules for syntactic structure of well formed programs.
 Takes streams of tokens from lexical analyzer and produce a parse tree.

Grammars
 Every programming language has grammar rules
 Parsers or syntax analyzers are generated for a particular grammar
 CFG are used for syntax specification of programming languages
Context Free Grammar (CFG)
 Is denoted as G = (N, T , P, S)
 N : finite set of non-terminals
 T : finite set of terminals
 S ∈ N: The start symbol
 P : Finite set of productions, each of the form A→α, where A∈N and α ∈ (N U
T)∗

Derivations
 Derivation of terminal string from non-terminal
 A production is applied at each step in derivation
 the productions E→E + E, E→id, and E→ id, are applied at steps 1,2, and, 3 respectively.
 read as S derives id + id.

Derivation Trees
 Derivations can be displayed as trees
 Internal nodes of the tree are all non-terminals
 Leaves are all terminals
 The yield of a derivation tree is the list of the labels of all the leaves read from left to
right.

9|Page
Leftmost and Rightmost Derivations
 Leftmost Derivation
⚫ Apply a production only to the leftmost variable at every step
⚫ S → aAS | a | SS
⚫ A → SbA | ba
⚫ S => aAS => aSbAS =>aabAS => aabbaS => aabbaa
 Rightmost Derivation
⚫ Apply production to the rightmost variable at every step
⚫ S =>aAS =>aAa=>aSbAa =>aSbbaa =>aabbaa

Parsing
 Process of constructing parse tree for a sentence generated by a given grammar.
 2 types of parsers
⚫ Top down parsing (predictive parsers)
 LL(1)
⚫ Bottom up parsing (SR parsers)
 LR(1)

Top Down Parsing

 The parse tree is created top to bottom
 Starts from the start symbol and transform it to the input
Bottom Up Parsing
 Starts with the input symbols and tries to construct the parse tree up to the start symbol.
 One way of reducing a sentence is to follow the right most derivation in reverse

10 | P a g e
LL(1) Grammar
 L – left to right
 L – left most derivation
 1 – number of look ahead
 First( ) and Follow( )
⚫ the first terminal in a string and the terminal that follows a variable respectively.
LR Parsing
 LR(k) - Left to right scanning with Rightmost derivation in reverse, k being the number
of lookahead tokens.

Types of LR Parsers
 LR (0) , SLR (1) , LALR (1) , CLR (1)

LL LR

Leftmost derivation Rightmost derivation in reverse

Starts with root non-terminal on stack Ends with root non-terminal on the stack

Builds the parse tree top-down Builds the parse tree bottom-up

Expands the non-terminals Reduces the non-terminals

Ends when the stack is empty Starts with an empty stack

11 | P a g e
Chapter 4

Semantic Analysis
Syntax Directed Translation
 Attaching actions to the grammar rules(productions).
 Actions are executed during the compilation
⚫ Not during the generation of the compiler
 Actions are executed according to the parsing mechanism.
Syntax Directed Definitions
 Is a generalization of a context free grammar
 Is a CFG with attributes and rules
 Attributes are associated with grammar symbols and rules with productions
 Attributes may be:
⚫ Numbers
⚫ Types
⚫ Strings etc
Syntax Directed Definition- Example
 Production Semantic Rules
 L  E return print(E.val)
 E  E1 + T E.val = E1.val + T.val
 ET E.val = T.val
 T  T1 * F T.val = T1.val * F.val
 TF T.val = F.val
 T(E) F.val = E.val
 F  digit F.val = digit.lexval

Functions for Syntax Tree Nodes

 mknode ( op, left, right )
⚫ Creates an operator node with label op &

12 | P a g e
⚫ Two fields containing pointers to left and right
 mkleaf(id, entry)
⚫ Creates an identifier node with label id &
⚫ A field containing entry, ptr to symbol table entry for the identifier
 mkleaf(num, val)
⚫ Create a number node with label num &
⚫ A field containing val, the value for the number

Syntax tree for expression a-4+c

 P1=mkleaf(id,entrya);
 P2=mkleaf(num, 4);
 P3=mknode(‘-’,p1,p2);
 P4=mkleaf(id,entryc);
 P5=mknode(‘+’,p3,p4);

Chapter 5
Type Checking
What are Types ?
 Types:
⚫ Describe the values computed during the execution of the program

13 | P a g e
 Type Errors:
⚫ Improper or inconsistent operations during program execution
 Type-safety:
⚫ Absence of type errors
Type Checking
 Semantic checks to enforce the type safety of the program
 Semantic Checks
⚫ Static – done during compilation
⚫ Dynamic – done during run-time
 Examples
⚫ Unary and binary operators
⚫ Number and type of arguments
⚫ Return statement with return type
⚫ Compatible assignment
Static Checking
 The compiler must check the semantic conventions of the source language
 Static Checking: ensures that certain kind of errors are detected and reported
 Example
 Type Checks: incompatible operands
 Flow Control Check
 Uniqueness Check
 Name Related Check
Type Checking of Expressions
E  literal { E.type = char }
E  num { E.type = int }
E  id { E.type = lookup(id.entry) }
EE1 mod E2 { E.type=if E1.type=int and E2.type= int
then int
else type_error }

14 | P a g e
EE1[E2] { E.type=if E2.type=int and
E1.type=array(s,t) then t else type_error }
Type Checking of Statements
Sid=E { S.type = if id.type=E.type then
void else type_error }
Sif E then S1 { S.type = if E.type=Boolean then
S1.type else type_error }
Swhile E do S1 { S.type = if E.type = Boolean then
S1.type else type_error }

Chapter Six

Intermediate Code Generation

Three Address Code

 Is a sequence of statements of the form
⚫ X = Y op Z
⚫ X,Y and Z are names, constants or compiler generated temporaries
⚫ Op is operator (arithmetic, logical )
 Example:
⚫ a = b + c , x = -y , if a > b goto L1
 LHS is the target
 RHS has at most two sources and one operator

15 | P a g e
Three Address Code
 Is a generic form and can be implemented as:
⚫ Quadruples
⚫ Triples
⚫ Indirect Triples
⚫ Tree
⚫ DAG
 Example: a = b + c * d , a + b * c - d / (b * c) ?
 t1 = c * d
 t2 = b + t1
 a = t2

Three Address Code

 Quadruples:
⚫ Each instruction is divided into four fields
⚫ Operator, arg1, arg2, and result
 Triples:
⚫ Has three fields
⚫ Operator, arg1 and arg2
 DAG and Tree
⚫ Similar presentation of expression to triples
 Indirect Triples
⚫ Uses pointers instead of position to store results

Implementations of 3-Address Code

16 | P a g e
Declarations
 Involves allocation of space in memory &
 Entry of type and name in symbol table
 Off set variable (Offset=0) is used to denote the base address

int a; float b;
Allocation process: { offset = 0 }
int a;
id.type = int
id.width = 2
offset = offset + id.width { offset = 2 }
float b;
id.type=float
id.width=4
offset = offset +id.width { offset = 6 }

Chapter 8
Introduction to Code Optimization

Goals of Code Optimization

 Remove redundant code without changing the meaning of program
 Executes faster
 Efficient memory usage
 Better performance

17 | P a g e
Techniques
 Common sub-expression elimination
⚫ Repeated appearance computed previously
 Strength reduction
⚫ Replacement of expensive expressions with simple ones
 Code movement
⚫ Moving a block of code outside a loop
 Dead code elimination
⚫ Eliminated code statements that are either never executed or unreachable

Register Allocation
 Registers hold values
 Example
⚫ a=c+d
⚫ e=a+b
⚫ f=e–1
 With the assumption that a and e die after use
 Temporary a can be reused after e=a+b, same wz a
 Can allocate a,e and f all to one register(r1)
⚫ r1 = r2 + r3
⚫ r1 = r1 + r4
⚫ r1 = r1 – 1

Peephole Optimization
 Transforming to optimal sequence of instructions
Common Techniques:
 Elimination of redundant loads and stores
⚫ Eg.
⚫ r2 = r1 + 5
⚫ I = r2
⚫ r3 = I
⚫ r4 = r3 * 3
 Constant folding
⚫ Eg.
⚫ R2 = 3 * 2
 Constant Propagation
⚫ Eg.
⚫ r1 = 3
⚫ r2 = r1 * 2
 Copy Propagation
⚫ Eg.
⚫ r2 = r1

18 | P a g e
⚫ r3 = r1 + r2
⚫ r2 = 5;
 Elimination of useless instructions
⚫ Eg.
⚫ r1 = r1 + 0 r1 = r1 * 1

19 | P a g e

Ch-2 DFA and NFA
No ratings yet
Ch-2 DFA and NFA
27 pages
Chapter 3 - Basics of Search
No ratings yet
Chapter 3 - Basics of Search
81 pages
Introduction To Network Fundamentals 2 - LetsDefend
No ratings yet
Introduction To Network Fundamentals 2 - LetsDefend
3 pages
Final Exam 50% Compiler Design
No ratings yet
Final Exam 50% Compiler Design
4 pages
21CS51 ATCD MODULE 2 - 2 Lexical Analyser Part1
No ratings yet
21CS51 ATCD MODULE 2 - 2 Lexical Analyser Part1
63 pages
Compiler Design Short Notes
No ratings yet
Compiler Design Short Notes
133 pages
Topic 2 - Syntax and Semantics Lecture Notes
No ratings yet
Topic 2 - Syntax and Semantics Lecture Notes
50 pages
Uninformed Search Algorithms
100% (1)
Uninformed Search Algorithms
19 pages
Advanced Web Development - MCQ Answers
100% (1)
Advanced Web Development - MCQ Answers
3 pages
Compiler Design Chapter-1
No ratings yet
Compiler Design Chapter-1
41 pages
En Es - Dict
No ratings yet
En Es - Dict
1,092 pages
Chapter 5 Turing Machines
No ratings yet
Chapter 5 Turing Machines
47 pages
Chapter 2 - Lexical Analyser
No ratings yet
Chapter 2 - Lexical Analyser
40 pages
5 LineDrawing
No ratings yet
5 LineDrawing
42 pages
Compiler Design Unit 1 Notes
No ratings yet
Compiler Design Unit 1 Notes
21 pages
Chapter 2
No ratings yet
Chapter 2
56 pages
178p1a0427 (Seminar Report)
No ratings yet
178p1a0427 (Seminar Report)
37 pages
Model Exam Version 6
No ratings yet
Model Exam Version 6
18 pages
Compiler Construction: Chapter 1: Introduction To Compilation
No ratings yet
Compiler Construction: Chapter 1: Introduction To Compilation
65 pages
Operating System Mid Term Exam Revision Note
No ratings yet
Operating System Mid Term Exam Revision Note
114 pages
Rift Valley University Department of Computer Science
No ratings yet
Rift Valley University Department of Computer Science
35 pages
rkCD-Chapter 2 - LEXICAL ANALYSIS
No ratings yet
rkCD-Chapter 2 - LEXICAL ANALYSIS
9 pages
Compiler Assignment
No ratings yet
Compiler Assignment
11 pages
Module 3 Notes
No ratings yet
Module 3 Notes
27 pages
Compiler Design - Chapter 4 - Syntax Directed Translation
No ratings yet
Compiler Design - Chapter 4 - Syntax Directed Translation
49 pages
Chapter 6 8086 Hardware Specifications PDF
No ratings yet
Chapter 6 8086 Hardware Specifications PDF
42 pages
Computer Programming Exit Exam Answers With Explanation Version
No ratings yet
Computer Programming Exit Exam Answers With Explanation Version
7 pages
Networking Manual by Bassterlord (Fisheye)
No ratings yet
Networking Manual by Bassterlord (Fisheye)
63 pages
Ethical Dilemma's Faced by Robotics
No ratings yet
Ethical Dilemma's Faced by Robotics
6 pages
Question One 1. Discuss The Roles of Os in Device Management
No ratings yet
Question One 1. Discuss The Roles of Os in Device Management
11 pages
Management Information Systems: Kenneth C. Laudon Jane P. Laudon
No ratings yet
Management Information Systems: Kenneth C. Laudon Jane P. Laudon
4 pages
Compiler Design Notes
No ratings yet
Compiler Design Notes
8 pages
Chapter 1 Introduction To Compiler Design
No ratings yet
Chapter 1 Introduction To Compiler Design
13 pages
Bit2203 Advanced Object-Oriented Programming Lectures Sep 2021
No ratings yet
Bit2203 Advanced Object-Oriented Programming Lectures Sep 2021
250 pages
Arid Agriculture University, Rawalpindi: (Theory)
No ratings yet
Arid Agriculture University, Rawalpindi: (Theory)
6 pages
Compiler Design Chapter 2
No ratings yet
Compiler Design Chapter 2
14 pages
Linker and Loader
100% (1)
Linker and Loader
25 pages
Inter Office Communication File Managment System
No ratings yet
Inter Office Communication File Managment System
87 pages
UNIT-1: Compiler Design
No ratings yet
UNIT-1: Compiler Design
17 pages
2 Syntax Directed Transiation
No ratings yet
2 Syntax Directed Transiation
9 pages
School of Electrical Engineering and Computing Department of Electrical and Computer Engineering "Telephone Conversation Recorder Design, Modeling and Simulation"
No ratings yet
School of Electrical Engineering and Computing Department of Electrical and Computer Engineering "Telephone Conversation Recorder Design, Modeling and Simulation"
37 pages
Compiler Design Worksheet
No ratings yet
Compiler Design Worksheet
2 pages
Adama Science and Technology University: School of Electrical Engineering and Computing
No ratings yet
Adama Science and Technology University: School of Electrical Engineering and Computing
10 pages
Roots Millennium School Khyber Campus Peshawar 1 Term Exam Dec. 2016
No ratings yet
Roots Millennium School Khyber Campus Peshawar 1 Term Exam Dec. 2016
3 pages
Hapter: Simple Sorting and Searching Algorithms
No ratings yet
Hapter: Simple Sorting and Searching Algorithms
27 pages
Loaders and Linkers
100% (1)
Loaders and Linkers
15 pages
Design of Arduino Based Home Automation Systems Incorporating Identity Detection
No ratings yet
Design of Arduino Based Home Automation Systems Incorporating Identity Detection
77 pages
Traffic Light Simulator Using 8086 Assembly Language
No ratings yet
Traffic Light Simulator Using 8086 Assembly Language
14 pages
Iptchapter5 151207213412 Lva1 App6891 PDF
No ratings yet
Iptchapter5 151207213412 Lva1 App6891 PDF
17 pages
Chapter7 Input Output Organization PDF
No ratings yet
Chapter7 Input Output Organization PDF
19 pages
Unit 1 Compiler Design
No ratings yet
Unit 1 Compiler Design
124 pages
Feedback: Your Answer Is Correct
No ratings yet
Feedback: Your Answer Is Correct
46 pages
Ap CSP Guide
No ratings yet
Ap CSP Guide
26 pages
RFID Based Attendance System
No ratings yet
RFID Based Attendance System
30 pages
Chapter 3 Regular Expression
No ratings yet
Chapter 3 Regular Expression
25 pages
Simple Sorting and Searching Algorithms 2.1searching: Pseudocode
No ratings yet
Simple Sorting and Searching Algorithms 2.1searching: Pseudocode
7 pages
Allslides Handout
No ratings yet
Allslides Handout
269 pages
1.2 Assembler Notes
100% (1)
1.2 Assembler Notes
16 pages
Computer Graphics Through Opengl: From Theory To Experiments Experiments Chapter 2
No ratings yet
Computer Graphics Through Opengl: From Theory To Experiments Experiments Chapter 2
26 pages
Advanced Being Safe Security App With Scream Alert
No ratings yet
Advanced Being Safe Security App With Scream Alert
7 pages
Compiler Design
No ratings yet
Compiler Design
19 pages
Compler
No ratings yet
Compler
35 pages
Applications of Embedded Systems
No ratings yet
Applications of Embedded Systems
13 pages
Elix Essential - Manual
No ratings yet
Elix Essential - Manual
54 pages
SMD Unit 1 PDF
No ratings yet
SMD Unit 1 PDF
8 pages
Gefran 3400-4400 0409 ENG
No ratings yet
Gefran 3400-4400 0409 ENG
4 pages
SOP For Masters in Computer Science: Phone: +91 9946991401
No ratings yet
SOP For Masters in Computer Science: Phone: +91 9946991401
1 page
Lec00 Outline
No ratings yet
Lec00 Outline
27 pages
My Siwes Report
No ratings yet
My Siwes Report
48 pages
Chapter 8 Revision
No ratings yet
Chapter 8 Revision
15 pages
Curriculum Vitae Ms. Asenaca Leleasiga Kubu Wotta
No ratings yet
Curriculum Vitae Ms. Asenaca Leleasiga Kubu Wotta
5 pages
ARM Cortex Portfolio - Public Version - 2113
No ratings yet
ARM Cortex Portfolio - Public Version - 2113
5 pages
Java Foundation With Data Structures Topic: Installation Guide For JDK and Eclipse
No ratings yet
Java Foundation With Data Structures Topic: Installation Guide For JDK and Eclipse
7 pages
Hall Sensor Diagram
No ratings yet
Hall Sensor Diagram
7 pages
Student Book Touchstone 2
0% (1)
Student Book Touchstone 2
3 pages
DAA Unit 1
No ratings yet
DAA Unit 1
106 pages
Sensors: Indoor Positioning Algorithm Based On The Improved RSSI Distance Model
No ratings yet
Sensors: Indoor Positioning Algorithm Based On The Improved RSSI Distance Model
15 pages
8 Ways To Improve Your Managerial Skills
No ratings yet
8 Ways To Improve Your Managerial Skills
5 pages
EI Lecture 15-3-2011
No ratings yet
EI Lecture 15-3-2011
2 pages
Klayman Et Al v. Obama Et Al Opinion
No ratings yet
Klayman Et Al v. Obama Et Al Opinion
68 pages
Seat-Belt Engine Cut-Off System
No ratings yet
Seat-Belt Engine Cut-Off System
6 pages
Vocabulary Classroom Objects
No ratings yet
Vocabulary Classroom Objects
2 pages
Ahmad Austin Resume
No ratings yet
Ahmad Austin Resume
5 pages
Security Testing
No ratings yet
Security Testing
6 pages
TV Price List WS - June 04, 2020
No ratings yet
TV Price List WS - June 04, 2020
1 page
Eto Yung Performance Task Namin
No ratings yet
Eto Yung Performance Task Namin
2 pages
Resizing Partitions (For Android)
No ratings yet
Resizing Partitions (For Android)
2 pages
Electrical and Computer Engineering, Department of Department
No ratings yet
Electrical and Computer Engineering, Department of Department
3 pages
Uwell Caliburn AK2 Replacement Pods - India Vape
No ratings yet
Uwell Caliburn AK2 Replacement Pods - India Vape
1 page
Transcript
No ratings yet
Transcript
1 page