Chapter 1
Chapter 1
Prof Chung.
1
Outlines
1.1 Overview and History
1.2 What Do Compilers Do?
1.3 The Structure of a Compiler
1.4 The Syntax and Semantics of Programming
Languages
1.5 Compiler Design and Programming Language
Design
1.7 Computer Architecture and Compiler Design
1.8 Compiler Design Considerations
2
Overview and History (1)
Cause
Software for early computers was written in assembly
language
The benefits of reusing software on different CPUs started
to become significantly greater than the cost of writing a
compiler
3
Overview and History (2)
Compiler technology
is more broadly applicable and has been
employed in rather unexpected areas.
Text-formatting languages,
like nroff and troff; preprocessor packages like eqn, tbl,
pic
Silicon compiler for the creation of VLSI circuits
Command languages of OS
Query languages of Database systems
4
What Do Compilers Do (1)
Programming Machine
Language Compiler Language
(Source) (Target)
5
What Do Compilers Do (2)
Compilers may generate three types of code:
Pure Machine Code
Machine instruction set without assuming the existence
of any operating system or library.
Mostly being OS or embedded applications.
Augmented Machine Code
Code with OS routines and runtime support routines.
More often
Virtual Machine Code
Virtual instructions, can be run on any architecture with
a virtual machine interpreter or a just-in-time compiler
Ex. Java
6
What Do Compilers Do (3)
Another way that compilers
differ from one another is in the format of the
target machine code they generate:
Assembly or other source format
Relocatable binary
Relative address
A linkage step is required
Absolute binary
Absolute address
Can be executed directly
7
The Structure of a Compiler (1)
Analysis Synthesis
Source
Program Tokens SyntacticSemantic
Scanner Parser
(Character StructureRoutines
Stream)
Intermediate
Representation
Source
Program Tokens SyntacticSemantic
Scanner Parser
(Character StructureRoutines
Stream)
Intermediate
Scanner Representation
The scanner begins the analysis of the source
program by reading the input, character by
Symbol and Optimizer
character, and grouping characters into individual
Attribute
words and symbols (tokens)
Tables
RE ( Regular expression )
(Used
NFA ( Non-deterministic by Automata
Finite all )
DFA ( DeterministicPhases of
Finite Automata )
LEX The Compiler) Code
Generator
10
Target machine code
The Structure of a Compiler (4)
Source
Program Tokens SyntacticSemantic
Scanner Parser
(Character StructureRoutines
Stream)
Intermediate
Parser Representation
Given a formal syntax specification (typically as a
context-free grammar [CFG] ), the parse reads
Symbol and Optimizer
tokens and groups them into units as specified by
Attribute
the productions of the CFG being used.
As syntactic structureTables
is recognized, the parser
either calls corresponding semantic routines
(Used by all
directly or builds a syntax tree.
CFG ( Context-Free Phases
Grammarof )
BNF ( Backus-Naur The
Form Compiler)
) Code
GAA ( Grammar Analysis Algorithms ) Generator
LL, LR, SLR, LALR Parsers
11
YACC
Target machine code
The Structure of a Compiler (5)
Source
Program Tokens SyntacticSemantic
Scanner Parser
(Character StructureRoutines
Stream)
Intermediate
Semantic Routines Representation
Perform two functions
Check the static semantics of each construct
Do the actualSymbol and
translation Optimizer
The heart of a compiler
Attribute
Tables
Syntax Directed Translation
Semantic Processing Techniques
(Used by all
IR (Intermediate Representation)
Phases of
The Compiler) Code
Generator
12
Target machine code
The Structure of a Compiler (6)
Source
Program Tokens SyntacticSemantic
Scanner Parser
(Character StructureRoutines
Stream)
Intermediate
Optimizer Representation
The IR code generated by the semantic routines is
analyzed and transformed into functionally
Symbol and Optimizer
equivalent but improved IR code
This phase can beAttribute
very complex and slow
Tables
Peephole optimization
loop optimization, register allocation, code
(Used by all
scheduling
Phases of
The Compiler)
Register and Temporary Management Code
Peephole Optimization Generator
13
Target machine code
The Structure of a Compiler (7)
Source
Program Tokens SyntacticSemantic
Scanner Parser
(Character StructureRoutines
Stream)
Intermediate
Code Generator Representation
Interpretive Code Generation
Generating Code from Tree/Dag
Grammar-Based Code Generator
Optimizer
Code
Generator
14 Target machine code
The Structure of a Compiler (8)
Code Generator
[Intermediate Code Generator]
Non-optimized
Scanner
[Lexical Analyzer] Intermediate Code
Tokens
Code Optimizer
Parser
[Syntax Analyzer]
Optimized Intermediate Cod
Parse
tree
Code Optimizer
Semantic Process
[Semantic analyzer] Target machine code
15
The Structure of a Compiler (9)
Compiler writing tools
Compiler generators or compiler-
compilers
E.g. scanner and parser
generators
Examples : Yacc, Lex
16
The Syntax and Semantics of
Programming Language (1)
A programming language must include the
specification of syntax (structure) and
semantics (meaning).
Syntax typically means the context-free
syntax because of the almost universal use of
context-free-grammar (CFGs)
Ex.
a = b + c is syntactically legal
b + c = a is illegal
17
The Syntax and Semantics of
Programming Language (2)
The semantics of a programming language
are commonly divided into two classes:
Static semantics
Semantics rules that can be checked at compiled time.
Ex. The type and number of a function’s arguments
Runtime semantics
Semantics rules that can be checked only at run time
18
Compiler Design and Programming
Language Design (1)
19
Compiler Design and Programming
Language Design(2)
20
Computer Architecture and Compiler
Design
Compilers should exploit the hardware-specific
feature and computing capability to optimize
code.
The problems encountered in modern
computing platforms:
Instruction sets for some popular architectures are
highly nonuniform.
High-level programming language operations are
not always easy to support.
Ex. exceptions, threads, dynamic heap access …
Exploiting architectural features such as cache,
distributed processors and memory
Effective use of a large number of processors
21
Compiler Design Considerations
Debugging Compilers
Designed to aid in the development and
debugging of programs.
Optimizing Compilers
Designed to produce efficient target code
Retargetable Compilers
A compiler whose target architecture can be
changed without its machine-independent
components having to be rewritten.
22