0% found this document useful (0 votes)

3 views

Compiler_Design_Notes

The document outlines key concepts in compiler design, covering lexical and syntax analysis, parsing techniques, syntax-directed translation, code optimization, and runtime environments. It details the structure of compilers, the role of lexical analyzers and parsers, various parsing methods, and intermediate code generation. Additionally, it discusses optimization techniques and code generation issues, emphasizing the importance of understanding these components for effective compiler construction.

Uploaded by

nandinikook

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

Compiler_Design_Notes

Uploaded by

nandinikook

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

Compiler Design Notes

UNIT I: Lexical Analysis & Syntax Analysis

**Language Processors:** Systems that process programs to make them executable.
Examples: compilers, interpreters, assemblers.

Structure of a Compiler: Phases include lexical analysis, syntax analysis, semantic

analysis, intermediate code generation, code optimization, and code generation.

Lexical Analysis: Converts characters to tokens. Removes whitespace/comments.

Role of Lexical Analyzer:

 - Tokenizes input
 - Removes whitespace/comments
 - Passes tokens to parser

Bootstrapping: Writing a compiler in the source programming language it intends to

compile.

**Input Buffering:** Technique for efficient scanning using buffers with sentinel characters.

Specification of Tokens: Defined using regular expressions, e.g., identifier: `[a-zA-Z_][a-

zA-Z0-9_]*`

Recognition of Tokens: Finite Automata used to recognize token patterns.

**Lexical Analyzer Generator (LEX):** Tool that generates lexical analyzers. Example:

DIGIT [0-9]

{DIGIT}+ { printf("Number"); }

Finite Automata: DFA/NFA used to implement lexical analyzers.

**Regular Expressions and Finite Automata:** REs define languages recognized by FA.

**Design of Lexical Analyzer Generator:** Converts REs to NFA -> DFA -> minimized DFA ->
code.

Syntax Analysis: Checks token sequence against grammar rules.

**Role of the Parser:** Detects syntax errors, builds parse trees.

Context-Free Grammars (CFG): Consist of terminals, non-terminals, start symbol, and

productions.

**Derivations and Parse Trees:** Show how strings derive from grammar. Leftmost and
rightmost derivations.

**Ambiguity:** A grammar with multiple parse trees for the same string.

**Left Recursion:** Grammar with productions like A -> Aα. Must be removed for top-down
parsing.

Left Factoring: Removes common prefixes to aid predictive parsing.

---

UNIT II: Parsing Techniques

**Top Down Parsing:** Builds parse tree from top using CFG.

Preprocessing Steps: Remove left recursion, perform left factoring.

Backtracking: Tries multiple production rules. Inefficient.

**Recursive Descent Parsing:** Uses mutually recursive functions for grammar rules.

LL(1) Grammars: Can be parsed without backtracking. Use single lookahead.

Non-recursive Predictive Parsing: Uses parsing table and stack.

**Error Recovery in Predictive Parsing:** Techniques include panic mode and phrase-level
recovery.

Bottom Up Parsing: Builds tree from leaves up.

**Difference between LR and LL Parsers:** LR parsers are more powerful and can handle
left recursion.

Types of LR Parsers: SLR, CLR, LALR.

**Shift-Reduce Parsing:** Uses stack and input buffer. Shift moves input to stack; reduce
applies grammar.

SLR Parsers: Simplified LR parsers using FOLLOW sets.

**SLR Table Construction:** Compute FIRST, FOLLOW, item sets, ACTION/GOTO tables.

**CLR and LALR Parsers:** More powerful, use lookahead. LALR combines similar CLR
states.
**Dangling Else Ambiguity:** "else" may match multiple "if"s. Resolved via grammar.

Error Recovery in LR Parsing: Same as in LL but adapted for stack.

Handling Ambiguous Grammar: Use precedence and associativity rules.

---

UNIT III: Syntax Directed Translation & Intermediate Code

**Syntax Directed Definitions (SDD):** CFG + semantic rules.

**Evaluation Orders for SDDs:** Post-order traversal for bottom-up; pre-order for top-
down.

Applications of Syntax Directed Translation: Type checking, intermediate code

generation.

Syntax Directed Translation Schemes (SDTS): Grammar with semantic actions

embedded.

Implementing L-Attributed SDDs: Evaluate attributes during parsing.

Intermediate Code Generation: Converts source to intermediate representation (IR).

Variants of Syntax Trees: Abstract syntax trees, DAGs.

Three Address Code (TAC): IR using temporary variables. Example:

t1 = a + b

t2 = t1 * c

Types and Declarations: Managed with symbol table.

Translation of Expressions: Convert infix to postfix/TAC.

Type Checking: Ensures operands are type-compatible.

Control Flow & Backpatching: Used for jumps and branches.

Intermediate Code for Procedures: Includes prologue/epilogue, parameter passing.

---

UNIT IV: Code Optimization

**Sources of Optimization:** Redundant operations, dead code, loop inefficiencies.
**Basic Blocks:** Sequences of instructions with single entry/exit.

Optimization of Basic Blocks: Remove common sub-expressions, dead code elimination.

Structure Preserving Transformations: Maintain program structure while optimizing.

Flow Graphs: Represent control flow with nodes and edges.

Loop Optimization: Includes loop unrolling, invariant code motion.

Data-Flow Analysis: Gathers info on variable usage to optimize.

Peephole Optimization: Localized improvements like replacing instructions.

---

UNIT V: Run Time Environments & Code Generation

**Storage Organization:** Stack, heap, static, and code segments.

Run Time Storage Allocation: Memory assigned to variables/structures during

execution.

Activation Records: Store return address, parameters, local variables.

Procedure Calls: Manage control transfer and data passing.

Displays: Used for accessing non-local variables.

Code Generation Issues: Instruction selection, register allocation.

Object Code Forms: Final machine code forms.

Code Generation Algorithm: Converts IR to assembly.

**Register Allocation and Assignment:** Efficient use of CPU registers using graph coloring.

---

**Note:** Each unit's examples and key diagrams (like DFA for token recognition, parse
trees, TAC examples) should be practiced separately.

These notes aim to summarize core compiler design concepts with clarity.

#AVIRO GLOBAL TEKNOLOGI - Company Profile Rev12.3
No ratings yet
#AVIRO GLOBAL TEKNOLOGI - Company Profile Rev12.3
15 pages
SHORTS
No ratings yet
SHORTS
11 pages
CD Overview
No ratings yet
CD Overview
9 pages
SYSTEM SOFTWARE -WPS Office
No ratings yet
SYSTEM SOFTWARE -WPS Office
2 pages
compiler_design_syllabus
No ratings yet
compiler_design_syllabus
8 pages
cd2m
No ratings yet
cd2m
5 pages
Compiler Design CAT Answers
No ratings yet
Compiler Design CAT Answers
3 pages
Download
No ratings yet
Download
1 page
CD question bank (1)
No ratings yet
CD question bank (1)
7 pages
Compiler Design 1
No ratings yet
Compiler Design 1
206 pages
1 qp
No ratings yet
1 qp
31 pages
Compiler Design Imortant Questions
No ratings yet
Compiler Design Imortant Questions
28 pages
Ambiguous Grammars and Eliminating Ambiguity
No ratings yet
Ambiguous Grammars and Eliminating Ambiguity
2 pages
Document 7
No ratings yet
Document 7
13 pages
Compiler Design Note
No ratings yet
Compiler Design Note
313 pages
CSE313 - Compiler Design Syllabus
No ratings yet
CSE313 - Compiler Design Syllabus
2 pages
Compiler_Design_Solutions-1
No ratings yet
Compiler_Design_Solutions-1
4 pages
ALL UNITS
No ratings yet
ALL UNITS
19 pages
CD_Micro
No ratings yet
CD_Micro
5 pages
cd 10 marks
No ratings yet
cd 10 marks
29 pages
Compiler Design CAT
No ratings yet
Compiler Design CAT
6 pages
Compiler Design_ 2-Mark and 16-Mark Answers (1)
No ratings yet
Compiler Design_ 2-Mark and 16-Mark Answers (1)
19 pages
Compiler Design Syllabus
No ratings yet
Compiler Design Syllabus
2 pages
Cheatsheet Generator
No ratings yet
Cheatsheet Generator
2 pages
Compiler Design KCS5
No ratings yet
Compiler Design KCS5
10 pages
2-Introduction to Compilation and Lexical Analysis-19!07!2024
No ratings yet
2-Introduction to Compilation and Lexical Analysis-19!07!2024
135 pages
Cambridge: Computer Science Tripos Part Ib
No ratings yet
Cambridge: Computer Science Tripos Part Ib
82 pages
CD -2 Notes
No ratings yet
CD -2 Notes
34 pages
CS3501 Compiler Design
No ratings yet
CS3501 Compiler Design
13 pages
1-Introduction to programming language translators-13-12-2024
No ratings yet
1-Introduction to programming language translators-13-12-2024
38 pages
Compiler
No ratings yet
Compiler
5 pages
Compiler Key2
No ratings yet
Compiler Key2
18 pages
imp
No ratings yet
imp
9 pages
Syllabus: ECS-603: Compiler Design
No ratings yet
Syllabus: ECS-603: Compiler Design
2 pages
Overview of Compiler
No ratings yet
Overview of Compiler
56 pages
cdsem
No ratings yet
cdsem
14 pages
Chapter 1 - Introduction
No ratings yet
Chapter 1 - Introduction
27 pages
409.f
No ratings yet
409.f
13 pages
Principles of Compiler Design
100% (2)
Principles of Compiler Design
35 pages
Compiler Construction CHAPTER 3
No ratings yet
Compiler Construction CHAPTER 3
15 pages
Btcse 701-Compiler Design
No ratings yet
Btcse 701-Compiler Design
10 pages
Compiler Design
No ratings yet
Compiler Design
1 page
WINSEM2024-25_CSI2005_TH_VL2024250502429_2024-12-13_Reference-Material-I (1)
No ratings yet
WINSEM2024-25_CSI2005_TH_VL2024250502429_2024-12-13_Reference-Material-I (1)
42 pages
Compiler Design Unit1 Summary
No ratings yet
Compiler Design Unit1 Summary
2 pages
Langauage Processor
No ratings yet
Langauage Processor
11 pages
compiler basic question
No ratings yet
compiler basic question
3 pages
Markdown to PDF
No ratings yet
Markdown to PDF
2 pages
Chapter 1 - Introduction To Comp
No ratings yet
Chapter 1 - Introduction To Comp
27 pages
QB TT2(1)
No ratings yet
QB TT2(1)
2 pages
System Programming
100% (2)
System Programming
48 pages
System_Software_Syllabus_Hierarchy
No ratings yet
System_Software_Syllabus_Hierarchy
3 pages
Compiler Design
No ratings yet
Compiler Design
4 pages
Imp CS1352 APR08
No ratings yet
Imp CS1352 APR08
15 pages
IARE CD Lecture Notes
No ratings yet
IARE CD Lecture Notes
98 pages
Syllabus
No ratings yet
Syllabus
1 page
Compiler_Design_Roadmap
No ratings yet
Compiler_Design_Roadmap
2 pages
cd-important-questions
No ratings yet
cd-important-questions
2 pages
u2
No ratings yet
u2
129 pages
Problem Solving in C and Python: Programming Exercises and Solutions, Part 1
From Everand
Problem Solving in C and Python: Programming Exercises and Solutions, Part 1
Yana Kortsarts
4.5/5 (2)
C Programming
From Everand
C Programming
Netra
No ratings yet
Python Programming Concepts
From Everand
Python Programming Concepts
MRB
No ratings yet
Java Project
No ratings yet
Java Project
4 pages
email-p2presearch-2009-02-13-023120
No ratings yet
email-p2presearch-2009-02-13-023120
3 pages
GECKReadme
No ratings yet
GECKReadme
2 pages
Mca Syllabus
No ratings yet
Mca Syllabus
44 pages
672 Small Scale Design
No ratings yet
672 Small Scale Design
3 pages
Unit-5(Part-3)File Handling in C
No ratings yet
Unit-5(Part-3)File Handling in C
18 pages
RRC SCR GDCE JE Mechanical 16-08-2024 QP (1)
No ratings yet
RRC SCR GDCE JE Mechanical 16-08-2024 QP (1)
25 pages
J Matric Revision Notes
No ratings yet
J Matric Revision Notes
209 pages
PTL Whats New v3 80 Enu
No ratings yet
PTL Whats New v3 80 Enu
6 pages
Mold Building Standards: Revised Date
No ratings yet
Mold Building Standards: Revised Date
13 pages
Case Study 5-HP
No ratings yet
Case Study 5-HP
3 pages
Unit 4 - Student Spec
No ratings yet
Unit 4 - Student Spec
8 pages
Project Management - CPM-PERT
No ratings yet
Project Management - CPM-PERT
125 pages
Europower EP4000/EP2000: PA Amplifiers
No ratings yet
Europower EP4000/EP2000: PA Amplifiers
4 pages
MINILINK 6654-Assembling and Connecting Indoor Cables
No ratings yet
MINILINK 6654-Assembling and Connecting Indoor Cables
34 pages
ZTE ZXV10 W615 V3 Product
No ratings yet
ZTE ZXV10 W615 V3 Product
5 pages
Introduction To Basic Programming
No ratings yet
Introduction To Basic Programming
43 pages
01) System Configuration (IPECS-MG)
No ratings yet
01) System Configuration (IPECS-MG)
23 pages
How To Create A Read Only User For Tablespace DCO Monitoring in LogicMonitor
No ratings yet
How To Create A Read Only User For Tablespace DCO Monitoring in LogicMonitor
8 pages
AI Powered Threat Detecition
No ratings yet
AI Powered Threat Detecition
11 pages
Sony kdl-22cx520 kdl-32cx520 cx523 kdl-40cx520 523 Chassis Az2g SM
50% (2)
Sony kdl-22cx520 kdl-32cx520 cx523 kdl-40cx520 523 Chassis Az2g SM
43 pages
PPSC Lecturer of Computer Science Old Paper
No ratings yet
PPSC Lecturer of Computer Science Old Paper
13 pages
Expansion Joint Cover
No ratings yet
Expansion Joint Cover
2 pages
Coap 2024 Ece
No ratings yet
Coap 2024 Ece
24 pages
Automation: Getting The Most Out of A Plant
No ratings yet
Automation: Getting The Most Out of A Plant
10 pages
Dell Desktop RC With All Amendments
No ratings yet
Dell Desktop RC With All Amendments
53 pages
Syllabus Nimcet
No ratings yet
Syllabus Nimcet
4 pages
Stochastic Reserving - Stochastic Reserving - Mack and Bootstrapping Mack and Bootstrapping (Slides) - Dave Clark
No ratings yet
Stochastic Reserving - Stochastic Reserving - Mack and Bootstrapping Mack and Bootstrapping (Slides) - Dave Clark
12 pages
poweredge-rack-quick-reference-guide (12-3-2025) (002)
No ratings yet
poweredge-rack-quick-reference-guide (12-3-2025) (002)
6 pages

Compiler_Design_Notes

Uploaded by

Compiler_Design_Notes

Uploaded by

Compiler Design Notes

UNIT I: Lexical Analysis & Syntax Analysis

**Structure of a Compiler:** Phases include lexical analysis, syntax analysis, semantic

**Lexical Analysis:** Converts characters to tokens. Removes whitespace/comments.

**Role of Lexical Analyzer:**

**Bootstrapping:** Writing a compiler in the source programming language it intends to

**Specification of Tokens:** Defined using regular expressions, e.g., identifier: `[a-zA-Z_][a-

**Recognition of Tokens:** Finite Automata used to recognize token patterns.

**Finite Automata:** DFA/NFA used to implement lexical analyzers.

**Syntax Analysis:** Checks token sequence against grammar rules.

**Context-Free Grammars (CFG):** Consist of terminals, non-terminals, start symbol, and

**Left Factoring:** Removes common prefixes to aid predictive parsing.

UNIT II: Parsing Techniques

**Preprocessing Steps:** Remove left recursion, perform left factoring.

**Backtracking:** Tries multiple production rules. Inefficient.

**LL(1) Grammars:** Can be parsed without backtracking. Use single lookahead.

**Non-recursive Predictive Parsing:** Uses parsing table and stack.

**Bottom Up Parsing:** Builds tree from leaves up.

**Types of LR Parsers:** SLR, CLR, LALR.

**SLR Parsers:** Simplified LR parsers using FOLLOW sets.

**Error Recovery in LR Parsing:** Same as in LL but adapted for stack.

**Handling Ambiguous Grammar:** Use precedence and associativity rules.

UNIT III: Syntax Directed Translation & Intermediate Code

**Applications of Syntax Directed Translation:** Type checking, intermediate code

**Syntax Directed Translation Schemes (SDTS):** Grammar with semantic actions

**Implementing L-Attributed SDDs:** Evaluate attributes during parsing.

**Intermediate Code Generation:** Converts source to intermediate representation (IR).

**Variants of Syntax Trees:** Abstract syntax trees, DAGs.

**Three Address Code (TAC):** IR using temporary variables. Example:

**Types and Declarations:** Managed with symbol table.

**Translation of Expressions:** Convert infix to postfix/TAC.

**Type Checking:** Ensures operands are type-compatible.

**Control Flow & Backpatching:** Used for jumps and branches.

**Intermediate Code for Procedures:** Includes prologue/epilogue, parameter passing.

UNIT IV: Code Optimization

**Optimization of Basic Blocks:** Remove common sub-expressions, dead code elimination.

**Structure Preserving Transformations:** Maintain program structure while optimizing.

**Flow Graphs:** Represent control flow with nodes and edges.

**Loop Optimization:** Includes loop unrolling, invariant code motion.

**Data-Flow Analysis:** Gathers info on variable usage to optimize.

**Peephole Optimization:** Localized improvements like replacing instructions.

UNIT V: Run Time Environments & Code Generation

**Run Time Storage Allocation:** Memory assigned to variables/structures during

**Activation Records:** Store return address, parameters, local variables.

**Procedure Calls:** Manage control transfer and data passing.

**Displays:** Used for accessing non-local variables.

**Code Generation Issues:** Instruction selection, register allocation.

**Object Code Forms:** Final machine code forms.

**Code Generation Algorithm:** Converts IR to assembly.

You might also like

Structure of a Compiler: Phases include lexical analysis, syntax analysis, semantic

Lexical Analysis: Converts characters to tokens. Removes whitespace/comments.

Role of Lexical Analyzer:

Bootstrapping: Writing a compiler in the source programming language it intends to

Specification of Tokens: Defined using regular expressions, e.g., identifier: `[a-zA-Z_][a-

Recognition of Tokens: Finite Automata used to recognize token patterns.

Finite Automata: DFA/NFA used to implement lexical analyzers.

Syntax Analysis: Checks token sequence against grammar rules.

Context-Free Grammars (CFG): Consist of terminals, non-terminals, start symbol, and

Left Factoring: Removes common prefixes to aid predictive parsing.

Preprocessing Steps: Remove left recursion, perform left factoring.

Backtracking: Tries multiple production rules. Inefficient.

LL(1) Grammars: Can be parsed without backtracking. Use single lookahead.

Non-recursive Predictive Parsing: Uses parsing table and stack.

Bottom Up Parsing: Builds tree from leaves up.

Types of LR Parsers: SLR, CLR, LALR.

SLR Parsers: Simplified LR parsers using FOLLOW sets.

Error Recovery in LR Parsing: Same as in LL but adapted for stack.

Handling Ambiguous Grammar: Use precedence and associativity rules.

Applications of Syntax Directed Translation: Type checking, intermediate code

Syntax Directed Translation Schemes (SDTS): Grammar with semantic actions

Implementing L-Attributed SDDs: Evaluate attributes during parsing.

Intermediate Code Generation: Converts source to intermediate representation (IR).

Variants of Syntax Trees: Abstract syntax trees, DAGs.

Three Address Code (TAC): IR using temporary variables. Example:

Types and Declarations: Managed with symbol table.

Translation of Expressions: Convert infix to postfix/TAC.

Type Checking: Ensures operands are type-compatible.

Control Flow & Backpatching: Used for jumps and branches.

Intermediate Code for Procedures: Includes prologue/epilogue, parameter passing.

Optimization of Basic Blocks: Remove common sub-expressions, dead code elimination.

Structure Preserving Transformations: Maintain program structure while optimizing.

Flow Graphs: Represent control flow with nodes and edges.

Loop Optimization: Includes loop unrolling, invariant code motion.

Data-Flow Analysis: Gathers info on variable usage to optimize.

Peephole Optimization: Localized improvements like replacing instructions.

Run Time Storage Allocation: Memory assigned to variables/structures during

Activation Records: Store return address, parameters, local variables.

Procedure Calls: Manage control transfer and data passing.

Displays: Used for accessing non-local variables.

Code Generation Issues: Instruction selection, register allocation.

Object Code Forms: Final machine code forms.

Code Generation Algorithm: Converts IR to assembly.