0% found this document useful (0 votes)

26 views23 pages

PART I: Overview Material: 2 Language Processors (Tombstone Diagrams, Bootstrapping) 3 Architecture of A Compiler

This document summarizes the process of developing a recursive descent parser from a grammar. It discusses: 1. Expressing the grammar in EBNF and performing transformations like left factorization and eliminating left recursion. 2. Creating a parser class with methods for accepting tokens from a scanner and a public parse method. 3. Implementing private parsing methods corresponding to each grammar rule, using pattern matching on the current token to determine the parsing path. 4. The algorithm to automatically generate these parsing methods by rewriting EBNF rules is described. For the parser to work correctly, the grammar must be LL(1).

Uploaded by

anithasudha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views23 pages

PART I: Overview Material: 2 Language Processors (Tombstone Diagrams, Bootstrapping) 3 Architecture of A Compiler

Uploaded by

anithasudha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 23

Course Overview

PART I: overview material

1 Introduction
2 Language processors (tombstone diagrams, bootstrapping)
3 Architecture of a compiler
PART II: inside a compiler
4 Syntax analysis
5 Contextual analysis
6 Runtime organization
7 Code generation
PART III: conclusion
8 Interpretation
9 Review
Syntax Analysis (Chapter 4) 1
Systematic Development of Rec. Descent Parser
(1) Express grammar in EBNF
(2) Grammar Transformations:
Left factorization and Left recursion elimination
(3) Create a parser class with
– private variable currentToken
– methods to call the scanner: accept and acceptIt
(4) Implement a public method for main function to call:
– public parse method that
• fetches the first token from the scanner
• calls parseS (where S is start symbol of the grammar)
• verifies that scanner next produces the end–of–file token
(5) Implement private parsing methods:
– add private parseN method for each non terminal N
Syntax Analysis (Chapter 4) 2
Developing RD Parser for Mini Triangle
Before we begin:
• The following non-terminals are recognized by the scanner
• They will be returned as tokens by the scanner
Identifier := Letter (Letter|Digit)*
Integer-Literal ::= Digit Digit*
Operator ::= + | - | * | / | < | > | =
Comment ::= ! Graphic* eol
Assume scanner returns instances of this class:
public class Token {
byte kind; String spelling;
final static byte
IDENTIFIER = 0,
INTLITERAL = 1;
...
Syntax Analysis (Chapter 4) 3
(1)&(2) Developing RD Parser for Mini Triangle

Program ::= single-Command

Command ::= single-Command
Left recursion elimination needed
| Command ; single-Command
single-Command Left factorization needed
::= V-name := Expression
| Identifier ( Expression )
| if Expression then single-Command
else single-Command
| while Expression do single-Command
| let Declaration in single-Command
| begin Command end
V-name ::= Identifier
...

Syntax Analysis (Chapter 4) 4

(1)&(2) Express grammar in EBNF and transform

After factorization etc. we get:

Program ::= single-Command
Command ::= single-Command (; single-Command)*
single-Command
::= Identifier
( := Expression | ( Expression ) )
| if Expression then single-Command
else single-Command
| while Expression do single-Command
| let Declaration in single-Command
| begin Command end
V-name ::= Identifier
...

Syntax Analysis (Chapter 4) 5

(1)&(2) Developing RD Parser for Mini Triangle
Expression Left recursion elimination
::= primary-Expression
needed
| Expression Operator primary-Expression
primary-Expression
::= Integer-Literal
| V-name
| Operator primary-Expression
| ( Expression )
Declaration Left recursion elimination
::= single-Declaration
needed
| Declaration ; single-Declaration
single-Declaration
::= const Identifier ~ Expression
| var Identifier : Type-denoter
Type-denoter ::= Identifier
Syntax Analysis (Chapter 4) 6
(1)&(2) Express grammar in EBNF and transform
After factorization and recursion elimination :
Expression
::= primary-Expression
( Operator primary-Expression )*
primary-Expression
::= Integer-Literal
| Identifier
| Operator primary-Expression
| ( Expression )
Declaration
::= single-Declaration (; single-Declaration)*
single-Declaration
::= const Identifier ~ Expression
| var Identifier : Type-denoter
Type-denoter ::= Identifier
Syntax Analysis (Chapter 4) 7
(3)&(4) Create a parser class and public parse method
public class Parser {
private Token currentToken;
private void accept (byte expectedKind) {
if (currentToken.kind == expectedKind)
currentToken = scanner.scan( );
else
report syntax error
}
private void acceptIt( ) {
currentToken = scanner.scan( );
}
public void parse( ) {
acceptIt( ); // get the first token
parseProgram( ); // Program is the start symbol
if (currentToken.kind != Token.EOT)
report syntax error
}
...
Syntax Analysis (Chapter 4) 8
(5) Implement private parsing methods
Program ::= single-Command

private void parseProgram( ) {

parseSingleCommand( );
}

Syntax Analysis (Chapter 4) 9

(5) Implement private parsing methods
single-Command
::= Identifier
( := Expression | ( Expression ) )
| if Expression then single-Command
else single-Command
| ... other alternatives ...

private void parseSingleCommand( ) {

switch (currentToken.kind) {
case Token.IDENTIFIER : ...
case Token.IF : ...
... other cases ...
default: report a syntax error
}
}

Syntax Analysis (Chapter 4) 10

Algorithm to convert EBNF into a RD parser
• The conversion of an EBNF specification into a Java or C++
implementation for a recursive descent parser is so “mechanical”
that it could easily be automated (such tools exist, but we won’t
use them in this course)
• We can describe the algorithm by a set of mechanical rewrite
rules
N ::= 
private void parseN( ) {
parse  // as explained on next two slides
}

Syntax Analysis (Chapter 4) 12

Algorithm to convert EBNF into a RD parser

parse t where t is a terminal

accept(t);

parse N where N is a non-terminal

parseN( );

parse 
// a dummy statement

parse X Y

parse X
parse Y

Syntax Analysis (Chapter 4) 13

Algorithm to convert EBNF into a RD parser
parse X*
while (currentToken.kind is in starters[X]) {
parse X
}

parse X | Y
switch (currentToken.kind) {
cases in starters[X]:
parse X
break;
cases in starters[Y]:
parse Y
break;
default:
if neither X nor Y generates  then report syntax error
}
Syntax Analysis (Chapter 4) 14
Example: “Generation” of parseCommand

Command ::= single-Command ( ; single-Command )*

private void parseCommand( ) {

parse single-Command );( ; single-Command )*
parseSingleCommand(
}while
parse ((currentToken.kind==Token.SEMICOLON)
; single-Command )* {
} acceptIt(
parse ; single-Command
); // because SEMICOLON has just been checked
} parseSingleCommand(
parse single-Command );
}}
}

Syntax Analysis (Chapter 4) 15

Example: Generation of parseSingleDeclaration
single-Declaration
::= const Identifier ~ Expression
| var Identifier : Type-denoter

private void parseSingleDeclaration( ) {

switch (currentToken.kind) {
private
case Token.CONST:
void parseSingleDeclaration( ) {
switch
parseacceptIt(
(currentToken.kind)
const );
Identifier {
~ Expression
case
| parseIdentifier(
varToken.CONST:
Identifier : );
Type-denoter
parse const
acceptIt( ); Identifier ~ Expression
} accept(Token.IS);
parse
parseIdentifier(
Identifier ); );
parseExpression(
case Token.VAR:
parse
accept(Token.IS);
case ~
Token.VAR:
var Identifier : Type-denoter
parse Expression
parseExpression(
acceptIt(
default: ); syntax
report ); error
} case Token.VAR:);
parseIdentifier(
parse var Identifier : Type-denoter
} accept(Token.COLON);
default:
parseTypeDenoter(
report syntax);error
} default: report syntax error
}}
} Analysis (Chapter 4)
Syntax 16
LL 1 Grammars
• The presented algorithm to convert EBNF into a parser
does not work for all possible grammars.
• It only works for so called “LL 1” grammars.
• Basically, an LL 1 grammar is a grammar which can
be parsed with a top-down parser with a lookahead (in
the input stream of tokens) of one token.
• What grammars are LL 1?
How can we recognize that a grammar is (or is not) LL 1?
=> We can deduce the necessary conditions from the
parser generation algorithm.

Syntax Analysis (Chapter 4) 17

LL 1 Grammars
parse X*
while (currentToken.kind is in starters[X]) {
parse X
} Condition: starters[X] must be
disjoint from the set of tokens that
parse X |Y can immediately follow X *
switch (currentToken.kind) { Conditions: starters[X] and starters[Y]
cases in starters[X]:
parse X
must be disjoint sets, and if either X
break; or Y generates  then must also be
cases in starters[Y]: disjoint from the set of tokens that can
parse Y immediately follow X | Y
break;
default: if neither X nor Y generates  then report syntax error
}

Syntax Analysis (Chapter 4) 18

LL 1 grammars and left factorization

The original Mini-Triangle grammar is not LL 1:

For example:
single-Command
::= V-name := Expression
| Identifier ( Expression )
| ...
V-name ::= Identifier

Starters[V-name := Expression]
= Starters[V-name] = Starters[Identifier]
Starters[Identifier ( Expression )]
= Starters[Identifier] NOT DISJOINT!
Syntax Analysis (Chapter 4) 19
LL 1 grammars: left factorization
What happens when we generate a RD parser from a non LL 1 grammar?

single-Command
::= V-name := Expression
| Identifier ( Expression )
| ...

private void parseSingleCommand( ) {

switch (currentToken.kind) { wrong: overlapping
case Token.IDENTIFIER: cases
parse V-name := Expression
case Token.IDENTIFIER:
parse Identifier ( Expression )
...other cases...
default: report syntax error
}
}
Syntax Analysis (Chapter 4) 20
LL 1 grammars: left factorization

single-Command
::= V-name := Expression
| Identifier ( Expression )
| ...

Left factorization (and substitution of V-name)

single-Command
::= Identifier
( := Expression | ( Expression ) )
| ...

Syntax Analysis (Chapter 4) 21

LL 1 Grammars: left recursion elimination

Command ::= single-Command

| Command ; single-Command
What happens if we don’t perform left-recursion elimination?
public void parseCommand( ) {
switch (currentToken.kind) { wrong: overlapping
case in starters[single-Command] cases
parseSingleCommand( );
case in starters[Command]
parseCommand( );
accept(Token.SEMICOLON);
parseSingleCommand( );
default: report syntax error
}
}

Syntax Analysis (Chapter 4) 22

LL 1 Grammars: left recursion elimination

Command ::= single-Command

| Command ; single-Command

Left recursion elimination

Command
::= single-Command (; single-Command)*

Syntax Analysis (Chapter 4) 23

Abstract Syntax Trees
• So far we have talked about how to build a recursive
descent parser which recognizes a given language
described by an (LL 1) EBNF grammar.
• Next we will look at
– how to represent AST as data structures.
– how to modify the parser to construct an AST data structure.
• We make heavy use of Object–Oriented Programming!
(classes, inheritance, dynamic method binding)

Syntax Analysis (Chapter 4) 24

Mastering Adjectives in English With Examples - 7ESL
No ratings yet
Mastering Adjectives in English With Examples - 7ESL
27 pages
Chapter No 3 Sytax and Semsetics
No ratings yet
Chapter No 3 Sytax and Semsetics
19 pages
Grade-5-Bow-1 Eng
100% (2)
Grade-5-Bow-1 Eng
19 pages
ELT 9 Learning Module 10 Proofreading and Editing
No ratings yet
ELT 9 Learning Module 10 Proofreading and Editing
2 pages
Chapter 4
No ratings yet
Chapter 4
62 pages
CSC 461 Final
No ratings yet
CSC 461 Final
170 pages
Lecture 14
No ratings yet
Lecture 14
52 pages
Determine If The Group of Words Is A SENTENCE
No ratings yet
Determine If The Group of Words Is A SENTENCE
6 pages
CMP401 Ii
No ratings yet
CMP401 Ii
38 pages
4 - Top-Down
No ratings yet
4 - Top-Down
67 pages
Lexical and Syntax Analysis
No ratings yet
Lexical and Syntax Analysis
63 pages
Always On My Mind - Simple Past Negative
No ratings yet
Always On My Mind - Simple Past Negative
3 pages
SPCC - 5
No ratings yet
SPCC - 5
19 pages
Chapter Three
No ratings yet
Chapter Three
70 pages
Compiler Design Full QnA
No ratings yet
Compiler Design Full QnA
4 pages
Compiler CH-3
No ratings yet
Compiler CH-3
6 pages
Ch02 Programming Language Syntax 4e 2
No ratings yet
Ch02 Programming Language Syntax 4e 2
64 pages
Distribution 3rd 2025 1st Term 3 Ok
No ratings yet
Distribution 3rd 2025 1st Term 3 Ok
3 pages
Sharing Wealth
No ratings yet
Sharing Wealth
3 pages
Top Down
No ratings yet
Top Down
25 pages
Finite and Non Finite Verbs
100% (1)
Finite and Non Finite Verbs
3 pages
Unit4 Notes
No ratings yet
Unit4 Notes
32 pages
Compiler Principle and Technology: Mr. Aruna Malik BIT (Mesra) Ranchi, Off Campus NOIDA
No ratings yet
Compiler Principle and Technology: Mr. Aruna Malik BIT (Mesra) Ranchi, Off Campus NOIDA
86 pages
Parsers
No ratings yet
Parsers
24 pages
Chapter 3 Syntax Analysis
No ratings yet
Chapter 3 Syntax Analysis
78 pages
Module 2 - Compiler Till or Paser
No ratings yet
Module 2 - Compiler Till or Paser
120 pages
COP CD Unit2 PDF
No ratings yet
COP CD Unit2 PDF
371 pages
Te Ti!: A Beginner's Guide To Oshindonga
No ratings yet
Te Ti!: A Beginner's Guide To Oshindonga
83 pages
Report Writing: A Review of Short Reports and Formal Business Reports
100% (1)
Report Writing: A Review of Short Reports and Formal Business Reports
12 pages
Adverbs - V PDF
No ratings yet
Adverbs - V PDF
7 pages
Chapter - Three
No ratings yet
Chapter - Three
139 pages
Conditional Type Three Grammar Drills Information Gap Activities 104628
No ratings yet
Conditional Type Three Grammar Drills Information Gap Activities 104628
21 pages
Module-2 1
No ratings yet
Module-2 1
51 pages
LK 1.1: Lembar Kerja Belajar Mandiri (Modul 6 - Profesional)
No ratings yet
LK 1.1: Lembar Kerja Belajar Mandiri (Modul 6 - Profesional)
7 pages
Principals of Programming Language 1.2
No ratings yet
Principals of Programming Language 1.2
86 pages
Lec03 parserCFG
No ratings yet
Lec03 parserCFG
27 pages
Course: English II: Step 1: Pre-Task
No ratings yet
Course: English II: Step 1: Pre-Task
8 pages
Chapter 5 Syntax Analysis
No ratings yet
Chapter 5 Syntax Analysis
43 pages
MODULE Pertemuan Ke 12 - BUSINESS ENGLISH - Prodi Manajemen - Yudi Anjangsana - OK SENT
No ratings yet
MODULE Pertemuan Ke 12 - BUSINESS ENGLISH - Prodi Manajemen - Yudi Anjangsana - OK SENT
22 pages
Unit-II CD
No ratings yet
Unit-II CD
81 pages
B2 Story
No ratings yet
B2 Story
1 page
Parser
No ratings yet
Parser
36 pages
Parsing With Haskell
100% (1)
Parsing With Haskell
16 pages
Lec02 Programming Language Specification
No ratings yet
Lec02 Programming Language Specification
36 pages
cs212 Lect05 63 Inter
No ratings yet
cs212 Lect05 63 Inter
48 pages
Compiler Rewind
No ratings yet
Compiler Rewind
52 pages
Top Down PDF
No ratings yet
Top Down PDF
49 pages
Top - Down Parsing: EDA180: Compiler Construc6on
No ratings yet
Top - Down Parsing: EDA180: Compiler Construc6on
43 pages
Communication With Customers
No ratings yet
Communication With Customers
17 pages
1.describing Syntax and Semantics
No ratings yet
1.describing Syntax and Semantics
110 pages
Top Down Parsing
No ratings yet
Top Down Parsing
27 pages
English - 1-22
No ratings yet
English - 1-22
22 pages
Chapter - Three: Syntax Analysis
No ratings yet
Chapter - Three: Syntax Analysis
100 pages
CD Unit 3
No ratings yet
CD Unit 3
76 pages
Syntax Analysis: CD: Compiler Design
No ratings yet
Syntax Analysis: CD: Compiler Design
90 pages
Tekkom M4,5
No ratings yet
Tekkom M4,5
29 pages
3-Module 2 - Role of Parser - Parse Tree-02-08-2024
No ratings yet
3-Module 2 - Role of Parser - Parse Tree-02-08-2024
76 pages
What Is Parsing: Parsing Is The Process of Analyzing An Input Sequence in Order
No ratings yet
What Is Parsing: Parsing Is The Process of Analyzing An Input Sequence in Order
9 pages
Compiler Design: - Top-Down Parsing With A Recursive Descent Parser
No ratings yet
Compiler Design: - Top-Down Parsing With A Recursive Descent Parser
20 pages
CD Chapter-3
No ratings yet
CD Chapter-3
105 pages
Chapter 3
No ratings yet
Chapter 3
180 pages
Compiler Theory: (A Simple Syntax-Directed Translator)
No ratings yet
Compiler Theory: (A Simple Syntax-Directed Translator)
50 pages
Lecture02 Single Slide Handout
No ratings yet
Lecture02 Single Slide Handout
49 pages
Chapter-3 So Far
No ratings yet
Chapter-3 So Far
50 pages
Group II Prelims General English Syllabus
No ratings yet
Group II Prelims General English Syllabus
20 pages
Chapter 3 - Describing Syntax and Semantics: CS-4337 Organization of Programming Languages
No ratings yet
Chapter 3 - Describing Syntax and Semantics: CS-4337 Organization of Programming Languages
58 pages
Chapter 4
No ratings yet
Chapter 4
3 pages
Parsing, Lexical Analysis, and Tools: William Cook
No ratings yet
Parsing, Lexical Analysis, and Tools: William Cook
16 pages
What Characterizes A Language: A: BC Foo) A, B (
No ratings yet
What Characterizes A Language: A: BC Foo) A, B (
10 pages
CH03
No ratings yet
CH03
57 pages
Homework 3
No ratings yet
Homework 3
14 pages
UNIT-I Part 2 Describing Syntax and Semantics
No ratings yet
UNIT-I Part 2 Describing Syntax and Semantics
70 pages
Brochure: Performan Ce Task #3
No ratings yet
Brochure: Performan Ce Task #3
2 pages
English Tenses Chart PDF
No ratings yet
English Tenses Chart PDF
3 pages
Unit - Ii 2.1 Syntax Analysis
No ratings yet
Unit - Ii 2.1 Syntax Analysis
122 pages
Predictive Parsing: Recall The Main Idea of Top-Down Parsing
No ratings yet
Predictive Parsing: Recall The Main Idea of Top-Down Parsing
19 pages
Predictive Parsing: Recall The Main Idea of Top-Down Parsing
No ratings yet
Predictive Parsing: Recall The Main Idea of Top-Down Parsing
19 pages
Chapter 3 - Syntax Analysis
No ratings yet
Chapter 3 - Syntax Analysis
88 pages
Gramatica Ingles
No ratings yet
Gramatica Ingles
22 pages
Grammar With Ease.
No ratings yet
Grammar With Ease.
46 pages
Eliptical Sentences: and Neither... and ... Either Digunakan Untuk Menggabungkan Dua Kalimat Negatif
No ratings yet
Eliptical Sentences: and Neither... and ... Either Digunakan Untuk Menggabungkan Dua Kalimat Negatif
14 pages
Parsing
No ratings yet
Parsing
38 pages
Describing Syntax and Semantics: CS 350 Programming Language Design Indiana University - Purdue University Fort Wayne
No ratings yet
Describing Syntax and Semantics: CS 350 Programming Language Design Indiana University - Purdue University Fort Wayne
73 pages
Unit 1 - Local Community - NG Pháp
No ratings yet
Unit 1 - Local Community - NG Pháp
2 pages
Simple Present Tense
No ratings yet
Simple Present Tense
6 pages
Topic #4: Syntactic Analysis (Parsing) : INF 524 Compiler Construction Spring 2011
No ratings yet
Topic #4: Syntactic Analysis (Parsing) : INF 524 Compiler Construction Spring 2011
44 pages
Course Syllabus Get Ready For Mover
100% (1)
Course Syllabus Get Ready For Mover
23 pages
Esl / Efl Resources: Imperatives
100% (1)
Esl / Efl Resources: Imperatives
2 pages
Reported Speech Guide
No ratings yet
Reported Speech Guide
5 pages
Lisp Interpreter in Rust
From Everand
Lisp Interpreter in Rust
Vishal Patil
1/5 (1)

PART I: Overview Material: 2 Language Processors (Tombstone Diagrams, Bootstrapping) 3 Architecture of A Compiler

Uploaded by

PART I: Overview Material: 2 Language Processors (Tombstone Diagrams, Bootstrapping) 3 Architecture of A Compiler

Uploaded by

Course Overview

PART I: overview material

Program ::= single-Command

Syntax Analysis (Chapter 4) 4

After factorization etc. we get:

Syntax Analysis (Chapter 4) 5

private void parseProgram( ) {

Syntax Analysis (Chapter 4) 9

private void parseSingleCommand( ) {

Syntax Analysis (Chapter 4) 10

Syntax Analysis (Chapter 4) 12

parse t where t is a terminal

parse N where N is a non-terminal

Syntax Analysis (Chapter 4) 13

Command ::= single-Command ( ; single-Command )*

private void parseCommand( ) {

Syntax Analysis (Chapter 4) 15

private void parseSingleDeclaration( ) {

Syntax Analysis (Chapter 4) 17

Syntax Analysis (Chapter 4) 18

The original Mini-Triangle grammar is not LL 1:

private void parseSingleCommand( ) {

Left factorization (and substitution of V-name)

Syntax Analysis (Chapter 4) 21

Command ::= single-Command

Syntax Analysis (Chapter 4) 22

Command ::= single-Command

Left recursion elimination

Syntax Analysis (Chapter 4) 23

Syntax Analysis (Chapter 4) 24

You might also like