0% found this document useful (0 votes)

105 views48 pages

cs212 Lect05 63 Inter

Here are the steps to scan the input "x := (y+10) * z1;": 1. Start at state q0 2. Read 'x' and stay in q0, accumulating characters into the token 3. Read ':' and transition to state q1, returning token "x" 4. Read '=' and stay in q1, returning token ":=" 5. Read '(' and transition to state q2, returning token "=" 6. Read 'y' and stay in q2, accumulating characters into the token 7. Read '+' and stay in q2 8. Read '1' and stay in q2 9. Read '0' and stay in q2

Uploaded by

Leng Hour leng

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

105 views48 pages

cs212 Lect05 63 Inter

Uploaded by

Leng Hour leng

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 48

CS 212 LECTURE 05

PROGRAMMING LANGUAGES
BACKUS NAUR FORM

• Backus Naur Form (BNF): a standard notation for

expressing syntax as a set of grammar rules.
• BNF was developed by Noam Chomsky, John Backus,
and Peter Naur.
• First used to describe Algol.
• BNF can describe any context-free grammar.
• Fortunately, computer languages are mostly context-
free. 2
BACKUS NAUR FORM
• Grammar Rules or Productions: define symbols.
assignment_stmt ::= id = expression ;

The nonterminal The definition

symbol being defined. (production)

Nonterminal Symbols: anything that is defined on the left-

side of some production.
Terminal Symbols: things that are not defined by
productions. They can be literals, symbols, and other
3
lexemes of the language defined by lexical rules.
BACKUS NAUR FORM
• Different notations (same meaning):
assignment_stmt ::= id = expression + term
<assignment-stmt> ::= <id> = <expr> + <term>
<assignment-stmt> => <id> = <expr> + <term>
AssignmentStmt  id = expression + term
::=, =>,  mean "consists of" or "defined as”
• Null symbol : e or @
• Alternatives ( " | " ):
<expression> ::= <expression> + <term>
4
| <expression> - <term>
| <term>
PROBLEMS WITH BNF NOTATION

• BNF notation is too long.

• Must use recursion to specify repeated
occurrences
• Must use separate an alternative for every
option
5
EXTENDED BNF NOTATION
• EBNF adds notation for repetition and optional elements.
{…} means the contents can occur 0 or more times:
Use for repeating left recursive
<expr> ::= <expr> + <term> | <term>
This can repeat itself
and it on the left of other
becomes
non terminal symbol
<expr> ::= <term> { + <term> }
6
EXTENDED BNF NOTATION
[…] encloses an optional part:
1
<if-stmt> ::= if ( <expr> ) <stmt>
| if ( <expr> ) <stmt> else <stmt>
2
If there is no else <stmt> in (2) then (1) = (2). So
that else <stmt> is the optional part
Becomes
<if-stmt> ::=
if ( <expr> ) <stmt> [else <stmt>]
7
EXTENDED BNF NOTATION,
CONTINUED
• ( a | b | ... ) is a list of choices. Choose exactly one.
1 The different
<expr> ::= <expr> + <term>
between (1) and
2
| <expr> - <term> (2) is +,- sign. So
| <term> +,- are the list
choice
becomes
<expr> ::= <term> { (+|-) <term> }
Another example:
<term> ::= <factor> { (*|/|%)<factor> }
8
EBNF COMPARED TO BNF
BNF: <expression> ::=
<expression> <expression> ++ <term>
::= <expression> <term>
|| <expression>
<expression> -- <term>
<term>
|| <term>
<term>
<term> ::=
<term> ::= <term> ** <factor>
<term> <factor>
|| <term>
<term> // <factor>
<factor>
|| <factor>
<factor>
<factor> ::=
<factor> ( <expression>
::= ( <expression> ))
|| <id>
<id>
|| <number>
<number>

EBNF:
<expression> ::=
<expression> <term> {{ (+|-)
::= <term> (+|-) <term>
<term> }}
<term> ::=
<term> ::= <factor> {{ (*|/)
<factor> (*|/)
<factor> }}
<factor>
factor 
factor  '(' <expression>
'(' <expression> ')'
')'
|| <id>
<id> || <number>
<number> 9
NOTES ON USE OF EBNF

• Do not start a rule with {…}:

Right: <expr> ::= <term> { + <term> }
Wrong: <expr> ::= {<term> + } <term>
• For right recursive rules use [ ... ] instead:
<expr> ::= <term> + <expr> | <term>
EBNF: <expr> ::= <term> [ + <expr> ]
Square brackets can be used anywhere:
<expr> ::= <expr> + <term>|<term>|- <term>
EBNF:
10
<expr> ::= [ - ] <term> { + <term> }
TRY THIS

• Rewrite this grammar using Extended BNF.

<sentence> ::= <noun-phrase> <verb-phrase> .
<noun-phrase> ::= <article><noun>
| <noun>
<article> ::= a | the
<noun> ::= boy | girl | cat | dog
<verb-phrase> ::= <verb><noun-phrase>
| <verb>
<verb> ::= sees | pets | bites
11
SYNTAX AND SEMANTICS
• The syntax of a language defines the valid symbols and
grammar.
• Syntax defines the structure of a program, i.e., the
form that each program unit and each statement must
use.
• The semantics defines the meaning of the grammar
elements.
• Lexical structure is the form of lowest level syntactic 12
units (words or tokens) of a grammar.
SYNTAX AND SEMANTICS COMPARED

• Syntax: in Java, an assignment statement is:

identifier = expression { operator expression } ;
• Semantics: an assignment statement must use compatible
types, e.g.
int n1, n2;
n1 = 20*1024; // OK, int_var = int_expression
n2 = 3.50; // illegal, incompatible types

• Lexical elements (Lexemes):

13
"n2" "=" "3.50" ";"
HOW ARE THEY USED?
Program
Source Code Parts of a Compiler / Interpreter:
Tokenizer (Lexical Analysis)
Token stream

Parser (Syntax Analysis)

Parse tree

Semantic Analysis
Intermediate code

Optimization and Code Generation

14
Object code
SCANNING AND PARSING

source file sum = x1 + x2;

input stream sum

=
Scanner x1
+
tokens x2
;
Parser
sum
=
+
parse tree x1 x2 15
SCANNERS

• Recognize regular expressions

• Implemented as finite automata (finite state machines)
• Typically contain a loop that cycles through characters,
building tokens and associated values by repeated
operations
• scanner may be integrated as a function in the parser.
• Parser calls the Scanner to get the next token.
16
SCANNERS

17
LEXICAL STRUCTURE
• Lexemes are the smallest lexical unit of a language, grouped
according to syntactic usage. Some types of lexemes in
computer languages are:

identifiers: x, println, _INIT, ArrayList

numeric constants: 0, 10000, 2.98E+6
operators: =, +, -, ++, +=, *, /
separators: [ ] ; : . , ( )
string literals: "hello there" 18
LEXICAL STRUCTURE

• A token is a structure representating a lexeme that

explicitly indicates its categorization for the purpose
of parsing.

19
LEXICAL STRUCTURE

•Lexemes are recognized by the first phase of

a translator -- the scanner -- that deals
directly with the input. The scanner
separates the input into tokens.
•Scanners are also called lexers.
20
TYPES OF LEXEMES
• Common Lexemes (classes of tokens)
identifiers: x, println, _INIT, ArrayList
numeric constants: 0, 10000, 2.98E+6
assignment operators: =, +=, -=, *=, /=, %=
arithmetic operators: *, /, +, -, %
boolean operators: &&, ||, ^, !
separators: [ ] ; : . , ( )
string literals: "hello there“
• Reserved words: may be defined as a class, or simply treat as
21
identifiers at lexical level
TOKENS
• Tokens are the strings of syntactic units.
• Example: what are the tokens in this statement?
result = (sum - average)/count;
• Lexeme Tokens:
result identifier
= assignment operator
( expression delimiter
sum identifier
- arithmetic operator
average identifier
) expression delimiter
/ arithmetic operator
count identifier 22
; semi-colon (statement delimiter)
HERE IS AN FA THAT RECOGNIZES A SUBSET
OF TOKENS IN THE PASCAL LANGUAGE:

23
when scanning “ temp := temp + 1 ”
The first token should be temp.
From start state, then go to state q1 and loop in
state q1 until get “ : ” . It will stop and return the
first token “temp” then start to get the next token.
Try scanning “ x := (y+10) * z1; ”
24
PARSING ALGORITHMS
• Broadly divided into LL and LR.
• LL algorithms match input directly to left-side
symbols, then choose a right-side production that
matches the tokens. This is top-down parsing

• LR algorithms try to match tokens to the right-side

productions, then replace groups of tokens with the
left-side nonterminal. They continue until the entire
input has been "reduced" to the start symbol
25
PARSING ALGORITHMS

••Look
Lookahead:
ahead:
••algorithms
algorithmsmust
mustlook
lookatatnext
nexttoken(s)
token(s)totodecide
decide
between alternate productions for current tokens
between alternate productions for current tokens

••LL(1)
LL(1)means
meansLL
LLwith
with11token
tokenlook-ahead
look-ahead
••LL algorithms are simpler and easier to visualize.
LL algorithms are simpler and easier to visualize.
••LR
LRalgorithms
algorithmsare
aremore
morepowerful:
powerful:can
canparse
parsesome
some
grammars that LL cannot, such as left recursion.
grammars that LL cannot, such as left recursion. 26
TOP-DOWN PARSING EXAMPLE (LL)

• Grammar rule : Rule

1 Number
2
3
4
5
6
7
8
9
10

Input String : x – 2 * y
27
Tokens : id – number * id
28
29
30
31
32
33
34
35
36
37
38
39
TOP-DOWN PARSING EXAMPLE (LL)

• Grammar rule : Rule

1 Number
2
3
4
5
6
7
8
9
10

Input String : x – 2 * y
40
Tokens : id – number * id
41
42
ELIMINATION OF LEFT RECURSION

45
LL PARSING EXAMPLE

Let try input String : x – 2 * y

Tokens : id – number * id

46
Rul Sentential Form Input
e
- Goal x – 2 * y
Expr x – 2 * y
Term Expr x – 2 * y
Factor Term Expr x – 2 * y
<id,x> Term Expr x – 2 * y
<id,x>  Expr x – 2 * y
<id,x> +Term Expr x – 2 * y

47
Rule Sentential Form Input
- Goal x – 2 * y
Expr x – 2 * y
Term Expr x – 2 * y
Factor Term Expr x – 2 * y
<id,x> Term Expr x – 2 * y
<id,x>  Expr x – 2 * y
<id,x> - Term Expr x – 2 * y
<id,x> - Factor Term Expr x – 2 * y
<id,x> - <number,2> Term Expr x –2*y
<id,x> - <number,2> * Factor Term Expr x –2*y
<id,x> - <number,2> * <id,y> Term Expr x –2*y
<id,x> - <number,2> * <id,y>  Expr x –2*y
<id,x> - <number,2> * <id,y>  x –2*y
<id,x> - <number,2> * <id,y> x –2*y
48

Lecture03 Parsing 1
No ratings yet
Lecture03 Parsing 1
108 pages
Grammar and Parse Trees (Syntax) : What Makes A Good Programming Language?
100% (2)
Grammar and Parse Trees (Syntax) : What Makes A Good Programming Language?
50 pages
Unit 2
No ratings yet
Unit 2
67 pages
Chapter 5 Syntax Analysis
No ratings yet
Chapter 5 Syntax Analysis
43 pages
Compiler Rewind
No ratings yet
Compiler Rewind
52 pages
Formal Methods of Describing Syntax - PPL
No ratings yet
Formal Methods of Describing Syntax - PPL
15 pages
Principals of Programming Language 1.2
No ratings yet
Principals of Programming Language 1.2
86 pages
Chapter Three
No ratings yet
Chapter Three
70 pages
Ch02 Programming Language Syntax 4e 2
No ratings yet
Ch02 Programming Language Syntax 4e 2
64 pages
Lecture 11
No ratings yet
Lecture 11
56 pages
Business Blueprint in SAP Implementation
0% (1)
Business Blueprint in SAP Implementation
2 pages
Lec02 Programming Language Specification
No ratings yet
Lec02 Programming Language Specification
36 pages
Chapter 3 Syntax Analysis
No ratings yet
Chapter 3 Syntax Analysis
78 pages
4 Parsing
No ratings yet
4 Parsing
32 pages
Lexical and Syntax Analysis
No ratings yet
Lexical and Syntax Analysis
63 pages
Chapter 4
No ratings yet
Chapter 4
62 pages
Lecture 2
No ratings yet
Lecture 2
38 pages
Describing Syntax and Semantics: CSE 325/CSE 425: Concepts of Programming Language
No ratings yet
Describing Syntax and Semantics: CSE 325/CSE 425: Concepts of Programming Language
46 pages
Top Down
No ratings yet
Top Down
25 pages
Top - Down Parsing: EDA180: Compiler Construc6on
No ratings yet
Top - Down Parsing: EDA180: Compiler Construc6on
43 pages
Computer Science Project Topics and Mate
No ratings yet
Computer Science Project Topics and Mate
38 pages
Compiler Design 3
No ratings yet
Compiler Design 3
140 pages
Lecture 3
No ratings yet
Lecture 3
48 pages
BNF Ebnf
100% (1)
BNF Ebnf
25 pages
Lec 03 TPL
No ratings yet
Lec 03 TPL
28 pages
Syntax and Symentic
No ratings yet
Syntax and Symentic
52 pages
PL 10 CH 3
No ratings yet
PL 10 CH 3
61 pages
CH 03
No ratings yet
CH 03
54 pages
Projects HTML & Wabsite
100% (2)
Projects HTML & Wabsite
30 pages
cs3304 4
No ratings yet
cs3304 4
12 pages
Second Phase of The Compiler. Main Task:: Lexical Analyzer Rest of Front End Parser Source Tree Parse Req Token IR
No ratings yet
Second Phase of The Compiler. Main Task:: Lexical Analyzer Rest of Front End Parser Source Tree Parse Req Token IR
13 pages
CSC305 Chapter 2 (Part 1)
No ratings yet
CSC305 Chapter 2 (Part 1)
23 pages
03 Lexing Parsing
No ratings yet
03 Lexing Parsing
78 pages
15 Syntax Parsing
No ratings yet
15 Syntax Parsing
30 pages
Grammars: Definitions Grammars Backus-Naur Form Derivation
No ratings yet
Grammars: Definitions Grammars Backus-Naur Form Derivation
19 pages
Chapter - Three
No ratings yet
Chapter - Three
139 pages
CMP401 Ii
No ratings yet
CMP401 Ii
38 pages
Syntax Semantics
No ratings yet
Syntax Semantics
6 pages
Topic 2 - Syntax and Semantics Lecture Notes
No ratings yet
Topic 2 - Syntax and Semantics Lecture Notes
50 pages
03 Parsing
No ratings yet
03 Parsing
61 pages
Chapter 3
No ratings yet
Chapter 3
22 pages
(Week 4) Syntax Analysis (CFG)
No ratings yet
(Week 4) Syntax Analysis (CFG)
50 pages
BNF
No ratings yet
BNF
30 pages
Syntax & Semantics
No ratings yet
Syntax & Semantics
34 pages
Chapter 3 - Describing Syntax and Semantics: CS-4337 Organization of Programming Languages
No ratings yet
Chapter 3 - Describing Syntax and Semantics: CS-4337 Organization of Programming Languages
58 pages
CD Chapter 2
No ratings yet
CD Chapter 2
39 pages
1.describing Syntax and Semantics
No ratings yet
1.describing Syntax and Semantics
110 pages
Chapter - Three: Syntax Analysis
No ratings yet
Chapter - Three: Syntax Analysis
100 pages
Lecture02 Single Slide Handout
No ratings yet
Lecture02 Single Slide Handout
49 pages
CD Chapter-3
No ratings yet
CD Chapter-3
105 pages
UNIT-I Part 2 Describing Syntax and Semantics
No ratings yet
UNIT-I Part 2 Describing Syntax and Semantics
70 pages
KCA015 Unit2
No ratings yet
KCA015 Unit2
29 pages
Chapter-3 So Far
No ratings yet
Chapter-3 So Far
50 pages
Chapter 3 "Describing Syntax and Semantics"
No ratings yet
Chapter 3 "Describing Syntax and Semantics"
10 pages
What Characterizes A Language: A: BC Foo) A, B (
No ratings yet
What Characterizes A Language: A: BC Foo) A, B (
10 pages
Describing Syntax and Semantics: CS 350 Programming Language Design Indiana University - Purdue University Fort Wayne
No ratings yet
Describing Syntax and Semantics: CS 350 Programming Language Design Indiana University - Purdue University Fort Wayne
73 pages
Syntax Analyser
No ratings yet
Syntax Analyser
30 pages
Chapter No 3 Sytax and Semsetics
No ratings yet
Chapter No 3 Sytax and Semsetics
19 pages
Cse 3 1 PPL Unit 2
No ratings yet
Cse 3 1 PPL Unit 2
10 pages
PL Units1.2
No ratings yet
PL Units1.2
24 pages
Specifying Syntax: Components of A Grammar
No ratings yet
Specifying Syntax: Components of A Grammar
6 pages
Linker and Loader
100% (1)
Linker and Loader
2 pages
Unit - 9 System Construction and Implementation
No ratings yet
Unit - 9 System Construction and Implementation
20 pages
Case Study of Lexical Analyzer PDF
100% (1)
Case Study of Lexical Analyzer PDF
3 pages
Essential Skills For Public Servants-7 Public Policy Formulation and Analysis
No ratings yet
Essential Skills For Public Servants-7 Public Policy Formulation and Analysis
35 pages
Procurement Implemantation Guide
No ratings yet
Procurement Implemantation Guide
28 pages
Undergraduate Thesis and Project Guidelines - BSIT
100% (1)
Undergraduate Thesis and Project Guidelines - BSIT
4 pages
Cryptography: Information and Network Security
No ratings yet
Cryptography: Information and Network Security
44 pages
Cryptography: Information and Network Security
No ratings yet
Cryptography: Information and Network Security
44 pages
Compiler Notes Introduction
No ratings yet
Compiler Notes Introduction
8 pages
cs212 Lect02 63 Inter
No ratings yet
cs212 Lect02 63 Inter
39 pages
Lexical and Syntax Analysis
No ratings yet
Lexical and Syntax Analysis
34 pages
Gap fitAnalysisfromaBusinessProcessPerspective
No ratings yet
Gap fitAnalysisfromaBusinessProcessPerspective
11 pages
ICAO Universal Safety Oversight Audit Programme
No ratings yet
ICAO Universal Safety Oversight Audit Programme
32 pages
Final Essay: Topic: Write An Essay Comparing Thai Legal Procedure Vs Cambodia Legal Procedure
No ratings yet
Final Essay: Topic: Write An Essay Comparing Thai Legal Procedure Vs Cambodia Legal Procedure
3 pages
Mod Menu Log - Com - Ea.game - Pvzfree - Row
No ratings yet
Mod Menu Log - Com - Ea.game - Pvzfree - Row
84 pages
Exercises
No ratings yet
Exercises
2 pages
G. H. Raisoni College of Engineering: Sub-Langueage Processor
No ratings yet
G. H. Raisoni College of Engineering: Sub-Langueage Processor
8 pages
Human Resource Management System: A Case Study On An Information Management Design
No ratings yet
Human Resource Management System: A Case Study On An Information Management Design
6 pages
cs212 Lect04 63 Inter
No ratings yet
cs212 Lect04 63 Inter
23 pages
Compiler Design Question Bank-UNIT 1
No ratings yet
Compiler Design Question Bank-UNIT 1
12 pages
Amulya Report
No ratings yet
Amulya Report
39 pages
Difference Between Compiler and Interpreter
No ratings yet
Difference Between Compiler and Interpreter
2 pages
CD MCQ
No ratings yet
CD MCQ
63 pages
Compiler Construction
No ratings yet
Compiler Construction
26 pages
Assignment 1: 1/ What Research Will U Do Before Your Initial Meeting With The Executive Management Team?
No ratings yet
Assignment 1: 1/ What Research Will U Do Before Your Initial Meeting With The Executive Management Team?
3 pages
Institute of Engineering & Management: Department of Information Technology Workbook (IT605D)
No ratings yet
Institute of Engineering & Management: Department of Information Technology Workbook (IT605D)
74 pages
Of The Text Book: Code Optimization
No ratings yet
Of The Text Book: Code Optimization
19 pages
cs212 Lect03 63 Inter
No ratings yet
cs212 Lect03 63 Inter
23 pages
Stack Vs Heap
No ratings yet
Stack Vs Heap
3 pages
Chapter 7 12 Answer
No ratings yet
Chapter 7 12 Answer
4 pages
Proposal For The Security of The FRA Sheds
No ratings yet
Proposal For The Security of The FRA Sheds
9 pages
Compiler-Interpreter-Compiled and Interpreted Language
No ratings yet
Compiler-Interpreter-Compiled and Interpreted Language
3 pages
Digital Change
No ratings yet
Digital Change
14 pages
Assignment 2 PDF
No ratings yet
Assignment 2 PDF
6 pages
Presentation 3 PDF
No ratings yet
Presentation 3 PDF
6 pages
Assignment Programming Language - CSE 341
No ratings yet
Assignment Programming Language - CSE 341
2 pages
Introduction To System Analysis and Design:: 7
No ratings yet
Introduction To System Analysis and Design:: 7
3 pages
Assignment1 Leng
No ratings yet
Assignment1 Leng
1 page
Assignment 2. Leng
No ratings yet
Assignment 2. Leng
1 page
Assignment1 PDF
No ratings yet
Assignment1 PDF
1 page
RBT Levels Marks L1 10marks: Answer Any Three Full Questions
No ratings yet
RBT Levels Marks L1 10marks: Answer Any Three Full Questions
1 page
Lisp Interpreter in Rust
From Everand
Lisp Interpreter in Rust
Vishal Patil
1/5 (1)
The Essential R Reference
From Everand
The Essential R Reference
Mark Gardener
No ratings yet

cs212 Lect05 63 Inter

Uploaded by

cs212 Lect05 63 Inter

Uploaded by

CS 212 LECTURE 05

• Backus Naur Form (BNF): a standard notation for

The nonterminal The definition

Nonterminal Symbols: anything that is defined on the left-

• BNF notation is too long.

• Do not start a rule with {…}:

• Rewrite this grammar using Extended BNF.

• Syntax: in Java, an assignment statement is:

• Lexical elements (Lexemes):

Parser (Syntax Analysis)

Optimization and Code Generation

source file sum = x1 + x2;

input stream sum

• Recognize regular expressions

identifiers: x, println, _INIT, ArrayList

• A token is a structure representating a lexeme that

•Lexemes are recognized by the first phase of

• LR algorithms try to match tokens to the right-side

• Grammar rule : Rule

• Grammar rule : Rule

Let try input String : x – 2 * y

You might also like