0% found this document useful (0 votes)

2 views

Syntax Analysis Parsing (1)

Wollo university

Uploaded by

milliyanmuhe

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

Syntax Analysis Parsing (1)

Wollo university

Uploaded by

milliyanmuhe

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

Syntax Analysis

3.1 Parsing

Parsing involves analyzing a string of tokens to determine its grammatical structure. It

ensures the source code conforms to the language's syntax rules.

Types of Parsing:
1. Top-down Parsing: Constructs the parse tree from root to leaves (e.g., Recursive Descent
Parsing).
2. Bottom-up Parsing: Constructs the parse tree from leaves to root (e.g., LR Parsing).

Example:
Grammar:
E→E+T|T
T→T*F|F
F → (E) | id

Input: id + id * id

Parsing determines if the input matches the start symbol E.

3.2 Top-down Parsing

Top-down parsing starts at the root of the parse tree and expands non-terminals using
production rules to match the input string from left to right.

Example:
Grammar:
S → aA
A→b
Input: ab
Steps:
1. Start with S.
2. Expand S → aA.
3. Match a.
4. Expand A → b and match b.

Parse Tree:
S
/\
a A
|
b

Case study on parse tree construction (top-down)

In this case study, we'll explore the process of parsing a simple expression language
that supports basic arithmetic operations and parentheses. Parsing is a fundamental
concept in compiler design and language processing, and it involves breaking down
a string of symbols into its syntactic structure.

The Grammar
To define the structure of our expression language, we'll use a context-free
grammar:

expression -> expression '+' term | expression '-' term | term

term -> term '*' factor | term '/' factor | factor
factor -> '(' expression ')' | NUMBER
This grammar defines the following rules:
Expression:
An expression can be an expression added to a term.
An expression can be an expression subtracted from a term.
An expression can be a single term.

Term:
A term can be a term multiplied by a factor.
A term can be a term divided by a factor.
A term can be a single factor.

Factor:
A factor can be an expression enclosed in parentheses.
A factor can be a number.

Parsing Process
To parse an expression, we can use a parsing algorithm like recursive descent or
shift-reduce. These algorithms analyze the input string token by token, building a
parse tree that represents the syntactic structure of the expression.

Example:

Consider the expression 2 * (3 + 4). Here's how it would be parsed:

Match 2 : This matches the term -> term factor rule.

Match 2: This matches the factor -> NUMBER rule.
Match (3 + 4): This matches the factor -> ( expression ) rule.
Match 3 + 4: This matches the expression -> expression + term rule.
Match 3 and 4: Both match the factor -> NUMBER rule.
The resulting parse tree would look like this:

expression
/ \
term expression
/ \ / \
factor * factor + factor
| | | | |
NUMBER NUMBER NUMBER NUMBER NUMBER

Implementation
We can implement a parser using a programming language like C++, Java, or Python.
Parser generators like Yacc or Bison can automate the process of generating a
parser from a grammar specification.

Key Points:
- Tokenization: The input string is broken down into tokens (e.g., numbers,
operators, parentheses).
- Parsing: The tokens are analyzed to determine the syntactic structure of the
expression.
- Semantic Analysis: The meaning of the expression is checked, and type checking is
performed.
- Code Generation: The parsed expression is translated into machine code or
intermediate code.

Conclusion
Parsing is a fundamental technique in compiler design and language processing. By
understanding the grammar of a language and using appropriate parsing
algorithms, we can analyze and interpret input strings. This case study provides a
basic introduction to parsing and demonstrates how to construct a parse tree for a
simple expression language.

3.3.1 Predictive Parsing

Predictive parsing uses lookahead tokens to decide which production to use. It avoids
backtracking and requires the grammar to be LL(1).

Example:
Grammar:
E → T E'
E' → + T E' | ε
T → id

Input: id + id

Steps:
1. Start with E.
2. Expand E → T E'.
3. Match T → id, expand E' → + T E', and match tokens.

3.4.1 Top-down Parsing Principles of CFG

Context-Free Grammar (CFG) consists of terminals, non-terminals, a start symbol, and

production rules.

Principles:
- Always expand the leftmost non-terminal (Leftmost Derivation).
- Eliminate Left Recursion for top-down parsing.

Example:
Original Grammar: A → Aα | β
After Elimination: A → βA', A' → αA' | ε

3.5 Regular Expressions vs CFG

| Aspect | Regular Expressions | Context-Free Grammar (CFG) |

|-----------------|----------------------------|----------------------------|
| Expressiveness | Regular languages | Context-free languages |
| Applications | Token patterns | Programming constructs |
| Parsing | Finite automata | Parsers (Top-down/Bottom-up)|

Example:
Regex: [a-zA-Z_][a-zA-Z_0-9]*
CFG: E → T + E | T

3.6 Recursive Descent Parsing

Recursive Descent Parsing uses recursive functions corresponding to grammar non-

terminals.

Example Grammar:
E→T+E|T
T → id

Pseudo-code:
void E() {
T();
if (lookahead == '+') {
match('+'); E(); }}
3.7 Bottom-Up Parsing

Bottom-up parsing reduces input symbols to the start symbol.

Shift-Reduce Parsing Example:

Grammar:
E→E+T|T
T → id

Input: id + id
Steps:
1. Shift id, reduce to T.
2. Shift +, id, reduce to T.
3. Reduce T + T to E.

Example : Bottom-up parsing is a parsing technique that starts from the input tokens and
gradually reduces them to the start symbol of the grammar. It's like building a house from
the ground up, starting with the foundation and working your way to the roof.

Consider the following grammar:

S -> A B C
A -> a
B -> b
C -> c

And the input string: abc

Parsing Steps:

1. **Shift**:
- The parser reads the first input symbol 'a' and shifts it onto the stack.
- Stack: a
- Input: bc

2. **Reduce**:
- The top of the stack 'a' matches the right-hand side of the production A -> a.
- The parser reduces 'a' to 'A'.
- Stack: A
- Input: bc

3. **Shift**:
- The parser reads the next input symbol 'b' and shifts it onto the stack.
- Stack: Ab
- Input: c

4. **Reduce**:
- The top of the stack 'b' matches the right-hand side of the production B -> b.
- The parser reduces 'b' to 'B'.
- Stack: AB
- Input: c

5. **Shift**:
- The parser reads the next input symbol 'c' and shifts it onto the stack.
- Stack: ABC
- Input: (empty)

6. **Reduce**:
- The top three symbols on the stack 'ABC' match the right-hand side of the production S ->
ABC.
- The parser reduces 'ABC' to 'S'.
- Stack: S
- Input: (empty)

Now, the stack contains only the start symbol 'S', which means the input string has been
successfully parsed.

Key Points:
- **Shift-Reduce Actions**: The parser alternates between shifting input symbols onto the
stack and reducing sequences of symbols to non-terminals.
- **Handle Identification**: The parser must correctly identify the handle (the rightmost
substring that can be reduced) at each step.
- **Parse Table**: A parse table is used to determine the appropriate action (shift or reduce)
based on the current state and input symbol.

Common Bottom-Up Parsing Algorithms:

- **LR(0)**: Simple but limited in its ability to handle ambiguous grammars.
- **SLR(1)**: More powerful than LR(0), but still has limitations.
- **LR(1)**: More powerful than SLR(1), but can be complex to implement.
- **LALR(1)**: A compromise between SLR(1) and LR(1), providing a good balance of
power and simplicity.

NX 3.0.22.00 - NX 4.0.22.00 - Service Manual (Service Manual For Download) PDF
100% (4)
NX 3.0.22.00 - NX 4.0.22.00 - Service Manual (Service Manual For Download) PDF
772 pages
EMP-860-User Mnaual
100% (10)
EMP-860-User Mnaual
93 pages
FortiAnalyzer 7.2.1 Administration Guide
No ratings yet
FortiAnalyzer 7.2.1 Administration Guide
398 pages
CD Unit-Ii
No ratings yet
CD Unit-Ii
56 pages
Compiler Design - Syntax Analysis
No ratings yet
Compiler Design - Syntax Analysis
14 pages
Parser Final
No ratings yet
Parser Final
19 pages
Compiler Engineering
No ratings yet
Compiler Engineering
27 pages
Parser
No ratings yet
Parser
40 pages
What Is Parsing: Parsing Is The Process of Analyzing An Input Sequence in Order
No ratings yet
What Is Parsing: Parsing Is The Process of Analyzing An Input Sequence in Order
9 pages
Unit-2 2.1. Review of CFG Ambiguity of Grammars 2.1.1. Limitations of Regular Language
No ratings yet
Unit-2 2.1. Review of CFG Ambiguity of Grammars 2.1.1. Limitations of Regular Language
44 pages
Compiler Design Unit 2 By Dr. Choudhary Ravi Singh
No ratings yet
Compiler Design Unit 2 By Dr. Choudhary Ravi Singh
46 pages
CD UNIT 3
No ratings yet
CD UNIT 3
76 pages
CSC312 2.docx Updated
No ratings yet
CSC312 2.docx Updated
10 pages
Chapter 3 Syntax Analysis
No ratings yet
Chapter 3 Syntax Analysis
54 pages
CD UNIT-2
No ratings yet
CD UNIT-2
107 pages
Compiler Design Unit-2
No ratings yet
Compiler Design Unit-2
29 pages
Chapter – 3
No ratings yet
Chapter – 3
46 pages
5.ll-lr
No ratings yet
5.ll-lr
53 pages
CD Chapter-3
No ratings yet
CD Chapter-3
105 pages
Compiler 2
100% (1)
Compiler 2
45 pages
UNIT-2(CD)
No ratings yet
UNIT-2(CD)
12 pages
CH 4 Syntax Analysis - Part2
No ratings yet
CH 4 Syntax Analysis - Part2
31 pages
CH2 2
No ratings yet
CH2 2
30 pages
Chapter 3 Syntax Analyzer1
No ratings yet
Chapter 3 Syntax Analyzer1
58 pages
Compiler Theory: (A Simple Syntax-Directed Translator)
No ratings yet
Compiler Theory: (A Simple Syntax-Directed Translator)
50 pages
Compiler 2
No ratings yet
Compiler 2
45 pages
Parsing
No ratings yet
Parsing
33 pages
CD Unit 2 RV
No ratings yet
CD Unit 2 RV
21 pages
PCC-CS501
No ratings yet
PCC-CS501
10 pages
Chapter 3 Compiler Design
No ratings yet
Chapter 3 Compiler Design
42 pages
3 Syntax Analysis
No ratings yet
3 Syntax Analysis
42 pages
Topic #4: Syntactic Analysis (Parsing) : INF 524 Compiler Construction Spring 2011
No ratings yet
Topic #4: Syntactic Analysis (Parsing) : INF 524 Compiler Construction Spring 2011
44 pages
KCA015 Unit2
No ratings yet
KCA015 Unit2
29 pages
Comp Review: Compilers: Fall 1996 Textbook: "Compilers" by Aho, Sethi & Ullman
No ratings yet
Comp Review: Compilers: Fall 1996 Textbook: "Compilers" by Aho, Sethi & Ullman
10 pages
Unit - Ii Topdown Parsing 1. Context-Free Grammars: Definition
No ratings yet
Unit - Ii Topdown Parsing 1. Context-Free Grammars: Definition
26 pages
408
No ratings yet
408
8 pages
2-Role of Parser and Parse Tree-02!08!2024
No ratings yet
2-Role of Parser and Parse Tree-02!08!2024
69 pages
2.2 - Syntax Analysis (Upto Top-down Parsing)
No ratings yet
2.2 - Syntax Analysis (Upto Top-down Parsing)
91 pages
Compiler CH-3
No ratings yet
Compiler CH-3
6 pages
Lecture 7
No ratings yet
Lecture 7
24 pages
CH-3 Syntax Analyzer
No ratings yet
CH-3 Syntax Analyzer
41 pages
ACD-UNIT-4 Notes
No ratings yet
ACD-UNIT-4 Notes
32 pages
2024_CD-Ch03_Syntaxx_Analysis
No ratings yet
2024_CD-Ch03_Syntaxx_Analysis
28 pages
Syntax Analyzer
No ratings yet
Syntax Analyzer
38 pages
Chapter 3
No ratings yet
Chapter 3
180 pages
Grammars
No ratings yet
Grammars
34 pages
Module 2 C D Notes
No ratings yet
Module 2 C D Notes
21 pages
CD Unit 2
No ratings yet
CD Unit 2
19 pages
CD Unit2
No ratings yet
CD Unit2
73 pages
Chapter Four
No ratings yet
Chapter Four
54 pages
Syntax Analysis or Parsing
No ratings yet
Syntax Analysis or Parsing
11 pages
Chapter 3 Syntax Analysis
No ratings yet
Chapter 3 Syntax Analysis
78 pages
u2 (2)
No ratings yet
u2 (2)
18 pages
UNIT-4 Parsing Techniques
No ratings yet
UNIT-4 Parsing Techniques
20 pages
Class Three
No ratings yet
Class Three
74 pages
Group 4&5 Activity Syntax Analyzer
No ratings yet
Group 4&5 Activity Syntax Analyzer
6 pages
Compiler Design Study Material Unit 2nd
No ratings yet
Compiler Design Study Material Unit 2nd
28 pages
CH03
No ratings yet
CH03
57 pages
Syntax Analysis: CD: Compiler Design
No ratings yet
Syntax Analysis: CD: Compiler Design
90 pages
Lecture 09
No ratings yet
Lecture 09
22 pages
UNIT-3 CD Final
No ratings yet
UNIT-3 CD Final
94 pages
Learn C++
From Everand
Learn C++
Durgesh
4.5/5 (9)
Introduction to Algorithms
From Everand
Introduction to Algorithms
S VASIST
No ratings yet
MAX 185-253KTL3-X HV Datasheet
No ratings yet
MAX 185-253KTL3-X HV Datasheet
2 pages
CMC Electronics Quality Manual 9100-1001
No ratings yet
CMC Electronics Quality Manual 9100-1001
39 pages
Atomic Energy Research Paper
No ratings yet
Atomic Energy Research Paper
29 pages
Chapter 4 Annex B PDF
No ratings yet
Chapter 4 Annex B PDF
35 pages
A Última Livraria de Londres 1st Edition Madeline Martin instant download
100% (2)
A Última Livraria de Londres 1st Edition Madeline Martin instant download
24 pages
3. PRACTICE TEST_UNIT 11 (File HS)
No ratings yet
3. PRACTICE TEST_UNIT 11 (File HS)
10 pages
Lesson 1 Homework Practice Constant Rate of Change
100% (1)
Lesson 1 Homework Practice Constant Rate of Change
6 pages
UX Survival Guide v1
No ratings yet
UX Survival Guide v1
61 pages
Bca Syallabus i & II Sem (1)
No ratings yet
Bca Syallabus i & II Sem (1)
18 pages
Int 306
No ratings yet
Int 306
19 pages
Afriso Multilyzer STX DB en
No ratings yet
Afriso Multilyzer STX DB en
3 pages
Proakd Transfagarasan v0.8 & v1.2 Traffic Simulation Mod
No ratings yet
Proakd Transfagarasan v0.8 & v1.2 Traffic Simulation Mod
5 pages
Advanced Design and Analysis of Algorithms: Dr. Hajira Jabeen
No ratings yet
Advanced Design and Analysis of Algorithms: Dr. Hajira Jabeen
36 pages
Screenshot 2024-01-22 at 11.19.35
No ratings yet
Screenshot 2024-01-22 at 11.19.35
8 pages
CT050 3 2 WAPP - Assignment - Question
No ratings yet
CT050 3 2 WAPP - Assignment - Question
4 pages
Decision Trees Machine Learning
No ratings yet
Decision Trees Machine Learning
4 pages
A Hybrid Machine Learning Model For Grade Prediction in Online Engineering Education
No ratings yet
A Hybrid Machine Learning Model For Grade Prediction in Online Engineering Education
22 pages
0301-SI-004-02 - Engineering Management CORPORATE
No ratings yet
0301-SI-004-02 - Engineering Management CORPORATE
36 pages
Swarnim Rai
No ratings yet
Swarnim Rai
1 page
Presentation 1
No ratings yet
Presentation 1
27 pages
Chapter 2 PPT Num.I.pptxxxxxx New
No ratings yet
Chapter 2 PPT Num.I.pptxxxxxx New
107 pages
DoppelPaymer Ransomware and Dridex 2
No ratings yet
DoppelPaymer Ransomware and Dridex 2
33 pages
Download
No ratings yet
Download
1 page
ICX204AL 0.8MPix 6.0mm 450mV multiespectral Mono
No ratings yet
ICX204AL 0.8MPix 6.0mm 450mV multiespectral Mono
23 pages
API Gateway APISIX Integrates Keycloak For Authentication - Apache APISIX - Cloud-Native API Gateway
No ratings yet
API Gateway APISIX Integrates Keycloak For Authentication - Apache APISIX - Cloud-Native API Gateway
11 pages
Advanced Web Attacks and Exploitation: Figure 20: Burp Suite Repeater Previous Request and Response
No ratings yet
Advanced Web Attacks and Exploitation: Figure 20: Burp Suite Repeater Previous Request and Response
4 pages
SAPQA
No ratings yet
SAPQA
2 pages

Syntax Analysis Parsing (1)

Uploaded by

Syntax Analysis Parsing (1)

Uploaded by

Syntax Analysis

Parsing involves analyzing a string of tokens to determine its grammatical structure. It

Parsing determines if the input matches the start symbol E.

3.2 Top-down Parsing

Case study on parse tree construction (top-down)

expression -> expression '+' term | expression '-' term | term

Consider the expression 2 * (3 + 4). Here's how it would be parsed:

Match 2 *: This matches the term -> term * factor rule.

3.3.1 Predictive Parsing

3.4.1 Top-down Parsing Principles of CFG

Context-Free Grammar (CFG) consists of terminals, non-terminals, a start symbol, and

3.5 Regular Expressions vs CFG

| Aspect | Regular Expressions | Context-Free Grammar (CFG) |

3.6 Recursive Descent Parsing

Recursive Descent Parsing uses recursive functions corresponding to grammar non-

Bottom-up parsing reduces input symbols to the start symbol.

Shift-Reduce Parsing Example:

Consider the following grammar:

And the input string: abc

Common Bottom-Up Parsing Algorithms:

You might also like

Match 2 : This matches the term -> term factor rule.