CC Summary (Slides)

The document discusses compiler construction and lexical analysis. It provides an overview of the major phases of compilation: analysis, synthesis, and supporting phases. Analysis includes lexical, syntax, and semantic analysis to break down source code. Synthesis generates intermediate code, performs optimizations, and final code generation. Lexical analysis identifies tokens by breaking input into lexemes matched to patterns using finite automata represented as transition diagrams or tables. Regular expressions define patterns for tokens that are building blocks for lexical analysis.

Uploaded by

sab640887

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views9 pages

CC Summary (Slides)

Uploaded by

sab640887

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

Compiler Construction (Slides) Summary

Chapter 1
 Compilers viewed from many perspectives:
o Construction:
 Single pass.
 Multi pass.
 Load & go.
o Functional:
 Debugging.
 Optimizing.
 Compilers have two fundamental parts:
o Analysis: decompose source code into intermediate representation.
o Synthesis: target code generation from representation
 Important software tools in analysis:
o Structure / syntax directed editors: force "syntactically" correct
code to be entered.
o Pretty printers: standardized version for program structure.
o Static checkers: a quick compilation to detect rudimentary errors.
o Interpreters: real time execution of code (line at a time).
o Text formatters: like LATEX and TROFF
o Silicon compilers: take input and generate circuit design.
 Analysis task for compilation:
o Lexical Analysis:
 Left-to-right scan to identify tokens.
 Tokens: sequence of chars that have collective meaning.
 Linear action (not recursive)
 Identify only individual "words" that are the tokens of the
language.
o Hierarchical (Syntax) Analysis:
 Grouping of tokens into meaningful collection
 Verify that the "words" are correctly assembled into
"sentences"
 Recursion is required to identify structure of an expression.
o Semantic Analysis:
 Checking to ensure correctness of components.
 Determine whether the sentences have one and only one
unambiguous interpretation.
 Provide Type Checking (legality of operands) operation.
 Supporting phases for analysis phase:
o Symbol table creation:
 A data structure that contains info about tokens created by
the lexical analyzer.
 Updated during analysis phase, and used during synthesis
phases.
o Error Handling:
 Detection of different errors which correspond to all phases.
 Synthesis task for compilation:
o Intermediate code generation:
 Abstract machine version of code (independent of
architecture).
o Code optimization:
 Find more efficient ways to execute code.
 Replace code with more optimal statements.
 Has two approaches: Peephole and High-level language.
o Final code generation:
 Generate relocatable machine dependent code.
 Grammar: set of rules which govern the interdependencies & structure
among the tokens.
 Assembly code: names used for introductions, and names are used for
memory addresses.
 Loader: taking relocatable machine code, alerting the addresses and
placing the altered instructions into memory.
 Link-editor: taking many (relocatable) machine code programs (with cross-
references) and produce a single file.
o Need to keep track of correspondence between variable names and
corresponding addresses in each piece of code.
 Compiler construction tools:
o Parser Generators : Produce Syntax Analyzers
o Scanner Generators (LEX) : Produce Lexical Analyzers
o Syntax-directed Translation Engines (YACC): Generate
Intermediate Code
o Automatic Code Generators : Generate Actual Code
o Data-Flow Engines : Support Optimization
Chapter 2
 A Context-free Grammar is utilized to describe the syntactic structure of the
language.
 A CFG is characterized by:
o A set of Tokens or Terminal symbols.
o A set of Non-Terminals.
o A set of Production rules.
o A Non-Terminal designated as the start symbol.
 A parse tree for a CFG has the following properties:
o Root is labeled with the start symbol.
o Leaf node is a token or epsilon.
o Interior node is a Non-Terminal.
 Ambiguous grammar that does not enforce associativity.
o Non-ambiguous grammar enforcing left associativity have a parse
tree that will grow to the left.
o Non-ambiguous grammar enforcing right associativity have a parse
tree that will grow to the right.
 Syntax-Directed Translation:
o Associate attributes with grammar rules & constructs and translate as
parsing occurs.
o Each production has a set of semantic rules.
o Each grammar symbol has a set of attributes.
 The type of tree traversal that is being performed during semantic rules is
postorder depth-first traversal.
 Semantic actions are added into the right sides of the productions.
o Example: 𝑒𝑥𝑝𝑟 → 𝑟𝑒𝑠𝑢𝑙𝑡 | 𝑑𝑖𝑔𝑖𝑡 {𝑝𝑟𝑖𝑛𝑡("𝑎𝑐𝑡𝑖𝑜𝑛"); }
 Parse tree / derivation of a token string occurs in a top down fashion.
o Uses a grammar to check structure of tokens.
o Can be recursive descent or predictive parsing.
o Parser operates by attempting to match tokens in the input stream.
 Lexical Analysis process functional responsibilities:
o Input token string is broken down.
o White spaces and comments are filtered out.
o Individual tokens with associated values are identified.
o Symbol table is initialized and entries are constructed for each
"appropriate" token.
 Reserved words are placed into the symbol table for easy lookup.
 Consider 𝑨 → 𝒂
o FIRST(𝒂)= set of leftmost tokens that appear in 𝒂 or in strings
generated by 𝒂
Chapter 3
 Separation of Lexical analysis from parsing presents a simpler conceptual
model as it emphasizes:
o High cohesion and low coupling
o Implies well specified for parallel implementation.
o Increase in compiler efficiency (I/O techniques to enhance lexical
analysis).
o Promoting portability.
 Major terms in Lexical Analysis:
o Token:
 A classification for a common set of strings.
 Examples: <Identifier>, <number>.
o Pattern:
 The rules which characterize the set of strings for a token.
 Examples: recall files and OS wildcards ([A-Z]*.*).
o Lexeme:
 Actual sequence of characters that matches pattern and is
classified by a token.
 Examples: Identifiers: x, count, name, etc..
 Error handling in lexical analysis is very localized, with respect to input
source.
o Errors occur when prefix of remaining input doesn't match any defined
token.
o Possible error recovery actions:
 Deleting or inserting input characters.
 Replacing or transposing characters.
 Lexical Analyzer construction techniques:
o Lexical analyzer generator.
o Hand-code / High-level Language (I/O facilitated by the language).
o Hand-code / Assembly Language (Explicitly manage I/O)
 Language: any set of strings over a fixed alphabet.
 Regular Expression: a set of rules/techniques used for constructing
sequences of symbols (strings) from an alphabet.
o For fixed alphabet ∑
 ∈ is a regular expression denoting {∈ }
 If a is in ∑, a is a regular expression that denotes {𝒂}
 All are Left-Associative. Parentheses are dropped as allowed by
precedence rules.
 Transition Diagrams (TD): used to represent the tokens.
o Attempts to match lexeme to a pattern.
o Each TD has:
 States: represented by circles.
 Actions: represented by arrows between states.
 Start state: beginning of a pattern (arrowhead)
 Final state(s): end of pattern (concentric circles)
o Each TD is Deterministic.
 Lexical Analyzer matches all keywords/reserved words as ids
o After the match, the symbol table or a special keyword table is
consulted
o Keyword table contains string versions of all keywords and associated
token values
o When a match is found, the token is returned, along with its symbolic
value
o If a match is not found, then it is assumed that an id has been
discovered.
 Finite Automata: a recognizer that takes an input string & determines
whether it's a valid sentence of the language.
o Deterministic: has at most one action for a give input symbol.
 Complex but more precise.
o Non-Deterministic: has more than one alternative action for the same
input symbol.
 Easy but less precise.
o Both types are used to recognize regular expressions
 Each NFA consists of:
o S, set of states
o ∑, the symbols of the input alphabet
o 𝛿, transition function.
o 𝑠0 ,the start state
o 𝐹, a set of final or accepting states.
 Problems in NFA:
o Valid input might not be accepted.
o NFA may behave differently on the same input.
 Relationship of NFAs to Compilation:
o Regular Expressions are "Recognized" by NFA
o Regular Expressions are "Patterns" for "Tokens"
o Tokens are building blocks for lexical analysis.
o Lexical analyzer can be described by a collection of NFAs. Each
NFA is for a language token.
 Transition diagrams are the states (circles), arcs, and final states.
 Transition tables are more suitable to representation within a computer.
 Each state in DFA corresponds to a SET of states of the NFA. (same input can
have multiple paths in NFA)
 ax- syntax of regular expression
is determining factor for NFA construction and structure.
 Let 𝑟 be a regular expression, with NFA 𝑁(𝑟), then:
o 𝑁(𝑟) ℎ𝑎𝑠 # 𝑜𝑓 𝑠𝑡𝑎𝑡𝑒𝑠 ≤ 2(#𝑠𝑦𝑚𝑏𝑜𝑙𝑠+#𝑜𝑝𝑒𝑟𝑎𝑡𝑜𝑟𝑠) 𝑓𝑜𝑟 𝑟
o 𝑁(𝑟) ℎ𝑎𝑠 𝑒𝑥𝑎𝑐𝑡𝑙𝑦 𝑜𝑛𝑒 𝑠𝑡𝑎𝑟𝑡 𝑎𝑛𝑑 𝑜𝑛𝑒 𝑎𝑐𝑐𝑒𝑝𝑡𝑖𝑛𝑔 𝑠𝑡𝑎𝑡𝑒
o Each state of 𝑁(𝑟) has at most one outgoing edge 𝑎 ∈ ∑ or at most two
outgoing ∈ ′𝑠
o Each state must have a unique name.

 Designing Lexical Analyzer Generator steps:

o From regular expression to NFA
o From NFA to DFA
o DFA simulation for lexical analyzer.
 Each pattern recognizes lexemes.
 Each pattern described by regular expression.

Information Technology - Vocational (Code-402) : Answer Key (Part B)
41% (17)
Information Technology - Vocational (Code-402) : Answer Key (Part B)
2 pages
2022-23 Sem 2
No ratings yet
2022-23 Sem 2
254 pages
How To Sync On-Premises Active Directory To Azure Active Directory With Azure AD Connect
No ratings yet
How To Sync On-Premises Active Directory To Azure Active Directory With Azure AD Connect
15 pages
MOP - GGSN Huawei Configure QCI-DSCP Mapping v3
No ratings yet
MOP - GGSN Huawei Configure QCI-DSCP Mapping v3
6 pages
Top 30+ Best Oracle Apex Interview Questions and Answers in 2022
No ratings yet
Top 30+ Best Oracle Apex Interview Questions and Answers in 2022
15 pages
Workflow For Sap S4hana
No ratings yet
Workflow For Sap S4hana
32 pages
Compiler Design 1
No ratings yet
Compiler Design 1
206 pages
Testing
No ratings yet
Testing
123 pages
Unit 6
No ratings yet
Unit 6
109 pages
Ms Office Icons Toolbars 24
No ratings yet
Ms Office Icons Toolbars 24
14 pages
Lexical Analyzer (Compiler Contruction)
100% (1)
Lexical Analyzer (Compiler Contruction)
6 pages
VMKV Engineering College Department of Computer Science & Engineering Principles of Compiler Design Unit I Part-A
No ratings yet
VMKV Engineering College Department of Computer Science & Engineering Principles of Compiler Design Unit I Part-A
80 pages
SP Unit III-2024-25
No ratings yet
SP Unit III-2024-25
126 pages
R13 QP Customer Class Specific Pricing Solution V1
No ratings yet
R13 QP Customer Class Specific Pricing Solution V1
22 pages
Technical Publications: Invenia ABUS 2.0 Version 2.0.x Dicom Conformance Statement
No ratings yet
Technical Publications: Invenia ABUS 2.0 Version 2.0.x Dicom Conformance Statement
59 pages
Material For CAT 1
100% (1)
Material For CAT 1
22 pages
Creality Ender 3 Pro User Manual
No ratings yet
Creality Ender 3 Pro User Manual
12 pages
Chapter 2 Lexical Analysis (Scanning) Edited
No ratings yet
Chapter 2 Lexical Analysis (Scanning) Edited
46 pages
Compiler
No ratings yet
Compiler
31 pages
Compiler Design Part 2
No ratings yet
Compiler Design Part 2
20 pages
Lexical Analysis: Dr. Murali Krishna Enduri Department of CSE
No ratings yet
Lexical Analysis: Dr. Murali Krishna Enduri Department of CSE
88 pages
ITR1 Schema AY2018-19 V1.3 PDF
No ratings yet
ITR1 Schema AY2018-19 V1.3 PDF
6 pages
The Structure of A Compiler: Any Compiler Must Perform Two Major Tasks
No ratings yet
The Structure of A Compiler: Any Compiler Must Perform Two Major Tasks
57 pages
CH 3 Myppt
No ratings yet
CH 3 Myppt
59 pages
Compiler Construction Lecture 3-4
No ratings yet
Compiler Construction Lecture 3-4
78 pages
Programming For Problem Solving Using C and C++
No ratings yet
Programming For Problem Solving Using C and C++
26 pages
CC Unit 2
No ratings yet
CC Unit 2
80 pages
A Bakery Information Management System 1
No ratings yet
A Bakery Information Management System 1
58 pages
Compiler Construction Final Notes For End Sem Exam
No ratings yet
Compiler Construction Final Notes For End Sem Exam
37 pages
CAT Short Key Material
No ratings yet
CAT Short Key Material
38 pages
An Analysis of Compiler Design in Context of Lexical Analyzer
No ratings yet
An Analysis of Compiler Design in Context of Lexical Analyzer
5 pages
Compiler Design
No ratings yet
Compiler Design
42 pages
Compiler 2 PDF
No ratings yet
Compiler 2 PDF
43 pages
1st Phase Lexical Analyzer
No ratings yet
1st Phase Lexical Analyzer
33 pages
Unit II - Lexical Analysis-20-1-2021
No ratings yet
Unit II - Lexical Analysis-20-1-2021
49 pages
Cloud Organization User Guide
No ratings yet
Cloud Organization User Guide
102 pages
Unit-I - CD R2021
No ratings yet
Unit-I - CD R2021
60 pages
CD 2m Final
No ratings yet
CD 2m Final
30 pages
CD UNIT-1
No ratings yet
CD UNIT-1
60 pages
Assignment No 4 Submitted To:. Sir Salman Butt
No ratings yet
Assignment No 4 Submitted To:. Sir Salman Butt
7 pages
A Typical Lexical Analyzer Generator Nfa To Dfa DFA Analysis
No ratings yet
A Typical Lexical Analyzer Generator Nfa To Dfa DFA Analysis
64 pages
Compiler Design
No ratings yet
Compiler Design
19 pages
ECS-603 Put 13-14 Sol
No ratings yet
ECS-603 Put 13-14 Sol
24 pages
CC Viva Questions
0% (1)
CC Viva Questions
5 pages
Chapter 2
No ratings yet
Chapter 2
39 pages
Notes Compiler
No ratings yet
Notes Compiler
28 pages
SPCC - 5
No ratings yet
SPCC - 5
19 pages
SS Unit 4
No ratings yet
SS Unit 4
29 pages
CH 2
No ratings yet
CH 2
36 pages
Chpater 2 Lexical Analysis
No ratings yet
Chpater 2 Lexical Analysis
48 pages
Chapter Two LexicalAnalysis
No ratings yet
Chapter Two LexicalAnalysis
16 pages
CS3304 9 LanguageSyntax 2 PDF
No ratings yet
CS3304 9 LanguageSyntax 2 PDF
39 pages
PCD 2 Marks University
No ratings yet
PCD 2 Marks University
21 pages
ch-2.pdf 2
No ratings yet
ch-2.pdf 2
27 pages
PP La Sa
No ratings yet
PP La Sa
20 pages
Chap 04
No ratings yet
Chap 04
15 pages
Compiler Design
No ratings yet
Compiler Design
11 pages
Unit 1
No ratings yet
Unit 1
24 pages
Question Bank Part A, Part B&C
No ratings yet
Question Bank Part A, Part B&C
15 pages
E-Library & Experinces of AIIMS Library
No ratings yet
E-Library & Experinces of AIIMS Library
31 pages
CSC 318 Class Notes
No ratings yet
CSC 318 Class Notes
21 pages
Development of A Computer Aided Critical Lift Plan
No ratings yet
Development of A Computer Aided Critical Lift Plan
13 pages
Lecture 4 Lexical Analysis
No ratings yet
Lecture 4 Lexical Analysis
23 pages
Compiler Design
No ratings yet
Compiler Design
19 pages
CD CIE 1 - DD - Scheme
No ratings yet
CD CIE 1 - DD - Scheme
13 pages
Cdsem
No ratings yet
Cdsem
14 pages
2019 February Iat 1 Te CMPN Sem Vi SPCC
No ratings yet
2019 February Iat 1 Te CMPN Sem Vi SPCC
12 pages
Chapter 4 Some Information Systems For Business Management
No ratings yet
Chapter 4 Some Information Systems For Business Management
61 pages
Atc Important
No ratings yet
Atc Important
7 pages
Lecture 4 Without Videos
No ratings yet
Lecture 4 Without Videos
33 pages
3a. Context Free Grammar
No ratings yet
3a. Context Free Grammar
18 pages
Compiler
No ratings yet
Compiler
5 pages
Lexical and Syntax Analysis - Updated
No ratings yet
Lexical and Syntax Analysis - Updated
5 pages
CD 2 M
No ratings yet
CD 2 M
5 pages
Compiler Construction Tools & Introduction To LA
No ratings yet
Compiler Construction Tools & Introduction To LA
5 pages
CA Summary
No ratings yet
CA Summary
14 pages
Software Requirements Specification (SRS) Project Lane Management System - 1
No ratings yet
Software Requirements Specification (SRS) Project Lane Management System - 1
35 pages
Review Questions
No ratings yet
Review Questions
5 pages
Lecture 5
No ratings yet
Lecture 5
30 pages
Tutorial 02: How To Build A C/C++ Program?: Part 2: Build Pipeline
No ratings yet
Tutorial 02: How To Build A C/C++ Program?: Part 2: Build Pipeline
28 pages
Netsuite Vs Odoo, 2024 Comparison Whitepaper
No ratings yet
Netsuite Vs Odoo, 2024 Comparison Whitepaper
26 pages
Lecture 2 - Without Videos
No ratings yet
Lecture 2 - Without Videos
26 pages
3 Simple Ways To Find Your Windows 10 Product Key
No ratings yet
3 Simple Ways To Find Your Windows 10 Product Key
5 pages
Rte 3
No ratings yet
Rte 3
21 pages
Soft Computing and Neural Network Unit 1
No ratings yet
Soft Computing and Neural Network Unit 1
4 pages
Tags
No ratings yet
Tags
2 pages
2025-01-04 Biz Main
No ratings yet
2025-01-04 Biz Main
6 pages
Proposal - Rozi Hub My Business
No ratings yet
Proposal - Rozi Hub My Business
5 pages
Excel Fundamentals Manual 41
No ratings yet
Excel Fundamentals Manual 41
1 page
NEFOUSSI Farah - CV
No ratings yet
NEFOUSSI Farah - CV
1 page

CC Summary (Slides)

Uploaded by

CC Summary (Slides)

Uploaded by

Compiler Construction (Slides) Summary

 Designing Lexical Analyzer Generator steps:

You might also like