CT - Lecture 2

compiler theory

Uploaded by

Mohamed Ahmed Ali

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views

CT - Lecture 2

compiler theory

Uploaded by

Mohamed Ahmed Ali

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 23

Compiler phases

LECTURE 2
Phases of Compiler:

Lexical Analysis
• The first phase of the scanner works as a text scanner. This phase scans the
source code as a stream of characters and converts it into meaningful lexemes.

• Lexical analyzer represents these lexemes in the form of tokens as:

<token-name, attribute-value>
Phases of Compiler:
Syntax Analysis
• The next phase is called the syntax analysis or parsing.

• It takes the token produced by lexical analysis as input and generates a parse
tree <or syntax tree>.

• In this phase, token arrangements are checked against the source code
grammar,

• i.e. the parser checks if the expression made by the tokens is syntactically
correct.
Phases of Compiler:
Semantic Analysis
• Semantic analysis checks whether the parse tree constructed follows the rules
of language.

• For example, the assignment of values is between compatible data types, and
adding string to an integer. Also, the semantic analyzer keeps track of
identifiers, their types and expressions; whether identifiers are declared before
use or not, etc. The semantic analyzer produces an annotated syntax tree as an
output.
Phases of Compiler:
Intermediate Code Generation
• After semantic analysis, the compiler generates an intermediate code of the
source code for the target machine. It represents a program for some abstract
machine. It is in between the high-level language and the machine language.
This intermediate code should be generated in such a way that it makes it
easier to translate into the target machine code.
Phases of Compiler:
Code Optimization
• The next phase is code optimization of the intermediate code. Optimization can
be assumed as something that removes unnecessary code lines and arranges
the sequence of statements to speed up the program execution without wasting
resources <CPU, memory>.
Phases of Compiler:
Code Generation
• In this phase, the code generator takes the optimized representation of the
intermediate code and maps it to the target machine language. The code
generator translates the intermediate code into a sequence of <generally> re-
locatable machine code. The sequence of instructions of machine code
performs the task as the intermediate code would do.
Phases of Compiler:
Symbol Table
• It is a data structure maintained throughout all the phases of a compiler. All the
identifier's names along with their types are stored here. The symbol table
makes it easier for the compiler to quickly search the identifier record and
retrieve it. The symbol table is also used for scope management.
Lexical Analysis
• Lexical analysis is a compiler’s first phase, also called linear analysis or scanning.
It takes modified source code from language pre-processors that are written in the
form of sentences. The lexical will read the source program one by one letter and
group the characters into meaningful sequences called lexemes. For each lexeme,
the lexical analyzer produces as output a token of the form

token-name; attribute-value
• If the lexical analyzer finds a token invalid, it generates an error. The lexical analyzer
works closely with the syntax analyzer. It reads character streams from the source
code, checks for legal tokens, and passes the data to the syntax analyzer when it
demands.
Lexical Analysis

• Upon receiving the “get next token”

command from the parser, the
lexical analyzer reads input
characters until it can identify the
next token.
Lexical Analysis
• The lexical analyzer reads the source text and, thus, it may perform certain
secondary tasks:
• Eliminate comments and white spaces as blanks, tabs and newline
characters.

• Correlate error messages from the compiler with the source program.
Lexical Analysis
• Token: A token is a group of characters having collective
meaning: typically, a word or punctuation mark, separated by a
lexical analyzer and passed to a parser.
• A lexeme is an actual character sequence forming a specific
token instance, such as num.
• Pattern: A rule describing the strings associated with a token.
Expressed as a regular expression and explaining how a particular
token can be formed.
Lexical Analysis
• For example, in C language, the variable declaration
line
int value = 100;

contains the tokens:

<int, keyword> <value, identifier> < =, operator > <100, constant> <; , symbol>
Lexical Analysis
• What are Tokens?
• A token is the smallest individual element of a program that
is meaningful to the compiler. It cannot be further divided.
Identifiers, strings, keywords, etc., can be the example of the
token. In the lexical analysis phase of the compiler, the
program is converted into a stream of tokens.
Lexical Analysis
• Different Types of Tokens

• There can be multiple types of tokens. Some of them are-

1. Keywords
• Keywords are words reserved for particular purposes and
imply a special meaning to the compilers. The keywords
must not be used for naming a variable, function, etc.
Lexical Analysis
• Different Types of Tokens
2. Identifier
• The names given to various components in the program, like
the function's name or variable's name, etc., are called
identifiers. Keywords cannot be identifiers.
3. Operators
• Operators are different symbols used to perform different
operations in a programming language.
Lexical Analysis
• Different Types of Tokens
4. Punctuations
• Punctuations are special symbols that separate different
code elements in a programming language.
• Consider the following line of code in C++ language -
int x = 45;
• The above statement has multiple tokens, which are-
Lexical Analysis
• Different Types of Tokens
•Keywords: int
•Identifier: x , 45
•Operators: =
•Punctuators: ;
Lexical Analysis
• Specifications of Tokens
• Strings
• Strings are a finite set of characters. These characters can be a digit or
an alphabet. There is also an empty string which is denoted by ε.
Lexical Analysis
• Specifications of Tokens
• Language
• A language is considered as a finite set of strings over some finite set
of alphabets. Computer languages are considered as finite sets, and
mathematically set operations can be performed on them. Finite
languages can be described by means of regular expressions.
Lexical Analysis
• Specifications of Tokens
• Regular Expressions
• The lexical analyzer needs to scan and identify only a finite set of valid
string/tokens/lexemes that belong to the language in hand. It searches for
the pattern defined by the language rules.
• Regular expressions can express finite languages by defining a pattern for
finite strings of symbols. The grammar defined by regular expressions is
known as regular grammar. The language defined by regular grammar is
known as regular language.
Lexical Analysis
• Specifications of Tokens
• Regular Expressions
• Regular expression is an important notation for specifying patterns.
Each pattern matches a set of strings, so regular expressions serve as
names for a set of strings. Programming language tokens can be
described by regular languages.
Thank you for listening

Introduction To Developing Web Applications - NetBeans IDE Tutorial PDF
100% (1)
Introduction To Developing Web Applications - NetBeans IDE Tutorial PDF
8 pages
role of a lexical AN
No ratings yet
role of a lexical AN
26 pages
L2 Compiler Phases
No ratings yet
L2 Compiler Phases
29 pages
Lecture 3- Lexical Analysis (1)
No ratings yet
Lecture 3- Lexical Analysis (1)
42 pages
CD - CH2 - Lexical Analysis
No ratings yet
CD - CH2 - Lexical Analysis
67 pages
Compiler Design Notes
No ratings yet
Compiler Design Notes
35 pages
L2 Lexical Analysis
No ratings yet
L2 Lexical Analysis
59 pages
Compiler Design
No ratings yet
Compiler Design
117 pages
2 Lexical Analyzer
No ratings yet
2 Lexical Analyzer
21 pages
cd UNIT-1
No ratings yet
cd UNIT-1
60 pages
Unit 1
No ratings yet
Unit 1
24 pages
SEN 317 Lecture 2n
No ratings yet
SEN 317 Lecture 2n
19 pages
Comp Chap2
No ratings yet
Comp Chap2
36 pages
HW_31712
No ratings yet
HW_31712
22 pages
Compier Design - Unit I
No ratings yet
Compier Design - Unit I
97 pages
section c
No ratings yet
section c
16 pages
Unit 1
No ratings yet
Unit 1
50 pages
3.Role of Lexical Analyzer
No ratings yet
3.Role of Lexical Analyzer
4 pages
Lexical Analysis
No ratings yet
Lexical Analysis
9 pages
Lexical Analysis
No ratings yet
Lexical Analysis
35 pages
Lecture3_E
No ratings yet
Lecture3_E
153 pages
Compiler Design
No ratings yet
Compiler Design
12 pages
Day 2 - Lexial Analyzer
No ratings yet
Day 2 - Lexial Analyzer
37 pages
Program Compilation Lec 7
No ratings yet
Program Compilation Lec 7
18 pages
Lexical Analysis - Compiler Design: Token, Pattern and Lexeme
No ratings yet
Lexical Analysis - Compiler Design: Token, Pattern and Lexeme
5 pages
CD Unit 1
No ratings yet
CD Unit 1
42 pages
CSE353 Slides
No ratings yet
CSE353 Slides
76 pages
Compilation Phases
No ratings yet
Compilation Phases
20 pages
Ch2_Lexical Analysis
No ratings yet
Ch2_Lexical Analysis
71 pages
Unit 1 Compiler Design
No ratings yet
Unit 1 Compiler Design
124 pages
Compiler Construction: Chapter # 2 - Lexical Analysis Instructor: Ms. Raazia Sosan
No ratings yet
Compiler Construction: Chapter # 2 - Lexical Analysis Instructor: Ms. Raazia Sosan
53 pages
Compiler Design
No ratings yet
Compiler Design
42 pages
Unit 1 Slides
No ratings yet
Unit 1 Slides
49 pages
Module-1 1
No ratings yet
Module-1 1
53 pages
Unit 2
No ratings yet
Unit 2
14 pages
Lexical Analysis
No ratings yet
Lexical Analysis
5 pages
Compilers and Translators Assignment
No ratings yet
Compilers and Translators Assignment
3 pages
Chapter 2-Lexical Analysis
No ratings yet
Chapter 2-Lexical Analysis
48 pages
AK CD CSE 305 ASSIGNMENT 1
No ratings yet
AK CD CSE 305 ASSIGNMENT 1
15 pages
Lecture 7 (1)
No ratings yet
Lecture 7 (1)
27 pages
lec 02
No ratings yet
lec 02
17 pages
CD Unit 1
No ratings yet
CD Unit 1
54 pages
Ch2_Lexical Analysis (2)
No ratings yet
Ch2_Lexical Analysis (2)
71 pages
Lesson 08 2
No ratings yet
Lesson 08 2
33 pages
Introduction Compiler
No ratings yet
Introduction Compiler
47 pages
Recognition of Token in Lexical Analysis-3
No ratings yet
Recognition of Token in Lexical Analysis-3
10 pages
Introduction
No ratings yet
Introduction
46 pages
Problems in Compilation
No ratings yet
Problems in Compilation
21 pages
Lexical Analysis: Deterministic Finite Automata
No ratings yet
Lexical Analysis: Deterministic Finite Automata
37 pages
Cat 1
No ratings yet
Cat 1
150 pages
Chapter 2 Lexical Analysis (Scanning) Edited
No ratings yet
Chapter 2 Lexical Analysis (Scanning) Edited
46 pages
Compiler Construction Notes After Mid
No ratings yet
Compiler Construction Notes After Mid
18 pages
2-Lexical Analysis
No ratings yet
2-Lexical Analysis
52 pages
Lexical Analysis: Risul Islam Rasel
No ratings yet
Lexical Analysis: Risul Islam Rasel
148 pages
SEN 317 Lecture 5
No ratings yet
SEN 317 Lecture 5
18 pages
Lexical Analysis and Parsing CD
No ratings yet
Lexical Analysis and Parsing CD
107 pages
Unit 5 SP
No ratings yet
Unit 5 SP
28 pages
M2 Session2
No ratings yet
M2 Session2
17 pages
Introduction To Compilers Complier: Ompiler Source Program Target Program Error Message
No ratings yet
Introduction To Compilers Complier: Ompiler Source Program Target Program Error Message
23 pages
Lecture 6- Semantic Analysis
No ratings yet
Lecture 6- Semantic Analysis
26 pages
Compiler Design
From Everand
Compiler Design
Knowledge Flow
No ratings yet
Y5 Autumn Block 5 Wo3 Area of Rectangles 2019
No ratings yet
Y5 Autumn Block 5 Wo3 Area of Rectangles 2019
2 pages
Infrastructure As A Service (Iaas) : A Comparative Performance Analysis of Open-Source Cloud Platforms
No ratings yet
Infrastructure As A Service (Iaas) : A Comparative Performance Analysis of Open-Source Cloud Platforms
6 pages
Abbreviations in Piping
No ratings yet
Abbreviations in Piping
12 pages
Typical Joint Detailing of Steel Hollow Sections
No ratings yet
Typical Joint Detailing of Steel Hollow Sections
7 pages
Bidirectional power flow in an electric vehicle using predictive control algorithm including sneak circuit analysis
No ratings yet
Bidirectional power flow in an electric vehicle using predictive control algorithm including sneak circuit analysis
12 pages
L - 2 - High-Dimensional Space
No ratings yet
L - 2 - High-Dimensional Space
20 pages
Baye 9e Chapter 03-2023
No ratings yet
Baye 9e Chapter 03-2023
29 pages
CHM-304 Experiment: Preparation of Phosphine Based Metal Complexes
No ratings yet
CHM-304 Experiment: Preparation of Phosphine Based Metal Complexes
4 pages
Mathematical System
No ratings yet
Mathematical System
87 pages
Download Complete (Ebook) Introduction to Software Testing by Paul Ammann, Jeff Offutt ISBN 9781107172012, 9781316774366, 1107172012, 1316774368 PDF for All Chapters
100% (9)
Download Complete (Ebook) Introduction to Software Testing by Paul Ammann, Jeff Offutt ISBN 9781107172012, 9781316774366, 1107172012, 1316774368 PDF for All Chapters
55 pages
Chapter 12
No ratings yet
Chapter 12
7 pages
Review of Dental Implant
100% (1)
Review of Dental Implant
10 pages
Design Calculations For Pressure Vessels
No ratings yet
Design Calculations For Pressure Vessels
54 pages
Lecture 2-2: Robotics Robotics and and Automation Automation
No ratings yet
Lecture 2-2: Robotics Robotics and and Automation Automation
8 pages
8 Bomba Tornillo Alliweiler Aeb
100% (2)
8 Bomba Tornillo Alliweiler Aeb
14 pages
A 1020 Template Study Guide 10 Star Properties
No ratings yet
A 1020 Template Study Guide 10 Star Properties
73 pages
Wire2CoilsFancy Single-Pages
No ratings yet
Wire2CoilsFancy Single-Pages
15 pages
DAS Fiber Transport V1.3
No ratings yet
DAS Fiber Transport V1.3
4 pages
Numerical Modelling and A Design of A Thermoelectric Dehumidifier
No ratings yet
Numerical Modelling and A Design of A Thermoelectric Dehumidifier
16 pages
Edited of Emtech Module5
No ratings yet
Edited of Emtech Module5
16 pages
Final Wlte Full Notes
No ratings yet
Final Wlte Full Notes
236 pages
Organic Chemistry Assignment-1: Complex Question SET
No ratings yet
Organic Chemistry Assignment-1: Complex Question SET
6 pages
Aim Do Following
No ratings yet
Aim Do Following
18 pages
Consolidation PDF
No ratings yet
Consolidation PDF
64 pages
SIES College of Management Studies MCA Batch 2020-22 Subject: Robotic Process Automation Assignment No. 1 1. Demonstrate Use of Recorder. Program
No ratings yet
SIES College of Management Studies MCA Batch 2020-22 Subject: Robotic Process Automation Assignment No. 1 1. Demonstrate Use of Recorder. Program
80 pages
Automatic Transmission / Trans: Preparation
No ratings yet
Automatic Transmission / Trans: Preparation
2 pages
Safety Wire
No ratings yet
Safety Wire
4 pages
1 点阵说明书ENLMH
No ratings yet
1 点阵说明书ENLMH
18 pages
Butterfly Valve Q 011
No ratings yet
Butterfly Valve Q 011
4 pages