0% found this document useful (0 votes)

51 views56 pages

Module 2

The document discusses the definition and purpose of an assembler. It describes how an assembler works by translating assembly language code into machine code. It covers the different components and design of an assembler including data structures, analysis and synthesis phases, and handling of forward references.

Uploaded by

Khushi Sinha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

51 views56 pages

Module 2

Uploaded by

Khushi Sinha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 56

Assemblers

- NEHA SURTI
Assembler: Definition

• Translating source code written in assembly

language to object code.
Language Levels
Machine code
Machine code:

◦ Set of commands directly executable via CPU

◦ Commands in numeric code
◦ Lowest semantic level
Machine code language
Structure:
◦ Operation code
◦ Defining executable operation
OpCode OpAddress
◦ Operand address
◦ Specification of operands
◦ Constants/register addresses/storage addresses

5
Elements of the Assembly Language
Programming
An Assembly language is a
◦ machine dependent,
◦ low level Programming language specific to a certain computer system.

Three features when compared with machine language are

1. Mnemonic Operation Codes
2. Symbolic operands
3. Data declarations
Elements of the Assembly Language
Programming
Mnemonic operation codes: eliminates the need to memorize numeric operation
codes.

Symbolic operands: Symbolic names can be associated with data or instructions.

Symbolic names can be used as operands in assembly statements (need not
know details of memory bindings).

Data declarations: Data can be declared in a variety of notations, including the

decimal notation (avoids conversion of constants into their internal
representation).
Assembly language-structure

<Label> <Mnemomic> <Operand> Comments

Label
◦ symbolic labeling of an assembler address (command
address at Machine level)
Mnemomic
◦ Symbolic description of an operation
Operands
◦ Contains of variables or addresse if necessary
Comments

8
Mnemonic Operation Codes
Each statement has two operands, first operand is always a register, and
second operand refers to a memory word using a symbolic name and
optional displacement.
Assembly Language Statements
An assembly program contains three kinds of statements:
Imperative Statements
Declaration Statements
Assembler Directives

Imperative Statements: They indicate an action to be performed

during the execution of an assembled program. Each imperative
statement is translated into one machine instruction.
Assembly Language Statements
Declaration Statements: syntax is as follows:
[Label] DS <constant>
[Label] DC '<value>'
◼ The DS (declare storage) statement reserves memory and associates names with
them.
◼ Ex:
A DS 1 ; reserves a memory area of 1 word, associating the name A to it
G DS 200 ; reserves a block of 200 words and the name G is associated with the
first word of the block (G+6 etc. to access the other words)
◼ The DC (declare constant) statement constructs memory words containing constants.
◼ Ex:
ONE DC '1’ ; associates name one with a memory word containing value 1
Assembly Language Statements
Assembler Directive
Assembler directives instruct the assembler to perform certain actions
during the assembly of a program.

Some assembler directives are described in the following:

1) START <constant>
This directive indicates that the first word of the target program generated
by the assembler should be placed in the memory word having address
<constant>.

2) END [<operand spec>]

This directive indicates the end of the of the source program. The optional
<operand spec> indicates the address of the instruction where the
execution of the program should begin.
Advantages of Assembly Language
• The primary advantages of assembly language programming
over machine language programming are due to the use of
symbolic operand specifications.
(in comparison to machine language program)
• Assembly language programming holds an edge over HLL
programming in situations where it is desirable to use
architectural features of a computer.
(in comparison to high level language program)
Fundamentals of LP
Language processing = analysis of source program + synthesis of target
program
Analysis of source program is specification of the source program
◦ Lexical rules: formation of valid lexical units(tokens) in the source language
◦ Syntax rules : formation of valid statements in the source language
◦ Semantic rules: associate meaning with valid statements of the language
Fundamentals of LP
Synthesis of target program is construction of target language
statements
◦ Memory allocation : generation of data structures in the target program
◦ Code generation
A simple Assembly Scheme
There are two phases in specifying an assembler:
1. Analysis Phase
2. Synthesis Phase(the fundamental information
requirements will arise in this phase)
A simple Assembly Scheme
There are four steps involved to design the specification of
an assembler:
1. Identify information necessary to perform a task.
2. Design a suitable data structure to record info.
3. Determine processing necessary to obtain and maintain
the info.
4. Determine processing necessary to perform the task
Synthesis Phase: Example
Consider the following statement:
MOVER BREG, ONE
The following info is needed to synthesize machine instruction for
this stmt:
1. Address of the memory word with which name ONE is
associated [depends on the source program, hence made
available by the Analysis phase].

2. Machine operation code corresponding to MOVER [does not

depend on the source program but depends on the assembly
language, hence synthesis phase can determine this
information for itself]
Note: Based on above discussion, the two data structures
required during the synthesis phase are described next
Data structures in synthesis phase
Symbol Table --built by the analysis phase
◦ The two primary fields are name and address of the symbol
used to specify a value.
Mnemonics Table --already present
◦ The two primary fields are mnemonic and opcode, along with
length.
Synthesis phase uses these tables to obtain
◦ The machine address with which a name is associated.
◦ The machine op code corresponding to a mnemonic.
The tables have to be searched with the
◦ Symbol name and the mnemonic as keys
Analysis Phase
Primary function of the Analysis phase is to build the
symbol table.
◦ It must determine the addresses with which the symbolic names
used in a program are associated
◦ It is possible to determine some addresses directly like the
address of first instruction in the program (ie.,start)
◦ Other addresses must be inferred
◦ To determine the addresses of the symbolic names we need to fix
the addresses of all program elements preceding it through
Memory Allocation.
To implement memory allocation a data structure called location counter is
introduced.
Analysis Phase – Implementing memory
allocation
LC(location counter) :
◦ is always made to contain the address of the next memory word in the target program.
◦ It is initialized to the constant specified at the START statement.

When a LABEL is encountered,

◦ it enters the LABEL and the contents of LC in a new entry of the symbol table.

LABEL – e.g. N, AGAIN, SUM etc

◦ It then finds the number of memory words required by the assembly statement and
updates the LC contents

To update the contents of the LC, analysis phase needs to know lengths of the
different instructions
◦ This information is available in the Mnemonics table and is extended with a field called
length

We refer the processing involved in maintaining the LC as LC Processing

Example
START 100
MOVER BREG, N LC = 100 (1 byte)
MULT BREG, N LC = 101 (1 byte)
STOP LC = 102 (1 byte)
N DS 5 LC = 103

Symbol Address

N 103
Since there the instructions take different amount of memory, it is also stored
in the mnemonic table in the “length” field

Mnemonic Opcode Length

MOVER 04 1
MULT 03 1
Data structures of an assembler during analysis and synthesis phases
Data structures
Mnemonics table is a fixed table which is merely accessed by the
analysis and synthesis phases
Symbol table is constructed during analysis and used during
synthesis
Tasks Performed : Analysis Phase
Isolate the labels, mnemonic, opcode and operand fields of a
statement.

If a label is present, enter (symbol, <LC>) into the symbol table.

Check validity of the mnemonic opcode using mnemonics table.

Update value of LC.

Tasks Performed : Synthesis Phase
Obtain machine opcode corresponding to the mnemonic from the
mnemonic table.

Obtain address of the memory operand from symbol table.

Synthesize a machine instruction or machine form of a constant,

depending on the instruction.
Assembler’s functions
Convert mnemonic operation codes to their machine language
equivalents
Convert symbolic operands to their equivalent machine addresses
Build the machine instructions in the proper format
Convert the data constants to internal machine representations
Write the object program and the assembly listing
Assembler:Design
• The design of assembler can be of:
– Scanning (tokenizing)
– Parsing (validating the instructions)
– Creating the symbol table
– Resolving the forward references
– Converting into the machine language
Assembler Design
• Pass of a language processor – one complete scan of the source
program
• Assembler Design can be done in:
– Single pass
– Two pass
• Single Pass Assembler:
– Does everything in single pass
– Cannot resolve the forward referencing
• Two pass assembler:
– Does the work in two pass
– Resolves the forward references
Backpatching
The problem of forward references is handled using a process
called backpatching
◦ Initially, the operand field of an instruction containing a forward
reference is left blank
◦ Ex: MOVER BREG, ONE can be only partially synthesized since
ONE is a forward reference
◦ The instruction opcode and address of BREG will be assembled
to reside in location 101
◦ To insert the second operand’s address later, an entry is added
as Table of Incomplete Instructions (TII)
◦ The entry TII is a pair (<instruction address>, <symbol>) which is
(101, ONE) here
Backpatching
◦ When END statement is processed, the symbol table would
contain the addresses of all symbols defined in the source
program
◦ So TII would contain information of all forward references
◦ Now each entry in TII is processed to complete the instruction
◦ Ex: the entry (101, ONE) would be processed by obtaining the
address of ONE from symbol table and inserting it in the
operand field of the instruction with assembled address 101.
◦ Alternatively, when definition of some symbol L is encountered,
all forward references to L can be processed
Assembler Design
• Symbol Table:
– This is created during pass 1
– All the labels of the instructions are symbols
– Table has entry for symbol name, address value.
• Forward reference:
– Symbols that are defined in the later part of the
program are called forward referencing.
– There will not be any address value for such symbols
in the symbol table in pass 1.
Assembler Design
• Assembler directives are pseudo instructions.
– They provide instructions to the assemblers itself.
– They are not translated into machine operation
codes.
Assembler Design
• First pass:
– Scan the code by separating the symbol, mnemonic
op code and operand fields
– Build the symbol table
– Perform LC processing
– Construct intermediate representation
• Second Pass:
– Solves forward references
– Converts the code to the machine code
Two Pass Assembler
Read from input line
◦ LABEL, OPCODE, OPERAND
Machine Operation Table (MOT)
Pseudo Operation Table (POT)
Symbol table & Literal table
Base table (BT)
Pass 1 Flowchart
Pass 2 Flowchart
Source Program Target Code
START 200
MOVER AREG, A
ADD BREG, B
LOOP: PRINT A
MUL CREG, =‘5’
READ =‘7’
LTORG
ADD CREG, X
A DC 2
B DS 3
X DC 3
END
Data Structures in Pass I
OPTAB – a table of mnemonic op codes
◦ Contains mnemonic op code, class and mnemonic info
◦ Class field indicates whether the op code corresponds to
◦ an imperative statement (IS),
◦ a declaration statement (DL) or
◦ an assembler Directive (AD)
◦ For IS, mnemonic info field contains the pair ( machine opcode,
instruction length)
◦ Else, it contains the id of the routine to handle the declaration or
a directive statement
◦ The routine processes the operand field of the statement to
determine the amount of memory required and updates LC and
the SYMTAB entry of the symbol defined
Data Structures in Pass I
SYMTAB - Symbol Table
◦ Contains address and length
LOCCTR - Location Counter
LITTAB – a table of literals used in the program
◦ Contains literal and address
◦ Literals are allocated addresses starting with the current value in
LC and LC is incremented, appropriately
OPTAB (operation code table)
Content
◦ Menmonic opcode, class and mnemonic info
Characteristic
◦ static table
Implementation
◦ array or hash table, easy for search
SYMTAB (symbol table)
Content
◦ label name, value, flag, (type, length) etc.
Characteristic
◦ dynamic table (insert, delete, search)
Implementation
◦ hash table, non-random keys, hashing
function

Assembler
100% (1)
Assembler
36 pages
System Programming and Compiler Construction Techknowledge Book PDF
No ratings yet
System Programming and Compiler Construction Techknowledge Book PDF
247 pages
Chap 2 - Assemblers
100% (1)
Chap 2 - Assemblers
50 pages
Assembly Language Project
100% (7)
Assembly Language Project
139 pages
Elements of Assembly Language Programming
No ratings yet
Elements of Assembly Language Programming
12 pages
Elements of Assembly Language Programming
80% (10)
Elements of Assembly Language Programming
10 pages
Assembler Tables
0% (1)
Assembler Tables
28 pages
Assembler
100% (2)
Assembler
26 pages
Unit 1: Introduction To System Programming
No ratings yet
Unit 1: Introduction To System Programming
125 pages
EContent 11 2023 02 26 23 47 38 Unit3pdf 2023 02 07 12 17 07
No ratings yet
EContent 11 2023 02 26 23 47 38 Unit3pdf 2023 02 07 12 17 07
38 pages
Assembly Language DDG
100% (4)
Assembly Language DDG
21 pages
SP Unit1 Assembler MRD
No ratings yet
SP Unit1 Assembler MRD
62 pages
Module 3 - Assemblers
No ratings yet
Module 3 - Assemblers
97 pages
10.question Bank With Answers
No ratings yet
10.question Bank With Answers
156 pages
System Software Ch3
No ratings yet
System Software Ch3
54 pages
Unit 1
No ratings yet
Unit 1
58 pages
Unit 3
No ratings yet
Unit 3
38 pages
Screenshot 2023-05-05 at 11.38.21 PM
No ratings yet
Screenshot 2023-05-05 at 11.38.21 PM
54 pages
Q 1. Discuss Design Specification of A Assembler With Diagram? Ans
No ratings yet
Q 1. Discuss Design Specification of A Assembler With Diagram? Ans
9 pages
Module 2
No ratings yet
Module 2
40 pages
Wa0000.
No ratings yet
Wa0000.
64 pages
Assembler
No ratings yet
Assembler
52 pages
2nd Chapter Assemblers
No ratings yet
2nd Chapter Assemblers
61 pages
Assemblers
No ratings yet
Assemblers
55 pages
03 Assembler (UPDATED)
100% (1)
03 Assembler (UPDATED)
12 pages
MODULE2 - Features, Statements and Forwrd Reference Prob
No ratings yet
MODULE2 - Features, Statements and Forwrd Reference Prob
43 pages
Unit 1 SystemProgramming
No ratings yet
Unit 1 SystemProgramming
43 pages
Ss Mod1
No ratings yet
Ss Mod1
29 pages
1 B Assembler
No ratings yet
1 B Assembler
60 pages
CA Unit - III
No ratings yet
CA Unit - III
44 pages
Chapter 4 Assemblers
100% (1)
Chapter 4 Assemblers
22 pages
CA0216D Chapter3
No ratings yet
CA0216D Chapter3
32 pages
Assembly Language
No ratings yet
Assembly Language
6 pages
Lecture-2 (Partial) - (Class - 7 & 8) (Introduction To Assembly Language)
No ratings yet
Lecture-2 (Partial) - (Class - 7 & 8) (Introduction To Assembly Language)
76 pages
ASSembler
No ratings yet
ASSembler
22 pages
Unit 2-System Software
No ratings yet
Unit 2-System Software
16 pages
Assignment 2 Ankur Sir
No ratings yet
Assignment 2 Ankur Sir
17 pages
Subject-System Programming Sub. Code-2150708 Unit-3 (Assemblers) by - Prof. Deepmala Sharma
No ratings yet
Subject-System Programming Sub. Code-2150708 Unit-3 (Assemblers) by - Prof. Deepmala Sharma
52 pages
Sort Array in Descending Order
100% (1)
Sort Array in Descending Order
3 pages
Cap318 (System Software) Home Work - Ii: Surendra
No ratings yet
Cap318 (System Software) Home Work - Ii: Surendra
16 pages
Session 2-1
No ratings yet
Session 2-1
15 pages
Final Te 2019 Sposl Lab Manual 2022-4-33
No ratings yet
Final Te 2019 Sposl Lab Manual 2022-4-33
30 pages
Department of Computer Science and Engineering Aptitude Test
No ratings yet
Department of Computer Science and Engineering Aptitude Test
5 pages
SPCC - 2
No ratings yet
SPCC - 2
14 pages
Design Specifications of An Assembler - 20241106 - 122535 - 0000
No ratings yet
Design Specifications of An Assembler - 20241106 - 122535 - 0000
9 pages
Assemblers
No ratings yet
Assemblers
48 pages
Programming In: Revised 2 Edition
No ratings yet
Programming In: Revised 2 Edition
29 pages
Assemblers, Table Processing, and Macro Processors: A Compre-Hensive Overview
No ratings yet
Assemblers, Table Processing, and Macro Processors: A Compre-Hensive Overview
5 pages
Unit 2 SP
No ratings yet
Unit 2 SP
19 pages
Assembler Basic
No ratings yet
Assembler Basic
24 pages
2 Assemblers
100% (1)
2 Assemblers
12 pages
Language Processor Notes
No ratings yet
Language Processor Notes
4 pages
SPOS Practical - Assign No1
No ratings yet
SPOS Practical - Assign No1
8 pages
UNIT 3 Assembler
No ratings yet
UNIT 3 Assembler
21 pages
Unit - I: System Software
No ratings yet
Unit - I: System Software
26 pages
Assemblers
No ratings yet
Assemblers
29 pages
1.4.4 Assembly Language
No ratings yet
1.4.4 Assembly Language
7 pages
Assembly Language
No ratings yet
Assembly Language
3 pages
Tree Data Structure
100% (1)
Tree Data Structure
46 pages
Stagecoach Problem
No ratings yet
Stagecoach Problem
18 pages
Oracle Practical Program - 240625 - 100611-1
No ratings yet
Oracle Practical Program - 240625 - 100611-1
11 pages
DXL Reference Manual
No ratings yet
DXL Reference Manual
952 pages
React Questions
No ratings yet
React Questions
252 pages
CSC 223-Computer Programming I
No ratings yet
CSC 223-Computer Programming I
18 pages
Python MCQ
No ratings yet
Python MCQ
3 pages
Cobol: School of MAINFRAMES
No ratings yet
Cobol: School of MAINFRAMES
75 pages
MODULE 3 - Syntax Analysis
No ratings yet
MODULE 3 - Syntax Analysis
110 pages
MapKit Framework Reference
No ratings yet
MapKit Framework Reference
318 pages
Lecture 01 - Introduction
No ratings yet
Lecture 01 - Introduction
64 pages
DAA Assignment 1
No ratings yet
DAA Assignment 1
32 pages
Programming-Guideline-Safety DOC V1 2 en
No ratings yet
Programming-Guideline-Safety DOC V1 2 en
42 pages
PYTHON PROGRAMMING (M.Tech)
No ratings yet
PYTHON PROGRAMMING (M.Tech)
2 pages
Software Engineering Unit 2 Long Ans
No ratings yet
Software Engineering Unit 2 Long Ans
18 pages
Circular Queue Dsa
No ratings yet
Circular Queue Dsa
13 pages
COSS - Lecture - 5 - With Annotation
No ratings yet
COSS - Lecture - 5 - With Annotation
23 pages
Lec04 SQL Aggregation Grouping
No ratings yet
Lec04 SQL Aggregation Grouping
38 pages
Restauraunt Ordering System Oop Project Report
No ratings yet
Restauraunt Ordering System Oop Project Report
7 pages
Class Visibility: Designing Well-Defined Public, Private, and Protected Protocols
No ratings yet
Class Visibility: Designing Well-Defined Public, Private, and Protected Protocols
7 pages
Information Request - Software Engineer Roles
No ratings yet
Information Request - Software Engineer Roles
9 pages
NPM Vs Yarm
No ratings yet
NPM Vs Yarm
8 pages
Xii Ip HHW PQP 1 2024-25
No ratings yet
Xii Ip HHW PQP 1 2024-25
4 pages
API For Duitku
No ratings yet
API For Duitku
7 pages
Java Execution Chart
No ratings yet
Java Execution Chart
4 pages
11-Half Yearly Exam 2022-23
No ratings yet
11-Half Yearly Exam 2022-23
7 pages
Seg 2105 Sample Exam MC Questions
No ratings yet
Seg 2105 Sample Exam MC Questions
4 pages