0% found this document useful (0 votes)

4K views27 pages

Assemblers II

machine independent features of assemblers

Uploaded by

srinivas

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4K views27 pages

Assemblers II

machine independent features of assemblers

Uploaded by

srinivas

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 27

Chapter 3 ASSEMBLERS-II

cks
Chapter - 3
ASSEMBLERS-II

ASSEMBLERS-2

1
C.K. SRINIVAS Asst.Prof. DEPT OF CSE BITM, BELLARY
Chapter 3 ASSEMBLERS-II

3.2 Machine-Independent Assembler features:

The following are the list of features which do not depend on the architecture of the machine.
 Literals
 Expressions
 Program blocks
 Control sections

Literals
It is often convenient for the programmer to be able to write the value of a constant operand as a
part of the instruction that uses it. This avoids the defining the constants elsewhere in the
program and make up a label for it. Such a notation is called as literal.

Consider the following example

.
:
LDA FIVE
:
FIVE WORD 5
:
It is convenient to write the value of a constant operand as a part of instruction.

cks
:
LDA =X’ 05’
:
A literal is identified with the prefix =, followed by a specification of the literal value.

The example above shows a 3-byte operand whose value is a character string EOF. The
object code for the instruction is also mentioned. It shows the relative displacement value of the
location where this value is stored. In the example the value is at location (002D) and hence the
displacement value is (010).
As another example the given statement below shows a 1-byte literal with the
hexadecimal value ‘05’.

2
C.K. SRINIVAS Asst.Prof. DEPT OF CSE BITM, BELLARY
Chapter 3 ASSEMBLERS-II

215 1062 WLOOP TD =X’05’ E32011

Literals vs. Immediate Operands

It is important to understand the difference between a literal and immediate operand.
1. With immediate addressing, the operand value is assembled as part of the machine instruction.
2. With a literal, the assembler generates the specified value as a constant at some other memory
location. The address of this generated constant is used as target address for the machine
instruction.

All of the literal operands used in a program are gathered together into one or more literal pools.
Normally literals are placed into a pool at the end of the program. In some cases, it is desirable to

cks
place literals into a pool at some other location in the object program.

When the assembler encounters a LTORG statement, it creates a literal pool that contains all of
the literal operands used since the previous LTORG (or the beginning of the program). This
literal pool is placed in the object program at the location where the LTORG directive was
encountered.

Of course, literals placed in a pool by LTORG will not be repeated in the pool at the end of the
program. If we had not used the LTORG statement, the literal =C’EOF’ would be placed in the
pool at the end of the program. Most assemblers recognize duplicate literals – that is, the same
literal used in more than one place in the program – and store only one copy of the specified data
value.

How to find the duplicate literals?

The easiest way to recognize duplicate literals is by comparison of the character strings defining
them (the string =X’05’). The basic data structure that assembler handles literal operands is
literal table LITTAB. For each literal used, this table contains the literal name, the operand value
and length, and the address assigned to the operand when it is placed in a literal pool. LITTAB
is often organized as a hash table, using the literal name or value as the key.

Format of LITTAB

3
C.K. SRINIVAS Asst.Prof. DEPT OF CSE BITM, BELLARY
Chapter 3 ASSEMBLERS-II

NAME OPERAND VALUE LENGTH ADDRESS

=X ‘05’ 05 1 1056
=C’EOF’ 454F46 3 002D

During pass 1, the assembler searches LITTAB for the specified literal name (or value). If the
literal is already present in the table, no action is needed. If it is not present, the literal is added to
LITTAB (leaving the address unassigned).

During pass 2, the operand address is obtained by searching LITTAB for each literal operand
encountered. Generate Modification record for literals that represent an address in the program.

Symbol-Defining Statements
Most assemblers provide an assembler directive that allows the programmer to define symbols
and specify their values. The directive used for this EQU (Equate).
The general form of the statement is

Symbol EQU value

This statement defines the given symbol (i.e., entering in the SYMTAB) and assigning to it the
value specified. The value can be a constant or an expression involving constants and any other
symbol which is already defined. One common usage is to define symbolic names that can be

For example

+LDT #4096
cks
used to improve readability in place of numeric values.

This loads the register T with immediate value 4096, this does not clearly what exactly this value
indicates. If a statement is included as:

MAXLEN EQU 4096 and then

+LDT #MAXLEN

Then it clearly indicates that the value of MAXLEN is some maximum length value.
When the assembler encounters EQU statement, it enters the symbol MAXLEN along with its
value in the symbol table. During LDT the assembler searches the SYMTAB for its entry and its
equivalent value as the operand in the instruction. The object code generated is the same for both
the options discussed, but is easier to understand. If the maximum length is changed from 4096
to 1024, it is difficult to change if it is mentioned as an immediate value wherever required in the
instructions. We have to scan the whole program and make changes wherever 4096 is used. If we
mention this value in the instruction through the symbol defined by EQU, we may not have to
search the whole program but change only the value of MAXLENGTH in the EQU statement
(only once).

The user-defined symbols in assembler language programs appear as labels on instructions or

data areas. The value of such a label is the address assigned to the statement on which it appears.

4
C.K. SRINIVAS Asst.Prof. DEPT OF CSE BITM, BELLARY
Chapter 3 ASSEMBLERS-II

Most assemblers provide an assembler directive that allows the programmer to define symbols
and specify their value. The assembler directive generally used is EQU.

Another common usage of EQU statement is for defining values for the general-purpose
registers. The assembler can use the mnemonics for register usage like a-register A , X – index
register and so on. But there are some instructions which require numbers in place of names in
the instructions. For example in the instruction RMO 0, 1 instead of RMO A,X. The
programmer can assign the numerical values to these registers using EQU directive.
A EQU 0
X EQU 1 and so on

These statements will cause the symbols A, X, L… to be entered into the symbol table
with their respective values. An instruction RMO A, X would then be allowed. As another usage
if in a machine that has many general purpose registers named as R1, R2,…, some may be used
as base register, some may be used as accumulator. Their usage may change from one program
to another. In this case we can define these requirement using EQU statements.

BASE EQU R1
INDEX EQU R2
COUNT EQU R3

One restriction with the usage of EQU is whatever symbol occurs in the right hand side of the

BETA
ALPHA
EQU
RESW
cks
EQU should be predefined. For example, the following statement is not valid:

ALPHA
1

As the symbol ALPHA is assigned to BETA before it is defined. The value of ALPHA is not
known.

ORG Statement:

This directive can be used to indirectly assign values to the symbols. The directive is
usually called ORG (for origin). Its general format is:

ORG value

Where value is a constant or an expression involving constants and previously defined symbols.
When this statement is encountered during assembly of a program, the assembler resets its
location counter (LOCCTR) to the specified value. Since the values of symbols used as labels are
taken from LOCCTR, the ORG statement will affect the values of all labels defined until the
next ORG is encountered. ORG is used to control assignment storage in the object program.
Sometimes altering the values may result in incorrect assembly.

5
C.K. SRINIVAS Asst.Prof. DEPT OF CSE BITM, BELLARY
Chapter 3 ASSEMBLERS-II

ORG can be useful in label definition. Suppose we need to define a symbol table with the
following structure:
SYMBOL 6 Bytes
VALUE 3 Bytes
FLAG 2 Bytes

The table looks like the one given below.

SYMBOL VALUE FLAGS

STAB
(100 entries)

. . .
. . .
. . .

The symbol field contains a 6-byte user-defined symbol; VALUE is a one-word representation of
the value assigned to the symbol; FLAG is a 2-byte field specifies symbol type and other
information. The space for the table can be reserved by the statement:
STAB RESB 1100

cks
If we want to refer to the entries of the table using indexed addressing, place the offset value of
the desired entry from the beginning of the table in the index register. To refer to the fields
SYMBOL, VALUE, and FLAGS individually, we need to assign the values first as shown
below:

SYMBOL EQU STAB

VALUE EQU STAB+6
FLAGS EQU STAB+9

To retrieve the VALUE field from the table indicated by register X, we can write a statement:
LDA VALUE, X

Using Indexed Addressing:

Use LOCCTR to address fields

STAB RESB 1100
Refer to each field
SYMBOL EQU STAB
VALUE EQU STAB+6
FLAGS EQU STAB+9

Ex: To fetch the VALUE field

LDA VALUE, X (*Last ORG sets LOCCTR back)

6
C.K. SRINIVAS Asst.Prof. DEPT OF CSE BITM, BELLARY
Chapter 3 ASSEMBLERS-II

The same thing can also be done using ORG statement in the following way:

Using ORG:

Reserve space

The first statement allocates 1100 bytes of memory assigned to label STAB. In the second
statement the ORG statement initializes the location counter to the value of STAB. Now the
LOCCTR points to STAB. The next three lines assign appropriate memory storage to each of
SYMBOL, VALUE and FLAG symbols. The last ORG statement reinitializes the LOCCTR to a
new value after skipping the required number of memory for the table STAB (i.e., STAB+1100).

Notice that two-pass assembler design requires that all symbols be defined during Pass 1.
Example:

ALPHA RESW 1 BETA EQU ALPHA

BETA EQU ALPHA

Another example:
cks ALPHA RESW 1

(*BETA cannot be assigned a value)

The sequence of statements cannot be resolved by an ordinary two-pass assembler regardless of

how the work is divided between passes.

ALPHA EQU BETA

BETA EQU DELTA
DELTA RESW 1

Expressions
Most assemblers allow the use of expressions. Each such expression must be evaluated by the
assembler to produce a single operand address or value. Assemblers generally arithmetic
expressions formed according to the normal rules using arithmetic operators +, - *, /. Division is
usually defined to produce an integer result. Individual terms may be constants, user-defined
symbols, or special terms. The only special term used is * ( the current value of location counter)
which indicates the value of the next unassigned memory location. Thus the statement

BUFFEND EQU *

7
C.K. SRINIVAS Asst.Prof. DEPT OF CSE BITM, BELLARY
Chapter 3 ASSEMBLERS-II

Assigns a value to BUFFEND, which is the address of the next byte following the buffer area.
Some values in the object program are relative to the beginning of the program and some are
absolute (independent of the program location, like constants).

Hence, expressions are classified as either absolute expression or relative expressions depending
on the type of value they produce.

Relative: means relative to the beginning of the program. Labels on instructions and data areas,
and references to the location counter value, are relative terms.

Absolute: means independent of program location. A constant is an absolute term.

Absolute Expressions: The expression that uses only absolute terms is absolute expression.
Absolute expression may contain relative term provided the relative terms occur in pairs with
opposite signs for each pair. Example:

MAXLEN EQU BUFEND-BUFFER

In the above instruction the difference in the expression gives a value that does not depend on the
location of the program and hence gives an absolute immaterial o the relocation of the program.
The expression can have only absolute terms. Example:

MAXLEN EQU 1000

cks
Note: A symbol whose value is given by EQU (or some similar assembler directive) may be
either an absolute term or a relative term depending on the expression used to define its value. If
relative terms occur in pairs and the terms in each such pair have opposite signs, then the
resulting expressions are absolute expressions. None of the relative terms may enter into a
multiplication or division operation.

A relative expression is one in which all of the relative terms except one can be paired as
described above; the remaining unpaired relative term must have a positive sign.

Example: 107 MAXLEN EQU BUFEND-BUFFER

Both BUFEND and BUFFER are relative terms, each representing an address within the
program. However, the expression represents an absolute value: the difference between the two
addresses which is the length of the buffer area in bytes.

Example: BUFEND + BUFFER, 100 - BUFFER, or 3×BUFFER represent neither absolute

values nor locations within the program. Because such expressions are very unlikely to be of any
use, they are considered errors.
To determine the type of an expression, we must keep track of the types of all symbols defined
in the program. With this information, the assembler can easily determine the type of each
expression used as an operand and generate Modification records in the object program for
relative values.

8
C.K. SRINIVAS Asst.Prof. DEPT OF CSE BITM, BELLARY
Chapter 3 ASSEMBLERS-II

SYMTAB
Symbol Type Value
Name Value
RETADR R 30 COPY 0
BUFFER R 36 FIRST 0
CLOOP 6
BUFEND R 1036
ENDFIL 1A
MAXLEN A 1000 RETADR 30
LENGTH 33
BUFFER 36
LITTAB BUFEND 1036
MAXLEN 1000
RDREC 1036
C'EOF' 454F46 3 002D RLOOP 1040
X'05' 05 1 1076 EXIT 1056
INPUT 105C
WREC 105D
Program Blocks WLOOP 1062

Program blocks are referred to be segments of code that are rearranged within a single object
program unit, Program blocks allow the generated machine instructions and data to appear in the
object program in a different order by Separating blocks for storing code, data, stack, and larger
data block.

cks
Assembler Directive USE:

USE [blockname]
At the beginning, statements are assumed to be part of the unnamed (default) block. If no USE
statements are included, the entire program belongs to this single block. Each program block
may actually contain several separate segments of the source program. Assemblers rearrange
these segments to gather together the pieces of each block and assign address. Separate the
program into blocks in a particular order. Large buffer area is moved to the end of the object
program. Program readability is better if data areas are placed in the source program close to the
statements that reference them.

In this case three blocks are used:

1. The first (unnamed) program block contains the executable instructions of the program.
2. The second (named CDATA) contains all data areas that are a few words or less in length.
3. The third (named CBLKS) contains all data areas that consist of larger blocks of memory.

Fig shows our example program, as it might be written using program blocks.

Block name Block number Address Length

default 0 0000 0066
CDATA 1 0066 000B
CBKLS 2 0071 1000
Arranging code into program blocks:

9
C.K. SRINIVAS Asst.Prof. DEPT OF CSE BITM, BELLARY
Chapter 3 ASSEMBLERS-II

Pass 1
 A separate location counter for each program block is maintained.
 Save and restore LOCCTR when switching between blocks.
 At the beginning of a block, LOCCTR is set to 0.
 Assign each label an address relative to the start of the block.
 Store the block name or number in the SYMTAB along with the assigned relative address
of the label
 Indicate the block length as the latest value of LOCCTR for each block at the end of
Pass1
 Assign to each block a starting address in the object program by concatenating the
program blocks in a particular order

Pass 2
 Calculate the address for each symbol relative to the start of the object program by
adding
 The location of the symbol relative to the start of its block
 The starting address of this block

cks

10
C.K. SRINIVAS Asst.Prof. DEPT OF CSE BITM, BELLARY
Chapter 3 ASSEMBLERS-II

cks

Fig 2.12

11
C.K. SRINIVAS Asst.Prof. DEPT OF CSE BITM, BELLARY
Chapter 3 ASSEMBLERS-II

Fig 2.12 shows this process applied to our sample program. Notice that the symbol MAXLEN
(line 107) is shown without a block number. It is an absolute symbol.

Consider an Example:

0006 0 LDA LENGTH 032060

SYMTAB shows the value of the operand (LENGTH) as relative location 0003 within
program block 1 (CDATA). The starting address for CDATA is 0066. Thus the desired target
address for this instruction is 0003+0066=0069.

Example of Address Calculation

20 0006 0 LDA LENGTH 032060

The value of the operand (LENGTH)

Address 0003 relative to Block 1 (CDATA)
Address 0003+0066=0069 relative to program When this instruction is executed
PC = 0000 (starting addr. Of default block) + 0009
disp = 0069 – 0009 = 0060

cks
opcode n i x b p e disp
000000 1 1 0 0 1 0 060

Label name Block number Address Flag

Length 1 0003

Object Program
It is not necessary to physically rearrange the generated code in the object program. The
assemblers just simply insert the proper load address in each Text record. The loader will load
these codes into correct place.

H^COPY ^000000^001071
T^000000^1E^172063^4B2021^032060^290000^332006^4B203B^3F2FEE^032055^0F2056^01000
3
T^00001E^09^0F2048^4B2029^3E203F
T^000027^1D^B410^B400^B440^75101000Ê22038^332FFA^DB2032Â004^3320085^57A02FB
850
T^000044^09^3B2FEA^13201F^4F0000
T^000006^01^F1
T^00004D^19^B410^772017Ê32031B^332FFA^53A016^FD2012^B850^3B2FEE^4F0000
T^000006^04^454F46^05
E^000000

12
C.K. SRINIVAS Asst.Prof. DEPT OF CSE BITM, BELLARY
Chapter 3 ASSEMBLERS-II

Program Blocks Loaded in Memory

Not present
in object program

cks
Control Sections:

A control section is a part of the program that maintains its identity after assembly; each
control section can be loaded and relocated independently of the others. Different control
sections are most often used for subroutines or other logical subdivisions. The programmer can
assemble, load, and manipulate each of these control sections separately.

Because of this, there should be some means for linking control sections together. For
example, instructions in one control section may refer to the data or instructions of other control
sections. Since control sections are independently loaded and relocated, the assembler is unable
to process these references in the usual way. Such references between different control sections
are called external references.

The assembler generates the information about each of the external references that will
allow the loader to perform the required linking. When a program is written using multiple
control sections, the beginning of each of the control section is indicated by an assembler
directive
– assembler directive: CSECT
The syntax

13
C.K. SRINIVAS Asst.Prof. DEPT OF CSE BITM, BELLARY
Chapter 3 ASSEMBLERS-II

secname CSECT
– separate location counter for each control section

Control sections differ from program blocks in that they are handled separately by the assembler.
Symbols that are defined in one control section may not be used directly another control section;
they must be identified as external reference for the loader to handle. The external references are
indicated by two assembler directives:

EXTDEF (external Definition):

It is the statement in a control section, names symbols that are defined in this section but
may be used by other control sections. Control section names do not need to be named in the
EXTREF as they are automatically considered as external symbols.

EXTREF (external Reference):

It names symbols that are used in this section but are defined in some other control
section. The order in which these symbols are listed is not significant. The assembler must
include proper information about the external references in the object program that will cause the
loader to insert the proper value where they are required.

cks

14
C.K. SRINIVAS Asst.Prof. DEPT OF CSE BITM, BELLARY
Chapter 3 ASSEMBLERS-II

cks

15
C.K. SRINIVAS Asst.Prof. DEPT OF CSE BITM, BELLARY
Chapter 3 ASSEMBLERS-II

Handling External Reference

Case 1

15 0003 CLOOP +JSUB RDREC 4B100000

 The operand RDREC is an external reference.
o The assembler has no idea where RDREC is
o inserts an address of zero
o can only use extended format to provide enough room (that is, relative addressing
for external reference is invalid)
 The assembler generates information for each external reference that will allow the loader
to perform the required linking.

Case 2

190 0028 MAXLEN WORD BUFEND-BUFFER 000000

 There are two external references in the expression, BUFEND and BUFFER.
 The assembler inserts a value of zero
 passes information to the loader
 Add to this data area the address of BUFEND

cks
 Subtract from this data area the address of BUFFER

Case 3

On line 107, BUFEND and BUFFER are defined in the same control section and the expression
can be calculated immediately.

107 1000 MAXLEN EQU BUFEND-BUFFER

16
C.K. SRINIVAS Asst.Prof. DEPT OF CSE BITM, BELLARY
Chapter 3 ASSEMBLERS-II

Object Code for the example program:

cks

17
C.K. SRINIVAS Asst.Prof. DEPT OF CSE BITM, BELLARY
Chapter 3 ASSEMBLERS-II

The assembler must also include information in the object program that will cause the loader to
insert the proper value where they are required. The assembler maintains two new record in the
object code and a changed version of modification record.

A define record gives information about the external symbols that are defined in this control
section, i.e., symbols named by EXTDEF.

Define record (EXTDEF)

 Col. 1 D
 Col. 2-7
 Col. 8-13
 Col.14-73 cks
Name of external symbol defined in this control section
Relative address within this control section (hexadecimal)
Repeat information in Col. 2-13 for other external symbols

A refer record lists the symbols that are used as external references by the control section, i.e.,
symbols named by EXTREF.

Refer record (EXTREF)

 Col. 1 R
 Col. 2-7 Name of external symbol referred to in this control section
 Col. 8-73 Name of other external reference symbols

The new items in the modification record specify the modification to be performed: adding or
subtracting the value of some external symbol. The symbol used for modification my be defined
either in this control section or in another section.

Modification record
 Col. 1 M
 Col. 2-7 Starting address of the field to be modified, relative to the beginning of the
control section (hexadecimal)
 Col. 8-9 Length of the field to be modified, in half-bytes (hexadecimal)
 Col 10 Modification flag (+ or -)
 Col.11-16 External symbol whose value is to be added to or subtracted from
the indicated field.

18
C.K. SRINIVAS Asst.Prof. DEPT OF CSE BITM, BELLARY
Chapter 3 ASSEMBLERS-II

The object program is shown below. There is a separate object program for each of the
control sections. In the Define Record and refer record the symbols named in EXTDEF and
EXTREF are included.

In the case of Define, the record also indicates the relative address of each external
symbol within the control section.

For EXTREF symbols, no address information is available. These symbols are simply
named in the Refer record.

cks

19
C.K. SRINIVAS Asst.Prof. DEPT OF CSE BITM, BELLARY
Chapter 3 ASSEMBLERS-II

Handling Expressions in Multiple Control Sections:

The existence of multiple control sections that can be relocated independently of one
another makes the handling of expressions complicated. It is required that in an expression that
all the relative terms be paired (for absolute expression), or that all except one be paired (for
relative expressions).

When it comes in a program having multiple control sections then we have an extended
restriction that:

 Both terms in each pair of an expression must be within the same control section
o If two terms represent relative locations within the same control section, their
difference is an absolute value (regardless of where the control section is located.
 Legal: BUFEND-BUFFER (both are in the same control section)

o If the terms are located in different control sections, their difference has a value
that is unpredictable.
 Illegal: RDREC-COPY (both are of different control section) it is the
difference in the load addresses of the two control sections. This value
depends on the way run-time storage is allocated; it is unlikely to be of
any use.

 How to enforce this restriction

cks
o When an expression involves external references, the assembler cannot determine
whether or not the expression is legal.
o The assembler evaluates all of the terms it can, combines these to form an initial
expression value, and generates Modification records.
o The loader checks the expression for errors and finishes the evaluation.

ASSEMBLER DESIGN
Here we are discussing
o The structure and logic of one-pass assembler. These assemblers are used when it is
necessary or desirable to avoid a second pass over the source program.
o Notion of a multi-pass assembler, an extension of two-pass assembler that allows an
assembler to handle forward references during symbol definition.

One-Pass Assembler

The main problem in designing the assembler using single pass was to resolve forward
references. We can avoid to some extent the forward references by:
 Eliminating forward reference to data items, by defining all the storage reservation
statements at the beginning of the program rather at the end.
 Unfortunately, forward reference to labels on the instructions cannot be avoided.
(forward jumping)
 To provide some provision for handling forward references by prohibiting forward
references to data items.

20
C.K. SRINIVAS Asst.Prof. DEPT OF CSE BITM, BELLARY
Chapter 3 ASSEMBLERS-II

There are two types of one-pass assemblers:

1. One that produces object code directly in memory for immediate execution (Load-and-go
assemblers).
2. The other type produces the usual kind of object code for later execution.

Load-and-Go Assembler

 Load-and-go assembler generates their object code in memory for immediate execution.
 No object program is written out, no loader is needed.
 It is useful in a system with frequent program development and testing
o The efficiency of the assembly process is an important consideration.
 Programs are re-assembled nearly every time they are run; efficiency of the assembly
process is an important consideration.

cks

Forward Reference in One-Pass Assemblers: In load-and-Go assemblers when a forward

reference is encountered:

 Omits the operand address if the symbol has not yet been defined
 Enters this undefined symbol into SYMTAB and indicates that it is undefined
 Adds the address of this operand address to a list of forward references associated with
the SYMTAB entry

21
C.K. SRINIVAS Asst.Prof. DEPT OF CSE BITM, BELLARY
Chapter 3 ASSEMBLERS-II

 When the definition for the symbol is encountered, scans the reference list and inserts the
address.
 At the end of the program, reports the error if there are still SYMTAB entries indicated
undefined symbols.
 For Load-and-Go assembler
o Search SYMTAB for the symbol named in the END statement and jumps to this
location to begin execution if there is no error

After Scanning line 40 of the program:

40 2021 J` CLOOP 302012

The status is that upto this point the symbol RREC is referred once at location 2013, ENDFIL at
201C and WRREC at location 201F. None of these symbols are defined. The figure shows that
how the pending definitions along with their addresses are included in the symbol table.

cks

Fig : object code in memory and symbol table entries for the program after scanning line 40.

22
C.K. SRINIVAS Asst.Prof. DEPT OF CSE BITM, BELLARY
Chapter 3 ASSEMBLERS-II

The status after scanning line 160, which has encountered the definition of RDREC and
ENDFIL, is as given below:

cks

If One-Pass needs to generate object code:

 If the operand contains an undefined symbol, use 0 as the address and write the Text
record to the object program.
 Forward references are entered into lists as in the load-and-go assembler.
 When the definition of a symbol is encountered, the assembler generates another Text
record with the correct operand address of each entry in the reference list.
 When loaded, the incorrect address 0 will be updated by the latter Text record containing
the symbol definition.

23
C.K. SRINIVAS Asst.Prof. DEPT OF CSE BITM, BELLARY
Chapter 3 ASSEMBLERS-II

Object Code Generated by One-Pass Assembler:

cks
Multi_Pass Assembler:

 For a two pass assembler, forward references in symbol definition are not allowed:
ALPHA EQU BETA
BETA EQU DELTA
DELTA RESW 1
o Symbol definition must be completed in pass 1.
 Prohibiting forward references in symbol definition is not a serious inconvenience.
o Forward references tend to create difficulty for a person reading the program.

Implementation Issues for Modified Two-Pass Assembler:

Implementation Issues when forward referencing is encountered in Symbol Defining statements:

 For a forward reference in symbol definition, we store in the SYMTAB:
o The symbol name
o The defining expression
o The number of undefined symbols in the defining expression
 The undefined symbol (marked with a flag *) associated with a list of symbols depend on this
undefined symbol.
 When a symbol is defined, we can recursively evaluate the symbol expressions depending on
the newly defined symbol.

Multi-Pass Assembler Example Program

24
C.K. SRINIVAS Asst.Prof. DEPT OF CSE BITM, BELLARY
Chapter 3 ASSEMBLERS-II

Multi-Pass Assembler (Figure 2.21 of Beck): Example for forward reference in Symbol Defining
Statements:

cks
1. HALFSZ EQU MAXLEN/2

MAXLEN has not yet been defined, so no value for HALFSZ can be computed. The defining
expression for HALFSZ is stored in the symbol table in place of its value. The entry &1 indicates that one
symbol in the defining expression is undefined. The SYMTAB would then simply contain a pointer to the
defining expression. The symbol MAXLEN is also entered in the symbol table, with the flag * identifying
it as undefined.

The same procedure is followed with the definition of MAXLEN. In this case there are two
undefined symbols involved in the definition: BUFEND and BUFFER. Both of these are entered

25
C.K. SRINIVAS Asst.Prof. DEPT OF CSE BITM, BELLARY
Chapter 3 ASSEMBLERS-II

into SYMTAB with lists indicating the dependence of MAXLEN upon them. Similarly, the
definition of PREVBT causes this symbol to be added to the list of dependences on BUFFER.

Let us assume that when line 4 is read, the location counter contains the hexadecimal value 1034.
This is stored as the value of BUFFER. The assembler then examines the list of symbols that are
dependent on BUFFER. The symbol table entry for the first symbol in this list (MAXLEN)
shows that depends on two currently undefined symbols; therefore, MAXLEN cannot be
evaluated immediately. Instead the &2 is changed to &1 to show that only one symbol in the
definition (BUFEND) remains undefined. The other symbols in the list (PREVBT) can be
evaluated because it depends only on BUFFER. The value of the defining expression for
PREVBT is calculated and stored in SYMTAB. The result is shown in figure.

cks

26
C.K. SRINIVAS Asst.Prof. DEPT OF CSE BITM, BELLARY
Chapter 3 ASSEMBLERS-II

Questions
Sl.No UNIT – 3 Assemblers-II Mark
s
1. Enlist the various assembler features that are m/c dependent and m/c independent. Explain 10
any one of them each.(Jan 2005)
2. In a two pass assembler, list the different data bases used in each pass. Explain the contents 10
and uses of each data base.(Jan 2005)
3. Compare a two pass assembler with a single pass assembler. How forward references are 10
handled in one pass assembler?(Dec 2007)
4. What is LITORG? When it is used? Explain with an example.(Dec 2007 June 2010) 06

5. When is multi-pass assembler required? Show step by step procedure to evaluate the 08
following statements. Show the symbol table after each scan.(Jan 2005, Dec09)
1. HALFSZ EQU MAXLEN/2
2. MAXLEN EQU BUFEND-BUFFER
3. PREVBT EQU BUFFER-1
4. BUFFER RESB 4096
5. BUFEND EQU *
OR . Write short notes on multi pass assemblers.
6. Explain the need for BASE and NOBASE directives with examples. 05

8.
2005, Dec 2007)
cks
Explain program relocation. Also explain how the problems of relocation are solved.?( Jan

What is a program block? How multiple program blocks are handled by assemblers?(Dec 2007)
10

9. What are the different ways of specifying an operand value in a source statement? Give their 12
formats.
10. Compare a two-pass assembler with a single pass assembler. How forward references are 10
handled in one-pass assembler?
11. What is the difference between literal and immediate operand. How does the assembler 04
handle the literal operands? (Dec09,Dec2011)
12. Explain the following assembler directives with example each: 05
(i) EQU (ii) BASE (iii) ORG (iv) USE (v) NOBASE
13. Give the difference between program blocks and control sections and explain in detail 10
processing of control sections.
08
What is control section? How are they processed? (June 2009)
14. With required data structures & processing logic, explain the implementation of literals within 10
an assembler.
15. Give the format for the following record necessary to obtain object code:.(Jan 2005,Dec2011) 12
i. Header record ii. Text record iii. Refer record
iv. Define record v. Modification record (revised )
v. End record
16. Explain absolute and relative expressions. How these are processed by an assembler.(June 06
2009)
17. Explain the structure of Load and Go assembler.(Dec09,June 2010) 08

27
C.K. SRINIVAS Asst.Prof. DEPT OF CSE BITM, BELLARY

LED Beam 90W
50% (2)
LED Beam 90W
2 pages
A++ and the Lambda Calculus: Principles of Functional Programming
From Everand
A++ and the Lambda Calculus: Principles of Functional Programming
Georg P. Loczewski
No ratings yet
Module 3 Notes
No ratings yet
Module 3 Notes
46 pages
M 3 Full
No ratings yet
M 3 Full
123 pages
Module 3
No ratings yet
Module 3
83 pages
4 K
No ratings yet
4 K
789 pages
Assembler Directives
No ratings yet
Assembler Directives
34 pages
Devops And: Cloud Computing
No ratings yet
Devops And: Cloud Computing
12 pages
CC372 Spring 2025 Tutorial 04-Extension
No ratings yet
CC372 Spring 2025 Tutorial 04-Extension
45 pages
Machine Independent Assembler 1
No ratings yet
Machine Independent Assembler 1
38 pages
I Pu Biology - Imp Questions
No ratings yet
I Pu Biology - Imp Questions
5 pages
AWS Security Architecture
No ratings yet
AWS Security Architecture
153 pages
Embedded Systems - Assembly Language
No ratings yet
Embedded Systems - Assembly Language
4 pages
Data Types and Derivatives
No ratings yet
Data Types and Derivatives
3 pages
Functional Programming Using F# PDF
No ratings yet
Functional Programming Using F# PDF
376 pages
Module 3
No ratings yet
Module 3
80 pages
Assembler Language IV
No ratings yet
Assembler Language IV
70 pages
05 Assimpler2
No ratings yet
05 Assimpler2
33 pages
Assembly Chapter3 PDF
No ratings yet
Assembly Chapter3 PDF
7 pages
Literal S
No ratings yet
Literal S
19 pages
Arduino Development Cookbook - Sample Chapter
100% (1)
Arduino Development Cookbook - Sample Chapter
35 pages
Module 3
No ratings yet
Module 3
19 pages
1.4 Assembler Directives: Rohini College of Engineering & Technology
No ratings yet
1.4 Assembler Directives: Rohini College of Engineering & Technology
7 pages
Module 3 - Complete Chapter
No ratings yet
Module 3 - Complete Chapter
78 pages
User Manual Dutch POCT 100
100% (1)
User Manual Dutch POCT 100
96 pages
Keypad Interfacing With ARM7 Slicker
100% (1)
Keypad Interfacing With ARM7 Slicker
15 pages
Assembler Directives
No ratings yet
Assembler Directives
9 pages
2024 Basic Elements of Assembly Language 2
No ratings yet
2024 Basic Elements of Assembly Language 2
13 pages
Base Station Subsystem
No ratings yet
Base Station Subsystem
65 pages
Mod 3 Class 4 - Machine Independent Assembler Features (Part 1)
No ratings yet
Mod 3 Class 4 - Machine Independent Assembler Features (Part 1)
23 pages
Document
No ratings yet
Document
78 pages
Directives DD
No ratings yet
Directives DD
19 pages
Brochure Big Data
No ratings yet
Brochure Big Data
3 pages
Gemalto - Introducing 5G Networks
No ratings yet
Gemalto - Introducing 5G Networks
9 pages
SP2.3 Assembler-Machine-Independent Assembler Features
No ratings yet
SP2.3 Assembler-Machine-Independent Assembler Features
23 pages
Assembler M/C Independent Features and Design Options: Chapter No. 3
0% (1)
Assembler M/C Independent Features and Design Options: Chapter No. 3
45 pages
Assembler Directives 8086
100% (1)
Assembler Directives 8086
18 pages
System Software
No ratings yet
System Software
34 pages
SS Mod 3.1
No ratings yet
SS Mod 3.1
83 pages
Module 2 Assemblers
No ratings yet
Module 2 Assemblers
24 pages
Isp500 July 2024
No ratings yet
Isp500 July 2024
5 pages
Assembler Directives and Basic Steps of ALP: Dr. Urvashi Singh
No ratings yet
Assembler Directives and Basic Steps of ALP: Dr. Urvashi Singh
20 pages
Systems Software U3
No ratings yet
Systems Software U3
22 pages
SBBJ Online
No ratings yet
SBBJ Online
3 pages
ABC Car Traders - User Manual
No ratings yet
ABC Car Traders - User Manual
15 pages
Four Digit Code With Error Detection
No ratings yet
Four Digit Code With Error Detection
2 pages
Regarding Genaretes in The Process of Estimate Regret Concern Despite Estimates Think Capture
No ratings yet
Regarding Genaretes in The Process of Estimate Regret Concern Despite Estimates Think Capture
3 pages
2 ProgrammingInAssembler
No ratings yet
2 ProgrammingInAssembler
31 pages
2.3 Machine-Independent Assembler Features
83% (6)
2.3 Machine-Independent Assembler Features
46 pages
Chapter 4
No ratings yet
Chapter 4
80 pages
UNIT-2-1 Notes For Ece
No ratings yet
UNIT-2-1 Notes For Ece
18 pages
CHAPTER 4 Robotics
No ratings yet
CHAPTER 4 Robotics
78 pages
Assembly Language Program Development With MASM
100% (1)
Assembly Language Program Development With MASM
9 pages
NoBrokerHood User Manual
No ratings yet
NoBrokerHood User Manual
8 pages
Module 3 (Part2)
No ratings yet
Module 3 (Part2)
68 pages
A.: Understanding PC Hardware by Jemma, Inc
No ratings yet
A.: Understanding PC Hardware by Jemma, Inc
2 pages
10 1
No ratings yet
10 1
26 pages
Rajath MB
No ratings yet
Rajath MB
1 page
System Software PP T
No ratings yet
System Software PP T
32 pages
Expressions and Program Blocks1
No ratings yet
Expressions and Program Blocks1
23 pages
VR CNC Milling For Windows Quickstart Guide
No ratings yet
VR CNC Milling For Windows Quickstart Guide
20 pages
Cyber Security Management
No ratings yet
Cyber Security Management
4 pages
SS Mod 2 Full
No ratings yet
SS Mod 2 Full
52 pages
DR 138 438
No ratings yet
DR 138 438
2 pages
M5-P2 Web Services
No ratings yet
M5-P2 Web Services
25 pages
Assembler: Jian-hua Yeh (葉建華) 真理大學資訊科學系助理教授
No ratings yet
Assembler: Jian-hua Yeh (葉建華) 真理大學資訊科學系助理教授
69 pages
Assembler Directives
100% (1)
Assembler Directives
18 pages
8086 Addressing Modes & Programming Introduction
No ratings yet
8086 Addressing Modes & Programming Introduction
22 pages
Assembler Directives
No ratings yet
Assembler Directives
29 pages
Chapter 3-4 PDF
No ratings yet
Chapter 3-4 PDF
31 pages
Module-1 Part 1 Assemblers
No ratings yet
Module-1 Part 1 Assemblers
22 pages
DX Diag
No ratings yet
DX Diag
32 pages
Lisp Interpreter in Rust
From Everand
Lisp Interpreter in Rust
Vishal Patil
1/5 (1)
Pirelli Prg-Av4202n en
No ratings yet
Pirelli Prg-Av4202n en
3 pages
Speedrelay4000 1
No ratings yet
Speedrelay4000 1
2 pages
8086 Assembler Directives: Unit 1 Presented by Mrs.M.P.Sasirekha
No ratings yet
8086 Assembler Directives: Unit 1 Presented by Mrs.M.P.Sasirekha
16 pages
Assembler Directives Micro Processors 8086
No ratings yet
Assembler Directives Micro Processors 8086
4 pages
Chp02 Assembly Language Fundamentals
100% (2)
Chp02 Assembly Language Fundamentals
14 pages
Temam Mohammed AR2
No ratings yet
Temam Mohammed AR2
8 pages
Pseducode
No ratings yet
Pseducode
27 pages
Chapter 7: Assembler Directives and Data Definitions: Csect
No ratings yet
Chapter 7: Assembler Directives and Data Definitions: Csect
16 pages
Lecture-36 Assembler Directives
No ratings yet
Lecture-36 Assembler Directives
10 pages
Unit 2 Assemblers: 2.1 Machine Independent Assembler Features
100% (1)
Unit 2 Assemblers: 2.1 Machine Independent Assembler Features
26 pages
CHAPTER 3: Instruction Set and Programming of 8086: Compiled by Vishal Gaikwad, SIESGST
No ratings yet
CHAPTER 3: Instruction Set and Programming of 8086: Compiled by Vishal Gaikwad, SIESGST
11 pages
Assembler 4 Presentation
No ratings yet
Assembler 4 Presentation
59 pages
M4-P3 Class-Methods
No ratings yet
M4-P3 Class-Methods
12 pages
Assembler Directives (Cont..)
No ratings yet
Assembler Directives (Cont..)
18 pages
7 CS
No ratings yet
7 CS
4 pages
Documentation On Bank Management System
No ratings yet
Documentation On Bank Management System
43 pages
Generate The Complete Object Program For Each Statement
No ratings yet
Generate The Complete Object Program For Each Statement
7 pages
Introduction to PHP, Part 2, Second Edition
From Everand
Introduction to PHP, Part 2, Second Edition
Adam Majczak
No ratings yet
Syntax of 8086
No ratings yet
Syntax of 8086
10 pages
Log
No ratings yet
Log
2 pages
Machine Independent
No ratings yet
Machine Independent
10 pages
DBMS Lab Manual
From Everand
DBMS Lab Manual
Jitendra Patel
1.5/5 (3)

Assemblers II

Uploaded by

Assemblers II

Uploaded by

Chapter 3 ASSEMBLERS-II

3.2 Machine-Independent Assembler features:

Consider the following example

215 1062 WLOOP TD =X’05’ E32011

Literals vs. Immediate Operands

How to find the duplicate literals?

NAME OPERAND VALUE LENGTH ADDRESS

Symbol EQU value

MAXLEN EQU 4096 and then

The user-defined symbols in assembler language programs appear as labels on instructions or

The table looks like the one given below.

SYMBOL VALUE FLAGS

SYMBOL EQU STAB

Using Indexed Addressing:

Use LOCCTR to address fields

Ex: To fetch the VALUE field

LDA VALUE, X (*Last ORG sets LOCCTR back)

ALPHA RESW 1 BETA EQU ALPHA

(*BETA cannot be assigned a value)

The sequence of statements cannot be resolved by an ordinary two-pass assembler regardless of

ALPHA EQU BETA

Absolute: means independent of program location. A constant is an absolute term.

MAXLEN EQU BUFEND-BUFFER

MAXLEN EQU 1000

Example: 107 MAXLEN EQU BUFEND-BUFFER

Example: BUFEND + BUFFER, 100 - BUFFER, or 3×BUFFER represent neither absolute

In this case three blocks are used:

Block name Block number Address Length

0006 0 LDA LENGTH 032060

Example of Address Calculation

20 0006 0 LDA LENGTH 032060

The value of the operand (LENGTH)

Label name Block number Address Flag

Program Blocks Loaded in Memory

EXTDEF (external Definition):

EXTREF (external Reference):

Handling External Reference

15 0003 CLOOP +JSUB RDREC 4B100000

190 0028 MAXLEN WORD BUFEND-BUFFER 000000

107 1000 MAXLEN EQU BUFEND-BUFFER

Object Code for the example program:

Define record (EXTDEF)

Refer record (EXTREF)

Handling Expressions in Multiple Control Sections:

 How to enforce this restriction

There are two types of one-pass assemblers:

Forward Reference in One-Pass Assemblers: In load-and-Go assemblers when a forward

After Scanning line 40 of the program:

40 2021 J` CLOOP 302012

If One-Pass needs to generate object code:

Object Code Generated by One-Pass Assembler:

Implementation Issues for Modified Two-Pass Assembler:

Implementation Issues when forward referencing is encountered in Symbol Defining statements:

Multi-Pass Assembler Example Program

You might also like