SIMULATIONOF32 BitArirhmeticUnitUsingXilinx

See discussions, stats, and author profiles for this publication at: https://www.researchgate.
net/publication/343570311
SIMULATION OF 32-BIT FLOATING POINT ARITHMETIC UNIT USING XILINX
Article · August 2019
CITATIONS READS
0 1,228
4 authors, including:
Bharathy G.T
Jerusalem College of Engineering
40 PUBLICATIONS 86 CITATIONS
SEE PROFILE
Some of the authors of this publication are also working on these related projects:
Cognitive Networks View project
All content following this page was uploaded by Bharathy G.T on 11 August 2020.
The user has requested enhancement of the downloaded file.

International Journal of Scientific Research in Engineering- IJSRE
Index: 20180208:003. IDSRJI Indexing, Impact Factor: 3.27
SIMULATION OF 32-BIT FLOATING POINT ARITHMETIC UNIT

USING XILINX
1Ms. G. T. Bharathy*, 1Ms. T. Tamilselvi, 2Ms. S. Bhavanisankari, 2Ms. G. Bhargavi
1
Sr. Assistant Professor, Department of ECE, Jerusalem College of Engineering, Chennai-100
2
Associate Professor, Department of ECE, Jerusalem College of Engineering, Chennai-100
ABSTRACT
The paper proposes a verilog design and verification of different operations in a pipelined
floating point arithmetic unit (FPAU). The designed AU aspects two floating point 32 bit
numbers and the code corresponding to the operation to be performed. The novelty of AU is it
gives high performance through the pipelining concept. Pipelining is a technique where
multiple instruction execution is overlapped. In the top-down design approach, four arithmetic
modules: Addition, Subtraction, Multiplication and Division are combined to form floating-
point AU. Each module is divided into smaller modules. Two bits selection determines which
operation takes place at a particular time. The pipeline modules are independent of each other.
Design functionalities are validated through simulation, compilation and the synthesis of the
code was performed using Xilinx-ISE. Successful implementation of pipelining in floating
point AU using Verilog fulfills the needs for high performance application.
KEYWORDS: Floating Point, Pipeline Architecture, Arithmetic Unit, Xilinx.
INTRODUCTION
A. General
Floating-point numbers are widely adopted in many applications due its dynamic
representation capabilities. Floating point representation is able to retain its resolution and
accuracy compared to fixed-point representations. Based on this standard, floating-point
representation for digital systems should be platform-independent and data are interchanged
freely among different digital systems. ALU is a block in a microprocessor that handles
arithmetic operations. It always performs computation of floating-point operations. Some
CPU’s such as AMD Athlon have more than one floating point unit that handles floating point
operation. The use of Verilog for modeling is especially appealing since it provides a formal
description of the system and allows the use of specific description styles to cover the different
abstraction levels (architectural, register transfer and logic level) employed in the design. In
the computation of method, the problem is first divided into small pieces; each can be seen as
a sub module in Verilog.
Pipelining is one of the popular methods to realize high performance computation

platform. Implementing pipelining requires various phases of floating-point operations be
separated and be pipelined into sequential stages. This increases the throughput and efficiency
IJSRE AUGUST, Vol-2 Issue-08 www.ijsre.in Page 1

of the overall operation. Hence to realize an ALU design, this research proposes a pipelined
floating point ALU design using Verilog ease the description, verification and simulation.
The digital arithmetic operations are very important in the design of digital processors
and application-specific systems. An arithmetic circuit plays an important role in digital
systems with the vast development in the very large scale integration (VLSI) circuit
technology; many complex circuits have become easily implementable today. Algorithms that
are seemed to be impossible to implement, now have attractive implementation possibilities
for the future. This means that not only the conventional computer arithmetic methods, but also
the unconventional ones are worth investigation in new designs. The motion of real numbers
in mathematics is convenient for hand computations and formula manipulations. However, real
numbers are not well-suited for general purpose computation, because their numeric
representation as a string of digits expressed in say, base 10 can be very long or even infinitely
long. Examples include pie, e, and 1/3. In practice, computers store numbers with finite
precision. Numbers and arithmetic used in scientific computation should meet a few general
criteria.
 Numbers should have minimum storage requirements.

 Arithmetic operations should be efficient to carry out.
 A level of standardization, or portability, is desirable–results obtained on one computer
should closely match the results of the same computation on other computers.
The standardized methods to represent floating point numbers have been instituted by the
IEEE-754 standard through which the floating point operations can be carried out efficiently
with modest storage requirements. An arithmetic unit is a part of computer processor (CPU)
that carries out arithmetic operations on the operands in computer instructions words. Generally
arithmetic unit performs arithmetic operations like addition, subtraction, multiplication,
division. Some processor contains more than one AU - for example one for fixed operations
and another for floating point operations. To represent very large or small values large range
is required as the integer representations is no longer appropriate. In most modern general
purpose computer architecture, one or more FPUs are integrated with the CPU; however, many
embedded processors, especially older designs, do not have hardware support for floating point
operations.
Almost every languages have a floating point data types; computers from pc’s to
supercomputers have floating point accelerators; most compliers will be called upon to compile
floating point algorithm from time to time; virtually every operating system must respond to
floating point exceptions such as overflow.
B. Floating Point Number
The term floating point is derived from the meaning that there is no fixed number of
digits before and after the decimal point, that is, the decimal point can float. In general floating
point representations are slower and less accurate than fixed-point representations, but they can
handle a larger range of numbers. It consists of a fractional part. For e.g. following numbers
are the floating point numbers: 35, -112.5, ½, 4E-5 etc. Almost every language supports a
floating point data type. A number representation (called a numeral system in mathematics)
specifies some way of storing a number that maybe encoded as a string of digits. In computing,
floating point describes a system for numerical representation in which a string of digits (or
bits) represents a rational number. The term floating point refers to the fact that the radix point
(decimal point, or more commonly in computers, binary point) can "float"; that is, it can be
placed anywhere relative to the significant digits of the number.
FLOATING POINT ARCHITECTURE
A. Floating Point Architecture
Floating point numbers are one possible way of representing real numbers in binary
format; the IEEE 754 [11] standard presents two different floating point formats, Binary
interchange format and Decimal interchange format. This paper focuses only on single
precision normalized binary interchange format. Figure 1 shows the IEEE 754 single precision
binary format representation; it consists of a one-bit sign (S), an eight bit exponent (E), and a
twenty-three-bit fraction (M) or Mantissa.
32-bit Single Precision Floating Point Numbers IEEE standard are stored as:
S EEEEEEEE MMMMMMMMMMMMMMMMMMMMMM
S: Sign – 1 bit
E: Exponent – 8 bits
M: Mantissa – 23 bits Fraction
Sign bit 8bits 23 bits
Biased Exponent Significant

32 bits
Fig.1 Single precision binary format representation
The value of number V:

If E=255 and F is nonzero, then V= Nan ("Not a Number")
If E=255 and F is zero and S is 1, then V= - Infinity
If E=255 and F is zero and S is 0, then V= Infinity
If 0<E<255 then V= (-1) **S * 2 ** (E-127) * (1.F) (exponent range = -127 to +128)

If E=0 and F is nonzero, then V= (-1) **S * 2 ** (-126) * (0.F) ("un-normalized" values”)
If E=0 and F is zero and S is 1, then V= - 0
If E=0 and M is zero and S is 0, then V = 0
An extra bit is added to the mantissa to form what is called the significand. If the exponent is
greater than 0 and smaller than 255, and there is 1 in the MSB of the significand then the
number is said to be a normalized number; in this case the real number is represented by (1)
V = (-1s) * 2 (E - Bias) * (1.M) (1)
Where M = m22 2-1 + m21 2-2 + m20 2-3+…+ m1 2-22+m0 2-23; Bias = 127.
FLOATING POINT OPERATIONS
A. Floating Point Addition / Subtraction
The addition of two floating point numbers has two different cases. Case I: when both
the numbers are of same sign i.e. when both the numbers are either +ve or –ve. In this case
MSB of both the numbers are either 1 or 0. Case II: when both the numbers are of different
sign i.e. when one number is +ve and other number is –ve. In this case the MSB of one number
is 1 and other is 0.
Case I: When both numbers are of same sign
Step 1: Enter two numbers N1 and N2. E1, S1 and E1, S2 represent exponent and significand
of N1 and N2 respectively.
Step 2: Is E1 or E2 ='0'. If yes; set hidden bit of N1 or N2 is zero. If not; then check if E2 > E1,
if yes swap N1 and N2 and if E1 > E2; contents of N1 and N2 need not to be swapped.
Step 3: Calculate difference in exponents d=E1-E2. If d = '0' then there is no need of shifting
the significand. If d is more than '0' say 'y' then shift S2 to the right by an amount 'y' and fill
the left most bits by zero. Shifting is done with hidden bit.
Step 4: Amount of shifting i.e. 'y' is added to exponent of N2 value. New exponent value of
E2= (previous E2) + 'y'. Now result is in normalize form because E1 = E2.
Step 5: Check if N1 and N2 have different sign, if 'no'
Step 6: Add the significands of 24 bits each including hidden bit S=S1+S2.
Step 7: Check if there is carry out in significand addition. If yes; then add '1' to the exponent
value of either E1 or new E2. After addition, shift the overall result of significand addition to
the right by one by making MSB of S as '1' and dropping LSB of significand.
Step 8: If there is no carry out in step 6, then previous exponent is the real exponent.
Step 9: Sign of the result i.e. MSB = MSB of either N1 or N2.
Step 10: Assemble result into 32-bit format excluding 24th bit of significand i.e. hidden bit

Case II: - When both numbers are of different sign
Step 1, 2, 3 & 4 are same as done in case I.

Step 5: Check if N1 and N2 have different sign, if 'Yes';
Step 6: Take 2's complement of S2 and then add it to S1 i.e. S=S1+ (2's complement of S2).
Step 7: Check if there is carry out in significand addition. If yes; then discard the carry and also
shift the result to left until there is '1' in MSB and also count the amount of shifting say 'z'.
Step 8: Subtract 'z' from exponent value either from E1 or E2. Now the original exponent is
E1-'z'. Also append the 'z' amount of zeros at LSB.
Step 9: If there is no carry out in step 6 then MSB must be '1' and in this case simply replace
'S' by 2's complement.
Step 10: Sign of the result i.e. MSB = Sign of the larger number either MSB of N1or it can be
MSB of N2.
Step 11: Assemble result into 32 bit format excluding 24th bit of significand i.e. hidden bit.
Three 8-bit comparators, one 24-bit and two 8-bit adders, two 8-bit subtractors, two shift units
and one swap unit are required in the design.
First 8-bit comparator is used to compare the exponent of two numbers. If exponents of
two numbers are equal then there is no need of shifting. Second 8-bit comparator compares
exponent with zero. If the exponent of any number is zero set the hidden bit of that number
zero. Third comparator is required to check whether the exponent of number 2 is greater than
number 1. If the exponent of number 2 is greater than number 1 then the numbers are swapped.
One subtractor is required to compute the difference between the 8-bit exponents of
two numbers. Second subtractor is used if both the numbers are of different sign than after
addition of the significands of two numbers if carry appears. This carry is subtracted from the
exponent using 8-bit subtractor.
One 24-bit adder is required to add the 24-bit significands of two numbers. One 8-bit
adder is required if both the numbers are of same sign than after addition of the significands of
two numbers if carry appears. This carry is added to the exponent using 8-bit adder. Second 8-
bit adder is used to add the amount of shifting to the exponent of smaller number.
One swap unit is required to swap the numbers if N2 is greater than N1. Swapping is
normally done by taking the third variable. Two shift units are required one is shift left and
second is shift right.
B. Floating Point Multiplication
The algorithm for floating point multiplication is explained through flow chart in Figure
3. Let N1 and N2 are normalized operands represented by S1, M1, E1 and S2, M2, E2 as their

respective sign bit, mantissa (significand) and exponent. Basically following four steps are used
for floating point multiplication.
1. Multiply significands, add exponents, and determine sign

M=M1*M2
E=E1+E2-Bias
S=S1XORS2
2. Normalize Mantissa M (Shift left or right by 1) and update exponent E
3. Rounding the result to fit in the available bits
4. Determine exception flags and special values for overflow and underflow.
Sign Bit Calculation: The result of multiplication is a negative sign if one of the multiplied
numbers is of a negative value and that can be obtained by XORing the sign of two inputs.
Exponent Addition is done through unsigned adder for adding the exponent of the first input to
the exponent of the second input and after that subtract the Bias (127) from the addition result
(i.e. E1+E2 - Bias). The result of this stage can be called as intermediate exponent.
Significand Multiplication is done for multiplying the unsigned significand and placing
the decimal point in the multiplication product. The result of significand multiplication can be
called as intermediate product (IP). The unsigned significand multiplication is done on 24 bit.
The result of the significand multiplication (intermediate product) must be normalized

to have a leading '1' just to the left of the decimal point (i.e. in the bit 46 in the intermediate
product). Since the inputs are normalized numbers then the intermediate product has the
leading one at bit 46 or 47. If the leading one is at bit 46 (i.e. to the left of the decimal point)
then the intermediate product is already a normalized number and no shift is needed. If the
leading one is at bit 47 then the intermediate product is shifted to the right and the exponent is
incremented by 1. Overflow/underflow means that the result’s exponent is too large/small to
be represented in the exponent field. The exponent of the result must be 8 bits in size, and must
be between 1 and 254 otherwise the value is not a normalized one. An overflow may occur
while adding the two exponents or during normalization.
Overflow due to exponent addition can be compensated during subtraction of the bias;
resulting in a normal output value (normal operation). An underflow may occur while
subtracting the bias to form the intermediate exponent. If the intermediate exponent < 0 then it
is an underflow that can never be compensated; if the intermediate exponent = 0 then it is an
underflow that may be compensated during normalization by adding 1 to it . When an overflow
occurs an overflow flag signal goes high and the result turns to ±Infinity (sign determined
according to the sign of the floating point multiplier inputs). When an underflow occurs an
underflow flag signal goes high and the result turns to ±Zero (sign determined according to the
sign of the floating point multiplier inputs).

C. Floating Point Division
Figure-1 Flow Chart for floating point Addition Subtraction

Figure-2 Flow chart for floating point Multiplication
The algorithm for floating point multiplication is explained through flow chart in Figure
4. Let N1 and N2 are normalized operands represented by S1, M1, E1 and S2, M2, E2 as their
respective sign bit, mantissa (significand) and exponent. If let us say we consider x=N1 and
d=N2 and the final result q has been taken as “x/d”. Again the following four steps are used for
floating point division.
1. Divide significands, subtract exponents, and determine sign

M=M1/M2 E=E1-E2 S=S1XORS2

2. Normalize Mantissa M (Shift left or right by 1) and update exponent E
3. Rounding the result to fit in the available bits
4. Determine exception flags and special values
The sign bit calculation, mantissa division, exponent subtraction (no need of bias subtraction
here), rounding the result to fit in the available bits and normalization is done in the similar
way as has been described for multiplication.
SIMULATION RESULTS
The following figures shows the input and output of the various operations of the
floating point arithmetic unit.
Figure-3 Inputs of FPAU
Figure-4 Outputs of FPAU

CONCLUSION
The Arithmetic Unit is an important part of any system. Here a 32-bit floating point
arithmetic unit has been simulated using Verilog HDL. The functions of the arithmetic units
are designed through pipelining technique and was found to be efficient as the pipelining
modules are smaller as well as independent. This design can be further used for higher
performance applications.
REFERENCES
1. N. Shirazi, A. Walters, and P. Athanas, “Quantitative analysis of floating point arithmetic

on fpga based custom computing machines,” in Proceedings of the IEEE Symposium on
FPGAs for Custom Computing Machines, pp. 155–162, 1995.
2. P. Belanovic and M. Leeser, “A library of parameterized floating-point modules and their
use,”in Proceedings of the International Conference on Field Programmable Logic and
Applications, 2002.
3. J. Dido, N. Geraudie, L. Loiseau, O. Payeur, Y. Savaria, and D. Poirier, “A flexible
floating-point format for optimizing data-paths and operators in fpga based dsps,” in
Proceedings of the ACM International Symposium on Field Programmable Gate Arrays,
(Monterrey, CA), February 2002.
4. A. A. Gaar, W. Luk, P. Y. Cheung, N. Shirazi, and J. Hwang, “Automating customisation
of floating-point designs,” in Proceedings of the International Conference on Field
Programmable Logic and Applications, 2002.
5. J. Liang, R. Tessier, and O. Mencer, “Floating point unit generation and evaluation for
fpgas,” in Proceedings of the IEEE Symposium on Field- Programmable Custom
Computing Machines, (Napa Valley, CA), pp. 185–194, April 2003.
6. IEEE Standards Board, “IEEE standard for binary floating-point arithmetic,” Tech. Rep.
ANSI/IEEE Std. 754-1985, The Institute of Electrical and Electronics Engineers, New
York, 1985.
7. B.Fagin and C. Renard, “Field programmable gate arrays and floating point arithmetic,”
IEEE Transactions on VLSI, vol. 2, no. 3, pp. 365–367, 1994.
8. W. B. Ligon, S. P. McMillan, G. Monn, F. Stivers, K. Schoonover, and K. D. Underwood,
“A re-evaluation of the praticality of floating-point on FPGAs,” in Proceedings of the
IEEE Symposium on FPGAs for Custom Computing Machines, (Napa Valley, CA), pp.
206–215, April 1998.
9. Z. Luo and M. Martonosi, “Accelerating pipelined integer and floating-point
accumulations in configurable hardware with delayed addition techniques,” IEEE
Transactions on Computers, vol. 49, no. 3, pp. 208–218, 2000.
10. X. Wang and B. E. Nelson, “Tradeoffs of designing floating-point division and square root
on virtex fpgas,” in Proceedings of the IEEE Symposium on Field-Programmable Custom
Computing Machines, (Napa Valley, CA), pp. 195–203, April 2003.
11. IEEE 754-2008, IEEE Standard for Floating-Point Arithmetic, 2008.
View publication stats

SIMULATIONOF32 BitArirhmeticUnitUsingXilinx

Uploaded by

Copyright:

Available Formats

SIMULATIONOF32 BitArirhmeticUnitUsingXilinx

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

SIMULATIONOF32 BitArirhmeticUnitUsingXilinx

Uploaded by

Copyright:

Available Formats

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

SIMULATION OF 32-BIT FLOATING POINT ARITHMETIC UNIT USING XILINX

Article · August 2019

Cognitive Networks View project

The user has requested enhancement of the downloaded file.

SIMULATION OF 32-BIT FLOATING POINT ARITHMETIC UNIT

KEYWORDS: Floating Point, Pipeline Architecture, Arithmetic Unit, Xilinx.

Pipelining is one of the popular methods to realize high performance computation

IJSRE AUGUST, Vol-2 Issue-08 www.ijsre.in Page 1

 Numbers should have minimum storage requirements.

B. Floating Point Number

FLOATING POINT ARCHITECTURE

A. Floating Point Architecture

Biased Exponent Significant

Fig.1 Single precision binary format representation

The value of number V:

IJSRE AUGUST, Vol-2 Issue-08 www.ijsre.in Page 3

FLOATING POINT OPERATIONS

A. Floating Point Addition / Subtraction

Case I: When both numbers are of same sign

IJSRE AUGUST, Vol-2 Issue-08 www.ijsre.in Page 4

Step 1, 2, 3 & 4 are same as done in case I.

B. Floating Point Multiplication

IJSRE AUGUST, Vol-2 Issue-08 www.ijsre.in Page 5

1. Multiply significands, add exponents, and determine sign

The result of the significand multiplication (intermediate product) must be normalized

IJSRE AUGUST, Vol-2 Issue-08 www.ijsre.in Page 6

Figure-1 Flow Chart for floating point Addition Subtraction

Figure-2 Flow chart for floating point Multiplication

1. Divide significands, subtract exponents, and determine sign

IJSRE AUGUST, Vol-2 Issue-08 www.ijsre.in Page 8

Figure-3 Inputs of FPAU

Figure-4 Outputs of FPAU

IJSRE AUGUST, Vol-2 Issue-08 www.ijsre.in Page 9

1. N. Shirazi, A. Walters, and P. Athanas, “Quantitative analysis of floating point arithmetic

IJSRE AUGUST, Vol-2 Issue-08 www.ijsre.in Page 10

View publication stats

You might also like