0% found this document useful (0 votes)

58 views6 pages

VLSI Implementation of Bit Serial Architecture Based Multiplier in Floating Point Arithmetic

The document discusses VLSI implementation of a bit serial architecture based multiplier for floating point arithmetic. It analyzes array multipliers and other approaches and finds that a bit serial architecture based multiplier provides good optimization of area, speed, power and precision for neural network applications that require both high performance and accuracy in floating point multiplication.

Uploaded by

meg.huang

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

58 views6 pages

VLSI Implementation of Bit Serial Architecture Based Multiplier in Floating Point Arithmetic

Uploaded by

meg.huang

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

VLSI Implementation of Bit Serial Architecture

based Multiplier in Floating Point Arithmetic

Jitesh R Shinde Suresh S Salankar

Research Scholar & IEEE member Professor, Electronics & Communication Department
Nagpur, India G.H.Raisoni College of Engineering
Nagpur, India

Abstract— VLSI implementation of Neural network and should have high degree of precision and dynamic range
processing or digital signal processing based applications simultaneously.
comprises large number of multiplication operations. A key
design issue, therefore in such applications depends on efficient The multipliers approach presented in last decade were not
realization of multiplier block which involves trade-off between satisfying our goal i.e. a VLSI implementation of floating point
precision, dynamic range, area, speed and power consumption of multiplier which is area-power-speed efficient and should have
the circuit. high degree of precision and dynamic range.

The study in this paper investigates performance of VLSI

There were three possible approaches to meet our design
implementation of bit serial architecture based multiplier (Type constraints viz.
III) in floating point arithmetic (IEEE 754 Single Precision -array multiplier (MUL1) (fig. 2.1).
format).
-multiplier based on digit serial architecture approach
Results of implementation of 32x32 bit multiplier on FPGA as (Type III) (MUL2) [3].
well as on Backend VLSI Design tool indicate that bit serial
architecture based multiplier design provides good trade-off in -multiplier based on bit serial architecture based (Type
terms of area, speed, power and precision over array multiplier III) (MUL3) [4].
and other multipliers approach proposed since last decade. In
other words, bit serial architecture based multiplier (Type III)
approach may provide good multi-objective solution for VLSI
circuits.

Keywords—Array Multiplier, bit serial architecture based

multiplier, floating point arithmetic, Not a Number (NaN),
underflow or de-normalized number, Ripple Carry Adder (RCA).

I. INTRODUCTION
Multiplication is the fundamental operation in neural
network processing or digital signal processing (DSP) based
applications. There are certain applications in this domain
wherein not only design should be area-power and speed
efficient but it also demands high degree of precision and Fig.2.1: An example of 4×4 Array multiplier
dynamic range. Therefore, in such cases it appears
implementation of multiplier block in floating point arithmetic
with adequately chosen parameters is a good compromise [1].
For performing floating point multiplication the numbers
are represented in the desired floating point format. The
product is obtained by multiplying the mantissa and adding the
exponents. The sign bits of mantissa should be added
separately to determine the sign of product of mantissa [2].

II. PROBLEM STATEMENT

The demand of our research work based on multi- Fig.2.2: Bit-serial type-III multiplier with word-length of 4 bits
optimization for VLSI implementation of neural network was
multiplier used in design should be area-speed-power efficient

978-1-4799-8792-4/15/$31.00 2015
c IEEE 1672

Authorized licensed use limited to: National Taiwan University. Downloaded on March 21,2024 at 08:52:25 UTC from IEEE Xplore. Restrictions apply.
Comparative analysis suggests that bit serial architecture
(Type III) provides better trade off to realize multi-objective
optimization approach for VLSI Implementation of digital
neural network.
So, the research work in this paper presents two multiplier
viz. array multiplier & bit serial architecture based multiplier
implemented in floating point arithmetic (IEEE-754 single
precision format).

III. DESIGN & IMPLEMENTATION

The generalized block schematic of multiplier used is
shown in figure (3.1). The major entities in block diagram are
unpackfp, multiplier block, exponent adder block, fpnormalize,
fpround and packfp respectively. The description and working
of each block is as follows:-
Fig.2.3: Digit Cell for type-III multiplier

Bit - serial arithmetic and communication is efficient for

computational processes, allowing good communication within
and between VLSI chips and tightly pipelined arithmetic
structures. It is ideal for neural networks as it minimizes the
interconnect requirement by eliminating multi - wire busses
[15].
Comparative analysis of multiplier (N*N) with respect to
multiplicand data size ‘A” & multiplier data size of N=8 in
both cases are shown in table 2.1.
Table 2.1: Comparison of Multipliers
Type of Multipliers
Parameters
MUL1 MUL3 (Bit
MUL2 (Digit)
(Array) Serial)

G1 (And
N2=64 W*(D*D)=32 N=8
gates)

G2 (Adders) N(N-1)=12 2*(N/W)=8 N=8

Pipelining Absent Present Present

Best due to
Speed Low Better
unfolding concept

Better than array

Area High Optimum
multiplier
Fig.3.1: Generalized block schematic structure of IEEE 754 single
precision multiplier block
Higher than bit
(Dynamic
serial architecture Function of unpackFP block, Packfp block & Logic
Power Moderate Optimum
due to unfolding block to predict nature of output :- Unpack block unpacks
Dissipation
concept incoming data (31 down to 0) into three parts viz. sign bit
(MSB 31st bit), exponent (30 down to 23) and mantissa (22
down to 0). This blocks maps 23 bit mantissa into 32 bit by
appending zeros at LSBs.
Where description of notations G1, G2 used in above table
is as follows: Packfp block packs final result of multiplication obtained
after normalization & rounding i.e. its mantissa, exponent and
- G1 => approximate number of AND gates required sign bit into IEEE -754 single precision format.
for partial product implementation.
Unpackfp and packfp block also checks the exponent and
- G2 => approximate number of Full Adders required. mantissa part of inputs for the following conditions (underflow
- Digit size D=N/W=4. No. of folding W=2.

2015 International Conference on Advances in Computing, Communications and Informatics (ICACCI) 1673

Authorized licensed use limited to: National Taiwan University. Downloaded on March 21,2024 at 08:52:25 UTC from IEEE Xplore. Restrictions apply.
(de-normalized number), overflow, infinity and not a number
(NaN) to find the 32 bit input data and hence output (via rst
FPpack block & logic block to predict nature of output) is a
U3
valid IEEE 754 single precision floating point number.

rs t
ex_clk
U1
clk clk load

c lk s
c lk b
rs t
The summary of conditions used in unpackFP block & sel(4:0)
load
sel(4:0)
Logic block to predict nature of output to find the 32 bit input U2 a(23:0)dout(23:0) dout(23:0)
b(23:0)
fsm_bsmt31
data is a valid IEEE 754 single precision floating point number clk sout(23:0) bsmt3_241_test2
are given in table 3.1 and flowchart (figure 3.2). rst

Table 3.1: Table listing conditions to check nature of input data_gen1

Number sign exponent mantissa

U4
clk sout(23:0)
normalized number 0/1 01 to FE any value rst

data_gen2
De-normalized
0/1 00 any value
number (underflow)
Fig.3.3: RTL view of bit serial architecture based multiplier block

zero 0/1 00 0

infinity (Overflow) 0/1 FF 0

NaN i.e Not a Number any value

FF
(inf*0 or inf/inf or 0/0 form) but not 0

The IEEE 754 standard partially solves the problem of

underflow by using de-normalized representations in which a
de-normalized representation is characterized by an exponent
code being all 0's, interpreted as having the whole part of the
significand being an implied 0 instead of an implied 1.

Fig.3.4: State diagram of fsm_31bsmt31 used in bit serial architecture

based multiplier block

clks
U3
U2
U1
clkb clk clk

b(23:0) d(23:0) y d s d dout(23:0) dout(23:0)

sel(4:0) sel(4:0) rst rst rst

a(23:0) a(23:0) load

muxps241
bsmt3_241 stop241

load
Fig.3.2: Flowchart representing logic used in program to check for underflow
condition Fig.3.4: RTL view of bsmt3_241_test2 used in bit serial architecture
Multiplier block and exponent adder block: - Multiplier based multiplier block
block performs the multiplication of two input data (mantissa) Bit serial architecture based multiplier block working:-
coming from unpackFP0 & unpackFP1 block and generates
mantissa part of final output. Output data bus of multiplier Initially by applying rst= ‘1’ in first clock cycle the entire
blocks implementated with array multiplier logic 32bit×32bit circuit is reset i.e. sel= ‘0000’, load= ‘0’. In next clock cycle
(figure 3.1) was of 64 bit and total partial product inferred were the rst = ‘0’, the counter gets initialized and count value gets
64. Output data bus of multiplier block implemented with bit loaded in sel. Counter value changes with respect to external
serial logic was 24 bit and number of AND gates inferred to clock i.e every 2 cycles of main clock. Counter increments till
implement partial product were 24. count =25 after that it roll backs to zero.

1674 2015 International Conference on Advances in Computing, Communications and Informatics (ICACCI)

Authorized licensed use limited to: National Taiwan University. Downloaded on March 21,2024 at 08:52:25 UTC from IEEE Xplore. Restrictions apply.
As soon as count becomes zero, load is set and inputs viz. The FPround block performs the function of rounding. If
A & B are loaded in bsmt3_241_test2 from data_gen1 and third LSB bit of mantissa is ‘1’, then no need of data rounding
data_gen2 block. Select line sel(4:0) of multiplexer muxps241 and SIG_in is copied in SIG_in. Otherwise, bit’1’ is added to
is driven by counter of fsm_31bsmt31 block. Multiplexer SIG_in (22 down to 3) and the result is concatenated with
applies bit b (i) to d input of bsmt3_241 block. Block “000” at LSB. The block schematic of FPround block is shown
bsmt3_241 block performs the multiplication operation based in figure 3.7. Logic used to implement this block is
on bit serial approach Type III as shown in figure 2.2. represented in the form of flowchart in figure 3.8.
At count = ‘25’, the result of multiplication i.e. 48 bit (47
down to 0) result is obtained. The stop241 blocks copies 24 bit
(47 down to 24) bit in output port of multiplier block i.e. dout.
The RTL view or state diagram of blocks used in
implementation of this block are shown in figure 3.3, 3.4 & 3.5
respectively.
Exponent adder blocks add exponent parts of respective
exponent parts of input A and B to generate exponent of output
Z.
FPnormalize block: The block schematic of FPnormalize
block is shown in figure 3.5. It first checks whether 23 bit
mantissa’s MSB bit is one or zero. If it is one then mantissa is
in de-normalized form and FPnormalize block converts de-
normalized mantissa into normalized form. Logic used to
implement this block is represented in the in form of flowchart Fig.3.7: Block schematic of FPround
in figure 3.6.

Fig.3.5: Block schematic of FPnormalize block

Fig.3.8: Flowchart representing logic used to implement FPround block in

program

Fig.3.6: Flowchart representing logic used to implement FPnormalize block

IV. RESULTS & COMPARISON
FPround block: - In floating point arithmetic the size of The code for 32×32 bit array multiplier (MUL1) & bit
the result of an operation may be exceeding the size of binary serial architecture based Type III multiplier (MUL2) were
used in the number system. In such cases the low order bits has written in Aldec Active HDL tool and synthesized on Altera’s
to be eliminated in order to store the result. The method of Quartus tool and it was targeted on FPGA Cyclone 2, Device
eliminating these lower order bits is rounding [2]. EP2C5AF256A7. Later on code was also tested at Backend on
Synopsis tool on 45 nm & 90 nm tech file. Experimental
results were found to be matching with theoretical results.

2015 International Conference on Advances in Computing, Communications and Informatics (ICACCI) 1675

Authorized licensed use limited to: National Taiwan University. Downloaded on March 21,2024 at 08:52:25 UTC from IEEE Xplore. Restrictions apply.
The performance comparison of implemented multiplier at
frontend and backend VLSI design are given in table 4.1 and
4.2 respectively and graphical representation of total cell area
and total dynamic power dissipation are shown in figure 4.1
and 4.2 respectively.
Table 4.1: Performance comparison of implemented multiplier at
frontend level

Parameters MUL1 MUL3

32×32 bit 32×32 bit
Total Logic Elements 234 209
Total dynamic power 6.11 0.54
dissipation(mW)
Worst propagation delay 41.604 21.495 Fig 4.1: Total Cell area graphical representation of Array multiplier and bit
(nsec) serial architecture based multiplier
0.54x26=14.04
Table 4.2: Performance comparison of implemented multiplier at Backend
level

Parameters 90 nm tech file 45 nm tech file (no

Workload model)
MUL1 MUL3 MUL1 MUL3 32×32
32×32 32×32 32×32 bit bit
bit bit
Total Area (nm 20677. 5420.9 -- --
square) 499757 71736

Total cell area 20007. 5304.7 19676.340 2185.060771

(nm square) 014559 29626 759
Total dynamic 0.5913 0.0275 6.3856 0.0312965
power 478 544
dissipation(mW)
Data arrival time 19.89 4.19 1.93 1.34 Fig 4.2: Total dynamic power dissipation graphical representation of Array
(nsec) multiplier and bit serial architecture based multiplier
0.0275544x26=0.7164144
The experimental results at Frontend VLSI design level V. CONCLUSIONS
indicates that MUL3 is better than MUL1
The experimental results indicated that bit serial
-in area by 10.6837 % architecture Type III based multiplier implementated in
-in dynamic power dissipation by 83.88 % floating point arithmetic (IEEE 754- single precision format)
leads to an area efficient, low power and high speed digital
-in delay by 48.3342% multiplier with high degree of precision. It has also proven to
be better alternative over array multiplier. In other words,
The experimental results at Backend VLSI design level approach used by us i.e. bit serial architecture Type III based
indicates that MUL3 is better than MUL1 multiplier implementated in floating point arithmetic provides
-In total area by 73.7832 % in 90 nm tech file. a good multi-objective optimization solution.
-In total cell area by 73.4856 % in 90 nm tech file and System designers often face problems in realizing and
by 88.8894% in 45 nm tech file. optimizing area, power and speed simultaneously with high
degree of precision & dynamic range in VLSI implementation
-In dynamic power dissipation by power by 95.3404 % of complicated digital circuits. The bit serial architecture Type
in 90 nm tech file and by 99.5099 %. III based multiplier approach suggested in this paper were
-In data arrival time by 79.133 % in 90 nm tech file found to be giving better performance than other promising
and by 30.5699 % in 45 nm tech file. findings available in literature [5 ,6, 7, 8, 9, 10, 11, 12, 13 &
14] . Many of these findings were single objective optimization
based and were not multi-objective optimization based. Thus,
the approach suggested in this paper may provide a challenging
solution in realizing area, power as well as speed efficient
optimized design for VLSI circuits.

1676 2015 International Conference on Advances in Computing, Communications and Informatics (ICACCI)

Authorized licensed use limited to: National Taiwan University. Downloaded on March 21,2024 at 08:52:25 UTC from IEEE Xplore. Restrictions apply.
Future research work includes application of this module in
high end applications like image processing, neural networks
and digital signal processing .

REFERENCE

[1] Jean-Michel Muller, Nicolas Brisebarre, Florent de Dinechin, Claude-

Pierre, Jeannerod Vincent Lef`ever, Guillaume Melquiond, Nathalie
Revol, Damien Stehl´ e, Serge Torres, “ Handbook of floating point
arithmetic”, Birkh¨auser Boston, part of Springer Science+Business
Media.
[2] A.Nagoor Kani, “Digital Signal Processing”, ch.8, Tata McGrawhill.
[3] Yun-Nan Chang,Student Member, IEEE, Janardhan H. Satyanarayana,
Member, IEEE, and Keshab K. Parhi, Fellow, IEEE, “Systematic Design
of High-Speed and Low-Power Digit-Serial Multipliers”, IEEE
Transactions on Circuits and Systems—II: Analog & Digital Signal
Processing, Vol. 45, no. 12, December 1998.
[4] Ms.P.J.Tayade, Dr. Prof. A.A.Gurjar, “Systematic Design of High-
Speed and Low-Power Digit-Serial Multipliers VLSI Based”,
International Journal of Management, IT and Engineering, Vol. 2, Issue
5, Pg. no. 439-446, May 2012
[5] Summit Vaidya, Deepak Dandekar, “Delay-Power Performance
Comparison of Multipliers in VLSI Circuit Design”, International
Journal of Computer Networks & Communications (IJCNC)”, Vol. II,
Issue IV, July 2010.
[6] M.K.Pavuluri, T.S.R. Krishna Prasad, Ch.Rambabu “Design &
Implementation of Complex Floating Point Processor using FPGA”,
International Journal of VLSI Design & Communication Systems
(VLSICS)”, Vol. IV, Issue V, October 2013.
[7] Prashant Kumar Sahu, Nitin Meena, “Comparative Study of Different
Multiplier Architectures”, International Journal of Engineering Trends &
Technology (IJETT)”, Vol. IV, Issue X, October 2013.
[8] Deepak Purohit, Himanshu Joshi, “ Comparative Study & Analysis of
Fast Multipliers”, International Journal of Engineering & Technical
Research (IJETR), Vol. II, Issue VII, July 2014.
[9] Anitha R, Alekhya Nelapati, L.Jesima W, V.Bagyaveereswaran, “
Comparative Study of High Performance Bruan’s Multiplier using
FPGAs”, IOSR Journal of Electronics & Communication Engineering
(IOSRJECE)”, Vol. I, Issue IV, pp 33-37, May-June,2012.
[10] Kumar Mishra, V.Nandanwar, Eskinder Anteneh Ayele, S.B.Dhok, “
FPGA Implementation of Single Precision Floating Point Multiplier
Using High Speed Compressors”, International Journal of Soft
Computing & Engineering, Vol. IV, Issue II, May 2014.
[11] B.Jeevan, S.Narendra, Dr. C.V.Reddy, Dr. K.Sivani, “A High Speed
Binary Floating Point Multiplier Using Dadda Algorithm”, IEEE 2013.
[12] Shaifali, Sakshi, “ FPGA Design of Pipelined 32-bit Floating Point
Multiplier”, International Journal of Computational & Management,
Vol. XVI, Issue V, September, 2013.
[13] Chaitali V. Matey, Dr. S.D. Chede, S.M.Sakhare, “ Design &
Implementation of Floating Point Multiplier Using Wallace and Dadda
Algorithm”, International Journal of Application or Innovation in
Engineering & Management”, Vol. III, Issue VI, June, 2014.
[14] R.Sai Siva Teja, A.Madhusudhan “FPGA Implementation of Low Area
Floating Point Multiplier Using Vedic Mathematics”, International
Journal of Engineering & Advanced Engineering, Vol. III, Issue XII,
December 2013.
[15] Alan F. Murray, Anthony V. W. Smith and Zoe F. Butler, “BIT -
SERIAL NEURAL NETWORKS”, American Institute of Physics 1988.
[16] Jitesh Shinde, Suresh Salankar, “VLSI Implementation of Neural
Network”, Current Trends in Technoligy & Science Journal,Vol. 4,
Issue 03, April-May,2015.

2015 International Conference on Advances in Computing, Communications and Informatics (ICACCI) 1677

Authorized licensed use limited to: National Taiwan University. Downloaded on March 21,2024 at 08:52:25 UTC from IEEE Xplore. Restrictions apply.

Polo 6R Codes
No ratings yet
Polo 6R Codes
9 pages
FPGA Design of A Fast 32-Bit Floating Point
No ratings yet
FPGA Design of A Fast 32-Bit Floating Point
3 pages
31_Design_JJ_new
No ratings yet
31_Design_JJ_new
8 pages
Design of Low-Area and High Speed Pipelined
No ratings yet
Design of Low-Area and High Speed Pipelined
6 pages
S S 32-B M C D: Imulation and Ynthesis of IT Ultiplier Using Onfigurable Evices
No ratings yet
S S 32-B M C D: Imulation and Ynthesis of IT Ultiplier Using Onfigurable Evices
8 pages
An Efficient Implementation of Oating Point Multiplier: Conference Paper
No ratings yet
An Efficient Implementation of Oating Point Multiplier: Conference Paper
6 pages
International Journal of Engineering Research and Development
No ratings yet
International Journal of Engineering Research and Development
7 pages
M. Al-Ashrafy, A. Salem, and W. Anis, An Efficient Implementation of Floating
No ratings yet
M. Al-Ashrafy, A. Salem, and W. Anis, An Efficient Implementation of Floating
6 pages
Hardware Algorithm For Variable Precision Multiplication On FPGA
No ratings yet
Hardware Algorithm For Variable Precision Multiplication On FPGA
4 pages
Manage-Implementation of Floating - Bhagyashree Hardiya
No ratings yet
Manage-Implementation of Floating - Bhagyashree Hardiya
6 pages
High Performance FPGA Based Floating Point Arithmetics: Project Report For Computer Arithmetic Algorithms
No ratings yet
High Performance FPGA Based Floating Point Arithmetics: Project Report For Computer Arithmetic Algorithms
10 pages
Floating Point Ieee
No ratings yet
Floating Point Ieee
4 pages
IJSPR_1203_438 (1)
No ratings yet
IJSPR_1203_438 (1)
4 pages
ARITHMETIC and LOGIC UNIT - in This Lecture, We Will Examine How
No ratings yet
ARITHMETIC and LOGIC UNIT - in This Lecture, We Will Examine How
12 pages
Design and Implementation of A High Performance Floating
No ratings yet
Design and Implementation of A High Performance Floating
15 pages
Example of Multiplier
No ratings yet
Example of Multiplier
4 pages
IJSPR_5901_30318
No ratings yet
IJSPR_5901_30318
5 pages
Design of Single Precision Floating Point Multiplication Algorithm With Vector Support
No ratings yet
Design of Single Precision Floating Point Multiplication Algorithm With Vector Support
8 pages
Implementation of A High Speed Single Precision Floating Point Unit Using Verilog
No ratings yet
Implementation of A High Speed Single Precision Floating Point Unit Using Verilog
5 pages
10 1 1 961 4530 PDF
No ratings yet
10 1 1 961 4530 PDF
5 pages
Design of Double Ieee Precision
No ratings yet
Design of Double Ieee Precision
9 pages
Design and Implementation of Fast Floating Point Multiplier Unit
No ratings yet
Design and Implementation of Fast Floating Point Multiplier Unit
5 pages
Design and Implementation of Floating Point ALU With Parity Generator Using Verilog HDL
No ratings yet
Design and Implementation of Floating Point ALU With Parity Generator Using Verilog HDL
6 pages
Finalpublishedpaperoriginal PDF
No ratings yet
Finalpublishedpaperoriginal PDF
10 pages
Floating Point Multiplier
100% (1)
Floating Point Multiplier
14 pages
Energy Efficient High Speed Floating Point Arithmetic Unit: Somya Kumawat, Arpan Shah, Ramesh Bharti
No ratings yet
Energy Efficient High Speed Floating Point Arithmetic Unit: Somya Kumawat, Arpan Shah, Ramesh Bharti
3 pages
34 PDF
No ratings yet
34 PDF
4 pages
An Approach to LUT Based Multiplier for Short
No ratings yet
An Approach to LUT Based Multiplier for Short
5 pages
Design and Implementation of Single Precision Pipelined Floating Point Co-Processor
No ratings yet
Design and Implementation of Single Precision Pipelined Floating Point Co-Processor
4 pages
Bu 33436438
No ratings yet
Bu 33436438
3 pages
Implementation of Floating Point Multiplier
No ratings yet
Implementation of Floating Point Multiplier
4 pages
2174 PDF
No ratings yet
2174 PDF
7 pages
Efficient Implementation of Pipelined Double Precision Floating Point Unit On FPGA
No ratings yet
Efficient Implementation of Pipelined Double Precision Floating Point Unit On FPGA
6 pages
Implementation of Double Precision Floating Point Radix-2 FFT Using VHDL
No ratings yet
Implementation of Double Precision Floating Point Radix-2 FFT Using VHDL
7 pages
A Novel Low Power and High Speed Multiply-Accumulate MAC Unit Design For Floating-Point Numbers
No ratings yet
A Novel Low Power and High Speed Multiply-Accumulate MAC Unit Design For Floating-Point Numbers
7 pages
algorithms-14-00198
No ratings yet
algorithms-14-00198
21 pages
Design and Implementation of 8X8 Truncat
No ratings yet
Design and Implementation of 8X8 Truncat
5 pages
Hardware Implementation of 24-Bit Vedic Multiplier
No ratings yet
Hardware Implementation of 24-Bit Vedic Multiplier
5 pages
Abstract
No ratings yet
Abstract
22 pages
32 Bit Floating Point ALU
80% (5)
32 Bit Floating Point ALU
7 pages
32 Bit Floating Point ALU
0% (1)
32 Bit Floating Point ALU
7 pages
The Efficient Implementation of An Array Multiplier
No ratings yet
The Efficient Implementation of An Array Multiplier
5 pages
Computer Arithmetic: Multiplication Algorithms Division Algorithms Floating-Point Arithmetic Operations
No ratings yet
Computer Arithmetic: Multiplication Algorithms Division Algorithms Floating-Point Arithmetic Operations
70 pages
Design and Synthesizing of Floating Point Adder andMultiplier using Cadence RTL Compiler
No ratings yet
Design and Synthesizing of Floating Point Adder andMultiplier using Cadence RTL Compiler
6 pages
Dmatm: Dual Modified Adaptive Technique Based Multiplier
No ratings yet
Dmatm: Dual Modified Adaptive Technique Based Multiplier
6 pages
esda 3rd
No ratings yet
esda 3rd
4 pages
Floating-Point Multiplication Unit With 16-Bit Significant and 8-Bit Exponent
No ratings yet
Floating-Point Multiplication Unit With 16-Bit Significant and 8-Bit Exponent
6 pages
Implementation of 32 Bit Floating Point MAC Unit To Feed Weighted Inputs To Neural Networks
No ratings yet
Implementation of 32 Bit Floating Point MAC Unit To Feed Weighted Inputs To Neural Networks
4 pages
Erle Mult Carrysave
No ratings yet
Erle Mult Carrysave
11 pages
Performance Evaluation of Fixed-Point Array Multipliers On Xilinx Fpgas
No ratings yet
Performance Evaluation of Fixed-Point Array Multipliers On Xilinx Fpgas
5 pages
COD - Unit-3 - N - 4 - PPT AJAY Kumar
No ratings yet
COD - Unit-3 - N - 4 - PPT AJAY Kumar
93 pages
An FPGA Implementation of High Speed and Area Efficient Double-Precision Floating Point Multiplier Using Urdhva Tiryagbhyam Technique
No ratings yet
An FPGA Implementation of High Speed and Area Efficient Double-Precision Floating Point Multiplier Using Urdhva Tiryagbhyam Technique
6 pages
Article 87
No ratings yet
Article 87
4 pages
Bhattacharjee 2011
No ratings yet
Bhattacharjee 2011
5 pages
Floating Point Adder
No ratings yet
Floating Point Adder
14 pages
A Matrix-Multiply Unit For Posits in Reconfigurable Logic Leveraging (Open) CAPI
No ratings yet
A Matrix-Multiply Unit For Posits in Reconfigurable Logic Leveraging (Open) CAPI
9 pages
Floating Point Multipliers: Simulation & Synthesis Using VHDL
No ratings yet
Floating Point Multipliers: Simulation & Synthesis Using VHDL
40 pages
Design of Floating Point Multiplier Using Vedic Aphorisms: Pratiksha Rai, Shailendra Kumar, Prof. (DR.) S.H.Saeed
No ratings yet
Design of Floating Point Multiplier Using Vedic Aphorisms: Pratiksha Rai, Shailendra Kumar, Prof. (DR.) S.H.Saeed
4 pages
A SECURE DATA AGGREGATION TECHNIQUE IN WIRELESS SENSOR NETWORK
From Everand
A SECURE DATA AGGREGATION TECHNIQUE IN WIRELESS SENSOR NETWORK
Dr Chaitra HV
No ratings yet
Cisco Packet Tracer Implementation: Building and Configuring Networks: 1, #1
From Everand
Cisco Packet Tracer Implementation: Building and Configuring Networks: 1, #1
S. R. Jena
No ratings yet
Efficient Numerical Computing with Intel MKL: Definitive Reference for Developers and Engineers
From Everand
Efficient Numerical Computing with Intel MKL: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
GE Fanuc IC695ALG728: Analog Output HART Module, 8 Channels, That Is Configurable IC695A IC695AL IC695ALG
No ratings yet
GE Fanuc IC695ALG728: Analog Output HART Module, 8 Channels, That Is Configurable IC695A IC695AL IC695ALG
46 pages
CNCI (C) - 01-Basic Electrical Principles and Signal Theory
No ratings yet
CNCI (C) - 01-Basic Electrical Principles and Signal Theory
26 pages
Computer System Organization
No ratings yet
Computer System Organization
126 pages
Lecture 02
No ratings yet
Lecture 02
58 pages
Eltrade ECR
100% (1)
Eltrade ECR
29 pages
Computer QUIZ
No ratings yet
Computer QUIZ
22 pages
NVM Express Subsystem Local Memory Specification 1.0 2023.12.20
No ratings yet
NVM Express Subsystem Local Memory Specification 1.0 2023.12.20
26 pages
MXT-1XX Protocol Standard
No ratings yet
MXT-1XX Protocol Standard
174 pages
Array_strings_TCS Questions
No ratings yet
Array_strings_TCS Questions
17 pages
Optical Character Recognition Using Fuzzy Logic: Freescale Semiconductor
No ratings yet
Optical Character Recognition Using Fuzzy Logic: Freescale Semiconductor
20 pages
HiAS 743. High Resolution Impulse Analyzing System FEATURES BENEFITS
No ratings yet
HiAS 743. High Resolution Impulse Analyzing System FEATURES BENEFITS
7 pages
Ecu Immobilizers
No ratings yet
Ecu Immobilizers
87 pages
Comp1:Office Productivity Application Software: Talisay City College
No ratings yet
Comp1:Office Productivity Application Software: Talisay City College
9 pages
CS101 Quiz 1,2 by Attiq Kundi-Updated On 19-Dec-2022
No ratings yet
CS101 Quiz 1,2 by Attiq Kundi-Updated On 19-Dec-2022
48 pages
CA Assignment - 1 (July-2015)
No ratings yet
CA Assignment - 1 (July-2015)
2 pages
Delta Ia-Mds VFD-MS300 Um en 20170306 5
No ratings yet
Delta Ia-Mds VFD-MS300 Um en 20170306 5
100 pages
Solution: 1. Multiplication Method - 2m
No ratings yet
Solution: 1. Multiplication Method - 2m
4 pages
Operator'S Manual: Pulse / Modifiable Cctalk Coin Acceptor
No ratings yet
Operator'S Manual: Pulse / Modifiable Cctalk Coin Acceptor
20 pages
El 5 Azo 2
No ratings yet
El 5 Azo 2
3 pages
An Alternative Geometry For Quantum Cellular Automata
No ratings yet
An Alternative Geometry For Quantum Cellular Automata
5 pages
TT18-4G-M TCP Data Protocol_v1.2
No ratings yet
TT18-4G-M TCP Data Protocol_v1.2
6 pages
Summer Vacation Home Work Cambridge 2 Subject: Computer Science
No ratings yet
Summer Vacation Home Work Cambridge 2 Subject: Computer Science
3 pages
Unit 1
No ratings yet
Unit 1
115 pages
ZPMeter Water Meter Modbus Manual
No ratings yet
ZPMeter Water Meter Modbus Manual
24 pages
Operators
No ratings yet
Operators
27 pages
Computer Fundamental IndiaBix
83% (6)
Computer Fundamental IndiaBix
162 pages
Open Ended Experiment: Digital Electronics and Computer Organization
No ratings yet
Open Ended Experiment: Digital Electronics and Computer Organization
4 pages
SDM530CT-MODBUS Table
No ratings yet
SDM530CT-MODBUS Table
20 pages
Journey To The Center of The Plant and Back Again - SCADA 1
No ratings yet
Journey To The Center of The Plant and Back Again - SCADA 1
10 pages

VLSI Implementation of Bit Serial Architecture Based Multiplier in Floating Point Arithmetic

Uploaded by

VLSI Implementation of Bit Serial Architecture Based Multiplier in Floating Point Arithmetic

Uploaded by

VLSI Implementation of Bit Serial Architecture

based Multiplier in Floating Point Arithmetic

Jitesh R Shinde Suresh S Salankar

The study in this paper investigates performance of VLSI

Keywords—Array Multiplier, bit serial architecture based

II. PROBLEM STATEMENT

III. DESIGN & IMPLEMENTATION

Bit - serial arithmetic and communication is efficient for

G2 (Adders) N(N-1)=12 2*(N/W)=8 N=8

Pipelining Absent Present Present

Better than array

Table 3.1: Table listing conditions to check nature of input data_gen1

Number sign exponent mantissa

infinity (Overflow) 0/1 FF 0

NaN i.e Not a Number any value

The IEEE 754 standard partially solves the problem of

Fig.3.4: State diagram of fsm_31bsmt31 used in bit serial architecture

b(23:0) d(23:0) y d s d dout(23:0) dout(23:0)

a(23:0) a(23:0) load

Fig.3.5: Block schematic of FPnormalize block

Fig.3.8: Flowchart representing logic used to implement FPround block in

Fig.3.6: Flowchart representing logic used to implement FPnormalize block

Parameters MUL1 MUL3

Parameters 90 nm tech file 45 nm tech file (no

Total cell area 20007. 5304.7 19676.340 2185.060771

[1] Jean-Michel Muller, Nicolas Brisebarre, Florent de Dinechin, Claude-

You might also like