An Optimized Modified Parallel Implementation Design of Multiplier and Accumulator Operator
An Optimized Modified Parallel Implementation Design of Multiplier and Accumulator Operator
Nustulapur, Karimnagar.
Department of Electronics and Communication Engineering.
By
UNDER THE GUIDENCE
E. EVANGELINE,
J. RAMESH
M.Tech(VLSI Design),
18271D5701.
Agenda:
• Abstract
• Introduction
• Design Analysis
• Tools Used
• FPGA Implementation
• ASIC Implementation
• Simulation Results
• Conclusion
Abstract
• a new architecture of multiplier-and-accumulator (MAC) for high-speed
arithmetic. By combining multiplication with accumulation and devising a hybrid
type of carry save adder (CSA), the performance was improved.
• Since the accumulator that has the largest delay in MAC was merged into CSA,
the overall performance was elevated. The proposed CSA tree uses 1’s-
complement-based radix-2 modified Booth’s algorithm (MBA) and has the
modified array for the sign extension in order to increase the bit density of the
operands.
• The CSA propagates the carries to the least significant bits of the partial products
and generates the least significant bits in advance to decrease the number of the
input bits of the final adder.
Introduction :
• Types of multipliers:
• Binary serial multiplier
• Parallel multiplier
• Booth encoding
• Modified Booth encoding
Binary Serial Multiplier :
• The last adder in the multiplier has a carry chain.The earlier additions
are performed by full adders are used to reduce three one-bit inputs
to two one-bit outputs.
Disadvantage :
• Critical path will be more.
• Standard Design
• Elguibaly’s Architecture
• Proposed Architecture
Standard Design :
n n+1
Accumulation
Final addition
Z(2n+1 bits)
n+1
X 2n+1
CSA tree
2n
n n+1 C P
Y n+1
n+1
S
Drawbacks:
• There are two bottlenecks to be considered to increase the speed of
MAC :
Partial products reduction network
Accumulator
• Since the accumulation has the longest delay in MAC operation, the
independent accumulation operation has been removed and is
merged into the compression process of the partial products.
so that overall MAC performance has been improved.
One of the most advanced types of MAC for general-purpose
Digital Signal Processing has been proposed by Elguibaly.
n-1
n+1
n+1
n
X
Final Adder
n+2
n n+1
Y C P[2n:n-1]
n+1 n+2
n+1
S
HA 0 0 0 0 3n/2 12
2 bit CLA 0 0 ( n/2 -1) 3 n/2 4
4-bit CLA 0 0 0 - n/4 2
Accumulator (2n+1) bits 1 - - - -
CLA
Final adder 2n bits 16 ( n+ 2 ) bits 10 n-bits 8
Disadvantage:
2. Elguibaly’s Architecture
3. Proposed Architecture:
4. Elguibaly’s architecture with pipelining:
5. Proposed architecture with pipelining:
Conclusion: