0% found this document useful (0 votes)

47 views39 pages

An Optimized Modified Parallel Implementation Design of Multiplier and Accumulator Operator

The document describes a new optimized parallel implementation of a multiplier and accumulator (MAC) operator. It proposes combining multiplication and accumulation into a hybrid carry-save adder tree structure. This improves performance by merging the accumulator, which has the longest delay, into the partial product compression. The design uses a 1's complement Booth encoding and modified arrays to increase operand density and reduce final adder inputs. It analyzes the proposed design against standard and Elguibaly MAC architectures in terms of hardware resources and performance when pipelined. The MAC was implemented on FPGA using Xilinx tools and for ASIC using Cadence design suites.

Uploaded by

VigneshInfotech

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

47 views39 pages

An Optimized Modified Parallel Implementation Design of Multiplier and Accumulator Operator

Uploaded by

VigneshInfotech

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 39

JYOTHISHMATHI INSTITUTE OF TECHONOLGY AND SCIENCES

Nustulapur, Karimnagar.
Department of Electronics and Communication Engineering.

AN OPTIMIZED MODIFIED PARALLEL IMPLEMENTATION

DESIGN
OF MULTIPLIER AND ACCUMULATOR OPERATOR

By
UNDER THE GUIDENCE
E. EVANGELINE,
J. RAMESH
M.Tech(VLSI Design),
18271D5701.
Agenda:

• Abstract
• Introduction
• Design Analysis
• Tools Used
• FPGA Implementation
• ASIC Implementation
• Simulation Results
• Conclusion
Abstract
• a new architecture of multiplier-and-accumulator (MAC) for high-speed
arithmetic. By combining multiplication with accumulation and devising a hybrid
type of carry save adder (CSA), the performance was improved.
• Since the accumulator that has the largest delay in MAC was merged into CSA,
the overall performance was elevated. The proposed CSA tree uses 1’s-
complement-based radix-2 modified Booth’s algorithm (MBA) and has the
modified array for the sign extension in order to increase the bit density of the
operands.
• The CSA propagates the carries to the least significant bits of the partial products
and generates the least significant bits in advance to decrease the number of the
input bits of the final adder.
Introduction :

• In many DSP applications like filtering and convolution, multiplier and

multiplier and accumulator ( MAC ) are the most essential elements.

• The current trend in ALU design is to implement the addition and

multiplication operations using one hardware component i.e., MAC
Unit.

• MAC = Multiplication + Accumulation

Contd…

• Types of multipliers:
• Binary serial multiplier
• Parallel multiplier
• Booth encoding
• Modified Booth encoding
Binary Serial Multiplier :
• The last adder in the multiplier has a carry chain.The earlier additions
are performed by full adders are used to reduce three one-bit inputs
to two one-bit outputs.

Disadvantage :
• Critical path will be more.

To reduce the critical path, we will go for parallel multipliers

which uses Booth Encoding concept.
Parallel multipliers :
Booth encoding :
Steps involved in Booth encoding:
Step 1:
Determine the values of A and S, and the initial value of P. All of
these numbers should have a length equal to (x + y + 1).
• A: Fill the most significant (leftmost) bits with the value of m. Fill
the remaining (y + 1) bits with zeros.
• S: Fill the most significant bits with the value of (−m) in two's
complement notation. Fill the remaining (y + 1) bits with zeros.
• P: Fill the most significant x bits with zeros. To the right of this,
append the value of r. Fill the least significant (rightmost) bit with
a zero.
Step 2:
Determine the two least significant (rightmost) bits of P.
• If they are 01, find the value of P + A. Ignore any overflow.
• If they are 10, find the value of P + S. Ignore any overflow.
• If they are 00, do nothing. Use P directly in the next step.
• If they are 11, do nothing. Use P directly in the next step.
Step 3:
Arithmetically shift the value obtained in the 2nd step by a single
place to the right. Let P now equal this new value.
Step 4: Repeat steps 2 and 3 until they have been done y times.
Step 5: Drop the least significant (rightmost) bit from P. This is the
product of m and r.
Types of Adders:

• Ripple carry adder

• Carry Look Ahead adder
• Carry Save adder

By taking the advantages of both the multiplier and adder

architectures, a hybrid type of CSA structure is used in this
MAC design.
Design Analysis:
Overview of MAC :

Fig. Hardware architecture of general MAC.

Fig. Basic arithmetic steps of multiplication and accumulation.

Contd…

• In general, For N X N bit multiplication,

The required partial products are N.
• Execution time is proportional to N.
• For faster multiplication, the architecture uses Booth which reduces the
partial products to half.
• This architecture uses a Hybrid type of CSA to add the partial products.
Different types of Parallel MAC architectures are :

• Standard Design
• Elguibaly’s Architecture
• Proposed Architecture
Standard Design :

Fig. Standard design

Contd…

Hardware architecture for the standard design :

BBBooth Encoder

n n+1

Accumulation
Final addition

Z(2n+1 bits)
n+1
X 2n+1
CSA tree

2n
n n+1 C P
Y n+1
n+1
S
Drawbacks:
• There are two bottlenecks to be considered to increase the speed of
MAC :
Partial products reduction network
Accumulator

• Since the accumulation has the longest delay in MAC operation, the
independent accumulation operation has been removed and is
merged into the compression process of the partial products.
so that overall MAC performance has been improved.
One of the most advanced types of MAC for general-purpose
Digital Signal Processing has been proposed by Elguibaly.

• critical path was reduced.

• number of input bits to the final adder will be reduced.
• But the output rate will be poor.
Elguibaly’s Architecture:

Fig. Parallel MAC architecture proposed by Elguibaly.

Contd…

n-1

n+1

CSA & Accumulator

P[n-2:0]
Booth Encoder

n+1
n
X

Final Adder
n+2
n n+1
Y C P[2n:n-1]
n+1 n+2
n+1
S

Fig. Hardware structure proposed by Elguibaly

Limitations :

• Even though it has a better performance because of the reduced

critical path, output rate will be poor.

To improve the output rate and performance we will go for

proposed architecture.
Proposed parallel MAC architecture :

Fig. Proposed arithmetic operation of multiplication and accumulation.

Contd…

Fig. Hardware architecture of the proposed MAC.

Characteristics of CSA tree:

Standard Elguibaly’s Proposed

Design design design
Number System 2’s complement 1’s complement 1’s complement

Sign Extension Used Used Not Used

Accumulation Result Data of Result Data of Sum and Carry of

Final Addition Final Addition CSA

CSA Tree FA,HA FA,2-bits CLA FA, HA, 2-bit CLA

Final Adder 2n bits n+2 bits n bits

Table . Calculation of Hardware Resources

Component Standard Elguibaly’s Proposed

Design Design design

General 8-bits General 8-bits General 8-Bits

FA ( n2 / 2 + n ) 40 ( n2/2+2n+3 ) 51 ( n2/2+n/2) 36

HA 0 0 0 0 3n/2 12
2 bit CLA 0 0 ( n/2 -1) 3 n/2 4
4-bit CLA 0 0 0 - n/4 2
Accumulator (2n+1) bits 1 - - - -
CLA
Final adder 2n bits 16 ( n+ 2 ) bits 10 n-bits 8
Disadvantage:

• Delay is more compared to the previous Elguibaly’s architecture.

• But the overall performance is increased if the pipelining concept is

applied for both the Elguibaly’s architecture and proposed
architecture.
Pipelining Scheme:

Fig. Pipelined Hardware structure a) Elguibaly’s design b) proposed design

• LANGUAGE USED: VHDL
• TOOLS REQUIRED: Simulation: modelsim5.8c
• Synthesis: Xilinx 9.1
3.Proposed Architecture:

RTL schematic diagram

5. Proposed Architecture with pipelining:

RTL schematic diagram

ASIC Implementation:
RTL Synthesis diagrams:
1. Elguibaly’s architecture with pipelining:
Contd…

2. Proposed Architecture with pipelining:

1. standard design

2. Elguibaly’s Architecture
3. Proposed Architecture:
4. Elguibaly’s architecture with pipelining:
5. Proposed architecture with pipelining:
Conclusion:

• The MAC unit is proposed and designed by combining a hybrid type

CSA structure and Modified Booth’s Algorithm using Xilinx ISE Design
suite for FPGA implementation and Cadence Semi-Custom Design
Suite for ASIC Design for TSMC 180nm.

• The overall performance parameter of the proposed MAC unit with

pipelining is increased by 49.05 % compared to the Elguibaly’s MAC
unit with pipelining.
Future scope

• The MAC unit can be extended by replacing Booth-2 algorithm with

Booth-3 algorithm. Using Booth-2 algorithm number of partial
products is reduced to half. Similarly using Booth-3 algorithm number
of partial products are reduces to n/3, so that delay will be reduced.
The Booth-3 algorithm extension can be done with an additional cost
of hardware components.
Thank You

A New Vlsi Architecture of Parallel Multiplier-Accumulator Based On Radix-2 Modified Booth Algorithm
No ratings yet
A New Vlsi Architecture of Parallel Multiplier-Accumulator Based On Radix-2 Modified Booth Algorithm
6 pages
A New VLSI Architecture of Parallel Multiplier-Accumulator Based On Radix-2 Modified Booth Algorithm
No ratings yet
A New VLSI Architecture of Parallel Multiplier-Accumulator Based On Radix-2 Modified Booth Algorithm
8 pages
Thesis Phase 1 Report
No ratings yet
Thesis Phase 1 Report
7 pages
Parallel MAC
No ratings yet
Parallel MAC
6 pages
A New VLSI Architecture of Parallel Multiplier-Accumulator Based On Radix-2 Modified Booth Algorithm
No ratings yet
A New VLSI Architecture of Parallel Multiplier-Accumulator Based On Radix-2 Modified Booth Algorithm
5 pages
1.5. MAC 1.5.1 Block Diagram of MAC
No ratings yet
1.5. MAC 1.5.1 Block Diagram of MAC
11 pages
Integer Multiplication and Accumulation
No ratings yet
Integer Multiplication and Accumulation
5 pages
Implementation of Low Power and High Speed Multiplier-Accumulator Using SPST Adder and Verilog
No ratings yet
Implementation of Low Power and High Speed Multiplier-Accumulator Using SPST Adder and Verilog
8 pages
Implementation of Low Power and High Speed Multiplier-Accumulator Using SPST Adder and Verilog
No ratings yet
Implementation of Low Power and High Speed Multiplier-Accumulator Using SPST Adder and Verilog
8 pages
Design of High-Speed Area Efficient Mac Unit Using Reversible Logic
No ratings yet
Design of High-Speed Area Efficient Mac Unit Using Reversible Logic
6 pages
Mac
No ratings yet
Mac
20 pages
Implementation of MAC Unit Using Booth Multiplier & Ripple Carry Adder
No ratings yet
Implementation of MAC Unit Using Booth Multiplier & Ripple Carry Adder
3 pages
DSP Arch
No ratings yet
DSP Arch
10 pages
Ijarcet Vol 1 Issue 5 346 351
No ratings yet
Ijarcet Vol 1 Issue 5 346 351
6 pages
A Reconfigurable Architecture of A High Performance 32-Bit MAC Unit For Embedded DSP
No ratings yet
A Reconfigurable Architecture of A High Performance 32-Bit MAC Unit For Embedded DSP
4 pages
DSD ch-5 Building Blocks
No ratings yet
DSD ch-5 Building Blocks
85 pages
A Novel High Performance Implemance and Design of 64 Bit MAC Unit& Their Delay Comparision
No ratings yet
A Novel High Performance Implemance and Design of 64 Bit MAC Unit& Their Delay Comparision
17 pages
International Journal of Computational Engineering Research (IJCER)
No ratings yet
International Journal of Computational Engineering Research (IJCER)
6 pages
Imp 22
No ratings yet
Imp 22
31 pages
Priyanka - 50300 16 130
No ratings yet
Priyanka - 50300 16 130
4 pages
Review of MAC Unit For Complex Numbers
No ratings yet
Review of MAC Unit For Complex Numbers
3 pages
Vlsi Architecture of Parallel Multiplier - Accumulator Based
No ratings yet
Vlsi Architecture of Parallel Multiplier - Accumulator Based
8 pages
Design of High Performance 64 Bit MAC Unit
No ratings yet
Design of High Performance 64 Bit MAC Unit
5 pages
DSP Notes Unit1 and 2
No ratings yet
DSP Notes Unit1 and 2
45 pages
FPGA Implementation of Efficient Modifie
No ratings yet
FPGA Implementation of Efficient Modifie
4 pages
VLSI Designing of High Speed Parallel Multiplier - Accumulator Based On Radix4 Booths Multiplier
No ratings yet
VLSI Designing of High Speed Parallel Multiplier - Accumulator Based On Radix4 Booths Multiplier
7 pages
Ece-Vii-dsp Algorithms & Architecture U2
No ratings yet
Ece-Vii-dsp Algorithms & Architecture U2
21 pages
A New Vlsi Architecture For Modi Ed
No ratings yet
A New Vlsi Architecture For Modi Ed
6 pages
SP Unit 3 SB
No ratings yet
SP Unit 3 SB
72 pages
A High-Speed, Energy-Efficient Two-Cycle Multiply-Accumulate (MAC) Architecture and Its Application To A Double-Throughput MAC Unit
No ratings yet
A High-Speed, Energy-Efficient Two-Cycle Multiply-Accumulate (MAC) Architecture and Its Application To A Double-Throughput MAC Unit
9 pages
Optimization of Delay IIN Pipeline Mac Unit Using Wallace Tree Multiplier
No ratings yet
Optimization of Delay IIN Pipeline Mac Unit Using Wallace Tree Multiplier
9 pages
PXC 3878710
No ratings yet
PXC 3878710
4 pages
VLSI Implementation of Modified Booth Algorithm: Rasika Nigam, Jagdish Nagar
No ratings yet
VLSI Implementation of Modified Booth Algorithm: Rasika Nigam, Jagdish Nagar
4 pages
Advanced VLSI Design: Dr. Premananda B.S
No ratings yet
Advanced VLSI Design: Dr. Premananda B.S
42 pages
Low Power Efficient MAC Unit Using Proposed Carry Select Adder
No ratings yet
Low Power Efficient MAC Unit Using Proposed Carry Select Adder
6 pages
Unit-5 DSP Processor
No ratings yet
Unit-5 DSP Processor
28 pages
Implementation Methods
No ratings yet
Implementation Methods
30 pages
Project Review: by Vamsikrishna Chemudupati 14BEC0022
No ratings yet
Project Review: by Vamsikrishna Chemudupati 14BEC0022
35 pages
DSP R20 Unit V
No ratings yet
DSP R20 Unit V
23 pages
Module 2 Notes
No ratings yet
Module 2 Notes
28 pages
Architectures For Programmable Digital Signal Processing Devices
No ratings yet
Architectures For Programmable Digital Signal Processing Devices
24 pages
A Fast 16 &#x00D7 16 Bit Asynchronous CMOS Multiplier
No ratings yet
A Fast 16 &#x00D7 16 Bit Asynchronous CMOS Multiplier
3 pages
Unit 2 Architectures For Programmable Digital Signal-Processors
No ratings yet
Unit 2 Architectures For Programmable Digital Signal-Processors
57 pages
COMPUTER ORGANISATION (LONG ANSWERS 2 PM)
No ratings yet
COMPUTER ORGANISATION (LONG ANSWERS 2 PM)
5 pages
Pal Durai 2014
No ratings yet
Pal Durai 2014
5 pages
Design of Modulo 2 - 1 Multiplier Based On Radix-8 Booth Algorithm Using Residue Number System
No ratings yet
Design of Modulo 2 - 1 Multiplier Based On Radix-8 Booth Algorithm Using Residue Number System
8 pages
Implementation and Comparison of Radix-8 Booth Multiplier by Using 32-Bit Parallel Prefix Adders For High Speed Arithmetic Applications
No ratings yet
Implementation and Comparison of Radix-8 Booth Multiplier by Using 32-Bit Parallel Prefix Adders For High Speed Arithmetic Applications
11 pages
Cpe626 Multipliers
No ratings yet
Cpe626 Multipliers
37 pages
High Performance Multiply
No ratings yet
High Performance Multiply
11 pages
Adders and Multipliers
No ratings yet
Adders and Multipliers
59 pages
Lecture 35
No ratings yet
Lecture 35
34 pages
DSP-8 (DSP Processors)
No ratings yet
DSP-8 (DSP Processors)
8 pages
Radix-4 and Radix-8 Multiplier Using Verilog HDL
No ratings yet
Radix-4 and Radix-8 Multiplier Using Verilog HDL
6 pages
Radix-4 and Radix-8 Multiplier Using Verilog HDL: (Ijartet) Vol. 1, Issue 1, September 2014
No ratings yet
Radix-4 and Radix-8 Multiplier Using Verilog HDL: (Ijartet) Vol. 1, Issue 1, September 2014
6 pages
DSP - Presentation - Sumit 3
No ratings yet
DSP - Presentation - Sumit 3
63 pages
DSP - Presentation - Sumit 5
No ratings yet
DSP - Presentation - Sumit 5
45 pages
CV Saber
No ratings yet
CV Saber
3 pages
Saitej Resume Done
No ratings yet
Saitej Resume Done
2 pages
de-220121-180442-M.Tech - M.Pharm Supply Project Thesis Uploading Notification Jan-2022
No ratings yet
de-220121-180442-M.Tech - M.Pharm Supply Project Thesis Uploading Notification Jan-2022
3 pages
2 Mas
No ratings yet
2 Mas
29 pages
Project Documentation Guidelines - ECE - JITS2013
No ratings yet
Project Documentation Guidelines - ECE - JITS2013
4 pages
Discrete Wavelet Transform
No ratings yet
Discrete Wavelet Transform
10 pages
Numbers: 5 and 6 Digits: Answers
No ratings yet
Numbers: 5 and 6 Digits: Answers
37 pages
1 Eng
No ratings yet
1 Eng
32 pages
Date Employee Nam Accompanied by Visited Area School Name
No ratings yet
Date Employee Nam Accompanied by Visited Area School Name
4 pages
STUDENT Name:: Mundada Vinay Kumar Email
No ratings yet
STUDENT Name:: Mundada Vinay Kumar Email
2 pages
Implementation of 2D-DWT: 2 1 1 2 1 2 1 2 1 2 LL LH HL HH LL LH HL HH LH HL HH
No ratings yet
Implementation of 2D-DWT: 2 1 1 2 1 2 1 2 1 2 LL LH HL HH LL LH HL HH LH HL HH
4 pages
Performance Analysis of BPSK and DPSK Systems in The Presence of Nakagami-M
No ratings yet
Performance Analysis of BPSK and DPSK Systems in The Presence of Nakagami-M
5 pages
Matlab Code:: 'Zebra - JPG' 'Image With Salt and Pepper Noise'
No ratings yet
Matlab Code:: 'Zebra - JPG' 'Image With Salt and Pepper Noise'
1 page
2 Evs PDF
No ratings yet
2 Evs PDF
39 pages
High-Throughput Interpolator Architecture For Low-Complexity Chase Decoding of Rs Codes
No ratings yet
High-Throughput Interpolator Architecture For Low-Complexity Chase Decoding of Rs Codes
3 pages
Design of An Error Detection and Data Recovery Architecture For Motion Estimation Testing Applications
No ratings yet
Design of An Error Detection and Data Recovery Architecture For Motion Estimation Testing Applications
3 pages
Multi-Feature and Genetic Algorithm
No ratings yet
Multi-Feature and Genetic Algorithm
41 pages
VTVL13 With Error
No ratings yet
VTVL13 With Error
3 pages
Inner Bound On The Gdof of The K-User Mimo Gaussian Symmetric Interference Channel
No ratings yet
Inner Bound On The Gdof of The K-User Mimo Gaussian Symmetric Interference Channel
8 pages
Image Retrieval Using Multi-Feature and Genetic Algorithm: Under The Guidance of K.Venkatramana Sir Associate Professor
No ratings yet
Image Retrieval Using Multi-Feature and Genetic Algorithm: Under The Guidance of K.Venkatramana Sir Associate Professor
13 pages
Supporting Multi Data Stores Applications in Cloud Environments
No ratings yet
Supporting Multi Data Stores Applications in Cloud Environments
14 pages
Title NO.: Acknowledgement List of Figures
No ratings yet
Title NO.: Acknowledgement List of Figures
3 pages
Module Description
No ratings yet
Module Description
1 page
System Design
No ratings yet
System Design
6 pages
Supporting Multi Data Stores Applications in Cloud Environments
No ratings yet
Supporting Multi Data Stores Applications in Cloud Environments
6 pages
Q&A DLCO Modified
No ratings yet
Q&A DLCO Modified
92 pages
Researchpaper Design of ALU Using Reversible Logic Based Low Power Vedic Multiplier
No ratings yet
Researchpaper Design of ALU Using Reversible Logic Based Low Power Vedic Multiplier
5 pages
Basic Multiplier Circuit: Week 9
No ratings yet
Basic Multiplier Circuit: Week 9
9 pages
ETE Practice Questions Set - 1
No ratings yet
ETE Practice Questions Set - 1
2 pages
By B.Ravina 17HM1D5701: Dept. of E.C.E
No ratings yet
By B.Ravina 17HM1D5701: Dept. of E.C.E
31 pages
Multiplier
No ratings yet
Multiplier
23 pages
Coa Notes Unit-3
No ratings yet
Coa Notes Unit-3
10 pages
DSP Project
No ratings yet
DSP Project
7 pages
Booth's Algorithm (Ques Included in This)
No ratings yet
Booth's Algorithm (Ques Included in This)
7 pages
Co QB
No ratings yet
Co QB
6 pages
CH 5
No ratings yet
CH 5
56 pages
Lecture4 Multiplier
No ratings yet
Lecture4 Multiplier
60 pages
Design of High Performance Dynamically Truncated A-1
No ratings yet
Design of High Performance Dynamically Truncated A-1
7 pages
Design of Power Efficient Posit Multiplier
No ratings yet
Design of Power Efficient Posit Multiplier
5 pages
Array Multiplier and Carry Save Multiplier
No ratings yet
Array Multiplier and Carry Save Multiplier
28 pages
Analysis of 8 X 8 Bit
No ratings yet
Analysis of 8 X 8 Bit
4 pages
On The Design of The FFT Butterfly Units
No ratings yet
On The Design of The FFT Butterfly Units
1 page
PBL Topics For VLSI
No ratings yet
PBL Topics For VLSI
3 pages
8 Karatsuba Document
No ratings yet
8 Karatsuba Document
75 pages
Binary Multiplier
No ratings yet
Binary Multiplier
27 pages
Tuto 1
No ratings yet
Tuto 1
2 pages
Introduction To Processor Based Embedded System Design
No ratings yet
Introduction To Processor Based Embedded System Design
8 pages
Approximate Wallace Tree Multiplier
No ratings yet
Approximate Wallace Tree Multiplier
7 pages
DSP Lab Manual 15-11-2016 PDF
No ratings yet
DSP Lab Manual 15-11-2016 PDF
73 pages
Coa Unit3
No ratings yet
Coa Unit3
116 pages
Parallel Implementation of A 4 X 4-Bit Multiplier
No ratings yet
Parallel Implementation of A 4 X 4-Bit Multiplier
4 pages
Cs8491 - Computer Architecture Lession Notes Unit Ii Arithmetic Operations
No ratings yet
Cs8491 - Computer Architecture Lession Notes Unit Ii Arithmetic Operations
18 pages
JOEL
No ratings yet
JOEL
20 pages
Designing High Power Efficient Finite Impulse Response Filters With Three-Four Inexact Adder-Integrated Booth Multiplier
No ratings yet
Designing High Power Efficient Finite Impulse Response Filters With Three-Four Inexact Adder-Integrated Booth Multiplier
10 pages

An Optimized Modified Parallel Implementation Design of Multiplier and Accumulator Operator

Uploaded by

An Optimized Modified Parallel Implementation Design of Multiplier and Accumulator Operator

Uploaded by

JYOTHISHMATHI INSTITUTE OF TECHONOLGY AND SCIENCES

AN OPTIMIZED MODIFIED PARALLEL IMPLEMENTATION

• In many DSP applications like filtering and convolution, multiplier and

• The current trend in ALU design is to implement the addition and

• MAC = Multiplication + Accumulation

To reduce the critical path, we will go for parallel multipliers

• Ripple carry adder

By taking the advantages of both the multiplier and adder

Fig. Hardware architecture of general MAC.

Fig. Basic arithmetic steps of multiplication and accumulation.

• In general, For N X N bit multiplication,

Fig. Standard design

Hardware architecture for the standard design :

• critical path was reduced.

Fig. Parallel MAC architecture proposed by Elguibaly.

CSA & Accumulator

Fig. Hardware structure proposed by Elguibaly

• Even though it has a better performance because of the reduced

To improve the output rate and performance we will go for

Fig. Proposed arithmetic operation of multiplication and accumulation.

Fig. Hardware architecture of the proposed MAC.

Standard Elguibaly’s Proposed

Sign Extension Used Used Not Used

Accumulation Result Data of Result Data of Sum and Carry of

CSA Tree FA,HA FA,2-bits CLA FA, HA, 2-bit CLA

Final Adder 2n bits n+2 bits n bits

Component Standard Elguibaly’s Proposed

General 8-bits General 8-bits General 8-Bits

• Delay is more compared to the previous Elguibaly’s architecture.

• But the overall performance is increased if the pipelining concept is

Fig. Pipelined Hardware structure a) Elguibaly’s design b) proposed design

RTL schematic diagram

RTL schematic diagram

2. Proposed Architecture with pipelining:

• The MAC unit is proposed and designed by combining a hybrid type

• The overall performance parameter of the proposed MAC unit with

• The MAC unit can be extended by replacing Booth-2 algorithm with

You might also like