Majority Logic (ML) Based: Design of Approximate Full Adders
Majority Logic (ML) Based: Design of Approximate Full Adders
Abstract—As a new paradigm in the nanoscale technologies, A, B and C and the output is F. The logic expression of the 3-
approximate computing enables error tolerance in the input majority gate (voter) is given by:
computational process; it has also emerged as a low power design
methodology for arithmetic circuits. Majority logic (ML) is = , , = + + (1)
applicable to many emerging technologies and its basic building A
block (the 3-input majority voter) has been extensively used in B M F
C
digital circuit design. In this paper, we propose the design of a one-
bit approximate full adder based on majority logic. Furthermore, Fig. 1. Majority gate (3-input voter)
multi-bit approximate full adders are also proposed and studied;
Research on the design of ML approximate circuits has only
the application of these designs to quantum-dot cellular automata
(QCA) is also presented as an example. The designs are evaluated
recently been pursued; In [3] the authors proposed a one-bit full
using hardware metrics (including delay and area) as well as error adder circuit. In this paper, we propose a few ML-based
metrics. Compared with other circuits found in the technical approximate full adders; one-bit as well as multi-bit approximate
literature, the optimal designs are found to offer superior adders are considered. QCA technology is used as a case study
performance. to show the validity of the proposed designs. Hardware
evaluation and error analysis are also provided.
Keywords—Approximate computing; majority logic; full adder;
The rest of this paper is organized as follows. Section II
quantum-dot cellular automata
reviews the basic principles of QCA, the relevant contributions
I. INTRODUCTION and the error metrics used in this paper. Section III presents the
design of the proposed one-bit approximate full adder based on
Power dissipation is increasingly becoming a challenge for ML. Multi-bit designs are studied by combining the proposed
advanced integrated circuit design; although emerging and existing one-bit approximate full adders in Section IV.
nanoscale technologies have been proposed to replace CMOS at Section V concludes the paper.
the end of Moore’s Law, the issue of power consumption
remains unabated, because integration density of these II. BACKGROUND
nanoelectronic devices continues to increase at a high rate.
Approximate computing is a promising technique to reduce A. Quantum-dot Cellular Automata
power consumption and improve performance of circuits and QCA makes use of the polarization state of cells to encode
systems by allowing computational errors in error-tolerant binary information and Coulombic force interactions between
applications, such as multimedia signal processing, machine cells to achieve circuit functionality; these features make this
learning and pattern recognition [1-2]. technology substantially different from CMOS. As shown in
Approximate computer arithmetic circuits based on CMOS Fig. 2a, a QCA cell consists in its simplest form of four quantum
technology have been extensively studied. Designs of dots and two electrons that can tunnel between them. Due to
approximate adders, multipliers and dividers for both fixed- Coulombic repulsion, electrons are forced to occupy the
point and floating-point formats have been proposed [3-7]. Error opposite diagonal vertices (dots). This forms two different
metrics such as the mean error distance (MED), the normalized polarization states (i.e., -1, +1) for each cell, thus representing
MED (NMED) and the relative MED (RMED) [8] have been logic values of 0 and 1.
proposed to analyze the errors introduced in the operations of QCA requires a clocking scheme with four different
approximate arithmetic circuits. operational phases, i.e. Switch, Hold, Release, and Relax; each
However, the approximate designs of CMOS circuits adjacent so-called zone is shifted in phase by 90 degrees to
cannot be immediately applied to many emerging technologies control the flow of information. For a majority gate in QCA,
such as QCA [9-10], nanomagnetic logic (NML) [11], and spin- which can be seen in Fig. 2b, there is a 0.25 clock delay in each
wave devices (SWD) [12] due to the very different underlying clocking zone. As another basic gate in QCA, an inverter is
logic structure of these devices. Emerging devices rely on shown in Fig. 2c. More information on the clocking scheme and
majority logic (ML) which is a substantially different framework QCA technology can be found in [9-10].
from conventional Boolean logic. The majority gate performs a
multi-input logic operation and is shown in Fig. 1; the inputs are
Cout ಲಷಲమ
Cout = = 0.083 (12)
a 3 b3 M
for 14 of the 32 input combinations; the MED and NMED of the C4 M
s0
M C4 s1 s0
M
C4
four approximate adders are provided in Table III. The error M s1
M
s1 s0 M
s2
C4 M M
s0
s2
results show that by cascading two of the same type of one-bit s3 s3 s2 s3 s3 s2 s1
approximate full adder, the MED and NMED are larger than
(a) (b) (c) (d)
cascading two different types of one-bit approximate full adder;
however, AFA22 incurs in the smallest area and has less delay Fig. 9. Schematics of proposed 4-bit approximate full adders: (a) AFA1212, (b)
than AFA12. Considering the number of gates required in an AFA2121, (c) AFA2112 and (d) AFA1221.
implementation, AFA12 requires one less inverter than AFA21.
Fig. 11 shows that AFA2121 is the best design. The NMED (c) (d)
for AFA1221 is rather large and the values for NMED of
AFA1212, AFA2121, AFA2112 are very close, but AFA2112 Fig. 12. Schematics of proposed 8-bit approximate full adders: (a) AFA1212-
1212, (b) AFA2121-2121, (c) AFA2121-1212 and (d) AFA1212-2121.
and AFA1212 require more delay. For four-bit designs, the
schemes in which two of the same type of the proposed two-bit a7 b7 a6 b6 a5 b5 a4 b4 a3 b3 a2 b2 a1 b1 a0 b0 Cin a7 b7 a6 b6 a5 b5 a4 b4 a3 b3 a2 b2 a1 b1 a0 b0 Cin
Fig. 13. Layouts of proposed 8-bit approximate full adders in QCA: (a)
AFA1212-1212, (b) AFA2121-2121, (c) AFA2121-1212 and (d) AFA1212-
2121.
TABLE V. EIGHT-BIT APPROXIMATE ADDER COMPARISON
ADDER TYPE MV INV D A() MED NMED
CFA8[14] 24 16 2.5 948700 0 0
Fig. 11. Evaluation of proposed 4-bit approximate full adders (NMED vs delay- RCA8[15] 24 8 2.75 745200 0 0
area product). AFA1212-1212 8 4 1.25 184640 46.20 0.090
AFA2121-2121 8 5 1 206425 47.02 0.092
C. Proposed Eight-bit Approximate Adders AFA2121-1212 8 5 1.25 218257 46.40 0.091
Consider an eight-bit adder with inputs, A = AFA1212-2121 8 4 1 191126 46.82 0.092
a a aହ aସ aଷ aଶ aଵa , B = b b bହ bସ bଷ bଶ bଵ b , C୧୬ and outputs, Fig. 14 shows that AFA1212-1212 is the best design by
S = s s sହ sସ sଷ sଶ sଵ s , C଼ . We have designed eight-bit considering both the area-delay product and the NMED.
approximate adders by cascading two four-bit approximate
adders by using AFA1212 and AFA2121, as they show better
overall performance than the other two designs.
The proposed eight-bit approximate adders are shown in Fig.
12; their implementations in QCA are shown in Fig. 13. The
comparison results are detailed in Table V. The proposed
designs significantly reduce the number of gates and delay but
at the cost of a decrease in accuracy. In terms of gates,
AFA1212-1212 and AFA1212-2121 require one less inverter Fig. 14. Evaluation of proposed 8-bit approximate full adders (NMED vs delay-
area product).
than the other adders; AFA2121-2121 and AFA1212-2121
incur less delay than the other adders. V. CONCLUSION
a 0b0C in a1b1 a0b0Cin
a 2b2 a1b1 M
a3b3 a
The paper has proposed ML based one-bit and multi-bit
2b2 M
a 4b4
a 3b3 M
a5b5
M approximate full adders; these designs show considerable
M M
s0
a 6b6 a 5b5 M
s1
a4b4
M
s1 s0
savings in area, delay and number of gates while only incurring
M
s2 a7b7 a6b6 M
a 7b7 M
s3 M
a modest loss in accuracy. Compared with the accurate full
C8 M
s4 M s3 s2
C8 M
s5 s4
adder, the proposed designs result in an improvement of at least
M s5
s7
s6
s7 s6
up to 50% in delay and up to 67% in area for the 4-bit design.
An improvement of at least up to 50% in delay and up to 71%
(a) (b) in area is achieved for the 8-bit scheme.