0% found this document useful (0 votes)

21 views

An Efficient Floating Point Adder For Low-Power Devices

With an increasing demand for power hungry data intensive computing, design methodologies with low power consumption are increasingly gaining prominence in the industry. Most of the systems operate on critical and noncritical data both. An attempt to generate a precision result results in excessive power consumption and results in a slower system. An attempt to generate a precision result results in excessive power consumption and results in a slower system. For non-critical data, approximate c

Uploaded by

IJRES team

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views

An Efficient Floating Point Adder For Low-Power Devices

Uploaded by

IJRES team

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

International Journal of Reconfigurable and Embedded Systems (IJRES)

Vol. 13, No. 2, July 2024, pp. 253~261

ISSN: 2089-4864, DOI: 10.11591/ijres.v13.i2.pp253-261  253

An efficient floating point adder for low-power devices

Manjula Narayanappa1, Siva S. Yellampalli2

1
Department of Electronics and Communication, Dr. Ambedkar Institute of Technology, Bengaluru, India
2
Department of Electronics and Communication, SRM University, Amravati, India

Article Info ABSTRACT

Article history: With an increasing demand for power hungry data intensive computing,
design methodologies with low power consumption are increasingly gaining
Received Dec 23, 2022 prominence in the industry. Most of the systems operate on critical and non-
Revised Nov 17, 2023 critical data both. An attempt to generate a precision result results in
Accepted Dec 22, 2023 excessive power consumption and results in a slower system. An attempt to
generate a precision result results in excessive power consumption and
results in a slower system. For non-critical data, approximate computing
Keywords: circuits significantly reduce the circuit complexity and hence power
consumption. For non-critical data, approximate computing circuits
Approximate computing significantly reduce the circuit complexity and hence power consumption. In
Floating point adder this paper, a novel approximate single precision floating point adder is
Low power proposed with an approximate mantissa adder. The mantissa adder is
Mantissa adder designed with three 8-bit full adder blocks.
Single precision
This is an open access article under the CC BY-SA license.

Corresponding Author:
Manjula Narayanappa
Department of Electronic and Communication, Dr. Ambedkar Institute of Technology
Bengaluru, Karnataka, India
Email: [email protected]

1. INTRODUCTION
Battery-operated portable electronic devices have increasingly become an indispensable part of
everyday life. The key behind this is the scaling ability of metal oxide silicon field effect transistors
(MOSFETS) seen in very large-scale integration (VLSI) due to which functionality per unit area has
increased which has brought the price of the devices down leading to wide usage. Due to scaling and an
increase in functionality per unit area, the power consumption has increased. The increase in power
consumption of the VLSI devices has not been matched by the improvement in the capacity of the battery.
Therefore, operation time per charge has come down causing inconvenience to the users. For this reason,
reducing the power consumption of portable devices has become a compelling design constraint. A large
portion of energy consumption is dominated by two components: dynamic power and leakage power. To
extend the battery life various technology-based, architecture-based, and circuit-based solutions that reduce
the sum of the two power components without sacrificing the performance have to be developed. At the
technology level, feature size scaling has continuously brought lower power circuits by reducing the supply
voltages. To retain performance, the threshold voltages of these circuits have also been reduced with
technology scaling. However, in recent technologies, the benefits of constant-field scaling have been
compromised by an exponential increase in the leakage current. On the architectural level, pipelining and
parallelism have helped in lowering the power consumption of digital circuits.
In the current complementary metal-oxide semiconductor (CMOS) technology, the benefits of
device scaling are impeded by the reliability issues due to the process variations, ageing effects and soft
errors. Leakage current, static power are increasingly adding to the concerns towards achieving low power
consumption. Hence, the device scaling which once used to offer advantages for low power applications is no

Journal homepage: http://ijres.iaescore.com

254  ISSN: 2089-4864

more attractive and hence, new architectures need to be evolved to achieve low power consumption. Design
of approximate computation blocks is one such potential solution [1].
Most of the modern graphics processors for multimedia and other applications have dedicated
digital signal processing blocks. These applications output an image, video or an audio signal and the limited
perception of human senses allows for an approximation of the computations involved in the demanding
digital signal processing (DSP) algorithms for these applications [2]. Even an analog computation that yields
good enough results instead of accurate results is also acceptable [3]. Addition is the most fundamental and
significant mathematical operations used in all signal/image processing applications [4], [5]. Deterministic
approximate logic or probabilistic imprecise arithmetic are normally employed for soft adders [6].
Various low-power design approaches using approximate computing have been introduced, such as
algorithmic noise tolerance [7], [8], non-uniform voltage over scaling [9], and significance-driven
computation [10], [11]. Verma et al. [12] have presented an innovative adder design known as the almost
correct adder (ACA), which offers exponentially faster performance compared to traditional adders.
They also proposed the variable latency speculative adder (VLSA) with a slight area overhead. Additionally,
some adder configurations meet real-time energy requirements by reducing complexity at the algorithmic
level [13], [14]. The lower part OR adder [15] relies on approximate logic with a distinct truth table
compared to a standard adder. The probabilistic full adder (PFA) [16]-[20] is based on probabilistic CMOS
technology, which is a platform for modeling nano-scale designs and reducing power consumption [21]-[23].

2. BACKGROUND
2.1. IEEE-754 floating point format
Floating point representation offers a wider dynamic range in comparison to fixed-point
representation for real numbers. However, floating point hardware is known for its complexity and
substantial power consumption. The predominant standard for floating point formats is IEEE 754-2008 [24],
which encompasses various basic and extended types. These formats include half precision (16-bits), single
precision (32-bits), double precision (64-bits), extended precision (80-bits), and quad precision (128-bits).
The typical IEEE floating point format, as depicted in Figure 1, features an exponent part with a bias of
2^(E-1)-1, where E denotes the number of exponent bits. Single precision and double precision formats are
the most commonly used in contemporary computer systems. You can find details regarding the exponent
and mantissa bits for IEEE-754 basic and extended floating point types in Table 1.

Figure 1. General IEEE-754 floating point format

Table 1. Exponent and Mantissa bits for IEEE-754 basic and extended floating point types
Type Sign bit Exp. bits Mant. bits Total Mant. bits/total
Half 1 5 10 16 62.5%
Single 1 8 23 32 71.9%
Double 1 11 52 64 81.2%
Extended 1 15 64 80 80.0%
Quad 1 15 112 128 87.5%

2.2. Floating point adder architecture

A typical floating point adder architecture comprises distinct hardware components for tasks like
exponent comparison, mantissa alignment, mantissa addition, normalization, and rounding of the mantissa (as
depicted in Figure 2 and elaborated by Behrooz [25]). Initially, two operands are extracted from their floating
point formats, and each mantissa has the hidden ‘1’ bit added to it. The addition of floating point numbers
entails a series of operations, starting with comparing the exponents and adding the mantissas. The exponents
are first assessed to determine the larger of the two. Depending on the result of the exponent comparison, the
mantissas are swapped and then aligned to have the same exponent value before undergoing addition in the
mantissa adder. After the addition, normalization shifts are essential to bring the result back to the IEEE
standard format. Normalization is achieved by left-shifting with a count of leading zeros, making the

Int J Reconfigurable & Embedded Syst, Vol. 13, No. 2, July 2024: 253-261
Int J Reconfigurable & Embedded Syst ISSN: 2089-4864  255

detection of leading zeros a critical step in this process. Finally, rounding the normalized result is the last
operation before storing the result back. Special cases such as overflow, underflow, and not-a-number are
also detected and indicated by flags.

Figure 2. Floating point adder algorithm

3. APPROXIMATE FLOATING POINT ADDER

The approximate floating point adder design originates at the architecture level with the exponent
and mantissa adder/subtractor designed using approximate fixed-point adders. An N-bit adder consists of two
parts, i.e., an m-bit exact adder and an n-bit inexact adder as shown in Figure 3. The exact adder part can
have the exact implementation as a full adder circuit. The inexact adder will ignore the carry bits for
computation thereby reducing the critical path as well as the hardware utilization.

Figure 3. Approximate adder concept

The modified approximate adder concept can also be used for the mantissa adder for approximate
computation. The mantissa adder will provide a larger scope, as the number of bits in the mantissa are higher
than the exponent and at the same time, the approximate design in the mantissa adder has a lower impact on
the error, because the mantissa part is less significant than the exponent part. Therefore, an inexact design of
a mantissa adder is more appropriate.
An efficient floating point adder for low-power devices (Manjula Narayanappa)
256  ISSN: 2089-4864

3.1. Basic building block: 8-bit approximate adder

The carry equation for a conventional carry look ahead adder is given by (1):

𝐶𝑖+1 = 𝐺𝑖 + 𝐺𝑖−1 𝑃𝑖 + ⋯ + 𝐺0 ∏𝑖𝑗=1 𝑃𝑗 + 𝐶𝑖𝑛 ∏𝑖𝑗=0 𝑃𝑗 (1)

where, 𝐶𝑖𝑛 is the input carry and 𝑃𝑖 and 𝐺𝑖 are propagate and generate signals of the ith stage. If the carry
equation is split up into two segments, as in (2):

𝐶𝑖+1 = (∑𝑖𝑗=𝑖−𝑊+1 𝐺𝑗 (∏𝑖−1 𝑖−𝑊 𝑖−1 𝑖

𝑘=𝑗+1 𝑃𝑘 )) + (∑𝑗=0 𝐺𝑗 (∏𝑘=𝑗+1 𝑃𝑘 ) + 𝐶𝑖𝑛 ∏𝑗=0 𝑃𝑗 ) (2)

where, 𝑊 is the window size and, the first segment consists of W most significant (MS) bits and the second
segment consists of N-W least significant (LS) bits. The first part of the (2) is the approximate part, while the
second part is called the augmenting part. For approximate carry generation with a window size of W, the
output carry at the ith stage is compute using the approximate part only. Computing an approximate 𝐶𝑖+1 is
faster and consumes less hardware resources and hence lesser power as compared to computing precise carry.
An 8-bit adder is chosen as the basic building block for the floating point approximate adder in the proposed
design. Figure 4 shows the structure of conventional full adder. As shown in Figure 5, the 8 bits are
partitioned into two blocks; the MS block is of 4 bits, while the LS block is of 4 bits. The output carry of this
8-bit adder block is computed approximately using the approximate part in (2), 4-bit carry generator block as
shown in Figure 6 is used for generating the approximate carry for the 8-bit adder using the 4-MS bits.

Figure 4. Conventional full adder

Figure 5. W-bit window for approximate computation

Figure 6. Carry generator for 4-bit block

In the proposed adder, for the LS, W-bit window sum and carry are computed as per (3). The
schematic of 1-bit full adder in the proposed configuration is given by Figure 7. The sum and carry for most

Int J Reconfigurable & Embedded Syst, Vol. 13, No. 2, July 2024: 253-261
Int J Reconfigurable & Embedded Syst ISSN: 2089-4864  257

significant (8-W=4) bits are computed as for an exact adder are given in (4). The truth table for the proposed
sum and carry equations is given in Table 2.

𝐶𝑖+1 = 𝑎𝑖 𝑏𝑖 + 𝑐𝑖𝑛
𝑆𝑖 = 𝐶𝑖+1 (3)

𝐶𝑖+1 = 𝑎𝑖 𝑏𝑖 + 𝑐𝑖𝑛 (𝑎𝑖 ⊕ 𝑏𝑖 )

𝑆𝑖 = (𝑎𝑖 ⊕ 𝑏𝑖 ⊕ 𝑐𝑖𝑛 ) (4)

Figure 7. Schematic for carry and sum generator of proposed adder

Table 2. Truth table for proposed adder

Proposed Exact
A B 𝐶𝑖𝑛
S 𝐶𝑖+1 S 𝐶𝑖+1
0 0 0 1 0 0 0
0 0 1 0 1 1 1
0 1 0 1 0 1 1
0 1 1 0 1 0 0
1 0 0 1 0 1 0
1 0 1 0 1 0 0
1 1 0 0 1 0 0
1 1 1 0 1 1 1

Overall, 3 errors are introduced in sum computation and 1 error in carry computation. Assigning the
inverted carry out at each stage to the sum computed for that stage reduces the hardware for sum computation
block. This is a significant reduction in hardware requirements as compared to a conventional adder.
Utilizing the look ahead carry generation logic from 4 MS bits improves the timing performance of the
circuit by not depending on the sequential computation of carry at each bit. A total of 8 transistors are used
for 1-bit sum and carry generation.

3.2. Mantissa approximate adder

For realizing the 23-bit approximate adder, three 8-bit adders are used. The lower two 8-bit adders
are the proposed 8-bit approximate adders, while the MS byte is implemented using an exact 8-bit adder. The
proposed 23-bit mantissa adder is shown in Figure 8.

Figure 8. Proposed 23-bit mantissa adder

An efficient floating point adder for low-power devices (Manjula Narayanappa)
258  ISSN: 2089-4864

3.3. Exponent adder/subtractor

As the exponent for realizing the 23-bit approximate adder, is having the most impact on the
accuracy of the result. The exponent adder is proposed to be implemented using exact 8-bit adder. Further,
we discuss error metrics for the evaluation purpose.

4. ERROR METRICS
4.1. Error distance
The error distance (ED) between two binary numbers, 𝑎 (erroneous) and 𝑏 (correct), is defined as
the arithmetic distance between these two numbers. Where, 𝑖 and 𝑗 are the indices for the bits in 𝑎 and 𝑏,
respectively. Suppose for an 8-bit adder, the correct sum for a given set of operands is “1110 0101” and the
incorrect outputs are “11100100” and “11110101”. Then the two erroneous values “11100100” and
“11110101” have an ED of 1 and 16 respectively.

𝐸𝐷(𝑎, 𝑏) = |𝑎 − 𝑏| = |∑𝑖 𝑎(𝑖) × 2𝑖 − ∑𝑗 𝑏(𝑗) × 2𝑗 | (5)

For a non-deterministic implementation, the output is probabilistic and usually follows a distribution
for a given input 𝑎𝑖 . In this case, the ED of the output (denoted by 𝑑𝑖 ) is defined as the weighted average of
EDs of all possible outputs to the nominal output. Assume that for a given input, the output has a nominal
value b, but it can take any value given in a set of vectors 𝑏𝑗 (1 ≤ 𝑗 ≤ 𝑟). The ED of the output is then given
by (6). Where 𝑝𝑗 is the output probability of 𝑏𝑗 (1 ≤ 𝑗 ≤ 𝑟).

𝑑𝑖 = ∑𝑖 𝐸𝐷(𝑏𝑗 , 𝑏) × 𝑝𝑗 (6)

4.2. Mean error distance

Mean error distance (MED) 𝑑𝑚 of a circuit for non-deterministic inputs with a certain probability of
occurrence is defined as the mean value of all the EDs of all possible outputs for each input. Assuming that
the inputs are defined by 𝑎𝑖 . (1 ≤ 𝑖 ≤ 𝑠) and probability of occurrence of each vector is 𝑞𝑖 (1 ≤ 𝑖 ≤ 𝑠), then
MED is given by (7). Where, 𝑑𝑖 is the ED of the outputs for input 𝑎𝑖 . For a uniformly distributed system, all
the inputs have an equal probability of occurrence and hence, 𝑞𝑖 is same for all input vector.

𝑑𝑚 = ∑𝑖 𝑑𝑖 × 𝑞𝑖 (7)

5. SIMULATION AND RESULTS

The proposed adder circuit is simulated in Cadence environment for delay and power consumption
and error analysis. The results are presented hereby. The maximum ED for the 8-bit adder is 3. The proposed
8-bit adder with a window size of W=4 is simulated for all possible input combinations of a and b. For all the
256×256 combinations, the approximate and the exact sum and carries are computed, error distances
computed between the approximate and accurate outputs. The maximum ED for the 8-bit adder is 3.

5.1. Error metrics for proposed 8-bit adder

The proposed 8-bit adder with a window size of W=4 is simulated for all possible input
combinations of a and b. For all the 256×256 combinations, the approximate and the exact sum and carries
are computed, error distances computed between the approximate and accurate outputs. The maximum ED
for the 8-bit adder is 3.

5.2. Delay
Considering a conventional 8-bit adder, the delay in 8-bit computation is due to the ripple carry
effect, which takes 8 cycles. Assuming the delay in computation of 1-bit full adder result to be T, the delay in
generating the 8-bit adder is 8T. In the proposed adder, the total delay is equal to the delay for computation of
carry out from MS 4-bits, which is equal to 4T.

5.3. Power consumption tradeoff

The energy consumed by a probabilistic inverter experiences an exponential increase as the
probability of obtaining the correct output rises. When it comes to approximate implementations, power
consumption is generally viewed as being directly proportional to the number of gates involved. In the newly
proposed adder circuits, the reduction in the number of transistors enables a lower operating voltage,

Int J Reconfigurable & Embedded Syst, Vol. 13, No. 2, July 2024: 253-261
Int J Reconfigurable & Embedded Syst ISSN: 2089-4864  259

resulting in an overall reduction in power consumption. In the case of a traditional full adder, if the power
consumed for a 1-bit operation is normalized to 1, then the power consumed for a k-bit conventional full
adder amounts to k. However, in the context of the proposed 8-bit adder, the reduction in the number of
transistors leads to a decrease in the operating voltage, from 1.13 V in an accurate implementation to 1.04 V.
Consequently, this reduction in voltage contributes to an estimated decrease in power consumption.

1.042 1.042
𝐸8−𝑏𝑖𝑡 = 𝑊 × + (8 − 𝑊) = 4 × +4 (8)
1.132 1.132

Which is 7.5% lower than the conventional adder. Both the discrete cosine transform (DCT) and
inverse discrete cosine transform (IDCT) blocks operates at a lower supply voltage in case of approximate
adders than the exact mode. Here, DCT and IDCT operates at a supply voltage of 1.28 and 1.13 V in the
exact mode, respectively. The different supply operating voltages are demonstrated in Figures 9 and 10 for
different approximations and truncations considering varied bits. Table 3 demonstrates the percentage power
savings considering varied approximations and truncation against the base case. Approximation 3 saves the
maximum power.

Operating Voltages for DCT Technique

1.2
Power Saving

1.15

1.1

1.05
Truncation Approximation 1 Approximation 2 Approximation 3
Technique
7 LSB’s 8 LSB’s 9 LSB’s

Figure 9. Operating voltages considering different bits for DCT technique

Figure 10. Operating voltages considering different bits for IDCT technique

Table 3. Percentage power savings for approximations over the base case
Technique 7 LSB’s 8 LSB’s 9 LSB’s
Truncation 48.22 56.23 61.24
Approximation 1 37.86 50.85 55.26
Approximation 2 41.21 49.13 53.84
Approximation 3 42.46 52.64 59.23

An efficient floating point adder for low-power devices (Manjula Narayanappa)

260  ISSN: 2089-4864

6. CONCLUSION
A novel approximate adder topology for single point floating point adder is presented in the paper.
The proposed design takes advantage of the fact that the lower significant bit addition can be approximate
and this will not be affecting the solution to a great extent, at the same time the power savings due to the
approximate computation will be significant. The proposed configurations has a lower propagation delay and
comparable error performance as compared to other architectures. With the proposed mantissa adder, which
is a hybrid of look, ahead carry adder for the carry generation and that of the approximate adder for the sum
generation gives a distinct advantage in terms of the power consumption as compared to the conventional full
adder.

REFERENCES
[1] J. Han and M. Orshansky, “Approximate computing: An emerging paradigm for energy-efficient design,” in 2013 18th IEEE
European Test Symposium (ETS), May 2013, pp. 1–6, doi: 10.1109/ETS.2013.6569370.
[2] R. Hegde and N. R. Shanbhag, “Soft digital signal processing,” IEEE Transactions on Very Large Scale Integration (VLSI)
Systems, vol. 9, no. 6, pp. 813–823, Dec. 2001, doi: 10.1109/92.974895.
[3] M. A. Breuer, “Let’s think analog,” in IEEE Computer Society Annual Symposium on VLSI: New Frontiers in VLSI Design
(ISVLSI’05), 2005, pp. 2–5, doi: 10.1109/ISVLSI.2005.48.
[4] V. Beiu, S. Aunet, J. Nyathi, R. R. Rydberg, and W. Ibrahim, “Serial addition: locally connected architectures,” IEEE
Transactions on Circuits and Systems I: Regular Papers, vol. 54, no. 11, pp. 2564–2579, Nov. 2007, doi:
10.1109/TCSI.2007.907885.
[5] S. Cotofana, C. Lageweg, and S. Vassiliadis, “Addition related arithmetic operations via controlled transport of charge,” IEEE
Transactions on Computers, vol. 54, no. 3, pp. 243–256, Mar. 2005, doi: 10.1109/TC.2005.40.
[6] J. Huang and J. Lach, “Exploring the fidelity-efficiency design space using imprecise arithmetic,” in 16th Asia and South Pacific
Design Automation Conference (ASP-DAC 2011), Jan. 2011, pp. 579–584, doi: 10.1109/ASPDAC.2011.5722256.
[7] S. Byonghyo, S. R. Sridhara, and N. R. Shanbhag, “Reliable low-power digital signal processing via reduced precision
redundancy,” IEEE Transactions on Very Large Scale Integration (VLSI) Systems, vol. 12, no. 5, pp. 497–510, May 2004, doi:
10.1109/TVLSI.2004.826201.
[8] G. V. Varatkar and N. R. Shanbhag, “Energy-efficient motion estimation using error-tolerance,” in Proceedings of the 2006
international symposium on Low power electronics and design-ISLPED ’06, Oct. 2006, pp. 113–118, doi:
10.1145/1165573.1165599.
[9] L. N. B. Chakrapani, K. K. Muntimadugu, A. Lingamneni, J. George, and K. V. Palem, “Highly energy and performance efficient
embedded computing through approximately correct arithmetic,” in Proceedings of the 2008 international conference on
Compilers, architectures and synthesis for embedded systems, Oct. 2008, pp. 187–196, doi: 10.1145/1450095.1450124.
[10] D. Mohapatra, G. Karakonstantis, and K. Roy, “Significance driven computation,” in Proceedings of the 2009 ACM/IEEE
international symposium on Low power electronics and design, Aug. 2009, pp. 195–200, doi: 10.1145/1594233.1594282.
[11] N. Banerjee, G. Karakonstantis, and K. Roy, “Process variation tolerant low power DCT architecture,” in 2007 Design,
Automation and Test in Europe Conference and Exhibition, Apr. 2007, pp. 1–6, doi: 10.1109/DATE.2007.364664.
[12] A. K. Verma, P. Brisk, and P. Ienne, “Variable latency speculative addition: a new paradigm for arithmetic circuit design,” in
2008 Design, Automation and Test in Europe, Mar. 2008, pp. 1250–1255, doi: 10.1109/DATE.2008.4484850.
[13] Y. V. Ivanov and C. J. Bleakley, “Real-time H.264 video encoding in software with fast mode decision and dynamic complexity
control,” ACM Transactions on Multimedia Computing, Communications, and Applications, vol. 6, no. 1, pp. 1–21, Feb. 2010,
doi: 10.1145/1671954.1671959.
[14] M. Shafique, L. Bauer, and J. Henkel, “enBudget: a run-time adaptive predictive energy-budgeting scheme for energy-aware
motion estimation in H.264/MPEG-4 AVC video encoder,” in 2010 Design, Automation and Test in Europe Conference and
Exhibition (DATE 2010), Mar. 2010, pp. 1725–1730, doi: 10.1109/DATE.2010.5457093.
[15] H. R. Mahdiani, A. Ahmadi, S. M. Fakhraie, and C. Lucas, “Bio-inspired imprecise computational blocks for efficient VLSI
implementation of soft-computing applications,” IEEE Transactions on Circuits and Systems I: Regular Papers, vol. 57, no. 4,
pp. 850–862, Apr. 2010, doi: 10.1109/TCSI.2009.2027626.
[16] S. H. Sreedhara, V. Kumar, and S. Salma, “Efficient big data clustering using adhoc fuzzy C means and auto-encoder CNN,” in
Inventive Computation and Information Technologies, 2023, pp. 353–368, doi: 10.1007/978-981-19-7402-1_25.
[17] M. S. K. Lau, L. Keck-Voon, C. Yun-Chung, and A. Bhanu, “A general mathematical model of probabilistic ripple-carry adders,”
in 2010 Design, Automation and Test in Europe Conference and Exhibition (DATE 2010), Mar. 2010, pp. 1100–1105, doi:
10.1109/DATE.2010.5456973.
[18] S. Borkar, T. Karnik, and V. De, “Design and reliability challenges in nanometer technologies,” in Proceedings of the 41st annual
Design Automation Conference, Jun. 2004, pp. 75–75, doi: 10.1145/996566.996588.
[19] W. Huan-Sheng and R. M. Mersereau, “Fast algorithms for the estimation of motion vectors,” IEEE Transactions on Image
Processing, vol. 8, no. 3, pp. 435–438, Mar. 1999, doi: 10.1109/83.748899.
[20] M. Shafique, L. Bauer, and J. Henkel, “3-tier dynamically adaptive power-aware motion estimator for h.264/AVC video
encoding,” in Proceeding of the thirteenth international symposium on Low power electronics and design-ISLPED ’08, 2008, pp.
147–152, doi: 10.1145/1393921.1393962.
[21] J. George, B. Marr, B. E. S. Akgul, and K. V. Palem, “Probabilistic arithmetic and energy efficient embedded signal processing,”
in Proceedings of the 2006 international conference on Compilers, architecture and synthesis for embedded systems, Oct. 2006,
pp. 158–168, doi: 10.1145/1176760.1176781.
[22] G. V. Varatkar and N. R. Shanbhag, “Energy-efficient motion estimation using error-tolerance,” in Proceedings of the 2006
international symposium on Low power electronics and design-ISLPED ’06, 2006, pp. 113–118, doi: 10.1145/1165573.1165599.
[23] D. Mohapatra, G. Karakonstantis, and K. Roy, “Low-power process-variation tolerant arithmetic units using input-based elastic
clocking,” in Proceedings of the 2007 international symposium on Low power electronics and design, Aug. 2007, pp. 74–79, doi:
10.1145/1283780.1283797.
[24] “IEEE standard for floating-point arithmetic,” IEEE Std 754-2008, Aug. 2008, doi: 10.1109/IEEESTD.2008.4610935.
[25] P. Behrooz, Computer arithmetic: algorithms and hardware designs, Oxford University Press, 2000.

Int J Reconfigurable & Embedded Syst, Vol. 13, No. 2, July 2024: 253-261
Int J Reconfigurable & Embedded Syst ISSN: 2089-4864  261

BIOGRAPHIES OF AUTHOR

Manjula Narayanappa working as assistant professor in electronics and

communication domain at Dr. Ambedkar Institute of Technology. She is pursuing Ph.D. VTU
Extension Centre, UTL Technologies Ltd., Bangalore. She has 13 years of working
experience. Her areas of interest are VLSI and embedded systems. She can be contacted at this
email: [email protected].

Siva S. Yellampalli obtained his M.S. and Ph.D. from Louisiana State University.
He is currently with VTU Extension Centre, UTL Technologies Ltd. He worked on a broad
range of research topics including VLSI, mixed signal circuits/systems development, micro-
electromechanical systems (MEMS), and integrated carbon nanotube based sensors. He has
published a book in the area of mixed signal design, and edited two books on carbon nano
tubes. He also published multiple journal papers and IEEE conference papers in these areas of
research. He can be contacted at this email: [email protected].

An efficient floating point adder for low-power devices (Manjula Narayanappa)

Approximate Single Precision Floating Point Adder For Low Power Applications
No ratings yet
Approximate Single Precision Floating Point Adder For Low Power Applications
15 pages
Full Adder Synopsis
No ratings yet
Full Adder Synopsis
10 pages
Design Stratigies of Low Power Voltage Level Shifter Circuits For Multi Supply Systems
No ratings yet
Design Stratigies of Low Power Voltage Level Shifter Circuits For Multi Supply Systems
6 pages
Design and Implementation of Low-Power Digital Signal Processing Using Approximate Adders
No ratings yet
Design and Implementation of Low-Power Digital Signal Processing Using Approximate Adders
7 pages
An Application-Oriented Analysis of Powerprecision
No ratings yet
An Application-Oriented Analysis of Powerprecision
7 pages
Design and Comparative Analysis of CMOS PDF
No ratings yet
Design and Comparative Analysis of CMOS PDF
9 pages
Wordlengthresuction
No ratings yet
Wordlengthresuction
18 pages
doc
No ratings yet
doc
73 pages
Efficient Design of Majority-Logic-Based Approximate Arithmetic Circuits
No ratings yet
Efficient Design of Majority-Logic-Based Approximate Arithmetic Circuits
13 pages
Vlsi
No ratings yet
Vlsi
32 pages
Wifi Based Smart Grid To Remotely Monitor and Control Renewable Energy Sources K. Sowmya P.Srinivas
No ratings yet
Wifi Based Smart Grid To Remotely Monitor and Control Renewable Energy Sources K. Sowmya P.Srinivas
6 pages
High-Performance Low-Power Carry Speculative Addition With Variable Latency
No ratings yet
High-Performance Low-Power Carry Speculative Addition With Variable Latency
13 pages
Siemens SW Accelerating Electrical Systems Design and Analysis With VeSys White Paper - tcm27 96932
No ratings yet
Siemens SW Accelerating Electrical Systems Design and Analysis With VeSys White Paper - tcm27 96932
12 pages
Toward_Designing_High-Speed_Cost-Efficient_Quantum_Reversible_Carry_Select_Adders
No ratings yet
Toward_Designing_High-Speed_Cost-Efficient_Quantum_Reversible_Carry_Select_Adders
15 pages
A1101010108
No ratings yet
A1101010108
8 pages
On-Chip Based Power Estimation For CMOS VLSI Circu
No ratings yet
On-Chip Based Power Estimation For CMOS VLSI Circu
8 pages
An On Chip Design For Prepaid Electricity Billing System: P .G, S.D P, T C S
No ratings yet
An On Chip Design For Prepaid Electricity Billing System: P .G, S.D P, T C S
7 pages
IET Computers Digital Tech - 2023 - Tavakolaee
No ratings yet
IET Computers Digital Tech - 2023 - Tavakolaee
14 pages
A Review of Low Power Processor Design
No ratings yet
A Review of Low Power Processor Design
9 pages
Design and Analysis
No ratings yet
Design and Analysis
8 pages
Design of Power-Efficient High-Speed 4-Bit Compara
No ratings yet
Design of Power-Efficient High-Speed 4-Bit Compara
8 pages
(IJCST-V9I2P10) :DR - Shine N Das
No ratings yet
(IJCST-V9I2P10) :DR - Shine N Das
6 pages
A New Vlsi Architecture For Modi Ed
No ratings yet
A New Vlsi Architecture For Modi Ed
6 pages
Design and Analysis of Low Power Full Adder For Portable and Wearable Applications
No ratings yet
Design and Analysis of Low Power Full Adder For Portable and Wearable Applications
5 pages
Low-Power SAR ADC Design Overview and Survey of State-of-the-Art Techniques
No ratings yet
Low-Power SAR ADC Design Overview and Survey of State-of-the-Art Techniques
14 pages
Design and Analysis of Direct and Post Truncated Adder Trees
No ratings yet
Design and Analysis of Direct and Post Truncated Adder Trees
8 pages
1 s2.0 S0026269219309346 Main
No ratings yet
1 s2.0 S0026269219309346 Main
9 pages
A Review Report On Multi-Voltage Rule Check - HBRP Publication
No ratings yet
A Review Report On Multi-Voltage Rule Check - HBRP Publication
11 pages
Energy Efficient Adder Circuits For Multiply and Accumulation Unit
No ratings yet
Energy Efficient Adder Circuits For Multiply and Accumulation Unit
18 pages
Implementation of Hardware and Energy Efficient Approximate Multiplier Architectures Using
No ratings yet
Implementation of Hardware and Energy Efficient Approximate Multiplier Architectures Using
9 pages
Low-Power_High-Speed_and_Area-Efficient_Multiplier_Based_on_the_PTL_Logic_Style
No ratings yet
Low-Power_High-Speed_and_Area-Efficient_Multiplier_Based_on_the_PTL_Logic_Style
5 pages
Manu 2019
No ratings yet
Manu 2019
6 pages
New Metrics For The Reliability of Approximate and Probabilistic Adders
No ratings yet
New Metrics For The Reliability of Approximate and Probabilistic Adders
12 pages
Layout Design of CMOS Buffer To Reduce Area and Power
No ratings yet
Layout Design of CMOS Buffer To Reduce Area and Power
4 pages
Design of CCTV System
No ratings yet
Design of CCTV System
5 pages
Power Optimization of Binary Division Based On FPG
No ratings yet
Power Optimization of Binary Division Based On FPG
13 pages
41
No ratings yet
41
5 pages
Constraint_generation_and_placement_for_automatic_layout_design_of_analog_integrated_circuits
No ratings yet
Constraint_generation_and_placement_for_automatic_layout_design_of_analog_integrated_circuits
4 pages
Itmconf I3cs2023 04001
No ratings yet
Itmconf I3cs2023 04001
12 pages
By Robert Prieto, Associate Professor, Universidad: Power Electronics Technology March 2005
No ratings yet
By Robert Prieto, Associate Professor, Universidad: Power Electronics Technology March 2005
5 pages
An Efficient Multiplier Based On Shift A
No ratings yet
An Efficient Multiplier Based On Shift A
6 pages
Katreepalli 2018
No ratings yet
Katreepalli 2018
13 pages
Design and Implementation of Power Estimation Technique For Digital Circuits IJERTV3IS041503
No ratings yet
Design and Implementation of Power Estimation Technique For Digital Circuits IJERTV3IS041503
10 pages
Design and Analysis of Approximate Redundant Binary Multipliers
No ratings yet
Design and Analysis of Approximate Redundant Binary Multipliers
15 pages
Advanced Low Power CMOS Design To Reduce Power Consumption in Deep CMOS Submicron Technologies
No ratings yet
Advanced Low Power CMOS Design To Reduce Power Consumption in Deep CMOS Submicron Technologies
11 pages
VLSI Design Fault Tolerant Network On Chip: IPASJ International Journal of Electronics & Communication (IIJEC)
No ratings yet
VLSI Design Fault Tolerant Network On Chip: IPASJ International Journal of Electronics & Communication (IIJEC)
3 pages
IET Circuits Devices Syst - 2020 - Folla - A Low‐Offset Low‐Power and High‐Speed Dynamic Latch Comparator With A
No ratings yet
IET Circuits Devices Syst - 2020 - Folla - A Low‐Offset Low‐Power and High‐Speed Dynamic Latch Comparator With A
13 pages
Design and Analysis of Different Circuits Using DCVSL & Static CMOS Technique
No ratings yet
Design and Analysis of Different Circuits Using DCVSL & Static CMOS Technique
7 pages
179462
No ratings yet
179462
7 pages
A Review of Low Power Processor Design
No ratings yet
A Review of Low Power Processor Design
10 pages
Verilog
No ratings yet
Verilog
9 pages
Vlsi Design: Introduction To Ic Technology
No ratings yet
Vlsi Design: Introduction To Ic Technology
59 pages
Block-Based Carry Speculative Approximate Adder For Energy-Efficient Applications
No ratings yet
Block-Based Carry Speculative Approximate Adder For Energy-Efficient Applications
5 pages
Low Power Add and Shift Multiplier Design Bzfad Architecture
No ratings yet
Low Power Add and Shift Multiplier Design Bzfad Architecture
14 pages
Analysis of Power Dissipation and Low Po
No ratings yet
Analysis of Power Dissipation and Low Po
9 pages
Final Year Report
No ratings yet
Final Year Report
16 pages
Performance Analysis of High Speed and Low Power B
No ratings yet
Performance Analysis of High Speed and Low Power B
13 pages
A High-Performance On-Chip Memory Module For Image Processing Applications
No ratings yet
A High-Performance On-Chip Memory Module For Image Processing Applications
13 pages
Performance Evaluation of Approximate Adders Case
No ratings yet
Performance Evaluation of Approximate Adders Case
8 pages
Analog Dialogue, Volume 47, Number 1: Analog Dialogue, #9
From Everand
Analog Dialogue, Volume 47, Number 1: Analog Dialogue, #9
Analog Dialogue
No ratings yet
FPGA implementation of artificial neural network for PUF modeling
No ratings yet
FPGA implementation of artificial neural network for PUF modeling
8 pages
Multimodal recognition with deep learning: audio, image, and text
No ratings yet
Multimodal recognition with deep learning: audio, image, and text
11 pages
Analysing feature selection: impacts towards forecasting electricity power consumption
No ratings yet
Analysing feature selection: impacts towards forecasting electricity power consumption
8 pages
Modeling of chimp optimization algorithm node localization scheme in wireless sensor networks
No ratings yet
Modeling of chimp optimization algorithm node localization scheme in wireless sensor networks
10 pages
Self-attention encoder-decoder with model adaptation for transliteration and translation tasks in regional language
No ratings yet
Self-attention encoder-decoder with model adaptation for transliteration and translation tasks in regional language
11 pages
A fast half-subtractor using 8T static random access memory for in-memory computation
No ratings yet
A fast half-subtractor using 8T static random access memory for in-memory computation
9 pages
Development and evaluation of robotic exoskeleton arm for enhanced human load carrying efficiency
No ratings yet
Development and evaluation of robotic exoskeleton arm for enhanced human load carrying efficiency
9 pages
Finite element analysis method as an alternative for furniture prototyping process and product testing
No ratings yet
Finite element analysis method as an alternative for furniture prototyping process and product testing
12 pages
Performance analysis of parallel prefix adders developed with field programmable gate array technology
No ratings yet
Performance analysis of parallel prefix adders developed with field programmable gate array technology
8 pages
Field-programmable gate array (FPGA) is a prominent device in developing the internet of things (IoT) application since it offers parallel computation, power efficiency, and scalability. The identification and authentication of these FPGAbased IoT applications are crucial to secure the user-sensitive data transmitted over IoT networks. Physical unclonable function (PUF) technology provides a great capability to be used as device identification and authentication for FPGAbased IoT applications. Nevertheless, conventional PUF-based authentication suffers a huge overhead in storing the challenge-response pairs (CRPs) in the verifier’s database. Therefore, in this paper, the FPGA implementation of the Arbiter-PUF model using an artificial neural network (ANN) is presented. The PUF model can generate the CRPs on-the-fly upon the authentication request (i.e., by a prover) to the verifier and eliminates huge storage of CRPs database in the verifier. The architecture of ANN (i.e., Arbiter-PUF
No ratings yet
Field-programmable gate array (FPGA) is a prominent device in developing the internet of things (IoT) application since it offers parallel computation, power efficiency, and scalability. The identification and authentication of these FPGAbased IoT applications are crucial to secure the user-sensitive data transmitted over IoT networks. Physical unclonable function (PUF) technology provides a great capability to be used as device identification and authentication for FPGAbased IoT applications. Nevertheless, conventional PUF-based authentication suffers a huge overhead in storing the challenge-response pairs (CRPs) in the verifier’s database. Therefore, in this paper, the FPGA implementation of the Arbiter-PUF model using an artificial neural network (ANN) is presented. The PUF model can generate the CRPs on-the-fly upon the authentication request (i.e., by a prover) to the verifier and eliminates huge storage of CRPs database in the verifier. The architecture of ANN (i.e., Arbiter-PUF
13 pages
An internet of things-driven smart key system with real-time alerts: innovations in hotel security
No ratings yet
An internet of things-driven smart key system with real-time alerts: innovations in hotel security
12 pages
Comparative analysis of ZigBee, LoRa, and NB-IoT in a smart building: advantages, limitations, and integration possibilities
No ratings yet
Comparative analysis of ZigBee, LoRa, and NB-IoT in a smart building: advantages, limitations, and integration possibilities
11 pages
20 21374 IJRES
No ratings yet
20 21374 IJRES
9 pages
Development of internet of vehicles and recurrent neural network enabled intelligent transportation system for smart cities
No ratings yet
Development of internet of vehicles and recurrent neural network enabled intelligent transportation system for smart cities
10 pages
Design of medium grain integrated clock gater for low power clock network
No ratings yet
Design of medium grain integrated clock gater for low power clock network
9 pages
TENS device for cervical pain during teleworking controlled remotely by mobile application
No ratings yet
TENS device for cervical pain during teleworking controlled remotely by mobile application
9 pages
A study of IoT based real-time monitoring of photovoltaic power plant
No ratings yet
A study of IoT based real-time monitoring of photovoltaic power plant
7 pages
Design of agrivoltaic system with internet of things control for chili fruit classification using the neural network method
No ratings yet
Design of agrivoltaic system with internet of things control for chili fruit classification using the neural network method
8 pages
Design and implementation of smart traffic light controller with emergency vehicle detection on FPGA
No ratings yet
Design and implementation of smart traffic light controller with emergency vehicle detection on FPGA
12 pages
Comparative analysis of feature descriptors and classifiers for real-time object detection
No ratings yet
Comparative analysis of feature descriptors and classifiers for real-time object detection
11 pages
Implementation of flexible axis photovoltaic system based on internet of things
No ratings yet
Implementation of flexible axis photovoltaic system based on internet of things
8 pages
Waste incinerator monitoring system based on remote communication with android interface
No ratings yet
Waste incinerator monitoring system based on remote communication with android interface
9 pages
Design of flood warning prototype using ESP32 module-based ultrasonic sensors
No ratings yet
Design of flood warning prototype using ESP32 module-based ultrasonic sensors
10 pages
Algorithm-driven development of a simulation tool for industrial manipulator stability analysis
No ratings yet
Algorithm-driven development of a simulation tool for industrial manipulator stability analysis
10 pages
Integration of K-Means and Silhouette score for energy efficiency of wireless sensor networks
No ratings yet
Integration of K-Means and Silhouette score for energy efficiency of wireless sensor networks
9 pages
Optimizing resource allocation in job shop production systems with seasonal demand patterns
No ratings yet
Optimizing resource allocation in job shop production systems with seasonal demand patterns
14 pages
Artificial intelligence driven robotic control system for personalized elderly care and foot massage
No ratings yet
Artificial intelligence driven robotic control system for personalized elderly care and foot massage
13 pages
Performance comparison of indoor navigation and obstacle avoidance methods for low-cost implementation in wheelchairs
No ratings yet
Performance comparison of indoor navigation and obstacle avoidance methods for low-cost implementation in wheelchairs
9 pages
Implementing a Very High-speed Secure Hash Algorithm 3 Accelerator Based on PCI-express
No ratings yet
Implementing a Very High-speed Secure Hash Algorithm 3 Accelerator Based on PCI-express
11 pages
Central processing unit load reduction through application code optimization and memory management
No ratings yet
Central processing unit load reduction through application code optimization and memory management
10 pages
Applied computing syllabus
No ratings yet
Applied computing syllabus
1 page
DLD Lecture Notes (Compatibility Mode)
No ratings yet
DLD Lecture Notes (Compatibility Mode)
149 pages
DL Unit 5 - Arithmetic Circuit
No ratings yet
DL Unit 5 - Arithmetic Circuit
53 pages
DELD Module 2 Notes
No ratings yet
DELD Module 2 Notes
82 pages
Chapter04 Answers at End
No ratings yet
Chapter04 Answers at End
47 pages
Arithmatic Circuit
No ratings yet
Arithmatic Circuit
7 pages
Boolean Algebra and Logic Gates.
No ratings yet
Boolean Algebra and Logic Gates.
50 pages
Raj Kamal - Switching Theory and Logic Design - For JNTUK-Pearson Education (2011)
No ratings yet
Raj Kamal - Switching Theory and Logic Design - For JNTUK-Pearson Education (2011)
407 pages
Assignment 1 Digital Logical Circuits
No ratings yet
Assignment 1 Digital Logical Circuits
22 pages
Experiment-1: Simulation Result For Half Adder and Full Adder Circuit
No ratings yet
Experiment-1: Simulation Result For Half Adder and Full Adder Circuit
7 pages
F23 DLD LAB06
No ratings yet
F23 DLD LAB06
3 pages
9. Half and Full Adder
No ratings yet
9. Half and Full Adder
36 pages
Rajagiri School of Engineering and Technology: Rajagiri Valley, Kakkanad. Third Semester
No ratings yet
Rajagiri School of Engineering and Technology: Rajagiri Valley, Kakkanad. Third Semester
37 pages
Bde Unit IV
No ratings yet
Bde Unit IV
21 pages
Primer - eBook-ENG PDF
No ratings yet
Primer - eBook-ENG PDF
370 pages
DELD - Short Answer Questions and Answers
No ratings yet
DELD - Short Answer Questions and Answers
23 pages
Low Power
No ratings yet
Low Power
65 pages
Chapter 6 Digital Arithmetic: Operations and Circuits
No ratings yet
Chapter 6 Digital Arithmetic: Operations and Circuits
63 pages
An Inverter I. CMOS Inverter Ii. Pseudo nMOS Inverter Iii. Tristate Inverter
No ratings yet
An Inverter I. CMOS Inverter Ii. Pseudo nMOS Inverter Iii. Tristate Inverter
53 pages
Experiment 8 Basic Combinatorial Circuits
No ratings yet
Experiment 8 Basic Combinatorial Circuits
4 pages
10 Compliment
No ratings yet
10 Compliment
5 pages
Assignment No: 3: Carry Save Adder
No ratings yet
Assignment No: 3: Carry Save Adder
7 pages
Pipelined Adders
No ratings yet
Pipelined Adders
9 pages
Combinational Assignment
No ratings yet
Combinational Assignment
91 pages
3QPG1 Cse It CS8351 DPSD QB1
No ratings yet
3QPG1 Cse It CS8351 DPSD QB1
2 pages
Speed Power Vlsi
No ratings yet
Speed Power Vlsi
20 pages
Unit I 4th Part
No ratings yet
Unit I 4th Part
99 pages
De Lab Manual
No ratings yet
De Lab Manual
54 pages
Classification of CMOS Digital Logic Circuit (Part - I)
No ratings yet
Classification of CMOS Digital Logic Circuit (Part - I)
35 pages
Lab 3
No ratings yet
Lab 3
4 pages

An Efficient Floating Point Adder For Low-Power Devices

Uploaded by

An Efficient Floating Point Adder For Low-Power Devices

Uploaded by

International Journal of Reconfigurable and Embedded Systems (IJRES)

Vol. 13, No. 2, July 2024, pp. 253~261

An efficient floating point adder for low-power devices

Manjula Narayanappa1, Siva S. Yellampalli2

Article Info ABSTRACT

Journal homepage: http://ijres.iaescore.com

Figure 1. General IEEE-754 floating point format

2.2. Floating point adder architecture

Figure 2. Floating point adder algorithm

3. APPROXIMATE FLOATING POINT ADDER

Figure 3. Approximate adder concept

3.1. Basic building block: 8-bit approximate adder

𝐶𝑖+1 = 𝐺𝑖 + 𝐺𝑖−1 𝑃𝑖 + ⋯ + 𝐺0 ∏𝑖𝑗=1 𝑃𝑗 + 𝐶𝑖𝑛 ∏𝑖𝑗=0 𝑃𝑗 (1)

𝐶𝑖+1 = (∑𝑖𝑗=𝑖−𝑊+1 𝐺𝑗 (∏𝑖−1 𝑖−𝑊 𝑖−1 𝑖

Figure 4. Conventional full adder

Figure 5. W-bit window for approximate computation

Figure 6. Carry generator for 4-bit block

𝐶𝑖+1 = 𝑎𝑖 𝑏𝑖 + 𝑐𝑖𝑛 (𝑎𝑖 ⊕ 𝑏𝑖 )

Figure 7. Schematic for carry and sum generator of proposed adder

Table 2. Truth table for proposed adder

3.2. Mantissa approximate adder

Figure 8. Proposed 23-bit mantissa adder

3.3. Exponent adder/subtractor

𝐸𝐷(𝑎, 𝑏) = |𝑎 − 𝑏| = |∑𝑖 𝑎(𝑖) × 2𝑖 − ∑𝑗 𝑏(𝑗) × 2𝑗 | (5)

4.2. Mean error distance

5. SIMULATION AND RESULTS

5.1. Error metrics for proposed 8-bit adder

5.3. Power consumption tradeoff

Operating Voltages for DCT Technique

Figure 9. Operating voltages considering different bits for DCT technique

An efficient floating point adder for low-power devices (Manjula Narayanappa)

Manjula Narayanappa working as assistant professor in electronics and

An efficient floating point adder for low-power devices (Manjula Narayanappa)

You might also like