0% found this document useful (0 votes)

31 views29 pages

Efficient Automated Repair of High Floating-Point Errors in Numerical Libraries

This paper proposes an approach to efficiently repair high floating-point errors in numerical libraries. It detects errors using algorithms guided by condition numbers, then derives an approximation of the mathematical function to generate a patch satisfying accuracy criteria. Experiments on 20 programs from a numerical library showed the approach can repair 19 programs with 100% accuracy.

Uploaded by

shllgtca

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

31 views29 pages

Efficient Automated Repair of High Floating-Point Errors in Numerical Libraries

Uploaded by

shllgtca

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 29

Efficient Automated Repair of High Floating-Point Errors in

Numerical Libraries
XIN YI, National University of Defense Technology, China
LIQIAN CHEN, National University of Defense Technology, China
XIAOGUANG MAO∗ , National University of Defense Technology, China
TAO JI, National University of Defense Technology, China
Floating point computation is by nature inexact, and numerical libraries that intensively involve floating-point
computations may encounter high floating-point errors. Due to the wide use of numerical libraries, it is highly
desired to reduce high floating-point errors in them. Using higher precision will degrade performance and may
also introduce extra errors for certain precision-specific operations in numerical libraries. Using mathematical
rewriting that mostly focuses on rearranging floating-point expressions or taking Taylor expansions may not
fit for reducing high floating-point errors evoked by ill-conditioned problems that are in the nature of the
mathematical feature of many numerical programs in numerical libraries.
In this paper, we propose a novel approach for efficient automated repair of high floating-point errors in
numerical libraries. Our main idea is to make use of the mathematical feature of a numerical program for
detecting and reducing high floating-point errors. The key components include a detecting method based on
two algorithms for detecting high floating-point errors and a repair method for deriving an approximation of
a mathematical function to generate patch to satisfy a given repair criterion. We implement our approach
by constructing a new tool called AutoRNP. Our experiments are conducted on 20 numerical programs in
GNU Scientific Library (GSL). Experimental results show that our approach can efficiently repair (with 100%
accuracy over all randomly sampled points) high floating-point errors for 19 of the 20 numerical programs.

CCS Concepts: • Mathematics of computing → Mathematical software; Computations in finite fields;

• Theory of computation → Numeric approximation algorithms; • Software and its engineering →
Search-based software engineering;

Additional Key Words and Phrases: Floating-point errors, automated repair, numerical program

ACM Reference Format:

Xin Yi, Liqian Chen, Xiaoguang Mao, and Tao Ji. 2019. Efficient Automated Repair of High Floating-Point
Errors in Numerical Libraries. Proc. ACM Program. Lang. 3, POPL, Article 56 (January 2019), 29 pages. https:
//doi.org/10.1145/3290369

∗ Corresponding author

Authors’ addresses: Xin Yi, Laboratory of Software Engineering for Complex Systems, College of Computer, National
University of Defense Technology, China, [email protected]; Liqian Chen, Laboratory of Software Engineering for
Complex Systems, College of Computer, National University of Defense Technology, China, [email protected]; Xiaoguang
Mao, Laboratory of Software Engineering for Complex Systems, College of Computer, National University of Defense
Technology, China, [email protected]; Tao Ji, Laboratory of Software Engineering for Complex Systems, College of
Computer, National University of Defense Technology, China, [email protected].

Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee
provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and
the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses,
contact
This worktheisowner/author(s).
licensed under a Creative Commons Attribution 4.0 International License.
© 2019 Copyright held by the owner/author(s).
2475-1421/2019/1-ART56
https://doi.org/10.1145/3290369

Proc. ACM Program. Lang., Vol. 3, No. POPL, Article 56. Publication date: January 2019.
56
56:2 Xin Yi, Liqian Chen, Xiaoguang Mao, and Tao Ji

1 INTRODUCTION
Using floating-point representation instead of real arithmetic in numerical programs aims to make
the calculation fast. However, the floating-point arithmetic accompanied with roundoff and trunca-
tion errors cannot guarantee that the results of numerical programs always have sufficient accuracy.
Numerical programs in numerical libraries that intensively involve floating-point computations
may encounter high floating-point errors. Hence, it is highly desired to reduce high floating-point
errors in widely used numerical libraries.
One method to reduce high floating-point errors is to use higher precision to perform floating-
point calculation of the original program. For example, one may replace a 32-bit single precision
with a 64-bit double precision to improve the accuracy of results. However, higher precision
execution will slow down the execution, sometimes even a thousand times slower [Benz et al. 2012].
In addition, higher precision execution may introduce extra errors and thus may not be able to
improve the accuracy of numerical programs in numerical libraries which may involve certain
precision-specific operations [Wang et al. 2016].
Mathematical rewriting is another choice, which reduces floating-point errors by rearranging
floating-point expressions or taking Taylor expansions. Using this method does not need higher
precision but requires users know the finer details of floating-point arithmetic. Along this direction,
tools like Herbie [Panchekha et al. 2015] and Salsa [Damouche and Martel 2018] were developed to
utilize mathematical rewriting to generate more accurate floating-point expressions automatically.
However, the mathematical rewriting may also fail to find a more accurate expression within
a limited search space constrained by the number of laws of mathematical transformation. In
particular, the mathematical rewriting may not fit for reducing a high floating-point error evoked
by an ill-conditioned problem (indicated by a large condition number, see ğ2) that is in the nature
of the mathematical feature of many numerical programs in numerical libraries.
This paper aims to provide an efficient approach for automated repair of high floating-point errors
in numerical libraries. To this end, several challenges need to be addressed. The first challenge
is to detect high floating-point errors efficiently in numerical programs. The set of 64-bit floating
point inputs of a numerical program is a huge search space, which makes the exhaustive search not
practical. Moreover, numerical programs in numerical libraries are supposed to be already designed
delicately for accuracy and inputs that can trigger the remaining high floating-point errors should
localize in some small parts of the whole input domain, which makes the detecting more challenging.
An efficient detecting method is desired to search for inputs that can trigger high floating-point
errors in the huge search space. The second challenge which is also a key challenge is to reduce
high floating-point errors to satisfy a given repair criterion. As mentioned before, using higher
precision not only degrades performance, but also may introduce extra errors, while mathematical
rewriting is constrained by limited search space and cannot handle the ill-conditioned problem. In
particular, high floating-point errors evoked by ill-conditioned problem are hard to be repaired
even for experienced developers of numerical libraries1 . The third challenge is to reduce time
overhead of repaired programs compared with the original programs. Performance is important for
numerical libraries, and thus repaired programs should not introduce too much time overhead.
This paper addresses these challenges mainly by exploiting the mathematical feature of numerical
programs. We suppose that a numerical program in numerical libraries is used to simulate a
mathematical function. First, we make use of the condition number of the mathematical function
and combine it with search algorithms to help detect high floating-point errors in the numerical

1 Forexample, Di Franco et al. [2017] find a phenomenon that a bug (#6368: https://github.com/scipy/scipy/pull/6368) in
Scipy was first repaired by mathematical rewriting, and developers recognized later that the bug is due to an ill-conditioned
problem and it may still remain even after repair.

Proc. ACM Program. Lang., Vol. 3, No. POPL, Article 56. Publication date: January 2019.
Efficient Automated Repair of High Floating-Point Errors in Numerical Libraries 56:3

program (the first challenge). Then we directly extract an approximation based on the inputs
and outputs of the mathematical function to generate a patch for repairing the numerical program
to satisfy a given repair criterion (the second challenge). During the above process, we use a
simple but efficient method to approximate the mathematical function and a search optimization
to improve the performance of the generated patch, both of which are useful for reducing time
overhead of repaired programs (the third challenge). The contributions of this paper are as
follows:
• We present a detecting method that includes two novel algorithms (namely DEMC and PTB)
for detecting high floating-point errors in numerical programs. More specifically, based on
the guide of condition number, we use two global optimization algorithms (the Differential
Evolution algorithm and the Monte Carlo Markov Chain (MCMC) algorithm) to help us find the
input that can trigger the possible maximum floating-point error. As far as we know, there is
no existing work combining condition number and MCMC for finding the input triggering
maximum errors. Moreover, our detecting method not only finds the input that can trigger
the maximum floating-point error (like existing detecting methods [Chiang et al. 2014] [Zou
et al. 2015] [Yi et al. 2017b]), but also searches for inputs that can trigger floating-point errors
higher than a given repair criterion. (See ğ4.1.)
• We present a repair method to produce patch automatically to reduce floating-point errors in
a numerical program to satisfy a given repair criterion. We prove the guarantee of termina-
tion of our repair method for an arbitrary given repair criterion. Unlike existing methods
that search for repairs by making changes of the implementation of the original numerical
program, our method uses a piecewise quadratic function to approximate the corresponding
mathematical function of the numerical program to generate patches. Therefore, the approxi-
mation is independent of the implementation of the numerical program and can be applied
to a numerical program in other numerical libraries that simulates the same mathematical
function. Moreover, our method uses a search optimization to improve the performance of a
generated patch to reduce time overhead of repaired programs. (See ğ4.2 and ğ4.3.)
• We develop a prototype tool called AutoRNP and evaluate our approach by conducting
experiments on 20 numerical programs of GSL. Experimental results show that our approach
can efficiently repair high floating-point errors in 19 of 20 numerical programs of GSL (with
100% accuracy, i.e., after repair, 100% of our randomly sampled inputs yielding outputs that
satisfy the given repair criterion). (See ğ5.)
The rest of the paper is organized as follows. We first introduce the basics of floating-point
representation, the definition of high floating-point error and the ill-conditioned problem in ğ2. We
then give an overview of our approach through an example in ğ3 and detail our approach in ğ4. We
provide the details of implementation together with experimental results in ğ5. The limitations of
our work are discussed in ğ6. In ğ7 we summarize related work. We end the paper with conclusions
and future work (ğ8).

2 PRELIMINARIES
Floating-point representation According to the IEEE-754 Standard [Kahan 1996], a floating-
point number can be represented in scientific notation:
f = (−1) S × M × 2 E (1)
where S ∈ {0, 1} is the 1-bit sign of x, which represents that x is positive (when S = 0) or negative
(when S = 1); M = m 0 .m 1m 2 . . . m p is called the significand, where f = .m 1m 2 . . . m p represents a
p-bit fraction and m 0 is the hidden bit without need of storage; E = e − bias is called the exponent,

Proc. ACM Program. Lang., Vol. 3, No. POPL, Article 56. Publication date: January 2019.
56:4 Xin Yi, Liqian Chen, Xiaoguang Mao, and Tao Ji

where e is a biased e-bit unsigned integer and bias = 2e−1 − 1. Taking the 64-bit double-precision
format as an example, e = 11 (and thus bias = 1023), p = 52.
High floating-point error We give the definition of a high floating-point error as below:

Definition 1. Let f (x) represent a mathematical function, fp (x) represent the correspond-
ing numerical program implementing f (x ). Given an error threshold ε, for an input x 0 , if
ErrorFunction(f (x0 ), fp (x0 )) > ε, we say the input x 0 triggers a high floating-point error.

The ErrorFunction in Definition 1 is a function for measuring the floating-point error between the
outputs of mathematical function and numerical program. In this paper, we define the floating-point
error as the number of floating-point values between the mathematical function output Or and
numerical program output Of , following [Panchekha et al. 2015]. This error can be represented by
Eq. 2 and characterized by Eq. 3.
FPNum{Or , Of } = |{ai ∈ F|min(Or , Of ) ≤ ai ≤ max (Or , Of )}| (2)
ErrBits{Or , Of } = log2 (FPNum{Or , Of }) (3)
FPNum{Or , Of } in Eq. 2 represents the number of floating-
fp
point values between the mathematical function output Or
f
and numerical program output Of including themselves. fp(x)
∆x
ErrBits{Or , Of } in Eq. 3 counts the number of most-significant f (x + ∆x)
bits that the approximate and the exact results agree on2 .
Compared with relative error and absolute error in real,
FPNum{Or , Of } and ErrBits{Or , Of } can keep consistent over x

the input space and avoid special handling for infinite and
denormalized values. Fig. 1. Backward error
Ill-conditioned problem For a mathematical function f (x ),
its condition number function can be expressed as follows:
f ′ (x) · x
C(x) = (4)
f (x)
where f ′ (x ) denotes the derivative of the mathematical function f (x ).
Condition number is an important quantity for measuring how sensitive a function is to errors
in the input. It has been mainly used to investigate instability of numerical programs [Bao and
Zhang 2013] [Tang et al. 2017]. To illustrate the influence of condition number to floating-point
error, we first introduce the notion of backward error. If we map the value given by a numerical
program fp (x ) to its corresponding mathematical function f (x), as shown in Fig. 1, we can get
fp (x) ≃ f (x + △x) (5)
We call △x the backward error B, and let δ = △x/x. We follow the assumption of [Fu et al. 2015]
that the mathematical function f is smooth in a neighborhood of x and the backward error is small.
Then by the Taylor expansion, we have
f (x) − fp (x) f (x) − f (x + δ · x) f ′ (x) · x
Forward_error = = ≈ |δ | · + Θ(δ 2 ) (6)
f (x) f (x) f (x)
2 Forexample, FPNum{1.0, 2.0 } = 4503599627370497 means there are 4503599627370497 number of floating-point values
between 1.0 and 2.0 including themselves, and Er r Bit s {1.0, 2.0} = loд2(FPNum{1.0, 2.0}) = 52 means that the number
2.0 has 52 bits error compared to 1.0. The value range of error threshold ε can be limited to [1, 264 ) by Eq. 2 and [0, 64) by
Eq. 3.

Proc. ACM Program. Lang., Vol. 3, No. POPL, Article 56. Publication date: January 2019.
Efficient Automated Repair of High Floating-Point Errors in Numerical Libraries 56:5

Eq. 6 shows that the forward error is mainly influenced by the backward error B = |δ | and the
f ′ (x) · x
condition number , i.e.,
f (x)
Forward_error ≈ B · C(x) (7)
The ill-conditioned problem happens when the value of C (x ) is large. As shown in Eq. 7, if the
value of C (x ) is large enough, even a small backward error B would lead to a large forward error.
Note that the value of C (x ) is inherent in the mathematical function f (x) (according to Eq. 4) while
independent of the implementation of numerical program fp (x). Theoretically, it is very difficult to
repair the high floating-point error evoked by the ill-conditioned problem.
Notations For mathematical function f (x ) and its corresponding numerical program fp (x), we
use X ⊆ F to denote their input domain. We use the rounding function f l : R → F to convert a
real number to a floating-point number. In this paper, we consider a floating-point input interval I
which represents a set of floating-point numbers, i.e., I = [x 0 , x n ] = {x 0 , x 1 , x 2 , ..., x n }. We use
the default rounding mode łrounding to nearest even" of IEEE 754 standard in this paper. We use
{+, −, ×, /} to denote real-valued operations and {⊕, ⊖, ⊗, ⊘} to denote floating-point operations.
We assume that each floating-point operation is left-associative. We use the same definition of ulp
function (in double-precision) as [Lee et al. 2017] who follows the Goldberg’s definition [Goldberg
1991]:

 2k −52 if |r | ∈ [2k , 2k +1 ) where k ∈ [−1022, 1023] ∩ Z
For r ∈ R, ulp(r) = 
 2−1074 if |r | ∈ [0, 2−1022 ) (8)

where ulp (unit in the last place) is the gap between the two floating-point numbers nearest to
r, even if r is one of them. Formally, if a and b are two adjacent floating-point numbers around r
satisfying (r ≥ 0.0 ∧ a ≤ r < b) ∨ (r < 0.0 ∧ a < r ≤ b), then we have ulp(r) = |a − b|.

3 OVERVIEW

i n t gsl_sf_legendre_P3_e ( double x , g s l _ s f _ r e s u l t ∗ r e s u l t )
{
r e s u l t −> v a l = 0.5*x*(5.0*x*x - 3.0);
r e s u l t −> e r r = GSL_DBL_EPSILON ∗ ( f a b s ( r e s u l t −> v a l ) + 0 . 5 ∗ f a b s ( x ) ∗ (
fabs ( 5 . 0 ∗ x∗x ) + 3.0) ) ;
r e t u r n GSL_SUCCESS ;
}

Fig. 2. Code of gsl_sf_legendre_P3

In this section, we give an overview of our approach by illustrating how our approach repairs the
high floating-point errors in a motivating example. The example is the program gsl_sf _legendre_P3
in GSL. The source code of the gsl_sf _legendre_P3 is shown in Fig. 2. The example implements
the Legendre functions Pn (x) with n = 3. The code in Fig. 2 shows that the result of the program
is calculated by the polynomial ł0.5 ∗ x ∗ (5.0 ∗ x ∗ x − 3.0)ž. The ill-conditioned problem exists
around the roots of the polynomial. If letting x = x0 = 0.7745966692414834, the polynomial returns
output zero, while the mathematical output of legendre_P3 at x0 in real-number arithmetic should be
8.1726185204e-17. In this case, the result of FPNum(0, 8.1726185204e-17 ) is around 4.36611485515e+18
which implies around 61.9 bits error. More specifically, the expression 5.0 ⊗ x ⊗ x has a roundoff
error 2e-16 for the input x0 , and the roundoff error is small for the value of 5.0 ⊗ x ⊗ x (which is

Proc. ACM Program. Lang., Vol. 3, No. POPL, Article 56. Publication date: January 2019.
56:6 Xin Yi, Liqian Chen, Xiaoguang Mao, and Tao Ji

rounded to 3.0), but huge for 5.0 ⊗ x ⊗ x ⊖ 3.0 (which is rounded to 0.0). Thus, the bad cancellation
comes from the minus operation in the expression 5.0 ⊗ x ⊗ x ⊖ 3.0 which makes the roundoff
error in 5.0 ⊗ x ⊗ x become a large relative error. The condition number at the input x0 is around
7.341588237629298e+16, so even the small rounding error introduced in 5.0 ⊗ x ⊗ x will be enlarged
and propagated to the output due to the large condition number (according to Eq. 7).
The work-flow of our approach is shown in Fig. 3. We now introduce the repair process step by
step.

Fig. 3. Work-flow of our approach. fp : numerical program, f : the corresponding mathematical function of fp ,
Iinit : input domain of fp , MeanErr : the mean error of numerical program fp in Iinit , MaxErr: the maximum
error of numerical program fp in Iinit , τ : parameter given by users to adjust the error threshold in interval
[MeanErr, MaxErr] (in Eq. 18)

Detecting high floating-point errors. In this step, we try to find an input interval which
includes inputs that can trigger floating-point errors higher than an error threshold ε. As shown
in Fig. 3, first, we use the DEMC algorithm (ğ4.1) to search for an input xm that can trigger a
possible maximum floating-point error MaxErr. Then, we apply the PTB algorithm (ğ4.1) to find an
input interval Ierr which includes inputs that can trigger floating-point errors larger than the error
threshold ε. Finally, the input interval Ierr and error threshold ε will be passed to next step.
For the motivating example, we get the input x0 = 0.7745966692414834 that can trigger a possible
maximum floating-point error. Then, under the given threshold ε = 6.8, we get the input interval
Ierr = [0.7719792508475998, 0.7771878196880129], as shown in Fig. 4(a).
Deriving an approximation of the mathematical function. After getting the input interval
Ierr , we will derive an approximation of the mathematical function f over Ierr to satisfy the error
threshold ε. We use a piecewise quadratic function (ğ4.2) to approximate the mathematical function
over Ierr . As shown in Fig. 3, we first use a linear function to approximate the mathematical
function f , then use a quadratic function to compensate the error between the linear function and
mathematical function f (ğ4.2) to make the approximation closer to the mathematical function
f . Unfortunately, the approximation using only one linear function with error compensation of
quadratic function may be not accurate enough to satisfy the error threshold, so we adopt an
iterative refinement algorithm (ğ4.2) to iteratively use more pieces of linear approximation with
error compensation to generate a piecewise quadratic function to approximate a mathematical

Proc. ACM Program. Lang., Vol. 3, No. POPL, Article 56. Publication date: January 2019.
Efficient Automated Repair of High Floating-Point Errors in Numerical Libraries 56:7

60 60

50 50
Ierr Ierr
40 [0.7719792508475998, 0.7771878196880129]
40 [0.7719792508475998, 0.7771878196880129]
ErrBits

ErrBits
30 30
x0 = 0.7745966692414834 x0 = 0.7745966692414834
20 20

10 ε =6.8 10 ε =6.8

0 0
0.772 0.773 0.774 0.775 0.776 0.777 0.772 0.773 0.774 0.775 0.776 0.777
Input Input

(a) Original ErrBits distribution (b) ErrBits distribution after repair

Fig. 4. ErrBits distribution of program gsl_sf_legendre_P3

function over Ierr . In the end of the process, we can get Σ(l, c) which represents the piecewise
quadratic function.
For the motivating example, under the error threshold ε = 6.8, a piecewise quadratic function
with around 1000 pieces is produced after iterative refinement to approximate the mathematical
function within the input interval Ierr (including more than 4691 billion double-precision floating-
point inputs) to satisfy the error threshold ε.

i n t gsl_sf_legendre_P3_e ( double x , g s l _ s f _ r e s u l t ∗ r e s u l t )
{
i f ( ( x < = 0 . 7 7 7 1 8 7 8 1 9 6 8 8 0 1 2 9 ) &&(x > = 0 . 7 7 1 9 7 9 2 5 0 8 4 7 5 9 9 8 ) ) {
r e s u l t −> v a l = a c c u r a c y _ i m p r o v e _ p a t c h _ o f _ g s l _ s f _ l e g e n d r e _ P 3 ( x ) ;
r e s u l t −> e r r = GSL_DBL_EPSILON ∗ f a b s ( r e s u l t −> v a l ) ;
r e t u r n GSL_SUCCESS ;
}
r e s u l t −> v a l = 0.5*x*(5.0*x*x - 3.0);
r e s u l t −> e r r = GSL_DBL_EPSILON ∗ ( f a b s ( r e s u l t −> v a l ) + 0 . 5 ∗ f a b s ( x ) ∗ (
fabs ( 5 . 0 ∗ x∗x ) + 3.0) ) ;
r e t u r n GSL_SUCCESS ;
}

Fig. 5. gsl_sf_legendre_P3 after repair

Producing patch. Finally, we convert the piecewise quadratic function Σ(l, c) to patch. The
piecewise quadratic function is implemented in the following manner: For an input x in the input
interval Ierr , we first find which piece of the piecewise quadratic function x belongs to, then we call
the function representing the piece to compute the output for the input x. To reduce time overhead
of searching for the right piece for an input, we apply a search optimization (ğ4.3) to accelerate the
process of searching the piece of the piecewise quadratic function x belongs to. After optimization,
we convert Σ(l, c) with search optimization to a function in C code. We would like to induce as
little as possible the influence of patch on the readability of original code, so we package the details
of the piecewise quadratic function in a separate function. As a result, a patch includes two parts:
1) A function in C code which is stored in a separate file to implement the piecewise quadratic

Proc. ACM Program. Lang., Vol. 3, No. POPL, Article 56. Publication date: January 2019.
56:8 Xin Yi, Liqian Chen, Xiaoguang Mao, and Tao Ji

function; 2) A branch code fragment that is inserted into the source code of numerical program to
decide whether to call the added function.
For the motivating example, the program after repair is shown in Fig. 5. From Fig. 5, we see that
the patch of gsl_sf _legendre_P3 is packaged in the function accuracy_improve_patch_of _gsl_sf _
legendre_P3. The ErrBits distribution of the program after repair is shown in Fig. 4(b).
In summary, under a given error threshold, our approach localizes inputs that can trigger
high floating-point errors, encloses them as a certain input interval, derives an approximation of
mathematical function that satisfies the error threshold, and converts the approximation to a patch
which is then inserted into source code to complete the repair.
Existing methods on the example To the best of our knowledge, no efficient way by mathemat-
ical rewriting can improve the accuracy of the polynomial ł0.5 ∗ x ∗ (5.0 ∗ x ∗ x − 3.0)ž around
x0 . Factoring is a possible solution of mathematical rewriting to reduce errors around a root of a
polynomial. When evaluating a polynomial ploy(x) = (x ⊖ r) ⊗ Q(x) near root r , the execution of
x ⊖ r in floating-point arithmetic is exact (according to Sterbenz’s theorem [Sterbenz 1973], see
ğ4.3). However, the factoring requires that the root r can be expressed exactly in floating-point
number, otherwise, the small roundoff error of r will also lead to a large relative error (according
to Eq. 7). In the example, the root of ł0.5 ∗ x ∗ (5.0 ∗ x ∗ x − 3.0)ž cannot be exactly expressed by
a floating-point number, so the idea of factoring does not work for the example. Note that, the
ill-conditioned nature of the problem also suggests that mathematical rewriting techniques (such
as Herbie) will fail to reduce errors in the example3 .
Using higher precision can reduce errors around x0 but will degrade the performance of the basic
function gsl_sf _legendre_P3 which is also called by other functions in GSL. For example, if we use
128-bit precision to calculate the polynomial ł0.5 ∗ x ∗ (5.0 ∗ x ∗ x − 3.0)ž, the execution time is
around 3.5 times slower than the original execution under 64-bit precision.

4 APPROACH
In this section, we details our approach in three parts which correspond to the work-flow in Fig. 3.

4.1 Detecting High Floating-Point Errors

Our detecting method is based on the following hypothesis:
Hypothesis 1 (H1). A numerical program in numerical libraries should result in outputs that are
accurate enough for the most part of its valid input domain.
The hypothesis is reasonable since numerical programs in a widely used and well maintained
numerical library are supposed to be already designed delicately for accuracy and their outputs
for most inputs are supposed to be with high accuracy. This hypothesis implies that high floating-
point errors should be triggered by inputs located in some small input intervals of the valid input
domain. Note that the input domain of a numerical program may be constrained by the feature of
the (mathematical) function that the program implements and the value range of floating-point
number4 , so we refer the input domain satisfying such constraints as the łvalidž input domain. E.g.,
the valid input domain of exponential function exp is constrained around [−709, 709] for 64-bit
floating-point inputs. Note that we consider only valid input domain throughout this paper.
Based on the hypothesis, we propose a detecting method following the logic flow in Fig. 6. As
shown in Fig. 6, first, we search for the input xm that can trigger the possible maximum floating-
point error MaxErr (0) in the whole input domain Iinit . If MaxErr (0) is larger than error threshold ε,
3 https://pavpanchekha.com/blog/float-point-polynomial.html
4 E.g., the value range of double precision floating-point number is around [−1.7e + 308, +1.7e + 308].

Proc. ACM Program. Lang., Vol. 3, No. POPL, Article 56. Publication date: January 2019.
Efficient Automated Repair of High Floating-Point Errors in Numerical Libraries 56:9

Fig. 6. Logic flow for detecting high floating-point errors. fp : numerical program, f : corresponding math-
Si (k)
ematical function of fp , Iinit : input domain of fp , MaxErr: the maximum error in Iinit − Ierr , ε: error
k =0
threshold

(0)
then we search for an input interval Ierr around xm that encloses inputs that can trigger floating-
point errors higher than ε. The detecting method will terminate if the current MaxErr (i) found is
(i)
less than ε, otherwise the method will iteratively find all Ierr . Now, we introduce two algorithms to
(i) (i)
help us find MaxErr and Ierr .
Detecting the possible maximum floating-point error (MaxErr) We propose an algorithm
called DEMC, based on two search algorithms namely Differential Evolution algorithm [Storn and
Price 1997] and Monte Carlo Markov Chain (MCMC) algorithm [Andrieu et al. 2003], to search for an
input that can trigger the possible maximum floating-point error. Differential evolution algorithm is
a simple and efficient global optimization algorithm over continuous space. The algorithm operates
on real numbers and naturally fits for numerical optimization. We use the algorithm to help us find
the input triggering the possible maximum condition number over the whole input domain of a
numerical program. The MCMC algorithm is a sampling method that draws samples from the target
(usually unknown) distribution. MCMC has been used to search the maximum backward error [Fu
et al. 2015] and to achieve high coverage for floating-point code [Fu and Su 2017], and has also been
applied in STOKE [Schkufza et al. 2014] for stochastic search of floating-point optimization. We
configure the MCMC sampling such that it tends to attain the inputs that may trigger maximum
floating-point errors with higher probability than the other points. We use the MCMC algorithm
to avoid local maxima and to find the input triggering maximum floating-point error in a relative
smaller search space.
First, we formalize the problem of detecting the maximum floating-point error as follows:

Definition 2. Let f (x) represent a mathematical function, fp (x) represent the corresponding
numerical program, D ∈ F denote the valid input domain. The detecting problem is to search
for an input xm ∈ D , such that ∀xi ∈ D, ErrBits(f (xm ), fp (xm )) ≥ ErrBits(f (xi ), fp (xi )).

Then, we use two fitness functions to guide our DEMC method to find such an input xm .
Fit1: ErrBits. In this paper, we use ErrBits (defined in Eq. 3) to evaluate the error associated to an
output of a numerical program. Hence ErrBits becomes a fitness function of our approach.
Fit2: Appro(C(x)). We use the approximate value of condition number C (x ) (Eq. 4) of the mathe-
matical function f (x) as our second fitness function, that is

Appro(C(x)) = fp′ (x) ⊗ x ⊘ fp (x) (9)

Proc. ACM Program. Lang., Vol. 3, No. POPL, Article 56. Publication date: January 2019.
56:10 Xin Yi, Liqian Chen, Xiaoguang Mao, and Tao Ji

We use the Fit2 function to guide the search algorithm to find the possible inputs that may trigger
the ill-conditioned problem (if there exists) in a numerical program. The approximation (C (x ) to
Appro(C(x))) makes us free from the exact calculation of the mathematical function f (x) which
needs high precision and may lead to high time overhead.

Algorithm 1 The DEMC algorithm

Input: fp , f , Iinit , [Fit1 : ErrBits(f , fp )], [Fit2 : Appro(C(x))]
Output: (MaxErr, xm )
1: tmp_list ← [ ]
2: error_list ← [ ]
3: LI ← Partition(Iinit )
4: for Ii ∈ LI do
5: xi ← Differential_Evolution(Ii , Fit2)
6: (erri , xi ) ← Fit1(xi )
7: tmp_list.append ((erri , xi ))
8: end for
9: tmp_list ← SortByError (tmp_list)
10: for (erri , xi ) ∈ tmp_list do
11: (max_errt , xt ) ← MCMC(xi , Fit1)
12: error_list.append ((max_errt , xt ))
13: end for
14: (MaxErr, xm ) ← SortByError (error_list)[0]
15: return (MaxErr, xm )

The DEMC algorithm is shown in Algorithm 1. The inputs of the DEMC algorithm include the
numerical program fp together with its corresponding mathematical function f , the input domain
Iinit of fp , and two fitness functions (Fit1 and Fit2). In brief, we first partition the search space (the
whole input domain) into many smaller parts, then apply the differential evolution algorithm using
Fit2 (which can be calculated fast) as guide to search in each part, and finally call MCMC using Fit1
as guide to refine the search results. As shown in Algorithm 1, the input domain is first partitioned
into many smaller parts (subintervals) by the Partition function which is designed according to the
distribution of floating-point numbers. After that, we first call the differential evolution algorithm
to search for the input that can trigger large condition number in each small input subinterval Ii .
Next, we perform the MCMC algorithm to find the possible maximum floating-point error max_errt
around each input xi that is found previously by the differential evolution algorithm. In other words,
we employ the MCMC algorithm to search higher errors in the neighborhood of xi . Finally, the
DEMC algorithm returns the maximum floating-point MaxErr and the corresponding input xm .
Generating the target input interval (Ierr ). To generate the input interval Ierr enclosing the
neighbors of xm with respect to the given error threshold ε, we propose a so-called Point-to-Bound
(PTB) algorithm.
Intuitively, around the input xm producing the maximum floating-point error, there may exist
some other inputs causing high floating-point errors. According to the formula Forward_error ≈
B · C (x ) (Eq. 7), a high floating-point error evoked by an ill-conditioned problem may decline with
the decreasing of the value of condition number C (x ). Moreover, if the mathematical function is
close to a linear function, the derivate |f ′ (x)| of the mathematical function is almost a stable value
in a small interval around xm , and thus the value of |f ′ (x) · x | is also almost a stable value in a

Proc. ACM Program. Lang., Vol. 3, No. POPL, Article 56. Publication date: January 2019.
Efficient Automated Repair of High Floating-Point Errors in Numerical Libraries 56:11

small interval around xm , which means that the value of |f (x)| becomes the main influential factor
of condition number C(x) = f ′ (x) · x/f (x) (Eq. 4).
Based on above analysis, we know that floating-point errors triggered by inputs around xm
will decrease with the increasing of the value of |f (x)|. Meanwhile, the formula (Eq. 2) we use to
evaluate the floating-point error is also implicitly related with the ulp value of |f (x)|. Thus we have
the intuition that the floating-point errors triggered by inputs around xm may be a stepwise decline
trend with the value of ulp(f (x)). For example, as shown in Fig. 4(a), x0 is the input triggering
maximum error and also the input (closest to the root of the function) making ulp(f (x)) smallest,
and the values of errors have a stepwise decline trend around x 0 .

Algorithm 2 The PTB Algorithm

Input: fp , f , xm , ε, step
Output: Ierr : [Lower_bound, Up_bound]
/* search the upper bound on the right of xm */
1: Up_bound ← IterationForBound(fp , fr , xm , ε, step, 1)
/* search the lower bound on the left of xm */
2: Lower_bound ← IterationForBound(fp , fr , xm , ε, step, −1)
3: return [Lower_bound, Up_bound]
4: function IterationForBound(fp , fr , xm , ε, step, sign)
5: temp_max_error ← Error_evaluation(xm , fp , fr )
6: temp_bound ← xm
7: step ← ulp(xm ) ⊗ 1000
8: while temp_max_error > ε do
9: temp_bound ← temp_bound ⊕ step ⊗ sign
10: temp_max_error ← Max_error_find (temp_bound, step)
11: times ← max ([(temp_max_error ⊘ ε), 2.0])
12: step ← times ∗ step
13: end while
14: last_step ← step
15: temp_bound ← IterationBack(ε, temp_bound, fp , f , last_step)
16: return temp_bound
17: end function

We design the PTB algorithm mainly based on the possible stepwise decline trend of floating-
point errors triggered by inputs around xm . The PTB algorithm is shown in Algorithm 2. The
inputs of the PTB algorithm include the numerical program fp together with its corresponding
mathematical function f , the input xm as the starting point, the error threshold ε, and an initial
step for search bound. The output of the algorithm is the target Ierr .
To generate an Ierr as small as possible which includes all inputs around xm that can trig-
ger floating-point error higher than the given error threshold ε, we first search a tight upper
bound (Up_bound) of Ierr which is larger than xm (Line 1). The temporary value of Up_bound is
saved in variable temp_bound (Line 6) and variable step (Line 7) is used to refresh the value of
temp_bound. We use variable temp_max_error to record the local maximum floating-point errors
around temp_bound. Note that Max_error_find (Line 10) finds the maximal floating-point error in
a small interval (decided by the value of step) around temp_bound. The value of times is large than
2.0 and the value of temp_max_error ⊘ ε is used to adjust (Line 12) the value of step. temp_bound
will keep changing by step ⊗ sign until temp_max_error ≤ ε (Lines 8 - 9). The value of step may

Proc. ACM Program. Lang., Vol. 3, No. POPL, Article 56. Publication date: January 2019.
56:12 Xin Yi, Liqian Chen, Xiaoguang Mao, and Tao Ji

increase too large, which leads temp_bound to be far away from the possible smallest Up_bound,
so we call the IterationBack function to refine the last_step to find a tighter Up_bound. Then,
we perform a similar process to find the possible largest lower bound Lower_bound of Ierr . After
finding the lower bound and upper bound of Ierr (Lines 1 - 2), the algorithm will return the target
input interval: Ierr = [Lower_bound, Up_bound].

4.2 Deriving an Approximation of Mathematical Function

In this section, we describe how to derive an approximation of a mathematical function within a
small input interval Ierr to satisfy a given error threshold ε. First, we give the following definition
of an approximation of a mathematical function.

Definition 3. Given an error threshold ε and an input interval Ierr , for a mathematical
function f : R → R, if there exists a function f : F → F, such that for any floating-point input
xi ∈ Ierr , ErrBits(f (xi ), f (xi )) ≤ ε holds, then f is said to be an approximation of f in the
given Ierr that satisfies the given error threshold ε.

Based on the above definition, we have the following theorem:

Theorem 4.1. Given a mathematical function f : R → R and an input interval Ierr , for any
ε ∈ [0, 64) and any floating-point input xi ∈ Ierr , there exists f : F → F such that
ErrBits(f (xi ), f (xi )) ≤ ε
Proof. Construct a dictionary Df = {xi : oi |xi ∈ Ierr , oi = fl(f (xi ))}, and let f = Df . Then
ErrBits(f (xi ), f (xi )) = ErrBits(fl(f (xi )), f (xi )) = 0 ≤ ε (because FPNum(fl(f (xi )), f (xi )) = 1 accord-
ing to Eq. 2). □
Theorem 4.1 shows that there always exists an approximation that can satisfy a given error
threshold ε. Actually, from the proof, we can construct the approximation Df that satisfies ε = 0.
Based on Theorem 4.1, we design an iterative refinement algorithm that will terminate with an
approximation f satisfying a given error threshold ε, e.g., f = Df in the worst case. Note that the
algorithm is general for any given Ierr and ε, which means that the algorithm not only can be
adopted to repair high floating-point errors evoked by ill-condition problems but also can be used
for repairing floating-point errors in general.
To detail the algorithm, we first introduce two main components of the algorithm: linear approx-
imation and error compensation.
Linear approximation. Based on Hypothesis 1 that
high floating-point errors should be triggered by inputs
localized in some small input intervals, it is natural to
leverage the linear approximation to simulate a mathe-
matical (univariate) function f within a small input in-
terval.
For a given input interval Ierr , the linear approxima-
tion can be easily depicted by a line segment geometri-
cally, as shown in Fig. 7. The two end points of the line Fig. 7. Schematic diagram of
segment can be determined by fl(f (x)). Let us consider, linear approximation
in Fig. 7, three points (xi , Oi ), (xk , Ok ), (xj , Oj ) where
{x i , x k , x j } ∈ Ier r , O i = f l ( f (x i )), O k = f l ( f (x k )), O j = f l ( f (x j )). We draw the line l to
approximate the three points, where we fix the two end points to satisfy Oi l = Oi = fl(f (xi )) and

Proc. ACM Program. Lang., Vol. 3, No. POPL, Article 56. Publication date: January 2019.
Efficient Automated Repair of High Floating-Point Errors in Numerical Libraries 56:13

O j l = O j = f l ( f (x j )). Then we have ErrBits(Oi l , f (xi )) = 0, ErrBits(Oj l , f (xj )) = 0, which means

that at the two end points the linear approximation induces 0 bit error compared with the original
mathematical function. Then, the intermediate points of the line segment can be calculated by a
linear function. The linear function can be implemented as
fl (x) = k ⊗ (x ⊖ xs ) ⊕ ys (10)
where k is the slope of the line segment, x ∈ Ier r , (x s , ys ) is one end point of the line segment.
Finally, the linear approximation overall can be described as:


 f l ( f (x )) if x ∈ {x s , x e }



For any floating point input x ∈ Ierr , l(x) =  (11)


 fl (x )
 if x < {x s , x e }
where {xs , xe } is the x-axis values of the two end points of the line segment.
Theorem 4.2. Given a mathematical function f : R → R and an input interval Ierr , for any
ε ∈ [0, 64) and any floating-point input xi ∈ Ierr , there exist a piecewise linear function Σ l (x) : F → F,
such that
ErrBits(Σ l (x), f (xi )) ≤ ε
Proof. Let the set {x0 , x1 , ..., xn } denote all floating-point numbers in Ierr . We construct a
piecewise linear function Σ l (x) = lk (x) when x ∈ {xk , xk+1 } (k ≤ n − 1, k ∈ N}), where lk (x) rep-
resents the linear approximation over Ierr = [xk , xk+1 ], following the definition in Eq. 11. Then
for any xi ∈ Ierr , Σ l (xi ) = li (xi ). According to Eq. 11, we have li (xi ) = fl(f (xi )), and thus we have
ErrBits(Σl (x i ), f (x i )) = ErrBits ( f l ( f (x i )), f (x i )) = 0 ≤ ε. □

Theorem 4.2 shows that for a mathematical function (over floating-point inputs), there always
exists a piecewise linear function that can satisfy a given error threshold ε, e.g., Σ l (x) in the
worst case. Note that Σ l (x) is equivalent to Df (in Theorem 4.1). Based on Theorem 4.2, we can
continuously reduce the error at the point that does not satisfy the error threshold ε yet by creating
a new line segment. After a limited iteration number, we can find a piecewise linear function Σl (x)
to satisfy ErrBits(Σl (xi ), f (xi )) ≤ ε, and in the worst case we can have Σl (xi ) = Σ l (xi ) . For example,
for the intermediate point (xk , Ok ) in Fig. 7, a possible error ε k exists between Ok l and Ok . If ε k is
smaller than a given error threshold ε, we say that we have already found a linear approximation
for f (x) at the three points, otherwise, a piecewise linear function Σl (x) consisting of two new line
segments (l1 and l2 ) will replace l(x), to make ErrBits(Σl (xk ), f (xk )) = 0 ≤ ε.
Error compensation Using purely linear approximation may result in too many pieces of linear
functions to express an accurate enough approximation of a mathematical function even in a small
input interval Ier r . Therefore, we add an error compensation on each piece of the piecewise linear
function to reduce the error between the linear approximation and the mathematical function.
To conduct the error compensation on each linear function l (x ), we first introduce a function
AbsErr (x) : F → F to express the absolute error between a linear function and the original
mathematical function:
For any x ∈ Ierr , AbsErr (x) = fl(f (x)) ⊖ l(x) (12)
The calculation of the function AbsErr (x) needs to calculate the mathematical function f (x) (which
cannot be implemented exactly in our patch using finite precision), so we use a error compensation
function to approximate AbsErr (x). According to Eq. 11, for the two end points of a line segment,
we have AbsErr (xs ) = AbsErr (xe ) = 0. Consider the possible nonlinear feature of the mathematical

Proc. ACM Program. Lang., Vol. 3, No. POPL, Article 56. Publication date: January 2019.
56:14 Xin Yi, Liqian Chen, Xiaoguang Mao, and Tao Ji

function in Ierr and the fact that we already know two roots (xs , xe ) of a quadratic function, we use
the following quadratic function to approximate AbsErr (x):
For any x ∈ Ierr , AbsErr (x) ≈ λ ⊗ (x ⊖ xs ) ⊗ (x ⊖ xe ) (13)
We choose the middle point xm of xs and xe to calculate the value of λ. Then we have λ =
AbsErr (xm ) ⊘ ((xm ⊖ xs ) ⊗ (xm ⊖ xe )) wherein AbsErr (xm ) can be computed though fl(f (xm )) ⊖
l (xm ) following Eq. 12.
Finally, we define the error compensation function as c(x) = λ ⊗ (x ⊖ xs ) ⊗ (x ⊖ xe ) and add it
back to the linear function. Then we can have the following possible more accurate approximation
of a mathematical function:
For any x ∈ Ierr , lc (x ) = l (x ) ⊕ c (x ) (14)
The error compensation helps us to get a more accurate approximation of a mathematical
function. For example, as shown in Fig. 8(a), for the mathematical function bessel_J1, we compare
AbsErr (x) with lc(x) ⊖ l(x) to show the effectiveness of error compensation, where we use the
l(x) ⊖ l(x) as the baseline. From Fig. 8(a), we can see that for this example one line segment with
one time of error compensation (red line) fits well the original mathematical function (bold yellow
line).

×10−13 ×10−20
0.0 6
AbsErr(x)
lc(x) ⊖ l(x)
−0.5 4 l(x) ⊖ l(x)

−1.0
2
Output

Output

−1.5
0

−2.0
−2

−2.5
AbsErr(x) −4
lc(x) ⊖ l(x)
−3.0
l(x) ⊖ l(x)
−6
0.000004 0.000005 0.000006 0.000007 0.000008 −9.00 −8.75 −8.50 −8.25 −8.00 −7.75 −7.50 −7.25
Input +3.8317 Input ×10−7 − 4.34016609×102

(a) Error compensation on bessel_J1 (b) Error compensation on airy_Ai

Fig. 8. Examples of error compensation

However, not all functions can be well approximated to satisfy a given error threshold ε by a
line segment with one time of error compensation. E.g., as shown in Fig. 8(b), even though we
have two line segments with error compensations, it may still be not accurate enough for a small
error threshold ε and thus more line segments with error compensations may be needed to satisfy
the ε. Thus, we propose an iterative refinement algorithm to call the previous two steps (linear
approximation and error compensation) iteratively to satisfy a given error threshold ε.
Iterative refinement The iterative refinement algorithm iteratively applies linear approximation
and error compensation to generate a piecewise quadratic function f that can satisfy the given
error threshold ε.
As shown in Algorithm 3, the inputs of the iterative refinement algorithm include the mathe-
matical function f , the given error threshold ε and input interval Ierr : [Lower_bound, Up_bound]
that is the output of Algorithm 2. In Algorithm 3, we maintain a global list Line_list to store all
pieces of linear functions with error compensations and return it as the output of the algorithm.

Proc. ACM Program. Lang., Vol. 3, No. POPL, Article 56. Publication date: January 2019.
Efficient Automated Repair of High Floating-Point Errors in Numerical Libraries 56:15

Algorithm 3 first calls the function LinearApproximate (Line 6) to generate a linear function l.
After getting l, the algorithm calls the function ErrorCompensation (Line 7) to generate the error
compensation c for l. After getting l and c, the algorithm generates the approximation function
f (Line 8) for f within Ierr . Then, the maximum error MaxErrf between f and f is found by the
function MaxErrorSearch (Line 9). The recursive function IterLineApprox will exit if the value of
MaxErrf is smaller than the given threshold ε and at the same time the linear approximation with
error compensation (i.e., (l, c)) will be stored in the global list Line_list (Line 11). Otherwise, two
new input intervals [Lower_bound, xnext ] and [xnext , Up_bound] will be passed to the recursive
function to do another new iteration on the new input intervals, where xnext is the found input that
triggers the possible maximum error MaxErrl in l.

Algorithm 3 Iterative Refinement Algorithm

Input: f , ε, Ierr : [Lower_bound, Up_bound]
Output: Line_list
1: Line_list ← [ ]
2: IterLineApprox(f , ε, [Lower_bound, Up_bound])
3: return Line_list
4: function IterLineApprox(f , ε, [Lower_bound, Up_bound])
5: global Line_list
6: l ← LinearApproximate([Lower_bound, Up_bound], f )
7: c ← ErrorCompensation([Lower_bound, Up_bound], f , l)
8: f ←l⊕c
9: (MaxErrf , xf ) ← MaxErrorSearch(f , f , [Lower_bound, Up_bound])
10: if MaxErrf ⩽ ε then
11: Line_list.append ( (l, c))
12: return 0
13: else
14: (MaxErrl , xnext ) ← MaxErrorSearch(l, f )
15: IterLineApprox(f , ε, [Lower_bound, xnext ])
16: IterLineApprox(f , ε, [xnext , Up_bound])
17: end if
18: end function

Termination guarantee. The termination of iterative refinement algorithm can be proved

based on the finiteness of floating-point numbers and Eq. 11. For each iteration, if the MaxErrf > ε,
a new input xnext will be chosen as the x value of a new end point which is then used to generate two
new line segments. According to Eq. 11, we have ErrBits(f (xnext ), f (xnext )) = 0, which means that
at least the floating-point error triggered by the input x nex t is reduced to zero after one iteration.
Therefore, if there exists n (n ≥ 2 and n ∈ N) floating-point numbers in the initial input interval Ierr ,
in the worst case, after 2 ∗ n − 3 iterations (which can be easily proved by mathematical induction),
i.e., when all inputs are included as the end points of lines, the final resulting approximation of f
will be the function Σ l (x) (in Theorem 4.2) and then the algorithm will terminate (since MaxErrf
becomes 0 for all floating-point inputs in Ierr ). The iterative refinement algorithm guarantees
that the algorithm can always find an approximation for a mathematical function f on the input
interval Ierr to satisfy a given error threshold ε in finite steps. Note that the termination of iterative
refinement algorithm is not influenced by the mathematical feature of the mathematical function f .

Proc. ACM Program. Lang., Vol. 3, No. POPL, Article 56. Publication date: January 2019.
56:16 Xin Yi, Liqian Chen, Xiaoguang Mao, and Tao Ji

Effectiveness analysis. The effectiveness of the algorithm is mainly influenced by the mathe-
matical feature of the mathematical function f . For example, as shown in Fig. 8(a), the bessel_J1 is
not a quadratic function, but it has a quadratic polynomial feature in a small input interval and
thus can be well approximated by the lc(x) function (in Eq. 14), while the function airy_Ai (in Fig.
8(b)) may need more iterations to satisfy the same given error threshold ε.

4.3 Producing Patch

After deriving an approximation of the target mathematical function f , we illustrate how to convert
the approximation to a patch in practice.
First, accurately implementing approximations is critical for approximating mathematical func-
tions. To improve the accuracy of implementation of the linear function (Eq. 10) and the error
compensation function (Eq. 13), we propose a strategy from the implementation aspect to make
the subtraction operations exact (in Eq. 10 and Eq. 13).
In detail, we partition the target input interval Ierr into a set of smaller input intervals D(Ierr )
according to the distribution of floating-point numbers, such that for each Is ∈ D(Ierr ), we have
ulp(xi ) = ulp(xj ) and xi ⊗ xj ≥ 0 for any (xi , xj ) ∈ Is . Let the set X s = {x 0 , x 1 , ..., x n } represent all
floating-point numbers in Is in an increasing order. When both x i , x j ∈ X s are denormalized floating-
point numbers, x i − x j can be exactly represented by denormalized floating-point representation
and xi ⊖ xj = xi − xj .
Now, we consider the case that both x i and x j are normalized floating-point numbers. We leverage
the Sterbenz’s theorem [Sterbenz 1973]:
Theorem 4.3. Let x, y ∈ F and x, y ≥ 0. Then
x
≤ y ≤ 2x =⇒ x ⊖ y = x − y
2
Theorem 4.3 can be easily extended to cover the case x, y ≤ 0, as the following one:
Theorem 4.4. Let x, y ∈ F. Then
x x
( ≤ y ≤ 2x) ∨ (2x ≤ y ≤ ) =⇒ x ⊖ y = x − y
2 2
For any two floating-point inputs x i , x j ∈ Xs , we know ulp(xi ) = ulp(xj ) and xi ⊗ xj ≥ 0. When
xi
xi , xj ≥ 0, if xi ∈ [2k , 2k+1 ), then xj ∈ [2 k , 2 k+1 ), so we have ( ≤ x j ≤ 2x i ). When xi , xj ≤ 0,
2
xi
if x i ∈ (−2 k+1 , −2 k ], then xj ∈ (−2 k+1 , −2 k ], so we have (2x i ≤ x j ≤ ). Then according to
2
Theorem 4.4, we have x i ⊖ x j = x i − x j .
Overall, we have:
For any two floating point inputs xi , xj ∈ Xs , xi ⊖ xj = xi − xj (15)
Eq. 15 ensures that the ⊖ operations in the linear approximation function (Eq. 10) and the error
compensation function (Eq. 13) are exact on each Is ∈ D (Ier r ).
Then, we store each piece of the approximation fi that is generated for each Ii ∈ D(Ierr ) using a
6-tuple structure. As shown in Algorithm 3 (Line 11), fi is expressed by a list of (l, c). We use the
following 6-tuple to store each (l, c) in detail
st j(i) = ⟨kj(i) , xs(i)
j
, ys(i)
j
, xe(i)j , ye(i)j , λ j(i) ⟩ (16)

where kj(i) is the slope of linear function (Eq. 10) , (xs(i)

j
, ys(i)
j
) is the starting point of lj , (xe(i)j , ye(i)j )
is the end point of lj , λ j(i) is the coefficient of the error compensation function (Eq. 13). Let m
represent the length of the list of (l, c) and the piecewise quadratic function fi on Ii be stored as

Proc. ACM Program. Lang., Vol. 3, No. POPL, Article 56. Publication date: January 2019.
Efficient Automated Repair of High Floating-Point Errors in Numerical Libraries 56:17

ST (i) = {st 0(i) , st 1(i) , ..., st m

(i)
} where st k(i) (0 ≤ k ≤ m) stores the k-th piece of the fi and for any two
st j(i) , st j+1
(i)
∈ ST (i) (0 ≤ j ≤ m − 1) we have xe(i)j = xs(i)
j+1
.
Next, we derive a search optimization problem of finding the right st r(i) ∈ ST (i) to calculate fi (x)
for each floating-point input x ∈ Ii .
Search optimization To do the search optimization, we first extract the x-values of the end
points from ST (i ) and store them as a list Le = [0, x e(i0) , x e(i1) , ..., x e(im) ]. Given an input x t ∈ Ii , if
Le [j] ≤ x t ≤ Le [j + 1], we will use st j(i ) to calculate fi (x t ). For the input x t ∈ Ii , it is a search
problem to find the right j such that Le [j] ≤ x t ≤ Le [j + 1]. We transfer the search problem into
a function f ∆ : X i → {0, ..., m} where X i represents all floating-point numbers in Ii in increasing
order, and for each x t ∈ Ii , when Le [j] ≤ x t ≤ Le [j + 1], we have f ∆ (x t ) = j. Furthermore, when
the value of m is large, we use a polynomial function Pi (x ) to fit the inputs and outputs of f ∆ such
that f∆ (xt ) ≈ je where je = floor (Pi (xt )).
More specifically, we use the least squares method to
1: procedure patch_code(x)
generate a polynomial function (i.e. P (x ) = p[0] ∗ x n +
2: Array: ST (i) , Le
... + p[n] where p[i] (i ∈ [0, ..., n]) is the coefficient we
3: je ← floor (P (x))
need to solve according to a given data set) from the given
4: j ← Search_idx (je , Le , x)
data set such that inputs are [x e(i0) , x e(i1) , ..., x e(im) ] and their 5: (l, c) ← transfer (ST (i) [j])
corresponding outputs are [0, ..., m]. Then we store the
6: f ←l⊕c
polynomial function in the patch and call it when needed
7: return f (x)
to estimate the j as shown in Line 3 of Fig. 9. When the
8: end procedure
value of m is small we employ a binary search to find j.
Finally, for a floating-point input x ∈ Ii , the patch first Fig. 9. Pseudocode of patch
calls a polynomial function Pi to estimate the possible j
value, then searches for the exact j and calls st j(i ) to generate f for calculating f (x ). The pseudocode
in Fig. 9 shows the process.

5 IMPLEMENTATION AND EVALUATION

5.1 Implementation
In this section, we introduce AutoRNP, the prototype tool that implements our approach. The
implementation follows the work flow in Fig. 3, consisting of the following components:
(S1) A detector of high floating-point errors which, given an input domain Iinit , numerical program
fp and the corresponding mathematical function f , finds the Ierr of fp . We use the package mpmath
[Johansson et al. 2013] which is an open-source Python library for real and complex floating-point
arithmetic with arbitrary precision. As explained in Sect. 4.1, the detector involves the uses of
differential evolution and MCMC algorithms to implement the DEMC algorithm. We use differential
evolution algorithm [Storn and Price 1997] and the Basinhopping algorithm [Wales and Doye 1997]
as the MCMC engine. Both of them are available from Scipy (version 1.0.0)5 .
(S2) An extractor that extracts an approximation f of the mathematical function f in Ierr to satisfy
a given error threshold ε.
(S3) A patch generator that applies the search optimization on the approximation f and converts f
to patch. Our tool is designed for numerical programs in GSL, so the patch is stored in C code. We
use the polyfit function in numpy6 to produce the polynomial function P for the search optimization
and evaluate the polynomial function P by Horner’s method [Cajori 1911].

5 https://docs.scipy.org/doc/scipy-1.0.0/reference/optimize.html
6 https://docs.scipy.org/doc/numpy/reference/generated/numpy.polyfit.html

Proc. ACM Program. Lang., Vol. 3, No. POPL, Article 56. Publication date: January 2019.
56:18 Xin Yi, Liqian Chen, Xiaoguang Mao, and Tao Ji

5.2 Benchmarks and Experimental Setup

Our subjects are chosen from the GNU Scientific Library7 (GSL) (version 2.1). GSL has in total 154
functions with all inputs being floating-point numbers and the output being one floating-point num-
ber. Note that 67% of those functions (104 of 154) are univariate functions, and many multi-argument
functions are built based on those univariate functions. Our prototype tool currently only supports
repairing numerical programs with one input. To achieve the accurate outputs of mathematical
functions that those programs implemented, we choose mpmath to supply implementations of
those mathematical functions under arbitrary precision to avoid unexpected errors that may be
introduced by higher precision execution on original programs (see details in ğ6). So we have to
remove those univariate functions that are not supported in mpmath and we get 49 numerical
programs as our initial subjects. Then we use the DEMC algorithm (ğ4.1) and run it 100 times
on each program to find the maximum floating-point error of those 49 numerical programs and
finally choose 20 numerical programs which have significant higher maximum floating-point errors
(measured by ErrBits) than other 29 programs (whose MaxErr’s are less than 30). The benchmark is
shown in Table 1. We also notice that there may exist not only one Ierr in a numerical program, and
we choose to repair the Ierr around the input that can trigger the possible maximum floating-point
error of a numerical program over the whole input domain. Note that the repair process for each
Ierr is independent and repeatable.
Table 1. Benchmark from GSL

ID Program MaxErr ID Program MaxErr

P1 gsl_sf_airy_Ai 62.94 P11 gsl_sf_legendre_P2 49.75
P2 gsl_sf_airy_Bi 62.94 P12 gsl_sf_legendre_P3 61.92
P3 gsl_sf_airy_Ai_deriv 62.94 P13 gsl_sf_legendre_Q1 52.01
P4 gsl_sf_airy_Bi_deriv 62.94 P14 gsl_sf_psi 62.94
P5 gsl_sf_bessel_J0 53.03 P15 gsl_sf_Chi 50.47
P6 gsl_sf_bessel_J1 61.92 P16 gsl_sf_Ci 62.92
P7 gsl_sf_bessel_Y0 61.92 P17 gsl_sf_lnsinh 61.92
P8 gsl_sf_bessel_Y1 51.31 P18 gsl_sf_zeta 51.35
P9 gsl_sf_clausen 61.92 P19 gsl_sf_eta 51.44
P10 gsl_sf_expint_Ei 51.76 P20 gsl_sf_psi_1 52.57

Performance on real-world numerical programs We want to evaluate the performance of our

approach on real-world numerical programs (see details within ğ5.3). First, we investigate the
repair ability of AutoRNP under a given criterion of repair and a time limit. We use AutoRNP to
repair high floating-point errors in the 20 numerical programs of GSL. We set three levels of error
thresholds. The error threshold is calculated by the following formulas:
X
k
ErrBits(f (xi ), fp (xi ))
MeanErr (fp ) = (17)
i=0
k
ε = (MaxErr (f , fp ) − MeanErr (fp )) ∗ τ + MeanErr (fp ) (18)
where MeanErr (fp ) calculates the mean error of fp over its whole input domain, and k is the number
(more than 10 million) of sampling inputs. We set τ = {H : 0.1; M : 0.2; L : 0.3} to calculate high
Hε , middle Mε and low Lε level threshold for repair. Note that if τ = 0, the threshold ε equals to
the mean error MeanErr (fp ). We set time limit as 3 hours for one time of repairing a numerical
program.
7 http://www.gnu.org/software/gsl/

Proc. ACM Program. Lang., Vol. 3, No. POPL, Article 56. Publication date: January 2019.
Efficient Automated Repair of High Floating-Point Errors in Numerical Libraries 56:19

To evaluate the repair results, we define the accuracy of repair using the following formula:
PassNum
AccRepair = ∗ 100% (19)
TestNum
where PassNum = |{x |ErrBits(f (x), fp (x)) ≤ ε, x ∈ Ierr }|, TestNum = Min(|{x |x ∈ Ierr }|, 100000). As
shown in Eq. 19, we evaluate the accuracy of repair by checking how many inputs whose floating
point errors of the outputs of fp (x) are less than the given error threshold ε in TestNum times
random sampling tests.
Moreover, we also investigate the change of the maximum floating-point error and the average
floating-point error over the input interval Ierr after repairing.

Fig. 10. Repair processes of HBG.

Comparing with state-of-the-art tools To compare with state-of-the-art methods, we build a

connection between two state-of-the-art tools Herbgrind [Sanchez-Stern et al. 2018] and Herbie
[Panchekha et al. 2015] to construct a new tool HBG. The work flow of HBG is shown in Fig.
10. In brief, HBG employs Herbgrind to find the root cause of floating-point errors during the
execution of numerical program fp on the given test inputs and extract floating-point expressions
FPExp that can be recognized by Herbie. Then, HBG employs Herbie to use mathematical rewriting
rules over FPExp to automatically improve the accuracy of the floating-point expressions. Finally,
Herbie will output new floating-point expressions NewFPExp. Because Herbgrind does not include
a functionality to detect high floating-point errors, we use our tool to generate test inputs in Ierr
for Herbgrind. For each Ierr under different error thresholds, we generate 100 000 random sample
points to aid Herbgrind to produce FPExp.
Influence of patches on the original programs Concerning the influence of patches on subjects,
we mainly focus on the time overhead of the subjects after repairing, and we also consider the
storage overhead of patches. Moreover, we discuss the readability of subjects after repairing. All
our experiments were conducted on Ubuntu 16.04.3 LTS with 3.5GHz(8C) Intel Core i7-3770K CPU
and 16 GB RAM.

5.3 Performance on Real-World Numerical Programs

Table 2 shows the repair time and the accuracy of
repair. We compare the value of AccRepair (in Eq. 19) i n t g s l _ s f _ p s i _ 1 ( c o n s t d o u b l e x )
before repair (in łBeforež column) and after repair (in { . . .
łAfterž column) in the column łAccuracy of Repairž. el se i f ( x > −5.0)
As shown in Table 2, AutoRNP completes the re- { / ∗ Abramowitz + S t e g u n 6 . 4 . 6 ∗ /
pairing of 19 numerical programs in GSL, while ...}
accounts time out (TO) for P20. For P20, we fur- e l s e
ther investigate the reason of time out, and we find { / ∗ Abramowitz + S t e g u n 6 . 4 . 7 ∗ /
that the reason is mainly due to the mathematical . ..}
}
feature of the function gsl_sf _psi_1 (i.e., P20). The
gsl_sf _psi_1 in its Ierr appears as an exponential
Fig. 11. Code of дsl_s f _psi_1
function, while our approach relies on the linear
approximation and the compensation of a quadratic function, which is not good enough to approx-
imate an exponential function. In fact, we have rerun AutoRNP on P20 without given a time limit

Proc. ACM Program. Lang., Vol. 3, No. POPL, Article 56. Publication date: January 2019.
56:20 Xin Yi, Liqian Chen, Xiaoguang Mao, and Tao Ji

Table 2. Experimental results on 20 numerical programs in GSL

Repair Time(s) Accuracy of Repair

ID Lε Mε Hε
Lε Mε Hε
Before After Before After Before After
P1 17.26 23.69 26.57 54.08% 100.00% 46.61% 100.00% 46.60% 100.00%
P2 22.51 21.67 35.25 51.16% 100.00% 40.52% 100.00% 55.50% 100.00%
P3 21.36 25.07 44.31 59.62% 100.00% 65.81% 100.00% 55.26% 100.00%
P4 21.73 22.40 34.75 60.98% 100.00% 65.85% 100.00% 52.94% 100.00%
P5 2.46 4.59 34.04 62.88% 100.00% 61.49% 100.00% 60.44% 100.00%
P6 2.04 2.33 36.02 58.57% 100.00% 61.63% 100.00% 59.12% 100.00%
P7 10.92 23.03 822.38 62.28% 100.00% 69.14% 100.00% 67.99% 100.00%
P8 59.95 150.73 4919.04 70.25% 100.00% 67.01% 100.00% 63.15% 100.00%
P9 66.57 89.18 1040.95 57.64% 100.00% 61.04% 100.00% 65.42% 100.00%
P10 1.90 4.93 95.70 66.70% 100.00% 58.40% 100.00% 61.92% 100.00%
P11 2.77 3.45 11.93 62.20% 100.00% 64.54% 100.00% 66.12% 100.00%
P12 2.32 4.38 63.30 63.75% 100.00% 62.91% 100.00% 59.64% 100.00%
P13 32.07 96.11 3561.67 64.37% 100.00% 69.99% 100.00% 57.75% 100.00%
P14 1.67 2.95 20.74 50.05% 100.00% 51.49% 100.00% 52.53% 100.00%
P15 3.51 9.62 411.72 60.20% 100.00% 58.63% 100.00% 56.79% 100.00%
P16 2.35 4.75 229.62 63.61% 100.00% 58.49% 100.00% 67.61% 100.00%
P17 2.01 5.25 124.59 63.35% 100.00% 66.38% 100.00% 63.55% 100.00%
P18 5.66 9.15 83.61 66.80% 100.00% 45.50% 100.00% 49.81% 100.00%
P19 10.62 17.15 52.30 51.77% 100.00% 61.94% 100.00% 45.93% 100.00%
P20 TO TO TO 54.97% 54.97% 52.53% 52.53% 59.23% 59.23%

and found that AutoRNP can complete the repair of P20 to satisfy the low error threshold Lε within
8 hours. Moreover, we find that GSL uses different formulas to implement the gsl_sf _psi_1 function.
As shown in Fig. 11, when the input x satisfying x ≤ −5, the GSL program implements a formula
(6.4.7 in [Abramowitz 1974]) which is different with the case x > −5.0, while the mathematical
function (in mpmath) keeps using the same formula (6.4.6 in [Abramowitz 1974]) for both cases. If
we change the implementation of gsl_sf _psi_1 according to the formula 6.4.6 for x ≤ −5, the high
floating-point errors of gsl_sf _psi_1 in Ierr disappear.
What stands out in Table 2 is the łAccuracy of Repairž which is 100% for all three levels of error
thresholds and for all programs except P20. The results show that most subjects have a polynomial
feature in their small input interval Ierr and can be well approximated by our approach.
Fig. 12 shows the bits correct for maximum error and average error after repair under different
levels of error thresholds. Accuracy in Fig. 12(a) is measured by Errbits, and the maximum error

Proc. ACM Program. Lang., Vol. 3, No. POPL, Article 56. Publication date: January 2019.
Efficient Automated Repair of High Floating-Point Errors in Numerical Libraries 56:21

Accuracy improving for Lε Accuracy improving for Lε

P1 P1
P2 P2
P3 P3
P4 P4
P5 P5
P6 P6
P7 P7
Program ID

Program ID
P8 P8
P9 P9
P10 P10
P11 P11
P12 P12
P13 P13
P14 P14
P15 P15
P16 P16
P17 P17
P18 P18
P19 P19

0 8 16 24 32 40 48 56 64 0 8 16 24 32 40 48 56 64

Accuracy improving for Mε Accuracy improving for Mε

P1 P1
P2 P2
P3 P3
P4 P4
P5 P5
P6 P6
P7 P7
Program ID

Program ID
P8 P8
P9 P9
P10 P10
P11 P11
P12 P12
P13 P13
P14 P14
P15 P15
P16 P16
P17 P17
P18 P18
P19 P19

0 8 16 24 32 40 48 56 64 0 8 16 24 32 40 48 56 64

Accuracy improving for Hε Accuracy improving for Hε

P1 P1
P2 P2
P3 P3
P4 P4
P5 P5
P6 P6
P7 P7
Program ID

Program ID

P8 P8
P9 P9
P10 P10
P11 P11
P12 P12
P13 P13
P14 P14
P15 P15
P16 P16
P17 P17
P18 P18
P19 P19

0 8 16 24 32 40 48 56 64 0 8 16 24 32 40 48 56 64

(a) Bits correct for maximum error (b) Bits correct for average error
(longer is better) (longer is better)

Fig. 12. Each row represents the improvement in accuracy achieved by AutoNRP on a single benchmark. The
thick arrow starts at the accuracy of the program before repair, and ends at the accuracy of the program after
repair. A triangle is drawn at the value of error threshold for each subject . A pentagram is drawn at the
value of mean error of each subject in its whole input domain.

is found by our DEMC algorithm (in ğ4). Accuracy in Fig. 12(b) is measured by Errbits, averaged
across 100 000 random input points in the Ierr of a program8 . As shown in Fig. 12, for all given
thresholds (Hε , Mε , Lε ) and subjects excepts P20, AutoRNP successfully improves the accuracy
such that the maximum error does not exceed the given error threshold and the resulting average
error decreases. Interestingly, for P11 , after repairing, its maximum error is almost getting close to
the mean error as shown in Fig. 12. And the timings for repairing P11 are also quite small, actually

8 If the number of floating point inputs in I e r r is less 100 000, then we test all floating-point inputs in Ierr .

Proc. ACM Program. Lang., Vol. 3, No. POPL, Article 56. Publication date: January 2019.
56:22 Xin Yi, Liqian Chen, Xiaoguang Mao, and Tao Ji

the least one for Hε , as shown in Table 2. In fact, the gsl_sf _legendre_P2 (i.e., P11) is a quadratic
function which is a natural fit for our approach.

50
Ierr size with Lε 3093 123 3448 3510 2148
log2 value of Time(s) and Size of Ierr

956 1361 1014 3847

Ierr size with Mε 2542 463 192
52 43 5 142 113 40 38
40 21 680 20
Ierr size with Hε 88 20 12
430 2 2 6 6
2 12 419 2 2 3 2
24 37 17 2 4 2
21
30 5 16
2 2 2 2
2
2 2
2 2 2
2 repair time with Lε
20
repair time with Mε
repair time with Hε
10

P1 P2 P3 P4 P5 P6 P7 P8 P9 P10 P11 P12 P13 P14 P15 P16 P17 P18 P19
Program ID

Fig. 13. Repair time and size of Ierr under three level error thresholds

Fig. 13 shows the log2 value of repair time and size of Ierr for each subject under three thresholds.
The size of Ierr is the quantity of floating-point numbers in Ierr (evaluated by Eq. 2). Note that we
label the number of pieces in the approximation function f (i.e., the piecewise quadratic function)
on each dot of the three lines of Ierr (the blue lines in the upper of Fig. 13). As shown in Fig. 13, the
higher level of error thresholds, the more repair time needed by our approach, the larger size of
Ierr , and the more pieces in the piecewise quadratic function.
In summary, the above results suggest that our approach can efficiently repair high floating-point
errors in numerical libraries to satisfy a given error threshold.

5.4 Comparing with State-of-the-art Tools

We compare our experimental results with that given by HBG which integrates the two state-of-
the-art tools Herbgrind and Herbie (as explained in ğ5.2). Because the main target of HBG is to
reduce the average error rather that reducing only high errors over the input domain, we focus on
the comparison of our AutoRNP with HBG over average bits correct over the same input domain
Ierr . In our experiments, we regard HBG as a black box and only consider the results reported by
HBG.
As shown in Table 3, compared with HBG, AutoRNP keeps a more stable and higher improving
of average bits correct for all 19 successful repaired programs (P1 ∼ P19) under the three levels of
error thresholds. HBG fails to complete the repair process for P1 and P2 (łÐž in Table 3), because
Herbgrind does not produce any floating-point expressions that induce large floating-point errors
and need to be repaired. This may be due to the fact that higher precision executions in Herbgrind
do not find much differences in accuracy with the original precision executions for P1 and P2.
However, our AutoRNP successfully finds high floating-point errors for P1 and P2, because AutoRNP
compares the results of original program with that given by mpmath (which is consider as the
mathematical function). Note that our approach aims to reduce higher errors rather than average
errors but still results in lower average errors than HBG.
Overall, the experimental results in Table 3 show that HBG which uses mathematical rewriting
technique might not fit for reducing high floating-point errors evoked by ill-conditioned problems
(that are in nature of most of the subjects in ğ5.2). However, HBG (Herbgrind and Herbie) still is a
very good combination to help developers to find implementations including high floating-point
errors in numerical code and to supply suggestions for improving accuracy.

Proc. ACM Program. Lang., Vol. 3, No. POPL, Article 56. Publication date: January 2019.
Efficient Automated Repair of High Floating-Point Errors in Numerical Libraries 56:23

Table 3. AutoRNP vs HBG over average bits correct

Average error Improving of average bits correct

(before repair) (after repair, larger is better)
ID
Lε Mε Hε
Lε Mε Hε
AutoRNP HBG AutoRNP HBG AutoRNP HBG
P1 33.8 29.9 25.5 25.7 Ð 13.7 Ð 5.2 Ð
P2 31.7 27.5 22.5 19.6 Ð 7.4 Ð 2.5 Ð
P3 29.6 24.6 20.3 18.0 13.2 3.7 0.2 2.9 0.7
P4 30.5 25.7 21.5 19.2 6.3 6.3 0.2 3.3 0.0
P5 17.2 12.2 7.2 8.8 2.6 2.2 -0.1 1.5 0.0
P6 19.7 13.6 7.7 11.3 0.0 2.6 0.0 1.5 0.0
P7 18.1 11.5 5.5 7.6 0.1 1.6 0.0 1.0 0.1
P8 14.5 9.6 4.7 3.9 0.0 1.7 0.0 1.3 0.0
P9 20.8 14.8 8.8 13.0 0.5 2.3 TO 2.0 0.2
P10 15.6 10.8 5.7 4.1 0.0 2.7 0.0 1.9 0.0
P11 14.9 9.9 5.0 14.2 1.2 9.4 1.5 4.6 1.6
P12 18.5 12.4 6.4 11.5 1.7 2.7 0.4 1.6 1.0
P13 15.6 10.2 5.7 4.1 0.5 1.5 0.7 1.7 0.1
P14 28.1 23.1 18.1 9.7 0.0 2.7 0.0 2.3 0.0
P15 16.3 11.3 4.9 4.7 0.0 2.1 0.6 1.8 0.3
P16 14.7 9.7 6.5 7.9 0.1 2.4 0.0 2.2 -0.1
P17 19.0 13.0 5.7 8.6 0.7 1.9 0.7 1.3 0.5
P18 18.0 11.7 8.2 2.7 0.0 3.2 0.0 1.8 0.0
P19 17.2 13.2 9.9 10.8 0.0 2.9 0.0 3.2 0.0
P20 18.9 13.9 5.9 0.0 0.0 0.0 0.0 0.0 0.0

5.5 Influence of Patches on the Original Programs

Time overhead We timed the original program and the program after repaired via AutoRNP by
running on 10 million random inputs in the Ierr and the whole input domain respectively. Fig. 14 is
the cumulative distribution of the slowdown for subjects after repaired. The horizontal axis shows
the ratio between the run-time of the program after and before repair. Fig. 14(a) shows that more
than 90% programs after repaired by AutoRNP are faster (approximately 0.5 to 1.0) over inputs in
Ierr . When testing in the whole input domain, Fig. 14(b) shows the ratio of time overhead is around
1.0 (approximately 0.85 to 1.15). Overall, these results suggest that our patch does not slow down
the execution in the whole input domain, while may accelerate the execution of subjects over Ierr .

Proc. ACM Program. Lang., Vol. 3, No. POPL, Article 56. Publication date: January 2019.
56:24 Xin Yi, Liqian Chen, Xiaoguang Mao, and Tao Ji

100 100

80 80

60 60

40 40

20 20
Lε Lε
Mε Mε
Hε Hε
0 0
0.5 0.6 0.7 0.8 0.9 1.0 0.85 0.90 0.95 1.00 1.05 1.10

(a) Time overhead on Ierr (b) Time overhead on the whole input domain

Fig. 14. AutoRNP time overhead ratio (left is better) on 19 successfully repaired programs

Storage overhead Extra storages are needed for storing the patch. We evaluate the storage
overhead by the size of patch file. As shown in Table 4, the storage overhead is increasing with the
level of error thresholds. Note that there exist significant difference between the subjects over the
storage overhead for the high level error threshold H ε .

Table 4. Storage overhead (KB) on 19 successfully repaired programs

Program ID
Threshold
P1 P2 P3 P4 P5 P6 P7 P8 P9 P10
Lε 1.68 1.69 1.82 1.82 1.73 1.73 1.73 2.04 1.69 1.73
Mε 1.69 1.69 1.82 1.82 4.05 4.67 9.61 15.15 3.25 8.21
Hε 2.15 5.10 7.42 4.78 67.92 149.08 479.39 393.82 107.46 211.58
P11 P12 P13 P14 P15 P16 P17 P18 P19
Lε 1.78 1.77 2.39 1.60 2.22 1.58 1.67 3.01 2.82
Mε 2.24 4.54 23.38 3.79 18.89 7.59 7.37 4.39 4.37
Hε 20.69 157.09 591.47 66.51 533.47 542.30 332.95 72.46 32.44

Readability The process (in ğ4.2) of deriving the approximation of mathematical function only
bases on the input and output of f , and does not involve the implementation of numerical program.
The patch is also independent of the implementation of original subject. Thus we can package
the main implementation of a patch in a function and store it in an individual file. In the source
code of original program, we just need to add a new branch to decide whether to call the function
in the patch, as shown in Fig. 5. Our approach based on simple linear approximation and error
compensation also makes the patch code (in Fig. 9) easily understand.

5.6 Wider Applicability

Our approach that is based on the mathematical feature of a numerical program also fits for the
numerical program which implements the same mathematical function in other numerical libraries
(e.g., the SciPy). We find 16 mathematical functions which are implemented in the 20 numerical

Proc. ACM Program. Lang., Vol. 3, No. POPL, Article 56. Publication date: January 2019.
Efficient Automated Repair of High Floating-Point Errors in Numerical Libraries 56:25

programs (Table 1) are also implemented by 16 numerical programs in SciPy. We also detect high
floating-point errors in 14 of those 16 numerical programs in SciPy on the same Ier r (as GSL). We
convert our approximation to python code and successfully repair those 14 numerical programs in
SciPy.

6 DISCUSSION
Execution of mathematical function The higher precision execution of original numerical
programs may introduce extra errors for numerical programs in numerical libraries that involve
some precision-specific operations [Wang et al. 2016], so we do not use the higher precision
execution as the choice for obtaining mathematical result. In our experiments, we use the package
mpmath to supply the corresponding mathematical functions of original numerical programs in
GSL. mpmath [Johansson et al. 2013] is an open-source library for real and complex floating-point
arithmetic with arbitrary precision, and many computer algebra systems (e.g., SageMath) use
mpmath as the underlying library. Since the corresponding functions supplied by mpmath are via
arbitrary precision, we assume that they should not include precision-specific operations and thus
can supply the (nearly) mathematical results of numerical programs in GSL.
Accuracy of repair A reasonable explanation for the 100% of AccRepair for all subjects (except
P20) might be that the search algorithm is very accurate to find the maximum floating-point error
in a small search space. A higher level of error thresholds needs an approximation function with
more pieces to reduce the error. Then the search space is partitioned more fine-grained, which
also decreases the difficulty to find a maximum floating-point error in the smaller search space
and increases the accuracy of the search algorithm. We have also repeated our experiments many
times with different random seeds, and the accuracy of repair keeps 100%. However, we can not
guarantee the accuracy of repair be always 100% for all programs.
Influence on functional correctness According to Theorem 4.1 and Theorem 4.2, in principle,
our approach can produce a repair semantically equivalent to the mathematical function by setting
the error threshold ε equal to zero (in the worst case by storing the mathematical result for
each floating-point input inside Ierr ). Besides, for a numerical program involving ill-conditioned
problems, the original implementation already affected the functional correctness, and rewrite-
based approaches are not fit for fixing it. However, as a dynamic analysis method, we cannot
guarantee the soundness of our approach unless we take an exhaustive search (i.e., testing all
inputs).
Form of approximation An interesting question is why we choose the piecewise quadratic
function to approximate a mathematical function. Accurately implementing approximations is
critical for approximating mathematical functions. Using quadratic approximations, we can have
strategies (in Sect 4.3) to make the subtraction operations (in approximations) exact and thus can
keep high accuracy during the implementation (while other approximations may not have such
properties). Moreover, we choose a simple way to solve the problem and our experimental results
show that our approach is effective on most subjects.

7 RELATED WORK
Floating-point error detection Since the work of Benz et al. [2012], the study of dynamically
detecting floating-point error has gained momentum. Benz et al. [2012] developed a tool called
FpDebug, which is built based on MPFR [Fousse et al. 2007] and Valgrind [Nethercote and Seward
2007], and can do a shadow execution of the original program in a higher precision to detect
floating-point errors for every instruction. Based on FpDebug, Zou et al. [2015] proposed a detection
approach called LSGA to search for input that can trigger possible maximum floating-point error
in a numerical program. Recently, Herbgrind, which is a similar tool of FpDebug, is developed by

Proc. ACM Program. Lang., Vol. 3, No. POPL, Article 56. Publication date: January 2019.
56:26 Xin Yi, Liqian Chen, Xiaoguang Mao, and Tao Ji

Sanchez-Stern et al. [2018]. Besides supporting similar functionalities of FpDebug, Herbgrind can
find the root cause of floating-point error and extract the corresponding floating-point expressions.
However, the above approaches or tools are all based on the assumption that the semantics of
floating-point code in the higher precision is closer to the semantics of mathematical function.
Hence they could not deal with the precision-specific operations [Wang et al. 2016] and may
introduce unexpected errors. Aware of the precision-specific operations in numerical program, Yi
et al. [2017b] verified that many of high floating-point errors in GSL reported by Zou et al. [2015]
are in fact false alarms. Yi et al. [2017b] also proposed a new search algorithm called EAGT with a
new fitness function from error analysis.
Compared with previous detecting methods, as far as we know, there is no existing work
combining condition number and MCMC for finding the input triggering maximum error. Moreover,
our detecting method not only uses the DEMC algorithm to search the input that can trigger the
maximum floating-point error (like existing detecting methods [Chiang et al. 2014] [Zou et al. 2015]
[Yi et al. 2017b]), but also uses the PTB algorithm to localize inputs that can trigger floating-point
errors higher than a given repair criterion.
Automated improving accuracy of floating-point expressions Tools for automated improv-
ing accuracy of floating-point expressions can be classified by the underling analytic methods
(static or dynamic). Static analysis of floating-point code tries to provide a sound bound of errors
but may have low precision due to too conservative over-approximations and may have limited
scalability. In contrast, dynamic analysis can scale for large program and find exact floating-point
errors during dynamically executing of numerical program but cannot guarantee a sound bound of
errors for a given input interval.
Static tools: Salsa [Damouche and Martel 2018] is a tool constructed under sound abstract inter-
pretation [Cousot and Cousot 1977] and uses mathematical equivalent transformation [Damouche
et al. 2017] to improve the accuracy of floating-point programs. Salsa can analyze numerical pro-
grams with loops and functions, but cannot deal with complex data structures like arrays, function
pointers, library function calls, etc.
Compared with Salsa, our work is based on dynamic analysis. Thus, we cannot guarantee a
sound bound of errors for a given input intervals like Salsa, but our experimental results show that
we produce a high accuracy rate (100%) of repair to reduce high floating-point error to satisfy a
given error threshold over a large amount of floating-point inputs.
Dynamic tools: Herbie [Panchekha et al. 2015] is a tool for automatically improving the accuracy
of floating-point expressions. Like Salsa, Herbie uses mathematical rewriting to improving accuracy.
To apply Herbie to numerical programs, the same authors of Herbie develop the tool Herbgrind
[Sanchez-Stern et al. 2018] to find root cause of floating-point error and extract the corresponding
floating-point expressions that can be analyzed by Herbie. The combination of Herbie and Herbgrind
is called HBG in our paper and its detailed implementation is introduced in ğ5.2. AutoFP [Yi et al.
2017a] is a similar tool of HBG, but is constructed based on FpDebug [Benz et al. 2012] and Herbie.
AutoFP tries to divide program into blocks to improve accuracy in every block to decrease the
complexity of analysis. Although Yi et al. [2017a] first try to use AutoFP to repair high accuracies in
numerical programs, AutoFP uses FpDebug with Herbie and thus is quite similar as HBG (Herbgrind
with Herbie). Moreover, AutoFP does not support numerical programs with complex data structures
in GSL.
Compared with the above work, our work targets at a different goal and uses different meth-
ods. Concerning the target, we focus on automatically repair high floating-point error under a
given threshold rather than improving accuracy without a given criterion or target. Concerning
methodology, our approach uses the linear approximation with error compensation to derive an

Proc. ACM Program. Lang., Vol. 3, No. POPL, Article 56. Publication date: January 2019.
Efficient Automated Repair of High Floating-Point Errors in Numerical Libraries 56:27

approximation of mathematical function instead of using the mathematical rewriting to rearrange

floating-point expressions in original numerical program.
Automated program repair A large and growing body of literature has investigated the auto-
mated program repair. Patch generation is the core part of the whole automated repair process.
Based on the method of patch generation, most automated program repair (APR) approaches can
be classified into two types below:
Search-Based APR: Weimer et al. [2009] proposed GenProg which first uses the genetic algo-
rithm to search possible useful sentences to generate patches. After the opening work of Weimer
et al. [2009], other search algorithms, e.g., random search in [Qi et al. 2014], have been used for
repairing, and methods [Long and Rinard 2015][Long and Rinard 2016] for selecting candidate
patches have also been well studied.
Semantic-Based APR: Nguyen et al. [2013] proposed SemFix which first uses the input and
output of test case to generate program contract and then uses program synthesis to generate
patches. Methods of semantic-based APR have been applied for repairing specific types of bugs,
such as conditional bugs [DeMarco et al. 2014][Xuan et al. 2017][Durieux and Monperrus 2016],
infinite loop [Marcote and Monperrus 2015], and assignments [Gopinath et al. 2011]. Methods of
semantic-based APR [Mechtaev et al. 2015][Mechtaev et al. 2016] have also been used for repairing
general bugs by using some simplified strategies.
Compared with other ARP methods, our approach focuses on repairing high floating-point errors
which is also considered as accuracy bugs [Di Franco et al. 2017].

8 CONCLUSION AND FUTURE WORK

In this paper, we have proposed a novel approach to automatically repair high floating-point errors
in numerical libraries. Our approach provides a detecting method including two algorithms (i.e.,
DEMC and PTB, see ğ4.1) to detect high floating-point errors and a repairing method based on
deriving an approximation of the mathematical function to generate patch to satisfy a given repair
criterion. We developed a prototype tool called AutoRNP for repairing high floating-point errors in
numerical programs of GSL. Experimental results show that our tool successfully repair 19 of 20
numerical programs in GSL (with 100% accuracy).
While our approach is presented for numerical programs with one floating-point input, for
future work, we plan to extend our approach to be generally applicable to numerical programs
with multiple floating-point inputs.

ACKNOWLEDGMENTS
We thank Pavel Panchekha for his discussions on using Herbie and Herbgrind, and Zhengfeng
Yang for his helpful suggestions on this work. We also thank the anonymous reviewers for their
valuable comments. This work is supported by the National Key R&D Program of China (No.
2017YFB1001802), and the National Natural Science Foundation of China (Nos. 61672529, 61872445,
61502015).

REFERENCES
Milton Abramowitz. 1974. Handbook of Mathematical Functions, With Formulas, Graphs, and Mathematical Tables,. Dover
Publications, Inc., New York, NY, USA.
Christophe Andrieu, Nando de Freitas, Arnaud Doucet, and Michael I. Jordan. 2003. An Introduction to MCMC for Machine
Learning. Machine Learning 50, 1 (01 Jan 2003), 5ś43. https://doi.org/10.1023/A:1020281327116
Tao Bao and Xiangyu Zhang. 2013. On-the-fly Detection of Instability Problems in Floating-point Program Execution. In
Proceedings of the 2013 ACM SIGPLAN International Conference on Object Oriented Programming Systems Languages &
Applications (OOPSLA ’13). ACM, New York, NY, USA, 817ś832. https://doi.org/10.1145/2509136.2509526

Proc. ACM Program. Lang., Vol. 3, No. POPL, Article 56. Publication date: January 2019.
56:28 Xin Yi, Liqian Chen, Xiaoguang Mao, and Tao Ji

Florian Benz, Andreas Hildebrandt, and Sebastian Hack. 2012. A Dynamic Program Analysis to Find Floating-point Accuracy
Problems. In Proceedings of the 33rd ACM SIGPLAN Conference on Programming Language Design and Implementation
(PLDI ’12). ACM, New York, NY, USA, 453ś462. https://doi.org/10.1145/2254064.2254118
Florian Cajori. 1911. Horner’s method of approximation anticipated by Ruffini. Bull. Amer. Math. Soc. 17, 8 (1911), 409ś414.
https://doi.org/10.1090/S0002-9904-1911-02072-9
Wei-Fan Chiang, Ganesh Gopalakrishnan, Zvonimir Rakamaric, and Alexey Solovyev. 2014. Efficient Search for Inputs
Causing High Floating-point Errors. SIGPLAN Not. 49, 8 (Feb. 2014), 43ś52. https://doi.org/10.1145/2692916.2555265
Patrick Cousot and Radhia Cousot. 1977. Abstract Interpretation: A Unified Lattice Model for Static Analysis of Programs
by Construction or Approximation of Fixpoints. In Proceedings of the 4th ACM SIGACT-SIGPLAN Symposium on Principles
of Programming Languages (POPL ’77). ACM, New York, NY, USA, 238ś252. https://doi.org/10.1145/512950.512973
Nasrine Damouche and Matthieu Martel. 2018. Salsa: An Automatic Tool to Improve the Numerical Accuracy of Programs.
In Automated Formal Methods (Kalpa Publications in Computing), Natarajan Shankar and Bruno Dutertre (Eds.), Vol. 5.
EasyChair, 63ś76. https://doi.org/10.29007/j2fd
Nasrine Damouche, Matthieu Martel, and Alexandre Chapoutot. 2017. Numerical Accuracy Improvement by Interprocedural
Program Transformation. In Proceedings of the 20th International Workshop on Software and Compilers for Embedded
Systems (SCOPES ’17). ACM, New York, NY, USA, 1ś10. https://doi.org/10.1145/3078659.3078662
Favio DeMarco, Jifeng Xuan, Daniel Le Berre, and Martin Monperrus. 2014. Automatic Repair of Buggy if Conditions and
Missing Preconditions with SMT. In Proceedings of the 6th International Workshop on Constraints in Software Testing,
Verification, and Analysis (CSTVA ’14). ACM, New York, NY, USA, 30ś39. https://doi.org/10.1145/2593735.2593740
Anthony Di Franco, Hui Guo, and Cindy Rubio-González. 2017. A Comprehensive Study of Real-world Numerical Bug
Characteristics. In Proceedings of the 32Nd IEEE/ACM International Conference on Automated Software Engineering (ASE
’17). IEEE Press, Piscataway, NJ, USA, 509ś519. http://dl.acm.org/citation.cfm?id=3155562.3155627
Thomas Durieux and Martin Monperrus. 2016. DynaMoth: Dynamic Code Synthesis for Automatic Program Repair. In
Proceedings of the 11th International Workshop on Automation of Software Test (AST ’16). ACM, New York, NY, USA, 85ś91.
https://doi.org/10.1145/2896921.2896931
Laurent Fousse, Guillaume Hanrot, Vincent Lefèvre, Patrick Pélissier, and Paul Zimmermann. 2007. MPFR: A Multiple-
precision Binary Floating-point Library with Correct Rounding. ACM Trans. Math. Softw. 33, 2, Article 13 (June 2007).
https://doi.org/10.1145/1236463.1236468
Zhoulai Fu, Zhaojun Bai, and Zhendong Su. 2015. Automated Backward Error Analysis for Numerical Code. In Proceedings
of the 2015 ACM SIGPLAN International Conference on Object-Oriented Programming, Systems, Languages, and Applications
(OOPSLA ’15). ACM, New York, NY, USA, 639ś654. https://doi.org/10.1145/2814270.2814317
Zhoulai Fu and Zhendong Su. 2017. Achieving High Coverage for Floating-point Code via Unconstrained Programming. In
Proceedings of the 38th ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI ’17). ACM,
New York, NY, USA, 306ś319. https://doi.org/10.1145/3062341.3062383
David Goldberg. 1991. What Every Computer Scientist Should Know About Floating-point Arithmetic. ACM Comput. Surv.
23, 1 (March 1991), 5ś48. https://doi.org/10.1145/103162.103163
Divya Gopinath, Muhammad Zubair Malik, and Sarfraz Khurshid. 2011. Specification-based Program Repair Using SAT. In
Proceedings of the 17th International Conference on Tools and Algorithms for the Construction and Analysis of Systems:
Part of the Joint European Conferences on Theory and Practice of Software (TACAS’11/ETAPS’11). Springer-Verlag, Berlin,
Heidelberg, 173ś188. http://dl.acm.org/citation.cfm?id=1987389.1987408
Fredrik Johansson et al. 2013. mpmath: a Python library for arbitrary-precision floating-point arithmetic (version 0.18).
http://mpmath.org/.
William Kahan. 1996. IEEE standard 754 for binary floating-point arithmetic. Lecture Notes on the Status of IEEE 754,
94720-1776 (1996), 11.
Wonyeol Lee, Rahul Sharma, and Alex Aiken. 2017. On Automatically Proving the Correctness of Math.H Implementations.
Proc. ACM Program. Lang. 2, POPL, Article 47 (Dec. 2017), 32 pages. https://doi.org/10.1145/3158135
Fan Long and Martin Rinard. 2015. Staged Program Repair with Condition Synthesis. In Proceedings of the 2015 10th
Joint Meeting on Foundations of Software Engineering (ESEC/FSE ’15). ACM, New York, NY, USA, 166ś178. https:
//doi.org/10.1145/2786805.2786811
Fan Long and Martin Rinard. 2016. Automatic Patch Generation by Learning Correct Code. SIGPLAN Not. 51, 1 (Jan. 2016),
298ś312. https://doi.org/10.1145/2914770.2837617
Sebastian R. Lamelas Marcote and Martin Monperrus. 2015. Automatic Repair of Infinite Loops. CoRR abs/1504.05078 (2015).
arXiv:1504.05078 http://arxiv.org/abs/1504.05078
Sergey Mechtaev, Jooyong Yi, and Abhik Roychoudhury. 2015. DirectFix: Looking for Simple Program Repairs. In Proceedings
of the 37th International Conference on Software Engineering - Volume 1 (ICSE ’15). IEEE Press, Piscataway, NJ, USA,
448ś458. http://dl.acm.org/citation.cfm?id=2818754.2818811

Proc. ACM Program. Lang., Vol. 3, No. POPL, Article 56. Publication date: January 2019.
Efficient Automated Repair of High Floating-Point Errors in Numerical Libraries 56:29

Sergey Mechtaev, Jooyong Yi, and Abhik Roychoudhury. 2016. Angelix: Scalable Multiline Program Patch Synthesis via
Symbolic Analysis. In Proceedings of the 38th International Conference on Software Engineering (ICSE ’16). ACM, New
York, NY, USA, 691ś701. https://doi.org/10.1145/2884781.2884807
Nicholas Nethercote and Julian Seward. 2007. Valgrind: A Framework for Heavyweight Dynamic Binary Instrumentation.
SIGPLAN Not. 42, 6 (June 2007), 89ś100. https://doi.org/10.1145/1273442.1250746
Hoang Duong Thien Nguyen, Dawei Qi, Abhik Roychoudhury, and Satish Chandra. 2013. SemFix: Program Repair via
Semantic Analysis. In Proceedings of the 2013 International Conference on Software Engineering (ICSE ’13). IEEE Press,
Piscataway, NJ, USA, 772ś781. http://dl.acm.org/citation.cfm?id=2486788.2486890
Pavel Panchekha, Alex Sanchez-Stern, James R. Wilcox, and Zachary Tatlock. 2015. Automatically Improving Accuracy for
Floating Point Expressions. In Proceedings of the 36th ACM SIGPLAN Conference on Programming Language Design and
Implementation (PLDI ’15). ACM, New York, NY, USA, 1ś11. https://doi.org/10.1145/2737924.2737959
Yuhua Qi, Xiaoguang Mao, Yan Lei, Ziying Dai, and Chengsong Wang. 2014. The Strength of Random Search on Automated
Program Repair. In Proceedings of the 36th International Conference on Software Engineering (ICSE ’14). ACM, New York,
NY, USA, 254ś265. https://doi.org/10.1145/2568225.2568254
Alex Sanchez-Stern, Pavel Panchekha, Sorin Lerner, and Zachary Tatlock. 2018. Finding Root Causes of Floating Point Error.
In Proceedings of the 39th ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI ’18).
ACM, New York, NY, USA, 256ś269. https://doi.org/10.1145/3192366.3192411
Eric Schkufza, Rahul Sharma, and Alex Aiken. 2014. Stochastic Optimization of Floating-point Programs with Tunable
Precision. In Proceedings of the 35th ACM SIGPLAN Conference on Programming Language Design and Implementation
(PLDI ’14). ACM, New York, NY, USA, 53ś64. https://doi.org/10.1145/2594291.2594302
Pat H Sterbenz. 1973. Floating-point computation. Prentice Hall, Englewood Cliffs, N.J.
Rainer Storn and Kenneth Price. 1997. Differential Evolution ś A Simple and Efficient Heuristic for global Optimization over
Continuous Spaces. Journal of Global Optimization 11, 4 (01 Dec 1997), 341ś359. https://doi.org/10.1023/A:1008202821328
Enyi Tang, Xiangyu Zhang, Norbert Th. Muller, Zhenyu Chen, and Xuandong Li. 2017. Software Numerical Instability De-
tection and Diagnosis by Combining Stochastic and Infinite-Precision Testing. IEEE Transactions on Software Engineering
43, 10 (Oct 2017), 975ś994. https://doi.org/10.1109/TSE.2016.2642956
David J. Wales and Jonathan P. K. Doye. 1997. Global Optimization by Basin-Hopping and the Lowest Energy Structures of
Lennard-Jones Clusters Containing up to 110 Atoms. The Journal of Physical Chemistry A 101, 28 (1997), 5111ś5116.
https://doi.org/10.1021/jp970984n
Ran Wang, Daming Zou, Xinrui He, Yingfei Xiong, Lu Zhang, and Gang Huang. 2016. Detecting and Fixing Precision-specific
Operations for Measuring Floating-point Errors. In Proceedings of the 2016 24th ACM SIGSOFT International Symposium
on Foundations of Software Engineering (FSE ’16). ACM, New York, NY, USA, 619ś630. https://doi.org/10.1145/2950290.
2950355
Westley Weimer, ThanhVu Nguyen, Claire Le Goues, and Stephanie Forrest. 2009. Automatically Finding Patches Using
Genetic Programming. In Proceedings of the 31st International Conference on Software Engineering (ICSE ’09). IEEE
Computer Society, Washington, DC, USA, 364ś374. https://doi.org/10.1109/ICSE.2009.5070536
Jifeng Xuan, Matias Martinez, Favio DeMarco, Maxime Clement, Sebastian Lamelas Marcote, Thomas Durieux, Daniel
Le Berre, and Martin Monperrus. 2017. Nopol: Automatic Repair of Conditional Statement Bugs in Java Programs. IEEE
Trans. Softw. Eng. 43, 1 (Jan. 2017), 34ś55. https://doi.org/10.1109/TSE.2016.2560811
Xin Yi, Liqian Chen, Xiaoguang Mao, and Tao Ji. 2017a. Automated Repair of High Inaccuracies in Numerical Programs. In
2017 IEEE International Conference on Software Maintenance and Evolution (ICSME ’17). 514ś518. https://doi.org/10.1109/
ICSME.2017.45
Xin Yi, Liqian Chen, Xiaoguang Mao, and Tao Ji. 2017b. Efficient Global Search for Inputs Triggering High Floating-Point
Inaccuracies. In 2017 24th Asia-Pacific Software Engineering Conference (APSEC ’17). 11ś20. https://doi.org/10.1109/
APSEC.2017.7
Daming Zou, Ran Wang, Yingfei Xiong, Lu Zhang, Zhendong Su, and Hong Mei. 2015. A Genetic Algorithm for Detecting
Significant Floating-point Inaccuracies. In Proceedings of the 37th International Conference on Software Engineering -
Volume 1 (ICSE ’15). IEEE Press, Piscataway, NJ, USA, 529ś539. http://dl.acm.org/citation.cfm?id=2818754.2818820

Proc. ACM Program. Lang., Vol. 3, No. POPL, Article 56. Publication date: January 2019.

Modern C++23 QuickStart Pro: Advanced programming including variadic templates, lambdas, async IO, multithreading and thread sync
From Everand
Modern C++23 QuickStart Pro: Advanced programming including variadic templates, lambdas, async IO, multithreading and thread sync
Jarek Thalor
No ratings yet
Mastering Generic Programming in C++: Unlock the Secrets of Expert-Level Skills
From Everand
Mastering Generic Programming in C++: Unlock the Secrets of Expert-Level Skills
Larry Jones
No ratings yet
Modern C++23 QuickStart Pro
From Everand
Modern C++23 QuickStart Pro
Jarek Thalor
No ratings yet
Tutorial Math Deep Learning 2018 PDF
No ratings yet
Tutorial Math Deep Learning 2018 PDF
103 pages
Optimization Theory and Methods
No ratings yet
Optimization Theory and Methods
7 pages
Numericals
No ratings yet
Numericals
52 pages
Accelerate Model Training with PyTorch 2.X: Build more accurate models by boosting the model training process
From Everand
Accelerate Model Training with PyTorch 2.X: Build more accurate models by boosting the model training process
Maicon Melo Alves
No ratings yet
8 - Applications of Derivatives PDF
No ratings yet
8 - Applications of Derivatives PDF
15 pages
IGCSE Math Core Ext SO
No ratings yet
IGCSE Math Core Ext SO
32 pages
Lecture-01 (Functions, Domain and Range)
100% (1)
Lecture-01 (Functions, Domain and Range)
18 pages
12 Maths Good Notes
No ratings yet
12 Maths Good Notes
208 pages
Machine Learning in Python: Hands on Machine Learning with Python Tools, Concepts and Techniques
From Everand
Machine Learning in Python: Hands on Machine Learning with Python Tools, Concepts and Techniques
Abiprod Pty Ltd
5/5 (10)
2016 Mock MT105A
No ratings yet
2016 Mock MT105A
3 pages
A Study of The Behavior of Floating-Point Errors
No ratings yet
A Study of The Behavior of Floating-Point Errors
12 pages
Math F111
No ratings yet
Math F111
2 pages
The Art of Clean Code: Best Practices to Eliminate Complexity and Simplify Your Life
From Everand
The Art of Clean Code: Best Practices to Eliminate Complexity and Simplify Your Life
Christian Mayer
No ratings yet
TOX Manual TOX Software Worx Line X Referenz en
No ratings yet
TOX Manual TOX Software Worx Line X Referenz en
154 pages
Quarter 1 - Week 5
No ratings yet
Quarter 1 - Week 5
4 pages
Applied Mathematics (Class XII)
No ratings yet
Applied Mathematics (Class XII)
151 pages
Ecfd
No ratings yet
Ecfd
335 pages
Nirav J Bhavsar Department of Chemical Engineering DDU, Nadiad
No ratings yet
Nirav J Bhavsar Department of Chemical Engineering DDU, Nadiad
74 pages
2 CS-C, Course Orientation Materials - CS2614
No ratings yet
2 CS-C, Course Orientation Materials - CS2614
8 pages
ZUP How To Use
No ratings yet
ZUP How To Use
17 pages
Assignment 3 Answer (1) FINAL............. Coal
No ratings yet
Assignment 3 Answer (1) FINAL............. Coal
14 pages
MCV4U - Lesson Plan - GZhang
No ratings yet
MCV4U - Lesson Plan - GZhang
37 pages
Practical C++ Backend Programming
From Everand
Practical C++ Backend Programming
Justin Barbara
No ratings yet
Quantum Computing for Programmers and Investors: with full implementation of algorithms in C
From Everand
Quantum Computing for Programmers and Investors: with full implementation of algorithms in C
Alberto Palazzi
5/5 (1)
Graph of Polynomial Function
No ratings yet
Graph of Polynomial Function
27 pages
PRACTICAL GUIDE TO LEARN ALGORITHMS: Master Algorithmic Problem-Solving Techniques (2024 Guide for Beginners)
From Everand
PRACTICAL GUIDE TO LEARN ALGORITHMS: Master Algorithmic Problem-Solving Techniques (2024 Guide for Beginners)
MARTY TWITTY
No ratings yet
Electronics: Asynchronous Floating-Point Adders and Communication Protocols: A Survey
No ratings yet
Electronics: Asynchronous Floating-Point Adders and Communication Protocols: A Survey
23 pages
Optimization of Chemical Processes (Che1011)
No ratings yet
Optimization of Chemical Processes (Che1011)
16 pages
DESIGN ALGORITHMS TO SOLVE COMMON PROBLEMS: Mastering Algorithm Design for Practical Solutions (2024 Guide)
From Everand
DESIGN ALGORITHMS TO SOLVE COMMON PROBLEMS: Mastering Algorithm Design for Practical Solutions (2024 Guide)
ARCHER PAUL
No ratings yet
Computer-Controlled Systems: Theory and Design, Third Edition
From Everand
Computer-Controlled Systems: Theory and Design, Third Edition
Karl J Åström
3/5 (1)
Practical C++ Machine Learning: Hands-on strategies for developing simple machine learning models using C++ data structures and libraries
From Everand
Practical C++ Machine Learning: Hands-on strategies for developing simple machine learning models using C++ data structures and libraries
Anais Sutherland
No ratings yet
Economic Multi Agent Systems: Design, Implementation, and Application
From Everand
Economic Multi Agent Systems: Design, Implementation, and Application
Gottfried Haber
4/5 (1)
Extension courseware based on the ArchiMate Standard, Version 3.1 Standard by Van Haren Publishing
From Everand
Extension courseware based on the ArchiMate Standard, Version 3.1 Standard by Van Haren Publishing
Van Haren Learning Solutions a.o.
No ratings yet
Detecting and Fixing Precision-Specific Operations For Measuring Floating Point Errors
No ratings yet
Detecting and Fixing Precision-Specific Operations For Measuring Floating Point Errors
12 pages
Extreme Value Theorem PDF
No ratings yet
Extreme Value Theorem PDF
2 pages
Guide for Dummies: from MATLAB to C++ through the MATLAB Coder: English and Italian Book
From Everand
Guide for Dummies: from MATLAB to C++ through the MATLAB Coder: English and Italian Book
Filippo Piccinini
No ratings yet
Advanced Guide to Dynamic Programming in Python: Techniques and Applications
From Everand
Advanced Guide to Dynamic Programming in Python: Techniques and Applications
Adam Jones
No ratings yet
Mastering Dynamic Programming in Python
From Everand
Mastering Dynamic Programming in Python
Ed A Norex
No ratings yet
Practical C++ Backend Programming: Crafting Databases, APIs, and Web Servers for High-Performance Backend
From Everand
Practical C++ Backend Programming: Crafting Databases, APIs, and Web Servers for High-Performance Backend
Justin Barbara
No ratings yet
Mat103 Final
No ratings yet
Mat103 Final
9 pages
Pivoting and Backward Stability of Fast Algorithms For Solving Cauchy Linear Equations
No ratings yet
Pivoting and Backward Stability of Fast Algorithms For Solving Cauchy Linear Equations
37 pages
Algorithms Made Simple: Understanding the Building Blocks of Software
From Everand
Algorithms Made Simple: Understanding the Building Blocks of Software
William E. Clark
No ratings yet
Ijspr 1203 438
No ratings yet
Ijspr 1203 438
4 pages
ChatGPT for Programmers: Enhance Your Coding Skills and Boost Productivity with AI-Powered Assistance (2024 Guide)
From Everand
ChatGPT for Programmers: Enhance Your Coding Skills and Boost Productivity with AI-Powered Assistance (2024 Guide)
CHRIS BUSH
No ratings yet
Optimization in Engineering Sciences: Exact Methods
From Everand
Optimization in Engineering Sciences: Exact Methods
Pierre Borne
No ratings yet
Solving Structured Linear Systems With Large Displacement Rank
No ratings yet
Solving Structured Linear Systems With Large Displacement Rank
27 pages
Sverre Haver - Wave Hindcast Forecast Environmental Contour Method
No ratings yet
Sverre Haver - Wave Hindcast Forecast Environmental Contour Method
23 pages
Python Debugging from Scratch: A Practical Guide with Examples ASIN (Ebook):
From Everand
Python Debugging from Scratch: A Practical Guide with Examples ASIN (Ebook):
William E. Clark
No ratings yet
Important Questions For CBSE Class 12 Maths Chapter 6
No ratings yet
Important Questions For CBSE Class 12 Maths Chapter 6
108 pages
Principles of Linear Algebra with Mathematica
From Everand
Principles of Linear Algebra with Mathematica
Kenneth M. Shiskowski
No ratings yet
Virtual Report Processing: The Mapper Story
From Everand
Virtual Report Processing: The Mapper Story
Louis Schlueter
No ratings yet
Defect Prediction in Software Development & Maintainence
From Everand
Defect Prediction in Software Development & Maintainence
Rudra Kumar
No ratings yet
Gemini
No ratings yet
Gemini
62 pages
Building large scale web apps
From Everand
Building large scale web apps
Addy Osmani
No ratings yet
Clinically Applicable AI System For Accurate Diagnosis, Quantitative Measurements, and Prognosis of COVID-19 Pneumonia Using Computed Tomography
No ratings yet
Clinically Applicable AI System For Accurate Diagnosis, Quantitative Measurements, and Prognosis of COVID-19 Pneumonia Using Computed Tomography
1 page
Mastering Data Structures and Algorithms in Python & Java
From Everand
Mastering Data Structures and Algorithms in Python & Java
Sachin Naha
No ratings yet
Group: MAT194 Calculus II Final Exam: Name: Number: Exam Rules
No ratings yet
Group: MAT194 Calculus II Final Exam: Name: Number: Exam Rules
10 pages
C++ Exception Handling Made Easy: A Practical Guide with Examples
From Everand
C++ Exception Handling Made Easy: A Practical Guide with Examples
William E. Clark
No ratings yet
Aks 20-10-2021 (Ii)
No ratings yet
Aks 20-10-2021 (Ii)
5 pages
Differntaial Calculus I
No ratings yet
Differntaial Calculus I
1 page
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
From Everand
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
Marcus Richards
No ratings yet
Métodos numéricos aplicados a Ingeniería: Casos de estudio usando MATLAB
From Everand
Métodos numéricos aplicados a Ingeniería: Casos de estudio usando MATLAB
Héctor Jorquera González
5/5 (1)
Ap Calculus Ab Winter Term Quiz Three Name Answer Key: F X F F
No ratings yet
Ap Calculus Ab Winter Term Quiz Three Name Answer Key: F X F F
4 pages
Maxima and Minima of Function of One Variables: IIT JEE (Main) Examination
No ratings yet
Maxima and Minima of Function of One Variables: IIT JEE (Main) Examination
18 pages
Computer Practices Using C++
From Everand
Computer Practices Using C++
Ramkrishna Ghosh
No ratings yet
C++ Algorithms for Beginners: A Practical Guide with Examples
From Everand
C++ Algorithms for Beginners: A Practical Guide with Examples
William E. Clark
No ratings yet
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
The Software Programmer: Basis of common protocols and procedures
From Everand
The Software Programmer: Basis of common protocols and procedures
S Mathioudakis
No ratings yet
Max Min
No ratings yet
Max Min
6 pages
Graph Layout Support for Model-Driven Engineering
From Everand
Graph Layout Support for Model-Driven Engineering
Miro Spönemann
No ratings yet
C++ Functional Programming for Starters: A Practical Guide with Examples
From Everand
C++ Functional Programming for Starters: A Practical Guide with Examples
William E. Clark
No ratings yet
AP Computer Science Principles: Student-Crafted Practice Tests For Excellence
From Everand
AP Computer Science Principles: Student-Crafted Practice Tests For Excellence
Sama Alshatali
No ratings yet
Chaotic Pulsing and Quasi-Periodic Route To Chaos in A Semiconductor Laser With Delayed Opto-Electronic Feedback
No ratings yet
Chaotic Pulsing and Quasi-Periodic Route To Chaos in A Semiconductor Laser With Delayed Opto-Electronic Feedback
8 pages
Dataflow and Reactive Programming Systems
From Everand
Dataflow and Reactive Programming Systems
Matt Carkci
No ratings yet
NeurIPS 2020 Fast Transformers With Clustered Attention Paper
No ratings yet
NeurIPS 2020 Fast Transformers With Clustered Attention Paper
10 pages
C++ Debugging from Scratch: A Practical Guide with Examples
From Everand
C++ Debugging from Scratch: A Practical Guide with Examples
William E. Clark
No ratings yet
Spatial Transformer K-Means
No ratings yet
Spatial Transformer K-Means
5 pages
Clustering Algorithm With Learnable Distance For Categorical Data With Nominal and Ordinal Attributes
No ratings yet
Clustering Algorithm With Learnable Distance For Categorical Data With Nominal and Ordinal Attributes
5 pages
Patterns, Principles, and Practices of Domain-Driven Design
From Everand
Patterns, Principles, and Practices of Domain-Driven Design
Scott Millett
No ratings yet
Advanced Backend Code Optimization
From Everand
Advanced Backend Code Optimization
Sid Touati
No ratings yet
Y X K Xy K X y X X Y: FX X X X
No ratings yet
Y X K Xy K X y X X Y: FX X X X
12 pages
Concurrency in C++: Writing High-Performance Multithreaded Code
From Everand
Concurrency in C++: Writing High-Performance Multithreaded Code
Robert Johnson
No ratings yet
Programming macros with Google Sheets: Professional training
From Everand
Programming macros with Google Sheets: Professional training
Rémy Lentzner
No ratings yet
Intermediate Load Runner With Oracle/Apex Concepts.
From Everand
Intermediate Load Runner With Oracle/Apex Concepts.
Rohan Gordon
No ratings yet
Basic Modelling and Design
From Everand
Basic Modelling and Design
Martin Braae
No ratings yet
Programming in Visual Basic (VB): For Visual Studio
From Everand
Programming in Visual Basic (VB): For Visual Studio
Olga Maria Stefania Cucaro
No ratings yet
The Art of Controller Design
From Everand
The Art of Controller Design
Martin Braae
No ratings yet
How to Track Schedules, Costs and Earned Value with Microsoft Project
From Everand
How to Track Schedules, Costs and Earned Value with Microsoft Project
Akram Najjar
No ratings yet
Programming Concepts in C++
From Everand
Programming Concepts in C++
Robert Burns
No ratings yet
C Programming Concepts
From Everand
C Programming Concepts
Jitendra Patel
No ratings yet
Machine Learning in Python: Hands on Machine Learning with Python Tools, Concepts and Techniques
From Everand
Machine Learning in Python: Hands on Machine Learning with Python Tools, Concepts and Techniques
Bob Mather
5/5 (1)
Practical Earned Value Analysis: 25 Project Indicators from 5 Measurements
From Everand
Practical Earned Value Analysis: 25 Project Indicators from 5 Measurements
Akram Najjar
No ratings yet
C & C++ Interview Questions You'll Most Likely Be Asked
From Everand
C & C++ Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Computer Algebra: Fundamentals and Applications
From Everand
Computer Algebra: Fundamentals and Applications
Fouad Sabry
No ratings yet

Efficient Automated Repair of High Floating-Point Errors in Numerical Libraries

Uploaded by

Efficient Automated Repair of High Floating-Point Errors in Numerical Libraries

Uploaded by

Efficient Automated Repair of High Floating-Point Errors in

CCS Concepts: • Mathematics of computing → Mathematical software; Computations in finite fields;

ACM Reference Format:

Fig. 2. Code of gsl_sf_legendre_P3

(a) Original ErrBits distribution (b) ErrBits distribution after repair

Fig. 4. ErrBits distribution of program gsl_sf_legendre_P3

Fig. 5. gsl_sf_legendre_P3 after repair

4.1 Detecting High Floating-Point Errors

Appro(C(x)) = fp′ (x) ⊗ x ⊘ fp (x) (9)

Algorithm 1 The DEMC algorithm

Algorithm 2 The PTB Algorithm

4.2 Deriving an Approximation of Mathematical Function

Based on the above definition, we have the following theorem:

O j l = O j = f l ( f (x j )). Then we have ErrBits(Oi l , f (xi )) = 0, ErrBits(Oj l , f (xj )) = 0, which means

(a) Error compensation on bessel_J1 (b) Error compensation on airy_Ai

Fig. 8. Examples of error compensation

Algorithm 3 Iterative Refinement Algorithm

Termination guarantee. The termination of iterative refinement algorithm can be proved

4.3 Producing Patch

where kj(i) is the slope of linear function (Eq. 10) , (xs(i)

ST (i) = {st 0(i) , st 1(i) , ..., st m

5 IMPLEMENTATION AND EVALUATION

5.2 Benchmarks and Experimental Setup

ID Program MaxErr ID Program MaxErr

Performance on real-world numerical programs We want to evaluate the performance of our

Fig. 10. Repair processes of HBG.

Comparing with state-of-the-art tools To compare with state-of-the-art methods, we build a

5.3 Performance on Real-World Numerical Programs

Table 2. Experimental results on 20 numerical programs in GSL

Repair Time(s) Accuracy of Repair

Accuracy improving for Lε Accuracy improving for Lε

Accuracy improving for Mε Accuracy improving for Mε

Accuracy improving for Hε Accuracy improving for Hε

956 1361 1014 3847

5.4 Comparing with State-of-the-art Tools

Table 3. AutoRNP vs HBG over average bits correct

Average error Improving of average bits correct

5.5 Influence of Patches on the Original Programs

Table 4. Storage overhead (KB) on 19 successfully repaired programs

5.6 Wider Applicability

approximation of mathematical function instead of using the mathematical rewriting to rearrange

8 CONCLUSION AND FUTURE WORK

You might also like