Eccentric self-forced inspirals into a rotating black hole

Philip Lynch; Maarten van de Meent; Niels Warburton

doi:10.1088/1361-6382/ac7507

1. Introduction

The detection of the first gravitational wave signal [1] ushered in a new era of astronomy, with ground-based observatories having now observed just over a hundred signals [2–4]. The next generation of space-based detectors, such as the laser interferometer space antenna (LISA), will probe the previously inaccessible millihertz band of the gravitational wave spectrum, allowing for the detection of hitherto unseen gravitational wave sources. Among these sources are extreme mass-ratio inspirals (EMRIs), which consist of a stellar mass compact object (such as a black hole or neutron star) spiralling into a supermassive black hole. These systems are characterised by their extremely small mass ratio, typically between 10⁻⁴ and 10⁻⁷. Unlike the signals detected by ground-based detectors, EMRIs will radiate in the LISA frequency band for up to hundreds of thousands of orbital cycles [5]. They are also expected to be eccentric and precessing, potentially resulting in multi-year long waveforms with rich and complex morphology [6]. These signals encode the spacetimes of supermassive black holes, promising exquisite parameter estimation and some of the most sensitive probes for new physics beyond general relativity [7].

The majority of EMRIs will have a very low instantaneous signal-to-noise ratio (SNR), and so the data must be processed with matched filtering techniques which will allow for the build up of the SNR over time [8]. Such techniques require the development of theoretical waveform templates to compare against the data. To achieve LISA's science objectives, these templates need their phase to be accurate to within a fraction of a radian, even after hundreds of thousands of orbital cycles. They also need to be extensive across the large parameter space of possible EMRI configurations. Moreover, since many template evaluations would be needed, they should also be fast to compute, ideally in less than a second.

To meet this challenge, several so-called 'kludge' models have been developed, which are both extensive and quick to compute [9–12]. However, they also make use of non-relativistic approximations which cause them to fall short of the accuracy requirement, though they may still be sufficient for the detection of loud EMRI signals [13]. Despite their shortcomings, these models are invaluable for testing data analysis techniques for LISA through the mock data challenges [14–16]. In order to detect more EMRI signals, relativistic, 'adiabatic' waveforms are required. The adiabatic trajectory can be calculated by balancing the fluxes of energy and angular momentum through null infinity the event horizon with the energy and angular momentum lost by the secondary throughout the inspiral. This has been implemented in a practical framework for non-spinning, eccentric EMRIs [17, 18]. While this represents a significant accuracy improvement over the kludge models, work remains to be done to extend this framework to generic inspirals into rotating black holes.

To detect all EMRI signals and enable precision parameter estimation, requires 'post-adiabatic' waveforms. Producing such waveforms requires calculating the local force experienced by the secondary due to the presence of its own gravitational field, known as the gravitational self-force (GSF). This can be calculated perturbatively by expanding the field equations in powers of the small mass ratio of the binary, which makes this approach ideally suited to EMRIs. Post-adiabatic EMRI waveforms will require not only full knowledge of the first-order self-force, but also the orbit averaged contribution of the second-order self-force [19, 20].

At each instant, the self-force is a functional of the past history of the secondary which can make it challenging to compute. One approach is to couple the field equations and the equations of motion and self-consistently solve both in a time-domain simulation. While this has been implemented for a toy model of a particle carrying a scalar charge orbiting a Schwarzschild black hole [21], numerical stability issues have so far stifled similar attempts for the gravitational case [22]. Moreover, this approach is computationally very expensive, making it impractical for generating large numbers of templates. However, it does promise waveforms against which more efficient schemes should be tested.

An alternative method is to compute the self-force for a body moving along fixed geodesics of the background spacetime and then use that force to move to another geodesic at a later timestep. The periodic nature of these geodesic orbits allows for calculations in frequency domain leading to many efficient calculations of the first-order self-force in both Schwarzschild [23–26] and Kerr [27, 28] spacetimes. Second-order calculations are also emerging, though at present these are restricted to quasi-circular inspirals into non-rotating black holes [29, 30]. These calculations can be repeated across the parameter space of bound geodesics and then interpolated in a preprocessing step, as has been done for eccentric inspirals into a Schwarzschild black hole [31, 32]. One of the goals of this work is to compute self-forced inspirals into rotating (Kerr) black holes.

With an interpolated model of the geodesic self-force in hand, one can restate the EMRI equations of motion in a more convenient form for numerical integration using the method of 'osculating orbital elements' (or 'osculating geodesics' when applied to the relativistic context). In this approach, the inspiral is described as a smooth evolution through neighbouring geodesics that are instantaneously tangent to the true inspiral. Formally, one identifies a set of constants of motion which uniquely identify a geodesic, known as 'orbital elements'. These constants are then promoted to functions of time which are governed by a set coupled ordinary differential equations that are derived from the 'osculating conditions'. These new equations of motion are then solved numerically to obtain the inspiral trajectory of the secondary. There are a number of equivalent formulations for these ODEs which have been derived for both Schwarzschild [33] and Kerr [34, 35] inspirals. In this work, we make use of a formulation based on action angles of the geodesic motion that was sketched out in reference [34].

However, for all of these formulations, the resulting equations of motion depend on the orbital phases. This means that the numerical integrator will have to resolve features on the orbital timescale, requiring the use of many small time steps. Since a typical EMRI undergoes $\sim 1{0}^{4}-1{0}^{6}$ orbital cycles during a radiation reaction timescale, this results in computational times of minutes to hours for a single inspiral, depending on the orbital configuration.

Following reference [36] (hereafter paper I), we overcome this problem by applying a near-identity (averaging) transformation (NIT) [37] to the self-forced equations of motion. These transformed equations have two important properties: (i) they no longer depend on the orbital phase, and (ii) they capture the long-term secular evolution of the original inspiral to the same order of approximation in the mass ratio as the original equations of motion. The first property means the transformed equation of motion can be numerically solved for any mass ratio in less than a second as the numerical integrator no longer needs to resolve the thousands of oscillations on the orbital timescale.

This approach has been applied to the case of eccentric inspirals into non-rotating black holes [36, 38]. In this work we apply the NITs to orbital motion in the Kerr spacetime. Combined with an interpolated model of first-order GSF data, these averaged equations of motion allow us to efficiently compute inspirals around a Kerr black hole which, for the first time, include all first order in the mass ratio effects. Since these inspirals are fast to compute, this approach can provide EMRI waveforms useful for practical data analysis when coupled to a fast waveform generation scheme, e.g., [17].

At this point, we emphasize that the inspirals we present do not reach the sub-radian accuracy required for EMRI data analysis as there are not yet any second order self-force calculations in this domain. To stress the importance of the second order contribution we examine the effects of driving the inspirals using first-order self-forces computed in two different gauges and demonstrate explicitly that without the inclusion of the second-order self-force the inspiral phase is not gauge invariant. Nonetheless, the framework we present can readily incorporate new self-force results as they become available.

This paper is laid out as follows. In section 2, we review geodesic motion and introduce our action angle formulation of the osculating geodesic equations for generic Kerr orbits. We end this section by specialising to equatorial (i.e., spin aligned) orbits for the rest of this work. Using these equations, along with a model for the GSF, allows us to calculate eccentric, self-forced, inspirals in Kerr spacetime for the first time. However, these inspirals, henceforth referred to as 'osculating geodesic' (OG) inspirals, are slow to compute, taking on the order of hours or days.

This motivates us to apply a near identity transformation, as developed in paper I, which we summarize in section 3.1. In section 3.2 we explicitly derive the averaged equations of motion for the case of eccentric Kerr inspirals. Inspirals calculated with these equations of motion can be evaluated in less than a second, and are henceforth referred to as 'NIT' inspirals.

For both OG and NIT inspirals, we require a model for the GSF. To this end, we review the calculation of the GSF in the radiation gauge in section 4.1 before outlining a procedure used to interpolate this data in section 4.2. Using this interpolated self force model we describe our numerical implementation for calculating the various terms in the NIT equations of motion in section 5.

We then present the results of this implementation in section 6. We start with a consistency check with energy and angular momentum fluxes and an interpretation of the various terms in the NIT equations of motion in section 6.1. We then compare OG and NIT inspirals to check that they agree at the appropriate order in the mass-ratio and assess the speed up the NIT provides. We then explore some post-adiabatic effects of the first-order self force by comparing NIT and adiabatic inspirals in section 6.3. To round off our results, in section 6.4 we make comparisons between inspiral trajectories around a Schwarzschild black hole calculated using two different first order self-force models: one calculated using outgoing radiation gauge self-force data and the other using Lorenz gauge self-force data. These comparisons indicate that trajectories calculated using only first order self-force data are gauge dependent. We end with some concluding remarks in section 7.

Throughout this paper we work in geometric units such that the gravitational constant and the speed of light are both equal to one (i.e., G = c = 1).

2. Forced motion near a rotating black hole

In this section we describe the motion of a non-spinning compact object of mass μ moving in the Kerr spacetime under the influence of some arbitrary force. Later in this work, we will take this to be the self-force experienced by the secondary due to its interaction with its own metric perturbation. We denote the mass of the primary by M and parameterize its spin by a = |J|/M where J is the angular momentum of the black hole. The Kerr metric can then be written inmodified Boyer–Lindquist coordinates, x^α = {t, r, z = cos θ, ϕ}, as

$\begin{equation}\begin{aligned}\hfill \mathrm{d}{s}^{2}& =-\left(1-\frac{2Mr}{{\Sigma}}\right)\mathrm{d}{t}^{2}+\frac{{\Sigma}}{{\Delta}}\mathrm{d}{r}^{2}+\frac{{\Sigma}}{1-{z}^{2}}\mathrm{d}{z}^{2}\hfill \\ \hfill & \quad +\frac{1-{z}^{2}}{{\Sigma}}(2{a}^{2}r(1-{z}^{2})+{\Sigma}{\varpi }^{2})\mathrm{d}{\phi }^{2}-\frac{4Mar(1-{z}^{2})}{{\Sigma}}\mathrm{d}t\mathrm{d}\phi ,\hfill \end{aligned}\end{equation} \tag{ 1 }$

where

$\begin{equation}{\Delta}(r){:=}{r}^{2}+{a}^{2}-2Mr,\qquad {\Sigma}(r,z){:=}{r}^{2}+{a}^{2}{z}^{2},\qquad \varpi (r){:=}\sqrt{{r}^{2}+{a}^{2}}.\end{equation} \tag{ 2a-c }$

If a force acts upon the secondary its motion can be described by the forced geodesic equation

$\begin{equation}{u}^{\beta }{\mathrm{\nabla }}_{\beta }{u}^{\alpha }={a}^{\alpha },\end{equation} \tag{ 3 }$

where u^α = dx^α/dτ is the secondary's four-velocity, ∇_β is the covariant derivative with respect to the Kerr metric, and a^α is the secondary's four-acceleration. We seek to recast equation (3) into a form useful for applying the near-identity transformations. Before considering the forced equation it is useful to first examine the geodesic limit.

2.1. Geodesic motion and orbital parameterization

In the absence of any perturbing force, the secondary will follow a geodesic, i.e.,

$\begin{equation}{u}^{\beta }{\nabla }_{\beta }{u}^{\alpha }=0.\end{equation} \tag{ 4 }$

The symmetries of the Kerr spacetime allow for the identification of integrals of motion $\vec{\mathcal{P}}=\left\{\mathcal{E},\mathcal{L},\mathcal{K}\right\}$ given by

$\begin{equation}\mathcal{E}=-{u}_{t},\qquad \mathcal{L}={u}_{\phi },\qquad \mathcal{K}={\mathcal{K}}^{\alpha \beta }{u}_{\alpha }{u}_{\beta },\end{equation} \tag{ 5a-c }$

where ${\mathcal{K}}^{\alpha \beta }$ is the Killing tensor, $\mathcal{E}$ is the orbital energy per unit rest mass μ, $\mathcal{L}$ is the z-component of the angular momentum divided by μ and $\mathcal{K}$ is the Carter constant divided by μ² [39]. This definition of the Carter constant is related to another common definition of the Carter constant, $\mathcal{Q}$ , by

$\begin{equation}\mathcal{Q}=\mathcal{K}\;-\;{(\mathcal{L}\;-a\mathcal{E})}^{2}.\end{equation} \tag{ 6 }$

The geodesic equation can be written explicitly in terms of these integrals of motion [40]:

$\begin{equation}\begin{aligned}\hfill {\left(\frac{\mathrm{d}r}{\mathrm{d}\lambda }\right)}^{2}& ={\left(\mathcal{E}{\varpi }^{2}-a\mathcal{L}\right)}^{2}-{\Delta}\left({r}^{2}+\mathcal{K}\right)\hfill \\ \hfill & =(1-{\mathcal{E}}^{2})({r}_{1}-r)(r-{r}_{2})(r-{r}_{3})(r-{r}_{4}){:=}{V}_{r},\hfill \end{aligned}\end{equation} \tag{ 7a }$

$\begin{equation}\begin{aligned}\hfill {\left(\frac{\mathrm{d}z}{\mathrm{d}\lambda }\right)}^{2}& =\mathcal{Q}-{z}^{2}\left({a}^{2}(1-{\mathcal{E}}^{2})(1-{z}^{2})+{\mathcal{L}}^{2}+\mathcal{Q}\right)\hfill \\ \hfill & =({z}^{2}-{z}_{-}^{2})\left({a}^{2}(1-{\mathcal{E}}^{2}){z}^{2}-{z}_{+}^{2}\right){:=}{V}_{z},\hfill \end{aligned}\end{equation} \tag{ 7b }$

$\begin{equation}\frac{\mathrm{d}t}{\mathrm{d}\lambda }=\frac{{\varpi }^{2}}{{\Delta}}\left(\mathcal{E}{\varpi }^{2}-a\mathcal{L}\right)-{a}^{2}\mathcal{E}(1-{z}^{2})+a\mathcal{L}{:=}{s}_{t},\end{equation} \tag{ 7c }$

$\begin{equation}\frac{\mathrm{d}\phi }{\mathrm{d}\lambda }=\frac{a}{{\Delta}}\left(\mathcal{E}{\varpi }^{2}-a\mathcal{L}\right)+\frac{\mathcal{L}}{1-{z}^{2}}-a\mathcal{E}{:=}{s}_{\phi },\end{equation} \tag{ 7d }$

where r₁ > r₂ > r₃ > r₄ are the roots of the radial potential V_r, z₊ > z₋ are the roots of the polar potential V_z, and λ is Mino(–Carter) time that decouples the radial and polar equations [41]. This time is related to the proper time of the particle, τ, by

$\begin{equation}\mathrm{d}\tau ={\Sigma}\mathrm{d}\lambda .\end{equation} \tag{ 8 }$

The two largest roots of V_r correspond to the apoapsis and periapsis of the orbit, respectively. Explicit expressions for the other roots are derived in reference [42] and given for completeness in appendix A. Rather than parameterize an orbit by the set $\left\{\mathcal{E},\mathcal{L},\mathcal{K}\right\}$ it is useful to instead use more geometric, quasi-Keplerian constants $\vec{P}=\left\{p,e,x\right\}$ . Here p is the semi-latus rectum, e is the orbital eccentricity and x measures the orbital inclination. These are related to the radial and polar roots via

$\begin{equation}{r}_{1}=\frac{pM}{1-e},\qquad {r}_{2}=\frac{pM}{1+e},\qquad {z}_{-}=\sqrt{1-{x}^{2}}.\end{equation} \tag{ 9a-c }$

The explicit relation between the integrals of motion $\left\{\mathcal{E},\mathcal{L},\mathcal{K}\right\}$ and {r₁, r₂, z₋} can be found in, e.g., appendix B of reference [43]. We note that one advantage of using x over other common choices for inclination is that x smoothly parameterizes orbits between prograde equatorial motion with x = 1 to retrograde equatorial motion with x = −1 orbits. Not all values of {p, e, x} correspond to bound geodesics and we denote the value of p at the last stable orbit by p_LSO(a, e, x) [44, 45].

In order to later apply the near-identity transformations it will be useful to employ action-angle formulation to parameterize the orbital motion [42, 43, 46]. In this description the orbital phases $\vec{q}=\left\{{q}_{r},{q}_{z}\right\}$ are such that the geodesic equations can written in the form

$\begin{equation}\dot{{P}_{j}}=0\quad \text{and}\quad \dot{{q}_{i}}={{\Upsilon}}_{i}(\vec{P}),\end{equation} \tag{ 10a-b }$

where an overdot denotes derivative with respect to Mino time and the ϒ_i are the Mino time fundamental frequencies [42]. Note that the right-hand side of the ${\dot{q}}_{i}$ equation depends only on $\vec{P}$ and not also on the orbital phases. Semi-analytic solutions to equation (7) in terms of $\vec{q}$ can be found in references [42, 46] and we present the key equations in appendix A.

At this point we note that it is also common in the literature [47, 48] to express the radial and polar motion in terms of quasi-Keplerian angles, ψ and χ, via

$\begin{equation}r(\psi )=\frac{pM}{1+e\mathrm{cos}(\psi )}\quad \text{and}\quad z(\chi )={z}_{-}\mathrm{c}\mathrm{o}\mathrm{s}(\chi ).\end{equation} \tag{ 11a-b }$

With this parameterization the evolution equations for ψ and χ depend on both $\vec{P}$ and $\vec{q}$ . This makes them inconvenient for deriving averaging transformations, though it is not an insurmountable challenge [35]. For this reason we prefer the action-angle formulation.

2.2. Osculating geodesics

We now wish to describe the forced motion of a body obeying equation (3). To do so, we make use of the method of osculating orbital elements, or OGs when applied to the relativistic context [33, 34]. We first identify a set of orbital elements that uniquely identify a given geodesic, such as the integrals of motion $\vec{P}$ along with the initial values of the orbital phases of the geodesic orbit ${\vec{q}}_{0}$ and designate them as a set of 'orbital elements' $\vec{I}=\left\{\vec{P},{\vec{q}}_{0}\right\}$ . For accelerated orbits, these orbital elements are promoted from constants to functions of Mino time. Note that now the orbital elements ${\vec{q}}_{0}(\lambda )$ are different quantities from the values of $\vec{q}$ evaluated at λ = 0, i.e., $\vec{q}(0)$ . We assume that the worldline and four-velocity of the secondary at each instant can be described as the worldline and four-velocity of a test body on a tangent geodesic, i.e., ${x}^{\alpha }(\lambda )={x}_{\mathrm{G}}^{\alpha }({I}^{A}(\lambda ),\lambda )$ and ${u}^{\alpha }(\lambda )={u}_{\mathrm{G}}^{\alpha }({I}^{A}(\lambda ),\lambda )$ . From these assumptions, one can derive the OG equations [33, 34]:

$\begin{equation}\frac{\partial {x}_{\mathrm{G}}^{\alpha }}{\partial {I}^{A}}{\dot{I}}^{A}=0\quad \text{and}\quad \frac{\partial {u}_{\alpha }^{\mathrm{G}}}{\partial {I}^{A}}{\dot{I}}^{A}={\Sigma}{a}_{\alpha }.\end{equation} \tag{ 12a-b }$

From these, evolution equations for each of the orbital elements can be calculated, which we have done in appendices B and C. This was first done for generic Kerr orbits in reference [34] which lays out four different formulations of the equations. Three of these formulations use the quasi-Keplerian angles ψ and χ as the orbital phases, and two of these were numerically implemented and shown to agree with each other. We make use of the final formulation where one instead uses the geodesic actions angels q_r and q_z as the orbital phases.

We first find the evolution equations for the integrals of motion $\vec{P}$ . We do so by finding evolution equations for $\vec{\mathcal{P}}$ and relating these to evolution equations for the roots r₁, r₂ and z₋. We derive these relations in appendix B. From here, one can invert equations (9) to find

$\begin{equation}\frac{\mathrm{d}p}{\mathrm{d}\lambda }=\frac{2}{M{({r}_{1}+{r}_{2})}^{2}}\left({r}_{2}^{2}\frac{\mathrm{d}{r}_{1}}{\mathrm{d}\lambda }+{r}_{1}^{2}\frac{\mathrm{d}{r}_{2}}{\mathrm{d}\lambda }\right){:=}{F}_{p},\end{equation} \tag{ 13a }$

$\begin{equation}\frac{\mathrm{d}e}{\mathrm{d}\lambda }=\frac{2}{{({r}_{1}+{r}_{2})}^{2}}\left({r}_{2}\frac{\mathrm{d}{r}_{1}}{\mathrm{d}\lambda }-{r}_{1}\frac{\mathrm{d}{r}_{2}}{\mathrm{d}\lambda }\right){:=}{F}_{e},\end{equation} \tag{ 13b }$

$\begin{equation}\frac{\mathrm{d}x}{\mathrm{d}\lambda }=-\frac{{z}_{-}}{x}\frac{\mathrm{d}{z}_{-}}{\mathrm{d}\lambda }\;{:=}{F}_{x}.\end{equation} \tag{ 13c }$

The orbital phases $\vec{q}$ still evolve with their respective Mino-time frequencies, but now pick up a correction due to the evolution of the initial values, i.e.,

$\begin{equation}\frac{\mathrm{d}{q}_{i}}{\mathrm{d}\lambda }={{\Upsilon}}_{i}+\frac{\mathrm{d}{q}_{i,0}}{\mathrm{d}\lambda }.\end{equation} \tag{ 14 }$

To find the evolution equations for the initial values for the orbital phases, we can re-arrange the first osculating condition (12a) and exploit the fact that the evolution of r is independent of q_z, and the evolution z is independent of q_r, to get

$\begin{equation}\frac{\mathrm{d}{q}_{i,0}}{\mathrm{d}\lambda }=-\frac{1}{\partial {x}_{\mathrm{G}}^{i}/\partial {q}_{i}}\left(\frac{\partial {x}_{\mathrm{G}}^{i}}{\partial {P}_{j}}\frac{\mathrm{d}{P}_{j}}{\mathrm{d}\lambda }\right){:=}{f}_{i}^{(1)},\end{equation} \tag{ 15 }$

where ${x}_{\mathrm{G}}^{i}$ is the geodesic expression for r or z given by equations (A3) and (A4) respectively. Unfortunately, this expression is difficult to evaluate numerically at the orbital turning points where both $\partial {x}_{\mathrm{G}}^{i}/\partial {q}_{i}$ and the term in the parentheses go to zero. However, one can derive an alternative expression for this quantity that is regular at the turning points [34]:

$\begin{equation}\frac{\mathrm{d}{q}_{i,0}}{\mathrm{d}\lambda }=\frac{2{{\Upsilon}}_{i}}{\partial {V}_{i}/\partial {x}_{\mathrm{G}}^{i}}\left({{\Sigma}}^{2}{a}^{i}-\left(\frac{\partial {\dot{x}}_{\mathrm{G}}^{i}}{\partial {P}_{j}}\dot{{P}_{j}}\right)\right){:=}{f}_{i}^{(1)}.\end{equation} \tag{ 16 }$

See appendix C for details on the derivation. This expression instead has a singularity whenever $\partial {V}_{i}/\partial {x}_{\mathrm{G}}^{i}=0$ . Thus, for our numerical implementation, we use equation (15) for the majority of the orbital cycle and switch to equation (16) in the vicinity of turning points.

Finally, we also require evolution equations for 'extrinsic quantities' that do not show up on the right-hand side of the equations of motion, but are necessary to compute the trajectory and the waveform. These are the time and azimuthal coordinates of the secondary which, as a set, we denote by $\vec{S}=\left\{t,\phi \right\}$ . Their evolution is given by the geodesic equations for t and ϕ, i.e., equations (7c) and (7d). Putting it all together, the equations of motion take the form:

$\begin{equation}\dot{{P}_{j}}={F}_{j}(\vec{P},\vec{q}),\end{equation} \tag{ 17a }$

$\begin{equation}\dot{{q}_{i}}={{\Upsilon}}_{i}(\vec{P})+{f}_{i}(\vec{P},\vec{q}),\end{equation} \tag{ 17b }$

$\begin{equation}\dot{{S}_{k}}={s}_{k}(\vec{P},\vec{q}).\end{equation} \tag{ 17c }$

These equations of motion are valid for generic inspirals under the influence of an unspecified perturbing force. We find that the action angle implementation produces inspiral trajectories that are identical to inspirals calculated using the 'null tetrad' formulation described in reference [34]. We have implemented both the action angle and null tetrad osculating element equations into a Mathematica package that will be publicly available on the Black Hole Perturbation Toolkit [49]. We find numerically that the null tetrad formulation is more computationally efficient as it does not have any singular equations that necessitate switching between different formulations twice during each orbital period. As such, for direct comparisons between OG and NIT inspirals and waveforms we make use of the null tetrad formulation, but use the action angle formulation as the starting point for our averaging procedure.

2.3. Specialising to equatorial motion

For the rest of this work, we specialize to the case of eccentric inspirals in the equatorial plane (i.e., spin aligned) under the influence of the first-order ratio gravitational self force (GSF). This corresponds to setting x = ±1 for prograde and retrograde orbits, respectively. Due to symmetry, motion in the equatorial plane will stay in the equatorial plane, and thus $\dot{x}=0$ . As such, we only need to track the evolution of $\vec{P}=\left\{p,e\right\}$ . Similarly, the equations of motion no longer depend on the polar phase q_z, and so we only need to evolve the radial phase $\vec{q}=\left\{{q}_{r}\right\}$ . The GSF scales with the small mass ratio = μ/M, meaning that the secondary's four-acceleration can be expressed as ${a}^{\alpha }={\epsilon}{a}_{\text{GSF}}^{\alpha }$ . Factoring out this scaling, the equations of motion for equatorial inspirals become

$\begin{equation}\frac{\mathrm{d}p}{\mathrm{d}\lambda }={\epsilon}{F}_{p}(a,p,e,{q}_{r}),\end{equation} \tag{ 18a }$

$\begin{equation}\frac{\mathrm{d}e}{\mathrm{d}\lambda }={\epsilon}{F}_{e}(a,p,e,{q}_{r}),\end{equation} \tag{ 18b }$

$\begin{equation}\frac{\mathrm{d}{q}_{r}}{\mathrm{d}\lambda }={{\Upsilon}}_{r}(a,p,e)+{\epsilon}{f}_{r}(a,p,e,{q}_{r}),\end{equation} \tag{ 18c }$

$\begin{equation}\frac{\mathrm{d}t}{\mathrm{d}\lambda }={s}_{t}(a,p,e,{q}_{r}),\end{equation} \tag{ 18d }$

$\begin{equation}\frac{\mathrm{d}\phi }{\mathrm{d}\lambda }={s}_{\phi }(a,p,e,{q}_{r}).\end{equation} \tag{ 18e }$

3. Near-identity transformations

Near identity (averaging) transformations. (NITs) are well known technique in applied mathematics and celestial mechanics [37]. This technique involves making small transformations to the equations of motion, such that the short timescale physics is averaged out, while retaining information about the long term evolution of a system. This is well suited to modelling EMRIs, where we do not require perfect knowledge of the trajectory on the orbital timescale, so long as we can accurately track its evolution on the radiation reaction timescale. We note that there can be a relationship between gauge transformations and NITs, as explored in reference [50], the NITs we perform in this work are distinct from the choice of gauge [51]. In paper I [36], these transformations were derived for EMRIs in general and then applied to the OG equations of motion around a Schwarzschild black hole. We briefly review the work of paper I for a general EMRI before applying these transformations to the specific case of eccentric self-forced inspirals in Kerr.

3.1. Near identity averaging transformations for generic EMRI systems

The NIT variables, ${\tilde{P}}_{j}$ , ${\tilde{q}}_{i}$ and ${\tilde{S}}_{k}$ , are related to the OG variables P_j, q_i and S_k via

$\begin{equation}{\tilde{P}}_{j}={P}_{j}+{\epsilon}{Y}_{j}^{(1)}(\vec{P},\vec{q})+{{\epsilon}}^{2}{Y}_{j}^{(2)}(\vec{P},\vec{q})+\mathcal{O}({{\epsilon}}^{3}),\end{equation} \tag{ 19a }$

$\begin{equation}{\tilde{q}}_{i}={q}_{i}+{\epsilon}{X}_{i}^{(1)}(\vec{P},\vec{q})+{{\epsilon}}^{2}{X}_{i}^{(2)}(\vec{P},\vec{q})+\mathcal{O}({{\epsilon}}^{3}),\end{equation} \tag{ 19b }$

$\begin{equation}{\tilde{S}}_{k}={S}_{k}+{Z}_{k}^{(0)}(\vec{P},\vec{q})+{\epsilon}{Z}_{i}^{(1)}(\vec{P},\vec{q})+\mathcal{O}({{\epsilon}}^{2}).\end{equation} \tag{ 19c }$

Here, the transformation functions ${Y}_{j}^{(n)}$ , ${X}_{i}^{(n)}$ , and ${Z}_{k}^{(n)}$ are required to be smooth, periodic functions of the orbital phases $\vec{q}$ . At leading order, equation (19) are identity transformations for P_k and q_i but not for S_k due to the presence of a zeroth order transformation term ${Z}_{k}^{(0)}$ . The inverse transformations can be found for P_k and q_i by requiring that their composition with the transformations in equation (19) must give the identity transformation. Expanding order by order in , this gives us

$\begin{equation}\begin{array}{c} {P}_{j}=\tilde{{P}_{j}}-{\epsilon}{Y}_{j}^{(1)}(\overrightarrow{\tilde{P}},\overrightarrow{\tilde{q}})\\ \quad -{{\epsilon}}^{2}\left({Y}_{j}^{(2)}(\overrightarrow{\tilde{P}},\overrightarrow{\tilde{q}})-\frac{\mathrm{\partial }{Y}_{j}^{(1)}(\overrightarrow{\tilde{P}},\overrightarrow{\tilde{q}})}{\mathrm{\partial }\tilde{{P}_{k}}}{Y}_{k}^{(1)}(\overrightarrow{\tilde{P}},\overrightarrow{\tilde{q}})-\frac{\mathrm{\partial }{Y}_{j}^{(1)}(\overrightarrow{\tilde{P}},\overrightarrow{\tilde{q}})}{\mathrm{\partial }\tilde{{q}_{k}}}{X}_{k}^{(1)}(\overrightarrow{\tilde{P}},\overrightarrow{\tilde{q}})\right)\\ \quad +\mathcal{O}({{\epsilon}}^{3}),\end{array}\end{equation} \tag{ 20a }$

$\begin{equation}\begin{array}{c} {q}_{i}=\tilde{{q}_{i}}-{\epsilon}{X}_{i}^{(1)}(\overrightarrow{\tilde{P}},\overrightarrow{\tilde{q}})\\ \quad -{{\epsilon}}^{2}\left({X}_{i}^{(2)}(\overrightarrow{\tilde{P}},\overrightarrow{\tilde{q}})-\frac{\mathrm{\partial }{X}_{i}^{(1)}(\overrightarrow{\tilde{P}},\overrightarrow{\tilde{q}})}{\mathrm{\partial }\tilde{{P}_{j}}}{Y}_{j}^{(1)}(\overrightarrow{\tilde{P}},\overrightarrow{\tilde{q}})-\frac{\mathrm{\partial }{X}_{i}^{(1)}(\overrightarrow{\tilde{P}},\overrightarrow{\tilde{q}})}{\mathrm{\partial }\tilde{{q}_{k}}}{X}_{k}^{(1)}(\overrightarrow{\tilde{P}},\overrightarrow{\tilde{q}})\right)\\ \quad +\mathcal{O}({{\epsilon}}^{3}).\end{array}\end{equation} \tag{ 20b }$

To proceed it is useful to decompose various functions into Fourier series where we use the convention:

$\begin{equation}A(\vec{P},\vec{q})=\sum\limits _{\vec{\kappa }\in {\mathbb{Z}}^{N}}{A}_{\vec{\kappa }}(\vec{P}){\text{e}}^{\text{i}\vec{\kappa }\cdot \vec{q}},\end{equation} \tag{ 21 }$

where N is the number of orbital phases. Based on this, we can split the function into an averaged piece

$\begin{equation}\left\langle A\right\rangle (\vec{P})={A}_{\vec{0}}(\vec{P})=\frac{1}{{(2\pi )}^{N}}\int \cdots {\int }_{\vec{q}}A(\vec{P},\vec{q})d{q}_{1}\dots d{q}_{N},\end{equation} \tag{ 22 }$

and an oscillating piece

$\begin{equation}\breve{A}(\vec{P},\vec{q})=A(\vec{P},\vec{q})-\left\langle A\right\rangle (\vec{P})=\sum\limits _{\vec{\kappa }\ne \vec{0}}{A}_{\vec{\kappa }}(\vec{P}){\text{e}}^{\text{i}\vec{\kappa }\cdot \vec{q}}.\end{equation} \tag{ 23 }$

Using the above transformations along with the equations of motion, and working order by order in , we can chose values for the transformation functions ${Y}_{j}^{(1)},{Y}_{j}^{(2)},{X}_{i}^{(1)},{X}_{i}^{(2)},{Z}_{k}^{(0)}$ and ${Z}_{k}^{(1)}$ such that the resulting equations of motion for $\tilde{{P}_{j}},\tilde{{q}_{i}}$ and $\tilde{{S}_{k}}$ take the following form

$\begin{equation}{\dot{\tilde{P}}}_{j}=0+{\epsilon}{\tilde{F}}_{j}^{(1)}(\overrightarrow{\tilde{P}})+{{\epsilon}}^{2}{\tilde{F}}_{j}^{(2)}(\overrightarrow{\tilde{P}})+\mathcal{O}({{\epsilon}}^{3}),\end{equation} \tag{ 24a }$

$\begin{equation}{\dot{\tilde{q}}}_{i}={{\Upsilon}}_{i}(\overrightarrow{\tilde{P}})+{\epsilon}{\tilde{f}}_{i}^{(1)}(\overrightarrow{\tilde{P}})+{{\epsilon}}^{2}{\tilde{f}}_{i}^{(2)}(\overrightarrow{\tilde{P}})+\mathcal{O}({{\epsilon}}^{3}),\end{equation} \tag{ 24b }$

$\begin{equation}{\dot{\tilde{S}}}_{k}={\tilde{s}}_{k}^{(0)}(\overrightarrow{\tilde{P}})+{\epsilon}{\tilde{s}}_{k}^{(1)}(\overrightarrow{\tilde{P}})+\mathcal{O}({{\epsilon}}^{2}).\end{equation} \tag{ 24c }$

Crucially, these equations of motion are now independent of the orbital phases $\vec{q}$ . Deriving the relationship between the transformed forcing functions ( ${\tilde{F}}_{j}^{(1{\backslash}2)},{\tilde{f}}_{i}^{(1{\backslash}2)}$ and ${\tilde{s}}_{k}^{(0{\backslash}1)}$ ) to the original forcing functions is quite an involved process with several freedoms and choices, each with their own merits and drawbacks. This is discussed at length in [36], so for brevity we will summarize the results and the particular choices we have made in this work.

The transformed forcing functions are related to the original functions by

$\begin{equation}{\tilde{F}}_{j}^{(1)}=\left\langle {F}_{j}^{(1)}\right\rangle ,\end{equation} \tag{ 25a }$

$\begin{equation}{\tilde{f}}_{i}^{(1)}=\left\langle {f}_{i}^{(1)}\right\rangle ,\end{equation} \tag{ 25b }$

$\begin{equation}{\tilde{s}}_{k}^{(0)}=\left\langle {s}_{k}^{(0)}\right\rangle ,\end{equation} \tag{ 25c }$

$\begin{equation}{\tilde{F}}_{j}^{(2)}=\left\langle {F}_{j}^{(2)}\right\rangle +\left\langle \frac{\partial {\breve{Y}}_{j}^{(1)}}{\partial {\tilde{q}}_{i}}{\breve{f}}_{i}^{(1)}\right\rangle +\left\langle \frac{\partial {\breve{Y}}_{j}^{(1)}}{\partial {\tilde{P}}_{k}}{\breve{F}}_{k}^{(1)}\right\rangle ,\end{equation} \tag{ 25d }$

$\begin{equation}{\tilde{f}}_{i}^{(2)}=0,\;\text{and}\end{equation} \tag{ 25e }$

$\begin{equation}{\tilde{s}}_{k}^{(1)}=\left\langle {s}_{k}^{(1)}\right\rangle -\left\langle \frac{\partial {\breve{s}}_{k}^{(0)}}{\partial {\tilde{P}}_{j}}{\breve{Y}}_{j}^{(1)}\right\rangle -\left\langle \frac{\partial {\breve{s}}_{k}^{(0)}}{\partial {\tilde{q}}_{i}}{\breve{X}}_{i}^{(1)}\right\rangle .\end{equation} \tag{ 25f }$

In deriving these equations of motion, we have constrained the oscillating pieces of the NIT transformation functions to be

$\begin{equation}{\breve{Y}}_{j}^{(1)}\equiv \sum\limits _{\vec{\kappa }\ne \vec{0}}\frac{i}{\vec{\kappa }\cdot \vec{{\Upsilon}}}{F}_{j,\vec{\kappa }}^{(1)}{\text{e}}^{\text{i}\vec{\kappa }\cdot \vec{q}},\end{equation} \tag{ 26 }$

$\begin{equation}{\breve{X}}_{i}^{(1)}\equiv \sum\limits _{\vec{\kappa }\ne \vec{0}}\left(\frac{i}{\vec{\kappa }\cdot \vec{{\Upsilon}}}{f}_{i,\vec{\kappa }}^{(1)}+\frac{1}{{(\vec{\kappa }\cdot \vec{{\Upsilon}})}^{2}}\frac{\partial {{\Upsilon}}_{i}}{\partial {P}_{j}}{F}_{j,\vec{\kappa }}^{(1)}\right){\text{e}}^{\text{i}\vec{\kappa }\cdot \vec{q}},\end{equation} \tag{ 27 }$

and ${\breve{Z}}_{k}^{(0)}$ is found by solving

$\begin{equation}{\breve{s}}_{k}^{(0)}+\frac{\partial {\breve{Z}}_{k}^{(0)}}{\partial {\tilde{q}}_{i}}{{\Upsilon}}_{i}=0.\end{equation} \tag{ 28 }$

This equation is satisfied by the oscillating pieces for the analytic solutions for the geodesic motion of t and ϕ in equation (A2), i.e.,

$\begin{equation}{\breve{Z}}_{k}^{(0)}=-{\breve{S}}_{k,r}({q}_{r})-{\breve{S}}_{k,z}({q}_{z}).\end{equation} \tag{ 29 }$

We have chosen the averaged pieces such that $\left\langle {Y}_{j}^{(1)}\right\rangle =\left\langle {Y}_{j}^{(2)}\right\rangle =\left\langle {X}_{i}^{(2)}\right\rangle =\left\langle {Z}_{k}^{(0)}\right\rangle =\left\langle {Z}_{k}^{(1)}\right\rangle =0$ but have used $\left\langle {X}_{i}^{(1)}\right\rangle$ to cancel out the contributions of ${\tilde{f}}_{i}^{(2)}$ . In order to generate waveforms, one only needs to know the transformations in equation (19) to zeroth order in the mass ratio, i.e.,

$\begin{equation}{P}_{j}=\tilde{{P}_{j}}+\mathcal{O}({\epsilon}),\end{equation} \tag{ 30a }$

$\begin{equation}{q}_{i}=\tilde{{q}_{i}}+\mathcal{O}({\epsilon}),\end{equation} \tag{ 30b }$

$\begin{equation}{S}_{k}=\tilde{{S}_{k}}-{Z}_{k}^{(0)}(\overrightarrow{\tilde{P}},\overrightarrow{\tilde{q}})+\mathcal{O}({\epsilon}).\end{equation} \tag{ 30c }$

Furthermore, to be able to directly compare between OG and NIT inspirals, we will need to match their initial conditions to sufficient accuracy. To maintain an overall phase difference of $\mathcal{O}({\epsilon})$ in the course of an inspiral, this requires known the transformation of the P_j's (19a) to linear order in , while it is sufficient to know the rest of equation (19) to zeroth order. In particular, we will never need an explicit expression for $\left\langle {X}_{i}^{(1)}\right\rangle$ , which would require solving a PDE to obtain.

3.2. Averaged equations of motion for eccentric equatorial Kerr inspirals

We now apply the near identity averaging transformation procedure to the equations of motion for equatorial Kerr inspirals to obtain:

$\begin{equation}\frac{\mathrm{d}\tilde{p}}{\mathrm{d}\lambda }={\epsilon}{\tilde{F}}_{p}^{(1)}(a,\tilde{p},\tilde{e})+{{\epsilon}}^{2}{\tilde{F}}_{p}^{(2)}(a,\tilde{p},\tilde{e}),\end{equation} \tag{ 31a }$

$\begin{equation}\frac{\mathrm{d}\tilde{e}}{\mathrm{d}\lambda }={\epsilon}{\tilde{F}}_{e}^{(1)}(a,\tilde{p},\tilde{e})+{{\epsilon}}^{2}{\tilde{F}}_{e}^{(2)}(a,\tilde{p},\tilde{e}),\end{equation} \tag{ 31b }$

$\begin{equation}\frac{\mathrm{d}\tilde{{q}_{r}}}{\mathrm{d}\lambda }={{\Upsilon}}_{r}(a,\tilde{p},\tilde{e})+{\epsilon}{\tilde{f}}_{r}^{(1)}(a,\tilde{p},\tilde{e}),\end{equation} \tag{ 31c }$

$\begin{equation}\frac{\mathrm{d}\tilde{t}}{\mathrm{d}\lambda }={\tilde{s}}_{t}^{(0)}(a,\tilde{p},\tilde{e})+{\epsilon}{\tilde{s}}_{t}^{(1)}(a,\tilde{p},\tilde{e}),\end{equation} \tag{ 31d }$

$\begin{equation}\frac{\mathrm{d}\tilde{\phi }}{\mathrm{d}\lambda }={\tilde{s}}_{\phi }^{(0)}(a,\tilde{p},\tilde{e})+{\epsilon}{\tilde{s}}_{\phi }^{(1)}(a,\tilde{p},\tilde{e}).\end{equation} \tag{ 31e }$

The leading order terms in each equation of motion are simply the original function averaged over a single geodesic orbit, i.e.,

$\begin{equation}{\tilde{F}}_{p}^{(1)}=\left\langle {F}_{p}\right\rangle ,\qquad {\tilde{F}}_{e}^{(1)}=\left\langle {F}_{e}\right\rangle ,\qquad {\tilde{f}}_{r}^{(1)}=\left\langle {f}_{r}\right\rangle ,\end{equation} \tag{ 32 }$

$\begin{equation}{\tilde{s}}_{t}^{(0)}=\left\langle {s}_{t}\right\rangle ={{\Upsilon}}_{t},\qquad {\tilde{s}}_{\phi }^{(0)}=\left\langle {s}_{\phi }\right\rangle ={{\Upsilon}}_{\phi },\end{equation} \tag{ 33 }$

where ϒ_t and ϒ_ϕ are the Mino-time t and ϕ fundamental frequencies. The remaining terms are more complicated and require Fourier decomposing the original functions and their derivatives with respect to the orbital elements (p, e). To express the result, we define the operator

$\begin{align}\hfill \mathcal{N}(A)& =\sum\limits _{\kappa \ne 0}\frac{-1}{{{\Upsilon}}_{r}}\left[{A}_{\kappa }{f}_{r,-\kappa }-\frac{i}{\kappa }\left(\frac{\partial {A}_{\kappa }}{\partial \tilde{p}}{F}_{p,-\kappa }+\frac{\partial {A}_{\kappa }}{\partial \tilde{e}}{F}_{e,-\kappa }\right.\right.\hfill \\ \hfill & \quad -\left.\left.\frac{{A}_{\kappa }}{{{\Upsilon}}_{r}}\left(\frac{\partial {{\Upsilon}}_{r}}{\partial \tilde{p}}{F}_{p,-\kappa }+\frac{\partial {{\Upsilon}}_{r}}{\partial \tilde{e}}{F}_{e,-\kappa }\right)\right)\right].\hfill \end{align} \tag{ 34 }$

With this in hand, the remaining terms in the equations of motion are found to be

$\begin{equation}{\tilde{F}}_{p}^{(2)}=\mathcal{N}({F}_{p}),\qquad {\tilde{F}}_{e}^{(2)}=\mathcal{N}({F}_{e}),\qquad {\tilde{s}}_{t}^{(1)}=\mathcal{N}({s}_{t}),\qquad {\tilde{s}}_{\phi }^{(1)}=\mathcal{N}({s}_{\phi }).\end{equation} \tag{ 35 }$

4. Gravitational self-force for eccentric Kerr inspirals

In order to drive the inspiral we need to rapidly evaluate the GSF given any values of (p, e, q_r). Codes that compute the GSF generally take minutes to hours to compute the force along a geodesic for a given (p, e) value and so it is not practical to directly couple the equations of motion to a GSF code. Instead, it is common practice to calculate the GSF on a discrete set of points across the parameter space and then build an interpolation or fitted model that smoothly connects the GSF data. The following subsections describe our approach.

4.1. Gravitational self-force

The GSF approach is reviewed extensively elsewhere [52, 53] and so we just give a brief overview of the calculations that we employ. The GSF approach starts by expanding the metric of the binary around the metric of the primary, i.e., ${g}_{\mu \nu }={\bar{g}}_{\mu \nu }+{\epsilon}{h}_{\mu \nu }^{(1)}+{{\epsilon}}^{2}{h}_{\mu \nu }^{(2)}+\dots \,$ where ${\bar{g}}_{\mu \nu }$ is the Kerr metric and the h⁽ⁿ⁾ are the nth order perturbations to the spacetime due to the presence of the secondary. The interaction between these metric perturbations and the motion of the secondary can be derived through a matched asymptotic expansion analysis [53]. In this work we use only first-order (in the mass ratio) results, as second order results are still emerging [30]. The first order metric perturbation generated by a compact object can be found by solving the linearized Einstein field equations with a point particle source moving on a geodesic of ${\bar{g}}_{\mu \nu }$ . The matched asymptotic expansion analysis identifies a regular part of this (divergent) metric perturbation that is responsible for the backreaction on the compact object [53–56]. The result is a (self-)force that appears on the right-hand side of equation (3) computed from derivatives of the regular metric perturbation [53].

Solving the perturbation equations requires picking a gauge, and the resulting self-force is gauge dependent [57]. The self-force was first computed in the Lorenz gauge where the procedure for obtaining the regular part was best understood. Numerical calculations of the Lorenz gauge self-force have been made in both the frequency—[25, 26, 58] and time-domains [22–24]. All these results have been for motion in Schwarzschild spacetime, with one exception in Kerr [59].

Calculating perturbations of the Kerr spacetime is hampered by the lack of separability of the linearized Einstein field equations on this background. This difficulty can be circumvented by using the Teukolsky formalism for describing perturbations to the Weyl scalars [60], which is fully separable in the frequency domain. From the Weyl scalars, the metric perturbation can be reconstructed in a radiation gauge [61–63]. There has also been recent progress understanding how to reconstruct the metric in the Lorenz gauge [64]. Regularization of the metric perturbation in radiation gauges is more subtle [65], but self-force calculations in the radiation gauge are now routine [27, 28, 66, 67].

In this work we primarily use the self-force computed using the code of references [27, 28, 67]. This code uses the Mano–Suzuki–Takasugi methods [68–71] to compute the perturbations to the Weyl scalars in the frequency domain. The metric is then reconstructed into an outgoing radiation gauge (including mass and angular momentum perturbations [72, 73] and gauge completion contributions [74]). The metric perturbation is then projected onto a basis of spherical harmonics before regularization is carried out using the mode-sum approach [65, 75]. Depending on the eccentricity of the orbit the code must compute the metric perturbation by summing over thousands to tens of thousands of Fourier harmonic modes. With the current Mathematica implementation, the self-force for an, e.g., a = 0.9M, p = 3.375, e = 0.5 orbit takes approximately 90 CPU hours to compute.

The self-force can be split into dissipative (time anti-symmetric) and conservative (time-symmetric) contributions [76].

The dissipative pieces causes the orbit to shrink until the secondary plunges into the primary. It also generally causes the orbit to circularize, with the exception being just before the transition to plunge where the orbit gains eccentricity [44, 77–79]. To produce adiabatic waveforms, we only require knowledge of the orbit averaged dissipative pieces of the first-order self-force. These can be related, via balance laws, to the fluxes of GWs to infinity and down the event horizon. Since calculating fluxes avoids regularization of the metric perturbation, adiabatic inspirals are typically calculated via flux balance laws [44, 78–83]. The conservative pieces have more subtle effects on the inspiral, such as altering the rate of periapsis advance and the location of the innermost stable circular orbit (ISCO) [50, 84–89].

To compute post-adiabatic inspirals requires knowledge of both the dissipative and conservatives pieces of the first-order self-force and the orbit average piece of the second-order self-force [19]. There are as yet no calculations of the latter so we will make do with just the first-order self-force information in this work. This will allow us to explore some of the effects of the conservative self-force on equatorial Kerr inspirals for the first time.

4.2. Interpolation method

In order to drive inspirals, we need the self-force to be rapidly computed across the EMRI parameter space. To achieve this we tile the parameter space with GSF data which we can then interpolate. This approach has been implemented for eccentric Schwarzschild inspirals [31, 32]. In these works the data was interpolated using standard cubic spline methods, which required computing the self-force at tens of thousands of points in the parameter space. While this might not pose much of a problem for the 2D parameter space of eccentric, Schwarzschild inspirals, these approaches would not scale well to the 4D parameter space for generic Kerr inspirals. Motivated by this, as well as the computational expense of the eccentric Kerr self-force code, we build an interpolation model based on Chebyshev polynomials that is accurate to percent level across a 2D slice of the EMRI parameter space using only a few hundred points.

We start by fixing the value of the spin parameter of the primary, which we choose to be a = 0.9M for Kerr inspirals or a = 0 for Schwarzschild inspirals and set the inclination x to be either 1 or −1 for prograde orbits or retrograde orbits respectively. This reduces our parameter space to two parameters; the semilatus rectum p and the eccentricity e. We then define a parameter y using the p and the position of the last stable orbit p_LSO. For Kerr orbits, we chose y to be

$\begin{equation}{y}_{\text{Kerr}}=\sqrt{\frac{{p}_{\text{LSO}}(a,e,x)}{p}}.\end{equation} \tag{ 36 }$

With this parameterization we found that the accuracy of the Chebyshev interpolation is limited by the appearance of cusps at the LSO in the data. To ameliorate their impact we instead used a parameter y given by

$\begin{equation}{y}_{\text{Schwarz}}=1-{\left(1-\frac{{p}_{\text{LSO}}(0,e,x)}{p}\right)}^{1/3}\end{equation} \tag{ 37 }$

for later runs in Schwarzschild spacetime. In either case, tiling the parameter space in y instead of p will concentrate more points near the separatrix where the self force varies the most.

We let y range from y_min = 0 (0.01 for Schwarzschild) to y_max = 1 and e range from e_min = 0 to e_max = 0.5 for Kerr and e_min = 0 to e_max = 0.3 for Schwarzschild. We define parameters u and v which cover this parameter space as they range from (−1, 1)

$\begin{equation}u{:=}\frac{y-({y}_{\mathrm{min}}+{y}_{\mathrm{max}})/2}{({y}_{\mathrm{min}}-{y}_{\mathrm{max}})/2}\quad \text{and}\quad v{:=}\frac{e-({e}_{\mathrm{min}}+{e}_{\mathrm{max}})/2}{({e}_{\mathrm{min}}-{e}_{\mathrm{max}})/2}.\end{equation} \tag{ 38a-b }$

This parameterization is convenient when using Chebyshev polynomials of the first kind, where the order n polynomial is defined by T_n(cos φ) := cos(nφ). The Chebyshev nodes are the roots these polynomials, and the location of the kth root of nth polynomial is given by

$\begin{equation}{N}_{k}=\mathrm{c}\mathrm{o}\mathrm{s}\left(\frac{2k-1}{2n}\pi \right).\end{equation} \tag{ 39 }$

We then calculate the GSF on a 15 × 7 grid of Chebyshev nodes, with the u values given by the roots of the 15th order polynomial and the v values given by the roots of the 7th order polynomial. At each point on our grid, we Fourier decompose each component of the force with respect to the radial action angle q_r. We then multiply the data for each Fourier coefficient by a factor of (1 − y)/(1 − e²), as we find that this smooths the behaviour of the force near the separatrix and improves the accuracy of our interpolation. Next, we use Chebyshev polynomials to interpolate each Fourier coefficient across the (u, v) grid. We then resum the modes to reconstruct our interpolated GSF model:

$\begin{equation}{a}_{\alpha }=\frac{1-{e}^{2}}{1-y}\sum\limits _{\kappa =0}^{15}{A}_{\alpha }^{\kappa }(y,e)\mathrm{cos}(\kappa {q}_{r})+{B}_{\alpha }^{\kappa }(y,e)\mathrm{sin}(\kappa {q}_{r}),\end{equation} \tag{ 40 }$

where

$\begin{align}\hfill {A}_{\alpha }^{\kappa }(y,e)& =\sum\limits _{i=0}^{14}\,\sum\limits _{j=0}^{6}\,{A}_{\alpha }^{\kappa ij}{T}_{i}\left(u\right){T}_{j}\left(v\right)\quad \text{and}\hfill \\ \hfill {B}_{\alpha }^{\kappa }(y,e)& =\sum\limits _{i=0}^{14}\,\sum\limits _{j=0}^{6}\,{B}_{\alpha }^{\kappa ij}{T}_{i}\left(u\right){T}_{j}\left(v\right).\hfill \end{align} \tag{ 41 }$

Using this procedure forces each component to become singular at the last stable orbit. While the GSF changes rapidly as one approaches the last stable orbit, we do not expect the components of the self force to diverge at the LSO. Understanding the analytic structure of the self-force in this region would likely improve future interpolation models.

We note that the GSF should satisfy the orthogonality condition with the geodesic four-velocity, i.e., a_α u^α = 0. Interpolation will bring with it a certain amount of error which can cause this condition to be violated. We find empirically that we can reduce this interpolation error by projecting the force so that this condition is always satisfied, i.e.,

$\begin{equation}{a}_{\alpha }^{\perp }={a}_{\alpha }+{a}_{\beta }{u}^{\beta }{u}_{\alpha }.\end{equation} \tag{ 42 }$

This procedure allows us to create a smooth, continuous model for the GSF with relative errors less than 5 × 10⁻³ in the strong field—see figure 1. The variation in the accuracy of the model is primarily a by-product of how close a given test point (green cross) is to the data points (white dots) used to create the model. We note that this level of precision would not be sufficient for production grade waveforms for LISA, as we would need the relative error of the orbit averaged dissipative self-force to be less than $\sim 1/{\epsilon}$ , whereas the oscillatory pieces of the self-force only need to be interpolated to an accuracy of a fraction of a percent [32]. Our present interpolation model already likely reaches the latter criteria and a future hybrid method that combines flux and self-force data, similar to the one constructed in reference [32], can likely reach the overall accuracy goal. Nonetheless, our present model is more than sufficient to test our averaging procedure and to explore the effects of the GSF for eccentric Kerr inspirals. This will now be treated as the underlying forcing model for both the OG and NIT inspirals.

5. Implementation

Combining the above model with our action angle formulation of the OG equations provides us with everything required to calculate the NIT equations of motion. We first evaluate and interpolate the various terms in the NIT equations of motion across the parameter space. This offline process is costly but it only needs to be completed once. By contrast, the online steps are computationally cheap, which allows us to rapidly compute eccentric self-forced inspirals into a Kerr black hole.

5.1. Offline steps

To make the offline calculation we complete the following steps.

(a)
We start by selecting a grid to evaluate the NIT functions upon. We chose y values between 0.2 and 0.998 in 320 equally spaced steps and e values from 0.001 to 0.5 in 500 equally spaced steps (160 000 points) in the case of Kerr, or use the same spacing in y but only grid in e from 0.001 to 0.3 in 300 equally spaced steps (96 000 points) in Schwarzschild.³
(b)
For each point in the parameter space (a, y, e) we evaluate the functions F_p\e, f_r and s_t\ϕ along with their derivatives with respect to p and e for 30 equally spaced values of q_r from 0 to 2π.
(c)
We then perform a fast Fourier transform on the output data to obtain the Fourier coefficients of the forcing functions and their derivatives.
(d)
With these, we then use equations (32)–(35) to construct ${\tilde{F}}_{p{\backslash}e}^{(1{\backslash}2)}$ , ${\tilde{f}}_{r}^{(1)}$ and ${s}_{t{\backslash}\phi }^{(1)}$ for that point in the parameter space.
(e)
We also use equations (26) and (27) to construct the Fourier coefficients of the first order transformation functions ${Y}_{p{\backslash}e}^{(1)}$ and ${X}_{r}^{(1)}$ .
(f)
We then repeat this procedure across the parameter space for each point in our grid.
(g)
Finally we interpolate the values for ${\tilde{F}}_{p{\backslash}e}^{(1{\backslash}2)},{\tilde{f}}_{r}^{(1)}$ and ${\tilde{s}}_{t{\backslash}\phi }^{(1)}$ along with the coefficients of ${Y}_{p{\backslash}e}^{(1)}$ and ${X}_{r}^{(1)}$ across this grid using Hermite interpolation and store the interpolants for future use.

We implemented the above algorithm in Mathematica 12.2 and find, parallelized across 20 CPU cores takes, the calculation takes about one day to complete. This is a small price to pay, since these offline steps need only be completed once.

5.2. Online steps

The online steps are required for every inspiral calculation, but are comparatively inexpensive. The online steps for computing an NIT inspiral are as follows.

(a)
We load in the interpolants for ${\tilde{F}}_{p{\backslash}e}^{(1{\backslash}2)}$ and ${\tilde{f}}_{r}^{(1)}$ and ${\tilde{s}}_{t{\backslash}\phi }^{(1)}$ , define the NIT equations of motion.
(b)
In order to make comparisons between NIT and OG inspirals, we also load interpolants of the Fourier coefficients of ${\breve{Y}}_{p/e}^{(1)}$ and ${\breve{X}}_{r}^{(1)}$ and equation (19) to construct first order near-identity transformations.⁴
(c)
We state the initial conditions of the inspiral (p₀, e₀, q_r,0) and use the NIT to leading order in the mass ratio to transform these into initial conditions for the NIT equations of motion, i.e., $({\tilde{p}}_{0},{\tilde{e}}_{0},{\tilde{q}}_{r0})$ .
(d)
We then evolve the NIT equations of motion using an ODE solver (in this case Mathematica's NDSolve).

As with the offline steps we implement the online steps in Mathematica. Note that steps (b) and (c) are only necessary because we want to make direct comparisons between NIT and OG inspirals with the same initial conditions. In general, the difference between the NIT and OG variables will always be $\mathcal{O}({\epsilon})$ , and so performing the NIT transformation or inverse transformation to greater than zeroth order in mass ratio will not be necessary when producing waveforms to post adiabatic order, i.e. with phases accurate to $\mathcal{O}({\epsilon})$ .

6. Results

In this section we present the results from the NIT equations of motion. We first perform some consistency checks in section 6.1. We then show that our NIT and OG inspirals agree to the relevant order in the mass ratio in section 6.2. Here we also compute, for the first time, self-forced inspirals in Kerr spacetime. With our fast NIT model we then explore the impact of the conservative effects of the first-order GSF as calculated in radiation gauge for Kerr inspirals in section 6.3. Finally, in section 6.4, we compare Schwarzschild inspirals calculated using a radiation gauge GSF model and a Lorenz gauge GSF model.

6.1. Consistency checks

Before computing inspirals, we perform a series of consistency checks on the NIT equations of motion. A useful feature of the NIT is how it separates adiabatic and post-adiabatic effects of the GSF. At first order in the mass ratio, this corresponds to the dissipative and conservative pieces respectively. We note that when we substitute ${a}^{\alpha }\to {a}_{\mathrm{d}\mathrm{i}\mathrm{s}\mathrm{s}}^{\alpha }$ , we find that ${\tilde{F}}_{p{\backslash}e}^{(2)}$ , ${\tilde{f}}_{r}^{(1)}$ and ${\tilde{s}}_{t{\backslash}\phi }^{(1)}$ are numerically consistent with zero, while ${\tilde{F}}_{p{\backslash}e}^{(1)}$ remains unchanged. Similarly, when we substitute ${a}^{\alpha }\to {a}_{\mathrm{c}\mathrm{o}\mathrm{n}\mathrm{s}}^{\alpha }$ , ${\tilde{F}}_{p{\backslash}e}^{(1)}$ and ${\tilde{F}}_{p{\backslash}e}^{(2)}$ become consistent with zero, while ${\tilde{f}}_{r}^{(1)}$ and ${\tilde{s}}_{t{\backslash}\phi }^{(1)}$ remain the same as before. The functions ${\tilde{F}}_{p{\backslash}e}^{(2)}$ only becomes non-zero when both dissipative and conservative effects of the first order self-force are present.

From ${\tilde{F}}_{p{\backslash}e}^{(1)}$ , one can calculate the average rate of change of energy and angular momentum via the following relation:

$\begin{equation}\left\langle \frac{\mathrm{d}\mathcal{E}}{\mathrm{d}t}\right\rangle =\frac{{\epsilon}}{{{\Upsilon}}_{t}}\left(\frac{\partial \mathcal{E}}{\partial p}{\tilde{F}}_{p}^{(1)}+\frac{\partial \mathcal{E}}{\partial e}{\tilde{F}}_{e}^{(1)}\right)\end{equation} \tag{ 43a }$

$\begin{equation}\left\langle \frac{\mathrm{d}\mathcal{L}}{\mathrm{d}t}\right\rangle =\frac{{\epsilon}}{{{\Upsilon}}_{t}}\left(\frac{\partial \mathcal{L}}{\partial p}{\tilde{F}}_{p}^{(1)}+\frac{\partial \mathcal{L}}{\partial e}{\tilde{F}}_{e}^{(1)}\right).\end{equation} \tag{ 43b }$

We compared these to the energy and angular momentum fluxes at infinity tabulated in the Black Hole Perturbation Toolkit [49] and generated with a variant of the Gremlin code [80, 81] and found that the balance laws were upheld up to relative errors $< 1{0}^{-3}$ throughout the parameter space which is consistent with the interpolation error of our self-force model.

From all of this, we can infer the significance of each of the terms in equation (31): ϒ_r, ϒ_t and ϒ_ϕ capture the background geodesic motion, ${\tilde{F}}_{p}^{(1)}$ and ${\tilde{F}}_{e}^{(1)}$ capture the adiabatic effects due to the first order dissipative self-force, ${\tilde{f}}_{r}^{(1)}$ , ${\tilde{s}}_{t}^{(1)}$ , and ${\tilde{s}}_{\phi }^{(1)}$ capture the post-adiabatic effects due to the first order conservative self-force, and ${\tilde{F}}_{p}^{(2)}$ , ${\tilde{F}}_{e}^{(2)}$ capture the interplay between the first order dissipative and conservative self-force, as well as the effect of the orbit averaged contribution from the second order self-force.

6.2. Comparison between OG and NIT inspirals

In order to test the accuracy of our implementation, we compare inspirals calculated using the OG equations of motion found in reference [34] to those calculated using the near-identity transformed equations of motion. To demonstrate these results, we choose a binary with a primary of mass M = 10⁶ M_⊙ and a secondary of mass μ = 10M_⊙ for a typical EMRI mass ratio of = 10⁻⁵. To push our procedure to the limit, we chose the initial conditions of our prograde inspiral to be deep in the strong field and highly eccentric with p₀ = 7.1 and e₀ = 0.48 such that the resulting inspiral would take approximately 1 year to plunge. We also set q_r,0 = t₀ = ϕ₀ = 0 for simplicity.

Figure 2 shows the evolution of p and e over time. The trajectories calculated with the OG equations of motion have order oscillations on the orbital timescale which requires the numerical integrator to take small time steps to accurately resolve. The NIT trajectory does not have these oscillations so the numerical integrator can take much larger steps and still faithfully track the averaged trajectory throughout the entire inspiral. The inverse NIT given in equation (20a) through $\mathcal{O}({\epsilon})$ can be used to add the oscillations back on to the NIT trajectory. We find that while this is unnecessary for computing accurate waveforms, it demonstrates that the NIT trajectory remains in phase with the OG trajectory—see the insets of figure 2.

epsilon — **Figure 2.** The trajectory through (p, e) space for an inspiral with = 10⁻⁵, a = 0.9M, and initial conditions (p₀ = 7.1, e₀ = 0.48). We show the inspiral computed using the OG equations, the NIT equations of motion and the inverse NIT to first order in . The insets zoom into the start and end of the inspiral to reveal the small orbital timescale oscillations. The NIT averages through these oscillations, and when using the inverse NIT to add the oscillations back on, we see that the NIT trajectory remains almost perfectly in phase with the OG trajectory throughout the inspiral.
Download figure:
Standard image High-resolution image

The accuracy of our NIT model is further demonstrated by figure 3 which shows the absolute difference in the orbital phase q_r and the extrinsic quantities t and ϕ between the NIT and OG evolutions. Over the course of the year long inspiral, $\vert t-(\tilde{t}-{Z}_{t}^{(0)})\vert \leqslant 5\times 1{0}^{-3},\vert \phi -(\tilde{\phi }-{Z}_{\phi }^{(0)})\vert \leqslant 1{0}^{-5}$ and $\vert {q}_{r}-{\tilde{q}}_{r}\vert \leqslant 1{0}^{-3}$ with the differences only spiking to $\leqslant 1{0}^{-2}$ just as the trajectories reach the separatrix where the adiabatic approximation breaks down.

Finally, we test the effect the NIT procedure has on the waveform. In principle, we could use our averaged equations of motion in conjunction with the FastEMRIWaveforms (FEW) framework to rapidly compute waveforms with relativistic amplitudes. However, currently, the FEW framework only has amplitude data for Schwarzschild inspirals. As such, we make use of the same procedure as the numerical kludge [10] by mapping the Boyer–Lindquist coordinates {t, r, θ, ϕ} to flat space coordinates and using the quadrupole formula to generate the waveform. The resulting waveforms are only an approximation to the true waveforms, but since both inspiral trajectories are being fed through the same waveform generation scheme this should not bias the results when finding the difference in the waveform as a result of using the NIT trajectory instead of the OG trajectory.

From figure 4, we can see that the waveforms generated by each evolution scheme, sampled every t = 1M ≈ 5s, are almost identical by eye. We can further quantify this by calculating the waveform mismatch using the WaveformMatch function from the SimulationTools [90] Mathematica package and assuming a flat noise curve. From figure 5, we see that the mismatch remains below 5 × 10⁻⁸ throughout the inspiral. At this level of mismatch the two waveforms would be completely indistinguishable for EMRIs with SNR of up to (at least) 3000 [91–93].

**Figure 5.** The mismatch between the semi-relativistic quadrupole waveforms between inspirals calculated using the OG equations with (a, , p₀, e₀) = (0.9M, 10⁻⁵, 7.1, 0.48) and the adiabatic EOM matched initial conditions, the adiabatic EOM calculated with matched initial frequencies, and the near-identity transformed EOM. We also mark the mismatch that would be indistinguishable for signals with SNR = 100.
Download figure:
Standard image High-resolution image

Next, the difference between the OG and NIT quantities should scale linearly with the mass ratio. This is illustrated in figure 6, where starting with initial conditions p₀ = 4 and e₀ = 0.2 we evolved the inspiral until it reached p = 3 for mass ratios ranging from 10⁻¹ to 10⁻⁵. While working with only machine precision arithmetic we found that for smaller mass ratios the numerical error of the solver of the OG inspiral became dominant over the difference with the NIT. To rectify this, we increased the working precision of our solver to 30 significant digits and found that the difference does, in fact, scale linearly with the mass ratio. This requirement for higher precision only affected the OG solver, the NIT equations of motion can be solved with machine precision arithmetic without introducing any significant error.

Since the difference between OG and NIT quantities scales with the mass ratio, it is natural to ask how large can the mass ratio be before the NIT and OG waveforms differ enough to affect data analysis. Following the procedure outlined in reference [38], we used our fast NIT inspiral code along with a root-finding algorithm to find the initial value of p that corresponds to a year long inspiral for a given value of the mass ratio and initial eccentricity, and assuming a primary mass of 10⁶ M_⊙. We use these initial conditions to calculate the overlap between year-long NIT and OG waveforms. This calculation is repeated with mass ratios = {1, 3, 5, 7, 9} × 10⁻³ and initial eccentricities e₀ ranging from 0.05 to 0.45 in equally spaced steps of 0.05. The result of this analysis can be seen in figure 7. This demonstrates that NIT and OG waveforms have overlaps larger than the benchmark of 0.97 [94] for mass ratios less than $\approx \;3\times 1{0}^{-3}$ , but these overlaps decrease substantially for mass ratios larger than this. We also see that the overlap generally decreases as the initial eccentricity increases, though this effect is not as strong as the effect demonstrated by a similar analysis in reference [38] for NITs applied to highly eccentric inspirals in Schwarzschild. They also found that the mismatch between NIT and OG waveforms became substantial for mass ratios larger than 2 × 10⁻⁴. These differences between the two analyses are most likely the result of our inspirals being deeper in the strong field and driven by a self-force computed in a different gauge (reference [38] uses the Lorenz gauge self-force). Such mismatches should not be an issue for EMRI data analysis as EMRIs have mass ratios that range from 10⁻⁷ to 10⁻⁴. However, these mismatches become significant for intermediate mass ratio inspirals, with mass ratios between 10⁻⁴ to 10⁻¹. Since both the OG and NIT equations of motion are formally valid to the same order in the mass ratio, it is not clear a priori which of the two would be closer to the true inspiral. When completed at one-post-adiabtatic (1PA) order the two sets of equations represent different resummations of the 1PA equations of motion, differing only in their higher order (2+) PA terms. The fact that we are seeing a significant difference between these two resummations for intermediate mass ratios suggests that such higher order PA terms might become relevant. However, in this case it might just be signalling the importance of the missing orbit-averaged dissipative self-force term at 1PA order.

**Figure 7.** The overlap between OG and NIT waveforms for year-long, prograde, a = 0.9M, equatorial Kerr inspirals as a function of the mass ratio and initial eccentricity. The difference between the two waveforms is less than the accuracy benchmark of 0.97 for mass ratios $\leqslant 3\times 1{0}^{-3}$ , but not for mass ratios larger than this. While increasing eccentricity does have an effect on the overlap, this effect is not as strong as the effect observed in figure 9 of reference [38].
Download figure:
Standard image High-resolution image

**Figure 7.** The overlap between OG and NIT waveforms for year-long, prograde, a = 0.9M, equatorial Kerr inspirals as a function of the mass ratio and initial eccentricity. The difference between the two waveforms is less than the accuracy benchmark of 0.97 for mass ratios $\leqslant 3\times 1{0}^{-3}$ , but not for mass ratios larger than this. While increasing eccentricity does have an effect on the overlap, this effect is not as strong as the effect observed in figure 9 of reference [38].
Download figure:
Standard image High-resolution image

Finally, we note that using the NIT equations of motion produces a substantial speed-up over using the OG equations. From table 1, we see the typical computation time for an inspiral starting at p₀ = 7.1 and e₀ = 0.48 and evolved until the inspiral reaches the last stable orbit for different values of the mass ratio. We see that as we decrease the mass ratio by an order of magnitude, the OG inspiral takes roughly an order of magnitude longer to compute, as it would have to resolve an order of magnitude more orbital cycles before reaching last stable orbit. The NIT inspirals all take roughly the same amount of time to evolve to the last stable orbit, regardless of the mass ratio. Using our current Mathematica implementation, the NIT inspirals can be computed in less than a second. This time could be further reduced tens of milliseconds if one uses a compiled language such as C/C++, as was done in paper I [36]. We see that using the NIT equations of motion is most advantageous for long inspirals with small mass ratios. Another benefit of using the NIT is that the inspiral requires taking fewer time steps, which results in less numerical error, making it easier achieve a given target accuracy.

Table 1. Computational time required to evolve an inspiral from its initial conditions of p₀ = 7.1 and e₀ = 0.48 to the last stable orbit for different values of the mass ratio, as calculated in Mathematica 12.2 on an Intel Core i7 @ 2.2 GHz. The computational time for the OG inspiral scales inversely with the mass ratio, whereas the computational time for NIT inspirals is independent of the mass ratio. This demonstrates how the smaller the mass ratio of the inspiral, the greater speed-up one obtains from using the NIT equations of motion.

	OG Inspiral	NIT Inspiral	Speed-up
10⁻²	44 s	0.85 s	$\sim 37$
10⁻³	6 m 48 s	0.78 s	$\sim 491$
10⁻⁴	54 m 12 s	0.81 s	$\sim 3782$
10⁻⁵	6 h 16 m	0.76 s	$\sim 29\,655$

The only disadvantage of our formulation is that our final trajectory is parameterized in terms Mino time λ, whereas LISA data analysis applications will need waveforms parameterized Boyer–Lindquist retarded time t. Since our formulation also outputs t(λ), we can numerically invert this to get λ(t) which allows us to resolve this issue at the cost of additional computation time. This was also a problem with the NIT formulation in Schwarzschild where the final trajectory is outputted as a function of the quasi-Keplerian angle χ [36, 38]. This problem might be circumvented entirely by performing an additional transformation to our NIT equations of motion which would produce averaged equations of motion parameterized by t as outlined in [35].

Since we are now satisfied that our formulation can produce fast and accurate self-force driven trajectories, we can now use this procedure to explore the phenomenology of eccentric, equatorial Kerr inspirals.

6.3. Impact of adiabatic and post-adiabatic effects

With the ability to generate fast and accurate inspirals, we can survey the physics of equatorial Kerr inspirals and examine how this differs from the Schwarzschild case. From figure 8(a), we see the familiar effect of gravitational radiation reaction on the semilatus rectum, $\tilde{p}$ , and eccentricity, $\tilde{e}$ , whereby $\tilde{p}$ and $\tilde{e}$ both decrease over the inspiral with $\tilde{e}$ growing a little as the last stable orbit is approached [44, 78, 79]. As the inspiral approaches the last stable orbit adiabaticity breaks down and the inspiral undergoes a transition to plunge [95–98]. As such, we stop our inspirals just before the last stable orbit. Our results are the first inspirals to include conservative self-force corrections to the equations of motion in Kerr spacetime. The initial phase q_r,0 only evolves secularly when conservative self-force corrections are present and so we use this as a measure of the influence of these corrections [31]. This is illustrated by the dashed orange curves in figure 8(a), which mark the number of radians ${\tilde{q}}_{r,0}$ will evolve from a given pair of initial conditions $({\tilde{p}}_{0},{\tilde{e}}_{0})$ until the last stable orbit. For retrograde Kerr (and Schwarzschild orbits in figure 10), we find that ${\tilde{q}}_{r,0}$ increases throughout the inspiral, whereas for prograde Kerr ${\tilde{q}}_{r,0}$ decreases during the inspiral before increasing slightly just before plunge. This is consistent with the change of sign in the correction to the rate of periapsis advance induced by the conservative self force as a function of spin in the circular orbit limit [88]—see appendix D for further details.

**Figure 8.** Sample trajectories through $(\tilde{p},\tilde{e})$ space for prograde and retrograde equatorial Kerr inspirals with = 10⁻⁵ and a = 0.9M. From these plots, we see the familiar behaviour of EMRIs losing eccentricity as the compact object approaches the primary and then gaining eccentricity just before crossing the separatrix (dashed black line). The dashed orange curves are contours that mark the number of radians ${\tilde{q}}_{r,0}$ will evolve from a given point until plunge. The conservative self-force for retrograde orbits has a similar effect to the non-spinning case as it causes ${\tilde{q}}_{r,0}$ to increase throughout the inspiral. In the prograde case, ${\tilde{q}}_{r,0}$ decreases for most of the inspiral and then slightly increases shortly before plunge.
Download figure:
Standard image High-resolution image

**Figure 8.** Sample trajectories through $(\tilde{p},\tilde{e})$ space for prograde and retrograde equatorial Kerr inspirals with = 10⁻⁵ and a = 0.9M. From these plots, we see the familiar behaviour of EMRIs losing eccentricity as the compact object approaches the primary and then gaining eccentricity just before crossing the separatrix (dashed black line). The dashed orange curves are contours that mark the number of radians ${\tilde{q}}_{r,0}$ will evolve from a given point until plunge. The conservative self-force for retrograde orbits has a similar effect to the non-spinning case as it causes ${\tilde{q}}_{r,0}$ to increase throughout the inspiral. In the prograde case, ${\tilde{q}}_{r,0}$ decreases for most of the inspiral and then slightly increases shortly before plunge.
Download figure:
Standard image High-resolution image

As discussed in section 6.1, one can readily calculate adiabatic inspirals using the NIT equations of motion by simply neglecting the post-adiabatic terms. However, when trying to determine how post-adiabatic corrections effect the inspiral, one must be mindful of how one matches up an adiabatic inspiral with its post-adiabatic counterpart. Following the argument found in references [31, 32], matching the initial conditions $({\tilde{p}}_{0},{\tilde{e}}_{0})$ results in an error in the orbital phases that grows linearly in t as the conservative self-force changes the orbital frequencies [86]. Instead, one should instead match the Boyer–Lindquist time fundamental frequencies Ω_r and Ω_ϕ. For an adiabatic inspiral, these are directly related to the Mino-time fundamental frequencies via ${{\Omega}}_{r{\backslash}\phi }^{\mathrm{A}\mathrm{d}}=\frac{{{\Upsilon}}_{r{\backslash}\phi }}{{{\Upsilon}}_{t}}$ [42]. To calculate these frequencies as perturbed by the conservative self-force, one can either follow the method outlined in reference [32], or one can calculate them directly from the NIT equations of motion:

$\begin{equation}{{\Omega}}_{r}^{\mathrm{S}\mathrm{F}}=\frac{{{\Upsilon}}_{r}+{\epsilon}{\tilde{f}}_{r}^{(1)}}{{{\Upsilon}}_{t}+{\epsilon}{\tilde{s}}_{t}^{(1)}}+\mathcal{O}({{\epsilon}}^{2})\quad \text{and}\quad {{\Omega}}_{\phi }^{\mathrm{S}\mathrm{F}}=\frac{{{\Upsilon}}_{\phi }+{\epsilon}{\tilde{s}}_{\phi }^{(1)}}{{{\Upsilon}}_{t}+{\epsilon}{\tilde{s}}_{t}^{(1)}}+\mathcal{O}({{\epsilon}}^{2}).\end{equation} \tag{ 44 }$

We find that both approaches give the same result up to an error that scales as ². With this in hand, we can now choose a value for our initial conditions $({\tilde{p}}_{0}^{\mathrm{S}\mathrm{F}},{\tilde{e}}_{0}^{\mathrm{S}\mathrm{F}})$ for our self-forced inspiral, and then root find for initial conditions $({\tilde{p}}_{0}^{\mathrm{A}\mathrm{d}},{\tilde{e}}_{0}^{\mathrm{A}\mathrm{d}})$ that satisfy the simultaneous equations

$\begin{equation}{{\Omega}}_{r}^{\mathrm{S}\mathrm{F}}({\tilde{p}}_{0}^{\mathrm{S}\mathrm{F}},{\tilde{e}}_{0}^{\mathrm{S}\mathrm{F}})-{{\Omega}}_{r}^{\mathrm{A}\mathrm{d}}({\tilde{p}}_{0}^{\mathrm{A}\mathrm{d}},{\tilde{e}}_{0}^{\mathrm{A}\mathrm{d}})=0,\end{equation} \tag{ 45a }$

$\begin{equation}{{\Omega}}_{\phi }^{\mathrm{S}\mathrm{F}}({\tilde{p}}_{0}^{\mathrm{S}\mathrm{F}},{\tilde{e}}_{0}^{\mathrm{S}\mathrm{F}})-{{\Omega}}_{\phi }^{\mathrm{A}\mathrm{d}}({\tilde{p}}_{0}^{\mathrm{A}\mathrm{d}},{\tilde{e}}_{0}^{\mathrm{A}\mathrm{d}})=0.\end{equation} \tag{ 45b }$

Using this procedure to match the initial frequencies we find that the linear-in-t growth of the difference in the orbital phases is removed and the phase difference grows quadratically in t as expected—see figure 9.

**Figure 9.** Difference in ϕ as a function of t between an adiabatic and a first order self-forced inspiral when either matching initial conditions or matching the initial Boyer–Lindquist frequencies. The self-forced inspiral has initial conditions $({\tilde{p}}_{0},{\tilde{e}}_{0})=(7.1,0.48)$ with mass ratio = 10⁻⁵. Matching initial conditions results in an error that grows linearly with t, while matching frequencies produces an error that is initially constant and then grows quadratically with t.
Download figure:
Standard image High-resolution image

**Figure 9.** Difference in ϕ as a function of t between an adiabatic and a first order self-forced inspiral when either matching initial conditions or matching the initial Boyer–Lindquist frequencies. The self-forced inspiral has initial conditions $({\tilde{p}}_{0},{\tilde{e}}_{0})=(7.1,0.48)$ with mass ratio = 10⁻⁵. Matching initial conditions results in an error that grows linearly with t, while matching frequencies produces an error that is initially constant and then grows quadratically with t.
Download figure:
Standard image High-resolution image

6.4. Comparing inspirals driven using radiation gauge and Lorenz gauge self-force in Schwarzschild spacetime

We now turn our attention to the special case of Schwarzschild (a = 0), where we now have interpolated GSF models calculated in two different gauges. In addition to our outgoing radiation gauge self-force model, we make use of an interpolated Lorenz gauge self-force from reference [31], which is valid in the domain 6 ⩽ p ⩽ 12 and 0 ⩽ e ⩽ 0.2. We apply the same NIT procedure to inspirals driven by this force model, and find agreement with inspirals calculated in paper I, up to the precision of the numerical solver.

To assess the accuracy of the dissipative self-force, we calculate the orbit averaged energy and angular momentum fluxes, and find that they agree with values from the literature with a relative error less than 10⁻³ for both models across the parameter space. To assess the accuracy of the conservative self-force, we calculate the periapsis advance in the circular orbit limit as outlined in [85] using the formula found in [88]. We find that both models show good agreement with the literature across the Lorenz gauge model's domain of validity, with the Lorenz gauge model producing errors less than 10⁻³ and the radiation gauge model producing relative errors less than 10⁻².

While we find good agreement between the two results for gauge invariant quantities, we see from figure 10 that the inspirals experience dramatically different conservative effects, depending on the gauge used. While in both cases, the conservative self-force acts against geodesic periapsis advance, we see that the evolution of q_r,0 depends heavily on the gauge involved, while the trajectories through p and e space are less affected. This is to be expected as the leading order averaged rates of change of p and e are related to the gauge invariant asymptotic fluxes, while the change in q_r,0 is induced entirely by the (gauge dependent) conservative self-force [27].

Just as when comparing adiabatic and self-forced inspirals it is important to match the initial frequencies (rather than the initial (p, e) values). We note that for the Lorenz gauge model, we must account for the fact that the perturbed time coordinate, $\hat{t}$ , is not asymptotically flat [99]. We can define an asymptotically flat time coordinate for Lorenz gauge inspirals via the following rescaling

$\begin{equation}t=(1+{\epsilon}\alpha )\hat{t},\end{equation} \tag{ 46 }$

where α is given by

$\begin{equation}\alpha (p,e)=-\frac{1}{2}{h}_{tt}^{(1)}(r\to \infty ).\end{equation} \tag{ 47 }$

We make use of a code provided to us by Akcay to numerically calculate this quantity for Lorenz gauge values of p and e [25, 100]. Equation (46) means the perturbed Boyer–Lindquist frequencies must also be rescaled by:

$\begin{equation}{{\Omega}}_{r}^{(\mathrm{L}\mathrm{G})}=(1-{\epsilon}\alpha )\frac{{{\Upsilon}}_{r}+{\epsilon}{\tilde{f}}_{r(LG)}^{(1)}}{{{\Upsilon}}_{t}+{\epsilon}{\tilde{s}}_{t(LG)}^{(1)}}\quad \text{and}\quad {{\Omega}}_{\phi }^{(\mathrm{L}\mathrm{G})}=(1-{\epsilon}\alpha )\frac{{{\Upsilon}}_{\phi }+{\epsilon}{\tilde{s}}_{\phi (LG)}^{(1)}}{{{\Upsilon}}_{t}+{\epsilon}{\tilde{s}}_{t(LG)}^{(1)}}.\end{equation} \tag{ 48 }$

In the radiation gauge model, the corresponding subtleties have been dealt with by including the gauge completion corrections, so the frequencies can be calculated using equation (44) as before. Thus, we can choose a value for ${\tilde{p}}_{0}^{(\mathrm{L}\mathrm{G})}$ and ${\tilde{e}}_{0}^{(\mathrm{L}\mathrm{G})}$ in Lorenz gauge and root find for values of ${\tilde{p}}_{0}^{(\mathrm{R}\mathrm{G})}$ and ${\tilde{e}}_{0}^{(\mathrm{R}\mathrm{G})}$ in radiation gauge that satisfy:

$\begin{equation}{{\Omega}}_{r}^{(\mathrm{R}\mathrm{G})}({\tilde{p}}_{0}^{(\mathrm{R}\mathrm{G})},{\tilde{e}}_{0}^{(\mathrm{R}\mathrm{G})})-{{\Omega}}_{r}^{(\mathrm{L}\mathrm{G})}({\tilde{p}}_{0}^{(\mathrm{L}\mathrm{G})},{\tilde{e}}_{0}^{(\mathrm{L}\mathrm{G})})=0,\end{equation} \tag{ 49a }$

$\begin{equation}{{\Omega}}_{\phi }^{(\mathrm{R}\mathrm{G})}({\tilde{p}}_{0}^{(\mathrm{R}\mathrm{G})},{\tilde{e}}_{0}^{(\mathrm{R}\mathrm{G})})-{{\Omega}}_{\phi }^{(\mathrm{L}\mathrm{G})}({\tilde{p}}_{0}^{(\mathrm{L}\mathrm{G})},{\tilde{e}}_{0}^{(\mathrm{L}\mathrm{G})})=0.\end{equation} \tag{ 49b }$

This allows us to make comparisons between inspirals driven by self-force models calculated in different gauges. We use an inspiral driven by the Lorenz-gauge force model with initial conditions (p₀, e₀) = (11, 0.18), mass ratio = 10⁻⁵ as our reference inspiral which should last just over two and a half years for a 10⁶ M_⊙ primary.

In figure 11, we see the difference in the phase of the waveform Φ as a function of time between the Lorenz gauge NIT inspiral, and a number of reference models. We make use of the relations between the NIT quantities and the waveform phases derived in reference [38] to find

$\begin{equation}{{\Phi}}_{r}={\tilde{q}}_{r}-{{\Omega}}_{r}^{\mathrm{S}\mathrm{F}}{Z}_{t}^{(0)}+\mathcal{O}({\epsilon})\quad \;\text{and}\quad {{\Phi}}_{\phi }=\tilde{\phi }-{{\Omega}}_{\phi }^{\mathrm{S}\mathrm{F}}{Z}_{t}^{(0)}+\mathcal{O}({\epsilon}).\end{equation} \tag{ 50 }$

**Figure 11.** The difference in the waveform phase Φ for various inspirals as a function of t when compared to NIT inspiral driven by a Lorenz gauge self-force model, with initial conditions (a, p₀, e₀) = (0, 11, 0.18), mass ratio = 10⁻⁵, viewing angles Θ = π/4 and Φ = 0, and sampled every Δt = 1M ∼ 5s. We also show the mismatch (MM) between the waveforms in each case. By matching the initial frequencies, we compare an inspiral calculated using a radiation gauge self-force model, an adiabatic inspiral, an inspiral with the adiabatic pieces of the Lorenz gauge model and conservative pieces from the radiation gauge model, and a Lorenz gauge model with a 10% relative error added to each conservative piece. In all cases the difference grows quadratically in time. This plot suggests that post-adiabatic waveforms calculated using only the first-order self-force differ significantly depending on the gauge used.
Download figure:
Standard image High-resolution image

We then feed the solutions for $\left\{\tilde{p}(t),\tilde{e}(t),{{\Phi}}_{r}(t),{{\Phi}}_{\phi }(t)\right\}$ into the FEW package to generate these eccentric Schwarzschild waveforms [17]. Finally, we make use of the $\mathtt{S}\mathtt{i}\mathtt{m}\mathtt{u}\mathtt{l}\mathtt{a}\mathtt{t}\mathtt{i}\mathtt{o}\mathtt{n}\mathtt{T}\mathtt{o}\mathtt{o}\mathtt{l}\mathtt{s}$ Mathematica package to calculate the mismatches and decompose the waveforms into a single evolving amplitude A(t) and phase Φ(t). This allows us to find the difference in the waveform phase ΔΦ(t) between the Lorenz gauge inspiral and the other inspiral calculations. We use this as our point of comparison as the waveform phase is an observable and thus a gauge invariant quantity.

We note that in each case, we see constant error which gives way to quadratic growth with t just as in figure 9. As we discussed in section 6.3, this shows that the initial frequencies were correctly matched. From the blue curve, we see that the NIT radiation gauge inspiral quickly goes out of phase with the Lorenz gauge NIT inspiral, resulting in a very large mismatch of 0.93. We found that the largest source of error here is due to interpolation error for in the adiabatic pieces of the NIT. Since these are related to the gauge invariant fluxes, these pieces should be identical in both models. As such, we can rectify this error by using the Lorenz gauge functions for the adiabatic pieces and continue to use the radiation gauge functions for the conservative pieces of the NIT equations of motion. The improvement is evident in the green curve, which shows much better agreement with the Lorenz gauge NIT inspiral, with the mismatch falling to 0.83. However, it is only slightly better than matching an adiabatic inspiral (orange curve) using equation (45) resulting in a mismatch of 0.86. Both radiation gauge and adiabatic inspirals go out of phase by almost 100 radians by the time they reach the last stable orbit.

In order to rule out the possibility of interpolation error of the conservative effects being the primary cause of this difference, we repeat the Lorenz gauge inspiral, but this time we manually add a relative error of δ = 0.1 to all of the conservative pieces of both the NIT equation of motion and our matching procedure for the initial conditions, e.g., $\dot{\tilde{{q}_{r}}}={{\Upsilon}}_{r}+{\epsilon}{\tilde{f}}_{r}^{(1)}\to {{\Upsilon}}_{r}+{\epsilon}\left({\tilde{f}}_{r}^{(1)}+\delta {\tilde{f}}_{r}^{(1)}\right)$ etc. We note that this is an order of magnitude larger than the 10⁻² error produced by the radiation gauge model when calculating the gauge invariant quasi-circular periapsis advance. From the red curve, we see that manually adding a constant 10% relative error results in phase difference and a mismatch (0.54) which is significantly smaller than what we observe between the two self-forced inspirals. This gives us confidence that this difference is not dominated by numerical error.

From these investigations, we infer that the trajectories driven using only the first order self-force are gauge dependent, and thus, so too are their waveforms. Since post-adiabatic waveforms are an observable quantity, this leads us to conclude that incorporating the orbit-averaged dissipative second-order self-force will be necessary to obtain gauge invariant, post-adiabatic waveforms. Moreover, since the difference between the radiation and Lorenz gauge self-forced inspirals is of the same magnitude as the difference with the adiabatic inspiral, we further conclude that the impact of the orbit-averaged dissipative second-order self-force must be of a similar magnitude in at least one of the two gauges.

7. Discussion

In this paper, we present the first self-forced inspirals in Kerr spacetime. We computed the self-force in the radiation gauge using the code of reference [27] and interpolated it over a region of the parameter space of eccentric, equatorial orbits using Chebyshev interpolation. Our model achieves sub-percent accuracy for the self-force across the two dimensional parameter space using only 105 points is a substantial improvement over cubic spline interpolation which would require $\mathcal{O}(1{0}^{3})$ points to achieve a comparable level of accuracy. So far we have applied our method to strong-field regions of the parameter space for three values of the primary's spin (a = 0, ±0.9M). It remains as future work to interpolate over the spin of the primary, however, the Chebyshev interpolation method appears to be a promising approach to tiling data from expensive GSF codes across the four-dimensional generic Kerr parameter space. This method could be further improved with the aid of a detailed of the study of the analytic structure of the GSF near the last stable orbit.

With an interpolated self-force model in hand, we computed inspirals using an action-angle formulation of the method of OG. This approach is sketched in reference [34] and in we implement these equations of motion for generic (eccentric and inclined) inspirals about a Kerr black hole. Our Mathematica implementation will be made publicly available on the Black Hole Perturbation Toolkit [49]. For a binary with (small) mass ratio, , numerically solving the OG equations takes minutes to hours due to the need to resolve the $\sim 1/{\epsilon}$ oscillations in the orbital elements. To overcome this, we follow paper I [36] and apply near-identity (averaging) transformations (NIT) which produce equations of motion that capture the correct long-term secular evolution of the binary but can also be rapidly numerically solved.

As a test of this formulation, we applied it to our eccentric, equatorial self-forced inspirals. We showed that our NIT'd quantities remain close to the original evolution variables throughout the inspiral at the expected order in the mass-ratio. When the mass ratio is greater than 1:300, we find the difference between year-long NIT and OG inspirals becomes significant for data analysis, reinforcing the findings of reference [38]. Note, however, that a priori it is not known which (the NIT or OG inspiral) is closer to the true inspiral, since both are accurate to the same order in the mass-ratio.

With our efficient NIT model of eccentric, equatorial inspirals we explored the effects of the GSF. We find that prograde inspirals around a rapidly rotating black hole generally experience an additional periastron advance on top of the periastron advance induced by geodesic motion. This is in contrast to the 'periastron retreat' experienced by retrograde inspirals and inspirals around non-rotating black holes [101].

The NIT equations of motion make it convenient to compare inspirals both with and without post-adiabatic effects included and we confirmed that without post-adiabatic effects, the orbital phases of a typical EMRI will incur an error of order $\mathcal{O}({{\epsilon}}^{0})$ . Moreover, by comparing inspirals under the influence of self-force models calculated in different gauges, we find that the resulting trajectories are gauge dependent. This difference due to gauge causes a de-phasing that is comparable in magnitude to not including any post-adiabatic effects. This suggests that in order to obtain gauge invariant post-adiabatic waveforms, one must also include second order self-force results. Second order self-force calculations are presently made using a two-timescale framework [30, 102]. For the equations of motion, this framework is related to the NITs [35, 37], but more work is required in order to explicitly transform the two-timescale results into forcing terms that could be used in the framework presented in this paper. Many of the averaging techniques developed in this work are also useful for the two-timescale approach [35].

For complete post-adiabatic waveforms, one would also need to include the spin of the secondary. Inspirals incorporating the leading conservative spin induced effects around a non-rotating primary have been calculated [103] and the effect of (anti-)aligned secondary spin on the energy and angular momentum fluxes have recently been computed for eccentric orbits [104]. These effects can readily be incorporated into the NIT framework.

Another natural extension of our work is to non-equatorial orbital motion. We already have results for spherical inspirals that will be the topic of an upcoming paper. After that we plan to tackle generic orbits, but there are two major barriers to this. The first is the larger parameter space which will make calculating the self-force extremely expensive [83]. Our Chebyshev interpolation method should help to reduce the number of points in the parameter space where the self-force needs to be calculated. The second barrier is the presence of orbital resonances [105–109].

Near these resonances the NITs break down and an alternative averaging procedure over the resonance timescale needs to be applied [35]. While a lot is known about the effects of resonances on EMRI trajectories [105–110], rapidly computing inspiral trajectories while incorporating all resonant effects remains an open challenge.

Finally, we note that in this paper we use the leading-order quadrupole formula to generate the waveforms from the OG and NIT inspirals. This is sufficient for our purposes where we wish to compare waveforms from OG and NIT inspirals but for LISA data analysis we will want to use relativistic waveform amplitudes. These were recently efficiently interpolated in reference [17] for Schwarzschild inspirals. That work used orbit-averaged fluxes to drive the inspirals but it would be straightforward to use a NIT inspiral instead. Once the waveform amplitudes have been interpolated for Kerr inspirals it could be combined immediately with the implementation presented in this work.

Acknowledgments

PL acknowledges support from the Irish Research Council under Grant GOIPG/2018/1978. NW acknowledges support from a Royal Society-Science Foundation Ireland Research Fellowship. This publication has emanated from research conducted with the financial support of Science Foundation Ireland under Grant Number 16/RS-URF/3428. We thank Ian Hinder and Barry Wardell for the SimulationTools analysis package. This work makes use of the Black Hole Perturbation Toolkit.

Data availability statement

The data that support the findings of this study are available upon reasonable request from the authors.

Appendix A.: Geodesic motion

We present here relations for quantities that were key to deriving our action-angle formulation for the method of OGs. Two of the radial roots the potential V_r and one root of the polar potential V_z are given in equations (9). The remaining roots are given by [42]

$\begin{equation}{r}_{3}=\frac{M}{1-{\mathcal{E}}^{2}}-\frac{{r}_{1}+{r}_{2}}{2}+\sqrt{{\left(\frac{{r}_{1}+{r}_{2}}{2}-\frac{M}{1-{\mathcal{E}}^{2}}\right)}^{2}-\frac{{a}^{2}\mathcal{Q}}{{r}_{1}{r}_{2}(1-{\mathcal{E}}^{2})}}\end{equation} \tag{ A1a }$

$\begin{equation}{r}_{4}=\frac{{a}^{2}\mathcal{Q}}{{r}_{1}{r}_{2}{r}_{3}(1-{\mathcal{E}}^{2})},\end{equation} \tag{ A1b }$

$\begin{equation}{z}_{+}=\sqrt{{a}^{2}(1-{\mathcal{E}}^{2})+\frac{{\mathcal{L}}^{2}}{1-{z}_{-}^{2}}}.\end{equation} \tag{ A1c }$

Using action angles also has the advantage of providing analytic solutions for the t and ϕ coordinates of the secondary, which take the form

$\begin{equation}t(\lambda )={{\Upsilon}}_{t}\lambda +{t}_{r}({q}_{r})+{t}_{z}({q}_{z})\quad \text{and}\quad \phi (\lambda )={{\Upsilon}}_{\phi }\lambda +{\phi }_{r}({q}_{r})+{\phi }_{z}({q}_{z}),\end{equation} \tag{ A2a-b }$

where ϒ_t and ϒ_ϕ are the Mino-time fundamental frequencies, t_r and ϕ_r are periodic functions of q_r, and t_z and ϕ_z are periodic functions of q_z. The explicit expressions for these functions, and the Mino-time frequencies, can be found in references [42, 46], and are implemented in KerrGeodesics Mathematica package of the Black Hole Perturbation Toolkit [49].

In this work we make use of analytic solutions to the geodesic equations written interms of the action angles for the orbital phases $\vec{q}=\left\{{q}_{r},{q}_{z}\right\}$ . These were first derived in reference [42] and then presented in a simplified form in reference [46]. The radial and polar solutions to the geodesic equations are given by

$\begin{equation}r({q}_{r})=\frac{{r}_{3}({r}_{1}-{r}_{2}){\mathrm{s}\mathrm{n}}^{2}\left(\frac{K({k}_{r})}{\pi }{q}_{r}\vert {k}_{r}\right)-{r}_{2}({r}_{1}-{r}_{3})}{({r}_{1}-{r}_{2}){\mathrm{s}\mathrm{n}}^{2}\left(\frac{K({k}_{r})}{\pi }{q}_{r}\vert {k}_{r}\right)-({r}_{1}-{r}_{3})}\end{equation} \tag{ A3 }$

and

$\begin{equation}z({q}_{z})={z}_{-}\mathrm{s}\mathrm{n}\left(K({k}_{z})\frac{2\left({q}_{z}+\frac{\pi }{2}\right)}{\pi }\vert {k}_{z}\right),\end{equation} \tag{ A4 }$

where sn is Jacobi elliptic sine function, K is complete elliptic integral of the first kind, and

$\begin{equation}{k}_{r}=\frac{({r}_{1}-{r}_{2})({r}_{3}-{r}_{4})}{({r}_{1}-{r}_{3})({r}_{2}-{r}_{4})},\quad \text{and}\quad {k}_{z}={a}^{2}(1-{\mathcal{E}}^{2})\frac{{z}_{-}^{2}}{{z}_{+}^{2}}.\end{equation} \tag{ A5a-b }$

Appendix B.: Evolution equations for the integrals of motion

Our goal for this section is to derive evolution equations for the integrals of motion $\vec{P}=\left\{p,e,x\right\}$ . To do so, we must first consider how a different set of integrals of motion $\vec{\mathcal{P}}=\left\{\mathcal{E},\mathcal{L},\mathcal{K}\right\}$ evolve in terms of the covariant components of the particle's four-acceleration {a_t, a_r, a_z, a_ϕ}.

Using the second OG equations (12b) along with definitions of $\mathcal{E}$ and $\mathcal{L}$ in equation (5), we can obtain the evolution equations for $\mathcal{E}$ and $\mathcal{L}$ :

$\begin{equation}\frac{\mathrm{d}\mathcal{E}}{\mathrm{d}\lambda }=-{\Sigma}{a}_{t},\quad \text{and}\quad \frac{\mathrm{d}\mathcal{L}}{\mathrm{d}\lambda }={\Sigma}{a}_{\phi }.\end{equation} \tag{ B1a-b }$

To find the evolution of $\mathcal{K}$ , we note that the contravariant components of the Killing tensor can be written as [34]

$\begin{equation}{\mathcal{K}}^{\alpha \beta }=2{\Sigma}{l}^{\left(\right.\alpha }{n}^{\beta \left.\right)}+{r}^{2}{g}^{\alpha \beta },\end{equation} \tag{ B2 }$

where $\vec{l}$ and $\vec{n}$ are null vectors with components

$\begin{equation}\vec{l}=\frac{{\varpi }^{2}}{{\Delta}}{\partial }_{t}+{\partial }_{r}+\frac{a}{{\Delta}}{\partial }_{\phi }\quad \text{and}\quad \vec{n}=\frac{{\varpi }^{2}}{2{\Sigma}}{\partial }_{t}-\frac{{\Delta}}{2{\Sigma}}{\partial }_{r}+\frac{a}{2{\Sigma}}{\partial }_{\phi }.\end{equation} \tag{ B3a-b }$

Taking the derivative of $\mathcal{K}$ from equations (5c) with respect to proper time gives us

$\begin{equation}\frac{\mathrm{d}\mathcal{K}}{\mathrm{d}\tau }={\mathcal{K}}^{\alpha \beta }{u}_{\alpha }{a}_{\beta }.\end{equation} \tag{ B4 }$

Expanding this out explicitly while making use of the orthogonality condition, g^αβ u_α a_β = 0 and converting to Mino time gives us:

$\begin{equation}\frac{\mathrm{d}\mathcal{K}}{\mathrm{d}\lambda }=\frac{\mathrm{d}\mathcal{E}}{\mathrm{d}\lambda }\frac{2}{{\Delta}}\left({\varpi }^{4}\mathcal{E}-a{\varpi }^{2}\mathcal{L}\right)+\frac{\mathrm{d}\mathcal{L}}{\mathrm{d}\lambda }\frac{2}{{\Delta}}\left({a}^{2}\mathcal{L}-a{\varpi }^{2}\mathcal{E}\right)-2{\Sigma}{\Delta}{u}_{r}{a}_{r}.\end{equation} \tag{ B5 }$

Using the above equations, we can express how the roots {r₁, r₂, z₋} evolve with Mino time by exploiting the same trick as in appendices A.2 and A.3 of [34]. First, we note that, using the chain rule, we can express the rate of change of r₁ or r₂ as

$\begin{equation}\frac{\mathrm{d}{r}_{1,2}}{\mathrm{d}\lambda }=\frac{\partial {r}_{1,2}}{\partial {\mathcal{P}}_{j}}\frac{\mathrm{d}{\mathcal{P}}_{j}}{\mathrm{d}\lambda }.\end{equation} \tag{ B6 }$

We then find expressions for $\partial {r}_{1,2}/\partial {\mathcal{P}}_{j}$ be differentiating V_r(r) with respect to ${\mathcal{P}}_{j}$ .

$\begin{equation}{\left.\frac{\partial {V}_{r}}{\partial {\mathcal{P}}_{j}}\right\vert }_{\begin{subarray}{c}r={r}_{1}\end{subarray}}=(1-{\mathcal{E}}^{2})({r}_{1}-{r}_{2})({r}_{1}-{r}_{3})({r}_{1}-{r}_{4})\frac{\partial {r}_{1}}{\partial {\mathcal{P}}_{j}},\end{equation} \tag{ B7a }$

$\begin{equation}{\left.\frac{\partial {V}_{r}}{\partial {\mathcal{P}}_{j}}\right\vert }_{\begin{subarray}{c}r={r}_{2}\end{subarray}}=-(1-{\mathcal{E}}^{2})({r}_{1}-{r}_{2})({r}_{2}-{r}_{3})({r}_{2}-{r}_{4})\frac{\partial {r}_{2}}{\partial {\mathcal{P}}_{j}}.\end{equation} \tag{ B7b }$

We then note that the coefficients proceeding $\partial {r}_{1,2}/\partial {\mathcal{P}}_{j}$ are also obtained by differentiating V_r with respect to r and then evaluating at r_1,2, i.e.,

$\begin{equation}{\left.\frac{\partial {V}_{r}}{\partial {\mathcal{P}}_{j}}\right\vert }_{\begin{subarray}{c}{r}_{1,2}\end{subarray}}=-\kappa ({r}_{1,2})\frac{\partial {r}_{1,2}}{\partial {\mathcal{P}}_{j}},\end{equation} \tag{ B8 }$

where we have defined

$\begin{equation}\kappa (r){:=}\frac{\mathrm{d}{V}_{r}}{\mathrm{d}r}=4\mathcal{E}F(r)r-2r{\Delta}(r)-2(r-M)({r}^{2}+\mathcal{K}),\end{equation} \tag{ B9 }$

$\begin{equation}F(r){:=}\varpi {(r)}^{2}\mathcal{E}-a\mathcal{L}.\end{equation} \tag{ B10 }$

Combining equations (B6) and (B8) and using the appropriate definition of V_r from equation (7a) to calculate the partial derivatives gives us our evolution equations for r₁ and r₂:

$\begin{equation}\frac{\mathrm{d}{r}_{1,2}}{\mathrm{d}\lambda }=-\frac{2F({r}_{1,2})}{\kappa ({r}_{1,2})}\left(\varpi {({r}_{1,2})}^{2}\frac{\mathrm{d}\mathcal{E}}{\mathrm{d}\lambda }-a\frac{\mathrm{d}\mathcal{L}}{\mathrm{d}\lambda }\right)+\frac{{\Delta}({r}_{1,2})}{\kappa ({r}_{1,2})}\frac{\mathrm{d}\mathcal{K}}{\mathrm{d}\lambda }.\end{equation} \tag{ B11 }$

We can use similar steps as above to find the evolution of z₋. Again, the chain rule tells us that the evolution of z₋ follows

$\begin{equation}\frac{\mathrm{d}{z}_{-}}{\mathrm{d}\lambda }=\frac{\partial {z}_{-}}{\partial {\mathcal{P}}_{j}}\frac{\mathrm{d}{\mathcal{P}}_{j}}{\mathrm{d}\lambda }.\end{equation} \tag{ B12 }$

We then use ${\left.\frac{\partial {V}_{z}}{\partial {\mathcal{P}}_{j}}\right\vert }_{\begin{subarray}{c}{z}_{-}\end{subarray}}$ along with the second definition V_z in equation (7b) to find an expression for $\frac{\partial {z}_{-}}{\partial {P}_{j}}$

$\begin{equation}{\left.\frac{\partial {V}_{z}}{\partial {\mathcal{P}}_{j}}\right\vert }_{\begin{subarray}{c}{z}_{-}\end{subarray}}=-2{z}_{-}(\beta {z}_{-}^{2}-{z}_{+}^{2})\frac{\partial {z}_{-}}{\partial {\mathcal{P}}_{j}}.\end{equation} \tag{ B13 }$

However, using the first definition of V_z in terms of $\left\{\mathcal{E},\mathcal{L},\mathcal{Q}\right\}$ gives us the following explicit expressions for ${\left.\frac{\partial {V}_{z}}{\partial {\mathcal{P}}_{j}}\right\vert }_{\begin{subarray}{c}{z}_{-}\end{subarray}}$

$\begin{equation}{\left.\frac{\partial {V}_{z}}{\partial \mathcal{E}}\right\vert }_{\begin{subarray}{c}{z}_{-}\end{subarray}}=2{a}^{2}\mathcal{E}{z}_{-}^{2}(1-{z}_{-}^{2}),\qquad {\left.\frac{\partial {V}_{z}}{\partial \mathcal{L}}\right\vert }_{\begin{subarray}{c}{z}_{-}\end{subarray}}=-2\mathcal{L}{z}_{-}^{2},\quad \text{and}\quad {\left.\frac{\partial {V}_{z}}{\partial \mathcal{Q}}\right\vert }_{\begin{subarray}{c}{z}_{-}\end{subarray}}=1-{z}_{-}^{2}.\end{equation} \tag{ B14a-c }$

Combining the results from equations (B12), (B13), (B14a–c) gives us

$\begin{equation}\frac{2{z}_{-}({z}_{+}-\beta {z}_{-})}{(1-{z}_{-}^{2})}\frac{\mathrm{d}{z}_{-}}{\mathrm{d}\lambda }=\frac{\mathrm{d}\mathcal{Q}}{\mathrm{d}\lambda }-2\mathcal{L}\left(\frac{{z}_{-}^{2}}{1-{z}_{-}^{2}}\right)\frac{\mathrm{d}\mathcal{L}}{\mathrm{d}\lambda }+2{a}^{2}\mathcal{E}{z}_{-}^{2}\frac{\mathrm{d}\mathcal{E}}{\mathrm{d}\lambda }.\end{equation} \tag{ B15 }$

Since we have expressions for the evolution of $\left\{\mathcal{E},\mathcal{L},\mathcal{K}\right\}$ , we can derive and expression for the evolution of $\mathcal{Q}$ by taking the derivative of equation (6) with respect to λ:

$\begin{equation}\frac{\mathrm{d}\mathcal{Q}}{\mathrm{d}\lambda }=\frac{\mathrm{d}\mathcal{K}}{\mathrm{d}\lambda }-2(\mathcal{L}\;-\;a\mathcal{E})\left(\frac{\mathrm{d}\mathcal{L}}{\mathrm{d}\lambda }-a\frac{\mathrm{d}\mathcal{E}}{\mathrm{d}\lambda }\right).\end{equation} \tag{ B16 }$

Combing these two results and simplifying yields our final expression for the evolution of z₋

$\begin{equation}\frac{\mathrm{d}{z}_{-}}{\mathrm{d}\lambda }=\frac{1}{2{z}_{-}({z}_{+}^{2}-\beta {z}_{-}^{2})}\left({x}^{2}\frac{\mathrm{d}\mathcal{K}}{\mathrm{d}\lambda }-2\left(\mathcal{L}\;-\;a{x}^{2}\mathcal{E}\right)\left(\frac{\mathrm{d}\mathcal{L}}{\mathrm{d}\lambda }-a{x}^{2}\frac{\mathrm{d}\mathcal{E}}{\mathrm{d}\lambda }\right)\right),\end{equation} \tag{ B17 }$

where we have used equations (9c) to tidy up the final expression.

Now that we know how {r₁, r₂, z₋} evolve, determining the evolution of {p, e, x} is straightforward since we can convert from one set to the other using the relations

$\begin{equation}p=\frac{2{r}_{1}{r}_{2}}{M({r}_{1}+{r}_{2})},\qquad e=\frac{{r}_{1}-{r}_{2}}{{r}_{1}+{r}_{2}},\quad \text{and}\quad x=\pm \sqrt{1-{z}_{-}^{2}}.\end{equation} \tag{ B18a-c }$

We can then take the derivative of these equations with respect to λ and use the chain rule to obtain equation (13).

Appendix C.: Evolution of the orbital phases

While it is straightforward to obtain equation (15) from the first OG equations (12a) this equation is difficult to evaluate numerically at the turning points of the orbit, i.e., when $\partial {x}_{\mathrm{G}}^{i}/\partial {q}_{i}=0$ . As such, following the procedure described in reference [34], we derive an equivalent evolution equation for the initial phases which is finite at the turning points.

We start by considering the definition of the geodesic potentials:

$\begin{equation}{V}_{i}({x}^{i},\vec{P})={\left(\frac{\mathrm{d}{x}^{i}}{\mathrm{d}\lambda }\right)}^{2}.\end{equation} \tag{ C1 }$

If we add together the derivative of both sides of this expression with respect to P_j and then multiply both sides by $\dot{{P}_{j}}$ , one obtains:

$\begin{equation}\frac{\partial {V}_{i}}{\partial {x}^{i}}\left(\frac{\partial {x}^{i}}{\partial {P}_{j}}\dot{{P}_{j}}\right)+\frac{\partial {V}_{i}}{\partial {P}_{j}}\dot{{P}_{j}}=2{\dot{x}}^{i}\left(\frac{\partial {\dot{x}}^{i}}{\partial {P}_{j}}\dot{{P}_{j}}\right).\end{equation} \tag{ C2 }$

Rearranging and plugging in equation (15) allows us to write

$\begin{equation}\frac{\partial V}{\partial {x}^{i}}\frac{\partial {x}^{i}}{\partial {q}_{i}}{\dot{q}}_{i0}=\frac{\partial {V}_{i}}{\partial {P}_{j}}\dot{{P}_{j}}-2{\dot{x}}^{i}\left(\frac{\partial {\dot{x}}^{i}}{\partial {P}_{j}}\dot{{P}_{j}}\right).\end{equation} \tag{ C3 }$

We also note that taking the derivative of (C1) with respect to Mino time λ and rearranging yields

$\begin{equation}\frac{\partial {V}_{i}}{\partial {P}_{j}}\dot{{P}_{j}}=2{\dot{x}}^{i}{\ddot{x}}^{i}-\frac{\partial {V}_{i}}{\partial {x}^{i}}{\dot{x}}^{i}.\end{equation} \tag{ C4 }$

Rearranging this and subbing into equation (C3) and simplifying gives us

$\begin{equation}\frac{\partial V}{\partial {x}_{i}}\frac{\partial {x}^{i}}{\partial {q}_{i}}{\dot{q}}_{i0}=2{\dot{x}}^{i}\left(\left[{\ddot{x}}^{i}-\frac{1}{2}\frac{\partial {V}_{i}}{\partial {x}^{i}}\right]-\left(\frac{\partial {\dot{x}}^{i}}{\partial {P}_{j}}\dot{{P}_{j}}\right)\right).\end{equation} \tag{ C5 }$

We note that the square bracket term will vanish for geodesics. For perturbed orbits this means that this term will be proportional to the component of the four-acceleration aⁱ scaled by a factor of Σ² to compensate for taking derivatives with respect to λ instead of τ. When evaluating this expression, we make use of the osculating condition ${x}^{i}(\lambda )={x}_{\mathrm{G}}^{i}(\lambda )$ . This leads is to the simplification

$\begin{equation}{\dot{x}}^{i}={\dot{x}}_{\mathrm{G}}^{i}=\frac{\partial {x}_{\mathrm{G}}^{i}}{\partial {q}_{i}}\frac{\mathrm{d}{q}_{i}}{\mathrm{d}\lambda }=\frac{\partial {x}_{\mathrm{G}}^{i}}{\partial {q}_{i}}{{\Upsilon}}_{i}.\end{equation} \tag{ C6 }$

Combining these results with equation (C5) gives us our final expression for the evolution of the initial phases which is regular at the turning points, as expressed in equation (16).

Appendix D.: Self-force corrections to the periapsis advance around a spinning black hole

The periapsis advance is a observable quantity that has been used to compare models of compact binary dynamics [101]. The effect of the GSF on this observable for quasi-circular EMRIs around a rotating primary was explored in reference [88]. One important insight, that is present in the supplemental material of that work, is the effect that the spin of the primary and orbital radius has on the self-force correction to the rate of periapsis advance. For completeness, we highlight this result in this appendix.

For quasi-circular inspirals the relation between the dimensionless quantity $W={{\Omega}}_{r}^{2}/{{\Omega}}_{\phi }^{2}$ and Ω_ϕ is an important benchmark for comparing between different calculational approaches to the two-body problem [88, 101, 111]. The linear in mass ratio correction to the quantity is defined via

$\begin{equation}W({\epsilon};a,{{\Omega}}_{\phi })=W(0;a,{{\Omega}}_{\phi })+{\epsilon}\rho (a,{{\Omega}}_{\phi })+\mathcal{O}({{\epsilon}}^{2}),\end{equation} \tag{ D1 }$

where W(0; a, Ω_ϕ) is the background value for the periapsis advance, and ρ(a, Ω_ϕ) is the correction induced by the first-order GSF.

Figure 12 demonstrates how ρ varies as a function of orbital radius r and the spin of the primary a. We plot the ratio r_ISCO/r, where r_ISCO is the radius of the ISCO. This ratio is convenient for plotting the results as goes from 1 at the ISCO for all spin values and asymptotically approaches zero as r grows large. As one would expect, the plot demonstrates that this correction grows larger as the radius of the inspiral approaches the ISCO. This correction is positive for all retrograde orbits and in the strong field for prograde orbits. This means that self-force typically acts against the periapsis advance caused by the background geodesic motion, resulting in a reduction of the observed periapsis advance of the binary. However, for positive spins and at large radii, there is a region of the parameter space (in blue) where this correction is negative, meaning that the self-force increases the observed rate of periapsis advance compared to the background geodesic motion. The larger the spin, the smaller the radii at which this effect occurs. As such, this effect is most prominent for prograde orbits around rapidly rotating black holes.

We find that the effect of the conservative self-force on the orbital phase for eccentric inspirals is consistent with the sign of the self-force induced rate of periastron advance, ρ, in the quasi-circular limit—see section 6.3.

Eccentric self-forced inspirals into a rotating black hole

Article metrics

Submit

Author e-mails

Author affiliations

Author notes

ORCID iDs

Dates

Peer review information

Abstract

1. Introduction

2. Forced motion near a rotating black hole

2.1. Geodesic motion and orbital parameterization

2.2. Osculating geodesics

2.3. Specialising to equatorial motion

3. Near-identity transformations

3.1. Near identity averaging transformations for generic EMRI systems

3.2. Averaged equations of motion for eccentric equatorial Kerr inspirals

4. Gravitational self-force for eccentric Kerr inspirals

4.1. Gravitational self-force

4.2. Interpolation method

5. Implementation

5.1. Offline steps

5.2. Online steps

6. Results

6.1. Consistency checks

6.2. Comparison between OG and NIT inspirals

6.3. Impact of adiabatic and post-adiabatic effects

6.4. Comparing inspirals driven using radiation gauge and Lorenz gauge self-force in Schwarzschild spacetime

7. Discussion

Acknowledgments

Data availability statement

Appendix A.: Geodesic motion

Appendix B.: Evolution equations for the integrals of motion

Appendix C.: Evolution of the orbital phases

Appendix D.: Self-force corrections to the periapsis advance around a spinning black hole

Footnotes

Eccentric self-forced inspirals into a rotating black hole

Article metrics

Submit

Share this article

Author e-mails

Author affiliations

Author notes

ORCID iDs

Dates

Peer review information

Abstract

1. Introduction

2. Forced motion near a rotating black hole

2.1. Geodesic motion and orbital parameterization

2.2. Osculating geodesics

2.3. Specialising to equatorial motion

3. Near-identity transformations

3.1. Near identity averaging transformations for generic EMRI systems

3.2. Averaged equations of motion for eccentric equatorial Kerr inspirals

4. Gravitational self-force for eccentric Kerr inspirals

4.1. Gravitational self-force

4.2. Interpolation method

5. Implementation

5.1. Offline steps

5.2. Online steps

6. Results

6.1. Consistency checks

6.2. Comparison between OG and NIT inspirals

6.3. Impact of adiabatic and post-adiabatic effects

6.4. Comparing inspirals driven using radiation gauge and Lorenz gauge self-force in Schwarzschild spacetime

7. Discussion

Acknowledgments

Data availability statement

Appendix A.: Geodesic motion

Appendix B.: Evolution equations for the integrals of motion

Appendix C.: Evolution of the orbital phases

Appendix D.: Self-force corrections to the periapsis advance around a spinning black hole

Footnotes