Topical Review

Black holes, gravitational waves and fundamental physics: a roadmap

, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , and

Published 19 June 2019 © 2019 IOP Publishing Ltd
, , Citation Leor Barack et al 2019 Class. Quantum Grav. 36 143001 DOI 10.1088/1361-6382/ab0587

0264-9381/36/14/143001

Abstract

The grand challenges of contemporary fundamental physics—dark matter, dark energy, vacuum energy, inflation and early universe cosmology, singularities and the hierarchy problem—all involve gravity as a key component. And of all gravitational phenomena, black holes stand out in their elegant simplicity, while harbouring some of the most remarkable predictions of General Relativity: event horizons, singularities and ergoregions.

The hitherto invisible landscape of the gravitational Universe is being unveiled before our eyes: the historical direct detection of gravitational waves by the LIGO-Virgo collaboration marks the dawn of a new era of scientific exploration. Gravitational-wave astronomy will allow us to test models of black hole formation, growth and evolution, as well as models of gravitational-wave generation and propagation. It will provide evidence for event horizons and ergoregions, test the theory of General Relativity itself, and may reveal the existence of new fundamental fields. The synthesis of these results has the potential to radically reshape our understanding of the cosmos and of the laws of Nature.

The purpose of this work is to present a concise, yet comprehensive overview of the state of the art in the relevant fields of research, summarize important open problems, and lay out a roadmap for future progress. This write-up is an initiative taken within the framework of the European Action on 'Black holes, Gravitational waves and Fundamental Physics'.

Export citation and abstract BibTeX RIS

Glossary

Here we provide an overview of the acronyms used throughout this paper and also in common use in the literature.

BBHBinary black hole
BHBlack hole
BNSBinary neutron star
BSSNBaumgarte-Shapiro-Shibata-Nakamura
CBMCompact binary mergers
CMBCosmic microwave background
DMDark matter
ECOExotic Compact Object
EFTEffective Field theory
EMRIExtreme-mass-ratio inspiral
EOBEffective One Body model
EOSEquation of state
eVelectron Volt
GRGeneral Relativity
GSFGravitational self-force
GRBGamma-ray burst
GWGravitational Wave
HMNSHypermassive neutron star
IMBHIntermediate-mass black hole
IVPInitial Value Problem
LVCLIGO Scientific and Virgo Collaborations
MBHMassive black hole
NKNumerical kludge model
NSBNeutron star binary
NSNeutron star
NRNumerical Relativity
PBHPrimordial black hole
PNPost-Newtonian
PMPost-Minkowskian
QNMQuasinormal modes
sBHBlack hole of stellar origin
SGWBStochastic GW background
SMStandard Model
SMBBHSupermassive binary black hole
SOBBHStellar-origin binary black hole
SNRSignal-to-noise ratio
STScalar-tensor

Preface

L Barack, V Cardoso, S Nissanke, T Sotiriou (editors)

The long-held promise of gravitational-wave astronomy as a new window onto the universe has finally materialized with the dramatic discoveries of the LIGO-Virgo collaboration in the past few years. We have taken but the first steps along a new, exciting avenue of exploration that has now opened before us. The questions we will tackle in the process are cross-cutting and multidisciplinary, and the answers we will get will no doubt reshape our understanding of black-hole-powered phenomena, of structure formation in the universe, and of gravity itself, at all scales.

The harvesting of useful information from gravitational-wave (GW) signals and the understanding of its broader implications demand a cross-disciplinary effort. What exactly will GWs tell us about how, when and in which environment black holes were formed? How fast do black holes spin and how have some of them grown to become supermassive? GWs from merging black holes probe the environment in which they reside, potentially revealing the effect of dark matter or new fundamental degrees of freedom. The analysis of GWs will allow for precise tests of General Relativity, and of the black hole paradigm itself. However, to be able to collect and interpret the information encoded in the GWs, one has to be equipped with faithful and accurate theoretical models of the predicted waveforms. To accomplish the far-reaching goals of gravitational-wave science it is of paramount importance to bring together expertise over a very broad range of topics, from astrophysics and cosmology, through general-relativistic source modelling to particle physics and other areas of fundamental science.

In 2016, a short time before the announcement of the first gravitational-wave detection, a cross-disciplinary initiative in Europe led to the establishment of the new COST networking Action on 'Black holes, gravitational waves and fundamental physics' ('GWverse'). GWverse aims to maintain and consolidate leadership in black-hole physics and gravitational-wave science, linking three scientific communities that are currently largely disjoint: one specializing in gravitational-wave detection and analysis, another in black-hole modelling (in both astrophysical and general-relativistic contexts), and a third in strong-gravity tests of fundamental physics. The idea is to form a single, interdisciplinary exchange network, facilitating a common language and a framework for discussion, interaction and learning. The Action will support the training of the next generation of leaders in the field, and the very first 'native' GW/multi-messenger astronomers, ready to tackle the challenges of high-precision GW astronomy with ground and space-based detectors.

Chapter I:. The astrophysics of compact object mergers: prospects and challenges

Editor: S Nissanke

1. Introduction

In the last two years, strong-field gravity astrophysics research has been undergoing a momentous transformation thanks to the recent discoveries of five binary black hole (BBH) mergers that were observed in gravitational waves (GWs) by the LIGO and Virgo detectors. This was compounded last year by the multi-messenger discovery of a binary neutron star (BNS) merger measured in both GWs and detected in every part of the electromagnetic (EM) spectrum, allowing us to place compact object mergers in their full astrophysical context. These measurements have opened up an entirely new window onto the Universe, and given rise to a new rapidly growing and observationally-driven field of GW astrophysics.

Despite the multiple scientific breakthroughs and 'firsts' that these discoveries signify, the measured properties of the BBH and BNS mergers have immediately bought up accompanying challenges and pertinent questions to the wider astrophysics community as a whole. Here, we aim to provide an up-to-date and encompassing review of the astrophysics of compact object mergers and future prospects and challenges. Section 2 first introduces and briefly details the LIGO and Virgo observations of BBH and BNS mergers. Section 3 then discusses the astrophysics of BHs, in particular BBHs, from their genesis to archeology, for BHs that span more than ten decades in their mass range. In the case of stellar-mass BBH mergers, section 4 details the formation of compact binary mergers, in particular BBHs, through isolated stellar binary evolution. Section 5 then reviews how one could dynamically form such events in order to explain the observed merger rate and distribution of masses, mass ratios, and spins. Section 6 explores the intriguing possibility that at least a fraction of dark matter (DM) in the Universe is in the form of primordial BHs (PBHs), an area that has recently been invigorated by the recent LIGO and Virgo observations of BBH mergers. Section 7 presents an overview on the formation of supermassive BBHs through galaxy mergers, and section 8 introduces efforts underway to probe the astrophysics of such supermassive BBHs with pulsar timing arrays. Turning our attention to the mergers themselves as multi-messenger sources, section 9 reviews the state-of-the-art numerical modelling of compact object mergers, in particular, systems with NSs in which we have already observed accompanying EM radiation. For detailed and exhaustive discussion of the modelling of BBH mergers, we refer the reader to chapter II. Section 10 provides a summary of the observational efforts by a wide range of facilities and instruments in following up GW mergers in light of the first BNS merger discovery measured in both GWs and EM. Focusing entirely on EM observations, section 12 reviews observations of active galactic nuclei as probes of BBH systems and section 11 concludes by summarising recent advances in high-energy observations of x-ray binaries. Finally, section 13 provides an extensive review on how observations of GWs can impact the field of cosmology, that is, in our understanding of the origins, evolution and fate of the Universe.

2. LIGO and Virgo observations of binary black hole mergers and a binary neutron star

Contributors: E Porter, M Hendry, I S Heng

On September 15 2014 the discovery of GWs from the merger of two BHs during the first advanced detectors era run, commonly called O1, by the two LIGO observatories heralded the dawn of GW astronomy [1]. This event was quickly followed up by two other BBH mergers: one of lower significance on October 12, 2015, and another on December 26, 2015 [2, 3]; see table 1 for the source properties of the published GW mergers. These detections, as exemplified by this white paper, have had a major impact on the fields of astrophysics and fundamental physics [38].

Table 1. Source properties of the published BBH and BNS discoveries (June 2018) by the LIGO and Virgo detectors.

  GW150914 GW151226 LVT151012 GW170104 GW170608 GW170814 GW170817
$ \newcommand{\msun}{\,{\rm M_\odot}} m_1/\msun$ $36.2^{+5.2}_{-3.8}$ $14.2^{+8.3}_{-3.7}$ $23^{+18}_{-6}$ $31.2^{+8.4}_{-6.0}$ $12^{+7}_{-2}$ $30.5^{+5.7}_{-3.0}$ $(1.36,1.60)$
$ \newcommand{\msun}{\,{\rm M_\odot}} m_2/\msun$ $29.1^{+3.7}_{-4.4}$ $7.5^{+2.3}_{-2.3}$ $13^{+4}_{-5}$ $19.4^{+5.3}_{-5.9}$ $7^{+2}_{-2}$ $25.3^{+2.8}_{-4.2}$ $(1.16,1.36)$
$ \newcommand{\msun}{\,{\rm M_\odot}} {{\mathcal M}}/\msun$ $28.1^{+1.8}_{-1.5}$ $8.88^{+0.33}_{-0.28}$ $15.1^{+1.4}_{-1.1}$ $21.1^{+2.4}_{-2.7}$ $7.9^{+0.2}_{-0.2}$ $24.1^{+1.4}_{-1.1}$ $1.186^{+0.001}_{-0.001}$
q $0.81^{+0.17}_{-0.20}$ $0.52^{+0.40}_{-0.29}$ $0.57^{+0.38}_{-0.37}$ $0.62^{+}_{-}$ $0.6^{+0.3}_{-0.4}$ $0.83^{+}_{-}$ $(0.73,1)$
$ \newcommand{\msun}{\,{\rm M_\odot}} M_f/\msun$ $62.3^{+3.7}_{-3.1}$ $20.8^{+6.1}_{-1.7}$ $35^{+14}_{-4}$ $48.7^{+5.7}_{-4.6}$ $18.0^{+4.8}_{-0.9}$ $53.2^{+3.2}_{-2.5}$
$\chi_{eff}$ $-0.06^{+0.14}_{-0.29}$ $0.21^{+0.20}_{-0.10}$ $0.03^{+0.31}_{-0.20}$ $-0.12^{+0.21}_{-0.30}$ $0.07^{+0.23}_{-0.09}$ $0.06^{+0.12}_{-0.12}$ $0.00^{+0.02}_{-0.01}$
af $0.68^{+0.05}_{-0.06}$ $0.74^{+0.06}_{-0.06}$ $0.66^{+0.09}_{-0.10}$ $0.64^{+0.09}_{-0.20}$ $0.69^{+0.04}_{-0.05}$ $0.70^{+0.07}_{-0.05}$
DL / Mpc $420^{+150}_{-180}$ $440^{+180}_{-190}$ $1020^{+500}_{-490}$ $880^{+450}_{-390}$ $340^{+140}_{-140}$ $540^{+130}_{-210}$ 40
z $0.090^{+0.029}_{-0.036}$ $0.094^{+0.035}_{-0.039}$ $0.201^{+0.086}_{-0.091}$ $0.18^{+0.08}_{-0.07}$ $0.07^{+0.03}_{-0.03}$ $0.11^{+0.03}_{-0.04}$ 0.0099

The detection of GWs from only BBH mergers from all O1 detections has had significant ramifications on our understanding of astrophysical populations [3, 6, 9, 10]. The detected BHs were more massive than any BHs that had been previously detected in low mass x-ray binaries, requiring a re-evaluation of the models of stellar evolution in binary systems [5]. From just these three events, the LIGO Scientific and Virgo collaborations (LVC) constrained the rate of BBH mergers to between 9–240 Gpc−3 yr−1 [3] (see [11] for an updated BBH merger rate of 12–213 Gpc−3 yr−1). The non-detection of BNSs and NS-BH binaries allowed constraints of  <12 600 Gpc−3 yr−1 and  <3600 Gpc−3 yr−1 respectively [6]. At the time of this run, the LVC had over 60 MOUs signed with external telescopes, satellites and neutrino detectors. No EM counterparts were found relating to the BBH mergers [1214].

To detect and extract astrophysical information, GW astronomy uses the method of matched filtering [15]. This method is the optimal linear filter for signals buried in noise, and is very much dependent on the phase modelling of a GW template. Within the LVC, the GW templates are constructed using both analytical and numerical relativity [16]. In this case, the phase evolution of the template is a function of a number of frequency dependent coefficients. Alternative theories of gravity predict that these coefficients should be individually modified if general relativity (GR) is not the correct theory of gravity. While GR predicts specific values for these coefficients, one can treat each coefficient as a free variable and use Bayesian inference to test for deviations in the values of the parameters from the nominal GR value. All tests conducted by the LVC displayed no deviations from GR [3, 11, 17, 18]

In addition, searches for generic GW transients, or GW-bursts, typically do not require a well-known or accurate waveform model and are robust against uncertainties in the GW signature. GW-burst searches are designed to detect transients with durations between 10−3–10 s with minimal assumptions about the expected signal waveform. Such searches are, therefore, sensitive to GW transients from a wide range of progenitors, ranging from known sources such as BBH mergers to poorly-modeled signals such as core-collapse supernovae as well as transients that have yet to be discvered. An overview of GW-burst searches performed by LVC can be found here [19]. Both GW-burst and compact binary coalescences (CBC) searches detected the first GW signal from BBH mergers, GW150914.

In November 2016, the second Advanced Era Observation run, O2, began. Once again, in January and June 2017, two BBH mergers were observed by the two LIGO detectors [11, 18]. At the end of July 2017, the Advanced Virgo detector joined the global network of detectors. On August 14, all three detectors observed the merger of a BBH system. In previous detections, using only the two LIGO detectors, the sources were located to 1000s of square degrees in the sky. In this case, due to the addition of Advanced Virgo, this system was localised to within 60 square degrees. While not greatly advancing our understanding of the formation mechanisms of such systems, this detection did have a major effect in the field of fundamental physics. Due to the misalignment of the three detectors, for the first time we were able to test the tensorial nature of GWs. This event allowed the LVC to conclude that the GW signals were tensorial in nature, as is predicted by GR [18].

Burst searches were also used as an independent analysis to complement matched filtering analyses for the detection of GW170104 [11]. Burst searches further identified a coherent signal, corresponding to GW170608, with a false-alarm rate of 1 in  ∼30 years [20] and validated the detection of GW170814 with a false-alarm rate  <1 in 5900 years [18]. Note that, given the 'unmodelled' nature of burst searches, the estimated event significances from burst searches tend to be lower than matched-filtered searches for the same event, especially for lower-mass compact binary signals.

On August 17, the first BNS merger was observed by the LIGO and Virgo detectors [21]. This event was very quickly associated with a short gamma-ray burst (sGRB) detected by both the Fermi and Integral satellites [22]. Within 10 h, the host galaxy had been optically identified. Within 16 d, the source had been identified across all bands of the EM spectrum. This single event heralded the true beginning of multi-messenger astronomy, and raised as many questions as it answered.

While confirming the hypothetical link between BNS mergers and sGRBs, the delay between the gamma and x-ray signals (9 d) suggested that not all sGRBs are the same [23]. This fact generated a number of studies regarding equation of state models, and the possible remnant of such mergers. This one event also allowed the LVC to update the BNS event rate from  <12 600 Gpc−3 yr−1 in O1, to 320–4740 Gpc−3 yr−1 in O2 [21].

Perhaps, the most interesting results from this event concern fundamental physics. The delay between the detection of GWs and gamma-rays was 1.74 s. This places a bound on the difference between the speed of light and the speed of GWs of $3\times10^{-15}\leqslant|\Delta c/c|\leqslant 7\times 10^{-16}$ [23]. This single result has implications for certain alternative theories of gravity. For instance, the fact that GWs seem to travel at the same speed as that of strongly constrains the family of alternative theories of gravity that require $v_+, v_{\mathrm{\times}} \neq v_{{\rm light}}$ (e.g. beyond Horndeski, quartic/quintic Galileon, Gauss-Bonnet, if they are supposed to explain cosmology), as well as theories that predict a massive graviton. Furthermore, by investigating the Shapiro delay, the GW170817 detection also rules out MOND and DM emulator MOND-like theories (e.g. TeVeS), as according to these theories, the GWs would have arrived 1000 d after the gamma-ray detection.

The detection of GWs by the Advanced LIGO and Advanced Virgo detectors have had a major effect on our understanding of the Universe, sparking the fields of GW and multi-messenger astronomy and cosmology [22, 24]. It is becoming increasingly clear that combining EM and GW information will be the only way to better explain observed phenomena in our Universe. The third Advanced Detector Observation run (O3) will begin in early 2019, and will run for a year [14]. We expect the detected events to be dominated by BBH mergers at a rate of one per week. However, we also expect on the order of ten BNS events during this time, and possibly a NS-BH discovery (and potentially more than one such system). Given the effects of one GW detection on both astrophysics and fundamental physics, we expect O3 to fundamentally change our view of the Universe.

3. Black hole genesis and archaeology

Contributors: M Colpi and M Volonteri

3.1. Black hole genesis

Gravity around BHs is so extreme that gravitational energy is converted into EM and kinetic energy with high efficiency, when gas and/or stars skim the event horizon of astrophysical BHs. Black holes of stellar origin (sBHs) with masses close to those of known stars power galactic x-ray sources in binaries, while supermassive black holes (SMBHs) with masses up to billions of solar masses power luminous quasars and active nuclei at the centre of galaxies. BHs are key sources in EM in our cosmic landscape.

According to General Relativity (GR), Kerr BHs, described by their mass $M_{\rm BH}$ and spin vector ${\bf S}=\chi_{\rm spin} \, G \, M_{\rm BH}/c$ (with $-1\leqslant \chi_{\rm spin}\leqslant 1$ ) are the unique endstate of unhalted gravitational collapse. Thus understanding astrophysical BHs implies understanding the conditions under which gravitational equilibria lose their stability irreversibly. The chief and only example we know is the case of NSs which can exist up to a maximum mass $M_{\rm max}^{\rm NS}$ , around $ \newcommand{\msun}{\,{\rm M_\odot}} 2.2\msun$ $ \newcommand{\msun}{\,{\rm M_\odot}} 2.6\msun$ . No baryonic microphysical state emerges in nuclear matter, described by the standard model, capable to reverse the collapse to a BH state, during the contraction of the iron core of a supernova progenitor. The existence of $M_{\rm max}^{\rm NS}$ is due to the non linearity of gravity which is sourced not only by the mass 'charge' but also by pressure/energy density, according to the Oppenheimer-Volkoff equation. Thus, sBHs carry a mass exceeding $M_{\rm max}^{\rm NS}$ . Discovering sBHs lighter than this value (not known yet to high precision) would provide direct evidence of the existence of PBHs arising from phase transitions in the early universe.

As of today, we know formation scenarios in the mass range between $ \newcommand{\msun}{\,{\rm M_\odot}} 5\msun$ $ \newcommand{\msun}{\,{\rm M_\odot}} 40\msun$ , resulting from the core-collapse of very massive stars. The high masses of the sBHs revealed by the LVC, up to $ \newcommand{\msun}{\,{\rm M_\odot}} 36\msun$ , hint formation sites of low-metallicity, Z134, below $0.5\%$ of the solar value $Z_\odot=0.02$ [2527]. Theory extends this range up to about 40–$ \newcommand{\msun}{\,{\rm M_\odot}} 60\msun$ [28] and predicts the existence of a gap, between about $ \newcommand{\simless}{\lesssim} \newcommand{\msun}{\,{\rm M_\odot}} 60 \, \simless \, M_{\rm BH}/\msun \, \simless \, 150,$ since in this window pair instabilities during oxygen burning lead either to substantial mass losses or (in higher mass stellar progenitors) the complete disruption of the star [2931]. sBHs heavier than $ \newcommand{\msun}{\,{\rm M_\odot}} 150\msun$ can form at $Z \, < \, 1\% \, Z_\odot$ , if the initial mass function of stars extends further out, up to hundreds of solar masses.

The majestic discovery of BBHs, detected by LVC interferometers [1, 2, 5, 11, 18, 20], at the time of their coalescence further indicates, from an astrophysical standpoint, that in nature sBHs have the capability of pairing to form binary systems, contracted to such an extent that GW emission drives their slow inspiral and final merger, on observable cosmic timescales. As GWs carry exquisite information on the individual masses and spins of the BHs, and on the luminosity distance of the source, detecting a population of coalescing sBHs with LVC in their advanced configurations, and with the next-generation of ground-based detectors [32, 33], will let us reconstruct the mass spectrum and evolution of sBHs out to very large redshifts.

Observations teach us that astrophysical BHs interact with their environment, and that there are two ways to increase the mass: either through accretion, or through a merger, or both. These are the two fundamental processes that drive BH mass and spin evolution. Accreting gas or stars onto BHs carry angular momentum, either positive or negative, depending on the orientation of the disk angular momentum relative to the BH spin. As a consequence the spin changes in magnitude and direction [3436]. In a merger, the spin of the new BH is the sum of the individual and orbital angular momenta of the two BHs, prior to merging [37, 38]. An outstanding and unanswered question is can sequences of multiple accretion-coalescence events let sBHs grow, in some (rare) cases, up to the realm of SMBHs? If this were true, the 'only' collapse to a BH occurring in nature would be driven by the concept of instability of NSs at $M_{\rm max}^{\rm NS}$ .

SMBHs are observed as luminous quasars and active galactic nuclei, fed by accretion of gas [39], or as massive dark objects at the centre of quiescent galaxies which perturb the stellar and/or gas dynamics in the nuclear regions [40]. The SMBH mass spectrum currently observed extends from about $ \newcommand{\msun}{\,{\rm M_\odot}} 5\times 10^4\msun$ (the SMBH in the galaxy RGG118 [41]) up to about $ \newcommand{\msun}{\,{\rm M_\odot}} 1.2\times 10^{10}\msun$ (SDSS J0100+2802 [42]), as illustrated in figure 1. The bulk of active and quiescent SMBHs are nested at the centre of their host galaxies, where the potential well is the deepest. The correlation between the SMBH mass $M_\bullet$ and the stellar velocity dispersion $\sigma$ in nearby spheroids, and even in disk/dwarf galaxies [43] hints towards a concordant evolution which establishes in the centre-most region controlled by powerful AGN outflows. Extrapolated to lower mass disk or dwarf galaxies, this correlation predicts BH masses of $ \newcommand{\msun}{\,{\rm M_\odot}} M_\bullet\sim 10^3\msun$ at $\sigma$ as low as $ \newcommand{\kms}{\,{\rm km\,s^{-1}}} 10\kms$ , typical of nuclear star clusters (globular clusters) [44]. We remark that only BHs of mass in excess of $ \newcommand{\msun}{\,{\rm M_\odot}} 10^3\msun$ can grow a stellar cusp. The lighter BHs would random walk, and thus would have a gravitational sphere of influence smaller than the mean stellar separation and of the random walk mean pathlength.

Figure 1. Refer to the following caption and surrounding text.

Figure 1. Cartoon illustrating the BH mass spectrum encompassing the whole astrophysical relevant range, from sBHs to SMBHs, through the unexplored (light-green) zone where BH seeds are expected to form and grow. Vertical black-lines denote the two sBH masses in GW150914, the mass $M_\bullet$ of RGG118 (the lightest SMBH known as of today in the dwarf galaxy RG118), of SgrA* in the Milky Way, and of J0100+2802 (the heaviest SMBH ever recorded). The mass distribution of sBHs, drawn from the observations of the galactic sBH candidates, has been extended to account for the high-mass tail following the discovery of GW150914. The minimum (maximum) sBHs is set equal to $ \newcommand{\msun}{\,{\rm M_\odot}} 3\msun$ ($ \newcommand{\msun}{\,{\rm M_\odot}} 60\msun$ ), and the theoretically predicted pair-instability gap is depicted as a narrow darker-grey strip. The SMBH distribution has been drawn scaling their mass according to the local galaxy mass function and $M_\bullet$ -$\sigma$ correlation. The decline below  ∼$ \newcommand{\msun}{\,{\rm M_\odot}} 10^5\msun$ is set arbitrarily: BH of  ∼$ \newcommand{\msun}{\,{\rm M_\odot}} 10^{4-5}\msun$ may not be ubiquitous in low-mass galaxies as often a nuclear star cluster is in place in these galaxies, which may or may not host a central IMBH [54]. The black stars and dashed tracks illustrate the possibility that a SMBH at high redshift forms as sBH-only (born on the left side of the sBH gap) or as light seed (on the right of the gap) which then grows through phases of super-Eddington accretion [55]. The red circle and dotted track illustrates the possibility of a genetic divide between sBHs and SMBHs, and that a heavy seed forms through the direct collapse of a supermassive protostar in a metal free, atomic-hydrogen cooling, DM halo [48, 56]. The seed later grows via gas accretion and mergers with SMBHs in other black halos.

Standard image High-resolution image

Observations suggest that SMBHs have grown in mass through repeated episodes of gas accretion and (to a minor extent) through mergers with other BHs. This complex process initiates with the formation of a seed BH of yet unknown origin [45]. The concept of seed has emerged to explain the appearance of a large number of SMBHs of billion suns at $z\sim 6$ , shining when the universe was only 1 Gyr old [46]. Furthermore, the comparison between the local SMBH mass density, as inferred from the $M_\bullet-\sigma$ relation, with limits imposed by the cosmic x-ray background light, resulting from unresolved AGN powered by SMBHs in the mass interval between $ \newcommand{\msun}{\,{\rm M_\odot}} 10^{8-9}\msun$ , indicates that radiatively efficient accretion played a large part in the building of SMBHs below $z\sim 3,$ and that information is lost upon their initial mass spectrum [47]. Thus, SMBHs are believed to emerge from a population of seeds of yet unconstrained initial mass, in a mass range intermediate between those of sBHs and SMBHs, about $ \newcommand{\msun}{\,{\rm M_\odot}} 10^2\msun$ to $ \newcommand{\msun}{\,{\rm M_\odot}} 10^5\msun$ , and therefore they are sometimes dubbed Intermediate-mass BHs (IMBHs).

Seeds are IMBHs that form 'early' in cosmic history (at redshift $z\sim 20$ , when the universe was only 180 Myr old). They form in extreme environments, and grow over cosmic time by accretion and mergers. Different formation channels have been proposed for the seeds [45, 48, 49]. Light seeds refer to IMBHs of about $ \newcommand{\msun}{\,{\rm M_\odot}} 100\msun$ that form from the relativistic collapse of massive Pop III stars, but the concept extends to higher masses, up to  ∼$ \newcommand{\msun}{\,{\rm M_\odot}} 10^3\msun$ . These seeds likely arise from runaway collisions of massive stars in dense star clusters of low metallicity [50, 51], or from mergers of sBHs in star clusters subjected to gas-driven evolution [52]. The progenitors of light seeds are massive stars. However, they could be also the end result of repeated mergers among BHs [53]. Finding merging sBHs with the LVC detectors with masses in the pair instability gap would be a clear hint of a second generation of mergers resulting from close dynamical interactions.

Accretion on sBHs occurs in x-ray binaries, and there is no evidence of accretion from the interstellar medium onto isolated sBHs in the Milky Way. But, in gas-rich, dense environments characteristic of galaxy halos at high redshifts, single sBHs might accrete to grow sizably, despite their initial small gravitational sphere of influence, if specific dynamical conditions are met. For instance, in rare cases they may be captured in dense gas clouds within the galaxy [55]. Another possibility is that a sBH forms at the very center of the galaxy, where large inflows may temporarily deepen the potential well and allow it to grow significantly. This 'winning sBH' must be significantly more massive than all other sBHs in the vicinity to avoid being ejected by scatterings and to be retained at the center of the potential well by dynamical friction. Similar conditions can also be present in nuclear star clusters characterized by high escape velocities. After ejection of the bulk of the sBHs, the only (few) remaining isolated BH can grow by tidally disrupting stars and by gas accretion [57] sparking their growth to become an IMBH.

Heavy seeds refer instead to IMBHs of about $ \newcommand{\msun}{\,{\rm M_\odot}} 10^{4-5}\msun $ resulting from the monolithic collapse of massive gas clouds, forming in metal-free halos with virial temperatures $ \newcommand{\simgreat}{\gtrsim} T_{\rm vir} \, \simgreat \, 10^4$ K, which happen to be exposed to an intense H2 photodissociating ultraviolet flux [49, 56, 5860]. These gas clouds do not fragment and condense in a single massive proto-star which is constantly fueled by an influx of gas that lets the proto-star grow large and massive. Then, the star contracts sizably and may form a quasi-star [61], or it may encounter the GR instability that leads the whole star to collapse directly into a BH. Heavy seeds might also form in major gas-rich galaxy mergers over a wider range of redshifts, as mergers trigger massive nuclear inflows [62]. Figure 1 is a cartoon summarising the current knowledge of BHs in our Universe, and the link that may exist between sBHs and SMBHs, which is established by seed BHs along the course of cosmic evolution.

The seeds of the first SMBHs are still elusive to most instruments that exist today, preventing us to set constraints on their nature. Seed BHs are necessarily a transient population of objects and inferring their initial mass function and spin distribution from observations is possible only if they can be detected either through EM or GW observations at very high z, as high as  ∼20 (even $z\sim 40$ as discussed recently). Since, according to GR, BHs of any flavour captured in binaries are loud sources of GWs at the time of their merging, unveiling seeds and MBHs through cosmic ages via their GW emission at coalescence would provide unique and invaluable information on the BH genesis and evolution. The Gravitational Wave Universe is the universe we can sense using GWs as messengers [63, 64]. In this universe, BBHs are key sources carrying invaluable information on their masses, spins and luminosity distance that are encoded in the GW signal. There is one key condition that needs to be fulfilled: that the BHs we aim at detecting pair and form a binary with GW coalescence time smaller that the Hubble time, possibly close to the redshift of their formation. This condition, enabling the detection of seeds at very high redshifts, is extremely challenging to be fulfilled. BHs in binaries form at 'large' separation. Thus, nature has to provide additional dissipative processes leading to the contraction of the BBH down to the scale where GWs drive the inspiral. This requires a strong coupling of the two BHs with the environment, before and after forming a binary system. As we now discuss, understanding this coupling is a current challenge in contemporary astrophysics, cosmology and computational physics [65].

3.2. Black hole binaries: the difficulty of pairing

Due to the weakness of gravity, BBH inspirals driven by GW emission occur on a timescale:

Equation (1)

where $M_{\rm BBH}$ is the total mass of the BBH, a and e the semi-major axis and eccentricity respectively (${{\mathcal G}}(e)$ a weak function of e, and ${{\mathcal G}}(0)=1$ ) and $\nu=\mu/M_{\rm BBH}$ the symmetric mass ratio ($\nu=1/4$ for equal mass binaries), with $\mu$ the reduced mass of the binary. The values of a and e at the time of formation of the binary determine $ t_{\rm coal}$ , and this is the longest timescale. A (circular) binary hosting two equal-mass seed BHs of $ \newcommand{\msun}{\,{\rm M_\odot}} 10^3\msun$ ($ \newcommand{\msun}{\,{\rm M_\odot}} M_{\rm BBH}=10^5\msun$ ) would reach coalescence in 0.27 Gyrs, corresponding to the cosmic time at redshift $z\sim 15,$ if the two BHs are at an initial separation of $ a\sim \nu^{1/4} 4.84\times 10^4 R_{\rm G}$ ($ \nu^{1/4} \,1.5\times 10^4 R_{\rm G}$ ) corresponding to $a\sim \nu^{1/4}\,0.1$ AU, ($\nu^{1/4}\,30$ AU). For the case of two equal-mass MBHs of $ \newcommand{\msun}{\,{\rm M_\odot}} 10^6\msun$ coalescing at $z\sim 3$ (close to the peak of the star-formation rate and AGN rate of activity in the universe) $ a\sim \nu^{1/4} 4.84\times 10^6 R_{\rm G}$ corresponding to about one milli-parsec. These are tiny scales, and to reach these separations the binary needs to harden under a variety of dissipative processes. The quest for efficient mechanisms of binary shrinking, on AU-scales for sBHs and BH seeds, and sub-galactic scales for MBHs, make merger rate predictions extremely challenging, as Nature has to set rather fine-tuned conditions for a BBH to reach these critical separations. Only below these critical distances the binary contracts driven by GW emission. The merger occurs when the GW frequency (to leading order equal to twice the Keplerian frequency) reaches a maximum value,

Equation (2)

This frequency $f_{\rm GW}^{\rm max}$ scales with the inverse of the total mass of the binary, $M_{\rm BBH}$ as it is determined by the size of the horizon of the two BHs, at the time of coalescence.

Coalescing sBHs in binaries occur in galactic fields [25, 66], or/and in stellar systems such as globular clusters or/and nuclear star clusters [6771]. Thus, sBHs describe phenomena inside galaxies. Since there is a time delay between formation of the binary and its coalescence, dictated by the efficiency of the hardening processes, sBHs can merge in galaxies of all types, as in this lapse time that can be of the order of Gyrs, host galaxies undergo strong evolution. Instead, coalescing IMBHs, seeds of the SMBHs, formed in DM halos at high redshifts, and thus track a different environment. Forming in pristine gas clouds their pairing either requires in situ formation, e.g. from fissioning of rotating super-massive stars [72], or via halo-halo mergers on cosmological scales subjected to rapid evolution and embedded in a cosmic web that feed baryons through gaseous stream [73]. Coalescing MBHs refer exclusively to galaxy-galaxy mergers of different morphological types [65] occurring during the clustering of cosmic structures, which encompass a wide range of redshifts, from $z\sim 9$ to $z\sim 0$ passing through the era of cosmic reionization, and of cosmic high noon when the averaged star formation rate has its peak.

In the following sections we describe in detail the different channels proposed for the formation and pairing of BBHs at all scales. For each physical scenario we review the state of the art, challenges and unanswered questions and the most promising lines of research for the future. Ample space is devoted to stellar mass objects (BNS and BH-NS binaries and BBHs, with a particular focus on the latter), for which we discuss separately the three main formation channels: pairing of isolated binaries in the field, the various flavors of dynamical formation processes, relics from the early universe. We then move onto discuss the state of the art of our understanding of MBH binary pairing and evolution, the current theoretical and observational challenges, and the role of future surveys and pulsar timing arrays (PTAs) in unveiling the cosmic population of these elusive systems.

4. The formation of compact object mergers through classical binary stellar evolution

Contributors: K Belczynski, T Bulik, T M Tauris, G Nelemans

4.1. Stellar-origin black holes

The LIGO/Virgo detections of BBH mergers can be explained with stellar-origin BHs [27] or by primordial BHs that have formed from density fluctuations right after Big Bang [74], Stars of different ages and chemical compositions can form BHs and subsequently BBH mergers. In particular, the first metal-free (population III) stars could have produced BBH mergers in the early Universe ($z\approx10$ ), while the local ($z\approx 0$ –2) Universe is most likely dominated by mergers formed by subsequent generation of more metal-rich population II and I stars [75]. The majority of population I/II stars (hereafter: stars) are found in galactic fields (∼$99\%$ ) where they do not experience frequent or strong dynamical interactions with other stars. In contrast, some small fraction of stars (∼$1\%$ ) are found in dense structures like globular or nuclear clusters, in which stellar densities are high enough that stars interact dynamically with other stars. Here, we briefly summarize basic concepts of isolated (galactic fields) stellar and binary evolution that leads to the formation of BBH mergers.

4.1.1. Single star evolution.

Detailed evolutionary calculations with numerical stellar codes that include rotation (like BEC, MESA or the Geneva code, e.g. [7679]) allow us to calculate the evolution of massive stars. Note that these are not (detailed) hydrodynamic nor multi-dimensional calculations (as such computations are well beyond current computing capabilities), but they solve the basic equations of stellar structure/radiation, energy transport and element diffusion with corrections for effects of rotation. These calculations are burdened with uncertainties in treatment of various physical processes (nuclear reaction rates, convection and mixing, transport of angular momentum within a star, wind mass loss, pulsations and eruptions), yet progress is being made to improve on stellar modeling. Stellar models are used to predict the structure and physical properties of massive stars at the time of core-collapse, after nuclear fusion stops generating energy and (radiation) pressure that supports the star. This is also a point in which transition (at latest) to hydrodynamical calculations is being made to assess the fate of a collapsing star [8082].

For a star to form a BH, it is required that either the explosion engine is weak or delayed (so energy can leak from the center of the collapsing star) or that the infalling stellar layers are dense and massive enough to choke the explosion engine adopted in a given hydrodynamical simulation. In consequence, BHs form either in weak supernovae explosions (with some material that is initially ejected falling back and accreting onto the BH or without a supernova explosion at all (in a so-called direct collapse). Note that signatures of BH formation may already have been detected. For example, in SN 1987A there is no sign of a pulsar [83], although the pulsar may still appear when dust obscuration decreases or it simply beams in another direction. Further evidence is the disappearance with no sign of a supernova of a $ \newcommand{\msun}{\,{\rm M_\odot}} 25\msun$ supergiant star [84], although this can be a potentially long period pulsating Mira variable star that will re-emerge after a deep decline in luminosity.

Stellar evolution and core-collapse simulations favor the formation of BHs with masses $M_{\rm BH} \sim 5$ $ \newcommand{\msun}{\,{\rm M_\odot}} 50 \msun$ and possibly with very high masses $ \newcommand{\msun}{\,{\rm M_\odot}} M_{\rm BH} \gtrsim 135 \msun$ . The low-mass limit is set by the so-called 'first mass gap', coined after the scarcity of compact objects in mass range 2–$ \newcommand{\msun}{\,{\rm M_\odot}} 5\msun$ [85, 86]. However, this mass gap may be narrower than previously thought as potential objects that fill the gap are discovered [8789]. The second gap arises from the occurrence of pair-instability SNe (PISN) as discussed below.

The first mass gap may be explained either by an observational bias in determination of BH masses in galactic x-ray binaries [90] or in terms of a timescale of development of the supernova explosion engine: for short timescales (∼$100~{\rm ms}$ ) a mass gap is expected, while for longer timescales (∼$1~{\rm s}$ ) a mass gap does not appear and NSs and BHs should be present in the 2–$ \newcommand{\msun}{\,{\rm M_\odot}} 5\msun$ mass range [91]. The mass threshold between NSs and BHs is not yet established, but realistic equations-of-state indicate that this threshold lies somewhere in range 2.0–$ \newcommand{\msun}{\,{\rm M_\odot}} 2.6 \msun$ . The second limit at $ \newcommand{\msun}{\,{\rm M_\odot}} M_{\rm BH} \sim 50 \msun$ is caused by (pulsational) PISNe [92, 93]. Massive stars with He-cores in the mass range $ \newcommand{\msun}{\,{\rm M_\odot}} 45 \lesssim M_{\rm He} \lesssim 65 \msun$ are subject to pulsational PISNe before they undergo a core-collapse. These pulsations are predicted to remove the outer layers of a massive star (above the inner 40–$ \newcommand{\msun}{\,{\rm M_\odot}} 50\msun$ ) and therefore this process limits the BH mass to  ∼$ \newcommand{\msun}{\,{\rm M_\odot}} 50\msun$ . BHs within these two limits ($M_{\rm BH} \sim 5$ $ \newcommand{\msun}{\,{\rm M_\odot}} 50 \msun$ ) are the result of the evolution of stars with an initial mass $M_{\rm ZAMS} \approx 20$ $ \newcommand{\msun}{\,{\rm M_\odot}} 150\msun$ . For high-metallicity stars (typical of stars in the Milky Way disk, $ \newcommand{\zsun}{{\rm ~Z}_{\odot}} Z=\zsun=0.01$ –0.02; [94, 95]) BHs form up to  ∼$ \newcommand{\msun}{\,{\rm M_\odot}} 15\msun$ , for medium-metallicity stars ($ \newcommand{\zsun}{{\rm ~Z}_{\odot}} Z=10\% \zsun$ ) BHs form up to  ∼$ \newcommand{\msun}{\,{\rm M_\odot}} 30\msun$ , while for low-metallicity stars ($ \newcommand{\zsun}{{\rm ~Z}_{\odot}} Z=1\% \zsun$ ) BHs form up to  ∼$ \newcommand{\msun}{\,{\rm M_\odot}} 50\msun$ [96, 97].

The remaining question is whether stars can form BHs above  ∼$ \newcommand{\msun}{\,{\rm M_\odot}} 50\msun$ . Stars with He-cores in mass range: $ \newcommand{\msun}{\,{\rm M_\odot}} 65\lesssim M_{\rm He} \lesssim 135 \msun$ are subject to PISNe [92, 98, 99] that totally disrupts the star and therefore does not produce a BH. However, it is expected that stars with He cores as massive as $ \newcommand{\msun}{\,{\rm M_\odot}} M_{\rm He} \gtrsim 135 \msun$ , although subject to pair instability, are too massive to be disrupted and they could possibly form massive BHs ($ \newcommand{\msun}{\,{\rm M_\odot}} M_{\rm BH} \gtrsim 135 \msun$ ). If these massive BHs exist, then second mass gap will emerge with no BHs in the mass range $M_{\rm BH} \simeq 50$ $ \newcommand{\msun}{\,{\rm M_\odot}} 135 \msun$ [26, 100, 101, 102]. If these massive BHs exist, and if they find their way into merging BBH binaries then GW observatories will eventually find them [102, 103]. The existence of very massive BHs will constrain the extend of the stellar initial mass function (IMF) and wind mass-loss rates for the most massive stars ($ \newcommand{\msun}{\,{\rm M_\odot}} M_{\rm ZAMS}>300\msun$ ) that can produce these BHs. So far, there are no physical limitations for the existence of such massive stars [104, 105]. Note that the most massive stars known today are found in the LMC with current masses of  ∼$ \newcommand{\msun}{\,{\rm M_\odot}} 200\msun$ [106].

BH formation may be accompanied by a natal kick. Natal kicks are observed for galactic radio pulsars, that move significantly faster (with average 3-dimensional speeds of  ∼$ \newcommand{\kms}{\,{\rm km\,s^{-1}}} 400\kms$ , e.g. [107]) than their typical progenitor star (10–$ \newcommand{\kms}{\,{\rm km\,s^{-1}}} 20\kms$ ). These high velocities are argued to be associated with some supernova asymmetry: either asymmetric mass ejection [108110] or asymmetric neutrino emission [111113]. Note that neutrino kick models all require (possibly unrealistic) strong magnetic fields, and simulations of core collapse without magnetic fields are unable to produce significant neutrino kicks. Naturally, in these simulations the authors find the need for asymmetric mass ejection to explain natal kicks (e.g. [109]). Although BH natal kicks as high as observed for NSs cannot yet be observationally excluded, it is unlikely for BHs to receive such large natal kicks [114, 115]. It appears that some of the BHs may form without a natal kick [116, 117], while some may form with a kick of the order of  ∼$ \newcommand{\kms}{\,{\rm km\,s^{-1}}} 100 \kms$ [115].

The BH natal spin may simply depend on the angular momentum content of the progenitor star at the time of core collapse. Massive stars are known to rotate; with the majority of massive stars spinning at moderate surface velocities (about 90% at  ∼$ \newcommand{\kms}{\,{\rm km\,s^{-1}}} 100 \kms$ ) and with some stars spinning rather rapidly (10% at  ∼$ \newcommand{\kms}{\,{\rm km\,s^{-1}}} 400 \kms$ ). During its evolution, a star may transport angular momentum from its interior to its atmosphere. Then angular momentum is lost from the star when the outer star layers are removed. The envelope removal in massive stars that are progenitors of BBH mergers is easily accomplished either by stellar winds or by mass transfer to a close companion star. However, the efficiency of angular momentum transport is unknown. Two competitive models are currently considered in the literature: very effective angular momentum transport by a magnetic dynamo [118, 119] included in the MESA stellar evolution code that leads to solid body rotation of the entire star, and mild angular momentum transport through meridional currents [78, 120] included in the Geneva code that leads to differential rotation of the star. Asteroseismology that probes the internal rotation of stars has not yet provided any data on massive stars (i.e. progenitors of BHs). The available measurements for intermediate-mass stars (B type main-sequence stars) show that some stars are well described by solid body rotation and some by differential rotation [121]. Depending on an the adopted model, the angular momentum content of a star at core-collapse could be very different. During BH formation some angular momentum may be lost affecting the natal BH spin if material is ejected in a supernova explosion. Whether BH formation is accompanied by mass loss is not at all clear and estimates that use different assumptions on mass ejection in the core-collapse process are underway [122, 123]. At the moment, from the modeling perspective, the BH natal spin is mostly unconstrained.

4.1.2. Binary star evolution.

The majority ($\gtrsim$ 70%) of massive O/B stars, the potential progenitors of NSs and BHs, are found in close binary systems [124]. The evolution of massive stars in binaries deviates significantly from that of single stars [125128]. The main uncertainties affecting the calculation of BH merger rates are the metallicity, the common-envelope phase and the natal kick a BH receives at birth. These factors also determine the two main BH properties: mass and spin.

Two main scenarios were proposed for BBH merger formation from stars that evolve in galactic fields: classical isolated binary evolution similar to that developed for double neutron stars (e.g. [27, 129135]) and chemically homogeneous evolution (e.g. [93, 102, 136139]). Classical binary evolution starts with two massive stars in a wide orbit ($a \gtrsim 50$ $ \newcommand{\rsun}{{\rm ~R}_{\odot}} 1000 \rsun$ ), and then binary components interact with each other through mass transfers decreasing the orbit below  ∼$ \newcommand{\rsun}{{\rm ~R}_{\odot}} 50\rsun$ in common envelope (CE) evolution [140, 141]. Depending on their mass, both stars collapse to BHs, either with or without supernova explosion, forming a compact BBH binary. The orbital separation of two BHs which merge within a Hubble time is below  ∼$ \newcommand{\rsun}{{\rm ~R}_{\odot}} 50\rsun$ (for a circular orbit and two $ \newcommand{\msun}{\,{\rm M_\odot}} 30\msun$ BHs [142]). [143] highlight that for the massive stars that are expected to form BHs, the mass ratio in the second mass-transfer phase is much less extreme, which means a CE phase may be avoided.

In the chemically homogeneous evolution scenario, two massive stars in a low-metallicity environment form in a very close binary ($\lesssim $ 50$ \newcommand{\rsun}{{\rm ~R}_{\odot}} \rsun$ ) and interact strongly through tides [144, 145]. Tidal interactions lock the stars in rapid rotation and allow for the very effective mixing of elements in their stellar interior that inhibits radial expansion of the stars. Hence, these stars remain compact throughout their evolution and collapse to BHs without experiencing a CE phase [102]. This evolutionary scheme may well explain the most massive LIGO/Virgo BBH mergers, as the enhanced tidal mixing required in this channel only works for most massive stars ($\gtrsim$ 30$ \newcommand{\msun}{\,{\rm M_\odot}} \msun$ ). It also predicts that both binary components evolve while rotating fairly rapidly and this may produce rapidly spinning BHs, unless angular momentum is lost very efficiently in the last phases of stellar evolution or during BH formation.

4.1.3. Reconciling observations and theory.

There seems to be some confusion in the community as to what was expected and predicted by stellar/binary evolution models prior to LIGO/Virgo detections of the first sources. In particular, it is striking that often it is claimed that LIGO/Virgo detections of BBH mergers with very massive BHs were surprising or unexpected. Before 2010, in fact most models were indicating that BNS are dominant GW sources for ground-based detectors (however, see also [132], predicting LIGO detection rates strongly dominated by BBH binaries), and that stellar-origin BHs are formed with small masses of  ∼$ \newcommand{\msun}{\,{\rm M_\odot}} 10 \msun$ [146]. The models before 2010 were limited to calculations for stars with high metallicity (typical of the current Milky Way population) and this has introduced a dramatic bias in predictions. However, already around 2010 it was shown that stars at low metallicities can produce much more massive (30–$ \newcommand{\msun}{\,{\rm M_\odot}} 80 \msun$ ) BHs than observed in the Milky Way [96, 147, 148]. Additionally, it was demonstrated that binaries at low metallicities are much more likely to produce BBH mergers than high metallicity stars by one or two ordes of magnitude [25]. This led directly to the pre-detection predictions that (i) the first LIGO/Virgo detection was expected when the detector (BNS) sensitivity range reached about 50–100 Mpc (the first detection was made at 70 Mpc), that (ii) BBH mergers will be the first detected sources, and that (iii) the BBH merger chirp-mass distribution may reach $ \newcommand{\msun}{\,{\rm M_\odot}} 30\msun$ [25, 66, 149, 150]. Additionally studies of the future evolution of x-ray binaries like IC10 X-1 and NGC300 X-1 [151] suggested that there exists a large population of merging BH binaries with masses in excess of $ \newcommand{\msun}{\,{\rm M_\odot}} 20 \msun$ .

Post-detection binary evolution studies expanded on earlier work to show agreement of calculated BBH merger rates and BBH masses with LIGO/Virgo observations [27, 97, 134, 135]. The range of calculated merger rates (10–$ \newcommand{\gpy}{{\rm ~Gpc}^{-3} {\rm ~yr}^{-1}} 300\gpy$ ) comfortably coincides with the observed rate estimates (12–$ \newcommand{\gpy}{{\rm ~Gpc}^{-3} {\rm ~yr}^{-1}} 213 \gpy$ for the LIGO/Virgo $90\%$ credible interval). Note that these classical binary evolution rates are typically much higher than rates predicted for dynamical BBH formation channels (5–$ \newcommand{\gpy}{{\rm ~Gpc}^{-3} {\rm ~yr}^{-1}} 10 \gpy$ , [152, 153]). The most likely detection mass range that is predicted from classical isolated binary evolution is found in the total BBH merger mass range of 20–$ \newcommand{\msun}{\,{\rm M_\odot}} 100 \msun$ (e.g. [97]). Examples of merger rate and mass predictions for BBH mergers are given in figures 2 and 3. A similar match between observed LIGO/Virgo BH masses and model predictions is obtained from the dynamical formation channel [152, 153]. Note that this makes these two channels indistinguishable at the moment, although the merger rates are likely to be much smaller for the dynamical channel.

Figure 2. Refer to the following caption and surrounding text.

Figure 2. Left: Redshifted total merger mass distribution for two population synthesis models [122]: M10 (low BH natal kicks) and M23 (high BH natal kicks). The O2 LIGO sensitivity is marked; the most likely detections are expected when models are closest to the sensitivity curve. We also mark LIGO/Virgo BBH merger detections (vertical positions have no meaning), all of which fall within the most likely detection region between 20–$ \newcommand{\msun}{\,{\rm M_\odot}} 100\msun$ . Right: Source frame BBH merger-rate density of several population synthesis models for the local Universe (z  =  0). The current LIGO O1/O2 BBH merger rate is 12–$ \newcommand{\gpy}{{\rm ~Gpc}^{-3} {\rm ~yr}^{-1}} 213\gpy$ (blue double-headed arrow). Note that the models with fallback-attenuated BH natal kicks (M10, M20) are at the LIGO upper limit, while models with high BH natal kicks are at the LIGO lower limit (M13, M23). Models with small (M26) and intermediate (M25) BH kicks fall near the middle of the LIGO estimate.

Standard image High-resolution image
Figure 3. Refer to the following caption and surrounding text.

Figure 3. Distribution of simulated double compact object binaries in the total mass—chirp mass plane for a metallicity of Z  =  0.0002. Three islands of data are visible, corresponding to BBH, mixed BH-NS and BNS systems. The colour code indicates the merger rate per pixel for a Milky Way equivalent galaxy. The three solid grey lines indicate a constant mass ratio of 1, 3 and 10 (from top to bottom). Observed LIGO/Virgo sources are shown with black crosses and event names are given for the four most massive cases. The lowest mass BBH mergers can only be reproduced with a higher metallicity. Reproduced with permission from [97] by Oxford University Press on behalf of the Royal Astronomical Society.

Standard image High-resolution image

A caveat of concern for the prospects of LIGO/Virgo detecting BBH mergers with masses above the PISN gap is related to the relatively low GW frequencies of the such massive BBH binaries with chirp masses above $ \newcommand{\msun}{\,{\rm M_\odot}} 100\;\msun$ . During the in-spiral, the emitted frequencies are expected to peak approximately at the innermost stable circular orbit (ISCO), before the plunge-in phase and the actual merging. Hence, the emitted frequencies are most likely less than 100 Hz, and with redshift corrections the frequencies to be detected are easily lower by a factor of two or more. A frequency this low is close to the (seismic noise) edge of the detection window of LIGO/Virgo and may not be resolved.

The current LIGO/Virgo broad range of an empirically determined local BBH merger-rate density (12–$ \newcommand{\gpy}{{\rm ~Gpc}^{-3} {\rm ~yr}^{-1}} 213 \gpy$ ) can easily be explained by uncertainties in key input physics parameters of population synthesis modelling, such as the slope of the IMF or the efficiency of the CE ejection, see e.g. table 5 in [97]. Alternatively, it may be explained by altering BH natal kicks [27] from full NS natal kicks corresponding to a low rate estimate) to almost no BH kicks (high rate estimate). Once LIGO/Virgo narrows its empirical estimate it may be possible to use the merge-rate density to constrain the input physics applied in modelling, although it should be cautioned that there is a large degree of degeneracy [97, 132].

LIGO/Virgo provides an estimate of the effective spin parameter that measures the projected BH spin components ($a_1, a_2$ ) parallel to binary angular momentum, weighted by BH masses ($M_1, M_2$ ):

Equation (3)

where $\Theta_{1,2}$ are the angles between the BH spins and the orbital angular momentum vector. So far, for the six LIGO/Virgo BBH detections the effective spins cluster around $|\chi_{\rm ef\,\!f}| < 0.35$ (e.g. [11]). This defies the expectations for the main BBH formation channels. If spin magnitudes of stellar-origin BHs are significant, as estimated for several galactic and extra-galactic x-ray binaries [154], then the dynamical formation channel (random BH capture) predicts an isotropic $\chi_{\rm ef\,\!f}$ distribution, while the classical binary evolution channel mostly predicts aligned BH spins (aligned stellar spins that are only moderately misaligned by BH natal kicks). Hence, in the latter case one expects a distribution peaked at high values of $\chi_{\rm ef\,\!f}$ . On the one hand this tension is rather unfortunate, as it does not allow to distinguish between these two very different scenarios of BBH merger formation. On the other hand, this is a great opportunity to learn something new about stars and BHs that was so far not expected and is not easily understood in the framework of current knowledge.

There are five potential explanations of this apparent tension. First, there could be a mechanism that puts both BH spins in the plane of the binary orbit, producing $\chi_{\rm ef\,\!f}=0$ independent of BH masses and spin magnitudes. Such a mechanism is proposed to operate in triple stars [155]. Note that triple stars are a minority of stars (10–$20\%$ of all field stars) and that the proposed mechanism requires very specific tuning to operate, so it is not clear how likely it is that it worked for all LIGO/Virgo sources. Second, there could be a mechanism that forces the BH spins to be in opposite directions so that they cancel out. For approximately equal mass BBH binaries (typical of LIGO/Virgo sources) this would imply 180 degree flip of spins. No mechanism is known to produce such configurations. Third, both BH spin magnitudes may be very small reducing effective spin parameter to $\chi_{\rm ef\,\!f}=0$ , independent of other merger parameters. This was already proposed and is used in studies of angular momentum transport in stars [122]. Fourth, LIGO/Virgo BHs may not have been produced by stars, but for example they come from a primordial population for which small spins are naturally expected [74, 156]. Fifth, it may be the case that the spin of a BH (at least its direction) is not mainly determined by the angular momentum of the progenitor star, but a result of the physics of the collapse (spin tossing, e.g. [157]). In that case, there is no reason to assume the spins in mergers formed from isolated binary evolution are aligned and they may be isotropic. Note that with these five options, that need to be tested and developed further, one cannot determine the main formation channel of BBH mergers, as it was proposed in several recent studies [145, 158160].

The issue of spins is rather fundamental as the effective spin parameter most likely contains information on natal BH spin magnitudes and therefore information on stellar astrophysics regarding angular momentum transport in massive stars, which is still unconstrained by electromagnetic observations. Possibly the second-formed BH spin could have been increased in binary evolution by accretion of mass from a companion star. However, it is argued that BHs cannot accrete a significant amount of mass in the binary evolution leading to the BBH formation [27, 122, 161]. This is partly due to very low accretion rates during a CE of 1–$10\%$ of the Bondi-Hoyle accretion rate [162165]) and fast timescale Roche-lobe overflow (RLO) in massive progenitor binaries that leads to ejection of most of the exchanged matter from the binary system (due to super-Eddington mass-transfer rates). The amount of mass accreted by BHs in binary systems ($\lesssim $ 1–$ \newcommand{\msun}{\,{\rm M_\odot}} 3 \msun$ ) cannot significantly spin up massive BHs (10–$ \newcommand{\msun}{\,{\rm M_\odot}} 30\msun$ ) that are detected by LIGO/Virgo.

It is important to note the most challenging parts of the evolutionary predictions in the context of BBH formation. In the classical binary evolutionary channel, the two most uncertain aspects of input physics are related to the CE evolution and the natal BH kicks. Although some observational constraints on both processes exist, they are rather weak. Systems entering CE evolution have recently been reported. However, they are not as massive as stars that could produce NSs or BHs [166]. The search for CE traces as IR outburts has so far yielded no clear detection of emerging or receding x-ray binaries as expected in this scenario [167].

BH natal kicks are only measured indirectly by positions and motions of x-ray binaries hosting BHs, and usually only lower limits on natal kicks are derived [114116, 168]. On theoretical grounds, reliable models for CE [141] and supernovae [169] are missing. In the chemically homogeneous evolution channel the largest uncertainty is connected with the efficiency of the mixing, the number of massive binaries that can form in very close orbits, and the strength of tidal interactions in close binaries. Since initial orbital period distributions are measured only in the very local Universe [124], it is not clear whether they apply to the majority of all stars and thus it is not fully understood how many stars are potentially subject to this kind of evolution. Even a deeper problem exists with our understanding of tides and their effectiveness in close binaries [144, 145], and effective tides are the main component of input physics in chemically homogeneous evolution.

Astrophysical inferences from GW observations are currently limited. First, it is not known which formation channel (or what mixture of them) produces the known LIGO/Virgo BBH mergers. Since each channel is connected to specific set of conclusions (for example, the isolated binary channel informs about CE evolution and natal kicks; while the dynamical channel informs predominantly about stellar interactions in dense clusters) it is not clear which physics we are testing with GW observations. Second, within each channel there is degeneracy such that multiple model parameters are only weakly constrained by observations. As nobody so far was able to deliver a comprehensive study of the large multi-dimensional parameter space, the inferences on various model parameters (e.g. the strength of BH natal kicks or the CE efficiency) are hindered by various and untested model parameter degeneracies. However, it is already possible to test several aspects of stellar evolution as some processes leave unambiguous signatures in GW observations. For example, the existence of the first and the second mass gap, if confirmed by LIGO/Virgo, will constrain core-collapse supernovae and PISNe, respectively. Careful studies with detailed exposure of caveats are needed to transform future observations into astrophysical inferences.

It is expected that GW events resulting from the merger of stellar-mass BHs are unlikely to produce electromagnetic counterparts. Nevertheless, a (marginal) transient signal detected by the Fermi gamma-ray burst monitor, 0.4 s after GW150914, was reported [170]. This claim encouraged several theoretical speculations for a possible origin. It has been suggested [171] that a tiny fraction of a solar mass of baryonic material could be retained in a circumbinary disk around the progenitor binary which probably shed a total mass of  >$ \newcommand{\msun}{\,{\rm M_\odot}} 10\msun$ during its prior evolution. The sudden mass loss and recoil of the merged BH may then shock and heat this material, causing a transient electromagnetic signal. It will be interesting see if any further electromagnetic signals will be associated with BBH mergers in the near future.

4.2. BNS mergers

The formation of double NSs has been studied in detail following the discovery of the Hulse-Taylor pulsar [172] and currently more than 20 such BNS systems are known—see [157] for a details and a review on their formation and evolution. Whereas no BBH binaries had been detected prior to GW150914, detailed knowledge on BNS systems was known for many years from galactic radio pulsar observations [173].

LIGO/Virgo has currently only detected one BNS merger event (GW170817), located in the lenticular (S0) galaxy NGC 4993, and thus the local empirical BNS merger rate density still remains rather uncertain: $ \newcommand{\gpy}{{\rm ~Gpc}^{-3} {\rm ~yr}^{-1}} 1540^{+3200}_{-1220} \gpy$ ($90\%$ credible limits [21]). The study of double NSs is relevant for the study of BHs because it gives independent constraints on the evolution of similar massive binary populations from which binary BHs are formed. In particular the question which stars for NSs and which form BHs and if and how this depends on previous binary interactions is a question that likely only can be answered observationally by significant statistics on the relative abundance of double BHs, BNS and NS-BH binaries.

There are two major sites to produce BNS mergers: isolated binaries in galactic fields (the main contributor), and dense environments in globular and nuclear clusters. None of these sites (nor any combination of them) can easily reproduce the preliminary estimated LIGO/Virgo event rate, even if all elliptical host galaxies are included within the current LIGO/Virgo horizon [174]. The local supernova rate can be estimated to be about $ \newcommand{\gpy}{{\rm ~Gpc}^{-3} {\rm ~yr}^{-1}} 10^5 \gpy$ , so the current empirical BNS merger rate from LIGO/Virgo would imply a very high efficiency of BNS binary formation.

This apparent tension may be solved if BNS mergers are allowed to originate from a wide range of host galaxies and if the low-end of the LIGO/Virgo merger-rate estimate is used ($ \newcommand{\gpy}{{\rm ~Gpc}^{-3} {\rm ~yr}^{-1}} 320 \gpy$ ). Population synthesis studies seem to agree that rates as high as 200–$ \newcommand{\gpy}{{\rm ~Gpc}^{-3} {\rm ~yr}^{-1}} 600 \gpy$ can possibly be reached if favorable conditions are assumed for classical binary evolution [97, 175, 176]. In the coming years, the statistics of the empirical BNS merger rate will improve significantly and reveal whether current theoretical BNS merger rates need a revision. It is interesting to notice, however, that calibrations with the rates of observed short gamma-ray bursts and the rate of mergers required to reproduce the abundances of heavy r-process elements favor a merger-rate density significantly smaller than the current empirical rate announced by LIGO/Virgo [97].

The main uncertainties of the theoretically predicted merger rate of BNS binaries are also related to CE evolution and SNe (similar to the case of BBH mergers). A CE evolution is needed to efficiently remove orbital angular momentum to tighten the binary orbit and allow a merger event within a Hubble time. However, the onset criterion and the efficiency of the in-spiral in a CE remain uncertain [141, 177]. The kick velocities imparted on newborn NSs span a wide range from a few ${\rm km~s}^{-1}$ (almost symmetric SNe) to more than 1000 ${\rm km~s}^{-1}$ and are sometimes difficult to determine [178]. The kick magnitude seems to be related to the mass of the collapsing core, its density structure and the amount of surrounding envelope material [110]. Additional important factors for the predicted merger rates include the slope of the initial-mass function and the efficiency of mass accretion during RLO [97].

All taken together, the predicted merger rate of double NSs in a Milky Way equivalent galaxy vary by more than two orders of magnitude [146]. The empirical merger rate that LIGO/Virgo will detect at design sensitivity in a few years is of uttermost importance for constraining the input physics behind the rate predictions. Besides the detection rates, the mass spectrum and the spin rates of the double NS mergers will reveal important information about their origin. Although their precise values cannot be determined due to degeneracy, the overall distribution of estimated NS masses will reveal information on their formation process (electron capture versus iron-core collapse SNe), as well as constraining the nuclear-matter equation-of-state. The latter will also be constrained from tidal deformations of the NSs in their last few orbits [21].

An important observational signature of the merger event of BNS binaries is the detection of the ring-down signal of either a meta-stable highly massive NS or a BH remnant. Such information would set constraints on the fundamental equation-of-state of nuclear matter at high densities. Whereas LIGO/Virgo is not sensitive enough to detect a ring-down signal, it is the hope that third-generation GW detectors might be able to do so. Another important observational input is the distribution of mass ratios in BNS merger events. This distribution could provide important information about the formation of NSs and the nature of the supernovae (e.g. electron-capture versus iron core-collapse supernovae).

Optical follow-up will in many cases reveal the location of a double NS merger (e.g. [22]). This will provide information on their formation environments [179, 180] and kinematics [181], besides crucial information on heavy r-process nucleosynthesis [182].

4.3. Mixed BH-NS mergers

The formation of mixed BH/NS mergers is expected to follow similar scenarios as double NS or double BH [129, 132] with all the associated uncertainties.

It is perhaps somewhat surprising that LIGO/Virgo detected a double NS merger (GW170817) before a mixed BH/NS merger, since (at least some) population synthesis codes predict a detection rate of mixed BH/NS systems which is an order of magnitude larger that the expected detection rate of double NS systems (with large uncertainties, e.g. [97]). Hence, if these predictions are correct, GW170817 is a statistical rare event and detections of mixed BH/NS systems are expected already in the upcoming O3/O4 LIGO runs. The detection of mixed BH-NS mergers is interesting for two reasons: (i) a key question is whether BH-NS and NS-BH binaries may be distinguished from one another (i.e. the formation order of the two compact objects, which leads to a (mildly) recycled pulsar in the latter case), and (ii) the detected in-spiral of mixed BH-NS mergers may reveal interesting deviations from GR and pure quadrupole radiation given the difference in compactness between BHs and NSs [183].

5. Dynamical formation of Stellar-mass binary black holes

Contributors: B Kocsis and A Askar

5.1. Introduction

The recent GW observations from six BBH mergers (GW150914, LVT151012, GW151226, GW170104, GW170608, GW170814) and a BNS merger (GW170817) opened ways to test the astrophysical theories explaining the origin of these sources [1, 2, 11, 18, 20, 21]. As discussed earlier, the component masses of these merging sources span a range between 8–$ \newcommand{\e}{{\rm e}} \newcommand{\Msun}{ \mathrm{M}_{\odot}} 35\Msun$ [11], which is different from the distribution of BHs seen in x-ray binaries, 5–$ \newcommand{\e}{{\rm e}} \newcommand{\Msun}{ \mathrm{M}_{\odot}} 17\Msun$ [184] with two possible exceptions (NGC300X-1 and IC10X-1). The event rates of BBH mergers are estimated to be between 40–$ \newcommand{\e}{{\rm e}} \newcommand{\Gpc}{{\rm Gpc}} 213~\Gpc^{-3}~{\rm yr}^{-1}$ for a power-law BH mass function and between 12–$ \newcommand{\e}{{\rm e}} \newcommand{\Gpc}{{\rm Gpc}} 65~\Gpc^{-3}~{\rm yr}^{-1}$ for a uniform-in-log BH mass function [11], which is higher than previous theoretical expectations of dynamically formed mergers, for instance see [146]. The event rates of BNS mergers is currently based on a single measurement which suggests a very high value of $ \newcommand{\e}{{\rm e}} \newcommand{\Gpc}{{\rm Gpc}} 1540^{+3200}_{-1220}\,\Gpc^{-3}~{\rm yr}^{-1}$ [21] (c.f. [174]). How do we explain the observed event rates and the distribution of masses, mass ratios, and spins?

Several astrophysical merger channels have been proposed to explain observations. Here we review some of the recent findings related to dynamics, their limitations and directions for future development. These ideas represent alternatives to the classical binary evolution picture, in which the stars undergo poorly understood processes, such as common envelope evolution. In all of these models the separation between the compact objects is reduced dynamically to less than an AU, so that GWs may drive the objects to merge within a Hubble time, $t_{\rm Hubble}=10^{10}\,~{\rm yr}$ .

5.2. Merger rate estimates in dynamical channels

5.2.1. Dynamical formation and mergers in globular clusters.

Although about 0.25% of the stellar mass is currently locked in globular clusters (GCs) [185187], dynamical encounters greatly catalyze the probability of mergers compared to that in the field. Within the first few million years of GC evolution, BHs become the most massive objects. Due to dynamical friction, they will efficiently segregate to the cluster center [188] where they can dynamically interact and form binaries with other BHs [189, 190]. The dense environments of GCs can also lead to binary-single and binary-binary encounters involving BHs that could result in their merger. Collisional systems like GCs can also undergo core collapse, during which central densities can become very large leading to many strong dynamical interactions. The encounter rate density is proportional to $\mathcal{R}\sim \int {\rm d}V \,\langle n_{*}^2 \rangle \, \sigma_{\rm cs} v$ , where n* is the stellar number density, $\sigma_{\rm cs}\sim GMb/v^2$ is the capture cross section, M is the total mass, b is the impact parameter, $v$ is the typical velocity dispersion. Note the scaling with $\langle n_*^2\rangle$ , where $ \newcommand{\e}{{\rm e}} \newcommand{\pc}{{\rm pc}} \langle n_{*}^2\rangle^{1/2} \sim 10^5\pc^{-3}$ in GCs and  ∼$ \newcommand{\e}{{\rm e}} \newcommand{\pc}{{\rm pc}} 1\pc^{-3}$ in the field.

Estimates using Monte Carlo method to simulate realistic GCs yield merger rates of at least $ \newcommand{\e}{{\rm e}} \newcommand{\Gpc}{{\rm Gpc}} \mathcal{R}_{\rm GC}\sim 5 ~\Gpc^{-3}~{\rm yr}^{-1}$ [152, 153], falling below the current limits on the observed rates. Rate estimates from results of direct N–body simulations also yield a similar value of $ \newcommand{\e}{{\rm e}} \newcommand{\Gpc}{{\rm Gpc}} \mathcal{R}_{\rm GC}\sim 6.5 ~ \Gpc^{-3}~{\rm yr}^{-1}$ [191]. In particular, these papers have shown that the low-mass GCs below $10^5\mathrm{M}_{\odot}$ have a negligible contribution to the rates. However, they also show that initially more massive GCs (more massive than $10^6\mathrm{M}_{\odot}$ ) contribute significantly to the rates. [153] argue that actual merger rates from BHs originating in GCs could be 3–5 times larger than their estimated value of  ∼$ \newcommand{\e}{{\rm e}} \newcommand{\Gpc}{{\rm Gpc}} 5 ~\Gpc^{-3}~{\rm yr}^{-1}$ due to uncertainties in initial GC mass function, initial mass function of stars in GCs, maximum initial stellar mass and evolution of BH progenitors. Furthermore, BBH merger rates can be significant in young clusters with masses  ∼$ \newcommand{\e}{{\rm e}} \newcommand{\Msun}{ \mathrm{M}_{\odot}} 10^4\,\Msun$ [50, 192].

A simple robust upper limit may be derived by assuming that all BHs merge once in each GC in a Hubble time:

Equation (4)

where $f_{\rm BH}$ is the fraction of stars that turn into BHs from a given stellar initial mass function $f_{\rm IMF}$ , N* is the initial number of stars in a GC, and $n_{\rm GC}$ is the cosmic number density of GCs, and $ \newcommand{\e}{{\rm e}} \newcommand{\Msun}{ \mathrm{M}_{\odot}} f_{\rm IMF}(m) \propto (m/0.5\Msun){}^{-2.3}$ for $ \newcommand{\e}{{\rm e}} \newcommand{\Msun}{ \mathrm{M}_{\odot}} m>0.5\Msun$ and $ \newcommand{\e}{{\rm e}} \newcommand{\Msun}{ \mathrm{M}_{\odot}} (m/0.5\Msun){}^{-1.3}$ otherwise [193]. The result is not sensitive to the assumed upper bound on mass of the BH progenitor, which is set by the pair instability supernova. Recent estimates find 110–$ \newcommand{\e}{{\rm e}} \newcommand{\Msun}{ \mathrm{M}_{\odot}} 130\,\Msun$ for GC metallicities [31]. However, note that the mass in GCs may have been much higher than currently by a factor  ∼5, since many GCs evaporated or got tidally disrupted [194, 195]. This effect increases the rates by at most a factor two at z  <  0.3, but more than a factor 10 at z  >  2.5 [196].

5.2.2. Dynamical and relativistic mergers in galactic nuclei.

The densest regions of stellar BHs in the Universe are expected to be in the centers of nuclear stars clusters (NSC), whose mass-segregated inner regions around the SMBH exceed $ \newcommand{\e}{{\rm e}} \newcommand{\pc}{{\rm pc}} n_*\sim 10^{10}\pc^{-3}$ [197]. In contrast to GCs, the escape velocity of the SMBH in the central regions of NSCs is so high that compact objects are not expected to be ejected by dynamical encounters or supernova birth kicks. In these regions, close BH binaries may form due to GW emission in close single–single encounters [197]. Binary mergers may also be induced by the secular Kozai–Lidov effect of the SMBH [198203] and tidal effects [204]. Detailed estimates give from the $\mathcal{R}_{\rm NSC}\sim 5$ $ \newcommand{\e}{{\rm e}} \newcommand{\Gpc}{{\rm Gpc}} 15~\Gpc^{-3}~{\rm yr}$ [174, 197200], below the observed value. Higher values may be possible for top heavy BH mass functions and if black holes are distributed in disk configurations [197, 205].

These numbers are sensitive to the uncertain assumptions on the total supply of BHs in the NSCs, either formed in situ or delivered by infalling GCs [194, 206209]. If all BHs merged in galactic nuclei once, an upper limit similar to equation (4) is $ \newcommand{\e}{{\rm e}} \newcommand{\Gpc}{{\rm Gpc}} \mathcal{R}_{\rm NSC}< 30~\Gpc^{-3}~{\rm yr}^{-1}$ . Due to the high escape velocity from NSCs and a rate of infalling objects, this bound may be in principle exceeded.

5.2.3. Mergers facilitated by ambient gas.

Dynamical friction facilitates mergers in regions where a significant amount of gas embeds the binary, which may carry away angular momentum efficiently. Particularly, this may happen in a star forming regions [75, 210, 211], in accretion disks around SMBHs in active galactic nuclei (AGN) [212, 213] or if the stellar envelope is ejected in a stellar binary [214]. In AGN, the accretion disk may serve to capture stellar mass binaries from the NSC [215], or help to form them in its outskirts by fragmentation [216]. The rate estimates are at the order of $ \newcommand{\e}{{\rm e}} \newcommand{\Gpc}{{\rm Gpc}} 1~\Gpc^{-3}~{\rm yr}^{-1}$ , below the observed value. Nevertheless, mergers in this channel deserve attention as they have electromagnetic counterparts, the population of AGNs, which may be used in a statistical sense [217].

5.2.4. Isolated triples.

The stellar progenitors of BHs are massive stars, which mostly reside not only in binaries, but in many cases in triples. The gravity of the triple companion drives eccentricity oscillations through the Kozai–Lidov effect, which may lead to GW mergers after close encounters. However, the rate estimates are around or below $ \newcommand{\e}{{\rm e}} \newcommand{\Gpc}{{\rm Gpc}} 6 ~\Gpc^{-3}~{\rm yr}^{-1}$ , below the current observational range [218, 219]. The rates may be higher 2–$ \newcommand{\e}{{\rm e}} \newcommand{\Gpc}{{\rm Gpc}} 25~\Gpc^{-3}~{\rm yr}^{-1}$ for low metallicity triples [220] and further increased by non-hierarchical configurations [221, 222] and by quadrupole systems [223].

5.2.5. Mergers in dark matter halos.

The first metal-free stars (Pop III) may form binaries dynamically and merge in DM halos [224], but the expected rates are below the observed rates [75]. Futhermore, two dynamical channels have been suggested leading to a high number of BH mergers in DM halos. If PBHs constitute a significant fraction of DM, the merger rates following GW captures in single–single encounters match the observed rates [225]. However, this would also lead to the dispersion of weakly bound GCs in ultrafaint dwarf galaxies, contradicting an observed system [226]. More recent estimates show that the LIGO rates are matched by this channel even if only $1\%$ of the DM is in PBHs [227]. The second PBH channel requires only a $0.1\%$ —fraction of DM to be PBHs, given that they form binaries dynamically in the very early universe [228]. While these sources can match the observed rates, as discussed in the following section 6, we await further strong theoretical arguments or observational evidence for the existence of these PBHs [229].

5.3. Advances in numerical methods in dynamical modeling

Recent years have brought significant advances in modelling the dynamical environments leading to GW events.

5.3.1. Direct N-body integration.

State of the art direct N-body simulations have been used to model the dynamics of GCs to interpret GW observations. These methods now reach N  =  106, so that stars are represented close to 1:1 in the cluster [230]. Comparisons between the largest direct N-body and Monte Carlo simulations show an agreement [230, 231]. However, due to their high numerical costs, only a very low number of initial conditions have been examined to date. Further development is needed to account for a higher number of primordial binaries, larger initial densities, a realistic mass spectrum, and a nonzero level of rotation and anisotropy.

Large direct N-body simulation have also been used to study the dynamics in nuclear star clusters (NSC) in galactic nuclei with an SMBH and the formation of NSCs from the infall of GCs [208]. Recent simulations have reached N  =  106 in regions embedding NSCs [232]. Here, the number of simulated stars to real stars is 1:100. To interpret GW mergers in these systems, a 1:1 simulation of the innermost region of the NSC would be most valuable, even if the total simulated number is of the order 105–106. Including a mass distribution and primordial binaries would also be useful.

Direct N-body methods were also recently used to simulate the NSC in AGN with stellar captures by the accretion disk, to predict the rate of tidal disruption events and the formation of a nuclear stellar disk [233, 234]. The most important place for development is to relax the assumption of a rigid fixed disk in these simulations. Indeed, the disk may be expected to be significantly warped by the star cluster [235, 236], and the stars and BHs captured in the disk may grow into IMBHs and open gaps in the disk [213]. Furthermore, an initially nonzero number of binaries, the binary-disk interaction, and a mass spectrum would be important to incorporate to make GW predictions.

5.3.2. Monte-Carlo methods.

State of the art Monte-Carlo methods have also improved significantly during the past years, providing a numerically efficient way to model the evolution of GCs accurately. Recent developments showed that the BH subclusters do not necessarily decouple and evaporate at short time scales and that GCs with longer relaxation times can retain BHs up to a Hubble time [237239]. These methods have been used to predict the evolutionary pathways to some of the observed mergers [69, 240] and to interpret the distributions of masses and spins [152, 153]. These methods have been used to study the formation of IMBHs [241]. Most recently, 2.5-order post-Newtonian dissipative effects were incorporated in order to re-simulate binary-single interactions involving three BHs from results of these Monte Carlo codes which increased the rate of eccentric mergers by a factor of 100 [242244]. The implementation of post-Newtonian terms and tidal dissipation [245] for computing results of strong binary-single and binary-binary encounters involving at least two BHs could further increase merger rates for BBHs expected to originate from GCs. Moreover, implementation of two body gravitational and tidal capture within these codes could provide more insights into the role of dynamics in forming potentially merging BBHs.

Further development is needed to include resonant multibody effects. Moreover, simulations of galactic nuclei with Monte Carlo methods would be valuable to tracking the evolution of binaries, and accounting for long term global effects such as resonant relaxation.

5.3.3. Secular symplectic N-body integration.

Systems which are described by a spherical gravitational potential such as a galactic nucleus or a GC are affected by strong global resonances, in which the anisotropies of the system drive a rapid secular change in the angular momentum vectors of the objects, called resonant relaxation [246]. Vector and scalar resonant relaxation, which affect the distribution of orbital planes and the eccentricity, respectively, are expected to reach statistical equilibrium within a few Myr to a few 100 Myr, respectively. A secular symplectic N-body integration method was recently developed [247]. Preliminary results show that objects such as BHs, which are heavier than the average star in a GC or a nuclear star cluster tend to be distributed in a disk [205]. Since LVC mergers happen more easily in BH disks, if they exist, future studies are necessary to explore the formation, evolution, and the expected properties of such configurations.

The statistical equilibrium phase space distribution of resonant relaxation is known only for a limited number of idealized configurations [248251]. Interestingly resonant relaxation has strong similarities to other systems in condensed matter physics such as point vortices and liquid crystals [247249]. This multidisciplinary connection may be used to study these probable sites of BH mergers [249].

5.3.4. Semianalytical methods.

Semianalytical methods are developed to include the highest number of physical effects in an approximate way. The formation of the galactic bulge from the infall of GCs was examined in great detail with this technique [194, 195]. It was shown, that during this process, more than 95% of the initial GC population is destroyed. Thus, GCs were much more common at cosmologically early times. Since one way to form IMBHs in GCs is by runaway collisions of stars or BHs [252], their numbers may also expected to be higher than previously thought (however, this conclusion may be different in other IMBH formation channels [241]). GCs generate a high rate of mergers between stellar mass and 102$ \newcommand{\e}{{\rm e}} \newcommand{\Msun}{ \mathrm{M}_{\odot}} 10^3\Msun$ IMBHs detectable in the near future with advanced LIGO and Virgo detectors at design sensitivity at z  >  0.6 [201, 202, 253]. BH mergers with IMBHs with 103$ \newcommand{\e}{{\rm e}} \newcommand{\Msun}{ \mathrm{M}_{\odot}} 10^4\Msun$ may also be detected with LISA from the local Universe.

5.4. Astrophysical interpretation of dynamical sources

How can dynamical models test the astrophysical interpretation of GW sources? GW detectors can measure the binary component masses, the spin magnitudes and direction, the binary orbital plane orientation, the eccentricity, the distance to the source and the sky location. The observed distributions of these parameters may be compared statistically to the predicted distributions for each channel. Some smoking gun signatures of the astrophysical environment are also known for individual sources, discussed below.

5.4.1. Mass distribution.

The mass distribution of mergers depends on the theoretically poorly known BH mass function times the mass-dependent likelihood of mergers. Particularly, the mergers of massive objects are favored in GCs, due to the mass dependence of binary formation in triple encounters, binary exchange interactions, dynamical friction, and the Spitzer instability. Monte Carlo and N-body simulations show that the likelihood of merger is proportional to M4 in GCs [254]. The 2-dimensional total mass and mass ratio distribution of mergers may be used to test this prediction [255] and to determine the underlying BH mass function in these environments. The mass function of mergers may also vary with redshift as the most massive BHs are ejected earlier [152].

Recently, [256] introduced a parameter to discriminate among different astrophysical channels:

Equation (5)

This dimensionless number is $1 \pm 0.03$ for PBHs formed dynamically in the early universe (if the PBH mass function is assumed to be flat. For arbitrary PBH mass function the results can be substantially different [257]), 10/7 for BHs which form by GW emission in collisionless systems such as DM halos. For BHs which form by GW emission in collisional systems which exhibit mass segregation, this $\alpha$ varies for different component masses. In galactic nuclei it ranges between 10/7 for the low mass components in the population to  −5 for the highest mass component. It would be very useful to make predictions for $\alpha$ for all other merger channels.

5.4.2. Spin distribution.

Using the empirically measured rotation rate of Wolf-Rayet (WR) stars, the birth spin of the massive BHs is expected to be small [258]. If BHs have undergone previous mergers, their spin is distributed around 0.7 [53, 259]. Dynamical effects may also spin up the WR-descendent BH [158]. If the BH acquires a significant amount of mass due to accretion, it is expected to be highly spinning. Thus, second generation mergers are distinguishable in mass and spin from first generation mergers. Monte Carlo simulations predict that $10\%$ are second generation in GCs [242, 254]. Results from Monte Carlo simulations have also been utilized to investigate the role of spin in gravitational recoil kicks on merged BHs [260]. This has important implications for repeated mergers in dense environments like GCs, results from [260] suggest that that about 30% of merging BHs could be retained in GCs. According to a recent x-ray observing campaign, 7 out of the 22 active galactic nuclei analyzed are candidates for being high spin SMBHs (with spin $>98\%$ ), see table 1 of [261]. Some of the x-ray binaries show evidence of highly spinning stellar mass BHs [262]. However, with the exception of a single source, current LVC sources are consistent with zero effective spin [263]. If spinning, the relative direction of spins is expected to be uncorrelated (isotropic) for spherical dynamical systems. This is different from the standard common envelope channel, where spin alignment is generally expected with the angular momentum vector and counteralignment is not likely, in case the BHs are spinning [263, 264].

5.4.3. Eccentricity distribution.

Since GW emission circularizes binaries [142], they are expected to be nearly circular close to merger unless they form with a very small pericenter separation. Indeed, GW sources in GCs are expected to have a relatively small eccentricity in the LIGO band close to merger [265]. Moderate eccentricity of e  =  0.1 is expected from $10\%$ of GC sources at the low-frequency edge of the Advanced LIGO design sensitivity [242, 243]. However, they may have a high eccentricity in the LISA band [244, 265268] or the DeciHz band [269]. Field triples may also yield some eccentric LIGO mergers [218221]. Eccentricity may be much higher for sources forming by GW emission in close encounters [197].

The eccentricity distribution of merging BH binaries in GCs is expected to have three distinct peaks corresponding to binaries which merge outside of the cluster after ejection following a binary-single hardening interaction, binary mergers which happen within a cluster in between two binary-single interactions, and mergers which happen during a binary-single interactions [244, 267].

For GW capture binaries, the typical eccentricity at the last stable orbit (LSO, extrapolating [142]) is set by the velocity dispersion of the source population $\sigma$ as $ \newcommand{\e}{{\rm e}} e_{\rm LSO} \sim 0.03 (\sigma/1000{\rm km}{\rm s}^{-1}){}^{35/32}\,(4\eta){}^{-35/16}$ , where $ \newcommand{\e}{{\rm e}} \eta=m_1m_2/(m_1+m_2){}^2$ is the symmetric mass ratio [270]. The heavier stellar BHs and IMBHs are expected to merge with a higher eccentricity at around 0.1, while the lower mass BHs will have an eccentricity around 10−3 [270]. At design sensitivity Advanced LIGO and VIRGO may be more sensitive to eccentricity well before merger [271].

5.4.4. Sky location distribution.

Since GW sources are observed from beyond $ \newcommand{\e}{{\rm e}} \newcommand{\Mpc}{{\rm Mpc}} 100\,\Mpc$ , they are expected to be isotropically distributed. The sky location measurement accuracy is typically insufficient to identify a unique host galaxy counterpart for individual mergers. However, the power spectrum of sky location of all mergers may be measured. This may be useful to determine the typical galaxy types of mergers, particularly, whether the sources are in active galactic nuclei [217].

5.4.5. Smoking gun signatures.

Which other features may help to conclusively identify the host environment of individual GW sources?

Resolved clusters with electromagnetic counterparts.

Recently it was shown that several inspiraling stellar mass black hole binaries of Milky Way GCs are expected to be in the LISA band [272]. Therefore if LISA identifies these GW sources on the sky, it will constrain the event rates corresponding to the GC channel.

Modulation of the GW phase due to environmental effects.

In the case of LISA sources, extreme mass ratio inspirals may be significantly affected by an a SMBH companion or gas-driven migration [213, 273]. For stellar-mass BH mergers, the most important effect of a perturbing object is a Doppler phase shift, which accumulates mostly at low GW frequencies [274]. LIGO and LISA together will be able to measure the SMBH provided that the orbital period around the SMBH is less than $\mathcal{O}({\rm yr})$ [274, 275].

GW astrophysical echos.

The identification of a secondary lensed signal by the SMBH using LVC may confirm that a LVC merger takes place in a galactic nucleus. This manifests as a GW echo with a nearly identical chirp waveform as the primary signal but with a typically fainter amplitude depending on direction and distance from the SMBH [276]. If the distance between the source and the SMBH is less than $10^3 M_{\rm SMBH}$ , the relative amplitude of the echo is of the order of $10\%$ , and the time-delay is less than a few hours. Studies of the expected source parameters of sources with GW echos are underway.

Double mergers and mergers with electromagnetic counterparts.

Finally, a case is conceivable in which in which a binary-single or binary-binary encounter results in a double merger, i.e. two mergers from the same direction within a short timespan [277]. Such an observation would indicate a dense dynamical host environment. Furthermore, binary-single or binary-binary encounters involving at least 2 BHs and one or more stellar object may also lead to BBH mergers with with transient electromagnetic counterparts associated with the disruption or accretion of stellar matter on the BHs. Observation of such a counterpart would also indicate that the merging BHs originated in a dense dynamical environment. The inclusion of tidal and gravitational dissipation effects during the computation of such strong encounters [245] within simulations of GCs could help to constrain rates for BBH mergers in which an electromagnetic counterpart could be expected.

6. Primordial black holes and dark matter

Contributors: G Bertone, C T Byrnes, D Gaggero, J García-Bellido, B J Kavanagh

6.1. Motivation and formation scenarios

The nature of dark matter (DM) is one of the most pressing open problems of modern cosmology. The evidence for this mysterious form of matter started to grow in the early 20th century [278], and it is today firmly established thanks to a wide array of independent observations [279]. Its nature is still unknown, and the candidates that have been proposed to explain its nature span over 90 orders of magnitude in mass, from ultra-light bosons, to massive BHs [280282]. An intriguing possibility is that at least a fraction of DM is in the form of PBHs [283]. The recent LVC detections [225, 284], have revived the interest in these objects, and prompted a reanalysis of the existing bounds [283, 285] and new prospects for phenomenological signatures [156, 286].

PBHs might be produced from early universe phase transitions, topological defects (e.g. cosmic strings, domain walls), condensate fragmentation, bubble nucleation and large amplitude small scale fluctuations produced during inflation (see the reviews [229, 287] and references therein). Excluding the possibility of BH relics, non-evaporating PBHs could range in mass from 10−18 to 106 solar masses today, although the most interesting range today is that associated with the observed LIGO BBHs, around a few to tens of solar masses. For those PBHs, amongst the most promising production mechanisms was the one first proposed in [288] from high peaks in the matter power spectrum generated during inflation. When those fluctuations re-enter the horizon during the radiation era, their gradients induce a gravitational collapse that cannot be overcome even by the radiation pressure of the expanding plasma, producing BHs with a mass of order the horizon mass [289]. Most of these PBH survive until today, and may dominate the Universe expansion at matter-radiation equality.

One important question is what fraction $f_{\rm PBH}= \Omega_{\rm PBH}/\Omega_{\rm DM}$ of DM should be made of O(10M) PBHs in order to match the BBH merger rate inferred by LIGO/Virgo ($\mathcal{R} \simeq 10$ –200 Gpc−3 yr−1 [11]). If one considers PBH binaries that form within virialized structures, the corresponding rate is compatible with $f_{\rm PBH} = 1$ [225]. However, PBH binary systems can form in the early Universe (deep in the radiation era) as well [290, 291], if the orbital parameters of the pair allow the gravitational pull to overcome the Hubble flow and decouple from it. A recent calculation of the associated merger rate today [228]—significantly extended in [227]—provides a much larger estimate (compared to the former scenario): This result can be translated into a bound on $f_{\rm PBH}$ , which is potentially stronger than any other astrophysical and cosmological constraint in the same range. The constraint has been recently put on more solid grounds in [292] by taking into account the impact of the DM mini-halos around PBHs, which have a dramatic effect on the orbits of PBH binaries, but surprisingly subtle effect on their merger rate today. Several aspects of this calculation are still under debate, including the PBH mass distribution, the role of a circumbinary accretion disk of baryons, the impact of initial clustering of PBHs (see [293]) and the survival until present time of the binary systems that decoupled in the radiation era.

The critical collapse threshold to form a PBH is typically taken to be $ \newcommand{\e}{{\rm e}} \delta_c\equiv {\delta\rho_c/\rho}\sim0.5$ , with an exponential sensitivity on the background equation of state, the initial density profile and angular momentum. Because any initial angular momentum suppresses PBH formation, PBHs are expected to spin slowly, a potential way to discriminate between them and (rapidly rotating) astrophyical BHs [294].

Inflationary models which generate PBHs are required to generate a much larger amplitude of perturbations on small scales compared to those observed on CMB scales. This can be achieved either through multiple-field effects or an inflection point in single-field inflation [295, 296]. These typically generate a reasonably broad mass fraction of PBHs, in contrast to the monochromatic mass spectrum usually assumed when interpreting observational constraints. In addition, the softening of the equation of state during the QCD phase transition greatly enhances the formation probability of solar mass PBHs compared to the more massive merging BHs LIGO detected [297]. BHs below the Chandrasekhar mass limit would be a smoking gun of a primordial origin.

At non-linear order, scalar and tensor perturbations couple, implying that the large amplitude perturbations required to generate PBHs will also generate a stochastic background of GWs. For the LIGO mass range the corresponding GW frequency is constrained by pulsar timing arrays unless the scalar perturbations are non-Gaussian [298, 299].

6.2. Astrophysical probes

Constraints on the PBH abundance are typically phrased in terms of $f_{\rm PBH} = \Omega_{\rm PBH}/\Omega_{\rm DM}$ , the fraction of the total DM density which can be contributed by PBHs. Even in the case where $f_{\rm PBH} < 1$ , a future detection via astrophysical probes or GW searches remains hopeful [228]. We outline selected astrophysical constraints below, summarizing these in figure 4, where we focus on PBHs around the Solar mass range, most relevant for GW signals.

Figure 4. Refer to the following caption and surrounding text.

Figure 4. Summary of astrophysical constraints on PBHs in the mass range $M \in [10^{-2},\,10^5]\,M_\odot$ . Details of the constraints are given in the main text and we plot here the most conservative. We emphasize that astrophysical constraints may have substantial systematic uncertainties and that the constraints shown here apply only for monochromatic mass functions.

Standard image High-resolution image

Micro-lensing: The MACHO [300] and EROS [301] collaborations searched for micro-lensing events in the Magellanic Clouds in order to constrain the presence of Massive Compact Halo Objects (MACHOs) in the Milky Way Halo. Considering events on timescales of $\mathcal{O}$ (1–800) d constrains $f_{\rm PBH} < 1$ for masses in the range $M_{\rm PBH} \in [10^{-7}, 30]\,M_\odot$ (though these constraints come with a number of caveats, see e.g. [285, 302]). Another promising target is M31: an important fraction of the Andromeda and Milky Way DM halo can be probed by micro-lensing surveys, and an interesting hint comes from the observation of 56 events in the $\mathcal{O}$ (1–100) d range from this region of interest, which may suggest a MACHO population with $f \sim 0.5$ and mass $1 \,M_\odot$ or lighter [303, 304]. A recent search for lensing of Type Ia Supernovae obtained constraints on all PBH masses larger than around $10^{-2}\,M_\odot$ , although substantial PBH fractions $f_{\rm PBH} \lesssim 0.6$ are still compatible with the data [305]. For wide mass distributions the SN lensing constraints go away [74] and PBH could still constitute all of the DM.

Early Universe constraints: PBHs are expected to accrete gas in the Early Universe, emitting radiation and injecting energy into the primordial plasma. This in turn can lead to observable distortions of the CMB spectrum and can affect CMB anisotropies [306]. Early calculations over-estimated the effect [307, 308], with more recent studies finding weaker constraints [309, 310]. In spite of this, data from COBE/FIRAS [309, 311] and PLANCK [310] can still rule out PBHs with masses $M_{\rm PBH} \gtrsim 100 \,M_\odot$ as the dominant DM component. It should be noted that these constraints depend sensitively on the details of accretion onto PBHs in the Early Universe (see e.g. [312]).

Dynamical constraints: The presence of PBHs is also expected to disrupt the dynamics of stars. Wide halo binaries may be perturbed by PBHs and the actual observation of such systems constrains the PBHs abundance above $\simeq$ $ 20 \,M_\odot$ [313] (although significant fractions $f_{\rm PBH} \lesssim 0.2$ are still allowed). PBHs are also expected to dynamically heat and thereby deplete stars in the centre of dwarf galaxies. Observations of stellar clusters in Eridanus II [226] and Segue I [314] have been used to constrain PBHs heavier than $\mathcal{O}(1)\,M_\odot$ to be sub-dominant DM components, unless they have an IMBH at their center [315].

Radio and x-ray constraints: If PBHs exist in the inner part of the Galaxy, which contains high gas densities, a significant fraction of them would inevitably form an accretion disk and emit a broad-band spectrum of radiation. A comparison with existing catalogs of radio and x-ray sources in the galactic ridge region already rules out the possibility that PBHs constitute all of the DM in the Galaxy, even under conservative assumptions on the physics of accretion [316]. During the next decade, the SKA experiment will provide an unprecedented increase in sensitivity in the radio band; in particular, SKA1-MID will have the unique opportunity to probe the presence of a subdominant population of PBHs in the Galaxy in the $10 \div 100 \, M_\odot$ mass range, even if it amounts to a fraction as low as $\simeq$ 1% of the DM.

In figure 4, we show only the most conservative of these constraints, which suggest that PBHs may still constitute all of the DM for masses close to $10\,M_\odot$ , or a smaller fraction ($f_{\rm PBH} \gtrsim 0.1$ ) for masses up to $\mathcal{O}(100\,M_\odot)$ . We highlight that such astrophysical constraints may have large systematic uncertainties and that these constraints apply only to PBHs with a mono-chromatic mass function. Recent studies have begun re-evaluating these constraints for extended mass functions [285, 317319]. For physically motivated mass functions, it may be possible to achieve up to $f_{\rm PBH} \sim 0.1$ for PBHs in the mass range 25–$100 \,M_\odot$ [320]. Relaxing certain dynamical constraints (which typically have large uncertainties) or considering more general mass functions may accommodate an even larger PBH fraction [320, 321].

6.3. Discriminating PBHs from astrophysical BHs

A number of observations may help discriminating PBHs from ordinary astrophysical BHs. The detection of GWs from a binary system including a compact object lighter than standard stellar BHs, say below one solar mass, would point towards the existence of PBHs. This can be deduced from the highest frequency reached in the GW chirp signal, $f_{\rm ISCO} = 4400~{\rm Hz}\, (M_\odot/M)$ , and it is in principle possible already with Advanced LVC in the next run O3 [286]. The detection of GWs at redshift $z \gtrsim 40$ would imply a non-standard cosmology, or the existence of PBHs [322]. Further insights on the origin of BHs might be obtained through the analysis of 'environmental' effects, which are discussed in section 5, and through the analysis of the spatial distribution and mass function of x-ray and radio sources powered by BHs.

7. Formation of supermassive black hole binaries in galaxy mergers

Contributors: M Colpi, M Volonteri, A Sesana

When two galaxies, each hosting a central SMBH merge, the SMBHs start a long journey that should bring them from separations of tens of kpc (1 kpc $= 3.086\times 10^{21}$ cm) down to milli-parsec scales, below which they merge through GWs. The initial pairing of BHs in merging galaxies, their binding into a binary on parsec scales, and their crossing to enter the GW-driven inspiral are the three main stages of this dynamical journey, sketched in figure 5 [65]. In some cases, the two SMBHs become bound in the core of the merger remnant, cross to the gravitational-driven regime and eventually coalesce. However, there are cases in which the two SMBHs never form a binary: one of the SMBHs may remain stranded and unable to pair and bind [323].

Figure 5. Refer to the following caption and surrounding text.

Figure 5. Cartoon illustrating the journey travelled by SMBHs of masses in the range $ \newcommand{\msun}{\,{\rm M_\odot}} 10^{6-8}\msun$ during major galaxy-galaxy collisions. The x-axis informs on the SMBH separation given in various panels, while the y -informs on the timescale. The journey starts when two galaxies (embedded in their DM halos) collide on kpc scales (right-most plot). The inset shows a selected group of galaxies from the cosmological simulation described in [324]. The inset in the second panel (from right to left), from [325], depicts the merger of two disc galaxies and their embedded SMBHs. Pairing occurs when the two SMBHs are in the midst of the new galaxy that has formed, at separations of a few kpc. The SMBHs sink under the action of star-gas-dynamical friction. In this phase, SMBHs may find themselves embedded in star forming nuclear discs, so that their dynamics can be altered by the presence of massive gas-clouds. Scattering off the clouds makes the SMBH orbit stochastic, potentially broadening the distribution of sinking times during the pairing phase [326329]. Furthermore, feedback from supernovae and AGN triggering by one or both the SMBHs affect the dynamics as these processes alter the thermodynamics of the gas and its density distribution, which in turn affect the process of gas-dynamical friction on the massive BHs. It is expected that eventually the SMBHs form a Keplerian binary, on pc scales. Then, individual scattering off stars harden the binary down to the GW-driven domain. This is an efficient mechanism (and not a bottleneck) if the relic galaxy displays some degree of triaxiality and/or rotation. In this case, there exists a large enough number of stars on low-angular momentum orbits capable to interact with the binary and extract orbital energy [330]. The binary in this phase can also be surrounded by a (massive) circum-binary disc [331]. In a process reminiscent to type II planet migration, the two SMBHs can continue to decrease their separation and they eventually cross the GW boundary. Then, GW radiation controls the orbital decay down to coalescence.

Standard image High-resolution image

The cosmic context, the growth of DM halos through mergers with other halos gives us the backdrop upon which all the subsequent evolution develops. This is the first clock: the halo merger rate evolves over cosmic time and peaks at different redshift for different halo masses. Furthermore, the cosmic merger rate predicts that not all mergers occur with the same frequency: major mergers, of halos of comparable masses (from 1:1 down to  ∼1:4) are rare, while minor mergers (mass ratio  <1 : 10) are more common. However, the dynamical evolution is much faster in major mergers than in minor mergers: if the mass ratio of the merging halos and galaxies is too small, the satellite halo is tidally disrupted early on in its dynamical evolution and its central SMBH is left on a wide orbit, too far from the center of the larger galaxy to ever merge with its SMBH [332, 333]. A population of wandering SMBHs is therefore predicted to exist in galaxies [334336].

For mergers where the SMBHs can pair, i.e. find themselves in the newly formed core of the merger remnant, the second hurdle is to get close enough to form a gravitationally bound binary [337340]. For SMBHs embedded in a singular isothermal sphere, a binary forms at a separation of $ \newcommand{\msun}{\,{\rm M_\odot}} \newcommand{\kms}{\,{\rm km\,s^{-1}}} a = GM/2\sigma^2 \simeq 0.2 \, {\rm pc} \, (M_{\rm BBH}/10^6{\msun}) \, \left(\sigma/100 \kms\right){}^{-2}$ , where the mass of the SMBHs exceeds the enclosed mass in stars, gas, and DM. The journey from the beginning of the merger, at tens of kpc, to the formation of the binary on pc to subpc scales, takes between a few tens of Myr for compact galaxies at very high redshift, z  >  3–4 to several Gyr for larger, less dense galaxies at low redshift [337, 341, 342].

In the cases where the SMBHs form a bound binary within the Hubble time, the final crossing into the GW regime hinges on exchanges of energy through scattering with low angular momentum stars in the nucleus of the galaxy [343] or on extraction of angular momentum through gravitational torques coming from a gas disc that may result on the shrinking of its separation [344], or on a combination of the two processes. Recent results of direct N-body simulations, Monte Carlo methods and scattering experiments are giving an optimistic view of what has been considered to be the main bottleneck of the binary evolution for almost 40 years: the 'final parsec problem', i.e. running out of low-angular momentum stars [343]. The evolution of SMBH binaries through stellar scattering seems to continue at nearly a constant rate leading to merger in less than  ∼1 Gyr [330, 345, 346] once rotation, triaxiality and the granularity of the stellar distribution are taken into account.

If the binary environment is dominated by gas, rather than stars, the binary is expected to evolve through interaction with the accretion disk(s) surrounding the SMBHs, through the so called circumbinary disk phase. The disks are formed by the gas that inflows towards the SMBHs with an even small amount of angular momentum: in the single SMBH case these are referred to as 'accretion discs' [347]. Depending on the mass ratio q, the binary may or may not clear a gap within the circumbinary disk. In the former case (valid for $q \ll 1$ ) the less massive hole behaves as a fluid element within the accretion disk of the primary, thus experiencing what is known as Type I migration. In the latter case (generally valid for q  >  0.01), the formation of a central cavity slows down the evolution of the binary (Type II migration). The energy and angular momentum transfer between the binary and the circumbinary disk is mediated by streams leaking from the inner rim into the cavity. The pace of the supermassive binary BH (SMBBH) evolution depends on the detailed dynamics of the streams crossing the gap, which are partially accreted by the two SMBHs and partially flung back to the disk, extracting the binary energy. Simulations generally concur that accretion into the central cavity is efficient enough to bring the SMBBH into the GW-dominated regime (∼0.01 pc) within  ∼1–100 Myr [331, 348354]. The role of AGN feedback on this scales, however, is still largely unexplored and may bring surprises.

In summary, SMBHs in merging galaxies have a long journey, that takes between 1 and 10 Gyr depending on the cosmic time, mass and mass ratios of the host galaxies, galaxy morphologies, stellar and gas content, and on the masses of the SMBH themselves. Figure 6 is an illustration of the delay timescales in the dynamics of SMBH mergers, during the pairing process in major mergers. This is a simplified example for circular binaries with the primary SMBH of $ \newcommand{\msun}{\,{\rm M_\odot}} 10^8 \msun$ and mass ratios q  =  1, and 0.1. Here we included the time for halos to merge [355], and then added an additional timescale for BHs to pair, based on the formalism of [356], motivated by the recent results of [333] who find that the halo merger timescale is insufficient to estimate the pairing timescale. We assumed 100 Myr from binary formation to coalescence for the gas-driven case and the fit proposed by [346] for the stellar-driven case.

Figure 6. Refer to the following caption and surrounding text.

Figure 6. Relation between 'timestart', corresponding to the cosmic time of onset of a galaxy-galaxy collision hosting SMBHs, and 'timecoal' at which the SMBHs merge for circular binaries with the primary BH of $ \newcommand{\msun}{\,{\rm M_\odot}} 10^8 \msun$ and mass ratios q  =  1, and 0.1. Redshifts at start and coalescence are given in the same plot. Dotted and dashed lines refer to sinking times associated to the total of the merger time for the host halos and galaxies, the pairing of SMBH, and then the shrinking of the binary subject to either stellar and gaseous processes. The black solid line represents 'time$_{\rm coal}={\rm time}_{\rm start}$ '. The figure shows that galaxies colliding at $z\sim 1$ can display time delays as long as 3 Gyr.

Standard image High-resolution image

Theoretical studies and predictions, however, are still far from being complete: this is a severely multi-scale problem, intimately connected with the processes of galaxy clustering on cosmological scales, which involves a rich physics. Determining the distribution of merging times in SMBH mergers is critical, as only through the detailed knowledge of this distribution and associated processes, we are able to bracket the uncertainties in the estimates of the SMBH merger rates, relevant for the LISA mission, and to provide reliable estimate of the expected GW signal in the nHz band, relevant to PTAs.

8. Probing supermassive black hole binaries with pulsar timing arrays

Contributors: C Mingarelli, M Kramer, A Sesana

Millisecond pulsars are excellent clocks over long timespans, making them ideal tools to search for GWs [357364]. Indeed, an array of millisecond pulsars forms a galactic-scale nanohertz GW detector called a pulsar timing array (PTA, [365]). The GWs change the proper distance between the Earth and the pulsars, which induces a timing delay or advance in the pulsar pulses. The difference between the expected pulse arrival time and the actual arrival time, called the timing residual, is used to search for signatures of low-frequency GWs. The frequency band where PTAs operate is set by the length of pulsar observations, and the cadence of the observations. Briefly, the lower limit is set by $1/T_{\rm obs}$ , where $T_{\rm obs}$ is the total observation time, and the high-frequency limit is set by $1/\Delta t$ , where $\Delta t$ is the cadence of the pulsar observations. This sets the sensitivity band between 1 nHz and 100 nHz.

The most promising signals in the PTA frequencies are due to the expected cosmic population of SMBBHs. The systems of interest here have masses $>10^8~M_\odot$ , and can spend tens of millions of years emitting GWs in the relevant frequency band. The incoherent superposition of their signals gives rise to a stochastic GW background (GWB, see, e.g. [366370]). On top of it, particularly nearby and/or very massive SMBBHs might be individually resolved [371, 372], providing interesting targets for multimessenger observations. Moreover, the memory effect following the merger of those systems may also give rise to detectable bursts of GW radiation [373, 374]. Gravitational radiation may also originate from cosmic strings [375377], and primordial GWs from e.g. inflation [378].

Here we focus on the stochastic GWB and continuous GW sources, but it is worth mentioning that the correlation function present in GWB searches depends on both the underlying theory of gravity, the distribution of GW power on the sky, and the intra-pulsar distances. Indeed, additional GWB polarizations such as breathing modes can in principal be detected with PTA experiments, e.g. [379, 380], as can departures from GWB isotropy [381386]. Clustering of large-scale structure, resulting in an overdensity of merging SMBBHs, can lead to GWB anisotropy, as can nearby continuous GW sources which are individually unresolvable, but can contribute to GWB anisotropy at level of  ∼$ 20\%$ of the isotropic component, see [372]. Moreover, pulsars separated by less than  ∼$ 3 ^{\circ}$ violate the short-wavelength approximation [383, 412], used to write down the Hellings and Downs curve, and exhibit an enhanced response to GWs which may help in their detection.

World-wide, PTA experiments have been taking data for over a decade, with efforts in North American governed by the North American Nanohertz Observatory for GWs (NANOGrav, see e.g. [387]), in Europe by the European PTA (EPTA) [388], and in Australia by the Parkes PTA (PPTA) [389]. The union of these PTAs forms the International Pulsar Timing Array (IPTA) [390], where the PTAs share data in the effort to accelerate the first detection of nHz gravitational radiation, and to learn more about the individual millisecond pulsars. For a more comprehensive overview of PTA experiments, see e.g. [362364]

8.1. The gravitational-wave background

The cosmic merger history of SMBBHs is expected to form a low-frequency GWB, which may be detected by PTAs in the next few years [391393]. The time to detection depends strongly on the number of pulsars in the array, the total length of the dataset, and the underlying astrophysics which affects the SMBBH mergers, such as stellar hardening process, interactions with accretion disks, binary eccentricity, and potentially SMBH stalling. Searches for the GWB have resulted in evermore stringent limits on the amplitude A of the GWB reported at a reference frequency of 1 yr−1:

Equation (6)

where hc is the characteristic strain of the GWB [394]. This simple power-law scaling assumes the the SMBBH systems are circular when emitting GWs, and that they are fully decoupled from their environment. Binary eccentricity and interactions with gas and stars around the binary can deplete the GW signal at very low frequencies (equivalently at wide binary separations), causing the GW strain spectrum to turn over [395398]. On the other hand, the amplitude of the GWB is affected by the abundance and mass range of the cosmic population of SMBHs. Therefore, future detections of a stochastic GWB will allow to constrain both the overall population of SMBBHs and the physics driving their local dynamics [399405]. Current non-detections have been used in [406] to challenge the popular $M_{\rm BH}-M_{\rm bulge}$ from [40]. However, a full analysis taking into account uncertainties in the merger rate, SMBBH dynamics and the possibility of stalling is needed to draw firm conclusions [407].

The current upper limit on A from the various PTA experiments are similar and are improving with time. From the EPTA is $A<3\times 10^{-15}$ [408], from the PPTA this is $A<1\times 10^{-15}$ [389], from NANOGrav this is $A <1.45\times 10^{-15}$ [409], and the IPTA limit is $1.7\times 10^{-15}$ [390]. In order to improve those, further more sensitive observations are needed, which can and will be provided by improved instrumentation and new telescopes like FAST, MeerKAT and eventually the SKA [410]. Additionally, systematics needs to be addressed. For instance, solar system ephemeris errors can mimic a GWB signal, if the underlying data are sufficiently sensitive [411]. Here, mitigation techniques can be applied, as already done in the recent analysis of the NANOGrav 11 year data [409], while other effects, such as the interstellar weather may be best addressed with multi-frequency observations. Future IPTA results will take those and other effects into account.

8.2. Continuous gravitational waves

Individual nearby and very massive SMBBHs emitting nHz GWs can also be detected with PTA experiments [414417]. These SMBBHs are likely in giant elliptical galaxies, though the timescale between the galaxy merger and the subsequent SMBH merger is poorly understood. Unlike LIGO or LISA sources, these SMBBHs are in the PTA band for many millions of years, and will likely merge outside both the PTA and LISA band. The sky and polarization averaged strain of a continuous GW source is

Equation (7)

where $\mathcal{M}_c^{5/3} = [q/(1+q){}^2] M^{5/3}$ is the binary chirp mass, q  <  1 is the binary mass ratio, M is the total binary mass, f  is the GW frequency and DL is the luminosity distance to the binary.

The time to detection of continuous GW sources has been estimated in various ways: namely by simulating a GW source population based on cosmological simulations [371, 392, 418], or by using an underlying galaxy catalog to estimate which nearby galaxies can host SMBBH systems which are PTA targets [372, 419]. Both these approaches largely agree that at least one SMBBH system will be detected in the next decade or so, with the details depending on the amount of red noise in the pulsars. Detecting individual SMBBH systems is expected to shed light on the so-called final parsec problem, providing valuable insights into how SMBBHs merge. In fact, pulsar distances are well-measured, it may be possible to measure the SMBH spins using the pulsar terms [420].

Importantly, resolving an individual SMBBH with PTAs will open new avenues in multimessenger astronomy [421424]. The combined GW and electromagnetic signals will allow to pin down the properties of the binary and its environment, to study the dynamics of the SMBBH pairing and the coupling with the surrounding gaseous disc, if present [362, 399]. Even a single joint detection will allow to nail down the distinctive electromagnetic properties of accreting SMBBHs. Analog systems can then be searched in the archival data of large surveys, to quantify their overall cosmic population. A promising way forward is to check candidates mined systematically in time domain surveys for peculiar spectral signatures, hence producing credible targets for PTAs.

9. Numerical simulations of Stellar-mass compact object mergers

Contributor: A Perego

9.1. Motivations

Compact binary mergers (CBM) comprising at least one NS (i.e. BNS or BH-NS mergers) are unique cosmic laboratories for fundamental physics at the extreme. These powerful stellar collisions involve all fundamental interactions in a highly dynamical and intense regime (see, e.g. [425428] and references therein for more comprehensive overviews). Their study is extremely challenging and requires multidimensional, multiphysics and multiscale models. General relativistic hydrodynamical (GRHD) simulations, including a detailed, microphysical description of matter and radiation, are necessary to model the merger and to produce robust predictions for a large variety of observables. Their comparison with observations will provide an unprecedented test for our understanding of the fundamental laws of Nature, in regimes that will never be accessible in terrestrial laboratories.

CBMs are events of primary relevance in astrophysics and fundamental physics. They are intense sources of neutrinos [429] and GWs [142], and primary targets for the present generation of ground-based GW detectors. Both theoretical and observational arguments support CBMs as progenitors of sGRB [430, 431]. At the same time, they are the places where the heaviest elements in the Universe (including Gold and Uranium) are synthesized and ejected into space, via the so-called r-process nucleosynthesis [432434]. The radioactive decay of these freshly synthesized, neutron-rich elements in the ejecta powers a peculiar EM transient, called kilonova (or macronova), hours to days after the merger [435, 436]. Despite happening far away from the merger remnant, the $\gamma$ -ray emission, the r-process nucleosynthesis, and the kilonova transient are extremely sensitive to physical processes happening where the gravitational curvature is stronger. In particular, the equation of state (EOS) of NS matter is thought to play a central role in all the emission processes, since it determines the compactness of the merging NSs, their tidal deformations, the lifetime and spin frequency of the remnant, the amount and the properties of the ejected mass.

On August, 17 2017, the first detection of GWs from a CBM event compatible with a BNS merger marked the beginning of the multimessenger astronomy era [21, 22] and remarkably confirmed several years of intense and productive lines of research. The GW signal, GW170817, provided the link to associate a sGRB, GRB170817A [23], and the kilonova AT2017gfo transient [180, 437, 438] to a CBM localized in the NGC4993 galaxy. Moreover, this single event set constraints on the EOS of NS matter and gave an independent measure of the Hubble constant [24].

Many relevant aspects of the merger and of the emission processes are, however, not yet fully understood and relate to open questions in fundamental physics and astrophysics. The interpretation of the observational data strongly relies on the theoretical modeling of compact binary sources in GR. During the last years the unprecedented growth of computing power has allowed the development of increasingly sophisticated numerical models. The most advanced simulations in Numerical Relativity (NR) are set up using a first-principles and an ab initio approach. Neutrino radiation and magnetic fields are thought to play a key role during the merger and its aftermath. Their inclusion in detailed NR simulations, in association with microphysical, finite temperature EOS for the description of matter at and above nuclear saturation density is one of the present challenges in the field. Reliable predictions do not only require the inclusion of all the relevant physics, but also accurate numerical schemes and numerically converging results. Robust and computationally stable discretizations are also essential to perform long-term and high-resolution simulations.

9.2. Recent results for binary neutron star mergers

9.2.1. Gravitational waves and remnant properties.

A primary outcome of BNS simulations are accurate gravitational waveforms. Numerical models provide consistent and continuous GW signals for all three major phases: inspiral, merger, and post-merger/ring-down. As discussed earlier, key parameters encoded in the waveform are the masses and the spins of the coalescing objects, the quadrupolar tidal polarizability (depending on the NS EOS), and possibly the residual binary eccentricity. For the inspiral phase, state-of-the-art simulations cover at least 10 to 20 orbits before coalescence [439, 440]. Due to the cold character of this phase, zero-temperature EOSs for the NS matter are often used. High-order finite difference operators have been shown to be key to reach high-confidence numerical results [441]. Error control on the numerical results is essential for using NR waveforms for the analysis of data streams coming from GW detectors. State-of-art analysis of the error budget include the study of the numerical convergence, requiring multiple resolutions (4 or 5), and an estimate of the error due to the finite distance extraction of the GW signal within the computational domain. Only recently, initial conditions for rotating NSs have become available [442]. They were used to follow consistently, for the first time, the inspiral phase in numerical relativity, including spin precession as well as spin–spin and spin-orbit couplings [443445]. The high computational costs have limited the exploration of the BNS wide parameter space. However, thanks to the large set of presently available waveforms, detailed comparisons with Analytical Relativity results (including post-Newtonian and Effective One Body approaches) are nowadays possible [439, 446, 447]. Moreover, large databases of NR waveforms led to the production of purely NR-based waveform models [448]. These results are critical to improve the quality of semi-analytic waveforms employed in current GW data analysis.

The merger and its aftermath are highly dynamical and non-linear phases. A systematic study of the remnant properties and of its GW emission is presently made possible by large and extensive sets of simulations in NR (e.g. [449451]), sometimes extending for tens of ms after the merger. Detailed analysis of the post-merger gravitational waveforms revealed the presence of characteristic high-frequency peaks, which will be possibly accessible by third generation GW detectors. These features in the GW spectrum are associated with properties of the nuclear EOS [449, 452, 453], with the presence of a magnetised long-lived massive NS [454], with the development of one-arm spiral instabilities [455457] or convective excitation of inertial modes inside the remnant [458]. However, a firm understanding of these structures in the GW frequency spectrum is still in progress. The remnant fate depends primarily on the masses of the colliding NS and on the nuclear EOS. The collapse timescales of metastable remnants to BHs crucially relies on physical processes that are still not fully understood, including angular momentum redistribution (possibly of magnetic origin) and neutrino cooling. A prompt collapse to a BH, expected in the case of massive enough NSs and soft EOS, results in the most luminous GW emissions at merger. For stiffer EOS or lower NS masses, differential rotation and thermal support can temporarily prevent the collapse of an object whose mass is larger than the mass of a maximally rotating NS, forming a so-called hypermassive NS (HMNS). Mergers producing a HMNS emit the largest amount of energy in GWs, since an intense luminosity is sustained for several dynamical timescales. If the remnant mass is below the maximum mass of a maximally rotating NS or even of a non-rotating NS, a supramassive or massive NS can form, respectively. Gravitational waveforms for a supramassive or massive NS are similar to those of a HMNS case, but weaker. The detection of this part of the spectrum is beyond the capability of the present generation of GW detectors [459]. NR simulations take consistently into account the angular momentum emitted in GWs and in ejected mass, and predict the angular momentum of the final remnant to be $0.6 \lesssim J/M^2 \lesssim 0.85$ . In the HMNS and supramassive NS cases, the super-Keplerian value of J provides a large reservoir to power subsequent angular momentum ejections. Within the first dynamical timescale, this leads to the formation of a massive, thick disk around the central remnant.

9.2.2. Matter ejection and electromagnetic counterparts.

Numerical simulations are necessary to quantitatively model matter ejection from BNS. Different ejection mechanisms result in different ejecta properties. In addition to GW emission, BNS simulations predict the ejection of both tidal (cold) and shocked (hot) dynamic ejecta, a few ms after the GW peak [449, 450, 460464]. The most recent NR simulations employing microphysical, finite temperature EOS predict between 10−4 and a few 10−3 M$_{\odot}$ of dynamic ejecta, moving at $v \sim 0.3\,c$ . This range covers both intrinsic variability in the binary (e.g. NS masses) and uncertainties in the nuclear EOS. However, results for the same systems from different groups agree only within a factor of a few. Differences in the treatments of the NS surface and of the floor atmosphere, as well as in the microphysical content and in the extraction of matter properties at finite radii possibly account for these discrepancies. Differently from what was previously thought, state-of-the-art simulations including weak reactions show that neutrino emission and absorption decrease the neutron richness of the equatorial ejecta and, more significantly, of the high latitude ejecta. The former can still synthesize the heaviest r-process elements [465], while for the latter the formation of lanthanides can be inhibited. These results have been recently confirmed by parametric studies based on NR models [466, 467].

Long term (100 s of ms) simulations of the merger aftermath show wind ejecta coming from the remnant and the accretion disk. These winds are powered by neutrino absorption [468, 469] or, on the longer disk lifetime, by viscous processes of magnetic origin and nuclear recombination inside the disk [462, 470473]. If a HMNS has formed, the neutrino emission is expected to be more intense and the differential rotation can also power winds of magnetic origin [474]. The unbound mass can sum up to several 10−2 M$_\odot$ , depending on the disk mass, and moves at  ∼$ 0.1~c$ . Weak processes, including neutrino irradiation, decrease the neutron richness, particularly at high latitudes and for very long-lived remnants [475]. This implies the production of the first r-process peak in $\nu$ -driven winds and viscous ejecta. In the former case, the nucleosynthesis of lanthanides is highly suppressed, while in the latter the wider distribution of neutron richness leads to the production of possibly all r-process elements, from the first to the third peak (e.g. [471, 472, 475, 476]. Only a few simulations of the merger aftermath are presently available and only few of them were performed in a NR framework. While the study of the viscosity-driven ejecta led to a partial agreement between different groups, simulations of $\nu$ - and magnetically-driven winds are still very few and additional work is required.

The different compositions and kinematic properties of the different ejecta channels have implications for the magnitude, color, and duration of the kilonova (see [477] and [478] for recent reviews). The presence of lanthanides in the ejecta is expected to significantly increase the photon opacity due to the presence of millions of mostly unknowns absorption lines [479]. Detailed radiative transfer codes provide the most accurate models for the EM emission over timescales ranging from a few hours up to a few weeks [480482]. Due to the uncertainties in the opacity treatment and in the geometry of the ejecta, large differences could still arise between different models. More phenomenological approaches, characterized by significantly lower computational costs and accuracy, allow a more systematic and extensive exploration of the different parameters in the ejecta properties [483, 484]. Based on these models, it was suggested that, due to the absorption of neutrinos at high latitudes, the presence of lanthanide-free material (characterized by lower opacity) could result in a bluer and earlier (∼a few hours) emission, compared with the redder and dimmer emission powered by material enriched in lanthanides [468, 485, 486].

9.2.3. GW170817 and its counterparts.

The detection of GW170817 and of its counterparts represents an unprecedented chance to test our understanding of BNS. The analysis of the gravitational waveform and the subsequent parameter estimation made use of large sets of approximated waveform templates. The most recent results obtained in NR could potentially improve this analysis and set more stringent constraints on the intrinsic binary parameters and on the EOS of NS matter. The interpretation of AT2017gfo as a kilonova transient revealed the need to consider at least two different components to explain both the early blue and the subsequent red emission [438, 487]. More recently, the observed light curves have been traced back to the properties and to the geometry of the ejecta predicted by the most recent numerical models. This analysis has confirmed the multi-component and anisotropic character of the BNS ejecta, as well as the central role of weak interaction in setting the ejecta composition [488490]. Finally, the combination of the information extracted from the GW and from the EM signal enabled the possibility to set more stringent constraints on the nuclear EOS, in a genuine multimessenger approach [491, 492]. Both very stiff and very soft EoSs seem to be disfavored by this kind of analysis. A similar approach has been used to show that the formation of a HMNS is the most probable outcome of this event [493].

9.3. Recent results for black hole-neutron star mergers

Modelling of BH-NS mergers in GR shares many similarities with BNS merger modelling, but reveals also differences and peculiarities (see [426] for a review). The larger ranges in masses and spins expected for stellar mass BHs significantly increase the parameter space. The large mass ratio between the colliding objects makes NR simulations more expensive, since the inspiral requires larger initial distances to cover a sufficient number of orbits, and the dissimilar lengthscales require higher resolutions. These numerical challenges have limited the number of models and waveforms that are presently available in comparison with the BNS case. If the final remnant is always represented by a spinning BH, the presence of a disk around it depends on the mass ratio and BH spin [494496]. Larger BHs, with moderate spins, swallow the NS during the plunge dynamical phase, while the decrease of the last stable orbit in the case of aligned, fast spinning BHs leads more easily to the tidal disruption of the NS (in the form of mass shedding) and to a massive accretion disk formation (with masses possibly in excess of 0.1 M$_\odot$ ). Misalignment between the orbital and the BH angular momentum induces non-trivial precessions, encoded both in the GW signal and in the amount of mass outside the BH horizon. The GW spectrum shows a characteristic cutoff frequency directly related with the orbital frequency at the NS tidal disruption or at the last stable orbit. Since this frequency decreases with increasing BH spin and with decreasing compactness, its detection could provide direct constraints on the NS matter EOS. Dynamical mass ejection from BH-NS merging system is expected to be caused by tidal torque when the NS is disrupted, and to happen predominantly along the equatorial plane. The ejected mass can be significantly larger than in the BNS case and neutrino irradiation is expected to have only a minor effect in changing the ejecta composition [497]. Similarly to the BNS case, the viscous evolution of the disk drives significant matter outflows on the disk lifetime, while the absence of a central NS reduces the emission of neutrinos and the ejection of $\nu$ - or magnetically-driven winds [498]. The weaker effect of neutrino processes on the dynamic ejecta results always in robust r-process nucleosynthesis between the second and the third r-process peaks, while viscous disk winds can still synthesize all r-process nuclei. The high lanthanide content in the ejecta is expected to produce a redder kilonova transient a few days after the merger [499].

9.4. Perspectives and future developments

A complete and dense exploration of the wide parameter space, including a consistent treatment of NS and BH spins and of their evolution, is one of the major challenges in the study of CBMs in NR. The extraction of waveforms from long inspiral simulations at high resolution, employing accurate high-order numerical schemes, is necessary to build a robust NR waveform database. These templates, in combination with more analytical approaches, can guide the construction of complete and coherent semi-analytical waveforms for the GW data analysis, spanning many orbits from the inspiral to the actual coalescence. The large variety of properties in the colliding system potentially translates to a broad distribution of properties for the remnant and for the ejecta. Long term simulations of the merger and of its aftermath are the necessary tool to provide a complete and accurate description of the ejecta. The inclusion of the physics necessary to model the remnant and the matter ejection is still at its outset. In particular, the consistent inclusion of neutrino radiation and magnetic field evolution in NR model is extremely challenging. Leakage scheme for neutrino radiation are presently implemented in many CBM models (see, e.g. the introduction section of [500] for a detailed discussion). They provide a robust and physically motivated treatment for neutrino cooling. However, they are too inaccurate to model long term evolution and neutrino absorption in optically thin conditions. State-of-the-art simulations include gray moment scheme [463, 501]. Nevertheless, the significant dependence of neutrino cross-sections on particle energy requires spectral schemes. Large velocity gradients inside the computational domain make the accurate transformation of the neutrino energy spectrum between different observers and its transport highly non-trivial. Moreover, the application of moment schemes for colliding rays in free streaming conditions leads to closure artifacts in the funnel above the remnant. Monte Carlo radiation schemes represent an appealing alternative, but their computational cost is still far beyond our present capabilities. The usage of Monte Carlo techniques to provide more physical closures in moment scheme seems a more viable approach [502]. Neutrino masses have a potentially high impact on the propagation and flavor evolution of neutrinos from CBMs. New classes of neutrino oscillations, including the so-called matter neutrino resonances, can appear above the remnant of BNS and BH-NS mergers and influence the ejecta nucleosynthesis [503, 504]. However, numerical modelling of these phenomena is still at its dawn [476, 505, 506].

General relativistic magneto-hydrodynamics codes are presently available and they have started to access the spatial scales necessary to consistently resolve the magneto-rotational instabilities (MRIs, [507]). The latter appear in the presence of differential rotation and locally amplify the magnetic field. However, global simulations obtained with unprecedented spatial resolution (of the order of 12.5 m) do not show convergence yet, proving that we are still not able to resolve the most relevant scales for magnetic field amplification in a self-consistent way [508]. In the past, sub-grid models have been used as alternative approaches to ab initio treatments [454, 509, 510]. More recently, two different formulations of effective MHD-turbulent viscosity in General Relativity have been proposed and used in both global BNS and long term aftermath simulations [511, 512]. Moreover, the role of bulk viscosity in the post-merger phase has started being investigated [513].

Accurate modelling of the GW and EM signals for CBMs are key to set tight constraints on the EOS of NS matter, which still represents one of the major sources of uncertainty in both fundamental physics and in numerical models. If the connection between CBMs and short GRBs will be confirmed by future detections, the knowledge of the remnant fate, as well as of the environment around it, will be crucial to address the problem of short GRB engines, including jet formation, collimation, and break-out. The accurate modelling of small-scale magnetic field amplification, as well as of heat redistribution due to neutrino transport, is key to predict the lifetime of the remnant for cases of astrophysical interest. In fact, the presence of highly rotating and magnetized massive NSs [454, 514], or of fast spinning BHs, is anticipated to play an essential role in solving the puzzle of the short GRB central engine.

The high computational costs required by long-term, high-resolution, numerically accurate and multiphysics models of CBMs point to the need of developing a new generation of numerical schemes and codes for the new generations of large supercomputers. These codes will need, for example, to improve scalability and to employ more heavily vectorization in the hybrid (shared & distributed) parallelization paradigm. This is perhaps the greatest challenge in the NR field for the years to come.

10. Electromagnetic follow-up of gravitational wave mergers

Contributor: A Horesh

The year 2017 will be remembered as the year in which extraordinary achievements in observational astrophysics have been made. On 2017, August 17, the LIGO and Virgo detectors detected for the first time GWs from the merger of two NSs, dubbed GW 170817. Adding to the excitement was the detection of gamma-ray emission only two seconds after the merger event, by the Fermi satellite. The sensational discovery of a GW signal with a coincident EM emission led to one of the most comprehensive observational campaigns worldwide. A few hours after the GW detection, the LIGO and Virgo detectors managed to pinpoint the position of the GW event to an error circle of 34 sq. degrees in size (see figure 7 below). This area was small enough so that an international teams of astronomers encompassing more than a hundred instruments around the world on ground and in space could conduct an efficient search for EM counterparts.

Figure 7. Refer to the following caption and surrounding text.

Figure 7. Left panel: The multi-messanger sky localization of GW170817 and the identification of the host galaxy. Reprinted by permission from Springer Nature Customer Service Centre GmbH: Springer Nature, Nature, [24], Copyright 2017. Right panel: The posterior distribution of H0 compared to the recent CMB and SNIa constraints [635, 636]. Reproduced from [22]. © 2017. The American Astronomical Society. All rights reserved.

Standard image High-resolution image

Roughly 11 h after the GW event, an optical counterpart was announced by the 1M2M ('Swope') project team [180, 515]. Many other teams around the world independently identified and observed the counterpart in parallel to the SWOPE team discovery. The optical detection of the counterpart pinpointed the source to a precise location in the galaxy NGC4993 at a distance of 40 Mpc, (which is also consistent with the distance estimate from the GW measurements; see section 2). Overall, the counterpart (for which we will use the official IAU name AT 2017gfo) was detected across the spectrum with detections in the ultraviolet, optical, infrared (e.g. [180, 437, 438, 515524]) and later on also in the x-ray (e.g. [525527]) and in the radio (e.g. [528, 529]).

Below we briefly summarize the observational picture with respect to theoretical predictions.

10.1. The high-energy counterpart

For many years it was hypothesized that short gamma-ray bursts (GRBs) that are by now routinely observed, are originating from NS mergers (e.g. [530]). The theoretical model includes the launching of a relativistic jet during the short period of accretion onto the merger remnant. This energetic jet is believed to be responsible for the prompt gamma-ray emission. In addition, the interaction of the jet with the interstellar medium will produce an afterglow emission in the optical, x-ray and also radio wavelengths.

The short GRB ($T_{90} = 2.0 \pm 0.5$ s) that has been discovered about 1.7 s after the GW170817 merger event [531] was irregular compared to other short GRBs observed so far. Most notable is the low equivalent isotropic energy $E_{\rm iso} \approx 5 \times 10^{46}$ erg (in the 10–1000 kev band), which is  ∼4 orders of magnitude lower than a typical short GRB energy. In addition to the initial detection by Fermi-GBM, there were observations made by the Integral satellite. Integral also detected the short GRB with a flux of  ∼$ 1.4 \times 10^{-7}$ erg cm−2 (at 75–2000 kev [532]).

Following the initial detection and once the candidate counterpart had been localized, x-ray observations were made within less than a day of the merger event. The initial x-ray observations by both the Neil Gehrels Swift Observatory and by the Nuclear Spectroscopic Telescope Array (NuSTAR) resulted in null-detections. However, later on, on day  ∼9 after the merger, the Chandra telescope detected x-ray emission from the AT 2017gfo for the first time [525] with an initial isotropic luminosity of $\approx$ $ 9 \times 10^{38}$ erg s−1. In the following days, the x-ray luminosity appeared to continue to slowly rise [525, 526]. Late-time observations (109 d after the merger) by Chandra still show that the x-ray emission continues to slowly rise [533].

In order to reconcile the relatively faint prompt gamma-ray emission and the late onset of the x-ray emission, it was suggested that AT 2017gfo was a regular short GRB but one that is observed slightly off the main axis by  ∼10 degrees of the jet [525]. However, this interpretation has yet to be tested against observations in other wavelengths, such as the radio (see below).

10.2. The optical and infrared counterpart

Over four decades ago, a prediction was made by [429, 432] that neutron-rich material can be tidally ejected in a NS merger. About a decade later numerical simulations also showed that NS mergers will exhibit such mass ejections, where the mass ejected is expected to be in the mass range 10−5–10−2 M$_{\odot}$ with velocities of 0.1–0.3 c (e.g. [534]). It was also predicted that heavy elements would form in the neutron rich ejected material via r-processes. As discussed in detail in the previous section 9, additional material is also expected to be ejected by accretion disk winds and from the interface of the two merging stars (the latter material will be ejected mostly in the polar direction). In general, each ejected mass component may have a different neutron fraction leading to a different r-process element composition.

As the ejecta from the NS merger is radioactive, it can power transient emission, as proposed in [535]. The initial prediction was that the emission will be supernova-like and will be blue and peaking on a one-day timescale. Later on, more detailed calculations by [182] showed that the peak of the emission will be weaker at a level of 1041 erg s−1. The next theoretical developments were made by [479, 480, 483] who for the first time calculated the effect of the heavy r-process element opacities on the emission. They found that due to very high opacities, the emission is expected to be even weaker, with its spectral peak now in the infrared (instead of the optical) and with the peak timescale being delayed. Since the various ejecta mass components may have different compositions, they therefore may have different opacities. Thus in principle, one component may form a blue short-lived emission (the so-called 'blue component') while another may emit in the infrared with week-long timescales (the so-called 'red component').

The optical emission from the AT 2017gfo peaked at a faint absolute magnitude of $M_{V} \approx -16$ in one day and began to rapidly decline, while the infrared emisson had a somewhat fainter and later (at approximately 2 d) peak compared to the optical emission. The overall observed brightness of the source in the optical and infrared bands and its evolution over time have shown in general an astonishing agreement with the predicted properties of such an event.

The multiple photometric and spectroscopic measurements sets obtained by the various groups paints the following picture: at early times, at about 0.5 d after the merger, the ejecta temperature was high at $T\sim 11\,000$ K (e.g. [520]). A day later the spectral peak was at $\approx$ 6000 $\mathring{\rm A}$ , and the temperature decreased to $T\approx 5000$ K [437]. From this point onwards the spectral peak quickly moved into the infrared band. The combined optical and IR emission and its evolution were found to be consistent with an energy source powered by the radioactive decay of r-process elements (e.g. [438, 516, 517]). Based on both the photometric and spectroscopic analysis an estimate of the ejecta mass and its velocity were found to be ${\rm M}_{ej} \approx 0.02$ –0.05 M$_{\odot}$ with a velocity in the range 0.1–0.3 c [437, 516520, 522524].

One of the main claims with regards to this event is that the observations provide evidence for the formation of r-process elements. This conclusion is mainly driven based on the evolution of the infrared emission [516, 521]. Tanvir et al [438], for example, obtained late-time IR measurements using the Hubble space telescope and argue that the slower evolution of the IR emission compared to the optical require high opacity r-process heavy elements with atomic number  >195. The infrared spectrum shows broad features which are presumably comprised of blended r-process elements [438]. These broad spectral features were compared to existing model predictions and show a general agreement, albeit there are still inconsistencies that need to be explained (e.g. [478, 521, 536]). There is still an ongoing debate about the composition of the heavier elements. [517] claims that the observed infrared spectral features can be matched with CS and Te lines. Others estimate the overall fraction of the lanthanides in the ejecta and find it to be in the range ${\rm X}_{lan} \approx 10^{-4}$ –10−2. It seems that at early times (up to 3–5 d), the ejecta that dominates the emission has very low lanthanide fraction and that there are discrepancies between the predicted and observed optical light curves (e.g. [519, 523]. Several works (e.g. [437, 478, 517, 518, 521, 524, 536, 537]) explain the early versus late-time behaviour of the emission by having an ejecta with two components (as also predicted in the literature). The first component is the 'blue' kilonova, with a high electron fraction and thus a low fraction of heavy elements, which dominates the optical emission early only. A 'red' kilonova is the second component that is comprised by heavy r-process elements and that produces the slow IR evolution and the broad spectral features observed at late-times. Still there are some claims that even at late times, the emission can be originating from an ejecta with only low-mass elements [538].

10.3. The radio counterpart

In addition to the radio afterglow emission on the short time scale (days), radio emission is also expected on longer time scales (month to years). The latter is not a result of the relativistic jet but rather originating from the interaction of the slower dynamical ejecta with the instersteIlar medium (ISM) [539]. This emission is expected to be rather weak where the strength of the peak emission depends on the velocity of the dynamical ejecta, the ISM density and on some microphysical parameters. While in the past, radio afterglows of short GRBs (not accompanied by GWs) were detected (e.g. [540]), long-term radio emission was never observed until GW 170817 (including in the previous cases where kilonova candidates were discovered [541, 542]).

Similar to any other wavelength, radio observations were undertaken within the day of the GW170817 merger discovery and the following days. The early time observations performed by The Jansky Very Large Array (VLA), The Australian Compact Array (ATCA), the Giant Meterwave Radio Telescope (GMRT) and the Atacama Large Millimeter Array (ALMA), all resulted in null-detections [528, 543, 544]. Late-time VLA observations, however, finally revealed a radio counterpart $\approx$ 16 d after the merger [528]. The initial radio emission was weak at a level of a couple of tens of $\mu$ Jy only (at both 3 and 6 GHz). Follow-up ATCA observations confirmed the radio detection. Upon detection, a long-term radio monitoring campaign was initiated, and the results are reported in [529]. They report that the radio emission still continued to rise at  >100 d after the merger. The multiple frequency observations also show that the radio emission is optically thin with a spectral index of $\alpha = -0.6$ .

Hallinan et al [528] compared the radio observations to several predictions including an on-axis jet, a slightly off-axis jet, and being completely off-axis. In addition, they included in their comparison a model in which the jet forms a hot wide-angled mildly relativistic cocoon. This cocoon formed as the jet is working its way out of the dynamical ejecta, may also lead to gamma-ray, x-ray, and radio emission, but with different characteristics than the emission formed by either the highly relativistic jet or the slower dynamical ejecta. In fact, [528] find that both an on-axis jet model, or even a slightly off-axis jet one, are expected to produce bright radio emission in the first few days after the merger which by day 16 should start fading, a prediction which does not match the rising observed radio source. The model that is the most consistent with current data is the cocoon model with either a choked or successful or 'structured' jet (see e.g. [533, 545]). In fact, the choked-jet cocoon model may also explain the relatively low energy of the gamma ray emission [546].

If both the late-time radio and x-ray emission originate from interaction of the material (whether it is the dynamical ejecta or a cocoon) with the ISM, one can test whether the observed emission in both wavelengths is indeed connected as expected. For now, it seems that the observed x-ray emission fits the prediction which is based on extrapolating the observed radio emission into higher energies. Both the x-rays and the radio emission have also roughly the same spectral slope. This suggests that there is no additional power source in play at this time.

10.4. Many open questions

While vast amounts of data have been collected for this amazing merger event, and while a flood of papers report many types of analysis and conclusions (by no means are we attempting to cover all of them here), there are still many open questions remaining. For example, is this event connected to short GRBs or do we still have to prove this connection? Which r-process elements form and at what time? Do the observations really require producing heavy r-process elements or can they all be explained by lighter-element components? How many components does the ejecta have and which one dominates the emission and when? Are there any other emission power sources in play such as a magnetar (even if at short time)? Are we really seeing a cocoon with a choked or successful jet or is there some other scenario that can explain the radio emission combined with all the other evidence?

As scientists around the world are still working on analyzing all the data in hand for this event and are also still collecting new data, they are also gearing up for the future, and preparing for the next year when the LIGO and Virgo detectors switch back on. At this time, hopefully more events with EM signatures will be discovered and more answers (and surely more new questions) will present themselves.

11. X-ray and gamma-ray binaries

Contributor: M Chernyakova

The population of galactic x-ray sources above 2 keV is dominated by the x-ray binaries, see e.g. [547]. A typical x-ray binary contains either a NS or a BH accreting material from a companion star. Due to angular momentum in the system, accreted material does not flow directly onto the compact object, forming a differentially rotating disk around the BH known as an accretion disk. X-ray binaries can be further divided into two different classes, regardless the nature of the compact object, according to the mass of the companion star: high-mass x-ray binaries and low-mass x-ray binaries. The secondary of low-mass x-ray binary systems is a low-mass star, which transfers matter by Roche-lobe overflow. High-mass x-ray binaries comprise a compact object orbiting a massive OB class star. High-mass x-ray binaries systems are a strong x-ray emitter via the accretion of matter from the OB companion. At the moment 114 high-mass x-ray binaries [548] and 187 low-mass x-ray binaries [549] are known.

Black hole x-ray binaries are interacting binary systems where x-rays are produced by material accreting from a secondary companion star onto a BH primary [347]. While some material accretes onto the BH, a portion of this inward falling material may also be removed from the system via an outflow in the form of a relativistic plasma jet or an accretion disk wind, see e.g. [550] for a review. Currently, the known galactic BH x-ray population is made up of 19 dynamically confirmed BHs, and 60 BH candidates [551]. The vast majority of these galactic BH x-ray objects are low-mass x-ray binaries. Most of these systems are transient, cycling between periods of quiescence and outburst. This behaviour is associated with changing geometries of mass inflow and outflow, e.g. [550].

At higher energies, however the situation is drastically different. While current Cherenkov telescopes have detected around 80 galactic sources (see the TeVCat catalogue [552]), only seven binary systems are regularly observed at TeV energies. Properties of PSR B1259-63, LS 5039, LSI  +61 303, HESS J0632+057 and 1FGL J1018.6-5856 are reviewed in [553]. Since 2013 two more galactic binaries have been discovered at TeV sky, PSR J2032+4127 [554] and HESS J1832-093 [555], but still the number of binaries observed at TeV sky is extremely small, and the reason why these systems are able to accelerate particles so efficiently is not known yet. These systems are called gamma-ray-loud binaries (GRLB), as the peak of their spectral energy distribution lies at GeV–TeV energy range.

All GRLB systems host compact objects orbiting around massive young star of O or Be spectral type. This allows to suggest, that the observed $\gamma$ -ray emission is produced in the result of interaction of the relativistic outflow from the compact object with the non-relativistic wind and/or radiation field of the companion massive star. However, neither the nature of the compact object (BH or NS?) nor the geometry (isotropic or anisotropic?) of relativistic wind from the compact object are known in the most cases. Only in PSR B1259-63 and PSR J2032+4127 systems the compact object is known to be a young rotation powered pulsar which produces relativistic pulsar wind. Interaction of the pulsar wind with the wind of the Be star leads to the huge GeV flare, during which up to 80% of the spin-down luminosity is released [556, 557].

In all other cases the source of the high-energy activity of GRLBs is uncertain. It can be either accretion onto or dissipation of rotation energy of the compact object. In these systems the orbital period is much shorter than in PSR B1259-63 and PSR J2032+4127, and the compact object spend most of the time in the dense wind of the companion star. The optical depth of the wind to free-free absorption is big enough to suppress most of the radio emission within the orbit, including the pulsed signal of the rotating NS [558], making impossible direct detection of the possible pulsar.

In [559] authors tried to deduce the nature of the compact source in LSI  +61 303 studying the relation between x-ray luminosity and the photon index of its x-ray spectrum. It turned out that existing x-ray observations of the system follows the same anti-correlation trend as BH x-ray binaries [560]. The hypothesis on microquasar nature of LSI  +61 303 allowed to explain the observed radio morphology [561] and explains the observed superorbital period as a beat frequency between the orbital and jet-precession periods [562]. At the same time it was shown that the model in which the compact source is a pulsar allowed naturally explain the keV-TeV spectrum of LSI  +61 303 [558]. Authors argued, that the radio source has a complex, varying morphology, and the jet emission is unlikely to dominate the spectrum through the whole orbit. Within this model the superorbital period of the source is explained as timescale of the gradual build-up and decay of the disk of the Be star. This hypothesis is also supported by the optical observations confirming the superorbital variability of the Be-star disk [563]. A number of multi-wavelength campaigns are currently ongoing aiming to resolve the nature of these peculiar systems.

GeV observations revealed a few more binaries visible up to few GeV. Among them are Cyg X-1 [564, 565] and Cyg X-3 [566]. However, contrary to the GRLBs described above these systems are transients and seen only during the ares, or, in the case of Cyg X-1, during the hard state. In addition to this the peak of the spectral energy distribution of these systems happens at much lower energies than in the case of binaries visible at TeV energies.

However contrary to the GRLBs described above these systems are transients and seen only during the flares, or, in the case of Cyg X-1, during the hard state. In addition to this the peak of the spectral energy distribution of these system happens at much lower energies than in the case of binaries visible at TeV energies. From these observations it seems that wind collision can accelerate particles more efficient that the accretion, but more sensitive observations are needed to prove it and understand the reason. Hopefully CTA [567] observations will be able to shed light on the details of the physical processes taking place in these systems.

12. Supermassive black hole binaries in the cores of AGN

Contributors: E Bon, E M Rossi, A Sesana, A Stamerra

Following mergers it is expected that the galaxy cores should eventually end up close to each other. In this process, the term dual SMBHs refers to the stage where the two embedded SMBHs are still widely separated (gravitationally-bound to the surrounding gas and stars and not to one another), while SMBBHs denotes the evolutionary stage where they are gravitationally bounded in the close-orbiting system of SMBHs.

Bound SMBBHs on centi-pc scales are the most relevant to GW emission (and therefore to this roadmap). In the approximation of circular orbits, these systems emit GWs at twice their orbital frequency, i.e. $f_{\rm GW}=2/P_{\rm orb} \gtrsim 1$ nHz. As we discuss in the next section, this is the frequency at which PTAs are most sensitive [387, 389, 390, 408], having a concrete chance to make a direct detection of these systems within the next decade [391393]. The presence of a SMBBH with an orbital period of several years introduces a natural timescale in the system. In fact, numerical simulations of SMBBHs embedded in circumbinary accretion disks display consistent periodic behaviors of the gas leaking into the cavity [331, 349, 352, 353, 568, 569]. This led to the notion that SMBBHs might be detected via periodicity of their lightcurve. However, the diffusion time of the gas within the mini-disks surrounding the two holes is generally longer than the binary period, and it is not at all clear that the periodic supply of gas through the cavity will in turn result in periodicities in the binary lightcurves or their spectra [421]. Even though the cross-gap accretion rate is generally periodic, only light generated at the accretion stream/minidisk or outgoing stream/circumbinary disk shocks is guaranteed to follow this modulation [569, 570]. Since some periodicity seems inevitable, we focus on this signature in the following discussion. We notice however, that several spectral signatures of close SMBBHs have also been proposed, including a dimming at UV wavelengths [422], double K$\alpha$ lines [421], notches in the spectral continuum [570], steepening of the thermal spectrum compared to the standard thin disk model [571].

Among many mechanisms proposed to explain the emission variability of active galactic nuclei (AGN) besides outflows, jet precession, disk precession, disk warping, spiral arms, flares, and other kinds of accretion disk instabilities, one of the most intriguing possibilities involves the existence of a SMBBH system in their cores [572576], and the tidal disruption event (see [577] and references therein).

The light variability emitted from AGN was tracked much before they were recognized as active galaxies. There are light curves showing AGN variability of over 100 years of observations, see for example [578, 579], with variability timescale of over decades. In fact, many AGN show variability of different time scales depending on time scales of processes that drive the variability, such as speeds at which variations propagate, for example the speed of light $c \sim3$ · 105 km/s or the speed of sound $v_s\sim(kT/m){}^{1/2}$ , where $c > v_{\rm orb} \gg v_s$ , or the time scale of the orbital motion $v_{\rm orb}\sim(GM/R){}^{1/2}$ . The shortest timescale corresponds to the light crossing timescale, on which the reverberation mapping campaigns are based on. Orbital timescales are longer [580], and are dominant in the case of SMBBH systems. Recently, possible connection of AGN variability time scales and orbital radius is presented in [581], indicating that variability time scales may not be random, and that they might correspond to the orbiting time scales.

To identify possible candidates, we search for periodic variations in their light and radial velocity curves. We expect that periodic variability should correspond to orbital motion exclusively, while the other processes could produce only quasi periodic signals. Unfortunately, AGN were identified only about 70 years ago, so observing records are long for few decades only, which is of order of orbiting time scales, and therefore not long enough to trace many orbits in historical light curves, not to mention the radial velocity curves [343, 582585], which are harder to obtain because of their faintness, and even shorter records. Therefore, for AGN it is very hard to prove that the signal is actually periodic, especially if they are compared to the red noise like variability curves, that in fact, AGN light curves are very similar to [586, 587]. Therefore, standard methods like Fourier and Lomb-Scargle [588] may show peaks of high looking significance but the derived p-value may not be valid [584, 587, 589].

Keeping in mind the aforementioned difficulties, a number of AGN have been proposed to display a significant periodical variability in their light curves [575, 590595]. Notable examples include the blazar OJ287 (11.5 yr period, [590, 594], the quasar PG1302-102 (6 yr period, [575]), the blazar PG 1553+113 (2 yr period, [596]). Among those candidates, there are only few that could indicate periodic light and radial velocity curves in the same time [583585, 597], which therefore could be recognized as SMBBH candidates, like NGC4151 with a 15.9 year periodicity [583], NGC5548 with a $\approx$ 15 year periodicity [584, 585], and Akn120 with a $\approx$ 20 year periodicity [597]. We note that simulating the emission from such systems is very complex [569, 571, 598600], especially for the eccentric high-mass ratio systems [573, 601, 602].

Among AGN, the class of blazars is dominated by the emission from the jet due to beaming effects caused by the small angle of sight. Blazars show high variability in all wavebands from radio to gamma-rays. High energy emission is likely originated in small jet scales and therefore can be modulated by the orbital motion of the SMBH binary system. The modulation can be funneled through variations on the accretion rate induced by the perturbation on the disk by the companion SMBH, as suggested for OJ287, or in helical paths induced by precession [603] as suggested to explain the clear signature of a periodic modulation on the gamma-ray blazar PG 1553+113 [596]. More complex interplay among the different components in the jet, emitting at different wavelength is possible in the framework of a binary SMBH system ([604]). We mention that quasi-periodicities in the jet emission can be induced by intrinsic oscillatory disk instabilities that can mimic periodical behaviour. The continuous gamma-ray monitoring of blazars by the Fermi-LAT satellite is providing new possible candidates showing periodic or quasi-periodic emission (see e.g. [605, 606]). Similarly, a 14 year periodicity is found in the x-ray and optical variations of 3C 273, while in OJ 287, the optical variability may not always be consistent with radio. Even, a detection of periodic variations of spinning jet could indicate presence of SMBBH [607].

The time domain window has only been opened in the past decade with dedicated surveys such as CRTS, PTF and Pan-STARRS, and already produced several SMBBH candidates. Many of them, however, have been already severely constrained by PTA upper limits on the stochastic GW background they would imply [608]. This confirms our poor understanding of SMBBH appearance. More sophisticated numerical simulations, including 3D grids, radiative transport schemes, feedback from the accreting sources, etc, are needed to better understand the emission properties and peculiar signatures of SMBBHs. Under their guidance, future candidates should be then proposed based on systematic cross check of variability coupled with peculiar spectral features.

12.1. Modeling electromagnetic signatures of merging SMBBHs

Although the identification of compact SMBBHs might have an important impact on GW observations with PTAs in the near future, looking further ahead, LISA is expected to detect tens to hundreds of coalescing SMBBHs throughout the Universe. Electromagnetic observations related to the merger of SMBBHs are important both for cosmology and the astrophysics of galactic nuclei. Pinning down the host galaxy of the merger will allow us to combine redshift and distance from gravitational waves to constrain cosmological parameters ([609], see next section) and to study the large scale galactic environment of merging SMBBHs, adding to our understanding to the process galaxy formation. On the other hand, electromagnetic observations will give us access to the properties of matter in the relative close environment of a merger and to the gas an stars (hydro)-dynamics as they adjust in response to the merger.

Given the importance of such identification, there has been an extensive effort to predict observable electromagnetic signatures that can occur in nearly coincidence with the event ('prompt signals') or afterwards ('afterglows'). In the following few examples will be given; notably, some of them also inspired recent models for possible electromagnetic counterparts to SOBBH mergers, of the kind detected by LIGO and VIRGO. As mentioned previously, SMBBH mergers, especially at high redshifts, can happen in a gaseous environment that provides each SMBH with a 'minidisc', fed through streams leaking from a circumbinary disc. Those minidiscs are likely to be retained even after the orbital decay due to gravitational radiation dominates, providing distinctive modulation of the emerging luminosity as the binary spirals in [610, 611]. During the final orbits, the surviving gas between the black holes gets squeezed possibly producing super-Eddington outflows as discussed in [612, 613], but see [614]. Full GR simulations have also been employed to study the possible formation of precessing jets during the inspiral and merger, in the attempt of identifying distinctive signatures [615617]. General relativity predicts that a newly formed black hole suffers a recoil because GWs carry away a non-zero linear momentum (e.g. [618, 619]). This recoil affects the circumbinary disc, bound to the black hole: a kick is imparted that shocks the gas producing a slowly rising, ∼10 yr lasting afterglows [620624]. In stellar mass black hole merger, similar phenomena but on much shorter timescales can occur [171, 625]. Contrary to the SMBH case, however, providing a gas rich environment for the merger is a challenge. A possible venue involves cold relic discs, formed as a result of weak supernovae, where accretion is suppressed until either the 'squeezing' or the 'kicking' heat them up again in the same configuration envisaged for SMBHs.

As already mentioned, the next theoretical challenge for these dynamical models is to predict realistic lightcurves and spectra, which will require non-trivial radiative transfer calculations (see, e.g. [626]). With solid predictions in hand, appropriate strategies can be devised to coordinate electromagnetic follow-ups, to take full advantage of multimessenger astronomy in the LISA era.

13. Cosmology and cosmography with gravitational waves

Contributors: C Caprini, G Nardini, N Tamanini

The recent direct measurement of GWs by the Earth-based interferometers LIGO and Virgo opened up a new observational window onto the universe and, right from the first detection, led to the discovery of a new, unexpected source: fairly massive stellar-origin BBHs. This demonstrates the great potential of GW observations to improve our knowledge of the universe. Concerning cosmology, it is beyond doubt that the possible detection of a stochastic GW background (SGWB) from the early universe would be revolutionary from this point of view: similar to the discovery of the CMB, which constitutes a milestone in our understanding of the universe, rich of consequences that we are still investigating. Furthermore, the plethora of new GW detections expected in the next decades by both Earth and space-based interferometers will not only deliver fundamental information on the emitting astrophysical sources, but it will also bring complementary and independent data, with respect to standard EM observations, that can be used for cosmological purposes. In particular, by means of GW detections, we can probe the history of the universe both at early and late times, shedding new light on some of the most elusive cosmological mysteries, such as dark energy, DM and the origin of cosmic inhomogeneities. In this section, we overview how the observation of GWs can enhance our knowledge of the history of the universe.

13.1. Standard sirens as a probe of the late universe

Within the theory of GR, a binary system composed by two compact astrophysical objects orbiting around each other, emits a GW signal with the two polarizations [627]

Equation (8)

Equation (9)

where $h_{\times,+}(t)$ are the GW strains in the transverse-traceless gauge (we neglect here post-Newtonian contributions). In these expressions $\iota$ is the orientation of the orbital plane with respect to the detector, z is the redshift of the source, dL(z) is the luminosity distance, f  is the GW frequency at the observer, $\Phi(t)$ is the phase of the GW, and $\mathcal{M}_c(z)=(1+z)(m_1 m_2){}^{3/5}/(m_1+m_2){}^{1/5}$ is the redshifted chirp mass, with m1,2 being the masses of the two binary bodies. An accurate detection of the GW signal allows to reconstruct all the parameters in equations (8) and (9) within some (correlated) uncertainties [628]. In particular, thanks to the reconstruction of dL, binaries can be employed as reliable cosmological distance indicators.

Equations (8) and (9) highlight three key aspects of using inspiralling binaries as cosmological distance indicators: (i) The measurement of dL from GW signals is not affected by any systematic uncertainties in the modelling of the source, since the dynamics of compact binary systems is directly determined by GR. This is in contrast with supernovae type-Ia (SNIa), which require the cosmic distance ladder, i.e. cross calibration with local measurements of sources of known distance, to overcome unknown systematics in the determination of their luminosity distance. (ii) Due to the scaling $\propto$ $ d_L^{-1}$ , GW cosmological indicators are suitable even at large distances where EM sources, whose intensity scales as $d_L^{-2}$ , are too faint. This implies that given the same amount of emitted energy, a source producing GWs can be observed at higher distances with respect to a source emitting EM waves. (iii) The measurement of the quoted waveform does not allow to determine the redshift of the source: in fact equations (9) and (8) are invariant under the transformation $m_i\to m_i (1+z)$ plus $d_L \to d_L (1+z)$ . In other words the waveform detected from any system with masses mi at a distance dL will be equivalent to a waveform produced by a system with masses mi(1  +  z) at a distance dL(1  +  z).

The luminosity distance dL is tightly linked to the redshift z in a given cosmological setup. For a homogeneous and isotropic universe (at large scale), the luminosity distance is given by

Equation (10)

with $\Omega_k$ being the present value of the density parameter of the spatial curvature, $H(z)$ being the Hubble rate as a function of the redshift, and H0  =  H(z  =  0). Therefore, if besides dL (reconstructed from the waveform) one can independently establish the redshift z of a GW source, then one obtains a data point useful to constrain the relation (10). The parameters of any given cosmological model, which are implicitly included in $H(z)$ since its dynamics is determined by the Einstein equations, can then be statistically constrained by fitting equation (10) against a number of (z, dL(z)) data points. This can be done using observations from GW inspiralling binaries, if a determination of their redshift is available by some means. For this reason, well detectable binaries, for which the redshift is also known (or at least estimated), are dubbed standard sirens, in analogy with SNIa used as standard candles in cosmography [629].

Of course, to obtain a robust bound, it is crucial to fit equation (10) with as many standard sirens as possible at the most diverse redshifts, especially if the cosmological model at hand contains many parameters. The outcome of such an analysis can be remarkable. It constitutes the first robust cosmological test not using EM radiation as the only messenger of astronomical information, and it allows to probe the validity of the cosmic distance ladder up to far distances. As demonstrated by the recent analysis of the LVC detection GW170817, discussed in section 13.1.2 below, it also provides a measurement of H0 that is independent of the calibrations necessary to establish constraints using SNIa. In particular, if the current tension between the H0 measurement from SNIa and CMB (assuming $\Lambda$ CDM) will persist, standard sirens will be a very useful observable to decipher the origin of this tension.

13.1.1. Redshift information.

The cosmography procedure just described assumes that the redshift of the GW sources can be acquired. This is however not straightforward. Because of their intrinsic nature, coalescing BHs do not guarantee any signature besides GWs. Nevertheless, EM counterparts can be envisaged for BBHs surrounded by matter, or for compact binaries where at least one of the two bodies is not a BH. For this reason, an EM counterpart is expected for merging massive BHs at the centre of galaxies, which may be surrounded by an accretion disk, and for BNSs, whose merger produces distinctive EM emissions, including gamma ray bursts and kilonovae. On the other hand, extreme mass ratio inspiral (EMRI) systems and stellar-origin BBHs (SOBBHs) are not expected to produce significant EM radiation at merger, although their merging environment is still unclear. Of course, in order to fit equation (10) with as many data points (z, dL) as possible, it would help substantially to determine the redshift of all detectable standard sirens, independently of whether they do or do not exhibit an EM counterpart. There are mainly two different ways to obtain redshift information for a standard siren, depending on the observation or not of an EM counterpart:

  • Method with EM counterpart: This method relies on EM telescopes to recognize the galaxy hosting the GW source [628]. Reaching a good sky localization (${\mathcal O}$ (10 deg2) or below) as soon as possible after the detection of the GW signal, is essential to alert EM telescopes and point them towards the solid angle determining the direction of the GW event to look for EM transients. Once such a transient is detected, the GW event can be associated with the nearest galaxy whose redshift can be measured either spectroscopically or photometrically. It is important for this method to have GW detectors able to rapidly reach a well-beamed sky localization of the GW source. A network of GW interferometers not only improves the sensitivity to a standard siren signal (improving the reconstruction of dL) but also the identification of the sky solid angle containing the source thanks to spatial triangulation.
  • Method without EM counterpart: This method allows to determine the sky localization of the standard siren much after the GW event, with clear practical advantages, in particular, the presence of the source does not need to be recognized in real time, and moreover the SNR required for a good sky localization does not need to be reached before the stage at which the EM signal might be triggered). It adopts a statistical approach [628, 630]. Indeed, given a galaxy catalogue, the galaxy hosting the standard siren has to be one of those contained in the box given by the identified solid angle (with its experimental error) times a properly-guessed redshift range. This range is obtained by applying a reasonable prior to the redshift obtained inverting equation (10). In this procedure one must of course take into account the dependency upon the cosmological parameters of $H(z)$ , which will affect the final posterior on the parameters themselves. The redshift of the standard siren can be estimated as the weighted average of the redshifts of all the galaxies within the error box (an additional prior on the conformation of each galaxy may be also included). Due to the large uncertainty of this statistical approach, this method is effective only when a limited amount of galaxies can be identified within the volume error box and only if a sufficiently large amount of GW events is observed.

These procedures suffer from some major uncertainties. The redshift appearing in equation (10) is the one due to the Hubble flow, and consequently the contribution due to the peculiar velocity of the host galaxy to the measured redshift should be subtracted. Moreover, the real geodesic followed by the GW is not the one resulting from the homogeneous and isotropic metric assumed in equation (10), and in fact the lensing contribution to dL due to cosmic inhomogeneities should be removed. Although in principle very precise lensing maps and galaxy catalogues are helpful to estimate such source of uncertainty [631634], still the lensing and peculiar velocity effects have to be treated as a (large) systematic error that can be reduced only by means of numerous detections (the uncertainty due to peculiar velocities dominating at low redshift, and the lensing uncertainty dominating at large redshift).

13.1.2. Standard sirens with current GW data: GW170817.

The GW170817 event, corresponding to the coalescence of a BNS, is the exquisite progenitor of cosmography via standard sirens with EM counterpart. For that event, the LVC interferometer network recovered the luminosity distance $d_L= 43.8^{+2.9}_{-6.9}$ Mpc at 68% C.L., and a sky localization of 31 deg2 [21, 22], corresponding to the solid angle shown in the left panel of figure 7 (green area). The optical telescopes exploring this portion of the sky identified an EM transient in association to the galaxy NGC4993, which is known to be departing from us at the speed of $3327\pm 72$ km s−1. Although part of the peculiar velocity of the galaxy NGC4993 could be subtracted [22], remaining uncertainties on the peculiar motion eventually yield the estimate $v_H=3017\pm 166$ km s−1 for its Hubble flow velocity, corresponding to $z\simeq 10^{-2}$ (notice that at this redshift the lensing uncertainty is negligible). Using the Hubble law $d_L=z/H_0$ , which is the leading order term in the expansion of equation (10) at small z, one obtains the posterior distribution for H0 presented in figure 7 (right panel), providing the measurement $H_0 = 70.0^{+12.0}_{-8.0}~{\rm km~s^{-1}~Mpc^{-1}}$ at 68% C.L. [24]. The uncertainty on H0 is too large to make the measurement competitive with CMB [635] and SNIa constraints [636]. Nevertheless, this represents a local estimate of H0 that is not dependent on the cosmic distance ladder and the first cosmological measurement not relying only on EM radiation. Moreover, the large number of events similar to GW170817 expected to be observed in the future will eventually yield constraints on H0 at the level of both local and CMB measurements.

13.1.3. Cosmological forecasts with standard sirens.

The potential of current and forthcoming GW detectors for cosmology have been widely studied in the literature. To appreciate the capability of standard sirens to probe the late universe, here we report on the most recent forecasts (see e.g. [637640] (LVC) and [641646] (LISA) for previous studies). Such analyses are expected to continuously improve over the upcoming years of data taking.

  • Forecasts with ground-based interferometers: Ground-based interferometers can detect binary systems composed of NSs and/or stellar-origin BHs, with masses ranging from few solar masses up to tens of solar masses. By means of the planned LVC-KAGRA-LIGO India network, BNSs with counterpart will allow to put a tight constraint on H0. For example, assuming the $\Lambda$ CDM model, H0 is expected to be measured roughly with a  ∼1% error after  ∼100 detections with counterpart [647, 648]. The result would be even tighter if, for some SOBBHs, the host galaxy could be identified, or if BNSs with counterpart are considered instead, as only  ∼10 detections would yield the same level of accuracy [647, 649]. The forecasts drastically improve for third generation ground-based detectors, such as the Einstein Telescope (ET) [650652]. In addition, besides the two approaches mentioned above for the redshift measurement, a further method, feasible only with an ET-like device, will become available [653] (see also [654] for a similar idea). Thanks to the tidal effects of a BNS waveform, the redshift-mass degeneracy can be broken. Consequently, by measuring the tidal effect in the waveform and assuming a prior knowledge on the NS equation of state, the redshift z entering the redshifted chirp mass can be reconstructed from the waveform itself. In this way, with more than 1000 detected BNS events, the ET is expected to constrain the parameters H0 and $\Omega_m$ of the $\Lambda$ CDM model roughly with an uncertainty of  ∼$8\%$ and  ∼$ 65\%$ , respectively [655]. However, if the rate of BNS mergers in the universe will result to be in the higher limit of the currently allowed range, the ET will be able to see up to  ∼107 BNS mergers, ameliorating these constraints by two orders of magnitude.
  • Forecasts with space-based interferometers: Space-based GW interferometry will open the low-frequency window (mHz to Hz) in the GW landscape, which is complementary to Earth-based detectors (Hz to kHz) and PTA experiments (nHz). The Laser Interferometer Space Antenna (LISA) is currently the only planned space mission designed to detect GWs, as it has been selected by ESA [64]. Several new GW astrophysical sources will be observed by LISA, including SOBBHs, EMRIs and massive BBHs from 104 to 107 solar masses. These sources can not only be conveniently employed as standard sirens, but they will be detected at different redshift ranges, making LISA a unique cosmological probe, able to measure the expansion rate of the universe from local ($z\sim 0.01$ ) to very high ($z\sim 10$ ) redshift. The current forecasts, produced taking into account only massive BBHs [609] (for which an EM counterpart is expected) or SOBBHs [656, 657] (for which no EM counterpart is expected), estimate constraints on H0 down to a few percent. However, joining all possible GW sources that can be used as standard sirens with LISA in the same analysis, should not only provide better results for H0, which will likely be constrained to the sub-percent level, but it will open up the possibility to constrain other cosmological parameters. The massive BBH data points at high redshifts will, moreover, be useful to test alternative cosmological models, predicting deviations from the $\Lambda$ CDM expansion history at relatively early times [658, 659]. Finally, more advanced futuristic missions, such as DECIGO or BBO, which at the moment have only been proposed on paper, may be able to probe the cosmological parameters, including the equation of state of dark energy, with ultra-high precision [660663]. They might also be able to detect the effect of the expansion of the universe directly on the phase of the binary GW waveform [664, 665], although the contribution due to peculiar accelerations would complicate such a measurement [275, 666].

13.1.4. Future prospects.

The recent GW170817 event has triggered numerous studies about the use of standard sirens for cosmography, and the forthcoming experimental and theoretical developments might change the priorities in the field. As mentioned above, the network of Earth-based GW interferometers is expected to detect an increasing number of GW sources employable as standard sirens, with or without EM counterparts. This will eventually yield a measurement of the Hubble constant competitive with CMB and SNIa constraints, which might be used to alleviate the tension between these two datasets. On the other hand, higher redshift ($z \gtrsim 1$ ) standard sirens data will probably be obtained only with third generation detectors or space-based interferometers. With these high redshift data we will start probing the expansion of the universe at large distances, implying that a clean, and not exclusively EM-based, measurement of other cosmological parameters will become a reality. Furthermore, the precision of cosmography via standard sirens might be boosted by improvements in other astronomical observations. This is the case if e.g. galactic catalogues and lensing maps improve substantially in the future, as expected.

Concerning present forecasts, there is room for improvements, especially regarding third generation Earth-based interferometers and space-borne GW detectors. The full cosmological potential of future GW experiments such as ET and LISA has still to be assessed. For ET we still do not have a full cosmological analysis taking into account all possible GW sources that will be used as standard sirens, using both the method with counterpart (BNSs) and the method without counterpart (SOBBHs and BNSs). Moreover, we still need to fully understand up to what extent the information on the NS equation of state can be used to infer the redshift of BNSs and NS-BH binaries, with the latterpotentially representing a new interesting future source of standard sirens [638, 640, 667]. Regarding LISA, there are still several issues to be addressed in order to produce reliable forecasts. For instance, the EMRI detection rate is still largely unknown [668], although these sources might turn out to be excellent standard siren candidates at redshift $0.1\lesssim z \lesssim 1$ [669]. In addition, the prospects for massive BBH mergers as standard sirens would be more robust if an up-to-date model of the EM counterpart were implemented in the investigations. Performing these analyses and combining the results for all the different types of standard sirens observable by LISA will allow for a full assessment of the cosmological potential of the mission.

Most of the literature on standard sirens assumes GR and an homogeneous and isotropic universe at large scales. However, breakthroughs may occur by generalizing the aforementioned analyses to theories beyond GR, and in fact forecasts for scenarios not fulfilling these assumptions are flourishing topics with potentially revolutionary results. For example, some models of modified gravity, formulated to provide cosmic acceleration, have been strongly constrained by the measurement of the speed of GW (compared to the speed of light) with GW170817 [670673]. In general, in theories beyond GR or admitting extra dimensions, the cosmological propagation of GWs changes and a comparison between the values of dL measured from GWs and inferred from EM observations can provide a strong test of the validity of these theories (see [674677] for recent works).

Finally we mention that the large number of GW sources expected to be observed by third generation interferometers such as ET or by future space-borne detectors such as DECIGO/BBO, might even be used to test the cosmological principle [678680] and to extract cosmological information by cross-correlating their spatial distribution with galaxy catalogues or lensing maps [681683].

13.2. Interplay between GWs from binaries and from early-universe sources

The gravitational interaction is so weak that GWs propagate practically unperturbed along their path from the source to us. GWs produced in the early universe thus can carry a unique imprint of the pre-CMB era, in which the universe was not transparent to photons. In this sense, GW detection can provide for the first time direct, clean access to epochs that are very hard to probe by any other observational means. GWs of cosmological origin appear (pretty much as the CMB) to our detectors as a SGWB [684, 685].

Several pre-CMB phenomena sourcing GWs might have occurred along the cosmological history, from inflation to the epoch of the QCD phase transition. In such a case, the SGWB would be constituted by the sum of all single contributions, each of them potentially differing from the others in its spectral shape as a function of frequency, or because of other properties such as chirality and/or gaussianity. It is hence crucial to have precise predictions of all proposed cosmological sources to possibly isolate all components in the detected SGWB, with the aim of reconstructing the early time history of the universe.

On the other hand, binary systems can also be detected as a SGWB. This happens for those (independent) astrophysical events that are overall too weak to be individually resolved. The level of their 'contamination' to the cosmological SGWB thus depends on the sensitivity and on the resolution of the available detector. Crucially, at LIGO-like and LISA-like experiments the astrophysical component might be stronger than the cosmological one, so that the information hidden in the pre-CMB signal would be impossible to recover, unless the astrophysical contribution is known in great detail and methods are found to subtract it.

13.2.1. Status and future prospects.

No measurement of the SGWB has been done yet, but upper bounds have been inferred from the observations. These are usually expressed in terms of the SGWB energy density, which is given by $\Omega_{\rm GW}(\,f)=(\,f/\rho_c) ~\partial \rho_{\rm GW}/\partial f$ , with $\rho_c$ and $\rho_{\rm GW}$ being the critical and the SGWB energy densities, respectively. Specifically, by assuming the frequency shape $\Omega_{\rm GW}(\,f)=\Omega_\alpha (\,f/25~{\rm Hz}){}^\alpha$ , the LVC found the 95% C.L. limit $\Omega_{\alpha=0,2/3,3} < 17\times 10^{-7},13\times 10^{-7}$ and $1.7\times 10^{-8}$ in the ${\mathcal O} (10)$ ${\mathcal O} (100)$ Hz frequency band [686]. The latest Pulsar Timing Array analysis, done with the data of the NANOGrav collaboration, yields $h^2\Omega_{\alpha=0} < 3.4\times 10^{-10}$ at 95% C.L. at $3.17\times 10^{-8}$ Hz [409]. Since the astrophysical sources are not expected to produce a SGWB with higher amplitude than these bounds, the latter are particularly relevant only for the most powerful cosmological signals [378, 687].

However, it is probably only a matter of some years to achieve the first detection of the astrophysical SGWB by the LVC. Based on the current detection rates, the SOBBHs and BNSs lead to the power-law SGWBs $\Omega_{\rm BBH}(\,f)=1.2^{+1.9}_{-0.9}\times 10^{-9}\left(\,f/25\,{\rm Hz}\right){}^{2/3}$ and $\Omega_{\rm BBH+BNS}(\,f)=1.8^{+2.7}_{-1.3}\times 10^{-9}\left(\,f/25\,{\rm Hz}\right){}^{2/3}$ in the frequency band of both LVC and LISA [688]. These signals are expected to be measurable by the LIGO and Virgo detectors in around 40 months of observation time [688], while in LISA they will reach a Signal to Noise Ratio (SNR) of ${\mathcal O}(10)$ in around one month of data taking [689]. On the other hand, given the large uncertainties on EMRIs [668], it is not clear whether they can also give rise to an observable SGWB component in the LISA band.

Focusing exclusively on the SGWB contribution from astrophysical binaries, two general aspects about it are particularly worth investigating. The first regards the detailed prediction and characterization of this component. For the time being, the literature does not suggest any technique to disentangle the cosmological SGWB component from a generic astrophysical one. At present, it is believed that component separation will be (partially) feasible in the LISA data only for what concerns the SGWB signal due to galactic binaries. In this case, indeed, the anisotropy of the spatial distribution of the source (confined to the galactic plane), together with the motion of the LISA constellation, generates a SGWB signal modulated on a yearly basis, which allows to distinguish and subtract this component from the total SGWB [690]. Although a similar approach is not possible for the extra-galactic SGWB component, this example well illustrates that detailed predictions of the astrophysical signatures might highlight subtraction techniques allowing to perform component separation and hopefully isolate the information coming from the early universe (similarly to what done for foreground subtraction in CMB analyses).

The second aspect concerns the possibility of exploiting third generation detectors, such as the ET and cosmic explorer, to subtract the astrophysical SGWB component [691]. Their exquisite sensitivity may allow to resolve many of the SOBBHs and BNSs that give rise to the SGWB in present detectors, including the full LVC-KAGRA-LIGO India network. This would clean the access to the cosmological SGWB down to a level of $\Omega_{\rm GW}\simeq 10^{-13}$ after five years of observation [691]. This technique not only applies in the frequency bandwidth of the Earth-based devices, but it can also be used to clean the LISA data and reach a potential SGWB of cosmological origin in the LISA band. Note that LISA might as well help in beating down the level of the SGWB from SOBBHs by exploiting possible multi-band detections of the same source [692]. The SOBBH would be detected first by LISA, during its inspiral phase; some years later, when the binary has arrived to the merger stage, it would reappear in Earth-based interferometers. These latter can therefore be alerted in advance, possibly leading to an increase in the number of detected BBHs (depending on their LISA SNR).

Chapter II:. Modelling black-hole sources of gravitational waves: prospects and challenges

Editor: Leor Barack

1. Introduction

The detection and characterization of gravitational-wave (GW) sources rely heavily on accurate models of the expected waveforms. This is particularly true for black hole binaries (BBHs) and other compact objects, for which accurate models are both necessary and hard to obtain. To appreciate the state of affairs, consider the following three examples. (i) While GW150914, the first BH merger event detected by LIGO, had initially been identified using a template-free search algorithm, some of the subsequent events, which were not as bright, would likely have been altogether missed if template-based searches had not been performed. (ii) While the error bars placed on the extracted physical parameters of detected BH mergers have so far come primarily from instrumental noise statistics, systematic errors from the finite accuracy of available signal models are only marginally smaller and would actually dominate the total error budget for sources of some other spin configurations or greater mass disparity; in the case of the binary neutron star (BNS) GW170817, deficiencies in available models of tidal effects already restrict the quality of science extractable from the signal. (iii) Even with a perfectly accurate model at hand, analysis of GW170817 would not have been possible within the timescale of weeks in which it was carried out, without the availability of a suitable reduced-order representation of the model, necessary to make such an analysis computationally manageable.

Indeed, accurate and computationally efficient models underpin all of GW data analysis. They do so now, and will increasingly do so even more in the future as more sensitive and broader-band instruments go online. The response of detectors like ET, and especially LISA, will be source-dominated, with some binary GW signals occurring at high signal-to-noise ratio (SNR) and remaining visible in band through many more wave cycles. The accuracy standard of models needs to increase commensurably with detector sensitivity, or else modelling error would restrict our ability to fully exploit the detected signals. As a stark example, consider that, in a scenario that is not unlikely, LISA's output will be dominated by a bright massive BH (MBH) merger signal visible with SNR of several 100s. This signal would have to be carefully 'cleaned out' of the data in order to enable the extraction and analysis of any other sources buried underneath; any model inaccuracies would form a systematic noise residual, potentially hiding dimmer sources. In the case of Extreme Mass Ratio Inspirals (EMRIs), where O(105) wave cycles are expected in the LISA band at a low SNR, a precise model is a crucial prerequisite for both detection and parameter extraction.

This chapter reviews the current situation with regard to the modelling of GW sources within GR, identifying the major remaining challenges and drawing a roadmap for future progress. To make our task manageable, we focus mainly on sources involving a pair of BHs in a vacuum environment in GR, but, especially in section 5, we also touch upon various extensions beyond vacuum, GR and the standard model (SM) of particle physics.

Isolated vacuum BHs in GR are remarkably simple objects, described in exact form by the Kerr family of solutions to Einstein's field equations (see, however, section 8 of this chapter). But let two such BHs interact with each other, and the resulting system displays a remarkably complicated dynamics, with no known exact solutions. Even numerical solutions have for decades proven elusive, and despite much progress following the breakthrough of 2005 they remain computationally very expensive—prohibitively so for mass ratios smaller than  ∼$ 1:10$ —and problematic for certain astrophysically relevant BH spin configurations. Systematic analytical approximations are possible and have been developed based around expansions of the field equations in the weak-field or extreme mass-ratio regimes, and these may be combined with fully numerical solutions to inform waveform models across broader areas of the parameter space. To facilitate the fast production of such waveforms, suitable for GW search pipelines, effective and phenomenological models have been developed, which package together and interpolate results from systematic numerical simulations and analytical approximations. With the rapid progress in GW experiments, there is now, more than ever, a need for a concentrated community effort to improve existing models in fidelity, accuracy and parameter-space reach, as well as in computational efficiency.

The structure of this chapter is as follows. In sections 2 and 3 we review the two main systematic approximations to the BBH problem: perturbation theory including the gravitational self-force (GSF) and post-Newtonian (PN) approaches, respectively. Section 4 surveys progress and prospects in the Numerical Relativity (NR) modelling of inspiralling and merging BHs in astrophysical settings, and section 5 similarly reviews the role of NR in studying the dynamics of compact objects in the context of alternative theories of gravity and beyond the SM. Section 6 then reviews the effective one body (EOB) approach to the BBH problem, and the various phenomenological models that have been developed to facilitate fast production of waveform templates. Section 7 reviews the unique and highly involved challenge of data-analysis in GW astronomy, with particular emphasis on the role of source models; this data-analysis challenge sets the requirements and accuracy standards for such models. Finally, section 8 gives a mathematical relativist's point of view, commenting on a variety of (often overlooked) foundational questions that are yet to be resolved in order to enable a mathematically rigorous and unambiguous interpretation of GW observations.

2. Perturbation methods

Contributor: B Wardell

Exact models for GWs from BBHs can only be obtained by exactly solving the full Einstein field equations. However, there is an important regime in which a perturbative treatment yields a highly accurate approximation. For BBH systems in which one of the BHs is much less massive than the other, one may treat the mass ratio as a small perturbation parameter. Then, the Einstein equations are amenable to a perturbative expansion in powers of this parameter. Such an expansion is particularly suitable for EMRIs, systems in which the mass ratio may be as small as 10−6, or even smaller [693697]. In such cases, it has been established [698700] that it will be necessary to incorporate information at second-from-leading perturbative order to achieve the accuracy that will be required for optimal parameter estimation by the planned LISA mission [668, 701704]. Aside from EMRIs, a perturbative expansion is likely to also be useful as a model for Intermediate Mass Ratio Inspirals (IMRIs): systems where the mass ratio may be as large as  ∼10−2. Such systems, if they exist, are detectable in principle by Advanced LIGO and Virgo, and are indeed being looked for in the data of these experiments [705, 706].

The perturbative approach (often called the self-force135 approach) yields a set of equations for the motion of the smaller object about the larger one. For a detailed technical review of self-force physics, see [715], and for a most recent, pedagogical review, see [716]. At zeroth order in the expansion, one recovers the standard geodesic equations for a test particle in orbit around (i.e. moving in the background spacetime of) the larger BH. At first order, we obtain coupled equations for an accelerated worldline forced off a geodesic by the GSF, which itself arises from the metric perturbation to the background spacetime sourced by the stress-energy of the smaller BH. In the GSF approach, it can be convenient to treat the smaller BH as a 'point particle', with the GSF being computed from an effective regularized metric perturbation [717721]. Such an assumption is not strictly necessary, but has been validated by more careful treatments whereby both BHs are allowed to be extended bodies, and the point-particle-plus-regularization prescription is recovered by appropriately allowing the smaller BH to shrink down to zero size [722725]. These more careful treatments have also allowed the GSF prescription to be extended to second perturbative order [725731] and to the fully non-perturbative case [732734].

The goal of the GSF approach is to develop efficient methods for computing the motion of the smaller object and the emitted GWs in astrophysically-relevant scenarios. These involve, most generally, a spinning (Kerr) large BH, and a small (possibly spinning) compact object in a generic (possibly inclined and eccentric) inspiral orbit around it. The data analysis goals of the LISA mission (which demand that the phase of the extracted waveform be accurate to within a fraction of a radian over the entire inspiral) require all contributions to the metric perturbation at first order, along with the dissipative contributions at second order [698].

2.1. Recent developments

Focused work on the GSF problem has been ongoing for at least the last two decades, during which time there has been substantial progress. Early work developed much of the mathematical formalism, particularly in how one constructs a well-motivated and unambiguous regularized first-order metric perturbation [717723, 735]. More recent work has addressed the conceptual challenges around how these initial results can be extended to second perturbative order [725731], and on turning the formal mathematical prescriptions into practical numerical schemes [736741]. As a result, we are now at the point where first-order GSF calculations are possible for almost any orbital configuration in a Kerr background spacetime [742]. Indeed, we now have not one, but three practical schemes for computing the regularized first-order GSF [743]. Some of the most recent highlights from the substantial body of work created by the GSF community are discussed below.

2.1.1. First-order gravitational self-force for generic orbits in Kerr spacetime.

Progress in developing tools for GSF calculations has been incremental, starting out with the simplest toy model of a particle with scalar charge in a circular orbit about a Schwarzschild BH, and then extending to eccentric, inclined, and generic orbits about a spinning Kerr BH [744771]. Progress has also been made towards developing tools for the conceptually similar, but computationally more challenging GSF problem; again starting out with simple circular orbits in Schwarzschild spacetime before extending to eccentric equatorial orbits and, most recently, fully generic orbits in Kerr spacetime [739, 772789]. Along the way, there have been many necessary detours in order to establish the most appropriate choice of gauge [779, 785, 787, 790794], reformulations of the regularization procedure [765, 781, 785, 795802], and various numerical methods and computational optimisations [753, 754, 757, 777, 781, 784, 790, 791, 803809].

2.1.2. Extraction of gauge-invariant information.

Both the regularized metric perturbation and the GSF associated with it are themselves gauge-dependent [792, 810, 811], but their combination encapsulates gauge-invariant information. A series of works have derived a set of gauge-invariant quantities accessible from the regularized metric and GSF, which quantify conservative aspects of the dynamics in EMRI systems beyond the geodesic approximation. (Strictly speaking, they are only 'gauge invariant' within a particular, physically motivated class of gauge transformations. Nevertheless, even with this restriction the gauge invariance is very useful for comparisons with other results.) Examples include Detweiler's red-shift invariant [812], the frequency of the innermost stable circular orbit (ISCO) in Schwarzschild [813] and Kerr [786], the periastron advance of slightly eccentric orbits in Schwarzschild [814] and Kerr [815], spin (geodetic) precession [816819], and, most recently, quadrupolar and octupolar tidal invariants [820, 821]. The most important outcome from the development of these gauge invariants is the synergy it has enabled, both within the GSF programme [822] (e.g. by allowing for direct comparisons between results computed in different gauges) and, as described next, with other approaches to the two-body problem.

2.1.3. Synergy with PN approximations, EOB theory, and NR.

One of the most fruitful outcomes arising from the development of GSF gauge-invariants is the synergy it has enabled between the GSF and PN and EOB theories. With a gauge-invariant description of the physical problem available, it is possible to make direct connections between GSF, PN and EOB approximations. This synergy has worked in a bidirectional way: GSF calculations have been used to determine previously-unknown coefficients in both PN and EOB expansions [812, 814, 823837]; and EOB and PN calculations have been used to validate GSF results [826], and even to assess the region of validity of the perturbative approximation [838840]. More on this in section 3.

Mirroring the synergy between GSF and PN/EOB, there have emerged methods for making comparisons between GSF and NR. This started out with direct comparisons of the periastron advance of slightly eccentric orbits [838, 840]. More recently, a similar comparison was made possible for Detweiler's redshift [841, 842], facilitated by an emerging understating of the relation between Detweiler's red-shift and the horizon surface gravity of the small BH.

2.1.4. New and efficient calculational approaches.

Despite the significant progress in developing numerical tools for computing the GSF, it is still a computationally challenging problem, particularly in cases where high accuracy is required. This challenge has prompted the development of new and efficient calculational approaches to the problem.

Initial GSF results were obtained in the Lorenz gauge [773, 774, 777, 778, 780, 784, 813, 843], where the regularization procedure is best understood. Unfortunately, the details of a Lorenz-gauge calculation—in which one must solve coupled equations for the 10 components of the metric perturbation—are tedious and cumbersome, making it difficult to implement and even more difficult to achieve good accuracy. In the Schwarzschild case, other calculations based on variations of Regge-Wheeler gauge [788, 790, 804806, 808, 844, 845] were found to be much easier to implement and yielded much more accurate results. However, with regularization in Regge-Wheeler gauge less well understood, those calculations were restricted to the computation of gauge-invariant quantities. Perhaps more importantly, they are restricted to Schwarzschild spacetime, meaning they can not be used in astrophysically realistic cases where the larger BH is spinning. The radiation gauge—in which one solves the Teukolsky equation [846, 847] for a single complex pseudo-scalar $\psi_4$ (or, equivalently, $\psi_0$ )—retains much of the simplicity of the Regge-Wheeler gauge, but has the significant benefit of being capable of describing perturbations of a Kerr BH. Furthermore, recent progress has clarified subtle issues related to regularization [785] (including metric completion [848, 849]) in radiation gauge, paving the way for high-accuracy calculations of both gauge-invariant quantities and the GSF [742, 815, 850, 851].

Within these last two approaches (using Regge-Wheeler and radiation gauges) functional methods [852854] have emerged as a particularly efficient means of achieving high accuracy when computing the metric perturbation. Fundamentally, these methods rely on the fact that solutions of the Teukolsky (or Regge-Wheeler) equation can be written as a convergent series of hypergeometric functions. This essentially reduces the problem of computing the metric perturbation to the problem of evaluating hypergeometric functions. The approach has proved very successful, enabling both highly accurate numerical calculations [742, 855857] and even exact results in the low-frequency–large-radius (i.e. PN) regime [826, 828830, 833837, 858, 859]. A different type of analytic treatment is possible for modelling the radiation from the last stage of inspiral into a nearly extremal BH, thanks to the enhanced conformal symmetry in this scenario [860863]. This method, based on a scheme of matched asymptotic expansions, has so far been applied to equatorial orbits [769, 864], with the GSF neglected.

2.1.5. Cosmic censorship.

Independently of the goal of producing accurate waveforms for LISA data analysis, the GSF programme has also yielded several other important results. One particular area of interest has been in the relevance of the GSF to answering questions about cosmic censorship. Calculations based on test-particle motion made the surprising discovery that a test particle falling into a Kerr BH had the potential to increase the BH spin past the extremal limit, thus yielding a naked singularity [865]. (Analogous cases exist where an electric charge falling into a charged (Reissner–Nordström) BH may cause the charge on the BH to increase past the extremal limit [866868].) The intuitive expectation is that this is an artifact of the test-particle approximation, and that by including higher-order terms in this approximation the GSF may in effect act as a 'cosmic censor' by preventing over-charging and restoring cosmic censorship [869]. Several works have explored this issue in detail, studying the self-force on electric charges falling into a Reissner–Nordström BH [870, 871] and on a massive particle falling into a Kerr BH [872, 873]. These works demonstrated with explicit calculations how the overspinning or overcharging scenarios are averted once the full effect of the self-force is taken into account (a result later rigorously proven in a more general context [874]).

2.2. Remaining challenges and prospects

While the perturbative (self-force) approach has proven highly effective to date, there remain several important and challenging areas for further development. Here, we list a number of the most important future challenges and prospects for the GSF programme.

2.2.1. Efficient incorporation of self-force information into waveform models.

There are at least two key aspects to producing an EMRI model: (i) computing the GSF; and (ii) using the GSF to actually drive an inspiral. Unfortunately, despite the substantial advances in calculational approaches, GSF calculations are still much too slow to be useful on their own as a means for producing LISA gravitational waveforms. Existing work has been able to produce first-order GSF driven inspirals for a small number of cases [760, 780, 788, 875] but when one takes into account the large parameter space of EMRI systems it is clear that these existing methods are inadequate. It is therefore important to develop efficient methods for incorporating GSF information into EMRI models and waveforms. There has been some promising recent progress in this direction, with fast 'kludge' codes producing approximate (but not sufficiently accurate) inspirals [707, 709, 712, 713, 876878], and with the emergence of mathematical frameworks based on near-identity transformations [879], renormalization group methods [880] and two-timescale expansions [881].

To complicate matters, inspirals generically go through a number of transient resonances, when the momentary radial and polar frequencies of the orbit occur in a small rational ratio. During such resonances, approximations based on adiabaticity break down [882886]. Works so far have mapped the locations of resonances in the inspiral parameter space, studied how the orbital parameters (including energy and angular momentum) experience a 'jump' upon resonant crossing, and illustrated how the magnitude of the jump depends sensitively on the precise resonant phase (the relative phase between the radial and polar motions at resonance). The impact of resonances on the detectability of EMRIs with LISA was studied in [887, 888] using an approximate model of the resonant crossing. But so far there has been no actual calculation of the orbital evolution through a resonance. Now that GSF codes for generic orbits are finally at hand, such calculations become possible, in principle. There is a vital need to perform such calculations, in order to allow orbital evolution methods to safely pass through resonances without a significant loss in the accuracy of the inspiral model.

2.2.2. Producing accurate waveform models: self-consistent evolution and second-order gravitational self-force.

Possibly the most challenging outstanding obstacle to reaching the sub-radian phase accuracy required for LISA data analysis is the fact that the first-order GSF on its own is insufficient and one must also incorporate information at second perturbative order [698]. There are ongoing efforts to develop tools for computing the second order GSF [723, 730, 731, 765, 802, 881, 889892], but, despite significant progress, a full calculation of the second-order metric perturbation has yet to be completed.

One of the challenges of the GSF problem when considered through second order is that it is naturally formulated as a self-consistent problem, whereby the coupled equations for the metric perturbation and for the particle worldline are evolved simultaneously. Indeed, this self-consistent evolution has yet to be completed even for the first-order GSF (it has, however, been done for the toy-model scalar charge case [760]). Even when methods are developed for computing the second order GSF, it will remain a further challenge to incorporate this information into a self-consistent evolution scheme.

2.2.3. Gravitational Green function.

One of the first proposals for a practical method for computing the GSF was based on writing the regularized metric perturbation in terms of a convolution, integrating the Green function for the wave equation along the worldline of the particle [736, 739]. It took several years for this idea to be turned into a complete calculation of the GSF, and even then the results were restricted to the toy-model problem of a scalar charge moving in Schwarzschild spacetime [764, 767]. Despite this deficiency, the Green function approach has produced a novel perspective on the GSF problem.

In principle, the methods used for a scalar field in Schwarzschild spacetime should be applicable to the GSF problem in Kerr. The challenge is two-fold: (i) to actually adapt the methods to the Kerr problem and to the relevant wave equation, which is the linearized Einstein equation in the Lorenz gauge; and (ii) to explore whether and how one can instead work with the much simpler Teukolsky wave equation.

2.2.4. Internal-structure effects.

The vast majority of GSF calculations to date have been based on the assumption that the smaller object is spherically symmetric and non-spinning. This idealization ignores the possibility that the smaller BH may be spinning, or more generally that other internal-structure effects may be relevant. Unfortunately, this picture is inadequate; certain internal-structure effects can make important contributions to the equations of motion. For example, the coupling of the small body's spin to the larger BH's spacetime curvature (commonly referred to as the Mathisson–Papapetrou force) is expected to contribute to the phase evolution of a typical EMRI at the same order as the conservative piece of the first-order GSF [893]. Furthermore, finite internal-structure is likely to be even more important in the case that the smaller body is a NS.

While there has been some progress in assessing the contribution from the smaller body's spin to the motion [875, 893899], existing work has focused on flux-based calculations or on the PN regime. It remains an outstanding challenge to determine the influence of internal-structure effects on the GSF, especially at second order.

2.2.5. EMRIs in alternative theories of gravity.

In all of the discussion so far, we have made one overarching assumption: that BHs behave as described by General Relativity (GR). However, with EMRIs we have the exciting prospect of not simply assuming this fact, but of testing its validity with exquisite precision. There is initial work on the self-force in the context of scalar-tensor gravity [900], but much more remains to be done to establish exactly what EMRIs can do to test the validity of GR (and how) when pushed to its most extreme limits. Much more on this in chapter III (see, in particular, section 3.3 therein)

2.2.6. Open tools and datasets.

While there has been significant progress in developing tools for computing the GSF, much of it has been ad-hoc, with individual groups developing their own private tools and codes. Now that a clear picture has emerged of exactly which are the most useful methods and tools, the community has begun to combine their efforts. This has lead to the development of a number of initiatives, including (i) tabulated results for Kerr quasinormal modes and their excitation factors [901, 902]; (ii) open source 'kludge' codes for generating an approximate waveform for EMRIS [877, 903]; and (ii) online repositories of self-force results [904]. It is important for such efforts to continue, so that the results of the many years of development of GSF tools and methods are available to the widest possible user base. One promising initiative in this direction is the ongoing development of the Black Hole Perturbation Toolkit [905], a free and open source set of codes and results produced by the GSF community.

3. Post-Newtonian and post-Minkowskian methods

Contributor: A Le Tiec

3.1. Background

The PN formalism is an approximation method in GR that is well suited to describe the orbital motion and the GW emission from binary systems of compact objects, in a regime where the orbital velocity is small compared to the speed of light and the gravitational fields are weak. This approximation method has played a key role in the recent detections, by the LIGO and Virgo observatories, of GWs generated by inspiralling and merging BH and NS binaries [1, 2, 11, 18, 21], by providing accurate template waveforms to search for those signals and to interpret them. Here we give a brief overview of the application of the PN approximation to binary systems of compact objects, focusing on recent developments and future prospects. See the review articles [906913] and the textbooks [627, 914] for more information.

In PN theory, relativistic corrections to the Newtonian solution are incorporated in a systematic manner into the equations of motion (EOM) and the radiation field, order by order in the small parameter $v^2/c^2 \sim Gm/(c^2r)$ , where $v$ and r are the typical relative orbital velocity and binary separation, m is the sum of the component masses, and we used the fact that $v^2 \sim Gm/r$ for bound motion. (The most promising sources for current and future GW detectors are bound systems of compact objects.)

Another important approximation method is the post-Minkowskian (PM) approximation, or non-linearity expansion in Newton's gravitational constant G, which assumes weak fields ($Gm/c^2 r \ll 1$ ) but unrestricted speeds ($v^2 / c^2 \lesssim 1$ ), and perturbs about the limit of special relativity. In fact, the construction of accurate gravitational waveforms for inspiralling compact binaries requires a combination of PN and PM techniques in order to solve two coupled problems, namely the problem of motion and that of wave generation.

The two-body EOM have been derived in a PN framework using three well-developed sets of techniques in classical GR: (i) the PN iteration of the Einstein field equations in harmonic coordinates [915919], (ii) the Arnowitt–Deser–Misner (ADM) canonical Hamiltonian formalism [920922], and (iii) a surface integral approach pioneered by Einstein, Infeld and Hoffmann [923925]. By the early 2000s, each of these approaches has independently produced a computation of the EOM for binary systems of non-spinning compact objects through the 3rd PN order (3PN)136. More recently, the application of effective field theory (EFT) methods [926], inspired from quantum field theory, has provided an additional independent derivation of the 3PN EOM [927]. All of those results were shown to be in perfect agreement. Moreover, the 3.5PN terms—which constitute a 1PN relative correction to the leading radiation-reaction force—are also known [928933].

At the same time, the problem of radiation (i.e. computing the field in the far/wave zone) has been extensively investigated within the multipolar PM wave generation formalism of Blanchet and Damour [934936], using the 'direct integration of the relaxed Einstein equation' approach of Will, Wiseman and Pati [937, 938], and more recently with EFT techniques [939, 940]. The application of these formalisms to non-spinning compact binaries has, so far, resulted in the computation of the GW phase up to the relative 3.5PN (resp. 3PN) order for quasi-circular (resp. quasi-eccentric) orbits [941948], while amplitude corrections in the GW polarizations are known to 3PN order, and even to 3.5PN order for the quadrupolar mode [949954].

3.2. Recent developments

Over the last five years or so, significant progress on PN modelling of compact binary systems has been achieved on multiple fronts, including (i) the extension of the EOM to 4PN order for non-spinning bodies (with partial results also obtained for aspects of the two-body dynamics at the 5PN order [814, 823, 955]), (ii) the inclusion of spin effects in the binary dynamics and waveform, (iii) the comparison of several PN predictions to those from GSF theory, and (iv) the derivation of general laws controlling the mechanics of compact binaries.

3.2.1. 4PN equations of motion for non-spinning compact-object binaries.

Recently, the computation of the two-body EOM has been extended to 4PN order, by using both the canonical Hamiltonian framework in ADM-TT coordinates [956961] and a Fokker Lagrangian approach in harmonic coordinates [962966]. Partial results at 4PN order have also been obtained using EFT techniques [967969]. All of those high-order PN calculations resort to a point-particle model for the (non-spinning) compact objects, and rely on dimensional regularization to treat the local ultraviolet (UV) divergences that are associated with the use of point particles. The new 4PN results have been used to inform the EOB framework [970], a semi-analytic model of the binary dynamics and wave emission (see section 6 below).

The occurrence at the 4PN order of infrared (IR) divergences of spatial integrals led to the introduction of several ambiguity parameters; one in the ADM Hamiltonian approach and two in the Fokker Lagrangian approach. One of those IR ambiguity parameters was initially fixed by requiring agreement with an analytical GSF calculation [971] of the so-called Detweiler redshift along circular orbits [812, 822]. Recently, however, Marchand et al [965] gave the first complete (i.e. ambiguity-free) derivation of the 4PN EOM. The last remaining ambiguity parameter was determined from first principles, by resorting to a matching between the near-zone and far-zone fields, together with a computation of the conservative 4PN tail effect in d dimensions, allowing to treat both UV and IR divergences using dimensional regularization.

Another interesting (and related) feature of the binary dynamics at the 4PN order is that it becomes non-local in time [958, 962], because of the occurence of a GW tail effect at that order: gravitational radiation that gets scattered off the background spacetime curvature backreacts on the orbital motion at later times, such that the binary's dynamics at a given moment in time depends on its entire past history [972974].

3.2.2. Spin effects in the binary dynamics and gravitational waveform.

Since stellar-mass and/or supermassive BHs may carry significant spins [975, 976], much effort has recently been devoted to include spin effects in PN template waveforms. In particular, spin–orbit coupling terms linear in either of the two spins have been computed up to the next-to-next-to leading order, corresponding to 3.5PN order in the EOM, using the ADM Hamiltonian framework [977, 978], the PN iteration of the Einstein field equations in harmonic coordinates [979, 980], and EFT techniques [981]. Spin-spin coupling terms proportional to the product of the two spins have also been computed to the next-to-next-to leading order, corresponding to 4PN order in the EOM, using the ADM Hamiltonian and EFT formalisms [982986]. The leading order 3.5PN cubic-in-spin and 4PN quartic-in-spin contributions to the binary dynamics are also known for generic compact bodies [987991], as well as all higher-order-in-spin contributions for BBHs (to leading PN order) [992]. All these results are summarized in figure 8. 2PN BH binary spin precession was recently revisited using multi-timescale methods [993, 994], uncovering new phenomenology such as precessional instabilities [995] and nutational resonances [996].

Figure 8. Refer to the following caption and surrounding text.

Figure 8. Contributions to the two-body Hamiltonian in the PN spin expansion, for arbitrary-mass-ratio binaries with spin induced multipole moments. Contributions in red are yet to be calculated. LO stands for 'leading order', NLO for 'next-to-leading order', and so on. SO stands for 'spin–orbit'. Reprinted figure with permission from [992], Copyright 2018 by the American Physical Society.

Standard image High-resolution image

Spin-related effects on the far-zone field have also been computed to high orders, for compact binaries on quasi-circular orbits. To linear order in the spins, those effects are known up to the relative 4PN order in the GW energy flux and phasing [997, 998], and to 2PN in the wave polarizations [999, 1000]. At quadratic order in the spins, the contributions to the GW energy flux and phasing have been computed to 3PN order [1001, 1002], and partial results were derived for amplitude corrections to 2.5PN order [1003]. The leading 3.5PN cubic-in-spin effects in the GW energy flux and phasing are known as well [989].

Finally, some recent works have uncovered remarkable relationships between the PN [992] and PM [1004] dynamics of a binary system of spinning BHs with an arbritrary mass ratio on the one hand, and that of a test BH in a Kerr background spacetime on the other hand. Those results are especially relevant for the ongoing development of EOB models for spinning BH binaries (see section 6), and in fact give new insight into the energy map at the core of such models.

3.2.3. Comparisons to perturbative gravitational self-force calculations.

The GWs generated by a coalescing compact binary system are not the only observable of interest. As we have described in section 2, over recent years, several conservative effects on the orbital dynamics of compact-object binaries moving along quasi-circular orbits have been used to compare the predictions of the PN approximation to those of the GSF framework, by making use of gauge-invariant quantities such as (i) the Detweiler redshift [812, 823, 824, 827, 1005, 1006], (ii) the relativistic periastron advance [814, 815, 838, 840], (iii) the geodetic spin precession frequency [816], and (iv) various tidal invariants [820, 821], all computed as functions of the circular-orbit frequency of the binary. Some of these comparisons were extended to generic bound (eccentric) orbits [818, 832, 843]. All of those comparisons showed perfect agreement in the common domain of validity of the two approximation schemes, thus providing crucial tests for both methods. Building on recent progress on the second-order GSF problem [725, 730, 731, 792, 891, 892], we expect such comparisons to be extended to second order in the mass ratio, e.g. by using the redshift variable [889].

Independently, the BH perturbative techniques of Mano, Suzuki and Takasugi [853, 1007] have been applied to compute analytically, up to very high orders, the PN expansions of the GSF contributions to the redshift for circular [834, 858, 1008, 1009] and eccentric [828, 829, 833, 835, 836] orbits, the geodetic spin precession frequency [837, 859, 1010], and various tidal invariants [817, 826, 830]. Additionally, using similar techniques, some of those quantities have been computed numerically, with very high accuracy, allowing the extraction of the exact, analytical values of many PN coefficients [850, 855, 857].

3.2.4. First law of compact binary mechanics.

The conservative dynamics of a binary system of compact objects has a fundamental property now known as the first law of binary mechanics [1011]. Remarkably, this variational formula can be used to relate local physical quantities that characterize each body (e.g. the redshift) to global quantites that characterize the binary system (e.g. the binding energy). For point-particle binaries moving along circular orbits, this law is a particular case of a more general result, valid for systems of BHs and extended matter sources [1012].

Using the ADM Hamiltonian formalism, the first law of [1011] was generalized to spinning point particles, for spins (anti-)aligned with the orbital angular momentum [1013], and to non-spinning binaries moving along generic bound (eccentric) orbits [1014]. The derivation of the first law for eccentric motion was then extended to account for the non-locality in time of the orbital dynamics due to the occurence at the 4PN order of a GW tail effect [1015]. These various laws were derived on general grounds, assuming only that the conservative dynamics of the binary derives from an autonomous canonical Hamiltonian. (First-law-type relationships have also been derived in the context of linear BH perturbation theory and the GSF framework [10161018].) Moreover, they have been checked to hold true up to 3PN order, and even up to 5PN order for some logarithmic terms.

So far the first laws have been applied to (i) determine the numerical value of the aforementioned ambiguity parameter appearing in derivations of the 4PN two-body EOM [971], (ii) calculate the exact linear-in-the-mass-ratio contributions to the binary's binding energy and angular momentum for circular motion [839], (iii) compute the shift in the frequencies of the Schwarzschild and Kerr innermost stable circular orbits induced by the (conservative) GSF [778, 786, 813, 815, 825, 839, 1005], (iv) test the weak cosmic censorship conjecture in a scenario where a massive particle subject to the GSF falls into a nonrotating BH along unbound orbits [872, 873], (v) calibrate the effective potentials that enter the EOB model for circular [825, 1019] and mildly eccentric orbits [829, 831, 835], and spin–orbit couplings for spinning binaries [828], and (vi) define the analogue of the redshift of a particle for BHs in NR simulations, thus allowing further comparisons to PN and GSF calculations [841, 842].

3.3. Prospects

On the theoretical side, an important goal is to extend the knowledge of the GW phase to the relative 4.5PN accuracy, at least for non-spinning binaries on quasi-circular orbits. (Partial results for some specific tail-related effects were recently derived [1020].) This is essential in order to keep model systematics as a sub-dominant source of error when processing observed GW signals [1021]. Accomplishing that will require, in particular, the calculation of the mass-type quadrupole moment of the binary to 4PN order and the current-type quadrupole and mass-type octupole moments to 3PN order. Moreover, some spin contributions to the waveform—both in the phasing and amplitude corrections—still have to be computed to reach the 4PN level, especially at quadratic order in the spins.

Most of the PN results reviewed here have been established for circularized binaries, and often for spins aligned or anti-aligned with the orbital angular momentum. It is important to extend this large body of work to generic, eccentric, precessing systems. Progress on the two-body scattering problem would also be desirable [10221024]. Additionally, much of what has been achieved in the context of GR could be done as well for well-motivated alternative theories of gravitation, such as for scalar-tensor gravity (e.g. [10251029]) or in quadratic gravity [1030, 1031].

The first law of binary mechanics reviewed above could be extended to generic, precessing spinning systems, and the effects of higher-order spin-induced multipoles should be investigated. Recent work on the PM approximation applied to the gravitational dynamics of compact binaries has given new insight into the EOB model [1004, 10321034], and in particular into the energy map therein. These two lines of research may improve our physical understanding of the general relativistic two-body problem.

On the observational side, future GW detections from inspiralling compact-object binaries will allow testing GR in the strong-field/radiative regime, by constraining possible deviations from their GR values of the various PN coefficients that appear in the expression of the phase. This, in particular, can be used to test some important nonlinear features of GR such as the GW tail, tail-of-tail and nonlinear memory effects. Indeed, the first detections of GWs from inspiralling BH binaries have already been used to set bounds on these PN coefficients, including an $O(10\%)$ constraint on the leading tail effect at the 1.5PN order [3]; see figure 9 (or [18] for a more up-to-date version thereof). More detections with a wider network of increasingly senstive interferometric GW detectors will of course improve those bounds.

Figure 9. Refer to the following caption and surrounding text.

Figure 9. Two GW detections of inspiralling BH binaries were used to set bounds on possible deviations from their GR values of various PN coefficients that appear in the expression for the GW phase. Reproduced from [3]. CC BY 3.0.

Standard image High-resolution image

Finally, following the official selection of the LISA mission by the European Space Agency (ESA), with a launch planned for 2034, we foresee an increased level of activity in source modelling of binary systems of MBHs and EMRIs, two promising classes of sources for a mHz GW antenna in space. This will motivate more work at the interface between the PN approximation and GSF theory.

4. Numerical relativity and the astrophysics of black hole binaries

Contributor: P Schmidt

The year of 2005 marked a remarkable breakthrough: the first successful numerical simulation of—and the extraction of the GWs from—an inspiraling pair of BHs through their merger and final ringdown [10351037] (see e.g. [1038] for a review).

NR provides us with accurate gravitational waveforms as predicted by GR. BBHs cover an eight-dimensional parameter space spanned by the mass ratio $q=m_1/m_2$ , the spin angular momenta $\mathbf{S}_i$ and the eccentricity e. Simulations are computationally extremely expensive, thus the large BBH parameter space is still sparsely sampled. Nevertheless, NR waveforms already play a crucial part in the construction and verification of semi-analytic waveform models used in GW searches, which facilitated the first observations of GWs from BBH mergers [13, 11, 18]. Furthermore, they play a key role in the estimation of source properties and in facilitating important tests of GR in its most extreme dynamical regime.

Since the initial breakthrough, NR has made significant progress: from the first simulations of equal-mass non-spinning BBHs spanning only the last few orbits [10351037], to a realm of simulations exploring aligned-spin [10391041] as well as precessing quasi-circular binaries [10421044], eccentric-orbit binaries [1045, 1046], and evolutions long enough to reach into the early-inspiral regime where they can be matched onto PN models [1047].

Today, several codes are capable of stably evolving BBHs and extracting their GW signal. They can roughly be divided into two categories: finite-differencing codes including BAM [1048], the Einstein Toolkit [1049], LazEv [619, 1036, 1039, 1050], MAYA [1051], LEAN [1052] as well as the codes described in [1035, 1053, 1054]; and (pseudo-)spectral codes such as the Spectral Einstein Code (SpEC) [1055]. Other evolution codes currently under development include [1056] and [1057]. To date, these codes have together produced several thousands of BBH simulations [10581061].

4.1. Current status

4.1.1. Excision and puncture.

In BBH simulations one numerically solves the vacuum Einstein equations with initial conditions that approximate a pair of separate BHs at some initial moment. An obvious complication is the presence of spacetime singularities inside the BHs, where the solution diverges. There are two approaches to this problem. The first is excision [1062], whereby a region around the singularity is excised (removed) from the numerical domain. As no information can propagate outwards from the interior of the BH, the physical content of the numerical solution outside the BHs is unaffected. Since the excision boundary is spacelike (not timelike), one cannot and does not specify boundary conditions on it. Instead, the main technical challenge lies in ensuring that the excision boundary remains spacelike. The risk, for example, is that part of the boundary may become timelike if numerical noise is not properly controlled [10631065]. It must also be ensured that non-physical gauge modes do not lead to numerical instabilities. Excision is used in SpEC, for example.

The second common way to deal with the BH singularities is to choose singularity-avoiding coordinates. This is achieved by representing BHs as compactified topological wormholes [1066] or infinitely long cylinders ('trumpets') [1067], known as puncture initial data. Specific gauge conditions allow the punctures to move across the numerical grid, giving this approach the name moving punctures [1036, 1037, 10671069].

4.1.2. Initial data.

No exact solutions are known, in general, for the BBH metric on the initial spatial surface, so one resorts to approximate initial conditions. Two types of initial data are commonly used: conformally flat and conformally curved. Most simulations that incorporate moving punctures use conformally flat initial data. Under this assumption, three of the four constraint equations (themselves a subset of the full Einstein field equations) are given analytically in terms of the Bowen-York solutions [1070]. The maximal possible angular momentum in this approach is a/m  =  0.93, known as the Bowen-York limit [1071]. In order to go beyond this limit, conformally curved initial data have to be constructed. For codes that use excision, these can be obtained by solving the extended conformal thin sandwich (CTS) equations [1072] with quasi-elliptical boundary conditions [10731076]. The initial spatial metric is proportional to a superposition of the metrics of two boosted Kerr–Schild BHs [1077]. More recently, the first non-conformally flat initial data within the moving punctures framework have been constructed [1078, 1079] by superposing the metrics and extrinsic curvatures of two Lorentz-boosted, conformally Kerr BHs.

4.1.3. Evolution systems.

The successful evolution of a BBH spacetime further requires a numerically stable formulation of the Einstein field equations and appropriate gauge choices. Long-term stable evolutions today are most commonly performed with either a variant of the generalised harmonic [1035, 1080, 1081] or the Baumgarte–Shapiro–Shibata–Nakamura (BSSN) formulation [1082, 1083]. Another formulation, Z4, combines constraint preserving boundary conditions with an evolution system very close to BSSN [10841087]. More advanced Z4-type formulations, which are conformal and traceless, were developed in [10881090].

4.1.4. Gravitational wave extraction.

The GW signal emitted through the inspiral, merger and ringdown is usually extracted from the Newman–Penrose curvature scalar $\Psi_4$ [1091, 1092] associated with the computed metric. The extraction of the signal is typically performed on spheres of constant coordinate radius some distance from the binary, followed by extrapolation to infinity. The method of Cauchy-characteristic extraction (CCE) [1093, 1094] allows us to extract the observable GW signal directly at future null infinity by matching the Cauchy evolution onto a characteristic evolution that extends the simulation to null infinity. In the Cauchy-characteristic extraction, the gravitational waveform is most naturally extracted from the Bondi news function [1095, 1096].

4.2. Challenges

4.2.1. High spins.

Numerical simulations of close to maximally spinning BHs are still challenging to carry out due to difficulties in the construction of initial data as well as increasingly demanding accuracy requirements during the evolutions. This particularly affects binary configurations of unequal masses and arbitrary spin orientation, despite significant developments made in the past five years [10581061, 1097]. While spins up to 0.994 have been evolved stably [1098], extensive work is still underway to reach the proposed Novikov-Thorne limit of 0.998 [1099] and beyond.

4.2.2. High mass ratios.

The highest mass ratio BBH fully general-relativistic numerical simulation available to date is of q  =  100 [1100]. (However, this simulation follows a relatively small number of orbital cycles.) For spinning BHs, simulations of mass ratio q  =  18 have been performed [1097]. Both of these have been obtained in the moving punctures approach. For spectral methods, mass ratios higher than $q\sim 10$ are numerically very challenging. Generally, higher mass ratios are more computationally expensive as the disparate lengthscales demand higher spatial resolution, and because the number of orbital cycles increases in proportion to q. The choice of a Courant factor poses another problem in explicit evolution schemes, as it forces a small time step, which becomes increasingly challenging for high mass ratios. Implicit schemes could provide a potential solution to this [1101], but have not yet been applied to the fully relativistic binary problem. Improvements in numerical technique, and perhaps a synergy with the perturbative methods reviewed in section 2, will be crucial for overcoming this problem in the hope of extending the reach of NR towards the extreme-mass-ratio regime.

4.2.3. Long simulations.

It is crucial for simulations to track the binary evolution from the early inspiral, where it can be matched to a PN model, all the way down to the final merger and ringdown. However, such long evolutions are computationally very expensive, requiring many months of CPU time with existing codes. Note that the run time is not simply linear in the evolution time: a longer run would usually also require a larger spatial domain in order to keep spatial boundaries out of causal contact. To date, only one complete numerical simulation reaching well into the PN regime has been produced, using a modified version of SpEC [1047]. Significant modifications to existing codes would need to be made in order to be able to generate long simulations on a production scale.

4.2.4. Waveform accuracy.

As the sensitivity of GW detectors increases, further work will be required to further improve models and assess their systematic errors [1021]. Already, for large precessing spins, or large mass ratios, or non-negligible eccentricities, systemic modelling errors limit the accuracy with which LIGO and Virgo can extract source parameters (see [1102] for a more detailed discussion). This is due, in part, to relatively large errors in current models of high multipole-mode contributions, which we expect in the coming years to become resolvable in detected signals.

4.2.5. Beyond general relativity.

GW observations from merging compact binaries allow us to probe the strong-field regime and test GR (see chapter III). While theory-agnostic ways to test deviations from GR are commonly used, testing a selection of well-motivated alternative theories through direct waveform comparison is desirable. First steps have been taken to simulate BBHs in alternative theories of gravity that admit BH solutions [1103, 1104]. However, many issues remain to be addressed, not least of which the fact that some of these theories may not even possess a well-posed initial value formulation (see [1105, 1106] for examples).

4.3. Numerical relativity and GW observations

NR plays an important role in GW astrophysics and data analysis. The semi-analytical waveform models employed to search for BBHs (see section 6 below) model the complete radiative evolution from the early inspiral, through the merger and to the final ringdown stage, with the later stages calibrated to BBH simulations [1107, 1108]. These models and more sophisticated ones incorporating more physical effects, for example precession [1109, 1110], are used to determine the fundamental properties of the GW source, i.e. its masses and spin angular momenta as well as extrinsic quantities such as the orbital orientation relative to the observatories [1111, 1112].

Alternatively, pure NR waveforms may be used directly [1113, 1114] or by the means of surrogate models [11151117]. But due to the computational cost of simulations, these have only been attempted so far on a very restricted portion of the parameter space. Despite this restriction, pure NR surrogate models have the advantage of incorporating more physical effects that may be limited or entirely neglected in currently available semi-analytic models.

The ringdown phase after the merger may be described by perturbation theory (see e.g. [1118] for a review), but the amplitudes of the excited quasi-normal modes can only be obtained from numerical simulations. These are of particular interest as measuring the amplitudes of individual quasinormal modes would allow to map the final state of the merger to the properties of the progenitor BHs [1119, 1120] and to test the BH nature of the source [1121] (see also sections 4.1 and 4.2).

In order to estimate the mass and spin angular momentum of the remnant BH, fits to NR simulations are essential [10881090, 11221129]. Independent measurements of the binary properties from the inspiral portion of the GW signal and from the later stages of the binary evolution using fits from NR allow, when combined, to test the predictions from GR [17, 1130]. While the phenomenological fit formulae have seen much improvement in recent years, modelling the final state of precessing BH binaries from numerical simulations still remains an open challenge.

Generically, when BHs merge, anisotropic emission of GWs leads to the build up of a net linear momentum. Due to the conservation of momentum, once the GW emission subsides, the remnant BH recoils ('kick'). While the recoil builds up during the entire binary evolution, it is largest during the non-linear merger phase. Kick velocities can be as high as several thousands ${\rm km}/{\rm s}$ , with astrophysical consequences: a BH whose recoil velocity is larger than the escape velocity of its galaxy may leave its host. Numerical simulations are necessary to predict the recoil velocities [11311134] (a convenient surrogate model for these velocities was constructed recently in [1135]). It has been suggested that it may be possible to directly measure the recoil speed from the GW signal by observing the induced differential Doppler shift throughout the inspiral, merger and ringdown [1136].

Numerical simulations of the strong field regime are also crucial for exploring other spin-related phenomena such as 'spin-flips' [1137] caused by spin–spin coupling, whose signatures may be observable. Understanding the spin evolution and correctly modelling spin effects is crucial for mapping out the spin distribution of astrophysical BHs from GW observations.

Waveform models as well as fitting formulae for remnant properties are prone to systematic modelling errors. For GW observations with low SNR, the statistical uncertainty dominates over the systematic modelling error in the parameter measurement accuracy. Improved sensitivities for current and future GW observatories (including, especially, LISA [1138, 1139]) will allow for high SNR observations reaching into the regime where systematic errors are accuracy limiting. This includes strongly inclined systems, higher-order modes, eccentricity, precession and kicks, all of which can be modelled more accurately through the inclusion of results from NR (see [1021] for a detailed discussion in the context of the first BBH observation GW150914).

5. Numerical relativity in fundamental physics

Contributor: U Sperhake

The standard model of particle physics and Einstein's theory of GR provide us with an exquisite theoretical framework to understand much of what we observe in the Universe. From high-energy collisions at particle colliders to planetary motion, the GW symphony of NS and BH binaries and the cosmological evolution of the Universe at large, the theoretical models give us remarkably accurate descriptions. And yet, there are gaps in this picture that prompt us to believe that something is wrong or incomplete. Galactic rotation curves, strong gravitational lensing effects, X-ray observations of galactic halos and the cosmic microwave background cannot be explained in terms of the expected gravitational effects of the visible matter [281]. Either we are prepared to accept the need to modify the laws of gravity, or there exists a form of dark matter (DM) that at present we can not explain satisfactorily with the SM (or both). Different DM candidates and their status are reviewed in section 5.1. Likewise, the accelerated expansion of the Universe [1140] calls for an exotic form of matter dubbed dark energy or (mathematically equivalently) the introduction of a cosmological constant with a value many orders of magnitude below the zero-point energy estimated by quantum field theory—the cosmological constant problem. Further chinks in the armor of the SM+GR model of the universe include the hierarchy problem, i.e. the extreme weakness of gravity relative to the other forces, and the seeming irreconcilability of GR with quantum theory.

Clearly, gravity is at the center of some of the most profound contemporary puzzles in physics. But it has now also given us a new observational handle on these puzzles, in the form of GWs [1]. Furthermore, the aforementioned 2005 breakthrough in NR [10351037] has given us the tools needed to systematically explore the non-linear strong-field regime of gravity. For much of its history, NR was motivated by the modeling of astrophysical sources of GWs, as we have described in section 4. As early as 1992, however, Choptuik's milestone discovery of critical behaviour in gravitational collapse [1141] has demonstrated the enormous potential of NR as a tool for exploring a much wider range of gravitational phenomena. In this section we review the discoveries made in this field and highlight the key challenges and goals for future work.

5.1. Particle laboratories in outer space

DM, by its very definition, generates little if any electromagnetic radiation; rather, it interacts with its environment through gravity. Many DM candidates have been suggested (see also section 5), ranging from primordial BH clouds to weakly interacting massive particles (WIMPs) and ultralight bosonic fields [281, 282, 1142]. The latter are a particularly attractive candidate in the context of BH and GW physics due to their specific interaction with BHs. Ultralight fields, also referred to as weakly interacting slim particles (WISPs), typically have mass parameters orders of magnitude below an electron volt and arise in extensions of the SM of particle physics or extra dimensions in string theory. These include axions or axion-like particles, dark photons, non-topological solitons (so-called Q-balls) and condensations of bosonic states [281, 11431145]. For reference, we note that a mass $10^{-10}~{\rm eV}$ ($10^{-20}~{\rm eV}$ ) corresponds to a Compton wavelength of $\mathcal{O}(1)~{\rm km}$ ($\mathcal{O}(10^2)~{\rm AU}$ ) which covers the range of Schwarzschild radii of astrophysical BHs.

At first glance, one might think the interaction between such fundamental fields and BHs is simple; stationary states are constrained by the no-hair theorems and the field either falls into the BH or disperses away. In practice, however, the situation is more complex—and more exciting. NR simulations have illustrated the existence of long-lived, nearly periodic states of single, massive scalar or Proca (i.e. vector) fields around BHs [11461148]. Intriguingly, these configurations are able to extract rotational energy from spinning BHs through superradiance [11491152], an effect akin to the Penrose process [1152, 1153]. Further details on this process are provided in section 5.7. In the presence of a confining mechanism that prevents the field from escaping to infinity, this may even lead to a runaway instability dubbed the BH bomb [1154, 1155]. Another peculiar consequence arising in the same context is the possibility of floating orbits where dissipation of energy through GW emission is compensated by energy gain through superradiance [1156, 1157].

Naturally, non-linear effects will limit the growth of the field amplitude or the lifetime of floating orbits, and recent years have seen the first numerical studies to explore the role of non-linearities in superradiance. These simulations have shown that massive, real scalar fields around BHs can become trapped inside a potential barrier outside the horizon, form a bound state and may grow due to superradiance [1146]. Furthermore, beating phenomena result in a more complex structure in the evolution of the scalar field. These findings were confirmed in [1147], which also demonstrated that the scalar clouds can source GW emission over long timescales. The amplification of GWs through superradiance around a BH spinning close to extremality has been modeled in [1158] and found to be maximal if the peak frequency of the wave is above the superradiant threshold but close to the dominant quasi-normal mode frequency of the BH. The generation of GW templates for the inspiral and merger of hairy BHs and the identification of possible smoking gun effects distinguishing them from their vacuum counterparts remains a key challenge for future NR simulations.

Numerical studies of the non-linear saturation of superradiance are very challenging due to the long time scales involved. A particularly convenient example of superradiance in spherical symmetry arises through the interaction of charged, massless scalar fields with Reissner–Nordström BHs in asymptotically anti-de Sitter spacetime or in a cavity; here energy is extracted from the BH charge rather than its rotation. The scalar field initially grows in accordance with superradiance, but eventually saturates, leaving behind a stable hairy BH [1159, 1160]. Recently the instability of spinning BHs in AdS backgrounds was studied in full generality [1161]. The system displays extremely rich dynamics and it is unclear if there is a stationary final state.

The superradiant growth of a complex, massive vector field around a near-extremal Kerr BH has recently been modeled in axisymmetry [1162]. Over $9\%$ of the BH mass can be extracted until the process gradually saturates. Massive scalar fields can also pile up at the center of 'normal' stars. Due to gravitational cooling, this pile-up does not lead to BH formation but stable configurations composed of the star with a 'breathing' scalar field [1163].

The hairy BH configurations considered so far are long-lived but not strictly stationary. A class of genuinely stationary hairy BH spacetimes with scalar or vector fields has been identified in a series of papers [11641168]. The main characteristic of these systems is that the scalar field is not stationary—thus bypassing the no-hair theorems—but the spacetime metric and energy-momentum are. A subclass of these solutions smoothly interpolates between the Kerr metric and boson stars. A major challenge for numerical explorations is to evaluate whether these solutions are non-linearly stable and might thus be astrophysically viable. A recent study of linearized perturbations around the hairy BH solution of [1164] has found unstable modes with characteristic growth rates similar to or larger than those of a massive scalar field on a fixed Kerr background. However, such solution may still be of some astrophysical relevance [1169]. See also section 4.2.1 of chapter III for a related discussion.

5.2. Boson stars

The idea of stationary localized, soliton-like configurations made up of the type of fundamental fields discussed in the previous section goes back to Wheeler's 'gravitational-electromagnetic entities' or geons of the 1950s [1170]. While Wheeler's solutions turn out to be unstable, replacing the electromagnetic field with a complex scalar field leads to 'Klein–Gordon geons' first discovered by Kaup [1171] and now more commonly referred to as boson stars. The simplest type of boson stars, i.e. stationary solutions to the Einstein-complex-Klein–Gordon system, is obtained for a non-self-interacting field with harmonic time dependence $\phi \propto {\rm e}^{{\rm i}\omega t}$ where the potential contains only a mass term $V(\phi)= m^2 |\phi|^2$ . The resulting one-parameter family of these so-called mini boson star solutions is characterized by the central amplitude of the scalar field and leads to a mass-radius diagram qualitatively similar to that of static NSs; a maximum mass value of $M_{\rm max}=0.633\,M_{\rm Planck}^2/m$ separates the stable und unstable branches [11721174]. For a particle mass $m=30\,{\rm GeV}$ , for instance, one obtains $M_{\rm max}\sim 10^{10}\,{\rm kg}$ with radius $R\sim 6\times 10^{-18}$ m and a density 1048 times that of a NS [1175].

A wider range of boson star models is obtained by adding self-interaction terms to the potential $V(\phi)$ which result in more astrophysically relevant bulk properties of the stars. For a quartic term $\lambda |\phi|^4/4$ , for example, the maximal mass is given by $M_{\rm Planck} (0.1\,{\rm GeV}^2)\,M_{\odot}\,\lambda^{1/2}/m^2$ [1176]; for further types of boson stars with different potential terms see the reviews [1174, 1175, 1177] and references therein. A particularly intriguing feature of rotating boson stars exemplifies their macroscopic quantum-like nature: the ratio of angular momentum to the conserved particle number must be of integer value which prevents a continuous transition from rotating to non-rotating configurations [1175, 1178]. More recently, stationary, soliton-like configurations have also been found for complex, massive Proca fields [1179]. For real scalar fields, in contrast, stationary solutions do not exist, but localized, periodically oscillating solutions dubbed oscillatons have been identified in [1180].

Boson stars are natural candidates for DM [1174], but may also act as BH mimickers [1181, 1182]. In the new era of GW observations, it is vital to understand the GW generation in boson-star binaries and search for specific signatures that may enable us to distinguish them from BH or NS systems. Recent perturbative calculations of the tidal deformation of boson stars demonstrate that the inspiral part of the waveform may allow us to discriminate boson stars, at least with third-generation detectors [1183]. Numerical studies of dynamic boson stars have so far mostly focused on the stability properties of single stars and confirmed the stable and unstable nature of the branches either side of the maximum mass configuration; see e.g. [11841187]. The modeling of head-on collisions of boson stars [1188, 1189] reveals rich structure in the scalar radiation and that the merger leads to the formation of another boson star. Head-on collisions have also served as a testbed for confirming the validity of the hoop conjecture in high-energy collisions [1190]. Inspiralling configurations result either in BH formation, dispersion of the scalar field to infinity or non-rotating stars [1191, 1192], possibly a consequence of the quantized nature of the angular momentum that makes it difficult to form spinning boson stars instead.

Binary boson star systems thus remain largely uncharted territory, especially regarding the calculation of waveform templates for use in GW data analysis and the quantized nature of spinning boson stars and their potential constraints on forming rotating stars through mergers.

5.3. Compact objects in modified theories of gravity

New physical phenomena and the signature of new 'modified' theories are typically encountered when probing extreme regimes not accessible to previous experiments and observations. Quantum effects, for instance, become prominent on microscopic scales and their observation led to the formulation of quantum theory while classical physics still provides an accurate description of macroscopic systems. Likewise, Galilean invariance and Newtonian theory accurately describe slow motion and weakly gravitating systems but break down at velocities comparable to the speed of light or in the regime of strong gravity. We therefore expect modifications of GR, if present, to reveal themselves in the study of extreme scales such as the large-scale dynamics of the universe or the strong curvature regime near compact objects.

The dawn of GW observations provides us with unprecedented opportunities to probe such effects. The modeling of compact objects in alternative theories of gravity represents one of the most important challenges for present and future NR studies, so that theoretical predictions can be confronted with observations. This challenge faces additional mathematical, numerical and conceptual challenges as compared to the GR case; a more detailed discussion of these is given in section 3.4 below. A convenient way to classify the numerous modified theories of gravity is provided by the assumptions underlying Lovelock's theorem and, more specifically, which of these assumptions are dropped [1193]. Unfortunately, for most of these candidate theories, well-posed formulations are not known (see table 1 in [1193]) or presently available only in the form of a continuous limit to GR or linearization around some background [1104, 1106]. Prominent exceptions are (single- or multi-) scalar-tensor (ST) theories of gravity [1194] which includes Brans–Dicke theory [1195] and, through mathematical equivalence, a subset of $f(R)$ theories [1196]. ST theories inherit the well-posedness of GR through the Einstein frame formulation [1197]; see also [11981200]. Furthermore, ST physics would present the most conspicuous strong-field deviation from GR discovered so far, the spontaneous scalarization of NSs [1201] (for a similar effect in vector-tensor theory see [1202]). For these reasons, almost all NR studies have focused on this class of theories, even though its parameter space is significantly constrained by solar system tests and binary pulsar observations [1203].

The structure of equilibrium models of NSs in ST theories has been studied extensively in the literature (e.g. [1201, 12041210]) and leads to a mass-radius diagram that contains one branch of GR or GR-like models plus possible additional branches of strongly scalarized stars. The presence or absence of these additional branches depends on the coupling between the scalar and tensor sector of the theory.

Early numerical studies considered the collapse of dust in spherical symmetry [12111214] which leads to a hairless BH in agreement with the no-hair theorems, even though departures from GR are possible during the dynamic stages of the collapse. In a sequence of papers, Novak et al studied the collapse of NSs into a BH [1215], the transition of NSs between stable branches [1216] and the formation of NSs through gravitational collapse [1217]. In all cases, strong scalar radiation is generated for that part of the ST theory's parameter space that admits spontaneously scalarized equilibrium models. In [1218], the collapse of stellar cores to BHs was found to be the most promising scenario to generate detectable scalar radiation for parameters allowed by the Cassini and pulsar observations; galactic sources at a distance $D=10\,{\rm kpc}$ may be detected with present and future GW observatories or used to further constrain the theory's parameters. All these simulations, however, consider massless ST theory. For massive fields, low frequency interactions decay exponentially with distance, so that the pulsar and Cassini constraints may no longer apply [1219]. In consequence, massive ST theory still allows for very strongly scalarized equilibrium stars if $m\gtrsim 10^{-15}~{\rm eV}$ [1209, 1210]. This has dramatic consequences for the GW signals that can be generated in massive ST theory as compared with its massless counterpart: GW amplitudes can be orders of magnitude larger and the waves are spread out into a nearly monochromatic signal over years or even centuries due to the dispersion of the massive field. GW searches may therefore be directed at historic supernovae such as SN1987A and either observe a signal or constrain the theory's parameter space [1220].

Numerical studies of binary systems in ST theory are rather scarce. The no-hair theorems strongly constrain possible deviations of pure BH spacetimes from GR. They can be bypassed, however, through non-trivial potentials [1221] or boundary conditions [1103] which leads to scalar wave emission. Nevertheless, NS systems appear to be the more natural candidate to search for imprints of ST theory. Dynamical scalarization has indeed been observed in simulations of the merger of two NSs with initially vanishing scalar charge [1222, 1223]. Beyond GR and ST theory, we are only aware of one numerical study [1104], which simulated the evolution of BH binaries in the dynamical Chern-Simons (dCS) theory linearized around GR. LIGO observations may then measure or constrain the dCS length scale to $\lesssim $ $ \mathcal{O}(10)~{\rm km}$ .

With the dawn of GW astronomy [1], the topics discussed so far in this section are becoming important subjects of observational studies with LIGO, Virgo and future GW detectors. These studies are still in an early stage and the generation of precision waveforms for scenarios involving modifications of gravity, fundamental fields or more exotic compact objects will be a key requirement for fully exploiting the scientific potential of this new channel to observing the universe.

The analysis of GW events has so far concentrated on testing the consistency of the observed signals with GR predictions, establishing bounds on phenomenological parametrizations of deviations from GR and obtaining constraints from the propagation of the GW signal. An extended study of GW150914 demonstrated consistency between the merger remnant's mass and spins obtained separately from the low-frequency inspiral and the high-frequency postinspiral signal [17]. The Compton wavelength of the graviton was constrained with a 90% lower bound of 1013 km (corresponding to an upper bound for the mass of  ∼$ 10^{-22}\,{\rm eV}$ ) and parametrizations of violations of GR using high-order PN terms have been constrained; see also [11, 20]. The first NSB observation [21], combined with electromagnetic observations, limited the difference between the speed of propagation of GWs and that of light to within $-3\times 10^{-15}$ and $7\times 10^{-16}$ times the speed of light [23]. Analysis of the polarization of the first triple coincidence detection GW170814 found Bayes' factors of 200 (1000) favoring a purely tensor polarization against purely vector (purely scalar) radiation. In summary, the GW observations have as yet not identified an inconsistency with the predictions of vacuum GR for BBH signals or GR predictions for NS systems.

Chapter III contains detailed discussions on many of the topics raised here, including motivation and brief description of certain broad classes of alternative to GR (section 2.1), numerics beyond GR (section 3.4), and the nature of compact objects beyond GR (section 4).

5.4. High-energy collisions of black holes

The hierarchy problem of physics consists in the vast discrepancy between the weak coupling scale ($\approx$ 246 ${\rm GeV}$ ) and the Planck scale $1.31\times 10^{19}~{\rm GeV}$ or, equivalently, the relative weakness of gravity compared with the other interactions. A possible explanation has been suggested in the form of 'large' extra spatial dimensions [1224, 1225] or extra dimensions with a warp factor [1226, 1227]. On short lengthscales $\lesssim $ $10^{-4}~{\rm m}$ , gravity is at present poorly constrained by experiment and would, according to these models, be diluted due to the steeper fall off in higher dimensions. All other interactions, on the other hand, would be constrained to a 3+1 dimensional brane and, hence, be unaffected. In these braneworld scenarios, the fundamental Planck scale would be much smaller than the four-dimensional value quoted above, possibly as low as $\mathcal{O}({\rm TeV})$ which inspired the name TeV gravity. This fundamental Planck scale determines the energy regime where gravity starts dominating the interaction, leading to the exciting possibility that BHs may be formed in particle collisions at the LHC or in high-energy cosmic rays hitting the atmosphere [12281230].

The analysis of experimental data employs so-called Monte-Carlo event generators [1231] which require as input the cross section for BH formation and the loss of energy and momentum through GW emission. In the ultrarelativistic limit, the particles may be modeled as pointlike or, in GR, as BHs.

In D  =  4 spacetime dimensions, high-energy collisions of BHs are by now well understood. Head-on collisions near the speed of light radiate about $14\%$ of the center-of-mass energy M in GWs [1232, 1233]. The impact parameter separating merging from scattering collisions with boost velocity $v$ is $b/M=(2.50\pm0.05)/v$ [1234]. Grazing collisions exhibit zoom-whirl behaviour [1235, 1236] and can radiate up to  ∼$ 50\%$ of the total energy in GWs [1237]. The collision dynamics furthermore become insensitive to the structure of the colliding objects—BHs or matter balls—at high velocities [1190, 12371240] as expected when kinetic energy dominates the budget. Finally, NR simulations of hyperbolic BH encounters are in good agreement with predictions by the EOB method [1022].

The key challenges are to generalize these results to the higher D scenarios relevant for TeV gravity (there are partial, perturbative results [12411243]). Considering the symmetries of the collision experiments, it appears plausible to employ rotational symmetry in higher D numerics which is vital to keep the computational costs under control and underlies all NR studies performed so far [1054, 12441247]. Nonetheless, NR in higher D faces new challenges. The extraction of GWs is more complex, but now tractable either with perturbative methods [12481250], using the Landau-Lifshitz pseudotensor [1251] or projections of the Weyl tensor [1252, 1253]. Likewise, initial data can be obtained by generalizing four-dimensional techniques [1254]. Studies performed so far, however, indicate that, for reasons not yet fully understood, obtaining numerically stable evolutions is harder than in D  =  4 [1255, 1256]. Results for the scattering threshold are limited to $v\lesssim 0.5~c$ [1255] and the emission of GWs has only been computed for non-boosted collisions of equal and unequal mass BH binaries [12561258]. These simulations show a strong suppression of the radiated energy with D beyond its peak value $E_{\rm rad}/M \approx 9\times 10^{-4}$ at D  =  5 for equal-mass systems, but reveal more complex behaviour for low mass ratios. A remarkable outcome of BH grazing collisions in D  =  5 is the possibility of super-Planckian curvature in a region outside the BH horizons [1255].

5.5. Fundamental properties of black holes and non-asymptotically flat spacetimes

Recent years have seen a surge of NR applications to non-asymptotically flat spacetimes in the context of the gauge-gravity duality, cosmological settings and for the exploration of fundamental properties of BH spacetimes. We list here a brief selection of some results and open questions; more details can be found in the reviews [1259, 1260].

Cosmic censorship has for a long time been a topic of interest in NR, but to date no generic, regular and asymptotically flat class of initial data are known to result in the formation of naked singularities in four spacetime dimensions. Higher-dimensional BHs, however, have a much richer phenomenology [1261], including in particular black rings which may be subject to the Gregory–Laflamme instability [1262]. Thin black rings have indeed been found to cascade to a chain of nearly circular BHs connected by ever thinner segments in finite time [1263] in the same way as infinite black strings [1264]. Similarly, ultraspinning topologically spherical BHs in $D\geqslant 6$ dimensions are unstable [1265] and ultimately form ever thinner rings in violation of the weak cosmic censorship conjecture [1266].

We have already mentioned Choptuik's discovery of critical phenomena in the collapse of spherical scalar fields [1141]. In asymptotically anti-de Sitter (AdS) spacetimes, the dynamics change through the confining mechanism of the AdS boundary, allowing the scalar field to recollapse again and again until a BH forms [12671271]; see also [1272] for non-spherically symmetric configurations. NR simulations of asymptotically AdS spacetimes are very challenging due to the complex outer boundary conditions, in particular away from spherical symmetry, but recent years have seen the first simulations of BH collisions in AdS [1273, 1274] which, assuming gauge-gravity duality, would imply a far-from hydrodynamic behaviour in heavy-ion collisions during the early time after merger.

Using a cosmological constant with an opposite sign leads to asymptotically de Sitter (dS) spacetimes widely believed to describe the Universe we live in. NR studies in dS have explored the possible impact of local structures on the cosmological expansion [12751279] and found such inhomogeneities to not significantly affect the global expansion. Further work has explored the robustness of inflation under inhomogeneities [1280], the propagation of light in an expanding universe [1281] and the impact of extreme values of the cosmological constant on the physics of BH collisions [1282].

6. Effective-one-body and phenomenological models

Contributor: T Hinderer

This section surveys the status of two main classes of models that are currently used in GW data analysis: (i) the EOB approach, which describes both the dynamics and waveforms in the time domain for generic spinning binaries, and (ii) the phenomenological approach (Phenom), which provides a closed-form description of the waveform in the frequency domain and includes the dominant spin effects. The discussion below reviews mainly the current state-of-the-art models; a more comprehensive overview of prior work can be found in review articles such as [12831285].

6.1. Effective-one-body models

The EOB approach was introduced in [1286, 1287] as a method to combine information from the test-particle limit with PN results (for early ideas in this spirit, see [1288]). The model comprises a Hamiltonian for the inspiral dynamics, a prescription for computing the GWs and corresponding radiation reaction forces, and a smooth transition to the ringdown signals from a perturbed final BH. The idea is to map the conservative dynamics of the relative motion of two bodies, with masses m1,2, spins ${\boldsymbol S}_{1,2}$ , orbital separation $\boldsymbol x$ and relative momentum $\boldsymbol p$ , onto an auxiliary Hamiltonian description of an effective particle of mass $\mu =m_1\,m_2/(m_1+m_2)$ and effective spin $\boldsymbol{S}_*(\boldsymbol{S}_1,\boldsymbol{S}_2,\boldsymbol{x},\boldsymbol{p})$ moving in an effective spacetime $g_{\mu \nu}^{\rm ef\,\!f}(M,\boldsymbol{S}_{\rm Kerr};\nu)$ that is characterized by the total mass $M =m_1+m_2$ , symmetric mass ratio $\nu= \mu/M$ , and total spin $\boldsymbol{S}_{\rm Kerr}(\boldsymbol{S}_1,\boldsymbol{S}_2)$ . The basic requirements on this mapping are that (i) the test-particle limit reduces to a particle in Kerr spacetime, and (ii) in the weak-field, slow-motion limit the EOB Hamiltonian reduces to the PN Hamiltonian, up to canonical transformations. These considerations, together with insights from results in quantum-electrodynamics [1289], scattering theory [1004, 1032], and explicit comparisons of the action variables [1286] all lead to the 'energy map' given by

Equation (11)

where H and $H_{\rm ef\,\!f}$ are the Hamiltonians describing the physical binary system and the effective particle respectively.

The details of the effective Hamiltonian $H_{\rm ef\,\!f}$ are less constrained by theoretical considerations, and different descriptions have been proposed that differ in the structure of the Hamiltonian, the form of the potentials characterizing the effective spacetime, and the spin mapping between the physical and effective systems. The differences between these choices reflect current limitations of theoretical knowledge; they all agree in the PN and nonspinning test-particle limit. For the structure of $H_{\rm ef\,\!f}$ , the incarnation of the EOB model of [1108, 12901293] imposes that the limiting case of a spinning test-particle in Kerr spacetime must be recovered, to linear order in the test-spin. The version of the model of [12941298] does not include test-particle spin effects, which enables a more compact description. These different choices also result in different spin mappings, for both the Kerr parameter and the spin of the effective particle. Finally, these two branches of EOB models also employ different ways to re-write the potentials that are calculated as a Taylor series in a PN expansion in a 're-summed' form. This means that an empirically motivated non-analytic representation is used that consists either of a Pade-resummation [12941296] or has a logarithmic form [1291, 1292]. In addition to the above choices for describing the strong-field, comparable-mass regime, the models also include parameterized terms whose coefficients are functions of the mass and spin parameters that are fixed by comparisons to NR results. In the model of [1108, 12911293], these calibration parameters are constrained by the requirement that the model must reproduce the GSF results for the ISCO shift in Schwarzschild [813].

The radiative sector in the EOB model is described by so-called factorized waveforms (instead of a PN Taylor series expansion) that are motivated from the structure of waveforms in the test-particle limit and have the form [1299, 1300]

Equation (12)

Here, $ \newcommand{\e}{{\rm e}} h_{\ell m}^{(N,\epsilon)}$ is the Newtonian contribution, and $ \newcommand{\e}{{\rm e}} \hat{S}_{\rm ef\,\!f}^{(\epsilon)}$ is a certain effective 'source term' that, depending on the parity $ \newcommand{\e}{{\rm e}} \epsilon$ of the mode, is related to either the energy or the angular momentum. The factor $ \newcommand{\e}{{\rm e}} T_{\ell m}$ contains the leading order logarithms arising from tail effects, the term $ \newcommand{\e}{{\rm e}} {\rm e}^{{\rm i}\delta_{\ell m}}$ is a phase correction due to sub-leading order logarithms in hereditary contributions, while the function $ \newcommand{\e}{{\rm e}} f_{\ell m}$ collects the remaining PN terms. Finally, the factor $ \newcommand{\e}{{\rm e}} N_{\ell m}$ is a phenomenological correction that accounts for deviations from quasi-circular motion [1301] and is calibrated to NR results. The modes from equation (12) are used to construct dissipative forces $\boldsymbol{\mathcal{F}}$ , in terms of which the equations of motion are given by

Equation (13)

Equation (14)

This EOB description of the inspiral-plunge dynamics is smoothly connected to the merger-ringdown signal in the vicinity of the peak in the amplitude |h22|. To perform this matching, initial nonspinning models and the aligned-spin models SEOBNRv1 [1292] and SEOBNRv2 [1293, 1302] (as well as in the models of the IHES group [1294, 1303]) used a superposition of damped sinusoids similar to quasi-normal modes [1287]. This method is inspired by results for the infall of a test-particle into a BH and the close-limit approximation [1304]. More recently, a simpler phenomenological fit of the amplitude and phase inspired by the rotating source approximation [1305], which was adapted to the EOB context in [1306], has become standard in SEOBNRv4 [1108]. This method provides a more stable and controlled way to connect the inspiral-plunge to the ringdown. A further key input into the merger-ringdown model is the frequency of the least-damped quadrupolar quasinormal mode $\sigma_{220}$ of the remnant based on a fitting formula from NR for the mass and spin of the final object given the initial parameters, with the currently most up-to-date fit from [1108].

Generic spin precession effects are also included in the model, by starting from a calibrated spin-aligned model and transforming to the precessing frame as dictated by the precession equations derived from the EOB Hamiltonian [1307, 1308], without further calibrations. The most recent refinement for generic precessing binaries from [1308] is known as SEOBNRv3.

For non-vacuum binaries involving e.g. neutron stars, exotic compact objects, or condensates of fundamental fields around BHs, several effects of matter change the GWs from the inspiral relative to those from a BH binary, as described in chapter III, section 2.4, where also the signatures from such systems generated during the merger and ringdown are covered. Effects during the inspiral include spin-induced deformations of the objects, tidal effects, the presence of a surface instead of an event horizon, tidal excitation of internal oscillation modes of the objects, and more complex spin-tidal couplings. Two classes of EOB models for such systems are currently available, corresponding to the two different baseline EOB models for BHs. One is known as TEOBRESUMS [1309] and incorporates the effects of rotational deformation and adiabatic tidal effects [1310, 1311] in re-summed form that was inspired by [826] and augmented as described in [439]. The other model is known as SEOBNRv4T and includes the spin-induced quadrupole as well as dynamical tides from the objects' fundamental oscillation modes [447, 1312].

6.2. Phenomenological (Phenom) models

The aim of the Phenom models is to provide a simple, efficient, closed-form expression for the GW signal in the frequency domain [1313] by assuming the schematic form

Equation (15)

where $\vec {\alpha}$ and $\vec {\beta}$ are amplitude and phase parameters in the model. Phenomenological ('Phenom') models were first developed for nonspinning binaries in [1313, 1314] and subsequently refined to include aligned spins [1315] known as 'PhenomB'. This model employed only a single weighted combination of the individual BH spins characterizing the dominant spin effect in the GWs, and was further refined in [1316] ('PhenomC'), and in [1097, 1107] ('PhenomD'). The latter has been calibrated using NR data for mass ratios up to 1:18 and dimensionless spins up to 0.85 (with a larger spin range for equal masses). An effective description of the dominant precession effects has also been developed [1111] ('PhenomP') [1317, 1318]. The PhenomP model provides an approximate mapping for obtaining a precessing waveform from any non-precessing model, with PhenomD being currently used as the baseline.

To construct these state-of-the-art Phenom models, the GW signal is divided into three main regimes: an inspiral, intermediate region, and merger-ringdown. The ansatz for the inspiral model is the PN result for the frequency-domain phasing obtained from energy balance in the stationary phase approximation ('TaylorF2'), accurate to 3.5PN in the nonspinning and linear-in-spin sector, and to 2PN in spin–spin terms. This has the form

Equation (16)

where ci are PN coefficients. The Phenom models add to this auxiliary terms so that the inspiral phase becomes

Equation (17)

where $\sigma_i$ are phenomenological coefficient with $\sigma_{1}=\sigma_2=0$ .

A different ansatz is made for the late inspiral and merger-ringdown signals that are likewise closed-form expressions involving the frequency and phenomenological parameters; in total the model involves 17 phenomenological parameters. For the late inspiral portion, these are mapped to the set of two physical parameters $(\nu, \chi_{\rm PN})$ , where $\chi_{\rm PN}$ is defined by

Equation (18)

Equation (19)

where $\hat L_N$ is the direction of the Newtonian angular momentum and $\chi_i=S_i/m_i^2$ . $\chi_{\rm PN}$ describes the dominant spin effect (the dependence on M is only a rescaling). The mapping is done by assuming a polynomial dependence $\sigma_i=\sum b_{ijk}\nu^{\,j} \chi_{\rm ef\,\!f}^k$ to quadratic order in $\nu$ and third order in $\chi_{\rm PN}$ so that the coefficients vary smoothly across the parameter space. Finally, the coefficients are calibrated to a large set of hybrid waveforms, which themselves are formed using the uncalibrated SEOBNRv2 model.

Spin effects in the Phenom models are described by several different combinations of parameters. In the aligned-spin baseline model, the description of the early inspiral depends on both spin parameters $\chi_1,\chi_2$ from PN. In the later inspiral regime, spin effects are described by the effective combination $\chi_{\rm PN}$ described above, while the merger-ringdown model is expressed in terms of the total spin. For generic spin orientations, an additional parameter $\chi_p$ that characterized the most readily measurable effects of precession is included in the model. The defnition of the precessional parameter $\chi_p$ is motivated by the observation that waveforms are simpler in the coprecessing frame that is aligned with the direction of the dominant radiation emission [1044]; it is given by [1318]

Equation (20)

where $B_1=2+3 m_2/(2 m_1)$ and $B_2=2+3 m_1/(2 m_2)$ assuming the convention $m_1>m_2$ . Starting from an aligned-spin frequency-domain waveform model, an approximate precessing waveform is constructed by 'twisting up' the non-precessing waveform with the precessional motion based on a single-spin PN description for the precession angles [1111, 1317]. While there are some broad similarities of PhenomP with the way the precessing EOB model is constructed, the two approaches differ in several details, as explained in section IV of [1308].

For non-BH objects, the effects of rotational and tidal deformations are included in the phasing using either PN information [13191321] or, for the case of binary neutron stars, using a model calibrated to NR [1322, 1323].

6.3. Remaining challenges

Important advances that all EOB and Phenom models aim to address in the near-future are higher modes, parameter space coverage (especially in the mass ratio) and inclusion of all available theoretical knowledge, reduction of systematic errors, and going beyond circular orbits.

To date, most of the effort in calibrating the EOB and Phenom models has focused on the $(2,2)$ mode, although for the special case of nonspinning binaries an EOB multipolar approximant is available [1324]. An accurate model of higher modes is important for robust data analysis, especially for spinning binaries. Work is ongoing to address this within the EOB model [1325] and with studies that will inform the Phenom approach [1326]. The parameter space over which current models have been calibrated and tested is limited by available NR simulations of sufficient accuracy and length (see section 4), and systematic uncertainties remain a concern.

Extending the range in parameter space ties into the pressing issue that many available results from GSF calculations are not currently incorporated in these models. One of the obstacles is that GSF data for circular orbits only make sense above the 'light ring' radius and cannot directly inform the EOB model across and below that radius. This issue is a further subject of ongoing work. Efforts are also underway to include effects of eccentricity [13271329]. Effects such as the motion of the center-of-mass, BH radiation absorption, and radiation reaction for spins are also not yet included in current models.

For EOB models, efficiency for data analysis is a further concern, because computing EOB waveforms is relatively time-consuming. To overcome this issue, reduced-order models for EOB waveforms in the frequency domain have been developed for aligned-spin binaries [1108, 1330], with work being underway on the larger parameter space of precessing binaries. Also ongoing is work on theoretical aspects of spin effects in the EOB model, aimed at devising an improved description that is more robust and computationally efficient.

6.4. Tests of GR with EOB and Phenom models

The effective models described above have been used for tests of GR and exotic objects, e.g. in [17]. One of the parameterized tests makes use of the general framework 'TIGER' [1331] that has been implemented for data analysis, where parameterized fractional deviations from the GR coefficients are included in the frequency-domain phase evolution from equation (16). In recent studies, e.g. [17, 18], this framework was used on top of the PhenomD inspiral model [1097, 1107] to obtain bounds on non-GR effects, where the tests also included parameterized deviations in the intermediate and merger-ringdown regime. Work is ongoing to implement tests of GR within the EOB approach [1332]. Finally, work is also underway to include in these models the option to test for exotic physics manifest in spin-quadrupole and tidal effects during the inspiral. Despite this recent progress on frameworks to test GR and the nature of BHs, substantial further work will be necessary to refine the level of sophistication and physical realism of such tests.

7. Source modelling and the data-analysis challenge

Contributor: J R Gair

Models of GW sources form an essential component of data analysis for GW detectors. Indeed, many of the scientific objectives of current and future GW detectors cannot be realised without having readily available accurate waveform models. In this section we first provide a brief overview of the primary techniques used for GW data analysis, and follow this with a discussion of the additional challenges that data analysis for future GW detectors will pose and the corresponding requirements that this will place on waveform models.

7.1. Overview of current data analysis methods

The LIGO-Virgo Collaboration (LVC) uses a variety of techniques to first identify and then characterise GW transients identified in their data. The majority of these make use of signal models in some way. We briefly describe these methods in the following.

7.1.1. Unmodelled searches.

A number of LVC analysis pipelines are targeted towards 'burst' sources for which the waveforms are not well modelled. These included Coherent Wave Burst [1333], X-pipeline [1334, 1335] and BayesWave [1336]. All three algorithms make use of the fact that there are multiple detectors in the LVC network, which allows signals (common between different detectors) to be distinguished from noise (generally uncorrelated between detectors). The algorithms differ in their implementation. Coherent Wave Burst and X-pipeline operate on spectrograms (time-frequency) maps of the data, computed using wavelet transforms in the first case and a Fourier transform in the second. Both algorithms then identify 'hot' pixels that have particularly large power, carry out clustering to identify candidate events, i.e. continuous tracks of excess power, in individual detectors, and then impose consistency in the properties of the tracks identified in the different detectors. BayesWave takes a slightly different approach, using Bayesian techniques to construct a model for the noise in each detector and any signal present. The signal is constructed as a superposition of wavelets, and reversible jump Markov Chain Monte Carlo is used to add or remove components from the signal and noise models.

These model-free searches are powerful tools for source identification. Indeed, the first algorithm to find GW150914, the first GW event detected by LIGO, was the Coherent Wave Burst [1, 19], because it was the only online search pipeline running at the time of the event. However, these algorithms are not as sensitive as searches based on models and the second clear GW event, GW151226, was only found with high significance by the template-based searches [2]. In addition, parameter estimation can only be done using models. Moreover, of the three algorithms, only BayesWave is truly independent of waveform models. Both Coherent Wave Burst and X-pipeline uses signal injections in order to determine the optimal choice of the various tunable thresholds in the algorithms that maximizes distinction between signal and noise. The injections use realistic GW signal waveforms, for which models are needed. BayesWave does not do tuning in this way, although the development of that algorithm and demonstration of its performance was done using signal injections.

7.1.2. Template based searches.

The primary search pipelines within the LVC compact binary coalescence (CBC) group use matched filtering. This involves using a precomputed bank of templates of possible GW signals and computing their overlap, i.e. inner product, with the data. The template bank needs to fully cover the parameter space so that if a signal is present, at least one of the templates will recover it confidently. There are two primary matched filtering based searches used by the LVC—pyCBC [1337, 1338] and gstLAL [1339, 1340]. The two searches differ in a number of ways, such as the choice of template bank and placement, and in the various consistency checks that are used to compare the signals identified in the different detectors and to compare the post-signal extraction residual to the expected noise distribution. We refer the interested reader to the previous references for more details. All the BBH systems identified by LIGO to date were found by both of these pipelines with very high significance, and several of them would not have been identified using template-free methods only. Matched filtering is only possible if models of GW signals are available that have sufficient fidelity to true signals.

7.1.3. Parameter estimation.

The other primary area where GW signal models are essential is in parameter estimation. Once a potential signal has been identified by one of the search pipelines described above, the signal is characterized using the separate LALInference pipeline [1341] (though other parameter inference methods are also used [1342, 1343]). LALInference constructs a posterior probability distribution for the source parameters using Bayes theorem. This relies on a model for the likelihood which is taken to be the likelihood of the noise (assumed Gaussian on the short stretches of data around each signal). The noise is computed as observed data minus signal, and is therefore a function of the signal parameters and requires an accurate model of potential signals. LALInference includes two different algorithms—LALInferenceNest [1344] and LALInferenceMCMC [1345]. The first uses nested sampling to determine the posterior and associated model evidence, while the latter uses Markov Chain Monte Carlo techniques based on the Metropolis-Hastings algorithm.

Parameter estimation is essential to extract physical information from identified GW events, and the resulting posterior distributions summarise all our information about the properties of the source. Any physical effect that we wish to probe using GW observations must be included in the signal model used in parameter estimation. The recent NSB event observed by LIGO, GW170817 [21], showed some evidence for tidal effects in the signal, which highlighted deficiencies in signal models with tides and the need for further work in that area [1346].

7.2. Challenges posed by future detectors

The next LIGO-Virgo observing run, O3, is scheduled to start in early 2019, and is expected to have a factor of  ∼2 improvement in sensitivity over the O2 run that finished in August 2017. LIGO/Virgo are expected to complete their first science runs at design sensitivity in the early 2020s [14]. Based on the earlier science runs, several tens of BBH systems and a small number of NSB events [9, 21] are likely to be detected in O3. These events will be similar to sources previously identified and so the primary challenge will be computational—the LVC will need the capability to process multiple events simultaneously, which implies a need for accurate waveform models that are as fast to generate as possible. Further in the future there are plans for third generation ground based detectors, such as the Einstein telescope [33] and cosmic explorer [32], and a space-based GW detector, the Laser Interferometer Space Antenna (LISA) [64]. These new detectors will observe new types of source which will pose new modelling challenges.

7.2.1. Third-generation ground-based detectors.

Planned third-generation ground-based detectors, like the Einstein telescope (ET) [33] or the cosmic explorer [32], aim to improve on advanced detectors in two ways. Firstly, they aim to have an increase in sensitivity by an order of magnitude. Secondly, they aim to improve low-frequency sensitivity, with the ultimate goal of sensitivity as low as 1 Hz, compared to  ∼30 Hz for the LIGO/Virgo detectors in O2, and 10 Hz at design sensitivity. For the ET, these aims will be achieved by increasing the arm-length to 10 km, sitting the detector underground and using cryogenic cooling of the mirrors. ET will also have three arms in a triangular configuration, so that it is equivalent to having two detectors at the same site.

The increase in sensitivity will increase the number of sources detected by about a factor of a thousand. Data analysis must be able to keep pace with such a high source volume, which means algorithms must run quickly. This places constraints on the computational cost of waveform models, which is discussed further below. The improvement in low frequency sensitivity significantly increases the duration of any given signal in band. At leading order, the time to coalescence of a binary scales like f−8/3 [142]. The NSB event GW170817 was in the LIGO band for 40 s, starting from 30 Hz, and generated 3000 waveform cycles [21]. For a detector with low frequency cut-off at 3 Hz, the same source would be in band for  ∼5 h and generate  ∼105 cycles. This longer duration places more stringent requirements on waveform models, since fractional errors in the waveforms will need to be small enough that the templates are accurate to within one cycle of the 105 in band. This is mitigated partially by the fact that most of the additional cycles are generated in the weak field where analytic models are well understood.

Third-generation detectors also offer the prospect of detection of new classes of sources. These include higher-mass BH systems, made possible by the improved low-frequency sensitivity [33], and possibly intermediate mass ratio inspirals (the inspirals of stellar origin BHs into intermediate mass BHs with mass  ∼$ 100M_\odot$ ) [1347, 1348]. The former do not pose additional modelling challenges, as these signals will be well represented by rescaling templates for lower mass systems. The latter, however, lie in a regime where both finite mass-ratio effects and higher-order PN effects become important. The latter require numerical techniques, but these are limited in terms of the number of cycles that can be modelled, while the former requires perturbative techniques, but are then limited by the size of mass ratio corrections. Any IMRIs observed are likely to be at very low SNR and so will only be identifiable in the data if accurate templates are available. Initial attempts to construct hybrid IMRI models have been made [13491352], but considerable work is still required.

It is also hoped that it will be possible to test GR to high precision with third-generation ground-based detectors [33]. This requires development of models in alternative theories. This challenge is common to space-based detectors and is discussed further in the next section.

7.2.2. The Laser Interferometer Space Antenna.

LISA is a space-based GW detector that has been selected as the third large mission that ESA will launch in its current programme, with a provisional launch date of 2034. LISA will comprise three satellites, arranged in an approximately equilateral triangular formation with 2.5Mkm long arms and with laser links passing in both directions along each arm. By precisely measuring the phase of the outgoing and incoming laser light in each arm LISA can do interferometry and detect GWs. It will operate at lower frequency than the LIGO/Virgo detectors, in the range 0.1 mHz–0.1 Hz, with peak sensitivity around a few mHz. The lower frequency sensitivity means that the typical systems that LISA will observe have higher mass, $M\sim 10^4$ $10^7M_\odot$ . Such massive BHs (MBHs) are observed to reside in the centres of lower mass galaxies. LISA is expected to observe mergers of binaries comprised of two such MBHs, which are expected to occur following mergers between the BH host galaxies. MBHs are typically surrounded by clusters of stars, which include BHs similar to those observed by LIGO that were formed as the endpoint of stellar evolution. LISA is also expected to observe the EMRIs of such stellar origin BHs into MBHs. In addition to these MBH sources, LISA will also observe stellar compact binaries in the Milky Way, it could detect some of the stellar origin BH binaries that LIGO will observe and may detect sources of cosmological origin [64]. The latter sources do not pose particular modelling challenges, but the BH sources do.

In contrast to the LIGO-Virgo network, there will be only one LISA constellation. While the three LISA arms allow, in principle, the construction of two independent data streams, there will inevitably be some correlation between noise in these channels. In addition, LISA sources are long-lived, lasting months or years in the data set, and so there will be hundreds of sources overlapping in the data. These properties make it much more difficult to construct unmodelled source pipelines like those used in LIGO and so LISA will rely even more heavily on having models of potential signals in order to identify them in the data. Typical MBH binary signals will have SNR in the tens to hundreds, with a few as high as a thousand. This allows much more precise estimates of parameters, but places much higher demands on the fidelity of waveform models. A template accurate to a few percent is fine for characterising a source that has SNR of a few tens as is typical for LIGO/Virgo, but for an SNR of one thousand, the residual after extracting that source will have SNR in the several tens, biasing parameter estimation and contaminating the extraction of subsequent sources. Templates need to be two orders of magnitude more accurate for use in LISA. This accuracy comes coupled to the need for longer duration templates, as for the third-generation ground based detectors, since the signals are present in the data stream for months. MBH binaries detected by LISA are additionally expected to have high spins [1353], in contrast to the observed LIGO/Virgo sources which are all consistent with low or zero spin [3, 11, 18, 20, 1354, 1355]. Finally, MBH binaries are more likely to show precession. The likely presence of these physical effects in observed signals, coupled with the necessity of model-based searches for LISA places strong requirements on the MBH binary waveform models that must be available by the time LISA flies.

For EMRIs, expected SNRs are in the tens [668, 887], but this SNR is accumulated over  ∼105 cycles. This makes unmodelled EMRI searches impossible and imposes the requirement on modelled searches that the EMRI templates match the true signals to better than one cycle over 105. In addition, all of these cycles are generated in the strong-field where accurate modelling of the signals is more challenging. This drives the requirement for GSF EMRI models accurate to second order in the mass ratio described above. These models must include the effect of high spin in the central MBH, and eccentricity and inclination of the orbit of the smaller BH [695, 1356]. EMRI models will also have to be computationally efficient. Naively, to match 105 cycles in a parameter space with eight intrinsic (and six extrinsic) parameters, requires $(10^5){}^8 \sim 10^{40}$ templates. This is a crude overestimate, but illustrates the complexity of a template-based EMRI search, and the need to be able to generate large numbers of templates in a small computational time. Semi-coherent methods have been proposed [694] that have less stringent requirements, but still need templates accurate for 103 or more cycles.

Finally, one of LISA's primary science objectives is to carry out high precision tests of GR, including both tests of strong-field gravity and tests of the nature of compact objects by using EMRIs to probe the gravitational field structure in the vicinity of BHs. Many different tests have been proposed and we refer the reader to [1357] for a comprehensive review. Several methods exist for phenomenological tests, which assess consistency of the observed signal with the predictions of GR. However, understanding the significance of any constraints that LISA places, and interpreting any deviations that are identified requires models for deviations from the predictions of GR in alternate theories. We require strong-field predictions, most likely relying on numerical simulations, to compare to the merger signals from MBH binaries, which will be observable with high significance. We also require predictions for the sorts of deviations that might be present in the inspiral phase of EMRIs. The latter need to be accurate to a part in 105 and must allow for confusion between gravitational effects and effects of astrophysical origin, e.g. the presence of matter in the system, perturbations from distant objects, etc.

7.3. Computationally efficient models

As described above, a significant obstacle to data analysis for future detectors is computational. In order to analyse a large number of potential signals and search large parameter spaces, we require models that capture all the main physics reliably but can be evaluated rapidly. We describe the current status and outstanding challenges here.

7.3.1. Reduced-order models.

In the context of LIGO/Virgo, interest in developing computationally efficient representations of waveform models arose due to the high cost of parameter estimation algorithms [1341]. This led to the development of reduced-order or surrogate models. The gstLAL search algorithm [1339, 1340] uses a singular-value decomposition (SVD) of the signal and noise parameter spaces to efficiently identify candidate signals. However, for parameter estimation we also need to quickly map such a representation onto physical parameters. One approach is to take Fourier representations of gravitational waveforms, construct an SVD of both of these and finally construct a fit to the dependence of the SVD coefficients on source parameters [1330, 1358]. An alternative approach is to find a reduced-order representation of the waveform parameter space. A set of templates covering the whole parameter space is constructed and then waveforms are added sequentially to the reduced-basis set using a greedy algorithm. At each stage the template that is least well represented by the current reduced basis is added to the set [1359, 1360]. This procedure is terminated once the representation error of the basis is below a specified threshold, and typically the number of templates in the final basis is one or more orders of magnitude less than the size of the original set. Representing an arbitrary template with the basis is then achieved by requiring the basis representation to match the waveform at a specific set of points, i.e. interpolation rather than projection. These points are chosen by another greedy algorithm — picking the points at which the difference between the next basis waveform and its interpolation on the current basis is biggest. This method of representation then allows the likelihood of the data to be written as a quadrature rule sum over the values of the desired waveform at the interpolation points [1361]. This is typically a far smaller number of points than the original time series and so is much cheaper to evaluate than the full waveform. For numerical waveforms an additional step is needed in which the values of the waveform at the desired points is interpolated across parameter space [1362].

The first GW application of this reduced-order quadrature method used a simple sine-Gaussian burst model [1361] and the first implementation within the LIGO/Virgo analysis infrastructure was for a NS waveform model [1363]. Reduced-order models and associated quadrature rules have subsequently been developed for a number of other waveform models, including NR simulations for higher mass BH binaries [11151117] and phenomenological inspiral-merger-ringdown waveform models [1364]. Parameter estimation for GW170817 [21] would arguably not have been possible within the timescale on which that paper was written if the reduced order model waveform had not been available.

Some of the existing reduced-order models and reduced-order quadratures will be useful for analysis of data from future detectors. However, the increased duration of signals will typically increase the size of the reduced basis and so work will be needed to optimize the models. In addition, new types of waveform models including higher spins, lower mass ratios or eccentricities do not yet have available reduced-order models, and so these will have to be developed for application to LISA searches for massive BHs and EMRIs. It is possible that some eventual advanced phenomenological models may on their own provide a competitive alternative to Reduced-order models.

7.3.2. Kludges.

To model EMRIs, 'kludge' models have been developed that capture the main qualitative features of true inspiral signals. There are two main kludge approaches. The analytic kludge (AK [702]) starts from GW emission from Keplerian orbits, as described by [142, 1365], and then adds various strong-field effects on top—evolution of the orbit through GW emission, perihelion precession and orbital-plane precession. The model is semi-analytic and hence cheap to generate, which means it has been used extensively to scope out LISA data analysis [694, 1366, 1367]. However, it rapidly dephases from true EMRI signals. Versions of the AK that include deviations from GR arising from an excess quadrupole moment [1368] and generic changes in the spacetime structure [1369] have also been developed. The numerical kludge model (NK) is built around Kerr geodesic trajectories, with the parameters of the geodesic evolved as the object inspirals, using a combination of PN results and fits to perturbative computations [1370]. A waveform is generated from the trajectory by identifying the Boyer-Lindquist coordinates in the Kerr spacetime with spherical polar coordinates and using a flat-space GW emission formula [1371]. The NK model is quite accurate when compared to trajectories computed using BH perturbation theory [1371]. Versions of the NK model including conservative GSF corrections have also been developed [1372].

Current kludge models are fast but not sufficiently accurate to be used in LISA data analysis. The NK model can be improved using fits to improved perturbative results as these become available. An osculating element formalism has been developed [1373, 1374] that can be used to compute the contribution appropriate corrections to the phase and orbital parameter evolution for an arbitrary perturbing force (see [780] for an example using the GSF). This model is likely to be sufficiently reliable for use in preliminary data analysis, but will have to be continually improved as the GSF calculations described in section 2 are further developed. Although much faster than GSF models, the NK model is likely to be too expensive computationally for the first stages of data analysis, in which source candidates are identified using a large number of templates. The AK model in its current form is potentially fast enough, but is not sufficiently accurate. However, recently a hybrid model called the 'augmented analytic kludge' was proposed [877, 1375] that corrects the AK phase and orbital evolution equations so that they match numerical kludge trajectories. The 'augmented analytic kludge' model is almost as fast as the AK model and is almost as accurate as the NK model, so it is a promising approach to data analysis, though much work is needed to develop the preliminary model to include the full range of physical effects.

Computationally efficient models will be essential for identifying sources in data from current and future GW detectors. However, they can only be constructed if accurate physical waveform models have been developed, and the final stage of any parameter estimation pipeline must use the most accurate physical model available in order to best extract the source parameters. Accurate source modelling underpins all GW data analysis.

8. The view from mathematical GR

Contributor: Piotr T Chruściel

The detection of gravitational waves presents formidable challenges to the mathematical relativist: can one invoke mathematical GR to provide an unambiguous interpretation of observations, and a rigorous underpinning for some of the interpretations already made? Here we briefly discuss some of the issues arising, focusing on questions that are tractable using available methods in mathematical GR. We shall sidestep a host of other important problems, such as that of the global well-posedness of the Cauchy problem for binary BHs, or the related problem of cosmic censorship, which are currently completely out of reach to mathematical GR.

8.1. Quasi-Kerr remnants?

A working axiom in the GW community is that Kerr BHs exhaust the collection of BH end states. There is strong evidence for this, based on the 'no-hair' theorem, which asserts that suitably regular stationary, analytic, non-degenerate, connected, asymptotically flat vacuum BHs belong to the Kerr family; see [1376] for precise definitions and a list of many contributors (see also sections 4.1 and 4.2 of chapter III for a discussion on no-hair theorems in the context of beyond-GR phenomenology). The analyticity assumption is highly undesirable, being rather curious: it implies that knowledge of a metric in a small space-time region determines the metric throughout the universe. The analyticity assumption was relaxed in [1377] for near-Kerr configurations, but the general case remains open. Next, the non-degeneracy assumption is essentially that of non-vanishing surface gravity: the maximally spinning Kerr BHs are degenerate in this sense. One often discards the extremal-Kerr case by declaring that it is unstable. This might well be the case, but it is perplexing that the spin range of observed SMBH candidates is clearly biased towards high spin, with a possible peak at extremality [261, 1378]. (Note, however, that these studies have an a priori assumption built-in, that the BHs are Kerr.) Finally, while the connectedness assumption is irrelevant when talking about an isolated BH, it is quite unsatisfactory to have it around: what mechanism could keep a many-BH vacuum configuration in a stationary state? All this makes it clear that removing the undesirable assumptions of analyticity, non-degeneracy, and connectedness in the uniqueness theorems should be a high priority to mathematically-minded relativists.

8.2. Quasi-normal ringing?

The current paradigm is that, after a merger, the final BH settles to a stationary one by emitting radiation which, at the initial stage, has a characteristic profile determined by quasi-normal modes. This characteristic radiation field provides a very useful tool for determining the final mass and angular momentum of the solution. (Note that the extraction of these parameters from the ringdown profile assumes the final state to be Kerr, taking us back to the issues raised in the previous paragraph...) All this seems to be well supported by numerics. But we are still far behind by way of mathematical proof. Work by Dyatlov [1379, 1380] has rigorously established that quasi-normal modes are part of an asymptotic expansion of the time-behaviour of linear waves on the Kerr-de Sitter background with a nonzero cosmological constant. The asymptotically flat Kerr case turns out to be much more difficult to tackle, and so far no rigorous mathematical statements are available for it in the literature. The whole problem becomes even more difficult when non-linearities are introduced, with no results available so far. The recently announced proofs of nonlinear stability of the Schwarzschild solution within specific families of initial data [1381, 1382] might serve as a starting point for further studies of this important problem.

8.3. Quasi-local masses, momenta and angular momenta?

It was mind-blowing, and still is, when LIGO detected two BHs of about 35 and 30 solar masses merging into one, of about 60 solar masses. But the question arises, what do these numbers mean? Assuming the end state to be Kerr, the last part of the statement is clear. But how can the pre-merger masses be determined, or even defined, given the non-stationarity of the BBH configuration? Each NR group uses its own method for assigning mass and spin parameters to the initial datasets used in their simulations, and it is not a priori clear how these numbers relate to each other. EOB models likewise employ their own definitions of mass and spin. Any differences are most likely irrelevant at the current level of detection accuracy. But one hopes that it will become observationally important at some stage, and is a fundamental issue in our formulation of the problem in any case.

To address the issue requires choosing benchmark definitions of quasi-local mass, momentum, and angular momentum that can be blindly calculated on numerical initial datasets, without knowing whether the data were obtained, e.g. by solving the spacelike constraints with an Ansatz, or by evolving some other Ansatz, or by matching spacelike and characteristic data. One can imagine a strategy whereby one first locates the apparent horizons within a unique preferred time-slicing (e.g. maximal, keeping in mind that apparent horizons are slicing-dependent), and then calculates the chosen quasi-local quantities for those. There exists a long list of candidates for quasi-local mass, with various degrees of computational difficulty, which could be used after extensive testing, all to be found in [1383]; alphabetically: Bartnik's, Brown-York's, Hawking's (directly readable from the area), Kijowski's, Penrose's, with the currently-most-sophisticated one due to Chen, Wang and Yau (see [1384] and references therein). The issue is not whether there is an optimal definition of quasi-local quantities, which is an interesting theoretical question on its own, unlikely to find a universally agreed answer, but whether there is a well-defined and computationally convenient one that can be used for scientific communication. Such a definition would need to have an unambiguous Newtonian limit, necessary for making contact with non-GR astrophysical observations.

8.4. Quasi-mathematical numerical relativity?

To an outsider, NR looks like a heroic struggle with a quasi-impossible task, which, after years of inspired attempts, resulted in a maze of incredibly complicated codes that manage to perform the calculations related to the problem at hand. It is conceivable that there is no way to control the whole construction in a coherent way. However, some mathematically minded outsiders would like to get convinced that the numerical calculations are doing what they are supposed to do; compare [1385, 1386]. In other words, is it possible to show that, at least some, if not most, if not all of the current numerical approximations to Einstein equations would converge to a real solution of the problem at hand if the numerical accuracy could be increased without limit? Standard convergence tests, or checks that the constraints are preserved, are of course very important, but they cannot on their own settle the point. A more rigorous proof of convergence is desirable.

Chapter III:. Gravitational waves and fundamental physics

Editor: Thomas P Sotiriou

1. Introduction

Gravity is arguably the most poorly understood fundamental interaction. It is clear that General Relativity (GR) is not sufficient for describing the final stages of gravitational collapse or the very early universe. Additionally, a deeper understanding of the role of gravitation seems to be a necessary ingredient for solving almost any other major challenge in fundamental physics, cosmology, and astrophysics, such as the hierarchy problem, or the dark matter (DM) and dark energy problems. Gravitational waves (GWs) promise to turn gravity research into a data-driven field and potentially provide much needed insights. However, one needs to overcome a major obstacle first: that of extracting useful information about fundamental physics from GW observations. The reason this is a challenge should have become apparent in the previous chapter. The signal is buried inside noise and extracting it requires precise modelling. Doing so in GR is already a major feat and it only gets harder when one tries to add new ingredients to the problem. Nonetheless there is very strong motivation. Black hole binary (BBH) mergers are among the most violent events in the universe and some of the most interesting and exotic phenomena are expected to take place in the vicinity of BHs.

This chapter focusses on how BHs can be used to probe fundamental physics through GWs. The next section sets the background by discussing some examples of beyond-GR scenarios. Section 3 summarises the techniques that are being used or developed in order to probe new fundamental physics through GWs. Section 4 is devoted to the efforts to probe the nature and structure of the compact objects that are involved in binary mergers. Finally, section 5 provide a thorough discussion on what GW observation might reveal on the nature of DM.

2. Beyond GR and the standard model

2.1. Alternative theories as an interface between new physics and observations

The most straightforward way to test new fundamental physics with GWs from BBH coalescences would be the following: select one's favourite scenario and model the system in full detail, extract a waveform (or better a template bank), and then look for the prediction in the data. The technical difficulties of doing so will become apparent below, where it will also be apparent that the required tools are at best incomplete and require further development. However, there is a clear non-technical drawback in this approach as well: it assumes that one knows what new fundamental physics to expect and how to model it to the required precision. One should contrast this with the fact that most quantum gravity candidates, for example, are not developed enough to give unique and precise predictions for the dynamics of a binary system. Moreover, ideally one would want to obtain the maximal amount of information from the data, instead of just looking for very specific predictions, in order not to miss unexpected new physics. Hence, the question one needs to ask is what is the optimal way to extract new physics from the data?

GW observations test gravity itself and the way matter interacts through gravity. Hence, at the theoretical level, they can test GR and the way GR couples to the Standard Model (SM) and its extensions. This suggests clearly that the new fundamental physics that can leave an imprint on GW observations can most likely be effectively modelled using an alternative theory of gravity. Recall that alternative theories of gravity generically contain extra degrees of freedom that can be nonminimally coupled to gravity.

The advantages of this approach are: (i) alternative theories of gravity can act as effective field theories for describing certain effects and phenomena (e.g. violations of Lorentz symmetry or parity) or be eventually linked to a specific more fundamental theory; (ii) they can provide a complete framework for obtaining predictions for binary evolutions and waveforms, as they come with fully nonlinear field equations; (iii) their range of validity is broad, so they allow one to combine constraints coming from the strong gravity regime with many other bounds from e.g. the weak field regime, cosmology, astrophysics, laboratory tests, etc. The major drawback of this approach is that it requires theory-dependent modelling, which can be tedious and requires one to focus on specific alternative theories of gravity.

In this section we will give a brief overview of some alternative theories that can be used to model new physics in GW observations. This is not meant to be a comprehensive list nor is it our intent to pinpoint interesting candidates. We simply focus on theories that have received significant attention in the literature in what regards their properties in the strong gravity regime and to which we plan to refer to in the coming sections.

It is worth mentioning that a complementary approach is to use theory-agnostic strong field parametrisations. Their clear advantage is that they simplify the modelling drastically and they render it theory-independent. However, they fall short in points (i)–(ii) above. In specific, it is no always straightforward to physical interpret them, they typically describe only part of the waveform, and they do not allow one to combine constraints with other observations unless interpreted in the framework of a theory. Strong field parametrisations and their advantanges and disadvantages will be discussed in sections 3.13.3.

2.2. Scalar-tensor theories

Contributors: T P Sotiriou, K Yagi

One of the simplest ways to modify GR is to introduce a scalar field that is nonminimally coupled to gravity. In many cases, the term scalar-tensor theories refers to theories described by the action

Equation (21)

where g is the determinant of the spacetime metric $g_{\mu\nu}$ , R is the Ricci scalar of the metric, and Sm is the matter action. $\psi$ is used to collectively denote the matter fields, which are coupled minimally to $g_{\mu\nu}$ . The functions $\omega(\varphi)$ and $V(\varphi)$ need to be specified to identify a specific theory within the class. Brans–Dicke theory is a special case with $\omega=\omega_0=$ constant and $V=0$ [1195].

It is common in the literature to rewrite the action in terms of a different set of variables. In particular, the conformal transformation $\hat{g}_{\mu\nu}=G \varphi \, g_{\mu\nu}$ , where G is Newton's constant, and the scalar field redefinition $2\varphi {\rm d}\phi=\sqrt{2\omega(\varphi)+3} \, {\rm d}\varphi$ , can be employed to bring the action (21) to the form

Equation (22)

where $U(\phi)=V(\varphi)/(G\varphi){}^2$ . The set of variables $(\hat{g}_{\mu\nu}, \phi)$ is known as the Einstein frame, whereas the original set of variables $(g_{\mu\nu},\varphi)$ is known as the Jordan frame. Quantities with a hat are defined with $\hat{g}_{\mu\nu}$ . In the Einstein frame, scalar-tensor theories seemingly take the form of Einstein's theory with a minimally coupled scalar field (hence the name). However, this comes at a price: $\phi$ now couples to the matter field $\psi$ , as can be seen if Sm is written in terms of $\hat{g}_{\mu\nu}$ , $\phi$ , and $\psi$ . Hence, if one sticks to the Einstein frame variables, one would infer that matter experiences an interaction with $\phi$ (fifth force), and hence it cannot follow geodesics of $\hat{g}_{\mu\nu}$ . In the Jordan frame instead, there is no interaction between $\varphi$ and $\psi$ (by definition), and this implies that matter follows geodesics of $g_{\mu\nu}$ . This line of thinking allows one to ascribe a specific physical meaning to the Jordan frame metric, but it does not change the fact that the two frames are equivalent descriptions of the same theory (see [1387] and references therein for a more detailed discussion).

Scalar-tensor theories are well-studied but generically disfavoured by weak field observations, e.g. [1203, 1388]. In brief, weak field tests dictate that the scalar field, if it exists, should be in a trivial configuration around the Sun and around weakly gravitating objects, such as test masses in laboratory experiments. These precision tests translate to very strong constraints on scalar-tensor theories; strong enough to make it unlikely that there can be any interesting phenomenology in the strong gravity regime, which is our focus here. There is one notable known exception [1201]. To introduce these models it is best to first set $V=U=0$ and then consider the equation of motion for $\phi$ , as obtained from varying action (22). It reads

Equation (23)

where $ \newcommand{\e}{{\rm e}} T\equiv T^\mu_{\phantom{a}\mu}$ , $ \newcommand{\e}{{\rm e}} T_{\mu\nu}\equiv -2 (-g){}^{-1/2} \delta S_m/\delta g^{\mu\nu}$ is the Einstein frame stress energy tensor, and

Equation (24)

Solution with $\phi=\phi_0=$ constant are admissible only if $\alpha(\phi_0)=0$ . This condition identifies a special class of theories and translates to $\omega(\varphi_0)\to \infty$ in the Jordan frame, where $\varphi_0$ is the corresponding value of $\varphi$ . For $\phi=\phi_0$ solutions the corresponding metric is identical to the GR solution for the same system. However, these solutions are not unique. It turns out that they are preferable for objects that are less compact than a certain threshold, controlled by the value of $\alpha'(\phi_0)$ . When compactness exceeds the threshold, a nontrivial scalar configuration is preferable, leading to deviations from GR [1197, 1201, 1389]. The phenomenon is called spontaneous scalarization and it is known to happen for neutron stars (NSs).

A NSB endowed with non-trivial scalar monopolar configurations would lose energy through dipole emission [1390] and this would affect the orbital dynamics. More specifically, this would be an additional, lower-order, contribution to the usual quadrupolar GW emission that changes the period of binary pulsars. Hence, binary pulsar constraints are extremely efficient in constraining spontaneous scalarization and the original scenario is almost ruled out [1391, 1392], with the exception of rapidly rotating stars. However, dipolar emission can be avoided if the scalar-scalar interaction between the stars is suppressed. This is the case is the scalar is massive [1209, 1219]. The mass defines a characteristic distance, beyond which the fall-off of the scalar profile is exponential (as opposed to 1/r), so if the value of the mass is in the right range, the stars can be scalarized and yet the binary will not exhibit any appreciable dipolar emission above a given separation. One can then avoid binary pulsar constraints and still hope to see some effect in GW emission from the late inspiral, merger, and ringdown. There is also the possibility that NSs will dynamically scalarize once they come close enough to each other [1222, 1223, 13931395].

It should be noted that the known scalarization scenario faces the following issue. Usually, calculations assume flat asymptotics with a vanishing gradient for the scalar. However, realistically the value of the scalar far away from the star changes at cosmological timescales and it turns out that this change is sufficient to create conflict with observations [1396, 1397]. This problem can be pushed into the future by tuning initial data, so it can technically be easily avoided. Nonetheless, one would eventually need to do away with the tuning, presumably by improving the model.

Black holes in scalar-tensor theories have received considerable attention, most notably in the context of no-hair theorems (e.g. [13981400]) and their evasions (see also [1401] for a discussion). Sections 4.1 and 4.2 are entirely devoted to this topic, so we will not discuss it further here.

The action of equation (21) is the most general one can write that is quadratic in the first derivatives of the scalar, and hence it presents itself as a sensible effective field theory for a scalar field nonminimally coupled to the metric. However, this action can be generalized further if one is willing to include more derivatives. Imposing the requirement that variation leads to field equations that are second order in derivatives of both the metric and the scalar leads to the action

Equation (25)

where

Equation (26)

Equation (27)

Equation (28)

Equation (29)

where $ \newcommand{\e}{{\rm e}} X\equiv -\frac{1}{2} \nabla^\mu \phi \nabla_\mu \phi$ , Gi need to be specified to pin down a model within the class, and $ \newcommand{\e}{{\rm e}} G_{iX}\equiv \partial G_i/\partial X$ . Horndeski was the first to write down this action in an equivalent form [1402] and it was rediscovered fairly recently in [1403]. Theories described by this action are referred to as generalized scalar-tensor theories, Horndeski theories, generalized galileons, or simply scalar-tensor theories. We will reserve the last term here for the action (21) in order to avoid confusion. The term generalized galileons comes from the fact that these theories were (re)discovered as curved space generalization of a class of flat space scalar theories that are symmetric under $\phi \to \phi + c_\mu x^\mu +c$ , where $c_\mu$ is a constant one-form and c is a constant (Galilean symmetry) [1404]. In fact, the numbering of the Li terms in the Lagrangian is a remnant of the original Galileons, where the i index indicates the number of copies of the field. This is no longer true in action (25), but now the Li term contains i  −  2 second derivatives of the scalar.

In what regards strong gravity phenomenology, a lot of attention has been given to the shift-symmetric version of action (25). If $G_i=G_i(X)$ , i.e. $\partial G_i/\partial \phi=0$ , then (25) is invariant under $\phi \to \phi+$ constant. This symmetry protects the scalar from acquiring a mass from quantum corrections. It has been shown in [1405] that asymptotically flat, static, spherically symmetric BHs in shift-symmetric Horndeski theories have trivial (constant) scalar configuration and are hence described by the Schwarzschild solution. This proof can be extended to slowly-rotating BHs, in which case the solution is the slowly rotating limit of the Kerr spacetime [1406]. However, there is a specific, unique term in (25) that circumvents the no-hair theorem and leads to BHs that differ from those of GR and have nontrivial scalar configurations (hairy BHs) [1406]: $\phi {{\mathcal G}}$ , where $ \newcommand{\e}{{\rm e}} {{\mathcal G}}\equiv R^2 - 4 R_{\mu\nu} R^{\mu\nu} + R_{\mu\nu\rho\sigma}R^{\mu\nu\rho\sigma}$ is the the Gauss–Bonnet invariant137. This singles out

Equation (30)

as the simplest action within the shift-symmetric Horndeski class that has hairy BHs. $\alpha$ is a coupling constant with dimension of a length squared. Hairy BHs in this theory [1408, 1409] are very similar to those found earlier in the (non-shift-symmetric) theory with exponential coupling [1410, 1411], i.e. $e^\phi {{\mathcal G}}$ . The theory with exponential coupling is known as Einstein-dilaton Gauss–Bonnet theory and it arises as a low-energy effective theory of heterotic strings [1412, 1413].

The subclass of theories described by the action

Equation (31)

seems to have interesting BH phenomenology in general. This theory admits GR solutions if $f'(\phi_0)=0$ for some constant $\phi_0$ . Assuming this is the case, it has recently been proven in [1414] that stationary, asymptotically flat BH solutions will be identical to those of GR, provided $f''(\phi_0){{\mathcal G}}<0$ . A similar proof, but restricted to spherical symmetry, can be found in [1415]. Theories that do not satisfy the $f''(\phi_0){{\mathcal G}}<0$ exhibit an interesting phenomenon [1414, 1416]: BH scalarization, similar to the NS scalarization discussed above. See also [1417] for a study of hairy BH solution in this class of theories and [1418] for a very recent exploration of linear stability.

The same class of theories can exhibit spontaneous scalarization in NSs [1414]. On the other hand, in shift-symmetric Horndeski theories it is known that for NSs the scalar configuration will be trivial [14191421], provided that the $\phi {{\mathcal G}}$ terms is absent. Even if it is there, there will be a faster than 1/r fall-off [1030, 1422]. Hence, for shift-symmetric theories BH binaries are probably the prime strong gravity system of interest.

It should be noted that the speed of GWs in generalized scalar tensor theories can differ from the speed of light under certain circumstances and this has been used to obtain constraint recently [670673, 1423, 1424]. These will be discussed in section 3.1.

As mentioned above, action (25) is the most general one that leads to second order equations upon variation. Nonetheless, it has also been shown that there exist theories outside this class that contain no other degrees of freedom than the metric and the scalar field [14251430]. These models are often referred to as beyond-Horndeski theories.

Last but not least, dynamical Chern–Simons (dCS) gravity [1431, 1432] is a scalar-tensor theory that introduces gravitational parity violation at the level of the field equations. It draws motivation from the standard model [1433], heterotic superstring theory [1434], loop quantum gravity [14351439] and effective field theories for inflation [1440]. dCS gravity is not a member of the Horndeski class. The scalar is actually a pseudo-scalar, i.e. it changes sign under a parity transformation. Moreover, it couples to the Pontryagin density $*R^{\alpha\phantom{a}\gamma\delta}_{\phantom{a}\beta} R_{\phantom{a}\alpha\gamma\delta}^{\beta}$ , where $R_{\phantom{a}\beta\gamma\delta}^{\alpha}$ is the Riemann tensor, $ \newcommand{\e}{{\rm e}} *R^{\alpha\phantom{a}\gamma\delta}_{\phantom{a}\beta} \equiv {1\over2} \epsilon^{\gamma\delta\mu\nu}R_{\phantom{a}\beta\mu\nu}^{\alpha}$ and $ \newcommand{\e}{{\rm e}} \epsilon^{\gamma\delta\mu\nu}$ is the Levi-Civita tensor. This coupling term gives rise to third-order derivatives in the field equations. The fact that the theory has higher-order equations implies that it is unlikely it is actually predictive as it stands [1106] (see also [1441]). Hence, most work has used a specific approach to circumvent that problem that is inspired by effective field theory and relies on allowing the coupling term to introduce only perturbative correction. This issue will be discussed in some detail in section 3.4.

dCS gravity has received a lot of attention in the strong gravity regime. Non-rotating BH and NS solutions are the same as in GR, as such solutions do not break parity. On the other hand, slowly-rotating [14421444], rapidly-rotating [1445] and extremal [1446, 1447] BHs and slowly-rotating NSs [1031] all differ from the corresponding GR solutions. In particular, such rotating compact objects acquire a scalar dipole hair and a correction to the quadrupole moment, which modify gravitational waveforms from compact binary inspirals [14481451] and mergers [1104]. Future GW observations will allow us to place bounds on the theory that are orders of magnitude stronger than the current bounds from Solar System [1452] and table-top experiments [1444]. Another interesting phenomenon in dCS gravity is gravitational amplitude birefringence, where the right-handed (left-handed) circularly-polarized GWs are either enhanced or suppressed (suppressed or enhanced) during their propagation from a source to Earth, which can be used to probe the evolution of the pseudo-scalar field [14531455]. Such GW observations are complementary [1456] to Solar System [1457] and binary pulsar [1458, 1459] probes.

For a general introduction to standard scalar-tensor theories see [1194, 1460] and for generalized scalar-tensor theories see [1461], and for a recent brief review on gravity and scalar fields see [1462].

2.3. Lorentz violations

Contributor: T P Sotiriou

Lorentz symmetry is a fundamental symmetry for the standard model of particle physics, hence it is important to question whether it plays an equally important role in gravity. To address this question one needs to study Lorentz-violating (LV) theories of gravity for two distinct reasons: (i) to quantify how much one is allowed to deviated from GR without contradicting combined observations that are compatible with Lorentz symmetry one needs to model the deviations in the framework of a consistent theory; (ii) studying the properties of such theories can provide theoretical insights. We will briefly review two examples of LV theories here.

Einstein-æther theory (æ-theory) [1463] is described by the action

Equation (32)

where

Equation (33)

ci are dimensionless coupling constants and the æ ther $u_\mu$ is forced to satisfy the constraint $ \newcommand{\e}{{\rm e}} u^2\equiv u^\mu u_\mu=1$ . This constraint can be enforced by adding to the action the Lagrange multiplier term $\lambda (u^2-1)$ or by restricting the variation of the æ ther so as to respect the constraint. Due to the constraint, the æ ther has to be timelike, thereby defining a preferred threading of spacetime by timelike curves, exactly like an observer would. It cannot vanish in any configuration, including flat space. Hence, local Lorentz symmetry, and more precisely boost invariance, is violated. (32) is the most general action that is quadratic in the first derivatives of a vector field that couples to GR and satisfies the unit constraint. Indeed, æ -theory is a good effective field theory for Lorentz-symmetry breaking with this field content [1464]. It should be emphasised that the æ ther brings three extra degrees of freedom into the theory, two vector modes and a scalar (longitudinal) mode. The speed of any mode, including the usual spin-2 mode that corresponds to GWs, can differ from the speed of light and is controlled by a certain combination of the ci parameters.

One can straightforwardly obtain an LV theory with a smaller field content from action (32) by restricting the æ ther to be hypersurface orthogonal [1465]. This amounts to imposing

Equation (34)

where T is a scalar and the condition is imposed before the variation (i.e. one varies with respect to T). This condition incorporates already the unit constraint u2  =  1 and removes the vector modes from the theory. It can be used to rewrite the action in terms of the metric and T only. T cannot vanish in any regular configuration that solves the field equations and its gradient will always be timelike. Hence, the æ ther is always orthogonal to some spacelike hypersurfaces over which T is constant. That is, T defines a preferred foliation and acts as a preferred time coordinate. Note that it appears in the action only through $u_\mu$ and the latter is invariant under $T\to f(T)$ , so this is a symmetry of the theory and the preferred foliation can be relabelled freely.

One can actually use the freedom to choose the time coordinate and write this theory directly in the preferred foliation [1465]. Indeed, if T is selected as the time coordinate, the T  =  constant surfaces define a foliation by spacelike hypersurfaces with extrinsic curvature Kij and N and Ni are the lapse function and shift vector of this foliation. Then $u_\mu=N \delta^0_\mu$ , and action (32) takes the form

Equation (35)

where $ \newcommand{\e}{{\rm e}} a_i\equiv \partial_i \ln N$ and

Equation (36)

Since T was chosen as a time coordinate, action (35) is no longer invariant under general diffeomorphisms. It is, however, invariant under the subset of diffeomorphisms that preserve the folation, or else the transformations $T\to T'=f(T)$ and $x^i\to x'^i=x'^i(T,x^i)$ , which are now the defining symmetry of the theory from a EFT perspective. In fact, this action can be thought of as the infrared (low-energy) limit of a larger theory with the same symmetry, called Hořava gravity [1466]. The most general action of Hořava gravity is [1467]

Equation (37)

where L2 is precisely the Lagrangian of action (35). L4 and L6 collectively denote all the operators that are compatible with this symmetry and contain four and six spatial derivatives respectively (in the T foliation). The existence of higher order spatial derivatives implies that perturbation will satisfy higher order dispersion relations and $M_\star$ is a characteristic energy scale that suppresses the corresponding terms. Hořava gravity has been argued to be power-counting renormalizable and hence a quantum gravity candidate [1466]. The basic principle is that the higher-order spatial derivative operators modify the behaviour of the propagator in the ultraviolet and remove/regulate divergences (see [1468, 1469] for a discussion in the context of a simple scalar model). The projectable version of the theory [1466, 1470], in which the lapse N is restricted to have vanishing spatial derivatives, has actually been shown to be renormalizable (beyond power-counting) in four dimensions [1471], whereas the 3-dimensional version [1472] is asymptotically [1473]. This restricted version does however suffer from serious viability issues at low energies [1466, 14741477].

We will not discuss the precise form of L4 and L6 any further, as it not important for low-energy physics. It should be stressed though that such terms lead to higher order corrections to the dispersion relation of GWs and that of the extra scalar excitation Hořava gravity propagates. This implies that the speeds of GW polarizations can differ from the speed of light and acquire a frequency dependence. The relevant constraints from GW observations are discussed in section 3.1 (see also [1478] for a critical discussion). At large wavenumber k, the dispersion relations scale as $\omega^2 \propto k^6$ (since by construction the leading order ultraviolet operators in the action contain two temporal and six spatial derivatives) and this implies that an excitation can reach any speed, provided it has short enough wavelength. More surprisingly, it has been shown that Hořava gravity exhibits instantaneous propagation even in the infrared limit, described by action (35). In this limit, dispersion relations are linear but, both at perturbative [1479] and nonperturbative level [1480], there exists an extra degree of freedom that satisfies an elliptic equation on constant preferred time hypersurfaces. This equation is not preserved by time evolution and hence it transmits information instantaneously within these hypersurfaces. For a detailed discussion see [1481].

It should be clear from the discussion above that both æ -theory and Hořava gravity can exhibit superluminal propagation and this makes BHs particularly interesting in these theories. For both theories BHs differ from their GR counterparts and have very interesting feature that will be discussed further in section 4.2. For a review see [1482].

For early reviews on æ -theory and Hořava gravity see [1483] and [1484], respectively. For the most recent combined constraints see [1485, 1486] and references therein.

2.4. Massive gravity and tensor-tensor theories

Contributor: S F Hassan

In the appropriate limit GR reproduces Newton's famous inverse square law of gravitation. At the same time, linear excitations in GR (GWs) satisfy a linear dispersion relation without a mass term. In particle theory terminology, both of these statements imply that the graviton—the boson that mediates the gravitational interaction—is massless. Extensions of GR that attempt to give the graviton a mass are commonly referred to as massive gravity, bimetric gravity, and multimetric gravity theories. One of the original motivations for the study of these theories was the hope that they could provide a resolution to the cosmological constant problem by changing the gravitational interaction beyond some length scale (controlled by the mass of the graviton). However, a more fundamental motivation emerges from comparing the structure of GR with that of the standard model of particle physics that successfully accounts for all known particles interactions. This is elaborated below.

The standard model describes particle physics in terms of fields of spin 0 (the Higgs field), spin 1/2 (leptons and quarks) and spin 1 (gauge bosons). At the most basic level, these fields are governed by the respective field equation, namely, the Klein–Gordon equation, the Dirac equation and the Maxwell/Yang–Mills/Proca equations. The structures of these equations are uniquely fixed by the spin and mass of the field along with basic symmetry and consistency requirements, in particular, the absence of ghost instabilities, for example, in spin-1 theories. From this point of view, GR is the unique ghost-free theory of a massless spin-2 field; the metric $g_{\mu\nu}$ . Then the Einstein equation, $R_{\mu\nu} - \frac{1}{2} g_{\mu\nu} R=0$ , with all its intricacies is simply the counterpart for a spin 2 field of the massless Klein–Gordon equation $\Box \phi=0$ , or of the Maxwell equation $\partial_\mu F^{\mu\nu}=0$ . However, to describe physics, the standard model contains a second level of structure: At the fundamental level, all fields appear in multiplets allowing them to transform under symmetries, in particular under gauge symmetries which are in turn responsible for the renormalizability and hence, for the quantum consistency of the theory. The observed fields are related to the fundamental ones through spontaneous symmetry breaking and through mixings. For example, to obtain a single Higgs field, one needs to start with four scalar fields arranged in a complex $SU(2)$ doublet, rather than the one scalar field of the final theory. Similarly, although electrodynamics is very well described by Maxwell's equations as a standalone theory, its origin in the standard model is much more intricate, requiring the spontaneous breaking of a larger gauge symmetry and a specific mixing of the original gauge fields into the vector potential of electromagnetism. Now, one may contrast these intricate features of the standard model with GR which is the simplest possible theory of a single massless spin 2 field with no room for any further structure. The comparison suggests embedding GR in setups with more than one spin 2 field and investigating the resulting models. The aim would be to see if the new structures could address some of the issues raised above, either in analogy with the standard model, or through new mechanisms. This will shed light on a corner of the theory space that has remained unexplored so far.

However, implementing this program is not as straightforward as it may seem. It was known since the early 70's that adding other spin 2 fields to GR generically leads to ghost instabilities. The instability was first noted in the context of massive gravity, which contains an interaction potential between the gravitational metric $g_{\mu\nu}$ and a nondynamical predetermined metric $f_{\mu\nu}$ . However it also plagues dynamical theories of two or more spin 2 fields. Recent developments since 2010 have shown that one can indeed construct ghost free theories of massive gravity (with a fixed auxiliary metric) [1487, 1488], or fully dynamical ghost free bimetric theories [1489]. In particular, the bimetric theory is the theory of gravitational metric that interacts with an extra spin-2 field in specific ways and contains several parameters. It propagates one massless and one massive spin-2 mode which are combinations of the two metrics. Consequently, it contains massive gravity and GR as two different limits. For a review of bimetric theory see [1490].

In the massive gravity limit, as the name suggests, the theory has a single massive spin-2 field that mediates gravitational interactions in the background of a fixed auxiliary metric. The most relevant parameter then is the graviton mass which is constrained by different observations (for a review see, [1491]). In particular, consistency with the detection of GWs by LIGO restricts the graviton mass to $m < 7.7\times 10^{-23}~{\rm eV}$ [11, 17]. However, in this limit, it becomes difficult to describe both large scale and small scale phenomena consistently with the same fixed auxiliary metric.

Of course, the bimetric theory is more relevant phenomenologically close to the GR limit, where gravity is described by a mostly massless metric, interacting with a massive spin-2 field. Now the two most relevant parameters are the spin-2 mass m and the Planck mass M of the second spin-2 field, $M\ll M_P$ , where MP is the usual Planck mass of the gravitational metric. The presence of the second parameter M significantly relaxes the bounds on the spin-2 mass m. In the most commonly assumed scenarios, the stability of cosmological perturbations in the very early Universe requires $M \lesssim 100~ {\rm GeV}$ [1492]. In this case, GWs will contain a very small admixture of the massive mode, hence, m is not strongly constrained by LIGO and can be large. A new feature is that now the massive spin-2 field can be a DM candidate. Depending on the specific assumptions, the relevant allowed mass ranges are estimated as $10^{-4}~ {\rm eV}\lesssim m \lesssim 10^7~ {\rm eV}$ [1493], or the higher range, $1 ~{\rm TeV} \lesssim m \lesssim 100 ~{\rm TeV}$ [1494].

While some studies of cosmology in bimetric theory have been carried out, many important features of the theory are yet to be fully explored. One of these is the problem of causality and the well-posedness of the initial value problem. In particular, bigravity provides a framework for investigating models where the causal metric need not always correspond to the gravitational metric [1495, 1496]. Another interesting question is if the bimetric interaction potential can be generated through some kind of Higgs mechanism, in analogy with the Proca mass term for spin 1 fields. From a phenomenological point of view, it is interesting to investigate the massive spin 2 field present in the theory as a DM candidate. Strong field effects in bimetric gravity are expected to show larger deviations from GR as compared to weak field effects.

While ghost free bimetric models have interesting properties, they do not admit enlarged symmetries, since the two spin 2 fields have only nonderivative interactions. Constructing ghost free theories of spin 2 multiplets with gauge or global symmetries is a more difficult problem which requires constructing ghost free derivative interactions. So far, this is an unresolved problem which needs to be further investigated. If such theories exist, they will provide the closest analogues of the standard model in the spin 2 sector.

3. Detecting new fields in the strong gravity regime

As has been discussed in some detail in section 2, if new physics makes an appearance in the strong gravity regime, it can most likely be modelled as a new field that couples to gravity. In order to detect this field with GW observations, one has to model its effects on the dynamics of the binary system and produce waveforms that can be compared with observations. The three distinct phases of a binary coalescence—inspiral, merger, and ringdown—require different modelling techniques. The inspiral and the ringdown are susceptible to different types of perturbative treatment.

During most of the inspiral, the two members of the binary are well separated and, even though they are strongly gravitating objects individually, they interact weakly with each other. Hence, one can model this interaction perturbatively and this modeling can be used to produce a partial waveform for the inspiral phase. It should be stressed that this gives rise to a perturbation expansion whose parameters depend on the nature and structure of the members of the binary. Computing these parameters, known as sensitivities, generally requires modelling the objects composing the binary in a nonperturbative fashion. Moreover, if the objects are endowed with extra field configurations, then one needs to model them as translating in an ambient field (in principle created by the companion), in order to faithfully capture the extra interaction. This is the same calculation one performs to obtain constraints from binary pulsars [1396, 1497]. An alternative is to try to prepare theory-agnostic, parametrised templates. Three different ways to achieve this will be discussed in detail in section 3.2.

The ringdown can be modelled with linear perturbation theory around a known configuration for a compact object, but there are significant challenges. The most obvious ones are: (i) one needs to first have fully determined the structure of the quiescent object that will be used as background; (ii) the equations describing linear perturbations around such a background are not easy to solve (e.g. separability is an issue); (iii) both the background and the perturbation equations depend crucially on the scenario one is considering. An in-depth discussion in perturbation theory around BHs, and its use to describe the ringdown phase, will be given in section 3.3.

The merger phase is highly nonlinear and modeling it requires full-blown numerical simulations. A prerequisite for performing such simulations is to have an initial value formulation and to show that the initial value problem (IVP) is well-posed (in an appropriate sense, as will be discussed in more detail below). This has not been addressed for the vast majority of gravity theories that are currently being considered. Section 3.4 focuses on the current state of the art in modeling the merger phase. It contains a critical discussion of the IVP in alternative theories of gravity and a summary of known numerical results.

Last but not least, one could attempt to detect new fields (polarisations) by the effect they have on GW propagation, as opposed to generation. This amounts to parametrizing the dispersion relation of the GWs and constraining the various parameters that enter this parameterisation, such as the mass, the speed, and the parameters of terms that lead to dispersive effects. The next section is devoted to propagation constraints.

3.1. Gravitational wave propagation

Contributor: T P Sotiriou

According to GR, GWs are massless and propagate at the speed of light, c. Hence, they satisfy the dispersion relation $E^2=c^2 p^2$ , where E is energy and p  is momentum. Deviations from GR can introduce modifications to this dispersion relation. LIGO uses the following parametrisation to model such deviations: $E^2=c^2 p^2+A c^\alpha p^\alpha$ , $\alpha>0$ [1498]138. When $\alpha=0$ , A represents a mass term in the dispersion relation. The current upper bound on the the mass from GW observations is $7.7\times 10^{-23}~{\rm eV}/c^2$ [11]. When $\alpha=2$ , A represents a correction to the speed of GWs. The fact that GWs from the NSB merger GW170817 almost coincided with the gamma ray burst GRB 170817A has given a spectacular double-sided constraint on the speed of GWs, which should agree with the speed of light to less than a part in 1015 [21, 23].

All other corrections introduce dispersive effects which accumulate with the distance the wave had to travel to reach the detector. This enhances the constraints in principle. However, it should be noted that A has dimensions $[A]=[E]^{2-\alpha}$ , which implies that corrections that have $\alpha>2$ are suppressed by positive powers of $c p/M_\star$ . $M_\star$ denotes some characteristic energy scale and it is useful to think of the constraints on A as constraints on $M_\star$ . GWs have long wavelengths, so one expects $\alpha>2$ corrections to be heavily suppressed unless the aforementioned energy scale is unexpectedly small. For instance, for $\alpha=3$ the current upper bound on $M_\star$ is in the $10^9~{\rm GeV}$ range, whereas for $\alpha=4$ it drops well below the eV range [11, 1478, 1498, 1500].

It is important to stress that all of the aforementioned constraints need to be interpreted within the framework of a theory (as is the case for any parametrisation). The mass bound can be turned to a constraint on the mass of the graviton in massive gravity theories, discussed in section 2.4. Interestingly, it is not the strongest constraint one can obtain [1491, 1501]. All other bounds can in principle be interpreted as constraints on Lorentz symmetry. However, terms with odd powers of $\alpha$ require spatial parity violations as well and the bounds on terms with $\alpha>2$ are probably too weak to give interesting constraints on Lorentz-violating gravity theories [1478]. The double-sided bound on the speed of GWs certainly yields a very strong constraint on Lorentz-violating theories of gravity. However, one has to take into account that such models generically have a multidimensional parameter space even at low energies and generically exhibit extra polarisations, the existence/absence of which can potentially give stronger or additional constraints [1478]. For details on the implications of this bound for Hořava gravity and for Einstein-æther theory, discussed in section 2.3, see [1485] and [1486] respectively.

Lorentz symmetry breaking is not the only scenario in which GWs propagate at a different speed than light. Lorentz invariant theories can exhibit such behaviour as well if they satisfy two conditions: (i) they have extra fields that are in a nontrivial configuration between the emission and the detection locations; (ii) these fields couple to metric perturbations in a way that modifies their propagation. The second conditions generally requires specific types of nonminimal coupling between the extra fields and gravity. It is satisfied by certain generalised scalar tensor theories [1424], discussed in section 2.2. Indeed, the bound on the speed has been used to obtain strong constraints on such theories as models of dark energy [670673, 1423, 1424]. It should be stressed, however, that, due to condition (i), such constraints are inapplicable if one does not require that the scalar field be the dominant cosmological contribution that drives the cosmic speed-up. Hence, they should not be interpreted as viability constraints on the theories themselves, but only on the ability of such models to account for dark energy.

In summary, modifications in the dispersion relation of GWs can be turned into constraints on deviations from GR. How strong these constraints are depends on the theory or scenario one is considering. The clear advantage of propagation constraints is that they are straightforward to obtain and they do not require modeling of the sources in some alternative scenario. This reveals a hidden caveat: they are intrinsically conservative, as they assume that there is no deviation from the GR waveform at the source. Moreover, as should be clear from the discussion above, propagation constraints have other intrinsic limitations and can only provide significant constraints for models that lead to deviations in the dispersion relation of GWs for relatively long wavelengths. Hence, one generally expects to get significantly stronger or additional constraints by considering the deviation from the GR waveform at the source.

3.2. Inspiral and parametrized templates

Contributor: K Yagi

3.2.1. Post-Newtonian parametrisation.

Rather than comparing GW data to template waveforms in non-GR theories to probe each theory one at a time, sometimes it is more efficient to prepare parameterized templates that capture deviations away from GR in a generic way so that one can carry out model-independent tests of gravity with GWs. We will mainly focus on the parameterized modification in the inspiral part of the waveform and discuss what needs to be done to construct parameterized waveforms in the merger-ringdown part of the waveform.

One way to prepare such parameterized templates is to treat coefficients of each post-Newtonian (PN) term in the waveform phase as an independent coefficient [15021504]. For gravitational waveforms of non-spinning BBH mergers in GR, such coefficients only depend on the masses of individual BHs. Thus, if GR is correct, any two measurements of these coefficients give us the masses, and the measurement of a third coefficient allows us to carry out a consistency test of GR. This idea is similar to what has been done with binary pulsar observations with measurements of post-Keplerian parameters [1505, 1506]. Though, it is nontrivial how to perform such tests for spinning BBHs. Moreover, this formalism does not allow us to probe non-GR corrections entering at negative PN orders due to e.g. scalar dipole emission.

3.2.2. Parameterized post-Einsteinian (ppE) formalism.

In this formalism the modified waveform in the Fourier domain is given by $\tilde h (\,f) = \tilde h_{{\rm GR}}(\,f)\, {\rm e}^{{\rm i} \beta v^{2n-5}}$ [1507]. Here $\tilde h_{{\rm GR}}$ is the GR waveform, $\beta$ is the ppE parameter representing the magnitude of the non-GR deviation while $v$ is the relative velocity of the binary components. The above correction enters at nth PN order. Since n is arbitrary, this formalism can capture deviations from GR that enter at negative PN orders. It has been used in [1500] to probe strong and dynamical field gravity with GW150914 [1] and GW151226 [2]. Bounds on $\beta$ at each PN order using a Fisher analysis are shown in figure 10. Here, the authors included non-GR corrections only in the inspiral part of the waveform. Having these bounds on generic parameters at hand, one can map them to bounds on violations in fundamental pillars of GR. For example, a  −4PN deviation maps to time variation of the gravitational constant G, that allows us to check the strong equivalence principle. 0PN (2PN) corrections can be used to probe violations in Lorentz (parity) invariance.

Figure 10. Refer to the following caption and surrounding text.

Figure 10. Upper bounds on the ppE parameter $\beta$ with GW150914 and GW151226 entering at different PN orders. For comparison, we also present bounds from binary pulsar observations and Solar System experiments. Reprinted figure with permission from [1500], Copyright 2016 by the American Physical Society.

Standard image High-resolution image

3.2.3. Phenomenological waveforms.

The LIGO/Virgo Collaboration (LVC) used a generalized IMRPhenom waveform model [17] by promoting phenomenological parameters $\vec p$ in the GR waveform to $\vec p(1 + \delta p)$ , where $\delta p$ corresponds to the fractional deviation from GR. For the inspiral part of the waveform, one can show that these waveforms are equivalent to the ppE waveform. LVC's bounds on non-GR parameters using a Bayesian analysis on the real data set are consistent with the Fisher bound in figure 10.

One important future direction to pursue is to come up with a meaningful parameterization of the waveform in the merger and ringdown phases. The generalized IMRPhenom waveform model used by LVC does contain non-GR parameters in the merger-ringdown part of the waveform, though it is not clear what they mean physically or how one can map such parameters to e.g. coupling constants in each non-GR theory. To achieve the above goal, one needs to carry out as many BBH merger simulations as possible in theories beyond GR [1103, 1104, 1221, 1508, 1509] to give us some insight on how we should modify the waveform from GR in the merger-ringdown phase.

3.3. Ringdown and black hole perturbations beyond general relativity

Contributor: P Pani

Depending on the physics being tested, the post-merger 'ringdown' phase of a compact-binary coalescence might be better suited than the inspiral or the merger. For instance, in the inspiral phase the nature of the binary components shows up only at high PN order (though corrections due to extra polarizations can show up at low PN order) whereas the merger phase requires time-consuming and theory-dependent numerical simulations. On the other hand, the post-merger phase—during which the remnant BH settles down to a stationary configuration—can be appropriately described by perturbation theory and it is therefore relatively simple to model beyond GR. If the remnant is a BH in GR, the ringdown phase is well-described by a superposition of exponentially damped sinusoids,

Equation (38)

called quasinormal modes (QNMs) [1118, 15101512]. Here, $\omega_{lmn}$ is the characteristic oscillation frequency of the final BH, $\tau_{lmn}$ is the damping time, Almn is the amplitude at a distance r, $\phi_{lmn}$ is the phase, l and m ($|m|\leqslant l$ ) are angular indices describing how radiation is distributed on the final BH's sky, and n is an overtone index labeling the (infinite, countable) number of modes.

Equation (38) sets the ground for GW spectroscopy: QNMs are the fingerprints of the final BH and extracting their frequency and damping time allows for a consistency check of the GW templates [16], tests of gravity [17, 1193, 1500], and of the no-hair properties of the Kerr solution [1513]. As a corollary of the GR uniqueness theorem [1514, 1515], the entire QNM spectrum of a Kerr BH eventually depends only on its mass and spin. Thus, detecting several modes allows for multiple null-hypothesis tests of the Kerr metric [15161518] and, in turn, of one of the pillars of GR.

3.3.1. Background.

BH perturbation theory beyond GR is well established in the case of nonspinning background solutions. Extending seminal work done in GR [15191522], the BH metric is approximated as $g_{\mu\nu} = g_{\mu\nu}^{(0)}+h_{\mu\nu}$ , where $g_{\mu\nu}^{(0)}$ is the background solution, namely the static BH metric that solves the modified Einstein equations in a given theory, whereas $h_{\mu\nu}\ll1$ are the metric perturbations. Likewise, all possible extra fields (e.g. the scalar field in a scalar-tensor theory) are linearized around their own background value. Perturbations of spherically symmetric objects can be conveniently decomposed in a basis of spherical harmonics, and modes with different harmonic indices $(l,m)$ and different parity decouple from each other.

At linear order, the dynamics is described by two systems of ordinary differential equations: the axial (or odd parity) sector and the polar (or even parity) sector. When supplemented by physical boundary conditions at the horizon and at infinity [1118, 1511], each resulting linear system defines an eigenvalue problem whose solutions are the (complex) QNMs, $\omega_{lmn}-{\rm i}/\tau_{lmn}$ . Remarkably, for a Schwarzschild BH within GR the odd and even parity sectors are isospectral [1522], whereas this property is generically broken in other gravity theories.

Perturbations of spinning BHs beyond GR are much more involved. In GR, the gravitational perturbations of a Kerr BH can be separated using the Newman-Penrose formalism [847, 1522] and the corresponding QNMs are described by a single master equation. This property does not generically hold beyond GR or for other classes of modes. To treat more generic cases, one could perform a perturbative analysis in the spin (the so-called slow-rotation approximation [1523, 1524]) or instead solve the corresponding set of partial differential equations with spectral methods and other elliptic solvers [1525]. In general, the spectrum of spinning BHs beyond GR is richer and more difficult to study. This fact has limited the development of parametrized approaches to ringdown tests [1526], which clearly require taking into account the spin of the final object.

Remarkably, there is a tight relation between the BH QNMs in the eikonal limit ($l\gg1$ ) and some geodesic properties associated to the spherical photon orbit (the photon sphere) [15271529]. This 'null geodesic correspondence' is useful as it requires only manipulation of background quantities which are easy to obtain. Furthermore, it provides a clear physical insight into the BH QNMs in terms of waves trapped within the photon sphere, slowly leaking out on a timescale given by the geodesic instability time scale. Within this approximation, the QNMs of BHs beyond GR have been recently studied in [1530] and a parametrized approach has been proposed in [1531]. There are however two important limitations. First, the correspondence is strictly valid only in the eikonal limit, $l\gg1$ , whereas the GW signal is typically dominated by the lowest-l modes. More importantly, the geodesic properties are 'kinematical' and do not take into account dynamical aspects, e.g. those related to extra degrees of freedom [1530] or nonminimal couplings [1532]. For example, Schwarzschild BHs in two different theories of gravity will share the same geodesics but will have in general different QNM spectra [1530, 1533].

3.3.2. Signatures.

There are several distinctive features of linearized BH dynamics that can be used to perform tests of gravity in the strong-field regime.

Ringdown tests & the no-hair theorem.

For BBH mergers within GR, the QNMs excited to larger amplitudes are usually the $(2,2,0)$ and $(3,3,0)$ gravitational modes [15341537]. Because the Kerr BH depends only on two parameters, extracting $\omega_{220}$ and $\tau_{220}$ allows to estimate the mass and spin of the final object, whereas extracting further subleading modes provides multiple independent consistency checks of the Kerr metric, since the QNMs are generically different in extensions of GR. Since QNMs probe a highly dynamical aspect of the theory, the latter statement is true even for those theories in which the Kerr metric is the only stationary vacuum solution [1533]. These tests requires high SNR in the ringdown phase and will be best performed with next generation detectors, especially with LISA [1121, 1517, 1538, 1539], although coherent mode-stacking methods may be feasible also with advanced ground-based detectors [1540]. Furthermore, the ability to perform accurate tests will rely on understanding theoretical issues such the starting time of the ringdown [1537, 1541] and on the modelling of higher harmonics [1332]. Such tests are based on the prompt ringdown response and implicitly assume that the remnant object has an horizon. If an ultracompact exotic compact object rather than a BH forms as a results of beyond-GR corrections, the ringdown signal depends strongly on the final object's compactness. If the compactness is comparable or slightly larger than that of a neutron star, the prompt ringdown will show distinctive signatures [15421544]. On the other hand, for objects as compact as BHs, the prompt ringdown is identical to that of a Kerr BH, but the signal is characterized by 'echoes' at late times [15451547] (see also section 4.3).

Extra ringdown modes.

In addition to the shift of gravitational modes discussed above, virtually any extension of GR predicts extra degrees of freedom [1193] which may be excited during the merger [1104, 1530, 1548, 1549]. Although the amplitude of such modes is still poorly estimated, this type of tests calls for novel ringdown searches including two modes of different nature [1550].

Isospectrality.

Isospectrality of Schwarzschild BHs in GR has a bearing also on Kerr BHs. Since this property is generically broken in alternative theories, a clear signature of beyond-GR physics would be the appearance of a 'mode doublet' (i.e. two modes with very similar frequency and damping time) in the waveform [1526, 1530].

Instabilities.

BHs in Einstein's theory are (linearly mode) stable. A notable exception relates to the superradiant instability triggered by minimally-coupled light bosonic fields [1152] (see also section 5 below). For astrophysical BHs, this instability is ineffective except for ultralight bosons with masses below $10^{-10}~{\rm eV}$ , in which case it has important phenomenological consequences (see section 5.7). The situation is different in modified gravity, for example certain BH solutions are known to be unstable in scalar-tensor theories in the presence of matter [1551, 1552], in theories with higher-order curvature couplings [1414, 1418], and in massive (bi)-gravity [1553, 1554]. These instabilities are not necessarily pathological, but simply indicate the existence of a different, stable, 'ground state' in the spectrum of BH solutions to a given theory. Finally, as discussed in section 4.3, extensions of GR might predict exotic compact objects without an event horizon. These objects are typically unstable and their instability can lead to peculiar signatures also in the GW signal [1547, 1555].

3.3.3. A parametrised ringdown approach?

It should be clear from the previous discussion that the Teukolsky formalism is insufficient to study BH perturbation theory in alternative theories of gravity. Extending it is hence a major open problem. One might be even more ambitious and attempt to develop a theory-agnostic parametrisation for the ringdown, similar in spirit to that discussed in section 3.2 for the inspiral phase. This would generally be a two-faceted problem, since the quiescent BH that acts as the endpoint of evolution is not a Kerr BH, in general. Hence, one first needs to parametrise deviations from the Kerr spacetime and then propose a meaningful and accurate parametrisation of the perturbations around such non-Kerr background.

There have been several attempts to parametrise stationary deviations from the Kerr spacetime. Bumpy metrics were constructed in [15561558] within GR, starting from a perturbation around the Schwarzschild metric and applying the Newman–Janis transformation [1559] to construct a rotating configuration. The quasi-Kerr metric uses the Hartle–Thorne metric [1560], valid for slowly-rotating configurations, and modifies the quadrupole moment [1561]. Non-Kerr metrics can also be found, with the extra requirement that a Carter-like, second-order Killing tensor exists [1562]. This approach and metric were later simplified in [1563]. Note however that rotating BH solutions in alternative theories of gravity do not possess such a Killing tensor in general. Johannsen and Psaltis [1564] proposed the modified Kerr metric by starting from a spherically symmetric and static metric (that does not necessarily satisfy the Einstein equations) and applying the Newman–Janis transformation. This approach was later extended in [1565]. Finally, modified axisymmetric spacetime geometries can be found by adopting a continued-fraction expansion to obtain a rapid convergence in the series [1566, 1567]. See e.g. [1568, 1569] for other parameterizations for deviations from Kerr. Properties of various parameterized Kerr metrics have been studied in [1570]. See also [1571] for more details on such generic parameterization of the Kerr metric.

It should be stressed that a shortfall of all of the aforementioned parametrisation is that they do not have a clear physical interpretation and this makes them rather ad hoc. Moreover, they only address half of the problem of parametrizing the ringdown. So far their use in modelling waveforms has been limited, but see [1369, 1572] and [1531] for applications to extreme-mass ratio inspirals and BH ringdown respectively. At this stage there has been very little progress on how one can take the second step, i.e. parametrise linear perturbations around such spacetimes in a theory-agnostic way. Along these lines, a general theory of gravitational perturbations of a Schwarzschild metric has been developed in [1548] and applied to Horndeski theory in [1549].

3.4. Merger, numerics, and complete waveforms beyond general relativity

Contributors: C Palenzuela, T P Sotiriou

The merger of binary compact objects will test the highly dynamical and strongly non-linear regime of gravity, that can only be modelled by using numerical simulations. As already discussed above, the recent direct detection of GWs from BBHs [1, 2, 11, 18, 20], and of the NSB merger [21] with an associated short GRB [23] and a plethora of concurrent electromagnetic signals from the same source [22] has already provided constraints on deviations from GR in various forms and contexts (e.g. [17, 670673, 1485, 1486, 1500] respectively. However, most of these constraints use either partial, potentially parametrised, waveforms or rely on propagation effects. In fact, the true potential of testing GR is currently limited by the lack of knowledge of GW emission during the merger phase in alternatives to GR [1500]. This problem is particularly acute for heavier BH mergers, where only a short part of the inspiral is detected.

This suggests that constraints could be strengthened significantly in most cases if one had complete, theory-specific, waveforms. However, performing stable and accurate numerical simulations that would produce such waveforms requires an understanding of several complex issues. Probably the first among them is the well-posedness of the system of equations that describe evolution of a given alternative scenario to GR. In the next section we briefly describe the issue of well-posedness and discuss some techniques that have been used so far to obtain a well-posed evolution system in alternative theories. In section 3.4.3 we overview some recent numerical studies of nonlinear evolution beyond GR.

3.4.1. Initial value formulation and predictivity beyond GR.

Modelling the evolution of a binary system for given initial data is a type of initial value problem (IVP). An IVP is well-posed if the solution exists, is unique and does not change abruptly for small changes in the data. A theory with an ill-posed IVP cannot make predictions. The IVP is well-posed in GR [1573] but it is not clear if this is true for most of its contenders. This is a vastly overlooked issue and a systematic exploration of the IVP in many interesting alternative theories, such as those discussed in section 2.1, is pending.

A class of alternative theories of gravity that are known to be well-posed is scalar-tensor theories described by action (21). As discussed in section 2.2, after suitable field redefinitions, these theories take the form of GR plus a scalar field with a canonical kinetic term and potential non-minimal couplings between the scalar and standard model fields, see action (22). Since these couplings do not contain more than two derivatives, one can argue for well-posedness using the known results for GR and the fact that lower-order derivative terms are not relevant for this discussion. Interestingly, most alternative theories actually include modifications to the leading order derivative terms in the modified Einstein's equations, so a theory-specific study is necessary. This has been attempted in very few cases and results are mostly very preliminary. In particular, there is evidence that dynamical Chern–Simons gravity is ill-posed [1106]. Certain generalised scalar-tensor theories appear to be ill-posed in a generalised harmonic gauge when linearised over a generic, weak field background [1574]; however, note that this result is gauge-dependent, and hence not conclusive. Finally, in certain Lorentz-violating theories that exhibit instantaneous propagation, describing evolution might require solving a mixture of hyperbolic and elliptic equations (where the latter are not constraints as in GR) [1480, 1481].

3.4.2. Well-posedness and effective field theories.

One is tempted to use well-posedness as a selection criterion for alternative theories of gravity, as a physical theory certainly needs to be predictive (in an appropriate sense). However, alternatives to GR can be thought of as effective field theories—truncations of a larger theory and hence inherently limited in their range of validity. This complicates the question of what one should do when a given theory turns out to be ill-posed. In fact, it even affects how one should view its field equations and the dynamics they describe in the first place. Effective field theories (EFTs) can often contain spurious degrees of freedom (e.g. ghosts) that lead to pathological dynamics. In linearised theories it is easy to remove such degrees of freedom and the corresponding pathologies, but there is no unique prescription to doing so in general. Hence, instead of setting aside theories that appear to be ill-posed when taken at face value, perhaps one should be looking for a way to 'cure' them and render them predictive at nonlinear level.

A very well-known EFT, viscous relativistic hydrodynamics, requires such 'curing' to control undesirable effects of higher order derivatives and a prescription for doing so has been given long ago [15751577]. A similar prescription applicable to gravity theories has been given recently [1578]. Roughly speaking, this approach treats the theory as a gradient expansion and, hence, it considers as the cause of the pathologies runaway energy transfer to the ultraviolet, that in turn renders the gradient expansion inapplicable. As a results, it attempts to modify the equations so as to prevent such transfer and to ensure that the system remains within the regime of validity of the effective descriptions throughout the evolution.

Another approach is to consider the theory as arising from a perturbative expansion is a certain parameter, say $\lambda$ . If that were the case, then higher order corrections in $\lambda$ have been neglected and, consequently, the solutions can only be trusted only up to a certain order in $\lambda$ . Moreover, they have to be perturbatively close (in $\lambda$ ) to solutions with $\lambda=0$ , which are solution of GR. Hence, one can iteratively generate fully dynamical solutions order-by-order in $\lambda$ . This process is effectively an order-reduction algorithm and yields a well-posed system of equations. It has a long history in gravity theories in different contexts (e.g. [15791581]), but it has only recently been used for nonlinear, dynamical evolution in alternative theories [1104, 1582, 1583].

These two 'cures' do not necessarily give the same results. Moreover, there is no reason to believe that there cannot be other ways to address this problem, so this remains a crucial open question. In principle, the way that an EFT is obtained from a more fundamental theory should strongly suggest which is the way forward when considering nonlinear evolution. However, in practice one often starts with an EFT and hopes to eventually relate it to some fundamental theory. Hence, it seems wise to consider all possible approaches. In principle different theories might require different approaches.

3.4.3. Numerical simulations in alternative theories.

As mentioned above, one can straightforwardly argue that the IVP is well-posed in standard scalar-tensor theories described by action (21). However, as will be discussed in detail in section 4.1, BHs in this class of theories are generically identical to GR and carry no scalar configuration. Hence, even though gravitational radiation in these theories can in principle contain a longitudinal component, it is highly unlikely it will get excited in a BBH. In section 4.2 we outline the conditions under which BHs can differ from their GR counterparts in standard scalar-tensor theories139. When these conditions are satisfied there should be an imprint in GWs from BH binaries. It is worth highlighting here a couple of cases where numerical simulations have to be used to address this question. The first has to do with the role of asymptotics. It has been shown that time-dependent asymptotic for the scalar could lead to scalar radiation during the coalescence [1103], though this emission would probably be undetectable for realistic asymptotic values of the scalar field gradient. The second case has to do with whether matter in the vicinity of a BH can force it to develop a non-trivial configuration. This has been shown in idealised setups only in [1551, 1552]. However, numerical simulations for the same phenomenon in NSs have been performed in [1222, 1223, 1393].

$f(R)$ theories of gravity (see [1584] for a review), which are dynamically equivalent to a specific subclass of scalar-tensor theories, are also well-posed [1584, 1585]. A comparative study of NSB mergers in GR with those of a one-parameter model of f (R)  =  R  +  aR2 gravity is performed in [1586].

A well-posed extension of scalar-tensor theories is Einstein–Maxwell–Dilaton gravity. It has its origin in low energy approximations of string theory and it includes, apart from the metric and a scalar, a U(1) gauge field. The scalar field couples exponentially to the gauge field, allowing for deviations from GR even for BHs with asymptotically constant scalar field. BBH simulations have shown that these deviations are rather small for reasonable values of the hidden charge [1509], leading to weak constraints on the free parameters of the theory.

Finally, some first numerical results have recently appeared in scalar-Gauss–Bonnet gravity, and specifically the theory described by action (30), and in Chern–Simons gravity (see section 2.2 for more details on the theories). [1582, 1583] studied scalar evolution in scalar-Gauss–Bonnet gravity and [1104] performed the first binary evolution in Chern–Simons gravity. These results are notable for using a perturbative expansion in the free parameter of the theory, described in the previous section, in order to circumvent potential issues with well-posedness (as discussed above, if the field equation are taken at face value, well-posedness is known to be an issue for Chern–Simons gravity [1106] and it might be an issue for scalar-Gauss–Bonnet gravity [1574]).

4. The nature of compact objects

4.1. No-hair theorems

Contributors: C Herdeiro, T P Sotiriou

The discovery of the Schwarzschild solution in 1915–16 [1587], shortly after Einstein's presentation of GR [1588], triggered a half-a-century long quest for its rotating counterpart, which ended with the discovery of the Kerr solution in 1963 [1589]—see [1590] for a historical account of this discovery. At this time, unlike in Schwarzschild's epoch, it was already considered plausible that these metrics could represent the endpoint of the gravitational collapse of stars, even though the name black hole to describe them only became widespread after being used by Wheeler in 1967–68 [1591].

The extraordinary significance of the Kerr metric became clear with the establishment of the uniqueness theorems for BHs in GR—see [1376] for a review. The first such theorem was that of Israel's for the static case [1592]. Then, Carter [1514] and Robinson [1593] constructed a theorem stating: An asymptotically flat stationary and axisymmetric vacuum spacetime that is non-singular on and outside a connected event horizon is a member of the two-parameter Kerr family. The axial symmetry assumption was subsequently shown to be unnecessary (assuming analyticity, nondegenerary, and causality [1594]); for BHs, stationarity actually implies axial symmetry, via the 'rigidity theorem', which relates the teleologically defined 'event horizon' to the locally defined 'Killing horizon' [1595]. The upshot of these theorems is that a stationary vacuum BH, despite possessing an infinite number of non-trivial, appropriately defined multipolar moments [1596], has only two degrees of freedom: its mass and angular momentum. This simplicity was written in stone by Wheeler's dictum ' BHs have no hair' [1597].

In parallel to these developments the astrophysical evidence for BHs piled up over the last half century—see e.g. [1598]. In fact, the very conference where the Kerr solution was first presented (the first Texas symposium on relativistic astrophysics) held in December 1963, was largely motivated by the discovery of quasars in the 1950s and the growing belief that relativistic phenomena related to highly compact objects should be at the source of the extraordinary energies emitted by such very distant objects [1599]. Gravitational collapse was envisaged to play a key role in the formation of these highly compact objects and such collapse starts from matter distributions, rather than vacuum. Thus, one may ask if BHs in the presence of matter, rather than in vacuum, still have no-hair. In this respect, Wheeler's dictum was a conjecture, rather than a synthetic expression of the uniqueness theorems. It hypothesised a much more general and ambitious conclusion than that allowed solely by these theorems. The conjecture stated that: gravitational collapse leads to equilibrium BHs uniquely determined by mass, angular momentum and electric charge -asymptotically measured quantities subject to a Gauss law and no other independent characteristics (hair) [1600]. Here, the Gauss law plays a key role to single out the physical properties that survive gravitational collapse, because they are anchored to a fundamental (gauged) symmetry, whose conserved charge cannot be cloaked by the event horizon. The uniqueness theorems (which can be generalised to electro-vacuum [1376]), together with the no-hair conjecture, lay down the foundation for the Kerr hypothesis, that the myriad of BHs that exist in the Cosmos are well described, when near equilibrium, by the Kerr metric. This astonishing realisation was elegantly summarised by S. Chandrasekhar [1601]: In my entire scientific life, extending over forty-five years, the most shattering experience has been the realisation that an exact solution of Einstein's field equations of GR, discovered by the New Zealand mathematician, Roy Kerr, provides the absolutely exact representation of untold numbers of massive BHs that populate the Universe.

The Kerr hypothesis can be tested both theoretically and by confrontation with observations. On the theory front, the key question is whether BHs that are not described by the Kerr metric can constitute solutions of reasonable field equations that describe gravity and potentially other fields. Additionally, one can ask whether such BH solutions can form dynamically and are (sufficiently) stable as to be astrophysically relevant. On the observational front, the question is to which extent the BHs we observe are compatible with the Kerr hypothesis and what constraints can one extract from that.

A set of results on the non-existence of non-Kerr BHs in specific models, either including different matter terms in Einstein's GR or by considering theories of gravity beyond Einstein's GR (or both) have been established since the 1970s and are known as no-hair theorems—see e.g. the reviews [1401, 16021604]140. No-hair theorems apply to specific models or classes of models and rely on assumptions regarding symmetries, asymptotics, etc. As will become evident in the next section, dropping one or more of these assumptions in order to circumvent the corresponding no-hair theorem can be a good tactic for finding non-Kerr BHs (see also [1401] for a discussion). More generally, understanding the various sort of theorems, their assumptions and limitations is instructive.

A large body of work was dedicated to the case of scalar hair, partly due to the conceptual and technical simplicity of scalar fields. Indeed, one of the earliest examples of a no-hair theorem was provided by Chase [1605], following [1606]. Chase inquired if a static, regular BH spacetime could be found in GR minimally coupled to a real, massless scalar field. Without assuming spherical symmetry, he could show that the scalar field necessarily becomes singular at a simply-connected event horizon. Thus, a static BH cannot support scalar hair, in this model. The 1970s witnessed the formulation of influential no-hair theorems, notably, by Bekenstein, who developed a different approach for establishing the non-existence of scalar hair, applicable to higher spin fields as well [16071609]. An elegant no-hair theorem was given by Hawking [1398] for minimally coupled scalar fields without self interactions and for Brans–Dicke theory [1195]. Hawking's theorem assumes stationarity (as opposed to staticity) and no other symmetry and its proof is particularly succinct. This theorem has been later generalised by Sotiriou and Faraoni [1400] to scalars that exhibit self interactions and to the class of scalar-tensor theories described by action (21). Hui and Nicolis [1405] have proven a no-hair theorem for shift-symmetric (and hence massless) scalar fields that belong to the Horndeski class discussed in section 2.2. This proof assumes staticity and spherical symmetry, though it can be easily generalized to slowly-rotating BHs [1406]. It is also subject to a notable exception, as pointed out in [1406]: a linear coupling between the scalar and the Gauss–Bonnet invariant is shift symmetric and yet circumvents the theorem (see section 2.2 and action (30)). Finally, a no-hair theorem has been given in [1414] for stationary BH solutions for the theory described by action (31), provided $f''(\phi_0){{\mathcal G}}<0$ ; a similar proof, but restricted to spherical symmetry, can be found in [1415].

4.2. Non-Kerr black holes

Contributors: C Herdeiro, T P Sotiriou

Despite the many no-hair theorems discussed above, hairy BH solutions are known to exist in many different contexts. In GR, the 1980s and 1990s saw the construction of a variety of hairy BH solutions, typically in theories with non-linear matter sources—see e.g. the reviews [1602, 1610, 1611]. The paradigmatic example are the coloured BHs, found in Einstein-Yang-Mills theory [16121615]. However, these BHs are unstable and have no known formation mechanism. Hence, they should not be considered as counter examples to the weak no-hair conjecture, and it is unlikely that they play a role in the astrophysical scene.

Below we will review some more recent attempt to find hairy BH solutions that are potentially astrophysically relevant. We first focus on cases that are covered by no-hair theorems and discuss how dropping certain assumptions can lead to hairy BHs. We then move on to various cases of hairy BHs in theories and models that are simply not covered by no-hair theorems. This is not meant to be an exhaustive review but merely a discussion of some illuminating examples in order to highlight the interesting BH physics that GWs could potentially probe.

4.2.1. Circumventing no-hair theorems.

No-hair theorems rely on assumptions, such as: (i) the asymptotic flatness of the metric and other fields; (ii) stationarity (or more restrictive symmetries); (iii) the absence of matter; (iv) stability, energy conditions and other theorem-specific conditions. There is no doubt that removing any one of these assumptions leads to BH hair. For example, standard scalar-tensor theories described by the action (21) are covered by the no-hair theorems discussed in the previous section [1398, 1400, 1605, 16071609]. Nonetheless, BHs with scalar hair exist in these theories if one allows for anti-de Sitter (AdS) asymptotics (e.g. [1616]), or attempts to 'embed' the BHs in an evolving universe [1617], or allows for matter to be in their vicinity [1551]. The case of AdS asymptotics, though interesting theoretically, is not relevant for astrophysics. The fact that cosmic evolution could potentially endow astrophysical BHs with scalar hair is certainly enticing but the effect should be very small [1618]. Finally, it is not yet clear if the presence of matter in the vicinity of a BH could induce a scalar charge that is detectable with the current observations. See [1401] for a more detailed discussion. Circumventing the theorems by violating stability arguments will be discussed in section 4.2.3.

A perhaps less obvious way to obtain hairy solutions is to strictly impose the symmetry assumptions on the metric only and relax them for other fields. Even though the field equations relate the fields their symmetries do not need to match exactly in order to be compatible (see, for instance, [1619] for a recent discussion). A well-studied case is that of a complex, massive scalar field minimally coupled to gravity [1164, 1620]. The scalar has a time-dependent phase, but its stress-energy tensor remains time-independent (thanks to some tuning) and the metric can remain stationary. See also [1621, 1622] for generalisations and [1168] for an existence proof of the solutions. BHs with Proca (massive vector) hair have also been found using the same approach [1167].

Whether or not these hairy BH are relevant for astrophysical phenomena depends on three main factors: (1) the existence of (ultra-light) massive bosonic fields in Nature; (2) the existence of a formation mechanism and (3) their stability properties. The first factor is an open issue. Ultra-light bosonic fields of the sort necessary for the existence of these BHs with astrophysical masses do not occur in the standard model of particle physics. Some beyond the standard model scenarios, however, motivate the existence of this type of 'matter', notably the axiverse scenario in string theory [1623]. The second point has been positively answered by fully non-linear numerical simulations [1162, 1624]. There is at least one formation channel for these BHs, via the superradiant instability of Kerr BHs, in the presence of ultralight bosonic fields—see [1152] for a review of this instability. The hairy BHs that result from this dynamical process, however, are never very hairy, being always rather Kerr-like. Concerning the third point, it has been recognised since their discovery, that these BHs could be afflicted by the superradiant instability themselves [1620, 1625]. Recent work [1626] succeeded in computing the corresponding timescales, and reported that the timescale of the strongest superradiant instability that can afflict hairy BH is roughly 1000 times longer than that of the strongest instability that can afflict the Kerr BH. This study, albeit based on the analysis of a very small sample of solutions, leaves room for the possibility that some of these BHs can form dynamically and be sufficiently stable to play a role in astrophysical processes [1169]. However, a more detailed analysis needs to be performed before definite conclusions can be drawn.

Hairy BHs have also been found in certain shift-symmetric scalar tensor theories by allowing the scalar to depend linearly on time while requiring that the metric be static [1627, 1628]. This is possible because shift symmetry implies that the scalar only appears in the equations through its gradient and the linear time dependence renders the gradient time independent. The linear stability on such solutions has been explored in [16291631]. It remains largely unexplored whether these BHs can form dynamically and hence whether they are relevant for astrophysics.

4.2.2. Black holes with scalar hair.

The most straightforward way to find hairy BH solutions is to consider theories that are not covered by no-hair theorems. In the case of scalar-tensor theories, it is known that couplings between the scalar (or pseudoescalar) and quadratic curvature invariants, such as the Gauss–Bonnet invariant $ \newcommand{\e}{{\rm e}} {{\mathcal G}}\equiv R^2 - 4 R_{\mu\nu} R^{\mu\nu} + R_{\mu\nu\rho\sigma}R^{\mu\nu\rho\sigma}$ or the Pontryagin density $*R^{\alpha\phantom{a}\gamma\delta}_{\phantom{a}\beta} R_{\phantom{a}\alpha\gamma\delta}^{\beta}$ , can lead to scalar hair, see e.g. [1406, 14081411]. Indeed, the existence of hairy BHs singles out specific class of theories, such as dynamical Chern–Simons (dCS) gravity [1431, 1432] and the scalar-Gauss–Bonnet theories described by the action (31) of section 2.2 (which have hairy BH solutions provided that $f'(\phi_0)\neq 0$ for any constant $\phi_0$ [1406, 1414]). Section 2.2 already contains a discussion about these theories and their BH phenomenology, so we refer the reader there for details. See also section 3.4.3 for a discussion on the first attempts to simulate dynamical evolution of hairy BHs and to produce waveforms for binaries that contain them.

4.2.3. Black hole scalarization.

In the previous section the focus was on theories that are not covered by no hair theorems and have hairy BH for all masses. An interesting alternative was recently pointed out in [1414, 1416]: that certain theories that marginally manage to escape no-hair theorems [1414, 1415] can have hairy BHs only in certain mass ranges, outside which BHs are identical to those of GR. The theories belong in the class describe by action (31) and the phenomenon resembles the'spontaneous scalarization' of stars that happens in certain models of the standard scalar-tensor theories of action (21); see section 2.2 for more details. In the mass ranges where hairy BHs exist the GR BHs of the same mass are expected to be unstable and this instability is supposed to gives rise to the scalar hair. However, depending on the model, the hairy solutions can also be unstable [1418] and this issue requires further investigation. It is also not clear if other theories can also exhibit BH scalarization and the astrophysical significance of the effect has not yet be explored.

4.2.4. Black holes in theories with vector fields and massive/bimetric theories.

In analogy to the generalised scalar-tensor theories discussed in section 2.2 and described by action (25), one can construct the most general vector-tensor theories with second-order equations of motion. These theories can be considered as generalised Proca theories and they contain new vector interactions [1632, 1633]. Their phenomenology can be distinct from that of Horndeski scalars and BH solutions with vector hair are known to exist [16341640]. Depending on the order of derivative interactions, these BH solutions can have primary (i.e. the new charge is independent) or secondary (i.e. the new charge is not independent) Proca hair. The presence of a temporal vector component significantly enlarges the possibility for the existence of hairy BH solutions without the need of tuning the models [1636, 1637].

Horndeski scalar-tensor and the generalised Proca theories can be unified into scalar-vector-tensor theories for both the gauge invariant and the gauge broken cases [1641]. In the $U(1)$ gauge invariant and shift symmetric theories the presence of cubic scalar-vector-tensor interactions is crucial for obtaining scalar hair, which manifests itself around the event horizon. The inclusion of the quartic order scalar-vector-tensor interactions enables the presence of regular BH solutions endowed with scalar and vector hairs [1642]. It is worth mentioning that this new type of hairy BH solutions are stable against odd-parity perturbations [1643].

In massive and bimetric gravity theories the properties of the BH solutions depend heavily on choices one makes for the fiducial or dynamical second metric [16441649]. If the two metrics are forced to be diagonal in the same coordinate system (bidiagonal solutions) their horizons need to coincide if they are not singular [1650]. It should be stressed that generically there is not enough coordinate freedom to enforce the metrics to be bidiagonal. The analysis of linear perturbations reveals also an unstable mode with a Gregory–Laflamme instability [1553, 1554], which remains in bi-Kerr geometry. Perturbations of the non-bidiagonal geometry is better behaved [1651]. Non-singular and time dependent numerical solutions were recently found in [1652].

4.2.5. Black holes in Lorentz-violating theories.

Lorentz symmetry is central to the definition of a BH thanks to the assumption that the speed of light is the maximum attainable speed. Superluminal propagation is typical of Lorentz violating theories141 and GW observations have already given a spectacular constraint: the detection of a NSB merger (GW170817) with coincident gamma ray emission has constrained the speed of GWs to a part in 1015 [23]. However, Lorentz-violating theories generically exhibit extra polarizations [1478], whose speed remains virtually unconstrained [1485]. See also section 3.1. Moreover, BHs in such theories are expected to always be hairy. The reason has essentially been explained in the discussion of section 2.3: Lorentz-violating theories can generally be written in a covariant way (potentially after restoring diffeomorphism invariance through the introduction of a Stueckelberg field) and in this setup Lorentz symmetry breaking can be attributed to some extra field that has the property of being nontrivial in any solution to the field equation. That same property will then endow BHs with hair. Indeed, this is known to be true for the theories reviewed in section 2.3. The structure of such BHs depends on the causal structure of the corresponding theory.

In particular, in Einstein-æther theory (æ-theory), though speeds of linear perturbations can exceed the speed of light, they satisfy an upper bound set by the choice of the ci parameters. Indeed, massless excitations travel along null cones of effective metrics composed by the metric that defines light propagation and the æ ther [1653]. This makes the causal structure quasi-relativistic, when one adopts the perspective of the effective metric with the widest null cone [1481]. Stationary BHs turn out to have multiple nested horizons, each of which is a Killing horizon of one of the effective metrics and acts as a causal boundary of a specific excitation [1653, 1654].

In Hořava gravity there is no maximum speed, as already explained is section 2.3. There is a preferred foliation and all propagating modes have dispersion relations that scale as $\omega^2\propto k^6$ for large wave numbers k and there is arbitrarily fast propagation. Even at low momenta, where the dispersion relations are truncated to be linear, there is also an instantaneous (elliptic) mode, both at perturbative [1479] and nonperturbative level [1480]. It would be tempting to conclude that BHs cannot exist in Hořava gravity. Nonetheless, there is a new type of causal horizon that allows one to properly define a BH, dubbed the universal horizon [1479, 1654]. It is not a null surface of any metric, but a spacelike leaf of the preferred foliation that cloaks the singularity. Future-directed signals can only cross it in one direction and this shields the exterior from receiving any signal, even an instantaneous one, from the interior. Universal horizons persist in rotating BHs in Hořava gravity [1481, 1482, 16551657] and have been rigorously defined without resorting to symmetries in [1481].

Even though BHs in Lorentz-violating theories and their GR counterparts have very different causal structure, their exteriors can be very similar and hard to tell apart with electromagnetic observations [1482, 1654, 1657]. Nevertheless, the fact that Lorentz-violating theories have additional polarisations and their BHs are have hair suggests very strongly that binary systems will emit differently than in GR. Hence, confronting model-dependent waveforms with observations is likely to yields strong constraints for Lorentz violation in gravity.

4.3. Horizonless exotic compact objects

Contributors: V Cardoso, V Ferrari, P Pani, F H Vincent

BHs are the most dramatic prediction of GR. They are defined by an event horizon, a null-like surface that causally disconnects the interior from our outside world. While initially considered only as curious mathematical solutions to GR, BHs have by now acquired a central role in astrophysics: they are commonly accepted to be the outcome of gravitational collapse, to power active galactic nuclei, and to grow hierarchically through mergers during galaxy evolution. The BH paradigm explains a variety of astrophysical phenomena and is so far the simplest explanation for all observations. It is, however, useful to remember that the only evidence for BHs in our universe is the existence of dark, compact and massive objects. In addition, BHs come hand in hand with singularities and unresolved quantum effects. Hence, it is useful to propose and to study exotic compact objects (ECOs)—dark objects more compact than NSs but without an event horizon.

Perhaps the strongest theoretical motivation to study ECO's follows from quantum gravity. Quantum modifications to GR are expected to shed light on theoretical issues such as the pathological inner structure of Kerr BHs and information loss in BH evaporation. While there is a host of recent research on the quantum structure of BHs, it is not clear whether these quantum effects are of no consequence on physics outside the horizon or will rather lead to new physics that resolves singularities and does away with horizons altogether. The consequence of any such detection, however small the chance, would no less than shake the foundations of physics. As there is no complete quantum gravity approach available yet, the study of ECOs, in various degrees of phenomenology, is absolutely crucial to connect to current GW observations.

In summary, ECOs should be used both as a testing ground to quantify the observational evidence for BHs, but also to understand the signatures of alternative proposals and to search for them in GW and electromagnetic data [1547].

Classification: compact objects can be conveniently classified according to their compactness, M/R, where M and R are the mass and (effective) radius of the object, respectively. NSs have $M/R\approx 0.1-0.2$ , a Schwarzschild BH has M/R  =  1/2, whereas a nearly extremal Kerr BH has $M/R\approx1$ . The study of the dynamics of light rays and of GWs shows that the ability of ECOs to mimic BHs is tied to the existence of an unstable light ring in the geometry [1547, 1658]. Accordingly, two important categories of this classification are [1547]:

  • (i)  
    Ultracompact objects (UCOs), whose exterior spacetime has a photon sphere. In the static case, this requires M/R  >  1/3. UCOs are expected to be very similar to BHs in terms of geodesic motion. NSs are not in this category, unless they are unrealistically compact and rapidly spinning.
  • (ii)  
    Clean-photon-sphere objects (ClePhOs), so compact that the light travel time from the photon sphere to the surface and back is longer than the characteristic time scale, $\tau_{\rm geo}\sim M$ , associated with null geodesics along the photon sphere. These objects are therefore expected to be very similar to BHs at least over dynamical time scales  ∼M. In the static case, this requires M/R  >  0.492 [1547].

Some models of ECOs have been proposed in attempts to challenge the BH paradigm. Others are motivated by (semiclassical) quantum gravity scenarios that suggest that BHs would either not form at all [1547, 16591665], or that new physics should drastically modify the structure of the horizon [16661668].

In the presence of exotic matter fields (e.g. ultralight scalars), even classical GR can lead to ECOs. The most characteristic case is perhaps that of boson stars [1171, 1669, 1670]: self-gravitating solutions of the Einstein–Klein–Gordon theory. They have no event horizon, no hard surface, and are regular in their interior. Depending on the scalar self-interactions, they can be as massive as supermassive compact objects and—when rapidly spinning—as compact as UCOs. Further details about these objects can be found elsewhere [1671]. Though boson stars are solutions of a well defined theory, most ECO-candidates currently arise from phenomenological considerations and employ some level of speculation. For instance, certain attempts of including quantum gravity effects in the physics of gravitational collapse postulate corrections in the geometry at a Planck distance away from the horizon, regardless of the mass (and, hence, of the curvature scale) of the object. Since $l_{\rm Planck}\ll M$ , these objects naturally classify as ClePhOs. One example are fuzzballs, ensembles of horizonless microstates that emerge as rather generic solutions to several string theories [1660, 1661]. In these solutions the classical horizon arises as a coarse-grained description of the (horizonless) microstate geometries. Another example are so-called gravitational-vacuum stars, or gravastars, ultracompact objects supported by a negative-pressure fluid [1659, 16721674], which might arise from one-loop effects in curved spacetime in the hydrodynamical limit [1675]. Other examples of ClePhOs inspired by quantum corrections include black stars [1676], superspinars [1677], collapsed polymers [1678, 1679], 2  −  2-holes [1680], etc [1547]. We should highlight that for most of these models, $1-2M/R\sim 10^{-39}$ –10−46 for stellar to supermassive dark objects. Thus, their phenomenology is very different from that of boson stars, for which $1-2M/R\sim {{\mathcal O}}(0.1)$ at most.

The speculative nature of most ECOs implies that their formation process has not been consistently explored (with notable exceptions [1671]). If they exist they are likely subject to the so-called ergoregion instability. This was first found in [1681] for scalar and electromagnetic perturbations (see also [1682, 1683]), then shown to affect gravitational perturbations as well in [1684], and recently proved rigorously in [1685]). The instability affects any spinning compact object with an ergoregion but without a horizon and its time scale may be shorter than the age of the Universe [1686, 1687]. The endpoint of the instability is unknown but a possibility is that the instability removes angular momentum [1688], leading to a slowly-spinning ECO [1152], or perhaps it is quenched by dissipation within the object [1689] (although the effects of viscosity in ECOs are practically unknown). An interesting question is also the outcome of a potential ECO coalescence. All known equilibrium configurations of ECOs have a maximum mass above which the object is unstable against radial perturbations and is expected to collapse. The configuration with maximum mass is also the most compact (stable) one. Therefore, the more an ECO mimics a BH, the closer it is to its own maximum mass. Thus, if two ECOs are compact enough as to reproduce the inspiral of a BH coalescence (small enough deviations in the multipole moments, tidal heating, and tidal Love numbers relative to the BH case; see below), their merger would likely produce an ECO with mass above the critical mass of the model. Hence the final state could be expected to be a BH. In other words, one might face a 'short blanket' problem: ECOs can mimic well either the post-merger ringdown phase of a BH or the pre-merger inspiral of two BHs, but they may find it difficult to mimic the entire inspiral-merger-ringdown signal of a BH coalescence with an ECO+ECO $\to$ ECO process. The only way out of this problem is to have a mass-radius diagram that resembles that of BHs for a wide range of masses. No classical model is known that behaves this way, yet.

It is worth noting that, even if the BH paradigm is correct, ECOs might be lurking in the universe along with BHs and NSs. Modelling the EM and GW signatures of these exotic sources is necessary to detect them, and may even provide clues for other outstanding puzzles in physics, such as DM (see also section 5 below).

4.4. Testing the nature of compact objects

Contributors: V Cardoso, V Ferrari, M Kramer, P Pani, F H Vincent, N Wex

4.4.1. EM diagnostics.

EM radiation emitted from the vicinity of BH candidates can probe the properties of spacetime in the strong-field region and may lead to constraining the nature of the compact object. We focus our discussion on three popular EM probes, namely shadows, x-ray spectra, and quasi-periodic oscillations. We do not discuss here polarization, nor effects on stellar trajectories. Because all EM tests can be eventually traced back to geodesic motion, EM probes may distinguish between BHs and ECOs with M/R  <  0.492, whereas it is much more challenging to tell a ClePhO from a BH through EM measurements.

Shadows.

BHs appear on a bright background as a dark patch on the sky, due to photons captured by the horizon. This feature is known as the BH shadow [1690, 1691]. The event horizon telescope [1692], aiming at obtaining sub-millimeter images of the shadow of the supermassive object Sgr A* at the center of the Milky Way and of the supermassive object at the center of the elliptical galaxy M87, is a strong motivation for these studies.

A rigorous and more technical definition of the BH shadow is the following [1693]: it is the set of directions on an observer's local sky that asymptotically approach the event horizon when photons are ray traced backwards in time along them. Thus, by this definition, shadows are intrinsically linked with the existence of a horizon and, strictly speaking, an ECO cannot have a shadow. However, in practice ultracompact horizonless objects (in particular UCOs and ClePhOs) might be very efficient in mimicking the exterior spacetime of a Kerr BH. It is therefore interesting to study photon trajectories in such spacetimes and to contrast them with those occurring in the Kerr case.

In the analysis of shadows, one generally either considers parametrized spacetimes [1564, 16941696] (that allow to tune the departure from Kerr but might not map to known solutions of some theory), or takes into account a specific alternative theoretical framework [16971704] or a particular compact object within GR [17051708]. Most of these studies obtain differences with respect to the standard Kerr spacetime that are smaller than the current instrumental resolution (<10 $\mu$ as). Some studies report more noticeable differences in the case of naked singularities [1705], exotic matter that violates some energy condition [1706], or in models that allow for large values of the non-Kerrness parameters [1701, 1702]. Moreover, demonstrating a clear difference in the shadow is not enough to infer that a test can be made, since such difference might be degenerate with the mass, spin, distance, and inclination of the source. Finally, the fact that horizonless objects lead to shadow-like regions that can share some clear resemblance with Kerr [1708] shows the extreme difficulty of an unambiguous test based on such observables (for perspective tests with current observations see, e.g. [1709, 1710]).

Kα iron line and continuum-fitting.

The x-ray spectra of BHs in x-ray binaries and AGNs are routinely used to constrain the spin parameter of BHs, assuming a Kerr metric [1711, 1712]. In particular, the iron K$\alpha$ emission line, that is the strongest observed and is strongly affected by relativistic effects, is an interesting probe.

Many authors have recently investigated the x-ray spectral observables associated to non-Kerr spacetimes [1713]. The same distinction presented above for the tests of shadows can be made between parametric studies [17141716], specific alternative theoretical frameworks [1700, 1715, 17171719], and alternative objects within classical GR [1720, 1721].

Although differences with respect to Kerr are commonly found in various frameworks, these are generally degenerate with other parameters, such as mass, spin and inclination of the source. Moreover, the precise shape of the iron line depends on the subtle radiative transfer in the accretion disk, which is ignored in theoretical studies that generally favor simple analytic models. This is a great advantage of the shadow method, which is probably the EM probe least affected by astrophysical systematics.

QPOs.

Quasi-periodic oscillations (QPOs) are narrow peaks in the power spectra, routinely observed in the x-ray light curves of binaries. QPOs are of the order of Keplerian frequencies in the innermost regions of the accretion flow [1722].

A series of recent works were devoted to studying the QPO observables associated to alternative compact objects, be it in the context of parametric spacetimes [1723, 1724], alternative theoretical frameworks [1700, 1725, 1726], or alternative objects within GR [1727]. Although the QPO frequencies can be measured with great accuracy, QPO diagnostics suffer from the same limitations as the x-ray spectrum: degeneracies and astrophysical uncertainties. The non-Kerr parameters are often degenerate with the object's spin. Moreover, it is currently not even clear what the correct model for QPOs is.

Pulsar-BH systems.

High-precision timing observations of radio pulsars provide very sensitive probes of the spacetime in the vicinity of compact objects. Indeed, the first evidence for the existence of gravitational waves were obtained from observations of binary pulsars [1728]; light propagation in strong gravitational effects can be precisely tested with Shapiro delay experiments [1729]; relativistic spin-precession can be studied by examining pulse shape and polarisation properties of pulsars [1730, 1731]. The very same techniques can be applied for pulsars to be found around BHs [1732].

While shadows, accretion-disk spectra, and QPOs are probing the near field of the BH, i.e. on a scale of a few Schwarzschild radii, it is not expected to find a pulsar that close to a BH. The lifetime of such a pulsar-BH system due to gravitational wave damping is very small, which makes a discovery of such a system extremely unlikely. This is also true for a pulsar around Sgr A*, although observational evidence stills point towards an observable pulsar population in the Galactic Centre [1733]. Consequently, a pulsar-BH system, once discovered, is expected to provide a far field test, i.e. a test of only the leading multipole moments of the BH spacetime, in particular mass $M_\bullet$ , spin $S_\bullet$ and—for IMBHs and Sgr A*—the quadrupole moment $Q_\bullet$ [17341736]. On the one hand, the measurement of $M_\bullet$ , $S_\bullet$ and $Q_\bullet$ can be used to test the Kerr hypothesis. On the other hand, a pulsar can provide a complementary test to the near field test, and break potential degeneracies of, for instance, a test based on the shadow of the BH [1695, 1710, 1737].

A pulsar in a sufficiently tight orbit (period $\lesssim $ few days) about a stellar-mass BH is expected to show a measurable amount of orbital shrinkage due to the emission of gravitational waves. Alternatives to GR generally show the existence of 'gravitational charges' associated with additional long-range fields, like e.g. the scalar field in scalar-tensor theories. Any asymmetry in such 'gravitational charge' generally leads to the emission of dipolar radiation, which is a particular strong source of energy loss, as it appears at the 1.5 post-Newtonian level in the equations of motion. Of particular interest here are theories that give rise to extra gravitational charges only for BHs and therefore evade any present binary pulsar tests. Certain shift-symmetric Horndeski theories known to have such properties [1406, 1408, 1409, 1422], where a star that collapses to a BH suddenly acquires a scalar charge in a nontrivial manner [1582, 1583] (see sections 2.2 and 4.2). Based on mock timing data, Liu et al [1735] have demonstrated the capability of utilizing a suitable pulsar-BH system to put tight constraints on any form of dipolar radiation.

Finally, there are interesting considerations how a pulsar-BH system could be used to constrain quantum effects related to BHs. For instance, there could be a change in the orbital period caused by the mass loss of an enhanced evaporation of the BH, for instance due to an extra spatial dimension. An absence of any such change in the timing data of the pulsar would lead to constraints on the effective length scale of the extra spatial dimension [1738]. If quantum fluctuations of the spacetime geometry outside a BH do occur on macroscopic scales, the radio signals of a pulsar viewed in an (nearly) edge-on orbit around a BH could be used to look for such metric fluctuations. Such fluctuations are expected to modify the propagation of the radio signals and therefore lead to characteristic residuals in the timing data [1739].

Given the prospects and scientific rewards promised by PSR-BH systems, searches are on-going to discover these elusive objects. Pulsars orbiting stellar-mass BHs are expected to be found in or near the Galactic plane. Since binary evolution requires such system to survive two supernova explosions, this implies a low systemic velocity, placing it close to its original birth place. On-going deep Galactic plane surveys, like those as part in the High Resolution Universe Survey [1740] or upcoming surveys using the MeerKAT or FAST telescopes, clearly have the potential to uncover such systems. Looking at regions of high stellar density, one can expect even to find millisecond pulsars around BHs due to binary exchange interactions, making globular clusters [1741] and the Galactic Centre [1742] prime targets for current and future surveys. As discussed, finding a pulsar orbiting Sgr A* would be particularly rewarding. Past surveys are likely to have been limited by sensitivity and scattering effects due to the turbulent interstellar medium although the discovery of a radio magnetar in the Galactic Center [1743] indicates that the situation may be more complicated than anticipated [1744]. Searches with sensitive high-frequency telescopes will ultimate provide the answer [1745]. Meanwhile, sensitive timing observations of pulsars in Globular cluster can also probe the proposed existence of Intermediate Mass Black Holes (IMBHs) in the cluster centres by sensing how the pulsars 'fall' in the cluster potential. In some clusters, prominent claims [1746] can be safely refuted [1747], while in other clusters IMBHs may still exist in the centre [1748, 1749] or their potential mass can at least be constrained meaningfully [1750].

In summary, current and future radio pulsars observations have the potential to study BH properties over a wide mass range, from stellar-mass to super-massive BHs providing important complementary observations presented in this chapter;

4.4.2. GW diagnostics.

In contrast to EM probes, GW tests are able to probe also the interior of compact objects and are much less affected by the astrophysical environment [1526]. Thus, they are best suited to constraining all classes of ECOs.

GW tests based on the ringdown phase.

The remnant of a binary merger is a highly distorted object that approaches a stationary configuration by emitting GWs during the 'ringdown' phase. If the remnant is a BH in GR, the ringdown can be modeled as a superposition of damped sinusoids described by the quasinormal modes (QNMs) of the BH [1118, 1510, 1511] (see section 3.3). If the remnant is an ECO, the ringdown signal is different:

  • For UCOs, the ringdown signal can be qualitatively similar to that of a BH, but the QNMs are different from their Kerr counterpart [15421544]. The rates of binary mergers that allow for QNM spectroscopic tests depends on the astrophysical models of BH population [1517]. The estimated rates are lower than $1/{\rm yr}$ for current detectors even at design sensitivity. On the other hand, rates are higher for third-generation, Earth-based detectors and range between a few to 100 events per year for LISA, depending on the astrophysical model [1517]. Even if the ringdown frequencies of a single source are hard to measure with current detectors, coherent mode stacking methods using multiple sources may enhance the ability of aLIGO/aVirgo to perform BH spectroscopy [1540]. Such procedure is dependent upon careful control of the dependence of ringdown in alternative theories on the parameters of the system (mass, spin, etc).
  • For ClePhOs, the prompt post-merger ringdown signal is identical to that of a BH, because it excited at the light ring [1545, 1547]. However, ClePhOs generically support quasi-bound trapped modes [1751, 1752] which produce a modulated train of pulses at late times. The frequency and damping time of this sequence of pulses is described by the QNMs of the ClePhO (which are usually completely different from those of the corresponding BH) [1753, 1754]. These modes were dubbed 'GW echoes' and appear after a delay time that, in many models, scales as $\tau_{\rm echo}\sim M\log(1-2M/R)$ [1546]. Such logarithmic dependence is key to allow for tests of Planckian corrections at the horizon scale [1546, 1547, 1755]. Models of ultracompact stars provide GW echoes with a different scaling [1547, 1756], the latter being a possible smoking gun of exotic state of matter in the merger remnant.
  • In addition to gravitational modes, matter modes can be excited in ECOs [1500]. So far, this problem has been studied only for boson stars [1189, 1192] and it is unclear whether matter QNMs would be highly redshifted for more compact ECOs [1547].
GW tests based on the inspiral phase.

The nature of the binary components has a bearing also on the GW inspiral phase, chiefly through three effects:

  • Multipolar structure. As a by-product of the BH uniqueness and no-hair theorems [1515], the mass and current multipole moments $ \newcommand{\e}{{\rm e}} (M_{\ell}, {S}_\ell)$ of any stationary, isolated BH can be written only in terms of mass $ \newcommand{\e}{{\rm e}} M\equiv M_0$ and spin $ \newcommand{\e}{{\rm e}} \chi\equiv {S_1}/{M^2}$ . The quadrupole moment of the binary components enters the GW phase at 2PN relative order, whereas higher multipoles enter at higher PN order [911]. The multipole moments of an ECO are generically different, e.g. ${M}_2^{\rm ECO}={M}_2^{\rm Kerr}+\delta q(\chi, M/R)M^3$ , and it is therefore possible to constrain the dimensionless deviation $\delta q$ by measuring the 2PN coefficient of the inspiral waveform. This was recently used to constrain ${{\mathcal O}}(\chi^2)$ parametrized deviations in $\delta q$ [1319]. It should be mentioned that ECOs will generically display higher-order spin corrections in $\delta q$ and that—at least for the known models of rotating ultracompact objects [17571759]—the multipole moments approach those of a Kerr BH in the high-compactness limit. Moreover, the quadrupole PN correction is degenerate with the spin–spin coupling. Such degeneracy can be broken using the I-Love-Q relations [1760, 1761] for ECOs, as computed for instance in the case of gravastars [1757, 1758].
  • Tidal heating. In a binary coalescence the inspiral proceeds through energy and angular momentum loss in the form of GWs emitted to infinity. If the binary components are BHs, a fraction of the energy loss is due to absorption at the horizon [17621767]. This effect introduces a 2.5PN $\times\log v$ correction to the GW phase of spinning binaries, where $v$ is the orbital velocity. The sign of this correction depends on the spin [17631765], since highly spinning BHs can amplify radiation due to superradiance [1152]. In the absence of a horizon, GWs are not expected to be absorbed significantly, and tidal heating is negligible [1689, 1768]. Highly spinning supermassive binaries detectable by LISA will provide a golden test of this effect [1768].
  • Tidal deformability. The gravitational field of the companion induces tidal deformations, which produce an effect at 5PN relative order, proportional to the so-called tidal Love numbers of the object [914]. Remarkably, the tidal Love numbers of a BH are identically zero for static BHs [17691772], and for spinning BHs to first order in the spin [17731775], and to second order in the axisymmetric perturbations [1774]. On the other hand, the tidal Love numbers of ECOs are small but finite [1757, 1758, 1776, 1777]. Thus, the nature of ECOs can be probed by measuring the tidal deformability, similarly to what is done to infer the nuclear equation of state in NSBs [21, 1778, 1779]. Analysis of the LIGO data shows that interesting bounds on the tidal deformability can be imposed already, to the level that some boson star models (approximated through a polytropic fluid) can be excluded [1780]. The tidal Love numbers of a ClePhO vanish logarithmically in the BH limit [1777], providing a way to probe horizon scales. For Planckian corrections near the horizon, the tidal Love numbers are about 4 orders of magnitude smaller than those of a NS. It is therefore expected that current and future ground-based detectors will not be sensitive enough to detect such small effect, while LISA might be able to measure the tidal deformability of highly-spinning supermassive binaries [1768].

Finally, it is possible that ECOs' low-frequency modes are excited during the inspiral, leading to resonances in the emitted GW flux [1543, 1781, 1782]. Low-frequency modes certainly arise in the gravitational sector, as we discussed already. In addition, fluid modes at low frequency can also be excited, although this issue is poorly studied.

Challenges in modeling ECO coalescence waveforms.

With the notable exception of boson stars [1671], very little is known about the dynamical formation of isolated ECOs or about their coalescence. While the early inspiral and post-merger phases can be modelled within a PN expansion and perturbation theory, respectively, searches for ECO coalescence require a full inspiral-merger-ringdown waveform. Some combination of numerical and semianalytical techniques—analog to what is done to model precisely the waveform of BH binaries [1097, 1107, 1293]—is missing.

Concerning the post-merger ringdown part alone, it is important to develop accurate templates of GW echoes. While considerable progress has been recently done [1753, 17831789], a complete template which is both accurate and practical is missing. This is crucial to improve current searches for echoes in LIGO/Virgo data [1755, 17901796]. Model-independent burst searches have recently been reported, and will be instrumental in the absence of compelling models [1797].

5. The dark matter connection

Contributors: G Bertone, D Blas, R Brito, C Burrage, V Cardoso, L. Urena-López

The nature and properties of DM and dark energy in the Universe are among the outstanding open issues of modern Cosmology. They seem to be the responsible agents for the formation of large scale structure and the present accelerated phase of the cosmic expansion. Quite surprisingly, there is a concordance model that fits all available set of observations at hand. This model is dubbed $\Lambda$ CDM because it assumes that the main matter components at late times are in the form of a cosmological constant for the dark energy [1798, 1799] and a pressureless component known as cold DM [1800]. These two assumptions, together with the theoretical basis for GR, make up a consistent physical view of the Universe [635].

The nature of the missing mass in the Universe has proven difficult to determine, because it interacts very feebly with ordinary matter. Very little is known about the fundamental nature of DM, and models range from ultralight bosons with masses $\sim $ $10^{-22} ~{\rm eV}$ to BHs with masses of order $10\, M_{\odot}$ . Looking for matter with unknown properties is extremely challenging, and explains to a good extent why DM has never been directly detected in any experiment so far. However, the equivalence principle upon which GR stands—tested to remarkable accuracy with known matter—offers a solid starting point. All forms of energy gravitate and fall similarly in an external gravitational field. Thus, gravitational physics can help us unlock the mystery of DM: even if blind to other interactions, it should still 'see' gravity. The feebleness with which DM interacts with baryonic matter, along with its small density in local astrophysical environments poses the question of how to look for DM signals with GWs.

5.1. BHs as DM

In light of the LIGO discoveries there has been a revival of interest in the possibility that DM could be composed of BHs with masses in the range 1–$100 M_{\odot}$ [225, 228, 284, 1801, 1802]. To generate enough such BHs to be DM, they would need to be produced from the collapse of large primordial density fluctuations [283, 1803, 1804]. The distribution of the BH masses that form depends on the model of inflation [1805]. Such BHs can be produced with sufficiently large masses that they would not have evaporated by the current epoch. Alternatively, DM could be composed of ultracompact horizonless objects for which Hawking radiation is suppressed [1806]. Different formation scenarios and constraints on such objects were reviewed in chapter I, section 6.

If all of the DM is composed of such heavy compact objects a key signature is the frequency of microlensing events [1807]. Microlensing is the amplification, for a short period of time, of the light from a star when a compact object passes close to the line of sight between us and the star. How frequent and how strong these events will be, depends on the distribution of BH masses [18081810]. It has been claimed that DM composed entirely of primordial BHs in this mass range is excluded entirely by microlensing [283]; however, such study assumed that the BH mass distribution was a delta function. If the mass distribution is broadened the tension with the microlensing data weakens [317, 318, 1811], although whether realistic models can be compatible with the data remains a subject of debate [156]. Further observational signatures include the dynamical heating of dwarf galaxies, through two body interactions between BHs and stars [226, 314], electromagnetic signatures if regions of high BH density also have high gas densities (such as the center of the galaxy) [316, 1812], constraints from the CMB due to energy injection into the plasma in the early universe [310, 312] and from the (absence of) a stochastic background of GWs [1802].

At the very least primordial BHs in the LIGO mass range can be a component of the DM in our universe. Future GW observations determining the mass and spatial distribution of BHs in our galaxy will be key to testing this hypothesis.

5.2. The DM Zoo and GWs

Despite the accumulation of evidence for the existence of DM, the determination of its fundamental properties remains elusive. The observations related to GWs may change this. GWs and their progenitors 'live' in a DM environment to which they are sensitive (at least gravitationally though other interactions may occur). Furthermore, the DM sector may include new particles and forces that can change the nature and dynamics of the sources.

Before making any classification of DM candidates, we remind that the preferred models are those that: (i) explain all the DM, (ii) include a natural generation mechanism, (iii) can be added to the SM to solve other (astrophysical or fundamental) puzzles, (iv) can be tested beyond the common DM gravitational effects. None of these properties is really necessary, but their advantages are clear.

A first distinction one can make when classifying DM models is whether the state of DM is a distribution of fundamental particles or of compact objects. The masses of compact objects are determined by astrophysical evolution from initial overdensities in a matter field. As a consequence, the range and distribution of masses is quite broad. These models and their bounds are discussed in section 5.1 (see also section 3 for more details about astrophysical BHs).

Fundamental particles have fixed mass, which constrains the mass to be $m_\chi \lesssim M_P$ to treat gravity within an EFT formalism (see [1813] for a recent study of the top range). Furthermore, there is no candidate within the SM for these fundamental particles. Particles with typical energy scales in the electroweak range (WIMPs) have been preferred in the past because they satisfy the four points above [281]. Other popular candidates include axions and sterile neutrinos (see e.g. [282]).

The first classification of particle models is based on the DM spin. Fermionic DM can only exist with masses above142 $m_\chi \gtrsim$ keV. This is because the phase-space available for fermions in virial equilibrium in dwarf spheroidals is limited [1814]. The limit $m_\chi \gtrsim$ keV always applies for models where the DM was generated thermally. In this case, the distribution of DM particles is too 'hot' for lighter masses and there is a suppression of the growth of cosmological structures at small scales, in tension with observations [1815]. Other 'out-of-equilibrium' production mechanisms allow for 'cold' distributions for all possible masses, see e.g. [1816] and references therein. The extreme situation are models where a bosonic candidate is generated 'at rest' as happens in the cases of misalignment or inflationary generation, see [1817]. The fundamental limitation is then $m_\chi \gtrsim 10^{-22}$ eV, which comes from the wiping of structure at scales smaller than the de Broglie wavelength of the candidate. This scale should allow for the existence of dwarf spheroidals. Ultra-light candidates incorporate the existence of oscillating modes with coherence times long enough to generate new phenomena as compared to standard candidates (see below).

The complexity of the DM sector is also unknown. Models that incorporate natural candidates for DM (e.g. supersymmetric models or axion models) provide concrete proposals, but one should keep an open mind when thinking about DM phenomenology. The existence of a coupling DM-SM beyond the gravitational one may mean an extra interaction of gravitating bodies among themselves or with the DM background. This can modify the orbital properties and open new emission channels, if the DM candidate is light enough. Besides, different DM models may admit different DM distributions, depending on their production mechanism and type of interactions. The common feature is the existence of spherical DM halos, but more complex structures (mini-halos, DM disks, breathing modes,....) may occur.

Finally, the distribution of DM close to the galactic center is also quite uncertain [1818]. This is because baryonic effects are more important. Still, one may be able to distinguish certain features, as the existence of huge overdensities, or solitons, characteristic of some DM models [1819]. For instance, the prediction of solitons in center of galaxies for masses $m\sim (10^{-22}\div 10^{-21})$ eV has been recently used in [1820] to claim that this mass range is in tension with data.

5.3. The cosmological perspective on DM: scalar field dark matter

Requiring a consistent cosmology limits some of the DM candidates, if they are to solve both the dark energy and DM puzzles. An intense experimental search in the last years has successfully imposed stringent limits on the interactions between DM and 'ordinary' matter [18211823]. The weakly interacting massive particle (WIMP) hypothesis [1824], while appealing, is under extreme pressure. Thus, alternative models need to be seriously considered [1825]. Among the vast number of possibilities, scalar fields have become increasingly tempting [1799, 1826]. Scalar field DM is a model where the properties of the DM can be represented by a relativistic scalar field $\phi$ endowed with an appropriate scalar potential $V(\phi)$ [1817, 18271830]. In the relativistic regime, the equation of motion for the scalar is the Klein–Gordon equation $\partial_\mu \partial^\mu \phi - \partial_\phi V=0$ , whereas the non-relativistic regime leads to a Schrödinger-type equation for the wave function. At the quantum level, the scalar represents the mean field value of a coherent state of boson particles with a large occupation number.

Although many possibilities exist for the potential $V(\phi)$ in DM models, it should possess a minimum at some critical value $\phi_c$ around which we can define a mass scale m for the boson particle, $ \newcommand{\e}{{\rm e}} m\equiv \partial^2_\phi V(\phi_c)$ . The simplest possibility is the parabolic potential $V(\phi) = (1/2) m^2 \phi^2$ (with $\phi_c =0$ ), but higher order terms could play a role at higher energies. One representative example of the scalar field hypothesis is the axion from the Peccei-Quinn mechanism to solve the strong CP problem, for which the potential has the trigonometric form $V(\phi) = m^2 f^2 [1-\cos(\phi/f)]$ , where f is the so-called decay constant of the axion particle [1817]. In contrast to the parabolic potential, the axion potential is periodic, has an upper energy limit $V(\phi) \leqslant m^2 f^2$ and includes contributions from higher-order terms $\phi^4,\phi^6,\ldots$ [1831, 1832]. For simplicity, we present below a summary of results obtained for the parabolic potential and their constraints arising from cosmological and astrophysical observations.

5.3.1. Cosmological background dynamics.

Because of the resemblance of the KG equation to that of the harmonic oscillator, there are two main stages in the evolution of the scalar field: a damped phase with the Hubble constant $H \gg m$ during which $\phi \simeq {\rm const.}$ , and a stage of rapid field oscillations around the minimum of the potential that corresponds to $H \ll m$ . A piece-wise solution for the two regimes can be envisaged from semi-analytical studies, but the real challenge arises in numerical simulations for which one desires a continuum solution for all the field variables. The two stages can be put together smoothly in numerical simulations [1833]. The evolution of the energy density $\rho_\phi = (1/2) \dot{\phi}^2 + (1/2) m^2\phi^2$ should transit from $\rho_\phi = {\rm const}.$ to $\rho_\phi \propto a^{-3}$ (the expected one for a DM component) before the time of radiation-matter equality to obtain the correct amount of DM at the present time. A lower bound on the boson mass $m \geqslant 10^{-26} {\rm eV}$ arises from such a requirement [1833].

5.3.2. Cosmological linear perturbations.

For this to be a successful model, the scalar field DM must allow the formation of structure. Generically, it is found that small density fluctuations $\delta_\rho/\rho$ grow (decay) for wavelengths $k^2 < 2 a^2 m H$ ($k^2 > 2 a^2 m H$ ), which indicates the existence of an effective time-dependent Jeans wavenumber defined by $k^2_J = 2 a^2 m H$ , where a is the scale factor. The most noticeable aspect about the growth of linear perturbations is that the power spectrum has the same amplitude as that of CDM for large enough scales (ie k  <  kJ), and that a sharp-cut-off appears for small enough scales (ie k  >  kJ). The straightforward interpretation is that the SFDM models naturally predicts a dearth of small scale structure in the Universe, and that such dearth is directly related to the boson mass. This time the lower bound on the boson mass is somehow improved to $m \geqslant 10^{-24} {\rm eV}$ [1833, 1834], but one is limited by the non-linear effects on smaller scales to impose a better constraint.

5.3.3. Cosmological non-linear perturbations.

Although N-body simulations have been adapted to the field description required for SFDM, they cannot capture the full field dynamics and the best option remains to solve directly the so-called Schrödinger–Poisson system. Some studies suggest that the non-linear process of structure formation proceeds as in CDM for large enough scales [1819, 1835, 1836].

The gravitationally bound objects that one could identify with DM galaxy halos all have a common structure: a central soliton surrounded by a NFW-like envelope created by the interference of the Schrödinger wave functions [1819, 1837]. In terms of the standard nomenclature, the scalar-field DM model belongs to the so-called non-cusp, or cored, types of DM models (towards which evidence is marginally pointing). Simulations suggest a close relationship between the soliton Ms and the total halo mass Mh of the form $M_s = \gamma(a) M^{1/3}_h$ , so that massive halos also have massive central solitons [1819, 1835, 1837]. Here $\gamma(a)$ is a time-dependent coefficient of proportionality.

These results then predict a minimum halo mass $M_{h,{\rm min}} = \gamma^{3/2}(1) \,\simeq $ $ (m/10^{-22}\, {\rm eV}){}^{-3/2} \, 10^7 M_\odot$ when the galaxy halo is just the central soliton. But the soliton scale radius rs is related to its mass via $M_s r_ s \simeq 2 (m_{\rm Pl}/m){}^2$ , where $m_{\rm Pl} = G^{-1/2}$ is the Planck mass. For the aforementioned values of the soliton mass, $r_s = 300 \, {\rm pc}$ if $M_s=10^7 \, M_\odot$ , in remarkable agreement with observations [1838] (see, however, [1839]). Furthermore, the previous relation between the soliton mass and the host halo mass can be used to predict the presence of a second peak in the velocity of rotation curves, closer to the galactic center [1820]. This prediction is in tension with data for masses in the range $m\sim (10^{-22}\div 10^{-21})$ eV [1820].

Finally, non-linear results also constrain the scalar mass through comparison with observations inferred from the Lyman-$\alpha$ forest [1840]. These constraints require dedicated N-body simulations with gas and stars, so that the flux of quasars can be calculated. The constraints are obtained indirectly and subjected to uncertainties arising from our limited knowledge on the formation of realistic galaxies. In a first approximation we can estimate the power spectrum of the transmitted flux of the Lyman-$\alpha$ forest at linear order, and obtain a lower limit on the boson mass $m > 10^{-21}~{\rm eV}$ if SFDM is to fulfill the DM budget completely [1841].

In conclusion, scalar-field models of dark energy can account for the whole of DM as well, provided the scalar has a mass of order $10^{-21}~{\rm eV}$ . This model explains also the seemingly cored density profile in the central parts of the galaxies.

5.4. The DM environment

In the following we provide a (hopefully agnostic) view on how different DM models may affect GW emission by compact objects. The environment in which compact objects live—be it the interstellar medium plasma, accretion disks or DM—can influence the GW signal in different ways:

  • By modifying the structure of the compact objects themselves. Accretion disks, for example, alter the spacetime multipolar structure relative to the standard Kerr geometry. Accretion disks are also known to limit the parameter space of Kerr BHs, preventing them from spinning beyond $cJ/(GM^2)\sim 0.998$ [1099]. Likewise, if DM behaves as heavy particles, its effects parallel that of baryonic matter either in accretion disks or in the interstellar medium.In models where a particle description is insufficient (for example, if DM assumes the form of fundamental light bosons), Kerr BHs can either be unstable, or not be an equilibrium solution of the field equations at all [1401, 1513, 1603] (see also discussion in section 4). Some of the simplest forms of axionic DM have a more mild effect instead, contributing to the spinning down of astrophysical BHs [1143, 1842, 1843].Independently of the nature of DM, compact stars evolving in DM-rich environments may accrete a significant amount of DM: DM is captured by the star due to gravitational deflection and a non-vanishing cross-section for collision with the star material [1163, 18441847]. The DM material eventually thermalizes with the star, and accumulates inside a finite-size core [1163, 1845, 1846, 1848].
  • By altering the way that GWs are generated. The different equilibrium configuration of the compact objects, together with the possible self-gravity effects of DM and different structure of the field equations gives rise, generically, to a different GW output from that predicted in vacuum GR.
  • The environment—interstellar dust, a cosmological constant or DM—must of course affect also the way that GWs and electromagnetic waves propagate. It is a common-day experience that light can be significantly affected by even low-density matter. For example, low-frequency light in a plasma travels at the group velocity $v_g=c\sqrt{1-\omega_p^2/\omega^2}$ , with $\omega_p$ the plasma frequency $ \newcommand{\e}{{\rm e}} \omega_p^2=\frac{n_0 q^2}{4\pi\epsilon_0 \,m}$ . Here, n0 is the particle number density, q the electron charge and m its mass. Thus, light is delayed with respect to GWs, by an amount $\delta t$ directly proportional to the total distance they both traversed
    Equation (39)
    where in the last equality we substituted numbers for the very latest observation of GWs with an electromagnetic counterpart [22]. These numbers could be interesting: given the observed time delay of several days for radio waves, one may get constraints for models where DM consists of mili-charged matter (i.e. models where q and m may be a fraction of the electron charge and mass).Cold DM affects the propagation of GWs by inducing a small frequency dependent modification of the propagation speed [1849]. Furthermore, the effect of viscous fluids or oscillators on the passage of GWs are discussed in [1850, 1851]. These calculations indicate that the effect may be too small to be measurable in the near future.

These different effects have, so far, been explored and quantified in the context of a few possible mechanisms where DM plays a role.

5.5. Accretion and gravitational drag

The formation and growth of compact objects may lead to large overdensities of DM around them [18521854]. Whatever the nature and amount of DM surrounding them, a binary of two BHs or compact stars evolving in a DM-rich environment will be subjected to at least two effects, depicted in figure 11. As it moves through the medium it accretes the surrounding material, but also exherts a gravitational pull (known as 'gravitational drag') on all the medium, which affects the inspiral dynamics. To quantify these effects, it is important to know how DM behaves. Collisionless DM causes, generically, a gravitational drag different from that of normal fluids [1543, 1782]. The gravitational drag caused by DM which is coherent on large scales may be suppressed [1829], but further work is necessary to understand this quantitatively.

Figure 11. Refer to the following caption and surrounding text.

Figure 11. Depiction of a BBH evolving in a possible interstellar or DM environment. Each individual BH accretes and exherts a gravitational pull on the surrounding matter. Both effects contribute to decelerate the BHs, leading to a faster inspiral. Credit: Ana Sousa.

Standard image High-resolution image

The phase evolution of a pair of BHs, taking gravitational radiation, accretion and drag as well as the DM self-gravity was studied recently [1526, 1543, 1782, 18551857]. These estimates are all Newtonian, and the results depend crucially on the local DM density. Perhaps more importantly, back-reaction on the DM distribution was not taken into account in a systematic manner. The only studies available including backreaction, look only at the very last orbits of the binary, and are unlikely to capture well the full physics of DM dispersal [1103].

5.6. The merger of two dark stars

Gravitational drag and accretion have a cumulative effect: even though their magnitude is small, the effect piles up over a large number of orbits. By contrast, drag and accretion have a tiny effect during merger. The merger signal carries, mostly, an imprint of the configuration of the colliding objects. For BHs, the no-hair results strongly suggest that the colliding objects belong to the Kerr family, and it seems hopeless to find any DM imprints here. However, the no-hair results can be by-passed, as they assume stationarity of the spacetime or certain matter properties, as explained in section 4 [1401, 1513, 1603]. Detailed investigations of BHs other than Kerr are still to be done. Smoking-gun effects arising from the merger of such objects are, likewise, unknown.

As we discussed, compact stars can grow DM cores at their center. It is conceivable that the DM core might imprint a signature on different phases of the coalescence of two of these objects. The effects of tidal deformations within the gravitational wave signal produced by binary DM stars have been investigated in [1858]. Moreover, the imprint on the post-merger spectrum, so far only for analog-mechanical models, has been studied in [1859].

5.7. Non-perturbative effects: superradiance and spin-down

Spinning BHs are huge energy reservoirs from which all rotational energy can ultimately be extracted. In particular, due to the absence of a Pauli exclusion principle for bosons, bosonic waves can effectively extract energy from spinning BHs through a process known as rotational superradiance [1152]. The superradiant energy extraction might be used to rule out (or detect) ultralight bosons that might be a significant component of DM, such as the QCD axion or axion-like particles predicted by the string axiverse scenario [1143], dark photons [1523, 1860] and even massive spin-2 fields [1554]. For boson masses in the range 10−21–10−11 eV, their Compton wavelength is of the order of the horizon of typical astrophysical BHs, the gravitational coupling of these two objects is strongest, and long-lived quasi-bound states of bosonic particles around BHs can form. For rotating BHs these states are typically superradiant, thus becoming an effective tap of rotational energy. The extracted energy condenses as a bosonic cloud around the BH producing 'gravitational atoms' [1143, 1162, 1842]. The evolution of these systems leads to several observational channels. For complex bosonic fields, GW emission is suppressed and the end-state of the superradiant instability might be a Kerr BH with bosonic hair [1164, 1167]. These solutions are themselves unstable [1626] which, over the age of the Universe, would likely lead to a slowly-rotating BH surrounded by a bosonic cloud populating the most unstable modes. On the other hand for real bosonic fields, this cloud would disperse over long timescales though the emission of nearly-monochromatic GWs, either observable individually or as a very strong stochastic GW background [1843, 18611865]. These are appealing GW emitters with appropriate amplitude and frequency range for Earth- and space-based detectors. Although estimates from [1843, 1865] suggest that current LIGO data can already be used constrain some range of scalar field masses, an analysis on real data has not yet been performed.

These models also predict that BHs affected by the instability would spin-down in the process, and therefore the mere observation of rotating BHs can place severe limits on these particles, thereby strongly constraining the theory. In fact, observations of spinning BHs can already be used to bound the photon [1523, 1860] and graviton masses [1554]. These studies demonstrate that BHs have an enormous potential as theoretical laboratories, where particle physics can be put to the test using current observations.

All these studies neglect the possible decay of ultralight bosonic fields into Standard Model particles which is justified given the current constraints on ultralight bosons in this mass range (see e.g. [1817, 1866]). However, the superradiant instability leads to the generation of boson condensates around BHs with a very large particle number, which could enhance their decay through channels other than gravitational. For the coupling typical of the QCD axion with photons, stimulated decay into photon pairs becomes significant for axion masses above $\gtrsim$ 10−8 eV, and consequently for BHs with masses $\lesssim $ $ 0.01 M_{\odot}$ [1867]. If such systems exist, they would lead to extremely bright lasers with frequencies in the radio band. For the expected mass of the QCD axion $\sim $ 10−5 eV, such lasing effect would occur around primordial BHs with masses around $\sim $ $10^{24}~{\rm Kg}$ and lead to millisecond radio-bursts, which could suggest a link to the observed fast radio bursts [1867].

In addition to these effects, the coupling of light DM bosons with Standard Model particles may also turn NSs unstable against superradiant amplification. Rotational superradiance is in fact a generic process and can happen in any rotating object, as long as some sort of dissipation mechanism is at work [1152]. Thus, even NSs can be superradiantly unstable against some models of DM, such as dark photons coupled to charged matter, which can already be used to put constraints on these models [1868].

An important question that remains to be fully investigated is: if ultralight bosons do exist and we detect them through one or several of these channels, will we be able to start using compact objects to do precision particle physics, in particular can one infer fundamental properties such as their mass, spin and fundamental interactions? This will require a better understanding of higher-spin fields around rotating BHs.

Although some progress has been made to deal with massive vector fields using time-domain simulations, either in full GR [1148, 1162] or by fixing the background geometry to be a Kerr BH [1146, 1869], frequency-domain computations, which are more appropriate to span the parameter space, have mainly been dealt with by employing a small-spin approximation [1523, 1554, 1860], using an analytical matched asymptotics approach [1864] or using a field theory approach [1870], these last two being only valid in the Newtonian limit. Computations of the instability rates without employing any of these approximations have only recently started to be available [1871, 1872]. An estimate of the stochastic background coming from vector or tensor fields has also not been computed. Finally, besides some some estimates on the GWs from a bosenova collapse due to axion self-interations [1873], a detailed analysis of how self-interactions or interactions with other fields could affect the GW emission and its detectability is still missing.

5.8. Pulsar timing

The gravitational drag from particle DM candidates in the orbital motion of binary pulsars has been studied in [1874, 1875]. This may be important to constrain DM models that generate a thick DM disk [1875] or to study the DM density closer to the galactic center, if a pulsar binary is discovered there and can be timed to high precision.

DM, specially if under the form of light bosons, can also produce other peculiar effects, accessible through EM observations. In models where DM comes in the form of an ultralight boson, the DM configuration is able to support itself through pressure, which is in turn caused by a time-dependence of the field. This time-periodicity gives rise to an oscillating gravitational potential which is able to produce tiny, but potentially measurable, differences in the time-of-arrival signal from precise clocks, such as pulsars [1876, 1877].

5.9. Mini-galaxies around BHs

In addition, in these theories, BHs can grow (permanent or long-lived) time-oscillating, non-axisymmetric bosonic structures [1152]. Planets or stars in the vicinity of these BH are subjected to a periodic forcing, leading to Lindblad and co-rotation resonances. This phenomenon is akin to the forcing that spiral-armed galaxies exhert on its planets and stars, and it is conceivable that the same type of pattern appears, on smaller-scales, around supermassive hairy BHs [1857, 1878].

5.10. GW detectors as DM probes

Finally, there are interesting proposals to use the GW detectors themselves as probes of DM, when used as extremely sensitive accelerometers [18791881].

Postscript

Gravity sits at the heart of some of the most important open problems in astrophysics, cosmology and fundamental physics, making it a subject of strong interdisciplinarity. Black holes are, in some ways, the 'atoms' of gravity, the 'simplest' astrophysical objects, yet they harbour some of the most remarkable predictions of GR, including that of its own ultimate failure. Gravitational-wave astronomy will allow us to test models of BH formation, growth and evolution, as well as models of GW generation and propagation. It will provide evidence for event horizons and ergoregions, test GR itself and may reveal the existence of new fundamental fields. The synthesis of these results has the potential to shed light on some of the most enigmatic issues in contemporary physics. This write-up summarized our present knowledge and key open challenges. We hope that it can serve as a guide for the exciting road ahead.

Acknowledgments

This article is based upon work from COST Action CA16104 'GWverse', supported by COST (European Cooperation in Science and Technology). We would like to thank Walter del Pozzo for useful comments. AA acknowledges partial support from the Polish National Science Center (NCN) through the grant UMO-2016/23/B/ST9/02732 and is currently supported by the Carl Tryggers Foundation through the grant CTS 17:113. LB acknowledges support from STFC through Grant No. ST/R00045X/1. VC acknowledges financial support provided under the European Union's H2020 ERC Consolidator Grant 'Matter and strong-field gravity: New frontiers in Einstein's theory' grant agreement no. MaGRaTh–646597. TPS acknowledges partial support from Science and Technology Facilities Council Consolidated Grant ST/P000703/1. KB acknowledges support from the Polish National Science Center (NCN) grant: Sonata Bis 2 (DEC-2012/07/E/ST9/01360). EB acknowledges financial support from projects 176003 and 176001 by the Ministry of Education and Science of the Republic of Serbia. TB was supported by the TEAM/2016-3/19 grant from FNP. PC acknowledges support from the Austrian Research Fund (FWF), Project P 29517-N16, and by the Polish National Center of Science (NCN) under grant 2016/21/B/ST1/00940. BK acknowledges support from the European Research Council (ERC) under the European Union's Horizon 2020 research and innovation programme ERC-2014-STG under grant agreement No 638435 (GalNUC) and from the Hungarian National Research, Development, and Innovation Office grant NKFIH KH-125675. GN is supported by the Swiss National Science Foundation (SNF) under grant 200020-168988. PP acknowledges financial support provided under the European Union's H2020 ERC, Starting Grant agreement no. DarkGRA–757480. US acknowledges H2020-MSCA-RISE-2015 Grant No. 690904, NSF Grant No. PHY-090003 and PRACE Tier-0 Grant No. PPFPWG. The Flatiron Institute is supported by the Simons Foundation. PAS acknowledges support from the Ramón y Cajal Programme of the Ministry of Economy, Industry and Competitiveness of Spain. EB supported by NSF Grants No. PHY-1607130 and AST-1716715. KB, AC, AG, NI, TP and GZ acknowledge financial support from the Slovenian Research Agency. SB acknowledges support by the EU H2020 under ERC Starting Grant, no. BinGraSp-714626. PC-D is supported by the Spanish MINECO (grants AYA2015-66899- C2-1-P and RYC-2015-19074) and the Generalitat Valenciana (PROMETEOII-2014-069). DG is supported by NASA through Einstein Postdoctoral Fellowship Grant No. PF6-170152 by the Chandra X-ray Center, operated by the Smithsonian Astrophysical Observatory for NASA under Contract NAS8-03060. RE acknowledges financial support provided by the Scientific and Technical Research Council of Turkey (TUBITAK) under the grant no. 117F296. JAF is supported by the Spanish MINECO (AYA2015-66899-C2-1-P), by the Generalitat Valenciana (PROMETEOII-2014-069), and by the H2020-MSCA-RISE-2017 Grant No. FunFiCO-777740. FMR is supported by the Scientific and Technological Research Council of Turkey (TÜBİTAK) project 117F295. IR was supported by the POLONEZ programme of the National Science Centre of Poland which has received funding from the European Union's Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie grant agreement No. 665778. AS thanks the Spanish Ministry of Economy, Industry and Competitiveness, the Spanish Agencia Estatal de Investigación, the Vicepresidència i Conselleria d'Innovació, Recerca i Turisme del Govern de les Illes Balears, the European Commission, the European Regional Development Funds (ERDF). NS acknowledges support from DAAD Germany-Greece Grant (ID 57340132) and GWAVES (pr002022) of ARIS-GRNET(Athens). HW acknowledges financial support by the Royal Society UK under the University Research Fellowship UF160547-BHbeyGR and the Research Grant RGF  −  R1  −  180073. KY is supported by TÜBİTAK-117F188.

Footnotes

  • 134 

    In astrophysics 'metallicity' refers to the global content of heavy elements above those produced by primordial nucleosynthesis.

  • 135 

    Strictly speaking, the term self-force refers to the case where local information about the perturbation in the vicinity of the smaller object is used; other calculations of, e.g. the flux of GW energy far from the binary also rely on a perturbative expansion [707714], but are not referred to as GSF calculations.

  • 136 

    By convention, 'nPN' refers to EOM terms that are O(1/c2n) smaller than the Newtonian acceleration, or, in the radiation field, smaller by that factor relative to the standard quadrupolar field.

  • 137 

    Though this term does not seem to be part of action (25), it actually corresponds to the choice $G_5\propto \ln|X|$ [1407].

  • 138 

    A more general parametrisation [1499] has been used in [23].

  • 139 

    It might hence be preferable for readers that are not familiar with the literature on hairy BHs to return to this section after going through sections 4.1 and 4.2.

  • 140 

    Often, the uniqueness theorems are also called no-hair theorems, under the rationale that they show that in vacuum (or electro-vacuum), BHs have no independent multipolar hair, and only two (or three, ignoring magnetic charge) degrees of freedom. Here we use no-hair theorems to refer to no-go results for non-Kerr BHs in models beyond electro-vacuum GR.

  • 141 

    It might be worth stressing that superluminal propagation does not necessarily lead to causal conundrums in Lorentz-violating theories.

  • 142 

    All the limits of this section apply to candidates that represent all the DM. The bounds are less stringent for fractional components.

Please wait… references are loading.