1 Introduction

Before entering the very subject of the article, namely the Hamiltonian treatment of the dynamics of compact binary systems within general relativity (GR) theory, some historical insight will be supplied. The reader may find additional history, e.g., in Damour (1983a, 1987b), Futamase and Itoh (2007), Blanchet (2014), Porto (2016), Levi (2020).

1.1 Early history (1916–1960)

The problem of motion of many-body systems is an important issue in GR (see, e.g., Damour 1983a, 1987b). Earliest computations were performed by Droste, de Sitter, and Lorentz in the years 1916–1917, at the first post-Newtonian (1PN) order of approximation of the Einstein field equations, i.e., at the order \(n=1\), where \((1/c^2)^n\) corresponds to the nth post-Newtonian (PN) order with \(n=0\) being the Newtonian level. Already in the very first paper, where Droste calculated the 1PN gravitational field for a many-body system (Droste 1916), there occurred a flaw in the definition of the rest mass m of a self-gravitating body of volume V (we follow the Dutch version; the English version contains an additional misprint), reading, in the rest frame of the body, indicated in the following by \(\dot{=}\),

$$\begin{aligned} m\quad {\mathop {=}\limits ^{\text {{Droste 1916}}}} \int _V \text {d}^3 x\,\varrho \,\dot{=} \int _V \text {d}^3 x\,\varrho _*\left( 1-\frac{3U}{c^2}\right) , \end{aligned}$$
(1.1)

where the “Newtonian” mass density \(\varrho _*=\sqrt{-g}\varrho u^0/c\) [\(g=\det (g_{\mu \nu })\), \(u^0\) is the time component of the four-velocity field \(u^{\mu }\), \(u^{\mu }u_{\mu }=-c^2\)] fulfills the metric-free continuity equation

$$\begin{aligned} \partial _t \varrho _* + \text{div}(\varrho _*\textbf{v}) = 0, \end{aligned}$$
(1.2)

where \(\textbf{v} = (v^i)\) is the Newtonian velocity field (with \(v^i = cu^i/u^0\)). The Newtonian potential U is defined by

$$\begin{aligned} \varDelta U = -4\pi G \varrho _*, \end{aligned}$$
(1.3)

with the usual boundary condition for U at infinity: \(\lim _{|\textbf{r}|\rightarrow \infty }U(\textbf{r},t)=0\). Let us stress again that the definition (1.1) is not correct. The correct expression for the rest mass contrarily reads, at the 1PN level,

$$\begin{aligned} m \,\dot{=} \int _V \text {d}^3x\, \varrho _*\left( 1 + \frac{1}{c^2}\left( \varPi -\frac{U}{2}\right) \right) , \end{aligned}$$
(1.4)

with specific internal energy \(\varPi \). For pressureless (dust-like) matter (for a dust-like body \(\varPi =0\), but then the potential term U has to disappear too, because of the internal pressure-gravity balance: a pressureless body cannot show up internal gravity), the correct 1PN expression is given by

$$\begin{aligned} m = \int _V \text {d}^3x\, \varrho _* \,\dot{=} \int _V \text {d}^3x \sqrt{\det (g_{ij})}\,\varrho = \int _V \text {d}V \varrho , \end{aligned}$$
(1.5)

where \(\text {d}V\equiv \sqrt{\det (g_{ij})}\,\text {d}^3x\).

The error in question slept into second of two sequential papers by de Sitter (1916a, b, 1917) when calculating the 1PN equations of motion for a many-body system. Luckily, that error had no influence on the de Sitter precession of the Moon orbit around the Earth in the gravitational field of the Sun. The error became identified (at least for dusty matter) by Eddington and Clark (1938). On the other side, Levi-Civita (1937b) used the correct rest mass formula for dusty bodies. Einstein criticized the calculations by Levi-Civita because he was missing pressure for stabilizing the bodies. Hereupon, Levi-Civita argued with the “effacing principle”, inaugurated by Brillouin, that the internal structure should have no influence on the external motion. The 1PN gravitational field was obtained correctly by Levi-Civita but errors occurred in the equations of motion including self-acceleration and wrong periastron advance (Levi-Civita 1937a; Damour and Schäfer 1988). Full clarification was achieved by Eddington and Clark (1938), letting aside the unstable interior of their dusty balls. Interestingly, in a 1917 paper by Lorentz and Droste (in Dutch), the correct 1PN Lagrangian of a self-gravitating many-body system of fluid balls was obtained but never properly recognized. Only in 1937, for the edition of the collected works by Lorentz, it became translated into English (Lorentz and Droste 1937). A full-fledged calculation made by Einstein et al. (1938)—posed in the spirit of Hermann Weyl by making use of surface integrals around field singularities—convincingly achieved the 1PN equations of motion, nowadays called Einstein–Infeld–Hoffmann (EIH) equations of motion. In the publication seamless following Einstein et al. (1938), Robertson (1938) derived the 1PN periastron advance based on the EIH equations of motion. Some further refining work by Einstein and Infeld appeared in the 1940s. Fichtenholz (1950) computed the Lagrangian and Hamiltonian out of the EIH equations. A consistent fluid ball derivation of the EIH equations has been achieved by Fock (1939), Petrova (1949) (delayed by World War II), and Papapetrou (1951a) (see also Fock 1959).

In the 1950s, Infeld and Plebański rederived the EIH equations of motion with the aid of Dirac \(\delta \)-functions as field sources by postulating the properties of Infeld’s “good” \(\delta \)-function (Infeld 1954, 1957; Infeld and Plebański 1960; see Sect. 4.2 of our review for more details). Also in the 1950s, the Dirac \(\delta \)-function became applied to the post-Newtonian problem of motion of spinning bodies by Tulczyjew (1959), based on the seminal work by Mathisson (1937, 2010), with the formulation of a general relativistic gravitational skeleton structure of extended bodies. Equations of motion for spinning test particles had been obtained before by Papapetrou (1951b) and Corinaldesi and Papapetrou (1951). Further in the 1950s, another approach to the equations-of-motion problem, called fast-motion or post-Minkowskian (PM) approximation, which is particularly useful for the treatment of high-speed scattering problems, was developed and elaborated by Bertotti (1956) and Kerr (1959a, b, c), at the 1PM level. First results at the 2PM level were obtained by Bertotti and Plebański (1960).

1.2 History on Hamiltonian results

Hamiltonian frameworks are powerful tools in theoretical physics because of their capacity of full-fledged structural exploration and efficient application of mathematical theories (see, e.g., Holm 1985; Alexander 1987; Vinti 1998; Boccaletti and Pucacco 2004, 2002). Most importantly, Hamiltonians generate the time evolution of all quantities in a physical theory. For closed systems, the total Hamiltonian is conserved in time. Together with the other conserved quantities, total linear momentum and total angular momentum, which are given by very simple universal expressions, and the boost vector, which is connected with the Hamiltonian density (which defines “centre-of-energy vector”) and the total linear momentum, the total Hamiltonian is one of the generators of the globally operating Poincaré or inhomogeneous Lorentz group. A natural ingredient of a Hamiltonian formalism is the (3+1)-splitting of spacetime in space and time. Consequently Hamiltonian formalisms allow transparent treatments of both initial value problems and Newtonian limits. Finally, for solving equations of motion, particularly in approximation schemes, Hamiltonian frameworks naturally fit into the powerful Lie-transform technique based on action-angle variables (Hori 1966; Kinoshita 1978; Vinti 1998; Boccaletti and Pucacco 2004, 2002; Tessmer et al. 2013). Lie series are also very useful when treating canonical transformations with usual canonical variables (see, e.g., Blümlein et al. 2020a, c, 2021b).

Additionally we refer to an important offspring of the Hamiltonian framework, the effective-one-body (EOB) approach, which will find its presentation in an upcoming Living Reviews article by Thibault Damour. References in the present article referring to EOB are particularly Buonanno and Damour (1999, 2000), Damour et al. (2000a), Damour (2001), Damour et al. (2008b), Damour et al. (2015), Damour (2016).

The focus of the present article is on the Hamiltonian formalism of GR as developed by Arnowitt, Deser, and Misner (ADM) (Arnowitt et al. 1959, 1960a, b), with its Routhian modification (Jaranowski and Schäfer 1998, 2000c) (where the matter is treated in Hamiltonian form and the field in the Lagrangian one) and classical-spin generalization (Steinhoff and Schäfer 2009a; Steinhoff 2011), and with application to the problem of motion of binary systems with compact components including proper rotation (spin) and rotational deformation (quadratic in the spin variables); for other approaches to the problem of motion in GR, see the reviews by Futamase and Itoh (2007), Blanchet (2014), Porto (2016). The review article by Arnowitt et al. (1962) gives a thorough account of the ADM formalism (see also Regge and Teitelboim 1974 for the discussion about asymptotics). In this formalism, the final Hamiltonian, nowadays called ADM Hamiltonian, is given in form of a volume integral of the divergence of a vector over three-dimensional spacelike hypersurface, which can also naturally be represented as surface integral at flat spatial infinity \(i^0\).

It is also interesting to give insight into other Hamiltonian formulations of GR, because those are closely related to the ADM approach but differently posed. Slightly ahead of ADM, Dirac (1958, 1959) had developed a Hamiltonian formalism for GR, and slightly afterwards, Schwinger (1963a, b). Schwinger’s approach starts from tetrad representation of GR and ends up with a different set of canonical variables and, related herewith, different coordinate conditions. Dirac has developed his approach with some loose ends toward the final Hamiltonian (see Sect. 2.1 below and also, e.g., Deser 2004), but the coordinate conditions introduced by him—nowadays called Dirac gauge—are often used, mainly in numerical relativity. A subtle problem in all Hamiltonian formulations of GR is the correct treatment of surface terms at spacelike infinity which appear in the asymptotically flat spacetimes. In 1967, this problem has been clearly addressed by DeWitt (1967) and later, in 1974, full clarification has been achieved by Regge and Teitelboim (1974). For a short comparison of the three canonical formalisms in question, the Dirac, ADM, and Schwinger ones, see Schäfer (2014).

The first authors who had given the Hamiltonian as two-dimensional surface integral at \(i^0\) on three-dimensional spacelike hypersurfaces were ADM. Of course, the representation of the total energy as surface integral was known before, particularly through the Landau–Lifshitz gravitational stress-energy-pseudotensor approach. Schwinger followed the spirit of ADM. He was fully aware of the correctness of his specific calculations modulo surface terms only which finally became fixed by asymptotic Lorentz invariance considerations. He presented the Hamiltonian (as well as the other generators of the Lorentz group) as two-dimensional surface integrals. Only one application of the Schwinger approach by somebody else than Schwinger himself is known to the authors (apart from Faddeev 1982 who presented Einstein’s theory of gravitation in the Schwinger canonical variables). It is the paper by Kibble in 1963 in which the Dirac spin-1/2 field found a canonical treatment within GR (Kibble 1963). This paper played a crucial role in the implementation of classical spin into the ADM framework by Steinhoff and Schäfer (2009a) and Steinhoff (2011) (details can be found in Sect. 7 of the present article).

The ADM formalism is the most often used Hamiltonian framework in the analytical treatment of the problem of motion of gravitating compact objects. The main reason for this is surely the very well adapted coordinate conditions for explicit calculations introduced by Arnowitt et al. (1960c) (generalized isotropic coordinates; nowadays, for short, often called ADMTT coordinates, albeit the other coordinates introduced by Arnowitt et al. 1962, are ADMTT too), though also in Schwinger’s approach similar efficient coordinate conditions could have been introduced (Schäfer 2014). Already Kimura (1961) started application of the ADM formalism to gravitating point masses at the 1PN level. In 1974, that research activity culminated in a 2PN Hamiltonian for binary point masses obtained by Ohta et al. (1974a, b), based on earlier work by Hiida and Okamura (1972). However, one coefficient of their Hamiltonian was not correctly calculated and the Hamiltonian as such was not clearly identified, i.e., it was not clear to which coordinate system it referred to. In 1985, full clarification has been achieved in a paper by Damour and Schäfer (1985) relying on the observation by Schäfer (1984) that the perturbative use of the equations of motion on the action level implies that coordinate transformations have been applied; also see Barker and O’Connell (1984, 1986). In addition, Damour and Schäfer (1985) showed how to correctly compute the delicate integral (\(U^{\text{TT}}\)) which had been incorrectly evaluated by Hiida and Okamura (1972), Ohta et al. (1974a, b), and made contact with the first fully correct calculation of the 2PN dynamics of binary systems (in harmonic coordinates) by Damour and Deruelle (1981), Damour (1982) in 1981–1982. The 2PN periastron advance for binary systems has been obtained for the first time by Damour and Schäfer (1987); generalized by adding to it the effect of the leading-order spin-orbit coupling, in 1988 (Damour and Schäfer 1988).

In Schäfer (1983b), the leading-order 2.5PN radiation reaction force for n-body systems was derived by using the ADM formalism. The same force expression had already been obtained earlier by Schäfer (1982) within coordinate conditions closely related to the ADM ones—actually identical with the ADM conditions through 1PN and at 2.5PN order—and then again by Schäfer (1983a), as quoted in Poisson and Will (2014), based on a different approach but in coordinates identical to the ADM ones at 2.5PN order. The 2PN Hamiltonian shown by Schäfer (1982) and taken from Ohta et al. (1974b), apart from the erroneous coefficient mentioned above, is the ADM one as discussed above (the factor 7 in the static part therein has to be replaced by 5), and in the definition of the reaction force in the centre-of-mass system, a misprinted factor 2 is missing, i.e. \(2\textbf{F}=\textbf{F}_1-\textbf{F}_2\). The detailed calculations were presented in Schäfer (1985); and in Schäfer (1986), a further ADM-based derivation by use of a PM approximation scheme has been performed. At 2PN level, the genuine 3-body potential was derived by Schäfer (1987). However, in the reduction of a 4-body potential derived by Ohta et al. (1973, 1974a, b) to three bodies made by Schäfer (1987) some combinatorical shortcomings slept in, which were identified and corrected by Lousto and Nakano (2008), and later by Galaviz and Brügmann (2011) in different form. The n-body 3.5PN non-autonomous radiation reaction HamiltonianFootnote 1 was obtained by the authors in Jaranowski and Schäfer (1997), confirming energy balance results in Blanchet and Schäfer (1989), and the equations of motion out of it were derived by Königsdörffer et al. (2003).

Additionally within the ADM formalism, for the first time in 2001, the conservative 3PN dynamics for compact binaries has been fully obtained by Damour and the authors, by also for the first time making extensive use of the dimensional regularization techniqueFootnote 2 (Damour et al. 2001) (for an earlier mentioning of application of dimensional regularization to classical point particles, see Damour 1980, 1983a; and for an earlier n-body static result, i.e. a result valid for vanishing particle momenta and vanishing reduced canonical variables of the gravitational field, not based on dimensional regularization, see Kimura and Toiya 1972). Only by performing all calculations in a d-dimensional space the regularization has worked out fully consistently in the limit \(d\rightarrow 3\) (later on, a d-dimensional Riesz kernel calculation has been performed too, Damour et al. 2008a). In purely 3-dimensional space computations two coefficients, denoted by \(\omega _{\text{kinetic}}\) and \(\omega _{\text{static}}\), could not be determined by analytical three-dimensional regularization. The coefficient \(\omega _{\text{kinetic}}\) was shown to be fixable by insisting on global Lorentz invariance and became thus calculable with the aid of the Poincaré algebra (with value 41/24) (Damour et al. 2000c, d).Footnote 3 The first evaluation of the value of \(\omega _{\text{static}}\) (namely \(\omega _{\text{static}}=0\)) was obtained by Jaranowski and Schäfer (1999, 2000b) by assuming a matching with the Brill–Lindquist initial-value configuration of two black holes. The correctness of this value (and thereby the usefulness of considering that the Brill–Lindquist initial-value data represent a relevant configuration of two black holes) was later confirmed by dimensional regularization (Damour et al. 2001). Explicit analytical solutions for the motion of compact binaries through 2PN order were derived by Damour and Schäfer (1988) and Schäfer and Wex (1993b, c), and through 3PN order by Memmesheimer et al. (2005), extending the seminal 1PN post-Keplerian parametrization proposed by Damour and Deruelle (1985).

Quite recently, the 4PN binary dynamics has been successfully derived, using dimensional regularization and sophisticated far-zone matching (Jaranowski and Schäfer 2012, 2013; Damour et al. 2014; Jaranowski and Schäfer 2015). Let us remark in this respect that the linear in G (Newtonian gravitational constant) part can be deduced to all PN orders from the 1PM Hamiltonian derived by Ledvinka et al. (2008). For the first time, the contributions to 4PN Hamiltonian were obtained by the authors in Jaranowski and Schäfer (2012) through \(G^2\) order, including additionally all log-terms at 4PN going up to the order \(G^5\). Also the related energy along circular orbits was obtained as function of orbital frequency. The application of the Poincaré algebra by Jaranowski and Schäfer (2012) clearly needed the noncentre-of-mass Hamiltonian, though only the centre-of-mass one was published. By Jaranowski and Schäfer (2013), all terms became calculated with the exception of terms in the Hamiltonian linear in the symmetric mass ratio \(\nu \equiv m_1m_2/(m_1+m_2)^2\) (where \(m_1\) and \(m_2\) denote the masses of binary system components) and of the orders \(G^3\), \(G^4\), and \(G^5\). Those terms are just adding up to the log-terms mentioned above. However, taking a numerical self-force solution for circular orbits in the Schwarzschild metric into account, already the innermost (or last) stable circular orbit could be determined numerically through 4PN order by Jaranowski and Schäfer (2013).

The computations by Jaranowski and Schäfer (2012, 2013, 2015) are all based on a straightforward use of the PN expansion, and are thereby a priori only valid in the near zone. The formal extension of the 4PN-level near-zone computation to the full space implies the appearance of infrared (IR) divergences (linked to the formal limit \(r\rightarrow \infty \)). The regularization of these IR divergences is unambiguous, except for a single 4PN-level ambiguity coefficient, denoted by C in Damour et al. (2014), linked to the arbitrariness in the IR regulator scale s entering within a logarithm (see Eq. (3.7) in Damour et al. 2014). The value of C (\(C=-1681/1536\)) was, however, uniquely determined in Damour et al. (2014) by combining several other previous results: (1) the understanding that the IR effect responsible for this logarithmic ambiguity was in precise agreement with a nonlocal 4PN tail effect discovered long ago Blanchet and Damour (1988)—and recovered within the ADM formalism by Damour et al. (2016); (2) the “first law of binary black-hole mechanics” by Le Tiec et al. (2012) allowing one to link the energy-angular-momentum function \(E(j,\nu )\) to the redshift along circular orbits; and, most importantly from the conceptual point of view, (3) a computation, at first order in the symmetric mass ratio \(\nu \), of the redshift by Bini and Damour (2013), obtained by using an analytical representation of the (linear in \(\nu \)) metric perturbation in terms of series of hypergeometric functions (Mano et al. 1996). The crucial point is that the latter analytical representation incorporated a precise matching between the near-zone metric and the far-zone one, thereby providing the “beyond-PN” information needed for the analytical determination of the value of C. Previous results obtained by Le Tiec et al. (2012) and Barausse et al. (2012a), based on numerical self-force computations (Blanchet et al. 2010b), had given an approximate numerical knowledge of a PN expansion coefficient equivalent to the knowledge of C. Applications of 4PN Hamiltonian dynamics for bound and unbound orbits were performed by Damour et al. (2015), Bini and Damour (2017).

For spinning bodies, counting spin as 0.5PN effect, the 1.5PN spin-orbit and 2PN spin-spin Hamiltonians were derived by Barker and O’Connell (1975, 1979), where the given quadrupole-moment-dependent part can be regarded as representing spin-squared terms for extended bodies (notice the presence of the tensor product of two unit vectors pointing each to the spin direction in the quadrupole-moment-dependent Hamiltonians). For an observationally important application of the spin-orbit dynamics, see Damour and Schäfer (1988). In 2008, the 2.5PN spin-orbit Hamiltonian was successfully calculated by Damour et al. (2008c), and the 3PN spin1-spin2 and spin1-spin1 binary black-hole Hamiltonians by Steinhoff et al. (2008a, b, c). The 3PN spin1-spin1 Hamiltonian for binary neutron stars was obtained by Hergt et al. (2010). The 3.5PN spin-orbit and 4PN spin1-spin2 Hamiltonians were obtained by Hartung and Steinhoff (2011a, b) (also see Hartung et al. 2013 and Levi and Steinhoff 2014). The 4PN spin1-spin1 Hamiltonian was presented in Levi and Steinhoff (2021). Based on the Dirac approach, the Hamiltonian of a spinning test-particle in the Kerr metric has been obtained by Barausse et al. (2009, 2012b). The canonical Hamiltonian for an extended test body in curved spacetime, to quadratic order in spin, was derived by Vines et al. (2016). Finally, the radiation-reaction Hamiltonians from the leading-order spin-orbit and spin1-spin2 couplings have been derived by Steinhoff and Wang (2010) and Wang et al. (2011).

1.3 More recent history on non-Hamiltonian results

At the 2PN level of the equations of motion, the Polish school founded by Infeld succeeded in getting many expressions whereby the most advanced result was obtained by Ryteń (1961) in her MSc thesis from 1961 using as model for the source of the gravitational field Infeld’s “good \(\delta \)-function”. Using the same source model as applied by Fock and Petrova, Kopeikin (1985) and Grishchuk and Kopeikin (1986) derived the 2PN and 2.5PN equations of motion for compact binaries. However, already in 1982, Damour and Deruelle had obtained the 2PN and 2.5PN equations of motion for compact binaries, using analytic regularization techniques (Damour 1982, 1983a, b) (for another such derivation see Blanchet et al. 1998, who additionally got the metric coefficients at the 2.5PN accuracy). Also Ohta and Kimura (1988) should be mentioned for a Fokker action derivation of the 2PN dynamics. Regarding the coordinate conditions used in the papers quoted in the present subsection, treating spinless particles, all are based on the harmonic gauge with the exceptions of the ones with a Hamiltonian background and those by Ryteń or Ohta and Kimura.

The two-point-mass equations of motion at 3PN order in harmonic coordinates were obtained complete with the exception of one parameter called \(\lambda \) (equivalent to \(\omega _{\rm{static}}\), see above) by Blanchet and Faye (2000a, b) (see also de Andrade et al. 2001 and Blanchet and Iyer 2003). The derivation used the modified version of the Hadamard regularization called the extended Hadamard regularization (Blanchet and Faye 2001a, b, see Sect. 4.3 of our review for more details). This regularization was not able to resolve the problem of the ambiguity parameter \(\lambda \), but gives a final result physically equivalent to that of dimensional regularization, except for the unknown value of this parameter. Using the technique of Einstein, Infeld, and Hoffmann (EIH), Itoh and Futamase (2003) and Itoh (2004) succeeded in deriving the 3PN equations of motion for compact binaries, and Blanchet et al. (2004) derived the same 3PN equations of motion based on dimensional regularization.

The 3.5PN equations of motion were derived within several independent approaches: by Pati and Will (2002) using the method of direct integration of the relaxed Einstein equations (DIRE) developed by Pati and Will (2000), Nissanke and Blanchet (2005) applying Hadamard self-field regularization, by Itoh (2009) using the EIH technique, and by Galley and Leibovich (2012) within the effective field theory (EFT) approach. Radiation recoil effects, starting at 3.5PN order, have been discussed by Bekenstein (1973), Fitchett (1983), Junker and Schäfer (1992), Kidder (1995), Blanchet et al. (2005).

Bernard et al. (2016) calculated the 4PN Fokker action for binary point-mass systems and found a nonlocal-in-time Lagrangian inequivalent to the Hamiltonian obtained by Damour et al. (2014). On the one hand, the local part of the result of Bernard et al. (2016) differed from the local part of the Hamiltonian of Damour et al. (2014) only in a few terms. On the other hand, though the nonlocal-in-time part of the action in Bernard et al. (2016) was the same as the one in Damour et al. (2014, 2015), Bernard et al. (2016) advocated to treat it (notably for deriving the conserved energy, and deriving its link with the orbital frequency) in a way which was inequivalent to the one in Damour et al. (2014, 2015). It was then shown by Damour et al. (2016) that: (i) the treatment of the nonlocal-in-time part in Bernard et al. (2016) was not correct, and that (ii) the difference in local-in-time terms was composed of a combination of gauge terms and of a new ambiguity structure which could be fixed either by matching to Damour et al. (2014, 2015) or by using the results of self-force calculations in the Schwarzschild metric. In their recent articles (Bernard et al. 2017a, b) Blanchet and collaborators have recognized that the criticisms of Damour et al. (2016) were founded, and, after correcting their previous claims and using results on periastron precession first derived by Damour et al. (2015, 2016), have obtained full equivalence with the earlier derived ADM results. Let us emphasize that Marchand et al. (2018) (also see Bernard et al. 2017a) have presented the first self-contained calculation of the full 4PN dynamics (not making any use of self-force results), which confirms again the correctness of the 4PN dynamics first obtained by Damour et al. (2014). That calculation is based on asymptotic expansion of the radiative gravitational field in d dimensions with matching equations to be regularized first analytically and then dimensionally. An application of the 4PN dynamics for bound orbits was performed by Bernard et al. (2017b).

The application of EFT approach to PN calculations, devised by Goldberger and Rothstein (2006a, b), has also resulted in PN equations of motion for spinless particles up to the 3PN order (Gilmore and Ross 2008; Kol and Smolkin 2009; Foffa and Sturani 2011). At the 4PN level, Foffa and Sturani (2013a) calculated a quadratic in G higher-order Lagrangian, the published version of which was found in agreement with Jaranowski and Schäfer (2012). The quintic in G part of the 4PN Lagrangian was derived within the EFT approach by Foffa et al. (2017) (with its 2016 arXiv version corrected by Damour and Jaranowski 2017). Galley et al. (2016) got the 4PN nonlocal-in-time tail part. Then Porto and Rothstein (2017) and Porto (2017) performed a deeper analysis of IR divergences in PN expansions. Recently, Foffa and Sturani (2019) and Foffa et al. (2019b) succeeded for the first time with a purely d-dimensional derivation of the 4PN dynamics, without use of any additional regularizations. This again shows the power of dimensional regularization in PN calculations, which have been established for the first time at 3PN order by Damour et al. (2001).

The 1.5PN spin-orbit dynamics was derived in Lagrangian form by Tulczyjew (1959) and Damour (1982). The 2PN spin-spin equations of motion were derived by D’Eath (1975a, b), and Thorne and Hartle (1985), respectively, for rotating black holes. The 2.5PN spin-orbit dynamics was successfully tackled by Tagoshi et al. (2001), and Faye et al. (2006), using harmonic coordinates approach. Within the EFT approach, Porto (2010) and Levi (2010a) succeeded in determining the same coupling (also see Perrodin 2011). The 3PN spin1-spin2 dynamics was successfully tackled by Porto and Rothstein (2008b, 2010b) (based on Porto 2006; Porto and Rothstein 2006) and by Levi (2010b), and the 3PN spin1-spin1 one, again by Porto and Rothstein (2008a), but given in 2010 only in fully correct form (Porto and Rothstein 2010a). For the 3PN spin1-spin1 dynamics, also see Bohé et al. (2015). The most advanced results for spinning binaries can be found in Levi (2012), Marsat et al. (2013), Bohé et al. (2013), Marsat (2015), Levi and Steinhoff (2016a, b, 2021), reaching 3.5PN and 4PN levels (also see Steinhoff 2017). Finally, the radiation-reaction dynamics of the leading-order spin-orbit and spin1-spin2 couplings have been obtained by Wang and Will (2007) and Zeng and Will (2007), based on the DIRE method (Will 2005) (see also Maia et al. 2017a, b, where the EFT method became applied). For a review of spin effects in the radiation field, see Blanchet (2014).

1.4 Most recent history since 2019

The year 2019 can be regarded as the beginning of the epoch of the calculation of conservative PN approximations beyond 4PN. These calculations have been dominated by the EFT approach in the treatment of the gravitational field, working with Lagrangians and action functionals based on harmonic coordinates. Only at the end of the field calculations, having at hand effective Lagrangians and actions for the matter sources, the transition to effective Hamiltonians for the particles takes place. Hereof the effective EOB Hamiltonians can be constructed which are extremely useful objects for applications and comparisons of different approaches. Bound binary systems were the first to be addressed at 5PN with calculations of static potential contributions by Foffa et al. (2019a) and Blümlein et al. (2020a). Blümlein et al. (2020b) checked their approach by calculating the complete 4PN Hamiltonian for the binary dynamics.

For the calculation of binary dynamics at 5PN and beyond a new strategy was devised by Bini et al. (2019), later coined “tutti frutti” (TF) approach (Bini et al. 2021). This strategy combines various analytical approximation methods: PN (post-Newtonian), PM (post-Minkowskian), MPM (multipolar post-Minkowskian), EFT (effevtive field theory), SF (gravitational self force), EOB (effective one body), and Delaunay averaging. Binary Hamiltonians at 5PN order have been derived by Bini et al. (2020a) and by Blümlein et al. (2021b, 2022b). Up to three rational numbers, the results do agree. Details are given in Sect. 6.3.3. The TF approch has become leading through the 6PN order presenting almost complete (with 4 coefficients still unknown) 6PN effective EOB Hamiltonian (Bini et al. 2020b, c, 2021); also see Blümlein et al. (2021a, 2020c). In Sect. 6.3.5, PN-knowledge through 6PN order can be found.

Based on the PM approach, scattering calculations became more and more important in the determination of the binary Hamiltonian. Here, a new powerful approach entered, based on advanced calculations of scattering amplitudes using generalized unitarity, double-copy construction, eikonal resummation, and advanced multiloop integration methods, in the beginning resulting straight with an ordinary centre-of-mass 2PM binary Hamiltonian in isotropic gauge (isotropic coordinates for the canonical momentum) (Cheung et al. 2018), followed by the first computation of the 3PM two-body Hamiltonian in Bern et al. (2019a, b); also see Kälin et al. (2020a), using standard EFT techniques. Quite recently, the 4PM binary Hamiltonian became available, see Bern et al. (2021a, 2022); also see Dlapa et al. (2022a, b). Evidently, the nPM-order level controls all terms in the corresponding PN approximation through \((n-1)\)PN order. Binary scattering is usually treated in the action language, so Hamiltonians are close by. The problem is to make sure that the PN parts of the straightforwardly obtained PM Hamiltonians are a priori applicable to bound binary systems because of different boundary conditions, see, e.g., Kälin et al. (2020a).

Recently, the NNNLO quadratic-in-spin (Mandal et al. 2023a; Kim et al. 2023a), the NLO cubic-in-spin (Levi et al. 2021b, 2023), as well as the quartic-in-spin NLO (Levi and Teng 2021; Levi and Yin 2023) Hamiltonians were derived; also the spin-orbit gravitational couplings got obtained through the NNNLO level (Antonelli et al. 2020; Levi et al. 2021a; Mandal et al. 2023b; Kim et al. 2023b), all based on EFT methods. The complete Hamiltonian for spinning binary systems at 1PM order, exact to all orders in momentum and spin expansions, was derived in Chung et al. (2020) (also see Lee and Lee 2023 for comparison of Chung et al. 2020 with other results). At the 2PM order, binary dynamics through the fifth power of spin was considered in Bern et al. (2023).

Regarding tidal interactions, Hamiltonians through NNLO post-Newtonian (Henry et al. 2020a, b) and NLO post-Minkowskian (Cheung and Solon 2020; Kälin et al. 2020b) order corrections are available, again based on EFT (see also Bern et al. 2021b). The Wilson coefficients for rotational deformations, our \(C_{Q_a}\), are called \(C^{(0)}_{\text{E}\text{S}^2}\) by Mandal et al. (2023a) and for tidal ones, \(C^{(2)}_{\text{E}^2}\), \(C^{(0)}_{\text{E}^2\text{S}^2}\). The rotational coefficient starts at the 2PN level [i.e. at \(\mathcal {O}(c^{-2}c^{-1}c^{-1})=\mathcal {O}(c^{-4})\), where spins are counted of order \(\mathcal {O}(c^{-1})\)], whereas tidal coefficients enter from NNNLO on [i.e. at \(\mathcal {O}\big ((c^{-2})^3c^{-2}c^{-1}c^{-1}\big )=\mathcal {O}(c^{-10})\), what corresponds to the 5PN level]. Relativistic theory of tidal Love numbers was presented in Binnington and Poisson (2009); in a post-Newtonian setting, including Hamiltonian constructions, the leading-order relativistic theory of tides has been developed by Vines and Flanagan (2013). Effective one-body description of tidal effects was given in Damour and Nagar (2010); dynamical tides in general relativity were treated in Steinhoff et al. (2016). More details on tidal interactions can be found in Sect. 8.

1.5 Notation and conventions

In this article, Latin indices from the mid alphabet are running from 1 to 3 (or d for an arbitrary number of space dimensions), Greek indices are running from 0 to 3 (or d for arbitrary space dimensions), whereby \(x^0=ct\). We denote by \(\textbf{x}=(x^i)\) (\(i\in \{1,\ldots ,d\}\)) a point in the d-dimensional Euclidean space \(\mathbb {R}^d\) endowed with a standard Euclidean metric defining a scalar product (denoted by a dot). For any spatial d-dimensional vector \(\textbf{w}=(w^i)\) we define \(|\textbf{w}|\equiv \sqrt{\textbf{w}\cdot \textbf{w}}\equiv \sqrt{\delta _{ij}w^iw^j}\), so \(|\cdot |\) stands here for the Euclidean length of a vector, \(\delta _{ij}=\delta ^i_j\) denotes Kronecker delta. The partial differentiation with respect to \(x^\mu \) is denoted by \(\partial _\mu \) or by a comma, i.e., \(\partial _\mu \phi \equiv \phi _{,\mu }\), and the partial derivative with respect to time coordinates t is denoted by \(\partial _t\) or by an overdot, \(\partial _t\phi \equiv \dot{\phi }\). The covariant differentiation is generally denoted by \(\nabla \), but we may also write \(\nabla _\alpha (\cdot )\equiv (\cdot )_{||\alpha }\) for spacetime or \(\nabla _i(\cdot )\equiv (\cdot )_{;i}\) for space variables, respectively. The signature of the \((d+1)\)-dimensional metric \(g_{\mu \nu }\) is \(+(d-1)\). The Einstein summation convention is adopted. The speed of light is denoted by c and G is the Newtonian gravitational constant.

We use the notion of a tensor density. The components of a tensor density of weight w, k times contravariant and l times covariant, transform, when one changes one coordinate system to another, by the law [see, e.g., p. 501 in Misner et al. (1973) or, for more general case, Sects. 3.7–3.9 and 4.5 in Plebański and Krasiński (2006), where however definition of the density weight differs by sign from the convention used by us; note the primed notation is on the indices, not on the main symbol]

$$\begin{aligned} \mathcal {T}^{\alpha '_1\ldots \alpha '_k}_{\beta '_1\ldots \beta '_l} = \left( \frac{\partial x'}{\partial x}\right) ^{-w} {x^{\alpha '_1}}_{,\alpha _1}\ldots {x^{\alpha '_k}}_{,\alpha _k} {x^{\beta _1}}_{,\beta '_1}\ldots {x^{\beta _l}}_{,\beta '_l} \mathcal {T}^{\alpha _1\ldots \alpha _k}_{\beta _1\ldots \beta _l}, \end{aligned}$$
(1.6)

where \(({\partial x'}/{\partial x})\) is the Jacobian of the transformation \(x\rightarrow x'(x)\). For example, determinant of the metric \(g\equiv \det (g_{\mu \nu })\) is a scalar density of weight \(+2\). The covariant derivative of the tensor density of weight w, k times contravariant and l times covariant, is computed according to the rule

$$\begin{aligned} \nabla _\gamma \mathcal {T}^{\alpha _1\ldots \alpha _k}_{\beta _1\ldots \beta _l}&= \partial _\gamma \mathcal {T}^{\alpha _1\ldots \alpha _k}_{\beta _1\ldots \beta _l} - w \varGamma ^\rho _{\rho \gamma } \mathcal {T}^{\alpha _1\ldots \alpha _k}_{\beta _1\ldots \beta _l} \\&\quad + \sum _{i=1}^k \varGamma ^{\alpha _i}_{\rho _i\gamma } \mathcal {T}^{\alpha _1\ldots \rho _i\ldots \alpha _k}_{\beta _1\ldots \beta _l} - \sum _{j=1}^l \varGamma ^{\rho _j}_{\beta _j\gamma } \mathcal {T}^{\alpha _1\ldots \alpha _k}_{\beta _1\ldots \rho _j\ldots \beta _l}. \end{aligned}$$
(1.7)

For the often used case when \(\mathcal {T}^{\alpha _1\ldots \alpha _k}_{\beta _1\ldots \beta _l}=|g|^{w/2}T^{\alpha _1\ldots \alpha _k}_{\beta _1\ldots \beta _l}\) (where \(T^{\alpha _1\ldots \alpha _k}_{\beta _1\ldots \beta _l}\) is a tensor k times contravariant and l times covariant), Eq. (1.7) implies that the covariant derivative of \(\mathcal {T}^{\alpha _1\ldots \alpha _k}_{\beta _1\ldots \beta _l}\) can be computed by means of the rule,

$$\begin{aligned} \nabla _{\gamma }\mathcal {T}^{\alpha _1\ldots \alpha _k}_{\beta _1\ldots \beta _l} = T^{\alpha _1\ldots \alpha _k}_{\beta _1\ldots \beta _l} \nabla _{\gamma }|g|^{w/2} + |g|^{w/2} \nabla _{\gamma } T^{\alpha _1\ldots \alpha _k}_{\beta _1\ldots \beta _l} = |g|^{w/2} \nabla _{\gamma } T^{\alpha _1\ldots \alpha _k}_{\beta _1\ldots \beta _l}, \end{aligned}$$
(1.8)

because

$$\begin{aligned} \nabla _{\gamma }|g|^{w/2} = \partial _{\gamma }|g|^{w/2} - w \varGamma ^{\rho }_{\rho \gamma }|g|^{w/2} = 0. \end{aligned}$$
(1.9)

Letters ab (\(a,b=1,2\)) are particle labels, so \(\textbf{x}_a=(x_a^i)\in \mathbb {R}^d\) denotes the position of the ath point mass. We also define \(\textbf{r}_a\equiv \textbf{x}-\textbf{x}_a\), \(r_a\equiv |\textbf{r}_a|\), \(\textbf{n}_a\equiv \textbf{r}_a/r_a\); and for \(a\ne b\), \(\textbf{r}_{ab}\equiv \textbf{x}_a-\textbf{x}_b\), \(r_{ab}\equiv |\textbf{r}_{ab}|\), \(\textbf{n}_{ab}\equiv \textbf{r}_{ab}/r_{ab}\). The linear momentum vector of the ath particle is denoted by \(\textbf{p}_a=(p_{ai})\), and \(m_a\) denotes its mass parameter. We abbreviate Dirac delta distribution \(\delta (\textbf{x}-\textbf{x}_a)\) by \(\delta _a\) (both in d and in 3 dimensions); it fulfills the condition \(\int \text {d}^dx\,\delta _a=1\).

Thinking in terms of dimensions of space, d has to be an integer, but whenever integrals within dimensional regularization get performed, we allow d to become an arbitrary complex number [like in the analytic continuation of factorial \(n!=\varGamma (n+1)\) to \(\varGamma (z)\)]. A thorough introduction to dimensional regularization can be found in Chapter 4 of Collins (1984).

2 Hamiltonian formalisms of GR

The presented Hamiltonian formalisms do all rely on a \((3+1)\) splitting of spacetime metric \(g_{\mu \nu }\) in the following form:

$$\begin{aligned} \text {d}s^2 = g_{\mu \nu }\text {d}x^{\mu }\text {d}x^{\nu } = -(Nc\,\text {d}t)^2 + \gamma _{ij}(\text {d}x^i + N^ic\,\text {d}t)(\text {d}x^j + N^jc\,\text {d}t), \end{aligned}$$
(2.1)

where

$$\begin{aligned} \gamma _{ij} \equiv g_{ij}, \quad N \equiv (-g^{00})^{-1/2}, \quad N^i = \gamma ^{ij}N_j \quad \text {with}\quad N_i \equiv g_{0i}, \end{aligned}$$
(2.2)

here \(\gamma ^{ij}\) is the inverse metric of \(\gamma _{ij}\) (\(\gamma _{ik}\gamma ^{kj}=\delta _i^j\)), \(\gamma \equiv \text{det}(\gamma _{ij})\); lowering and raising of spatial indices is with \(\gamma _{ij}\). The splitting (2.1), and the associated explicit 3+1 decomposition of Einstein’s equations, was first introduced by Fourès-Bruhat (1956). The notations N and \(N^i\) are due to Arnowitt et al. (1962) and their names, respectively “lapse” and “shift” functions, are due to Wheeler (1964). Let us note the useful relation between the determinants \(g\equiv \det (g_{\mu \nu })\) and \(\gamma \):

$$\begin{aligned} g = -N^2 \gamma . \end{aligned}$$
(2.3)

We restrict ourselves to consider only asymptotically flat spacetimes and we employ quasi-Cartesian coordinate systems \((t,x^i)\) which are characterized by the following asymptotic spacelike behaviour (i.e., in the limit \(r\rightarrow \infty \) with \(r\equiv \sqrt{x^ix^i}\) and t = const) of the metric coefficients:

$$\begin{aligned} N&= 1+O(1/r), \quad N^i=O(1/r), \quad \gamma _{ij}=\delta _{ij}+O(1/r),\end{aligned}$$
(2.4)
$$\begin{aligned} N_{,i}&= O(1/r^2), \quad N^i_{,j}=O(1/r^2), \quad \gamma _{ij,k}=O(1/r^2). \end{aligned}$$
(2.5)

DeWitt (1967) and later, in a more refined way, Regge and Teitelboim (1974) explicitly showed that the Hamiltonian which generates all Einsteinian field equations can be put into the form,

$$\begin{aligned} H[\gamma _{ij},\pi ^{ij},N,N^i;q^A,\pi _A]&= \int \text {d}^3x\, (N{\mathcal {H}} - c N^i {\mathcal {H}}_i) \\&\quad + \frac{c^4}{16\pi G}\oint _{i^0} \text {d}S_i\,\partial _j (\gamma _{ij} - \delta _{ij} \gamma _{kk}), \end{aligned}$$
(2.6)

wherein N and \(N^i\) operate as Lagrangian multipliers and where \(\mathcal {H}\) and \({\mathcal {H}}_i\) are Hamiltonian and momentum densities, respectively; \(i^0\) denotes spacelike flat infinity. They depend on matter canonical variables \(q^A,\pi _A\) (through matter Hamiltonian density \({\mathcal {H}}_{\text {m}}\) and matter momentum density \({\mathcal {H}}_{\text {m}i}\)) and read

$$\begin{aligned} {\mathcal {H}}\equiv & {} \frac{c^4}{16\pi G}\left[ -\gamma ^{1/2} R + \frac{1}{\gamma ^{1/2}}\left( \gamma _{ik}\gamma _{jl}\pi ^{ij}\pi ^{kl}-\frac{1}{2}\pi ^2\right) \right] + {\mathcal {H}}_{\text {m}}, \end{aligned}$$
(2.7)
$$\begin{aligned} {\mathcal {H}}_i\equiv & {} \frac{c^3}{8\pi G}\gamma _{ij}\nabla _k\pi ^{jk} + {\mathcal {H}}_{\text {m}i}, \end{aligned}$$
(2.8)

where R is the intrinsic curvature scalar of the spacelike hypersurfaces of constant-in-time slices \(t=x^0/c\) = const; the ADM canonical field momentum is given by the density \( \frac{c^3}{16\pi G}\pi ^{ij}\), where

$$\begin{aligned} \pi _{ij} \equiv -\gamma ^{1/2} (K_{ij}-K\gamma _{ij}), \end{aligned}$$
(2.9)

with \(K\equiv \gamma ^{ij}K_{ij}\), where \(K_{ij}=-N \varGamma ^0_{ij}\) is the extrinsic curvature of t = const slices, \(\varGamma ^0_{ij}\) denote Christoffel symbols; \(\pi \equiv \gamma _{ij}\pi ^{ij}\); \(\nabla _k\) denotes the three-dimensional covariant derivative (with respect to \(\gamma _{ij}\)). The given densities are densities of weight one with respect to three-dimensional coordinate transformations. Let us note the useful formula for the density of the three-dimensional scalar curvature of the surface t = const:

$$\begin{aligned} \sqrt{\gamma } R&= \frac{1}{4} \sqrt{\gamma } \Big (\big (\gamma ^{ij}\gamma ^{lm}-\gamma ^{il}\gamma ^{jm}\big )\gamma ^{kn} + 2\big (\gamma ^{il}\gamma ^{km}-\gamma ^{ik}\gamma ^{lm}\big )\gamma ^{jn}\Big )\gamma _{ij,k}\gamma _{lm,n} \\&\quad + \partial _i\big (\gamma ^{-1/2}\partial _j (\gamma \gamma ^{ij})\big ). \end{aligned}$$
(2.10)

The matter densities \({\mathcal {H}}_{\text {m}}\) and \({\mathcal {H}}_{\text {m}i}\) are computed from components of the matter energy-momentum tensor \(T^{\mu \nu }\) by means of formulae

$$\begin{aligned} {\mathcal {H}}_{\text {m}}&= \sqrt{\gamma }\,T^{\mu \nu }n_\mu n_\nu = \sqrt{\gamma }\, N^2 T^{00}, \end{aligned}$$
(2.11)
$$\begin{aligned} {\mathcal {H}}_{\text {m}i}&= -\sqrt{\gamma }\,T^{\mu }_i n_\mu = \sqrt{\gamma }\, N T^0_i, \end{aligned}$$
(2.12)

where \(n_\mu =(-N,0,0,0)\) is the timelike unit covector orthogonal to the spacelike hypersurfaces t = const. Opposite to what the right-hand sides of Eqs. (2.11)–(2.12) seem to suggest, the matter densities must be independent on lapse N and shift \(N^i\) and expressible in terms of the dynamical matter and field variables \(q^A\), \(\pi _A\), \(\gamma _{ij}\) only (\(\pi ^{ij}\) does not show up for matter which is minimally coupled to the gravitational field). The variation of (2.6) with respect to N and \(N^i\) yields the constraint equations

$$\begin{aligned} {\mathcal {H}} = 0 \quad \text {and} \quad {\mathcal {H}}_i = 0. \end{aligned}$$
(2.13)

The most often applied Hamiltonian formalism employs the following coordinate choice made by ADM (which we call ADMTT gauge),

$$\begin{aligned} \pi ^{ii} = 0, \qquad 3\partial _j\gamma _{ij} - \partial _i\gamma _{jj} = 0 \quad \text {or} \quad \gamma _{ij} = \psi \delta _{ij} + h_{ij}^{\text{TT}}, \end{aligned}$$
(2.14)

where the TT piece \(h_{ij}^{\text{TT}}\) is transverse and traceless, i.e., it satisfies \(\partial _jh_{ij}^{\text{TT}}=0\) and \(h_{ii}^{\text{TT}}=0\). The TT piece of any field function can be computed by means of the TT projection operator defined as follows

$$\begin{aligned} \delta ^{\text{TT}kl}_{ij} \equiv \frac{1}{2}(P_{il}P_{jk} + P_{ik}P_{jl} - P_{kl}P_{ij}), \quad P_{ij} \equiv \delta _{ij} - \partial _i\partial _j\varDelta ^{-1}, \end{aligned}$$
(2.15)

where \(\varDelta ^{-1}\) denotes the inverse of the flat space Laplacian, which is taken without homogeneous solutions for source terms decaying fast enough at infinity (in 3-dimensional or, if not, then in generalized d-dimensional space). The nonlocality of the TT-operator \(\delta ^{\text{TT}kl}_{ij}\) is just the gravitational analogue of the well-known nonlocality of the Coulomb gauge in the electrodynamics.

Taking into account its gauge condition as given in Eq. (2.14), the field momentum \( \frac{c^3}{16\pi G}\pi ^{ij}\) can be split into its longitudinal and TT parts, respectively,

$$\begin{aligned} \pi ^{ij} = \tilde{\pi }^{ij} + \pi ^{ij}_{\text{TT}}, \end{aligned}$$
(2.16)

where the TT part \(\pi ^{ij}_{\text{TT}}\) fulfills the conditions \(\partial _j\pi ^{ij}_{\text{TT}}=0\) and \(\pi ^{ii}_{\text{TT}}=0\) and where the longitudinal part \(\tilde{\pi }^{ij}\) can be expressed in terms of a vectorial function \(V^i\),

$$\begin{aligned} \tilde{\pi }^{ij} = \partial _i V^j +\partial _j V^i - \frac{2}{3}\delta ^{ij}\partial _k V^k. \end{aligned}$$
(2.17)

It is also convenient to parametrize the field function \(\psi \) from Eq. (2.14) in the following way

$$\begin{aligned} \psi = \left( 1 + \frac{1}{8}\phi \right) ^4. \end{aligned}$$
(2.18)

The independent field variables are \(\pi ^{ij}_{\text{TT}}\) and \(h_{ij}^{\text{TT}}\). Already Kimura (1961) used just this presentation for applications. The Poisson bracket for the independent degrees of freedom reads

$$\begin{aligned} \{F(\textbf{x}),G(\textbf{y})\}&\equiv \frac{16\pi G}{c^3} \int \text {d}^3z\, \Bigg \{ \frac{\delta F(\textbf{x})}{\delta h^{\text{TT}}_{ij}(\textbf{z})} \bigg (\delta ^{\text{TT}kl}_{ij}(\textbf{z})\frac{\delta G(\textbf{y})}{\delta \pi ^{kl}_{\text{TT}}(\textbf{z})}\bigg ) \\&\quad - \frac{\delta G(\textbf{y})}{\delta h^{\text{TT}}_{ij}(\textbf{z})} \bigg (\delta ^{\text{TT}kl}_{ij}(\textbf{z})\frac{\delta F(\textbf{x})}{\delta \pi ^{kl}_{\text{TT}}(\textbf{z})}\bigg ) \Bigg \}, \end{aligned}$$
(2.19)

where \(\delta F(\textbf{x})/(\delta f(\textbf{z}))\) denotes the functional (or Fréchet) derivative. ADM gave the Hamiltonian in fully reduced form, i.e., after having applied (four) constraint equations (2.13) and (four) coordinate conditions (2.14). It reads

$$\begin{aligned} H_{\rm {red}}[h_{ij}^{\text{TT}},\pi ^{ij}_{\text{TT}};q^A,\pi _A]&= \frac{c^4}{16\pi G}\oint _{i^0} \text {d}S_i\, \partial _j (\gamma _{ij} - \delta _{ij} \gamma _{kk}) \\&= \frac{c^4}{16\pi G}\int \text {d}^3x\, \partial _i\partial _j (\gamma _{ij} - \delta _{ij} \gamma _{kk}). \end{aligned}$$
(2.20)

The reduced Hamiltonian generates the field equations of the two remaining metric coefficients (eight metric coefficients are determined by the four constraint equations and four coordinate conditions combined with four otherwise degenerate field equations for the lapse and shift functions). By making use of (2.18) the reduced Hamiltonian (2.20) can be written as

$$\begin{aligned} H_{\rm {red}}[h_{ij}^{\text{TT}},\pi ^{ij}_{\text{TT}};q^A,\pi _A] = -\frac{c^4}{16\pi G} \int \text {d}^3x\, \varDelta \phi [h_{ij}^{\text{TT}},\pi ^{ij}_{\text{TT}};q^A,\pi _A]. \end{aligned}$$
(2.21)

2.1 Hamiltonian formalisms of Dirac and Schwinger

Dirac had chosen the following coordinate system, called “maximal slicing” because of the field momentum condition,

$$\begin{aligned} \pi \equiv \gamma _{ij}\pi ^{ij} = 0, \quad \partial _j(\gamma ^{1/3}\gamma ^{ij}) = 0. \end{aligned}$$
(2.22)

The reason for calling the condition \(\pi =2K\gamma ^{1/2}=0\) “maximal slicing” is because the congruence of the timelike unit vectors \(n^{\mu }\) normal to the t = const hypersurfaces (slices)—as such irrotational—is free of expansion (notice that \(\nabla _\mu n^{\mu }=-K\)). Hereof it immediately follows that a finite volume in any slice gets unchanged by a small timelike deformation of the slice which vanishes on the boundary of the volume, i.e. an extremum principle holds (see, e.g., York 1979). The corresponding independent field variables are (no implementation of the three differential conditions!)

$$\begin{aligned} \tilde{\pi }^{ij} = \Big (\pi ^{ij}-\frac{1}{3}\gamma ^{ij}\pi \Big )\gamma ^{1/3}, \quad \tilde{g}_{ij} = \gamma ^{-1/3}\gamma _{ij}, \end{aligned}$$
(2.23)

with the algebraic properties \(\gamma _{ij}\tilde{\pi }^{ij}=0\) and \(\det (\tilde{g}_{ij})=1\). To leading order linear in the metric functions, the Dirac gauge coincides with the ADM gauge. The reduction of the Dirac form of dynamics to the independent tilded degrees of freedom has been performed by Regge and Teitelboim (1974), including a fully satisfactory derivation of the Hamiltonian introduced by Dirac. The Poisson bracket for the Dirac variables reads

$$\begin{aligned} \{F(\textbf{x}),G(\textbf{y})\}&\equiv \frac{16\pi G}{c^3}\int \text {d}^3z\,\Bigg \{ \tilde{\delta }^{kl}_{ij}(\textbf{z}) \left( \frac{\delta F(\textbf{x})}{\delta \tilde{g}_{ij}(\textbf{z})}\frac{\delta G(\textbf{y})}{\delta \tilde{\pi }^{kl}(\textbf{z})} - \frac{\delta G(\textbf{y})}{\delta \tilde{g}_{ij}(\textbf{z})}\frac{\delta F(\textbf{x})}{\delta \tilde{\pi }^{kl}(\textbf{z})}\right) \\&\quad + \frac{1}{3}\Big (\tilde{\pi }^{ij}(\textbf{z})\tilde{g}^{kl}(\textbf{z}) - \tilde{\pi }^{kl}(\textbf{z})\tilde{g}^{ij}(\textbf{z})\Big ) \frac{\delta F(\textbf{x})}{\delta \tilde{\pi }^{ij}(\textbf{z})}\frac{\delta G(\textbf{y})}{\delta \tilde{\pi }^{kl}(\textbf{z})} \Bigg \}, \end{aligned}$$
(2.24)

with

$$\begin{aligned} \tilde{\delta }^{kl}_{ij} \equiv \frac{1}{2}(\delta ^k_i\delta ^l_j + \delta ^l_i\delta ^k_j) - \frac{1}{3}\tilde{g}_{ij}\tilde{g}^{kl}, \quad \tilde{g}^{ij} = \gamma ^{1/3}\gamma ^{ij}, \quad \tilde{g}_{ij}\tilde{g}^{jk} = \delta ^k_i. \end{aligned}$$
(2.25)

The Hamiltonian proposed by Dirac results from the expression

$$\begin{aligned} H_{\text{D}}[\tilde{g}_{ij},\tilde{\pi }^{ij},q^A,\pi _A] = -\int \text {d}^3x\,c\,N^i \mathcal {H}_i - \frac{c^4}{16\pi G} \int \text {d}^3x\,\partial _i\big (\gamma ^{-1/2}\partial _j (\gamma \gamma ^{ij})\big ), \end{aligned}$$
(2.26)

which itself results from Eq. (2.6) under imposing the Hamiltonian constraint \({\mathcal {H}}=0\) [see Eq. (2.13)] as identity, replacing in (2.6) the surface term with another but equivalent surface term, and implementing the Dirac variables from Eq. (2.23), which are the independent variables under the maximal slicing condition. The further reduction, the one with implementing the coordinate conditions on the hypersurfaces, goes via the Dirac brackets as follows.

The fixation of the coordinates in the hypersurface through \(\partial _j\tilde{g}^{ij}=0\) results in Dirac brackets in phase space of the form (Dirac 1959)

$$\begin{aligned}&\{F(\textbf{x}),G(\textbf{y})\}_{\text{D}} \equiv \{F(\textbf{x}),G(\textbf{y})\} + \int \text {d}^3z\int \text {d}^3z'\,C_i^j(\textbf{z},\textbf{z}') \\&\quad \times \Big (\big \{F(\textbf{x}),\partial _k\tilde{g}^{ik}(\textbf{z})\big \}\big \{{\mathcal{H}}_j(\textbf{z}'),G(\textbf{y})\big \} - \big \{F(\textbf{x}),{\mathcal{H}}_j(\textbf{z}')\big \}\big \{\partial _k\tilde{g}^{ik}(\textbf{z}),G(\textbf{y})\big \}\Big ), \end{aligned}$$
(2.27)

where the matrix \(C_m^l(\textbf{z}'',\textbf{z}')\) is defined by

$$\begin{aligned} \int \text {d}^3z'\, C_m^l(\textbf{z}'',\textbf{z}') \{{\mathcal{H}}_l(\textbf{z}'),\partial _k\tilde{g}^{nk}(\textbf{z})\} = \delta _m^n\delta (\textbf{z}-\textbf{z}''). \end{aligned}$$
(2.28)

It obeys the differential equation

$$\begin{aligned} \tilde{g}^{ij}(\textbf{x})\partial _i\partial _jC_m^n(\textbf{x}',\textbf{x}) + \frac{1}{3}\tilde{g}^{nk}(\textbf{x})\partial _k\partial _lC_m^l(\textbf{x}',\textbf{x}) = \delta _m^n\delta (\textbf{x}-\textbf{x}'). \end{aligned}$$
(2.29)

When using Dirac brackets the momentum constraint reads [see Eq. (2.13)]

$$\begin{aligned} {\mathcal{H}}_i = \frac{c^3}{8\pi G}\big (\tilde{\pi }^{jk}\partial _i\tilde{g}_{jk} - 2\partial _k(\tilde{\pi }^{jk}\tilde{g}_{ji})\big ) + {\mathcal{H}}_{\text{m} i} = 0, \end{aligned}$$
(2.30)

and the corresponding coordinate conditions \(\partial _j\tilde{g}^{ij} = 0\) can be treated as strong equations, because for an arbitrary functional F

$$\begin{aligned} \{F,{\mathcal{H}}_i\}_{\text{D}} = 0, \quad \{F,\partial _j\tilde{g}^{ij}\}_{\text{D}} = 0. \end{aligned}$$
(2.31)

Thus, applying Dirac brackets,

$$\begin{aligned} H_{\text{D}}[\tilde{g}_{ij},\tilde{\pi }^{ij},q^A,\pi _A] = -\frac{c^4}{16\pi G} \int \text {d}^3x\, \partial _i\big (\gamma ^{-1/2}\partial _j(\gamma ^{2/3} \tilde{g}^{ij})\big ) \end{aligned}$$
(2.32)

holds.

For the determination of the surface term in Eq. (2.32) only the determinant \(\gamma \) of the metric must be expressed by independent field variables (2.23). This can be done through the differential equation

$$\begin{aligned} -\frac{c^4}{4\pi G} \tilde{g}^{ij}\partial _i\partial _j\kappa = \frac{c^4}{16\pi G} \Big ( \frac{1}{\kappa ^3}\tilde{g}_{ij}\tilde{g}_{kl} \tilde{\pi }^{ik}\tilde{\pi }^{jl} + B \Big ) + {\mathcal{H}}_{\text{m}}, \qquad \kappa ^6 = \gamma , \end{aligned}$$
(2.33)

resulting from the Hamiltonian constraint, first equation in Eq. (2.13), with

$$\begin{aligned} B = \frac{1}{4}\kappa (\partial _i\tilde{g}_{jk})(\partial _l\tilde{g}_{mn}) \tilde{g}^{jm}(\tilde{g}^{kn}\tilde{g}^{il}- 2\tilde{g}^{in}\tilde{g}^{lk}) - \frac{2}{\kappa }(\partial _i\kappa )(\partial _j\kappa )\tilde{g}^{ij}. \end{aligned}$$
(2.34)

Schwinger proposed still another set of canonical field variables \((q^{ij},\varPi _{ij})\), for which the Hamiltonian and momentum densities have the form

$$\begin{aligned} \mathcal {H}&\equiv \frac{c^4}{16\pi G}\gamma ^{-1/2} \Big (-\frac{1}{4}q^{mn}\partial _m q^{kl}\partial _n q^{kl} - \frac{1}{2}q_{ln}\partial _m q^{kl}\partial _k q^{mn} \\&\quad - \frac{1}{2}q^{kl}\partial _k\,{\text{ln}}(q^{1/2})\partial _l \,{\text{ln}}(q^{1/2}) + \partial _i\partial _j q^{ij} + q^{ik}q^{jl}\varPi _{ij}\varPi _{kl} - (q^{ij}\varPi _{ij})^2 \Big ) + {\mathcal {H}}_{\text {m}}, \end{aligned}$$
(2.35)
$$\begin{aligned} {\mathcal {H}}_i&\equiv \frac{c^3}{16\pi G}\Big (-\varPi _{lm}\partial _iq^{lm} + \partial _i(2\varPi _{lm}q^{lm}) - \partial _l(2\varPi _{im}q^{lm})\Big ) + {\mathcal {H}}_{\text {m}i}, \end{aligned}$$
(2.36)

where \(\varPi _{ij}\equiv -\gamma ^{-1}(\pi _{ij} - \frac{1}{2}\pi \gamma _{ij})\), \(q^{ij}\equiv \gamma \gamma ^{ij}\), \(q\equiv \gamma ^2\); Schwinger’s canonical field momentum \( \frac{c^3}{16\pi G}\varPi _{ij}\) is just \( \frac{c^3}{16\pi G}\gamma ^{-1/2} K_{ij}\). The Poisson bracket for the Schwinger variables does have the same structure as the one for the ADM variables. The Schwinger’s reduced Hamiltonian has the form

$$\begin{aligned} H_{\text{S}} = -\frac{c^4}{16\pi G}\oint _{i^0} \text {d}S_i\, \partial _j q^{ij} = -\frac{c^4}{16\pi G}\int \text {d}^3x\, \partial _i\partial _j q^{ij}. \end{aligned}$$
(2.37)

If Schwinger had chosen coordinate conditions corresponding to those introduced above in Eq. (2.14) (ADM also introduced another set of coordinate conditions to which Schwinger adjusted), namely

$$\begin{aligned} \varPi _{ii} = 0, \quad q^{ij} = \varphi \delta _{ij} + f^{ij}_{\text{TT}}, \end{aligned}$$
(2.38)

a similar simple technical formalism convenient for practical calculations would have resulted with the independent field variables \(\varPi _{ij}^{\text{TT}}\) and \(f^{ij}_{\text{TT}}\). To our best knowledge, only the paper by Kibble (1963) delivers an application of Schwinger’s formalism, apart from Schwinger himself, namely a Hamiltonian formulation of the Dirac spinor field in gravity. Much later, Nelson and Teitelboim (1978) completed the same task within the tetrad-generalized Dirac formalism (Dirac 1962).

Notice that the Dirac Hamiltonian (2.32) shows first derivatives of the metric coefficients only, plugging in the Hamiltonian constraint. The same holds with the Hamiltonian proposed by Schwinger, see Eq. (2.37) and the Eq. (2.35) on-shell, i.e. after application of the Hamiltonian constraint. The Hamiltonians (2.20), (2.32), and (2.37) are identical as global objects because their integrands differ by total divergences which do vanish after integration.

2.2 Derivation of the ADM Hamiltonian

The ADM Hamiltonian was derived via the generator of field and spacetime-coordinates variations. Let the generator of general field variations be defined as (it corresponds to the generator \(G\equiv p_i\,\delta x^i\) of the point-particle dynamics in classical mechanics with the particle’s canonical momentum \(p_i\) and position \(x^i\))

$$\begin{aligned} G_{\text{field}} \equiv \frac{c^3}{16\pi G} \int \text {d}^3x\, \pi ^{ij} \delta \gamma _{ij}. \end{aligned}$$
(2.39)

Let the coefficients of three space-metric \(\gamma _{ij}\) be fixed by the relations (2.14), then the only free variations left are

$$\begin{aligned} G_{\text{field}} = \frac{c^3}{16\pi G} \int \text {d}^3x\, \pi ^{ij}_{\text{TT}} \delta h_{ij}^{\text{TT}} + \frac{c^3}{16\pi G} \int \text {d}^3x\, \pi ^{jj} \delta \psi \end{aligned}$$
(2.40)

or, modulo a total variation,

$$\begin{aligned} G_{\text{field}} = \frac{c^3}{16\pi G} \int \text {d}^3x\, \pi ^{ij}_{\text{TT}} \delta h_{ij}^{\text{TT}} - \frac{c^3}{16\pi G} \int \text {d}^3x\, \psi \delta \pi ^{jj}. \end{aligned}$$
(2.41)

It is consistent with the Einstein field equations in space-asymptotically flat space-time with quasi-Cartesian coordinates to put [the mathematically precise meaning of this equation is detailed in the Appendix B of Arnowitt et al. (1960a)]

$$\begin{aligned} ct = - \frac{1}{2}\varDelta ^{-1} \pi ^{jj}, \end{aligned}$$
(2.42)

which results in, dropping total space derivatives,

$$\begin{aligned} G_{\text{field}} = \frac{c^3}{16\pi G} \int \text {d}^3x\, \pi ^{ij}_{\text{TT}} \delta h_{ij}^{\text{TT}} + \frac{c^4}{8\pi G} \int \text {d}^3x\, \varDelta \psi \, \delta t. \end{aligned}$$
(2.43)

Hereof the Hamiltonian easily follows in the form

$$\begin{aligned} H = -\frac{c^4}{8\pi G} \int \text {d}^3x\,\varDelta \psi , \end{aligned}$$
(2.44)

which can also be written, using the form of the three-metric from Eq. (2.14),

$$\begin{aligned} H = \frac{c^4}{16\pi G} \int \text {d}^3x\, \partial _i\partial _j (\gamma _{ij} - \delta _{ij}\gamma _{kk}). \end{aligned}$$
(2.45)

This expression is valid also in case of other coordinate conditions (Arnowitt et al. 1962). For the derivation of the generator of space translations, the reader is referred to Arnowitt et al. (1962) or, equivalently, to Schwinger (1963a).

3 The ADM formalism for point-mass systems

3.1 Reduced Hamiltonian for point-mass systems

In this section we consider the ADM canonical formalism applied to a system of self-gravitating nonrotating point masses (particles). The energy-momentum tensor of such system reads

$$\begin{aligned} T^{\alpha \beta }(x^\gamma ) = \sum _a m_a c \int _{-\infty }^\infty \frac{u_a^\alpha u_a^\beta }{\sqrt{-g}} \delta ^{(4)}\big (x^\mu -x_a^\mu (\tau _a)\big )\text {d}\tau _a, \end{aligned}$$
(3.1)

where \(m_a\) is the mass parameter of ath point mass (\(a=1,2,\ldots \) labels the point masses), \(u_a^\alpha \equiv \text {d}x_a^\alpha /\text {d}\tau _a\) (with \(c\,\text {d}\tau _a=\sqrt{-g_{\mu \nu }\text {d}x_a^\mu \text {d}x_a^\nu }\)) is the four-velocity along the worldline \(x^\mu =x_a^\mu (\tau _a)\) of the ath particle. After performing the integration in (3.1) one gets

$$\begin{aligned} T^{\alpha \beta }(\textbf{x},t) = \sum _a m_a c \frac{u_a^\alpha u_a^\beta }{u_a^0 \sqrt{-g}} \delta ^{(3)}\big (\textbf{x}-\textbf{x}_a(t)\big ), \end{aligned}$$
(3.2)

where \(\textbf{x}_a=(x_a^i)\) is the position three-vector of the ath particle. The linear four-momentum of the ath particle equals \(p_a^\alpha \equiv m_a u_a^\alpha \), and the three-momentum canonically conjugate to the position \(\textbf{x}_a\) comes out to be \(\textbf{p}_a=(p_{ai})\), where \(p_{ai}=m_a u_{ai}\).

The action functional describing particles-plus-field system reads

$$\begin{aligned} S = \int \text {d}t \left( \frac{c^3}{16\pi G} \int \text {d}^3x\, \pi ^{ij} \partial _t \gamma _{ij} + \sum _a p_{ai} \dot{x}_a^i - H_0\right) , \end{aligned}$$
(3.3)

where \(\dot{x}_a^i\equiv \text {d}x_a^i/\text {d}t\). The asymptotic value 1 of the lapse function enters as prefactor of the surface integral in the Hamiltonian \(H_0\), which takes the form

$$\begin{aligned} H_0 = \int \text {d}^3x\, (N{\mathcal {H}} - cN^i {\mathcal {H}}_i) + \frac{c^4}{16\pi G} \oint _{i^0}\text {d}S_i\,\partial _j (\gamma _{ij} - \delta _{ij} \gamma _{kk}), \end{aligned}$$
(3.4)

where the so-called super-Hamiltonian density \(\mathcal {H}\) and super-momentum density \( {\mathcal {H}}_i\) can be computed by means of Eqs. (2.7)–(2.8), (2.11)–(2.12), and (3.2). They read [here we use the abbreviation \(\delta _a\) for \(\delta ^{(3)}(\textbf{x}-\textbf{x}_a)\)]

$$\begin{aligned} \mathcal {H}&= \frac{c^4}{16\pi G} \left[ \frac{1}{\gamma ^{1/2}}\left( \pi ^i_j\pi ^j_i-\frac{1}{2}\pi ^2\right) - \gamma ^{1/2} R\right] + \sum _a c \left( m_a^2c^2 + \gamma _a^{ij}p_{ai}p_{aj}\right) ^{1/2} \delta _a, \end{aligned}$$
(3.5)
$$\begin{aligned} \mathcal {H}_i&= \frac{c^3}{8\pi G} \nabla _j \pi ^j_i + \sum _a p_{ai}\delta _a, \end{aligned}$$
(3.6)

where \(\gamma _a^{ij}\equiv \gamma _{\text {reg}}^{ij}(\textbf{x}_a)\) is the finite part of the inverse metric evaluated at the particle position, which can be perturbatively and, using dimensional regularization, unambiguously defined (see Sects. 4.2, 4.4 below and Appendix A 4 of Jaranowski and Schäfer 2015).

The evolutionary part of the field equations is obtained by varying the action functional (3.3) with respect to the field variables \(\gamma _{ij}\) and \(\pi ^{ij}\). The resulting equations read

$$\begin{aligned} \gamma _{ij,0}&= 2N \gamma ^{-1/2} \left( \pi _{ij} - \frac{1}{2} \pi \gamma _{ij} \right) + \nabla _iN_j + \nabla _jN_i, \end{aligned}$$
(3.7)
$$\begin{aligned} \pi ^{ij}_{\,,0}&= -N \gamma ^{1/2} \left( R^{ij} - \frac{1}{2}\gamma ^{ij}R\right) + \frac{1}{2}N\gamma ^{-1/2}\gamma ^{ij}\left( \pi ^{mn}\pi _{mn} - \frac{1}{2}\pi ^2\right) \\&\quad - 2 N \gamma ^{-1/2}\left( \pi ^{im}\pi ^j_m - \frac{1}{2}\pi \pi ^{ij} \right) + \nabla _m(\pi ^{ij}N^m) - (\nabla _m N^i)\pi ^{mj} \\&\quad - (\nabla _mN^j)\pi ^{mi} + \frac{1}{2} \sum _a N_a \gamma ^{ik}_a p_{ak} \gamma ^{jl}_a p_{al} \left( \gamma ^{mn}_ap_{am}p_{an} + m^2_ac^2\right) ^{-1/2}\delta _a. \end{aligned}$$
(3.8)

The constraint part of the field equations results from varying the action (3.3) with respect to N and \(N^i\). It has the form

$$\begin{aligned} {\mathcal {H}} = 0, \qquad {\mathcal {H}}_i = 0. \end{aligned}$$
(3.9)

The variation of the action (3.3) with respect to \(\textbf{x}_a\) and \(\textbf{p}_a\) leads to equations of motion for the particles,

$$\begin{aligned} \dot{p}_{ai}&= -\frac{\partial }{\partial x_a^i} \int \text {d}^3x\, (N{\mathcal {H}} - cN^k {\mathcal {H}}_k) \\&= c p_{aj} \frac{\partial N^j_a}{\partial x_a^i} - c \left( m_a^2c^2 + \gamma ^{kl}_a p_{ak}p_{al} \right) ^{1/2}\, \frac{\partial N_a}{\partial x_a^i} \\ {}&\quad - \frac{c N_a}{2\left( m_a^2c^2 + \gamma ^{mn}_a p_{am}p_{an} \right) ^{1/2}}\frac{\partial \gamma ^{kl}_a}{\partial x_a^i}p_{ak}p_{al}, \end{aligned}$$
(3.10)
$$\begin{aligned} \dot{x}_a^i&= \frac{\partial }{\partial p_{ai}} \int \text {d}^3x\, \left( N{\mathcal {H}} - cN^k {\mathcal {H}}_k\right) \\&= \frac{c N_a \gamma _a^{ij}p_{aj}}{\left( m_a^2c^2 + \gamma ^{kl}_a p_{ak}p_{al}\right) ^{1/2}} - cN^i_a. \end{aligned}$$
(3.11)

Notice the involvement of lapse and shift functions in the equations of motion. Both the lapse and shift functions, four functions in total, get determined by the application of the four coordinate conditions (2.14) to the field equations (3.7) and (3.8).

The reduced action, which is fully sufficient for the derivation of the dynamics of the particles and the gravitational field, reads (only the asymptotic value 1 of the shift function survives)

$$\begin{aligned} S = \int \text {d}t \left[ \frac{c^3}{16\pi G} \int \text {d}^3x\, \pi _{\text{TT}}^{ij} \partial _t h^{\text{TT}}_{ij} + \sum _a p_{ai} \dot{x}_a^i - H_{\text {red}} \right] , \end{aligned}$$
(3.12)

where both the constraint equations (3.9) and the coordinate conditions (2.14) are taken to hold. The reduced Hamilton functional \(H_{\text {red}}\) is given by

$$\begin{aligned} H_{\text {red}}[\textbf{x}_a,\textbf{p}_a,h^{\text{TT}}_{ij},\pi _{\text{TT}}^{ij}] = -\frac{c^4}{16\pi G} \int \text {d}^3x\,\varDelta \phi [\textbf{x}_a,\textbf{p}_a,h^{\text{TT}}_{ij},\pi _{\text{TT}}^{ij}]. \end{aligned}$$
(3.13)

The remaining field equations read

$$\begin{aligned} \frac{c^3}{16\pi G}\partial _t\pi _{\text {TT}}^{ij} = -\delta ^{{\text{TT}} {ij}}_{kl} \frac{\delta H_{\text {red}}}{\delta h_{kl}^{\text {TT}}}, \quad \frac{c^3}{16\pi G}\partial _t h_{ij}^{\text {TT}} = \delta ^{{\text {TT}} {kl}}_{ij} \frac{\delta H_{\text {red}}}{\delta \pi _{\text {TT}}^{kl}}, \end{aligned}$$
(3.14)

and the equations of motion for the point masses take the form

$$\begin{aligned} \dot{p}_{ai} = -\frac{\partial H_{\text {red}}}{\partial x_a^i}, \quad \quad \dot{x}_a^i = \frac{\partial H_{\text {red}}}{\partial p_{ai}}. \end{aligned}$$
(3.15)

Evidently, there is no involvement of lapse and shift functions in the equations of motion and in the field equations for the independent degrees of freedom (Arnowitt et al. 1960b; Kimura 1961).

3.2 Routh functional

The Routh functional (or Routhian) of the system is defined by

$$\begin{aligned} R\left[ \textbf{x}_a,\textbf{p}_a,h^{\text{TT}}_{ij},\partial _t h^{\text{TT}}_{ij}\right] \equiv H_{\text {red}} - \frac{c^3}{16\pi G} \int \text {d}^3x\,\pi ^{ij}_{\text{TT}}\,\partial _t h^{\text{TT}}_{ij}. \end{aligned}$$
(3.16)

This functional is a Hamiltonian for the point-mass degrees of freedom, and a Lagrangian for the independent gravitational field degrees of freedom. Within the post-Newtonian framework it was first introduced by Jaranowski and Schäfer (1998, 2000c). The evolution equation for the gravitational field degrees of freedom reads

$$\begin{aligned} \frac{ \delta }{\delta h^{\text{TT}}_{ij}(\textbf{x},t)}\int R(t')\,\text {d}t' = 0. \end{aligned}$$
(3.17)

The Hamilton equations of motion for the two point masses take the form

$$\begin{aligned} \dot{p}_{ai} = - \frac{\partial R}{\partial x^i_a}, \quad \dot{x}^i_a = \frac{\partial R}{\partial p_{ai}}. \end{aligned}$$
(3.18)

For the following treatment of the conservative part of the dynamics only, we will make now a short model calculation revealing the structure and logic behind the treatment. Let’s take a Routhian of the form \(R(q,p;\xi ,{\dot{\xi }})\). Then the action reads

$$\begin{aligned} S[q,p;\xi ] = \int \big (p\dot{q} - R(q,p;\xi ,{\dot{\xi }})\big )\text {d}t. \end{aligned}$$
(3.19)

Its variation through the independent variables gives

$$\begin{aligned} \delta S&= \int \bigg [\frac{\text {d}}{\text {d}t}(p\delta q) + \left( \dot{q}-\frac{\partial R}{\partial p}\right) \delta p + \left( -\dot{p}-\frac{\partial R}{\partial q}\right) \delta q \\&\quad - \left( \frac{\partial R}{\partial \xi }-\frac{\text {d}}{\text {d}t}\frac{\partial R}{\partial {\dot{\xi }}}\right) \delta \xi - \frac{\text {d}}{\text {d}t}\left( \frac{\partial R}{\partial {\dot{\xi }}}\delta \xi \right) \bigg ]\text {d}t. \end{aligned}$$
(3.20)

Going on-shell with the \(\xi \)-dynamics yields

$$\begin{aligned} \delta S = \int \left[ \frac{\text {d}}{\text {d}t}(p\delta q) + \left( \dot{q}-\frac{\partial R}{\partial p}\right) \delta p + \left( -\dot{p} -\frac{\partial R}{\partial q}\right) \delta q\right] \text {d}t - \left( \frac{\partial R}{\partial {\dot{\xi }}}\delta \xi \right) ^{+\infty }_{-\infty }. \end{aligned}$$
(3.21)

The vanishing of the last term means—thinking in terms of \(h^{\text{TT}}_{ij}\) and \(\dot{h}^{\text{TT}}_{ij}\), i.e. considering the term \((\int \text {d}^3x\,\pi ^{ij}_{\text{TT}}\,\delta h^{\text{TT}}_{ij})^{+\infty }_{-\infty }\) on the solution space of the field equations (“on-field-shell”)—that as much incoming as outgoing radiation has to be present, or time-symmetric boundary conditions have to be applied. Thus in the Fokker-type procedure no dissipation shows up. Assuming a leading-order-type prolongation (allowing additions of only first time derivatives of q and p) of the form \(R=R(q,p,\dot{q},\dot{p})\), the autonomous dynamics can be deduced from the variation

$$\begin{aligned} \delta S = \int \left[ \frac{\text {d}}{\text {d}t}(p\delta q) + \left( \dot{q}-\frac{\delta R}{\delta p}\right) \delta p + \left( -\dot{p}-\frac{\delta R}{\delta q}\right) \delta q \right] \text {d}t, \end{aligned}$$
(3.22)

where the Euler–Lagrange derivative \(\delta A/\delta z\equiv \partial A /\partial z-\text {d}(\partial A/\partial \dot{z})/\text {d}t\) has been introduced.

Having explained that, the conservative part of the binary dynamics is given by the higher-order Hamiltonian equal to the on-field-shell Routhian,

$$\begin{aligned}&H_{\text {con}}[\textbf{x}_a,\textbf{p}_a,\dot{\textbf{x}}_a,\dot{\textbf{p}}_a,\ldots ] \\&\quad \equiv R\big [\textbf{x}_a,\textbf{p}_a,h^{\text{TT}}_{ij}(\textbf{x}_a,\textbf{p}_a,\dot{\textbf{x}}_a,\dot{\textbf{p}}_a,\ldots ),\dot{h}^{\text{TT}}_{ij}(\textbf{x}_a,\textbf{p}_a,\dot{\textbf{x}}_a,\dot{\textbf{p}}_a,\ldots )\big ], \end{aligned}$$
(3.23)

where the field variables \(h^{\text{TT}}_{ij}\), \(\dot{h}^{\text{TT}}_{ij}\) were “integrated out”, i.e., replaced by their solutions as functionals of particle variables. The conservative equations of motion defined by the higher-order Hamiltonian (3.23) read

$$\begin{aligned} \dot{p}_{ai}(t) = -\frac{\delta }{\delta x^i_a(t)} \int H_{\text {con}}(t')\,\text {d}t', \quad \dot{x}^i_a(t) = \frac{\delta }{\delta p_{ai}(t)} \int H_{\text {con}}(t')\,\text {d}t', \end{aligned}$$
(3.24)

where the functional derivative is given by

$$\begin{aligned} \frac{\delta }{\delta z(t)} \int H_{\text {con}}(t')\,\text {d}t' = \frac{\partial H_{\text {con}}}{\partial z(t)} - \frac{\text {d}}{\text {d}t}\frac{\partial H_{\text {con}}}{\partial \dot{z}(t)} + \cdots , \end{aligned}$$
(3.25)

with \(z=x^i_a\) or \(z=p_{ai}\). Schäfer (1984) and Damour and Schäfer (1991) show that time derivatives of \(\textbf{x}_a\) and \(\textbf{p}_a\) in the higher-order Hamiltonian (3.23) can be eliminated by the use of lower-order equations of motion, leading to an ordinary Hamiltonian,

$$\begin{aligned} H_{\text {con}}^{\text {ord}}[\textbf{x}_a,\textbf{p}_a] = H_{\text {con}}[\textbf{x}_a,\textbf{p}_a,\dot{\textbf{x}}_a(\textbf{x}_a,\textbf{p}_a),\dot{\textbf{p}}_a(\textbf{x}_a,\textbf{p}_a),\ldots ]. \end{aligned}$$
(3.26)

Notice the important point that the two Hamiltonians \(H_{\text {con}}\) and \(H_{\text {con}}^{\text {ord}}\) do not belong to the same coordinate system. Therefore, the Hamiltonians \(H_{\text {con}}\) and \(H_{\text {con}}^{\text {ord}}\) and their variables should have, say, primed and unprimed notations which usually however does not happen in the literature due to a slight abuse of notation.

A formal PN expansion of the Routh functional in powers of \(1/c^2\) is feasible to all PN orders. With the aid of the definition \( h^{\text{TT}}_{ij}\equiv \frac{16\pi G}{c^4}\hat{h}^{\text{TT}}_{ij}\), we may write

$$\begin{aligned} R\left[ \textbf{x}_a,\textbf{p}_a, h^{\text{TT}}_{ij}, \partial _th^{\text{TT}}_{ij}\right] - \sum _a m_a c^2 = \sum _{n=0}^{\infty } \frac{1}{c^{2n}} R_n\big [\textbf{x}_a,\textbf{p}_a, \hat{h}^{\text{TT}}_{ij}, \partial _t{\hat{h}}^{\text{TT}}_{ij}\big ]. \end{aligned}$$
(3.27)

Hereof, the field equation for \(h^{\text{TT}}_{ij}\) results in a PN-series form,

$$\begin{aligned} \left( \varDelta -\frac{1}{c^2}\partial _t^2\right) \hat{h}^{\text{TT}}_{ij} = \sum _{n=0}^{\infty } \frac{1}{c^{2n}} D^{\text{TT}}_{(n)ij}\big [\textbf{x},\textbf{x}_a,\textbf{p}_a, \hat{h}^{\text{TT}}_{kl}, \partial _t{\hat{h}}^{\text{TT}}_{kl}\big ]. \end{aligned}$$
(3.28)

This equation must now be solved step by step using either retarded integrals for getting the whole dynamics or time-symmetric ones for only the conservative dynamics defined by \(H_{\text {con}}\), which themselves have to be expanded in powers of 1/c. In higher orders, however, non-analytic in 1/c log-terms do show up (see, e.g., Damour et al. 2014, 2016).

To calculate the reduced Hamiltonian of Eq. (2.21) for a many-particle system one has to perturbatively solve for \(\phi \) and \(\tilde{\pi }^ {ij}\) the constraint equations \(\mathcal {H}=0\) and \(\mathcal {H}_i=0\) with the densities \(\mathcal {H}\), \(\mathcal {H}_i\) defined in Eqs. (3.5)–(3.6). Then the transition to the Routhian of Eq. (3.16) is straightforward using the second equation in (3.14). The expansion of the Hamiltonian constraint equation up to \(c^{-10}\) leads to the following equation [in this equation and in the next one we use units \(c=1\), \(G=1/(16\pi )\)]Footnote 4:

$$\begin{aligned} -\varDelta \phi&= \sum _a \bigg [ 1-\frac{1}{8}\phi +\frac{1}{64}\phi ^2-\frac{1}{512}\phi ^3+\frac{1}{4096}\phi ^4 \\&\quad + \left( \frac{1}{2}-\frac{5}{16}\phi +\frac{15}{128}\phi ^2-\frac{35}{1024}\phi ^3\right) \frac{\textbf{p}_a^2}{m_a^2} \\&\quad + \left( -\frac{1}{8}+\frac{9}{64}\phi -\frac{45}{512}\phi ^2\right) \frac{(\textbf{p}_a^2)^2}{m_a^4} +\left( \frac{1}{16}-\frac{13}{128}\phi \right) \frac{(\textbf{p}_a^2)^3}{m_a^6} -\frac{5}{128}\frac{(\textbf{p}_a^2)^4}{m_a^8} \\&\quad +\left( -\frac{1}{2}+\frac{9}{16}\phi +\frac{1}{4}\frac{\textbf{p}_a^2}{m_a^2}\right) \frac{p_{ai}p_{aj}}{m_a^2}{h^{\text{TT}}_{ij}} -\frac{1}{16}\left( {h^{\text{TT}}_{ij}}\right) ^2 \bigg ] m_a\delta _a \\&\quad +\left( 1+\frac{1}{8}\phi \right) \left( {\widetilde{\pi }^{ij}}\right) ^2 +\left( 2+\frac{1}{4}\phi \right) {\widetilde{\pi }^{ij}}{\pi ^{ij}_{\text{TT}}} +\left( {\pi ^{ij}_{\text{TT}}}\right) ^2 \\&\quad +\left[ \left( -\frac{1}{2}+\frac{1}{4}\phi -\frac{5}{64}\phi ^2\right) \phi _{,{ij}} +\left( \frac{3}{16}-\frac{15}{128}\phi \right) \phi _{,i}\phi _{,j} +2{\widetilde{\pi }^{ik}}{\widetilde{\pi }^{jk}}\right] {h^{\text{TT}}_{ij}} \\&\quad +\left( \frac{1}{4}-\frac{7}{32}\phi \right) \left( {h^{\text{TT}}_{ij,k}}\right) ^2 +\left( \frac{1}{2}+\frac{1}{16}\phi \right) {h^{\text{TT}}_{ij,k}}{h^{\text{TT}}_{ik,j}} \\&\quad + \varDelta \left[ \left( -\frac{1}{2}+\frac{7}{16}\phi \right) \left( {h^{\text{TT}}_{ij}}\right) ^2\right] -\left[ \frac{1}{2}\phi {h^{\text{TT}}_{ij}}{h^{\text{TT}}_{ik,j}} +\frac{1}{4}\phi _{,k}\left( {h^{\text{TT}}_{ij}}\right) ^2\right] _{,k} \\&\quad + \mathcal {O}(c^{-12}). \end{aligned}$$
(3.29)

The expansion of the momentum constraint equation up to \(c^{-7}\) reads

$$\begin{aligned} {\widetilde{\pi }^{ij}}_{,j}&= \left( -\frac{1}{2}+\frac{1}{4}\phi -\frac{5}{64}\phi ^2\right) \sum _a p_{ai}\delta _a + \left( -\frac{1}{2}+\frac{1}{16}\phi \right) \phi _{,j}{\widetilde{\pi }^{ij}} \\&\quad - \frac{1}{2}\phi _{,j}{\pi ^{ij}_{\text{TT}}} - {\widetilde{\pi }^{jk}}_{,k}{h^{\text{TT}}_{ij}} + {\widetilde{\pi }^{jk}}\left( \frac{1}{2}{h^{\text{TT}}_{jk,i}}-{h^{\text{TT}}_{ij,k}}\right) + \mathcal {O}(c^{-8}). \end{aligned}$$
(3.30)

In the Eqs. (3.29) and (3.30) dynamical field variables \(h^{\text{TT}}_{ij}\) and \(\pi ^{ij}_{\text{TT}}\) are counted as being of the orders \(1/c^4\) and \(1/c^5\), respectively [cf. Eq. (3.28)].

3.3 Poincaré invariance

In asymptotically flat spacetimes the Poincaré group is a global symmetry group. Its generators \(P^{\mu }\) and \(J^{\mu \nu }\) are realized as functions \(P^\mu (\textbf{x}_a,\textbf{p}_a)\) and \(J^{\mu \nu }(\textbf{x}_a,\textbf{p}_a)\) on the many-body phase-space. They are conserved on shell and fulfill the Poincaré algebra relations for the Poisson bracket product (see, e.g., Regge and Teitelboim 1974),

$$\begin{aligned} \{ P^{\mu }, P^{\nu } \}&= 0, \end{aligned}$$
(3.31)
$$\begin{aligned} \{ P^{\mu }, J^{\rho \sigma } \}&= -\eta ^{\mu \rho } P^{\sigma } + \eta ^{\mu \sigma } P^{\rho },\end{aligned}$$
(3.32)
$$\begin{aligned} \{ J^{\mu \nu }, J^{\rho \sigma } \}&= -\eta ^{\nu \rho } J^{\mu \sigma } + \eta ^{\mu \rho } J^{\nu \sigma } + \eta ^{\sigma \mu } J^{\rho \nu } - \eta ^{\sigma \nu } J^{\rho \mu }, \end{aligned}$$
(3.33)

where the Poisson brackets are defined in an usual way,

$$\begin{aligned} \{A,B\} \equiv \sum _a \left( \frac{\partial A}{\partial x_a^i}\frac{\partial B}{\partial p_{ai}} - \frac{\partial A}{\partial p_{ai}}\frac{\partial B}{\partial x_a^i} \right) . \end{aligned}$$
(3.34)

The meaning of the components of \(P^{\mu }\) and \(J^{\mu \nu }\) is as follows: the time component \(P^0\) (i.e., the total energy) is realized as the Hamiltonian \(H\equiv c P^0\), \(P^i=P_i\) is linear momentum, \(J^i\equiv \frac{1}{2}\varepsilon ^{ikl}J_{kl}\) [with \(\varepsilon ^{ijk}\equiv \varepsilon _{ijk}\equiv \frac{1}{2}(i-j)(j-k)(k-i)\), \(J_{kl}=J^{kl}\), and \(J_{ij}=\varepsilon _{ijk}J^k\)] is angular momentum, and Lorentz boost vector is \(K^i\equiv J^{i0}/c\). The boost vector represents the constant of motion associated with the centre-of-mass theorem and can further be decomposed as \(K^i=G^i-t\,P^i\) (with \(G_i=G^i\)). In terms of three-dimensional quantities the Poincaré algebra relations read (see, e.g., Damour et al. 2000c, d)

$$\begin{aligned} \{ P_i, H \}&= 0, \quad \{ J_i, H \} = 0,\end{aligned}$$
(3.35)
$$\begin{aligned} \{ J_i, P_j \}&= \varepsilon _{ijk} \, P_k, \quad \{ J_i, J_j \} = \varepsilon _{ijk} \, J_k,\end{aligned}$$
(3.36)
$$\begin{aligned} \{ J_i, G_j \}&= \varepsilon _{ijk} \, G_k, \end{aligned}$$
(3.37)
$$\begin{aligned} \{ G_i, H \}&= P_i,\end{aligned}$$
(3.38)
$$\begin{aligned} \{ G_i, P_j \}&= \frac{1}{c^2}\,H\,\delta _{ij}, \end{aligned}$$
(3.39)
$$\begin{aligned} \{ G_i, G_j \}&= -\frac{1}{c^2}\,\varepsilon _{ijk}\,J_k. \end{aligned}$$
(3.40)

The Hamiltonian H and the centre-of-mass vector \(G^i\) have the integral representations

$$\begin{aligned} H&= -\frac{c^4}{16 \pi G} \int \text {d}^3x\,\varDelta \phi = -\frac{c^4}{16 \pi G} \oint _{i^0} r^2 \text {d}\varOmega \,\textbf{n}\cdot \nabla \phi , \end{aligned}$$
(3.41)
$$\begin{aligned} G^i&= -\frac{c^2}{16 \pi G} \int \text {d}^3x\, x^i \varDelta \phi = -\frac{c^2}{16 \pi G} \oint _{i^0} r^2\text {d}\varOmega \, n^j (x^i \partial _j - \delta _{ij})\phi , \end{aligned}$$
(3.42)

where \(\textbf{n}\,r^2\text {d}\varOmega \) (\(\textbf{n}\) is the outward radial unit vector) is the two-dimensional surface-area element at \(i^0\). The two quantities H and \(G^i\) are the most involved ones of those entering the Poincaré algebra.

The Poincaré algebra has been extensively used in the calculations of PN Hamiltonians for spinning binaries (Hergt and Schäfer 2008a, b). Hereby the most useful equation was (3.38), which tells that the total linear momentum has to be a total time derivative. This equation was also used by Damour et al. (2000c, d) to fix the so called “kinetic ambiguity” in the 3PN ADM two-point-mass Hamiltonian without using dimensional regularization. In harmonic coordinates, the kinetic ambiguity got fixed by a Lorentzian version of the Hadamard regularization based on the Fock–de Donder approach (Blanchet and Faye 2001b).

The explicit form of the generators \(P^\mu (\textbf{x}_a,\textbf{p}_a)\) and \(J^{\mu \nu }(\textbf{x}_a,\textbf{p}_a)\) (i.e., \(\textbf{P}\), \(\textbf{J}\), \(\textbf{G}\), and H) for two-point-mass systems is given in Appendix C with 4PN accuracy.

The global Lorentz invariance results in the following useful expressions (see, e.g., Rothe and Schäfer 2010; Georg and Schäfer 2015). Let us define the quantity \({\mathcal {M}}\) through the relation

$$\begin{aligned} \mathcal {M}c^2 \equiv \sqrt{H^{2}-\mathbf{{P}}^{2}c^{2}} \quad \text{ or } \quad H = \sqrt{\mathcal {M}^{2}c^{4}+\mathbf{{P}}^{2}c^{2}}, \end{aligned}$$
(3.43)

and let us introduce the canonical centre of the system vector \(\textbf{X}\) (with components \(X^i = X_i\)),

$$\begin{aligned} \textbf{X} \equiv \frac{\textbf{G}c^2}{H} + \frac{1}{{\mathcal {M}}\left( H+{\mathcal {M}}c^{2}\right) }\left( \textbf{J} - \left( \frac{\textbf{G}c^2}{H}\times \textbf{P}\right) \right) \times \textbf{P}. \end{aligned}$$
(3.44)

Then the following commutation relations are fulfiled:

$$\begin{aligned} \left\{ X_i,\, P_j \right\}&= \delta _{ij}, \quad \left\{ X_i,\, X_j \right\} = 0, \quad \left\{ P_i,\, P_j \right\} = 0, \end{aligned}$$
(3.45)
$$\begin{aligned} \left\{ {\mathcal {M}},\, P_i \right\}&= 0, \quad \left\{ {\mathcal {M}},\, X_i \right\} = 0,\end{aligned}$$
(3.46)
$$\begin{aligned} \left\{ {\mathcal {M}},\, H \right\}&= 0, \quad \left\{ P_i,\, H \right\} = 0, \quad \frac{H}{c^2}\left\{ X_i,\, H \right\} = P_i. \end{aligned}$$
(3.47)

The commutation relations clearly show the complete decoupling of the internal dynamics from the external one by making use of the canonical variables. The equations (3.43) additionally indicate that \({\mathcal {M}}^2\) is simpler (or, more primitive) than \({\mathcal {M}}\), cf., Georg and Schäfer (2015). A centre-of-energy vector can be defined by \(X_E^i = X_{Ei} = c^2 G^i / H = c^2 G_i / H\). This vector, however, is not a canonical position vector, see, e.g., Hanson and Regge (1974).

In view of our later treatment of particles with spin, let us decompose the total angular momentum \(J^{\mu \nu }\) of a single object into orbital angular momentum \(L^{\mu \nu }\) and spin \(S^{\mu \nu }\), both of them being anti-symmetric tensors,

$$\begin{aligned} J^{\mu \nu } = L^{\mu \nu } + S^{\mu \nu }. \end{aligned}$$
(3.48)

The orbital angular momentum tensor is given by

$$\begin{aligned} L^{\mu \nu } = Z^{\mu }P^{\nu } - Z^{\nu }P^{\mu }, \end{aligned}$$
(3.49)

where \(Z^{\mu }\) denotes 4-dimensional position vector (with \(Z^0 = ct\)). The splitting in space and time results in

$$\begin{aligned} J^{ij} = Z^iP^j - Z^jP^i + S^{ij}, \quad J^{i0} = Z^iH/c - P^ict + S^{i0}. \end{aligned}$$
(3.50)

Remarkably, relativity tells us that any object with mass \({\mathcal {M}}\), spin length S, and positive energy density must have extension orthogonal to its spin vector of radius of at least \(S/({\mathcal {M}}c)\) (see, e.g., Misner et al. 1973). Clearly then, the position vector of such an object is not given a priori but must be defined. As the total angular momentum should not depend on the fixation of the position vector, the notion of spin must depend on the fixation of the position vector and vice versa. Thus, imposing a spin supplementary condition (SSC) fixes the position vector. We enumerate here the most often used SSCs (see, e.g., Fleming 1965; Hanson and Regge 1974; Barker and O’Connell 1979).

  1. (i)

    Covariant SSC (also called Tulczyjew-Dixon SSC):

    $$\begin{aligned} P_{\nu }S^{\mu \nu } = 0. \end{aligned}$$
    (3.51)

    The variables corresponding to this SSC are denoted in Sect. 7 by \(Z^i = z^i\), \(S^{ij}\), and \(P^i = p^i\).

  2. (ii)

    Canonical SSC (also called Newton-Wigner SSC):

    $$\begin{aligned} (P_{\nu } + {\mathcal {M}}c\,n_{\nu })S^{\mu \nu } = 0, \quad {\mathcal {M}}c = \sqrt{-P_{\mu }P^{\mu }}, \end{aligned}$$
    (3.52)

    where \(n_{\mu } = (-1,0,0,0)\), \(n_{\mu }n^{\mu } = -1\). The variables corresponding to this SSC are denoted in Sect. 7 by \(\hat{z}^i\), \(\hat{S}^{ij}\), and \(P^i\).

  3. (iii)

    Centre-of-energy SSC (also called Corinaldesi-Papapetrou SSC):

    $$\begin{aligned} n_{\nu }S^{\mu \nu } = 0. \end{aligned}$$
    (3.53)

    Here the boost vector takes the form of a spinless object, \(K^i = Z^iH/c^2 - P^it = G^i - P^it\).

3.4 Poynting theorem of GR

Let us start with the following local identity, having structure of a Poynting theorem for GR in local form,

$$\begin{aligned} -\dot{h}^{\text {TT}}_{ij} \Box h^{\text {TT}}_{ij} = -\partial _k\left( \dot{h}^{\text {TT}}_{ij}h^{\text {TT}}_{ij,k}\right) + \frac{1}{2}\partial _t \left[ (\dot{h}^{\text {TT}}_{ij}/c)^2 + (h^{\text {TT}}_{ij,k})^2\right] , \end{aligned}$$
(3.54)

where \(\Box \equiv -\partial _t^2/c^2+\varDelta \) denotes the d’Alembertian. Integrating this equation over whole space gives, assuming past stationarity,

$$\begin{aligned} -\int _{V_\infty }\text {d}^3x\, \dot{h}^{\text {TT}}_{ij} \Box h^{\text {TT}}_{ij} = \frac{1}{2} \int _{V_\infty }\text {d}^3x\,\partial _t \left[ (\dot{h}^{\text {TT}}_{ij}/c)^2 + (h^{\text {TT}}_{ij,k})^2\right] , \end{aligned}$$
(3.55)

where \(V_\infty \) is just another expression for \({{\mathbb {R}}}^3\). Notice that the far or wave zoneFootnote 5 is understood as area of the t = const slice where gravitational waves are decoupled from their source and do freely propagate outwards, what means that the relation \(h^{\text {TT}}_{ij,k}=-(n^k/c)\dot{h}^{\text {TT}}_{ij}+\mathcal {O}(r^{-2})\) (r being the radial coordinate) is fulfilled in the far zone at distances \(r\gg \lambda /(2\pi )\) from the source, where \(\lambda \) is characteristic wavelength of gravitational waves emitted by the source. We always use t = const slices, where our Hamiltonians are defined on, and explore physical processes by going from one slice to another one located in the close-by future. This suffices to discriminate radiation from non-radiation for any given approximation. Spacelike infinity (\(i^0\)) is enough for posing reliable boundary conditions, timelike infinity is not needed, neither for the future nor for the past (past stationarity simply replaces past infinity). Integration of Eq. (3.54) over the volume \(V_{\text{fz}}\) enclosed by its outer boundary located in the far zone (fz) with \(\text {d}s_k = n^k r^2 \text {d}\varOmega \) surface-area element of the two-surface of integration with \(\text {d}\varOmega \) as the solid-angle element, yields

$$\begin{aligned} -\int _{V_{\text{fz}}}\text {d}^3x\, \dot{h}^{\text {TT}}_{ij} \Box h^{\text {TT}}_{ij} = -\oint _{\text{fz}} \text {d}s_k\,\dot{h}^{\text {TT}}_{ij}h^{\text {TT}}_{ij,k} + \frac{1}{2} \int _{V_{\text{fz}}}\text {d}^3x\, \partial _t \left[ (\dot{h}^{\text {TT}}_{ij}/c)^2 + (h^{\text {TT}}_{ij,k})^2\right] . \end{aligned}$$
(3.56)

To make sure that the surface integral (say over a sphere of radius \(r_{\text {fz}}\)) in the above equation is not zero, we have to assume that \(r_{\text {fz}}\) is located in the far zone, where real wave propagation happens, i.e. behind the wave front of the out-propagating wave. Of course, as the system is stationary in the remote past, the wave front has still infinite distance to \(i^0\).

Combining Eqs. (3.55) and (3.56) together, one gets

$$\begin{aligned}&-\int _{(V_\infty - V_{\text{fz}})}\text {d}^3x\, \dot{h}^{\text {TT}}_{ij} \Box h^{\text {TT}}_{ij} = \oint _{\text{fz}}\text {d}s_k\,\dot{h}^{\text {TT}}_{ij}h^{\text {TT}}_{ij,k} \\&\quad + \frac{1}{2} \int _{(V_\infty -V_{\text{fz}})}\text {d}^3x\, \partial _t \left[ (\dot{h}^{\text {TT}}_{ij}/c)^2 + (h^{\text {TT}}_{ij,k})^2\right] . \end{aligned}$$
(3.57)

The volume \((V_\infty -V_{\text{fz}})\) is meant for t = const and thus reaches \(i^0\); it embraces the radial coordinates \(r_{\rm{bfz}}\lesssim r \lesssim +\infty \), where \(r_{\rm{bfz}}\) denotes the beginning of the far zone. In the following we drop the left side of this equation as negligibly small [of the relative order \(\lambda /(2\pi r_{\text {fz}})\), where \(r_{\text {fz}}\) is located in the far zone]. Indeed, we can assume that the source term for \(\Box h^{\text {TT}}_{ij}\), which follows from the Routhian field equation (3.17), decays at least as \(1/r^3\) for \(r\rightarrow \infty \) (for isolated systems, all source terms for \(\Box h^{\text {TT}}_{ij}\) decay at least as \(1/r^4\) if not TT-projected; the TT-projection may raise the decay to \(1/r^3\), e.g. TT-projection of Dirac delta function). Additionally, \(\dot{h}^{\text {TT}}_{ij}\) decays as 1/r, so the integrand on the left side decays in total as \(1/r^4\). This results in

$$\begin{aligned} \frac{c^3}{32\pi G}\oint _{\text{fz}} \text {d}\varOmega \, r^2 (\dot{h}_{ij}^{\text{TT}})^2 = \frac{c^2}{32\pi G}\frac{\text {d}}{\text {d}t} \int _{(V_\infty - V_{\text{fz}})}\text {d}^3x\, (\dot{h}_{ij}^{\text{TT}})^2, \end{aligned}$$
(3.58)

with meaning that the energy flux through a surface in the far zone equals the growth of gravitational energy beyond that surface.

3.5 Near-zone energy loss and far-zone energy flux

The change in time of the matter Routhian reads, assuming \(\mathcal {R}\) to be local in the gravitational field,

$$\begin{aligned} \frac{\text {d}R}{\text {d}t} = \frac{\partial R}{\partial t} = \int \text {d}^3x\, \frac{\partial \mathcal {R}}{\partial h^{\text {TT}}_{ij}} \dot{h}^{\text {TT}}_{ij} + \int \text {d}^3x\, \frac{\partial \mathcal {R}}{\partial h^{\text {TT}}_{ij,k}} \partial _k \dot{h}^{\text {TT}}_{ij} + \int \text {d}^3x\, \frac{\partial \mathcal {R}}{\partial \dot{h}^{\text {TT}}_{ij}} \ddot{h}^{\text {TT}}_{ij}, \end{aligned}$$
(3.59)

where

$$\begin{aligned} R(\textbf{x}_a,\textbf{p}_a,t) \equiv \int \text {d}^3x\, \mathcal {R}(\textbf{x}_a,\textbf{p}_a,h^{\text {TT}}_{ij}(t),h^{\text {TT}}_{ij,k}(t),\dot{h}^{\text {TT}}_{ij}(t)). \end{aligned}$$
(3.60)

The equation for \(\text {d}R/\text {d}t\) is valid provided the equations of motion

$$\begin{aligned} \dot{p}_{ai} = -\frac{\partial R}{\partial x_a^i}, \quad \dot{x}_a^i = \frac{\partial R}{\partial p_{ai}} \end{aligned}$$
(3.61)

hold. Furthermore, we have

$$\begin{aligned}&\int \text {d}^3x\, \frac{\partial \mathcal {R}}{\partial h^{\text {TT}}_{ij,k}} \partial _k \dot{h}^{\text {TT}}_{ij} + \int \text {d}^3x\, \frac{\partial \mathcal {R}}{\partial \dot{h}^{\text {TT}}_{ij}} \ddot{h}^{\text {TT}}_{ij} = \int \text {d}^3x\, \partial _k\left( \frac{\partial \mathcal {R}}{\partial h^{\text {TT}}_{ij,k}} \dot{h}^{\text {TT}}_{ij}\right) \\&\quad + \frac{\text {d}}{\text {d}t} \int \text {d}^3x\, \frac{\partial \mathcal {R}}{\partial \dot{h}^{\text {TT}}_{ij}} \dot{h}^{\text {TT}}_{ij} - \int \text {d}^3x\, \partial _k\left( \frac{\partial \mathcal {R}}{\partial h^{\text {TT}}_{ij,k}}\right) \dot{h}^{\text {TT}}_{ij} - \int \text {d}^3x\, \frac{\text {d}}{\text {d}t}\left( \frac{\partial \mathcal {R}}{\partial \dot{h}^{\text {TT}}_{ij}}\right) \dot{h}^{\text {TT}}_{ij}. \end{aligned}$$
(3.62)

The canonical field momentum is given by

$$\begin{aligned} \frac{c^3}{16\pi G} \pi ^{ij}_{\text {TT}} = -\delta ^{\text {TT}ij}_{kl}\frac{\partial \mathcal {R}}{\partial \dot{h}^{\text {TT}}_{kl}}. \end{aligned}$$
(3.63)

Performing the Legendre transformation

$$\begin{aligned} H = R + \frac{c^3}{16\pi G} \int \text {d}^3x\,\pi ^{ij}_{\text {TT}}\dot{h}^{\text {TT}}_{ij}, \quad \text{ or } \quad R = H - \frac{c^3}{16\pi G} \int \text {d}^3x\,\pi ^{ij}_{\text {TT}}\dot{h}^{\text {TT}}_{ij}, \end{aligned}$$
(3.64)

the energy loss equation takes the form [using Eq. (3.59) together with (3.62) and (3.63)]

$$\begin{aligned} \frac{\text {d}H}{\text {d}t}&= \int \text {d}^3x\, \partial _k\left( \frac{\partial \mathcal {R}}{\partial h^{\text {TT}}_{ij,k}} \dot{h}^{\text {TT}}_{ij}\right) + \int \text {d}^3x\, \frac{\partial \mathcal {R}}{\partial h^{\text {TT}}_{ij}} \dot{h}^{\text {TT}}_{ij} \\&\quad - \int \text {d}^3x\, \partial _k\left( \frac{\partial \mathcal {R}}{\partial h^{\text {TT}}_{ij,k}}\right) \dot{h}^{\text {TT}}_{ij} - \int \text {d}^3x\, \frac{\text {d}}{\text {d}t}\left( \frac{\partial \mathcal {R}}{\partial \dot{h}^{\text {TT}}_{ij}}\right) \dot{h}^{\text {TT}}_{ij}. \end{aligned}$$
(3.65)

Application of the field equations

$$\begin{aligned} \frac{\partial \mathcal {R}}{\partial h^{\text {TT}}_{ij}} - \partial _k\left( \frac{\partial \mathcal {R}}{\partial h^{\text {TT}}_{ij,k}}\right) - \frac{\text {d}}{\text {d}t}\left( \frac{\partial \mathcal {R}}{\partial \dot{h}^{\text {TT}}_{ij}}\right) = 0 \end{aligned}$$
(3.66)

yields, assuming past stationarity [meaning that at any finite time t no radiation can have reached spacelike infinity, so the first (surface) term in the right-hand side of Eq. (3.65) vanishes],

$$\begin{aligned} \frac{\text {d}H}{\text {d}t} = 0. \end{aligned}$$
(3.67)

The Eq. (3.58) shows that the Eq. (3.64) infers, employing the leading-order quadratic field structure of \(\mathcal {R}\) [\(\mathcal {R}=-(1/4)(c^2/(16\pi G))(\dot{h}^{\text {TT}}_{ij})^2+\cdots \); see Eq. (F.3)],

$$\begin{aligned} \frac{\text {d}}{\text {d}t}\left( R - \int _{V_{\rm {fz}}}\text {d}^3x\,\frac{\partial \mathcal {R}}{\partial \dot{h}^{\text {TT}}_{ij}}\dot{h}^{\text {TT}}_{ij}\right) = -\mathcal{L}, \end{aligned}$$
(3.68)

where

$$\begin{aligned} \mathcal {L} = -\frac{c^4}{32\pi G} \oint _{\text{fz}} \text {d}s_k h^{\text {TT}}_{ij,k} \dot{h}^{\text {TT}}_{ij} = \frac{c^3}{32\pi G} \oint _{\text{fz}} \text {d}\varOmega \, r^2 (\dot{h}^{\text {TT}}_{ij})^2 \end{aligned}$$
(3.69)

is the well known total energy flux (or luminosity) of gravitational waves. The Eq. (3.68) can be put into the energy form, again employing the leading-order quadratic field structure of \(\mathcal {R}\),

$$\begin{aligned} \frac{\text {d}}{\text {d}t}\left( H - \frac{c^2}{32\pi G}\int _{(V_\infty -V_{\rm {fz}})}\text {d}^3x\,(\dot{h}^{\text {TT}}_{ij})^2\right) = -\mathcal{L}. \end{aligned}$$
(3.70)

Note that the integral over \(V_\infty -V_{\rm {fz}}\) changes with time for radiating sources because more and more radiation is entering the volume \(V_\infty -V_{\rm{fz}}\), whereas the integral over \(V_{\rm {fz}}\) changes on secular damping-time scales only because for stationary time-sections the volume \(V_{\rm {fz}}\) is filled with constant amount of radiation energy.

Taking into account the Eqs. (3.29) and (3.41) we find that the second term in the parenthesis of the left side of Eq. (3.70) exactly subtracts the corresponding terms from pure \((h^{\text{TT}}_{ij,k})^2\) and \((\pi _{\text{TT}}^{ij})^2\) expressions therein. This improves, by one order in radial distance, the large distance decay of the integrand of the integral of the whole left side of Eq. (3.70), which runs over the whole hypersurface t = const. We may now perform near- and far-zone PN expansions of the left and right sides of the Eq. (3.70), respectively. Though the both series are differently defined—on the left side, expansion in powers of 1/c around fixed time t of an energy expression which is time differentiated; on the right side, expansion in powers of 1/c around fixed retarded time \(t-r/c\)—the expansions cannot contradict each other as long as they are not related term by term. For the latter relation we must keep in mind that PN expansions are instantaneous expansions so that the two times, t and \(t-r/c\), are not allowed to be located too far apart from each other. This means that we have to read off the radiation right when it enters far zone. Time-averaging of the expressions on the both sides of Eq. (3.70) over several wave periods [see text below Eq. (3.77)] makes the difference between the two times negligible as it should be if one is interested in a one-to-one correspondence between the terms on the both sides. The Newtonian and 1PN wave generation processes were explicitly shown to fit into this scheme by Königsdörffer et al. (2003).

3.6 Radiation field

In the far zone, the multipole expansion of the transverse-traceless (TT) part of the gravitational field, obtained by algebraic projection with

$$\begin{aligned} P_{ijkl}(\textbf{n})&\equiv \frac{1}{2}\Big (P_{ik}(\textbf{n})P_{jl}(\textbf{n}) + P_{il}(\textbf{n})P_{jk}(\textbf{n}) - P_{ij}(\textbf{n})P_{kl}(\textbf{n}\Big ), \end{aligned}$$
(3.71)
$$\begin{aligned} P_{ij}(\textbf{n})&\equiv \delta _{ij} - n_i n_j, \end{aligned}$$
(3.72)

where \(\textbf{n}\equiv \textbf{x}/r\) (\(r\equiv |\textbf{x}|\)) is the unit vector in the direction from the source to the far away observer, reads (see, e.g., Thorne 1980; Blanchet 2014)

$$\begin{aligned} h_{ij}^{\text{TT\,fz}}(\textbf{x},t)&= \frac{G}{c^4} \frac{P_{ijkm}(\textbf{n})}{r} \sum _{l=2}^{\infty } \left\{ \left( \frac{1}{c^2}\right) ^{\frac{l-2}{2}} \frac{4}{l!}\, \text{ M}^{(l)}_{kmi_3 \ldots i_l}\left( t-\frac{r_*}{{c}}\right) \,N_{i_3 \ldots i_l} \right. \\&\quad + \left. \left( \frac{1}{{c^2}}\right) ^{\frac{l-1}{2}} \frac{8l}{(l+1)!}\, \varepsilon _{pq(k} \text{ S}^{(l)}_{m)pi_3 \ldots i_l}\left( t-\frac{r_*}{{c}}\right) \,n_q\,N_{i_3 \dots i_l}\right\} , \end{aligned}$$
(3.73)

where \(N_{i_3 \dots i_l}\equiv n^{i_3}\ldots n^{i_l}\) and where \(\text{ M}^{(l)}_{i_1i_2i_3\ldots i_l}\) and \(\text{ S}^{(l)}_{i_1i_2i_3\ldots i_l}\) denote the lth time derivatives of the symmetric and tracefree (STF) radiative mass-type and current-type multipole moments, respectively. The term with the leading mass-quadrupole tensor takes the form (see, e.g., Schäfer 1990)

$$\begin{aligned} \text{ M}_{ij}^{(2)}\left( t-\frac{r_*}{c}\right)&= \hat{\text{ M }}_{ij}^{(2)}\left( t - \frac{r_*}{c}\right) \\&\quad + \frac{2Gm}{c^3} \int _0^{\infty } {\text {d}}v \left[ \ln \left( \frac{v}{2b}\right) + \kappa \right] \hat{\text{ M }}^{(4)}_{ij}\left( t-\frac{r_*}{c}-v\right) + \mathcal {O}\left( \frac{1}{c^4}\right) , \end{aligned}$$
(3.74)

with

$$\begin{aligned} r_* = r + \frac{2Gm}{c^2}\text{ ln }\left( \frac{r}{cb}\right) + \mathcal {O}\left( \frac{1}{c^3}\right) \end{aligned}$$
(3.75)

showing the leading-order tail term of the quadrupole radiation (the gauge dependent relative phase constant \(\kappa \) between direct and tail term was not explored by Schäfer 1990; for more details see, e.g., Blanchet and Schäfer 1993 and Blanchet 2014). Notice the modification of the standard PN expansion through tail terms. This expression nicely shows that also multipole expansions in the far zone do induce PN expansions. The mass-quadrupole tensor \(\hat{\text{ M }}_{ij}\) is just the standard Newtonian one. Higher-order tail terms up to “tails-of-tails-of-tails” can be found in Marchand et al. (2016). Leading-order tail terms result from the backscattering of the leading-order outgoing radiation, the “tails-of-tails” from their second backscattering, and so on.

Through 1.5PN order, the luminosity expression (3.69) takes the form

$$\begin{aligned} {\mathcal {L}}(t) = \frac{G}{5c^5}\left\{ \text{ M}^{(3)}_{ij}\text{ M}^{(3)}_{ij} + \frac{1}{c^2}\left[ \frac{5}{189}\text{ M}^{(4)}_{ijk}\text{ M}^{(4)}_{ijk} + \frac{16}{9}\text{ S}^{(3)}_{ij}\text{ S}^{(3)}_{ij}\right] \right\} . \end{aligned}$$
(3.76)

On reasons of energy balance in asymptotically flat space, for any coordinates or variables representation of the Einstein theory, the time-averaged energy loss has to fulfill a relation of the form

$$\begin{aligned} -\left\langle \frac{ \text {d}{\mathcal {E}}\left( t-{r_*}/{c}\right) }{\text {d}t}\right\rangle = \left\langle {\mathcal {L}}(t)\right\rangle , \end{aligned}$$
(3.77)

where the time averaging procedure takes into account typical periods of the system (i.e. it is averaging over several periods of the lowest frequency mode, usually called “averaging over several wavelengths”; see, e.g., Thorne 1980). Generalizing our considerations after Eq. (3.70) we may take the observation time t much larger than the time, say \(t_{\text{bfz}}\), the radiation enters the far or wave zone, even larger than the damping time of the radiating system, by just freely transporting the radiation power along the null cone with tacitly assuming \(\langle \mathcal{L}(t,r)\rangle =\langle \mathcal{L}(t_{{\rm{bfz}}},r_{\rm{bfz}})\rangle \), where \(t-t_{\text{bfz}}=(r-r_{\text{bfz}})/c>0\). Coming back to Eq. (3.70), time averaging on the left side of Eq. (3.70) eliminates total time derivatives of higher PN order, so-called Schott terms, and transforms them into much higher PN orders. The both sides of the equation (3.77) are gauge (or, coordinate) invariant. We stress that the Eq. (3.77) is valid for bound systems. In case of scattering processes, a coordinate invariant quantity is the emitted total energy.

The energy flux to nPN order in the far zone implies energy loss to \((n+2.5)\)PN order in the near zone. The leading-order 2.5PN energy loss is usually called “Newtonian” because only the Newtonian source dynamics contributes; corresponding notions are applied to the higher order PN fluxes. Hereof it follows that energy-loss calculations are quite efficient via energy-flux calculations (Blanchet 2014). In general, only after averaging over orbital periods the both expressions do coincide. In the case of circular orbits, however, this averaging procedure is not needed.

4 Applied regularization techniques

The most efficient source model for analytical computations of many-body dynamics in general relativity are point masses (or particles) represented through Dirac delta functions. If internal degrees of freedom are come into play, derivatives of the delta functions must be incorporated into the source. Clearly, point-particle sources in field theories introduce field singularities, which must be regularized in computations. Two aspects are important: (i) the differentiation of singular functions (i.e. functions which are not infinitely differentiable), and (ii) the integration of singular functions, either to new (usually also singular) functions or to the final Routhian/Hamiltonian. The item (ii) relates to the integration of the field equations and the item (i) to the differentiation of their (approximate) solutions. On consistency reasons, differentiation and integration must commute.

The most efficient strategy developed for computation of higher-order PN point-particle Hamiltonians relies on performing a 3-dimensional full computation in the beginning (using Riesz-implemented Hadamard regularization defined later in this section) and then correcting it by a d-dimensional one around the singular points, as well the local ones (UV divergences) as the one at infinity (IR divergences). A d-dimensional full computation is not needed. At higher than the 2PN level 3-dimensional computations with analytical Hadamard and Riesz regularizations show up ambiguities which require a more powerful treatment. The latter is dimensional regularization. The first time this strategy was successfully applied in the context of general relativity was in the 3PN dynamics of binary point particles (Damour et al. 2001); IR divergences did not appear therein, those enter from the 4PN level on only, the same as the nonlocal-in-time tail terms to which they are connected. At 4PN order, using different regularization methods for the treatment of IR divergences (Jaranowski and Schäfer 2015), an ambiguity parameter was left which, however, got fixed by matching to self-force calculations in the Schwarzschild metric (Le Tiec et al. 2012; Bini and Damour 2013; Damour et al. 2014).

The regularization techniques needed to perform PN calculations up to (and including) 4PN order, are described in detail in Appendix A of Jaranowski and Schäfer (2015).

4.1 Distributional differentiation of homogeneous functions

Besides appearance of UV divergences, another consequence of employing Dirac-delta sources is necessity to differentiate homogeneous functions using an enhanced (or distributional) derivative, which comes from standard distribution theory (see, e.g., Sect. 3.3 in Chapter III of Gel’fand and Shilov 1964).

Let f be a real-valued function defined in a neighbourhood of the origin of \(\mathbb {R}^3\). f is said to be a positively homogeneous function of degree \(\lambda \), if for any number \(a>0\)

$$\begin{aligned} f(a\,\textbf{x}) = a^\lambda \,f(\textbf{x}). \end{aligned}$$
(4.1)

Let \(k:=-\lambda -2\). If \(\lambda \) is an integer and if \(\lambda \le -2\) (i.e., k is a nonnegative integer), then the partial derivative of f with respect to the coordinate \(x^i\) has to be calculated by means of the formula

$$\begin{aligned} \partial _i \,f(\textbf{x}) = \partial _{{\underline{i}}}\,f(\textbf{x}) + \frac{(-1)^k}{k!} \frac{\partial ^k\delta (\textbf{x})}{\partial x^{i_1}\cdots \partial x^{i_k}} \times \oint _\varSigma \text {d}\sigma _i\,f(\textbf{x}')\,x'^{i_1}\cdots x'^{i_k}, \end{aligned}$$
(4.2)

where \(\partial _i f\) on the lhs denotes the derivative of f considered as a distribution, while \(\partial _{{\underline{i}}}f\) on the rhs denotes the derivative of f considered as a function (which is computed using the standard rules of differentiation), \(\varSigma \) is any smooth close surface surrounding the origin and \(\text {d}\sigma _i\) is the surface element on \(\varSigma \).

The distributional derivative does not obey the Leibniz rule. It can easily be seen by considering the distributional partial derivative of the product \(1/r_a\) and \(1/r_a^2\). Let us suppose that the Leibniz rule is applicable here:

$$\begin{aligned} \partial _i{\frac{1}{r_a^3}} = \partial _i{\left( \frac{1}{r_a}\frac{1}{r_a^2}\right) } = \frac{1}{r_a^2}\, \partial _i{\frac{1}{r_a}} + \frac{1}{r_a}\, \partial _i{\frac{1}{r_a^2}}. \end{aligned}$$
(4.3)

The right-hand side of this equation can be computed using standard differential calculus (no terms with Dirac deltas), whereas computing the left-hand side one obtains some term proportional to \(\partial _i\delta _a\). The distributional differentiation is necessary when one differentiates homogeneous functions under the integral sign. For more details, see Appendix A 5 in Jaranowski and Schäfer (2015).

4.2 Riesz-implemented Hadamard regularization

The usage of Dirac \(\delta \)-functions to model point-mass sources of gravitational field leads to occurence of UV divergences, i.e., the divergences near the particle locations \(\textbf{x}_a\), as \(r_a\equiv |\textbf{x}-\textbf{x}_a|\rightarrow 0\). To deal with them, Infeld (1954, 1957), Infeld and Plebański (1960) introduced “good” \(\delta \)-functions, which, besides having the properties of ordinary Dirac \(\delta \)-functions, also satisfy the condition

$$\begin{aligned} \frac{1}{|\textbf{x} - \textbf{x}_0|^k}\delta (\textbf{x}-\textbf{x}_0) = 0, \quad k = 1,\ldots ,p, \end{aligned}$$
(4.4)

for some positive integer p (in practical calculations one takes p large enough to take all singularities appearing in the calculation into account). They also assumed that the “tweedling of products” property is always satisfied

$$\begin{aligned} \int \text {d}^3x\, f_1(\textbf{x})f_2(\textbf{x})\delta (\textbf{x} - \textbf{x}_0) = f_{1 \text reg}(\textbf{x}_0) f_{2 \text reg}(\textbf{x}_0), \end{aligned}$$
(4.5)

where “reg” means regularized value of the function at its singular point (i.e., \(\textbf{x}_0\) in the equation above) evaluated by means of the rule (4.4).

A natural generalization of the rule (4.4) is the concept of “partie finie” value of function at its singular point, defined as

$$\begin{aligned} f_{\text{reg}}(\textbf{x}_0) \equiv \frac{1}{4\pi } \int \text {d}\varOmega \, a_0(\textbf{n}), \end{aligned}$$
(4.6)

with (here M is some non-negative integer)

$$\begin{aligned} f(\textbf{x} = \textbf{x}_0 + \epsilon \textbf{n}) = \sum _{m=-M}^{\infty } a_m(\textbf{n})\epsilon ^m, \quad \textbf{n}\equiv \frac{\textbf{x}-\textbf{x}_0}{|\textbf{x}-\textbf{x}_0|}. \end{aligned}$$
(4.7)

Defining, for a function f singular at \(\textbf{x}=\textbf{x}_0\),

$$\begin{aligned} \int \text {d}^3x f(\textbf{x})\delta (\textbf{x}-\textbf{x}_0) \equiv f_{\rm{reg}}(\textbf{x}_0), \end{aligned}$$
(4.8)

the “tweedling of products” property (4.5) can be written as

$$\begin{aligned} (f_1 f_2)_{\text{reg}}(\textbf{x}_0) = f_{1 \text reg}(\textbf{x}_0) f_{2 \text reg}(\textbf{x}_0). \end{aligned}$$
(4.9)

The above property is generally wrong for arbitrary singular functions \(f_1\) and \(f_2\). In the PN calculations problems with fulfilling this property begin at the 3PN order. This is one of the reasons why one should use dimensional regularization.

The Riesz-implemented Hadamard (RH) regularization was developed in the context of deriving PN equations of motion of binary systems by Jaranowski and Schäfer (1997, 1998, 2000c) to deal with locally divergent integrals computed in three dimensions. The method is based on the Hadamard “partie finie” and the Riesz analytic continuation procedures.

The RH regularization relies on multiplying the full integrand, say \(i(\textbf{x})\), of the divergent integral by a regularization factor,

$$\begin{aligned} i(\textbf{x}) \longrightarrow i(\textbf{x})\Big (\frac{r_1}{s_1}\Big )^{\epsilon _1} \Big (\frac{r_2}{s_2}\Big )^{\epsilon _2}, \end{aligned}$$
(4.10)

and, after integration, studying the double limit \(\epsilon _1\rightarrow 0\), \(\epsilon _2\rightarrow 0\) within analytic continuation in the complex \(\epsilon _1\) and \(\epsilon _2\) planes (here \(s_1\) and \(s_2\) are arbitrary three-dimensional UV regularization scales). Let us thus consider such integral performed over the whole space \({\mathbb R}^3\) and let us assume that it develops only local poles (so it is convergent at spatial infinity). The value of the integral, after performing the RH regularization in three dimensions, has the structure (this is the most general structure in the calculation of conservative Hamiltonians up to and including 4PN order)

$$\begin{aligned} I^{\text {RH}}(3;\epsilon _1,\epsilon _2)&\equiv \int _{{{\mathbb {R}}}^3} i({{\textbf{x}}}) \Big (\frac{r_1}{s_1}\Big )^{\epsilon _1} \Big (\frac{r_2}{s_2}\Big )^{\epsilon _2}\,\text {d}^3x \\&= A + c_1 \Big (\frac{1}{\epsilon _1} + \ln \frac{r_{12}}{s_1} \Big ) + c_2 \Big (\frac{1}{\epsilon _2} + \ln \frac{r_{12}}{s_2} \Big ) + \mathcal {O}(\epsilon _1,\epsilon _2). \end{aligned}$$
(4.11)

Let us mention that in the PN calculations regularized integrands \(i({{\textbf{x}}})(r_1/s_1)^{\epsilon _1}(r_2/s_2)^{\epsilon _2}\) depend on \(\textbf{x}\) only through \(\textbf{x}-\textbf{x}_1\) and \(\textbf{x}-\textbf{x}_2\), so they are translationally invariant. This explains why the regularization result (4.11) depends on \(\textbf{x}_1\) and \(\textbf{x}_2\) only through \(\textbf{x}_1-\textbf{x}_2\).

In the case of an integral over \({{\mathbb {R}}}^3\) developing poles only at spatial infinity (so it is locally integrable) it would be enough to use a regularization factor of the form \((r/r_0)^\epsilon \) (where \(r_0\) is an IR regularization scale), but it is more convenient to use the factor

$$\begin{aligned} \Big (\frac{r_1}{r_0}\Big )^{a\epsilon } \Big (\frac{r_2}{r_0}\Big )^{b\epsilon } \end{aligned}$$
(4.12)

and, after integration, study the limit \(\epsilon \rightarrow 0\). Let us denote the integrand again by \(i(\textbf{x})\). The integral, after performing the RH regularization in three dimensions, has the structure

$$\begin{aligned} I^{\text {RH}}(3;a,b,\epsilon ) \equiv \int _{{{\mathbb {R}}}^3} i({\textbf{x}}) \Big (\frac{r_1}{r_0}\Big )^{a\epsilon } \Big (\frac{r_2}{r_0}\Big )^{b\epsilon }\,\text {d}^3x = A - c_\infty \bigg (\frac{1}{(a+b)\epsilon } + \ln \frac{r_{12}}{r_0} \bigg ) + \mathcal {O}(\epsilon ). \end{aligned}$$
(4.13)

Many integrals appearing in PN calculations were computed using a famous formula derived in Riesz (1949) in d dimensions. It reads

$$\begin{aligned} \int \text {d}^dx\, r^{\alpha }_1r^{\beta }_2 = \pi ^{d/2} \frac{\varGamma (\frac{\alpha +d}{2}) \varGamma (\frac{\beta +d}{2}) \varGamma (-\frac{\alpha +\beta +d}{2}) }{\varGamma (-\frac{\alpha }{2})\varGamma (-\frac{\beta }{2})\varGamma (\frac{\alpha +\beta +2d}{2})}r^{\alpha + \beta + d}_{12}. \end{aligned}$$
(4.14)

To compute the 4PN-accurate two-point-mass Hamiltonian one needs to employ a generalization of the three-dimensional version of this formula for integrands of the form \(r^{\alpha }_1r^{\beta }_2(r_1+r_2+r_{12})^\gamma \). Such formula was derived by Jaranowski and Schäfer (1998, 2000c) and also there an efficient way of implementing both formulae to regularize divergent integrals was proposed (it employs prolate spheroidal coordinates in three dimensions). See Appendix A 1 of Jaranowski and Schäfer (2015) for details and Appendix A of Hartung et al. (2013) for generalization of this procedure to d space dimensions.

4.3 Extended Hadamard regularization

A specific variant of 3-dimensional Hadamard regularization called the extended Hadamard regularization (EHR) was devised by Blanchet and Faye (2000a, 2001b). It was used by Blanchet and Faye (2000b, 2001a) at the 3PN-level computations of two-point-mass equations of motion in harmonic coordinates.

The basic idea of EHR is to associate to any function \(F\in \mathcal {F}\), where the set \(\mathcal {F}\) comprises functions which are smooth on \(\mathbb {R}^3\) except for the two points (around which they admit a power-like singular expansion), a partie-finie pseudo-function \(\text {Pf}F\), which is a linear form acting on functions from \(\mathcal {F}\):

$$\begin{aligned} \langle \text {Pf}F,G\rangle := \text {Pf}_{s_1,s_2}\int \text {d}^3x\,FG, \quad \hbox { for any}\ G\in \mathcal {F}, \end{aligned}$$
(4.15)

where \(\text {Pf}_{s_1,s_2}\) on the right-hand side means partie finie of the divergent integral [see Eq. (3.1) in Blanchet and Faye (2000a) and the text around for the definition]; it depends on two—one per each singularity—arbitrary regularization scales \(s_1\) and \(s_2\). The Dirac \(\delta \)-functions \(\delta _a\) are represented by the pseudo-functions \(\text {Pf}\delta _a\) defined by

$$\begin{aligned} \langle \text {Pf}\delta _a,G\rangle := G_{\text{reg}}(\textbf{x}_a), \quad \hbox { for any}\ G\in \mathcal {F}, \end{aligned}$$
(4.16)

where the regularized value \(G_{\text{reg}}(\textbf{x}_a)\) of function at its singular point is defined in Eqs. (4.6)–(4.7) above. The product \(F\delta _a\) is represented by another pseudo-function \(\text {Pf}(F\delta _a)\) such that

$$\begin{aligned} \langle \text {Pf}(F\delta _a),G\rangle := (FG)_{\text{reg}}(\textbf{x}_a), \quad \hbox { for any}\ G\in \mathcal {F}. \end{aligned}$$
(4.17)

As a consequence, in general

$$\begin{aligned} \text {Pf}(F\delta _a) \ne F_{\text {reg}}(\textbf{x}_a)\text {Pf}\delta _a. \end{aligned}$$
(4.18)

Another ingredient of the EHR relies on the specific treatment of partial derivatives of singular functions. To ensure the possibility of integration by parts, one requires that \(\langle \partial _i(\text {Pf}F),G\rangle =-\langle \partial _i(\text {Pf}G),F\rangle \) for any functions \(F,G\in \mathcal {F}\). This requirement leads to the following definition of the partial derivative of the pseudo-function:

$$\begin{aligned} \partial _i(\text {Pf}F) = \text {Pf}(\partial _i F) + \text {D}_i[F], \end{aligned}$$
(4.19)

where \(\text {Pf}(\partial _i F)\) denotes the ordinary derivative of F viewed as a pseudo-function, and \(\text {D}_i[F]\) is the purely distributional part with support concentrated on \(\textbf{x}_1\) or \(\textbf{x}_2\) [see Sects. VII–IX of Blanchet and Faye (2000a) for more details]. The derivative \(\text {D}_i[F]\) is an extended distributional derivative which differs in general from the usual Schwartz derivative introduced in Eq. (4.2) above. Let us quote the results

$$\begin{aligned} \text {D}_i\Big [\frac{1}{r_1}\Big ] = 2\pi \text {Pf}(r_1 n_1^i \delta _1), \quad \text {D}_{ij}\Big [\frac{1}{r_1}\Big ] = -\frac{4\pi }{3} \text {Pf}\Big (\delta ^{ij}+\frac{15}{2}\hat{n}_1^{ij}\Big )\delta _1, \end{aligned}$$
(4.20)

where \(\hat{n}_1^{ij}\equiv n_1^in_1^j-\frac{1}{3}\delta ^{ij}\). The Schwartz derivative (4.2) of \(\partial _i(1/r_1)\) contains no distributional part, whereas distributional part of \(\partial _i\partial _j(1/r_1)\) equals \(-(4\pi /3)\delta ^{ij}\delta _1\).

There is no known generalization of the EHR definitions (4.17) and (4.19) to generic d-dimensional case. Moreover, these definitions disagree with the dimensional-regularization rules.

  1. (i)

    In generic d dimensions one can always use

    $$\begin{aligned} F^{(d)}(\textbf{x})\delta ^{(d)}(\textbf{x}-\textbf{x}_a) = F^{(d)}_{\text {reg}}(\textbf{x}_a)\delta ^{(d)}(\textbf{x}-\textbf{x}_a), \end{aligned}$$
    (4.21)

    where \(F^{(d)}\) is the d-dimensional version of F. This leads to the following dimensional-regularization rule [see Sect. III A in Blanchet et al. (2004)]:

    $$\begin{aligned} \big [F(\textbf{x})\delta ^{(3)}(\textbf{x}-\textbf{x}_a)\big ]_{\text {reg}} := \big (\lim _{d\rightarrow 3}F^{(d)}_{\text {reg}}(\textbf{x}_a)\big )\delta ^{(3)}(\textbf{x}-\textbf{x}_a). \end{aligned}$$
    (4.22)

    The property (4.18) disagrees with this.

  2. (ii)

    The extended differentiation (4.19), when applied to smooth functions of compact support, coincides with Schwartz differentiation (4.2). However, in the 3PN-level computations performed by Blanchet and Faye (2000b, 2001a) it operated with other singular functions and gave the results different from the results obtained by applying Schwartz differentiation. The definition (4.2) of Schwartz differentiation is valid in d dimensions (see Sect. 4.4.3 above), which supports the use of this definition also in the limit of three dimensions.

The computation using the EHR constitutes an approach very different from dimensional regularization, following a different route which could not be combined with the latter. This can be clearly seen in the paper by Blanchet et al. (2004) on dimensional-regularization completion of the 3PN equations of motion in harmonic coordinates [see the paragraph containing Eq. (1.8) and Sect. III D there]. Before applying dimensional regularization the authors of Blanchet et al. (2004) had to subtract from the 3-dimensional results of Blanchet and Faye (2000b, 2001a) all contributions, which were direct consequences of the use of EHR. However, Blanchet and Faye (2000b, 2001a) have shown that at the 3PN level the difference between the final results of EHR and dimensional regularization computations of two-point-mass equations of motion can be described in terms of one constant ambiguity parameter (they called \(\lambda \)).

Yang and Estrada (2013) have recently developed the theory of “thick distributions” in higher dimensions n (where n is an integer larger than 1). This theory is connected with the extended Hadamard regularization, but is not equivalent to the latter.

4.4 Dimensional regularization

It was first shown by Damour et al. (2001), that the unambiguous treatment of UV divergences in the current context requires usage of dimensional regularization (see, e.g., Collins 1984). It was used both in the Hamiltonian approach and in the one using the Einstein field equations in harmonic coordinates (Damour et al. 2001; Blanchet et al. 2004; Jaranowski and Schäfer 2013; Damour et al. 2014; Jaranowski and Schäfer 2015; Bernard et al. 2016, 2017a; Marchand et al. 2018; Foffa and Sturani 2019,Foffa et al. 2019b). The dimensional regularization preserves the law of “tweedling of products” (4.9) and gives all involved integrals, particularly the inverse Laplacians, a unique definition.

4.4.1 D-dimensional ADM formalism

Dimensional regularization (DR) needs the representation of the Einstein field equation for arbitray space dimensions, say d for the dimension of space and \(D=d+1\) for the spacetime dimension. In the following, \(G_D = G_{\text{N}} \ell _0^{d-3}\) will denote the gravitational constant in D-dimensional spacetime and \(G_{\text{N}}\) the standard Newtonian one, \(\ell _0\) is the DR scale relating both constants.

The unconstraint Hamiltonian takes the form

$$\begin{aligned} H = \int \text {d}^dx\,(N {\mathcal{H}} - c N^i {\mathcal{H}}_i) + \frac{c^4}{16\pi G_D}\oint _{i^0}\text {d}^{d-1}S_i\,\partial _j(\gamma _{ij}-\delta _{ij}\gamma _{kk}), \end{aligned}$$
(4.23)

where \(\text {d}^{d-1}S_i\) denotes the \((d-1)\)-dimensional surface element. The Hamiltonian and the momentum constraint equations written for many-point-particle systems are given by

$$\begin{aligned} \sqrt{\gamma }\,R&= \frac{1}{\sqrt{\gamma }} \left( \gamma _{ik}\gamma _{j\ell }\pi ^{ij}\pi ^{k\ell } - \frac{1}{d-1}(\gamma _{ij}\pi ^{ij})^2 \right) \\&\quad + \frac{16\pi G_D}{c^3}\sum _a (m_a^2c^4 + \gamma _a^{ij}p_{ai}p_{aj})^{\frac{1}{2}}\delta _a, \end{aligned}$$
(4.24)
$$\begin{aligned} -\nabla _j\pi ^{ij}&= \frac{8\pi G_D}{c^3}\sum _a \gamma _a^{ij}p_{aj}\delta _a. \end{aligned}$$
(4.25)

The gauge (or coordiante) ADMTT conditions read

$$\begin{aligned} \gamma _{ij} = \left( 1 + \frac{d-2}{4(d-1)}\phi \right) ^{4/(d-2)} \delta _{ij} + h^{\text{TT}}_{ij}, \quad \pi ^{ii}=0, \end{aligned}$$
(4.26)

where

$$\begin{aligned} h^{\text{TT}}_{ii} = 0, \quad \partial _j h^{\text{TT}}_{ij} = 0. \end{aligned}$$
(4.27)

The field momentum \(\pi ^{ij}\) splits into its longitudinal and TT parts, respectively,

$$\begin{aligned} \pi ^{ij} = \tilde{\pi }^{ij} + \pi _{\text{TT}}^{ij}\,, \end{aligned}$$
(4.28)

where the longitudinal part \(\tilde{\pi }^{ij}\) can be expressed in terms of a vectorial function \(V^i\),

$$\begin{aligned} \tilde{\pi }^{ij} = \partial _i V^j +\partial _j V^i - \frac{2}{d}\delta ^{ij}\partial _k V^k, \end{aligned}$$
(4.29)

and where the TT part satisfies the conditions,

$$\begin{aligned} \pi _{\text{TT}}^{ii} = 0, \quad \partial _j \pi _{\text{TT}}^{ij} = 0. \end{aligned}$$
(4.30)

The reduced Hamiltonian of the particles-plus-field system takes the form

$$\begin{aligned} H_{\text{red}}\big [\textbf{x}_a,\textbf{p}_a,h^{\text{TT}}_{ij},\pi _{\text{TT}}^{ij}\big ] = -\frac{c^4}{16\pi G_D}\int \text {d}^dx\,\varDelta \phi \big [\textbf{x}_a,\textbf{p}_a,h^{\text{TT}}_{ij},\pi _{\text{TT}}^{ij}\big ]. \end{aligned}$$
(4.31)

The equations of motion for the particles read

$$\begin{aligned} \dot{\textbf{x}}_a = \frac{\partial H_{\text{red}}}{\partial {{\textbf{p}}_a}}, \quad \dot{\textbf{p}}_a = -\frac{\partial H_{\text{red}}}{\partial {{\textbf{x}}_a}}, \end{aligned}$$
(4.32)

and the field equations for the independent degrees of freedom are given by

$$\begin{aligned} \frac{\partial }{\partial t}h^{\text{TT}}_{ij} = \frac{16\pi G_D}{c^3}\,\delta ^{\text {TT}kl}_{ij}\frac{\delta H_{\text{red}}}{\delta \pi _{\text{TT}}^{kl}}, \qquad \frac{\partial }{\partial t}\pi _{\text{TT}}^{ij} = -\frac{16\pi G_D}{c^3}\,\delta ^{\text {TT}ij}_{kl}\frac{\delta H_{\text{red}}}{\delta h^{\text{TT}}_{kl}}, \end{aligned}$$
(4.33)

where the d-dimensional TT-projection operator is defined by

$$\begin{aligned} \delta ^{\text {TT}ij}_{kl}&\equiv \frac{1}{2}(\delta _{ik}\delta _{jl}+\delta _{il}\delta _{jk}) - \frac{1}{d-1}\delta _{ij}\delta _{kl} \\&\quad - \frac{1}{2}(\delta _{ik}\partial _{jl}+\delta _{jl}\partial _{ik}+\delta _{il}\partial _{jk}+\delta _{jk}\partial _{il})\varDelta ^{-1} \\&\quad + \frac{1}{d-1}(\delta _{ij}\partial _{kl}+\delta _{kl}\partial _{ij})\varDelta ^{-1} + \frac{d-2}{d-1}\partial _{ijkl}\varDelta ^{-2}. \end{aligned}$$
(4.34)

Finally, the Routh functional is defined as

$$\begin{aligned} R\big [\textbf{x}_a,\textbf{p}_a,h^{\text{TT}}_{ij},\dot{h}^{\text{TT}}_{ij}\big ] \equiv H_{\text{red}}\big [\textbf{x}_a,\textbf{p}_a, h^{\text{TT}}_{ij}, \pi _{\text{TT}}^{ij}\big ] -\frac{c^3}{16\pi G_D}\int \text {d}^dx\,\pi _{\text{TT}}^{ij}\dot{h}^{\text{TT}}_{ij}, \end{aligned}$$
(4.35)

and the fully reduced matter Hamiltonian for the conservative dynamics reads

$$\begin{aligned} H[\textbf{x}_a,\textbf{p}_a] \equiv R\big [\textbf{x}_a,\textbf{p}_a,h^{\text{TT}}_{ij}(\textbf{x}_a,\textbf{p}_a),\dot{h}^{\text{TT}}_{ij}(\textbf{x}_a,\textbf{p}_a)\big ]. \end{aligned}$$
(4.36)

4.4.2 Local and asymptotic dimensional regularization

The technique developed by Damour et al. (2001) to control local (or UV) divergences boils down to the computation of the difference

$$\begin{aligned} \lim _{d\rightarrow 3} H^{\text{loc}}(d) - H^{\text{RH loc}}(3), \end{aligned}$$
(4.37)

where \(H^{\text{RH loc}}(3)\) is the “local part” of the Hamiltonian obtained by means of the three-dimensional RH regularization [it is the sum of all integrals of the type \(I^{\text {RH}}(3;\epsilon _1,\epsilon _2)\) introduced in Eq. (4.11)], \(H^{\text{loc}}(d)\) is its d-dimensional counterpart.

Damour et al. (2001) showed that to find the DR correction to the integral \(I^{\text {RH}}(3;\epsilon _1,\epsilon _2)\) of Eq. (4.11) related with the local pole at, say, \(\textbf{x}=\textbf{x}_1\), it is enough to consider only this part of the integrand \(i(\textbf{x})\) which develops logarithmic singularities in three dimensions, i.e., which locally behaves like \(1/r_1^3\),

$$\begin{aligned} i(\textbf{x}) = \cdots + \tilde{c}_1(\textbf{n}_1)\,r_1^{-3} + \cdots , \quad \text {when}\ \textbf{x}\rightarrow \textbf{x}_1. \end{aligned}$$
(4.38)

Then the pole part of the integral (4.11) related with the singularity at \(\textbf{x}=\textbf{x}_1\) can be recovered by RH regularization of the integral of \(\tilde{c}_1(\textbf{n}_1)\,r_1^{-3}\) over the ball \(\mathbb {B}(\textbf{x}_1,{\ell _1})\) of radius \(\ell _1\) surrounding the particle \(\textbf{x}_1\). The RH regularized value of this integral reads

$$\begin{aligned} I_1^{\text {RH}}(3;\epsilon _1) \equiv \int _{\mathbb {B}({\mathbf {x}}_1,{\ell _1})} \tilde{c}_1(\textbf{n}_1) \, r_1^{-3} \Big (\frac{r_1}{s_1}\Big )^{\epsilon _1} \, \text {d}^3 {\textbf{r}}_1 = c_1 \int _0^{\ell _1} r_1^{-1} \Big (\frac{r_1}{s_1}\Big )^{\epsilon _1}\,\text {d}r_1, \end{aligned}$$
(4.39)

where \(c_1/(4\pi )\) is the angle-averaged value of the coefficient \(\tilde{c}_1(\textbf{n}_1)\). The expansion of the integral \(I_1^{\text {RH}}(3;\epsilon _1)\) around \(\epsilon _1=0\) equals

$$\begin{aligned} I_1^{\text {RH}}(3;\epsilon _1) = c_1\Big (\frac{1}{\epsilon _1} + \ln \frac{\ell _1}{s_1}\Big ) + \mathcal {O}(\epsilon _1). \end{aligned}$$
(4.40)

The idea of the technique developed by Damour et al. (2001) relies on replacing the RH-regularized value of the three-dimensional integral \(I_1^{\text {RH}}(3;\epsilon _1)\) by the value of its d-dimensional version \(I_1(d)\). One thus considers the d-dimensional counterpart of the expansion (4.38). It reads

$$\begin{aligned} i(\textbf{x}) = \cdots + \ell _0^{k(d-3)}\tilde{\mathfrak {c}}_1(d;{{\textbf{n}}}_1)\,r_1^{6-3d} + \cdots , \quad \text {when}\ \textbf{x}\rightarrow \textbf{x}_1. \end{aligned}$$
(4.41)

Let us note that the specific exponent \(6-3d\) of \(r_1\) visible here follows from the \(r_1\rightarrow 0\) behaviour of the (perturbative) solutions of the d-dimensional constraint equations (4.24)–(4.25). The number k in the exponent of \(\ell _0^{k(d-3)}\) is related with the momentum-order of the considered term [e.g., at the 4PN level the term with k is of the order of \(\mathcal {O}(p^{10-2k})\), for \(k=1,\ldots ,5\); such term is proportional to \(G_D^k\)]. The integral \(I_1(d)\) is defined as

$$\begin{aligned} I_1(d) \equiv \ell _0^{k(d-3)} \int _{\mathbb {B}({\mathbf{x}}_1,{\ell _1})} \tilde{\mathfrak {c}}_1(d;{{\textbf{n}}}_1) \, r_1^{6-3d}\, \text {d}^d {{\textbf{r}}}_1 = \mathfrak {c}_1(d) \int _0^{\ell _1} r_1^{5-2d}\,\text {d}r_1, \end{aligned}$$
(4.42)

where \(\mathfrak {c}_1(d)/\big (\varOmega _{d-1}\ell _0^{k(d-3)}\big )\) (\(\varOmega _{d-1}\) stands for the area of the unit sphere in \(\mathbb {R}^d\)) is the angle-averaged value of the coefficient \(\tilde{\mathfrak {c}}_1(d;{{\textbf{n}}}_1)\),

$$\begin{aligned} \mathfrak {c}_1(d) \equiv \ell _0^{k(d-3)} \oint _{\mathbb {S}^{d-1}({\mathbf{0}},1)} \tilde{\mathfrak {c}}_1(d;{{\textbf{n}}}_1)\,\text {d}\varOmega _{d-1}. \end{aligned}$$
(4.43)

One checks that always there is a smooth connection between \(\mathfrak {c}_1(d)\) and its three-dimensional counterpart \(c_1\),

$$\begin{aligned} \lim _{d\rightarrow 3} \mathfrak {c}_1(d) = \mathfrak {c}_1(3) = c_1. \end{aligned}$$
(4.44)

The radial integral in Eq. (4.42) is convergent if the real part \(\Re (d)\) of d fulfills the condition \(\Re (d)<3\). Making use of the expansion \(\mathfrak {c}_1(d) = \mathfrak {c}_1(3+\varepsilon ) = c_1 + \mathfrak {c}'_1(3)\varepsilon + \mathcal {O}(\varepsilon ^2)\), where \(\varepsilon \equiv d-3\), the expansion of the integral \(I_1(d)\) around \(\varepsilon =0\) reads

$$\begin{aligned} I_1(d) = -\frac{\ell _1^{-2\varepsilon }}{2\varepsilon }\mathfrak {c}_1(3+\varepsilon ) = -\frac{c_1}{2\varepsilon } -\frac{1}{2} \mathfrak {c}'_1(3) + c_1 \ln \ell _1 + \mathcal {O}(\varepsilon ). \end{aligned}$$
(4.45)

Let us note that the coefficient \(\mathfrak {c}'_1(3)\) usually depends on \(\ln r_{12}\) and it has the structure

$$\begin{aligned} \mathfrak {c}'_1(3) = \mathfrak {c}'_{11}(3) + \mathfrak {c}'_{12}(3) \ln \frac{r_{12}}{\ell _0} + 2c_1 \ln \ell _0, \end{aligned}$$
(4.46)

where \(\mathfrak {c}'_{12}(3)=(2 - k)c_1\) [what can be inferred knowing the dependence of \(\mathfrak {c}_1(d)\) on \(\ell _0\) given in Eq. (4.43)]. Therefore the DR correction also changes the terms \(\propto \ln {r_{12}}\).

The DR correction to the RH-regularized value of the integral \(I^{\text {RH}}(3;\epsilon _1,\epsilon _2)\) relies on replacing this integral by

$$\begin{aligned} I^{\text {RH}}(3;\epsilon _1,\epsilon _2) + \varDelta I_1 + \varDelta I_2, \end{aligned}$$
(4.47)

where

$$\begin{aligned} \varDelta I_a \equiv I_a(d) - I_a^{\text {RH}}(3;\epsilon _a), \quad a=1,2. \end{aligned}$$
(4.48)

Then one computes the double limit

$$\begin{aligned} \mathop{\lim}\limits _{\begin{array}{c} \epsilon _1\to 0 \\ \epsilon _2\to 0 \end{array}} \Big (I^{\text {RH}}&(3;\epsilon _1,\epsilon _2) + \varDelta I_1 + \varDelta I_2\Big ) \\&= A - \frac{1}{2} \big (\mathfrak {c}'_{11}(3) + \mathfrak {c}'_{21}(3)\big ) - \frac{1}{2}\big (\mathfrak {c}'_{12}(3) + \mathfrak {c}'_{22}(3)\big ) \ln \frac{r_{12}}{\ell _0} \\&\quad + \big (c_1 + c_2\big )\left( - \frac{1}{2\varepsilon } + \ln \frac{r_{12}}{\ell _0}\right) + \mathcal {O}(\varepsilon ). \end{aligned}$$
(4.49)

Note that all poles \(\propto 1/\epsilon _1,1/\epsilon _2\) and all terms depending on radii \(\ell _1\), \(\ell _2\) or scales \(s_1\), \(s_2\) cancel each other. The result (4.49) is as if all computations were fully done in d dimensions.

In the DR correcting UV divergences in the 3PN two-point-mass Hamiltonian performed by Damour et al. (2001), after collecting all terms of the type (4.49) together, all poles \(\propto 1/(d-3)\) cancel each other. This is not the case for the UV divergences of the 4PN two-point-mass Hamiltonian derived by Jaranowski and Schäfer (2015). As explained in Sect. VIII D of Jaranowski and Schäfer (2015), after collecting all terms of the type (4.49), one has to add to the Hamiltonian a unique total time derivative to eliminate all poles \(\propto 1/(d-3)\) (together with \(\ell _0\)-dependent logarithms).

The above described technique of the DR correcting of UV divergences can easily be transcribed to control IR divergences. This is done by the replacement of the integrals

$$\begin{aligned} \int _{\mathbb {B}({\mathbf{x}}_a,{\ell _a})}\text {d}^dx\,i(\mathbf{x}) \end{aligned}$$
(4.50)

by the integral

$$\begin{aligned} \int _{\mathbb {R}^d\setminus \mathbb {B}({\mathbf{0}},R)}\text {d}^dx\,i({\mathbf{x}}), \end{aligned}$$
(4.51)

where \(\mathbb {B}(\textbf{0},R)\) means a large ball of radius R (with the centre at the origin \(\textbf{0}\) of the coordinate system), and by studying expansion of the integrand \(i(\textbf{x})\) for \(r\rightarrow \infty \). This technique was not used to regularize IR divergences in the computation of the 4PN two-point-mass Hamiltonian by Damour et al. (2014) and Jaranowski and Schäfer (2015). This was so because this technique applied only to the instantaneous part of the 4PN Hamiltonian is not enough to get rid of the IR poles in the limit \(d\rightarrow 3\). For resolving IR poles it was necessary to observe that the IR poles have to cancel with the UV poles from the tail part of the Hamiltonian (what can be achieved e.g. after implementing the so-called zero-bin subtraction in the EFT framework, see Porto and Rothstein 2017).

Another two different approaches were employed by Damour et al. (2014) and Jaranowski and Schäfer (2015) to regularize IR divergences in the instantaneous part of the 4PN Hamiltonian (see Appendix A 3 in Jaranowski and Schäfer 2015): (i) modifying the behavior of the function \(h^{\text{TT}}_{(6)ij}\) at infinity,Footnote 6 (ii) implementing a d-dimensional version of Riesz–Hadamard regularization. Both approaches were developed in d dimensions, but the final results of using any of them in the limit \(d\rightarrow 3\) turned out to be identical with the results of computations performed in \(d=3\) dimensions. Moreover, the results of the two approaches were different in the limit \(d\rightarrow 3\), what indicated the ambiguity of IR regularization, discussed in detail by Jaranowski and Schäfer (2015) and fixed by Damour et al. (2014). This IR ambiguity can be expressed in terms of only one unknown parameter, because the results of two regularization approaches, albeit different, have exactly the same structure with only different numerical prefactors. This prefactor can be treated as the ambiguity parameter. The full 4PN Hamiltonian was thus computed up to a single ambiguity parameter and it was used to calculate, in a gauge invariant form, the energy of two-body system along circular orbits as a function of frequency. The ambiguity parameter was fixed by comparison of part of this formula [linear in the symmetric mass ratio \(\nu \), see Eq. (6.3) below for the definition] with the analogous 4PN-accurate formula for the particle in the Schwarzschild metric which included self-force corrections.

Analogous ambiguity was discovered in 4PN-acccurate calculations of two-body equations of motion done by Bernard et al. (2016) in harmonic coordinates, where also analytic regularizationFootnote 7 of the IR divergences of the instantaneous part of the dynamics was performed. However, the computations made by Bernard et al. (2016) faced also a second ambiguity (Damour et al. 2016; Bernard et al. 2017b), which must come from their different (harmonic instead of ADMTT) gauge condition and the potentiality of analytic regularization not to preserve gauge (in contrast to dimensional regularization). The first method of analytic regularization applied by Damour et al. (2014) and Jaranowski and Schäfer (2015) is manifest ADMTT gauge preserving. Finally, Marchand et al. (2018) and Bernard et al. (2017a) successfully applied in harmonic-coordinates approach d-dimensional regularization all-over. However, it is worth to emphasize that in intermediate steps their derivation makes crucial use of an auxiliary regulator parameter \(\eta \), entering as a factor \(r^\eta \) multiplying the formal expansions of the source. The confidence in the procedure stems from the fact that the occurring poles in \(\eta \) do cancel each other in d dimensions. On the other side, the obtained crucial rational number in the tail action, 41/60 or 41/30 depending on representation, was already derived within pure d-dimensional calculations by Foffa and Sturani (2013b) and Galley et al. (2016) based on the EFT formalism. Yet only quite recently, a complete pure dimensional-regularization calculation has been achieved by Foffa and Sturani (2019); Foffa et al. (2019b), where use has been made of the zero-bin subtraction method for interrelated UV and IR poles, as discussed in view of the 4PN approximation by Porto (2017) and Porto and Rothstein (2017).

4.4.3 Distributional differentiation in d dimensions

One can show that the formula (4.2) for distributional differentiation of homogeneous functions is also valid (without any change) in the d-dimensional case. It leads, e.g., to equality

$$\begin{aligned} \partial _i \partial _j r^{2-d} = (d-2)\frac{d\,n^i n^j-\delta _{ij}}{r^d} - \frac{4\pi ^{d/2}}{d\,\varGamma (d/2 -1)}\delta _{ij}\delta . \end{aligned}$$
(4.52)

To overcome the necessity of using distributional differentiations it is possible to replace Dirac \(\delta \)-function by the class of analytic functions introduced in Riesz (1949),

$$\begin{aligned} \delta _{\epsilon }(\textbf{x}) \equiv \frac{\varGamma ((d-\epsilon )/2)}{\pi ^{d/2}2^{\epsilon }\varGamma (\epsilon /2)}r^{\epsilon - d}, \end{aligned}$$
(4.53)

resulting in the Dirac \(\delta \)-function in the limit

$$\begin{aligned} \delta = \lim _{\epsilon \rightarrow 0}\delta _{\epsilon }. \end{aligned}$$
(4.54)

On this class of functions, the inverse Laplacian operates as

$$\begin{aligned} \varDelta ^{-1} \delta _{\epsilon } = -\delta _{\epsilon + 2}, \end{aligned}$$
(4.55)

and instead of (4.52) one gets

$$\begin{aligned} \partial _i \partial _j r^{\epsilon + 2 - d} = (d-2-\epsilon )\frac{(d-\epsilon )n^i n^j-\delta _{ij}}{r^{d-\epsilon }}. \end{aligned}$$
(4.56)

There is no need to use distributional differentiation here, so no \(\delta \)-functions are involved.

Though the replacements in the stress-energy tensor density of \(\delta _a\) through \(\delta _{\epsilon _a}\) (with \(a=1,2\)) do destroy the divergence freeness of the stress-energy tensor and thus the integrability conditions of the Einstein theory, the relaxed Einstein field equations (the ones which result after imposing coordinate conditions) do not force the stress-energy tensor to be divergence free and can thus be solved without problems. The solutions one gets do not fulfill the complete Einstein field equations but in the final limits \(\epsilon _a\rightarrow 0\) the general coordinate covariance of the theory is manifestly recovered. This property, however, only holds if these limits are taken before the limit \(d=3\) is performed (Damour et al. 2008a).

5 Point-mass representations of spinless black holes

This section is devoted to an insight of how black holes, the most compact objects in GR, can be represented by point masses. On the other side, the developments in the present section show that point masses, interpreted as fictitious point masses (analogously to image charges in the electrostatics), allow to represent black holes. Later on, in the section on approximate Hamiltonians for spinning binaries, neutron stars will also be considered, taking into account their different rotational deformation. Tidal deformations are considered in Sect. 8.

The simplest black hole is a Schwarzschildian one which is isolated and non-rotating. Its metric is a static solution of the vacuum Einstein field equations. In isotropic coordinates, the Schwarzschild metric reads (see, e.g., Misner et al. 1973)

$$\begin{aligned} \text {d}s^2 = -\left( \frac{ 1-\frac{GM}{2rc^2}}{ 1+\frac{GM}{2rc^2}}\right) ^2 c^2 \text {d}t^2 + \left( 1+\frac{GM}{2rc^2}\right) ^4 \text {d}\textbf{x}^2, \end{aligned}$$
(5.1)

where M is the gravitating mass of the black hole and \((x^1,x^2,x^3)\) are Cartesian coordinates in \(\mathbb {R}^3\) with \(r^2 = (x^1)^2 + (x^2)^2 + (x^3)^2\) and \(\text {d}\textbf{x}^2 = (\text {d}x^1)^2 + (\text {d}x^2)^2 + (\text {d}x^3)^2\). The origin of the coordinate system \(r=0\) is not located where the Schwarzschild singularity \(R=0\), with R the radial Schwarzschild coordinate, is located, rather it is located on the other side of the Einstein–Rosen bridge, at infinity, where space is flat. The point \(r=0\) does not belong to the three-dimensional spacelike curved manifold, so we do have an open manifold excluding the point \(r=0\), a so-called “puncture” manifold (see, e.g., Brandt and Brügmann 1997; Cook 2005). However, as we shall see below, the Schwarzschild metric can be contructed with the aid of a Dirac \(\delta \) function with support at \(r=0\), located in a conformally related flat space of dimension smaller than three. Distributional sources with support at the Schwarzschild singularity are summarized and treated by Pantoja and Rago (2002), Heinzle and Steinbauer (2002).

A two black hole initial value solution of the vacuum Einstein field equations is the time-symmetric Brill–Lindquist one (Brill and Lindquist 1963; Lindquist 1963),

$$\begin{aligned} \text {d}s^2 = -\left( \frac{ 1-\frac{\beta _1G}{2r_1c^2}-\frac{\beta _2G}{2r_2c^2}}{ 1+\frac{\alpha _1G}{2r_1c^2}+\frac{\alpha _2G}{2r_2c^2}}\right) ^2 c^2\text {d}t^2 + \left( 1+\frac{\alpha _1G}{2r_1c^2}+\frac{\alpha _2G}{2r_2c^2}\right) ^4\text {d}\textbf{x}^2, \end{aligned}$$
(5.2)

where \(\textbf{r}_a\equiv \textbf{x}-\textbf{x}_a\) and \(r_a\equiv |\textbf{r}_a|\) (\(a=1,2\)), the coefficients \(\alpha _a\) and \(\beta _a\) can be found in Jaranowski and Schäfer (2002) (notice that \(h^{\text{TT}}_{ij}=0\), \(\pi ^{ij}=0\), and, initially, \(\partial _t r_a=0\)). Its total energy results from the ADM surface integral [this is the reduced ADM Hamiltonian from Eq. (2.20) written for the metric (5.2)]

$$\begin{aligned} E_{\text {ADM}} = -\frac{c^4}{2\pi G} \oint _{i_0}\text {d}S_i\, \partial _i\varPsi = (\alpha _1 + \alpha _2)c^2, \end{aligned}$$
(5.3)

where \(\text {d}S_i=n^ir^2\text {d}\varOmega \) is a two-dimensional surface-area element (with unit radial vector \(n^i\equiv {x^i/r}\) and solid angle element \(\text {d}\varOmega \)) and

$$\begin{aligned} \varPsi \equiv 1 + \frac{\alpha _1G}{2r_1c^2} + \frac{\alpha _2G}{2r_2c^2}. \end{aligned}$$
(5.4)

Introducing the inversion map \(\textbf{x}\rightarrow \textbf{x}'\) defined by Brill and Lindquist (1963)

$$\begin{aligned} \textbf{r}'_1 \equiv \textbf{r}_1 \frac{\alpha _1^2G^2}{4c^4r_1^2} \quad \Longrightarrow \quad \textbf{r}_1 = \textbf{r}'_1 \frac{\alpha _1^2G^2}{4c^4r_1'^{2}}, \end{aligned}$$
(5.5)

where \(\textbf{r}'_1\equiv \textbf{x}'-\textbf{x}_1\), \(r'_1\equiv |\textbf{x}'-\textbf{x}_1|\), the three-metric \(\text {d}l^2 = \varPsi ^4 \text {d}\textbf{x}^2\) transforms into

$$\begin{aligned} \text {d}l^2 = \varPsi '^4 \text {d}\textbf{x}'^2, \quad \text {with}\quad \varPsi ' \equiv 1+\frac{\alpha _1G}{2r'_1c^2}+\frac{\alpha _1\alpha _2G^2}{4r_2 r'_1 c^4}, \end{aligned}$$
(5.6)

where \(\textbf{r}_2 = \textbf{r}'_1 \alpha _1^2\,G^2/(4c^4r_1'^{2}) + \textbf{r}_{12}\) with \(\textbf{r}_{12}\equiv \textbf{x}_{1} - \textbf{x}_{2}\). From the new metric function \(\varPsi '\) the proper mass of the throat 1 results in,

$$\begin{aligned} m_1 \equiv -\frac{c^2}{2\pi G}\oint _{i_0^1} \text {d}S'_i\, \partial '_i \varPsi ' = \alpha _1 + \frac{\alpha _1\alpha _2G}{2r_{12}c^2}, \end{aligned}$$
(5.7)

where \(i_0^1\) denotes the black hole’s 1 own spacelike infinity. Hereof the ADM energy comes out in the form,

$$\begin{aligned} E_{\text {ADM}} = (m_1 + m_2)c^2 - G \frac{\alpha _1\alpha _2}{r_{12}}, \end{aligned}$$
(5.8)

where

$$\begin{aligned} \alpha _a = \frac{m_a-m_b}{2} + \frac{c^2r_{ab}}{G}\left( \sqrt{1+\frac{m_a+m_b}{c^2r_{ab}/G} + \left( \frac{m_a-m_b}{2c^2r_{ab}/G}\right) ^2} - 1 \right) . \end{aligned}$$
(5.9)

This construction, as performed by Brill and Lindquist (1963), is a purely geometrical (or vacuum) one without touching singularities. Recall that this energy belongs to an initial value solution of the Einstein constraint equations with vanishing of both \(h^{\text{TT}}_{ij}\) and particle together with field momenta. In this initial conditions spurious gravitational waves are included.

In the following we will show how the vacuum Brill–Lindquist solution can be obtained with Dirac \(\delta \)-function source terms located at \(r_1=0\) and \(r_2=0\) in a conformally related three-dimensional flat space. To do this we will formulate the problem in d space dimensions and make analytical continuation in d of the results down to \(d=3\). The insertion of the stress-energy density for point masses into the Hamiltonian constraint equation yields, for \(p_{ai}=0\), \(h^{\text{TT}}_{ij}=0\), and \(\pi ^{ij}=0\),

$$\begin{aligned} -\varPsi \varDelta \phi = \frac{16\pi G}{c^2} \sum _a m_a \delta _a, \end{aligned}$$
(5.10)

where \(\varPsi \) and \(\phi \) parametrize the space metric,

$$\begin{aligned} \gamma _{ij} = \varPsi ^{4/(d-2)}\delta _{ij}, \quad \varPsi \equiv 1 + \frac{d-2}{4(d-1)}\phi . \end{aligned}$$
(5.11)

If the lapse function N is represented by

$$\begin{aligned} N \equiv \frac{\chi }{\varPsi }, \end{aligned}$$
(5.12)

an equation for \(\chi \) results of the form (using the initial-data conditions \(p_{ai}=0\), \(h^{\text{TT}}_{ij}=0\), \(\pi ^{ij}=0\)),

$$\begin{aligned} \varPsi ^2 \varDelta \chi =\frac{4\pi G}{c^2} \frac{d-2}{d-1}\chi \sum _a m_a\delta _a. \end{aligned}$$
(5.13)

With the aid of the relation

$$\begin{aligned} \varDelta \frac{1}{r_a^{d-2}} = -\frac{4\pi ^{d/2}}{\varGamma (d/2-1)}\delta _a \end{aligned}$$
(5.14)

it is easy to show that for \(1<d<2\) the equations for \(\varPsi \) and \(\chi \) do have well-defined solutions. To obtain these solutions we employ the ansatz

$$\begin{aligned} \phi = \frac{4G}{c^2}\frac{\varGamma (d/2-1)}{\pi ^{d/2-1}}\left( \frac{\alpha _1}{r^{d-2}_1}+\frac{\alpha _2}{r^{d-2}_2}\right) , \end{aligned}$$
(5.15)

where \(\alpha _1\) and \(\alpha _2\) are some constants. After plugging the ansatz (5.15) into Eq. (5.10) we compare the coefficients of the Dirac \(\delta \)-functions on both sides of the equation. For point mass 1 we get

$$\begin{aligned} \bigg (1 + \frac{G(d-2)\varGamma (d/2-1)}{c^2(d-1)\pi ^{d/2-1}} \Big (\frac{\alpha _1}{r^{d-2}_1}+\frac{\alpha _2}{r^{d-2}_2}\Big )\bigg ) \alpha _1 \delta _1 = m_1 \delta _1. \end{aligned}$$
(5.16)

After taking \(1<d<2\), one can perform the limit \(r_1 \rightarrow 0\) for the coefficient of \(\delta _1\) in the left-hand-side of the above equation,

$$\begin{aligned} \bigg (1 + \frac{G(d-2)\varGamma (d/2-1)}{c^2(d-1)\pi ^{d/2-1}} \frac{\alpha _2}{r^{d-2}_{12}}\bigg ) \alpha _1 \delta _1 = m_1 \delta _1. \end{aligned}$$
(5.17)

Going over to \(d=3\) by arguing that the solution is analytic in d results in the relation

$$\begin{aligned} \alpha _a = \frac{m_a}{ 1+ \frac{G}{2c^2}\frac{\alpha _b}{r_{ab}}}, \end{aligned}$$
(5.18)

where \(b \ne a\) and \(a,b = 1,2\). The ADM energy is again given by, in the limit \(d=3\),

$$\begin{aligned} E_{\text {ADM}} = (\alpha _1 + \alpha _2) c^2. \end{aligned}$$
(5.19)

Here we recognize the important aspect that although the metric may describe close binary black holes with strongly deformed apparent horizons, the both black holes can still be generated by point masses in conformally related flat space. This is the justification for our particle model to be taken as model for orbiting black holes. Obviously black holes generated by point masses are orbiting black holes without spin, i.e., Schwarzschild-type black holes. The representation of a Schwarzschild-type black hole in binary–black-hole systems with one Dirac \(\delta \)-function seems not to be the only possibility. As shown by Jaranowski and Schäfer (2000a), binary–black-hole configurations defined through isometry-conditions at the apparent horizons (Misner 1963) need infinitely many Dirac \(\delta \)-functions per each one of the black holes. Whether or not those black holes are more physical is not known. It has been found by Jaranowski and Schäfer (1999) that the expressions for ADM energy of the two kinds of binary black holes do agree through 2PN order, and that at the 3PN level the energy of the Brill–Lindquist binary black holes is additively higher by \(G^4m_1^2m_2^2(m_1+m_2)/(8c^6r^4_{12})\), i.e. the Misner configuration seems stronger bound.Footnote 8 The same paper has shown that the spatial metrics of both binary–black-hole configurations coincide through 3PN order, and that at least through 5PN order they can be made to coincide by shifts of black-hole position variables.

6 Post-Newtonian Hamilton dynamics of nonspinning compact binaries

In this section we collect explicit results on Hamilton dynamics of binaries made of compact and nonspinning bodies. Up to the 4PN order the Hamiltonian of binary point-mass systems is explicitly known and it can be written as the sum

$$\begin{aligned} H[\textbf{x}_a,\textbf{p}_a,t]&= \sum _a m_a c^2 + H_{\text {N}}(\textbf{x}_a,\textbf{p}_a) + \frac{1}{c^2} H_{\text {1PN}}(\textbf{x}_a,\textbf{p}_a) + \frac{1}{c^4} H_{\text {2PN}}(\textbf{x}_a,\textbf{p}_a) \\&\quad + \frac{1}{c^5} H_{\text {2.5N}}(\textbf{x}_a,\textbf{p}_a,t) + \frac{1}{c^6} H_{\text {3PN}}(\textbf{x}_a,\textbf{p}_a) + \frac{1}{c^7} H_{\text {3.5PN}}(\textbf{x}_a,\textbf{p}_a,t) \\&\quad + \frac{1}{c^8} H_{\text {4PN}}[\textbf{x}_a,\textbf{p}_a] + \mathcal {O}(c^{-9}). \end{aligned}$$
(6.1)

This Hamiltonian is the PN-expanded reduced ADM Hamiltonian of point-masses plus field system; the nontrivial procedure of reduction is described in Sects. 3.1 and 3.2 of this review. The non-autonomous dissipative Hamiltonians \(H_{\text {2.5PN}}(\textbf{x}_a,\textbf{p}_a,t)\) and \(H_{\text {3.5PN}}(\textbf{x}_a,\textbf{p}_a,t)\) are written as explicitly depending on time because they depend on the gravitational field variables (see Sect. 6.5 for more details). The dependence of the 4PN Hamiltonian \(H_{\text {4PN}}\) on \(\textbf{x}_a\) and \(\textbf{p}_a\) is both pointwise and functional (and this is why we have used square brackets for arguments of \(H_{\text {4PN}}\)).

We will display here the conservative Hamiltonians \(H_{\text {N}}\) to \(H_{\text {4PN}}\) in the centre-of-mass reference frame, relegating their generic, noncentre-of-mass forms, to Appendix C. In the ADM formalism the centre-of-mass reference frame is defined by the simple requirement

$$\begin{aligned} \textbf{p}_1 + \textbf{p}_2 = \textbf{0}. \end{aligned}$$
(6.2)

Here we should point out that at the 3.5PN order for the first time recoil arises, hence the conservation of linear momentum is violated [see, e.g., Fitchett 1983 (derivation based on wave solutions of linearized field equations) and Junker and Schäfer 1992 (derivation based on wave solutions of non-linear field equations)]. This however has no influence on the energy through 6.5PN order, if \(\textbf{P}\equiv \textbf{p}_1+\textbf{p}_2=\textbf{0}\) holds initially, because up to 3PN order the Eq. (3.43) is valid and the change of the Hamiltonian H caused by nonconservation of \(\textbf{P}\) equals \((\text {d}H/\text {d}t)|_{\mathcal {M}=\text {const}} =\big ((c^2/H)\textbf{P}\big )_{\text{3PN}}\cdot (\text {d}\textbf{P}/\text {d}t)_{\text{3.5PN}}=0\) [where \(\mathcal {M}\) is defined in Eq. (3.43)] through 6.5PN order.

Let us define

$$\begin{aligned} M \equiv m_1 + m_2, \quad \mu \equiv \frac{m_1m_2}{M}, \quad \nu \equiv \frac{\mu }{M}, \end{aligned}$$
(6.3)

where the symmetric mass ratio \(0 \le \nu \le 1/4\), with \(\nu = 0\) being the test-body case and \(\nu = 1/4\) for equal-mass binaries. It is convenient to introduce reduced (or rescaled) variables \(\textbf{r}\) and \(\textbf{p}\) (together with the rescaled time variable \(\hat{t}\)),

$$\begin{aligned} \textbf{r}\equiv \frac{\textbf{x}_1 - \textbf{x}_2}{GM}, \quad \textbf{n}\equiv \frac{\textbf{r}}{|\textbf{r}|}, \quad \textbf{p} \equiv \frac{\textbf{p}_1}{\mu } = -\frac{\textbf{p}_2}{\mu }, \quad p_r \equiv \textbf{n}\cdot \textbf{p}, \quad \hat{t} \equiv \frac{t}{GM}, \end{aligned}$$
(6.4)

as well as the reduced Hamiltonian [note that \(H=\mathcal {M}c^2\), see Eq. (3.43)]

$$\begin{aligned} \hat{H} \equiv \frac{H-Mc^2}{\mu }. \end{aligned}$$
(6.5)

6.1 Conservative Hamiltonians through 4PN order

The conservative reduced 4PN-accurate two-point-mass Hamiltonian in the centre-of-mass frame reads

$$\begin{aligned} \hat{H}[\textbf{r},\textbf{p}]&= \hat{H}_{\text {N}}(\textbf{r},\textbf{p}) + \frac{1}{c^2} \hat{H}_{\text {1PN}}(\textbf{r},\textbf{p}) + \frac{1}{c^4} \hat{H}_{\text {2PN}}(\textbf{r},\textbf{p}) \\&\quad + \frac{1}{c^6} \hat{H}_{\text {3PN}}(\textbf{r},\textbf{p}) + \frac{1}{c^8} \hat{H}_{\text {4PN}}[\textbf{r},\textbf{p}]. \end{aligned}$$
(6.6)

The Hamiltonians \(\hat{H}_{\text {N}}\) through \(\hat{H}_{\text {3PN}}\) are local in time. They explicitly read

$$\begin{aligned} \hat{H}_{\text {N}}(\textbf{r},\textbf{p})= & {} \frac{p^2}{2} - \frac{1}{r}, \end{aligned}$$
(6.7)
$$\begin{aligned} \hat{H}_{\text {1PN}}(\textbf{r},\textbf{p})= & {} \frac{1}{8} (3{\nu } -1) p^4 - \frac{1}{2}\left[ (3+{\nu }) p^2 + {\nu } p^2_r\right] \frac{1}{r} + \frac{1}{2r^2}, \end{aligned}$$
(6.8)
$$\begin{aligned} \hat{H}_{\text {2PN}}(\textbf{r},\textbf{p})= & {} \frac{1}{16}(1-5{\nu }+5\nu ^2)p^6 \\{} & {} + \frac{1}{8}\big [(5-20{\nu } - 3\nu ^2)p^4 - 2\nu ^2p^2_rp^2 - 3\nu ^2 p^4_r\big ]\frac{1}{r} \\{} & {} + \frac{1}{2}[(5+8{\nu } )p^2 + 3{\nu } p_r^2]\frac{1}{r^2} - \frac{1}{4}(1+3{\nu })\frac{1}{r^3}, \end{aligned}$$
(6.9)
$$\begin{aligned} \hat{H}_{\text {3PN}}(\textbf{r},\textbf{p})= & {} \frac{1}{128}(-5+35{\nu } - 70\nu ^2 + 35 \nu ^3) p^8 \\{} & {} + \frac{1}{16}\Big [(-7+42{\nu } - 53\nu ^2 - 5\nu ^3)p^6 +(2-3\nu )\nu ^2p_r^2p^4 \\{} & {} + 3(1-\nu )\nu ^2p_r^4p^2 - 5\nu ^3p_r^6\Big ]\frac{1}{r} + \Big [\frac{1}{16}(-27 + 136{\nu } + 109\nu ^2)p^4 \\{} & {} + \frac{1}{16}(17+30\nu ){\nu } p_r^2p^2 + \frac{1}{12}(5+43\nu ){\nu } p_r^4\Big ]\frac{1}{r^2} \\{} & {} + \Bigg [\left( -\frac{25}{8} + \left( \frac{1}{64} \pi ^2 - \frac{335}{48}\right) {\nu } - \frac{23}{8} \nu ^2\right) p^2 \\{} & {} + \left( -\frac{85}{16} - \frac{3}{64}\pi ^2 - \frac{7}{4} \nu \right) {\nu } p^2_r\Bigg ] \frac{1}{r^3} \\{} & {} + \left[ \frac{1}{8} + \left( \frac{109}{12} - \frac{21}{32} \pi ^2\right) {\nu }\right] \frac{1}{r^4}. \end{aligned}$$
(6.10)

The total 4PN Hamiltonian \(\hat{H}_{\text {4PN}}[\textbf{r},\textbf{p}]\) is the sum of the local-in-time piece \(\hat{H}_{\text {4PN}}^{\text {local}}(\textbf{r},\textbf{p})\) and the piece \(\hat{H}_{\text {4PN}}^{\text {nonlocal}}[\textbf{r},\textbf{p}]\) which is nonlocal in time:

$$\begin{aligned} \hat{H}_{\text {4PN}}[\textbf{r},\textbf{p}] = \hat{H}_{\text {4PN}}^{\text {local}}(\textbf{r},\textbf{p}) + \hat{H}_{\text {4PN}}^{\text {nonlocal}}[\textbf{r},\textbf{p}]. \end{aligned}$$
(6.11)

The local-in-time 4PN Hamiltonian \(\hat{H}_{\text {4PN}}^{\text {local}}(\textbf{r},\textbf{p})\) reads

$$\begin{aligned} \hat{H}_{\text {4PN}}^{\text {local}}(\textbf{r},\textbf{p})&= \left( \frac{7}{256} -\frac{63}{256}\nu +\frac{189}{256}\nu ^2 -\frac{105}{128}\nu ^3 +\frac{63}{256}\nu ^4 \right) p^{10} \\&\quad + \Bigg \{ \frac{45}{128} p^8 -\frac{45}{16} p^8\nu +\left( \frac{423}{64} p^8 -\frac{3}{32} p_r^2 p^6 -\frac{9}{64} p_r^4 p^4 \right) \nu ^2 \\&\quad + \left( -\frac{1013}{256} p^8 +\frac{23}{64} p_r^2 p^6 +\frac{69}{128} p_r^4 p^4 -\frac{5}{64} p_r^6 p^2 +\frac{35}{256} p_r^8 \right) \nu ^3 \\&\quad + \left( -\frac{35}{128} p^8 -\frac{5}{32} p_r^2 p^6 -\frac{9}{64} p_r^4 p^4 -\frac{5}{32} p_r^6 p^2 -\frac{35}{128} p_r^8 \right) \nu ^4 \Bigg \}\frac{1}{r} \\&\quad + \Bigg \{ \frac{13}{8} p^6 + \left( -\frac{791}{64}p^6 +\frac{49}{16} p_r^2 p^4 -\frac{889}{192} p_r^4 p^2 +\frac{369}{160} p_r^6 \right) \nu \\&\quad + \left( \frac{4857}{256} p^6 -\frac{545}{64} p_r^2 p^4 +\frac{9475}{768} p_r^4 p^2 -\frac{1151}{128} p_r^6 \right) \nu ^2 \\&\quad + \left( \frac{2335}{256} p^6 +\frac{1135}{256} p_r^2 p^4 -\frac{1649}{768} p_r^4 p^2 +\frac{10353}{1280} p_r^6 \right) \nu ^3 \Bigg \}\frac{1}{r^2} \\&\quad + \Bigg \{ \frac{105}{32} p^4 + \left[ \left( \frac{2749}{8192}\pi ^2-\frac{589189}{19200}\right) p^4 + \left( \frac{63347}{1600} - \frac{1059}{1024}\pi ^2\right) p_r^2 p^2 \right. \\&\quad \left. + \left( \frac{375}{8192}\pi ^2-\frac{23533}{1280}\right) p_r^4 \right] \nu + \bigg [ \left( \frac{18491}{16384}\pi ^2 - \frac{1189789}{28800}\right) p^4 \\&\quad - \left( \frac{127}{3} + \frac{4035}{2048}\pi ^2\right) p_r^2 p^2 + \left( \frac{57563}{1920} - \frac{38655}{16384}\pi ^2 \right) p_r^4 \bigg ]\nu ^2 \\&\quad + \bigg ( -\frac{553}{128} p^4 -\frac{225}{64} p_r^2 p^2 -\frac{381}{128} p_r^4 \bigg )\nu ^3 \Bigg \}\frac{1}{r^3} \\&\quad + \Bigg \{ \frac{105}{32}p^2 + \left[ \left( \frac{185761}{19200} - \frac{21837}{8192}\pi ^2\right) p^2 + \left( \frac{3401779}{57600} - \frac{28691}{24576}\pi ^2\right) p_r^2 \right] \nu \\&\quad + \left[ \left( \frac{672811}{19200} - \frac{158177}{49152}\pi ^2\right) p^2 + \left( -\frac{21827}{3840} + \frac{110099}{49152}\pi ^2\right) p_r^2 \right] \nu ^2 \Bigg \}\frac{1}{r^4} \\&\quad + \Bigg \{ -\frac{1}{16} + \left( {-\frac{169199}{2400} + \frac{6237}{1024}\pi ^2}\right) \, \nu + \left( -\frac{1256}{45} + \frac{7403}{3072}\pi ^2\right) \,\nu ^2 \Bigg \}\frac{1}{r^5}. \end{aligned}$$
(6.12)

The time-symmetric but nonlocal-in-time Hamiltonian \(\hat{H}_{\text {4PN}}^{\text {nonlocal}}[\textbf{r},\textbf{p}]\) is related with the leading-order tail effects (Damour et al. 2014). It equals

$$\begin{aligned} \hat{H}_{\text {4PN}}^{\text {nonlocal}}[\textbf{r},\textbf{p}] = -\frac{1}{5}\frac{G^2}{\nu c^8} {\dddot{I}_{ij}}(t) \times {\text {Pf}}_{2r_{12}/c} \int _{-\infty }^{+\infty } \frac{\text {d}\tau }{\vert \tau \vert } {\dddot{I}_{ij}}(t+\tau ), \end{aligned}$$
(6.13)

where \({\text {Pf}}_T\) is a Hadamard partie finie with time scale \(T\equiv 2r_{12}/c\) and where \(\dddot{I}_{\!ij}\) denotes a third time derivative of the Newtonian quadrupole moment \(I_{ij}\) of the binary system,

$$\begin{aligned} I_{ij} \equiv \sum _a m_a \left( x_a^i x_a^j - \frac{1}{3}\delta ^{ij}\textbf{x}_a^2\right) . \end{aligned}$$
(6.14)

The Hadamard partie finie operation is defined as (Damour et al. 2014)

$$\begin{aligned} {\text {Pf}}_T \int _0^{+\infty } \frac{\text {d}v}{v}g(v) \equiv \int _0^T \frac{\text {d}v}{v}[g(v)-g(0)] + \int _T^{+\infty } \frac{\text {d}v}{v}g(v). \end{aligned}$$
(6.15)

Let us also note that in reduced variables the quadrupole moment \(I_{ij}\) and its third time derivative \(\dddot{I}_{\!ij}\) read

$$\begin{aligned} I_{ij} = (GM)^2 \mu \left( r^ir^j-\frac{1}{3}\textbf{r}^2\delta ^{ij}\right) , \quad \dddot{I}_{ij} = -\frac{\nu }{Gr^2} \left( 4 n^{\langle i} p_{j\rangle } - 3(\textbf{n}\cdot \textbf{p})n^{\langle i}n^{j\rangle } \right) , \end{aligned}$$
(6.16)

where \(\langle \cdots \rangle \) denotes a symmetric tracefree projection and where in \(\dddot{I}_{\!ij}\) the time derivatives \(\dot{\textbf{r}}\), \(\ddot{\textbf{r}}\), and \(\dddot{\textbf{r}}\) were eliminated by means of Newtonian equations of motion.

From the reduced conservative Hamiltonians displayed above, where a factor of \(1/\nu \) is factorized out [through the definition (6.5) of the reduced Hamiltonian], the standard test-body dynamics is very easily obtained, simply by putting \(\nu =0\). The conservative Hamiltonians \(\hat{H}_{\text {N}}\) through \(\hat{H}_{\text {4PN}}\) serve as basis of the EOB approach, where with the aid of a canonical transformation the two-body dynamics is put into test-body form of an effective particle moving in deformed Schwarzschild metric, with \(\nu \) being the deformation parameter (Buonanno and Damour 1999, 2000; Damour et al. 2000a, 2015). These Hamiltonians, both directly and through the EOB approach, constitute an important element in the construction of templates needed to detect gravitational waves emitted by coalescing compact binaries. Let us stress again that the complete 4PN Hamiltonian has been obtained only in 2014 (Damour et al. 2014), based on earlier calculations (Blanchet and Damour 1988; Bini and Damour 2013; Jaranowski and Schäfer 2013) and a work published later (Jaranowski and Schäfer 2015).

6.2 Nonlocal-in-time tail Hamiltonian at 4PN order

The nonlocal-in-time tail Hamiltonian at the 4PN level (derived and applied by Damour et al. 2014 and Damour et al. 2015, respectively) is the most subtle part of the 4PN Hamiltonian. It certainly deserves some discussion. Let us remark that though the tail Hamiltonian derived in 2016 by Bernard et al. (2016) was identical with the one given in Damour et al. (2014), the derivation there of the equations of motion and the conserved energy was incorrectly done, as detailed by Damour et al. (2016), which was later confirmed by Bernard et al. (2017b).

The 4PN-level tail-related contribution to the action reads

$$\begin{aligned} S^{\text{tail}}_{\rm {4PN}} = -\int {H}^{\text{tail}}_\text {4PN}(t)\, \text {d}t, \end{aligned}$$
(6.17)

where the 4PN tail Hamiltonian equals

$$\begin{aligned} {H}^{\text{tail}}_{\rm {4PN}}(t) = -\frac{G^2M}{5c^8} \dddot{I}_{ij}(t)\, {\text{Pf}}_{2r(t)/c}\int _{-\infty }^{\infty }\frac{\text {d}v}{|v|}\dddot{I}_{ij}(t+v). \end{aligned}$$
(6.18)

Because formally

$$\begin{aligned} \dddot{I}_{ij}(t+v) = \exp \left( v\frac{\text {d}}{\text {d}t}\right) \dddot{I}_{ij}(t), \end{aligned}$$
(6.19)

the tail Hamiltonian can also be written as

$$\begin{aligned} {H}^{\text{tail}}_{\rm {4PN}}(t)&= - \frac{G^2M}{5c^8} \dddot{I}_{ij}(t)\, {\text{Pf}}_{2r(t)/c}\int _0^{\infty }\frac{\text {d}v}{v}\left[ \dddot{I}_{ij}(t+v)+\dddot{I}_{ij}(t-v)\right] \\&= - \frac{2G^2M}{5c^8} \dddot{I}_{ij}(t)\, {\text{Pf}}_{2r(t)/c}\int _0^{\infty }\frac{\text {d}v}{v} \text{ cosh }\left( v\frac{\text {d}}{\text {d}t}\right) \dddot{I}_{ij}(t). \end{aligned}$$
(6.20)

Another writing of the tail Hamiltonian is

$$\begin{aligned} H^{\text{tail}}_{\rm {4PN}}(t) = -\frac{2G^2M}{5c^8} \dddot{I}_{ij}(t)\, {\text{Pf}}_{2r(t)/c}\int _0^{\infty }\frac{\text {d}v}{v} \cosh \left( vX(H_0)\right) \dddot{I}_{ij}(t) \end{aligned}$$
(6.21)

with

$$\begin{aligned} X(H_0) \equiv \sum _i \left( \frac{\partial H_0}{\partial p_i(t)}\frac{\partial }{\partial x^i(t)} - \frac{\partial H_0}{\partial x^i(t)}\frac{\partial }{\partial p_i(t)}\right) , \quad H_0 = \frac{(\textbf{p}(t))^2}{2\mu } - \frac{GM\mu }{r(t)}. \end{aligned}$$
(6.22)

This presentation shows that \(H^{\text{tail}}_\text {4PN}\) can be constructed from positions and momenta at time t.

For circular orbits, \(\dddot{I}_{ij}(t)\) is an eigenfunction of \(\text{ cosh }\left( v\frac{\text {d}}{\text {d}t}\right) \), reading

$$\begin{aligned} \cosh \left( v\frac{\text {d}}{\text {d}t}\right) \dddot{I}_{ij}(t) = \cos \left( 2v\varOmega (t)\right) \dddot{I}_{ij}(t), \end{aligned}$$
(6.23)

where \(\varOmega \) is the angular frequency along circular orbit (\(p_r=0\)),

$$\begin{aligned} \varOmega (t) \equiv \dot{\varphi } = \frac{\partial H_0(p_{\varphi },r)}{\partial p_{\varphi }} = \frac{p_{\varphi }(t)}{\mu r^2(t)}, \quad H_0(p_{\varphi },r) = \frac{p_\varphi ^2}{2\mu r^2} - \frac{GM\mu }{r}. \end{aligned}$$
(6.24)

Notice the representation of \(\varOmega (t)\) as function of the still independent (dynamical equation \(\dot{p}_r=-\partial H_0/\partial r\) has not yet been used) canonical variables \(p_\varphi (t)\) and r(t) (in Damour et al. 2014, 2016, a more concise representation for circular orbits has been applied, based on the orbital angular momentum as only variable). The somewhat complicated structure of Eq. (6.23) can be made plausible by writing \( v\frac{\text {d}}{\text {d}t}=v\,\varOmega (p_\varphi ,r)\frac{\text {d}}{\text {d}\varphi }\), see Eq. (6.24), and parametrizing the Eq. (6.16) for circular orbits (\(p_r=0\)) with orbital angle \(\varphi \). The 4PN tail Hamiltonian for circular orbits can thus be written as

$$\begin{aligned} H^{\text{tail \, circ}}_{\rm {4PN}}(t)&= -\frac{2G^2M}{5c^8} \left( \dddot{I}_{ij}(t)\right) ^2\, {\text{Pf}}_{2r(t)/c}\int _0^{\infty }\frac{\text {d}v}{v}\cos \left( \frac{2p_{\phi }(t)}{\mu r^2(t)}v\right) \\&= \frac{2G^2M}{5c^8} \left( \dddot{I}_{ij}(t)\right) ^2\, \left[ \ln \left( \frac{4p_{\phi }(t)}{\mu c r(t)}\right) + \gamma _{\rm {E}} \right] , \end{aligned}$$
(6.25)

where \(\gamma _\text {E}=0.577\ldots \) denotes Euler’s constant. This representation has been quoted and used by Bernard et al. (2016), see Eq. (5.32) therein, for a straightforward comparison of their tail results with the tail results presented by Damour et al. (2014).

6.3 Dynamical invariants of two-body conservative dynamics

The observables of two-body systems that can be measured from infinity by, say, gravitational-wave observations, are describable in terms of dynamical invariants, i.e., functions which do not depend on the choice of phase-space coordinates. Dynamical invariants are easily obtained within a Hamiltonian framework of integrable systems.

We start from the reduced conservative Hamiltonian \(\hat{H}(\textbf{r},\textbf{p})\) in the centre-of-mass frame (we are thus considering here a local-in-time Hamiltonian; for the local reduction of a nonlocal-in-time 4PN-level Hamiltonian see Sect. 6.3.2 below) and we employ reduced variables \((\textbf{r},\textbf{p})\). The invariance of \(\hat{H}(\textbf{r},\textbf{p})\) under time translations and spatial rotations leads to the conserved quantities

$$\begin{aligned} E \equiv \hat{H}(\textbf{r},\textbf{p}), \quad \textbf{j} \equiv \frac{\textbf{J}}{\mu GM} = \textbf{r}\times \textbf{p}, \end{aligned}$$
(6.26)

where E is the total energy and \(\textbf{J}\) is the total orbital angular momentum of the binary system in the centre-of-mass frame. We further restrict considerations to the plane of the relative trajectory endowed with polar coordinates \((r,\phi )\) and we use Hamilton-Jacobi approach to obtain the motion. To do this we separate the variables \(\hat{t}\equiv t/(GM)\) and \(\phi \) in the reduced planar action \(\hat{S}\equiv S/(G\mu M)\), which takes the form

$$\begin{aligned} \hat{S} = -E \hat{t} + j \phi + \int \sqrt{R(r,E,j)}\,\text {d}r. \end{aligned}$$
(6.27)

Here \(j\equiv |\textbf{j}|\) and the effective radial potential R(rEj) is obtained by solving the equation \(E=\hat{H}(\textbf{r},\textbf{p})\) with respect to \(p_r\equiv \textbf{n}\cdot \textbf{p}\), after making use of the relation

$$\begin{aligned} \textbf{p}^2 = (\textbf{n}\cdot \textbf{p})^2 + (\textbf{n}\times \textbf{p})^2 = p_r^2 + \frac{j^2}{r^2}. \end{aligned}$$
(6.28)

The Hamilton–Jacobi theory shows that the observables of the two-body dynamics can be deduced from the (reduced) radial action integral

$$\begin{aligned} i_r(E,j) \equiv \frac{2}{2\pi } \int _{r_{\rm {min}}}^{r_{\rm {max}}} \sqrt{R(r,E,j)}\, \text {d}r, \end{aligned}$$
(6.29)

where the integration is defined from minimal to maximal radial distance. The dimensionless parameter \(k\equiv \varDelta \varPhi /(2\pi )\) (with \(\varDelta \varPhi \equiv \varPhi -2\pi \)) measuring the fractional periastron advance per orbit and the periastron-to-periastron period P are obtained by differentiating the radial action integral:

$$\begin{aligned} k&= -\frac{\partial i_r(E,j)}{\partial j} - 1, \end{aligned}$$
(6.30)
$$\begin{aligned} P&= 2\pi GM\,\frac{\partial i_r(E,j)}{\partial E}. \end{aligned}$$
(6.31)

It is useful to express the Hamiltonian as a function of the Delaunay (reduced) action variables (see, e.g., Goldstein 1981) defined by

$$\begin{aligned} n \equiv i_r + j = \frac{\mathcal{N}}{\mu GM}, \quad j = \frac{J}{\mu GM}, \quad m \equiv j_z = \frac{J_z}{\mu GM}. \end{aligned}$$
(6.32)

The angle variables conjugate to n, j, and m are, respectively: the mean anomaly, the argument of the periastron, and the longitude of the ascending node. In the quantum language, \(\mathcal{N}/\hbar \) is the principal quantum number, \(J/\hbar \) the total angular-momentum quantum number, and \(J_z/\hbar \) the magnetic quantum number. They are adiabatic invariants of the dynamics and they are, according to the Bohr–Sommerfeld rules of the old quantum theory, (approximately) quantized in integers. Knowing the Delaunay Hamiltonian \(\hat{H}(n,j,m)\) one computes the angular frequencies of the (generic) rosette motion of the binary system by differentiating \(\hat{H}\) with respect to the action variables. Namely,

$$\begin{aligned} \omega _{\text {radial}}&= \frac{2\pi }{P} = \frac{1}{GM} \frac{\partial \hat{H}(n,j,m)}{\partial n}, \end{aligned}$$
(6.33)
$$\begin{aligned} \omega _{\text {periastron}}&= \frac{\varDelta \varPhi }{P} = \frac{2\pi k}{P} = \frac{1}{GM} \frac{\partial \hat{H}(n,j,m)}{\partial j}. \end{aligned}$$
(6.34)

Here, \(\omega _{\text {radial}}\) is the angular frequency of the radial motion, i.e., the angular frequency of the return to the periastron, while \(\omega _{\text {periastron}}\) is the average angular frequency with which the major axis advances in space.

6.3.1 3PN-accurate results

The dynamical invariants of two-body dynamics were computed by Damour and Schäfer (1988) at the 2PN level and then generalized to the 3PN level of accuracy by Damour et al. (2000b). We are displaying here 3PN-accurate formulae. The periastron advance parameter k readsFootnote 9

$$\begin{aligned} k&= \frac{3}{c^2j^2} \Bigg \{ 1 + \frac{1}{c^2} \left[ \frac{5}{4}(7-2\nu )\frac{1}{j^2} + \frac{1}{2}(5-2\nu )\,E \right] \\&\quad + \frac{1}{c^4} \Bigg [ \frac{5}{2} \Bigg (\frac{77}{2} + \left( \frac{41}{64}\pi ^2-\frac{125}{3}\right) \nu + \frac{7}{4}\nu ^2\Bigg )\frac{1}{j^4} \\&\quad + \Bigg (\frac{105}{2} + \left( \frac{41}{64}\pi ^2-\frac{218}{3}\right) \nu + \frac{45}{6}\nu ^2\Bigg )\frac{E}{j^2} \\&\quad + \frac{1}{4}(5-5\nu +4\nu ^2)\,E^2 \Bigg ] + \mathcal {O}(c^{-6}) \Bigg \}. \end{aligned}$$
(6.35)

The 3PN-accurate formula for the orbital period reads

$$\begin{aligned} P&= \frac{2\pi GM}{(-2E)^{3/2}} \Bigg \{1 - \frac{1}{c^2} \frac{1}{4}(15-\nu ) E \\&\quad + \frac{1}{c^4} \left[ \frac{3}{2}(5-2\nu )\frac{(-2E)^{3/2}}{j} -\frac{3}{32}(35+30\nu +3\nu ^2)\,E^2 \right] \\&\quad + \frac{1}{c^6} \Bigg [ \Bigg (\frac{105}{2} + \left( \frac{41}{64}\pi ^2-\frac{218}{3}\right) \nu + \frac{45}{6}\nu ^2\Bigg )\frac{(-2E)^{3/2}}{j^3} \\&\quad - \frac{3}{4}(5-5\nu +4\nu ^2) \frac{(-2E)^{5/2}}{j} \\&\quad + \frac{5}{128}(21-105\nu +15\nu ^2+5\nu ^3)\,E^3 \Bigg ] + \mathcal {O}(c^{-8}) \Bigg \}. \end{aligned}$$
(6.36)

These expressions have direct applications to binary pulsars (Damour and Schäfer 1988). Explicit analytic orbit solutions of the conservative dynamics through 3PN order are given by Memmesheimer et al. (2005). The 4PN periastron advance was first derived by Damour et al. (2015, 2016), with confirmation provided in a later rederivation (Bernard et al. 2017b); also see Blanchet and Le Tiec (2017).

All conservative two-body Hamiltonians respect rotational symmetry, therefore the Delaunay variable m does not enter these Hamiltonians. The 3PN-accurate Delaunay Hamiltonian reads (Damour et al. 2000b)

$$\begin{aligned} \widehat{H}(n,j,m)&= -\frac{1}{2n^2} \bigg \{ 1 + \frac{1}{c^2} \bigg ( \frac{6}{j n}-\frac{1}{4}(15-\nu )\frac{1}{n^2} \bigg ) \\&\quad + \frac{1}{c^4} \bigg ( \frac{5}{2}(7-2\nu )\frac{1}{j^3 n} + \frac{27}{j^2 n^2} - \frac{3}{2}(35-4\nu )\frac{1}{j n^3} + \frac{1}{8}(145-15\nu +\nu ^2)\frac{1}{n^4} \bigg ) \\&\quad + \frac{1}{c^6} \bigg [ \bigg (\frac{231}{2}+\Big (\frac{123}{64}\pi ^2-125\Big )\nu +\frac{21}{4}\nu ^2\bigg )\frac{1}{j^5 n} + \frac{45}{2}(7-2\nu )\frac{1}{j^4 n^2} \\&\quad +\bigg (-\frac{303}{4}+\Big (\frac{1427}{12}-\frac{41}{64}\pi ^2\Big )\nu -10\nu ^2\bigg )\frac{1}{j^3 n^3} - \frac{45}{2}(20-3\nu )\frac{1}{j^2 n^4} \\&\quad + \frac{3}{2}(275-50\nu +4\nu ^2)\frac{1}{j n^5} - \frac{1}{64}(6363-805\nu +90\nu ^2-5\nu ^3)\frac{1}{n^6} \bigg ] \\&\quad + \mathcal {O}(c^{-8}) \bigg \}. \end{aligned}$$
(6.37)

Additional insight into the 3PN dynamics can be found in a paper by Le Tiec (2015), where the first law of mechanics for binary systems of point masses (Le Tiec et al. 2012) was generalized to generic eccentric orbits.

6.3.2 Results at 4PN order

The reduced 4PN Hamiltonian \(\hat{H}_{\text {4PN}}[\textbf{r},\textbf{p}]\) can be decomposed in two parts in a way slightly different from the splitting shown in Eq. (6.11). Namely,

$$\begin{aligned} \hat{H}_{\text {4PN}}[\textbf{r},\textbf{p}] = \hat{H}^{\text {I}}_{\text {4PN}}(\textbf{r},\textbf{p};s) + \hat{H}^{\text {II}}_{\text {4PN}}[\textbf{r},\textbf{p};s], \end{aligned}$$
(6.38)

where the first part is local in time while the second part is nonlocal in time; \(s\equiv {s_{\text{phys}}}/(GM)\) is a reduced scale with dimension of 1/velocity\(^2\), where \(s_{\text{phys}}\) is a scale with dimension of a length. The Hamiltonian \(\hat{H}^{\text {I}}_{\text {4PN}}\) is a function of phase-space variables \((\textbf{r},\textbf{p})\) of the form

$$\begin{aligned} \hat{H}^{\text {I}}_{\text {4PN}}(\textbf{r},\textbf{p};s) = \hat{H}^{\text {loc}}_{\text {4PN}}(\textbf{r},\textbf{p}) + F(\textbf{r},\textbf{p})\ln \frac{r}{s}, \quad F(\textbf{r},\textbf{p}) \equiv \frac{2}{5}\frac{G^2}{\nu }(\dddot{I}_{ij})^2, \end{aligned}$$
(6.39)

where the Hamiltonian \(\hat{H}^{\text {loc}}_{\text {4PN}}\) is given in Eq. (6.12) above. The Hamiltonian \(\hat{H}^{\text {II}}_{\text {4PN}}\) is a functional of phase-space trajectories \((\textbf{r}(t),\textbf{p}(t))\),

$$\begin{aligned} \hat{H}^{\text {II}}_{\text {4PN}}[\textbf{r},\textbf{p};s] = -\frac{1}{5}\frac{G^2}{\nu } \dddot{I}_{ij}(t) \times {\text {Pf}}_{2s_{\text{phys}}/c} \int _{-\infty }^{+\infty } \frac{\text {d}\tau }{\vert \tau \vert } \dddot{I}_{ij}(t+\tau ). \end{aligned}$$
(6.40)

The nonlocal Hamiltonian \(\hat{H}^{\text {II}}_{\text {4PN}}[\textbf{r},\textbf{p};s]\) differs from what is displayed in Eq. (6.13) as the nonlocal part of the 4PN Hamiltonian. There the nonlocal piece of \(\hat{H}_{\text {4PN}}\) is defined by taking as regularization scale in the partie finie operation entering Eq. (6.13) the time \(2r_{12}/c\) instead of \(2s_{\text{phys}}/c\) appearing in (6.40). Thus the arbitrary scale \(s_{\text{phys}}\) enters both parts \(\hat{H}^{\text {I}}_{\text {4PN}}\) and \(\hat{H}^{\text {II}}_{\text {4PN}}\) of \(\hat{H}_{\text {4PN}}\), though it cancels out in the total Hamiltonian. Damour et al. (2015) has shown that modulo some nonlocal-in-time shift of the phase-space coordinates, one can reduce a nonlocal dynamics defined by the Hamiltonian \(\hat{H}[\textbf{r},\textbf{p};s]\equiv \hat{H}_{\text {N}}(\textbf{r},\textbf{p})+\hat{H}^{\text {II}}_{\text {4PN}}[\textbf{r},\textbf{p};s]\) to an ordinary (i.e., local in time) one. We will sketch here this reduction procedure, which employs the Delaunay form of the Newtonian equations of motion. In the circular motion case things are much simpler and we can directly perform the integral in the nonlocal Hamiltonian, Eq. (6.25).

It is enough to consider the planar case. In that case the action-angle variables are \((\mathcal {L},\ell ;\mathcal {G},g)\), using the standard notation of Brouwer and Clemence (1961) (with \(\mathcal {L}\equiv n\) and \(\mathcal {G}\equiv j\)). The variable \(\mathcal {L}\) is conjugate to the “mean anomaly” \(\ell \), while \(\mathcal {G}\) is conjugate to the argument of the periastron \(g=\omega \). The variables \(\mathcal {L}\) and \(\mathcal {G}\) are related to the usual Keplerian variables a (semimajor axis) and e (eccentricity) via

$$\begin{aligned} \mathcal {L}\equiv \sqrt{a}, \quad \mathcal {G}\equiv \sqrt{a(1-e^2)}. \end{aligned}$$
(6.41)

By inverting (6.41) one can express a and e as functions of \(\mathcal {L}\) and \(\mathcal {G}\):

$$\begin{aligned} a = \mathcal {L}^2, \quad e = \sqrt{1 - \left( \frac{\mathcal {G}}{\mathcal {L}}\right) ^2}. \end{aligned}$$
(6.42)

We use here rescaled variables: in particular, a denotes the rescaled semimajor axis \(a\equiv a_{\text{phys}}/(GM)\). We also use the rescaled time variable \(\hat{t}\equiv t_{\text{phys}}/(GM)\) appropriate for the rescaled Newtonian Hamiltonian

$$\begin{aligned} \hat{H}_{\text {N}}(\mathcal {L}) = \frac{1}{2} \, \textbf{p}^2 - \frac{1}{r} = -\frac{1}{2 \mathcal {L}^2}. \end{aligned}$$
(6.43)

The explicit expressions of the Cartesian coordinates (xy) of a Newtonian motion in terms of action-angle variables are given by

$$\begin{aligned} x (\mathcal {L}, \ell ; \mathcal {G}, g)&= \cos g \, x_0 - \sin g \, y_0, \quad y (\mathcal {L}, \ell ; \mathcal {G}, g) = \sin g \, x_0 + \cos g \, y_0, \end{aligned}$$
(6.44)
$$\begin{aligned} x_0&= a (\cos u - e), \quad y_0 = a \sqrt{1-e^2} \sin u, \end{aligned}$$
(6.45)

where the “eccentric anomaly” u is the function of \(\ell \) and e defined by solving Kepler’s equation

$$\begin{aligned} u - e \sin u = \ell . \end{aligned}$$
(6.46)

The solution of Kepler’s equation can be written in terms of Bessel functions:

$$\begin{aligned} u = \ell + \sum _{n=1}^\infty \frac{2}{n} J_n(ne) \sin (n \, \ell ). \end{aligned}$$
(6.47)

Note also the following Bessel-Fourier expansions of \(\cos u\) and \(\sin u\) [which directly enter \((x_0, y_0)\) and thereby (xy)]

$$\begin{aligned} \cos u&= -\frac{e}{2} + \sum _{n=1}^{\infty } \frac{1}{n} [J_{n-1} (ne) - J_{n+1} (ne)] \cos n \, \ell , \end{aligned}$$
(6.48)
$$\begin{aligned} \sin u&= \sum _{n=1}^{\infty } \frac{1}{n} [J_{n-1} (ne) + J_{n+1} (ne)] \sin n \, \ell . \end{aligned}$$
(6.49)

For completeness, we also recall the expressions involving the “true anomaly” f (polar angle from the periastron) and the radius vector r:

$$\begin{aligned} r&= a (1-e \cos u) = \frac{a (1-e^2)}{1+e \cos f}, \end{aligned}$$
(6.50)
$$\begin{aligned} \frac{x_0}{r}&= \cos f = \frac{\cos u -e}{1-e \cos u}, \quad \frac{y_0}{r} = \sin f = \frac{\sqrt{1-e^2} \sin u}{1-e \cos u}. \end{aligned}$$
(6.51)

The above expressions allow one to evaluate the expansions of x, y, and therefrom the components of the quadrupole tensor \(I_{ij}\), as power series in e and Fourier series in \(\ell \).

Let us then consider the expression

$$\begin{aligned} \mathcal {F}(t,\tau ) \equiv \dddot{I}_{ij}(t)\dddot{I}_{ij}(t+\tau ), \end{aligned}$$
(6.52)

which enters the nonlocal-in-time piece (6.40) of the Hamiltonian. In order to evaluate the order-reduced value of \(\mathcal {F}(t,\tau )\) one needs to use the equations of motion, both for computing the third time derivatives of \(I_{ij}\), and for expressing the phase-space variables at time \(t+\tau \) in terms of the phase-space variables at time t. One employs the zeroth-order equations of motion following from the Newtonian Hamiltonian (6.43),

$$\begin{aligned} \frac{\text{d}\ell }{\text{d} {\hat{t}}}&= \frac{\partial \hat{H}_{\text {N}}}{\partial \mathcal {L}} = \frac{1}{\mathcal {L}^3} \equiv \varOmega (\mathcal {L}), \quad \frac{\text{d} g}{\text{d} {\hat{t}}} = \frac{\partial \hat{H}_{\text {N}}}{\partial \mathcal {G}} = 0, \end{aligned}$$
(6.53)
$$\begin{aligned} \frac{\text{d}\mathcal {L}}{\text{d} {\hat{t}}}&= - \frac{\partial \hat{H}_{\text {N}}}{\partial \ell } = 0, \quad \frac{\text{d} \mathcal {G}}{\text{d}{\hat{t}}} = - \frac{\partial \hat{H}_{\text {N}}}{\partial g} = 0, \end{aligned}$$
(6.54)

where \(\varOmega (\mathcal {L}) \equiv \mathcal {L}^{-3}\) is the (\(\hat{t}\)-time) rescaled Newtonian (anomalistic) orbital frequency \(\varOmega = G M \varOmega _{\text{phys}}\) (it satisfies the rescaled third Kepler’s law: \(\varOmega = a^{-3/2}\)). The fact that g, \(\mathcal {L}\), and \(\mathcal {G}\) are constant and that \(\ell \) varies linearly with time, makes it easy to compute \({\dddot{I}_{ij}} (t+\tau )\) in terms of the values of \((\ell ,g,\mathcal {L},\mathcal {G})\) at time t. It suffices to use (denoting by a prime the values at time \(t' \equiv t+\tau \))

$$\begin{aligned} \ell ' \equiv \ell (t+\tau ) = \ell (t) + \varOmega (\mathcal {L}) {\hat{\tau }}, \end{aligned}$$
(6.55)

where \({\hat{\tau }} \equiv \tau /(GM)\), together with \(g' = g\), \(\mathcal {L}' = \mathcal {L}\), and \(\mathcal {G}' = \mathcal {G}\). The order-reduced value of \(\mathcal {F}(t,\tau )\) is given by (using \(\text {d}/\text {d}{\hat{t}} = \varOmega \,\text {d}/\text {d}\ell \))

$$\begin{aligned} \mathcal {F}(\ell ,{\hat{\tau }}) = \bigg (\frac{\varOmega (\mathcal {L})}{GM}\bigg )^6 \, \frac{\text {d}^3 I_{ij}}{\text {d}\ell ^3}(\ell ) \frac{\text {d}^3 I_{ij}}{\text {d}\ell ^3}(\ell +\varOmega (\mathcal {L}) {\hat{\tau }}). \end{aligned}$$
(6.56)

Inserting the expansion of \(I_{ij} (\ell )\) in powers of e and in trigonometric functions of \(\ell \) and g, yields \(\mathcal {F}\) in the form of a series of monomials of the type

$$\begin{aligned} \mathcal {F}(\ell ,{\hat{\tau }}) = \sum _{n_1 , n_2 , \pm n_3} C_{n_1 n_2 n_3}^{\pm } \, e^{n_1} \cos (n_2 \, \ell \pm n_3 \, \varOmega \, {\hat{\tau }}), \end{aligned}$$
(6.57)

where \(n_1\), \(n_2\), \(n_3\) are natural integers. (Because of rotational invariance, and of the result \(g' = g\), there is no dependence of \(\mathcal {F}\) on g.)

All the terms in the expansion (6.57) containing a nonzero value of \(n_2\) will, after integrating over \({\hat{\tau }}\) with the measure \(\text {d}{\hat{\tau }}/\vert {\hat{\tau \vert }}\) as indicated in Eq. (6.40), generate a corresponding contribution to \(\hat{H}^{\text {II}}_{\text {4PN}}\) which varies with \(\ell \) proportionally to \(\cos (n_2\,\ell )\). One employs now the standard Delaunay technique: any term of the type \(A(\mathcal {L})\cos (n\ell )\) in a first-order perturbation \(\varepsilon H_1(\mathcal {L},\ell )\equiv \hat{H}^{\text {II}}_{\text {4PN}}(\mathcal {L},\ell )\) of the leading-order Hamiltonian \(H_0(\mathcal {L})\equiv H_{\text {N}}(\mathcal {L})\) can be eliminated by a canonical transformation with generating function of the type \(\varepsilon \mathfrak {g}(\mathcal {L},\ell )\equiv \varepsilon B(\mathcal {L}) \sin (n\ell )\). Indeed,

$$\begin{aligned} \delta _\mathfrak {g} H_1 = \{ H_0 (\mathcal {L}) , \mathfrak {g} \} = - \frac{\partial H_0 (\mathcal {L})}{\partial \mathcal {L}} \, \frac{\partial \mathfrak {g}}{\partial \ell } = - n \, \varOmega (\mathcal {L}) \, B(\mathcal {L}) \cos (n\ell ), \end{aligned}$$
(6.58)

so that the choice \(B = A/(n \, \varOmega )\) eliminates the term \(A \cos (n\ell )\) in \(H_1\). This shows that all the periodically varying terms (with \(n_2 \ne 0\)) in the expansion (6.57) of \(\mathcal {F}\) can be eliminated by a canonical transformation. Consequently one can simplify the nonlocal part \(\hat{H}^{\text {II}}_{\text {4PN}}\) of the 4PN Hamiltonian by replacing it by its \(\ell \)-averaged value,

$$\begin{aligned} \hat{\bar{H}}^{\text {II}}_{\text {4PN}}(\mathcal {L},\mathcal {G};s) \equiv \frac{1}{2\pi }\int _0^{2\pi }\text {d}\ell \,\hat{H}^{\text {II}}_{\text {4PN}}[\textbf{r},\textbf{p};s] = -\frac{1}{5} \, \frac{G^2}{\nu c^8} \, {\text{Pf}}_{2s/c} \int _{-\infty }^{+\infty } \frac{\text{d} {\hat{\tau }}}{\vert {\hat{\tau }} \vert } \, {\bar{\mathcal {F}}} , \end{aligned}$$
(6.59)

where \({\bar{\mathcal {F}}}\) denotes the \(\ell \)-average of \(\mathcal {F}(\ell , {\hat{\tau }})\) [which is simply obtained by dropping all the terms with \(n_2 \ne 0\) in the expansion (6.57)]. This procedure yields an averaged Hamiltonian \(\hat{\bar{H}}^{\text {II}}_{\text {4PN}}\) which depends only on \(\mathcal {L}\), \(\mathcal {G}\) (and s) and which is given as an expansion in powers of e (because of the averaging this expansion contains only even powers of e). Damour et al. (2015) derived the \(\ell \)-averaged Hamiltonian as a power series of the formFootnote 10

$$\begin{aligned} \hat{\bar{H}}^{\text {II}}_{\text {4PN}}(\mathcal {L},\mathcal {G};s) = \frac{4}{5}\frac{\nu }{c^8\mathcal {L}^{10}} \sum _{p=1}^\infty p^6 |\hat{I}^p_{ij}(e)|^2 \ln \left( 2p\frac{\text {e}^{\gamma _{\text{E}}}s}{c\mathcal {L}^3}\right) , \end{aligned}$$
(6.60)

where \(\hat{I}^p_{ij}(e)\) are coefficients in the Bessel-Fourier expansion of the dimensionless reduced quadrupole moment \(\hat{I}_{ij}\equiv I_{ij}/[(GM)^2\mu a^2]\),

$$\begin{aligned} \hat{I}_{ij}(\ell ,e) = \sum _{p=-\infty }^{+\infty } \hat{I}_{ij}^p(e) \text {e}^{\text {i}p \ell }. \end{aligned}$$
(6.61)

Equation (6.60) is the basic expression for the transition of the tail-related part of the 4PN dynamics to the EOB approach (Damour et al. 2015).

For another approach to the occurrence and treatment of the \((\ell ,\ell ')\)-structure in nonlocal-in-time Hamiltonians the reader is referred to Damour et al. (2016) (therein, \(\ell \) is called \(\lambda \)). Generalized quasi-Keplerian parametrization for eccentric orbits at 4PN order was studied in Cho et al. (2022) (ignoring certain oscillatory terms arising due to 4PN tail effects).

6.3.3 Results at 5PN order

To compactify the expressions for higher-order PN Hamiltonians it is most convenient to go over to the canonically equivalent Hamiltonians of the EOB formalism (Buonanno and Damour 1999, 2000) (let us remind that the EOB approach is not in the scope of this review). Within this formalism the nPN-accurate Hamiltonian \(H_{\le n\text {PN}}(x,p)\) of the two-body system, in the centre of mass frame, is replaced by the real (i.e. giving the evolution equations with respect to the real ADM time coordinate \(t_{\text {ADM}}\) and the real two-body energy) and improved (i.e. representing a nonperturbative resummed estimate of the PN Hamiltonian) Hamiltonian \(H^{\text{improved}}_{\rm {real}}(x'(x,p),p'(x,p))\) (Buonanno and Damour 2000). The Hamiltonian \(H^{\text{improved}}_{\rm {real}}\) is related to the effective EOB Hamiltonian \(H^{\text{EOB}}_{\text{eff}}\) through the equation (Damour et al. 2000a)

$$\begin{aligned} \frac{H^{\text{EOB}}_{\text{eff}}}{\mu c^2} = \frac{(H^{\text{improved}}_{\rm {real}})^2 -m_1^2c^4 - m_2^2c^4}{2m_1m_2c^4}, \end{aligned}$$
(6.62)

resulting in the useful representation of \(H^{\text{improved}}_{\rm {real}}\) in terms of \(H^{\text{EOB}}_{\text{eff}}\),

$$\begin{aligned} H^{\text{improved}}_{\rm {real}} = Mc^2\sqrt{1 + 2\nu \left( \hat{H}^{\text{EOB}}_{\text{eff}}- 1\right) }, \end{aligned}$$
(6.63)

where \(\hat{H}^{\text{EOB}}_{\text{eff}}:=H^{\text{EOB}}_{\text{eff}}/(\mu c^2)\) denotes the reduced effective EOB Hamiltonian. In turn, the EOB effective Hamiltonian is defined as \(H^{\text{EOB}}_{\text{eff}}:=-c\,p'_0\), where \(p'_0\) is the solution of a general mass-shell condition of the form

$$\begin{aligned} g^{\mu \nu }_{\text{eff}}(x')p'_{\mu }p'_{\nu } + Q(x',p'_r) = -\mu ^2c^2, \end{aligned}$$
(6.64)

where the scalar Q denotes contributions which are at least quartic in momenta; one can reduce the dependence of Q on momenta to a dependence on the sole radial momentum \(p'_r\). The spherically symmetric effective metric \(g_{\mu \nu }^{\text{eff}}\) is a \(\nu \)-dependent deformation of Schwarzschild metric,

$$\begin{aligned} g_{\mu \nu }^{\text{eff}}\text {d}x'^\mu \text {d}x'^\nu&= -A(r';\nu ) c^2 \text {d}t'^2 + \big (A(r';\nu )\bar{D}(r';\nu )\big )^{-1}\text {d}r'^2 \\&\quad + r'^2 (\text {d}\theta '^2 + \sin ^2\theta '\,\text {d}\phi '^2). \end{aligned}$$
(6.65)

Solving Eq. (6.64) [with the metric (6.65)] with respect to \(p'_0\) gives the reduced effective EOB Hamiltonian of the form

$$\begin{aligned} \hat{H}^{\text{EOB}}_{\text{eff}}(x',p';\nu ) = \sqrt{A(u;\nu )\Big (1 + \hat{p}'^2 + \Big (A(u;\nu )\bar{D}(u;\nu )-1\Big )\hat{p}'^2_r + \hat{Q}(u,\hat{p}'_r;\nu )\Big )}, \end{aligned}$$
(6.66)

where \(\hat{Q}=Q/(\mu c^2)\), \(u:=GM/(r'c^2)\), \(\hat{p}'_r:=p'_r/(\mu c)\), \(\hat{p}':=p'/(\mu c)\) with \(p':=\sqrt{p'^2_r+p'^2_\theta /r'^2+p'^2_\phi /(r'^2\sin ^2\theta ')}\).

The 5PN-accurate PN expansions of the potentials A, \(\bar{D}\), and \(\hat{Q}\) read [let us note that \(u=\mathcal {O}(c^{-2})\) and \(p'_r=\mathcal {O}(c^{-1})\)]

$$\begin{aligned} A(u;\nu )&= 1 + \sum _{k=1}^4 a_k(\nu ) u^k + \sum _{k=5}^6 \big ( a_k^{\text{c}}(\nu )+a_k^{\text{ln}}(\nu )\ln u \big ) u^k, \end{aligned}$$
(6.67a)
$$\begin{aligned} \bar{D}(u;\nu )&= 1 + \sum _{k=2}^3 \bar{d}_k(\nu ) u^k + \sum _{k=4}^5 \big ( \bar{d}_k^{\text{c}}(\nu )+{\bar{d}}_k^{\text{ln}}(\nu )\ln u \big ) u^k, \end{aligned}$$
(6.67b)
$$\begin{aligned} Q(u,p'_r;\nu )&= \Big (q_{42}(\nu )u^2 + q_{43}(\nu )u^3 + \big (q_{44}^{\text {c}}(\nu )+q_{44}^{\text {ln}}(\nu ) \ln u\big )u^4\Big ) p'^4_r \\&\quad + \Big (q_{62}(\nu )u^2 + \big (q_{63}^{\text {c}}(\nu )+q_{63}^{\text {ln}}(\nu ) \ln u\big )u^3\Big ) p'^6_r \\&\quad + \Big (q_{81}(\nu )u + q_{82}(\nu )u^2 \Big ) p'^8_r. \end{aligned}$$
(6.67c)

Up to the 3PN level, the coefficients read as follows (Buonanno and Damour 1999; Damour et al. 2000a):

$$\begin{aligned}&\text {At 0PN:}\quad a_1(\nu ) = -2, \end{aligned}$$
(6.68a)
$$\begin{aligned}&\text {at 1PN:}\quad a_2(\nu ) = 0,\end{aligned}$$
(6.68b)
$$\begin{aligned}&\text {at 2PN:}\quad a_3(\nu ) = 2\nu , \quad \bar{d}_2(\nu ) = 6\nu ,\end{aligned}$$
(6.68c)
$$\begin{aligned}&\text {at 3PN:}\quad a_4(\nu ) = \left( \frac{94}{3}-\frac{41}{32}\pi ^2\right) \nu , \quad \bar{d}_3(\nu ) = 52\nu - 6\nu ^2, \\&q_{42}(\nu ) = 8\nu - 6\nu ^2. \end{aligned}$$
(6.68d)

At the 4PN level, the coefficients read (Damour et al. 2015; Bini et al. 2020a)

$$\begin{aligned} a_5^{\text{c}}(\nu )&= \left( \frac{2275}{512}\pi ^2 - \frac{4237}{60} + \frac{128}{5}{\gamma _{\text{E}}}+ \frac{256}{5}\ln 2 \right) \nu + \left( \frac{41}{32}\pi ^2 - \frac{221}{6} \right) \nu ^2, \end{aligned}$$
(6.69a)
$$\begin{aligned} a_5^{\text{ln}}(\nu )&= \frac{64}{5}\nu ,\end{aligned}$$
(6.69b)
$$\begin{aligned} \bar{d}_4^{\text{c}}(\nu )&= \left( -\frac{533}{45} - \frac{23761}{1536}\pi ^2 + \frac{1184}{15}{\gamma _{\text{E}}}- \frac{6496}{15}\ln 2 + \frac{2916}{5}\ln 3 \right) \nu \\&\quad + \left( \frac{123}{16}\pi ^2 - 260\right) \nu ^2, \end{aligned}$$
(6.69c)
$$\begin{aligned} \bar{d}_4^{\text{ln}}(\nu )&= \frac{592}{15}\nu ,\end{aligned}$$
(6.69d)
$$\begin{aligned} q_{43}(\nu )&= \left( -\frac{5308}{15} + \frac{496256}{45}\ln 2 - \frac{33048}{5}\ln 3 \right) \nu - 83 \nu ^2 + 10 \nu ^3,\end{aligned}$$
(6.69e)
$$\begin{aligned} q_{62}(\nu )&= \left( -\frac{827}{3} - \frac{2358912}{25}\ln 2 + \frac{1399437}{50}\ln 3 + \frac{390625}{18}\ln 5 \right) \nu \\&\quad - \frac{27}{5}\nu ^2 + 6\nu ^3,\end{aligned}$$
(6.69f)
$$\begin{aligned} q_{81}(\nu )&= \left( -\frac{35772}{175} + \frac{21668992}{45}\ln 2 + \frac{6591861}{350}\ln 3 - \frac{27734375}{126}\ln 5 \right) \nu . \end{aligned}$$
(6.69g)

At the 5PN level, solution with unique numerical prefactors is not available. The TF approach yields all 5PN-order coefficients of the EOB potentials (6.67) except for numerical prefactors of two terms proportional to \(\nu ^2\) entering the coefficients \(a_6^{\text{c}}(\nu )\) and \(\bar{d}_5^{\text{c}}(\nu )\). Also, Blümlein et al. (2022a, b) disagree with obtained by Bini et al. (2019, 2020a) local contribution to a term proportional to \(\nu ^2\) in the coefficient \(q_{44}^{\text{c}}(\nu )\). The coefficients of the 5PN-order EOB potentials read (Bini et al. 2019, 2020a)

$$\begin{aligned} a_6^{\text{c}}(\nu )&= \left( -\frac{1066621}{1575} + \frac{246367}{3072}\pi ^2 - \frac{14008}{105}{\gamma _{\text{E}}}- \frac{31736}{105}\ln 2 + \frac{243}{7}\ln 3 \right) {\nu } \\&\quad + a_{62}\nu ^2 + 4\nu ^3,\end{aligned}$$
(6.70a)
$$\begin{aligned} a_6^{\text{ln}}(\nu )&= -\frac{7004}{105}\nu - \frac{144}{5}\nu ^2,\end{aligned}$$
(6.70b)
$$\begin{aligned} \bar{d}_5^{\text{c}}(\nu )&= \left( \frac{294464}{175} - \frac{63707}{512}\pi ^2 - \frac{2840}{7}{\gamma _{\text{E}}}+ \frac{120648}{35}\ln 2 - \frac{19683}{7}\ln 3\right) {\nu } \\&\quad + \bar{d}_{52}\nu ^2 + \left( \frac{1069}{3} - \frac{205}{16}\pi ^2\right) {\nu ^3},\end{aligned}$$
(6.70c)
$$\begin{aligned} \bar{d}_5^{\text{ln}}(\nu )&= -\frac{1420}{7}\nu - \frac{3392}{15}\nu ^2,\end{aligned}$$
(6.70d)
$$\begin{aligned} q_{44}^{\text{c}}(\nu )&= \bigg ( \frac{1295219}{350} - \frac{93031}{1536}\pi ^2 + \frac{10856}{105}{\gamma _{\text{E}}}- \frac{40979464}{315}\ln 2 + \frac{14203593}{280}\ln 3 \\&\quad + \frac{9765625}{504}\ln 5 \bigg ){\nu } + q_{442}\nu ^2 + \left( 640 - \frac{615}{32}\pi ^2\right) \nu ^3,\end{aligned}$$
(6.70e)
$$\begin{aligned} q_{44}^{\text{ln}}(\nu )&= \frac{5428}{105}\nu - \frac{592}{5}\nu ^2,\end{aligned}$$
(6.70f)
$$\begin{aligned} q_{63}^{\text{c}}(\nu )&= \bigg ( \frac{2613083}{1050} + \frac{6875745536}{4725}\ln 2 - \frac{23132628}{175}\ln 3 - \frac{101687500}{189}\ln 5 \bigg )\nu \\&\quad + \bigg ( \frac{159089}{75} - \frac{4998308864}{1575}\ln 2 - \frac{45409167}{350}\ln 3 + \frac{26171875}{18}\ln 5 \bigg ){\nu ^2} \\&\quad + 116\nu ^3 - 14\nu ^4,\end{aligned}$$
(6.70g)
$$\begin{aligned} q_{63}^{\text{ln}}(\nu )&= 0,\end{aligned}$$
(6.70h)
$$\begin{aligned} q_{82}(\nu )&= \bigg ( \frac{5790381}{2450} - \frac{16175693888}{1575}\ln 2 - \frac{393786545409}{156800}\ln 3 \\&\quad + \frac{875090984375}{169344}\ln 5 + \frac{13841287201}{17280}\ln 7 \bigg )\nu \\&\quad + \bigg ( \frac{870976}{525} + \frac{703189497728}{33075}\ln 2 + \frac{332067403089}{39200}\ln 3 \\&\quad - \frac{468490234375}{42336}\ln 5 - \frac{13\,841\,287\,201}{4\,320}\ln 7 \bigg )\nu ^2 + \frac{24}{7}{\nu ^3} - 6\nu ^4. \end{aligned}$$
(6.70i)

The nonlocal part of the potential \(q_{82}\) was computed in Appendix G of Bini et al. (2020c).

The non computed in Bini et al. (2019, 2020a) prefactors \(a_{62}\) and \(\bar{d}_{52}\) enter the local-in-time parts of the EOB potentials,

$$\begin{aligned} a_{62} = a_{62}^{{{\text{nloc}}}} + a_{62}^{\text{loc}}, \quad \bar{d}_{52} = \bar{d}_{52}^{{{\text{nloc}}}} + \bar{d}_{52}^{\text{loc}}, \end{aligned}$$
(6.71)

where the prefactors \(a_{62}^{{{\text{nloc}}}}\) and \(\bar{d}_{52}^{{{\text{nloc}}}}\) related with the nonlocal-in-time parts are well confirmed and equal [see Table IV in Bini et al. (2020a)]

$$\begin{aligned} a_{62}^{{{\text{nloc}}}}&= \frac{64}{5} - \frac{288}{5}{\gamma _{\text{E}}}+ \frac{928}{35}\ln 2 - \frac{972}{7}\ln 3,\end{aligned}$$
(6.72a)
$$\begin{aligned} \bar{d}_{52}^{{{\text{nloc}}}}&= \frac{67\,736}{105} -\frac{6\,784}{15}{\gamma _{\text{E}}}- \frac{326\,656}{21}\ln 2 + \frac{58\,320}{7}\ln 3. \end{aligned}$$
(6.72b)

The EFT approach by Blümlein et al. (2021b, 2022b) gives,

$$\begin{aligned} a_{62}^{\text{loc}}&= a_{62(\text {rat})}^{\text{loc}} + a_{62(\pi ^2)}^{\text{loc}}, \quad a_{62(\text {rat})}^{\text{loc}} = -\frac{584881}{525}, \quad a_{62(\pi ^2)}^{\text{loc}} = \frac{25911}{256}\pi ^2,\end{aligned}$$
(6.73a)
$$\begin{aligned} \bar{d}_{52}^{\text{loc}}&= \bar{d}_{52(\text {rat})}^{\text{loc}} + \bar{d}_{52(\pi ^2)}^{\text{loc}}, \quad \bar{d}_{52(\text {rat})}^{\text{loc}} = -\frac{10442728}{1575}, \quad \bar{d}_{52(\pi ^2)}^{\text{loc}} = \frac{306545}{512}\pi ^2. \end{aligned}$$
(6.73b)

The coefficients \(a_{62(\pi ^2)}^{\text{loc}}\) and \(\bar{d}_{52(\pi ^2)}^{\text{loc}}\) are confirmed by TF.

The computed in Bini et al. (2019, 2020a) prefactor \(q_{442}\) is the sum of the local-in-time and the nonlocal-in-time parts,

$$\begin{aligned} q_{442} = q_{442}^{{{\text{nloc}}}} + q_{442}^{\text{loc}}, \end{aligned}$$
(6.74)

where the nonlocal-in-time part \(q_{442}^{{{\text{nloc}}}}\) reads [see Table IV in Bini et al. (2020a)]

$$\begin{aligned} q_{442}^{{{\text{nloc}}}} = \frac{74\,436}{35} -\frac{1\,184}{5}{\gamma _{\text{E}}}+ \frac{33\,693\,536}{105}\ln 2 - \frac{6\,396\,489}{70}\ln 3 - \frac{9\,765\,625}{126}\ln 5. \end{aligned}$$
(6.75)

The local-in-time part \(q_{442}^{\text{loc}}\) equals

$$\begin{aligned} q_{442}^{\text{loc}} = q_{442(\text {rat})}^{\text{loc}} + q_{442(\pi ^2)}^{\text{loc}}, \quad q_{442(\pi ^2)}^{\text{loc}} = \frac{31\,633}{512}\pi ^2, \end{aligned}$$
(6.76)

where the transcendental part \(q_{442(\pi ^2)}^{\text{loc}}\) is confimed by both Bini et al. (2019, 2020a) and Blümlein et al. (2022a, b). However, the rational part \(q_{442(\text {rat})}^{\text{loc}}\) has incompatible values according to Bini et al. (2019, 2020a) (TF) and Blümlein et al. (2022a, b) (BMMS),

$$\begin{aligned} q_{442(\text {rat})}^{\text{loc TF}} = -\frac{9\,367}{15}, \quad q_{442(\text {rat})}^{\text{loc BMMS}} = -\frac{1\,252\,924}{1\,575}. \end{aligned}$$
(6.77)

Agreement between the TF and BMMS results could be achieved by a possibly missing conservative quadratic radiation-reaction (anti-symmetric)\(^2\) term mentioned in Bini et al. (2021), which could lead to the following change of the TF Hamiltonian (Blümlein et al. 2022a, b),

$$\begin{aligned} \delta H^{{\text{(reac)}}^2}_{\text{rad}} = a\,\nu ^2 p'^4_r u^4, \quad a\in \mathbb {R}. \end{aligned}$$
(6.78)

The agreement would be achieved for \(a=-168/5\) (Blümlein et al. 2022a, b).

The genuine (i.e., not the 1PN corrections coming from 4PN level) local and nonlocal tail Hamiltonians at the 5PN order are (Foffa and Sturani 2020; Bini et al. 2021; Almeida et al. 2021; Blümlein et al. 2021b, 2022b)

$$\begin{aligned} H^{\text{tail, nloc}}_{5{\text{PN}}}&= -\frac{G\mathcal {M}}{c^3} {\text{Pf}}_{2r_{12}/c} \int _{-\infty }^{\infty }\frac{\text {d}\tau }{|\tau |}{\mathcal{F}}_{\text{1PN}}^{\text{split}}(t,t + \tau ),\end{aligned}$$
(6.79a)
$$\begin{aligned} H^{\text{tail, loc}}_{\text{5PN}}&= -\frac{G\mathcal {M}}{c^3} \left( R_{\text{oct,e}}{\mathcal{F}}^{\text{split}, MQ^2}_{\text{1PN}}(t,t) + R_{\text{quad,m}} {\mathcal{F}}^{\text{split}, MJ^2}_{\text{1PN}}(t,t) \right) . \end{aligned}$$
(6.79b)

Here, \(\mathcal {M}\) denotes the total ADM conserved mass-energy of the binary system [\(\mathcal {M}=M+\mathcal {O}(c^{-2})\)] and the indices \(MQ^2\) and \(MJ^2\) are denoting the mass-type (or electric-type) octupole-moment (\(Q_{ijk}\)) and the spin-type (or magnetic-type) quadrupole-moment (\(J_{ij}\)) contributions, respectively, and

$$\begin{aligned} {\mathcal{F}}_{\text{1PN}}^{\text{split}}(t,t')&= \frac{G}{c^5}\frac{1}{c^2} \left( \frac{1}{189}Q^{(4)}_{ijk}(t)Q^{(4)}_{ijk}(t') + \frac{16}{45}J^{(3)}_{ij}(t)J^{(3)}_{ij}(t') \right) ,\end{aligned}$$
(6.80a)
$$\begin{aligned} R^{\text{TF}}_{\text{oct,e}}&= R^{\text{FS}}_{\text{oct,e}} = R^{\text{BMMS}}_{\text{oct,e}} = \frac{82}{35},\end{aligned}$$
(6.80b)
$$\begin{aligned} R^{\text{TF}}_{\text{quad,m}}&= R^{\text{AFS}}_{\text{quad,m}} = R^{\text{BMMS}}_{\text{quad,m}} = \frac{49}{20}. \end{aligned}$$
(6.80c)

We have used here the notation \(f^{(n)}(t)\equiv \text {d}f(t)/\text {d}t^n\) to denote the n-th derivative with respect to time t.

The magnetic-type quadrupole moment \(J_{ij}=J_{ji}\) comes in via the most subtle form \(\frac{1}{2}R_{0iab}\epsilon _{abj}J_{ij}\), valid in 3 dimensions only. Its d-dimensional generalization needs the avatar \(J_{i|ab}\), antisymmetric with respect to i and b, \(J_{i|ab}=-J_{b|ai}\), that satisfies the cyclic identity \(J_{i|ab}+J_{a|bi}+J_{b|ia}=0\). It reads (Henry et al. 2021; Bini et al. 2021)

$$\begin{aligned} J_{i|ab}&= \nu (m_2-m_1) \bigg (\Big (x^ix^a - \frac{\textbf{x} \cdot \textbf{x}}{d-1}\delta ^{ia}\Big )\text{v}_b - \Big (x^ax^b-\frac{\textbf{x} \cdot \textbf{x}}{d-1}\delta ^{ab}\Big )v_i \\&\quad - \frac{\textbf{x} \cdot \textbf{v}}{d-1}(x^i\delta ^{ab} - x^b\delta ^{ia})\bigg ). \end{aligned}$$
(6.81)

Then \(\epsilon _{abj}J_{ij} \equiv J_{b|ia}\),

$$\begin{aligned} J^{(3)}_{ij} J^{(3)}_{ij} \rightarrow \frac{1}{2}J^{(3)}_{i|ab} J^{(3)}_{i|ab}. \end{aligned}$$
(6.82)

The following relations have been derived within TF (Bini et al. 2021), using \(R_{\text{oct,e}}\) and \(R_{\text{quad,m}}\),

$$\begin{aligned} a_{62}^{\text{loc}}&= \frac{25\,911}{256}\pi ^2 + R_{a_6}(C_{QQL},C_{QQQ_1},C_{QQQ_2}), \end{aligned}$$
(6.83a)
$$\begin{aligned} \bar{d}_{52}^{\text{loc}}&= \frac{306\,545}{512}\pi ^2 + R_{d_5}(C_{QQL},C_{QQQ_1},C_{QQQ_2}), \end{aligned}$$
(6.83b)

where \(R_{a_6}\) and \(R_{d_5}\) are given rational-valued functions of the three numerical constants \(C_{QQA}\) (\(A = L,Q_1,Q_2\)) which are defined by specific terms in the effective action for the radiation-type graviton exchange:

$$\begin{aligned} S_{QQL}&= C_{QQL}G^2 \int \text {d}t\, Q^{(4)}_{il}Q^{(3)}_{jl}\epsilon _{ijk}L_k, \end{aligned}$$
(6.84a)
$$\begin{aligned} S_{QQQ_1}&= C_{QQQ_1}G^2 \int \text {d}t\, Q^{(4)}_{il}Q^{(4)}_{jl}Q_{ij}, \end{aligned}$$
(6.84b)
$$\begin{aligned} S_{QQQ_2}&= C_{QQQ_2}G^2 \int \text {d}t\, Q^{(3)}_{il}Q^{(3)}_{jl}Q^{(2)}_{ij}, \end{aligned}$$
(6.84c)

with values all having been calculated by Foffa and Sturani (2020, 2021), Blümlein et al. (2022a, 2022b), and Almeida et al. (2023b) using in-out and in-in (or, closed-time) formalisms, respectively,Footnote 11

$$\begin{aligned} C^{\text{AFS}}_{QQL}&= -\frac{1}{30} = \frac{1}{16} C^{\text{BMMS}}_{QQL}, \end{aligned}$$
(6.85a)
$$\begin{aligned} C^{\text{mem FS}}_{QQQ_1}&= -\frac{1}{15} = \frac{4}{3}C^{\text{mem BMMS}}_{QQQ_1},\end{aligned}$$
(6.85b)
$$\begin{aligned} C^{\text{cont BMMS}}_{QQQ_1}&= \frac{1}{8},\end{aligned}$$
(6.85c)
$$\begin{aligned} C^{\text{mem FS}}_{QQQ_2}&= -\frac{4}{105} = \frac{4}{3}C^{\text{mem BMMS}}_{QQQ_2}. \end{aligned}$$
(6.85d)

The abbreviations “mem” and “cont” denote so-called memory and contact terms, respectively.

In terms of doubled in-in position variables, \(x^i_{a,1}\) (moving forward in time) and \(x^i_{a,2}\) (moving backward in time), with then \(x^i_{a,-} = (x^i_{a,1} - x^i_{a,2})/\sqrt{2}\) and \(x^i_{a,+} = (x^i_{a,1} + x^i_{a,2})/\sqrt{2}\) or, alternatively, \(x^i_{a,-} = x^i_{a,1} - x^i_{a,2}\) and \(x^i_{a,+} = (x^i_{a,1} + x^i_{a,2})/2\), the action functionals obtained in respectively Blümlein et al. (2022a, b) and Almeida et al. (2023a) coincide. The classical limit reads \(x^i_{a,1} = x^i_{a,2} = x^i_a\). In the extractions of classical dynamics information, however, Blümlein et al. (2022a, b) and Almeida et al. (2023a) did obtain different results.

By TF (Bini et al. 2021), the following constraint equation is derived from the condition on scattering-angles \(\chi ^{\text{cons,EFT}}_4 - \chi ^{\text{cons,TF}}_4 = 0\) of conservative dynamics, where \(\chi ^{\text{cons,TF}}_4\) is based on a general rule on mass-polynomiality (Damour 2020) that terms proportional to \(\nu ^2\) are not present,

$$\begin{aligned} 0 = \frac{2973}{350} - \frac{69}{2}C_{QQL} + \frac{253}{18} C_{QQQ_1} + \frac{85}{9} C_{QQQ_2}, \end{aligned}$$
(6.86)

where the pure rational number is obtained for a specific value of \(q_{44}\). That condition gets fulfilled by neither the values from Foffa and Sturani (2020, 2021) nor those from Blümlein et al. (2022a, b). Also Almeida et al. (2023a) does not stay in agreement.

To sum up: on the local-in-time level, the 5PN EOB numerical coefficients \(a_{62(\text {rat})}^{\text{loc}}\), \(\bar{d}_{52(\text {rat})}^{\text{loc}}\), and \(q_{442(\text {rat})}^{\text{loc}}\) are still controversial.

6.3.4 Results at 5.5PN order

Half-integer-power PN contributions to conservative two-body dynamics start at the 5.5PN order (Shah et al. 2014; Blanchet et al. 2014). The complete 5.5PN conservative Hamiltonian comes from the second-order tail (i.e., tail-of-tail or tail\(^2\)) effects and it reads (Damour et al. 2015; Bini et al. 2020a)

$$\begin{aligned} H_{5.5 {\text{PN}}}^{\text{tail}^2,{\text{nloc}}} = -\frac{107}{210}\frac{G^2\mathcal {M}^2}{c^6} \int _{-\infty }^{\infty }\frac{\text {d}\tau }{\tau }[{\mathcal{G}}^{\text{split}}(t,t + \tau ) - {\mathcal{G}}^{\text{split}}(t,t - \tau )], \end{aligned}$$
(6.87)

where

$$\begin{aligned} {\mathcal{G}}^{\text{split}}(t,t') := \frac{G}{5c^5} Q^{(3)}_{ij}(t) Q^{(4)}_{ij}(t'). \end{aligned}$$
(6.88)

The contribution of the 5.5PN Hamiltonian \(H_{5.5 {\text{PN}}}^{\text{tail}^2,{\text{nloc}}}\) to an effective EOB dynamics was computed in (Damour et al. 2015; Bini et al. 2020a).

6.3.5 Results at 6PN order

The TF approach succeeded with 6PN level to some remarkable extent (Bini et al. 2020b, c) and the EFT approach to some part (Blümlein et al. 2020c, 2021a). Only four numerical coefficients of the EOB representation of the 6PN dynamics are unknown [two of them are prefactors of terms proportional to \(\nu ^2\) and \(\nu ^3\) in the potential \(A(u;\nu )\), the remaining two are prefactors of terms proportional to \(\nu ^2\) entering the \(\bar{D}(u;\nu )\) and \(Q(u,p'_r;\nu )\) potentials]. Each of these coefficients is predicted to be the sum of a rational number and a transcendental number.

The nonlocal-in-time 6PN Hamiltonian is known explicitly and reads

$$\begin{aligned} H^{\text{tail, nloc}}_{6{\text{PN}}}&= -\frac{G\mathcal {M}}{c^3}{\text{Pf}}_{2r_{12}/c} \int _{-\infty }^{\infty }\frac{\text {d}\tau }{|\tau |}{\mathcal{F}}_{\text{2PN}}^{\text{split}}(t,t + \tau ),\end{aligned}$$
(6.89)
$$\begin{aligned} {\mathcal{F}}_{\text{2PN}}^{\text{split}}(t,t')&= \frac{G}{c^5}\frac{1}{c^4} \left( \frac{1}{9072} Q^{(5)}_{ijkl}(t) Q^{(5)}_{ijkl}(t') + \frac{1}{84} J^{(4)}_{ijk}(t) J^{(4)}_{ijk}(t') \right) , \end{aligned}$$
(6.90)

\(Q_{ijkl}\) and \(J_{ijk}\) denoting mass-type hexadecapole and magnetic-type octupole moments. The R-coefficients, cf. (6.79b), of the corresponding local-in-time part are known, even through all PN orders, see Almeida et al. (2021). Not known are many other local-in-time expressions. All these expressions contribute to the four coefficients listed at the beginning of this subsection.

6.4 The innermost stable circular orbit

The innermost stable circular orbit (ISCO) of a test-body orbiting in the Schwarzschild metric is located at \(R=6MG/c^2\), in Schwarzschild coordinates. Within a Hamiltonian formalism the calculation of the ISCO for systems made of bodies of comparable masses is rather straightforward. It is relevant to start with the discussion of dynamics of a two-body system along circular orbits.

The centre-of-mass conservative Hamiltonian \(\hat{H}(\textbf{r},\textbf{p})\) can be reduced to circular orbits by setting \(p_r = \textbf{n}\cdot \textbf{p}= 0\) and \(\textbf{p}^2 = j^2/r^2\), then \(\hat{H}=\hat{H}(r,j)\). Moreover, \(\partial \hat{H}(r,j)/\partial r = 0\) along circular orbits, what gives the link between r and j, \(r=r(j)\). Finally the energy \(\hat{E}^{\rm {circ}}\) along circular orbits can be expressed as a function of j only, \(\hat{E}^{\rm {circ}}(j)\equiv \hat{H}(r(j),j)\). The link between the (reduced) centre-of-mass energy \(\hat{E}^{\rm {circ}}\) and the (reduced) angular momentum j is explicitly known up to the 4PN order. It reads (Bini and Damour 2013; Damour et al. 2014)

$$\begin{aligned} \hat{E}^{\text{circ}}(j;\nu )&= -\frac{1}{2j^2} \Bigg \{ 1 + \bigg (\frac{9}{4}+\frac{1}{4}\nu \bigg )\frac{1}{j^2} + \bigg (\frac{81}{8} - \frac{7}{8}\nu + \frac{1}{8}\nu ^2\bigg )\frac{1}{j^4} \\&\quad + \bigg [ \frac{3861}{64} + \bigg (\frac{41\pi ^2}{32}-\frac{8833}{192}\bigg ) \nu - \frac{5}{32}\nu ^2 + \frac{5}{64}\nu ^3 \bigg ] \frac{1}{j^6} \\&\quad + \bigg [ \frac{53703}{128} + \bigg (\frac{6581\pi ^2}{512}-\frac{989911}{1920}-\frac{64}{5}\bigg (2\gamma _\text {E}+\ln \frac{16}{j^2}\bigg )\bigg )\nu \\&\quad + \bigg (\frac{8875}{384}-\frac{41\pi ^2}{64}\bigg )\nu ^2-\frac{3}{64}\nu ^3+\frac{7}{128}\nu ^4\bigg ] \frac{1}{j^8} + \mathcal {O}(j^{-10}) \Bigg \}. \end{aligned}$$
(6.91)

An important observational quantity is the angular frequency of circular orbits, \(\omega _{\rm {circ}}\). It can be computed as

$$\begin{aligned} \omega _{\rm {circ}} = \frac{1}{GM} \frac{\text {d}\hat{E}^{\rm {circ}}}{\text {d}j}. \end{aligned}$$
(6.92)

It is convenient to introduce the coordinate-invariant dimensionless variable (which can also serve as small PN expansion parameter)

$$\begin{aligned} x \equiv \left( \frac{GM\omega _{\rm {circ}}}{c^3}\right) ^{2/3}. \end{aligned}$$
(6.93)

Making use of Eqs. (6.92) and (6.93) it is not difficult to translate the link of Eq. (6.91) into the dependence of the energy \(\hat{E}^{\rm {circ}}\) on the parameter x. The 4PN-accurate formula reads (Bini and Damour 2013; Damour et al. 2014)

$$\begin{aligned} \hat{E}^{\text{circ}}(x;\nu )&= -\frac{x}{2} \Bigg \{ 1 - \bigg (\frac{3}{4} + \frac{\nu }{12} \bigg ) x + \bigg (-\frac{27}{8} + \frac{19\nu }{8} - \frac{\nu ^2}{24}\bigg ) x^2 \\&\quad + \bigg [ -\frac{675}{64} + \left( \frac{34445}{576}-\frac{205\pi ^2}{96}\right) \nu -\frac{155\nu ^2}{96} - \frac{35\nu ^3}{5184} \bigg ] x^3 \\&\quad + \bigg [ -\frac{3969}{128} + \bigg (\frac{9037 \pi ^2}{1536}-\frac{123671}{5760}+\frac{448}{15}\big (2\gamma _{\rm {E}}+\ln (16 x)\big )\bigg )\nu \\&\quad + \left( \frac{3157\pi ^2}{576} -\frac{498449}{3456}\right) \nu ^2 + \frac{301\nu ^3}{1728} + \frac{77\nu ^4}{31104} \bigg ] x^4 + \mathcal {O}(x^5) \Bigg \}. \end{aligned}$$
(6.94)

In the test-mass limit \(\nu \rightarrow 0\) (describing motion of a test particle on a circular orbit in the Schwarzschild spacetime) the link \(\hat{E}^{\text{circ}}(x;\nu )\) is exactly known,

$$\begin{aligned} \hat{E}^{\text{circ}}(x;0) = \frac{1-2x}{\sqrt{1-3x}} - 1. \end{aligned}$$
(6.95)

The location \(x_{\text {ISCO}}=1/6\) of the ISCO in the test-mass limit corresponds to the minimum of the function \(\hat{E}^{\text{circ}}(x;0)\), i.e.

$$\begin{aligned} \frac{\text {d}\hat{E}^{\text{circ}}(x;0)}{\text {d}x}\bigg |_{x=x_{\text {ISCO}}} = 0. \end{aligned}$$
(6.96)

Therefore the most straightforward way of locating the ISCO for \(\nu >0\) relies on looking for the minimum of the function \(\hat{E}^{\text{circ}}(x;\nu )\), i.e., for a given value of \(\nu \), the location of the ISCO is obtained by (usually numerically) solving the equation \(\text {d}\hat{E}^{\text{circ}}(x;\nu )/(\text {d}x)=0\) (Blanchet 2002). Equivalently the location of the ISCO can be defined as a solution of the set of simultaneous equations \(\partial \hat{H}(r,j)/\partial r = 0\) and \(\partial ^2\hat{H}(r,j)/\partial r^2 = 0\). Both approaches are equivalent only for the exact Hamiltonian \(\hat{H}(r,j)\), see however Sect. IV A 2 in Buonanno et al. (2003, 2006) for subtleties related to equivalence of both approaches when using post-Newtonian-accurate Hamiltonians. With the aid of the latter method Schäfer and Wex (1993a) computed the nPN-accurate ISCO of the test mass in the Schwarzschild metric through 9PN order in three different coordinate systems, obtaining three different results. Clearly, the application of the first method only results in a nPN-accurate ISCO described by parameters which are coordinate invariant.

Let us consider the 4PN-accurate expansion of the exact test-mass-limit formula (6.95),

$$\begin{aligned} \hat{E}^{\text{circ}}(x;0) = -\frac{x}{2} \bigg ( 1 - \frac{3}{4} x - \frac{27}{8} x^2 -\frac{675}{64} x^3 -\frac{3969}{128}x^4 +\mathcal {O}(x^5) \bigg ). \end{aligned}$$
(6.97)

Let us compute the succesive PN estimations of the exact ISCO frequency parameter \(x_{\text {ISCO}}=1/6\cong 0.166667\) in the test-mass limit, by solving the equations \(\text {d}\hat{E}_{n{{\text{PN}}}}^{\text{circ}}(x;0)/(\text {d}x)=0\) for \(n=1,\ldots ,4\), where the function \(\hat{E}_{n{{\text{PN}}}}^{\text{circ}}(x;0)\) is defined as the \(\mathcal {O}(x^{n+1})\)-accurate truncation of the right-hand-side of Eq. (6.97). They read: 0.666667 (1PN), 0.248807 (2PN), 0.195941 (3PN), 0.179467 (4PN). One sees that the 4PN prediction for the ISCO frequency parameter is still \(\sim \)8% larger than the exact result. This suggests that the straightforward Taylor approximants of the energy function \(\hat{E}^{\text{circ}}(x;\nu )\) do not converge fast enough to determine satisfactorily the frequency parameter of the ISCO also in \(\nu >0\) case, at least for sufficiently small values of \(\nu \). The extrapolation of this statement for larger \(\nu \) is supported by the values of the ISCO locations in the equal-mass case (\(\nu =1/4\)), obtained by solving the equations \(\text {d}\hat{E}_{\rm {nPN}}^{\text{circ}}(x;1/4)/(\text {d}x)=0\) for \(n=1,\ldots ,4\), where the function \(\hat{E}_{n{{\rm{PN}}}}^{\text{circ}}(x;\nu )\) is now defined as the \(\mathcal {O}(x^{n+1})\)-accurate truncation of the right-hand-side of Eq. (6.94). For the approximations from 1PN up to 4PN the ISCO locations read (Damour et al. 2000a; Blanchet 2002; Jaranowski and Schäfer 2013): 0.648649 (1PN), 0.265832 (2PN), 0.254954 (3PN), and 0.236599 (4PN).Footnote 12

To overcome the problem of the slow convergence of PN expansions several new methods of determination of the ISCO for comparable-mass binaries were devised by Damour et al. (2000a). They use different “resummation” techniques and are based on the consideration of gauge-invariant functions. One of the methods, called the “j-method” by Damour et al. (2000a), employs the invariant function linking the angular momentum and the angular frequency along circular orbits and uses Padé approximants. The ISCO is defined in this method as the minimum, for the fixed value of \(\nu \), of the function \(j^2(x;\nu )\), where j is the reduced angular momentum [introduced in Eq. (6.26)]. The function \(j^2(x;\nu )\) is known in the test-mass limit,

$$\begin{aligned} j^2(x;0) = \frac{1}{x(1-3x)}, \end{aligned}$$
(6.98)

and its minimum coincides with the exact “location" \(x_{\text {ISCO}}=1/6\) of the test-mass ISCO. The form of this function suggests to use Padé approximants instead of direct Taylor expansions. It also suggests to require that all used approximants have a pole for some \(x_{\rm {pole}}\), which is related with the test-mass “light-ring” orbit occurring for \(x_{\rm {lr}}=1/3\) in the sense that \(x_{\rm {pole}}(\nu )\rightarrow 1/3\) when \(\nu \rightarrow 0\). The 4PN-accurate function \(j^2(x;\nu )\) has the symbolic structure \((1/x)(1+x+\ldots +x^4+x^4\ln x)\). In the j-method the Taylor expansion at the 1PN level with symbolic form \(1+x\) is replaced by Padé approximant of type (0,1), at the 2PN level \(1+x+x^2\) is replaced by (1,1) approximant, at the 3PN level \(1+x+x^2+x^3\) is replaced by (2,1) approximant, and finally at the 4PN level \(1+x+x^2+x^3+x^4\) is replaced by (3,1) Padé approximant [the explicit form of the (0,1), (1,1), and (2,1) approximants can be found in Eqs. (4.16) of Damour et al. 2000a]. At all PN levels the test-mass result is recovered exactly and Jaranowski and Schäfer (2013) showed that the ISCO locations resulting from 3PN-accurate and 4PN-accurate calculations almost coincide for all values of \(\nu \), \(0\le \nu \le \frac{1}{4}\). The ISCO locations in the equal-mass case \(\nu =1/4\) for the approximations from 1PN up to 4PN are as follows (Jaranowski and Schäfer 2013): 0.162162 (1PN), 0.185351 (2PN), 0.244276 (3PN), 0.242967 (4PN).

6.5 Dissipative Hamiltonians

To discuss dissipative Hamiltonians it is convenient to use the toy model from Sect. 3.2 with the Routhian \(R(q,p;\xi ,{\dot{\xi }})\) and its corresponding Hamiltonian \(H(q,p;\xi ,\pi )=R+\pi {\dot{\xi }}\). The Hamilton equations of motion for the (qp) variables read

$$\begin{aligned} \dot{p} = -\frac{\partial H}{\partial q} = -\frac{\partial R}{\partial q}, \quad \dot{q} = \frac{\partial H}{\partial p} = \frac{\partial R}{\partial p}, \end{aligned}$$
(6.99)

and the Euler–Lagrange equation for the \(\xi \) variable is

$$\begin{aligned} \frac{\partial R}{\partial \xi } - \frac{\text {d}}{\text {d}t}\frac{\partial R}{\partial {\dot{\xi }}} = 0. \end{aligned}$$
(6.100)

Alternatively, the Hamilton equations of motion for the \((\xi ,\pi )\) variables can be used. Solutions of the Euler–Lagrange equation are functions \(\xi =\xi (q,p)\). Under those solutions, the Hamilton equations of motion for the (qp) variables become

$$\begin{aligned} \dot{p} = -\frac{\partial R}{\partial q}\bigg |_{\xi =\xi (q,p)}, \quad \dot{q} = \frac{\partial R}{\partial p}\bigg |_{\xi =\xi (q,p)}. \end{aligned}$$
(6.101)

These autonomous equations in the (qp) variables contain the full conservative and dissipative content of the (qp) dynamics. The time-symmetric part of R yields the conservative equations of motion and the time-antisymmetric part the dissipative ones. The conservative equations of motion agree with the Fokker-type ones showing the same boundary conditions for the \((\xi ,{\dot{\xi }})\) variables. When going from the \((\xi ,{\dot{\xi }})\) variables to the field variables \(h^{\text{TT}}\) and \(\dot{h}^{\text{TT}}\), those time-symmetric boundary conditions mean as much incoming as outgoing radiation.

To describe astrophysical systems one should use the physical boundary conditions of no incoming radiation and past stationarity. Clearly, radiative dissipation happens now and the time-symmetric part of the whole dynamics makes the conservative part. In linear theories the conservative part just results from the symmetric Green function \(G_\text {s}\), whereas the dissipative one comes from the antisymmetric Green function \(G_\text {a}\), which is a homogeneous solution of the wave equation. They both together combine to the retarded Green function \(G_{\text{ret}}=G_{\text{s}}+G_{\text{a}}\), with \(G_{\text{s}} = (1/2)(G_{\text{ret}} + G_{\text{adv}})\) and \(G_{\text{a}} = (1/2)(G_{\text{ret}} - G_{\text{adv}})\), where \(G_{\text{adv}}\) denotes the advanced Green function. In non-linear theories time-symmetric effects can also result from homogeneous solutions, e.g., the tail contributions.

For a binary system, the leading-order direct and tail radiation reaction enter the Routhian in the form

$$\begin{aligned} R^{\text {rr}}(\textbf{x}_a,\textbf{p}_a,t) = -\frac{1}{2}\,h^{\text {TT}\,\text {rr}}_{ij}(t)\, \left( \frac{p_{1i}p_{1j}}{m_1} + \frac{p_{2i}p_{2j}}{m_2} - \frac{Gm_1m_2}{r_{12}} n_{12}^i n_{12}^j\right) , \end{aligned}$$
(6.102)

where \(h^{\text {TT}\,\text {rr}}_{ij}(t)\) decomposes into a direct radiation-reaction term and a tail one (Damour et al. 2016),

$$\begin{aligned} h^{\text {TT}\,\text {rr}}_{ij}(t) = -\frac{4G}{5c^5} \left( {I}^{(3)}_{\!ij}(t) + \frac{4GM}{c^3} \int ^\infty _0\text {d}\tau \,\text{ln}\left( \frac{c\tau }{2s_{\rm{phys}}}\right) {I}^{(5)}_{\!ij}(t-\tau )\right) . \end{aligned}$$
(6.103)

The last term on the right side results in a Routhian, which reproduces the corresponding tail effects in Blanchet (1993) and Galley et al. (2016).

The conservative (time-symmetric) part in \(h^{\text {TT}\,\text {rr}}_{ij}\) reads

$$\begin{aligned} h^{\text {TT}\,\text {rr}\,\text {con}}_{ij}(t) = -\frac{8G^2M}{5c^8} {\text{Pf}}_{2s_{\text{phys}}/c} \int ^\infty _{-\infty } \frac{\text {d}t'}{|t-t'|}\,{I}^{(4)}_{\!ij}(t'), \end{aligned}$$
(6.104)

and the dissipative (time-antisymmetric) one equals

$$\begin{aligned} h^{\text {TT}\,\text {rr}\,\text {dis}}_{ij}(t) = -\frac{4G}{5c^5} {I}^{(3)}_{\!ij}(t) - \frac{8G^2M}{5c^8} {\text{Pf}}_{2s_{\text{phys}}/c} \int ^\infty _{-\infty }\frac{\text {d}t'}{t-t'}\,{I}^{(4)}_{\!ij}(t'), \end{aligned}$$
(6.105)

where use has been made of the relations

$$\begin{aligned} {\text{Pf}}_{\tau _0} \int ^\infty _{-\infty } \frac{\text {d}t' f(t')}{|t-t'|}&= \int ^{\infty }_0 \text {d}\tau \, \text{ln}\left( \frac{\tau }{\tau _0}\right) [f^{(1)}(t-\tau ) - f^{(1)}(t+\tau )], \end{aligned}$$
(6.106)
$$\begin{aligned} {\text{Pf}}_{\tau _0} \int ^\infty _{-\infty } \frac{\text {d}t' f(t')}{t-t'}&= \int ^{\infty }_0 \text {d}\tau \,\text{ln}\left( \frac{\tau }{\tau _0}\right) [f^{(1)}(t-\tau ) + f^{(1)}(t+\tau )]. \end{aligned}$$
(6.107)

The leading-order 2.5PN dissipative binary orbital dynamics is described by the non-autonomous Hamiltonian (Schäfer 1995),

$$\begin{aligned} H_{\text {2.5PN}}(\textbf{x}_a,\textbf{p}_a,t) = \frac{2G}{5c^5}\,\dddot{I}_{ij}\big (x'^k_a(t)\big )\, \left( \frac{p_{1i}p_{1j}}{m_1} + \frac{p_{2i}p_{2j}}{m_2} - \frac{Gm_1m_2}{r_{12}} n_{12}^i n_{12}^j\right) , \end{aligned}$$
(6.108)

where \(I_{ij}\) is the Newtonian mass-quadrupole tensor,

$$\begin{aligned} I_{ij}\big (x'^k_a(t)\big ) \equiv \sum _a m_a \big (x'^i_a(t)x'^j_a(t) - \frac{1}{3}{} \textbf{x}'^2_a(t)\delta _{ij}\big ). \end{aligned}$$
(6.109)

Only after the Hamilton equations of motion have been obtained the primed position and momentum variables coming from \(\dddot{I}_{ij}\) are allowed to be identified with the unprimed position and momentum variables, also see Galley (2013). Generally, the treatment of dissipation with Hamiltonians or Lagrangians necessarily needs doubling of variables (Bateman 1931). In quantum mechanics, that treatment was introduced by Schwinger (1961) and Keldysh (1965). In the EFT approach as well a doubling of variables is needed if one wants to treat dissipative systems in a full-fledged manner at the action level (see, e.g., Galley and Leibovich 2012 and Galley et al. 2016). However, one should keep in mind that in quantum mechanics damping can also be treated without doubling of variables by making use of the fact that the Feynman Green function \(G_{\text{F}}\), the analogue of the retarded Green function of classical physics, decomposes into real and imaginary parts, \(G_{\text{F}} = G_{\text{s}} + (i/2)G^{(1)}\), where both \(G_{\text{s}}\) from above and \(G^{(1)}\), Hadamard’s elementary function, are symmetric Green functions, \(G^{(1)}\) solving homogeneous wave equation as \(G_a\) does. The imaginary part in e.g. the Eq. (8.7.57) in the book by Brown (1992) yields nothing but the dipole radiation loss formula and this without any doubling of variables (also see Sect. 9–4 in Feynman and Hibbs 1965). Note, however, that the statement concerning the Feynman propagator applies only to the calculation of the energy flux, not to that of the gravitational-wave amplitude.

Applications of the 2.5PN Hamiltonian can be found in, e.g., Kokkotas and Schäfer (1995), Ruffert et al. (1996), Buonanno and Damour (1999), Gopakumar and Schäfer (2008), where in Gopakumar and Schäfer (2008) a transformation to the Burke-Thorne gauge (coordinate conditions) is performed. More information on the 2.5PN dissipation can be found in Damour (1987a). The 3.5PN Hamiltonian for many point-mass systems is known too, it is displayed in Appendix E (Jaranowski and Schäfer 1997; Königsdörffer et al. 2003). Recently the 4.5PN radiation-reaction acceleration for nonspinning binary was computed using the EFT approach (Leibovich et al. 2023). Regarding gravitational spin interaction, see the next section, also for this case radiation reaction Hamiltonians have been derived through leading order spin-orbit and spin-spin couplings (Steinhoff and Wang 2010; Wang et al. 2011). Recent related developments within the EFT formalism are found in Maia et al. (2017a, 2017b).

Let us mention that the already cited article Galley et al. (2016) contains two interesting results improving upon and correcting an earlier article by Foffa and Sturani (2013b): on the one hand it confirms the conservative part of the tail action, particularly the additional rational constant 41/30 which corresponds to the famous 5/6 in the Lamb shift (see, e.g., Brown 2000), and on the other side it correctly delivers the dissipative part of the tail interaction. It is worth noting that in the both articles the involved calculations were performed in harmonic coordinates.

7 Generalized ADM formalism for spinning objects

In this section we review the relatively recent generalization of ADM formalism describing dynamics of systems made of spinning point masses or, more precisely, pole-dipole particles. We start from reviewing the generalization which is of fully reduced form (i.e., without unresolved constraints, spin supplementary and coordinate conditions) and which is valid to linear order in spin variables (our presentation of linear-in-spins dynamics closely follows that of Steinhoff and Schäfer (2009a)).

7.1 Dynamics linear in spins

In this section Latin indices from the middle of the alphabet i, j, k, \(\ldots \) are running through \(\{1,2,3\}\). We utilize three different reference frames here, denoted by different indices. Greek indices refer to the coordinate frame \((x^\mu )\) and have the values \(\mu =0,1,2,3\). Lower case Latin indices from the beginning of the alphabet refer to the local Lorentz frame with its associated tetrad fields \(\big (e_a^\mu (x^\nu )\big )\) (\(e_a^\mu \) denotes thus the \(\mu \) coordinate-frame component of the tetrad vector of label a), while upper case ones denote the so-called body-fixed Lorentz frame with its associated “tetrad” \(\big (\varLambda ^{\,a}_A(z^\mu )\big )\), where \((z^\mu )\) denotes coordinate-frame components of the body’s position (so \(\varLambda ^{\,a}_A\) is the a local-Lorentz-frame component of the tetrad vector of label A). The values of these Lorentz indices are marked by round and square brackets as \(a=(0),(i)\) and \(A=[0],[i]\), respectively, e.g., \(A=[0],[1],[2],[3]\). The basics of the tetrad formalism in GR can be found in, e.g., Sect. 12.5 of Weinberg (1972).

In GR, the coupling of a spinning object to a gravitational field, in terms of a Lagrangian density, reads

$$\begin{aligned} \mathcal {L}_M = \int \text {d}\tau \left[ \left( p_{\mu } - \frac{1}{2} S_{ab} \,\omega _{\mu }^{\,ab}\right) \frac{\text {d}z^{\mu }}{\text {d}\tau } + \frac{1}{2} S_{ab} \frac{\delta \theta ^{ab}}{\text {d}\tau } \right] \delta ^{(4)}(x^{\nu }-z^{\nu }(\tau )). \end{aligned}$$
(7.1)

The linear momentum variable is \(p_{\mu }\) and the spin tensor is denoted by \(S_{ab}\). The object’s affine time variable is \(\tau \) and \(\delta ^{(4)}(x^{\nu }-z^{\nu }(\tau ))\) is the 4-dimensional Dirac delta function (from now on we will abbreviate it to \(\delta ^{(4)}\)). The angle variables are represented by some Lorentz matrix satisfying \(\varLambda ^{Aa} \varLambda ^{Bb} \eta _{AB} = \eta ^{ab}\) or \(\varLambda _{Aa} \varLambda _{Bb} \eta ^{ab} = \eta _{AB}\), where \(\eta _{AB} = \text{ diag }(-1,1,1,1) = \eta ^{ab}\), which must be respected upon infinitesimal Lorentz transformations (see Hanson and Regge 1974), so \(\delta \theta ^{ab}\equiv \varLambda _{C}^{\,a}\text {d}\varLambda ^{Cb}=-\delta \theta ^{ba}\). The Ricci rotation coefficients \(\omega _{\mu }^{\,ab}\) are given by \(\omega _{\mu \alpha \beta }=e_{a\alpha }e_{b\beta }\omega _{\mu }^{\, a b}=-\varGamma _{\beta \alpha \mu }^{(4)} + e_{\alpha ,\mu }^c e_{c\beta }\), with \(\varGamma _{\beta \alpha \mu }^{(4)} = \frac{1}{2} (g_{\beta \alpha ,\mu } + g_{\beta \mu ,\alpha } - g_{\alpha \mu ,\beta })\) as the 4-dimensional Christoffel symbols of the first kind with \(g_{\mu \nu } = e_{a\mu } e_{b\nu } \eta ^{ab}\) the 4-dimensional metric. As in Hanson and Regge (1974), the matrix \(\varLambda ^{Ca}\) can be subjected to right (or left) Lorentz transformations, which correspond to transformations of the local Lorentz reference frame (or the body-fixed frame, respectively). In the action (7.1) only a minimal coupling between spin variables and gravitational field is employed; for more general (than minimal) couplings, the reader is referred to Bailey and Israel (1975).

The matter constraints are given by, also in terms of a Lagrangian density,

$$\begin{aligned} \mathcal {L}_C = \int \text {d}\tau \left[ \lambda _1^a p^b S_{ab} + \lambda _{2[i]} \varLambda ^{[i] a} p_a - \frac{\lambda _3}{2} (p^2 + m^2c^2) \right] \delta ^{(4)}, \end{aligned}$$
(7.2)

where m is the constant mass of the object, \(p^2\equiv p_{\mu }p^{\mu }\), and \(\lambda _1^a\), \(\lambda _{2[i]}\), \(\lambda _3\) are the Lagrange multipliers. The constraint

$$\begin{aligned} p^b S_{ab} = 0 \end{aligned}$$
(7.3)

is called the spin supplementary condition (SSC), it states that in the rest frame the spin tensor contains the 3-dimensional spin \(S_{(i)(j)}\) only (i.e., the mass-dipole part \(S_{(0)(i)}\) vanishes).Footnote 13 The conjugate constraint \(\varLambda ^{[i] a} p_a = 0\) ensures that \(\varLambda ^{C a}\) is a pure 3-dimensional rotation matrix in the rest frame (no Lorentz boosts), see Hanson and Regge (1974). Finally, the gravitational part is given by the usual Einstein-Hilbert Lagrangian density

$$\begin{aligned} \mathcal {L}_G = \frac{c^4}{16\pi G} \sqrt{-g} R^{(4)}, \end{aligned}$$
(7.4)

where g is the determinant of the 4-dimensional metric and \(R^{(4)}\) is the 4-dimensional Ricci scalar. Using a second-order form of the gravitational action, i.e., not varying the connection independently, ensures that the torsion tensor vanishes, see, e.g., Nelson and Teitelboim (1978). The complete Lagrangian density is the sum

$$\begin{aligned} \mathcal {L} = \mathcal {L}_G + \mathcal {L}_M + \mathcal {L}_C. \end{aligned}$$
(7.5)

We assume space-asymptotic flatness as a boundary condition of the spacetime. The total action is given in a second-order form, where the Ricci rotation coefficients are not independent field degrees of freedom and where no torsion of spacetime shows up. It reads

$$\begin{aligned} W[e_{a\mu },z^{\mu },p_{\mu },\varLambda ^{Ca},S_{ab},\lambda _1^a,\lambda _{2[i]},\lambda _3] = \int \text {d}t\,\text {d}^3x\, \mathcal {L}, \end{aligned}$$
(7.6)

and must be varied with respect to the tetrad field \(e_{a\mu }\), the Lagrange multipliers \(\lambda _1^a\), \(\lambda _{2[i]}\), \(\lambda _3\), position \(z^{\mu }\) and linear momentum \(p_{\mu }\) of the object, as well as with respect to angle-type variables \(\varLambda ^{Ca}\) and spin tensor \(S_{ab}\) associated with the object.

Variation of the action \(\delta W=0\) leads to the equations of motion for the matter variables (here \(\text {d}\) and \(\text {D}\) denote ordinary and covariant total derivatives, respectivelyFootnote 14)

$$\begin{aligned} \frac{\text {D}S_{ab}}{\text {D}\tau }&= 0, \quad \frac{\text {D}\varLambda ^{Ca}}{\text {D}\tau } = 0, \quad u^{\mu } \equiv \frac{\text {d}z^\mu }{\text {d}\tau } = \lambda _3 p^{\mu }, \end{aligned}$$
(7.7)
$$\begin{aligned} \frac{\text {D}p_{\mu }}{\text {D}\tau }&= -\frac{1}{2} R_{\mu \rho ab}^{(4)} u^{\rho } S^{ab}, \end{aligned}$$
(7.8)

as well as to the usual Einstein equations with the stress-energy tensor (cf. Tulczyjew 1957 and Sect. 12.5 in Weinberg 1972Footnote 15)

$$\begin{aligned} T^{\mu \nu }&= \frac{e^{\mu }_a}{\sqrt{-g}} \frac{\delta ( \mathcal {L}_M + \mathcal {L}_C )}{\delta e_{a \nu }} \\&= \int \text {d}\tau \left[ \lambda _3 p^{\mu } p^{\nu } \frac{\delta ^{(4)}}{\sqrt{-g}} + \bigg ( u^{(\mu } S^{\nu )\alpha } \frac{\delta ^{(4)}}{\sqrt{-g}} \bigg )_{||\alpha } \right] , \end{aligned}$$
(7.9)

where \(R_{\mu \rho ab}^{(4)}\) is the 4-dimensional Riemann tensor in mixed indices, \(_{||\alpha }\) denotes the 4-dimensional covariant derivative. Here it was already used that preservation of the constraints in time requires \(\lambda _1^a\) to be proportional to \(p^a\) and \(\lambda _{2[i]}\) to be zero, so that \(\lambda _1^a\) and \(\lambda _{2[i]}\) drop out of the matter equations of motion and the stress-energy tensor. The Lagrange multiplier \(\lambda _3 = \lambda _3(\tau )\) represents the reparametrization invariance of the action (notice \(\lambda _3=\sqrt{-u^2}/m\)). Further, an antisymmetric part of the stress-energy tensor vanishes,

$$\begin{aligned} \frac{1}{2} \int \text {d}\tau \left( S^{\mu \nu } u^{\rho } \frac{\delta ^{(4)}}{\sqrt{-g}} \right) _{||\rho } = \frac{1}{2} \int \text {d}\tau \frac{\text {D}S^{\mu \nu }}{\text {D}\tau } \frac{\delta ^{(4)}}{\sqrt{-g}} = 0, \end{aligned}$$
(7.10)

and \({T^{\mu \nu }}_{||\nu }=0\) holds by virtue of the matter equations of motion. Obviously, the spin length s as defined by \(2\,s^2\equiv S_{ab}S^{ab}\) is conserved.

A fully reduced action is obtained by the elimination of all constraints and gauge degrees of freedom. However, after that the action has still to be transformed into canonical form by certain variable transformations. To perform this reduction we employ 3+1 splitting of spacetime by spacelike hypersurfaces \(t=\text {const}\). The timelike unit covector orthogonal to these hypersurfaces reads \(n_{\mu }=(-N,0,0,0)\) or \(n^{\mu }=(1,-N^i)/N\). The three matter constraints can then be solved in terms of \(p_i\), \(S_{ij}\), and \(\varLambda ^{[i](k)}\) as

$$\begin{aligned} np&\equiv n^{\mu } p_{\mu } = -\sqrt{m^2c^2 + \gamma ^{ij} p_{i} p_{j}}, \end{aligned}$$
(7.11)
$$\begin{aligned} nS_{i}&\equiv n^{\mu } S_{\mu i} = \frac{p_{k} \gamma ^{kj} S_{ji}}{np} = \gamma _{ij} nS^j,\end{aligned}$$
(7.12)
$$\begin{aligned} \varLambda ^{[j](0)}&= \varLambda ^{[j](i)} \frac{p_{(i)}}{p^{(0)}}, \quad \varLambda ^{[0]a} = -\frac{p^a}{mc}. \end{aligned}$$
(7.13)

We take \(\mathcal {L}_C=0\) from now on. A split of the Ricci rotation coefficients results in

$$\begin{aligned} \omega _{kij}&= -\varGamma _{jik} + e_{i,k}^a e_{aj},\end{aligned}$$
(7.14)
$$\begin{aligned} n^{\mu } \omega _{k \mu i}&= K_{ki} - g_{ij} \frac{N^j_{,k}}{N} + \frac{e_{ai}}{N} (e^a_{0,k} - e^a_{l,k} N^l),\end{aligned}$$
(7.15)
$$\begin{aligned} \omega _{0ij}&= N K_{ij} - N_{j;i} + e_{i,0}^a e_{aj},\end{aligned}$$
(7.16)
$$\begin{aligned} n^{\mu } \omega _{0 \mu i}&= K_{ij} N^j - N_{;i} - \gamma _{ij} \frac{N^j_{,0}}{N} + \frac{e_{ai}}{N} (e^a_{0,0} - e^a_{l,0} N^l), \end{aligned}$$
(7.17)

where \(_{;i}\) denotes the 3-dimensional covariant derivative, \(\varGamma _{jik}\) the 3-dimensional Christoffel symbols, and the extrinsic curvature \(K_{ij}\) is given by \(2NK_{ij}=-\gamma _{ij,0}+2N_{(i;j)}\), where \(_{(\cdots )}\) denotes symmetrization.

It is convenient to employ here the time gauge (see Schwinger 1963a and also Dirac 1962; Kibble 1963; Nelson and Teitelboim 1978),

$$\begin{aligned} e^{\mu }_{(0)} = n^{\mu }. \end{aligned}$$
(7.18)

Then lapse and shift turn into Lagrange multipliers in the matter action, like in the ADM formalism for nonspinning matter points. The condition (7.18) leads to the following relations:

$$\begin{aligned} e_{i}^{(0)}&= 0 = e_{(i)}^0, \quad e^{(0)}_0 = N = 1/e_{(0)}^0,\end{aligned}$$
(7.19)
$$\begin{aligned} N^i&= -N e^{i}_{(0)}, \quad e^{(i)}_0 = N^j e^{(i)}_j,\end{aligned}$$
(7.20)
$$\begin{aligned} \gamma _{ij}&= e_i^{(m)} e_{(m)j}, \quad \gamma ^{ij} = e^i_{(m)} e^{(m)j}, \end{aligned}$$
(7.21)

which effectively reduce the tetrad \(e^{a\mu }\) to a triad \(e^{(i)j}\).

The matter part of the Lagrangian density, after making use of the covariant SSC (7.3), turns into

$$\begin{aligned} \mathcal {L}_M = \mathcal {L}_{MK} + \mathcal {L}_{MC} + \mathcal {L}_{GK} + (\text{ td}), \end{aligned}$$
(7.22)

where \((\text{ td})\) denotes an irrelevant total divergence. After fixing the yet arbitrary parameter \(\tau \) by choosing \(\tau =z^0=ct\), where t is the time coordinate, the terms attributed to the kinetic matter part are given by

$$\begin{aligned} \mathcal {L}_{MK}&= \bigg [ p_{i} + K_{ij} nS^j + A^{kl} e_{(j)k} e_{l,i}^{(j)} -\bigg ( \frac{1}{2} S_{kj} + \frac{p_{(k} nS_{j)}}{np} \bigg ) \varGamma ^{kj}_{\,\,i} \bigg ] \dot{z}^{i} \delta + \frac{nS^i}{2 np} \dot{p}_i \delta \\&\quad +\bigg [ S_{(i)(j)} + \frac{nS_{(i)} p_{(j)} - nS_{(j)} p_{(i)}}{np} \bigg ] \frac{\varLambda _{[k]}^{(i)} \dot{\varLambda }^{[k](j)}}{2} \delta , \end{aligned}$$
(7.23)

where \(\delta \equiv \delta (x^i - z^i(t))\) and \(A^{ij}\) is defined by

$$\begin{aligned} \gamma _{ik} \gamma _{jl} A^{kl} = \frac{1}{2} S_{ij} + \frac{nS_i p_j}{2 np}. \end{aligned}$$
(7.24)

The matter parts of the gravitational constraints result from

$$\begin{aligned} \mathcal {L}_{MC} = -N\mathcal {H}^{\text{matter}} + N^i \mathcal {H}^{\text{matter}}_i, \end{aligned}$$
(7.25)

where the densities \(\mathcal {H}^{\text{matter}}\) and \(\mathcal {H}^{\text{matter}}_i\) are computed from Eqs. (2.11)–(2.12) and (7.9). After employing the covariant SSC one gets (Steinhoff et al. 2008c)

$$\begin{aligned} \mathcal {H}^{\text{matter}}&= \sqrt{\gamma }T_{\mu \nu } n^{\mu }n^{\nu } = -np \delta - K^{ij} \frac{p_i nS_j}{np} \delta - ( nS^k \delta )_{;k}, \end{aligned}$$
(7.26)
$$\begin{aligned} \mathcal {H}^{\text{matter}}_i&= -\sqrt{\gamma }T_{i \nu }n^{\nu } = (p_i + K_{ij} nS^j ) \delta + \bigg ( \frac{1}{2} \gamma ^{mk} S_{ik} \delta + \delta _i^{(k} \gamma ^{l)m} \frac{p_k nS_l}{np} \delta \bigg )_{;m}. \end{aligned}$$
(7.27)

Further, some terms attributed to the kinetic part of the gravitational field appear as

$$\begin{aligned} \mathcal {L}_{GK} = A^{ij} e_{(k)i} \dot{e}_{j}^{(k)} \delta . \end{aligned}$$
(7.28)

Now we proceed to Newton-Wigner (NW) variables \(\hat{z}^i\), \(P_i\), \(\hat{S}_{(i)(j)}\), and \(\hat{\varLambda }^{[i](j)}\), which turn the kinetic matter part \(\mathcal {L}_{MK}\) into canonical form. The variable transformations read

$$\begin{aligned} z^i&= \hat{z}^i - \frac{nS^i}{mc - np}, \quad nS_i = -\frac{p_k\gamma ^{kj}\hat{S}_{ji}}{mc}, \end{aligned}$$
(7.29)
$$\begin{aligned} S_{ij}&= \hat{S}_{ij} - \frac{p_i nS_{j}}{mc-np} + \frac{p_j nS_i}{mc - np}, \end{aligned}$$
(7.30)
$$\begin{aligned} \varLambda ^{[i](j)}&= \hat{\varLambda }^{[i](k)} \bigg ( \delta _{kj} + \frac{p_{(k)}p^{(j)}}{mc(mc-np)} \bigg ), \end{aligned}$$
(7.31)
$$\begin{aligned} P_i&= p_i + K_{ij} nS^j + \hat{A}^{kl} e_{(j)k} e_{l,i}^{(j)} - \bigg ( \frac{1}{2} S_{kj} + \frac{p_{(k} nS_{j)}}{np} \bigg ) \varGamma ^{kj}_{\,\,i}, \end{aligned}$$
(7.32)

where \(\hat{A}^{ij}\) is given by

$$\begin{aligned} \gamma _{ik} \gamma _{jl} \hat{A}^{kl} = \frac{1}{2} \hat{S}_{ij} + \frac{mc p_{(i} nS_{j)}}{np (mc-np)}. \end{aligned}$$
(7.33)

The NW variables have the important properties \(\hat{S}_{(i)(j)} \hat{S}_{(i)(j)}=2\,s^2=\text{ const }\) and \(\hat{\varLambda }_{[k]}^{(i)}\hat{\varLambda }^{[k](j)} = \delta _{ij}\), which implies that \(\delta \hat{\theta }^{(i)(j)}\equiv \hat{\varLambda }_{[k]}^{(i)}\text {d}\hat{\varLambda }^{[k](j)}\) is antisymmetric. The redefinitions of position, spin tensor, and angle-type variables are actually quite natural generalizations of their Minkowski space versions to curved spacetime, cf., Hanson and Regge (1974) and Fleming (1965). However, there is no difference between linear momentum \(p_i\) and canonical momentum \(P_i\) in the Minkowski case. In these NW variables, one has

$$\begin{aligned} \mathcal {L}_{GK} + \mathcal {L}_{MK} = \hat{\mathcal {L}}_{GK} + \hat{\mathcal {L}}_{MK} + (\text{td}), \end{aligned}$$
(7.34)

with [from now on \(\delta = \delta (x^i - \hat{z}^i(t))\)]

$$\begin{aligned} \hat{\mathcal {L}}_{MK}&= P_i \dot{\hat{z}}^i \delta + \frac{1}{2} \hat{S}_{(i)(j)} \dot{\hat{\theta }}^{(i)(j)} \delta , \end{aligned}$$
(7.35)
$$\begin{aligned} \hat{\mathcal {L}}_{GK}&= \hat{A}^{ij} e_{(k)i} e_{j,0}^{(k)} \delta . \end{aligned}$$
(7.36)

Notice that all \(\dot{p}_i\) terms in the action have been canceled by the redefinition of the position and also all \(K_{ij}\) terms were eliminated from \(\mathcal {L}_{MC}\) and \(\mathcal {L}_{MK}\) by the redefinition of the linear momentum. If the terms explicitly depending on the triad \(e^{(i)}_j\) are neglected, the known source terms of Hamilton and momentum constraints in canonical variables are obtained [cf. Equations (4.23) and (4.25) in Steinhoff et al. (2008c)].

The final step goes with the ADM action functional of the gravitational field (Arnowitt et al. 1962; DeWitt 1967; Regge and Teitelboim 1974), but in tetrad form as derived by Deser and Isham (1976). The canonical momentum conjugate to \(e_{(k)j}\) is given by

$$\begin{aligned} \bar{\pi }^{(k)j} = \frac{8\pi G}{c^3}\frac{\partial \mathcal {L}}{\partial e_{(k)j,0}} = e_i^{(k)} \pi ^{ij} + e_i^{(k)} \frac{8\pi G}{c^3}\hat{A}^{ij} \delta , \end{aligned}$$
(7.37)

where the momentum \(\pi ^{ij}\) is given by

$$\begin{aligned} \pi ^{ij} = \sqrt{\gamma } (\gamma ^{ij}\gamma ^{kl} - \gamma ^{ik}\gamma ^{jl})K_{kl}. \end{aligned}$$
(7.38)

Legendre transformation leads to

$$\begin{aligned} \hat{\mathcal {L}}_{GK} + \mathcal {L}_G = \frac{c^3}{8\pi G} \bar{\pi }^{(k)j} e_{(k)j,0} - \frac{c^4}{16\pi G} \mathcal {E}_{i,i} + \mathcal {L}_{GC} + (\text{td}). \end{aligned}$$
(7.39)

In asymptotically flat spacetimes the quantity \(\mathcal {E}_i\) is given by [cf. Eq. (2.6)]

$$\begin{aligned} \mathcal {E}_i = \gamma _{ij,j} - \gamma _{jj,i}. \end{aligned}$$
(7.40)

The total energy then reads

$$\begin{aligned} E = \frac{c^4}{16\pi G}\oint \text {d}^2s_i\,\mathcal {E}_i. \end{aligned}$$
(7.41)

The constraint part of the gravitational Lagrangian density takes the form

$$\begin{aligned} \mathcal {L}_{GC} = - N \mathcal {H}^{\text{field}} + N^i \mathcal {H}^{\text{field}}_i, \end{aligned}$$
(7.42)

with

$$\begin{aligned} \mathcal {H}^{\text{field}}&= - \frac{c^4}{16\pi G\sqrt{\gamma }} \left[ \gamma R + \frac{1}{2} \left( \gamma _{ij} \pi ^{ij} \right) ^2 - \gamma _{ij} \gamma _{k l} \pi ^{ik} \pi ^{jl}\right] , \end{aligned}$$
(7.43)
$$\begin{aligned} \mathcal {H}^{\text{field}}_i&= \frac{c^3}{8\pi G} \gamma _{ij} \pi ^{jk}_{\,\, ; k} \,, \end{aligned}$$
(7.44)

where R is the 3-dimensional Ricci scalar. Due to the symmetry of \(\pi ^{ij}\), not all components of \(\bar{\pi }^{(k)j}\) are independent variables (i.e., the Legendre map is not invertible), leading to the additional constraint (\([\ldots ]\) denotes anti-symmetrization)

$$\begin{aligned} \bar{\pi }^{[ij]} = \frac{8\pi G}{c^3} \hat{A}^{[ij]} \delta . \end{aligned}$$
(7.45)

This constraint will be eliminated by going to the spatial symmetric gauge (for the frame \(e_{(i)j}\))

$$\begin{aligned} e_{(i)j} = e_{ij} = e_{ji}, \quad e^{(i)j} = e^{ij} = e^{ji}. \end{aligned}$$
(7.46)

Then the triad is fixed as the matrix square-root of the 3-dimensional metric, \(e_{ij} e_{jk} = \gamma _{ik}\), or, in matrix notation,

$$\begin{aligned} (e_{ij}) = \sqrt{(\gamma _{ij})}. \end{aligned}$$
(7.47)

Therefore, we can define a quantity \(B^{kl}_{ij}\) as

$$\begin{aligned} e_{k[i} e_{j]k,\mu } = B^{kl}_{ij} \gamma _{kl,\mu }, \end{aligned}$$
(7.48)

or, in explicit form,

$$\begin{aligned} 2 B^{kl}_{ij} = e_{mi} \frac{\partial e_{mj}}{\partial g_{kl}} - e_{mj} \frac{\partial e_{mi}}{\partial g_{kl}}. \end{aligned}$$
(7.49)

This expression may be evaluated perturbatively, cf., Steinhoff et al. (2008c). One also has \(B^{kl}_{ij} \delta _{kl} = 0\). Furthermore,

$$\begin{aligned} e_{(k)i} e_{j,\mu }^{(k)} = B^{kl}_{ij} \gamma _{kl,\mu } + \frac{1}{2} \gamma _{ij,\mu }, \end{aligned}$$
(7.50)

which yields

$$\begin{aligned} \bar{\pi }^{(k)j} e_{(k)j,0} = \frac{1}{2} \pi _{\text{can}}^{ij} \gamma _{ij,0}, \end{aligned}$$
(7.51)

with the new canonical field momentum

$$\begin{aligned} \pi _{\text{can}}^{ij} = \pi ^{ij} + \frac{8\pi G}{c^3} \hat{A}^{(ij)} \delta + \frac{16\pi G}{c^3} B^{ij}_{kl} \hat{A}^{[kl]} \delta . \end{aligned}$$
(7.52)

The gravitational constraints arising from the variations \(\delta N\) and \(\delta N^i\) read,

$$\begin{aligned} \mathcal {H}^{\text{field}} + \mathcal {H}^{\text{matter}} = 0, \quad \mathcal {H}^{\text{field}}_i + \mathcal {H}^{\text{matter}}_i = 0. \end{aligned}$$
(7.53)

They are eliminated by imposing the gauge conditions

$$\begin{aligned} 3 \gamma _{ij,j} - \gamma _{jj,i} = 0, \quad \pi ^{ii}_{\text{can}} = 0, \end{aligned}$$
(7.54)

which allow for the decompositions

$$\begin{aligned} \gamma _{ij} = \varPsi ^4 \delta _{ij} + h^{\text{TT}}_{ij}, \quad \pi ^{ij}_{\text{can}} = \tilde{\pi }^{ij}_{\text{can}} + \pi ^{ij\text TT}_{\text{can}}, \end{aligned}$$
(7.55)

where \(h^{\text{TT}}_{ij}\) and \(\pi ^{ij\text TT}_{\text{can}}\) are transverse and traceless quantities, and longitudinal part \(\tilde{\pi }^{ij}_{\text{can}}\) is related to a vector potential \(V^i_{\text{can}}\) by

$$\begin{aligned} \tilde{\pi }^{ij}_{\text{can}} = V^i_{\text{can},j} + V^j_{\text{can},i} - \frac{2}{3} \delta _{ij} V^k_{\text{can},k}. \end{aligned}$$
(7.56)

Let us note that in the construction of \(V^i_{\text{can}}\) the operator \(\varDelta ^{-1}\) is employed [see the text below Eq. (2.15)].

The gravitational constraints can now be solved for \(\varPsi \) and \(\tilde{\pi }^{ij}_{\text{can}}\), leaving \(h^{\text{TT}}_{ij}\) and \(\pi ^{ij\text TT}_{\text{can}}\) as the final degrees of freedom of the gravitational field. Notice that our gauge condition \(\pi ^{ii}_{\rm{can}} = 0\) deviates from the original ADM one \(\pi ^{ii} = 0\) by spin corrections (which enter at 5PN order). The final fully reduced action reads,

$$\begin{aligned} W = \frac{c^4}{16\pi G}\int \text {d}^4 x\, \pi ^{ij\text TT}_{\text{can}} h^{\text{TT}}_{ij,0} + \int \text {d}t \bigg [ P_i \dot{\hat{z}}^i + \frac{1}{2} \hat{S}_{(i)(j)} \dot{\hat{\theta }}^{(i)(j)} - E \bigg ]. \end{aligned}$$
(7.57)

The dynamics is completely described by the ADM energy E, which is the total Hamiltonian (\(E=H\)) once it is expressed in terms of the canonical variables. This Hamiltonian can be written as the volume integral

$$\begin{aligned} H[\hat{z}^i, P_i, \hat{S}_{(i)(j)}, h^{\text{TT}}_{ij}, \pi ^{ij\text TT}_{\text{can}}] = -\frac{c^4}{2\pi G} \int \text {d}^3 x\, \varDelta \varPsi [\hat{z}^i, P_i, \hat{S}_{(i)(j)}, h^{\text{TT}}_{ij}, \pi ^{ij\text TT}_{\text{can}}]. \end{aligned}$$
(7.58)

The equal-time Poisson bracket relations take the standard form,

$$\begin{aligned} \{ \hat{z}^i, P_j\} = \delta _{ij}, \quad \{\hat{S}_{(i)}, \hat{S}_{(j)}\} = \epsilon _{ijk} \hat{S}_{(k)}, \end{aligned}$$
(7.59)
$$\begin{aligned} \{h^{\text{TT}}_{ij}(\textbf{x},t), \pi ^{kl\text TT}_{\text{can}}(\textbf{x}',t)\} = \frac{16\pi G}{c^3} \delta ^{\text{TT}kl}_{ij}\delta (\textbf{x} - \textbf{x}'), \end{aligned}$$
(7.60)

zero otherwise, where \(\hat{S}_{(i)}=\frac{1}{2}\epsilon _{(i)(j)(k)}\hat{S}_{(j)(k)}\), \(\epsilon _{(i)(j)(k)}=\epsilon _{ijk}=(i-j)(j-k)(k-i)/2\), and \(\delta ^{\text{TT}ij}_{mn}\) is the TT-projection operator, see, e.g., Steinhoff et al. (2008c). Though the commutation relations (7.59) and (7.60) are sufficient for the variables on which the Hamiltonian (7.58) depends on, for completeness we add the non-trivial ones needed when a Hamiltonian, besides \(\hat{S}_{(i)(j)}\), also depends on the 3-dimensional rotation matrix \(\hat{\varLambda }_{[i](j)}\) (“angle” variables). They read

$$\begin{aligned} \{\hat{\varLambda }_{[i](j)}, \hat{S}_{(k)(l)}\} = \hat{\varLambda }_{[i](k)} \delta _{lj} - \hat{\varLambda }_{[i](l)}\delta _{kj}. \end{aligned}$$
(7.61)

The angular velocity tensor \(\hat{\varOmega }^{(i)(j)}\), the Legendre dual to \(\hat{S}_{(i)(j)}\), i.e. \(\hat{\varOmega }^{(i)(j)} = 2 \partial H/\partial \hat{S}_{(i)(j)}\), is defined by \(\hat{\varOmega }^{(i)(j)} = \delta \hat{\theta }^{(i)(j)}/\text {d}t = \hat{\varLambda }_{[k]}^{\,(i)}\dot{\hat{\varLambda }}^{[k](j)}\), and the time derivative of the spin tensor thus reads

$$\begin{aligned} \dot{\hat{S}}_{(i)(j)} = 2\hat{S}_{(k)[(i)}\varOmega _{(j)](k)} + \hat{\varLambda }^{[k](j)}\frac{\partial H}{\partial \hat{\varLambda }^{[k](i)}} - \hat{\varLambda }^{[k](i)}\frac{\partial H}{\partial \hat{\varLambda }^{[k](j)}}. \end{aligned}$$
(7.62)

The Hamiltonian H of Eq. (7.58) generates the time evolution in the reduced matter+field phase space. Generalization and application to many-body systems is quite straightforward, see Steinhoff et al. (2008c). The total linear (\(P_i^{\text{tot}}\)) and angular (\(J_{ij}^{\text{tot}}\)) momenta take the forms (particle labels are denoted by a),

$$\begin{aligned} P_i^{\text{tot}}&= \sum _a P_{ai} - \frac{c^3}{16\pi G} \int \text {d}^3x \, \pi _{\text{can}}^{kl\text TT} h^{\text{TT}}_{kl,i}, \end{aligned}$$
(7.63)
$$\begin{aligned} J_{ij}^{\text{tot}}&= \sum _a ( \hat{z}_a^i P_{aj} - \hat{z}_a^j P_{ai} + \hat{S}_{a(i)(j)}) - \frac{c^3}{8\pi G} \int \text {d}^3x\, ( \pi _{\text{can}}^{ik\text TT} h^{\text{TT}}_{kj}- \pi _{\text{can}}^{jk\text TT} h^{\text{TT}}_{ki} ) \\&\quad - \frac{c^3}{16\pi G} \int \text {d}^3x\, ( x^i \pi _{\text{can}}^{kl\text TT} h^{\text{TT}}_{kl,j} - x^j \pi _{\text{can}}^{kl\text TT} h^{\text{TT}}_{kl,i}), \end{aligned}$$
(7.64)

and are obtained from the reduced action in the standard Noether manner.

7.2 Spin-squared dynamics

For the construction of the spin-squared terms we resort to the well-known stress-energy tensor for pole-dipole particles but augmented by quadrupolar terms. The stress-energy tensor density then reads (Steinhoff et al. 2008b)

$$\begin{aligned} \sqrt{-g}\,T^{\mu \nu } = \int \text {d}\tau \bigg [ t^{\mu \nu } \delta _{(4)} + ( t^{\mu \nu \alpha } \delta _{(4)} )_{||\alpha } + ( t^{\mu \nu \alpha \beta } \delta _{(4)} )_{||\alpha \beta } \bigg ]. \end{aligned}$$
(7.65)

The quantities \(t^{\mu \nu \dots }=t^{\nu \mu \dots }\) only depend on the four-velocity \(u^{\mu }\equiv \text {d}z^{\mu }/{\text {d}\tau }\), where \(z^{\mu }(\tau )\) is the parametrization of the worldline in terms of its proper time \(\tau \), and on the spin and quadrupole tensors. Notice that, in general, the quadrupole expressions include not only the mass-quadrupole moment, but also the current-quadrupole moment and the stress-quadrupole moment (see, e.g., Steinhoff and Puetzfeld 2010). For the pole-dipole particle \(t^{\mu \nu \alpha \beta }\) is zero. In contrast to the stress-energy tensor of pole-dipole particles, the Riemann tensor shows up at the quadrupolar level. However, the source terms of the constraints,

$$\begin{aligned} {\gamma }^\frac{1}{2} T^{\mu \nu }n_\mu n_\nu = {\mathcal {H}}^{\text{matter}}, \quad -{\gamma }^\frac{1}{2}T^{\;\mu }_i n_\mu = {\mathcal {H}}^{\text{matter}}_i, \end{aligned}$$
(7.66)

at the approximation considered here, do not include the Riemann tensor.

Regarding rotating black holes, the mass-quadrupole tensor \(Q^{ij}_1\) of object 1 is given by Steinhoff et al. (2008b) (also see, e.g., Thorne 1980 and Damour 2001)

$$\begin{aligned} m_1 c^2 Q^{ij}_1 \equiv \gamma ^{ik}\gamma ^{jl}\gamma ^{mn}\hat{S}_{1km}\hat{S}_{1nl} + \frac{2}{3}{} \textbf{S}^2_1\gamma ^{ij} = e^i_{(k)}e^j_{(l)}\big (S_{1(k)}S_{1(l)} - \frac{1}{3}\textbf{S}^2_1\delta _{(k)(l)}\big ), \end{aligned}$$
(7.67)

where \(\textbf{S}_1=(S_{1(i)})\) is the three-dimensional Euclidean spin vector related to a spin tensor \(\hat{S}_{1ij}\) with the help of a dreibein \(e_{i(j)}\) by \(\hat{S}_{1ij}=e_{i(k)}e_{j(l)}\epsilon _{klm}S_{1(m)}\). The quantity \(\textbf{S}_1^2\) is conserved in time,

$$\begin{aligned} 2\textbf{S}_1^2 = \gamma ^{ik} \gamma ^{jl} \hat{S}_{1ij} \hat{S}_{1kl} = \text{ const }. \end{aligned}$$
(7.68)

The source terms of the constraints in the static case (independent from the linear momenta \(P_i\) of the objects, what means taking \(P_i=0\), but \(p_i\ne 0\)) read

$$\begin{aligned} \mathcal {H}^{\text{matter}}_{S_1^2,\,{\text{static}}}&= c_1 \left( c^2 Q^{ij}_1 \delta _1 \right) _{; ij} + \frac{1}{8 m_1} \gamma _{mn} \gamma ^{pj} \gamma ^{ql} {\gamma ^{mi}}_{,p} {\gamma ^{nk}}_{,q} \hat{S}_{1 ij} \hat{S}_{1 kl} \delta _1 \\&\quad + \frac{1}{4m_1} \left( \gamma ^{ij} \gamma ^{mn} \gamma ^{kl}_{\,\,,m} \hat{S}_{1 ln} \hat{S}_{1 jk} \delta _1 \right) _{,i}, \end{aligned}$$
(7.69)
$$\begin{aligned} \mathcal {H}_{i \,\text{static}}^{\text{matter}}&= \frac{1}{2} \left( \gamma ^{mk}\hat{S}_{ik} \delta \right) _{,m} + \mathcal {O}{(\hat{S}^3)}. \end{aligned}$$
(7.70)

The \(c_1\) is some constant that must be fixed by additional considerations, like matching to the Kerr metric. The noncovariant terms are due to the transition from three-dimensional covariant linear momentum \(p_i\) to canonical linear momentum \(P_i\) given by [cf. Eq. (4.24) in Steinhoff et al. 2008c or Eq. (7.32) above]

$$\begin{aligned} p_i = P_i - \frac{1}{2} \gamma _{ij} \gamma ^{lm} \gamma ^{jk}_{\,\,,m} \hat{S}_{kl} + \mathcal {O}(P^2) + \mathcal {O}(\hat{S}^2). \end{aligned}$$
(7.71)

Thus the source terms are indeed covariant when the point-mass and linear-in-spin terms depending on the (noncovariant) canonical linear momentum are added, cf. Eqs. (7.26) and (7.27).

The simple structure of the \(Q_1^{ij}\) term in Eq. (7.69) is just the structure of minimal coupling of the Minkowski space mass-quadrupole term to gravity. As shown by Steinhoff et al. (2008b), the most general ansatz for the spin-squared coupling including the three-dimensional Ricci tensor reduces to the shown term. Here we may argue that the correct limit to flat space on the one side and on the other side, an undefined multiplication with a second delta-function, resulting in that limit from the Ricci tensor of the spinning “point” particle, makes the ansatz unique. A deeper analysis of the structure of nonlinear-in-spin couplings can be found in, e.g., Levi and Steinhoff (2015).

7.3 Approximate Hamiltonians for spinning binaries

All the approximate Hamiltonians presented in this subsection have been derived or rederived in recent papers by one of the authors and his collaborators employing canonical formalisms presented in Sects. 7.1 and 7.2 (Damour et al. 2008c; Steinhoff et al. 2008b, c). They are two-point-particle Hamiltonians, which can be used to approximately model binaries made of spinning black holes. For the rest of this section, canonical variables (which are arguments of displayed Hamiltonians) are not hatted any further. We use \(a,b=1,2\) as the bodies labels, and for \(a \ne b\) we define \(r_{ab}{} \textbf{n}_{ab}\equiv \textbf{x}_{a}-\textbf{x}_{b}\) with \(\textbf{n}_{ab}^2=1\).

The Hamiltonian of leading-order (LO) spin-orbit coupling reads (let us note that in the following \(\textbf{p}_a\) will denote the canonical linear momenta)

$$\begin{aligned} H_{\text {SO}}^{\text {LO}} = \sum _a \sum _{b \ne a} \frac{G}{c^2r_{a b}^2} (\textbf{S}_a \times \textbf{n}_{ab}) \cdot \left( \frac{3 m_b}{2 m_a} \textbf{p}_a - 2 \textbf{p}_b \right) , \end{aligned}$$
(7.72)

and the one of leading-order spin(1)-spin(2) coupling is given by

$$\begin{aligned} H_{{S_1S_2}}^{\text {LO}} = \sum _a \sum _{b \ne a} \frac{G}{2 c^2 r_{ab}^3} \big ( 3 (\textbf{S}_a\cdot \textbf{n}_{ab})(\textbf{S}_b\cdot \textbf{n}_{ab}) - (\textbf{S}_a\cdot \textbf{S}_b) \big ). \end{aligned}$$
(7.73)

The more complicated Hamiltonian is the one with spin-squared terms because it relates to the rotational deformation of spinning black holes. To leading order, say for spin(1), it reads

$$\begin{aligned} H_{{S_1^2}}^{\text {LO}} = \frac{G m_2}{2 c^2 m_1 r_{12}^3} \big ( 3 (\textbf{S}_1\cdot \textbf{n}_{12})^2 - \textbf{S}_1^2 \big ). \end{aligned}$$
(7.74)

The LO spin-orbit and spin(a)-spin(b) centre-of-mass vectors take the form

$$\begin{aligned} \textbf{G}_{\text {SO}}^{\text {LO}} = \sum _a \frac{1}{2c^2 m_a} (\textbf{p}_a \times \textbf{S}_a), \quad \textbf{G}_{{S_1S_2}}^{\text {LO}} =0, \quad \textbf{G}_{{S^2_1}}^{\text {LO}} = 0. \end{aligned}$$
(7.75)

The LO spin Hamiltonians have been applied to studies of binary pulsar and solar system dynamics, including satellites on orbits around the Earth (see, e.g., Barker and O’Connell 1979 and Schäfer 2004). Another application to the coalescence of spinning binary black holes via the effective-one-body approach is given in Damour (2001). The LO spin dynamics was analysed for black holes and other extended objects in external fields by D’Eath (1975a) and Thorne and Hartle (1985), and for binary black holes in the slow-motion limit by D’Eath (1975b). In Barausse et al. (2009, 2012b) the spinning test-particle dynamics in the Kerr metric has been explored at LO within Hamiltonian formalism based on Dirac brackets. In the article Kidder (1995) the LO spin-orbit and spin1-spin2 dynamics for compact binaries is treated in full detail, even including their influence on the gravitational waves and the related gravitational damping, particularly the quasi-circular inspiraling and the recoil of the linear momuntum from the LO spin coupling was obtained.

The Hamiltonian of the next-to-leading-order (NLO) spin-orbit coupling reads

$$\begin{aligned}&H_{\text {SO}}^{\text {NLO}} = -G\frac{((\textbf{p}_1 \times \textbf{S}_1) \cdot \textbf{n}_{1 2})}{c^4r_{12}^2} \Bigg ( \frac{5 m_2 \textbf{p}_1^2}{8 m_1^3} + \frac{3 ((\textbf{p}_1\cdot \textbf{p}_2)+(\textbf{n}_{12}\cdot \textbf{p}_1)(\textbf{n}_{12}\cdot \textbf{p}_2))}{4 m_1^2} \\&\quad - \frac{3(\textbf{p}_2^2- 2(\textbf{n}_{12}\cdot \textbf{p}_2)^2)}{4 m_1 m_2} \Bigg ) + G \frac{((\textbf{p}_1 \times \textbf{S}_1) \cdot \textbf{p}_2)}{c^4r_{12}^2} \left( \frac{2 (\textbf{n}_{12}\cdot \textbf{p}_2)}{m_1 m_2} - \frac{3 (\textbf{n}_{12}\cdot \textbf{p}_1)}{4 m_1^2} \right) \\&\quad + G\frac{((\textbf{p}_2 \times \textbf{S}_1) \cdot \textbf{n}_{1 2})}{c^4r_{12}^2} \frac{(\textbf{p}_1\cdot \textbf{p}_2)+ 3 (\textbf{n}_{12}\cdot \textbf{p}_1)(\textbf{n}_{12}\cdot \textbf{p}_2)}{m_1 m_2} \\&\quad - G^2\frac{((\textbf{p}_1 \times \textbf{S}_1) \cdot \textbf{n}_{1 2})}{c^4r_{12}^3} \left( \frac{11 m_2}{2} + \frac{5 m_2^2}{m_1}\right) \\&\quad + G^2 \frac{((\textbf{p}_2 \times \textbf{S}_1) \cdot \textbf{n}_{1 2})}{c^4r_{12}^3} \left( 6 m_1 + \frac{15 m_2}{2}\right) + (1 \leftrightarrow 2). \end{aligned}$$
(7.76)

This Hamiltonian was derived by Damour et al. (2008c). The equivalent derivation of the NLO spin-orbit effects in two-body equations of motion was done in harmonic coordinates by Blanchet et al. (2006, 2007, 2010a).

The NLO spin(1)-spin(2) Hamiltonian is given by

$$\begin{aligned} H_{{S_1S_2}}^{\text {NLO}}&= \frac{G}{2c^4 m_1 m_2 r_{12}^3} \Big [6 ((\textbf{p}_2 \times \textbf{S}_1) \cdot \textbf{n}_{1 2})((\textbf{p}_1 \times \textbf{S}_2) \cdot \textbf{n}_{1 2}) \\&\quad + \frac{3}{2} ((\textbf{p}_1 \times \textbf{S}_1) \cdot \textbf{n}_{1 2})((\textbf{p}_2 \times \textbf{S}_2) \cdot \textbf{n}_{1 2}) \\ {}&\quad - 15 (\textbf{S}_1 \cdot \textbf{n}_{1 2})(\textbf{S}_2 \cdot \textbf{n}_{1 2})(\textbf{n}_{12}\cdot \textbf{p}_1)(\textbf{n}_{12}\cdot \textbf{p}_2) \\&\quad - 3 (\textbf{S}_1 \cdot \textbf{n}_{1 2})(\textbf{S}_2 \cdot \textbf{n}_{1 2})(\textbf{p}_1\cdot \textbf{p}_2)+ 3 (\textbf{S}_1 \cdot \textbf{p}_2)(\textbf{S}_2 \cdot \textbf{n}_{1 2})(\textbf{n}_{12}\cdot \textbf{p}_1) \\&\quad + 3 (\textbf{S}_2 \cdot \textbf{p}_1)(\textbf{S}_1 \cdot \textbf{n}_{1 2})(\textbf{n}_{12}\cdot \textbf{p}_2)+ 3 (\textbf{S}_1 \cdot \textbf{p}_1)(\textbf{S}_2 \cdot \textbf{n}_{1 2})(\textbf{n}_{12}\cdot \textbf{p}_2) \\&\quad + 3 (\textbf{S}_2 \cdot \textbf{p}_2)(\textbf{S}_1 \cdot \textbf{n}_{1 2})(\textbf{n}_{12}\cdot \textbf{p}_1)- 3 (\textbf{S}_1 \cdot \textbf{S}_2)(\textbf{n}_{12}\cdot \textbf{p}_1)(\textbf{n}_{12}\cdot \textbf{p}_2) \\&\quad + (\textbf{S}_1 \cdot \textbf{p}_1)(\textbf{S}_2 \cdot \textbf{p}_2)- \frac{1}{2} (\textbf{S}_1 \cdot \textbf{p}_2)(\textbf{S}_2 \cdot \textbf{p}_1)+ \frac{1}{2} (\textbf{S}_1 \cdot \textbf{S}_2)(\textbf{p}_1\cdot \textbf{p}_2)\Big ] \\&\quad + \frac{3G}{2c^4 m_1^2 r_{12}^3} \Big [-((\textbf{p}_1 \times \textbf{S}_1) \cdot \textbf{n}_{1 2})((\textbf{p}_1 \times \textbf{S}_2) \cdot \textbf{n}_{1 2}) \\&\quad + (\textbf{S}_1 \cdot \textbf{S}_2)(\textbf{n}_{12}\cdot \textbf{p}_1)^2 - (\textbf{S}_1 \cdot \textbf{n}_{1 2})(\textbf{S}_2 \cdot \textbf{p}_1)(\textbf{n}_{12}\cdot \textbf{p}_1)\Big ] \\&\quad + \frac{3G}{2c^4 m_2^2 r_{12}^3} \Big [-((\textbf{p}_2 \times \textbf{S}_2) \cdot \textbf{n}_{1 2})((\textbf{p}_2 \times \textbf{S}_1) \cdot \textbf{n}_{1 2}) \\&\quad + (\textbf{S}_1 \cdot \textbf{S}_2)(\textbf{n}_{12}\cdot \textbf{p}_2)^2 - (\textbf{S}_2 \cdot \textbf{n}_{1 2})(\textbf{S}_1 \cdot \textbf{p}_2)(\textbf{n}_{12}\cdot \textbf{p}_2)\Big ] \\&\quad + \frac{6G^2(m_1 + m_2)}{c^4 r_{12}^4} [(\textbf{S}_1 \cdot \textbf{S}_2)- 2(\textbf{S}_1 \cdot \textbf{n}_{1 2})(\textbf{S}_2 \cdot \textbf{n}_{1 2})]. \end{aligned}$$
(7.77)

The calculation of the LO and NLO \(S_1^2\)-Hamiltonians needs employing the source terms (7.69)–(7.70). In the case of polar-dipolar-quadrupolar particles which are to model spinning black holes, \(Q^{ij}_1\) is the quadrupole tensor of the black hole 1 resulting from its rotational deformation and the value of the constant \(c_1\) is fixed by matching to the test-body Hamiltonian in a Kerr background: \(c_1=-1/2\). Additionally one has to use the Poincaré algebra for unique fixation of all coefficients in momentum-dependent part of the Hamiltonian. The NLO \(S_1^2\)-Hamiltonian was presented for the first time by Steinhoff et al. (2008b).Footnote 16 It reads

$$\begin{aligned} H_{S_1^2}^{\text {NLO}}&= \frac{G}{c^4r_{12}^3}\bigg \{ \frac{m_{2}}{m_{1}^3} \bigg [ \frac{1}{4}\left( {\textbf{p}}_{1}\cdot {{\textbf{S}}}_{1}\right) ^2 + \frac{3}{8}\left( {\textbf{p}}_{1}\cdot {{\textbf{n}}}_{12}\right) ^{2}{{\textbf{S}}}_{1}^{2} - \frac{3}{8} {{\textbf{p}}}_{1}^{2}\left( {{\textbf{S}}}_{1}\cdot {\textbf{n}}_{12}\right) ^2 \\&\quad - \frac{3}{4} \left( {\textbf{p}}_{1}\cdot {{\textbf{n}}}_{12}\right) \left( {{\textbf{S}}}_{1}\cdot {\textbf{n}}_{12}\right) \left( {{\textbf{p}}}_{1}\cdot {{\textbf{S}}}_{1}\right) \bigg ] + \frac{3}{4m_{1}m_{2}}\Big [3{\textbf{p}}_{2}^{2}\left( {{\textbf{S}}}_{1}\cdot {{\textbf{n}}}_{12}\right) ^2 \\&\quad - {{\textbf{p}}}_{2}^{2}{{\textbf{S}}}_{1}^{2}\Big ] + \frac{1}{m_1^2} \bigg [ \frac{3}{4}\left( {{\textbf{p}}}_{1}\cdot {\textbf{p}}_{2}\right) {{\textbf{S}}}_{1}^2 -\frac{9}{4}\left( {\textbf{p}}_{1}\cdot {{\textbf{p}}}_{2}\right) \left( {{\textbf{S}}}_{1} \cdot {{\textbf{n}}}_{12}\right) ^2 \\&\quad - \frac{3}{2}\left( {{\textbf{p}}}_{1}\cdot {\textbf{n}}_{12}\right) \left( {{\textbf{p}}}_{2}\cdot {{\textbf{S}}}_{1}\right) \left( {{\textbf{S}}}_{1}\cdot {{\textbf{n}}}_{12}\right) +3\left( {\textbf{p}}_{2}\cdot {{\textbf{n}}}_{12}\right) \left( {{\textbf{p}}}_{1} \cdot {{\textbf{S}}}_{1}\right) \left( {{\textbf{S}}}_{1}\cdot {\textbf{n}}_{12}\right) \\&\quad + \frac{3}{4}\left( {\textbf{p}}_{1}\cdot {{\textbf{n}}}_{12}\right) \left( {{\textbf{p}}}_{2} \cdot {{\textbf{n}}}_{12}\right) {{\textbf{S}}}_{1}^2 -\frac{15}{4}\left( {{\textbf{p}}}_{1}\cdot {\textbf{n}}_{12}\right) \left( {{\textbf{p}}}_{2}\cdot {\textbf{n}}_{12}\right) \left( {{\textbf{S}}}_{1}\cdot {{\textbf{n}}}_{12}\right) ^2 \bigg ] \bigg \} \\&\quad - \frac{G^2 m_2}{2 c^4r_{12}^4} \bigg [9({{\textbf{S}}}_1 \cdot {{\textbf{n}}}_{12})^2 - 5 {{\textbf{S}}}_1^2 + \frac{14 m_2}{m_1} ({{\textbf{S}}}_1 \cdot {{\textbf{n}}}_{12})^2 - \frac{6 m_2}{m_1}{{\textbf{S}}}_1^2 \bigg ]. \end{aligned}$$
(7.78)

The spin precession equations corresponding to the Hamiltonians \(H^{\text {NLO}}_{S_1S_2}\) and \(H_{S_1^2}^{\text {NLO}}\) have been calculated also by Porto and Rothstein (2008b)Footnote 17 and Porto and Rothstein (2008a),Footnote 18 respectively.

The NLO spin-orbit and spin(a)-spin(b) centre-of-mass vectors take the form

$$\begin{aligned} \textbf{G}_{\text {SO}}^{\text {NLO}}&= -\sum _a \frac{\textbf{p}_a^2}{8 c^4m_a^3} (\textbf{p}_a \times \textbf{S}_a) \\&\quad + \sum _a \sum _{b \ne a} \frac{G m_b}{4 c^4m_a r_{ab}} \bigg \{ [(\textbf{p}_a\times \textbf{S}_a)\cdot \textbf{n}_{ab}] \frac{5\textbf{x}_a+\textbf{x}_b}{r_{ab}} - 5 (\textbf{p}_a \times \textbf{S}_a) \bigg \} \\&\quad + \sum _a \sum _{b \ne a} \frac{G}{c^4 r_{ab}} \bigg \{ \frac{3}{2} (\textbf{p}_b \times \textbf{S}_a) - \frac{1}{2} (\textbf{n}_{ab} \times \textbf{S}_a) (\textbf{p}_b \cdot \textbf{n}_{ab}) \\&\quad - [(\textbf{p}_a \times \textbf{S}_a) \cdot \textbf{n}_{ab}] \frac{\textbf{x}_a+\textbf{x}_b}{r_{ab}} \bigg \}, \end{aligned}$$
(7.79)
$$\begin{aligned} \textbf{G}_{{S_1S_2}}^{\text {NLO}}&= \frac{G}{2c^4} \sum _a \sum _{b \ne a} \bigg \{ \left[ 3(\textbf{S}_{a}\cdot \textbf{n}_{ab})(\textbf{S}_{b}\cdot \textbf{n}_{ab}) -(\textbf{S}_{a}\cdot \textbf{S}_{b})\right] \frac{\textbf{x}_{a}}{r_{ab}^3} + (\textbf{S}_{b}\cdot \textbf{n}_{ab}) \frac{\textbf{S}_{a}}{r_{ab}^2} \bigg \}\,,\end{aligned}$$
(7.80)
$$\begin{aligned} \textbf{G}_{S_1^2}^{\text {NLO}}&= \frac{2Gm_2}{c^4m_{1}} \bigg \{ \frac{3\left( \textbf{S}_{1}\cdot \textbf{n}_{12}\right) ^2}{8r_{12}^3}\left( \textbf{x}_{1}+\textbf{x}_{2}\right) +\frac{\textbf{S}_{1}^2}{8r_{12}^3} \left( 3\textbf{x}_{1}-5\textbf{x}_{2}\right) -\frac{\left( \textbf{S}_{1}\cdot \textbf{n}_{12}\right) \textbf{S}_1}{r_{12}^2}\bigg \}. \end{aligned}$$
(7.81)

We can sum up all centre-of-mass vectors displayed in this subsection in the following equation:

$$\begin{aligned} \textbf{G} = \textbf{G}_{\text {N}} + \textbf{G}_{\text {1PN}} + \textbf{G}_{\text {2PN}} + \textbf{G}_{\text {3PN}} + \textbf{G}_{\text {4PN}} + \textbf{G}_{\text {SO}}^{\text {LO}} + \textbf{G}_{\text {SO}}^{\text {NLO}} + \textbf{G}_{S_{1}S_{2}}^{\text {NLO}} + \textbf{G}_{S_{1}^2}^{\text {NLO}} + \textbf{G}_{S_{2}^2}^{\text {NLO}}, \end{aligned}$$
(7.82)

where \(\textbf{G}_{\text {N}}\) up to \(\textbf{G}_{\text {4PN}}\) represent the pure orbital contributions, which do not depend on spin variables (the explicit formulae for them one can find in Jaranowski and Schäfer (2015)). The last term in Eq. (7.82) can be obtained from the second last one by means of the exchange \((1 \leftrightarrow 2)\) of the bodies’ labels.

The explicitly given above and in Appendices C and D conservative binary Hamiltonians, modeling binaries made of spinning black holes, can be summarized as follows:

$$\begin{aligned} H&= H_{\text {N}} + H_{\text {1PN}} + H_{\text {2PN}} + H_{\text {3PN}} + H_{\text {4PN}} \\&\quad + H_{\text {SO}}^{\text {LO}} + H^{\text {LO}}_{S_1^2} + H^{\text {LO}}_{S_{1}S_{2}} + H^{\text {LO}}_{S_2^2} \\&\quad + H_{\text {SO}}^{\text {NLO}} + H^{\text {NLO}}_{S_{1}^2} + H^{\text {NLO}}_{S_{1}S_{2}} + H^{\text {NLO}}_{S_{2}^2} \\&\quad + H_{\text {SO}}^{\text {NNLO}} + H^{\text {NNLO}}_{S_{1}^2} + H^{\text {NNLO}}_{S_{1}S_{2}} + H^{\text {NNLO}}_{S_{2}^2} \\&\quad + H^{\text {LO}}_{S_1^3} + H^{\text {LO}}_{S_1^2 S_2} + H^{\text {LO}}_{S_1 S_2^2}+ H^{\text {LO}}_{S_2^3} \\&\quad + H^{\text {LO}}_{S_{1}^4} + H^{\text {LO}}_{S_{1}^3S_{2}} + H^{\text {LO}}_{S_{1}^2S_{2}^2} + H^{\text {LO}}_{S_{1}S_{2}^3} + H^{\text {LO}}_{S_{2}^4}, \end{aligned}$$
(7.83)

where the first line comprises pure orbital, i.e., spin-independent, Hamiltonians. The Hamiltonians from the second and the third line are explicitly given above. The NNLO spin-orbit \(H_{\text {SO}}^{\text {NNLO}}\) and spin1-spin2 \(H^{\text {NNLO}}_{S_{1}S_{2}}\) Hamiltonians were obtained by Hartung et al. (2013), their explicit forms can be found in Appendix D. Levi and Steinhoff (2021) derived, applying the EFT method to extended bodies, the NNLO spin-squared Hamiltonians \(H^{\text {NNLO}}_{S_{1}^2}\) and \(H^{\text {NNLO}}_{S_{2}^2}\). All the Hamiltonians cubic and quartic in the spins were derived by Hergt and Schäfer (2008a, b) with the aid of approximate ADMTT coordinates of the Kerr metric and application of the Poincaré algebra.Footnote 19 Their generalizations to general extended objects were achieved by Levi and Steinhoff (2015), where also for the first time the Hamiltonians \(H^{\text {LO}}_{S_1^4}\) and \(H^{\text {LO}}_{S_2^4}\) were obtained (correcting Hergt and Schäfer 2008a). All the Hamiltonians cubic and quartic in the spins and displayed in Eq. (7.83) are explicitly given in Appendix D. Notice that not all Hamiltonians from Eq. (7.83) are necessarily given in the ADM gauge, because any use of the equations of motion in their derivation has changed gauge. E.g., for spinless particles the highest conservative Hamiltonian in ADM gauge is \(H_{\text {2PN}}\).

For completeness we also give the spin-squared Hamiltonians for neutron stars through next-to-leading order (Porto and Rothstein 2008a, 2010a; Hergt et al. 2010). They depend on the quantity \(C_Q\), which parametrizes quadrupolar deformation effects induced by spins. The LO Hamiltonian reads (cf., e.g., Barker and O’Connell 1979)

$$\begin{aligned} H_{S_{1}^2\text {(NS)}}^{\text {LO}} = \frac{Gm_{1}m_{2}}{2c^2 r_{12}^3} C_{Q_1} \left( 3\frac{(\textbf{S}_1 \cdot \textbf{n}_{1 2})^2}{m_{1}^2}-\frac{\textbf{S}_1^2}{m_{1}^2}\right) . \end{aligned}$$
(7.84)

The NLO Hamiltonian equals

$$\begin{aligned} H_{S_{1}^2\text {(NS)}}^{\text {NLO}}&= \frac{G}{c^4 r_{12}^3} \Bigg [\frac{m_{2}}{m_{1}^3}\Bigg ( \left( -\frac{21}{8}+\frac{9}{4}C_{Q_1}\right) \textbf{p}_1^2({\textbf{S}}_1 \cdot {\textbf{n}}_{1 2})^2 +\left( \frac{3}{2}C_{Q_1}-\frac{5}{4}\right) ({\textbf{S}}_1 \cdot \textbf{p}_1)^2 \\&\quad +\left( \frac{15}{4}-\frac{9}{2}C_{Q_1}\right) (\textbf{p}_1 \cdot {\textbf{n}}_{1 2})({\textbf{S}}_1 \cdot {\textbf{n}}_{1 2})({\textbf{S}}_1 \cdot \textbf{p}_1) \\&\quad +\left( -\frac{9}{8}+\frac{3}{2}C_{Q_1}\right) (\textbf{p}_1 \cdot {\textbf{n}}_{1 2})^2{\textbf{S}}_1^2+\left( \frac{5}{4}-\frac{5}{4}C_{Q_1}\right) \textbf{p}_1^2{\textbf{S}}_1^2\Bigg ) \\&\quad +\frac{1}{m_{1}^2} \Bigg (-\frac{15}{4}C_{Q_1}(\textbf{p}_1 \cdot {\textbf{n}}_{1 2})(\textbf{p}_2 \cdot {\textbf{n}}_{1 2})({\textbf{S}}_1 \cdot {\textbf{n}}_{1 2})^2 \\&\quad +\left( 3-\frac{21}{4}C_{Q_1}\right) (\textbf{p}_1\cdot \textbf{p}_2)({\textbf{S}}_1 \cdot {\textbf{n}}_{1 2})^2 \\&\quad +\left( -\frac{3}{2}+\frac{9}{2}C_{Q_1}\right) (\textbf{p}_2 \cdot {\textbf{n}}_{1 2})({\textbf{S}}_1 \cdot {\textbf{n}}_{1 2})({\textbf{S}}_1 \cdot \textbf{p}_1) \\&\quad +\left( -3+\frac{3}{2}C_{Q_1}\right) (\textbf{p}_1 \cdot {\textbf{n}}_{1 2})({\textbf{S}}_1 \cdot {\textbf{n}}_{1 2})({\textbf{S}}_1 \cdot \textbf{p}_2) \\&\quad +\left( \frac{3}{2}-\frac{3}{2}C_{Q_1}\right) ({\textbf{S}}_1 \cdot \textbf{p}_1)({\textbf{S}}_1 \cdot \textbf{p}_2) \\&\quad +\left( \frac{3}{2}-\frac{3}{4}C_{Q_1}\right) (\textbf{p}_1 \cdot {\textbf{n}}_{1 2})(\textbf{p}_2 \cdot {\textbf{n}}_{1 2}){\textbf{S}}_1^2 \\&\quad +\left( -\frac{3}{2}+\frac{9}{4}C_{Q_1}\right) (\textbf{p}_1\cdot \textbf{p}_2){\textbf{S}}_1^2\Bigg ) \\&\quad +\frac{C_{Q_1}}{m_{1}m_{2}}\Big (\frac{9}{4}\textbf{p}_2^2({\textbf{S}}_1 \cdot {\textbf{n}}_{1 2})^2-\frac{3}{4}\textbf{p}_2^2{\textbf{S}}_1^2\Big )\Bigg ] \\&\quad + \frac{G^2 m_{2}}{c^4 r_{12}^4} \Bigg [\left( 2+\frac{1}{2}C_{Q_1}+\frac{m_{2}}{m_{1}}\big (1+2C_{Q_1}\big )\right) {\textbf{S}}_1^2 \\&\quad +\left( -3-\frac{3}{2}C_{Q_1}-\frac{m_{2}}{m_{1}}\big (1+6C_{Q_1}\big )\right) ({\textbf{S}}_1 \cdot {\textbf{n}}_{1 2})^2\Bigg ]. \end{aligned}$$
(7.85)

This Hamiltonian for \(C_{Q_1}=1\) agrees with that given in Eq. (7.78) describing black-hole binaries (for neutron stars, \(C_{Q_1}=2\)–8 holds; see, e.g., Mandal et al. 2023a). It has been derived fully correctly for the first time by Porto and Rothstein (2010a) using the EFT method. Shortly afterwards, an independent calculation by Hergt et al. (2010), in part based on the Eqs. (7.69) and (7.70) including (7.67), has confirmed the result.

The radiation-reaction (or dissipative) Hamiltonians for leading-order spin-orbit and spin1-spin2 couplings are derived by Steinhoff and Wang (2010) and Wang et al. (2011). All the known dissipative Hamiltonians can thus be summarized as

$$\begin{aligned} H^{\text{diss}} = H_{\text{2.5PN}} + H_{\text{3.5PN}} + H^{\text {LO}\,\text {diss}}_{\text{SO}} + H^{\text {LO}\,\text {diss}}_{S_1S_2}, \end{aligned}$$
(7.86)

where \(H_{\text{2.5PN}}\) and \(H_{\text{3.5PN}}\) are spin-independent (purely orbital) dissipative Hamiltonians. The leading-order Hamiltonian \(H_{\text{2.5PN}}\) is given in Eq. (6.108) for two-point-mass and in Appendix E for many-point-mass systems, and the next-to-leading-order Hamiltonian \(H_{\text{3.5PN}}\) is explicitly given in the Appendix E (also for many-point-mass systems). The spin-dependent dissipative Hamiltonians \(H^{\text {LO}\,\text {diss}}_{\text{SO}}\) and \(H^{\text {LO}\,\text {diss}}_{S_1S_2}\) can be read off from the Hamiltonian \(H^{\text {spin}}_{\text {3.5PN}}\) given in the Appendix E (we keep here the notation of the Hamiltonian used by Wang et al. 2011, which indicates spin corrections to the spinless 3.5PN dynamics).

8 Tidal interactions

The work done in this field through higher PN orders relies on the effective Fokker action with non-minimal matter couplings. The Hamiltonians are obtained from higher-order Lagrangians in harmonic coordinates via order reduction and Legendre transforms. Here we tightly follow Henry et al. (2020a, b); also see Damour and Nagar (2010), Bini et al. (2012), Steinhoff et al. (2016).

The action for the gravitational field is given in harmonic gauge through

$$\begin{aligned} S_g = \frac{c^3}{16\pi G} \int \text {d}^4x \sqrt{-g} \left( R-\frac{1}{2}g_{\mu \nu }\varGamma ^\mu \varGamma ^\nu \right) , \end{aligned}$$
(8.1)

where \(\varGamma ^\mu :=g^{\rho \sigma }\varGamma ^\mu _{\rho \sigma }\). The ansatz for the matter action, in sufficient approximation for our intended presentation, is given by

$$\begin{aligned} S_m&= \sum _a \int \text {d}\tau _a \bigg (-m_ac^2 + \frac{\mu _a^{(2)}}{4}G^a_{\mu \nu } G_a^{\mu \nu } + \frac{\sigma ^{(2)}_a}{6c^2}H^a_{\mu \nu }H_a^{\mu \nu } \\&\quad + \frac{\mu _a^{(3)}}{12}G^a_{\lambda \mu \nu } G_a^{\lambda \mu \nu } + {\mathcal{O}}\left( \frac{\epsilon _{\text{tidal}}}{c^6}\right) \bigg ), \end{aligned}$$
(8.2)

with the bodies’, labeling a, tidal mass quadrupole \(G_a^{\mu \nu }\), tidal current quadrupole \(H_a^{\mu \nu }\), and tidal mass octupole \(G_a^{\lambda \mu \nu }\) moments; \(\epsilon _{\rm{tidal}}\sim 1/c^{10}\) denotes order of the dominant tidal effect. The static (equilibrium) deformability coefficients are denoted, including their orders, by \(\mu _a^{(2)} = {\mathcal{O}}(\epsilon _{\rm{tidal}})\), \(\sigma _a^{(2)} = {\mathcal{O}}(\epsilon _{\rm{tidal}})\), and \(\mu _a^{(3)} = {\mathcal{O}}(\epsilon _{\rm{tidal}}/c^4)\). The first tidal term is leading order plus NLO plus NNLO, the second is NLO plus NNLO, and the third one is solely NNLO.

The tidal moments are related with the Weyl or Riemann tensor, centered at the point masses (particles) in the forms

$$\begin{aligned} G^a_{\mu \nu }&= -c^2 [R_{\mu \rho \nu \sigma }]_a u_a^\rho u_a^\sigma , \end{aligned}$$
(8.3a)
$$\begin{aligned} H^a_{\mu \nu }&= 2c^3 [R^*_{(\mu \underline{\rho }\nu )\sigma }]_a u_a^\rho u_a^\sigma ,\end{aligned}$$
(8.3b)
$$\begin{aligned} G^a_{\lambda \mu \nu }&= -c^2 [\nabla ^\bot _{(\lambda } R_{\mu \underline{\rho }\nu )\sigma }]_a u_a^\rho u_a^\sigma , \end{aligned}$$
(8.3c)

with the underlined index \(\rho \) being excluded from symmetrization and \(\nabla ^\bot _\mu :=(\delta ^\nu _\mu + u_\mu u^\nu )\nabla _\nu \).

Making use of the gothic metric deviation \(h^{\mu \nu } = \sqrt{-g} g^{\mu \nu } - \eta ^{\mu \nu }\), with then defining the vector variable \(h\equiv (h^{00ii},h^{0i},h^{ij})\) with \(h^{00ii}\equiv h^{00}+\delta _{ij}h^{ij}\), and decomposing \(h = h_{\text{pp}} + h_{\text{tidal}}\), where \(h_{\text{pp}}\) comes from the metric generated by structureless point particles (pp) and \(h_{\text{tidal}}=(\epsilon _{\text{tidal}}/c^2,\epsilon _{\text{tidal}}/c^3,\epsilon _{\text{tidal}}/c^4)\), the Fokker action \(S_{\text{F}}[\text{MV}]\), and as well the Fokker Lagrangian \(L_{\text{F}}\) with \(\int \text {d}t\,L_{\text{F}}(\text{MV}) = S_{\text{F}}[\text{MV}]\), with MV denoting matter variables, similarly to our Routhian procedure, results in the form

$$\begin{aligned} S_{\text{F}}[\text{MV}] = S_{\text{total}}[\text{MV},h_{\text{pp}}], \end{aligned}$$
(8.4)

where we have used

$$\begin{aligned} S_{\text{total}}[\text{MV}, h]&= S_{\text{total}}[\text{MV},h_{\rm{pp}}] + \int \text {d}^4x \frac{\delta S_{\text{total}}}{\delta h}[\text{MV}, h_{\rm{pp}}]h_{\text{tidal}} + {\mathcal{O}}(h^2_{\text{tidal}}) \\&= S_{\text{total}} [\text{MV}, h_{\rm{pp}}] + {\mathcal{O}}(\epsilon ^2_{\text{tidal}}); \end{aligned}$$
(8.5)

also see Appendix C in Damour and Schäfer (1985).

The explicit form of the NNLO tidal Hamiltonian can be found in Henry et al. (2020a). It reads

$$\begin{aligned} H_{\rm {tidal}}&= \frac{G^2 m_2^2}{r_{12}^6}\Bigg \{ -\frac{3}{2}\mu _1^{(2)} + \frac{1}{c^2} w_{\text {NLO}} + \frac{1}{c^4} w_{\text {NNLO}} - \frac{15\mu _1^{(3)}}{2r_{12}^2} \Bigg \} + (1\leftrightarrow 2) \\&\quad + \mathcal {O}\left( \frac{\epsilon _\text {tidal}}{c^{6}}\right) , \end{aligned}$$
(8.6)

where

$$\begin{aligned} w_{\text {NLO}}&= -\frac{12 \sigma _1^{(2)} \textbf{p}_2^2}{m_{2}^2} + \frac{(\textbf{n}_{12}\cdot \textbf{p}_2)^2}{m_{2}^2} \Bigl (-18 \mu _1^{(2)} + 12 \sigma _1^{(2)}\Bigr ) \\&\quad + \frac{(\textbf{n}_{12}\cdot \textbf{p}_1)(\textbf{n}_{12}\cdot \textbf{p}_2)}{m_{1} m_{2}} \Bigl (18 \mu _1^{(2)} - 24 \sigma _1^{(2)}\Bigr ) + \frac{(\textbf{p}_1\cdot \textbf{p}_2)}{m_{1} m_{2}} \Bigl (\frac{9}{2} \mu _1^{(2)} + 24 \sigma _1^{(2)}\Bigr ) \\&\quad + \frac{(\textbf{n}_{12}\cdot \textbf{p}_1)^2}{m_{1}^2} \Bigl (\frac{9}{2} \mu _1^{(2)} + 12 \sigma _1^{(2)}\Bigr ) + \frac{\textbf{p}_1^2}{m_{1}^2} \Bigl (- \frac{15}{4} \mu _1^{(2)} - 12 \sigma _1^{(2)}\Bigr ) \\&\quad + \frac{G}{r_{12}} \Bigl (3m_{1} + \frac{21}{2} m_{2}\Bigr ) \mu _1^{(2)}, \end{aligned}$$
(8.7a)
$$\begin{aligned} w_{\text {NNLO}}&= \frac{(\textbf{n}_{12}\cdot \textbf{p}_2)^4}{m_{2}^4} \Bigl (- \frac{63}{2} \mu _1^{(2)} - 60 \sigma _1^{(2)}\Bigr ) + \frac{(\textbf{p}_2^2)^2}{m_{2}^4} \Bigl (-9 \mu _1^{(2)} - 12 \sigma _1^{(2)}\Bigr ) \\&\quad + \frac{(\textbf{p}_1\cdot \textbf{p}_2)\textbf{p}_2^2}{m_{1} m_{2}^3} \Bigl (\frac{99}{4} \mu _1^{(2)} + 60 \sigma _1^{(2)}\Bigr ) + \frac{(\textbf{n}_{12}\cdot \textbf{p}_2)^2}{m_{2}^2} \biggl [\frac{\textbf{p}_2^2}{m_{2}^2} \Bigl (54 \mu _1^{(2)} + 72 \sigma _1^{(2)}\Bigr ) \\&\quad + \frac{(\textbf{p}_1\cdot \textbf{p}_2)}{m_{1} m_{2}} \Bigl (-54 \mu _1^{(2)} - 144 \sigma _1^{(2)}\Bigr )\biggl ] + \frac{(\textbf{p}_1\cdot \textbf{p}_2)^2}{m_{1}^2 m_{2}^2} \Bigl (- \frac{45}{2} \mu _1^{(2)} - 60 \sigma _1^{(2)}\Bigr ) \\&\quad + \frac{(\textbf{n}_{12}\cdot \textbf{p}_1)^3 (\textbf{n}_{12}\cdot \textbf{p}_2)}{m_{1}^3 m_{2}} \Bigl (18 \mu _1^{(2)} + 48 \sigma _1^{(2)}\Bigr ) \\&\quad + \frac{\textbf{p}_1^2}{m_{1}^2} \biggl [\frac{(\textbf{n}_{12}\cdot \textbf{p}_2)^2}{m_{2}^2} \Bigl (\frac{45}{2} \mu _1^{(2)} + 66 \sigma _1^{(2)}\Bigr ) + \frac{\textbf{p}_2^2}{m_{2}^2} \Bigl (- \frac{45}{4} \mu _1^{(2)} - 30 \sigma _1^{(2)}\Bigr ) \\&\quad + \frac{(\textbf{p}_1\cdot \textbf{p}_2)}{m_{1} m_{2}} \Bigl (\frac{81}{4} \mu _1^{(2)} + 48 \sigma _1^{(2)}\Bigr )\biggl ] + \frac{(\textbf{n}_{12}\cdot \textbf{p}_1)}{m_{1}} \biggl (\frac{(\textbf{n}_{12}\cdot \textbf{p}_2)^3}{m_{2}^3} \Bigl (54 \mu _1^{(2)} + 144 \sigma _1^{(2)}\Bigr ) \\&\quad + \frac{(\textbf{n}_{12}\cdot \textbf{p}_2)\textbf{p}_1^2}{m_{1}^2 m_{2}} \Bigl (- \frac{63}{2} \mu _1^{(2)} - 48 \sigma _1^{(2)}\Bigr ) \\&\quad + \frac{(\textbf{n}_{12}\cdot \textbf{p}_2)}{m_{2}} \biggl [\frac{\textbf{p}_2^2}{m_{2}^2} \Bigl (-36 \mu _1^{(2)} - 60 \sigma _1^{(2)}\Bigr ) + \frac{(\textbf{p}_1\cdot \textbf{p}_2)}{m_{1} m_{2}} \Bigl (45 \mu _1^{(2)} + 120 \sigma _1^{(2)}\Bigr )\biggl ]\biggl ) \\&\quad + \frac{(\textbf{n}_{12}\cdot \textbf{p}_1)^4}{m_{1}^4} \Bigl (- \frac{9}{2} \mu _1^{(2)} - 12 \sigma _1^{(2)}\Bigr ) + \frac{(\textbf{p}_1^2)^2}{m_{1}^4} \Bigl (- \frac{45}{16} \mu _1^{(2)} - 6 \sigma _1^{(2)}\Bigr ) \\&\quad + \frac{(\textbf{n}_{12}\cdot \textbf{p}_1)^2}{m_{1}^2} \biggl [\frac{(\textbf{n}_{12}\cdot \textbf{p}_2)^2}{m_{2}^2} \Bigl (-45 \mu _1^{(2)} - 120 \sigma _1^{(2)}\Bigr ) + \frac{\textbf{p}_2^2}{m_{2}^2} \Bigl (9 \mu _1^{(2)} + 24 \sigma _1^{(2)}\Bigr ) \\&\quad + \frac{(\textbf{p}_1\cdot \textbf{p}_2)}{m_{1} m_{2}} \Bigl (-18 \mu _1^{(2)} - 48 \sigma _1^{(2)}\Bigr ) + \frac{\textbf{p}_1^2}{m_{1}^2} \Bigl (\frac{27}{4} \mu _1^{(2)} + 18 \sigma _1^{(2)}\Bigr )\biggl ] \\&\quad + \frac{G}{r_{12}} \biggl (m_{1} \biggl [\frac{(\textbf{n}_{12}\cdot \textbf{p}_2)^2}{m_{2}^2} \Bigl (207 \mu _1^{(2)} - 80 \sigma _1^{(2)}\Bigr ) + \frac{\textbf{p}_2^2}{m_{2}^2} \Bigl (- \frac{45}{2} \mu _1^{(2)} + 80 \sigma _1^{(2)}\Bigr ) \\&\quad + \frac{(\textbf{n}_{12}\cdot \textbf{p}_1)(\textbf{n}_{12}\cdot \textbf{p}_2)}{m_{1} m_{2}} \Bigl (- \frac{1341}{8} \mu _1^{(2)} + 172 \sigma _1^{(2)}\Bigr ) + \frac{(\textbf{p}_1\cdot \textbf{p}_2)}{m_{1} m_{2}} \Bigl (\frac{3}{8} \mu _1^{(2)} - 172 \sigma _1^{(2)}\Bigr ) \\&\quad + \frac{(\textbf{n}_{12}\cdot \textbf{p}_1)^2}{m_{1}^2} \Bigl (- \frac{183}{2} \mu _1^{(2)} - 92 \sigma _1^{(2)}\Bigr ) + \frac{\textbf{p}_1^2}{m_{1}^2} \Bigl (\frac{123}{4} \mu _1^{(2)} + 92 \sigma _1^{(2)}\Bigr )\biggl ] \\&\quad + m_{2} \biggl [\frac{(\textbf{n}_{12}\cdot \textbf{p}_2)^2}{m_{2}^2} \Bigl (\frac{331}{2} \mu _1^{(2)} - 120 \sigma _1^{(2)}\Bigr ) + \frac{\textbf{p}_2^2}{m_{2}^2} \Bigl (\frac{61}{4} \mu _1^{(2)} + 120 \sigma _1^{(2)}\Bigr ) \\&\quad + \frac{(\textbf{n}_{12}\cdot \textbf{p}_1)(\textbf{n}_{12}\cdot \textbf{p}_2)}{m_{1} m_{2}} \Bigl (- \frac{1189}{8} \mu _1^{(2)} + 228 \sigma _1^{(2)}\Bigr ) + \frac{(\textbf{p}_1\cdot \textbf{p}_2)}{m_{1} m_{2}} \Bigl (- \frac{401}{8} \mu _1^{(2)} - 228 \sigma _1^{(2)}\Bigr ) \\&\quad + \frac{(\textbf{n}_{12}\cdot \textbf{p}_1)^2}{m_{1}^2} \Bigl (- \frac{81}{2} \mu _1^{(2)} - 108 \sigma _1^{(2)}\Bigr ) + \frac{\textbf{p}_1^2}{m_{1}^2} \Bigl (\frac{135}{4} \mu _1^{(2)} + 108 \sigma _1^{(2)}\Bigr )\biggl ]\biggl ) \\&\quad + \frac{G^2}{r_{12}^2} \Bigl (\frac{303}{28}m_{1}^2 - \frac{455}{8}m_{1}m_{2} - 39m_{2}^2\Bigr )\mu _1^{(2)}. \end{aligned}$$
(8.7b)

The NNNLO tidal effects were recently computed in Mandal et al. (2023).