Metric Theories of Gravity and the PPN Formalism

3 Metric Theories of Gravity and the PPN Formalism

3.1 Metric theories of gravity and the strong equivalence principle

3.1.1 Universal coupling and the metric postulates

The empirical evidence supporting the Einstein equivalence principle, discussed in Section 2, supports the conclusion that the only theories of gravity that have a hope of being viable are metric theories, or possibly theories that are metric apart from very weak or short-range non-metric couplings (as in string theory). Therefore for the remainder of this review, we shall turn our attention exclusively to metric theories of gravity, which assume that

there exists a symmetric metric,
test bodies follow geodesics of the metric, and
in local Lorentz frames, the non-gravitational laws of physics are those of special relativity.

The property that all non-gravitational fields should couple in the same manner to a single gravitational field is sometimes called “universal coupling”. Because of it, one can discuss the metric as a property of spacetime itself rather than as a field over spacetime. This is because its properties may be measured and studied using a variety of different experimental devices, composed of different non-gravitational fields and particles, and, because of universal coupling, the results will be independent of the device. Thus, for instance, the proper time between two events is a characteristic of spacetime and of the location of the events, not of the clocks used to measure it.

Consequently, if EEP is valid, the non-gravitational laws of physics may be formulated by taking their special relativistic forms in terms of the Minkowski metric $η$ and simply “going over” to new forms in terms of the curved spacetime metric $g$ , using the mathematics of differential geometry. The details of this “going over” can be found in standard textbooks (see [289 *, 407 , 355 , 324 *], TEGP 3.2. [420 *]).

3.1.2 The strong equivalence principle

In any metric theory of gravity, matter and non-gravitational fields respond only to the spacetime metric $g$ . In principle, however, there could exist other gravitational fields besides the metric, such as scalar fields, vector fields, and so on. If, by our strict definition of metric theory, matter does not couple to these fields, what can their role in gravitation theory be? Their role must be that of mediating the manner in which matter and non-gravitational fields generate gravitational fields and produce the metric; once determined, however, the metric alone acts back on the matter in the manner prescribed by EEP.

What distinguishes one metric theory from another, therefore, is the number and kind of gravitational fields it contains in addition to the metric, and the equations that determine the structure and evolution of these fields. From this viewpoint, one can divide all metric theories of gravity into two fundamental classes: “purely dynamical” and “prior-geometric”.

By “purely dynamical metric theory” we mean any metric theory whose gravitational fields have their structure and evolution determined by coupled partial differential field equations. In other words, the behavior of each field is influenced to some extent by a coupling to at least one of the other fields in the theory. By “prior geometric” theory, we mean any metric theory that contains “absolute elements”, fields or equations whose structure and evolution are given a priori, and are independent of the structure and evolution of the other fields of the theory. These “absolute elements” typically include flat background metrics $η$ or cosmic time coordinates $t$ .

General relativity is a purely dynamical theory since it contains only one gravitational field, the metric itself, and its structure and evolution are governed by partial differential equations (Einstein’s equations). Brans–Dicke theory and its generalizations are purely dynamical theories; the field equation for the metric involves the scalar field (as well as the matter as source), and that for the scalar field involves the metric. Visser’s bimetric massive gravity theory [401 *] is a prior-geometric theory: It has a non-dynamical, Riemann-flat background metric $η$ , and the field equations for the physical metric $g$ involve $η$ .

By discussing metric theories of gravity from this broad point of view, it is possible to draw some general conclusions about the nature of gravity in different metric theories, conclusions that are reminiscent of the Einstein equivalence principle, but that are subsumed under the name “strong equivalence principle”.

Consider a local, freely falling frame in any metric theory of gravity. Let this frame be small enough that inhomogeneities in the external gravitational fields can be neglected throughout its volume. On the other hand, let the frame be large enough to encompass a system of gravitating matter and its associated gravitational fields. The system could be a star, a black hole, the solar system, or a Cavendish experiment. Call this frame a “quasi-local Lorentz frame”. To determine the behavior of the system we must calculate the metric. The computation proceeds in two stages. First we determine the external behavior of the metric and gravitational fields, thereby establishing boundary values for the fields generated by the local system, at a boundary of the quasi-local frame “far” from the local system. Second, we solve for the fields generated by the local system. But because the metric is coupled directly or indirectly to the other fields of the theory, its structure and evolution will be influenced by those fields, and in particular by the boundary values taken on by those fields far from the local system. This will be true even if we work in a coordinate system in which the asymptotic form of $gμν$ in the boundary region between the local system and the external world is that of the Minkowski metric. Thus the gravitational environment in which the local gravitating system resides can influence the metric generated by the local system via the boundary values of the auxiliary fields. Consequently, the results of local gravitational experiments may depend on the location and velocity of the frame relative to the external environment. Of course, local non-gravitational experiments are unaffected since the gravitational fields they generate are assumed to be negligible, and since those experiments couple only to the metric, whose form can always be made locally Minkowskian at a given spacetime event. Local gravitational experiments might include Cavendish experiments, measurement of the acceleration of massive self-gravitating bodies, studies of the structure of stars and planets, or analyses of the periods of “gravitational clocks”. We can now make several statements about different kinds of metric theories.

A theory which contains only the metric $g$ yields local gravitational physics which is independent of the location and velocity of the local system. This follows from the fact that the only field coupling the local system to the environment is $g$ , and it is always possible to find a coordinate system in which $g$ takes the Minkowski form at the boundary between the local system and the external environment (neglecting inhomogeneities in the external gravitational field). Thus the asymptotic values of $gμν$ are constants independent of location, and are asymptotically Lorentz invariant, thus independent of velocity. GR is an example of such a theory.
A theory which contains the metric $g$ and dynamical scalar fields $φA$ yields local gravitational physics which may depend on the location of the frame but which is independent of the velocity of the frame. This follows from the asymptotic Lorentz invariance of the Minkowski metric and of the scalar fields, but now the asymptotic values of the scalar fields may depend on the location of the frame. An example is Brans–Dicke theory, where the asymptotic scalar field determines the effective value of the gravitational constant, which can thus vary as $φ$ varies. On the other hand, a form of velocity dependence in local physics can enter indirectly if the asymptotic values of the scalar field vary with time cosmologically. Then the rate of variation of the gravitational constant could depend on the velocity of the frame.
A theory which contains the metric $g$ and additional dynamical vector or tensor fields or prior-geometric fields yields local gravitational physics which may have both location and velocity-dependent effects. An example is the Einstein-Æther theory, which contains a dynamical timelike four-vector field; the large-scale distribution of matter establishes a frame in which the vector has no spatial components, and systems moving relative to that frame can experience motion-dependent effects.

These ideas can be summarized in the strong equivalence principle (SEP), which states that:

WEP is valid for self-gravitating bodies as well as for test bodies.
The outcome of any local test experiment is independent of the velocity of the (freely falling) apparatus.
The outcome of any local test experiment is independent of where and when in the universe it is performed.

The distinction between SEP and EEP is the inclusion of bodies with self-gravitational interactions (planets, stars) and of experiments involving gravitational forces (Cavendish experiments, gravimeter measurements). Note that SEP contains EEP as the special case in which local gravitational forces are ignored. For further discussion of SEP and EEP, see [128 ].

The above discussion of the coupling of auxiliary fields to local gravitating systems indicates that if SEP is strictly valid, there must be one and only one gravitational field in the universe, the metric $g$ . These arguments are only suggestive however, and no rigorous proof of this statement is available at present. Empirically it has been found that almost every metric theory other than GR introduces auxiliary gravitational fields, either dynamical or prior geometric, and thus predicts violations of SEP at some level (here we ignore quantum-theory inspired modifications to GR involving “ $R2$ ” terms). The one exception is Nordström’s 1913 conformally-flat scalar theory [303 ], which can be written purely in terms of the metric; the theory satisfies SEP, but unfortunately violates experiment by predicting no deflection of light. General relativity seems to be the only viable metric theory that embodies SEP completely. In Section 4.3, we shall discuss experimental evidence for the validity of SEP.

3.2 The parametrized post-Newtonian formalism

Despite the possible existence of long-range gravitational fields in addition to the metric in various metric theories of gravity, the postulates of those theories demand that matter and non-gravitational fields be completely oblivious to them. The only gravitational field that enters the equations of motion is the metric $g$ . The role of the other fields that a theory may contain can only be that of helping to generate the spacetime curvature associated with the metric. Matter may create these fields, and they plus the matter may generate the metric, but they cannot act back directly on the matter. Matter responds only to the metric.

Thus the metric and the equations of motion for matter become the primary entities for calculating observable effects, and all that distinguishes one metric theory from another is the particular way in which matter and possibly other gravitational fields generate the metric.

The comparison of metric theories of gravity with each other and with experiment becomes particularly simple when one takes the slow-motion, weak-field limit. This approximation, known as the post-Newtonian limit, is sufficiently accurate to encompass most solar-system tests that can be performed in the foreseeable future. It turns out that, in this limit, the spacetime metric $g$ predicted by nearly every metric theory of gravity has the same structure. It can be written as an expansion about the Minkowski metric ( $ημν = diag(− 1,1,1,1)$ ) in terms of dimensionless gravitational potentials of varying degrees of smallness. These potentials are constructed from the matter variables (see Box 2) in imitation of the Newtonian gravitational potential

The “order of smallness” is determined according to the rules $2 U ∼ v ∼ Π ∼ p ∕ρ ∼ 𝜖$ , $i 1∕2 v ∼ |d∕dt|∕|d ∕dx| ∼ 𝜖$ , and so on (we use units in which $G = c = 1$ ; see Box 2 for definitions and conventions).

A consistent post-Newtonian limit requires determination of $g00$ correct through $𝒪 (𝜖2)$ , $g0i$ through $𝒪 (𝜖3∕2)$ , and $g ij$ through $𝒪(𝜖)$ (for details see TEGP 4.1 [420 *]). The only way that one metric theory differs from another is in the numerical values of the coefficients that appear in front of the metric potentials. The parametrized post-Newtonian (PPN) formalism inserts parameters in place of these coefficients, parameters whose values depend on the theory under study. In the current version of the PPN formalism, summarized in Box 2, ten parameters are used, chosen in such a manner that they measure or indicate general properties of metric theories of gravity (see Table 2). Under reasonable assumptions about the kinds of potentials that can be present at post-Newtonian order (basically only Poisson-like potentials of conventional perfect fluid sources, absence of parity-violating potentials), one finds that ten PPN parameters exhaust the possibilities.

Table 2: The PPN Parameters and their significance (note that $α3$ has been shown twice to indicate that it is a measure of two effects).

Parameter	What it measures relative to GR	Value in GR	Value in semiconservative theories	Value in fully conservative theories
$γ$	How much space-curvature produced by unit rest mass?	$1$	$γ$	$γ$

$β$	How much “nonlinearity” in the superposition law for gravity?	$1$	$β$	$β$
$ξ$	Preferred-location effects?	$0$	$ξ$	$ξ$
$α1$	Preferred-frame effects?	$0$	$α1$	$0$
$α2$		$0$	$α2$	$0$
$α3$		$0$	$0$	$0$
$α3$	Violation of conservation	$0$	$0$	$0$
$ζ1$	of total momentum?	$0$	$0$	$0$
$ζ 2$		$0$	$0$	$0$
$ζ3$		$0$	$0$	$0$
$ζ4$		$0$	$0$	$0$

The parameters $γ$ and $β$ are the usual Eddington–Robertson–Schiff parameters used to describe the “classical” tests of GR, and are in some sense the most important; they are the only non-zero parameters in GR and scalar–tensor gravity. The parameter $ξ$ is non-zero in any theory of gravity that predicts preferred-location effects such as a galaxy-induced anisotropy in the local gravitational constant $G L$ (also called “Whitehead” effects); $α1$ , $α2$ , $α3$ measure whether or not the theory predicts post-Newtonian preferred-frame effects; $α3$ , $ζ1$ , $ζ2$ , $ζ3$ , $ζ4$ measure whether or not the theory predicts violations of global conservation laws for total momentum. In Table 2 we show the values these parameters take

in GR,
in any theory of gravity that possesses conservation laws for total momentum, called “semi-conservative” (any theory that is based on an invariant action principle is semi-conservative), and
in any theory that in addition possesses six global conservation laws for angular momentum, called “fully conservative” (such theories automatically predict no post-Newtonian preferred-frame effects).

Semi-conservative theories have five free PPN parameters ( $γ$ , $β$ , $ξ$ , $α 1$ , $α 2$ ) while fully conservative theories have three ( $γ$ , $β$ , $ξ$ ).

The PPN formalism was pioneered by Kenneth Nordtvedt [305 ], who studied the post-Newtonian metric of a system of gravitating point masses, extending earlier work by Eddington, Robertson and Schiff (TEGP 4.2 [420 *]). Will [413 ] generalized the framework to perfect fluids. A general and unified version of the PPN formalism was developed by Will and Nordtvedt [431 *]. The canonical version, with conventions altered to be more in accord with standard textbooks such as [289 ], is discussed in detail in TEGP 4 [420 *]. Other versions of the PPN formalism have been developed to deal with point masses with charge, fluid with anisotropic stresses, bodies with strong internal gravity, and post-post-Newtonian effects (TEGP 4.2, 14.2 [420 *]). Additional parameters or potentials are needed to deal with some theories, such as theories with massive fields (Yukawa-type potentials replace Poisson potentials), or theories like Chern–Simons theory, which permit parity violation in gravity.

Box 2. The Parametrized Post-Newtonian formalism

Coordinate system:: The framework uses a nearly globally Lorentz coordinate system in which the coordinates are $(t,x1,x2,x3)$ . Three-dimensional, Euclidean vector notation is used throughout. All coordinate arbitrariness (“gauge freedom”) has been removed by specialization of the coordinates to the standard PPN gauge (TEGP 4.2 [420 *]). Units are chosen so that $G = c = 1$ , where $G$ is the physically measured Newtonian constant far from the solar system.

Matter variables:

$ρ$ : density of rest mass as measured in a local freely falling frame momentarily comoving with the gravitating matter.
$i i v = (dx ∕dt)$ : coordinate velocity of the matter.
$wi$ : coordinate velocity of the PPN coordinate system relative to the mean rest-frame of the universe.
$p$ : pressure as measured in a local freely falling frame momentarily comoving with the matter.
$Π$ : internal energy per unit rest mass (it includes all forms of non-rest-mass, non-gravitational energy, e.g., energy of compression and thermal energy).

PPN parameters:: $γ$ , $β$ , $ξ$ , $α1$ , $α2$ , $α3$ , $ζ1$ , $ζ2$ , $ζ3$ , $ζ4$ .

Metric:

Metric potentials:

Stress–energy tensor (perfect fluid):

Equations of motion:

Stressed matter: $μν T ;ν = 0$ .
Test bodies: $d2xμ- μ dx-νdxλ- dλ2 + Γ νλ dλ d λ = 0$ .
Maxwell’s equations: $F μν;ν = 4πJ μ, Fμν = A ν;μ − Aμ;ν$ .

3.3 Competing theories of gravity

One of the important applications of the PPN formalism is the comparison and classification of alternative metric theories of gravity. The population of viable theories has fluctuated over the years as new effects and tests have been discovered, largely through the use of the PPN framework, which eliminated many theories thought previously to be viable. The theory population has also fluctuated as new, potentially viable theories have been invented.

In this review, we shall focus on GR, the general class of scalar–tensor modifications of it, of which the Jordan–Fierz–Brans–Dicke theory (Brans–Dicke, for short) is the classic example, and vector-tensor theories. The reasons are several-fold:

A full compendium of alternative theories circa 1981 is given in TEGP 5 [420 *].
Many alternative metric theories developed during the 1970s and 1980s could be viewed as “straw-man” theories, invented to prove that such theories exist or to illustrate particular properties. Few of these could be regarded as well-motivated theories from the point of view, say, of field theory or particle physics.
A number of theories fall into the class of “prior-geometric” theories, with absolute elements such as a flat background metric in addition to the physical metric. Most of these theories predict “preferred-frame” effects, that have been tightly constrained by observations (see Section 4.3.2). An example is Rosen’s bimetric theory.
A large number of alternative theories of gravity predict gravitational wave emission substantially different from that of GR, in strong disagreement with observations of the binary pulsar (see Section 9).
Scalar–tensor modifications of GR have become very popular in unification schemes such as string theory, and in cosmological model building. Because the scalar fields could be massive, the potentials in the post-Newtonian limit could be modified by Yukawa-like terms.
Theories that also incorporate vector fields have attracted recent attention, in the spirit of the SME (see Section 2.2.4), as models for violations of Lorentz invariance in the gravitational sector, and as potential candidates to account for phenomena such as galaxy rotation curves without resorting to dark matter.

3.3.1 General relativity

The metric $g$ is the sole dynamical field, and the theory contains no arbitrary functions or parameters, apart from the value of the Newtonian coupling constant $G$ , which is measurable in laboratory experiments. Throughout this article, we ignore the cosmological constant $ΛC$ . We do this despite recent evidence, from supernova data, of an accelerating universe, which would indicate either a non-zero cosmological constant or a dynamical “dark energy” contributing about 70 percent of the critical density. Although $ΛC$ has significance for quantum field theory, quantum gravity, and cosmology, on the scale of the solar-system or of stellar systems its effects are negligible, for the values of $ΛC$ inferred from supernova observations.

The field equations of GR are derivable from an invariant action principle $δI = 0$ , where

where $R$ is the Ricci scalar, and $I m$ is the matter action, which depends on matter fields $ψ m$ universally coupled to the metric $g$ . By varying the action with respect to $gμν$ , we obtain the field equations

where $Tμν$ is the matter energy-momentum tensor. General covariance of the matter action implies the equations of motion $μν T ;ν = 0$ ; varying $Im$ with respect to $ψm$ yields the matter field equations of the standard model. By virtue of the absence of prior-geometric elements, the equations of motion are also a consequence of the field equations via the Bianchi identities $G μν = 0 ;ν$ . According to our choice of units, we set $G = 1$ .

The general procedure for deriving the post-Newtonian limit of metric theories is spelled out in TEGP 5.1 [420 *], and is described in detail for GR in TEGP 5.2 [420 *] (see also Chapters 6 – 8 of [324 *]). The PPN parameter values are listed in Table 3.

Table 3: Metric theories and their PPN parameter values ( $α3 = ζi = 0$ for all cases). The parameters $γ′$ , $β ′$ , $α′ 1$ , and $α ′ 2$ denote complicated functions of the arbitrary constants and matching parameters.

Theory	Arbitrary	Cosmic	PPN parameters
	functions	matching	_______________________________________
	or constants	parameters	$γ$	$β$	$ξ$	$α1$	$α2$

General relativity	none	none	$1$	$1$	$0$	$0$	$0$

Scalar–tensor
Brans–Dicke	$ωBD$	$ϕ0$	$1+-ωBD-- 2+ ωBD$	$1$	$0$	$0$	$0$
General, $f(R )$	$A (φ)$ , $V (φ)$	$φ0$	$1+-ω- 2+ ω$	$---λ--- 1+ 4 + 2ω$	$0$	$0$	$0$


Vector–tensor
Unconstrained	$ω, c1,c2,c3,c4$	$u$	$′ γ$	$′ β$	$0$	$′ α 1$	$′ α2$
Einstein-Æther	$c1,c2,c3,c4$	none	$1$	$1$	$0$	$′ α 1$	$′ α2$
Khronometric	$α ,β ,λ k k k$	none	$1$	$1$	$0$	$α ′ 1$	$α′ 2$


Tensor–Vector–Scalar	$k,c,c ,c ,c 1 2 3 4$	$ϕ 0$	$1$	$1$	$0$	$α ′ 1$	$α′ 2$

3.3.2 Scalar–tensor theories

These theories contain the metric $g$ , a scalar field $φ$ , a potential function $V(φ )$ , and a coupling function $A (φ)$ (generalizations to more than one scalar field have also been carried out [102 *]). For some purposes, the action is conveniently written in a non-metric representation, sometimes denoted the “Einstein frame”, in which the gravitational action looks exactly like that of GR and the scalar action looks like a minimally coupled scalar field with a potential:

where $&tidle;R ≡ g&tidle;μν &tidle;Rμν$ is the Ricci scalar of the “Einstein” metric $g&tidle;μν$ . (Apart from the scalar potential term $V(φ )$ , this corresponds to Eq. (28 *) with $&tidle;G (φ ) ≡ (4πG )−1$ , $U(φ ) ≡ 1$ , and $&tidle;M (φ) ∝ A (φ)$ .) This representation is a “non-metric” one because the matter fields $ψm$ couple to a combination of $φ$ and $&tidle;gμν$ . Despite appearances, however, it is a metric theory, because it can be put into a metric representation by identifying the “physical metric”

The action can then be rewritten in the metric form

where

ϕ ≡ A(φ )−2, − 2 3 + 2ω (ϕ) ≡ α(φ ) , (36 ) d(lnA (φ)) α (φ ) ≡ ----------. dφ

The Einstein frame is useful for discussing general characteristics of such theories, for numerical relativity calculations, and for some cosmological applications, while the metric representation is most useful for calculating observable effects. The field equations, post-Newtonian limit and PPN parameters are discussed in TEGP 5.3 [420 *] (see also Section 13.5 of [324 *]), and the values of the PPN parameters are listed in Table 3.

The parameters that enter the post-Newtonian limit are

where $ϕ0$ is the value of $ϕ$ today far from the system being studied, as determined by appropriate cosmological boundary conditions. The Newtonian gravitational constant $G N$ , which is set equal to unity by our choice of units, is related to the coupling constant $G$ , $ϕ0$ and $ω$ by

In Brans–Dicke theory ( $ω(ϕ) ≡ ωBD = const.$ ), the larger the value of $ωBD$ , the smaller the effects of the scalar field, and in the limit $ωBD → ∞$ ( $α0 → 0$ ), the theory becomes indistinguishable from GR in all its predictions. In more general theories, the function $ω (ϕ)$ could have the property that, at the present epoch, and in weak-field situations, the value of the scalar field $ϕ0$ is such that $ω$ is very large and $λ$ is very small (theory almost identical to GR today), but that for past or future values of $ϕ$ , or in strong-field regions such as the interiors of neutron stars, $ω$ and $λ$ could take on values that would lead to significant differences from GR. It is useful to point out that all versions of scalar–tensor gravity predict that $γ ≤ 1$ (see Table 3).

Damour and Esposito-Farèse [102 *] have adopted an alternative parametrization of scalar–tensor theories, in which one expands $lnA (φ )$ about a cosmological background field value $φ0$ :

A precisely linear coupling function produces Brans–Dicke theory, with $2 α0 = 1∕(2ωBD + 3)$ , or $1∕(2 + ωBD ) = 2α20∕ (1 + α20)$ . The function $ln A(φ )$ acts as a potential for the scalar field $φ$ within matter, and, if $β0 > 0$ , then during cosmological evolution, the scalar field naturally evolves toward the minimum of the potential, i.e., toward $α ≈ 0$ , $ω → ∞$ , or toward a theory close to, though not precisely GR [112 , 113 ]. Estimates of the expected relic deviations from GR today in such theories depend on the cosmological model, but range from $−5 10$ to a few times $−7 10$ for $|γ − 1|$ .

Negative values of $β0$ correspond to a “locally unstable” scalar potential (the overall theory is still stable in the sense of having no tachyons or ghosts). In this case, objects such as neutron stars can experience a “spontaneous scalarization”, whereby the interior values of $φ$ can be very different from the exterior values, through non-linear interactions between strong gravity and the scalar field, dramatically affecting the stars’ internal structure and leading to strong violations of SEP [103 , 104 ]. There is evidence from recent numerical simulations of the occurrence of a dynamically induced scalarization during the inspirals of compact binary systems containing neutron stars, which can affect both the final motion and the gravitational-wave emission [32 , 313 , 364 ].

On the other hand, in the case $β0 < 0$ , one must confront that fact that, with an unstable $φ$ potential, cosmological evolution would presumably drive the system away from the peak where $α ≈ 0$ , toward parameter values that could be excluded by solar system experiments.

Scalar fields coupled to gravity or matter are also ubiquitous in particle-physics-inspired models of unification, such as string theory [384 , 266 , 117 , 114 , 115 ]. In some models, the coupling to matter may lead to violations of EEP, which could be tested or bounded by the experiments described in Section 2.1. In many models the scalar field could be massive; if the Compton wavelength is of macroscopic scale, its effects are those of a “fifth force”. Only if the theory can be cast as a metric theory with a scalar field of infinite range or of range long compared to the scale of the system in question (solar system) can the PPN framework be strictly applied. If the mass of the scalar field is sufficiently large that its range is microscopic, then, on solar-system scales, the scalar field is suppressed, and the theory is essentially equivalent to general relativity.

For a detailed review of scalar–tensor theories see [167 ].

3.3.3 f(R) theories

These are theories whose action has the form

where $f$ is a function chosen so that at cosmological scales, the universe will experience accelerated expansion without resorting to either a cosmological constant or dark energy. However, it turns out that such theories are equivalent to scalar–tensor theories: replace $f (R)$ by $f (χ ) − f (χ)(R − χ) ,χ$ , where $χ$ is a dynamical scalar field. Varying the action with respect to $χ$ yields $f,χχ(R − χ) = 0$ , which implies that $χ = R$ as long as $f,χχ ⁄= 0$ . Then defining a scalar field $ϕ ≡ f,χ(χ)$ one puts the action into the form of a scalar–tensor theory given by Eq. (35 *), with $ω (ϕ ) = 0$ and $ϕ2V = ϕχ (ϕ) − f(χ(ϕ ))$ . As we will see, this value of $ω$ would ordinarily strongly violate solar-system experiments, but it turns out that in many models, the potential $V (ϕ)$ has the effect of giving the scalar field a large effective mass in the presence of matter (the so-called “chameleon mechanism” [216 ]), so that the scalar field is suppressed at distances that extend outside bodies like the Sun and Earth. In this way, with only modest fine tuning, $f (R )$ theories can claim to obey standard tests, while providing interesting, non general-relativistic behavior on cosmic scales. For detailed reviews of this class of theories, see [370 ] and [122 ].

3.3.4 Vector–tensor theories

These theories contain the metric $g$ and a dynamical, typically timelike, four-vector field $μ u$ . In some models, the four-vector is unconstrained, while in others, called Einstein-Æther theories it is constrained to be timelike with unit norm. The most general action for such theories that is quadratic in derivatives of the vector is given by

where

The coefficients $ci$ are arbitrary. In the unconstrained theories, $λ ≡ 0$ and $ω$ is arbitrary. In the constrained theories, $λ$ is a Lagrange multiplier, and by virtue of the constraint $μ uμu = − 1$ , the factor $ωu μuμ$ in front of the Ricci scalar can be absorbed into a rescaling of $G$ ; equivalently, in the constrained theories, we can set $ω = 0$ . Note that the possible term $uμu νRμν$ can be shown under integration by parts to be equivalent to a linear combination of the terms involving $c 2$ and $c3$ .

Unconstrained theories were studied during the 1970s as “straw-man” alternatives to GR. In addition to having up to four arbitrary parameters, they also left the magnitude of the vector field arbitrary, since it satisfies a linear homogenous vacuum field equation of the form $ℒu μ = 0$ ( $c4 = 0$ in all such cases studied). Indeed, this latter fact was one of most serious defects of these theories. Each unconstrained theory studied corresponds to a special case of the action (41 *), all with $λ ≡ 0$ :

General vector–tensor theory; $ω$ , $τ$ , $𝜖$ , $η$: The gravitational Lagrangian for this class of theories had the form $μ μ ν μν μ ν R + ωuμu R + ηu u R μν − 𝜖F μνF + τ ∇μu ν∇ u$ , where $F μν = ∇ μuν − ∇ νuμ$ , corresponding to the values $c1 = 2 𝜖 − τ$ , $c2 = − η$ , $c1 + c2 + c3 = − τ$ , $c4 = 0$ . In these theories $γ$ , $β$ , $α1$ , and $α2$ are complicated functions of the parameters and of $u2 = − uμu μ$ , while the rest vanish (see TEGP 5.4 [420 *]).

Will–Nordtvedt theory: This is the special case $c1 = − 1$ , $c2 = c3 = c4 = 0$ . In this theory, the PPN parameters are given by $γ = β = 1$ , $α = u2∕(1 + u2∕2) 2$ , and zero for the rest [431 ]).

Hellings–Nordtvedt theory; $ω$: This is the special case $c1 = 2$ , $c2 = 2ω$ , $c1 + c2 + c3 = 0 = c4$ . Here $γ$ , $β$ , $α1$ and $α 2$ are complicated functions of the parameters and of $u2$ , while the rest vanish [187 ].
Einstein-Æther theory; $c1,c2, c3,c4$: The Einstein-Æther theories were motivated in part by a desire to explore possibilities for violations of Lorentz invariance in gravity, in parallel with similar studies in matter interactions, such as the SME. The general class of theories was analyzed by Jacobson and collaborators [204 , 274 , 205 *, 147 , 163 *], motivated in part by [230 ]. Analyzing the post-Newtonian limit,¹ they were able to infer values of the PPN parameters $γ$ and $β$ as follows [163 ]:

where $c123 = c1 + c2 + c3$ , $c13 = c1 + c3$ , $c14 = c1 + c4$ , subject to the constraints $c123 ⁄= 0$ , $c14 ⁄= 2$ , $2c1 − c21 + c23 ⁄= 0$ . By requiring that gravitational-wave modes have real (as opposed to imaginary) frequencies, one can impose the bounds $c1∕c14 ≥ 0$ and $c123∕c14 ≥ 0$ . Considerations of positivity of energy impose the constraints $c1 > 0$ , $c14 > 0$ and $c123 > 0$ .
Khronometric theory; $α ,β ,λ K K K$: This is the low-energy limit of “Hořava gravity”, a proposal for a gravity theory that is power-counting renormalizable [190 ]. The vector field is required to be hypersurface orthogonal ( $uα ∝ ∇ αT$ , where $T$ is a scalar field related to a preferred time direction; equivalently the twist $ωαβ = ∇ [αuβ] + u [αa β]$ must vanish, where $aβ = uμ∇ μu β$ ), so that higher spatial derivative terms could be introduced to effectuate renormalizability. A “healthy” version of the theory [61 , 62 ] can be shown to correspond to the values $c1 = − 𝜖$ $c2 = λK$ , $c3 = βK + 𝜖$ and $c4 = αK + 𝜖$ , where the limit $𝜖 → ∞$ is to be taken. (The idea is to extract $𝜖$ times $ω αβωα β$ from the Einstein-Æther action and let $𝜖 → ∞$ to enforce the twist-free condition [203 ].) In this case $α 1$ and $α 2$ are given by

3.3.5 Tensor–vector–scalar (TeVeS) theories

This class of theories was invented to provide a fully relativistic theory of gravity that could mimic the phenomenological behavior of so-called Modified Newtonian Dynamics (MOND). MOND is a phenomenological mechanism [283 ] whereby Newton’s equation of motion $2 a = Gm ∕r$ holds as long as $a$ is large compared to some fundamental scale $a0$ , but in a regime where $a < a0$ , the equation of motion takes the form $a2∕a0 = Gm ∕r2$ . With such a behavior, the rotational velocity of a particle far from a central mass would have the form $v ∼ √ar--∼ (Gma0 )1∕4$ , thus reproducing the flat rotation curves observed for spiral galaxies, without invoking a distribution of dark matter.

Devising a relativistic theory that would embody the MOND phenomenology turned out to be no simple matter, and the final result, TeVeS was rather complicated [36 ]. Furthermore, it was shown to have unexpected singular behavior that was most simply cured by incorporating features of the Einstein-Æther theory [366 ]. The extended theory is based on an “Einstein” metric $&tidle;gμν$ , related to the physical metric $g μν$ by

where $μ u$ is a vector field, and $ϕ$ is a scalar field. The action for gravity is the standard GR action of Eq. (31 *), but defined using the Einstein metric $&tidle;gμν$ , while the matter action is that of a standard metric theory, using $gμν$ . These are supplemented by the vector action, given by that of Einstein-Æther theory, Eq. (41 *), and a scalar action, given by

where $k$ is a constant, $ℓ$ is a distance, and $hμν ≡ g&tidle;μν − u μuν$ , indices being raised and lowered using the Einstein metric. The function $ℱ (y)$ is chosen so that $μ (y ) ≡ d ℱ ∕dy$ is unity in the high-acceleration, or normal Newtonian and post-Newtonian regimes, and nearly zero in the MOND regime.

The PPN parameters of the theory [346 ] have the values $γ = β = 1$ and $ξ = α3 = ζi = 0$ , while the parameters $α1$ and $α2$ are given by

κc (2 − c ) − c sinh 4ϕ + 2(1 − c) sinh2 2ϕ α1 = (α1)Æ − 16G --1------14-----3-------02---2------1--------0-, (52 ) ( 2c1 − c1 + c3 ) α2 = (α2)Æ − 2G A1κ − 2A2 sinh 4ϕ0 − A3 sinh22ϕ0 , (53 )

where $(α1)Æ$ and $(α2)Æ$ are given by Eqs. (46 *) and (47 *), where

2 A1 ≡ (2c13 −-c14)-+ 4c1(2-−-c14)-− 6-(1-+-c13-−-c14) , (54 ) c123(2 − c14) 2c1 − c21 + c23 2 − c14 (2c − c )2 4(1 − c ) 2(1 − c )( 2 3 ) A2 ≡ ---13----14--2 − -------21--2-+ -------13- ----+ ------- , (55 ) c123(2 − c14) 2c1 − c1 + c3 2 − c14 ( c123 2 − c14 ) (2c13 − c14)2 4c3 2 3(1 − c13) 2c13 − c14 A3 ≡ c--(2-−-c--)2 + 2c-−--c2-+-c2-+ (2-−-c--) ---c------− -2-−-c---- , (56 ) 123 14 1 1 3 14 123 14

where $κ ≡ k∕8 π$ ,

and $ϕ0$ is the asymptotic value of the scalar field. In the limit $κ → 0$ and $ϕ0 → 0$ , $α1$ and $α2$ reduce to their Einstein-Æther forms.

However, these PPN parameter values are computed in the limit where the function $ℱ (y)$ is a linear function of its argument $y = kℓ2hμνϕ,μϕ,ν$ . When one takes into account the fact that the function $μ (y ) = dℱ ∕dy$ must interpolate between unity and zero to reach the MOND regime, it has been found that the dynamics of local systems is more strongly affected by the fields of surrounding matter than was anticipated. This “external field effect” (EFE) [284 , 57 , 58 ] produces a quadrupolar contribution to the local Newtonian gravitational potential that depends on the external distribution of matter and on the shape of the function $μ(y)$ , and that can be significantly larger than the galactic tidal contribution. Although the calculations of EFE have been carried out using phenomenological MOND equations, it should be a generic phenomenon, applicable to TeVeS as well. Analysis of the orbit of Saturn using Cassini data has placed interesting constraints on the MOND interpolating function $μ (y )$ [186 ].

For thorough reviews of MOND and TeVeS, and their confrontation with the dark-matter paradigm, see [367 , 150 ].

3.3.6 Quadratic gravity and Chern–Simons theories

Quadratic gravity is a recent incarnation of an old idea of adding to the action of GR terms quadratic in the Riemann and Ricci tensors or the Ricci scalar, as “effective field theory” models for more fundamental string or quantum gravity theories. The general action for such theories can be written as

∫ [ I = κR + α1f1(ϕ)R2 + α2f2 (ϕ)RαβR αβ + α3f3 (ϕ )RαβγδR αβγδ + α4f4 (ϕ)∗RR ( ) ] β μν 1∕2 4 − -- g ∂μϕ ∂νϕ + 2V (ϕ) (− g) d x + Im (ψm,gμν), (58 ) 2

where $κ = (16πG )−1$ , $ϕ$ is a scalar field, $αi$ are dimensionless coupling constants (if the functions $fi(ϕ)$ are dimensionless), and $β$ is a constant whose dimension depends on that of $ϕ$ , and where $∗ ∗ α γδ β RR ≡ R β R αγδ$ , where $∗ α γδ 1 γδρσ α R β ≡ 2𝜖 R βρσ$ is the dual Riemann tensor.

One challenge inherent in these theories is to find an argument or a mechanism that evades making the natural choice for each of the $α$ parameters to be of order unity. Such a choice makes the effects of the additional terms essentially unobservable in most laboratory or astrophysical situations because of the enormous scale of $κ ∝ 1∕ℓ2Planck$ in the leading term. This class of theories is too vast and diffuse to cover in this review, and no comprehensive review is available, to our knowledge.

Chern–Simons gravity is the special case of this class of theories in which only the parity-violating term $∗RR$ is present ( $α1 = α2 = α3 = 0$ ) [201 ]. It can arise in various anomaly cancellation schemes in the standard model of particle physics, in cancelling the Green–Schwarz anomaly in string theory, or in effective field theories of inflation [408 ]. It can also arise in loop quantum gravity [382 , 276 ]. The action in this case is given by

where $α$ and $β$ are coupling constants with dimensions $ℓA$ , and $ℓ2A−2$ , assuming that the scalar field has dimensions $ℓ−A$ .

There are two different versions of Chern–Simons theory, a non-dynamical version in which $β = 0$ , so that $ϕ$ , given a priori as some specified function of spacetime, plays the role of a Lagrange multiplier enforcing the constraint $∗RR = 0$ , and a dynamical version, in which $β ⁄= 0$ .

The PPN parameters for a non-dynamical version of the theory with $α = κ$ and $β = 0$ are identical to those of GR; however, there is an additional, parity-even potential in the $g0i$ component of the metric that does not appear in the standard PPN framework, given by

Unfortunately, the non-dynamical version has been shown to be unstable [137 ], while the dynamical version is sufficiently complex that its observable consequences have been analyzed for only special situations [6 , 444 ]. Alexander and Yunes [5 ] give a thorough review of Chern–Simons gravity.

Einstein-Dilaton-Gauss–Bonnet gravity is another special case, in which the Chern–Simons term is neglected ( $α4 = 0$ ), and the three other curvature-squared terms collapse to the Gauss–Bonnet invariant, $R2 − 4R αβRαβ + R αβγδR αβγδ$ , i.e. $f1(ϕ ) = f2(ϕ) = f3(ϕ)$ and $α1 = − α2 ∕4 = α3$ (see [292 , 314 ]).

3.3.7 Massive gravity

Massive gravity theories attempt to give the putative “graviton” a mass. The simplest attempt to implement this in a ghost-free manner suffers from the so-called van Dam–Veltman–Zakharov (vDVZ) discontinuity [398 , 453 ]. Because of the 3 additional helicity states available to the massive spin-2 graviton, the limit of small graviton mass does not coincide with pure GR, and the predicted perihelion advance, for example, violates experiment. A model theory by Visser [401 *] attempts to circumvent the vDVZ problem by introducing a non-dynamical flat-background metric. This theory is truly continuous with GR in the limit of vanishing graviton mass; on the other hand, its observational implications have been only partially explored. Braneworld scenarios predict a tower or a continuum of massive gravitons, and may avoid the vDVZ discontinuity, although the full details are still a work in progress [125 , 96 ]. Attempts to avert the vDVZ problem involve treating non-linear aspects of the theory at the fundamental level; many models incorporate a second tensor field in addition to the metric. For recent reviews, see [188 , 123 ], and a focus issue in Vol. 30, Number 18 of Classical and Quantum Gravity.

	Abstract
1	Introduction
2	Tests of the Foundations of Gravitation Theory
	2.1	The Einstein equivalence principle
	2.2	Theoretical frameworks for analyzing EEP
	2.3	EEP, particle physics, and the search for new interactions
3	Metric Theories of Gravity and the PPN Formalism
	3.1	Metric theories of gravity and the strong equivalence principle
	3.2	The parametrized post-Newtonian formalism
	3.3	Competing theories of gravity
4	Tests of Post-Newtonian Gravity
	4.1	Tests of the parameter $γ$
	4.2	The perihelion shift of Mercury
	4.3	Tests of the strong equivalence principle
	4.4	Other tests of post-Newtonian gravity
	4.5	Prospects for improved PPN parameter values
5	Strong Gravity and Gravitational Waves: Tests for the 21st Century
	5.1	Strong-field systems in general relativity
	5.2	Motion and gravitational radiation in general relativity: A history
	5.3	Compact binary systems in general relativity
	5.4	Compact binary systems in scalar–tensor theories
6	Stellar System Tests of Gravitational Theory
	6.1	The binary pulsar and general relativity
	6.2	A zoo of binary pulsars
	6.3	Binary pulsars and alternative theories
	6.4	Binary pulsars and scalar–tensor gravity
7	Gravitational-Wave Tests of Gravitational Theory
	7.1	Gravitational-wave observatories
	7.2	Gravitational-wave amplitude and polarization
	7.3	Gravitational-wave phase evolution
	7.4	Speed of gravitational waves
8	Astrophysical and Cosmological Tests
9	Conclusions
	Acknowledgments
	References
	Footnotes
	Updates
	Figures
	Tables