-
PDF
- Split View
-
Views
-
Cite
Cite
X Hernandez, Internal kinematics of Gaia DR3 wide binaries: anomalous behaviour in the low acceleration regime, Monthly Notices of the Royal Astronomical Society, Volume 525, Issue 1, October 2023, Pages 1401–1415, https://doi.org/10.1093/mnras/stad2306
- Share Icon Share
ABSTRACT
The Gaia eDR3 catalogue has recently been used to study statistically the internal kinematics of wide binary populations using relative velocities of the two component stars, ΔV, total binary masses, mB, and separations, s. For s ≳ 0.01 pc, these binaries probe the low-acceleration a ≲ 2a0 regime where gravitational anomalies usually attributed to dark matter are observed in the flat rotation curves of spiral galaxies, where a0 ≈ 1.2 × 10−10 m s−2 is the acceleration scale of MOND. Such experiments test the degree of generality of these anomalies, by exploring the same acceleration regime using independent astronomical systems of vastly smaller mass and size. A signal above Newtonian expectations has been observed when a ≲ 2a0, alternatively interpreted as evidence of a modification of gravity, or as due to kinematic contaminants; undetected stellar components, unbound encounters, or spurious projection effects. Here I take advantage of the enhanced DR3 Gaia catalogue to perform a more rigorous study of the internal kinematics of wide binaries than what has previously been possible. Internally determined Gaia stellar masses and estimates of binary probabilities for each star using spectroscopic information, together with a larger sample of radial velocities, allow for a significant improvement in the analysis and careful exclusion of possible kinematic contaminants. Resulting ΔV scalings accurately tracing Newtonian expectations for the high acceleration regime, but markedly inconsistent with these expectations in the low acceleration one, are obtained. A non-Newtonian low acceleration phenomenology is thus confirmed.
1 INTRODUCTION
The presence of gravitational anomalies in the low acceleration regime of a < a0 ≈ 1.2 × 10−10 m s−2 at galactic scales has been alternatively interpreted as evidence for a dominant dark matter component of unknown origin and as yet lacking any direct confirmation, or as indicating a change of regime in the structure of gravity, generically termed modified gravity (e.g. Milgrom 1983), or even a validity limit for Newton’s second law, modified inertia proposals, for example Milgrom (1994), Milgrom (2022). Empirically, to first order such gravitationally anomalous regime is characterized by flat rotation curves at an amplitude satisfying the baryonic Tully–Fisher relation, VTF = (GMa0)1/4 = 0.35(M/M⊙)1/4 km s−1, where M refers to the total baryonic mass of a galaxy, McGaugh et al. (2000) and Lelli et al. (2017).
Deciding between the alternative interpretations could benefit from exploring the low acceleration regime in different astronomical contexts, to obtain evidence as to the generality, or lack thereof, of the gravitational anomalies present in the outskirts of spiral galaxies. Steps in this direction have been taken by studies probing the possible presence of Tully–Fisher phenomenology in pressure supported galactic systems by Jiménez et al. (2013), Durazo et al. (2018) and Chae et al. (2020), who find asymptotic velocity dispersion amplitudes also scaling with the fourth root of total baryonic masses. Extensions towards dwarf galaxies have also been explored, for example McGaugh et al. (2021). In going to smaller pressure supported systems, for example Scarpa, Marconi & Gilmozzi (2003), Hernandez & Jiménez (2012) and Hernandez & Lara-D I (2020) show asymptotically flat velocity dispersion profiles of galactic globular clusters also presenting the same empirical scalings with total baryonic mass of the baryonic Tully–Fisher relation, albeit the results of Claydon, Gieles & Zocchi (2017), who present an explanation for the observed Globular Cluster phenomenology within a standard gravitational framework.
As first identified in Hernandez, Jiménez & Allen (2012), large samples of wide binaries with internal separations larger than about 0.01 pc offer a probe of the low acceleration a ≲ 2a0 regime at mass and length scales many orders of magnitude below those of spiral galaxies, and even below the ones mentioned earlier, where any dark matter contribution would be negligible. Even at the widest separations considered here, of 0.06 pc, the local dark matter density inferred under a Newtonian framework of |$0.01 \, \mathrm{M}_{\odot }$| pc−3 (e.g. Read 2014), would only imply about |$1\times 10^{-5} \, \mathrm{M}_{\odot }$| of dark matter within the wide binary orbit, and hence only a contribution of order one part in 105 of the total mass of the system, with individual stellar masses not far from |$0.8 \, \mathrm{M}_{\odot }$|.
The internal kinematics of wide binary samples as a test of gravity in the low acceleration regime have more recently been considered by: Scarpa et al. (2017), Banik & Zhao (2018), Pittordis & Sutherland (2018), Hernandez et al. (2019a), Banik (2019), Pittordis & Sutherland (2019), Acedo (2020), Hernandez, Cookson & Cortés (2022), Manchanda, Sutherland & Pittordis (2023) and Pittordis & Sutherland (2023). The presence of anomalous internal velocities in wide binaries is now well established, with various interpretations having been proposed. In Hernandez et al. (2022) we used Gaia eDR3 to show that on reaching the low acceleration regime of sufficiently separated binaries, a qualitative regime change appears where the binned rms projected relative velocity for wide binary populations ceases to drop along Keplerian expectations, and settles to values consistent with the baryonic Tully–Fisher scaling of spiral galaxies, to argue in favour of a change in regime for the physics describing the problem.
Alternatively, Clarke (2020) simulating wide binary populations and Pittordis et al. (2023) using Gaia eDR3 data, show that introducing a hypothetical population of hidden tertiaries, cases where one or both components of an observed binary harbour an undetected stellar companion, results in a kinematic contaminant which if chosen judiciously can explain the observations within a Newtonian framework. This population of hidden tertiaries hence becomes a prediction of Newtonian gravity, which fortunately has recently been shown to lie within the reach of independent confirmation through dedicated follow-up studies using a variety of readily available techniques, Manchanda et al. (2023).
In this paper I explore from an empirical approach, as an extension of our previous study of Hernandez et al. (2022), the kinematics of solar neighbourhood wide binaries and the mass-velocity scalings these present, taking advantage for the first time in this context of the recent Gaia DR3 catalogue. This latest data release benefits from direct mass estimates using spectroscopic information and the Gaia work package FLAME for a sub-set of stellar sources, an internally assessed binary probability for each star, the CLASSPROB_DSC_COMBMOD_BINARYSTAR parameter, an inference using spectral, photometric, and astrometric information, henceforth BP, as well about 4.7 × more stars containing velocities along the line of sight. All of the above improvements on Gaia eDR3 permit a more accurate probing of the problem, with an enlarged sample of stars having radial velocities, useful to exclude unbound flyby events, and more accurate elimination of kinematic contaminants, for example, by imposing cuts in the BP parameter of a sample, as well as allowing for an accurate calibration of the luminosity-mass scalings used in all the studies mentioned earlier to infer individual stellar masses.
The structure of the paper is as follows: Section 2 describes the sample selection, driven by the philosophy of defining a small but very high quality sample where the consistency or otherwise of wide binary internal kinematics with Newtonian expectations might be assessed. Section 3 shows the calibration of a simple mass–luminosity relation to the high quality spectroscopically determined Gaia DR3 individual masses, which is then used throughout. Section 4 presents results for two samples having different distance cuts so as to obtain a measure of distance dependent effects. Section 5 gives a comparison to recent independent work, and section 6 a final discussion. Additional consistency checks appear in the appendix.
2 SAMPLE SELECTION
The sample selection used is a modified version of the Gaia one used by El-Badry & Rix (2018) who present and test a wide binary catalogue with a distance cut of D < 200 pc. Accounting for projection effects and using simulations modelling reasonable distributions of ellipticities and undetected companions to estimate completion factors, these authors report a level of contamination of below |$0.2 {{\ \rm per\ cent}}$|. Based on this same approach Tian et al. (2020) produced a lower quality but more extensive catalogue with a distance cut of D < 4 kpc containing 800 000 binary candidates.
I begin with a Gaia search returning all stars within 333 pc with accurate parallaxes and G magnitudes having a signal-to-noise ratio of (S/N)ϖ > 100 and a signal-to-noise ratio in the Gaia G band of (S/N)G > 5. I then search within a projected 0.5 pc radius on the plane of the sky about each star for a potential binary companion. Any resulting pair is then accepted as a binary candidate, provided the difference in distance along the line of sight between both stars, ΔD, is smaller than twice the projected separation between both stars, 2s, at a 3σ level, that is, that ΔD − 3σΔD < 2s < ΔD + 3σΔD. A 192 healpix scheme is used, to limit the number of lost binaries where component stars lie in adjacent healpix.
As noticed in El-Badry & Rix (2018), the fixed Gaia resolution implies that as the distance cut of the sample increases, a growing fraction of close binaries will be lost. Hence, the sample will have a distance-dependent completion factor. This is not a concern, as the objective is not to ensure valid candidates are all included with a fixed probability, but that unsound binary candidates are excluded. I modify the original El-Badry & Rix (2018) selection criteria, to remove the condition that the relative velocity between the two components of a binary system should be within Newtonian expectations, as it is precisely the validity of this assumption that is being probed.
The next step is to aggressively eliminate any binary candidates which might be under the gravitational influence of a third star, or indeed be part of a tertiary system. I search for any stars which are members of more than one candidate binary, and remove all such binary candidates. Thus, I do not try to decide which binary a given star flagged as a member of two or more binary candidate systems belongs to, but eliminate all candidate binaries which contain individual stars forming part of more than one such system. Given the parallax signal-to-noise ratio of the sample, for a binary system at an average distance for the final binaries used of ≈100 pc, by construction, no other Gaia sources remain within 1 pc along the line of sight at the 1σ level. For binary systems in the critical range of internal separations of ≈0.03 pc (see Section 4), this implies an isolation factor along the line of sight of about 30 × the internal binary separation. On the plane of the sky, the initial search criteria ensures an isolation factor of 5/3 × larger than along the line of sight at a 1σ level, ensuring the final selected binaries are indeed free from kinematic contamination from other Gaia sources, to a very large degree. Of the close to 10 million binary candidates originally identified, the above de-grouping algorithm leaves only 97 251 binary pairs.
Further, I apply a series of data quality cuts to each of the stars of each candidate binary system, and remove any such system where either of the constituent stars fails the cut. First, RP, G, and BP magnitude signal-to-noise cuts of >20, which given the strong correlations between (S/N)ϖ and magnitude signal-to-noise levels, actually excludes very few binary candidates, but ensures no suspicious sources are being considered. As discussed in the introduction, a primary concern is the possible kinematic contamination from hidden tertiaries, which is addressed through a variety of quality cuts. The first is the use of the Gaia internal binary probability field, BP, which provides a first order estimate of the probability that each individual Gaia source is actually a binary star, which would then make any of our binary candidate systems a hidden tertiary. I impose a BP < 0.2 quality cut. Also, it has been shown, for exampleBelokurov et al. (2020), that the probability of a Gaia source being an unresolved binary is a strong function of the RUWE quality index of the source, and actually identify a threshold of RUWE < 1.4, at distances of 1 kpc, below which hidden tertiaries can be reliably excluded using Gaia DR2 data. Since the DR3 data used reflect a 34 month timeline, rather than the 22 month one of DR2, and since our 200 pc distance cut-off is much smaller than the 1 kpc which Belokurov et al. (2020) consider, imposing a stringent RUWE < 1.2 limit will guarantee a sample relatively free of hidden tertiaries. Again, the two quality cuts mentioned earlier are applied to all stars, with any candidate binary system being excluded if either of its stars fails any of the cuts.
Then, I require all stars to have a reported radial velocity measurement in the DR3 catalogue. This cut serves three purposes:
as the probability of having a radial velocity measurement in the Gaia catalogue drops rapidly as the single star solution degrades, requiring radial velocities serves as a further quality control against hidden tertiaries.
Having radial velocities allows to consider full spherical geometric corrections (and perspective effects) when deriving the relative velocity between both components, Smart (1968). This last correction is only relevant to a few very close and very wide binaries, indeed, El-Badry (2019) shows that ignoring this correction will result in spuriously elevated relative velocities for wide binaries, but only for internal separations above about 0.1 pc. For my sample such effects are very minor, but are included for completion.
Radial velocities allow a filter on projection and unbound flyby systems, which we cut by requiring all our final binary candidate systems have individual stars showing a radial velocity difference below 4 km s−1.
The battery of quality and hidden tertiary cuts described earlier culls the original 97 251 binary candidates down to a small and highly curated sample, which for a distance cut of D < 125 pc leaves only 1352 binary pairs. These are shown in a sky plot in Fig. 1 as black dots, where grey dots show the original 97 251 binary candidates. The thorough de-grouping strategy described already removes all known nearby clusters and groups from the original binary candidate list, the strict series of exclusion cuts that then follow leave a small sample of highly isolated wide binaries not showing any evidence of unwanted groups or clusters.

The grey dots show a sky plot of 95 899 binary pairs with (D/pc) < 333, (S/N)ϖ > 100, and (S/N)G > 5 in each star, where all binary candidates having stars in common have already been discarded. The 1352 binary pairs within (D/pc) < 125 and further satisfying RUWE < 1.2, Bp < 0.2, ΔVLOS < 4 km s–1 and RP, G, BP, and proper motion signal-to-noise values >20 are shown as black dots. Notice no conspicuous groupings or local clusters remain after the aggressive de-grouping procedure used.
As can be seen in Fig. 1, explicitly excluding the low galactic latitudes is unnecessary given the aggressive de-grouping strategy adopted, as is the specific removal of local known groups. Indeed, the de-grouping implemented leaves no local over-densities which might correlate with either known local groups or with the galactic plane. The use of Gaia data restricted to distances below 125 pc with very high quality parallaxes makes this cut unnecessary, within that distance the data show no overcrowding. For the high quality D < 125 pc sample from which conclusions are drawn, the final mean signal-to-noise value for the parallaxes of the stars is 855.4. Even at the outer limit of this sample, this implies a distance uncertainty of only 0.15 pc, which ensures crowding against background sources is not an issue. A second safeguard against the inclusion of spurious binary pairs in overcrowded regions comes from the relative radial velocity cut introduced, which efficiently eliminates most projection effects. Indeed, Fig. 1 was included as a check of this point, the sky plot of binaries remaining after only cases where both components have a well-determined radial velocity, and a relative value in this quantity of <4 km s−1, does not show any conspicuous overdensities, not even along the galactic plane, which does remain evident in the binary candidates before the use of the relative radial velocity cut.
Finally, we take advantage of a CMD selection strategy to further reduce the probability of hidden tertiary systems remaining in the high-quality wide binary samples used. The left panel of Fig. 2 shows a CMD of all the stars in the original 97 251 candidate binaries as grey dots. The stars in the 1352 sample appear as black dots, already showing a much more well-defined main sequence than the original sample. A small number of photometric binaries remain above this well-defined main sequence.

Left(a): CMD for the stars in the 95 899 binaries mentioned in Fig. 1, shown as grey dots. The stars for the 1352 binary pairs shown as black dots in Fig. 1 appear here as black dots. Right(b): CMD for the stars in the 688 binary pairs within the colour magnitude region selected to eliminate photometric binaries as member stars of final selected binaries, and minimize the inclusion of hidden tertiaries in the kinematic samples.
We now impose a further cut defined on the CMD, following again the conclusions of Belokurov et al. (2020), who show that unresolved binaries can be further avoided by remaining within a region of the CMD below the turn-off and excluding the less massive and hence dimmer lower tail of the main sequence. We hence exclude all binary candidates containing one or two stars lying outside of a region having a G magnitude width of 0.6 magnitudes about the line connecting points (0.7, 4.7) and (2.2, 9.7) in the (colour, absolute magnitude) plane shown. This last cut leaves only 688 binary pairs, shown in the right panel of Fig. 2.
The above CMD selection now implies an absolute magnitude limit of 9.7, although only a handful of stars are actually above an absolute magnitude of 9. Indeed those few stars mentioned earlier are amongst the dimmest and with the exception of only one, are excluded by the final signal to noise cuts in the kinematic plot. Thus, an absolute magnitude cut of 9 is representative of the final sample. This then becomes a distance-dependant apparent magnitude cut, which for the mean distance of the final high-quality cut of 90.25 pc becomes 18.77. For the outer limit of this sample, the corresponding value is 19.5.
When comparing the present final high-quality sample to the final sample in our previous study, only 40|${{\ \rm per\ cent}}$| of the stars in Hernandez et al. (2022) appear in our present sample. This is due to the significant increase in Gaia sources having reported radial velocities in going from eDR3 to DR3, of a factor 4.7. Keeping a sample of comparable size to the one of our previous study now allows stricter quality cuts. In Hernandez et al. (2022), no binary probability cut as internally determined by Gaia was applied, as the Bp parameter was not available in eDR3. As discussed earlier, the present sample includes a Bp < 0.2 cut, with the final sample mean value of this internally determined Gaia binarity probability per source, which uses astrometric, photometric and spectroscopic information in ways complementary to the RUWE filter and the radial velocity determinations, of <BP > =0.07.
To summarize the strategy applied towards the removal of hidden tertiaries, the first cut is through the use of a limit in the Gaia DR3 internally determined binary probability parameter of each source, the Bp < 0.2 constraint described earlier. The second is through the use of the RUWE<1.2 constraint, this ensures sources where the single star photometric solution is poor will not be included. Then, keeping only binaries where radial velocities are reported ensures again a high quality spectroscopic single star solution resulted for the star in question, which is easily degraded by a hidden tertiary. Finally, following Belokurov et al. (2020), a careful CMD selection strategy is applied. This allows not only the exclusion of clear photometric binaries, but also limits consideration to regions of the CMD where errors and photometric blending are minimized. Indeed, through careful statistical re-sampling of mock observations of simulated hidden tertiaries and then ’observed’ assuming actual Gaia sensitivity parameters, the above authors estimate only a 5 per cent hidden tertiary contamination, down to Jupiter scales, when taking a RUWE<1.4, plus the restriction to a well defined region of the CMD, out to 1 kpc, using DR2. Necessarily, the same CMD criterion as applied here, in conjunction with a RUWE<1.2 cut and distances of <125 pc using the longer base line DR3 catalogue, will be highly clear of hidden tertiaries.
Hence it is astrometry, photometry, and spectroscopy, through four different and independent combinations of constraints that are used to eliminate hidden tertiaries. All cuts are applied sequentially to both components of each candidate binary, with all such candidates where even one of the components fails the cut being removed.
Before presenting the relative velocity versus internal separation and relative velocity versus mass scalings of our final samples, the following section details the mass estimates used.
3 ADDRESSING MASS ESTIMATE BIASES
Determining if the internal kinematics of a sample of wide binaries is consistent with Newtonian gravity, clearly requires estimating the masses of each star in each of the binaries being studied (e.g. Banik & Zhao 2018; Pittordis & Sutherland 2023). In the context of large Gaia samples of solar neighbourhood wide binaries, this mass estimate has so far been approached in terms of simple magnitude-mass scalings, such as
expected to be reasonably accurate and unbiased for the old low mass stars of the solar neighbourhood comprising the relevant wide binaries, for example Pittordis & Sutherland (2019), Hernandez et al. (2022). However, the availability of an internally determined Gaia FLAME work package mass estimate making use of spectroscopic information in DR3, allows for a much more accurate update in the masses of the binaries analysed. Unfortunately, the more demanding observations required for Gaia spectroscopic mass determinations imply that this parameter is available only for a fraction of even nearby stars. If we restrict ourselves to wide binaries where both components have a Gaia DR3 mass determination, we would have to eliminate all candidates where no such information is available, and also those cases where Gaia DR3 masses are available for only one of the two members of a binary. Therefore, supplementing internally determined spectroscopic Gaia DR3 masses with estimates from equation (1), in cases where the former is not available, is desirable.
The first test of this scheme is to check for the presence of any biases or inconsistencies when comparing the results of equation (1) to Gaia DR3 masses. I begin with a sample as described in the previous section, taking a distance cut of D < 135 pc, and changing only the RUWE and BP cuts to <2.0 and <0.4. This relaxation of the quality controls only at this point, serves to increase the sample so as to obtain a more complete comparison of the two mass estimates mentioned earlier. This sample yields 3696 binary systems.
The left panel in Fig. 3 shows the individual stars of the sample just described which have Gaia DR3 masses, in a plot comparing this quantity to the result of equation (1) for these same stars, in the mass range relevant for the wide binaries of interest. The grey dots show stars which are members of a binary system where the other star does not have a Gaia DR3 mass determination, and the black dots cases where the other component of the binary system also has a Gaia DR3 mass available. As can be seen, a significant fraction of binaries do not have Gaia DR3 masses for both components, indeed, in the sample shown only |$24{{\ \rm per\ cent}}$| of the stars have Gaia DR3 masses available for both components. Thus, given the small available high quality samples, it is important to supplement Gaia DR3 mass determinations with a reliable estimate inferring masses from Gaia DR3 quantities available for a larger fraction of stars. Unfortunately, we see from the left panel in Fig. 3 that the masses inferred using equation (1) are not an unbiased estimate of the Gaia DR3 masses. For masses below about |$0.7 \, \mathrm{M}_{\odot }$| a small systematic appears such that the inferences from equation (1) are on average |$10{{\ \rm per\ cent}}$| smaller than the more accurate Gaia DR3 ones which use more detailed spectroscopic information.

Left(a): inferred stellar masses using equation (1) compared to internally provided spectroscopic Gaia DR3 masses. A small systematic offset is clearly apparent in the low mass range. Right(b): inferred stellar masses using equation (1) compared to internally provided spectroscopic Gaia DR3 masses, after a |$0.05 \, \mathrm{M}_{\odot }$| correction factor was added to the result of equation (1), for masses obtained initially below |$0.7 \, \mathrm{M}_{\odot }$|. No substantial systematic offset remains at this point in the mass range shown, where the stars comprising the binaries in the kinematic samples used are found. In both panels grey dots indicate cases where the star in question is part of a binary where the other star has no Gaia DR3 mass reported, while the black dots indicate cases where the star shown is part of a binary where the other star also has a reported Gaia DR3 mass.
For masses |$\gt 0.7 \, \mathrm{M}_{\odot }$|, mass estimates from equation (1) and Gaia DR3 agree much better and show almost no systematic biases. However, stars with masses in the range where both inferences diverge form a large fraction of the constituent members of the relevant binaries, for example Hernandez et al. (2022) report a mean binary mass of |$m_{B}=1.6 \, \mathrm{M}_{\odot }$| for their Gaia eDR3 sample, and in the study by Pittordis & Sutherland (2019), inferred mass distributions for individual stars peak for |$0.4 \, \mathrm{M}_{\odot }\lt m_{\star }\lt 0.5 \, \mathrm{M}_{\odot }$|. Indeed, attempting to test particular MOND models requires masses determined to close to |$10 {{\ \rm per\ cent}}$| accuracy (Banik & Zhao 2018), and crucially, no systematic biases in this quantity, making the need of a non-biased mass estimate for all stars involved an important requirement in the context of wide binary gravity tests.
To this end we perform a simple correction on equation (1), adding |$0.05 \, \mathrm{M}_{\odot }$| to the mass of all stars which equation (1) returns as being below |$0.7 \, \mathrm{M}_{\odot }$|. The result of this adjustment is shown in the right panel of Fig. 3, where it is clear that both mass estimates are now in much better agreement and no substantial systematic bias remains. A more detailed adjustment is of course possible, but will only yield much smaller corrections, which in any case will eventually be rendered unnecessary as the fraction of accurate Gaia spectroscopically determined masses grows with time. For the remainder of this paper, I shall use Gaia DR3 masses when available, supplemented by the use of equation (1), with the inclusion of the correction described, in cases where no Gaia DR3 masses exist for any particular star. This is in effect equivalent to the more detailed higher order mass-magnitude scalings used by other authors, for example Chae (2023).
4 RESULTS
Starting from the distance cut described at the end of section 2 with D < 125 pc leaves 688 binary pairs, after the CMD cut criteria presented there. For each of these binaries, parallaxes, positions, and proper motion Gaia DR3 parameters are used to evaluate the relative velocities on the plane of the sky in RA and Dec., (ΔV)RA and (ΔV)Dec, including spherical geometric effects. We consider only velocity differences on the plane of the sky for the kinematic tests performed because these are more robust against contamination from the presence of hidden tertiaries than velocities along the line of sight. Velocities on the plane of the sky are being inferred through proper motions, while along the line of sight, velocities are derived through the Doppler effect. This is important since a hidden tertiary of short period could result in an added velocity component, which if along the line of sight, will directly result in an equal kinematic contaminant on the radial velocity, regardless of the amplitude of the stellar oscillation. However, on the plane of the sky, the same velocity contaminant, for a very tight hidden tertiary having a small internal semimajor axis, will not result in any kinematic contamination if the resulting oscillation is of small spatial amplitude. Distortions of small spatial amplitude (even those of high velocity) will to a large degree be averaged out over the 34 month timeline of the DR3 catalogue and produce negligible effects on plane of the sky velocities, whenever the inner orbital periods are significantly shorter than the integration period of the catalogue.
Also, restricting the relative velocity analysis to the plane of the sky (as was also done and clearly stated in Hernandez et al. 2022), makes this study robust to general relativistic gravitational redshift effects distorting relative velocities along the line of sight (e.g. El-Badry 2022), the opposite of what was mistakenly claimed by Loeb (2022). Our kinematic ΔV determinations include only relative motions on the plane of the sky determined primarily through Gaia proper motions, positions and parallaxes; radial velocities play only a very minor part, exclusively in the small perspective corrections, which as shown by El-Badry (2019), are second order for separations below 0.1 pc.
Once the (ΔV)RA and (ΔV)Dec parameters are obtained for the binaries in question, two final quality cuts are introduced. The first is the exclusion of low-signal-to-noise cases, I keep only binaries where both ΔV measurements satisfy (ΔV/σΔV) > 1.5. Lastly, I evaluate the final average binary probability Gaia parameter for the sample, <BP >, and remove half of that number as a percentage of the highest ΔV systems, uniformly distributed along the separation, s, interval considered. The logic for this last cut is that if say, <BP > =0.07, it is possible that |$7 {{\ \rm per\ cent}}$| of our remaining systems harbour a hidden tertiary. Some of those will have negligible effects on the resulting ΔV measurements of the binaries, when the hidden tertiary velocity perturbation lies primarily along the line of sight, but in some cases this kinematic contamination will occur substantially on the plane of the sky and hence distort the intended test. There is no guarantee that these last cases will be those showing the largest ΔV values across our sample, but as a measure to ensure the least possible effect of any degree of contamination from hidden tertiaries, the fastest |$3{{\ \rm per\ cent}}$| of cases per separation bin are eliminated across the entire s range covered.
The two final quality cuts mentioned earlier leave a small and very pure sample of 450 highly isolated wide binaries, a scatter plot of the (ΔV)RA and (ΔV)Dec values obtained is presented in the left panel of Fig. 4, as a function of the internal separation on the plane of the sky for each binary. In essence, each binary has been observed along two perpendicular directions, yielding two 1D ΔV observations which are independent from the point of view of projection effects. The robustness of the results obtained to this assumption, also included in all previous published wide binary papers by this group, is tested and confirmed in the appendix. The final average distance to these stars is 90.25 pc, with the selection criteria used resulting in very high average signal-to-noise for relative ΔV, stellar proper motion and parallax of 15.71, 3442, and 855.4, respectively, and average RUWE and BP values of 1.01 and 0.07 respectively, as detailed in Table 1.

Left(a): kinematic plot showing 1D ΔV values on the plane of the sky, dots, for binaries in the (D/pc) < 125 pc sample as a function of the internal separation of each binary in pc, s, RA and Dec. values appearing separately for each binary. The points with error bars give the binned rms values for this same quantity, with the horizontal bars giving the bin size and the vertical ones 1σ confidence intervals on the quantity being plotted, dashed and dot-dashed bins for RA and Dec. observations, respectively. The diagonal line gives the prediction for the binned rms 1D ΔV values on the plane of the sky for solar neighbourhood wide binaries from Jiang & Tremaine (2010), and the vertical one shows the MOND radius for the mean total mass of the binaries shown. Right(b): same as the left panel, but for the 125 < (D/pc) < 170 sample, see text and Table 1 for all other signal-to-noise, RUWE, Bp and CMD diagram selection cuts, which were identically applied to both distance samples shown in this figure.
Distance . | Initial number . | Numbers after . | Numbers clearing . | <D/pc > . | <S/N > ΔV . | <S/N > pm . | <S/N > ϖ . | <RUWE > . | <BP > . |
---|---|---|---|---|---|---|---|---|---|
selection . | of binaries . | CMD cut . | (ΔV/σΔV) > 1.5, . | . | . | . | . | . | . |
. | . | . | <top |$3\% \Delta V$| . | . | . | . | . | . | . |
(D/pc) < 125 | 1352 | 688 | 450 | 90.25 | 15.7 | 3442 | 855.4 | 1.01 | 0.07 |
125 < (D/pc) < 170 | 1562 | 914 | 450 | 147.6 | 7.5 | 1798 | 474.9 | 1.01 | 0.06 |
Distance . | Initial number . | Numbers after . | Numbers clearing . | <D/pc > . | <S/N > ΔV . | <S/N > pm . | <S/N > ϖ . | <RUWE > . | <BP > . |
---|---|---|---|---|---|---|---|---|---|
selection . | of binaries . | CMD cut . | (ΔV/σΔV) > 1.5, . | . | . | . | . | . | . |
. | . | . | <top |$3\% \Delta V$| . | . | . | . | . | . | . |
(D/pc) < 125 | 1352 | 688 | 450 | 90.25 | 15.7 | 3442 | 855.4 | 1.01 | 0.07 |
125 < (D/pc) < 170 | 1562 | 914 | 450 | 147.6 | 7.5 | 1798 | 474.9 | 1.01 | 0.06 |
Notes. For the two distance cuts described in the text, (D/pc) < 125 and 125 < (D/pc) < 170, the first three entries of the table give the initial number of (S/N)ϖ > 100, RUWE < 1.2, BP < 0.2 binaries after de-grouping, the remaining binaries after application of the CMD filtering procedure, and the final number of binaries in the kinematic plots after removal of (ΔV/σΔV) < 1.5 cases, and the exclusion of the |$3\%$| fastest binaries per bin. The following two entries show the final average distance in pc and ΔV signal-to-noise ratio, averaged over RA and Dec., for the binaries in the final samples. The last four entries give the average signal-to-noise values in proper motions (averaged over RA and Dec.) and parallax, and final mean RUWE and Bp values for all the individual stars included in the kinematic plots of figure 4.
Distance . | Initial number . | Numbers after . | Numbers clearing . | <D/pc > . | <S/N > ΔV . | <S/N > pm . | <S/N > ϖ . | <RUWE > . | <BP > . |
---|---|---|---|---|---|---|---|---|---|
selection . | of binaries . | CMD cut . | (ΔV/σΔV) > 1.5, . | . | . | . | . | . | . |
. | . | . | <top |$3\% \Delta V$| . | . | . | . | . | . | . |
(D/pc) < 125 | 1352 | 688 | 450 | 90.25 | 15.7 | 3442 | 855.4 | 1.01 | 0.07 |
125 < (D/pc) < 170 | 1562 | 914 | 450 | 147.6 | 7.5 | 1798 | 474.9 | 1.01 | 0.06 |
Distance . | Initial number . | Numbers after . | Numbers clearing . | <D/pc > . | <S/N > ΔV . | <S/N > pm . | <S/N > ϖ . | <RUWE > . | <BP > . |
---|---|---|---|---|---|---|---|---|---|
selection . | of binaries . | CMD cut . | (ΔV/σΔV) > 1.5, . | . | . | . | . | . | . |
. | . | . | <top |$3\% \Delta V$| . | . | . | . | . | . | . |
(D/pc) < 125 | 1352 | 688 | 450 | 90.25 | 15.7 | 3442 | 855.4 | 1.01 | 0.07 |
125 < (D/pc) < 170 | 1562 | 914 | 450 | 147.6 | 7.5 | 1798 | 474.9 | 1.01 | 0.06 |
Notes. For the two distance cuts described in the text, (D/pc) < 125 and 125 < (D/pc) < 170, the first three entries of the table give the initial number of (S/N)ϖ > 100, RUWE < 1.2, BP < 0.2 binaries after de-grouping, the remaining binaries after application of the CMD filtering procedure, and the final number of binaries in the kinematic plots after removal of (ΔV/σΔV) < 1.5 cases, and the exclusion of the |$3\%$| fastest binaries per bin. The following two entries show the final average distance in pc and ΔV signal-to-noise ratio, averaged over RA and Dec., for the binaries in the final samples. The last four entries give the average signal-to-noise values in proper motions (averaged over RA and Dec.) and parallax, and final mean RUWE and Bp values for all the individual stars included in the kinematic plots of figure 4.
The points with error bars give the binned rms values and corresponding 1σ confidence intervals for RA and Dec. measurements, dashed and dot-dashed cases, respectively, where the horizontal lines are just the bin sizes and the figures above the bins give the number of data points per bin. This can be compared to detailed Newtonian expectations for 1D relative velocities between components of wide binaries in the solar neighbourhood through the work of Jiang & Tremaine (2010), henceforth JT10. These authors simulate populations of 50 000 wide binaries in the solar neighbourhood, evolved over a 10 Gyr period under the influence of galactic tides and encounters with field stars and molecular clouds, under the assumptions of Newtonian dynamics, constant total binary masses of |$m_{B}=2 \, \mathrm{M}_{\odot }$| and expected distributions of ellipticities and isotropic orientations with respect to a simulated observational line of sight. The final rms value of the 1D ΔV measurements was then reported, and is shown by the solid line in the left panel of Fig. 4. In the s range given we see essentially a Keplerian ΔV ∝ s−1/2 scaling, as we remain well within the ≈1.7 pc Jacobi radius of the problem.
As explained earlier, an important aspect of the wide binary test is to consider projection effects and a distribution of ellipticities, even within the Newtonian region. I have not performed any dynamical modelling, neither within a Newtonian model nor within any modified gravity one. The main intent is only to test the validity of a full Newtonian model. To this end, the consequences of projection effects and of a distribution of ellipticities under a Newtonian scheme were taken into account through a detailed comparison to the results of JT10. In that paper the authors simulate large populations of wide binaries, and provide detailed predictions for the 1D projected rms values of the resulting relative velocities as a function of projected separations. It is for this reason that those are precisely the quantities calculated here, as it is only those quantities which can be compared in detail to the predictions of JT10. A further important point to note is that JT10 do not present initial instantaneous predictions for the rms value of binned 1D relative velocities as a function of projected separations, but those quantities after 10 Gyr of dynamical evolution in the tidal field of the solar neighbourhood and under the dynamical influence of gravitational interaction of the wide binary populations treated with field stars. This is important, as the results of this evolution alter the final distributions of ellipticities away from the initial assumed ones. One should not compare against birth distributions of ellipticities, e, but against the expected present day ones. To this effect, note that JT10 assume an initial uniform distribution of e2 values between 0 and 1, which is known as a thermal distribution of ellipticities and is justified in terms of thermodynamical expectations in the sampling of the energy distribution of initial wide binary parameters, e.g. Kroupa (2008). However, recent direct inferences of e distributions by Hwang, Ting & Zakamska (2022) using Gaia data of local wide binaries, show that the present-day distribution of ellipticities becomes super-thermal in going to the s > 5 × 10−3pc region, precisely the region of relevance to the wide binary test.
We see that in the separation range s < 0.01 pc (2000 au), results closely follow the predictions of JT10, confirming their Newtonian expectations for the rms values of 1D ΔV. Our inferred values lie slightly below the Newtonian predictions, in consistence with the final sample |$\lt m_{B}\gt =1.56 \, \mathrm{M}_{\odot }$| being a factor of 0.78 below the assumed |$m_{B}=2 \, \mathrm{M}_{\odot }$| of JT10. Indeed, our results are below their prediction by a factor of very close to 0.781/2 = 0.88.
The situation is qualitatively different in the s > 0.01 pc region, where the rms Gaia DR3 inferred ΔV values cease to follow the Newtonian ΔV ∝ s−1/2 scalings and settle to a constant value with distance. It is interesting that this asymptotic value of slightly below 0.4 km s−1 closely agrees with the spiral galaxy baryonic Tully–Fisher velocity scaling of VTF = (GMa0)1/4 = 0.35(M/M⊙)1/4 km s−1, which when evaluated at a total baryonic mass of |$1.56 \, \mathrm{M}_{\odot }$| yields 0.39 km s−1.
An indicative scale at which MOND phenomena are expected for a system of total mass M, is given by the MOND radius, RM = (GM/a0)1/2, the radius at which the acceleration of a test particle in orbit about a spherically symmetric mass distribution becomes equal to a0. For the average total binary mass of the sample in Fig. 4, |$\lt m_{B}\gt =1.56 \, \mathrm{M}_{\odot }$| and |$\lt m_{B}\gt =1.59 \, \mathrm{M}_{\odot }$|, left and right panels respectively, RM = 0.042 pc (8400 au). This value is shown by the red vertical line in Fig. 4. Comparing to the 0.01 pc (2000 au) separation where we see the regime transition, would leave a factor of 4.2 to account between a form factor due to the difference in symmetry conditions between circular equilibrium orbits and the highly dynamical problem of populations of two orbiting bodies with a distribution of effective eccentricities, and the appearance of gravitational anomalies at accelerations a little above the a < a0 threshold. Indeed, such is the case in spiral galaxy dynamics, for example at the solar radius a ≈ a0 and the internal dark matter fraction required under Newtonian gravity is already about 0.5. At the solar circle radius of about 8.2 kpc, the amplitude of the Newtonian baryonic rotation curve of the galaxy is about 185 km s−1 but the actually observed rotation velocity is about 230 km s−1. Thus, gravity is about 1.5× stronger than the Newtonian expectation with only baryons, implying a dark matter to baryon ratio of about 0.5, see for exampleZhu et al. (2023).
A finer test geared towards understanding the physics behind the ΔV versus s scalings discussed earlier comes from plotting the ΔV observations against the total binary mass of each system, mB. For the sample being discussed, this appears in the left panel of Fig. 5, including only binaries within the internal separation interval s < 0.01 pc, the region consistent with a Newtonian ΔV ∝ s−1/2 scaling, shown in the left panel of Fig. 4. The dots with error bars now give the average ΔV values for binned binaries, with R.A. and Dec. treated as independent data points. The horizontal lines give the width of the bins, while the vertical ones indicate the 1σ confidence intervals on <ΔV > and the numbers above show the number of data points per bin. A linear fit to the binned <ΔV > values gives a scaling of |$\lt \Delta V\gt \propto m_{B}^{0.46 \pm 0.13}$| with a correlation parameter of r = 0.87, perfectly consistent with Newtonian expectations of |$\lt \Delta V\gt \propto m_{B}^{0.5}$|.

Left(a): log–log plot of 1D ΔV values on the plane of the sky, dots, for binaries in the (s/pc) < 0.01, (D/pc) < 125 sample as a function of total binary mass, RA and Dec. values appearing separately for each binary. Dots with error bars give the mean values of said quantity per bin, together with 1σ confidence intervals on these means. The solid line shows a linear fit to the binned means having a slope of α = 0.46 ± 0.13 and giving a correlation parameter of r = 0.87, consistent with Newtonian expectations in this high acceleration region. Right(b): same as the left panel, but for the (s/pc) < 0.01, 125 < (D/pc) < 170 sample. The solid line shows a linear fit to the binned means having a slope of α = 0.089 ± 0.21 and giving a correlation parameter of r = 0.21, barely consistent with Newtonian expectations in this high acceleration region at a 2σ level, and indicating some combination of error dominated ΔV determinations and the presence of kinematic contaminants in this more distant sample.
The corresponding plot for the s > 0.01 pc region of the left panel in Fig. 4 is given in the left panel of Fig. 6. This time the linear fit gives |$\lt \Delta V\gt \propto m_{B}^{0.26 \pm 0.32}$|. This scaling, though still consistent with Newtonian expectations at a 1σ level, is in closer agreement with a galactic baryonic Tully–Fisher scaling of |$\lt \Delta V\gt \propto m_{B}^{0.25}$|. The narrow mass interval available and the small number of points limit the precision of this last test, in spite of which a strong reduction in the best fit |$\lt \Delta V\gt \propto m_{B}^{\alpha }$| value of α is apparent on crossing the s = 0.01 pc (2000 au) binary separation threshold which divides the region consistent with Newtonian expectations in the previous ΔV versus s plot, from the flat rms relative velocity regime appearing for s > 0.01 pc in that plot. Notice that not all points considered in the fits in Figs 5 and 6 appear in the plots, given the ΔV range displayed.

Left(a): log–log plot of 1D ΔV values on the plane of the sky, dots, for binaries in the (s/pc) > 0.01, (D/pc) < 125 sample as a function of total binary mass, R.A. and Dec. values appearing separately for each binary. Dots with error bars give the mean values of said quantity per bin, together with 1σ confidence intervals on these means. The solid line shows a linear fit to the binned means having a slope of α = 0.263 ± 0.32 and giving a correlation parameter of r = 0.50, consistent in this low acceleration region with the 1/4 of Tully–Fisher kinematics, and also with the 1/2 of Newtonian expectations, at a 1σ level. Right(b): same as the left panel, but for the (s/pc) > 0.01, 125 < (D/pc) < 170 sample. The solid line shows a linear fit to the binned means having a slope of α = 0.086 ± 0.49 and giving a correlation parameter of r = 0.12, consistent with either Newtonian or Tully–Fisher expectations in this low acceleration region, but at a low significance level, given the large confidence intervals, probably indicating some combination of error dominated ΔV determinations and the presence of kinematic contaminants in this more distant sample.
All parameters of the fits discussed appear in Table 2, including the percentage of binaries used where both masses were supplied directly by Gaia DR3, and the percentage of binaries where at least one mass was provided in such a way. For the s < 0.01 pc region, the above two values are |$40{{\ \rm per\ cent}}$| and |$61{{\ \rm per\ cent}}$|, respectively, with the corresponding values for the s > 0.01 pc region being |$41{{\ \rm per\ cent}}$| and |$63{{\ \rm per\ cent}}$|. We see no significant difference between these two sets of numbers, showing no bias in the availability of Gaia DR3 mass determinations, and hence no bias in the availability of high quality data, across the two regions being compared.
. | (D/pc) < 125 . | . | 125 < (D/pc) < 170 . | . |
---|---|---|---|---|
. | (s/pc) < 0.01 . | (s/pc) > 0.01 . | (s/pc) < 0.01 . | (s/pc) > 0.01 . |
N | 349 | 87 | 321 | 113 |
α | 0.46 ± 0.13 | 0.26 ± 0.32 | 0.089 ± 0.21 | 0.086 ± 0.49 |
r | 0.87 | 0.50 | 0.21 | 0.21 |
<mB > | |$1.56 \, \mathrm{M}_{\odot }$| | |$1.56 \, \mathrm{M}_{\odot }$| | |$1.59 \, \mathrm{M}_{\odot }$| | |$1.59 \, \mathrm{M}_{\odot }$| |
|$\% GM_{2}$| | 40 | 41 | 47 | 46 |
|$\% GM_{1}$| | 61 | 63 | 67 | 64 |
. | (D/pc) < 125 . | . | 125 < (D/pc) < 170 . | . |
---|---|---|---|---|
. | (s/pc) < 0.01 . | (s/pc) > 0.01 . | (s/pc) < 0.01 . | (s/pc) > 0.01 . |
N | 349 | 87 | 321 | 113 |
α | 0.46 ± 0.13 | 0.26 ± 0.32 | 0.089 ± 0.21 | 0.086 ± 0.49 |
r | 0.87 | 0.50 | 0.21 | 0.21 |
<mB > | |$1.56 \, \mathrm{M}_{\odot }$| | |$1.56 \, \mathrm{M}_{\odot }$| | |$1.59 \, \mathrm{M}_{\odot }$| | |$1.59 \, \mathrm{M}_{\odot }$| |
|$\% GM_{2}$| | 40 | 41 | 47 | 46 |
|$\% GM_{1}$| | 61 | 63 | 67 | 64 |
Notes. The table gives the number of binaries in each of the four samples used for the fits shown in Figs 5 and 6, together with the fitted total <ΔV > versus binary mass power-law index α, the 1σ confidence interval in this quantity, and the Pearson statistical correlation between <ΔV > and total binary binned mass, r, average total binary mass in |$\, \mathrm{M}_{\odot }$| and % of binaries having both, |$\% GM_{2}$|, and at least one, |$\% GM_{1}$|, stellar masses from internally supplied Gaia data.
. | (D/pc) < 125 . | . | 125 < (D/pc) < 170 . | . |
---|---|---|---|---|
. | (s/pc) < 0.01 . | (s/pc) > 0.01 . | (s/pc) < 0.01 . | (s/pc) > 0.01 . |
N | 349 | 87 | 321 | 113 |
α | 0.46 ± 0.13 | 0.26 ± 0.32 | 0.089 ± 0.21 | 0.086 ± 0.49 |
r | 0.87 | 0.50 | 0.21 | 0.21 |
<mB > | |$1.56 \, \mathrm{M}_{\odot }$| | |$1.56 \, \mathrm{M}_{\odot }$| | |$1.59 \, \mathrm{M}_{\odot }$| | |$1.59 \, \mathrm{M}_{\odot }$| |
|$\% GM_{2}$| | 40 | 41 | 47 | 46 |
|$\% GM_{1}$| | 61 | 63 | 67 | 64 |
. | (D/pc) < 125 . | . | 125 < (D/pc) < 170 . | . |
---|---|---|---|---|
. | (s/pc) < 0.01 . | (s/pc) > 0.01 . | (s/pc) < 0.01 . | (s/pc) > 0.01 . |
N | 349 | 87 | 321 | 113 |
α | 0.46 ± 0.13 | 0.26 ± 0.32 | 0.089 ± 0.21 | 0.086 ± 0.49 |
r | 0.87 | 0.50 | 0.21 | 0.21 |
<mB > | |$1.56 \, \mathrm{M}_{\odot }$| | |$1.56 \, \mathrm{M}_{\odot }$| | |$1.59 \, \mathrm{M}_{\odot }$| | |$1.59 \, \mathrm{M}_{\odot }$| |
|$\% GM_{2}$| | 40 | 41 | 47 | 46 |
|$\% GM_{1}$| | 61 | 63 | 67 | 64 |
Notes. The table gives the number of binaries in each of the four samples used for the fits shown in Figs 5 and 6, together with the fitted total <ΔV > versus binary mass power-law index α, the 1σ confidence interval in this quantity, and the Pearson statistical correlation between <ΔV > and total binary binned mass, r, average total binary mass in |$\, \mathrm{M}_{\odot }$| and % of binaries having both, |$\% GM_{2}$|, and at least one, |$\% GM_{1}$|, stellar masses from internally supplied Gaia data.
Despite the extensive pruning of the sample and all the various independent checks introduced to limit the presence of any kinematic contaminants, it is of course possible that some remain. To gauge the possibility of this, we explore the distance dependence of our results, given that all errors and kinematic contaminants necessarily grow in importance as the mean distance of the binaries considered increases. With increasing distance the fixed (S/N)ϖ > 100 constraint used implies a larger confidence interval along the line of sight and hence a greater probability of including projection interlopers, while on average stars will appear dimmer, leading to an unavoidable increase in the error intervals for the inferred proper motions and hence a noisier ΔV sample. Further, any hidden tertiary induced wobble will more easily hide below more uncertain stellar parameters.
I test for the sensitivity of the results to distance by repeating the experiment described previously, leaving all quality and kinematic contaminant exclusion cuts the same, but increasing this time the distance cut-off of the sample, to produce a second binary sample independent of the previous one. This time the distance cut is 125 < (D/pc) < 170. The ΔV versus s plot of this new sample is shown in the right panel of Fig. 4, while the s < 0.01 pc ΔV versus mB plot appears in the right panel of Fig. 5, the corresponding s > 0.01 pc ΔV versus mB scaling is shown in the right panel of Fig. 6. From the sample parameters given in Table 1 we see that the mean distance of the sample has increased by |$64{{\ \rm per\ cent}}$|, resulting in important decreases in the final average signal-to-noise values for binary ΔV, stellar proper motion and parallax of |$52.3{{\ \rm per\ cent}}$|, |$47.8{{\rm per\ cent}}$|, and |$44.5{{\rm per\ cent}}$|, respectively.
Such important decreases in the quality of the sample inevitably lead to noisier ΔV data where all trends will be less clear, independent of the probable increased appearance of kinematic contaminants. The right panel of Fig. 4 no longer shows such a close correspondence to the Newtonian expectations as seen in the left panel, rms values for ΔV are now above the JT10 line for almost all the interval probed. For s > 0.01 pc we still see the suggestion of a flat region, although it is now much less well defined and dispersion between bins is more prominent.
In comparing the right and left panels of Figs 5 and 6 we see that the ΔV versus mB scalings have disappeared, with <ΔV > values showing no clear scaling with mB. Indeed, the results of linear fits are both consistent with no dependence of ΔV on mB, yielding α = 0.089 ± 0.21 and α = 0.086 ± 0.49 with very low statistical correlations of r = 0.21 and r = 0.12 for the s < 0.01 pc and the s > 0.01 pc regions, respectively. Although these power-law scalings remain consistent with the Newtonian value of 0.5 at a 2σ level, the poor correlation obtained makes this final result more suggestive of indicating the presence of kinematic contamination, particularly when compared to the much more relevant and precise results obtained for these scalings using the D < 125 pc sample. From Table 2 we also notice a slight increase in <mB > for the more distant sample, showing stars of smaller masses beginning to drop from the sample, at fixed minimum (S/N)ϖ > 100.
Thus, a combination of noisier data, and a greater probability of kinematic contamination render wide binary samples with mean distances even |$64{{\ \rm per\ cent}}$| larger than 125 pc, inadequate for the purpose of inferring fine details regarding the scalings of their internal kinematics.
5 COMPARISON TO RECENT INDEPENDENT STUDIES
In this final section, I compare the results obtained to a couple of recent studies exploring also Gaia wide binaries in the context of testing gravity in the low acceleration regime. The first is the work of Pittordis & Sutherland (2023), who examine velocities on the plane of the sky for a much larger Gaia sample than the one treated here. These authors present their results in terms of distributions of the |$\tilde{v}$| parameter for various s cuts, where
a measure first introduced in Banik & Zhao (2018). Clearly, |$\tilde{v}=1$| for a Newtonian binary having a circular orbit within the plane of the sky. Projection effects will lead to a distribution of |$\tilde{v}$| values below 1 for an observed population, while the presence of a distribution of ellipticities will broaden the distribution, which under Newtonian dynamics, will have a sharp upper cut at the escape velocity of |$\tilde{v}=\sqrt{2}$|. I now calculate the |$\tilde{v}$| values for all the 450 wide binaries appearing in the left panel of Fig. 4. The resulting sample of |$\tilde{v}$| values was then divided into two sub-samples according to the separation of the binaries, distinguishing between s < 0.01 pc and s > 0.01 pc. These two distributions are shown in Fig. 7, with the left panel giving results for s < 0.01 pc, and the right panel for s > 0.01 pc, red lines. The blue histograms, taken from Banik & Zhao (2022), give predictions for a Newtonian model and for a MOND model including the external field effect of standard AQUAL or QUMOND proposals. The numbers in the y-axis give the binned occupancy values for the wide binaries treated, in the right panel, while in the left panel, the data has been re-scaled so as to allow a uniform comparison, by a factor of 0.66.

Left(a): distribution of |$\tilde{v}$| values for the wide binaries in the left panel of Fig. 4, for the s < 0.01 pc Newtonian region, red histogram. The blue histogram gives Newtonian predictions for this distribution from Banik & Zhao (2022). Right(b): distribution of |$\tilde{v}$| values for the wide binaries in the left panel of Fig. 4, for the s > 0.01 pc low acceleration region, red histogram. The blue histogram gives predictions for the distribution of |$\tilde{v}$| values under an AQUAL MOND model and the green one corresponding predictions for a MOND without external field effect model, both from Banik & Zhao (2022).
For context on the external field effect (EFE) of MOND, recall that the theory is inherently non-linear, and includes an explicit acceleration scale, a0, with the character of dynamics depending sensitively on the ratio of the acceleration present to a0. This implies that a small bound system immersed within a larger mass distribution will behave in ways which are determined both by its internal acceleration, and by the local acceleration felt as part of the larger system, beyond the inclusion of tides (see e.g. Milgrom 1986). As the details of exactly how this EFE applies are dependent on the exact particular MOND proposal, all of which share identical predictions regarding the low acceleration behaviour of centrifugal equilibrium orbits about static isolated systems, e.g. Milgrom (2011) or Milgrom (2022), it is common to find predictions for behaviour which is sensitive to the EFE presented for particular well studied proposals, such as AQUAL or QUMOND, and also for the extreme limiting case of no EFE, for example Pittordis & Sutherland (2018).
Looking at the left panel of Fig. 7 we see that the obtained distribution of |$\tilde{v}$| values is broadly consistent with the Newtonian expectations, the peak appears well within values of |$\tilde{v}=1$|, and no systems appear at values above |$\tilde{v}=1.5$|. The differences between both curves are due to the small numbers of binaries available after the cuts imposed (Poisson noise confidence intervals on the red curve, which have not been added to avoid cluttering the figures, make both histograms consistent), and to any offset between the assumed ellipticity distribution of the Newtonian models constructed by Banik & Zhao (2022) and the actual present-day e distribution. The same can be said of the red and blue histograms in the right panel of Fig. 7, which compare our results for the low acceleration s > 0.01 pc region and the AQUAL prediction of this quantity, also from Banik & Zhao (2022). Compared to the Newtonian s < 0.01 pc distributions, the s > 0.01 pc ones show a slight broadening, and also a slight displacement towards larger values of the |$\tilde{v}$| parameters, with a mode still well within |$\tilde{v}=1$|. Thus, our results are strongly suggestive of an AQUAL phenomenology at the low acceleration s > 0.01 pc region.
The green histogram in this last figure, also from Banik & Zhao (2022), shows the expectations of a MOND without external field effect model, corresponding to the deep MOND limit of an isolated system, for example the models known to accurately reproduce the rotation curves of isolated spiral galaxies, for example Lelli et al. (2017). With respect to the blue histograms, this last prediction shows a very distinct triangular shape, significantly shifted to the right, with a mode appearing at values of |$\tilde{v}$| larger than 1, a region where the observed distribution presents almost no points. This last distribution is clearly inconsistent, both qualitatively and quantitatively with the observed one.
The red histograms shown in Fig. 7 can now be compared to those presented recently by Pittordis & Sutherland (2023), their fig. 9. We see that results from these authors are quite similar to mine in the s > 0.01 pc right panel of Fig. 7, in the |$\tilde{v}\lt 2$| region, for all the large samples these authors present, of about 2000 binaries, all in various sections of the low acceleration s > 0.01 pc regime. However, theirs present extended distributions reaching values as high as |$\tilde{v}=7$|. This feature shows the extensive presence of kinematic contaminants in the large sample of Pittordis & Sutherland (2023), and alerts to their presence also within the sensitive |$\tilde{v}\lt 2$| region which has to be used to discriminate between different gravity models. These authors are aware of this problem, and model their results through the inclusion of a fraction of hidden tertiaries and flyby contaminants, rather than adopting the strategy followed here, of attempting the removal of these contaminants, which would necessarily result in a much smaller sample. Thus, an statistical approach is taken where the fraction of hidden tertiaries and of flybys is allowed to vary so that optimal values for these parameters are obtained. Considering three plausible ellipticity distribution functions for both Newtonian and MOND scenarios, the final results show optimal fits with a hidden tertiary fraction of |$50{{\ \rm per\ cent}}$| which are a good representation of the data for Newtonian models. Reduced chi squared values which are within 1.2σ of Newtonian expectations but over 3σ away from MOND models for thermal ellipticity distributions result, although the difference between the best MOND model and the worst Newtonian one is smaller than the range of values covered by either of these two options.
Hence, the above authors obtain results which favour standard Newtonian gravity over both MOND variants discussed earlier. However, having missed entirely the Newtonian region, as the smallest s value they consider is s = 0.025 pc, they have no firm Newtonian point against which to calibrate or test the robustness of the scheme used. It is quite possible that had they included a Newtonian calibration point, a regime transition at s = 0.01 pc would have been apparent. The inclusion of a firm calibration Newtonian region in Pittordis & Sutherland (2023) would also allow a clearer separation of the degeneracies inherent to a problem where the fraction of hidden tertiaries, the fraction of unbound flybys and any changes in the structure of gravity appear intermixed. The high best fit hidden tertiary fraction obtained by these authors could well be a reflection of an enhanced effective value of G in this low acceleration region. Indeed, Chae (2023) (see further), obtains a lower best fit hidden tertiary fraction than what Pittordis & Sutherland (2023) find, while having a sample which extends into the high acceleration region to calibrate this number against firm Newtonian expectations.
Returning to Fig. 7, right panel, the clear inconsistency of the MOND without external field effect green histogram when compared to the data presented here, shows that the analogy with a flat rotation curve galactic region suggested by the low acceleration behaviour in Fig. 4, is limited. The flat behaviour shown by the rms values of the ΔV populations in this figure is hence the result of the details of the distributions. While the deviation from Newtonian behaviour is evident, the precise manner of this deviation has to be deduced from more than just a single moment of the distribution obtained. The |$\tilde{v}$| distributions shown in Fig. 7 make it obvious that a MOND variant including an external field effect is preferred by the data.
That the hidden tertiary cleaning strategy implemented here is highly successful, albeit leaving only a very small final sample, can now be confirmed through three independent checks. First, the Newtonian prediction of JT10 is very accurately traced in the s < 0.01 pc region of the left panel of Fig. 4, in spite of the fact that the rms statistic being used in this comparison is highly sensitive to outliers. Then, the mass-velocity power-law scaling resulting for the earlier mentioned sample is 0.45 ± 0.13, in excellent agreement with Newtonian expectations, despite the small dynamical range in mass available. Lastly, the |$\tilde{v}$| distributions for all regimes of the final binaries used show none of the extended tail at values higher than 2.6, out to 6 or even 7, which are characteristic of the results of the Pittordis & Sutherland group, who perform no cleaning of hidden tertiaries or flyby contaminants. In the present sample only two binaries appears with |$\tilde{v}$| values above 2, and none above 2.6. If any meaningful population of hidden tertiaries remained, none of the above three conditions would be met.
As explained in Section 3, it is only through rms 1D values that the results can be compared against an existing Newtonian prediction including projection effects, a distribution of initial ellipticities, and crucially, 10 Gyr of evolution self-consistently included in the work of JT10, who present final 1D rms results. Note that no other wide binary analysis in the context of tests of gravity presently found in the literature has included any consideration of the effects of dynamical evolution in the galactic environment, tidal fields and interaction with field stars, a non-negligible effect on the loosely bound wide binaries being studied.
The use of the 1D results of JT10 also allow a careful comparison to the high-acceleration Newtonian region, as shown in the left panel of Fig. 4. This acts as a consistency check on the study as a whole. The highly accurate agreement between present results for s < 0.01 pc and the predictions of JT10 validates the procedure used here and shows that the final sample is highly free from kinematic contaminants of any type. This comparison is only possible in the variables currently shown. Still, the use of the rms statistic is not entirely intuitive and lead to the mistaken interpretation in previous work by my group of the flattening in this presentation of the data as evidence for a MOND without external field effect model. The results shown in the right panel of Fig. 7 show this last to have been a mistaken interpretation of results.
In order to explore the character of the non-Newtonian s > 0.01 pc region better, I now repeat the left panel of Fig. 4, but showing not the rms value but the means at each of the same binning intervals of the previous figure. This is given in Fig. 8, where the line shows again the Newtonian expectations of JT10, for comparison. As it is the mean rather than the rms values at each bin which are being plotted, the Newtonian region now falls slightly below the rms Newtonian expectations, but reassuringly, for s < 0.01 pc, the scaling still follows the ΔV ∝ s−1/2 of Newtonian expectations. In going to the low acceleration s > 0.01 pc (2000 au) region, just as in Fig. 4, a clear departure from Newtonian behaviour becomes apparent, although this time, the mean ΔV values no longer remain flat, but show a fixed fractional boost over the extrapolation of the Newtonian region. In the s > 0.01 pc region we see mean values tracing the Newtonian scaling law, but above by a small factor.

The figure shows the same data appearing in the left panel of Fig. 4, but showing this time the mean values at each bin, rather that the rms values of Fig. 4. For comparison, the diagonal line is the same as in Fig. 4, the vertical one again shows the MOND radius for the mean total mass of the binaries shown.
This last figure can be compared to the recent results of Chae (2023), which appeared while the present paper was being referred. This author takes a similar approach to Pittordis & Sutherland (2023) discussed earlier, in that a large wide binary sample is maintained by not attempting a clearing of hidden tertiaries, but in contrast to Pittordis & Sutherland (2023), Chae (2023) does efficiently remove flyby contamination through the use of an isolation criterion for the binaries being considered, taken from El-Badry, Rix & Heintz (2021). This results in a much cleaner sample, evident from the lack of a high |$\tilde{v}$| distribution, as is also the case in my present study. The treatment of hidden tertiaries between the above two studies is similar, in that the hidden tertiary fraction is treated as a free parameter to be determined statistically. However, the wide binary sample considered by Chae (2023) does extend into the Newtonian regime, reaching down to the same s = 0.001 pc lower value as I use here. This allows Chae (2023) a crucial calibration at the Newtonian region where an excellent agreement between the data and Newtonian expectations allows an accurate calibration of the hidden tertiary fraction, as well as a consistency check of the whole procedure.
Chae (2023) reports wide binary data accurately tracing Newtonian expectations out to s = 0.01 pc, after which a clear and highly significant departure appears, which for the first time was identified in that paper as being consistent with an AQUAL MOND description of the data, as confirmed in my present study. Chae (2023) then presents a comparison to the first version of my present paper as it appeared on the arXiv on submission to the journal, by plotting in his fig. 32 his data in a ΔV versus s scatter plot, with binned values showing mean values at each bin. This last figure is qualitatively and quantitatively consistent with the result shown in Fig. 8 here. It is very reassuring of the underlying robustness of the results obtained here that a completely independent and highly complementary (in terms of the treatment of hidden tertiaries) study should reach conclusions consistent with those presented here.
We note finally that Chae (2023) intended his fig. 32 as a comparison to Fig. 4 in the first draft of the present study, but failed to note that in Fig. 4 of my study, for reasons already detailed, what is being plotted is the rms value at each bin and not the mean. For this reason Chae (2023) mentions an accurate agreement with the results of the first draft of the present study in terms of an accurate tracing of the Newtonian expectations out to s = 0.01 pc, followed by a clear and highly significant departure upwards in velocity for s > 0.01 pc (2000 au). However, Chae (2023) also notes a qualitative difference with my earlier results, as the mean binned ΔV values do not remain flat but rather show a parallel tracing of the Newtonian scaling, but a small factor above. This is indeed the exact behaviour shown in Fig. 8, the flat binned behaviour is found here when plotting the rms values, not the means. Hence, agreement with Chae (2023) is complete.
6 DISCUSSION
The results shown in the left panels of Figs 4 and 5 for the <D/pc > =90.25 sample clearly display a gravitational anomaly in the s > 0.01 pc region which can not be attributed merely to the presence of noise due to random uncertainties in the Gaia data used; the mean relative velocity signal-to-noise ratio of this sample, <S/N > ΔV = 15.7. If random noise were determining the signal recovered, the clear Newtonian signal seen immediately before s = 0.01 would be lost and not show the accurate tracing of the JT10 prediction right up to the appearance of the flat rms ΔV regime. Indeed, in the much noisier <S/N > ΔV = 7.5, <D/pc > =147.6 sample, we see a gradual departure upwards of the recovered (ΔV)rms values which never closely trace the JT10 prediction.
Therefore, understanding the s > 0.01 pc scalings inferred here within a Newtonian framework requires the assumption of carefully crafted kinematic contaminants, so as to accurately reproduce the close accordance with the scalings obtained. Pittordis & Sutherland (2023), analysing only the low acceleration s > 0.01 pc region, have show that the observed ΔV versus s signal can be reproduced by assuming a suitable combination of unbound flybys and hidden tertiaries as kinematic contaminants of local wide binary samples. The above authors show that an optimal fit to the non-Newtonian signal is achieved by assuming a close to |$20{{\ \rm per\ cent}}$| flyby contamination (unbound flybies as a fraction of true bound binaries) in the largest separation intervals, see their Tables 4–6. From the left panel of Fig. 4 we see that this would have to apply to binaries with internal separations below 0.06 pc, at flyby encounter velocities of close to 0.4 km s−1.
Given the mean interstellar separation of 1 pc of the solar neighbourhood, stars with a Gaussian velocity distribution with a σV close to 40 km s−1, random encounter relative velocities will also show a Gaussian ΔV distribution with a |$\sigma _{\Delta V} = \sqrt{2} \times 40 \approx 60$| km s−1. Hence, both in terms of spatial separations and relative velocities, the required flybys as kinematic contaminants are inconsistent with random encounters in the field. One has to assume a strong degree of correlation invoking initial conditions, essentially a common origin for the stellar pairs making up the unbound flyby population. At current separations below 0.06 pc and current relative velocities of ≈0.4 km s−1, separation doubling times will be below 1.5 × 105 yr, a very small fraction of 1.5 × 10−5 the typical 10 Gyr ages of the low-mass solar neighbourhood stars in question. It is therefore far from obvious that a fully self-consistent solar neighbourhood dynamical model can be constructed where close to |$10{{\ \rm per\ cent}}$| of the wide binaries presented are unbound flybies, at the observed separations of s < 0.06 pc and relative velocities of ≈0.4 km s−1.
Regarding hidden tertiaries as kinematic contaminants of wide binary samples (as originally highlighted theoretically by Banik & Zhao 2018 and latter explicitly shown by Clarke 2020), Pittordis & Sutherland (2023) have shown that a very high |$50{{\ \rm per\ cent}}$| hidden tertiary fraction is required, for every bound binary formed from two single stars, one bound binary containing a hidden tertiary is required (in addition to the amount of flybies mentioned earlier) to reproduce the high velocity tail of the observed ΔV binary distribution.
Assuming such a high fraction of hidden tertiaries in our current sample is inconsistent with the estimates of Penoyre et al. (2020) and Belokurov et al. (2020), who estimate a hidden tertiary contamination (including even hot or outer Jupiters) below |$5{{\ \rm per\ cent}}$| for a RUWE < 1.4 cut, D < 1 kpc, using the CMD cleaning which they propose and which we follow, in their case for the 22 month Gaia DR2 sample. Invoking a |$50{{\ \rm per\ cent}}$| hidden tertiary contamination in our much cleaner 34 month Gaia DR3 (RUWE < 1.2, <RUWE > =1.01, D < 125 pc, <D > =90.3 pc) sample, further restricted by the imposed internal Gaia binary probability filter of BP < 0.2 resulting in a final <BP > =0.07 for all the stellar sources included, from which the fastest |$3{{\ \rm per\ cent}}$| ΔV values across all bins have been removed, is highly unlikely.
If the phenomenology of the s > 0.01 pc region is attributed to hidden tertiaries, the excellent agreement with Newtonian predictions even immediately below s = 0.01pc would require the abrupt termination of this hypothetical contamination, a scenario which is not supported by the available empirical evidence which shows no relevant trend with wide binary separation on the frequency of tertiary systems, in the regimes probed to date, for example Tokovinin et al. (2002) and Tokovinin, Hartung & Hayward (2010).
Finally, even if a suitable combination of kinematic contaminants can be put together to reproduce the ΔV versus s scalings seen in the left panel of Fig. 4, any such arrangement would also have to satisfy the ΔV versus mB scalings present in the data, as shown in Figs 5 and 6. Also, given the reported confidence intervals in all stellar properties as given in Gaia DR3, which have been used in terms of a full error propagation analysis leading to the error bars shown in all the figures presented, it appears unlikely that the signal obtained in the s > 0.01 pc region could be the result of error-inflated relative velocities, as <S/N > ΔV values in this low acceleration region of Figs 4 and 8 are 7.31 and 7.53 for Dec. and RA, respectively. Thus, it is possible that we could in fact be detecting the low acceleration validity limit of standard gravity, at a regime where the inclusion of dark matter would appear contrived. Indeed, such is the conclusion independently reached by Chae (2023), whose complementary wide binary results closely match what has been presented here.
Following the pioneering approach of MOND (Milgrom 1983), within a classical framework, and Bekenstein (2004) in terms of covariant extensions to GR, a multitude of modified gravity and modified inertia proposals now exist which do not require proposing a hypothetical dark matter component to explain galactic rotation curves and cosmological observations, for example Moffat & Toth (2008), Zhao & Famaey (2010), Capozziello & De Laurentis (2011), Verlinde (2016), Barrientos & Mendoza (2018), Hernandez, Sussman & Nasser (2019b) or Skordis & Złośnik (2021), to mention but a few. Unfortunately, the available predictions of most of these, particularly covariant extensions of GR, are limited to spherically symmetric and static metrics. Calculating modified gravity/modified inertia orbits of binary stars within the global potential of the Milky Way is something which has only been done for a very limited set of specific MOND variants (e.g. the particular QUMOND option explored by Banik & Zhao 2018, or within the superfluid dark matter proposal which yield much the same expectations as QUMOND, as shown by Berezhiani & Khoury 2015). Hence, the results presented here can not presently be compared to the majority of existing modified gravity/modified inertia proposals, with some exceptions as presented in Pittordis & Sutherland (2018) and in Section 5 here.
Finally note that the many orders of magnitude in mass, velocity, and scale which separate a < a0 local wide binaries from the a < a0 galactic regime makes these interesting binaries a crucial subject for further study, which could yield surprises in terms of unknown kinematic contaminants yet to be considered under a Newtonian framework, or important constraints to limit modified gravity scenarios helping to eventually find a complete extended theory. A more careful analysis of uncertainties and of any remaining systematics in the Gaia data, as the mission timeline continues to extend and the catalogue validation advances, will also increase the robustness of the conclusions presented, further reducing the probability of error-inflated relative velocities or hidden tertiaries affecting our results.
ACKNOWLEDGEMENTS
I acknowledge useful conversations with Charalambos Pittordis, Kyu-Hyun Chae, Indranil Banik, Hong-Sheng Zhao, and Ricardo Cortés on all aspects of this work. I also acknowledge the critical input of an anonymous referee as important towards clearing up a mistaken interpretation of my results as presented in the original version of this study, as well as having provided substantial constructive criticism leading to a much improved final version. All data retrieval, processing, statistical analysis, and presentation was performed using software developed jointly with Stephen Cookson. I acknowledge financial assistance from UNAM DGAPA grant number IN106220 and CONACYT as well. This work has made use of data from the European Space Agency (ESA) mission Gaia (https://www.cosmos.esa.int/gaia), processed by the Gaia Data Processing and Analysis Consortium (DPAC, https://www.cosmos.esa.int/web/gaia/dpac/consortium). Funding for the DPAC has been provided by national institutions, in particular the institutions participating in the Gaia Multilateral Agreement.
7 DATA AVAILABILITY
All data used in this work will be shared on reasonable request to the author.
References
APPENDIX: COMPLIMENTARY MATERIAL
To finalize, a series of variants on the results presented previously are shown here, as additional checks of the self-consistency of the study and the results presented. First, it is clear that although R.A. and Dec. observations for a single binary represent orthogonal observations providing two perpendicular 1D relative velocity measurements which can be used as such in terms of the projection effects of the problem, they are not independent observations, as they refer to the same binary star. Thus, the presence of any kinematic contaminant would affect both RA and Dec. measurements, and it is interesting to investigate if the signal found in the low acceleration regime is robust to the use of two data points per binary or not. The first figure in this appendix is a repeat of Fig. 4, where RA and Dec. relative velocity observations for each binary have been added in quadrature to provide a single 2D relative velocity measurement on the plane of the sky per binary. The Jiang & Tremaine (2010) prediction has hence been shifted upwards by a factor of |$\sqrt{(}2)$|, as there is nothing special about the RA Dec. directions. The reduction in the number of data points imposes a reduction in the number of bins considered, with a corresponding reduction in separation resolution of the figure, when compared to the original Fig. 4. Despite this, the left panel of Fig. A1 is fully consistent with the original left panel of Fig. 4. We see the data tracing the 2D rms prediction of Jiang & Tremaine (2010) from below in the s < 0.01 pc (2000 au) region, and then flattening out in the low acceleration s > 0.01 pc region. This validates the use of two perpendicular relative velocity entries per binary used throughout, as indeed was also the case in all the published papers by my group on wide binaries since Hernandez et al. (2012). In essence, each binary has been observed twice in terms of projected 1D relative velocity on the plane of the sky, through RA and Dec. perpendicular velocity projections.

Left(a): kinematic plot showing 2D ΔV values on the plane of the sky, dots, for binaries in the (D/pc) < 125 pc sample as a function of the internal separation of each binary in pc, s, RA and Dec. values have been added in quadrature for each binary. The points with error bars give the binned rms values for this same quantity, with the horizontal bars giving the bin size and the vertical ones 1σ confidence intervals on the quantity being plotted. The diagonal line gives the prediction for the binned rms 2D ΔV values on the plane of the sky for |$2\, \mathrm{M}_{\odot }$| solar neighbourhood wide binaries from Jiang & Tremaine (2010), and the vertical one shows the MOND radius for the mean total mass of the observed binaries shown, |$\lt m_{B}\gt =1.56 \, \mathrm{M}_{\odot }$|. Right(b): the figure shows the same data appearing in the left panel of Fig. 4, but showing this time the median values at each bin, rather that the rms values of Fig. 4 or the means of Fig. 8. For comparison, the diagonal line is the same as in Fig. 4, the vertical one again shows the MOND radius for the mean total mass of the binaries shown.
The further test is a remake of Fig. 8, again, using the same data as in the original, but this time showing the median value per bin rather than the means. This is motivated by the greater robustness of rank statistics and provides a further consistency check on the robustness of our results. This is shown in the right panel of Fig. A1, where the diagonal line is the same Jiang & Tremaine (2010) prediction for the Newtonian rms values as was shown in Fig. 8. We see the medians appearing slightly below the means, as the points with error bars have shifted downwards with respect to those of Fig. 8. The 1σ confidence intervals on this rank statistic have been calculated using j± = (n/2) ± (n/4)1/2, where j± are the ranks of the entries defining the 1σ confidence intervals and n is the occupancy number per bin, for example Conover (1999). The figure remains qualitatively the same, a tracing of the Newtonian scaling in the s < 0.01 (2000 au) separation limit, and then a transition to a boosted Newtonian scaling beyond about s = 0.015 pc, accurately tracing the AQUAL predictions, and consistent with the first mention of this feature in Chae (2023). No occupancy numbers per bin have been given in these last two figures, as the data are the ones already shown in the left panel of Fig. 4 and in Fig. 8, and also to allow a better, uncluttered appreciation of the trends present.
The last figure in the appendix shows the |$\tilde{v}$| models and observed distributions of Fig. 7, but this time both Newtonian and MOND AQUAL models are given in each panel, with the observed distribution appearing in the left panel for the s < 0.01 region of Fig. 4, and for the s > 0.01 one in the right one. The observed distributions are given by the red histograms, while the Newtonian models of Banik & Zhao (2022) appear in blue, and the MOND AQUAL models of these same authors in aqua. Although Newtonian and MOND predictions are quite similar in this metric, the Newtonian distribution raises faster than the MOND one for small values of |$\tilde{v}$|, and indeed, the observed distribution in the s < 0.01 pc traces this raise better than the MOND one, and better than the observed distribution for s > 0.01 pc, which traces better the raise of the MOND model. Similarly, in the |$1.0\lt \tilde{v}\lt 1.5$| region, the MOND model predicts a larger signal than the Newtonian model, as is also seen when comparing the observed distributions for the s > 0.01 pc region and for the s < 0.01 pc one. Thus, we see here a closer correspondence of the MOND expectations to the observed s > 0.01 pc region, and of the Newtonian ones to the observations for s < 0.01 pc data. However, it is also evident that the poor sampling afforded by the small binary candidates surviving all the rigorous kinematic cleansing procedures implemented, makes this comparison unsuitable for drawing any conclusions. For this reason groups using this |$\tilde{v}$| comparisons require binary samples running into the many thousands of stars, and for this reason too, conclusions in this present paper are drawn from the much clearer data presentations of Figs 4, 5, 6, 8, and A1.
Indeed, comparing Figs A1 and A2 it is clear that a departure from Newtonian expectations is much easier to spot in terms of the data presentation of Fig. A1 than from those of Fig. A2. Similarly as found here, the results of Chae (2023), fig. 32, which is equivalent to Fig. 8 here, very evidently show a departure from Newtonian expectations on reaching separations of 0.01 pc, 2000 au, as is also the case for the very clear acceleration plots by that same author, making a departure from Newtonian expectations for the low acceleration regime, in terms of a0, clearly explicit.

Left(a): distribution of |$\tilde{v}$| values for the wide binaries in the left panel of Fig. 4, for the s < 0.01 pc Newtonian region, red histogram. The blue and aqua histograms give Newtonian and MOND predictions, respectively, for this distribution from Banik & Zhao (2022). Right(b): distribution of |$\tilde{v}$| values for the wide binaries in the right panel of Fig. 4, for the s > 0.01 pc low acceleration region, red histogram. The blue and aqua histograms give Newtonian and MOND predictions, respectively, for this distribution from Banik & Zhao (2022).