- Split View
-
Views
-
CiteCitation
Christopher Hirata, Uroš Seljak; Shear calibration biases in weak-lensing surveys, Monthly Notices of the Royal Astronomical Society, Volume 343, Issue 2, 1 August 2003, Pages 459–480, https://doi.org/10.1046/j.1365-8711.2003.06683.x
Download citation file:
© 2018 Oxford University Press
Close -
Share
Abstract
We investigate biases induced by the conversion between the observed image shape and shear distortion in current weak-lensing analysis methods. Such overall calibration biases cannot be detected by the standard tests such as E/B decomposition or calibration with stars. We find that the non-Gaussianity of the point spread function has a significant effect and can lead to an error of up to 15 per cent on the linear amplitude of fluctuations σ8, depending on the method of analysis. This could explain some of the discrepancies seen in recent amplitude determinations from weak lensing. Using an elliptical Laguerre expansion method we develop a re-Gaussianization method that reduces the error to a calibration error of the order of 1 per cent, even for poorly resolved galaxies. We also discuss a new type of shear selection bias, which results in up to roughly an 8 per cent underestimation of the signal. It is expected to scale with redshift, inducing errors in the growth factor extraction if not properly corrected for. Understanding and correcting for such effects is crucial if weak lensing is to become a high-precision probe of cosmology.
Introduction
Gravitational lensing is a powerful tool for probing the large-scale matter distribution in the Universe (see, e.g., Bartelmann & Schneider 2001 for a review). As such it offers the possibility of breaking the degeneracies between many cosmological parameters (e.g. Hu & Tegmark 1999) and serves as a powerful complementary method to the cosmic microwave background anisotropies. Its main advantage is that it is sensitive to the dark matter directly and so avoids many of the complications present in other cosmological probes at low redshift such as galaxy clustering or cluster surveys. Recent first detections of shear correlations (Bacon, Refregier & Ellis 2000; Van Waerbeke et al. 2000; Rhodes, Refregier & Groth 2001; Brown et al. 2003; Hoekstra et al. 2002; Jarvis et al. 2003) have already demonstrated the promise of this method. Upcoming surveys such as the NOAO deep survey (www.noao.edu/noao/noaodeep/), the CFHT legacy survey (www.cfht.hawaii.edu/Science/CFHLS/) and the Deep Lens Survey (dls.bell-labs.com/) will go beyond simply detecting cosmic shear and will map it over large areas of the sky. Finally, proposed projects such as DMT/LSST (www.dmtelescope.org/darkhome.html), Pan-STARRS (www.ifa.hawaii.edu/pan-starrs/) or SNAP (snap.lbl.gov) will map a large fraction of the sky with high depth, achieving unprecedented statistical precision.
Advances in statistical precision require parallel progress on the systematic errors if the information from the current and upcoming experiments is to be exploited fully. The effects one is looking for are extremely small, so any small errors in the calibration or data reduction can lead to a large error in the final result. The most commonly used methods of reduction so far are based on work by Kaiser, Squires & Broadhurst (1995) (KSB95). Recently, there have been several new approaches proposed dealing with various aspects of this problem developed by Kaiser (2000), Bernstein & Jarvis (2002) (BJ02) and Refregier & Bacon (2003). What is somewhat unclear from this work is how much difference these new approaches make and what are the remaining errors in them. While most of these papers present at least some tests, these are often performed using simplified models and do not attempt to quantify the error as a function of size of the galaxy or their ellipticity. The original KSB95 method was calibrated using simulations, and other authors have investigated calibration issues in some of the variants and extensions of KSB95 numerically (Kaiser 2000; Bacon et al. 2001; Erben et al. 2001). However, the calibration of more recent methodologies such as BJ02 has not yet been studied in detail, nor have the various methods been compared in the same simulation. [Many of the simulations in the literature are not directly comparable since they have different assumptions concerning the distribution of galaxy shapes and sizes and the point spread function (PSF) and noise properties.] The goal of this paper is to address some of these issues in more detail. We pay particular attention to the problem of point spread function dilution in realistic PSF and galaxy shape models, which only affects the overall calibration of the shear (if seeing is not significantly variable across the lensing field) and does not show up in the systematics tests.
Extracting shear from the images is not the only place where a bias might arise in the weak-lensing analysis. Other possibilities include (but are not limited to) intrinsic correlations, errors in the source redshift distribution and so-called selection bias, where a galaxy is preferentially selected when its shape is aligned with the PSF distortion. However, there is another possible bias caused by the fact that a galaxy may also be preferentially selected when aligned orthogonally to the intrinsic shear, because detection algorithms preferentially detect circular objects. This bias depends on the detection threshold and the relative angular size of the galaxy and PSF; it becomes unimportant for poorly resolved galaxies (i.e. galaxies much smaller than the PSF) because in this case the size and shape of the image of the galaxy are determined by the PSF irrespective of the physical orientation of the galaxy. Since it is directly correlated with the signal it is very difficult to detect it using the standard systematics tests such as the 45° rotation test (E/B decomposition) and must be corrected for using simulations or some other method that explicitly takes it into account.
This paper is organized as follows. In Section 2 we review past approaches to PSF corrections, present the formalism for PSF correction based on the elliptical Laguerre expansion, and derive two approximate methods to correct for non-Gaussianity of the PSF. We compare various methods in the literature and estimate their biases using simulations. In Section 3 we discuss a shear selection bias and how it can affect the amplitude of fluctuations. Conclusions are presented in Section 4 and in the appendices we describe technical details on convolution in the Laguerre expansion formalism, the linear PSF method and the KSB95 method.
Psf Corrections
In actual measurements, we do not measure directly the moments of intrinsic galaxy images, but rather we measure the moments of the galaxy image after convolution with the PSF of the telescope. We will denote the intrinsic galaxy image by ƒ(x), the PSF by g(x) and the measured galaxy image by I(x). Our eventual objective is to take the PSF g(x) (which can be reconstructed by considering the images of stars in the field) and the observed image of the galaxy I(x), and construct from these the ellipticity eƒ of the intrinsic image that can (with an appropriate calibration factor) be used as an estimator for the local shear.
- (1)
Is the weight function a circular Gaussian, an elliptical Gaussian or some other shape?
- (2)
Is the weight function used for the PSF the same as that for the galaxy, or is it matched to the size of the PSF?
- (3)
Does the method (partially) deconvolve the galaxy image or does it directly correct the galaxy moments for the effect of the PSF? In the latter case, which corrections are performed using perturbation theory, which are performed by comparison with simulations, and which are performed using simple analytical results valid in limiting cases?
Question 1 has typically been decided in favour of a Gaussian weight due to its rapid convergence to zero at large radii (hence the polynomial moments are all convergent) and the absence of singularities, as well as general mathematical convenience. The majority of recent theoretical (Kaiser 2000; Refregier & Bacon 2003) and observational (Bacon et al. 2000; Van Waerbeke et al. 2000; Rhodes et al. 2001; Hoekstra et al. 2002) analyses have followed KSB95 in using a circular Gaussian weight function. The elliptical Gaussian weight function was introduced by BJ02; although it is considerably more complicated to work with, a general ellipse can be a better match to the actual shape of an object than a circle. This property is important for those methods that treat the deviation of an object or PSF from Gaussianity via perturbation theory, including our linear and re-Gaussianization methods.
There is little consensus in the literature on the best answer to Question 2. The advantage of measuring the PSF with its own weight function, independently of the galaxy, is that the PSF weight function can be matched to the size (and, in the case of elliptical moments, the shape) of the PSF. This is necessary for methods (e.g. re-Gaussianization) that treat the non-Gaussianity of the PSF perturbatively. The disadvantage of measuring moments of the PSF with a weight function matched to the PSF is that features of the PSF outside the scale radius of the weight function are suppressed. Features at radii large compared with the characteristic scale of the PSF are present in nearly all real-world PSFs: they are introduced as diffraction rings in space-based observations, and as a power-law (g∝r−11/3) tail from Kolmogorov turbulence in ground-based observations. If the galaxy is of size comparable to or somewhat larger than the PSF, these features can fall inside the scale radius of the weight function of the galaxy, with potentially disasterous results for attempts to unravel the effects of the PSF on the measured galaxy moments (see Section 2.2). The higher-order moments of the PSF must therefore be considered in order to take into account features at large radius if a PSF weight matched to the size of the PSF itself is used. Numerical tests (Hoekstra et al. 1998) indicate that the KSB95 method supplemented with a pre-seeing shear polarizability calculation (Luppino & Kaiser 1997) works best when the PSF is measured with the same weight function as the galaxy; this is unsurprising, since this method does not consider any moments higher than the fourth order (see Appendix C).
Finally, we arrive at Question 3: how do we convert the measured moments of the galaxy and PSF into an ellipticity of the intrinsic galaxy image? Most methods, including Luppino & Kaiser (1997), Hoekstra et al. (1998) and BJ02, have corrected the galaxy moments for the effect of the PSF using either perturbation theory, simple analytical corrections or some combination thereof. (The original KSB95 work used simulations to compute the diluting effect of the PSF on the galaxy.) Refregier & Bacon (2003) developed a deconvolution method based on the circular Laguerre expansion of the PSF and galaxy (see Section 2.2), writing the PSF convolution as a matrix acting on the vector of galaxy moments and cutting off the hierarchy of Laguerre moments at sixth order to prevent noise amplification in the deconvolution (matrix inversion). BJ02 have investigated matrix inversion methods as well using the raising and lowering operator formalism, with particular attention paid to the effects of the matrix inversion on noise. In this paper, we introduce and investigate two perturbative approaches, expanding around the case of elliptical Gaussian PSFs and galaxies. The ‘linear’ method (Appendix B) is essentially a first-order approximation to the matrix inverse of Refregier & Bacon (2003), generalized to elliptical moments. It improves significantly on the same order method proposed by BJ02 in their Appendix C and should be of particular interest to those using Sloan Digital Sky Survey photo reduction for weak lensing. The ‘re-Gaussianization’ method (Section 2.2) treats the deviations of the PSF from Gaussianity perturbatively in real space, without first measuring moments. In our tests this was the most successful method and the only one that comes close to the precision requirements of the future surveys. The introduction of non-linear calculations such as higher-order perturbation theory or direct matrix inversion can be expected to further improve the accuracy of the method because the full non-linear dependence of measured moments on the PSF is taken into account; however, one must be careful to avoid amplifying noise or creating unstable estimators.
We organize this section as follows. The adaptive moment determination is discussed in Section 2.2. In Section 2.2 (and in Appendix A), we explore the generalization of the Laguerre expansion of Refregier (2003) and BJ02 to the case of an elliptical weight. In Section 2.2, we explore the limitations of BJ02's analytical correction for PSF dilution; the re-Gaussianization method is introduced in Section 2.2. We conclude by comparing the methods in simulations (Section 2.2).
Adaptive moments
Because of the presence of the w, the adaptive moments are weighted. [It is simpler to work with unweighted moments, but these have divergent noise (KSB95).]
Elliptical Laguerre expansion
. This provides us with the property that u2+v2=xTM−1x. The Laguerre basis functions are where where m=p−q is the angular momentum quantum number and the + sign is used in u± iv if m > 0 and the − sign is used if m < 0. (Both signs are equivalent if m= 0.) Equations (7) and (8) can be seen to be equivalent to BJ02's equations (6)–(8) through the use of the explicit expression for the associated Laguerre polynomials. Note that Λpq is a polynomial in two variables with complex integer coefficients and Λpq is the complex conjugate of Λqp. In Table 1 the Λ polynomials are given through fourth order; in the last column we give the result in polar coordinates. We note that the Λ polynomials are related to Refregier (2003)'s polar Hermite polynomials according to Λpq(r cos φ, r sin φ) =Hp,q(r) eimφ.We will find it useful to use the reduced quantities βpq=bpq/b00. BJ02 observe that the Laguerre coefficients associated with the adaptively determined covariance M satisfy β01=β11=β02= 0. One can see this directly from the least-squares residual principle by differentiating equation (2) with respect to (A, x0, M) and setting the result to zero.
PSF corrections
PSF correction is typically performed by correcting the moments of the measured object using the moments of the PSF. There are generally two cases to consider here: either the PSF moments are measured with an elliptical Gaussian weight adapted to the PSF shape rather than the object shape, or they are measured with the same weight function as the object. The former approach is used by BJ02, and we will consider it in detail here; we defer investigation of PSF moments determined with the same weight function as the object.
Another issue that arises is whether to use a rounding-kernel method to symmetrize the PSF. Here we assume that the best-fitting Gaussian to the PSF is circular; this could be achieved using a rounding kernel. An alternative is to apply a shear (to both the image and PSF) to make the best-fitting Gaussian to the PSF circular, and then apply an inverse shear after making the ellipticity measurement. (The transformation of ellipticities is performed according to equation B17.) The rounding kernel method may be better suited to elimination of spurious power from PSF anisotropy, since a rounding kernel can be designed to eliminate arbitrarily many spin-2 moments of the PSF (at least in principle, see, e.g., BJ02); however, rounding kernels increase the effective PSF size which is not desirable. Circularizing the best-fitting Gaussian to the PSF with a shear does not increase PSF size, but the removal of spin-2 moments of the PSF (and hence of the associated spurious power) is incomplete as only the b02 moment is forced to zero by this method.
If we were using unweighted moments, this scheme would be valid for any PSF and galaxy. For weighted moments, equation (13) fails, and a correction for non-Gaussianity of the galaxy and PSF is necessary.
Several methods could be used to improve equation (14). One approach is to compute the correction to the galaxy ellipticity to linear order in the βpq; this approach is pursued in Appendix B. Unfortunately, realistic PSFs (e.g. turbulence- or diffraction-limited PSFs) have significant power at large radii and hence the convergence of the Laguerre expansion is rather slow. This is manifested in Section 2.2, where significant errors are found from terminating the Laguerre expansion after the 22 moment. Therefore, in the next section we will pursue an alternative approach: we fit a Gaussian to the PSF, compute the residuals and attempt to correct the galaxy image for the effects of these residuals. Then we apply equation (14) to this corrected galaxy image. As shown in the simulations (Section 2.2), this method yields good performance for non-Gaussian PSFs and galaxies, except for the most elongated galaxies.
Re-Gaussianization
The adaptive moments of I′ can then be computed, and the PSF correction of equation (14) can then be applied to recover the intrinsic ellipticity e(ƒ).
There are several ‘knobs’ that can be adjusted on the re-Gaussianization method. First, there is the choice of Gaussian approximation to the PSF, equation (20). We have chosen to set MG to be the adaptive covariance of the PSF, i.e. the covariance of the best-fitting Gaussian. However, rather than using the normalization of the best-fitting Gaussian, we have normalized equation (20) to integrate to unity. While this increases the overall power ∫(ϵ2) of the residual function, it yields ∫ ϵ = 0, which ensures that for well-resolved objects (i.e. objects for which the PSF is essentially a δ-function), the ‘correction’ε⊗ƒ0 applied by equation (23) does not corrupt the image I. One could also try a more sophisticated function ƒ0 than a Gaussian, perhaps based on a partial deconvolution of the PSF. We have not explored the possibilities here, but it is clear that an attempt to approximate ƒ will have to make some assumptions regarding the behaviour of ƒ in the regions of Fourier space where the PSF has little power (i.e. where the optical transfer function is close to zero).
Simulation
We may test the PSF correction algorithms via simulation for common categories of galaxy and PSF. The simulation procedure is as follows:
- (i)
construct an intrinsic image ƒ of the test galaxy;
- (ii)
construct a sample PSF g;
- (iii)
convolve the test galaxy and PSF, then pixelize this to yield a ‘measured object’I;
- (iv)
compute the adaptive moments of the test galaxy ƒ, and measured galaxy I;
- (v)
apply the PSF correction scheme to the moments of I to yield the ‘recovered’ ellipticity of the test galaxy;
- (vi)
compare the true and recovered ellipticities e(ƒ) of the test galaxy; and
- (vii)
propagate the ellipticity error through to an error in the shear estimate.
Our simulations here will compare five methods of recovering the initial ellipticity. The re-Gaussianization (‘G’) method is that described in Section 2.2. The linear (‘LIN’) method follows our prescription at the end of Appendix B. The ‘BJ’ method uses BJ02's resolution factor from Appendix C, equation (14), instead of ours, equation (B16). [BJ02 discuss several methods of correcting for PSF dilution, including a matrix inversion method which from theoretical expectations should do better than the resolution factor approach. The resolution factor used here is the same one that is used in the cosmic shear study of Jarvis et al. (2003) and can be implemented on Sloan Digital Sky Survey photo reductions.] The no-correction (‘NC’) method computes ellipticities based on equation (13) and hence is correct only for Gaussian galaxies and PSFs. Included for comparison is the ‘KSB’ method, which uses the KSB95 method with pre-seeing shear polarizability estimated by the method of Luppino & Kaiser (1997). Note that there are several versions of the KSB method in the literature, a comparison of some of these is provided by Erben et al. (2001). (For details of our implementation of KSB95, see Appendix C.)
. This allows us to determine 〈Δe〉, and thus correct the measured ellipticity for noise bias:
. Note that the statistical variance in e resulting from the Monte Carlo method after N realizations of the artificial noise have been computed is 〈Δe2〉/N, whereas the statistical noise in the galaxy shape measurement is 〈Δe2〉; hence the Monte Carlo convergence rapidly becomes a negligible contributor to the overall statistical uncertainty. Thus we can determine and remove the noise bias effect on any PSF correction method by evaluating equation (24), provided only that the significance ν is large enough to treat the noise as a perturbation. Since the O(1/ν2) noise bias is straightforward to remove and in any case the details of its removal process do not depend strongly on the PSF correction scheme, we have presented the simulations here without noise to illustrate the purely PSF-induced effects.This is the equation we use to determine the calibration for the no-correction, linear and BJ02 methods. In the plots (Figs 1–3) we show the error
.
The error in reconstructing the shear for the case of a circular Gaussian PSF and elliptical Gaussian galaxy. We have plotted the shear calibration error δγ/γ. Note that the adaptive moments method is exact for this case. The KSB95 method is not exact but gives good performance (calibration errors of the order of 1 per cent for most galaxies) even when the galaxy is not well resolved.
The error in reconstructing the shear for the case of a circular Gaussian PSF and elliptical Gaussian galaxy. We have plotted the shear calibration error δγ/γ. Note that the adaptive moments method is exact for this case. The KSB95 method is not exact but gives good performance (calibration errors of the order of 1 per cent for most galaxies) even when the galaxy is not well resolved.
The error in reconstructing the shear for the case of a circular Gaussian PSF and elliptical exponential galaxy. We have plotted the shear calibration error δγ/γ. Note that for the well-resolved galaxy (a), both BJ02 and linear methods provide good performance; the error in the linear method grows substantially for the poorly resolved galaxies in (b) and (c). The no-correction method underestimates ellipticities in all cases, with the situation worsening for the poorly resolved galaxies.
The error in reconstructing the shear for the case of a circular Gaussian PSF and elliptical exponential galaxy. We have plotted the shear calibration error δγ/γ. Note that for the well-resolved galaxy (a), both BJ02 and linear methods provide good performance; the error in the linear method grows substantially for the poorly resolved galaxies in (b) and (c). The no-correction method underestimates ellipticities in all cases, with the situation worsening for the poorly resolved galaxies.
The error in reconstructing the shear for the case of a circular PSF generated by atmospheric turbulence and an elliptical Gaussian galaxy. We have plotted the shear calibration error δγ/γ. For well-resolved (a) and moderately resolved (b) galaxies, the linear method substantially reduces errors when compared with the no-correction method. For poorly resolved galaxies, the linear method overcorrects for PSF non-Gaussianity. The PSF non-Gaussianity correction of BJ02 has the incorrect sign in all three cases. Note that the scale is different from that of Fig. 2.
The error in reconstructing the shear for the case of a circular PSF generated by atmospheric turbulence and an elliptical Gaussian galaxy. We have plotted the shear calibration error δγ/γ. For well-resolved (a) and moderately resolved (b) galaxies, the linear method substantially reduces errors when compared with the no-correction method. For poorly resolved galaxies, the linear method overcorrects for PSF non-Gaussianity. The PSF non-Gaussianity correction of BJ02 has the incorrect sign in all three cases. Note that the scale is different from that of Fig. 2.
. We can find 〈eK〉 by an argument similar to that leading to equation (27); the result is This is the calibration equation that we use for the KSB95 method.Simulation results are shown in the figures for three cases: a well-resolved galaxy (σ2g/σ2ƒ= 0.2; we remind the reader that subscript g is the PSF and ƒ denotes the galaxy), an intermediate case (σ2g/σ2ƒ= 1.0) and a poorly resolved galaxy (σ2g/σ2ƒ= 2.0). (Here
is proportional to the area covered by the object.) The simple case of a Gaussian galaxy and a Gaussian PSF is shown in Fig. 1. The adaptive moments methods are exact for this case. The KSB95 method, while not exact, has a relatively small calibration error except for the very elongated galaxies.
In Fig. 2, we test the ability of these methods to handle non-Gaussian galaxies by showing simulation results with a circular Gaussian PSF and an elliptical galaxy with exponential (∝ e
) radial profile. The exponential profile has a positive kurtosis β(ƒ)22≈ 0.17. Note that both the BJ02 and linear methods improve upon the no-correction method; BJ02's performance here is superior to that of the linear method, presumably because BJ02 works to all orders in β(ƒ)22. Once again, the KSB95 method is accurate at the 1 per cent level except for very eccentric galaxies.
function, as appropriate for observations through a turbulent atmosphere with Kolmogorov scaling (Kaiser 2000). This PSF has a kurtosis of β(g)22≈ 0.046; it is important to note, however, that the PSF drops to zero slowly at large radii and hence it possesses higher moments, e.g. β(g)33≈ 0.012 (see Table 2) that are not taken into account by either the BJ02 or linear methods. We can demonstrate that the error from the linear method is due to these higher moments by repeating the simulation using a PSF of the ‘quartic-Gaussian’ form: which has the same kurtosis as the turbulent PSF but no higher-order moments. The results using this form for the PSF are shown in Fig. 4. Finally, we have shown simulations where both the galaxy and PSF are non-Gaussian: one with a turbulent PSF and an exponential galaxy (Fig. 5), one with a turbulent PSF and a de Vaucouleurs profile (Fig. 6) and one with a diffraction-limited (Airy) PSF and an exponential galaxy (Fig. 7). Of the methods described, only the re-Gaussianization method retains high accuracy in these cases. This is primarily because of the sixth-order and higher moments of these PSFs, which are ignored by the BJ02 and fourth-order linear methods. For well-resolved objects, the KSB95 method avoids these problems because its non-adaptive weight function allows it to ‘see’ the turbulence-induced tails and/or diffraction rings in the PSF. For the less well-resolved objects, KSB suffers from loss of higher-moment information, although it is apparent from the simulations that the problem is not as serious as for the BJ02 or linear methods.PSFs used in the simulations. The adaptive (i.e. iterated until β11= 0) radial 2nth moments βnn characterize the non-Gaussianity of the PSF. Note the slow convergence of the Laguerre expansion for the diffraction-limited PSF.
PSFs used in the simulations. The adaptive (i.e. iterated until β11= 0) radial 2nth moments βnn characterize the non-Gaussianity of the PSF. Note the slow convergence of the Laguerre expansion for the diffraction-limited PSF.
The error in reconstructing the shear for the case of a circular quartic-Gaussian PSF (see equation 33) and an elliptical Gaussian galaxy. We have plotted the shear calibration error δγ/γ. For well-resolved (a) and moderately resolved (b) galaxies, the linear method substantially reduces errors when compared with the no-correction method. For poorly resolved galaxies, the linear method overcorrects for PSF non-Gaussianity. The PSF non-Gaussianity correction of BJ02 has the incorrect sign in all three cases.
The error in reconstructing the shear for the case of a circular quartic-Gaussian PSF (see equation 33) and an elliptical Gaussian galaxy. We have plotted the shear calibration error δγ/γ. For well-resolved (a) and moderately resolved (b) galaxies, the linear method substantially reduces errors when compared with the no-correction method. For poorly resolved galaxies, the linear method overcorrects for PSF non-Gaussianity. The PSF non-Gaussianity correction of BJ02 has the incorrect sign in all three cases.
The error in reconstructing the shear for the case of a circular turbulence-limited PSF and an elliptical exponential galaxy. We have plotted the shear calibration error δγ/γ. Note the significant loss of accuracy of all the methods except for re-Gaussianization (G) for poorly resolved galaxies.
The error in reconstructing the shear for the case of a circular turbulence-limited PSF and an elliptical exponential galaxy. We have plotted the shear calibration error δγ/γ. Note the significant loss of accuracy of all the methods except for re-Gaussianization (G) for poorly resolved galaxies.
The error in reconstructing the shear for the case of a circular turbulence-limited PSF and an elliptical de Vaucouleurs profile galaxy. We have plotted the shear calibration error δγ/γ. Note the significant loss of accuracy of all the methods except for re-Gaussianization (G) and KSB for poorly resolved galaxies.
The error in reconstructing the shear for the case of a circular turbulence-limited PSF and an elliptical de Vaucouleurs profile galaxy. We have plotted the shear calibration error δγ/γ. Note the significant loss of accuracy of all the methods except for re-Gaussianization (G) and KSB for poorly resolved galaxies.
The error in reconstructing the shear for the case of a circular diffraction-limited PSF and an elliptical exponential galaxy. Severe loss of accuracy occurs for most of the methods if the galaxy is poorly resolved (c); fortunately, diffraction-limited PSFs occur only for space-based surveys in which the PSF size is usually small compared with the size of the galaxy, hence the comparatively large errors exhibited in (c) are unlikely to be a problem in practice.
The error in reconstructing the shear for the case of a circular diffraction-limited PSF and an elliptical exponential galaxy. Severe loss of accuracy occurs for most of the methods if the galaxy is poorly resolved (c); fortunately, diffraction-limited PSFs occur only for space-based surveys in which the PSF size is usually small compared with the size of the galaxy, hence the comparatively large errors exhibited in (c) are unlikely to be a problem in practice.
The simulations indicate significant (at least several per cent) calibration errors for all of the currently used methods when the PSF is non-Gaussian and has size comparable to that of the galaxy. Both the KSB95 and BJ02 methods yield an underestimate of the shear for turbulence-limited PSFs, with the error for poorly resolved objects being of the order of 5 per cent for KSB95 and 10–15 per cent for BJ02. The errors have the opposite sign for a diffraction-limited PSF, however, the small size (typically of the order of 0.1 arcsec) of diffraction-limited PSFs helps to reduce the error in this case (cf. Figs 7a–c). The re-Gaussianization method introduced here shows promise in reducing these calibration errors to the 1–2 per cent level even for poorly resolved galaxies. It has the best performance of all the methods studied here. While we have not simulated Refregier & Bacon (2003)'s matrix inversion method, we note for comparison that in realistic Monte Carlo simulations (i.e. including noise and pixelization), Refregier & Bacon (2003) found calibration errors of −3 ± 4 per cent in e+ and 0 ± 4 per cent in e× using a turbulent PSF with σ2g/σ2ƒ= 0.32. This would thus be comparable to our tests of well-resolved galaxies, for which KSB95 and re-Gaussianization methods are accurate to within 2 per cent. It would be interesting to investigate the behaviour of the matrix inversion method at lower resolution where the other methods except for re-Gaussianization show more significant error.
Shear Selection Bias
Even if the PSF correction is perfect, an estimate of the shear can be biased if the object detection algorithm preferentially selects galaxies elongated in one direction over galaxies elongated in another direction. Kaiser (2000) and BJ02 investigate such a selection bias that results from asymmetries of the PSF and offer methods for correcting it. However, there is another selection bias that exists: suppose that we are looking at a part of the sky where the gravitational shear elongates objects in the x-direction (i.e. γ+ > 0). Then background galaxies intrinsically (i.e. pre-shear) elongated in the x-direction appear more elliptical than otherwise identical background galaxies elongated in the y-direction (although both have the same area, flux and surface brightness). Many object detection algorithms, such as significance cut-offs, will preferentially detect the circular objects in this situation, thus the catalogue of background galaxies that results will preferentially contain galaxies aligned orthogonal to the gravitational shear. This results in underestimation of the lensing signal. We will call this effect ‘shear selection bias’ to distinguish it from the ‘PSF selection bias’ discussed by Kaiser (2000) and BJ02.
Unlike PSF-related biases, shear selection bias is perfectly (anti)correlated with the shear signal. This means that shear selection bias takes the form of a calibration error, with the fractional error depending on the object detection algorithm and on the underlying population of galaxies. As previously mentioned, calibration errors are difficult to detect using, for example, E/B decomposition of the measured shear field, which is insensitive to a position-independent calibration error.
We organize this section as follows. In Section 3.3, we develop a formalism for relating the galaxy selection probability to the selection bias. In Sections 3.2 and 3.3, we apply this formalism to the problem of evaluating shear selection bias for galaxy detection algorithms based on a statistical significance threshold. We show in Section 3.3 that for this class of algorithms, shear selection bias is an effect of the order of several per cent. Suggestions for overcoming the shear selection bias are offered in Section 3.3.
Effect on measured shear
Here we analyse the effect of shear selection bias on measurements of gravitational shear. Following KSB95, we will use lowercase Roman letters for vector indices and Greek letters for spin-2 tensor (i.e. traceless symmetric matrix) indices. To avoid confusion, we call an object with two tensor indices a ‘hypertensor’.
Methods for computing Pγ can be found in terms of Laguerre coefficients in our Appendix C and in terms of real-space variables in Hoekstra et al. (1998)'s Appendix A[these methods are based on KSB95 and Luppino & Kaiser (1997), however, KSB95's shear polarizability contains algebra errors that have been corrected by Hoekstra et al. (1998)].
Significance threshold algorithms
Our principal objective here is to determine the probability P of the object being detected. The definition of ‘detected’, and hence the determination of P, is a delicate issue. If we simply assume some intrinsic intensity ƒ(x) peaked at x= 0 and ask whether J(0) ≥ν0, we will underestimate P because J is a random field and it is unlikely that 0 will be an exact maximum of J. However, the use of ν=J(0) as a detection statistic will serve as a first approximation to the shear selection bias, and due to the complexity of the problem it is unclear whether substantial gains can be made through a more detailed analytical investigation.
using integration by parts: where the mixed hypertensor δijβ has components and we have used the tracelessness of δβ to eliminate the ∂xi/∂xj term. In the special case where K is a Gaussian, K(x) ∝ exp(−1/2|x|2/σ2K), this reduces to For a single detection kernel K1, equation (51) describes the transfer hypertensor. However, the single-scale detection algorithms considered here are not optimal for detecting objects substantially larger or smaller than the size of the detection kernel. For this reason, most authors have used multiple detection kernels, which we consider next.
Multiscale detection algorithms
, that is where (Remember that σK depends implicitly on σd.) In the special case of a circular Gaussian object
, the left-hand side of equation (54) becomes 2/(σ−2K+σ−2ƒ); this gives a solution to equation (54) of (We ignore the imaginary solution σ2d=−σ2g.) This means that if the galaxy is a circular Gaussian, the detection algorithm will return maximum significance with a filter matched to the object size.How important is shear selection bias?
We can evaluate, at least at the order-of-magnitude level, the importance of the shear selection bias. Since σd is matched to the size of the object, the quantity 1 −σ2g/σ2d will be near unity for well-resolved galaxies, decreasing to zero for point-like objects. (That Ξ= 0 for point-like objects is unsurprising because a point source remains a point source when sheared.) The T-factor is shown in Fig. 8. The weighted ellipticities ě are half of the adaptive ellipticities (equation 5) for near-circular Gaussian objects with a matched Gaussian weight function. Thus the ě circular ellipticity has a shape variance of approximately V≈ (0.32/2)2≈ 0.026, where 0.32 is the rms ellipticity of galaxies (McKay et al. 2001). (This parameter will depend somewhat on the sample examined, so the value of 0.32 should be thought of as a rough guide rather than a law of nature.) It thus appears that the shear selection bias will be of the order of several per cent for well-resolved galaxies, falling to zero in the limit of a poorly resolved object.
The T-factor T(ν̄; ν0) = d 1n P/d 1n ν̄. This factor represents the (logarithmic) increase in probability of detection P if we increase the significance of the galaxy. The shear selection bias increases in proportion to an appropriate average of T (see equation 48). Note that T→ 0 as ν̄ → ∞; this means that there is no shear selection bias for objects far above threshold, as expected.
The T-factor T(ν̄; ν0) = d 1n P/d 1n ν̄. This factor represents the (logarithmic) increase in probability of detection P if we increase the significance of the galaxy. The shear selection bias increases in proportion to an appropriate average of T (see equation 48). Note that T→ 0 as ν̄ → ∞; this means that there is no shear selection bias for objects far above threshold, as expected.
A more quantitative approach to the problem is possible in the special case where the galaxy significance distribution is a power law and the shear estimator is constructed from an unweighted distribution of galaxies (i.e. w=constant). We suppose that the galaxy counts scale as
. The exponent λ is determined by noting that ν̄ is a signal-to-noise ratio and hence is proportional to the total flux of the galaxy, F, divided by the square root of its post-PSF-convolved area A; the number–magnitude relation is roughly N∝F−1. If the galaxies are poorly resolved, A is determined by the PSF and hence is constant, in which case λ= 1. For well-resolved, nearby galaxies (redshift z≪ 1) we can assume a Euclidean universe, in which the usual inverse square laws F∝A∝ 1/r2 apply; this gives
, since the number of objects scales as r3. This gives λ= 3. Thus we expect that λ will probably lie in the range 1 ≤λ≤ 3, with the distant galaxies of interest to cosmic shear measurements probably having λ closer to 1.
, which is 1 for well-resolved and 0 for poorly resolved objects. We also use the fact that
. The average transfer hypertensor is where Reff is a weighted (by T) average of the shear selection bias resolution factor R over the detected galaxies:
. Since T is a rapidly decreasing function of ν̄ (see Fig. 8), Reff should be evaluated for galaxies near the detection threshold.Thus for cosmic shear measurements where galaxies are selected via a statistical significance cut, where V= 0.026 and λ≈ 1, the shear selection bias causes an underestimate of the shear by 2.6Reff per cent. (It vanishes for the poorly resolved galaxies with Reff≈ 0.) This factor would be increased to roughly 8 per cent in the case of λ= 3, Reff= 1, i.e. of low-redshift source galaxies for which the Euclidean source counts are more nearly valid and the near-threshold objects are well resolved. Note that this result is independent of the significance threshold, at least when a power-law form for the galaxy count is assumed. Indeed, because it does not depend on the particular form for P, this result is still valid if only those source galaxies are selected that appear in multiple images of the same region of sky (this is the case in most current lensing surveys). It is also important to understand that shear selection bias is small not because the shear is small, but because the background galaxies are so nearly circular. If the galaxies are more elliptical in deeper surveys than in SDSS used here the bias will be larger. Note also that the mean rms ellipticity, λ and Reff are all likely to be redshift dependent and so will be the bias.
Correcting for shear selection bias
While the above analysis serves to illustrate the order of magnitude of the shear selection bias problem, it may not be adequate to provide an analytical correction due to the large number of assumptions and approximations made. There are also contributions to the selection bias other than significance thresholds; for example, it is generally necessary to remove overlapping objects from the background galaxy catalogue, but the overlap probability is a function of galaxy shape. There are four major approaches that could be taken to overcome shear selection bias.
- (1)
The bias could simply be ignored.
- (2)
The galaxy detection algorithm could be designed so as to eliminate, or at least substantially reduce, the bias.
- (3)
The transfer hypertensor Ξ could be computed and used as a correction to the shear polarizability or resolution factor.
- (4)
The bias could be determined from simulations.
Of these, Method 1 is appropriate when calibration errors of the order of several per cent are unimportant, for example in typical galaxy–galaxy lensing studies where the statistical error dominates. However, it is clearly inapplicable to future cosmic shear surveys that aim for much higher accuracy. Method 2 requires a detection statistic that does not suffer from shear selection bias, i.e. one that is independent of the shear. Examples would be the magnitude of the galaxy in a given band and the PSF-deconvolved area. Unfortunately, these are not truly independent of the shear when noise is taken into account, and the ‘overlapping object’ effect described above is not addressed. These effects are difficult to compute analytically, hence it is not clear whether we can verify that the residual errors in Method 2 are ‘small enough’ without resorting to simulation.
Method 3 requires that we compute
for each galaxy in our survey, then find Ξαβ=εαEβ, and compute the weighted average 〈Ξαβ〉wd to determine the calibration error due to shear selection bias. The problem is thus one of finding Eβ; unfortunately, Eβ can only be determined from the intrinsic intensity ƒ of the galaxy, not the observed image. Thus correction of cosmic shear measurements based on the determination of Ξ would require a careful analysis to ensure that noise and PSF effects do not invalidate the calculation. Method 4, the simulation, is straightforward albeit time consuming. Its advantage is that it can accommodate arbitrarily complicated detection algorithms. The main weakness is that the underlying distribution of galaxies in the simulation may not be correct; however, the shear selection bias is only an effect of the order of several per cent anyway, so determination of the bias to many significant figures is unnecessary.
Conclusion
In this paper we have addressed several possible biases present in the data reduction of cosmic shear measurements. We focus on effects that affect the overall normalization only and cannot be detected using the usual methods of finding systematics, such as the tests on stars and E/B decomposition. We pay particular attention to the deviations from the elliptical Gaussian profile of the galaxy or the PSF, which is often the starting assumption of the data reduction methods. The results in Figs 1–7 show that while for a Gaussian PSF most of the methods are successful in reconstructing the shear, the case of a more realistic non-Gaussian PSF is significantly more problematic except for the re-Gaussianization method introduced in this paper. This suggests that in existing surveys the shear calibration may be inaccurate at the 10 per cent level. Some methods, such as the current implementation of BJ02 used in Jarvis et al. (2003) and McKay et al. (2001), are particularly susceptible to this and can underestimate the linear amplitude of fluctuations by 10 per cent, while those based on KSB95 (e.g. Bacon et al. 2000; Van Waerbeke et al. 2000; Rhodes et al. 2001; Brown et al. 2003; Hoekstra et al. 2002) have a negative bias of up to 5 per cent. It is interesting to note that Jarvis et al. (2003) find a significantly (10–20 per cent) lower value for σ8 than other surveys. Calibrations errors of the type discussed here can explain part of the difference, although other factors such as statistical errors, uncertainties in the redshift distribution of background galaxies, non-linear evolution corrections of the dark matter power spectrum and residual systematics (which are known to be present because their effect on B-type correlation is seen in all the surveys that have looked for it) also play a role. At the moment, residual systematics due to the PSF correction can be as great as 10 per cent in σ8 and are comparable to the statistical errors in current samples.
There are other biases that one must worry about when an accuracy of the order of 1 per cent is required. One such effect is shear selection bias, which causes the galaxy selection to be correlated with the underlying shear and with the galaxy flux. This error is of the order of a few per cent, but can reach up to roughly 8 per cent in some limits. Since the shear selection bias preferentially selects galaxies with shear orthogonal to cosmic shear the bias translates into an overall underestimate of the amplitude of fluctuations. Moreover, since for a flux-limited survey the smaller and fainter galaxies are at a higher redshift the error in the amplitude may be correlated with redshift and can give an incorrect growth factor if not properly accounted for. Another effect that can create this is if the mean rms ellipticity of the galaxies changes with redshift.
Looking further ahead, weak lensing has the potential to break many of the degeneracies present in the cosmological parameters even after the cosmic microwave background has given us all of its information, especially if one can use the redshift information on background galaxies to extract the information on the growth factor (Hu 1999). It is an especially powerful probe of dark energy and massive neutrinos, since they both affect the growth factor at low redshifts probed by weak lensing. While the statistical precision of some of the proposed experiments such as LSST is impressive, the systematic effects must also be controlled at the 1 per cent level. Current reduction methods are not sufficient for this purpose, but improved methods such as our re-Gaussianization method (which can be further generalized to include higher-order terms) should be up to the task. Similarly, shear selection bias and other biases should be carefully simulated to estimate their importance as a function of galaxy detection significance, PSF, etc. One notes that realistic simulations would allow us to determine both the PSF dilution calibration and the shear selection bias, and indeed shear selection bias is implicitly taken into account in simulated observations such as those of Kaiser (2000), Erben et al. (2001) and Bacon et al. (2001). Future experiments will require that this line of work be extended, for example to include measured PSFs from the telescope and the correct distribution of galaxy shapes and fluxes as a function of redshift.
It is clear that the quality of the data of the upcoming measurements will exceed the current generation of reduction methods. However, this is not a cause for alarm, since none of the problems discussed here is fundamental in nature. We have proposed solutions for some of the problems, while others will require more detailed modelling using realistic simulations of the galaxies and their image processing through the atmosphere (if applicable), telescope, detector and software. Given enough attention to these details weak lensing should fulfil its promise in the era of high-precision cosmology.
Acknowledgments
The authors acknowledge useful discussions with Gary Bernstein, Mike Jarvis, Robert Lupton, Michael Strauss, Joseph Hennawi and Nikhil Padmanabhan. CH is supported through the NASA Graduate Student Researchers Program, grant NASA GSRP-02-OSS-079. This work was supported by NASA NAG5-1993 and NSF CAREER. US is a fellow of the David and Lucille Packard Foundation and the Alfred P. Sloan Foundation.
References
Appendix
Appendix A: Convolution with Elliptical Laguerre Expansion
; using equation (10) and writing the result in terms of
yields The trace of this equation yields
; the traceless-symmetric part yields
and
, or that χƒ and χg differ by a multiple of π/2; we may, however, assume the latter case, since in the former case the aforementioned continuous degeneracy allows us to choose χƒ=χg. Indeed, the discrete degeneracy allows us to set χƒ=χg(≡χ). From this we derive
. Finally, the positive determinant of A(ƒ) and A(g) implies that C̄ƒD̄ƒ and C̄gD̄g are positive; the discrete degeneracy then allows us to choose C̄ƒ, C̄g, D̄ƒ and D̄g positive.
,
and D̄g in terms of Mƒ,g,h may be obtained by directly finding the square roots of these matrices, thereby determining A(ƒ,g), and then substituting these matrices into equation (A5). In order to construct the As, it is necessary to determine the square root of a symmetric matrix. This is given by where D=MxxMyy−M2xy.
yields where we have used the result that Λpq(Rθk) = e−imθΛpq(k). Then equation (A9) is a linear equation for the cpq, thus where the coefficients
satisfy
can be calculated by noting that the right-hand side of equation (A11) is a polynomial of the order of p′+q′+p′′+q′′ in the two variables U and V. Since Λpq(U, V) with p+q≤N form a basis for the (N+ 1)(N+ 2)/2-dimensional vector space of polynomials in U and V of degree ≤N, the sum on the left-hand side of equation (A11) may be cut-off at p+q≤N. Equating coefficients of UmVn on both sides then results in a finite system of linear equations for
, which is easily solved using standard linear algebra techniques. An alternative route is to exploit the orthonormality relations among the Laguerre basis functions to convert equation (A11) into this ratio of Gaussian integrals can be computed using Wick's theorem with the Feynman rules 〈U〉=〈V〉=〈UV〉= 0 and 〈U2〉=〈V2〉 = 1/2.
in terms of finite sums. To do this, we expand the Λ: where and s= (p+q−μ−ν)/2. The sum in equation (A14) is over all values of t such that s, μ−t, t, and q−s−t are all non-negative. The ‘selection rules’ for X are as follows: Xpqμν≠ 0 only if μ+ν is equal to p+q, or less than p+q by an even integer. Having constructed X, we evaluate the integral in equation (A12) as We have computed several of the lower-order (up to degree six) Z coefficients for the simulations in this paper; it is found that for these moments, it is practical to perform either the linear system solution (equation A11) or the summation method (equation A15) using symbolic manipulation software, and then write ‘hard-wired’ C code to numerically evaluate the resulting expressions. Because the explicit polynomial form of the Λ functions (see equation A14) contains many positive and negative terms for large p and q, it is likely that round-off errors would be an issue for our computational approach if we were to consider moments of sufficiently high order. It is unlikely that moments of very high order would be useful in weak-lensing studies; nevertheless, if this turned out to be the case, or if the high-order Z coefficients were required for another application, it would be necessary to develop a better algorithm (likely involving a recursion relation) for computing them.
Appendix
Appendix B: Linear Psf Correction
in accordance with equation (B4). This gives us a system of linear equations that can be solved for the unknown β(ƒ)pq: for pqϵL̄. The β thus determined can be substituted into equation (B3) to yield c11/c00 and c02/c00, which then yield Mh via equations (B4) and (B5). Finally, we can determine Mƒ=Mh−Mg and then determine the intrinsic ellipticity from equation (5).The above procedure is somewhat involved, and we illustrate it with a simple example: that of L̄ = {22}. That is, we will correct for the radial fourth moment or kurtosis of the PSF and intrinsic galaxy image. Obviously, in ignoring the higher-order radial and angular moments we are making an approximation, however, the b22 moment is present for non-Gaussian galaxy profiles such as the exponential or de Vaucouleurs profiles (and is positive in both cases). As we show in Section 2.2, the higher-order moments can be important for some PSFs. The angular moments bpq for p≠q are not present for purely radial profiles, i.e. profiles for which the isophotes are concentric ellipses that share principal axes and axis ratios. If the angular moments, or the higher-order radial moments b33, b44, etc. are important, these would need to be included in L̄.
.[Warning: the ‘shear composition’ operator e′=δ⊕e as defined by BJ02 and in equation (B17) does not form a group because it is not associative. This problem arises because the shear e followed by the shear δ does not have the same effect as the shear e′ (they differ by an initial rotation). For this reason, it is probably better to think of ⊕ as representing the transformation law for ellipticities under shear, rather than as a composition operator.]
We can thus list the steps involved in PSF correction using the moments L̄ = {22}. If the PSF has been circularized using a rounding kernel, the steps marked with a ⋆ can be skipped.
- (i)
Compute the adaptive covariances MI and Mg for the galaxy and PSF, and the corresponding radial fourth moments β(I)22 and β(g)22.
- (ii)
⋆ Apply a shear to circularize the PSF, i.e. use equation (B17) with δ=−e(g). The radial fourth moments are unchanged by this shear.
- (iii)
Apply a rotation so that the measured galaxy has e(I)+ > 0 and e(I)×= 0.
- (iv)
Compute the dilation μ and shear η in accordance with equations (B7)–(B13).
- (v)
Compute the intrinsic ellipticity of the galaxy in the sheared rotated coordinate system using equation (B15).
- (vi)
Undo the rotation.
- (vii)
⋆ Shear to return the ellipticity to the unsheared unrotated coordinate system. The shear is undone using equation (B17) with δ=e(g), i.e. the exact opposite shear that was applied initially.
Appendix
Appendix C: The Ksb Method in Laguerre Coefficients
In these equations, the X contributions to the polarizabilities are due to the effect of shear on the quadrupole moments b20 whereas the eαesh,smβ contributions are due to the effect of the shear on the weighted trace Tw. When the X hypertensors are expressed in terms of the Laguerre moments, the decomposition into spin-0 (scalar) and spin-4 (traceless-symmetric) components is manifest.
Our only remaining issue is the choice of the width σ of the weight function. We use a weight function scale set by the requirement b11= 0, i.e. we use an adaptive circular variance. This is appropriate, at least for the simple case of a circular Gaussian object, for maximum significance detection with a Gaussian filter of variable width (see the discussion in Section 3.3). Following Hoekstra et al. (1998), we use the same σ2 for the PSF as for the object.














































































































