- Split View
-
Views
-
Cite
Cite
Keir K. Rogers, Hiranya V. Peiris, Boris Leistedt, Jason D. McEwen, Andrew Pontzen, SILC: a new Planck internal linear combination CMB temperature map using directional wavelets, Monthly Notices of the Royal Astronomical Society, Volume 460, Issue 3, 11 August 2016, Pages 3014–3028, https://doi.org/10.1093/mnras/stw1121
- Share Icon Share
Abstract
We present new clean maps of the cosmic microwave background (CMB) temperature anisotropies (as measured by Planck) constructed with a novel internal linear combination (ILC) algorithm using directional, scale-discretized wavelets – scale-discretized, directional wavelet ILC or Scale-discretised, directional wavelet Internal Linear Combination (SILC). Directional wavelets, when convolved with signals on the sphere, can separate the anisotropic filamentary structures which are characteristic of both the CMB and foregrounds. Extending previous component separation methods, which use the frequency, spatial and harmonic signatures of foregrounds to separate them from the cosmological background signal, SILC can additionally use morphological information in the foregrounds and CMB to better localize the cleaning algorithm. We test the method on Planck data and simulations, demonstrating consistency with existing component separation algorithms, and discuss how to optimize the use of morphological information by varying the number of directional wavelets as a function of spatial scale. We find that combining the use of directional and axisymmetric wavelets depending on scale could yield higher quality CMB temperature maps. Our results set the stage for the application of SILC to polarization anisotropies through an extension to spin wavelets.
1 INTRODUCTION
Accurate measurements of the cosmic microwave background (CMB) arguably form the bedrock of modern precision cosmology. In particular, the full-sky multifrequency CMB maps provided by three generations of satellite experiments – COBE (Mather et al. 1990; Boggess et al. 1992), Wilkinson Microwave Anisotropy Probe (WMAP; Bennett et al. 2003a) and Planck (Planck Collaboration I 2011) – represent milestones in our understanding of the cosmological model. However, to obtain a full-sky map of the CMB requires removing instrumental noise and signals due to astrophysical foregrounds (primarily in the Milky Way). Full-sky foreground-cleaned CMB maps are used for a wide variety of scientific purposes (see e.g. Planck Collaboration XVI 2015; Planck Collaboration XVII 2015).
There are numerous methods to perform foreground component separation. They broadly fall into two categories: blind methods which make minimal physical assumptions about the contributing signals and the so-called mixing matrix (which quantifies the strength of different components at different frequencies) and non-blind methods which are based on a physical modelling of the sky components. Examples of non-blind methods include the maximum entropy method (Hobson et al. 1998) and the parametric Bayesian CMB Gibbs sampler Commander (Eriksen et al. 2006, 2008). Correlated component analysis (Bedini et al. 2005) is a semiblind method that estimates the mixing matrix based on second-order statistics. Spectral Estimation via Expectation Maximization (SEVEM; Martínez-González et al. 2003; Leach et al. 2008; Fernández-Cobos et al. 2012) is a template-fitting technique. Examples of so-called blind source separation include the sparsity based method local-generalized morphological component analysis (Bobin et al. 2008, 2013) and the spectral matching independent component analysis (SMICA; Cardoso et al. 2008), although the latter work does discuss how the choice of component model affects the blindness of this method. Of particular interest to this work is another blind method, the internal linear combination (ILC), most recently implemented by the needlet ILC (NILC; Delabrouille et al. 2009). In its component separation analysis, the Planck Collaboration used Commander, NILC, SEVEM and SMICA (Planck Collaboration IX 2015). See, e.g. Delabrouille et al. (2009) and Bobin et al. (2013) for reviews of CMB component separation methods.
The ILC computes a weighted sum of CMB maps as measured at multiple frequencies. These weights are constrained to sum to unity at each point in the map, ensuring that the CMB signal is conserved, assuming that it is equal at each frequency. Under this constraint, the weights are calculated by minimizing the empirical variance of the ILC map, which in turn minimizes the variance of the error in CMB reconstruction (assuming the CMB and foregrounds and the CMB and noise are, respectively, uncorrelated). The variances we minimize are empirical in that they are calculated using the data themselves. In this way, the weights are calculated to remove foreground and noise, revealing the underlying primordial CMB anisotropies. The ILC method was originally used by the WMAP Collaboration (Bennett et al. 2003b) and then extended by Eriksen et al. (2004) through an analytical calculation of the weights. One limitation of the original ILC approach is the extent of localization of the weights. The initial versions calculated different weights in separate parts of the sky (e.g. Bennett et al. 2003b split the Galactic region into 11 parts). In order to further remove local contamination, the weights can be allowed to vary across the sky and also at different spatial scales. Tegmark, de Oliveira-Costa & Hamilton (2003) made an ILC map allowing the weights to vary at each multipole, as well as within different regions of the sky. A direct extension of this work is to make use of both spatial and frequency information simultaneously using wavelets. The weights are then defined across wavelet scales and within wavelet coefficient maps on the sky.
Wavelets are functions that are localized in both real and frequency space. To analyse full-sky CMB maps, wavelets defined on the surface of a sphere are required. A number of wavelet frameworks on the sphere have been developed recently (Antoine & Vandergheynst 1998, 1999; Barreiro et al. 2000; Wiaux, Jacques & Vandergheynst 2005; McEwen, Hobson & Lasenby 2006; Narcowich, Petrushev & Ward 2006; Sanz et al. 2006; Starck et al. 2006; Geller et al. 2008; McEwen & Scaife 2008; Marinucci et al. 2008; Wiaux et al. 2008; Baldi et al. 2009; Starck, Moudden & Bobin 2009; Geller & Marinucci 2010, 2011; McEwen, Wiaux & Eyers 2011; Leistedt et al. 2013, 2015; McEwen, Vandergheynst & Wiaux 2013; McEwen et al. 2014, 2015b; McEwen, Durastanti & Wiaux 2015a). In particular, needlets (Narcowich et al. 2006; Marinucci et al. 2008; Baldi et al. 2009) have been used in the latest generation of ILC methods. Needlets are a set of axisymmetric kernels defined on the surface of a sphere. Each member of the set has compact support in harmonic space over different multipole ranges. When each needlet is convolved with a signal defined on the sphere, the resulting signal (i.e. needlet coefficients) also has compact harmonic support. NILC (Delabrouille et al. 2009) computes its weights by considering needlet scales separately (harmonic localization) and then different parts of each needlet coefficient map separately (spatial localization). The needlets are constructed in such a way that the original signal can be recovered from its needlet coefficients with no loss of information (in practice, small losses can be introduced by approximate spherical harmonic transforms). NILC has been very successful at forming clean full-sky CMB maps, which contain very little residual foreground and noise contamination.
In this work, we introduce the scale-discretized, directional wavelet ILC or Scale-discretised, directional wavelet Internal Linear Combination (SILC), which extends the wavelet ILC framework by localizing the calculation of ILC weights in an additional domain. We use wavelets which are not only harmonically localized but also directional (Wiaux et al. 2008; McEwen et al. 2013; McEwen et al. 2015b). Unlike needlets, which are axisymmetric on the sphere, directional wavelets are non-axisymmetric, i.e. the kernels are ‘squeezed’. This means that for one wavelet scale, one axisymmetric kernel is replaced by a number of complementary directional kernels, each with a different orientation. When these directional wavelets are convolved with a signal on the sphere, (within each scale) different orientations of signal structure are separated. This directional localization allows the ILC weights to be additionally fine-tuned to better remove foreground and noise, in particular for signals with filamentary structure. Furthermore, directional wavelets exhibit exact reconstruction, allowing them to be embedded in an ILC such that no signal is lost.
SILC is being developed with the goal of analysing CMB polarization components through an extension to spin, directional wavelets (McEwen et al. 2014, 2015b; Leistedt et al. 2015), which are expected to be well suited to localizing the complex filamentary morphologies of polarized foregrounds. As a precursor step, in this work we test SILC on the scalar temperature field in order to demonstrate the quantitative consistency of its foreground cleaning performance compared with existing component separation methods, and to identify possible optimizations for the extension to spin fields.
Directional wavelets are explained briefly in Section 2. In Section 3, the SILC algorithm is explained in detail. Various sources of error in the method are considered in Section 4. In Section 5, we compare our method to previous component-separation methods. The application to Planck simulations (Section 6) is followed by application to Planck data (Section 7). We discuss the results and error estimation based on the data in Section 8 and conclude in Section 9.
2 DIRECTIONAL WAVELETS
Directional, scale-discretized wavelets on the sphere that support exact reconstruction have been developed in Wiaux et al. (2008), McEwen et al. (2013) and McEwen et al. (2015b), while their localization properties have been studied in McEwen et al. (2015a). Fig. 1 shows an example of the spatial localization of directional wavelets. Larger wavelet scales have larger kernels, and when these are convolved with signals defined on the sphere (such the CMB and astrophysical foregrounds), signal structure with the same scale and orientation as the wavelet is isolated. The kernels in Fig. 1 are shown for a single direction and (for complete reconstruction) would be complemented by two more sets of kernels of the same sizes but rotated to different orientations. Fig. 2 shows an example of the harmonic localization of directional wavelets (for the wavelets used in this work). The harmonic supports of the wavelets overlap, with each wavelet covering a finite set of multipoles. Fig. 3 shows an example of directional wavelet decomposition as applied to the CMB. Although the CMB anisotropies are statistically Gaussian, the CMB spots on the sky demonstrate anisotropy as a function of scale (Bond & Efstathiou 1987). When the CMB is convolved with directional wavelets, structure of different orientations is separated. This further supports the use of directional wavelets in CMB analysis: both filamentary foreground structure and the CMB itself are better localized. This particularly applies in the case of polarization, as will be discussed in Section 9 when we consider extensions to our method. For a mathematical description of directional wavelets, see Section 3.4.
In the spherical harmonic transforms used in the computation of directional wavelet coefficient maps, we adopt the sampling scheme on the sphere of McEwen & Wiaux (2011, hereafter MW sampling), rather than, e.g. healpix sampling (Górski et al. 2005), although in principle healpix could be used if desired. The corresponding sampling theorem of McEwen & Wiaux (2011) shows that the MW sampling scheme requires fewer samples for a band-limited signal than any other sampling theorem. Additionally, the use of a separation of variables and fast Fourier transforms (FFTs) yields a numerically efficient algorithm. In particular, our spherical harmonic transforms are theoretically exact, unlike healpix. This allows one to manipulate signals with the minimal number of samples and to perform the numerous spherical harmonic transforms involved in the ILC algorithm without any loss of information (other than that due to the finite representation of floating point numbers). Our final map products, however, are provided in healpix format. Finally, MW sampling of spin signals on the sphere requires no additional computational complexity and this will be vital in the extension of our method to polarization E and B modes (Section 9). Further details on MW sampling are given in Section 3.4.
3 METHOD
We start by outlining the SILC algorithm. The steps are explained in more detail in the subsequent subsections (Sections 3.1–3.6). We discuss our numerical implementation in Section 3.7.
The raw input data are multifrequency full-sky maps of CMB temperature fluctuations. These maps use the healpix format. (See Section 3.1.) The model we employ for the raw data is explained in Section 3.2.
The maps are ‘pre-processed’ by inpainting in a small point source mask (see Section 3.6).
The input maps are converted to thermodynamic (CMB) temperature (if necessary). For Planck temperature data, the 545 and 857 GHz maps are converted from spectral flux density per unit solid angle (MJy Sr−1) to CMB temperature (KCMB) by the unit conversions given in the Planck 2015 Release Explanatory Supplement.1
The maps are each convolved to have the same effective beam (see Section 3.3).
Each input map is converted into a set of wavelet coefficient maps. This separates both the scale and orientation of structure within each map. These wavelet coefficient maps use MW sampling (McEwen & Wiaux 2011). (See Section 3.4.)
The ILC method is then applied separately to each wavelet scale and orientation. For each scale and orientation, the multifrequency wavelet coefficient maps are weighted and added to form a single wavelet coefficient map that contains mainly CMB signal, as well as some residual foreground and noise. These weights are allowed to vary from wavelet coefficient to wavelet coefficient. The calculation of these weights is explained in Section 3.5.
The final ILC wavelet coefficient maps are synthesized to form the final product: a full-sky map of CMB temperature fluctuations (with some residual foreground and noise). The final map uses the healpix format. (See Section 3.4.)
The final map is inpainted in a small point source mask (see Section 3.6).
3.1 Input data
Our main CMB temperature map products use full-mission 2015 release Planck temperature maps as their input.2 All nine frequency channels are used. At 70 GHz, we use the higher resolution version at Nside = 2048. We also use the full-mission full focal plane 8 (FFP8) simulations (Planck Collaboration XII 2015) without bandpass mismatch.3
3.2 Data model
3.3 Beam convolution
3.4 Wavelet analysis and synthesis
In the original definition of scale-discretized wavelets, the size of all harmonic kernels (setting the harmonic localization of the wavelets) is parametrized by a unique wavelet dilation parameter |$\lambda \in \mathbb {R}_{{\ast }}^{+}$|, λ > 1. Similarly, the number of directions is set by a unique azimuthal band-limit N. These two parameters, respectively, characterize κj(ℓ) and sℓn for all j. In this work, we vary λ as a function of multipole in order to allow more flexible harmonic localization. We achieve this by defining different values of λ in different multipole regions and then stitching together harmonically truncated wavelets at the region boundaries. We use the values λ = 2, 1.3, 1.2 with transitions at the multipoles ℓ = 512, 2015. If at a transition multipole the harmonic peak of the larger wavelet does not equal the peak of the smaller wavelet, then a small amount of unit response is used so that the two wavelets can be continuously combined. Wavelets constructed in this manner satisfy the standard admissibility criterion required for exact reconstruction. The harmonic tiling of the resulting wavelets is shown in Fig. 2. The technical details of the construction of each kernel is described in McEwen et al. (2015a). Finally, we use a single parameter N for all scales, i.e. each wavelet is divided into the same number of directions. However, a possible extension of this work is to vary N as a function of scale j, e.g. by using curvelet kernels (Chan et al. 2015) or other directional optimizations.
In the case of a single parameter λ, the limits of the wavelet harmonic window for scale j are simply |$(\ell _\mathrm{min}^j,\ell _\mathrm{max}^j) = (\lambda ^{j-1},\lambda ^{j+1})$|, with their peak response at λj. In our hybrid scheme, this property remains but j and λ must be adjusted in each harmonic region. The full details of our tiling are given in Table 1. When the limits of the harmonic windows of the maximum wavelet scales extend beyond the overall band-limit ℓmax, the windows are truncated at ℓmax. Finally, note that a scaling function WΦ is needed to capture the very low frequency content of the signal. It is axisymmetric and the corresponding scaling coefficients therefore live on the sphere. Here, we do not give the full details of the construction of the scaling function or the factors κj(ℓ) and sℓn, since these can be straightforwardly reproduced by following previous approaches (Wiaux et al. 2008; McEwen et al. 2013; McEwen et al. 2015b) and using Table 1.
Wavelet scale j . | |$\ell ^j_{\mathrm{min}}$| . | |$\ell ^j_{\mathrm{peak}}$| . | |$\ell ^j_{\mathrm{max}}$| . | |$N^j_\mathrm{samp}$| . |
---|---|---|---|---|
Scal. | 0 | 64 | 64 | 8385 |
0 | 32 | 64 | 128 | 33 153 |
1 | 64 | 128 | 256 | 131 841 |
2 | 128 | 256 | 512 | 525 825 |
3 | 256 | 512 | 706 | 998 991 |
4 | 542 | 705 | 918 | 1688 203 |
5 | 705 | 917 | 1193 | 2850 078 |
6 | 917 | 1192 | 1551 | 4815 856 |
7 | 1192 | 1550 | 2015 | 8126 496 |
8 | 1550 | 2015 | 2540 | 12 910 821 |
9 | 2116 | 2539 | 3048 | 18 589 753 |
10 | 2539 | 3047 | 3600 | 25 930 801 |
11 | 3047 | 3600 | 3600 | 25 930 801 |
Wavelet scale j . | |$\ell ^j_{\mathrm{min}}$| . | |$\ell ^j_{\mathrm{peak}}$| . | |$\ell ^j_{\mathrm{max}}$| . | |$N^j_\mathrm{samp}$| . |
---|---|---|---|---|
Scal. | 0 | 64 | 64 | 8385 |
0 | 32 | 64 | 128 | 33 153 |
1 | 64 | 128 | 256 | 131 841 |
2 | 128 | 256 | 512 | 525 825 |
3 | 256 | 512 | 706 | 998 991 |
4 | 542 | 705 | 918 | 1688 203 |
5 | 705 | 917 | 1193 | 2850 078 |
6 | 917 | 1192 | 1551 | 4815 856 |
7 | 1192 | 1550 | 2015 | 8126 496 |
8 | 1550 | 2015 | 2540 | 12 910 821 |
9 | 2116 | 2539 | 3048 | 18 589 753 |
10 | 2539 | 3047 | 3600 | 25 930 801 |
11 | 3047 | 3600 | 3600 | 25 930 801 |
Wavelet scale j . | |$\ell ^j_{\mathrm{min}}$| . | |$\ell ^j_{\mathrm{peak}}$| . | |$\ell ^j_{\mathrm{max}}$| . | |$N^j_\mathrm{samp}$| . |
---|---|---|---|---|
Scal. | 0 | 64 | 64 | 8385 |
0 | 32 | 64 | 128 | 33 153 |
1 | 64 | 128 | 256 | 131 841 |
2 | 128 | 256 | 512 | 525 825 |
3 | 256 | 512 | 706 | 998 991 |
4 | 542 | 705 | 918 | 1688 203 |
5 | 705 | 917 | 1193 | 2850 078 |
6 | 917 | 1192 | 1551 | 4815 856 |
7 | 1192 | 1550 | 2015 | 8126 496 |
8 | 1550 | 2015 | 2540 | 12 910 821 |
9 | 2116 | 2539 | 3048 | 18 589 753 |
10 | 2539 | 3047 | 3600 | 25 930 801 |
11 | 3047 | 3600 | 3600 | 25 930 801 |
Wavelet scale j . | |$\ell ^j_{\mathrm{min}}$| . | |$\ell ^j_{\mathrm{peak}}$| . | |$\ell ^j_{\mathrm{max}}$| . | |$N^j_\mathrm{samp}$| . |
---|---|---|---|---|
Scal. | 0 | 64 | 64 | 8385 |
0 | 32 | 64 | 128 | 33 153 |
1 | 64 | 128 | 256 | 131 841 |
2 | 128 | 256 | 512 | 525 825 |
3 | 256 | 512 | 706 | 998 991 |
4 | 542 | 705 | 918 | 1688 203 |
5 | 705 | 917 | 1193 | 2850 078 |
6 | 917 | 1192 | 1551 | 4815 856 |
7 | 1192 | 1550 | 2015 | 8126 496 |
8 | 1550 | 2015 | 2540 | 12 910 821 |
9 | 2116 | 2539 | 3048 | 18 589 753 |
10 | 2539 | 3047 | 3600 | 25 930 801 |
11 | 3047 | 3600 | 3600 | 25 930 801 |
To apply the ILC algorithm, the above continuous wavelet coefficients must be discretized. Since they live on the rotation group SO(3), we represent them using the sampling scheme of McEwen et al. (2015c), which is a generalization of the MW sampling scheme (McEwen & Wiaux 2011). Because our wavelets have well-defined band-limits, this approach allows a multiresolution scheme where each scale is pixellated with a minimal number of samples. In practice, the j-th wavelet scale has a band-limit |$\ell _\mathrm{max}^j$| and is only evaluated at locations |$(\theta _t^j, \phi _p^j, \chi _n)$| with |$t \in \lbrace 0,1,\ldots ,\ell _\mathrm{max}^j\rbrace$|, |$p \in \lbrace 0,1,\ldots ,2\ell _\mathrm{max}^j\rbrace$| and n ∈ {0, 1, …, N − 1}. Although wavelet coefficients are evaluated at discrete samples only, for a band-limited signal they capture the total information content of the underlying continuous wavelet coefficient representation, probed up to harmonic band-limit |$\ell _\mathrm{max}^j$| and azimuthal band-limit N. Thanks to the sampling theory on the rotation group SO(3) of McEwen et al. (2015c). In the full ensemble of realizations, the ILC (see Section 3.5 for details) has no sensitivity to the choice of coordinate convention for directions χ. In a single realization, there will be some marginal sensitivity to this choice manifesting in the localization of the empirical covariances we use. However, this effect is sub-dominant to the choice of N, on which we concentrate our analysis.
The wavelet analysis and synthesis are performed using the latest version of the s2let6 code (Leistedt et al. 2013; McEwen et al. 2015c), which in turn relies on the ssht7 (McEwen & Wiaux 2011) and so38 (McEwen et al. 2015c) codes to compute spin spherical harmonics and Wigner transforms exactly and efficiently using the MW sampling scheme. Thanks to the sampling theorem, the wavelet coefficients can be transformed using Wigner transforms without any loss of information (McEwen et al. 2015c).
3.5 ILC method
The size of the Gaussian kernel used to smooth the covariance matrices is chosen to be proportional to the size of the wavelet used to form a particular set of wavelet coefficient maps.9 In general, the estimation of covariance matrices in ILC methods could be further optimized. It may be preferable to dynamically adapt the smoothing kernel used based on local data. Delabrouille et al. (2009) suggested using a larger kernel at high Galactic latitudes, where Galactic emission does not vary so much and a smaller kernel towards the Galactic equator, where emission is more complex. It could involve masking equatorial regions when estimating the covariance at higher latitudes [somewhat akin to Planck Collaboration XII (2014)]. It could involve convolving the maps of elements of covariance matrices with the same directional wavelet in order to pick out how the local covariance follows the directional structure of the underlying signal. As mentioned above, in this work, we use a similar method as in previous work for ease of comparison.
Having established the main equations governing the ILC method, we now present the main steps in the ILC algorithm that we use.
Form the |$(R_{jnk}^\mathrm{approx})^{cc^{\prime }}$| by equation (14).
Smooth the |$(R_{jnk}^\mathrm{approx})^{cc^{\prime }}$| in harmonic space by equation (16).
Take the inverse of each covariance matrix at each pixel to form |$(\hat{R}_{jnk}^{-1})^{cc^{\prime }}$|.
Calculate the ILC weights |$\omega _{jnk}^c$| by equation (13), where we assume that ac = 1 for all c and we substitute the empirical estimates for the inverse covariance matrices.
Finally, calculate the ILC estimate wavelet coefficient maps |$W_{jnk}^\mathrm{ILC}$| by applying equation (11).
3.6 Point source masking
The input frequency maps are diffusively inpainted in a small point source mask following the method employed by Planck Collaboration XVII (2015). This recognizes that the ILC fails when the CMB is obscured by bright extragalactic sources or complex emission near the Galactic equator. The inpainting removes these sources and attempts to replace them with an extrapolation of the surrounding signal. The mask supplied is taken from the NILC section of Planck Collaboration I (2014) and is constructed from the Planck Catalogue of Compact Sources (Planck Collaboration XXVIII 2014; Planck Collaboration XXVI 2015).10 It masks about 2.2 per cent of the whole sky, predominantly along the Galactic equator towards the Galactic Centre.
Because of this inpainting, the final ILC map is inpainted within the point source mask. For the purposes of this inpainting, we have split the mask into two, based on the size of its constituent individual contiguous holes.11 For holes consisting of less than or equal to 800 pixels, we inpaint with a constrained Gaussian realization following the method of Benoit-Lévy et al. (2013), itself an approximate implementation of the Hoffman–Ribak algorithm (Hoffman & Ribak 1991). For holes consisting of more than 800 pixels (the largest 131 out of 10 031), we inpaint with a standard diffusive algorithm (in particular, following the method employed by Planck Collaboration XVII 2015). The result is that the ILC map is 1.3 per cent Gaussian inpainted and 0.9 per cent diffusively inpainted. This follows Benoit-Lévy et al. (2013), who do not recommend using their Gaussian inpainting for large holes near the Galactic equator.
3.7 Numerical implementation
SILC is implemented in python and is parallelized. At full Planck resolution (Nside = 2048, ℓmax = 3600), when run on a 60-core symmetric multiprocessor (SMP) with 1.5 TB RAM and a 24-core cluster node with 256 GB RAM,12 the pipeline takes approximately 12 h per direction. As shown by equation (16), we perform spherical harmonic transforms to 2ℓmax = 7200. For a given number of directions N, the full pipeline takes approximately N times as long as the axisymmetric limit of our method (when N = 1). In our infrastructure, the code was usually memory-limited (due to the very high resolution of the covariance matrix maps |$(R_{jnk})^{cc^{\prime }}$| at harmonic band-limit |$2 \ell _\mathrm{max}^j$|); the amount of parallelization sometimes had to be reduced to prevent memory overloads on a single node. As mentioned in Section 3.4, the wavelet transforms employ the latest version of s2let (Leistedt et al. 2013; McEwen et al. 2015c), written in c with python wrappers, itself employing ssht (McEwen & Wiaux 2011) and so3 (McEwen et al. 2015c). Despite the use of MW sampling and FFTs, spherical harmonic transforms are the most time-consuming part of the pipeline, again due to the very high resolution of the |$(R_{jnk})^{cc^{\prime }}$| [for the smallest wavelets, these covariance maps are band-limited at ℓ = 7200]. There is scope to further optimize the implementation. Our wavelet analysis and synthesis functions do not, respectively, output and take as input wavelet coefficient maps at double-resolution (i.e. a map band-limited at |$\ell _j^\mathrm{max}$| sampled at |$2 \ell _j^\mathrm{max}$|), requiring additional spherical harmonic transforms to double the resolution. Also, our spherical harmonic transform function does not calculate harmonic coefficients to a multipole less than the band-limit of the input map (i.e. to only calculate aℓm for ℓ < L where |$L < \ell _\mathrm{max}^j$|), resulting in excess computation at certain steps in the algorithm. These optimizations are left as further work.
4 SOURCES OF ERROR IN THE ILC
There are additional sources of error in the ILC method. The first is due to inaccuracy in the calculation of covariance matrices |$(R_{jnk})^{cc^{\prime }}$|, i.e. deviations in the empirical estimate |$(\hat{R}_{jnk})^{cc^{\prime }}$| from the true covariance |$(R_{jnk})^{cc^{\prime }}$|. Delabrouille et al. (2009) estimated the first order expansion of the reconstruction error in the ILC map estimate due to this covariance error. They showed that the covariance of the ILC error with the CMB is inversely proportional to the number of ‘effective modes’ used in the ILC calculation. This covariance bias is negative. In our directional wavelet decomposition, our ‘effective modes’ are spherical harmonic coefficients weighted to take account of the fact that the harmonic responses of wavelets overlap in both scale and direction. As N, the number of orientations probed, increases and so does the number of wavelets, each wavelet coefficient map contains fewer ‘effective modes’. We therefore conclude that our directional wavelet ILC may be susceptible to this negative ILC bias by increasing N. Delabrouille et al. (2009) also showed that due to chance correlations between the CMB and foregrounds, the variance minimization leads to the unintentional cancellation of Nc − 1 CMB modes. For Planck, Nc = 9, whereas for WMAP, Nc = 5. We therefore expect the magnitude of this negative bias to double simply by using more input frequency channels. Also, since this covariance bias is due to the cancellation of CMB modes, Delabrouille et al. (2009) showed that the absolute value is proportional to the CMB power. Therefore, the absolute value of the bias is greatest on large scales where CMB power is concentrated. In general, these biases are best estimated through suites of Monte Carlo simulations.
Another source of error is due to inaccuracy in the calibration ac of the CMB. Dick, Remazeilles & Delabrouille (2010) calculated the consequence of a first-order error in ac on a multiplicative correction to the CMB term in equation (20). They showed that even a small error in calibration can lead to a significant negative multiplicative bias in the CMB term, when the signal-to-noise ratio is large. (Here, the noise in this ratio also includes foreground signal.) They consider the implications for using an ILC on Planck data, where the signal-to-noise ratio is larger than for WMAP data. They estimate that a 1 per cent error in ac can cancel about a third of the CMB signal, while even a 0.1 per cent error in ac can remove about 1 per cent of the CMB. Since our main map products use Planck data as input, they will be susceptible to this additional negative calibration bias. In this work, we assume that the CMB is calibrated to have unit response for all frequency channels, i.e. ac = 1 for all c.
As mentioned in Sections 3.2 and 3.3, we assume all beams to be circularly symmetric. Therefore, non-axisymmetric beam components will propagate into the ILC calculation but are assumed sufficiently small to be ignored.
5 COMPARISON TO PREVIOUS WORK
We now consider how SILC compares with existing component separation methods, particularly those adopted for the Planck analysis. We applied the axisymmetric limit (when N = 1) of SILC to full-mission Planck data and compared the results to existing Planck analyses using the NILC and SMICA methods: the former because it is the closest in spirit to SILC, and the latter because it is the baseline method adopted by the Planck Collaboration for high-resolution analyses. Fig. 4 shows the CMB reconstructed by the axisymmetric limit of SILC, while Fig. 5 shows the differences between this map and the NILC and SMICA (full-mission 2015 release) CMB maps and the difference between NILC and SMICA. The differences between the three maps are small in magnitude and mostly concentrated at the edges of the Galactic mask towards the Galactic Centre, where foreground emission is most intense and complex. Quantitatively, we can compare the mean values and standard deviations of the masked difference maps. The mean values of Figs 5(a), (b) and (c) are, respectively, 0.44, −0.63 and −1.07 μK, while the standard deviations are, respectively, 4.24, 3.38 and 3.43 μK2. These values are small and similar, suggesting a strong consistency between the three methods. These difference maps have been formed from maps which have been smoothed and downgraded in resolution and so visually highlight differences at the lowest multipoles.
Fig. 6 compares point source masked TT angular power spectra (Dℓ = ℓ(ℓ + 1)Cℓ/2π) at the full multipole range of the three maps (up to ℓ = 3400)13 with a CMB spectrum derived from the Planck 2015 TT and low TEB likelihood.14 The SILC spectrum is remarkably similar to that of NILC. This is unsurprising since the axisymmetric limit of SILC (when N = 1) is very similar to the NILC method. None the less, there are a number of pipeline differences. In particular, we use a different set of wavelets than the needlets employed in NILC (as discussed in Section 3.4), even in the axisymmetric limit, with different harmonic responses. Fig. 2 shows the harmonic response of the wavelets used in this work and Table 1 lists their harmonic band-limits |$\ell ^j_{\mathrm{min}}$| and |$\ell ^j_{\mathrm{max}}$|. The SMICA spectrum has lower residuals at higher multipoles than both the axisymmetric limit of SILC and NILC.
Fig. 7 compares full-sky angular power spectra of the three maps, including the inpainted point source regions. The spectra are similar to those in Fig. 6. The main difference is the lower noise tail in the SILC map at high multipoles above ℓ = 1500 (where all component separation CMB maps are dominated by residual instrumental noise). This is because, unlike NILC and SMICA, we do not Gaussian inpaint the very largest point source holes, but rather use diffusive inpainting (as discussed in Section 3.6). The Gaussian inpainting of large irregular holes is poorly constrained and adds residual noise relative to diffusive inpainting.
We have shown that the axisymmetric limit of SILC gives comparable performance to NILC and SMICA. In Sections 6 and 7, we ‘turn on’ the directionality of the wavelets and consider the impact on CMB reconstruction from simulated and real data, respectively.
6 Application to PLANCK simulations
We now apply SILC to the fiducial full-mission Planck FFP8 simulated sky maps, focusing on the impact on CMB reconstruction by increasing directionality as a function of scale. Fig. 8 shows the difference between our reconstructed CMB (using N = 1) and the input simulated CMB. There are small-magnitude differences particularly at the edge of the Galactic mask where the strength and complexity of foreground emission is greatest. As in Fig. 5, this difference map is at low resolution and so highlights residuals at the lowest multipoles. Fig. 9 compares point source masked TT angular power spectra (up to ℓ = 3400) of CMB maps reconstructed using values of N from 1 to 5. It can be seen that the introduction of directionality has the greatest effect at multipoles around ℓ = 800; the residuals are beginning to converge for |$\ell \gtrapprox 2000$|. Fig. 10 shows the differences between simulated CMB maps reconstructed using N = [2, 3, 4, 5] minus the input CMB. The four maps and the axisymmetric difference map in Fig. 8 are almost identical with small magnitude residuals. This is because these low-resolution difference maps again highlight residuals on the very largest scales. However, as discussed in Section 3.4, the wavelets we use are constructed to have an axisymmetric scaling function at the very lowest multipoles. The scaling function we use (as detailed in Table 1) means that no directionality is applied for ℓ < 32.
Fig. 11 shows equivalent difference maps as in Fig. 10 but for the simulated CMB reconstructed using directional wavelets at all scales,15 including for ℓ < 32. It can be seen that the reconstruction errors are significantly larger in magnitude and cover almost the entire sky. The errors are also dominated by the largest scales, in particular a large error in the quadrupole increasing with magnitude as the amount of directionality N increases. We attribute this effect most probably to the ILC ‘biases’ discussed in Section 4, in particular the cancellation of CMB modes due to chance correlations with foregrounds in the ILC variance minimization. Delabrouille et al. (2009) showed that the absolute value of this effect is largest on large scales where CMB power is concentrated, since the cancelled CMB modes on large scales have the greatest magnitude. Further, as discussed in Section 4 and shown in Fig. 11, these errors are expected to increase in magnitude as a function of N. This is because as N increases, each directional wavelet coefficient map (the space in which our ILC operates) contains fewer ‘effective modes’ of the input data and so the error in our empirical covariance estimation is expected to increase. This error propagates to the final maps.
These map reconstruction errors due to the implementation of directionality on the very largest scales are accompanied by increasingly negative power spectrum residuals as N increases, in particular in the first multipole bin from ℓ = 2 to ℓ = 11. This is also indicative of the negative ILC bias due to empirical CMB cancellation, as discussed in Section 4 and Delabrouille et al. (2009). The results in Fig. 11 thus motivate the use of an axisymmetric scaling function, which ensures that no directionality is used for ℓ < 32 and so reduces the errors in CMB reconstruction. In principle, these biases can be estimated and corrected through large suites of simulations, which is beyond the scope of this work.
7 Application to PLANCK data
We now study the application of SILC with increasing directionality to the full-mission Planck sky maps. The left column of Fig. 12 shows the full-resolution reconstructed CMB maps as calculated with different values of N from 2 to 5, which visually appear very similar. The right column of Fig. 12 shows the differences between the CMB reconstructed using N = [2, 3, 4, 5] minus the axisymmetric limit (when N = 1), highlighting the differences at the lowest multipoles. The differences are of largest magnitude towards the Galactic plane where foreground emission is concentrated. This shows how the different wavelet kernels are localizing the ILC weights differently in response to the directional structure of the foregrounds and CMB. The differences are small, reflecting the implementation of an axisymmetric scaling function, meaning that no directionality is applied at ℓ < 32. Fig. 13 compares point source masked TT angular power spectra of the CMB reconstructed using values of N from 1 to 5. The power spectrum residuals from a Planck best-fitting ΛCDM model remain small for most scales until the reconstructed spectra reach a characteristic noise spectrum for |$\ell \gtrapprox 1500$| where the different values of N converge. At these high multipoles, the ILC solution is dominated by residual instrumental noise. We see the biggest impact from directionality at intermediate multipoles (from ℓ = 400 to ℓ = 1500). For comparison, we plot the SMICA power spectrum. In further support to the discussion in Section 5, SILC matches the performance of SMICA. We note that, as with the simulations in Section 6, directionality changes the reconstructed CMB power spectrum most significantly at intermediate multipoles around ℓ = 800.
8 DISCUSSION
The comparisons in Section 5 demonstrate that SILC matches the performance of two previous methods, NILC and SMICA, in both maps and power spectra, with particular similarity between the axisymmetric limit of SILC and NILC, as expected. Both map residuals and power spectra in Sections 6 and 7 show that switching on directionality changes CMB reconstruction most significantly at intermediate multipoles ℓ = 400–1500. There appears to be little benefit in localizing the ILC with directional wavelets at the very smallest scales, where the ILC result is noise-limited. We also adopt an axisymmetric scaling function on the very largest scales, meaning that there is no directionality at ℓ < 32. In Fig. 11, we show the large CMB reconstruction errors arising from using directionality on the largest scales. This motivates the use of an axisymmetric scaling function, which significantly reduces the errors as seen in Fig. 10. In Section 6, we sketched out an argument that attributes these errors to empirical CMB cancellation (Section 4). However, the precise source and exact magnitude of any ILC errors are best estimated through suites of simulations.
We have presented this analysis by producing CMB maps (in Sections 6 and 7) each with a different single value of N at all wavelet scales. Our method can be simply extended to allow different values of N at each wavelet scale. In the same way that each wavelet scale has different harmonic band-limits, they can also have different azimuthal band-limits, optimized as identified above to reduce foreground and noise residuals.
The negative ILC power spectrum biases discussed in Section 4 must be quantified in parallel to this directionality optimization if using the resulting map for power spectrum analyses. It is possible to estimate variance biases in the data through suites of realistic simulations. However, we can also calculate this using the data itself and a fiducial CMB spectrum. In wavelet space, the variance estimator at each wavelet coefficient is |$\langle W^\mathrm{ILC} W^{\mathrm{ILC}\dagger }\rangle = \boldsymbol {\omega }^\dagger \langle \boldsymbol {W} \boldsymbol {W}^\dagger \rangle \boldsymbol {\omega } = \boldsymbol {\omega }^\dagger \bf{R} \boldsymbol {\omega } = (\boldsymbol {a}^\dagger \bf{R}^{-1} \boldsymbol {a})^{-1} = (\sum _{c,c^{\prime }=1}^{N_{\rm c}} (R^{-1})^{cc^{\prime }})^{-1}$|, where each equality follows by applying in turn equations (11), (14) and (13) [from Section 3.5] and then expanding the vector notation [the vectors span the space defined by the number of input channels; explicitly, we assume unit CMB calibration |$\boldsymbol {a} = (1,1,1,1,1,1,1,1,1)$|]. In order to calculate the variance bias in the ILC, we can subtract the expected CMB variance |$\sum _{\ell m} C_\ell {| \Psi _{\ell m}^j |}^2$|. If analysis was done in wavelet space, the above would define the variance bias. If only considering the diagonal terms in wavelet space, it is possible to straightforwardly transform this estimate to real space through an inverse wavelet transform as in equation (10), substituting |$\Phi (\hat{\boldsymbol {n}})$| and |$\Psi ^j (\hat{\boldsymbol {\rho }}),$| respectively, for |${| \Phi (\hat{\boldsymbol {n}}) |}^2$| and |${| \Psi ^j (\hat{\boldsymbol {\rho }}) |}^2$|. However, for a full treatment of the variance bias including off-diagonal terms, full wavelet space covariances need to be calculated. Although many off-diagonal terms would decay, this would still be computationally demanding and will be the focus of future research. However, we reiterate that if analysis is carried out in wavelet space, then variance biases can be straightforwardly calculated from the information already contained in the results.
9 CONCLUSIONS
We have presented SILC, a new form of ILC that uses directional, scale-discretized wavelets to localize the ILC weighting according to the frequency, spatial, harmonic and, for the first time, morphological information in the CMB and its foregrounds. This is motivated by the anisotropic or filamentary morphology of both the CMB and astrophysical foregrounds in the microwave sky. We have tested SILC on 2015-release Planck data and simulations, demonstrating comparable performance to two existing component separation algorithms, NILC and SMICA, and investigated how to optimize the use of morphological information through directionality. We have explored increasing the amount of directionality in the algorithm, showing that on the largest and the smallest scales, the axisymmetric limit of the ILC works well, while at intermediate multipoles (from ℓ = 400–1500), increasing N (the number of directions per scale) leads to lower residuals. At high multipoles (|$\ell \gtrapprox 1500$|), the input data are already noise-limited, as is the ILC reconstruction, and directionality does not reduce the reconstruction error, as instrumental noise has no directional structure. We adopt an axisymmetric scaling function to analyse the largest scales, meaning that we use no directionality for ℓ < 32. This is motivated by the observation that increasing directionality on large scales gives increased reconstruction errors over the axisymmetric limit. We argue that these errors are due to empirical CMB cancellation in the ILC calculation, though the exact source must be estimated through large suites of realistic simulations. Allowing N to vary with wavelet scale is analogous to the choice of different harmonic band-limits at different scales.
We conclude that the introduction of directional wavelets allows greater flexibility in the ILC to make use of morphological information at targeted scales. Our multiprocessing implementation takes advantage of the wavelet scales to allow large-scale results to be analysed while small scales are still being processed. Moreover, our wavelet transforms are quick and exact, using MW sampling and FFTs (see Section 2). As discussed in Section 4, the ILC is prone to several sources of error and variance bias, including empirical CMB cancellation. This bias can be estimated through suites of Monte Carlo simulations, but we have also outlined (in Section 8) the ability to estimate biases directly from the data, most straightforwardly in wavelet space. We make our map products available at http://www.silc-cmb.org.16
This work on scalar signals (i.e. the temperature I component of the CMB) can be extended to spin signals (i.e. the polarization Q and U components of the CMB, or, equivalently, the E and B modes), by using spin wavelets (McEwen et al. 2014, 2015b; Leistedt et al. 2015). These are an extension of directional, scale-discretized wavelets to represent spin signals, such as CMB polarization, a spin ±2 signal. We expect that the directionality will be particularly suited to the anisotropic, filamentary nature of polarized foregrounds when observed on the sky, and in future work will present the application of SILC to CMB polarization data.
KKR thanks Franz Elsner for valuable discussions. KKR was supported by the Science and Technology Facilities Council. HVP and BL were partially supported by the European Research Council under the European Community's Seventh Framework Programme (FP7/2007-2013)/ERC grant agreement number 306478-CosmicDawn. JDM was partially supported by the Engineering and Physical Sciences Research Council (grant number EP/M011852/1). AP was supported by the Royal Society. Based on observations obtained with Planck (http://www.esa.int/planck), an ESA science mission with instruments and contributions directly funded by ESA Member States, NASA and Canada.
Planck 2015 Release Explanatory Supplement: UC CC tables (http://wiki.cosmos.esa.int/planckpla2015/index.php/UC_CC_Tables). For the 545 GHz map, the unit conversion is (58.0356 ± 0.0278) MJy Sr−1 KCMB− 1 and for the 857 GHz map, the unit conversion is (2.2681 ± 0.0270) MJy Sr−1 KCMB− 1.
FFP8 simulations are also available with bandpass mismatch, accounting for differences in the bandpasses of detectors nominally at the same frequency, leading to spurious signals in the frequency maps.
Planck 2015 Release Explanatory Supplement: the 2015 instrument model (http://wiki.cosmos.esa.int/planckpla2015/index.php/The_RIMO).
We adopt the zyz Euler convention corresponding to the rotation of a physical body in a fixed coordinate system about the z, y and z axes by χ, θ and ϕ, respectively.
|$\mathrm{FWHM}^j = 50 \sqrt{\frac{1200}{N_\mathrm{samp}^j}}$|. This value is the same as used in the NILC implementation on Planck data.
The details of its construction are given in Planck Collaboration I (2014). It can be downloaded from http://pla.esac.esa.int/pla and is labelled I_MASK in the NILC data products.
Note that many holes can be large and irregularly shaped due to the overlapping of smaller circular holes.
The exact specification for our infrastructure is an Intel Xeon E7-4890 2.8 GHz SMP with 4 × 15-core CPUs with 25.6 GB RAM per core, and an Intel Xeon E5-2697 2.7 GHz node with 2 × 12-core CPUs with 10.7 GB RAM per core.
In order to estimate full-sky spectra from a masked map, we correct the Cℓ by dividing by fsky = 0.978, a good approximation for a small mask. We elect to use point source masked spectra in order to concentrate our analysis on foreground and noise removal, rather than how maps are inpainted; all three maps are inpainted (at least) within the mask used.
The parameters come from the base_plikHM_TT_lowTEB likelihood. The values are available in the Planck 2015 Release Explanatory Supplement: 2015 Cosmological parameters and MC chains (http://wiki.cosmos.esa.int/planckpla2015/images/f/f7/Baseline_params_table_2015_limit68.pdf).
In particular, the scaling function and j = 0 wavelet are replaced by two directional wavelets with harmonic band-limits [1, 60] and [1, 128].
The DOI for our data release is 10.5281/zenodo.44373.
REFERENCES