Abstract

This is the third installment in a series of papers in which we investigate calibration artefacts. Calibration artefacts (also known as ghosts or spurious sources) are created when we calibrate with an incomplete model. In the first two papers of this series, we developed a mathematical framework which enabled us to study the ghosting mechanism itself. An interesting concomitant of the second paper was that ghosts appear in symmetrical pairs. This could possibly account for spurious symmetrization. Spurious symmetrization refers to the appearance of a spurious source (the antighost) symmetrically opposite an unmodelled source around a modelled source. The analysis in the first two papers indicates that the antighost is usually very faint, in particular, when a large number of antennas are used. This suggests that spurious symmetrization will mainly occur at an almost undetectable flux level. In this paper, we show that phase-only calibration produces an antighost that is N-times (where N denotes the number of antennas in the array) as bright as the one produced by phase and amplitude calibration and that this already bright ghost can be further amplified by the primary beam correction.

1 INTRODUCTION

Interferometric data can be severely degraded by instrumental and environmental errors. These errors need to be removed by calibration. Direction independent calibration is normally achieved by finding antenna gains that minimize the difference between observed and modelled visibilities (Rau et al. 2009; Wijnholds et al. 2010; Smirnov 2011; Yatawatta, Kazemi & Zaroubi 2012; van der Veen & Wijnholds 2013; Salvini & Wijnholds 2014; Smirnov & Tasse 2015). However, when we calibrate with an incomplete sky model, we inadvertently cause calibration artefacts, which appear as spurious positive or negative flux in the image. We refer to these spurious emission features as ghost sources or ghosts.

This is the third installment in a series of papers on these ghost phenomena. In the first paper (Grobler et al. 2014), which we will refer to as Paper I, we studied ghost formation by a regular east–west array using the Westerbork Synthesis Radio Telescope (WSRT) as an example. This analysis was extended to arbitrary array layouts in the second paper (Wijnholds, Grobler & Smirnov 2016), which we will refer to as Paper II. An interesting result of Paper II was that ghosts appear to form in symmetrical pairs. It could therefore explain the phenomenon of spurious symmetrization. In the context of this paper, spurious symmetrization refers to the appearance of spurious emission symmetrically opposite the unmodelled emission around modelled emission in a radio interferometric image (Cornwell & Fomalont 1999). Various cases of spurious symmetrization have been reported (Linfield 1986; Wilkinson, Conway & Biretta 1988; Cornwell & Fomalont 1999; Stewart 2014; Stewart et al. 2016; Paper II). Cornwell & Fomalont (1999) were one of the first to realize that a direct link exists between calibrating with an incomplete model and spurious symmetrization. They also observed that spurious symmetrization is particularly noticeable if an observation is made with a sufficiently small number of antennas; the implication being that the more antennas we use, the less likely it is for spurious symmetrization to occur at a detectable flux level.

In Papers I and II, we considered a simple two-source test case with one modelled and one unmodelled source. We found that a negative suppression ghost forms on top of the unmodelled source with a flux that is inversely proportional to the number of antennas in the array. In addition to the suppression ghost, an antighost also forms. The antighost forms symmetrically opposite the unmodelled source around the modelled source. The antighost is positive and has a flux that is inversely proportional to the square of the number of antennas in the array. This instance of spurious symmetrization is consistent with the assertion by Cornwell & Fomalont (1999) that the more antennas we use, the lower the flux level becomes at which spurious symmetrization will be detectable. Based on the analysis in Paper I and II, the brightness of the antighost should also be independent of the location of the unmodelled source.

However, Stewart (2014) recently reported on an intriguing instance of spurious symmetrization in which the antighost was much brighter than expected. Stewart was investigating whether a transient would be detected with Low Frequency Array (LOFAR's) Standard Imaging Pipeline (SIP; Heald et al. 2015). He devised this test to help him make sense of a puzzling transient that was recently detected by the Transients Pipeline (Swinbank et al. 2015; Stewart et al. 2016). For this test, he added a 60 Jy transient to an existing NCP (North Celestial Pole) data set. He then determined whether one could detect the synthetic transient after running it through the SIP. The 60 Jy source was not included in the calibration model. He ran the experiment multiple times, varying the location of the transient. The NCP field to which the 60-Jy source was added is dominated by 3C 61.1. The outcome of his test was quite unexpected as he was able to produce a very bright antighost when the transient was placed in some locations and no antighost at all when placed at others. In some instances, the antighost was even brighter than the transient itself, which could lead to it being incorrectly identified as a source. Two snapshots of Stewart's experiment are presented in Fig. 1.

Two snapshots created during Stewart's experiment. Stewart added a transient to an existing LOFAR data set which he then ran through the SIP. He found that when he placed the unmodelled transient in some positions (left-hand image), no antighost formed, but when he placed it at other locations, a bright antighost appeared (right-hand image).
Figure 1.

Two snapshots created during Stewart's experiment. Stewart added a transient to an existing LOFAR data set which he then ran through the SIP. He found that when he placed the unmodelled transient in some positions (left-hand image), no antighost formed, but when he placed it at other locations, a bright antighost appeared (right-hand image).

Stewart's result is surprising for two reasons.

  • According to Cornwell & Fomalont (1999) and Paper II, the antighost should be extremely faint when we use an array like LOFAR due to its large number of elements.

  • Paper II established that the brightness of the antighost was independent of the location of the unmodelled source.

Obviously, we want to be able to explain the outcome of Stewart's experiment quantitatively, since understanding Stewart's experiment will deepen our understanding of ghost phenomena and spurious symmetrization in particular. There are two major differences between Stewart's experimental setup and the analysis in Paper II.

  • Phase-only calibration was used in Stewart's case, while full-complex calibration was used in Paper II. Phase-only calibration can produce more stable solutions since we have fewer unknowns. Moreover, it is generally accepted that at the imaging stage, the amplitude error will be small and constant. The phase-error, however, can still be large and varying. Furthermore, as phase-only calibration conserves phase closures, it is also accepted to be quite safe.1

  • The primary beam was included in Stewart's simulations, but was not taken into account in Paper II.

In this paper, we will therefore study the impact of using phase-only calibration instead of full-complex calibration on ghost formation as well as the impact of primary beam correction. We will show that these two factors can indeed quantitatively explain the surprising instance of spurious symmetrization found in Stewart's experiment.

Studying the impact of phase-only calibration and primary beam correction further deepens our understanding of ghost phenomena. As mentioned earlier in this series, this is becoming crucial in light of the improved sensitivity (and consequently, increased susceptibility to artefacts) of current and upcoming telescopes like the LOFAR (van Haarlem et al. 2013), the Karoo Array Telescope (MeerKAT; Jonas 2009) and the Square Kilometre Array (SKA; Dewdney et al. 2009). Another important motivation, as demonstrated by Stewart's experiment, is that transient pipelines run the risk of mistakingly identifying the antighost as a transient source. An improved understanding of ghost phenomena may help to avoid such errors.

In the next two sections, we formulate the full-complex as well as the phase-only calibration problem and briefly review some of the ghost analysis techniques we developed in the first two papers. In Section 4, we present a theoretical analysis of ghost formation with phase-only calibration. We compare the results with the results for full-complex calibration described in Paper II in Section 5. The effect of primary beam correction is investigated in Section 6 before we conclude our paper with a summary of our findings. The images in this paper use the cubehelix colour scheme (Green 2011).

2 CALIBRATION

2.1 Full-complex calibration

In its most basic form, calibration boils down to minimizing the difference between observed and predicted visibilities by estimating the complex instrumental gain response. Mathematically speaking, unpolarized calibration is realized by minimizing the following optimization problem
(1)
where
  • The operator (*)H denotes the Hermitian transpose. Moreover, ⊙ denotes the Hadamard product and ‖*‖ denotes the appropriate matrix or vector norm.

  • |$\boldsymbol {\mathcal {R}}$| is the observed visibility matrix. Each entry of |$\boldsymbol {\mathcal {R}}$|⁠, which we denote by rpq, represents the visibility that was observed by the baseline formed by antennas p and q.

  • |$\boldsymbol {\mathcal {M}}$| is the model visibility matrix and we denote an entry of |$\boldsymbol {\mathcal {M}}$| with mpq. The entries of |$\boldsymbol {\mathcal {M}}$| contain model visibilities, i.e. synthetic visibilities that were created from the calibration sky model.

  • |$\boldsymbol {G}$| is the antenna gain matrix and is equal to |$\mathrm{diag} \left({\boldsymbol {g}} \right)$|⁠, with |$\boldsymbol {g}= [g_{1},g_{2},\ldots ,g_{N}]^T$| being the complex instrumental responses of the antennas. The operator diag(*) creates a matrix by placing the input vector on the main diagonal of an otherwise zero matrix, while (*)T denotes the transpose of its operand. We can decompose the gain of the pth element into its amplitude γp and a phasor |$\rho _p = e^{i \varrho _p}$|⁠. The amplitudes can be stacked in a vector |${{\bf {\gamma }}}= \left[ \gamma _1, \ldots , \gamma _N \right]^T$| and the phasors in a vector |$\rho = \left[ \rho _1, \ldots , \rho _N \right]^T$|⁠, such that |$\boldsymbol {g}= {{\bf {\gamma}}}{\odot } \rho$|⁠. We will denote the number of antennas in our array by N and the number of baselines by B. The number of baselines can be calculated from the number of antennas with B = (N2 − N)/ 2.

  • |$\boldsymbol {\mathcal {G}}= \boldsymbol {g}\boldsymbol {g}^H$| is known as the visibility gain matrix. An entry of |$\boldsymbol {\mathcal {G}}$| is denoted by gpq. We can compute gpq with
    (2)
    where |$\overline{(\ast )}$| denotes complex conjugation. The Hadamard inverse of |$\boldsymbol {\mathcal {G}}$| is denoted with |$\boldsymbol {\mathcal {G}}^{{\odot }-1}$| and is known as the visibility calibration matrix. We use the visibility calibration matrix to correct our visibilities once we have successfully estimated |$\boldsymbol {g}$|⁠.

2.2 Phase-only calibration

When we perform full-complex calibration, we assume that the phase and amplitude of our visibilities were corrupted by our antenna gains. In the case of phase-only calibration, we assume that only the phases of our visibilities are affected by the antenna gains. The phase-only equivalent of equation (1) is equal to
(3)
where
  • |$\boldsymbol {P}= \mathrm{diag} \left({\rho } \right)$| is the antenna phase matrix.

  • |$\boldsymbol {\mathcal {P}}= \rho \rho ^H$| denotes the visibility phase matrix. An entry of |$\boldsymbol {\mathcal {P}}$| is denoted by ρpq.

Once we have estimated |$\rho$|⁠, we can correct our visibilities, i.e. remove the errors caused by the antenna gains, with
(4)
where
  • |$\boldsymbol {\mathcal {R}}_{\boldsymbol {\mathcal {P}}}^{(c)}$| is the corrected visibility matrix.

  • |$\boldsymbol {\mathcal {P}}^{{\odot }-1}$| represents the visibility phase calibration matrix. The phase calibration matrix is computed by taking the Hadamard inverse of |$\boldsymbol {\mathcal {P}}$|⁠.

3 PRELIMINARIES

In this section, we present the remaining building blocks needed for our analysis. In Section 3.1, we present the experimental assumptions we adhere to in this paper. In Section 3.2, we define a couple of useful linear transformations. The extrapolation algorithm is presented in Section 3.3. Extrapolation employs the linear transformations in Section 3.2 to transform |$\boldsymbol {\mathcal {R}}$| and |$\boldsymbol {\mathcal {M}}$| into function-valued matrices. We can now use these newly created matrices and standard calibration to create per-baseline artefact maps (a map containing only the calibration systematics). Moreover, these artefact maps contain no sidelobes as extrapolation operates on continuous functions (see Algorithm 1).

3.1 Two-source scenario

Unless explicitly stated otherwise, all results presented in this paper are subject to the following assumptions.

  • We are dealing with a monochromatic observation.

  • No error was added to our visibilities by the interferometer, i.e. |$\boldsymbol {g}_{\rm t} = \boldsymbol {1}$|⁠. We denote an all one vector of size N with |$\boldsymbol {1}$|⁠. The subscript t of |$\boldsymbol {g}_{\rm t}$| is used to indicate that we are referring to the true antenna gain vector which is corrupting the observed visibilities instead of the gain vector estimate which is obtained when we calibrate.

  • We assume the w-term is negligible, i.e. we are observing in a narrow field of- view.

  • We do not consider the effect of noise as O.M. Smirnov has shown that including noise does not alter the positions at which the ghosts form (Smirnov 2010).

With regards to the true sky and the calibration sky model, we assume the following.

  • Our true sky consists of two sources, one in the field centre with flux A1 and another off-centre with flux A2. We denote the positional vector of the off-centre source by |$\boldsymbol {s}_0 = (l_0,m_0)$| (a direction-cosine vector). Stated differently, if we were to observe these two sources, then antennas p and q would observe
    (5)
    where |$\boldsymbol {b}_{pq} = (u_{pq},v_{pq})^T$| denotes the uv-coordinate at which rpq was measured. We use the symbol ‘·’ to denote the standard dot product which operates on equal length row vectors. It is important to realize that the coordinates upq and vpq in equation (5) are already expressed in wavelengths and not in m (we have already divided by the observational wavelength λ). In Paper II, we showed that all the uv-tracks of an array can be generated from a single circular reference track |$\boldsymbol {b}_0^\circ (t)$|⁠. The circular reference track |$\boldsymbol {b}_0^\circ (t)$| is depicted in Fig. 2.
  • We only include the centre source in our calibration model, i.e.
    (6)
  • Since equation (3) is invariant to amplitude scaling and positional shifts, we may assume that our modelled source is in the centre and that A1 = 1 without any loss of generality,

  • Unless otherwise stated, A2 ≪ A1, i.e. that our calibration model is reasonably accurate.

The linear transformations $\widehat{X}_{pq}(\boldsymbol {b}_0^{\circ })$ and $\widetilde{X}_{pq}^{-1}(\boldsymbol {b}_{pq})$.
Figure 2.

The linear transformations |$\widehat{X}_{pq}(\boldsymbol {b}_0^{\circ })$| and |$\widetilde{X}_{pq}^{-1}(\boldsymbol {b}_{pq})$|⁠.

3.2 Linear transformations

All the uv-tracks of an interferometer can be mapped on to a single circular reference track |$\boldsymbol {b}_0^\circ (t)$| by using three linear transformations, namely

  • a scaling described by
    (7)
  • a rotation described by
    (8)
  • and a translation |$\Delta \boldsymbol {b}_{pq}$|⁠.

A composition of two or more linear transformations results in another linear transformation. In this paper, we will use the following compositions:

  • a composition of scaling, rotation and translation
    (9)
  • a composition of scaling and translation
    (10)
  • a composition of scaling and rotation
    (11)

In the expressions above, δ0 is the declination of the field centre. The quantities ϕpq, θpq and |$\Delta \boldsymbol {b}_{pq} = (0,\Delta b_{pq})^T$| are determined by the array geometry and are formally defined in Fig. 2. The linear transformations |$\widehat{X}_{pq}(\boldsymbol {b}_0^{\circ })$| and |$\widetilde{X}_{pq}^{-1}(\boldsymbol {b}_{pq})$| are also depicted in Fig. 2.

We denote the functional inverse of |$\widehat{X}_{pq}$|⁠, |$\widetilde{X}_{pq}$| and Xpq with |$\widehat{X}_{pq}^{-1}$|⁠, |$\widetilde{X}_{pq}^{-1}$| and |$X_{pq}^{-1}$|⁠, respectively.

Within this paper, we use each of the aforementioned linear transformations for a specific purpose.

  • |$\widehat{X}_{pq}$|⁠: the circular reference track |$\boldsymbol {b}_{0}^{\circ }(t)$| is depicted in the leftmost coordinate system of Fig. 2. We can generate any uv-track of an array by applying |$\widehat{X}_{pq}$| to |$\boldsymbol {b}_{0}^{\circ }(t)$|⁠. This fact is illustrated by the mapping of |$\boldsymbol {b}_{0}^{\circ }(t)$| in the leftmost coordinate system to |$\boldsymbol {b}_{pq}(t)$| in the middle coordinate system of Fig. 2. The linear transformation |$\widehat{X}_{pq}(\boldsymbol {b}_{0}^{\circ })$| is exploited by the extrapolation procedure (presented in Section 3.3) to create per-baseline artefact maps.

  • |$\widetilde{X}_{pq}$|⁠: the effect of applying |$\widetilde{X}_{pq}^{-1}$| to |$\boldsymbol {b}_{pq}(t)$| is depicted by the mapping from |$\boldsymbol {b}_{pq}(t)$| in the middle coordinate system of Fig. 2 to the uv′-track in the rightmost coordinate system of Fig. 2. Since there is no rotation transformation present in |$\widetilde{X}_{pq}$|⁠, the uv′-track in the rightmost coordinate system of Fig. 2 does not start at coordinate (1,0). An additional de-rotation by θpq is required to achieve this, i.e. |$\widetilde{X}_{pq}^{-1}(\boldsymbol {b}_{pq})$| allows us to calculate the rotation angle θpq which is a quantity that is required by |$\widehat{X}_{pq}$| to function properly.

  • Xpq: it turns out that the translation |$\Delta \boldsymbol {b}_{pq}$| does not affect the positions of the ghosts, only their fluxes as discussed in Paper II. Since Xpq does not contain a translation transformation, it can be used to determine the positions at which the ghosts arise [see equation (38)].

3.3 Extrapolation

Extrapolation is a technique, which exploits the fact that all uv-tracks of an interferometer can be derived from a single imaginary reference track (via three linear transformations) to create clean per-baseline artefact maps (an artefact map untarnished by uv-sampling).

Extrapolation requires the following three steps.

  • Reformulate visibilities as functionals. Rewrite the visibility matrix |$\boldsymbol {\mathcal {Y}}$| as a function of the imaginary reference track |$\boldsymbol {b}_0^{\circ }$|⁠, i.e. |$\boldsymbol {\mathcal {Y}}(\boldsymbol {b}_0^{\circ })$|⁠. The |$\boldsymbol {\mathcal {Y}}$| matrix is a proxy matrix and can refer to an observed visibility, model visibility, predicted visibility, visibility gain, visibility calibration, corrected visibility matrix or any one of the distilled visibility matrices. Distillation is discussed in Section 4.4.

  • Derive the intermediate extrapolated visibility matrix. Replace |$\boldsymbol {b}_0^{\circ }$| with the free parameter |$\boldsymbol {b}= (u,v)^T$| to obtain |$\boldsymbol {\mathcal {Y}}(\boldsymbol {b})$|⁠.

  • Calculate the extrapolated visibility matrix. It turns out that the entries of |$\boldsymbol {\mathcal {Y}}(\boldsymbol {b})$| have no physical meaning if left as it is. To make the entries meaningful, we substitute |$\boldsymbol {b}$| with |$\widehat{X}_{pq}^{-1}(\boldsymbol {b})$| to obtain |$\boldsymbol {\mathcal {Y}}(\widehat{X}_{pq}^{-1}(\boldsymbol {b}))$|⁠.

If we now image the pqth entry of |$\boldsymbol {\mathcal {Y}}(\widehat{X}_{pq}^{-1}(\boldsymbol {b}))$|⁠, then the resulting image will be completely devoid of sidelobes as the pqth entry of |$\boldsymbol {\mathcal {Y}}(\widehat{X}_{pq}^{-1}(\boldsymbol {b}))$| is defined over the entire uv plane. Extrapolation is a per-baseline operation.

In Algorithm 1, we use the extrapolation procedure to specifically construct an empirical estimate of |$g_{pq}(\widehat{X}_{pq}^{-1} (\boldsymbol {b}))$|⁠. In this algorithm, |$\mathcal {K}$| denotes the set containing all the index values of the sources that were ‘observed’, while |$\mathcal {H}$| denotes the set containing the index values of the sources that were included in the calibration model, i.e. |$\mathcal {H}\subseteq \mathcal {K}$|⁠. The extrapolation procedure can therefore be applied to more complicated sky model scenarios (and is not limited to the two-source test case presented in Section 3.1). The artefact map associated with |$g_{pq}(\widehat{X}_{pq}^{-1} (\boldsymbol {b}))-1$| (we subtract one to distill out the calibration artefacts) is obtained by taking the inverse Fast Fourier Transform (FFT) of the empirical estimate of |$g_{pq}(\widehat{X}_{pq}^{-1} (\boldsymbol {b}))-1$|⁠, which is now trivial as the empirical estimate of |$g_{pq}(\widehat{X}_{pq}^{-1} (\boldsymbol {b}))-1$| is defined on a regular grid. This results in an artefact map without any sidelobes. Moreover, in Algorithm 1, ‘○’ denotes operator composition. Also note that any calibration algorithm may be used to realize Algorithm 1.

A more detailed explanation of extrapolation can be found in Paper II.

Algorithm 1

Extrapolation: |$g_{pq}(\widehat{X}_{pq}^{-1} (\bf b))$|

graphic
graphic
Algorithm 1

Extrapolation: |$g_{pq}(\widehat{X}_{pq}^{-1} (\bf b))$|

graphic
graphic

4 PHASE-ONLY GHOST PATTERN

In this section, we derive the phase-only ghost pattern that is associated with the two-source test case presented in Section 3.1. We do so by following the same approach we took for full-complex calibration in Paper II. We therefore start by developing a general perturbation analysis framework for phase-only calibration. We then use this general framework to analyse the two-source test case in Section 4.2. We then apply extrapolation in Section 4.3 to the result from the perturbation analysis. For the sake of completeness, we also briefly touch on the topics of distillation and corrected visibilities in Sections 4.4 and 4.5. A brief overview of imaging and how it pertains to calibration artefacts is presented in Section 4.6. In the last section, we extend the basic per-baseline taxonomy from Paper II. This extended taxonomy makes it easier to compare the ghost patterns that are produced by full-complex and phase-only calibration, respectively.

4.1 Perturbation analysis

Perturbation analysis is based on the assumption that we are using a fairly accurate calibration sky model, implying that the antenna gains are only slightly perturbed due to the incomplete sky model. In the perturbation approach that we present here, we estimate the amount of perturbation the gains experience, by first linearizing the estimation problem, making it easier to solve. This linearization is only possible if the perturbation we are trying to estimate is relatively small.

The observed visibility matrix is equal to
(12)
where |$\boldsymbol {\mathcal {R}}_k$| contains the visibilities of the sources that we know exist, i.e. |$\boldsymbol {\mathcal {R}}_k = \boldsymbol {\mathcal {M}}$|⁠, |$\boldsymbol {\mathcal {R}}_u$| contains the visibilities of the sources that we are unaware of, |$\boldsymbol {g}_{\rm t}$| is the true antenna gain vector and |$\boldsymbol {G}_{\rm t} = \mathrm{diag} \left({\boldsymbol {g}_{\rm t}} \right)$|⁠. We can also decompose |$\boldsymbol {g}_{\rm t}$| into an amplitude vector |${{\bf {\gamma }}}_{\rm t}=\left[ \gamma _1^{\rm t}, \ldots , \gamma _N^{\rm t} \right]^T$| and phasor vector |$\rho _{\rm t}=[ e^{{i\varrho _1^{\rm t}}}, \ldots , e^{i\varrho _N^{\rm t}} ]^T$|⁠, such that |$\boldsymbol {g}_{\rm t} = {{\bf {\gamma }}}_{\rm t} {\odot } \rho _{\rm t}$|⁠. Equation (12) allows us to reformulate equation (3) as
(13)
If we ignore the true amplitude variations |${{\bf {\gamma }}}_{\rm t}$| or we neglect the mismatch between the model visibilities (described by |$\boldsymbol {\mathcal {R}}_k$|⁠) and the observed visibilities (described by |$\boldsymbol {\mathcal {R}}_k + \boldsymbol {\mathcal {R}}_u$|⁠), then we will induce a phase bias |$\Delta \varrho$| into |$\rho$|⁠, i.e.
(14)
Without loss of generality, we may assume that |$\varrho _{\rm t} = \boldsymbol {0}$|⁠. Assuming that we are justified in using phase-only calibration and that we have a reasonably accurate sky model, the biases will be small. We may therefore use the approximation |$e^{i\Delta \varrho } \approx 1 + i \Delta \varrho$|⁠. It then follows that
(15)
Equation (15) allows us to rewrite equation (13) to
(16)
Since |$\Delta \varrho$| is small, we may ignore the product |$\Delta \varrho \Delta \varrho ^H$| and write our minimization problem as
(17)
In equation (17), we made use of the fact that |$\boldsymbol {x}\boldsymbol {x}^H{\odot }\boldsymbol {\mathcal {X}}= \mathrm{diag} \left({\boldsymbol {x}} \right)\boldsymbol {\mathcal {X}}\mathrm{diag} \left({\boldsymbol {x}} \right)^H$|⁠, where |$\boldsymbol {x}$| is a vector and |$\boldsymbol {\mathcal {X}}$| is matrix of compatible size. In the next step, we will use
(18)
where |$\boldsymbol {\mathcal {A}}$| and |$\boldsymbol {\mathcal {C}}$| are matrices of compatible size and • denotes the Khatri–Rao or column-wize Kronecker product of two matrices. We swapped around and added identity matrices to equation (17), denoted by |$\boldsymbol {I}$|⁠, as visual cues to enable the reader to follow how one can apply equation (18) to equation (17) to obtain equation (19). We thus find
(19)
Note that the first mentioned |$\Delta \varrho$| in the first equation of equation (19) is not conjugated, although strictly speaking, it should be. We do not conjugate |$\Delta \varrho$| in equation (19), since |$\Delta \varrho$| is, per definition, a real valued vector.
If we apply the normal equation method to equation (19), we readily obtain
(20)
where
(21)
There is an inherent ambiguity present in equation (20), as we did not explicitly select a reference antenna. However, this ambiguity has no effect on the outcome of our analysis, as it drops out when we calculate |$\boldsymbol {\mathcal {P}}$|⁠.

4.2 Two-source perturbation results

In this section, we apply the general result derived in the previous section to our two-source test case. It follows from equations (5) and (6) (if A1 = 1) that
(22)
(23)
where |$\boldsymbol {a}= \left[ e^{-2\pi i \boldsymbol {b}_1^T \cdot \boldsymbol {s}_0},\ldots , e^{-2\pi i \boldsymbol {b}_N^T \cdot \boldsymbol {s}_0} \right]^T$| and |$\boldsymbol {b}_n$| is the (x, y, z)-position of the nth antenna in units of λ. It also follows from Section 3.1 that |$\boldsymbol {g}_{\rm t} = \boldsymbol {1}$|⁠. Substitution of these results in equation (21) gives
(24)
Note that the dimensions of the matrices in equation (24) are N2 × N.
To find an expression for our two-source model using equation (20), we need to find the inverse of
(25)
Since this matrix is rank deficient, we will resort to the pseudo-inverse, denoted by (*) to emulate the behaviour of most least-squares fitting routines. We find that
(26)
for which a proof can be found in Appendix A.
If we substitute equations (24) and (26) in equation (20), we obtain
(27)
where |$\scr {I}\lbrace \ast \rbrace$| denotes the imaginary operator. Substitution of this result into equation (15) gives
(28)
where c = A2i/N. Concentrating on a single element of the visibility phase matrix, we find
(29)
Simplifying equation (29) results in
(30)
where |$\boldsymbol {b}_{ps} = \boldsymbol {b}_p - \boldsymbol {b}_s$| and |$\boldsymbol {b}_{rq} = \boldsymbol {b}_r - \boldsymbol {b}_q$| (also see Section 3.1).
The fact that a sine can be written in terms of two complex exponentials allows us to rewrite equation (30) to
(31)
where
(32)
and
(33)
with
(34)
where ‘∧’ denotes the logical and. Moreover, we define
(35)

4.3 Extrapolation

We will use extrapolation to show that each term in equation (31) produces a ghost artefact in the image. We can express ρpq as a function of the reference baseline |$\boldsymbol {b}_0^{\circ }$|⁠, i.e.
(36)
To extrapolate, we replace |$\boldsymbol {b}_0^{\circ }$| with |$\boldsymbol {b}=(u,v)^T$| to obtain
(37)
As discussed in Section 3.3, we can create physically meaningful extrapolated visibilities by replacing |$\boldsymbol {b}$| in equation (31) with |$\widehat{X}_{pq}^{-1}(\boldsymbol {b})$| to obtain
(38)
where τpq, 0 = σpq, 0, τpq, rs = σpq, rsβpq, rs and
(39)
and
(40)
with
(41)
Since the Fourier inverse of a complex exponential is a delta function, each term in the sum of equation (38) can be interpreted as a spurious point source, i.e. as a calibration artefact or ghost. The amplitude of the spurious point source is equal to |$\tau _{pq,rs}\in \bf{C}$| and its position vector is equal to |$\boldsymbol {s}_{pq}^{rs}$|⁠. Furthermore, the amplitudes of the ghosts exhibit the following symmetry
(42)
In equation (35), we defined a set of baseline indices to uniquely identify all baselines that produce a ghost in the ghost pattern for baseline pq. Since each term in equation (38) is associated with a specific ghost, we can map our set of baseline indices to a set of ghost locations for baseline pq, i.e. we can define the mapping |$M_{pq}: \bf{P}(\mathcal {J}) \rightarrow \bf{P}(\mathcal {Q}_{pq})$| as
(43)
where |$\mathcal {X}= \lbrace rs\rbrace _{r\ne s}$|⁠, |$\mathcal {Q}_{pq} = \lbrace \boldsymbol {s}_{pq}^{rs}\rbrace _{r\ne s}$| and |$\bf{P}(\ast )$| denotes the power set of its operand (which has to be a set). We can use equation (43) to group all per-baseline phase-only ghost positions that are predicted by our perturbation analysis into one set
(44)
We are actually interested in studying the ghost pattern that is generated by the instrument as a whole. It is therefore more appropriate to merge the per-baseline perturbation results into one set |$\mathcal {T}$|⁠. We can construct |$\mathcal {T}$| as follows
(45)
Note that |$\mathcal {T}_{pq}$| and |$\mathcal {T}$| contain no recurring elements.2
Let |$\boldsymbol {0}= (0,0)$|⁠. Equation (45) allows us to reformulate equation (38) as
(46)
where |$\widehat{\mathcal {T}} = \mathcal {T}\cup \lbrace \boldsymbol {0}\rbrace$| and
(47)
In equation (47), |$\mathcal {I}_{pq,\tilde{\boldsymbol {s}}}=\lbrace rs|(\boldsymbol {s}_{pq}^{rs}\in \mathcal {T}_{pq}) \wedge (\tilde{\boldsymbol {s}} = \boldsymbol {s}_{pq}^{rs}) \wedge (\tilde{\boldsymbol {s}} \ne \boldsymbol {0})\rbrace$|⁠.

4.4 Distillation

In our analysis, we assume that our model is reasonably accurate, such that the deviations from the true gain values will be small. As a result, the ghosts will be very faint. We therefore have to distill |$\boldsymbol {\mathcal {P}}$| and |$\boldsymbol {\mathcal {P}}^{{\odot }-1}$| to make the ghosts visible. Distillation effectively removes all real emission from the image, such that we can image only the deviations from the true gain values (see Papers I and II). We can distill |$\boldsymbol {\mathcal {P}}$| by subtracting |$\breve{\boldsymbol {1}}= \boldsymbol {1}\boldsymbol {1}^H$| from it, i.e.
(48)
The visibility phase calibration matrix can be described by
(49)
since |$(\rho _{pq})^{-1} = \overline{\rho }_{pq}$|⁠. As was the case in Paper II, equation (49) implies that the brightest ghost positions are preserved under the Hadamard inverse. Although the positions are preserved, the fluxes of the ghosts change sign. Comparing equations (28) and (49), we can easily see that
(50)
where |$\tau _{pq,\tilde{\boldsymbol {s}}}^{{\odot }-1} = - \tau _{pq,\tilde{\boldsymbol {s}}}$|⁠. Note that the index set |$\mathcal {T}$| is used in equations (48) and (50), while the index set |$\widehat{\mathcal {T}}$| was used in equation (46). The reason being, according to the perturbation approach presented in Section 4.2, there is no ghost at |$\boldsymbol {0}$|⁠.

4.5 Corrected visibilities

The corrected visibility matrix |$\boldsymbol {\mathcal {R}}_{\boldsymbol {\mathcal {P}}}^{(c)}$| was defined in equation (4). As in the case of |$\boldsymbol {\mathcal {P}}$|⁠, distilling |$\boldsymbol {\mathcal {R}}_{\boldsymbol {\mathcal {P}}}^{(c)}$| allows us to study the fainter spurious emission in |$\boldsymbol {\mathcal {R}}_{\boldsymbol {\mathcal {P}}}^{(c)}$|⁠. The distilled corrected visibility matrix |$\boldsymbol {\mathcal {R}}^{\Delta }_{\boldsymbol {\mathcal {P}}}$| (also known as the residual visibility matrix) can be calculated with
(51)
implying that
(52)

Inspecting equation (52) reveals that when we correct, we actually convolve the atomic ghost pattern in |$\boldsymbol {\mathcal {P}}^{{\odot }-1}-\breve{\boldsymbol {1}}$| with the modelled and unmodelled source. Our corrected residuals are therefore made up of two ghost patterns, one around the modelled source and one around the unmodelled source, that were added together coherently (Papers I and II).

4.6 Imaging

Imaging and how it relates to calibration systematics was discussed at length in Paper II. In this section, we will only summarize the most important quantities we obtained in Paper II. We denote the dirty images associated with |$\boldsymbol {\mathcal {P}}^{{\odot }-1}-\breve{\boldsymbol {1}}$| and |$\boldsymbol {\mathcal {R}}^{\Delta }_{\boldsymbol {\mathcal {P}}}$| with |$I_D^{\boldsymbol {\mathcal {P}}^{{\odot }-1}-\breve{\boldsymbol {1}}}$| and |$I_D^{\boldsymbol {\mathcal {R}}^{\Delta }_{\boldsymbol {\mathcal {P}}}}$|⁠, respectively. As we showed in Section 4.3, each baseline has its own unique ghost pattern. When we image, we effectively average all the individual per-baseline ghost patterns together. If a ghost happens to form consistently in each baseline, its per-baseline flux will add up coherently. On the other hand, if a ghost is only present in one baseline averaging, the different baselines together will result in a dimmer ghost. In mathematical terms, the aforementioned result translates into the following: if we assume natural weighting, then we can calculate the flux of the ghost at |$\tilde{\boldsymbol {s}}$| in |$I_D^{\boldsymbol {\mathcal {P}}^{{\odot }-1}-\breve{\boldsymbol {1}}}$| with
(53)
where 〈*〉p<q denotes averaging over all the baselines and |$\scr {R}\lbrace \ast \rbrace$| denotes the real operator. In Paper II, we showed that only the real flux of a ghost manifests at |$\tilde{\boldsymbol {s}}$|⁠, which explains why we only average the real flux together in equation (53). It is actually more sensible to calculate the maximum possible flux that the ghost at position |$\tilde{\boldsymbol {s}}$| can contribute to the image, since some of the imaginary flux of a ghost manifests in the vicinity of |$\tilde{\boldsymbol {s}}$| due to the Ghost Spread Function (Papers I and II). We therefore also define
(54)
Equation (54) represents the maximum flux the ghost at |$\tilde{\boldsymbol {s}}$| could have contributed to the image. It contributes this maximum amount if it is completely real, and can therefore be interpreted as a type of upper bound. Equation (52) also allows us to define
(55)
and
(56)
which are the |$I_D^{\boldsymbol {\mathcal {R}}^{\Delta }_{\boldsymbol {\mathcal {P}}}}$| equivalent of equations (53) and (54).

4.7 Augmented per-baseline ghost taxonomy

Based on Paper II, the result equivalent of equation (31) for full-complex calibration is
(57)
where
(58)
and
(59)
The index sets |$\mathcal {I}_1$|⁠, |$\mathcal {I}_2$| and |$\mathcal {I}_3$| are already defined in equation (34). The fourth index set is defined as
(60)
The index set |$\mathcal {J}$| was defined in Section 4.3.

We can make the following observations if we inspect equations (31) and (57).

  • Some of the coefficients (σpq, rs and cpq, rs) in equations (31) and (57) are inversely proportional to N, while others are inversely proportional to N2. The ghosts that are associated with the coefficients that are inversely proportional to N are called proto-ghosts, and those associated with the coefficients that are inversely proportional to N2 are called deutero-ghosts (Paper II).

  • Some coefficients in equations (31) and (57) have a positive sign, while others have a negative sign. If the sign is positive and the ghost flux is completely real, it will manifest itself as a negative ghost, i.e. as a suppressing ghost. If the sign is negative and the ghost flux is completely real, it will manifest itself as a positive ghost, i.e. as an amplifying ghost. The inverse relation between the sign of the coefficients and whether a ghost manifests as a positive or negative ghost is due to the fact that |$(\rho _{pq})^{-1} = \overline{\rho }_{pq}$| (also see equation 49).

We can augment the per-baseline taxonomy of Paper II by using the second observation. The details of this proposed augmentation follows below.

Assigning the false colour blue to the coefficients with a positive sign and the false colour red to those with a negative sign enables us to divide |$\mathcal {I}$| into the following mutually exclusive subsets: |$\mathcal {I}_{PB} = \bigcup _{i=1}^3 \mathcal {I}_i$|⁠, |$\mathcal {I}_{PR} = \bigcup _{i=1}^3 \mathcal {I}_i^\prime$|⁠, |$\mathcal {I}_{DB} = \emptyset$| and |$\mathcal {I}_{DR} = \emptyset$|⁠. Similarly, we can divide |$\mathcal {J}$| into |$\mathcal {J}_{PB} = \bigcup _{i=1}^3 \mathcal {I}_i$|⁠, |$\mathcal {J}_{PR} = \emptyset$|⁠, |$\mathcal {J}_{DB} = \emptyset$| and |$\mathcal {J}_{DR} = \mathcal {I}_4$|⁠. The aforementioned index sets are also presented in Table 1.

Table 1.

Comparison between the composite baseline index sets produced by phase-only calibration and full-complex calibration. The entries in the last row of each subtable were obtained by taking the union of all the other entries in the column in which they reside. The same applies to the entries in the last column of each subtable.

Phase-onlyFull-complex
Sign|$\frac{1}{N}$||$\frac{1}{N^2}$|all|$\frac{1}{N}$||$\frac{1}{N^2}$|all
+|$\mathcal {I}_{PB} = \bigcup _{i=1}^3 \mathcal {I}_i$||$\mathcal {I}_{DB} = \emptyset$||$\mathcal {I}_B = \mathcal {I}_{PB} \cup \mathcal {I}_{DB}$||$\mathcal {J}_{PB} = \bigcup _{i=1}^3 \mathcal {I}_i$||$\mathcal {J}_{DB} = \emptyset$||$\mathcal {J}_B = \mathcal {J}_{PB} \cup \mathcal {J}_{DB}$|
|$\mathcal {I}_{PR} = \bigcup _{i=1}^3 \mathcal {I}_i^\prime$||$\mathcal {I}_{DR} = \emptyset$||$\mathcal {I}_R = \mathcal {I}_{PR} \cup \mathcal {I}_{DR}$||$\mathcal {J}_{PR} = \emptyset$||$\mathcal {J}_{DR} = \mathcal {I}_4$||$\mathcal {J}_R = \mathcal {J}_{PR} \cup \mathcal {J}_{DR}$|
|$\mathcal {I}_P = \mathcal {I}_{PB} \cup \mathcal {I}_{PR}$||$\mathcal {I}_D = \emptyset$||$\mathcal {I}$||$\mathcal {J}_P = \mathcal {J}_{PB} \cup \mathcal {J}_{PR}$||$\mathcal {J}_D = \mathcal {J}_{DB} \cup \mathcal {J}_{DR}$||$\mathcal {J}$|
Phase-onlyFull-complex
Sign|$\frac{1}{N}$||$\frac{1}{N^2}$|all|$\frac{1}{N}$||$\frac{1}{N^2}$|all
+|$\mathcal {I}_{PB} = \bigcup _{i=1}^3 \mathcal {I}_i$||$\mathcal {I}_{DB} = \emptyset$||$\mathcal {I}_B = \mathcal {I}_{PB} \cup \mathcal {I}_{DB}$||$\mathcal {J}_{PB} = \bigcup _{i=1}^3 \mathcal {I}_i$||$\mathcal {J}_{DB} = \emptyset$||$\mathcal {J}_B = \mathcal {J}_{PB} \cup \mathcal {J}_{DB}$|
|$\mathcal {I}_{PR} = \bigcup _{i=1}^3 \mathcal {I}_i^\prime$||$\mathcal {I}_{DR} = \emptyset$||$\mathcal {I}_R = \mathcal {I}_{PR} \cup \mathcal {I}_{DR}$||$\mathcal {J}_{PR} = \emptyset$||$\mathcal {J}_{DR} = \mathcal {I}_4$||$\mathcal {J}_R = \mathcal {J}_{PR} \cup \mathcal {J}_{DR}$|
|$\mathcal {I}_P = \mathcal {I}_{PB} \cup \mathcal {I}_{PR}$||$\mathcal {I}_D = \emptyset$||$\mathcal {I}$||$\mathcal {J}_P = \mathcal {J}_{PB} \cup \mathcal {J}_{PR}$||$\mathcal {J}_D = \mathcal {J}_{DB} \cup \mathcal {J}_{DR}$||$\mathcal {J}$|
Table 1.

Comparison between the composite baseline index sets produced by phase-only calibration and full-complex calibration. The entries in the last row of each subtable were obtained by taking the union of all the other entries in the column in which they reside. The same applies to the entries in the last column of each subtable.

Phase-onlyFull-complex
Sign|$\frac{1}{N}$||$\frac{1}{N^2}$|all|$\frac{1}{N}$||$\frac{1}{N^2}$|all
+|$\mathcal {I}_{PB} = \bigcup _{i=1}^3 \mathcal {I}_i$||$\mathcal {I}_{DB} = \emptyset$||$\mathcal {I}_B = \mathcal {I}_{PB} \cup \mathcal {I}_{DB}$||$\mathcal {J}_{PB} = \bigcup _{i=1}^3 \mathcal {I}_i$||$\mathcal {J}_{DB} = \emptyset$||$\mathcal {J}_B = \mathcal {J}_{PB} \cup \mathcal {J}_{DB}$|
|$\mathcal {I}_{PR} = \bigcup _{i=1}^3 \mathcal {I}_i^\prime$||$\mathcal {I}_{DR} = \emptyset$||$\mathcal {I}_R = \mathcal {I}_{PR} \cup \mathcal {I}_{DR}$||$\mathcal {J}_{PR} = \emptyset$||$\mathcal {J}_{DR} = \mathcal {I}_4$||$\mathcal {J}_R = \mathcal {J}_{PR} \cup \mathcal {J}_{DR}$|
|$\mathcal {I}_P = \mathcal {I}_{PB} \cup \mathcal {I}_{PR}$||$\mathcal {I}_D = \emptyset$||$\mathcal {I}$||$\mathcal {J}_P = \mathcal {J}_{PB} \cup \mathcal {J}_{PR}$||$\mathcal {J}_D = \mathcal {J}_{DB} \cup \mathcal {J}_{DR}$||$\mathcal {J}$|
Phase-onlyFull-complex
Sign|$\frac{1}{N}$||$\frac{1}{N^2}$|all|$\frac{1}{N}$||$\frac{1}{N^2}$|all
+|$\mathcal {I}_{PB} = \bigcup _{i=1}^3 \mathcal {I}_i$||$\mathcal {I}_{DB} = \emptyset$||$\mathcal {I}_B = \mathcal {I}_{PB} \cup \mathcal {I}_{DB}$||$\mathcal {J}_{PB} = \bigcup _{i=1}^3 \mathcal {I}_i$||$\mathcal {J}_{DB} = \emptyset$||$\mathcal {J}_B = \mathcal {J}_{PB} \cup \mathcal {J}_{DB}$|
|$\mathcal {I}_{PR} = \bigcup _{i=1}^3 \mathcal {I}_i^\prime$||$\mathcal {I}_{DR} = \emptyset$||$\mathcal {I}_R = \mathcal {I}_{PR} \cup \mathcal {I}_{DR}$||$\mathcal {J}_{PR} = \emptyset$||$\mathcal {J}_{DR} = \mathcal {I}_4$||$\mathcal {J}_R = \mathcal {J}_{PR} \cup \mathcal {J}_{DR}$|
|$\mathcal {I}_P = \mathcal {I}_{PB} \cup \mathcal {I}_{PR}$||$\mathcal {I}_D = \emptyset$||$\mathcal {I}$||$\mathcal {J}_P = \mathcal {J}_{PB} \cup \mathcal {J}_{PR}$||$\mathcal {J}_D = \mathcal {J}_{DB} \cup \mathcal {J}_{DR}$||$\mathcal {J}$|

Using Mpq, we can map the sets of baseline indices to sets of ghost positions. Interestingly enough, when the array is not redundant then Mpq is an isomorphism, i.e. it is structure preserving. If we inspect Table 1 whilst keeping the aforementioned fact in mind, we realize that |$\mathcal {T}_{pq} = M_{pq} \left(\mathcal {I}\right)$| can be divided into the following mutually exclusive subsets: |$\mathcal {T}_{pq}^{PR}$|⁠, |$\mathcal {T}_{pq}^{PB}$|⁠, |$\mathcal {T}_{pq}^{DR}$| and |$\mathcal {T}_{pq}^{DB}$|⁠. We can use a similar argument to divide the set |$\mathcal {S}_{pq} = M_{pq} \left(\mathcal {J}\right)\cup \lbrace \boldsymbol {0}\rbrace$| into four mutually exclusive subsets, namely |$\mathcal {S}_{pq}^{PR}$|⁠, |$\mathcal {S}_{pq}^{PB}$|⁠, |$\mathcal {S}_{pq}^{DR}$| and |$\mathcal {S}_{pq}^{DB}$|⁠. The per-baseline ghosts can therefore by extension, also be divided into four categories, namely red proto-ghosts (associated with |$\mathcal {T}_{pq}^{PR}$| and |$\mathcal {S}_{pq}^{PR}$|⁠), blue proto-ghosts (associated with |$\mathcal {T}_{pq}^{PB}$| and |$\mathcal {S}_{pq}^{PB}$|⁠), red deutero-ghosts (associated with |$\mathcal {T}_{pq}^{DR}$| and |$\mathcal {S}_{pq}^{DR}$|⁠) and blue deutero-ghosts (associated with |$\mathcal {T}_{pq}^{DB}$| and |$\mathcal {S}_{pq}^{DB}$|⁠). The augmented per-baseline ghost taxonomy is summarized in Table 2.

Table 2.

Comparison between the index sets of ghost locations produced by phase-only calibration and full-complex calibration using the augmented taxonomy with blue and red proto-ghosts and blue and red deutero-ghosts. Since Mpq is structure preserving, the entries in the last row of each subtable can also be obtained by taking the union of all the other entries in the column in which they reside. The same applies to the entries in the last column of each subtable. Note that, in the table, we have used the shorthand |$\tilde{0}$| to denote |$\lbrace \boldsymbol {0}\rbrace$|⁠.

Phase-onlyFull-complex
ColourProtoDeuteroallProtoDeuteroall
B|$\mathcal {T}_{pq}^{PB} = M_{pq} \left(\mathcal {I}_{PB} \right)$||$\mathcal {T}_{pq}^{DB} = \emptyset$||$\mathcal {T}_{pq}^B = M_{pq}(\mathcal {I}_B)$||$\mathcal {S}_{pq}^{PB} = M_{pq} \left(\mathcal {J}_{PB} \right) \cup \tilde{0}$||$\mathcal {S}_{pq}^{DB} = \emptyset$||$\mathcal {S}_{pq}^B =M_{pq}\left(\mathcal {J}_B \right)\cup \tilde{0}$|
R|$\mathcal {T}_{pq}^{PR} = M_{pq} \left(\mathcal {I}_{PR} \right)$||$\mathcal {T}_{pq}^{DR} = \emptyset$||$\mathcal {T}_{pq}^R = M_{pq}(\mathcal {I}_R)$||$\mathcal {S}_{pq}^{PR} = \emptyset$||$\mathcal {S}_{pq}^{DR} = M_{pq} \left(\mathcal {J}^{DR} \right)$||$\mathcal {S}_{pq}^R = M_{pq}\left(\mathcal {J}_R \right)$|
|$\mathcal {T}_{pq}^P = M_{pq} \left(\mathcal {I}_P \right)$||$\mathcal {T}_{pq}^D = \emptyset$||$\mathcal {T}_{pq} = M_{pq} \left(\mathcal {I}\right)$||$\mathcal {S}_{pq}^P= M_{pq} \left(\mathcal {J}_P \right)\cup \tilde{0}$||$\mathcal {S}_{pq}^D= M_{pq} \left(\mathcal {J}_D \right)$||$\mathcal {S}_{pq}=M_{pq} \left(\mathcal {J}\right)\cup \tilde{0}$|
Phase-onlyFull-complex
ColourProtoDeuteroallProtoDeuteroall
B|$\mathcal {T}_{pq}^{PB} = M_{pq} \left(\mathcal {I}_{PB} \right)$||$\mathcal {T}_{pq}^{DB} = \emptyset$||$\mathcal {T}_{pq}^B = M_{pq}(\mathcal {I}_B)$||$\mathcal {S}_{pq}^{PB} = M_{pq} \left(\mathcal {J}_{PB} \right) \cup \tilde{0}$||$\mathcal {S}_{pq}^{DB} = \emptyset$||$\mathcal {S}_{pq}^B =M_{pq}\left(\mathcal {J}_B \right)\cup \tilde{0}$|
R|$\mathcal {T}_{pq}^{PR} = M_{pq} \left(\mathcal {I}_{PR} \right)$||$\mathcal {T}_{pq}^{DR} = \emptyset$||$\mathcal {T}_{pq}^R = M_{pq}(\mathcal {I}_R)$||$\mathcal {S}_{pq}^{PR} = \emptyset$||$\mathcal {S}_{pq}^{DR} = M_{pq} \left(\mathcal {J}^{DR} \right)$||$\mathcal {S}_{pq}^R = M_{pq}\left(\mathcal {J}_R \right)$|
|$\mathcal {T}_{pq}^P = M_{pq} \left(\mathcal {I}_P \right)$||$\mathcal {T}_{pq}^D = \emptyset$||$\mathcal {T}_{pq} = M_{pq} \left(\mathcal {I}\right)$||$\mathcal {S}_{pq}^P= M_{pq} \left(\mathcal {J}_P \right)\cup \tilde{0}$||$\mathcal {S}_{pq}^D= M_{pq} \left(\mathcal {J}_D \right)$||$\mathcal {S}_{pq}=M_{pq} \left(\mathcal {J}\right)\cup \tilde{0}$|
Table 2.

Comparison between the index sets of ghost locations produced by phase-only calibration and full-complex calibration using the augmented taxonomy with blue and red proto-ghosts and blue and red deutero-ghosts. Since Mpq is structure preserving, the entries in the last row of each subtable can also be obtained by taking the union of all the other entries in the column in which they reside. The same applies to the entries in the last column of each subtable. Note that, in the table, we have used the shorthand |$\tilde{0}$| to denote |$\lbrace \boldsymbol {0}\rbrace$|⁠.

Phase-onlyFull-complex
ColourProtoDeuteroallProtoDeuteroall
B|$\mathcal {T}_{pq}^{PB} = M_{pq} \left(\mathcal {I}_{PB} \right)$||$\mathcal {T}_{pq}^{DB} = \emptyset$||$\mathcal {T}_{pq}^B = M_{pq}(\mathcal {I}_B)$||$\mathcal {S}_{pq}^{PB} = M_{pq} \left(\mathcal {J}_{PB} \right) \cup \tilde{0}$||$\mathcal {S}_{pq}^{DB} = \emptyset$||$\mathcal {S}_{pq}^B =M_{pq}\left(\mathcal {J}_B \right)\cup \tilde{0}$|
R|$\mathcal {T}_{pq}^{PR} = M_{pq} \left(\mathcal {I}_{PR} \right)$||$\mathcal {T}_{pq}^{DR} = \emptyset$||$\mathcal {T}_{pq}^R = M_{pq}(\mathcal {I}_R)$||$\mathcal {S}_{pq}^{PR} = \emptyset$||$\mathcal {S}_{pq}^{DR} = M_{pq} \left(\mathcal {J}^{DR} \right)$||$\mathcal {S}_{pq}^R = M_{pq}\left(\mathcal {J}_R \right)$|
|$\mathcal {T}_{pq}^P = M_{pq} \left(\mathcal {I}_P \right)$||$\mathcal {T}_{pq}^D = \emptyset$||$\mathcal {T}_{pq} = M_{pq} \left(\mathcal {I}\right)$||$\mathcal {S}_{pq}^P= M_{pq} \left(\mathcal {J}_P \right)\cup \tilde{0}$||$\mathcal {S}_{pq}^D= M_{pq} \left(\mathcal {J}_D \right)$||$\mathcal {S}_{pq}=M_{pq} \left(\mathcal {J}\right)\cup \tilde{0}$|
Phase-onlyFull-complex
ColourProtoDeuteroallProtoDeuteroall
B|$\mathcal {T}_{pq}^{PB} = M_{pq} \left(\mathcal {I}_{PB} \right)$||$\mathcal {T}_{pq}^{DB} = \emptyset$||$\mathcal {T}_{pq}^B = M_{pq}(\mathcal {I}_B)$||$\mathcal {S}_{pq}^{PB} = M_{pq} \left(\mathcal {J}_{PB} \right) \cup \tilde{0}$||$\mathcal {S}_{pq}^{DB} = \emptyset$||$\mathcal {S}_{pq}^B =M_{pq}\left(\mathcal {J}_B \right)\cup \tilde{0}$|
R|$\mathcal {T}_{pq}^{PR} = M_{pq} \left(\mathcal {I}_{PR} \right)$||$\mathcal {T}_{pq}^{DR} = \emptyset$||$\mathcal {T}_{pq}^R = M_{pq}(\mathcal {I}_R)$||$\mathcal {S}_{pq}^{PR} = \emptyset$||$\mathcal {S}_{pq}^{DR} = M_{pq} \left(\mathcal {J}^{DR} \right)$||$\mathcal {S}_{pq}^R = M_{pq}\left(\mathcal {J}_R \right)$|
|$\mathcal {T}_{pq}^P = M_{pq} \left(\mathcal {I}_P \right)$||$\mathcal {T}_{pq}^D = \emptyset$||$\mathcal {T}_{pq} = M_{pq} \left(\mathcal {I}\right)$||$\mathcal {S}_{pq}^P= M_{pq} \left(\mathcal {J}_P \right)\cup \tilde{0}$||$\mathcal {S}_{pq}^D= M_{pq} \left(\mathcal {J}_D \right)$||$\mathcal {S}_{pq}=M_{pq} \left(\mathcal {J}\right)\cup \tilde{0}$|

We can make a number of interesting observations from Table 2. First, phase-only calibration does not produce deutero-ghosts, which implies that the flux of all ghosts only decreases in proportion to the number of antennas in the array. Moreover, phase-only calibration produces an equal amount of red (potential positive/amplification) and blue (potential negative/suppressing)3 ghosts. In the case of full-complex calibration, all red ghosts are deutero-ghosts, while all blue ghosts are proto-ghosts. As a result, the red ghosts will typically be less bright in the case of full-complex calibration than they will be in the case of phase-only calibration. Moreover, the red ghosts will be less bright than the blue ghosts in the case of full-complex calibration, while they are of similar brightness in the case of phase-only calibration.

Note that our classification of ghosts assumes that the array is not redundant. If the array is redundant, some per-baseline ghosts would have the same position vector and would therefore be added together; changing their characteristics. This would require a more elaborate classification scheme, which we do not pursue here, since it not necessary to understand the symmetry relationships that exist between pairs of ghosts.

5 COMPARISON

In this section. we compare the full-complex ghost response of Paper II with the phase-only results we obtained in Section 4. We first consider the ghost pattern of an individual baseline in Section 5.1 before studying the conglomerated patterns in Section 5.2. All of the results in this section were obtained using the seven dish Karoo Array Telescope (KAT-7) array layout. KAT-7 is a precursor to the MeerKAT array and is located in the Karoo (South Africa). The layout of this array was depicted in Paper II. Moreover, we only consider the two-source test case from Section 3.1 with the unmodelled source being 0.2 Jy at (1°, 0°), δ0 = −74| $_{.}^{\circ}$|66 and ν = 1.445 GHz. We denote the observational frequency with ν.

5.1 Per-baseline Results

In Fig. 3, we present extrapolated artefact maps which were created by using the KAT-7 array layout. More specifically, we have plotted |$\mathcal {F}^{-1}\lbrace g_{45}^{-1}(\widehat{X}_{45}^{-1}(\boldsymbol {b}))-1\rbrace$| and |$\mathcal {F}^{-1}\lbrace \rho _{45}^{-1}(\widehat{X}_{45}^{-1}(\boldsymbol {b}))-1\rbrace$| in Figs 3(a) and (b) and Figs 3(d) and (e), respectively. The inverse Fourier transform is denoted by |$\mathcal {F}^{-1}\lbrace \cdot \rbrace$|⁠. Figs 3(c) and (f) show the theoretical ghost patterns based on the index sets summarized in Table 2. The theoretical ghost positions shown in Figs 3(c) and (f) help us visualize the similarities and differences that exist between full-complex calibration and phase-only calibration. The brightest ghosts in the artefact maps match perfectly with the theoretical ghost positions, indicating that the results from raw extrapolation and perturbation align well.

The left-hand and middle column show the real (left-hand column) and imaginary (middle-hand column) part of the extrapolated artefact map for baseline $\boldsymbol {b}_{45}$ obtained for full-complex calibration (top row) and phase-only calibration (bottom row). These maps were created by using KAT-7. The modelled source was in the centre and the unmodelled source (0.2 Jy) was at (1°, 0°). Moreover, δ0 = −74 $_{.}^{\circ}$66 and ν = 1.445 Ghz. The two panels in the right-hand column show the theoretical ghost patterns for full-complex (top) and phase-only (bottom) calibration for this scenario.
Figure 3.

The left-hand and middle column show the real (left-hand column) and imaginary (middle-hand column) part of the extrapolated artefact map for baseline |$\boldsymbol {b}_{45}$| obtained for full-complex calibration (top row) and phase-only calibration (bottom row). These maps were created by using KAT-7. The modelled source was in the centre and the unmodelled source (0.2 Jy) was at (1°, 0°). Moreover, δ0 = −74| $_{.}^{\circ}$|66 and ν = 1.445 Ghz. The two panels in the right-hand column show the theoretical ghost patterns for full-complex (top) and phase-only (bottom) calibration for this scenario.

We can draw the following conclusions from Table 2 and Fig. 3.

  • Proto-ghosts: full-complex calibration only produces blue proto-ghosts. In contrast, phase-only calibration produces an equal number of red and blue proto-ghosts. The blue and red proto-ghosts form symmetrically opposite each other around the modelled source. Moreover, the set of blue proto-ghosts that are created by phase-only calibration and full-complex calibration are almost equal.

  • Deutero-ghosts: all the deutero-ghosts that are produced by full-complex calibration are red. No deutero-ghosts are generated by phase-only calibration. The position vector set associated with the red proto-ghosts is a subset of the position vector set that is associated with the full-complex deutero-ghosts, i.e. |$\mathcal {T}_{pq}^{PR}\subset \mathcal {S}_{pq}^{DR}$|⁠. We therefore produce fewer first-order ghosts, i.e. the ghosts associated with the first-order perturbation of the visibility gains, with phase-only calibration when compared with full-complex calibration. We do, however, produce more proto-ghosts.

In Paper II, we realized that the following three ghosts were exceptionally bright.

  • The primary suppressor: the primary suppressor forms on top of the modelled source, and it can therefore suppress the modelled source. It is important to mention here that, in the full-complex case, the primary suppressor only forms when we incorporate the autocorrelations during calibration, which is rarely done in practice (Paper II). The primary suppressor forms at |$\boldsymbol {0}$|⁠.

  • The secondary suppressor: the secondary suppressor lies on top of the unmodelled source, i.e. at |$\boldsymbol {s}_0$|⁠. This ghost can suppress the unmodelled source.

  • The antighost: the antighost appears symmetrically opposite the secondary suppressor around the primary suppressor, i.e. at |$-\boldsymbol {s}_0$|⁠.

Concentrating on these specific sources in Table 2 and Fig. 3, we find:

  • Primary suppressor: the primary suppressor is a blue proto-ghost, but only forms if full-complex calibration is employed. Mathematically speaking |$\mathcal {S}_{pq}^{PB}\backslash \mathcal {T}_{pq}^{PB} = \lbrace \boldsymbol {0}\rbrace$|⁠. We denote set-theoretic difference by ‘\’. Intuitively, this is in agreement with the fact that the primary suppressor disappears when the autocorrelations are ignored in full-complex calibration, since the autocorrelations do not contain phase information. As we showed in Paper II, the primary suppressor is completely real, and since it is blue, it will appear as a negative source. This can be confirmed by inspecting Figs 3(a) and (b).

  • Secondary suppressor and its antighost: in the case of full-complex calibration, the secondary suppressor is a blue proto-ghost, and its antighost is a red deutero-ghost. As is the case with full-complex calibration, the phase-only secondary suppressor is also a blue proto-ghost, but its antighost is a red proto-ghost. This explains why the phase-only antighost is so much brighter than the full-complex antighost in Fig. 3. In Paper II, we showed that the secondary suppressor and its antighost are real valued. The fact that the secondary suppressor is real valued and blue explains why the secondary suppressor ends up being a negative ghost. The antighost of the secondary suppressor manifests as a positive ghost, because it is real valued and red.

5.2 Conglomerated results

In the previous section we only looked at the per-baseline ghost response but, as we mentioned in Section 4.6, the conglomerated ghost pattern and the ghost pattern of each individual baseline are completely different. It turns out that some ghost add up coherently for all baselines while others do not. In Paper II, we developed the following taxonomy for ghosts in the conglomerated pattern to address this issue.

  • Line ghosts: all ghosts that form at |$\tilde{\boldsymbol {s}}\in \boldsymbol {s}_0\bf{Z}$| are line ghosts. Line ghosts are present in each baseline and therefore add up coherently when we image.

  • Scattered ghosts: if a ghost is not a line ghost, it is a scattered ghost. In the non-redundant case, scattered ghosts are only present in one baseline, which is why they appear dimmer in the conglomerated ghost pattern.

The extrapolated conglomerated ghost patterns for full-complex and phase-only calibration for the full KAT-7 array are shown in Figs 4(a) and (b), respectively. These maps are clearly different from the per-baseline results shown in Fig. 3. One of the major differences is that the secondary suppressor and its antighost are much brighter relative to the remaining background ghosts.

Conglomerated ghost pattern for full-complex calibration (top) and phase-only calibration (bottom). The suppression ghost, the ghost forming on top of the unmodelled source, is brighter in the top panel than in the bottom panel. In the case of the antighost, the ghost forming symmetrically opposite the suppression ghost around the modelled source, we observe the exact opposite. Moreover, fewer bright ghosts are visible in the bottom panel than in the top panel.
Figure 4.

Conglomerated ghost pattern for full-complex calibration (top) and phase-only calibration (bottom). The suppression ghost, the ghost forming on top of the unmodelled source, is brighter in the top panel than in the bottom panel. In the case of the antighost, the ghost forming symmetrically opposite the suppression ghost around the modelled source, we observe the exact opposite. Moreover, fewer bright ghosts are visible in the bottom panel than in the top panel.

The fluxes of the three most important line ghosts in the conglomerated ghost pattern are shown in Figs 5(a) and 6(a). The scattered equivalents of Figs 5(a) and 6(a) are depicted in Figs 5(b) and 6(b). Figs 5 and 6 were produced by employing full-complex and phase-only calibration, respectively. To be more descriptive, Figs 5(a) and (b) and Figs 6(a) and 6(b) illustrate the full-complex equivalent of equation (54) (which can be found in Paper II) and equation (54), respectively. Similarly, Figs 5(c) and (d) and Figs 6(c) and (d) were created by employing the full-complex equivalent of equations (56) and (56), respectively.

The relative amplitude of the primary suppressor, secondary suppressor and its antighost on the left and the relative amplitude of a random proto-ghost and its antighost on the right. These curves were created by employing full-complex calibration. The dashed line represents the theoretically derived values that we obtained with perturbation, while the solid line represents a measured value which was obtained from a clean artefact map that was created via extrapolation.
Figure 5.

The relative amplitude of the primary suppressor, secondary suppressor and its antighost on the left and the relative amplitude of a random proto-ghost and its antighost on the right. These curves were created by employing full-complex calibration. The dashed line represents the theoretically derived values that we obtained with perturbation, while the solid line represents a measured value which was obtained from a clean artefact map that was created via extrapolation.

The relative amplitude of the primary suppressor, secondary suppressor and its antighost on the left and the relative amplitude of a random proto-ghost and its antighost on the right. These curves were created by employing phase-only calibration. The dashed line represents the theoretically derived values that we obtained with perturbation, while the solid line represents a measured value which was obtained from a clean artefact map that was created via extrapolation.
Figure 6.

The relative amplitude of the primary suppressor, secondary suppressor and its antighost on the left and the relative amplitude of a random proto-ghost and its antighost on the right. These curves were created by employing phase-only calibration. The dashed line represents the theoretically derived values that we obtained with perturbation, while the solid line represents a measured value which was obtained from a clean artefact map that was created via extrapolation.

In Figs 5(a) and 6(a), the blue curve represents the primary suppression ghost, the red curve the secondary suppression ghost and the green curve the antighost of the secondary suppressor. In Fig. 5(b), the blue curve represents a random blue full-complex proto-ghost, while its antighost is depicted in red. Its antighost is a red deutero-ghost. In Fig. 6(b), the blue curve is associated with a random blue phase-only proto-ghost, the red curve is associated with its antighost, a red proto-ghost.

The dashed curves in Figs 5 and 6 are the perturbation theoretical estimates we derived in Section 4.2 and in Paper II. In effect, we plot equations (33) and (59). The fact, that the results we obtained from perturbation and raw extrapolation agree, supports the validity of the first-order perturbation derivation in Section 4. The colours of the dashed lines indicate with which source we can associate each theoretical curve. In Fig. 6, we use black dashed lines instead, because in the case of phase-only calibration, perturbation predicts the same absolute flux value for every ghost and antighost pair. This can be verified by inspecting equations (33) and (42). Perturbation also predicts that the primary suppressor will always have zero flux. This explains why the dashed blue line is absent in Fig. 6.

The primary suppressor is, on average, 14 per cent of A2 and 2.5 per cent of A2 when we employ full-complex and phase-only calibration, respectively. Moreover, in the case of full-complex calibration, the secondary suppressor and its antighost are, on average, 30 per cent of A2 and 2 per cent of A2, respectively. For the phase-only setup, we find that, on average, both the secondary suppressor and its antighost are 15 per cent of A2.

A random blue proto-ghost is, on average, 0.7 per cent of A2 and 0.35 per cent of A2 when we perform full-complex and phase-only calibration, respectively. The blue proto-ghost's antighost is, on average, 0.1 per cent of A2 and 0.35 per cent of A2 when we use the full-complex and phase-only calibration algorithms, respectively.

Also note that correcting our visibilities does not significantly impact the brightness of our ghosts (compare the top panel of Fig. 5 with the bottom panel in Fig. 5). The slight differences between the top and bottom panel in Fig. 5 are discussed in Paper I and II. The same reasoning can be applied to Fig. 6.

We can reach the following conclusions by comparing Figs 4(a), (b), 5 and 6 with each other.

  • Line ghosts: the primary suppressor, the secondary suppressor and its antighost are all line ghosts, which is why they are so much brighter than the other ghosts in Fig. 4. According to perturbation theory, the primary suppressor only forms when performing full-complex calibration. However, Fig. 6 clearly indicates that a phase-only primary suppressor does form when A2 becomes large enough. The phase-only primary suppressor, however, is much dimmer than its full-complex counterpart. Moreover, the phase-only antighost in Figs 4(b) and 6 is much brighter than the antighost in Figs 4(a) and 5, since the antighost which is produced by phase-only calibration is a proto-ghost, while the antighost produced by full-complex calibration is a deutero-ghost. The full-complex secondary suppressor is also brighter than its phase-only counterpart. This can be explained by comparing equations (33) and (59). These two equations tell us that the secondary suppressor is proportional to 2/N and 1/N when we employ full-complex and phase-only calibration, respectively.

  • Scattered ghosts: we see that we produce less scattered ghosts with phase-only calibration than with full-complex calibration, since we do not produce any deutero-ghosts while performing phase-only calibration. Furthermore, in the case of phase-only calibration, the blue and red symmetric ghost pairs in Fig. 6 are of the same order of magnitude (both are proto-ghosts). This is not true when performing full-complex calibration (Fig. 3c clearly shows that at least one of the ghosts in each symmetric pair is a deutero-ghost). This follows trivially from equation (42), which is only true for phase-only calibration. Moreover, the full-complex proto-ghosts in Fig. 5 are brighter than the phase-only proto-ghosts in Fig. 6, which is corroborated by equations (33) and (59).

6 PRIMARY BEAM

In this section, we replicate and then try to explain the synthetic anti-ghost experiment in Stewart (2014), which we discussed in detail in Section 1 (see also Fig. 1). In Section 6.1, we will modify Stewart's experiment slightly to be able to explain his results using just a single graph. Our findings are discussed in detail in Section 6.2. Although the strange behaviour of the antighost appears to contradict some of the results in Paper II and Section 4, we will see that the primary beam correction causes the apparent dependence of the antighost's brightness on the position of the unmodelled source. The results in this section were obtained using standard reduction software packages and techniques. We provide the reader with more implementation specifics in Section 6.1.

6.1 Spiral experiment

In this section, we present an experiment that can shed some light on Stewart's puzzling result (see Section 1). We actually discuss a simple, modified version of Stewart's experiment below. Instead of sampling the locations of the unmodelled source from a rectangular grid, we propose that the possible locations of the unmodelled source all lie on a spiral. As will become apparent later, altering Stewart's experiment in this manner allows us to create a one-dimensional plot which can help us shed some light on Stewart's findings.

Just as with Stewart's experiment, we decided to conduct our experiment with the LOFAR antenna layout. To mirror Stewart's original experiment as much as possible, we used 11 min observations. Moreover, the field centre and the NCP coincided. The measurement set we used was created assuming 33 LOFAR stations.4 We also assumed a monochromatic observation at a frequency of ν = 58.66MHz.

Furthermore, we only considered a two-source scenario. We place an 88.7 Jy source at (02h22m52|${^{\rm s}_{.}}$|1496, 86°18′59|${^{\prime\prime}_{.}}$|184)5. This bright source is included in our calibration model. We also introduce a second source of 60 Jy to our simulation. This second source is not included in our calibration model. Moreover, the modelled source is placed at the same location as 3C 61.1.

We start our experiment by placing the unmodelled source at the field centre. We then generate the visibilities associated with the modelled and unmodelled source, which we then store in the measurement set. The modelled and unmodelled source were attenuated by the array's primary beam before the visibilities were generated. The effect of beam rotation was ignored. We then perform phase-only calibration. We then image and clean the resulting corrected visibilities and measure the apparent and intrinsic flux of the modelled source, the unmodelled source and the antighost.

The intrinsic flux density of a source is the true flux value of a specific source and its apparent flux is the flux it appears to have after it has been attenuated by the observing array's primary beam response.

It is important to note here that we apply the aforementioned terminology to both the input/a priori models that enabled us to create the visibilities that we used, and the output/a posteriori simulated images that we generated, during the spiral experiment in the discussion that is found in Section 6.2.

We can now change the position of the unmodelled source and then repeat the aforementioned simulation, calibration and imaging steps to generate another simulated observation. If we continue in this way, varying the position of the unmodelled source followed by the generation of a simulated observation, we will be able to produce multiple snapshot images (the unmodelled source will be in a different location in every snapshot).

The unmodelled source can be placed at completely random positions or it can follow a deterministic trajectory. We opted for a deterministic trajectory. During the course of the spiral experiment, the unmodelled source follows an Archimedean spiral trajectory, which can be described by the equation
(61)
where r is the radial distance (in degrees) of the unmodelled source from the field centre and θ is the polar angle of the spiral. In equation (61), θ is a positive real number and is measured in radians. The results we obtained by performing this experiment are presented in Section 6.2.

The red spiral in Fig. 7(a) depicts the trajectory that the unmodelled source follows during the experiment. The antighost, in turn, follows the blue trajectory in Fig. 7(a). Moreover, the LOFAR primary beam which we used during the spiral experiment is presented in Fig. 7(a).

The source trajectories that the unmodelled source and its antighost follow during the spiral experiment are depicted in Fig. 7(a). The LOFAR primary beam model, which we used in the spiral experiment, is also shown in Fig. 7(a). Figs 7(b)–(d) depict three distinct spiral experiment snapshots. The radial distance the unmodelled source is from the field centre is given in the individual sub-captions of the three figures. The contours of the LOFAR primary beam model are depicted in Figs 7(a)–(d) with dashed yellow lines. The positions of the modelled source, the unmodelled source and the antighost in Figs 7(b)–(d) are indicated with labeled circular markers. The labels M, U, and G were used to tag the modelled source, the unmodelled source and the antighost, respectively. The positions of the modelled source, the unmodelled source and the antighost in each of the three snapshots can also be found in Fig. 7(a) and are indicated with circular markers. The colour of the marker should be used as a key to identify to which snapshot each position belongs.
Figure 7.

The source trajectories that the unmodelled source and its antighost follow during the spiral experiment are depicted in Fig. 7(a). The LOFAR primary beam model, which we used in the spiral experiment, is also shown in Fig. 7(a). Figs 7(b)–(d) depict three distinct spiral experiment snapshots. The radial distance the unmodelled source is from the field centre is given in the individual sub-captions of the three figures. The contours of the LOFAR primary beam model are depicted in Figs 7(a)–(d) with dashed yellow lines. The positions of the modelled source, the unmodelled source and the antighost in Figs 7(b)–(d) are indicated with labeled circular markers. The labels M, U, and G were used to tag the modelled source, the unmodelled source and the antighost, respectively. The positions of the modelled source, the unmodelled source and the antighost in each of the three snapshots can also be found in Fig. 7(a) and are indicated with circular markers. The colour of the marker should be used as a key to identify to which snapshot each position belongs.

Fig. 7 also contains three other images: Figs 7(b)–(d). Each of these figures represents a spiral experiment snapshot. Figs 7(b)–(d) were obtained by placing the unmodelled source 0| $_{.}^{\circ}$|32, 3| $_{.}^{\circ}$|08 and 3| $_{.}^{\circ}$|48 away from the field centre, respectively. The antighost is clearly visible in Figs 7(b) and (d), but is not visible in Fig. 7(c). This confirms Stewart's observation (see Section 1) that the brightness of the antighost is highly dependent on the position of the unmodelled source. The contours of the primary beam that was used in this experiment are depicted in Fig. 7 with yellow dashed lines.

Most of the software we used to perform the spiral experiment were created from first principals, i.e. we created a custom built software package which could perform the following functions.

  • The creation of the visibilities associated with the modelled and the unmodelled source.

  • The calibration task: a phase-only variant of the StEFCal calibration algorithm was implemented (Salvini & Wijnholds 2014).

  • The incorporation of the primary beam into the experiment's pipeline.

  • The source finding task.

We also employed lwimager (an FFT-based imager derived from the casa (Common Astronomy Software Applications)6 libraries and functionally equivalent to the casa imager).

6.2 Spiral experiment results

6.2.1 Preliminary observations

The results of the spiral experiment are shown in Fig. 8. We find the different flux values of the modelled source, the unmodelled source and the antighost for the spiral experiment as a function of the radial distance r (see equation 61). The line-types indicate which curves can be associated with which source: the dotted curves, the dashed curves and the solid curves in Fig. 8 can be associated with the modelled source, the unmodelled source and the antighost, respectively. The only exception is the solid magenta line, which denotes the beam gain the antighost experiences when the unmodelled source is r degrees from the field centre (and is not measured in Jy, but is a unitless gain value).

The flux values of the spiral experiment for the modelled source, the unmodelled source and the antighost as a function of the radial distance r (the distance the unmodelled source is located from the field centre). The curves that are associated with the modelled source, the unmodelled source and the antighost are depicted by dotted, dashed and solid lines, respectively. Four curves are associated with each source, each one depicted in a different colour. The blue curves represent the true intrinsic flux of a source, the red curves depict the true apparent flux of a source, the green curves represent the measured apparent flux values of a source and the black curves denote the measured intrinsic flux of a source. The solid magenta line is an exception and denotes the beam gain the antighost experiences when the unmodelled source is r degrees from the field centre. It is not measured in Jy, but is actually a unitless quantity. The yellow blocks represent regions where the radial distance the ghost is from the field centre is larger than 9°. The graph is also divided into two regions, a red region (r ≤ 2 $_{.}^{\circ}$5) and a blue region (r > 2 $_{.}^{\circ}$5). The antighost appears much brighter in the red region than in the blue region on average. The coloured vertical dashed lines indicate the values of r at which the snapshots in Fig. 7 were made.
Figure 8.

The flux values of the spiral experiment for the modelled source, the unmodelled source and the antighost as a function of the radial distance r (the distance the unmodelled source is located from the field centre). The curves that are associated with the modelled source, the unmodelled source and the antighost are depicted by dotted, dashed and solid lines, respectively. Four curves are associated with each source, each one depicted in a different colour. The blue curves represent the true intrinsic flux of a source, the red curves depict the true apparent flux of a source, the green curves represent the measured apparent flux values of a source and the black curves denote the measured intrinsic flux of a source. The solid magenta line is an exception and denotes the beam gain the antighost experiences when the unmodelled source is r degrees from the field centre. It is not measured in Jy, but is actually a unitless quantity. The yellow blocks represent regions where the radial distance the ghost is from the field centre is larger than 9°. The graph is also divided into two regions, a red region (r ≤ 2| $_{.}^{\circ}$|5) and a blue region (r > 2| $_{.}^{\circ}$|5). The antighost appears much brighter in the red region than in the blue region on average. The coloured vertical dashed lines indicate the values of r at which the snapshots in Fig. 7 were made.

In Fig. 8, there are four different-coloured curves for each of the three aforementioned sources/line-types (i.e. there are 12 curves in total). The blue and red curves denote the true intrinsic and the true apparent flux values of the sources with which they are associated, i.e. true simulated model and true primary beam attenuated simulated model flux values. The values of the red and blue curves are known a priori. The solid red and blue line, however, warrant further explanation. The solid red line represents the theoretical flux density of the antighost, as derived in Section 4, and the solid blue curve represents the product of the primary beam gain (magenta curve) and the theoretical flux density of the antighost, i.e. the effect that the primary beam had on the theoretical flux density of the antighost once we correct for it. The green and black curves denote the measured intrinsic and the apparent flux values of the sources with which they are associated, i.e. raw measured flux values and primary beam corrected measured flux values. The green and black curve values were obtained from the images produced by the spiral experiment. To summarize, the red and blue curves can be interpreted as theoretical predictions, i.e. as the values that are fed into the spiral experiment, while the green and black curves can be interpreted as observed values, i.e. the quantities that are produced by the spiral experiment.

The yellow bars in Fig. 8 indicate regions where the antighost lies outside the boundaries of the images that were created by the spiral experiment. This explains why the yellow regions do not contain any solid curves.

Note that, since the modelled source remains at the same location during the experiment, its intrinsic flux curves are scalar multiples of its apparent flux curves. This assertion can be verified by comparing the dotted blue curve with the dotted red curve. A similar conclusion can be drawn by comparing the dotted black curve with the dotted green curve. In contrast, the apparent flux of the unmodelled source slowly dims as it spirals outwards due to the array's primary beam response. This can be verified by inspecting the dashed red curve.

We have indicated two regions in Fig. 8. In the red region with r ≤ 2| $_{.}^{\circ}$|5, the apparent flux of the unmodelled source is larger than the apparent flux of the modelled source, i.e. the red dashed line is above the red dotted line. In this region, the assumption A2 ≪ A1 of our perturbation analysis is clearly violated. The opposite is true in the blue region with r > 2| $_{.}^{\circ}$|5. Not unexpectedly, we find that when the apparent flux of the unmodelled source is larger than the modelled source, we get a very different ghost response than the well-studied case in which the apparent flux of the unmodelled source is smaller than the apparent flux of the modelled source.

6.2.2 Model accuracy

The fact that the ghost response is different in the red and blue regions of Fig. 8 is easier to spot if Fig. 9 is inspected instead. In Fig. 9(a), we have the measured value of the secondary suppressor (as a per cent of A2) as a function of the radial distance the unmodelled source is from the field centre. In Fig. 9(b), we have the measured value of the antighost (as a per cent of A2) as a function of the radial distance the unmodelled source is from the field centre. The crosses in each region of Fig. 9 represent actual measured flux values (as a per cent of A2). These values were obtained from the images that were produced during the spiral experiment. The coloured lines in each region represent the mean flux value measured within each region and the filled in block in each region depicts the standard deviation about the mean flux in each region. The black line in both of these figures is the theoretical value of the secondary suppressor and the antighost, as derived in Section 4.

The crosses represent the measured flux values of the secondary suppressor and the anti-ghost (as a per cent of A2) as a function of the radial distance r. The plots are divided into two regions: a red region and a blue region. The coloured horizontal lines represent the mean flux values within each region, while the horizontal filled in blocks represent the standard deviation about the mean within each region. The black line represents the theoretical value of the secondary suppressor and the antighost as predicted in Section 4. In Fig. 9(b), the yellow blocks represent regions where the radial distance the ghost is from the field centre is larger than 9°.
Figure 9.

The crosses represent the measured flux values of the secondary suppressor and the anti-ghost (as a per cent of A2) as a function of the radial distance r. The plots are divided into two regions: a red region and a blue region. The coloured horizontal lines represent the mean flux values within each region, while the horizontal filled in blocks represent the standard deviation about the mean within each region. The black line represents the theoretical value of the secondary suppressor and the antighost as predicted in Section 4. In Fig. 9(b), the yellow blocks represent regions where the radial distance the ghost is from the field centre is larger than 9°.

Inspection of Figs 8 and 9 reveals the following: when the calibration model is accurate enough, i.e. A2 < A1 or r > 2| $_{.}^{\circ}$|5, the measured flux values are close to the theoretical value, which we derived in Section 4. However, when the model becomes too inaccurate, i.e. A2A1 or r ≤ 2| $_{.}^{\circ}$|5, then it deviates completely from the two black lines in Fig. 9. Fig. 9, therefore, shows us that when the unmodelled source is much brighter than the modelled source, we produce a much brighter secondary suppressor and antighost than that predicted in Section 4. This result does not contradict anything in Section 4, because one of the main assumptions that we make in Section 3 is that A2 ≪ A1. A similar result was obtained in Grobler & Smirnov (2015) for the WSRT array. Fig. 9 also shows us that the theory we derived in Section 4 holds up if A2 < A1. We can now use the insight we gained from inspecting Fig. 9 to make sense of the remaining curves in Fig. 8.

6.2.3 Modelled source

The only interesting thing one can conclude by inspecting the curves of the modelled source is that the modelled source seems to be amplified a bit when we calibrate with a very inaccurate model (A2A1), but that this quickly improves when the calibration model becomes more accurate. This conclusion can be reached by comparing either the dotted blue and black curves or the dotted red and green curves with each other.

6.2.4 Unmodelled source

The first thing one notices when inspecting the curves associated with the unmodelled source is that the measured apparent flux curve (dashed green) of the unmodelled source is far below the true apparent flux curve (dashed red) when r ≤ 2| $_{.}^{\circ}$|5. We are now able to explain this as, according to Section 6.2.2, the suppression ghost becomes exceptionally bright if the model is too inaccurate (A2A1). In contrast, when r > 2| $_{.}^{\circ}$|5, we see that the dashed green and dashed red curves are virtually indistinguishable, which can also be explained with the aid of Section 6.2.2. Section 6.2.2 also tells us that if an accurate enough model (A2 < A1) is used during calibration, the unmodelled source is only suppressed slightly if N is large. Moreover, when r ≤ 2| $_{.}^{\circ}$|5, the black dashed line and the green dashed line are fairly close to each other, as the primary beam gain is close to unity when r ≤ 2| $_{.}^{\circ}$|5. As we mentioned previously, when r > 2| $_{.}^{\circ}$|5, the degree of suppression is reduced significantly, which also explains why the black dashed curve ends up lying just below the intrinsic flux value of the unmodelled source (the dashed blue curve).

6.2.5 Antighost

When we inspect the antighost curves, we see that the antighost behaves differently when r ≤ 2| $_{.}^{\circ}$|5 than when r > 2| $_{.}^{\circ}$|5. When r ≤ 2| $_{.}^{\circ}$|5, we are actually using a very inaccurate calibration model (A2A1), implying that the antighost will be exceptionally bright in this region of the graph (see Section 6.2.2). We can confirm this by inspecting the solid green curve in the red region of Fig. 8. The antighost can now be made even brighter if we also correct for the primary beam. This can be verified by inspecting the solid black curve in the red region of Fig. 8.

The combination of these two factors explain why the antighost is clearly visible during the course of the spiral experiment when r ≤ 2| $_{.}^{\circ}$|5. Fig. 7(b) is a snapshot of the spiral experiment. In this snapshot, the unmodelled source is 0| $_{.}^{\circ}$|34 from the field centre. Fig. 7(b) should be used as a visual aid that shows us just how bright the antighost can be when r ≤ 2| $_{.}^{\circ}$|5.

When r > 2| $_{.}^{\circ}$|5, we see that the antighost becomes very dim and becomes close to the theoretically predicted value (the solid red and green lines are close together). This is expected as, when r > 2| $_{.}^{\circ}$|5, we satisfy the criterion A2 < A1 (see Section 6.2.2). However, if we now apply the beam gain, we see that the antighost can still become exceptionally bright even if the ‘apparent’ antighost is fairly weak. This is due to the relative positions of the modelled and unmodelled source. If the relative positions of the modelled and unmodelled source just happens to be of such an orientation that the antighost that forms lies in a region of the primary beam where the primary beam attenuation is high, then the antighost can be significantly amplified when we do the primary beam correction. If the antighost forms in a region of the primary beam where there is little attenuation, then the antighost will remain fairly faint even after we do the primary beam correction. Note that the theoretical ‘intrinsic’ flux curve (the solid blue curve which is associated with the antighost) accurately describes the behaviour of the measured intrinsic antighost flux values (solid black curve).

Fig. 7(c) is a snapshot from the spiral experiment which shows us how hard it can be to spot the antighost when it forms in a region of the primary beam where there is little attenuation. In stark contrast, Fig. 7(d) is a snapshot from the spiral experiment that shows that the antighost can be very bright even if the model is quite accurate, provided the antighost happens to form in a region of the primary beam where the attenuation is high. The radial distances at which the three snapshots were made are depicted in Fig. 8 with three vertical dashed lines.

The most important observation that we can make while inspecting Fig. 8 is that the solid black line is highly correlated with the solid magenta line. This clearly shows the large impact of the primary beam on the visibility of the antighost. This observation helps explain Stewart's original result. Stewart observed that the brightness of the antighost was highly dependent on the location of the unmodelled source, since the location of the unmodelled source determines the location of its antighost, which may thus end up in a spot where the primary beam has hardly any gain.

7 CONCLUSION

In this paper, we showed that spurious symmetrization can be quite bright even if we employ a large number of antennas. We found that two factors working in unison are needed for this to occur: phase-only calibration and the array's primary beam. We showed that phase-only calibration produces a brighter antighost than full-complex calibration. This already bright antighost can, in turn, be further amplified by the primary beam correction. We studied both of these factors in detail. It is crucial for us to be cognisant of these results while reducing transient data, in order to prevent the misclassification of artefacts as sources.

We first present the major results we obtained while investigating phase-only calibration. We used extrapolation (see Section 4.3) in conjunction with perturbation analysis (see Section 4.2) to derive the two-source phase-only ghost pattern, which we then compared to the full-complex pattern in Section 5. We summarize the main conclusions we made in Section 5 below.

  • Bright antighost: phase-only calibration produces a brighter antighost if we compare it to the antighost which forms when we perform full-complex calibration. The reason for this is that, in the case of phase-only calibration, the antighost is a proto-ghost, while for full-complex calibration, it is a deutero-ghost. This can be confirmed by inspecting Table 2.

  • Fewer ghosts: phase-only calibration produces fewer scattered ghosts than full-complex calibration as no deutero-ghosts form when we employ phase-only calibration.

  • Colour: we extended the basic per-baseline taxonomy in Paper II in Section 4.7. We realized that we can use the sign of a ghost's real amplitude factor to assign one of the two following false colours to it: red and blue. If a ghost is completely real, its colour determines whether it manifests as a positive or negative ghost and, consequently, whether it suppresses or amplifies. Real blue ghosts are suppressors and real red ghosts are amplifiers. The primary suppressor, secondary supressor and its antighost are all usually real valued which, in turn, means that only their colour is generally important. The primary suppressor and secondary suppressor are blue which explains why they suppress flux. The antighost of the secondary suppressor is red which explains why it manifests as a positive source during imaging. The fact that the antighost is red is important. If this were not the case, spurious symmetrization would not be possible.

  • Symmetric pairs: the blue proto-ghosts of phase-only calibration and full-complex calibration, in general, form a symmetrical pair with a red antighost. However, the red antighosts are proto-ghosts in the case of phase-only calibration and deutero-ghosts when full-complex calibration is employed. This explains why the antighosts of the phase-only blue proto-ghosts are brighter than their full-complex counterparts.

To conclude our phase-only calibration summary, we present ghost flux values as predicted by equations (33) and (59) for different values of N in Table 3. The reader can use this table to directly compare the ghost responses of full-complex and phase-only calibration and, by implication, enable them to validate most of the aforementioned conclusions.

Table 3.

The flux of the primary suppressor, the secondary suppressor, the antighost of the secondary suppressor, a proto-ghost and a deutero-ghost for a non-redundant interferometer with 7, 14, 27 and 100 antennas as a percentage of A2. In this table, N and B denote the number of antennas and baselines in the array, respectively. We generated the values in this table by employing two different calibration strategies, namely full-complex and phase-only calibration. We use the abbreviations FC and PO to denote full-complex and phase-only calibration, respectively.

Ghost Approach FormulaN
fraction of A271427100
PrimaryFC|$\frac{1}{N}$|14%7%4%1%
PON/AN/AN/AN/AN/A
SecondaryFC|$\frac{2}{N}-\frac{1}{N^2}$|27%14%7%2%
PO|$\frac{1}{N}$|14%7%4%1%
AntighostFC|$\frac{1}{N^2}$|2%0.5%0.1%0.01%
PO|$\frac{1}{N}$|14%7%4%1%
ProtoFC|$\frac{1}{B}(\frac{1}{N}-\frac{1}{N^2})$|0.6%0.07%0.01%0.0002%
PO|$\frac{1}{2BN}$|0.3%0.04%0.005%0.0001%
DeuteroFC|$\frac{1}{BN^2}$|0.1%0.006%0.0004%0.000 002%
PON/AN/AN/AN/AN/A
Ghost Approach FormulaN
fraction of A271427100
PrimaryFC|$\frac{1}{N}$|14%7%4%1%
PON/AN/AN/AN/AN/A
SecondaryFC|$\frac{2}{N}-\frac{1}{N^2}$|27%14%7%2%
PO|$\frac{1}{N}$|14%7%4%1%
AntighostFC|$\frac{1}{N^2}$|2%0.5%0.1%0.01%
PO|$\frac{1}{N}$|14%7%4%1%
ProtoFC|$\frac{1}{B}(\frac{1}{N}-\frac{1}{N^2})$|0.6%0.07%0.01%0.0002%
PO|$\frac{1}{2BN}$|0.3%0.04%0.005%0.0001%
DeuteroFC|$\frac{1}{BN^2}$|0.1%0.006%0.0004%0.000 002%
PON/AN/AN/AN/AN/A
Table 3.

The flux of the primary suppressor, the secondary suppressor, the antighost of the secondary suppressor, a proto-ghost and a deutero-ghost for a non-redundant interferometer with 7, 14, 27 and 100 antennas as a percentage of A2. In this table, N and B denote the number of antennas and baselines in the array, respectively. We generated the values in this table by employing two different calibration strategies, namely full-complex and phase-only calibration. We use the abbreviations FC and PO to denote full-complex and phase-only calibration, respectively.

Ghost Approach FormulaN
fraction of A271427100
PrimaryFC|$\frac{1}{N}$|14%7%4%1%
PON/AN/AN/AN/AN/A
SecondaryFC|$\frac{2}{N}-\frac{1}{N^2}$|27%14%7%2%
PO|$\frac{1}{N}$|14%7%4%1%
AntighostFC|$\frac{1}{N^2}$|2%0.5%0.1%0.01%
PO|$\frac{1}{N}$|14%7%4%1%
ProtoFC|$\frac{1}{B}(\frac{1}{N}-\frac{1}{N^2})$|0.6%0.07%0.01%0.0002%
PO|$\frac{1}{2BN}$|0.3%0.04%0.005%0.0001%
DeuteroFC|$\frac{1}{BN^2}$|0.1%0.006%0.0004%0.000 002%
PON/AN/AN/AN/AN/A
Ghost Approach FormulaN
fraction of A271427100
PrimaryFC|$\frac{1}{N}$|14%7%4%1%
PON/AN/AN/AN/AN/A
SecondaryFC|$\frac{2}{N}-\frac{1}{N^2}$|27%14%7%2%
PO|$\frac{1}{N}$|14%7%4%1%
AntighostFC|$\frac{1}{N^2}$|2%0.5%0.1%0.01%
PO|$\frac{1}{N}$|14%7%4%1%
ProtoFC|$\frac{1}{B}(\frac{1}{N}-\frac{1}{N^2})$|0.6%0.07%0.01%0.0002%
PO|$\frac{1}{2BN}$|0.3%0.04%0.005%0.0001%
DeuteroFC|$\frac{1}{BN^2}$|0.1%0.006%0.0004%0.000 002%
PON/AN/AN/AN/AN/A

We investigated the effect that primary beam correction can have on the antighost in Section 6. We found that the primary beam can amplify the brightness of the antighost in two main ways.

  • Model accuracy: if the apparent flux of the unmodelled source happens to be larger than the apparent flux of the brightest source in the field, then our calibration model becomes very inaccurate. If we calibrate with a very inaccurate model (A2A1), then a very bright antighost is created. We discussed this in detail in Sections 6.2.2 and 6.2.5.

  • Source positions: if the relative positions of the modelled and unmodelled source is such that the antighost forms in a region of the primary beam where the attenuation is high, then the antighost is significantly amplified when we perform our beam correction (see Section 6.2.5).

This work is based upon research supported by the South African Research Chairs Initiative of the Department of Science and Technology and National Research Foundation. This work is funded by IBM, ASTRON, the Dutch Ministry of Economic Affairs and the Province of Drenthe. It is also part of the SKA-TSM project and supported by The Northern Netherlands Provinces Alliance (SNN), Koers Noord and the Province of Drenthe, and the European Community FP7 programme MIDPREP, Grant Agreement PIRSES-GA-2013-612599.

The authors would like to thank Gianni Bernardi and Sandeep Sirothia for the valuable contributions that they made. We would also like to thank the reviewer, Dr Maxim Voronkov, for the useful feedback he provided; his feedback has definitely improved the quality of the paper.

2

Recall that {1, 2, 3}∪{1, 2, 4} = {1, 2, 3, 4}.

3

The sign refers to the sign of the ghost itself and not to the sign of the coefficient associated with it.

4

LBA CS 1–7, 11, 13, 17, 21, 24, 26, 28, 30–32, 101, 103, 201, 301, 302, 401, 501; LBA RS 106, 205, 208, 306–307, 406, 503, 508–509.

5

These coordinates are J2000 coordinates.

REFERENCES

Cornwell
T. J.
Fomalont
E. B.
1999
Taylor
G. B.
Carilli
C. L.
Perley
R. A.
ASP Conf. Ser. Vol. 180, Synthesis Imaging in Radio Astronomy II
Astron. Soc. Pac.
San Francisco
197

Dewdney
P. E.
Hall
P. J.
Schilizzi
R. T.
Lazio
T. J. L. W.
2009
Proc. IEEE
97
1482

Green
D.
2011
Bull. Astr. Soc. India
39
289

Grobler
T. L.
Smirnov
O. M.
2015
Proc. 1st URSI AT-RASC, Calibration artefacts: Phase-only Calibration. International Union of Radio Science (URSI)
1

Grobler
T. L.
Nunhokee
C. D.
Smirnov
O. M.
van Zyl
A. J.
de Bruyn
A. G.
2014
MNRAS
439
4030
(Paper I)

Heald
G.
et al.
2015
A&A
582
A123

Jonas
J. L.
2009
Proc. IEEE
97
1522

Linfield
R. P.
1986
AJ
92
213

Rau
U.
Bhatnagar
S.
Voronkov
M. A.
Cornwell
T. J.
2009
Proc. IEEE
97
1472

Salvini
S.
Wijnholds
S. J.
2014
A&A
571
A97

Smirnov
O. M.
2010
Ghostbusters: The Unknown Unknowns Of Selfcal, presentation at CALIM2010 conference
Dwingeloo

Smirnov
O. M.
2011
A&A
527
A106

Smirnov
O.
Tasse
C.
2015
MNRAS
449
2668

Stewart
A. J.
2014
LOFAR Image Plane Transient Candidate #3, presentation at LOFAR TKP meeting
Amsterdam

Stewart
A.
et al.
2016
MNRAS
456
2321

Swinbank
J. D.
et al.
2015
Astron. Comput.
11
25

van der Veen
A.-J.
Wijnholds
S. J.
2013
Bhattacharyya
S. S.
Deprettere
E. F.
Leupers
R.
Takala
J.
Signal Processing Tools for Radio Astronomy, Handbook of Signal Processing Systems
Springer
New York
421

van Haarlem
M. P.
et al.
2013
A&A
556
1

Wijnholds
S. J.
van der Tol
S.
Nijboer
R.
van der Veen
A.-J.
2010
IEEE Signal Process. Mag.
27
30

Wijnholds
S. J.
Grobler
T. L.
Smirnov
O. M.
2016
MNRAS
457
2331
(Paper II)

Wilkinson
P. N.
Conway
J.
Biretta
J.
1988
Reid
M. J.
Moran
J. M.
Proc. IAU Symp. 129, The Impact of VLBI on Astrophysics and Geophysics
Kluwer
Dordrecht
509

Yatawatta
S.
Kazemi
S.
Zaroubi
S.
2012
Proc. GPU Accelerated Nonlinear Optimization in Radio Interferometric Calibration. Innovative Parallel Computing (InPar)
1

APPENDIX A: DERIVATION OF EQUATION (26)

In this appendix, we derive a closed form expression for the pseudo-inverse of equation (25). We do this by employing eigenvalue decomposition, i.e. we first find
(A1)
which enables us to compute
(A2)
where |$\boldsymbol {\Lambda }^\dagger$| is defined such that all non-zero eigenvalues are replaced by their reciprocals while the zero eigenvalues remain zero.
It is rather easy to show that
(A3)
where the vector of ones has length N and |$\boldsymbol {V}_0$| provides an orthonormal basis for the null space of the explicitly stated eigenvector. Substituting equation (A3) into (A1) shows that
(A4)
We therefore have
(A5)
Hence
(A6)
which completes our derivation of equation (26).