-
PDF
- Split View
-
Views
-
Cite
Cite
I A Eckley, G P Nason, A test for the absence of aliasing or local white noise in locally stationary wavelet time series, Biometrika, Volume 105, Issue 4, December 2018, Pages 833–848, https://doi.org/10.1093/biomet/asy040
- Share Icon Share
SUMMARY
Aliasing is often overlooked in time series analysis but can seriously distort the spectrum, the autocovariance and their estimates. We show that dyadic subsampling of a locally stationary wavelet process, which can cause aliasing, results in a process that is the sum of asymptotic white noise and another locally stationary wavelet process with a modified spectrum. We develop a test for the absence of aliasing in a locally stationary wavelet series at a fixed location, and illustrate its application on simulated data and a wind energy time series. A useful by-product is a new test for local white noise. The tests are robust with respect to model misspecification in that the analysis and synthesis wavelets do not need to be identical. Hence, in principle, the tests work irrespective of which wavelet is used to analyse the time series, although in practice there is a trade-off between increasing statistical power and time localization of the test.
1. Introduction
Typically a data analyst is presented with a time series sampled at a fixed rate. However, the series might have been sampled at a higher rate, or the analyst could request future samples at a higher rate. In practice, it is important to question whether time series are sampled often enough to successfully capture their second-order structure. Improper sampling can lead to aliasing, which this article proposes to detect and locate via a new test.
For a time series sampled at intervals of length |$\Delta$|, the range of angular frequencies in the spectrum that can be observed undistorted is |$[0, \pi/\Delta)$|, where |$\pi/\Delta$| is the Nyquist frequency (Chatfield, 2003, p.109). If the highest frequencies in a series exceed the Nyquist frequency, then aliasing occurs and distorts the spectrum estimable by any method. As the spectrum and autocovariance are Fourier duals, distortion of the spectrum implies distortion of the autocovariance, and techniques that rely on either can be affected. Consequently, aliasing can have a critical impact.
Without knowledge of the data-generation process, the analyst will, a priori, be unaware of whether aliasing has occurred, and could misguidedly analyse the series assuming it is free from distortion. For example, if |$\{ X_t \}$| is a real-valued stationary process with spectrum |$f_X(\omega)$| for |$\omega \in [0, \pi)$|, and |$Y_t = X_{2t}$| is observed, then the spectrum of |$Y_t$| is |$f_Y(\omega) = f_X(\omega) + f_X (\pi - \omega)$| for |$\omega \in [0, \pi/2)$|. Generally, |$f_X$| cannot be identified from |$f_Y$|, so estimation of |$f_X$| is impossible from an estimate of |$f_Y$|. Thus, there is a need for principled approaches to testing for the presence of aliasing.
Hannan (1960), Priestley (1983), Hamilton (1994), Bloomfield (2000), Brillinger (2001) and Chatfield (2003) all describe aliasing, but provide no advice on how to detect or locate it. Instead, they suggest steps that can be taken to guard against it. One possibility is to apply a low-pass anti-aliasing filter prior to sampling, to ensure that the highest frequency in the filtered series is below the Nyquist rate. Although anti-alias filtering can be useful for analogue or very high sample-rate signal acquisitions, it is not useful when data are acquired at slow rates, such as in economics. Moreover, even if anti-alias filtering is used, valuable high-frequency information could be lost. Another possibility is that one might know, a priori, the highest frequency contained within a time series, and can choose the sample rate high enough to prevent aliasing.
Hinich & Wolinsky (1988) and Hinich & Messer (1995) introduced a hypothesis test for aliasing in stationary time series based on the bispectrum, a third-order quantity. Our wavelet test is based on simpler second-order quantities, and is designed for nonstationary time series. Computing the bispectrum typically takes |$O(T^2)$| operations. Our test is faster, requiring only |$O( T \log T)$| operations, which can be important for long time series.
Previous work has considered alias detection at a fixed location for a nonstationary time series, where the true signal or spectrum is known beforehand. Wunsch & Gunn (2003) used ice core time series and induced aliasing to demonstrate how it can lead to misleading scientific conclusions. Our test does not require knowledge of the underlying true spectrum or the higher-rate time series.
In a single realization, a nonstationary series can sometimes be aliased and sometimes not, depending on its spectral content relative to the Nyquist frequency at a given point. Hence, with a nonstationary series one may ask not only whether the series is aliased, but also where. Our test is designed to help answer both questions for locally stationary wavelet time series. That wavelets bring something genuinely new to the aliasing problem can be seen by replicating our method using Fourier-based quantities. Unlike the wavelet equations, which have a solution as given below, the equivalent Fourier-based equations are underdetermined, which is precisely the usual aliasing problem.
If one has prior knowledge that the time series has been properly sampled and there is no possibility of aliasing, then our test becomes a test for local white noise. Although global white noise tests are popular and useful, at the time of writing we are unaware of any local method tailored to locally stationary series. Local tests can be used for similar purposes to global ones, such as assisting with model selection, understanding time-varying forecasting performance or, as suggested by a referee, detecting measurement error in some circumstances.
2. Review of locally stationary wavelet processes
Locally stationary wavelet processes are time series models, constructed from wavelets, that change their statistical properties slowly over time. They are particularly useful for their ability to model time series operating at dominant scales in areas such as finance (Fryzlewicz, 2005), economics (Winkelmann, 2016), ocean engineering (Killick et al., 2013), structural engineering (Spanos & Kougioumtzoglou, 2012), energy (Nowotarski et al., 2013; de Menezes et al., 2016) and business (Michis, 2009). Dahlhaus (2012) provides a comprehensive review of locally stationary time series. We now briefly review essential definitions from Nason et al. (2000).
Spectral power for a locally stationary wavelet time series is quantified by the evolutionary wavelet spectrum, the time-scale analogue of the usual stationary spectrum, |$f(\omega)$|.
The locally stationary wavelet process |$\{ X_{t, T} \}_{t=0, \ldots, T-1}$|, for |$T \geqslant 1$|, has evolutionary wavelet spectrum defined by |$S_j(z) = | W_j (z) |^2$| for |$j\in{\mathbb N}$| and |$z \in (0,1)$| with respect to |$\{ \psi_{j,\,k} \}$|.
Hence, the |$\{ w_{j,\,k} \}$| are a collection of amplitudes such that |$w^2_{j,\,k} \approx S_j (k/T)$|. Evolution of the second-order properties of |$X_{t, T}$| is controlled by smoothness constraints on |$S_j(z)$| as a function of |$z$| via those imposed on |$W_j(z)$| in Definition 2(i)–(iii). For brevity, we henceforth drop the second subscript, |$T$|, in |$X_{t, T}$|.
The spectrum, |$S_j(z)$|, governs the contribution to variance in |$X_t$| at different scales at time |$z$|. Informally, |$S_j(z)$| corresponds to the process variance integrated over the approximate frequency band |$[ 2^{-j} \pi, 2^{1-j} \pi]$|. For example, the approximate band for |$S_1(z)$| is |$[ \pi/2, \pi]$|, that for |$S_2(z)$| is |$[\pi/4, \pi/2]$|, and so on. The precise frequency bands and their degree of overlap depend on the particular wavelet |$\psi_{j,\,k} (t)$| used in (3). We next recall several key quantities associated with locally stationary wavelet theory.
The raw wavelet periodogram of |$Y_t$| is defined to be |$I_{\ell,\,m} = d_{\ell,\,m}^2$| for |$\ell \in {\mathbb N}$| and |$m \in {\mathbb Z}$|, where |$\{ d_{\ell,\,m} \}$| are the nondecimated wavelet coefficients of |$Y_t$| given by |$d_{\ell,\,m} = \sum_t Y_t \psi_{\ell,\,m} (t)$| for |$\ell \in {\mathbb N}$| and |$m \in {\mathbb Z}$|. The autocorrelation wavelet is given by |$\Psi_j (\tau) = \sum_k \psi_{j,\,k} \psi_{j,\,k-\tau}$| for |$j \in {\mathbb N}$| and |$\tau \in {\mathbb Z}$|, and the inner product operator of the autocorrelation wavelets is |$A_{j, \ell} = \langle \Psi_j, \Psi_{\ell}\rangle = \sum_\tau \Psi_j (\tau) \Psi_{\ell} (\tau)$| for |$j, \ell \in {\mathbb N}$|.
Further information on locally stationary wavelet processes can be found in Fryzlewicz & Nason (2006), Van Bellegem & von Sachs (2008) and Nason (2008, Chapter 5).
3. Locally stationary wavelet models under dyadic sampling
3.1. Dyadic subsampling of locally stationary wavelet processes
In this article, aliasing is induced by starting with a locally stationary wavelet process, |$X_t$|, and then forming |$Y_t = X_{2^r t}$| (|$t\in{\mathbb Z}$|; |$r= 1, \ldots, J-1$|) by dyadic subsampling of |$X_t$|. Since |$Y_t$| is sampled at a slower rate than |$X_t$|, high-frequency power in |$X_t$| could reappear as aliased power in |$Y_t$|. Our first result shows that |$Y_t$| can be represented by the sum of a locally stationary wavelet process, constructed using the same wavelets as the original, and another process, which is asymptotically white noise.
Let |$\{ X_t \}_{t \in {\mathbb Z}}$| be a locally stationary wavelet process with evolutionary wavelet spectrum |$\{ S_j (z) \}_{j=1}^{\infty}$|. If |$Y_t = X_{2 t}$|, then |$\{ Y_t \}_{t\in {\mathbb Z}}$| admits the decomposition |$Y_t = L_t + F_t$|, where |$L_t$| is a locally stationary wavelet process with the same underlying wavelets as |$X_t$| and having raw wavelet periodogram expectation given by the right-hand side of (4), and |$F_t$| is a process with mean zero and autocovariance |$\operatorname{cov} ( F_t, F_{t+\tau} ) = S_1(2t/T)\delta_{0, \tau} + O(T^{-1})$|. Hence, |$F_t$| is asymptotically white noise with variance |$S_1(z)$|. Further, if |$S_1(z)$| is constant for |$z \in (0,1)$|, then |$F_t$| is stationary white noise with variance |$S_1$|.
The proof is given in the Appendix. Theorem 1 can be extended to repeated dyadic sampling as follows.
Let |$\{ X_t \}_{t \in {\mathbb Z}}$| be as in Theorem 1, and let |$Y_t = X_{2^r t}$| (|$r \in {\mathbb N}$|). Then, asymptotically, |$\{ Y_t\}$| admits the decomposition |$Y_t = L_t + F_t$|, where |$L_t$| is a locally stationary wavelet process with the same underlying wavelets as |$X_t$| and having raw wavelet periodogram expectation as in the right-hand side of (4), and |$F_t$| is a process with mean zero and autocovariance |$\operatorname{cov} ( F_t, F_{t+\tau}) = \delta_{0, \tau} \sum_{j=1}^r S_j (2^r t/T) + O(T^{-1})$|. Further, if |$S_1(z), \ldots, S_{r}(z)$| are all constant functions of |$z\in (0,1)$|, then |$F_t$| is stationary white noise with variance |$\sum_{j=1}^r S_j$|.
Nason et al. (2000) developed an estimator for the wavelet spectrum by exploiting the raw wavelet periodogram |$d^2_{\ell,\,m}$| of |$X_t$| using the result |${E}(d^2_{\ell,\,m}) = \sum_{j} A_{\ell,\,j} S_j(m/T) + O(T^{-1})$|. The next result explains what happens to |${E} (d^2_{\ell,\,m})$| after dyadic subsampling.
Theorem 2 generalizes Proposition 4 of Nason et al. (2000), in which |$r=0$|, and additionally establishes the result for Shannon wavelets. The Shannon wavelet can be thought of as the limiting case of Daubechies wavelets with an infinite number of vanishing moments. A consequence is that |$A_{j, \ell} = 2^j \delta_{j, \ell}$| for Shannon wavelets; see Nason et al. (2000) for further details.
There are three key differences between (5) and (6). First, |$E( d^2_{\ell,\,m})$| in (5) is contaminated by |$S_1 (2m/ T)$| at every analysis scale, i.e., for all |$\ell \geqslant 1$|. This contamination is the manifestation of aliasing in the locally stationary wavelet domain and could be used to detect aliasing, as described in §3.2. The second difference is that the mixing matrix on the right-hand side of (5) is |$A_{j-1, \ell}$|, not |$A_{j, \ell}$|. The third difference is that the dyadically subsampled periodogram exists on the grid |$2m/T$| rather than on |$m/T$|.
3.2. White noise confounding and hypothesis specification
If a locally stationary wavelet process |$X_t$| is white noise with variance |$\sigma^2$|, then the unsampled process is such that |$E(d^2_{\ell,\,m}) = \sigma^2$| for |$\ell \in {\mathbb N}$| and |$m\in {\mathbb Z}$|; see the proof of Lemma B.3 in Fryzlewicz et al. (2003). The same quantity appears at every scale as in the first term of (4) or (5) when subsampling. Hence, the effects of white noise and aliasing are confounded in our set-up.
Such confounding is not a surprise, as white noise can be seen as the ultimate aliased signal. For example, suppose that |$X_t$| is a stationary process with variance |$\sigma^2 < \infty$| and autocovariance |$\gamma_X (\tau) \rightarrow 0$| as |${\tau \rightarrow \infty}$|. If |$Y_t = X_{2^r t}$|, then |$\gamma_Y (\tau) \rightarrow \sigma^2 \delta_{\tau, 0}$| as |$r \rightarrow \infty$|. In other words, repeated subsampling of |$X_t$| leads to white noise. Taking a broader view, in a practical situation, what appears to be white noise may be the result of repeatedly subsampling a time series. Similar confounding can be found in the test for aliasing for stationary series in Hinich & Messer (1995); rejection of the null hypothesis there can mean that the series is not random, not stationary, aliased, not mixing, or any combination of these.
However, even with confounding, we can still test the null hypothesis |$H_0$| of there being no white noise component and no aliased component at |$z_0 \in (0, 1)$| against the hypothesis |$H_{\rm A}$| that a white noise or aliased component exists at |$z_0$|. So although we cannot separate the two components, our hypothesis test can test locally whether they are both absent.
4. Aliasing/white noise test
4.1. The test procedure
Let |$t_0 \in \{ 1, \ldots, T \}$| and let |$z_0 = t_0 / T$| be the rescaled-time equivalent of |$t_0$|. If we could observe the evolutionary wavelet spectrum |$\{ S_j^{(Y)} (z_0) \}_{j \in {\mathbb N}}$| directly, it would be possible to test the ideal null hypothesis |$H^{\rm (I)}_0$| that there exists |$j^{\ast} \in {\mathbb N}$| such that |$S_{j^{\ast}} (z_0) = 0$|, versus the alternative |$H^{\rm (I)}_{\rm A}\colon S_j (z_0) > 0$| for all |$j \in {\mathbb N}$|. In view of §3.2, the null hypothesis means that |$\{ Y_t \}$| cannot have any additive white noise component; and, in view of Theorem 2, it would also mean that no aliasing has occurred from any underlying subsampled |$\{ X_t \}$| process.
However, the spectrum |$\{ S_j^{(Y)} (z) \}_{j \in {\mathbb N}}$| cannot be observed directly but can only be estimated from a realization |$\{ Y_t \}_{t=1}^T$| on a finite set of scales |$j \leqslant J^{\dagger} < J$|, where |$J^{\dagger}$| is chosen to avoid boundary effects. Hence, we can only gain information on |$S_j (z_0)$| for |$j \in \{ 1, \ldots, J^{\dagger}\}$|, and therefore only test the null hypothesis |$H_0$| that there exists |$j^{\ast} \in \{ 1, \ldots, J^{\dagger} \}$| such that |$S_j^{\ast} (z_0) = 0$|, versus the alternative |$H_{\rm A}\colon S_{j} (z_0) > 0$| for |$j \in \{1 \ldots, J^{\dagger}\}$|. Clearly, if |$H_0$| is true, then |$H_0^{\rm (I)}$| is true. However, if |$H_{\rm A}$| is true, then either |$H_0^{\rm (I)}$| or |$H_{\rm A}^{\rm (I)}$| could be true, as |$S_j (z_0)$| might be zero for scales |$j > J^{\dagger}$| that cannot be discerned from a finite-length series.
We proceed by testing whether |$S_j(z_0) = 0$| for each scale |$j \in \{ 1, \ldots, J^{\dagger} \}$| separately. Let |$b \in {\mathbb N}, b>0$| be a window width, and define |$R_b = \{ -b, \ldots, b \}$|. Define the sample |$\{ \hat{S}_{j,\,t_0 + r} \}_{j=1, \ldots,\,J^{\dagger},\,r \in R_b}$|, where |$\hat{S}_{j, \ell}$| is the estimator given by |$\sum_{i=1}^{J^{\dagger}} A^{-1}_{j,\,i} I_{i,\,\ell}$| with |$I_{i,\,\ell}$| and |$A$| as in Definition 4. Proposition 4 of Nason et al. (2000) and the Lipschitz continuity of |$S_j(z)$| imply that |${E} ( \hat{S}_{j,\,t_0 + r} ) = S_j ( z_0 ) + O( r/T)$|. Hence, under |$H_0$|, there exists |$j^{\ast} \in \{ 1, \ldots, J^{\dagger} \}$| such that |${E} (\hat{S}_{j^{\ast},\,t_0 + r} ) = O(r/ T)$|. For small |$r$| and large |$T$| we have |${E} (\hat{S}_{j^{\ast},\,t_0 +r} ) \approx 0$|, with the |$\hat{S}_{j^{\ast},\,t_0 + r}$| having the same distribution on |$r\in R_b$| asymptotically. This approximation improves with increasing |$T$| and for smoother |$S$| around |$z_0$|. Hence, we employ standard tests of zero location on the set |$\{ \hat{S}_{j,\,t_0 + r} \}_{r \in R_b}$|, one for each scale |$j=1, \ldots, J^{\dagger}$|. For example, we could use Student’s |$t$|-test on the sample |$\{ \hat{S}_{j,\,t_0 + r} \}_{r \in R_b}$|. Let |$\bar{S}_{j,\,t_0}$| and |$\hat{\sigma}^2_{j,\,t_0}$| be the sample mean and variance of |$\{ \hat{S}_{j,\,t_0 + r} \}_{j=1, \ldots,\,J^{\dagger},\,r \in R_b}$|. The next result establishes the asymptotic distribution of the usual |$t$|-statistic operating on the |$\{ \hat{S}_{j,\,t_0 + r} \}_{j=1, \ldots,\, J^{\dagger},\,r \in R_b}$| sample.
Let |$j \in \{ 1, \ldots, J^{\dagger} \}$| be fixed, and let |$b\in {\mathbb N}, b>0$|. Suppose that |$\{ Y_t\}$| is a stationary Shannon wavelet process with innovations that satisfy |$E ( |\xi_{j,\,k} |^6 ) <\infty$|. Let |$t = (2b)^{1/2} \bar{S}_{j,\,t_0} \hat{\sigma}^{-1}_{j,\,t_0}$| be Student’s |$t$|-statistic defined on the sample |$\{ \hat{S}_{j,\,t_0 + r} \}_{j=1, \ldots,\,J^{\dagger},\,r \in R_b}$|. Then, as |$b \rightarrow \infty$|, |$\:t$| converges in distribution to a normal variable with mean |$2^j S_{j,\,t_0}$| and variance |$1$|.
We use Student’s |$t$|-test or, in the case of significant violations of normality, a nonparametric test such as the Wilcoxon signed rank or the signmedian test. Student’s |$t$|-test does not work well in the presence of autocorrelation, which is known to exist in the |$\hat{S}_{j,\,t_0 +r}$| sequence as a function of |$r$|; see Jones (1975), for example. To improve the performance of the |$t$|-test, we employ the equivalent sample size method from Zwiers & von Storch (1995), which uses estimated autocorrelation to alter the effective sample size. The |$t$|-test is robust against violations of the normality assumption; see the Supplementary Material.
We use Holm’s method (Holm, 1979) to control the overall size of our test over the multiple |$J^\dagger$| hypotheses. This is a reasonable choice, as no assumption is made about correlations between the |$\hat{S}_j(z_0)$| at different scales, but as a result our test is somewhat conservative.
4.2. Practical considerations
Our test is robust with respect to mismatch of the synthesis wavelet in (3) and the analysis wavelet given in Definition 4, because a white noise component and aliasing caused by subsampling both cause a constant to be added to each level of the wavelet periodogram, regardless of the type of wavelet used to synthesize the process. Our test uses a wavelet method to detect whether a constant has been added to each level, and it works irrespective of the analysis wavelet.
All Daubechies compactly supported wavelet spectral estimates suffer from spectral leakage, where power from one scale can leak to adjacent scales. Such leakage makes it harder to determine whether power in a given scale is zero, and this influences the statistical size and power of our test. In practice, there is a trade-off between mitigating spectral leakage by using longer, smoother wavelets (Nason et al., 2000, §4.1) and achieving good time localization by using shorter, rougher wavelets such as Haar wavelets. Therefore, we recommend using Daubechies D|$5$|, D|$6$| or D|$7$| mid-range wavelets, which achieve a good compromise.
Similarly, the size and power of our test will depend on the window width, |$b$|, which should be large enough to enable detection of modest departures from |$H_0$|, but small enough to ensure that the estimates |$\hat{S}_{j,\,t_0 + r}$| are representative of the behaviour of the spectrum at |$z_0$|. An appropriate upper bound for |$b$| could be obtained by computing an estimator of the spectrum (Nason et al., 2000; Fryzlewicz & Nason, 2006) and then choosing |$b$| such that the window of spectral estimates on |$R_b$| is approximately constant for each scale. For fixed |$T$| our test operates over a small interval |$(z_0 - b/T, \, z_0 + b/T)$|, although, conceptually, as |$T$| increases this interval will shrink to |$z_0$|.
Torrence & Compo (1998) introduced the wavelet cone of influence as the region of the wavelet spectrum degraded by edge effects. The extent of the cone depends on the length |$T$| of the time series and the length |$N_h$| of the wavelet filter used to compute the wavelet spectrum from Definition 1. The coarsest non-cone scale is the largest |$J_{\rm NC} \in {\mathbb N}$| such that |$J_{\rm NC} \leqslant \log_2 \{ (T/2 + N_h - 2)/(N_h - 1)\}$|. Our test uses scales |$1, \ldots, J^{\dagger}$| and, to avoid edge effects, we choose |$J^{\dagger} \leqslant J_{\rm NC}$|. For a functioning practical test, with reasonable power, we recommend |$T \geqslant 512$|, and for the range of sample sizes considered we choose |$J^{\dagger} = 4$|. This provides us with enough scales to furnish a nontrivial test, but not too many scales, which would reduce power and expose the test to large autocorrelations at the coarser scales, as mentioned earlier. For practical use, we recommend that |$J^{\dagger}$| grow linearly with |$\log T$|.
We already employ multiple hypothesis testing methods to control the overall size of our test by combining the results over |$J^{\dagger}$| scales using Holm’s method. Similar methods could be used if we repeated our test at multiple time locations, not just at a single time-point |$z_0$|.
4.3. Post hoc investigation if the test is rejected
If the null hypothesis is rejected, then evidence exists that aliasing or a white noise component is present, and we describe three post hoc strategies to distinguish them.
One strategy might be to perform stationarity tests on a range of times around |$z_0$| and, if stationary, use the Hinich & Wolinsky (1988) or Hinich & Messer (1995) aliasing tests. Another strategy could be to reanalyse the series at a faster rate, if possible. If significant power exists at finer scales, then aliasing probably has occurred, and the new rate should be considered in future.
A third approach, illustrated in §5.2, investigates the local spectral properties of the series in clear patches, where there is no aliasing or white noise, to discover whether power moves from low to higher frequencies just before an aliasing/white noise event or vice versa. This exploratory approach has the advantage of not requiring faster-sampled data required by the second approach or the stationarity tests of the first approach, which could have poor power for moderate sample sizes.
For the specific case of Shannon locally stationary wavelet processes, Eckley & Nason (2014) showed that aliased power can be detected and the aliasing effects removed.
5. Simulations and a wind energy example
5.1. Simulation study
Our simulation study uses the test-bed evolutionary wavelet spectrum |$\{ S^{\text{TB1}}_{j}(z) \}$| shown in Fig. 1 to produce realizations without subsampling, |$r=0$|. The test-bed spectrum satisfies the null hypothesis for |$J^{\dagger}=4$| at |$z_0 = 1/2$|, and the alternative at |$z_0 = 1/4$|.

The test-bed spectrum |$\{ S^{\text{TB1}}_{j}(z) \}_{j=1}^{J^{\dagger}=4}$|.
We drew 1000 realizations of length |$T=4096$| from the locally stationary process model of (3) using the test-bed spectrum and Daubechies D|$5$| wavelets with normally distributed innovations. Tables 1–4 show the empirical rejection rates; in each table, the rightmost column gives the rejection rate obtained by combining the per-scale |$p$|-values via Holm’s method, whereas the rates for individual scales |$j=1, \ldots, 4$| are reported in the preceding columns. Using the method described in §4.1, we test |$H_0 \colon S_j (z_0) = 0$| versus |$H_{\rm A} \colon S_j(z_0) > 0$|, for each scale |$j=1, \ldots, J^{\dagger}$| separately, at a nominal size of 5%, using the equivalent sample size version of Student’s |$t$|-test.
Empirical rejection rate |$(\%)$|, per scale and overall, at |$z_0 = 1/2$| with no subsampling
. | Size . | Power . | Power . | Power . | Size . |
---|---|---|---|---|---|
|$b$| . | |$j=1$| . | |$j=2$| . | |$j=3$| . | |$j=4$| . | Overall . |
|$16$| | |$3{\cdot}2$| | |$42{\cdot}1$| | |$28{\cdot}0$| | |$30{\cdot}9$| | |$0{\cdot}6$| |
|$32$| | |$2{\cdot}5$| | |$76{\cdot}2$| | |$36{\cdot}4$| | |$39{\cdot}4$| | |$1{\cdot}1$| |
|$64$| | |$1{\cdot}8$| | |$99{\cdot}2$| | |$69{\cdot}7$| | |$65{\cdot}6$| | |$1{\cdot}2$| |
|$128$| | |$2{\cdot}2$| | |$100{\cdot}0$| | |$98{\cdot}0$| | |$90{\cdot}0$| | |$2{\cdot}1$| |
. | Size . | Power . | Power . | Power . | Size . |
---|---|---|---|---|---|
|$b$| . | |$j=1$| . | |$j=2$| . | |$j=3$| . | |$j=4$| . | Overall . |
|$16$| | |$3{\cdot}2$| | |$42{\cdot}1$| | |$28{\cdot}0$| | |$30{\cdot}9$| | |$0{\cdot}6$| |
|$32$| | |$2{\cdot}5$| | |$76{\cdot}2$| | |$36{\cdot}4$| | |$39{\cdot}4$| | |$1{\cdot}1$| |
|$64$| | |$1{\cdot}8$| | |$99{\cdot}2$| | |$69{\cdot}7$| | |$65{\cdot}6$| | |$1{\cdot}2$| |
|$128$| | |$2{\cdot}2$| | |$100{\cdot}0$| | |$98{\cdot}0$| | |$90{\cdot}0$| | |$2{\cdot}1$| |
Empirical rejection rate |$(\%)$|, per scale and overall, at |$z_0 = 1/2$| with no subsampling
. | Size . | Power . | Power . | Power . | Size . |
---|---|---|---|---|---|
|$b$| . | |$j=1$| . | |$j=2$| . | |$j=3$| . | |$j=4$| . | Overall . |
|$16$| | |$3{\cdot}2$| | |$42{\cdot}1$| | |$28{\cdot}0$| | |$30{\cdot}9$| | |$0{\cdot}6$| |
|$32$| | |$2{\cdot}5$| | |$76{\cdot}2$| | |$36{\cdot}4$| | |$39{\cdot}4$| | |$1{\cdot}1$| |
|$64$| | |$1{\cdot}8$| | |$99{\cdot}2$| | |$69{\cdot}7$| | |$65{\cdot}6$| | |$1{\cdot}2$| |
|$128$| | |$2{\cdot}2$| | |$100{\cdot}0$| | |$98{\cdot}0$| | |$90{\cdot}0$| | |$2{\cdot}1$| |
. | Size . | Power . | Power . | Power . | Size . |
---|---|---|---|---|---|
|$b$| . | |$j=1$| . | |$j=2$| . | |$j=3$| . | |$j=4$| . | Overall . |
|$16$| | |$3{\cdot}2$| | |$42{\cdot}1$| | |$28{\cdot}0$| | |$30{\cdot}9$| | |$0{\cdot}6$| |
|$32$| | |$2{\cdot}5$| | |$76{\cdot}2$| | |$36{\cdot}4$| | |$39{\cdot}4$| | |$1{\cdot}1$| |
|$64$| | |$1{\cdot}8$| | |$99{\cdot}2$| | |$69{\cdot}7$| | |$65{\cdot}6$| | |$1{\cdot}2$| |
|$128$| | |$2{\cdot}2$| | |$100{\cdot}0$| | |$98{\cdot}0$| | |$90{\cdot}0$| | |$2{\cdot}1$| |
Empirical power |$(\%)$| of test, per scale and overall, at |$z_0 = 1/4$| with no subsampling
|$b$| . | |$j=1$| . | |$j=2$| . | |$j=3$| . | |$j=4$| . | Overall . |
---|---|---|---|---|---|
|$16$| | |$76{\cdot}7$| | |$28{\cdot}6$| | |$25{\cdot}9$| | |$31{\cdot}4$| | |$3{\cdot}2$| |
|$32$| | |$98{\cdot}4$| | |$55{\cdot}2$| | |$40{\cdot}4$| | |$43{\cdot}6$| | |$11{\cdot}7$| |
|$64$| | |$100{\cdot}0$| | |$94{\cdot}6$| | |$77{\cdot}6$| | |$73{\cdot}0$| | |$54{\cdot}6$| |
|$80$| | |$100{\cdot}0$| | |$98{\cdot}8$| | |$86{\cdot}6$| | |$83{\cdot}0$| | |$70{\cdot}6$| |
|$96$| | |$100{\cdot}0$| | |$99{\cdot}4$| | |$94{\cdot}1$| | |$89{\cdot}1$| | |$83{\cdot}3$| |
|$128$| | |$100{\cdot}0$| | |$99{\cdot}8$| | |$98{\cdot}4$| | |$94{\cdot}0$| | |$92{\cdot}3$| |
|$b$| . | |$j=1$| . | |$j=2$| . | |$j=3$| . | |$j=4$| . | Overall . |
---|---|---|---|---|---|
|$16$| | |$76{\cdot}7$| | |$28{\cdot}6$| | |$25{\cdot}9$| | |$31{\cdot}4$| | |$3{\cdot}2$| |
|$32$| | |$98{\cdot}4$| | |$55{\cdot}2$| | |$40{\cdot}4$| | |$43{\cdot}6$| | |$11{\cdot}7$| |
|$64$| | |$100{\cdot}0$| | |$94{\cdot}6$| | |$77{\cdot}6$| | |$73{\cdot}0$| | |$54{\cdot}6$| |
|$80$| | |$100{\cdot}0$| | |$98{\cdot}8$| | |$86{\cdot}6$| | |$83{\cdot}0$| | |$70{\cdot}6$| |
|$96$| | |$100{\cdot}0$| | |$99{\cdot}4$| | |$94{\cdot}1$| | |$89{\cdot}1$| | |$83{\cdot}3$| |
|$128$| | |$100{\cdot}0$| | |$99{\cdot}8$| | |$98{\cdot}4$| | |$94{\cdot}0$| | |$92{\cdot}3$| |
Empirical power |$(\%)$| of test, per scale and overall, at |$z_0 = 1/4$| with no subsampling
|$b$| . | |$j=1$| . | |$j=2$| . | |$j=3$| . | |$j=4$| . | Overall . |
---|---|---|---|---|---|
|$16$| | |$76{\cdot}7$| | |$28{\cdot}6$| | |$25{\cdot}9$| | |$31{\cdot}4$| | |$3{\cdot}2$| |
|$32$| | |$98{\cdot}4$| | |$55{\cdot}2$| | |$40{\cdot}4$| | |$43{\cdot}6$| | |$11{\cdot}7$| |
|$64$| | |$100{\cdot}0$| | |$94{\cdot}6$| | |$77{\cdot}6$| | |$73{\cdot}0$| | |$54{\cdot}6$| |
|$80$| | |$100{\cdot}0$| | |$98{\cdot}8$| | |$86{\cdot}6$| | |$83{\cdot}0$| | |$70{\cdot}6$| |
|$96$| | |$100{\cdot}0$| | |$99{\cdot}4$| | |$94{\cdot}1$| | |$89{\cdot}1$| | |$83{\cdot}3$| |
|$128$| | |$100{\cdot}0$| | |$99{\cdot}8$| | |$98{\cdot}4$| | |$94{\cdot}0$| | |$92{\cdot}3$| |
|$b$| . | |$j=1$| . | |$j=2$| . | |$j=3$| . | |$j=4$| . | Overall . |
---|---|---|---|---|---|
|$16$| | |$76{\cdot}7$| | |$28{\cdot}6$| | |$25{\cdot}9$| | |$31{\cdot}4$| | |$3{\cdot}2$| |
|$32$| | |$98{\cdot}4$| | |$55{\cdot}2$| | |$40{\cdot}4$| | |$43{\cdot}6$| | |$11{\cdot}7$| |
|$64$| | |$100{\cdot}0$| | |$94{\cdot}6$| | |$77{\cdot}6$| | |$73{\cdot}0$| | |$54{\cdot}6$| |
|$80$| | |$100{\cdot}0$| | |$98{\cdot}8$| | |$86{\cdot}6$| | |$83{\cdot}0$| | |$70{\cdot}6$| |
|$96$| | |$100{\cdot}0$| | |$99{\cdot}4$| | |$94{\cdot}1$| | |$89{\cdot}1$| | |$83{\cdot}3$| |
|$128$| | |$100{\cdot}0$| | |$99{\cdot}8$| | |$98{\cdot}4$| | |$94{\cdot}0$| | |$92{\cdot}3$| |
Empirical rejection rate |$(\%)$|, per scale and overall, at |$z_0 = 1/2$| with dyadic subsampling |$(r = 1)$|
. | . | Power . | Size . | Power . | Size . |
---|---|---|---|---|---|
|$b$| . | |$j=1$| . | |$j=2$| . | |$j=3$| . | |$j=4$| . | Overall . |
|$16$| | – | |$87{\cdot}7$| | |$9{\cdot}7$| | |$63{\cdot}5$| | |$2{\cdot}8$| |
|$32$| | – | |$95{\cdot}5$| | |$5{\cdot}8$| | |$88{\cdot}6$| | |$1{\cdot}3$| |
|$64$| | – | |$100{\cdot}0$| | |$5{\cdot}9$| | |$99{\cdot}8$| | |$2{\cdot}2$| |
|$128$| | – | |$100{\cdot}0$| | |$2{\cdot}4$| | |$100{\cdot}0$| | |$0{\cdot}9$| |
. | . | Power . | Size . | Power . | Size . |
---|---|---|---|---|---|
|$b$| . | |$j=1$| . | |$j=2$| . | |$j=3$| . | |$j=4$| . | Overall . |
|$16$| | – | |$87{\cdot}7$| | |$9{\cdot}7$| | |$63{\cdot}5$| | |$2{\cdot}8$| |
|$32$| | – | |$95{\cdot}5$| | |$5{\cdot}8$| | |$88{\cdot}6$| | |$1{\cdot}3$| |
|$64$| | – | |$100{\cdot}0$| | |$5{\cdot}9$| | |$99{\cdot}8$| | |$2{\cdot}2$| |
|$128$| | – | |$100{\cdot}0$| | |$2{\cdot}4$| | |$100{\cdot}0$| | |$0{\cdot}9$| |
Empirical rejection rate |$(\%)$|, per scale and overall, at |$z_0 = 1/2$| with dyadic subsampling |$(r = 1)$|
. | . | Power . | Size . | Power . | Size . |
---|---|---|---|---|---|
|$b$| . | |$j=1$| . | |$j=2$| . | |$j=3$| . | |$j=4$| . | Overall . |
|$16$| | – | |$87{\cdot}7$| | |$9{\cdot}7$| | |$63{\cdot}5$| | |$2{\cdot}8$| |
|$32$| | – | |$95{\cdot}5$| | |$5{\cdot}8$| | |$88{\cdot}6$| | |$1{\cdot}3$| |
|$64$| | – | |$100{\cdot}0$| | |$5{\cdot}9$| | |$99{\cdot}8$| | |$2{\cdot}2$| |
|$128$| | – | |$100{\cdot}0$| | |$2{\cdot}4$| | |$100{\cdot}0$| | |$0{\cdot}9$| |
. | . | Power . | Size . | Power . | Size . |
---|---|---|---|---|---|
|$b$| . | |$j=1$| . | |$j=2$| . | |$j=3$| . | |$j=4$| . | Overall . |
|$16$| | – | |$87{\cdot}7$| | |$9{\cdot}7$| | |$63{\cdot}5$| | |$2{\cdot}8$| |
|$32$| | – | |$95{\cdot}5$| | |$5{\cdot}8$| | |$88{\cdot}6$| | |$1{\cdot}3$| |
|$64$| | – | |$100{\cdot}0$| | |$5{\cdot}9$| | |$99{\cdot}8$| | |$2{\cdot}2$| |
|$128$| | – | |$100{\cdot}0$| | |$2{\cdot}4$| | |$100{\cdot}0$| | |$0{\cdot}9$| |
Empirical rejection rate |$(\%)$|, per scale and overall, at |$z_0 = 1/4$| with subsampling |$(r=1)$|
|$b$| . | |$j=1$| . | |$j=2$| . | |$j=3$| . | |$j=4$| . | Overall . |
---|---|---|---|---|---|
|$16$| | – | |$78{\cdot}6$| | |$16{\cdot}6$| | |$38{\cdot}7$| | |$2{\cdot}1$| |
|$32$| | – | |$98{\cdot}7$| | |$29{\cdot}9$| | |$60{\cdot}7$| | |$6{\cdot}5$| |
|$64$| | – | |$100{\cdot}0$| | |$62{\cdot}4$| | |$90{\cdot}6$| | |$29{\cdot}0$| |
|$128$| | – | |$100{\cdot}0$| | |$94{\cdot}3$| | |$100{\cdot}0$| | |$65{\cdot}8$| |
|$b$| . | |$j=1$| . | |$j=2$| . | |$j=3$| . | |$j=4$| . | Overall . |
---|---|---|---|---|---|
|$16$| | – | |$78{\cdot}6$| | |$16{\cdot}6$| | |$38{\cdot}7$| | |$2{\cdot}1$| |
|$32$| | – | |$98{\cdot}7$| | |$29{\cdot}9$| | |$60{\cdot}7$| | |$6{\cdot}5$| |
|$64$| | – | |$100{\cdot}0$| | |$62{\cdot}4$| | |$90{\cdot}6$| | |$29{\cdot}0$| |
|$128$| | – | |$100{\cdot}0$| | |$94{\cdot}3$| | |$100{\cdot}0$| | |$65{\cdot}8$| |
Empirical rejection rate |$(\%)$|, per scale and overall, at |$z_0 = 1/4$| with subsampling |$(r=1)$|
|$b$| . | |$j=1$| . | |$j=2$| . | |$j=3$| . | |$j=4$| . | Overall . |
---|---|---|---|---|---|
|$16$| | – | |$78{\cdot}6$| | |$16{\cdot}6$| | |$38{\cdot}7$| | |$2{\cdot}1$| |
|$32$| | – | |$98{\cdot}7$| | |$29{\cdot}9$| | |$60{\cdot}7$| | |$6{\cdot}5$| |
|$64$| | – | |$100{\cdot}0$| | |$62{\cdot}4$| | |$90{\cdot}6$| | |$29{\cdot}0$| |
|$128$| | – | |$100{\cdot}0$| | |$94{\cdot}3$| | |$100{\cdot}0$| | |$65{\cdot}8$| |
|$b$| . | |$j=1$| . | |$j=2$| . | |$j=3$| . | |$j=4$| . | Overall . |
---|---|---|---|---|---|
|$16$| | – | |$78{\cdot}6$| | |$16{\cdot}6$| | |$38{\cdot}7$| | |$2{\cdot}1$| |
|$32$| | – | |$98{\cdot}7$| | |$29{\cdot}9$| | |$60{\cdot}7$| | |$6{\cdot}5$| |
|$64$| | – | |$100{\cdot}0$| | |$62{\cdot}4$| | |$90{\cdot}6$| | |$29{\cdot}0$| |
|$128$| | – | |$100{\cdot}0$| | |$94{\cdot}3$| | |$100{\cdot}0$| | |$65{\cdot}8$| |
Table 1 illustrates our test working as a local white noise test, with no subsampling. Under the null hypothesis, at |$z_0 = 1/2$|, the test shows increasing empirical power at scales |$j=2, 3, 4$| as the window width |$b$| increases, and has empirical size ranging from 1|$\cdot$|8% to 3|$\cdot$|2% for scale |$j=1$|. These empirical sizes are somewhat conservative, and we attribute this to negative correlations present in the |$\{ \hat{S}_{j,\,t_0 + r} \}_{r \in R_b}$| sample. The equivalent sample size method mentioned in §4.1 attempts to improve the size calibration, but the outcome is not perfect.
The overall empirical size of our test is |$1$|–|$2$|%, lower than the nominal rate. Under the alternative hypothesis, at |$z_0 = 1/4$|, Table 2 shows the increasing empirical power for all scales.
The next simulation set illustrates application of our absence-of-aliasing test using a second test-bed spectrum, |$S^{\text{TB2}}_j (z)$|, which is identical to |$S^{\text{TB1}}_j (z)$| except that the scale |$j=3$| has zero power. This set subsamples using |$r=1$|, which reduces the realization length from |$4096$| to |$2048$| according to §3.1, and induces aliasing where power existed at scale |$j=1$|. The subsampling causes scale |$j=1$| to disappear, and so we test the individual scale hypotheses at |$j=2, 3, 4$|. Table 3 shows the results for |$z_0 = 1/2$|, where previously there was no power at the finest scale and therefore no occurrence of aliasing. The empirical power for the |$j=2, 4$| columns is high, due to the spectral power present at those scales. However, for scale |$j=3$|, for reasonable sample sizes, the rejection rate becomes consistent with the nominal size. The overall rejection rate at |$z_0 = 1/2$| is well controlled, but conservative, with respect to the nominal rate.
Table 4 shows the results for |$z_0 = 1/4$|, where the power at the finest scale, |$\,j=1$|, is redistributed into coarser scales according to Theorem 2. As before, the null hypothesis is routinely rejected at scales |$j=2, 4$|. In contrast to Table 3, Table 4 shows increasing, and eventually high, power at scale |$j=3$|. Hence, we would often reject our null hypothesis in favour of the alternative |$H_{\rm A}\colon S_j(z_0) > 0$| for all |$j = 1, \ldots, J^{\dagger}$|, and therefore the overall empirical power of the test increases and reaches 65|$\cdot$|8% for |$b=128$|. In practice we would, for the time being, accept |$H_0$| at |$z_0 = 1/2$| and conclude that no aliasing occurred and nor were white noise components present. However, at |$z_0 = 1/4$|, although we can reject |$H_0$|, we cannot conclude that there was definitely aliasing or that white noise components were present, because we have no information on scales for |$j > 4$|.
5.2. A wind speed example
Wind power forecasting has received much attention in recent years, motivated by the need to develop reliable forecasting tools to enable effective integration of wind farm output into power grids (Landberg et al., 2003; Genton & Hering, 2007). Several authors have used autoregressive integrated moving average models that implicitly assume the absence of aliasing; see Huang & Chalabi (1995) or Sfetsos (2002), for example. We seek to identify whether any such corruption might occur in data provided by an industrial collaborator. High-resolution wind speed data, sampled at 1 Hz, were acquired from a proposed wind farm in the midwestern U.S.A. during March 2011 and differenced to remove trend. Figure 2 displays the differenced series, which exhibits nonstationarity. For example, the sample variances of the first and last 100 observations are |$0{\cdot}063$| and |$0{\cdot}093$|, respectively, and statistically significant variance differences have been confirmed using methods from Nason (2013). Q-Q plots and goodness-of-fit tests strongly suggest that the series’ marginal distribution has heavy tails.

First differences of wind speed (|$\mbox{m}\,\mbox{s}^{-1}$|) for a proposed wind farm site in the midwestern U.S.A., with |$T=512$|. The dashed vertical lines indicate the time-points |$t=245, 330$|.
We test |$H_0$| at the two times |$t=245, 330$|, and the Holm-adjusted |$p$|-values are displayed in Table 5. Results are presented for the three wavelets recommended in §4.2 for the |$J^{\dagger}=4$| non-cone scales. At time |$t=330$|, for each wavelet |$H_0$| is not rejected, so we have no evidence of a white noise component or aliasing at this location. At time |$t=245$|, |$H_0$| is rejected at the 5% level for wavelets D|$6$| and D|$8$| but not for D|$7$|. However, |$H_0$| would have been rejected at the 6|$\cdot$|5% level for the D|$7$| wavelet. Hence there is evidence to reject |$H_0$| in favour of |$H_{\rm A}$|.
The |$p$|-values and |$H_0$| rejection status for the wind series with our test at two reference times, using the signmedian test, three different Daubechies D|$v$| wavelets with |$v$| vanishing moments, and |$b=25$|
Time . | Wavelet . | |$j=1$| . | |$j=2$| . | |$j=3$| . | |$j=4$| . | Reject |$H_0$| . |
---|---|---|---|---|---|---|
|$245$| . | D|$6$| . | |$0{\cdot}0$| . | |$0{\cdot}3$| . | |$3{\cdot}3$| . | |$0{\cdot}3$| . | Yes . |
|$245$| | D|$7$| | |$0{\cdot}0$| | |$2{\cdot}0$| | |$6{\cdot}5$| | |$2{\cdot}0$| | No |
|$245$| | D|$8$| | |$0{\cdot}1$| | |$3{\cdot}3$| | |$0{\cdot}8$| | |$0{\cdot}8$| | Yes |
|$330$| | D|$6$| | |$0{\cdot}0$| | |$100{\cdot}0$| | |$0{\cdot}0$| | |$100{\cdot}0$| | No |
|$330$| | D|$7$| | |$0{\cdot}0$| | |$100{\cdot}0$| | |$0{\cdot}8$| | |$100{\cdot}0$| | No |
|$330$| | D|$8$| | |$0{\cdot}0$| | |$100{\cdot}0$| | |$0{\cdot}1$| | |$100{\cdot}0$| | No |
Time . | Wavelet . | |$j=1$| . | |$j=2$| . | |$j=3$| . | |$j=4$| . | Reject |$H_0$| . |
---|---|---|---|---|---|---|
|$245$| . | D|$6$| . | |$0{\cdot}0$| . | |$0{\cdot}3$| . | |$3{\cdot}3$| . | |$0{\cdot}3$| . | Yes . |
|$245$| | D|$7$| | |$0{\cdot}0$| | |$2{\cdot}0$| | |$6{\cdot}5$| | |$2{\cdot}0$| | No |
|$245$| | D|$8$| | |$0{\cdot}1$| | |$3{\cdot}3$| | |$0{\cdot}8$| | |$0{\cdot}8$| | Yes |
|$330$| | D|$6$| | |$0{\cdot}0$| | |$100{\cdot}0$| | |$0{\cdot}0$| | |$100{\cdot}0$| | No |
|$330$| | D|$7$| | |$0{\cdot}0$| | |$100{\cdot}0$| | |$0{\cdot}8$| | |$100{\cdot}0$| | No |
|$330$| | D|$8$| | |$0{\cdot}0$| | |$100{\cdot}0$| | |$0{\cdot}1$| | |$100{\cdot}0$| | No |
The |$p$|-values and |$H_0$| rejection status for the wind series with our test at two reference times, using the signmedian test, three different Daubechies D|$v$| wavelets with |$v$| vanishing moments, and |$b=25$|
Time . | Wavelet . | |$j=1$| . | |$j=2$| . | |$j=3$| . | |$j=4$| . | Reject |$H_0$| . |
---|---|---|---|---|---|---|
|$245$| . | D|$6$| . | |$0{\cdot}0$| . | |$0{\cdot}3$| . | |$3{\cdot}3$| . | |$0{\cdot}3$| . | Yes . |
|$245$| | D|$7$| | |$0{\cdot}0$| | |$2{\cdot}0$| | |$6{\cdot}5$| | |$2{\cdot}0$| | No |
|$245$| | D|$8$| | |$0{\cdot}1$| | |$3{\cdot}3$| | |$0{\cdot}8$| | |$0{\cdot}8$| | Yes |
|$330$| | D|$6$| | |$0{\cdot}0$| | |$100{\cdot}0$| | |$0{\cdot}0$| | |$100{\cdot}0$| | No |
|$330$| | D|$7$| | |$0{\cdot}0$| | |$100{\cdot}0$| | |$0{\cdot}8$| | |$100{\cdot}0$| | No |
|$330$| | D|$8$| | |$0{\cdot}0$| | |$100{\cdot}0$| | |$0{\cdot}1$| | |$100{\cdot}0$| | No |
Time . | Wavelet . | |$j=1$| . | |$j=2$| . | |$j=3$| . | |$j=4$| . | Reject |$H_0$| . |
---|---|---|---|---|---|---|
|$245$| . | D|$6$| . | |$0{\cdot}0$| . | |$0{\cdot}3$| . | |$3{\cdot}3$| . | |$0{\cdot}3$| . | Yes . |
|$245$| | D|$7$| | |$0{\cdot}0$| | |$2{\cdot}0$| | |$6{\cdot}5$| | |$2{\cdot}0$| | No |
|$245$| | D|$8$| | |$0{\cdot}1$| | |$3{\cdot}3$| | |$0{\cdot}8$| | |$0{\cdot}8$| | Yes |
|$330$| | D|$6$| | |$0{\cdot}0$| | |$100{\cdot}0$| | |$0{\cdot}0$| | |$100{\cdot}0$| | No |
|$330$| | D|$7$| | |$0{\cdot}0$| | |$100{\cdot}0$| | |$0{\cdot}8$| | |$100{\cdot}0$| | No |
|$330$| | D|$8$| | |$0{\cdot}0$| | |$100{\cdot}0$| | |$0{\cdot}1$| | |$100{\cdot}0$| | No |
We undertake a post hoc investigation as described in §4.3. We did not reject |$H_0$| at |$t=330$|, are suspicious that aliasing or white noise components could exist at |$t=245$|, and suspect that there might be a transition between the two time-points, possibly around |$t=290$|. If prior to |$t=290$| the series was truly subject to aliasing, and after that time it was not, then power will have moved from higher to lower frequencies over time or, at the very least, have disappeared from frequencies above the Nyquist frequency. Since we suspect aliasing prior to |$t=300$|, we do not apply classical spectral analysis here.
However, as we have no evidence of aliasing after the suspected transition, we subject the series there to post hoc classical stationary periodogram analyses on three rolling windows, using methods described by Fryzlewicz et al. (2008) and displayed in Fig. 3. The results indicate that the peak frequency associated with each window decreases. Hence, there is supporting evidence for the conjecture that, after |$t=300$|, the high-frequency content in the series reduces over time.

Three regular local kernel-smoothed periodograms of wind speed series taken over three time windows: 260–323 (solid), 270–333 (dashed) and 280–343 (dotted). The frequencies of the maximum values of each smoothed periodogram are indicated by vertical lines.
Using the centre time, |$t$|, for each rolling window and its associated peak frequency, |$f$|, we obtain |$(t, f) \in \{ (292, 0{\cdot}438), (302, 0{\cdot}406), (312, 0{\cdot}395)\}$|, which can be modelled approximately by the linear relationship |$f = 1{\cdot}06 - 0{\cdot}0022 t$|. Assuming that the peak frequency reduces linearly according to this model, and extrapolating backwards, we hypothesize that the series was aliased before |$t=261$|, when |$f=0{\cdot}5$| in the model. This is merely a hypothesis, as the series could have been contaminated by white noise before the transition rather than by aliased high-frequency components. In addition, the transition time itself, as well as this analysis, is subject to uncertainty.
Acknowledgement
We thank C. Ziesler of Wind Capital Group for the wind speed data and several reviewers who have helped to improve this paper. The authors were partially supported by the Research Councils UK Energy Programme and gratefully acknowledge financial support from the Engineering and Physical Sciences Research Council.
Supplementary material
Supplementary material available at Biometrika online includes the proofs of Theorems 2 and 3 and definition of the test-bed spectrum, together with advice on the selection of a suitable hypothesis test of location, and theoretical justification of the asymptotic equivalence of the distribution of |$d_{j,\,k}$| to |$d_{j,\,k + r}$|.
Appendix
Proof of Theorem 1
The |$\{ W^{\ast}_{j}(z) \}$| satisfy the same smoothness conditions as the |$\{ W_{j}(z) \}$|. Hence |$U^{(p)}_t$| is a locally stationary wavelet process for all |$p$|. Although the range of |$p$| values in (A3) seems to be infinite, only a finite, and typically small, number of |$h_{2p}$| are nonzero, so the sum over |$p$| and |$\ell$| in (A3) is never infinite.
Therefore, |$V_t$| is the sum of a finite number of locally stationary wavelet processes with constant coefficients, not depending on |$t$|, and hence is itself a locally stationary wavelet process (Cardinali & Nason, 2010). The same arguments can be applied to |$O_t$|. Thus, |$V_t + O_t$| is also a locally stationary wavelet process.
The last equality holds because of the orthonormality relations satisfied by the quadrature mirror filter coefficients associated with the underlying mother wavelet; see Burrus et al. (1997, formula (5.28)).
The transition from (A5) to (A6) is due to formula (10) in Nason et al. (2000), and that from (A6) to (A7) is due to the Lipschitz continuity of |$W_1(z)$|, with |$L_1$| the associated Lipschitz constant, from Definition 1c in Nason et al. (2000). The remainder terms parallel those found in Nason et al. (2000, Proposition 1) and the proof of Theorem 2.
Proof of Corollary 1
This follows by iteration of Theorem 1 and reference to Theorem 2.
Proof of Theorem 2
The remainders in (A9) for Daubechies’ wavelets are derived using standard arguments (Nason et al., 2000, Proposition 4). The Supplementary Material gives details of the remainder for the Shannon case.
Lemma 1 of Eckley & Nason (2005) shows that when |$j>r$| we have |$\Psi_j (2^r v) = \Psi_{j-r} (v)$|. Further, for |$j=r$| we have |$\Psi_j (2^r v) = \Psi_1 (2v) = \delta_{v,\,0}$|, and for |$j<r$| we have |$\Psi_1 (2^{r-j} v) = \delta_{v,\,0}$|.
When not subsampling, i.e., |$r=0$|, formula (A11) reduces to the formula for the expectation of the raw wavelet periodogram in Nason et al. (2000, Proposition 4).