This paper presents a comprehensive examination of the X-ray continuum variability of the bright Seyfert 1 galaxy MCG-6-30-15. The source clearly shows the strong, linear correlation between rms variability amplitude and flux first seen in Galactic X-ray binaries. The high-frequency power spectral density (PSD) of MCG-6-30-15 is examined in detail using a Monte Carlo fitting procedure and is found to be well represented by a steep power law at high frequencies (with a power-law index α≈ 2.5), breaking to a flatter slope (α≈ 1) below fbr≈ 0.6–2.0 × 10−4 Hz, consistent with the previous results of Uttley, McHardy & Papadakis. The slope of the power spectrum above the break is energy dependent, with the higher energies showing a flatter PSD. At low frequencies the variations between different energy bands are highly coherent, while at high frequencies the coherence is significantly reduced. Time lags are detected between energy bands, with the soft variations leading the hard ones. The magnitude of the lag is small (≲200 s for the frequencies observed) and is most likely frequency dependent. These properties are remarkably similar to the temporal properties of the Galactic black hole candidate Cygnus X-1. The characteristic time-scales in these two types of source differ by ∼105; assuming that these time-scales scale linearly with black hole mass then suggests a black hole mass ∼106 M⊙ for MCG-6-30-15. We speculate that the timing properties of MCG-6-30-15 may be analogous to those of Cyg X-1 in its high/soft state and discuss a simple phenomenological model, originally developed to explain the timing properties of Cyg X-1, which can explain many of the observed properties of MCG-6-30-15.
The X-ray emission from radio-quiet active galactic nuclei (AGN) shows persistent variability that is stronger and more rapid than in any other waveband. This rapid X-ray variability provided early support for the black hole/accretion disc model of AGN (e.g. Rees 1984) and suggested that the X-ray emission is produced in the inner regions of the central engine (see Mushotzky, Done & Pounds 1993 for a review). The X-ray fluctuations appear erratic and random, with variability occurring on a range of time-scales (e.g. McHardy 1989).
In the study of time-variable processes one of the most widely used statistical tools is the variability power spectrum (e.g. Priestley 1981; Bloomfield 2000). The power spectral density (PSD) describes the amount of variability power (i.e. amplitude2) as a function of temporal frequency (time-scale−1). The long, uninterrupted observations provided by EXOSAT allowed for the measurement of the first accurate X-ray PSDs for Seyfert 1 galaxies. These EXOSAT PSDs were shown to rise at lower frequencies as a power law: , where is the power at frequency f and α is the PSD slope, found to be α∼ 1.5 over the frequencies ∼10−5 to ∼10−3 Hz (Green, McHardy & Lehto 1993; Lawrence & Papadakis 1993). This type of broad-band variability, rising to lower frequencies with a power spectral slope α≳ 1, is usually called ‘red noise’ (for an introduction to red noise in astronomy and elsewhere see Press 1978).
The red-noise PSDs of Seyfert 1s are similar to those observed in Galactic black hole candidates (GBHCs, see e.g. van der Klis 1995) on much shorter time-scales, and suggests that the X-ray emission mechanisms may be the same in these objects that differ in black hole mass by factors of ≳105 (McHardy 1989). The PSD of the best studied GBHC, Cygnus X-1 in its low/hard state, can be approximated by a doubly broken power law with a steep slope at high frequencies (α≈ 2 above ∼3 Hz), breaking to a flatter slope at intermediate frequencies (α≈ 1 in the range ≈0.2–3 Hz) then breaking to a flat (‘white’) spectrum at low frequencies (α≈ 0 below 0.2 Hz). See Nowak et al. (1999a) and Belloni & Hasinger (1990) for more details of Cyg X-1. Other GBHCs such as GX 339-4 show similar behaviour (Nowak, Wilms & Dove 2002). The position of the breaks in the PSD represent ‘characteristic time-scales’ of the system, but these time-scales have no clear physical interpretation at present (e.g. Belloni, Psaltis & van der Klis 2002 and references therein).
The featureless EXOSAT PSDs of Seyfert 1s provided no such time-scales. Such breaks must exist, however, or the total power would diverge at low frequencies (for PSD slopes α⩾ 1). Indeed, in recent years dedicated X-ray monitoring programmes with RXTE have revealed deviations from the power-law PSD in a few Seyfert 1 galaxies; at low frequencies the PSD breaks to a flatter slope (Edelson & Nandra 1999; Uttley, McHardy & Papadakis 2002; Markowitz et al. 2003). The time-scale of this break frequency is consistent with the idea that the PSD of AGN and GBHCs are essentially the same, with the characteristic time-scales scaling linearly with the black hole mass. Such a linear scaling of time-scales with black hole mass might be expected if the characteristic time-scales are related to, for example, a light-crossing time-scale or an orbital time-scale. This linear scaling also means that a 100-ks light curve of a Seyfert galaxy (such as presented here) is comparable to a ≲1 s light curve of Cyg X-1. On these time-scales a bright Seyfert 1 galaxy provides many more photons (by more than two orders of magnitude) than the GBHC, i.e. Seyferts have a higher count rate per characteristic time-scale. This means that intensive monitoring of Seyferts can provide a view of the very high-frequency variations in accreting black holes that is difficult to access even in bright GBHCs.
Not only do AGN and GBHCs show similar PSDs, but they display other striking similarities in their X-ray timing properties. Uttley & McHardy (2001) showed that both X-ray binary and AGN light curves show a tight, linear correlation between flux and rms variability amplitude (see also Edelson et al. 2002). The variability properties of GBHCs such as Cyg X-1 are a function of photon energy. The PSD shape above the high-frequency break is energy dependent, with the harder-energy bands showing flatter PSD slopes (e.g. Nowak et al. 1999a; Lin et al. 2000). In addition the variations between different bands are highly coherent (well correlated at each Fourier frequency) and show time delays, with variations at softer energies leading those at harder energies (Miyamoto & Kitamoto 1989; Cui et al. 1997; Nowak et al. 1999a). The magnitude of the time delay is frequency dependent, with the delay increasing at lower frequencies. To date these properties have only been measured in one AGN, NGC 7469 (Nandra & Papadakis 2001; Papadakis, Nandra & Kazanas 2001), and in this object the energy dependence of the PSD slope, the coherence and time-delay properties appeared remarkably similar to those seen in Cyg X-1. These similarities re-enforce the idea that the X-ray emission mechanisms are the same in GBHCs and AGN.
This paper presents a detailed analysis of the X-ray continuum variability of the bright, variable Seyfert 1 galaxy MCG-6-30-15 (z = 0.007 749) using a long XMM-Newton observation. The large collecting area, wide bandpass and long orbit (∼2 d) of XMM-Newton make it ideal for examining the high-frequency (≳10−5 Hz) timing properties of Seyfert galaxies. The analysis concentrates on frequency-domain (i.e. Fourier) time series analysis techniques, as these are used extensively in the analysis of GBHC data. The rest of this paper is organized as follows. In Section 2 the data reduction procedures are outlined. In Section 3 the basic properties of the light curves are discussed, followed by an analysis of the PSD in Section 4. Spectral variability is then examined in detail by measuring the coherence and time lags between different energy bands, as discussed in Section 5. As these analyses are relatively new to AGN studies we discuss the methodology in some detail. Finally, the implications of these results are discussed in Section 6 and some concluding remarks are given in Section 7.
2 Data Reduction
XMM-Newton (Jansen et al. 2001) observed MCG-6-30-15 over the period 2001 July 31–2001 August 5 (revolutions 301, 302 and 303), during which all instruments were operating nominally. Fabian et al. (2002) discuss details of the observation and basic data reduction. For the present paper the data were reprocessed entirely with the latest software (sas v5.3.3), but the procedure followed that discussed in the previous paper (except that the extraction region was a circle of radius 35 arcsec for both MOS and pn).
Light curves were extracted from the EPIC pn data in four different energy bands: 0.2–10.0 keV (full band), 0.2–0.7 keV (soft band), 0.7–2.0 keV (medium band) and 2.0–10.0 keV (hard band). These were corrected for telemetry drop outs (less than 1 per cent of the total time), background subtracted and binned to 100-s time resolution. The errors on the light curves were calculated by propagating the Poisson noise. The light curves were not corrected for the ∼71 per cent‘live time’ of the pn camera (Strüder et al. 2001), which is only a scaling factor. The EPIC MOS light curves were also examined, but these are essentially identical to the pn light curves except with a lower signal-to-noise ratio. Therefore, in the rest of this paper the analysis concentrates on the pn data, but the MOS data are used to confirm the pn results of Section 5. The full-band light curves are shown in Fig. 1. The revolutions provided three uninterrupted light curves of 80-, 123- and 124-ks duration.
As discussed in Fabian et al. (2002), BeppoSAX observed MCG-6-30-15 simultaneously with XMM-Newton, but these data are not used in the present paper. The LECS and MECS data cover approximately the same bandpass as the EPIC (but with a lower signal-to-noise ratio) and the higher-energy PDS data (with a count rate of ∼0.7 count s−1) do not have sufficient statistics to determine the high-energy variability properties accurately. In addition, analysis of the BeppoSAX data is complicated by the poor sampling of the light curves (owing to repeated Earth occultations and SAA passages), indeed the amount of ‘good’ exposure time obtained for the PDS was 40 ks spread over a >400 ks observing period.
3 Basic Temporal Properties
The source shows a factor of 5 variation between minimum and maximum flux (both occurring during revolution 303), and significant variability can be seen on time-scales as short as ∼100 s.
The fractional excess rms variability amplitude (; Edelson et al. 2002) and mean count rate were calculated for each of the three revolutions and are tabulated in Table 1. The errors on Fvar were estimated using the prescription given by Edelson et al. (2002, but see their appendix A for caveats on the interpretation of these estimates). The medium band showed the highest variability amplitude. This can also been seen in fig. 6 of Fabian et al. (2002), where the fractional rms spectrum peaked in the 0.7–2.0 keV band.
3.1 Radiative Efficiency
Fig. 2 shows a rapid ‘event’ from the revolution 302 light curve. During this the source changed in 0.2–10 keV luminosity by ΔL≈ 1.5 × 1042 erg s−1 over 100 s. This was used to estimate the radiative efficiency η of the source, assuming photon diffusion through a spherical mass of accreting matter: ΔL/Δt≲η× 2.1 × 1042 erg s−2 (Fabian 1979; Brandt et al. 1999). This gave a limit of η≳ 0.7 per cent, below the expected maximum for accretion on to a black hole (6 and 30 per cent for non-rotating and maximally rotating black holes, respectively; Thorne 1974).
3.2 Rms-flux correlation
Using the 100-s resolution light curves the mean count rate and σrms were calculated in bins of 15 data points. The values of σrms from all three orbits were then binned as a function of count rate such that there were 20 measurements per bin, and an error was assigned based on the scatter within each bin (equation 4.14 of Bevington & Robinson 1992). Fig. 3 shows the resulting correlation between σrms and count rate in the four energy bands. The significance of the correlation was tested using the Spearman rank-order correlation coefficient, rs, and the Kendall τ coefficient (Press et al. 1992). Both of these indicate that the correlation is significant at ≳99.5 per cent confidence in all energy bands. The effect of the rms-flux correlation can be seen in the light curves (Fig. 1): the peaks of the light curves appear more variable (‘jagged’) while the troughs appear much smoother.
Table 2 shows the results of fitting a linear model to these data; in all cases the linear model gives a good fit to the data, with a y-axis intercept consistent with zero. The gradient of this relation corresponds to the Fvar measured only on these time-scales (between 100 and 1500 s). The 1σ upper limit on any constant flux offset (x-intercept) is 1.2 count s−1 for the full band, or 6.2 per cent of the total count rate in that band. In the three energy subbands the upper limits on the constant flux offset are as follows: 1.04 count s−1 (12 per cent), 0.09 count s−1 (1.3 per cent) and 0.43 count s−1 (12.8 per cent) for the soft, medium and hard bands, respectively.
4 Power Spectral Properties
4.1 Estimating the power spectrum
The PSD (sometimes known as the autospectrum) describes the amount of variability ‘power’ as a function of Fourier frequency. The PSD and autocorrelation function (ACF) are Fourier pairs, i.e. they are Fourier transforms of one another. For a general introduction to Fourier methods of time series analysis see, e.g., Priestley (1981) and Bloomfield (2000), and for a review of Fourier techniques as applied to X-ray time series analysis see van der Klis (1989).
4.1.1 Measuring the periodogram
If the light curve x(ti) is evenly sampled (with a sampling period ΔT) then the PSD can be estimated by calculating the periodogram.1 This is the normalized, modulus-squared of the discrete Fourier transform (DFT) of the data. The DFT of the light curve, X(fj), is calculated as (see Press et al. 1992)
The transforms are normalized by (ΔT/N)1/2 so that the amplitudes are independent of the sampling and length of the light curve. The DFT is calculated at N/2 evenly spaced frequencies fj=j/NΔT (where j = 1, 2, …, N/2). Thus the highest frequency accessible to the DFT is the Nyquist frequency, fNyq=fN/2 = 1/2ΔT, and the lowest frequency is f1 = 1/NΔT. Note that it is customary to subtract the mean flux from the light curve before calculating the DFT, this eliminates the zero-frequency power.equation (4) and equation (5) to account for the data sampling.]
This normalization (defined in van der Klis 1997, see also Miyamoto et al. 1991) is most commonly used in analysis of AGN and X-ray binaries because the integrated periodogram yields the fractional variance of the data (i.e. F2var). This normalization will be used throughout this paper. The factor of 2 arises because this is a ‘one-sided’ normalization; the periodogram is defined for both negative and positive frequencies, with this normalization the integral over only positive frequencies yields the fractional variance. The integrated periodogram between two frequencies, say f1 and f2, yields the contribution to the fractional variance owing to variations between the corresponding time-scales 1/f2 and 1/f1 (this follows from Parseval's theorem, see e.g. van der Klis 1989). The units for the periodogram ordinate are (rms/mean)2 Hz−1 (where rms/mean is the dimensionless quantity Fvar) or simply Hz−1.
4.1.2 Poisson noise
If the time series is a continuous photon counting signal binned into intervals of ΔT, as it is here, the effect of Poisson noise is to add an approximately constant amount of power to the periodogram at all frequencies. Thus, at high frequencies, where the red-noise spectrum of the AGN is weak, the observed periodogram will be dominated by the flat (‘white’) Poisson noise spectrum. With the above normalization this constant Poisson noise level is . This formally assumes zero background flux, but for the present observation this is a reasonable approximation (even in the hard band the background count rate is ∼0.6 per cent that of the source).
4.1.3 Scatter in the periodogram
A periodogram measured from a single light curve shows a great deal of scatter around the underlying PSD. This is a natural outcome of measuring the periodogram from a single realization of a stochastic process. The result of this is that the periodogram at a given frequency, P(f), is scattered around the underlying PSD, , following a χ2 distribution with two degrees of freedom (dof) (see Section 6.2 of Priestley 1981):2 The expectation value of the periodogram at a frequency f is thus equal to the PSD but so is its standard deviation, as a result the periodogram shows a great deal of scatter. This fundamental property of the periodogram is discussed in terms of X-ray time series analysis by, for example, Leahy et al. (1983), van der Klis (1989), Papadakis & Lawrence (1993), Timmer & König (1995) and Stella et al. (1997). In addition the periodogram is an inconsistent estimator of the PSD because this intrinsic scatter does not decrease as the number of data points in the light curve is increased. The periodogram can, however, be averaged (binned over separate light curves or adjacent frequencies) to reduce this scatter and produce a consistent estimate of the PSD (Jenkins & Watts 1968).
In GBHC analysis the light curves are typically broken into many separate segments and periodograms are calculated for each segment. At a given Fourier frequency the periodogram estimates from each light curve are identically and independently distributed (assuming the variability is stationary) and so their average will become Gaussian (following the central limit theorem). Therefore, averaging the periodograms from many independent light-curve segments will produce a consistent estimate of the PSD with Gaussian errors (e.g. van der Klis 1997). However, dividing up the light curve limits the lowest frequency probed by the periodogram and so is not so suitable for AGN analysis where it is crucial to examine the widest frequency range possible.
An alternative technique is discussed by Papadakis & Lawrence (1993) and is in some sense ‘optimal’ for red-noise light curves such as those from AGN. In this method a single periodogram is computed from the light curve, reaching down to the lowest accessible frequencies, and periodogram estimates at consecutive frequencies are averaged. For evenly sampled data the periodogram estimates at each Fourier frequency are independently distributed (according to equation (8)) and so the averaged periodogram estimate in each frequency bin will tend to a Gaussian distribution. In the method of Papadakis & Lawrence (1993) it is the logarithm of the periodogram that is binned, such that each bin contains N⩾ 20 periodogram estimates, and an error is assigned based on the scatter within each bin. Using the logarithm of the periodogram means the distribution of the binned periodogram converges to a Gaussian with fewer data points per bin, and therefore allows for finer frequency binning. This binning method is used throughout this paper to produce binned periodogram estimates with errors. (See also van der Klis 1997 for more on binned periodogram estimates.)
4.1. Bias in the periodogram
A further point is that periodograms measured from finite data tend to be biased by windowing effects that further complicate their interpretation (van der Klis 1989; Papadakis & Lawrence 1993; Uttley et al. 2002). The observed light curve [f(t)] is the true, continuous light curve of the source [l(t)] multiplied by the sampling, ‘window’, function [w(t)]:
This leads to two main effects. The first is because of the discrete sampling of the data. If the light curve is not contiguously sampled then power from frequencies above the Nyquist frequency can be folded back, or ‘aliased’, into the observed frequency range. For binned, continuous light curves such as those presented here, the aliasing will be negligible (see van der Klis 1989). In any event, from continuous data the very high-frequency spectrum will be dominated by the Poisson noise level and not aliasing.
The second effect is ‘red-noise leak’ and arises as a result of the finite duration of the light curve. Power is transferred from low to high frequencies by the lobes of the spectral window function (see, e.g., Deeter & Boynton 1982; van der Klis 1997). If there is significant power at frequencies below the lowest frequency probed by the periodogram (i.e. on time-scales longer than the length of the observation), this can manifest itself as slow rising or falling trends across the light curve. These trends will contribute to the total variance of the light curve (and thus the total power) but, as the periodogram does not extend to such low frequencies, the power will be transferred into the observed frequency bandpass (i.e. it will ‘leak’ to higher frequencies). Red-noise leak tends to add a constant component to the observed periodogram in the form of a power law of slope α = 2 (i.e. the PSD of a linear trend; see van der Klis 1997 for more details), the amplitude of this depends on the PSD of the source at low frequencies (see also Section 3.3 of Papadakis & Lawrence 1995). The effect of red-noise leak on the periodogram can be accounted for using the Monte Carlo simulation procedure outlined below.
4.2 PSD fitting procedure
The method employed in the present work is similar to the methods discussed by Done et al. (1992), Green, McHardy & Done (1999) and Uttley et al. (2002). A model PSD is assumed, multiple realizations (light curves) are simulated using this PSD and are resampled to match the original data (thereby including the distorting effects of the sampling). Periodograms are then calculated from the simulated light curves in the same manner as the real data, averaged over the realizations, and this ‘average distorted model’ (ADM), which includes the effects of red-noise leak, is compared with the original periodogram using a goodness-of-fit measure. This method therefore accounts for the biases in the periodogram introduced by the window function. It is important to bear in mind that, as with any model-fitting procedure, the results depend on the assumptions made, i.e. the choice of specific models to test.
The errors derived from the periodograms (see Section 4.1.3) are approximately Gaussian (Papadakis & Lawrence 1993) and therefore the χ2 statistic can be used to estimate the goodness of the fit. Once a reasonable fit is found, uncertainties on the model parameters can be estimated using the Δχ2 criterion (e.g. Lampton, Margon & Bowyer 1976), as in X-ray energy spectrum fitting.
The algorithm described by Timmer & König (1995) was used to generate the simulated light curves from the model PSD. This algorithm generates a random time series from an arbitrary broad-band PSD, correctly accounting for the intrinsic scatter in the powers (i.e. equation (8)), and is computationally very efficient. The effects of red-noise leak can be accounted for by extending the PSD model to much lower frequencies than required, generating light curves much longer than the observed light curve and using only a segment of the required length. Data simulated in such a fashion will include power on time-scales much longer than those probed by each segment alone.
The fitting procedure is as follows.
A model PSD is chosen and 500 random light curves are generated3 using the algorithm of Timmer & König (1995). The simulated light curves were of ≈26 times longer duration than the orbital light curves in order to account for red-noise leak.
From each of the simulated light curves a section from the middle is resampled to match the data. Specifically, from each (long) simulated light curve three consecutive light curves were extracted, to match the three consecutive revolutions, using exactly the same sampling as the real data. The logarithmically binned periodogram is computed in exactly the same way as for the real data (Section 4.3).
The 500 binned periodograms are then averaged to produce the ADM — this represents the periodogram after being distorted by the light-curve sampling (folded through the window function).
The constant Poisson noise level is added and the comparison between the ADM and the data is then made by calculating the χ2 of the fit.
The normalization of the ADM is then adjusted to minimize the χ2 (keeping the Poisson noise level fixed).
The χ2 of the fit is then evaluated over a grid of PSD models and the minimum, corresponding to the best-fitting model, is found.
This method differs slightly from the method of Uttley et al. (2002). First, because the XMM-Newton light curves are continuous the effects of aliasing are negligible, thus the simulated light curves were generated with the same time resolution as the data (100 s). The sparsely sampled RXTE data used by Uttley et al. (2002) meant that aliasing was a significant factor and needed to be accounted for in the simulations. The second major difference is that Uttley et al. (2002) calculated errors on the ADM (based on the scatter in the simulated periodograms), and these were used to estimate the goodness of fit. In the present analysis there are enough data points to apply the Papadakis & Lawrence (1993) binning (with N = 20) and estimate the error on the observed periodogram directly from the data, for use with χ2-fitting.
The χ2-fitting employed in this method relies on the data being binned sufficiently. It would, in principle, be possible to make use of the maximum frequency resolution offered by the unbinned periodogram by choosing a different goodness-of-fit estimator (e.g. deriving the rejection probability directly from the Monte Carlo trials, as in Uttley et al. 2002). However, the results (the best-fitting parameters) are model dependent in any case and so using a different method is unlikely to yield any further information.
4.3 The periodogram of MCG-6-30-15
The periodogram of the full-band data was computed by calculating the periodogram for each revolution of data (following equation (7)). The light curves from each revolution, when taken separately, are perfectly evenly sampled and uninterrupted. This means that for each light curve the periodogram estimates are independent at each Fourier frequency, and they are also independent between the periodograms from the three light curves. The periodograms for the three light curves were therefore combined to produce a single, binned periodogram by sorting the periodogram points by frequency and then applying the logarithmic binning (Papadakis & Lawrence 1993).
Furthermore, no additional windowing was applied to the data (see Section 13.4 of Press et al. 1992 and Section 7.5 of Priestley 1981). Windowing is often used to suppress the leak of power from one frequency to another, as it suppresses the side-lobes of the spectral window function W(f) and therefore the amount of power displaced to distant frequency bins. However, in exchange for this the central lobe of the spectral window function is broadened, meaning more power is lost to neighbouring frequency bins. This transfer of power between consecutive frequency bins means they are no longer independently distributed, which is required by the binning method applied above.
In order to accurately estimate the Poisson noise level, the periodograms were first calculated using the full-band light curves with 10-s resolution. This allows the periodogram to extend up to high frequencies, at which the variability is dominated by Poisson noise. Fig. 4 shows the periodograms from each of the three orbits (binned by N = 50) and the combined periodogram (binned by N = 100 to improve the signal-to-noise ratio). The red-noise PSD of the source can be seen below ∼3 × 10−3 Hz, above which the (white) Poisson noise spectrum dominates. Also marked are the predicted noise levels assuming just Poisson noise in the data. These show that the predicted noise level is clearly an accurate estimate of the true noise level in the data. In the following analysis only the 100-s resolution light curves are used, since above the Nyquist frequency (5 × 10−3 Hz) the periodogram is dominated by the Poisson noise and not the source variability, and the Poisson noise level is assumed to be at the expected value.
Fig. 5 shows the raw periodograms (binned by N = 20), calculated by combining the periodograms from each revolution. The red-noise PSD of the source is clearly detected above the Poisson noise level at frequencies below few × 10−3 Hz in all bands. In the following sections these ‘dirty’ periodograms were fitted using the Monte Carlo procedure in order to derive the properties of the PSD in a robust manner.
4.4.1 The full-band PSD of MCG-6-30-15
As expected, the periodograms shown in Fig. 5 show an approximately power-law spectrum for the intrinsic (source) variability. Various trial models were fitted to the data (the models are illustrated in Fig. 6), using the procedure outlined above to constrain the PSD. The first trial model was a power law (model 1):Table 3, this model provided an unacceptable fit to the data (>99.99 per cent rejection probability). Fig. 7 shows the variation in χ2 with α (for 79 degrees of freedom). The χ2 of the fit changes little for PSD slopes steeper than 2. This is an effect of the red-noise leak. If the PSD continues to very low frequencies as a steep power law (α > 2), a significant amount of power will leak into the observed periodogram in the form of an α = 2 power law. This red-noise leak contribution will cause the observed periodogram to have a slope of α≈ 2 even when the true underlying PSD slope is much steeper. Thus, if the power-law PSD continues unbroken to low frequencies, it is difficult to distinguish slopes steeper than 2 in the fitting.
Fig. 8 shows the best-fitting power-law model, with a slope of α = 2.1. This steep slope implies that the PSD must break at low frequencies or the integrated power would diverge. Therefore, PSD models including a flattening at low frequencies were explored in an attempt to constrain the frequency of the break. The residuals from the power-law fit show that the model is too flat above 2 × 10−4 Hz and too steep at lower frequencies, suggesting the PSD may break in the observed frequency range.
Indeed, based on long time-scale RXTE monitoring, Uttley et al. (2002) found the best-fitting PSD model for MCG-6-30-15, in the 2–10 keV band, was a broken power law. When the slope below the break was fixed at αlow = 1 the slope above the break was found to be α = 2.0 ± 0.3, with a break frequency in the range 1.3 × 10−5 to 1.0 × 10−4 Hz (90 per cent confidence limits). Therefore, the next trial model fitted was a broken power law (model 2)Table 3. The model provided an acceptable fit to the data (77 per cent rejection probability) and the improvement in the fit was Δχ2 = 73.6 for the addition of one free parameter (the break frequency), which, according to the F-test, is significant at >99.99 per cent confidence. Fig. 9 shows the fitting residuals using this model and the confidence contours for the model parameters are shown in Fig. 10.
Although there are only two periodogram points at frequencies below the break, the detection of the break is nevertheless highly significant. This is because the high-frequency PSD slope is observed to be steeper than 2, implying that the effect of red-noise leak is not strong. The steep PSD observed at high frequencies cannot continue below the lowest observed frequencies; if the PSD remained steep to low frequencies the contribution from red-noise leak would cause the periodogram to resemble an α= 2 power law over the whole observed frequency range. The steep PSD at high frequencies must therefore break to a flatter slope close to the lowest frequencies probed in order for the high-frequency periodogram to have a steep slope.
The slope of the PSD below the break, however, is not well constrained from the XMM-Newton data. Fitting the data with a PSD model that breaks from a slope of αlow = 0 (model 3) gives a similarly acceptable fit but with a slightly lower break frequency (Table 3, Fig. 11). As these two models both give good fits it is not possible to distinguish between them based on these data alone. However, it is interesting to note that the break frequency found from the RXTE data is consistent with the break frequency found from the XMM-Newton data when the PSD is assumed to break to αlow= 1. The break frequencies found from fitting the model assuming a break to αlow= 0 are inconsistent between RXTE (fbr = 5.1+5.1−2.6× 10−6 Hz) and XMM-Newton(fbr = 5+3−1× 10−5 Hz).
The exact shape of the break is not well defined from these data. As an alternative to a sharply broken power law, a model representing a power law with a smooth break from a slope of αlow = 1 to α was tested (model 4):Table 3, Fig. 12), comparable to the models including a sharp break, and the break frequency was consistent with that found for model 2. Table 3, Fig. 13). The best-fitting slope for this model is much flatter than for the previous models, and the cut-off frequency is much higher than the break frequencies in the broken power-law models. That this model provides a good fit to the data is perhaps suggesting that the PSD of MCG-6-30-15 steepens further at high frequencies.
4.4.2 Energy dependence of the PSD
On the basis of the XMM-Newton data alone, PSD models 2–5 all provide reasonable fits to the full-band data. In the following analysis, model 2 is fitted to the data from the soft, medium and hard energy bands. in order to provide a uniform parametrization of the PSD at different energies. The residuals are shown in Fig. 14 and the results from these fits are shown in Fig. 15 and Table 4.
The fits to the soft- and hard-band data using model 2 are somewhat worse than the fit to the full-band data. However, as can been seen from Fig. 14, there are no obvious, strong, systematic residuals. Refitting these data with model 4 provided very similar fits and refitting using model 5 gave a slightly better fit to the hard-band data but slightly worse fits to the soft and medium bands. The possibility remains that there is some marginally significant structure in the PSDs at high frequencies (around ∼4 × 10−3 Hz in Fig. 14), not accounted for in these simple continuum models, as was also suggested for the high-frequency PSD of NGC 7469 by Nandra & Papadakis (2001). These points notwithstanding, model 2 was used to parametrize, in a simple and uniform fashion, the PSD as a function of energy.
The break frequencies as determined in each energy band are consistent with one another. However, the slope of the PSD above the break shows significant changes between the energy bands, with the hardest band showing the flattest slope. Table 4 also gives the total variability powers, in Fvar form, derived from each band (cf. Table 1). For each band the total power was estimated by integrating a binned periodogram (after subtracting off the constant Poisson noise level) to give Fvar,1, and also by integrating the PSD model (excluding the Poisson noise component) over the frequency range 10−5 to 5 × 10−3 Hz to give Fvar,2. As was noted in Section 3, the medium band contains the highest total variability power.
4.4.3 Low-frequency PSD
The binned periodograms used above reached down to 3 × 10−5 Hz in the lowest-frequency bin. It would, in principle, be possible to treat the three consecutive light curves as a single, longer light curve (top panel of Fig. 1) and use this to reach even lower frequencies in the power spectrum. However, this three-revolution light curve is no longer uninterrupted, owing to the gaps between revolutions. This means that the standard DFT (as discussed in Section 4.1.1) is no longer suitable. There do exist periodogram estimators designed to deal with data containing gaps, such as the Lomb-Scargle periodogram (Lomb 1976; Scargle 1982) but there are drawbacks to using such techniques. First, the gaps in the sampling mean the periodogram estimates at each frequency are no longer independent of one another (Scargle 1982), so the binning and error estimation used above is not strictly valid. Secondly, the more complicated window function introduces much greater distortion on the low-frequency periodogram, which would severely limit the amount of information that could be recovered from the lowest frequencies. For these reasons the results presented in this paper are based entirely on the more robust, standard DFT method, but for completeness the Lomb-Scargle periodogram (computed using the algorithm of Press & Rybicki 1989) of the full-band light curve is shown in Fig. 16. Down to low frequencies this appears to be broadly consistent with the results presented above.
The fundamental assumption underpinning the above analysis is that the variability process is stationary. A stationary process is one where the statistical properties do not depend on time (see Section 1.3 of Bendat & Piersol 1986 for more on stationarity in random data). Specifically, the assumption made here is that the PSD is stationary, i.e. that the light curves are all realizations of the same process. The rms-flux correlation found in Section 3.2 shows that the variability of MCG-6-30-15 is in some sense non-stationary (the variance does change), but in a ‘well-behaved’ fashion. Indeed, the linear correlation between σrms and means that the average Fvar is constant. In other words, the variations, when normalized by the local flux, appear stationary.
The method outlined in Appendix A of Papadakis & Lawrence (1995) was used as a simple check of the validity of this assumption. If the PSD is constant then the (unbinned) periodograms obtained from each revolution should be identical except for the scatter dictated by equation (8). Papadakis & Lawrence (1995) define the statistic S from the logarithm of the ratio of two periodograms, summed over a range of frequencies.Papadakis & Lawrence 1993).
If the underlying PSD is the same in the two periodograms then S should be normally distributed with a mean of zero and a variance of unity.
A normalized periodogram was computed for the first 80 ks of each of the three revolutions (the light curves were clipped to equal duration so that their periodograms had identical Fourier frequencies) and S was measured for frequencies below 10−3 Hz, where the source variability dominates over the Poisson noise. Comparing rev. 301 and 302 gave S = 1.18, comparing rev. 301 and 303 gave S=−0.26 and comparing rev. 302 and 303 gave S=−1.44. These three estimates are all within 2σ of the expected value assuming stationarity in the data. Therefore, to first order at least, the assumption of stationarity in these data seems a reasonable one.
5 Cross-Spectral Properties
5.1 The cross-spectrum
The PSD is often called the autospectrum because it is the Fourier transform of the light curve, multiplied by itself (equation (6)). The cross-spectrum is a related tool used for comparing the properties of two simultaneous light curves, such as from different energy bands, say x(t) and y(t). The product of the Fourier transforms of the two light curves gives the cross-spectrum (cf. equation (6)):
The cross-spectrum is complex valued and can be represented in the complex plane by an amplitude and a phase. The complex Fourier transforms of the two light curves can be written as X(f) =|X(f)| e and Y(f) =|Y(f)| e and the cross-spectrum can be written as
The phase component of the cross-spectrum represents the phase difference between the two light curves (which corresponds to a time delay at that frequency) and is discussed more fully in Section 5.3. The squared magnitude of the cross-spectrum is used to define the ‘coherence’ of the two light curves.
5.2.1 The meaning of coherenceequation (18) defines the linear correlation coefficient of X(f) and Y(f) (see Section 9.1 of Priestley 1981).4 The coherence can thus be thought of as a measure of the degree of linear correlation between the two light curves as a function of Fourier frequency, and is related to the amplitude of the CCF.
This means that if the two light curves are related by a single, linear transfer function (e.g. a delay or a smoothing) then they will have unity coherence. In other words, if knowledge of one light curve can, in principle, be used to predict the other, the two are said to be perfectly coherent. If, on the other hand, the light curves contain contributions from completely independent processes then the coherence will be less than unity.
Another way of thinking about coherence is in terms of equation (17). If the light curves are related by some simple transform then, at a frequency f, there will be a fixed delay between the light curves, i.e. the lag will be the same in each realization of the process. If the phase difference is constant then the value of the numerator of equation (18) (the squared mean of equation (17)) is equal to the denominator and hence the coherence is unity. The interpretation of coherence is illustrated more fully in Vaughan & Nowak (1997) and Nowak et al. (1999a).
5.2.2 Estimating the coherence
In an analogous manner to the PSD, the cross-spectrum is estimated using the cross-periodogram. If the two discretely sampled light curves, x(ti) and y(ti), have DFTs X(fj) and Y(fj) then the cross-periodogram is given by the discrete form of equation (16), namelyequation (5). The coherence is estimated using the discrete form of equation (18): Vaughan & Nowak (1997). In the following analysis the noise-corrected coherence and its uncertainty were estimated using equation (8) of Vaughan & Nowak (1997). The coherence was calculated only for frequencies below 10−3 Hz, above this frequency the Poisson noise power becomes comparable to the intrinsic source variability and equation (8) of Vaughan & Nowak (1997) becomes increasingly inaccurate.
As mentioned above, the coherence needs to be averaged in some fashion. For the present analysis the cross- and autospectra were calculated separately for each revolution. The data from the three revolutions were then combined by sorting in order of frequency and then averaged in bins of N = 40 frequency points.
5.2.3 Coherence properties of MCG-6-30-1
The coherence function was calculated for the three possible combinations of the three energy bands (soft/medium, soft/hard and medium/hard). This was repeated using data from MOS1 and MOS2 as well as the pn and the results were consistent between the detectors. (The coherence was also estimated individually for each revolution of data and the results were consistent between the three revolutions.) The resulting coherence functions are shown in Fig. 17. At low frequencies the coherence is close to unity for all energy bands. At higher frequencies the coherence becomes much lower, particularly between the soft and hard bands. For comparison, Fig. 17 shows the function γ2(f) = exp(−f/8 × 10−4 Hz) compared with the coherence between soft and hard bands.
This apparent loss of coherence at high frequencies could be an effect of Poisson noise in the data, since at high frequencies the intrinsic source power is weaker compared with the noise power. In order to test this possibility the coherence was evaluated by comparing light curves in identical energy ranges from the three different detectors. The light curves extracted from the three detectors should be identical, in a given energy band, except for Poisson noise. Therefore, the coherence should be unity and any deviation from unity will be a result of Poisson noise. As can been seen from Fig. 18, the coherence is close to unity until the highest frequencies, and the deviation from unity coherence at high frequencies is much smaller than the observed loss of coherence between the different energy bands. This, and the fact that the loss of coherence is repeated in all three detectors suggests that Poisson noise is not seriously biasing these results, and that the error bars are reasonable estimates of the true uncertainties.
Simulated data were used as an additional test of the accuracy of the coherence estimation. Two simulated light curves were constructed, representing light curves in two different bands, and the coherence was calculated for them exactly as for the real data. These simulations were used as a further test of the effect of Poisson noise and to measure the effect of different PSD slopes on the coherence function.
The effect of Poisson noise was tested by simulating coherent data in two bands using the same PSD. The PSD model used was taken from the fit to the soft-band data using model 2 (Table 4). Two long light curves were generated using this PSD and an identical random number sequence. The two light curves, having identical PSDs as well as Fourier phases and amplitudes, were thus identical and therefore have unity coherence at all frequencies.
The simulated light curves for the two bands were resampled to match the window function of the real data, i.e. three consecutive light curves were extracted from each to band match the three revolutions of data. The light curves for the first band were rescaled to match the average count rate of the soft-band light curve and the light curves from the second band were rescaled to match the count rate of the hard-band data. (The simulated light curves were produced using the absolute PSD normalizations found from fitting the data. Thus the absolute variability amplitude of the rescaled simulated data is as expected based on the real data.) Poisson noise was then added independently to the simulated light curves (i.e. the number of counts in each time bin was randomized according to the Poisson distribution). These two sets of simulated data represent soft- and hard-band light curves in terms of their Poisson noise, but intrinsically are perfectly coherent and have the same PSD.
100 sets of simulated data were produced and for each set the coherence and its error were calculated as discussed in Section 5.2.2 (i.e. using the recipe of Vaughan & Nowak 1997). The average and standard deviation of the coherence estimates from the 100 sets of simulated data are shown in the top panel of Fig. 19. Also shown are the average error bars. The coherence is clearly very close to unity over the frequency range examined, and the standard deviation of the estimates is close to the mean error bar, indicating that the Vaughan & Nowak (1997) method did accurately recover the intrinsic coherence from the effect of the Poisson noise. (Note that these error bars are slightly smaller than those derived from the real data because the uncertainty is itself a function of the intrinsic coherence, which is lower in the real data.)
The tests above used intrinsically coherent data with identical PSD shapes. The PSD fits of Section 4.4.2 show that the PSD shape is energy dependent. In order to confirm that the coherence is not biased because of the PSD shape being different in different energy bands, the experiment was repeated using PSDs appropriate for the two bands. Specifically, the best-fitting PSD models for the soft and hard bands (listed in Table 4) were used to generate simulated soft- and hard-band light curves, but again the two sets of data used an identical random number sequence. Thus the simulated soft- and hard-band light curves again have unity intrinsic coherence but different PSDs. As before, 100 sets of simulated data were used and the average coherence and error were calculated. The results are shown in the bottom panel of Fig. 19. The average coherence is again very close to unity, indicating the different PSD slopes in the different bands did not bias the results.
The loss of coherence observed in MCG-6-30-15 is thus not an artefact of Poisson noise in the data, nor the change in PSD slope with energy, and is repeated in each of the three EPIC cameras. As such, in the discussion below, it is assumed that this loss of coherence is intrinsic to the continuum variations of the source.
5.3 Time lags
5.3.1 Estimating the time lags
As mentioned in Section 5.1, the cross-spectrum also contains information concerning the phase differences between the two light curves (equation (17)). This information is similar to the delays measured by the CCF, except measures the delay as a function of Fourier frequency. The phase lag is measured from the phase of the averaged cross-periodogram:equation (25) and the angled brackets denote an averaging over consecutive frequency points and/or light curves. This phase difference can then be converted into a time delay at Fourier frequency f:
For the present analysis, the time lag was calculated using equation (28) and equation (29), with the averaging of the cross-spectrum performed in bins of N= 40 frequency points. The error on the time lag estimate was computed using equation (16) of Nowak et al. (1999a). See Section 9.1.3 of Bendat & Piersol (1986) for a derivation of this equation.
5.3.2 Poisson noise effects
The effect of Poisson noise is to add an independent, random component to the phases of the two light curves and hence randomize the phase difference. This means that small time lags are difficult to detect owing to photon noise. The recipe outlined in Nowak et al. (1999a) (see their Section 4.2) was used to estimate the effective noise limit, below which it is difficult to detect reliable lags. The estimated error on the phase lag is computed using their equation (14):Section 4.4, and coherence estimated using the function given in Section 5.2.3). This estimated uncertainty, which should be considered merely a lower limit on uncertainty introduced by Poisson noise, was compared with the function used in Section 5.3.3 to illustrate the observed time lags and the results are shown in Fig. 20. This indicates that above ∼6 × 10−4 Hz the lags are too small to be detected, i.e. they fall below the sensitivity limit defined by the Poisson noise. A further point is that the coherence is significantly below unity at higher frequencies (≳few 10−4 Hz), in which case any measured time lag is no longer meaningful. Therefore, from these data, significant time lags will only be detectable below a few × 10−4 Hz.
5.3.3 Time lags in MCG-6-30-1
The time lags were computed (for each EPIC detector) for the three combinations of energy bands. The results are shown in Fig. 21. In each test (three combinations of energy bands and three different detectors) there was a significant lag at the lowest frequencies, with the softer band leading the harder band. At higher frequencies the lag becomes consistent with zero. The lag is small, only ∼1 per cent between soft and hard bands (e.g. on a time-scale of 20 000 s the lag is only ∼200 s). This is illustrated in Fig. 21, which shows the function τ(f) = 0.013f−1 compared with the measured time lags between the soft and hard bands.
The time lags were calculated using light curves for the three different detectors in identical energy ranges, to test the possible influence of Poisson noise (see Section 5.2.3). These are shown in Fig. 22. In these tests the lag was consistent with zero over most of the useful frequency range. This, combined with the analysis of Section 5.3.2 and the fact that the lags were recovered independently from each EPIC camera, suggests that the lags observed between different energy bands are not an artefact of Poisson noise (which in any event would destroy any time lag signal).
The simulated data used in Section 5.2.4 to assess the accuracy of the coherence function were also used to assess the accuracy of the time-delay spectrum.
In the first set of simulations the data in the two bands were generated using the same PSD and the same random number sequence. In addition to having unity coherence, the pairs of simulated data also have identical Fourier phases and therefore zero intrinsic phase lag between them. The top panel of Fig. 23 shows the average of the time-delay estimates, their standard deviation and the average error estimate from the 100 pairs of simulated data (including Poisson noise as described in Section 5.2.4). This average spectrum is close to the zero-delay expectation meaning that the Poisson noise did not bias the time-delay measurements, and the average estimated error is close to the standard deviation of the estimates.
In the second set of simulations the pairs of simulated data were generated using different PSDs for the two bands. The bottom panel of Fig. 23 shows the average time-delay spectrum, which again is as expected if the delay estimate is unbiased. It therefore seems reasonable to assume that the time delays observed in MCG-6-30-15 at low frequencies are intrinsic to the source.
6.1 Summary of results
This paper presents an analysis of the X-ray continuum variability of MCG-6-30-15. A strong, linear correlation is found between the rms variability amplitude and the source flux (Section 3.2). The PSD was examined by fitting model power spectra to the data after allowing for the distorting effects of sampling using a Monte Carlo procedure (Section 4). However, the excellent sampling of the XMM-Newton light curves means that these biases are in any case minimized and so there was a minimum of information lost from the PSD. The PSD is well represented by a power law of slope αlow= 1 at low frequencies, breaking to a slope of α≈ 2.5 at a frequency of fbr≈ 10−4 Hz. However, it should be re-iterated that these PSD results are model dependent, and that parametrizing the PSD using a different model yields slightly different results.
The PSD is energy dependent, with the variations in the higher-energy band showing a flatter PSD slope (above the break frequency) than variations in the softer bands. There is no evidence for any energy dependence of the break frequency. Fig. 24 illustrates the PSD shape and energy dependence. Fitting the PSD as a function of energy revealed tentative hints of energy-dependent, high-frequency structure in the PSD (Section 4.4.2). This could be caused by subtle instrument effects or may represent low-amplitude structure intrinsic to the source PSD (see also Nandra & Papadakis 2001). These time-scales (≳10−3 Hz) are comparable to the dynamical time-scale at the innermost stable orbit around the black hole. However, further progress on the very high-frequency variability of relatively faint sources such as Seyfert galaxies must await future high-throughput missions such as Constellation-X and XEUS.
The variations observed in different energy subbands are highly coherent at low frequencies (long time-scales) but the coherence falls significantly below unity above a few × 10−4 Hz, close to the break in the PSD (Section 5.2). The high coherence between soft and hard bands at low frequencies means that the two bands are linked by a single, linear transfer function. The direction and magnitude of the time lag shows that the hard photons lag the soft photons, on average, by ∼200 s at a frequency f≈ 6 × 10−5 Hz (Section 5.3). It is not possible to reliably constrain the frequency dependence of the time lags, but the data seem consistent with an approximately τ∝f−1 dependence of the lag [i.e. a constant phase difference φ(f)] similar to that seen in the Seyfert galaxy NGC 7469 (Papadakis et al. 2001) and in GBHCs (Nowak et al. 1999a; Pottschmidt et al. 2000). On shorter time-scales the coherence falls well below unity, implying that there is no simple transfer function relating soft and hard bands.
6.2 Comparison with previous analyses
Uttley et al. (2002) presented an analysis of long time-scale monitoring of MCG-6-30-15 using RXTE. The PSD was found to be best described by a broken power law with slope of αlow = 1 at low frequencies, breaking to α = 2.0 ± 0.3 at a frequency fbr = 0.13-1.02 × 10−4 Hz. Using the same PSD model, the break frequency found from the high-frequency XMM-Newton data is fbr = 0.6 − 2.0 × 10−4 Hz, consistent with the RXTE result. The slope of the PSD above the break is different, but this is most likely an effect of the energy dependence of the PSD. Indeed, using the 2–10 keV XMM-Newton data (i.e. the same energy range used by Uttley et al. 2002) the slope of the high-frequency PSD is α≈ 2.10 ± 0.15, which is consistent with the RXTE result.
The break frequency presented here (and in Uttley et al. 2002) is model dependent. This is because a model PSD had to be assumed and fitted to the data to derive the appropriate parameters. However, the broken power-law model (with a break to αlow = 1 at low frequencies) provides consistent results when applied to the long time-scale RXTE monitoring and the short time-scale XMM-Newton observation. This, and the similarity of the observed PSD to those of GBHCs (see Section 6.3) suggests that the chosen model (and therefore the derived break frequency) is reasonable.
Nowak & Chiang (2000) used RXTE and ASCA data to measure the PSD of MCG-6-30-15. They found the PSD to break from α≈ 0 to 1 at a frequency ∼10−5 Hz and then break again to α≈ 2 in the range 10−4–10−3 Hz, The position of the high-frequency break, and the PSD slopes, are roughly consistent with those found from the XMM-Newton data. However, as noted by Uttley et al. (2002), the method employed by Nowak & Chiang (2000) did not account for the distorting effects of sampling (see Section 4.1.4) and underestimated the errors at low frequencies. Once these effects are included the low-frequency flattening is no longer significant (Section 6.2 of Uttley et al. 2002). Nowak & Chiang (2000) also place a limit on any time delay between soft (0.5–2.0 keV) and hard (8–15 keV) bands of τ < 2000 s, consistent with the time lag results presented here. Reynolds (2000) previously claimed variations in the 2–4 keV band led those in the 8–15 keV band by 50–100 s based on an RXTE observation of MCG-60-30-15.
Hayashida et al. (1998) also estimated the PSD of MCG-6-30-15, this time using Ginga data. They found a broken power law to be a good fit to the data. However, these authors also did not account for the biases in the periodogram, which will be significant because of the patchy sampling of the Ginga light curves. The unusually flat low-frequency slope (α=−0.5) found by these authors is most likely a result of bias in the low-frequency periodogram, and therefore the position of their claimed break frequency should be considered with some caution.
The only other AGN for which detailed cross-spectral analysis has been performed is NGC 7469 (Nandra & Papadakis 2001; Papadakis et al. 2001). In this object the PSD does not show any significant breaks, but does show the same energy dependence as found in MCG-6-30-15. The present analysis shows that the energy dependence of the PSD is caused by the slope of the PSD becoming flatter at higher energies, rather than a change in the break frequency. In addition, NGC 7469 shows time lags between energy bands in the same direction (soft leading hard) and with the similar relative magnitude (∼1 per cent) as in MCG-6-30-15. Papadakis & Lawrence (1995) showed that in NGC 4051 the harder band showed a flatter PSD than the softer band. Recently, McHardy et al. (in preparation) used XMM-Newton data of NGC 4051 and also found this energy dependence of the PDS.
6.3 Comparison with Cygnus X-1
Since EXOSAT first showed the medium time-scale PSDs of Seyfert galaxies to have a power-law form, their timing properties have been compared with those of GBHCs. Now, with long time-scale monitoring available from RXTE (Edelson & Nandra 1999; Nandra & Papadakis 2001; Uttley et al. 2002) and short time-scale observations possible with XMM-Newton (this paper) it is possible to re-assess this comparison.
The best studied GBHC, Cyg X-1, when in its low/hard state shows a PSD with an approximately broken power-law form. The PSD breaks from a low-frequency slope of α≈ 0 to 1 and then to α≈ 2 at high frequencies. The position of the high-frequency break is typically fbr≈ 3 Hz (Belloni & Hasinger 1990; Nowak et al. 1999a). The comparable break in the PSD of MCG-6-30-15 occurs at fbr≈ 10−4 Hz. This suggests that the characteristic time-scales in these two systems differ by a factor of ∼3 × 104. Under the assumption that the time-scales scale linearly with the mass of the central black hole, and assuming Cyg X-1 to contain a10 M⊙ black hole (Herrero et al. 1995), then suggests the mass of the black hole in MCG-6-30-15 is ∼3 × 105 M⊙ (in the range 1.5–5.0 × 105 M⊙ using the 90 per cent limits on the break frequency in MCG-6-30-15). Assuming a bolometric luminosity of 4 × 1043 erg s−1 for MCG-6-30-15 (Reynolds et al. 1997; converting to H0 = 75 km s−1 Mpc−1) then implies that MCG-6-30-15 is radiating at the Eddington luminosity. However, since independent mass estimates (such as from reverberation mapping) are not available for MCG-6-30-15 it remains possible that this mass estimate is inaccurate if the relevant time-scales do not scale linearly with black hole mass (as assumed above).
If the frequency of the PSD break is related to the orbital frequency of matter close to where the peak emissivity of the accretion disc occurs, rpeak, then it will depend on rpeak/rin, where rin is the inner radius of the disc. For a standard disc, rpeak/rin∼ 2. This will also introduce a dependence on rin/rg (where rg=GM/c2 is the gravitational radius of the black hole) and thus the spin parameter of the black hole a/M. If the black hole in MCG-6-30-15 has a higher spin parameter than that in Cyg X-1, then the mass deduced from the above scaling could be increased.
The other temporal characteristics seen in MCG-6-30-15 are remarkably similar to those observed in Cyg X-1, implying there is indeed a strong connection between these two types of source. The PSD (above the break frequency) is flatter in harder-energy bands in both MCG-6-30-15 and Cyg X-1 (Nowak et al. 1999a; Lin et al. 2000). The strong rms-flux correlation first seen in X-ray binaries (Uttley & McHardy 2001) is exhibited by MCG-6-30-15 (see Section 3.2). MCG-6-30-15 also displays a loss of coherence between hard and soft bands at high temporal frequencies (i.e. above the break in the PSD) very similar to that seen in Cyg X-1 (Nowak et al. 1999a) and time lags between these bands that appear similar (in direction and relative magnitude) to those observed in Cyg X-1. In particular, the time lags measured in MCG-6-30-15 between soft and hard bands is ∼200 s in the lowest-frequency bin, where the coherence is near unity. In Cyg X-1 the shortest time lags observed are ∼2 × 10−3 s (Nowak et al. 1999a; Pottschmidt et al. 2000) at frequencies just below where the coherence falls significantly below unity. The ratio of these two time lags is ∼105, similar to the ratio between the PSD break time-scales.
It should also be noted that it is not yet clear whether Seyfert galaxies more closely resemble the low/hard or high/soft states of GBHCs (see also the discussion in Uttley et al. 2002). In the high/soft state the PSD of Cyg X-1 is steep at high frequencies (α≳ 2) and flattens to α≈ 1 at fbr≈ 10 Hz, with no further flattening evident at lower frequencies (Cui et al. 1997; Churazov, Gilfanov & Revnivtsev 2001; Reig, Papadakis & Kylafis 2002). This ratio of characteristic time-scales in this case is ∼105 and the corresponding mass estimate for MCG-6-30-15 is ∼106 M⊙, implying a luminosity ∼30 per cent of the Eddington limit. This high (but nevertheless sub-Eddington) accretion rate might be expected if MCG-6-30-15 is indeed in a state similar to the high/soft state seen in Cyg X-1.
It is interesting to note that the best-fitting linear model to the rms-flux correlation observed in MCG-6-30-15 is consistent with passing through the zero-flux origin (Section 3.2). The simplest explanation of this is if there is no significant constant component to the light curve. A strong constant component (∼25 per cent of the source flux) is observed from the rms-flux correlation in Cyg X-1 (Uttley & McHardy 2001), but only when the source is in the low/hard state (Gleissner et al. 2002). This is perhaps also indicating that the variability properties of MCG-6-30-15 are more similar to the high/soft state than the low/hard state of Cyg X-1. (The time-delay properties of Cyg X-1 when in its high/soft state are rather similar to those seen in its low/hard state, see, e.g., Pottschmidt et al. 2000.)
6.4 Physical implications
The X-ray emission mechanism operating in Seyfert galaxies is usually thought to be inverse-Compton scattering of soft photons in a hot corona (e.g. Sunyaev & Titarchuk 1980; Haardt & Maraschi 1991). In the simplest models the harder photons are expected to lag behind the softer photons owing to the larger number of scatterings required to produce harder photons, and the delay should be of the order of the light-crossing time of the corona. The longest observed time delay in MCG-6-30-15 between soft and hard bands is ∼200 s. Assuming a black hole mass of 106 M⊙, this lag corresponds to a light-crossing distance of ∼40rg. Thus the direction and magnitude of the observed lag in MCG-6-30-15 are consistent with an origin in a Comptonizing corona. However, if the lags are frequency dependent (as expected by analogy with Cyg X-1 and also seen in NGC 7469; Papadakis et al. 2001) the lags at lower temporal frequencies would become much longer than expected for a compact corona (see the discussion in Nowak et al. 1999b). Indeed, the simplest such models predict the time delay between soft and hard photons to be independent of Fourier frequency (Miyamoto et al. 1988), contrary to what is observed. In addition, some models of Compton scattering coronae predict that the high-frequency PSD should be steeper for higher-energy photons, owing to the high-frequency fluctuations being washed out by multiple scatterings (Hua & Titarchuk 1996; Nowak & Vaughan 1996), again contrary to the observations.
Alternatively, the time delay between soft and hard bands could be caused by the spectral evolution of individual X-ray events. If the spectrum of the event becomes harder as the event unfolds this can lead to an average lag between soft and hard bands. Such models have been discussed by Böttcher & Liang (1998) and Poutanen & Fabian (1999).
Another alternative is that the delay between soft and hard bands could be caused by a propagation effect, i.e. a ‘trigger’ signal reaching the soft emission region before the hard emission region. In this scenario the magnitude of the lag is related to the propagation speed and the distance between soft and hard emission regions. The following section describes a model based on this kind of signal propagation.
At higher temporal frequencies the coherence falls well below unity, meaning that there is no longer a simple transfer function between the soft and hard-band emission. As a static corona should maintain a transfer function (whatever the physical mechanism), on time-scales longer than the break in the PSD (≳104 s), where the coherence is high, the corona may be effectively static (i.e. it maintains its average properties on these time-scales). On shorter time-scales the corona may be dynamic and so the steep PSD above the break and the loss of coherence may be caused by changes in the corona itself.
The time-scale of the break in the PSD from f−1 to a steeper slope is 104 s, which corresponds to a light-crossing distance of ∼2 × 104rg. The orbital time-scale at 40rg around a 106−M⊙ black hole is torb∼ 103 s, while the thermal time-scale for a standard, thin accretion disc is tth∼ 104 s (assuming a viscosity parameter α = 0.1), which is of the same order as the time-scale of the break in the PSD. The viscous time-scale in such a disc is many orders of magnitude larger than these other time-scales (see the discussion of accretion disc time-scales in Section 5.8 of Frank, King & Raine 1985). However, if that part of the accretion flow responsible for modulating the X-ray emission is geometrically thick, the break in the PSD could correspond to a viscous time-scale [which depends on the ratio of disc height H to radius R as ∼(H/R)2].
6.5 A phenomenological model
Various models have been considered to explain the variability of GBHCs and Seyfert galaxies. These include: various examples of ‘shot noise’ (e.g. Terrell 1972; Lehto 1989; Merloni & Fabian 2001); rotating ‘hotspots’ on the surface of an accretion disc (Abramowicz et al. 1991; Abramowicz 1992; Bao & Østgaard 1995); occultation by moving clouds (Abrassart & Czerny 2000); self-organized criticality in an accretion disc (Mineshige, Ouchi & Nishimori 1994); fluctuations in the accretion rate propagating through the disc (Lyubarskii 1997; Churazov et al. 2001; Kotov, Churazov & Gilfanov 2001) and magnetohydrodynamic instabilities in the inner accretion disc (Hawley & Krolik 2001).
Most of these models were developed to explain only one aspect of the variations (e.g. its red-noise nature). Now, however, it is possible to compare the predictions of these models with the well-determined timing properties of MCG-6-30-15. A viable physical model has to be able to reproduce not only the correct PSD shape and time-scales but also the other temporal characteristics such as the rms-flux correlation, the energy dependence of the PSD, the time lags and the coherence. It is not clear whether any of the above mechanism can explain all the observations, and indeed some models (e.g. the obscuration model of Abrassart & Czerny 2000) seem to offer little hope of explaining properties such as the coherence and time lags.
A simple phenomenological model has been developed by Lyubarskii (1997), Churazov et al. (2001) and Kotov et al. (2001) that seems to explain (qualitatively, at least) the temporal properties of Cyg X-1. As the timing properties of MCG-6-30-15 discussed above are very similar to those of Cyg X-1, it is worth discussing this model as a possible explanation for the behaviour of Seyfert galaxies.
In its high/soft state, Cyg X-1 shows an f−1 PSD over several decades in frequency (Cui et al. 1997; Churazov et al. 2001; Reig et al. 2002), down to time-scales far longer than would normally be associated with the X-ray-emitting region itself (which is assumed to be compact). The model put forward by Lyubarskii (1997) to explain this very broad range in time-scale has variations in accretion rate occurring over a large range of radii from the central black hole and propagating in towards the X-ray-emitting region (assumed to extend over only a few tens of gravitational radii).
If the viscosity parameter of the accretion flow is varying randomly and with the same amplitude at all radii, variations in mass accretion rate at much smaller radii will have an f−1 PSD. The accretion rate at the X-ray-emitting region (assumed to be concentrated at small radii) is thus being modulated by variations produced over a large range in radius. If the X-ray emission emerging from this region is proportional to the local accretion rate then the X-ray variations will also have an f−1 PSD extending down to low frequencies. Each radius produces variations at a characteristic time-scale comparable to the time-scale for inward propagation, hence long-time-scale fluctuations are produced at large radii and shorter-time-scale fluctuations are produced closer in. This works because any variations on time-scales shorter than the propagation time-scale are damped as they move through the disc (see Lyubarskii 1997; Churazov et al. 2001). This naturally explains both the f−1 part of the PSD (i.e. below the high-frequency break; Churazov et al. 2001) and the rms-flux correlation (Uttley & McHardy 2001).
Once these variations reach the inner regions, where accretion energy is released as X-ray emission, variations on shorter time-scales are further suppressed, leading to a steepening of the PSD. Thus the high-frequency break in the PSD from f−1 to a steeper slope (as seen in MCG-6-30-15) is caused by suppression of fluctuations within the X-ray-emitting region, and the time-scale of the break in the PSD corresponds to the characteristic time-scale at the outer radius of the X-ray-emitting region (Churazov et al. 2001). The emissivity as a function of radius defines a Green function for the incoming f−1 fluctuations. In frequency terms, the radial extent of the X-ray-producing region makes it act as a ‘low-pass filter’ for the incident f−1 fluctuations, resulting in a break in the PSD, to a steeper slope, at high frequencies.
The specific model described by Churazov et al. (2001) has a relatively stable, optically thick, geometrically thin accretion disc sandwiched by an optically thin, geometrically thick corona. The disc produces relatively slowly varying thermal emission (which would emerge presumably in the optical/ultraviolet for a Seyfert galaxy), while it is the variations within the corona that propagate inwards to produce the X-ray variability. The time-scale of the high-frequency break thus corresponds to the accretion time-scale (∼the viscous time-scale) of the geometrically thick corona at the outer edge of the X-ray-emitting region.
Kotov et al. (2001) extended this model by adding the simple condition that the spectrum of the X-ray-emitting region is a function of radius, with the outer regions producing a softer spectrum than the inner regions. This model can then reproduce the observed time lags between soft and hard emission. Accretion rate variations propagating inwards reach the softer emitting region (larger radii) before the harder emitting region (smaller radii), explaining why the soft emission leads the hard. This can explain the energy dependence of the time lags (as delay increases with the separation between energy bands) and the frequency dependence (since different radii correspond to different characteristic frequencies and different energy spectra). In addition, this model explains the energy dependence of the PSD above the high-frequency break. The harder emission is produced from smaller radii, which can be modulated by high-frequency fluctuations, hence the harder emission will have more high-frequency power than the softer emission (i.e. the hard-band PSD will be flatter), as observed (see Fig. 24). Also, as the variations on short time-scales are occurring within the X-ray-emitting region, i.e. are a result of the dynamic corona, the coherence between bands will fall off at high frequencies, as mentioned in Section 6.4 (see also the discussion in Nandra & Papadakis 2001).
Such phenomenological models require that variations occurring at large radii can be fed into much smaller radii and so influence the PSD. The observed iron line profile of MCG-6-30-15 from this observation (Fabian et al. 2002) indicates that the disc extends down to less than 2rg, and, from the steep inferred emissivity profile, that much of the X-ray power is generated very close to the black hole. This presents a challenge for all such models, given that the dynamical time-scale of this innermost region is 10–100 s, whereas the time-scales measured (e.g. the PSD break time-scale) are orders of magnitude longer. If the f−1 PSD of MCG-6-30-15 is similar to that in the high/soft state of Cyg X-1 (Reig et al. 2002) then the variability time-scales will extend to many orders of magnitude longer still. Indeed, the total power in f−1 spectra diverges so slowly that the PSD could, in principle, continue as f−1 until very low frequencies without the total power becoming large. (For example, model 2 from Section 4.4 could continue down to the unphysically low frequency ∼10−28 Hz without the integrated power exceeding Fvar = 1!)
This paper presents a detailed time series analysis of the continuum variability of MCG-6-30-15. The PSD shows a break from f−1 to f−5/2 at a frequency fbr∼ 10−4 Hz. Comparing the break time-scale with the analogous time-scale seen in Cyg X-1, and assuming that these scale linearly with black hole mass, gives a black hole mass of ∼106 M⊙ for MCG-6-30-15 and suggests it is accreting at a significant fraction of the Eddington limit. The slope of the PSD above the break is energy dependent, with the harder band showing more high-frequency power than the softer band. The variations in different energy bands also become incoherent above the break in the PSD. This leads us to speculate that on time-scales shorter than the break time-scale, the X-ray-emitting corona is dynamic, and that harder photons are produced at smaller radii than softer photons. At the lowest Fourier frequencies the soft and hard bands are highly coherent and the soft band is found to lead the hard band by ∼200 s. The direction and magnitude of this lag is consistent with simple Comptonization models. A phenomenological model, originally developed to explain the f−1 PSD of GBHCs such as Cyg X-1, is discussed and found to be able to reproduce (qualitatively) the observed temporal characteristics of MCG-6-30-15. Although this model is rather simple and phenomenological it can explain the observed temporal characteristics of the X-ray continuum in MCG-6-30-15.
Based on observations obtains with XMM-Newton, an ESA science mission with instruments and contributions directly funded by ESA Member States and the USA (NASA). We are grateful to Phil Uttley for many valuable discussions on power spectral issues and to Mike Nowak for discussions on coherence estimation. SV acknowledges many valuable discussions with P. Thomas and financial support from PPARC. We thank an anonymous referee for useful comments.