## Summary

We extend ambient noise surface wave tomography both in bandwidth (10–50 s period) and in geographical extent (across much of Europe) compared with previous applications. 12 months of ambient noise data from 2004 are analysed. The data are recorded at about 125 broad-band Seismic stations from the Global Seismic Network and the Orfeus Virtual European Broad-band seismic Network. Cross-correlations are computed in daily segments, stacked over 1 yr, and Rayleigh wave group dispersion curves from 8 to 50 s period are measured using a phase-matched filter, frequency time analysis technique. We estimate measurement uncertainties using the seasonal variation of the dispersion curves revealed in 3 month stacks. On average, uncertainties in group delays increase with period from ∼3 to ∼7 s from periods of 10 to 50 s, respectively. Group speed maps at periods from 10 to 50 s are estimated. The resulting path coverage is denser and displays a more uniform azimuthal distribution than from earthquake-emitted surface waves. The fit of the group speed maps to the ambient noise data is significantly improved below 30 s compared to the fit achieved with earthquake data. Average resolution is estimated to be about 100 km at 10 s period, but degrades with increasing period and toward the periphery of the study region. The resulting ambient noise group speed maps demonstrate significant agreement with known geological and tectonic features. In particular, the signatures of sedimentary basins and crustal thickness are revealed clearly in the maps. These results are evidence that surface wave tomography based on cross-correlations of long time-series of ambient noise data can be achieved over a broad period band on nearly a continental scale and yield higher resolution and more reliable group speed maps than based on traditional earthquake-based measurements.

## Introduction

Traditional inference of seismic wave speeds in Earth's interior is based on observations of waves emitted by earthquakes or human-made explosions. In the past decades, with the deployment of a large number of high-quality broad-band seismic stations across the globe, there have been numerous tomographic studies performed on both global and regional scales using the traveltimes of body waves, dispersion curves of surface waves, and waveform fitting methods. Much information has been obtained about the structure of Earth's interior using these methods.

Surface wave tomography has proven particularly useful in imaging Earth's crust and uppermost mantle on both regional and global scales. Because they propagate in a region directly beneath Earth's surface, surface waves typically generate better path coverage of the upper regions of Earth than body waves with the same distribution of seismic stations. Surface waves at different periods are sensitive to Earth structure at different depths, with the longer period waves exhibiting sensitivity to greater depths. By measuring the depressive character of surface waves, the structure of the crust and upper mantle can be relatively well constrained. In Europe, many traditional surface wave studies have been performed in the past, both at regional (e.g. Papazachos 1969; Nolet 1977; Mueller & Sprecher 1978; Maupin & Cara 1992; Stange & Friederich 1993; Vaccari & Panza 1993; Pedersen 1994; Lomax & Snieder 1995; Pontevivo & Panza 2002; Levshin 2006) and continental scales (e.g. Panza 1980; Patton 1980; Snieder 1988; Marquering & Snieder 1996; Ritzwoller & Levshin 1998; Yanovskaya 2000; Pasyanos & Walter 2002; Pasyanos 2005).

There are, however, basic limitations to earthquake-based surface wave tomography independent of the number of broad-band stations available. First, due to the uneven distribution of earthquakes around the world, seismic surface waves only sample certain preferential azimuths. In addition, in aseismic regions surface wave dispersion can be measured only from distant earthquakes. Second, it is difficult to obtain high-quality short-period (<20 s) dispersion measurements from teleseismic events due to intrinsic attenuation and scattering along ray paths. It is, however, the short-period waves that are most useful to constrain the structure of the crust and uppermost mantle. Third, inversions of seismic surface waves require some information about sources, such as earthquake hypocentral locations and moment tensors in some cases, which have a substantial intrinsic inaccuracy, particularly for small events.

Some of the problems that beset traditional earthquake surface wave tomography can be alleviated by observations made on diffuse wavefields (e.g. ambient noise, scattered coda waves). Theoretical research has shown that, under the right circumstances, the cross-correlation of records from two seismic stations provides an estimate of the Green's function between the stations, modulated by the spectrum of the noise source (Weaver & Lobkis 2001a, b, 2004; Derode 2003; Snieder 2004; Wapenaar 2004; Larose 2005). Seismic observations have confirmed the theory using both coda waves (Campillo & Paul 2003; Paul 2005) and ambient noise for surface waves (Shapiro & Campillo 2004; Sabra 2005a) and crustal body waves (Roux 2005). Oceanic applications also appear to be feasible (Lin 2006). The method has been successfully applied to stations in Southern California to obtain high-resolution surface wave tomography maps at short periods ranging from 7.5 to 15 s (Sabra 2005b; Shapiro 2005). The group velocity maps from these studies show a striking correlation with geological units in California with low-speed anomalies corresponding to the principal sedimentary basins and high-speed anomalies corresponding to the igneous cores of the major mountain ranges. More recent applications have arisen across all of California and the Pacific Northwest while tracking the growth of the Transportable Array component of EarthScope (Moschetti 2005), in South Korea at very short periods (Cho 2006), in Tibet (Yao 2006), and elsewhere in the world.

This research has shown, therefore, that information about Earth's structure between a pair of seismic stations can be extracted by cross-correlating long time sequences of seismic noise. With sufficient station density, cross-correlations of ambient seismic noise promise to yield more homogenous sampling over study regions than earthquakes, to provide information to shorter periods so as to improve constraints on crustal structure, and partially to release seismologists from the dependence on earthquakes in surface wave tomography.

The purpose of this paper is to determine whether ambient noise Rayleigh wave tomography can be applied reliably on a nearly continental scale and extended to intermediate periods up to ∼50 s. The study is based on data across Europe from 125 broad-band seismic stations from the Global Seismic Network (GSN) and the Virtual European Broad-band Seismic Network (VEBSN). We compute broad-band cross-correlations to produce estimated Green's functions for all station pairs using 1 yr time-series. The assessment of the reliability of the measurements is based on four primary criteria which are as follows.

- (i)
Signal-to-noise ratio (SNR) of the cross-correlations.

- (ii)
Seasonal repeatability of the dispersion measurements, which is the basis for the error analysis.

- (iii)
Misfit of the measurements determined from tomography.

- (iv)
Consistency of the estimated group speed maps with known structural features such as sedimentary basins and crustal roots beneath mountain ranges.

Group velocity curves across Europe are measured from 10 to 50 s period and we present group speed maps from 10 to 40 s period. The results are compared, whenever possible, to earthquake-based measurements and maps produced earlier by the University of Colorado at Boulder (CU-B) in order to quantify the relative merits of ambient noise and earthquake tomography.

## Data Processing and Group Velocity Measurements

Europe is an excellent region to test ambient noise surface wave tomography. Broad-band seismic station coverage is dense across much of the continent, particularly in its centre, and the substantial *a priori* knowledge of geological structures allows us to evaluate the reliability of the resulting group velocity maps. We have collected continuous vertical-component seismic data from 125 stations including data from the GSN and VEBSN (Fig. 1) over the 12 months of 2004.

The data processing procedure that is applied here is very similar to that discussed at greater length by Bensen (2006). Using vertical-component seismic data implies that the resulting cross-correlations contain only Rayleigh wave signals. Data are processed one day at a time for each station after being decimated to 1 sample per second and are bandpass filtered in the period band from 5 to 150 s after the daily trend, the mean and the instrument response are removed. Data are then normalized in time and whitened over the frequency-band of interest prior to cross-correlation. Spectral whitening is a straightforward procedure, but the temporal normalization requires more discussion.

Temporal normalization is a general phrase that describes a variety of methods used to remove earthquake signals and instrumental irregularities from the time-series prior to cross-correlation. At periods where earthquake signals are stronger than ambient noise, roughly above about 20 s period, some method of removing earthquake signals is imperative. For this purpose, Shapiro (2005) used one-bit normalization, which generates a data stream composed only of the values 1 and −1, retaining only the sign and disregarding the amplitude of the signal completely (Larose 2004). Other researchers have used different methods; for example, Sabra (2005b) used a truncation threshold for each station to clip seismograms. Based on observations of the SNR of the cross-correlations, we have come to prefer applying a weighted running average on the envelope of each seismogram in the time domain, which effectively removes earthquake signals while keeping much of the information on the amplitude of the ambient noise. The process applied here computes the average of the envelope of the seismogram over a normalization time interval, and weights the seismogram by the inverse of this average at the centre of the window. The width of the normalization window determines how much amplitude information is retained. A 1 s window is equivalent to one-bit normalization; while an infinitely long window is equivalent to the original signal renormalized. After testing various time windows, we find that a time window with width equal to half the maximum period of the bandpass filter works well. The maximum period here is 150 s, so the normalization window is about 75 s in duration. Normalization windows from 50 to 100 s in duration produce nearly indistinguishable results. This procedure of temporal normalization is found to generate a somewhat higher SNR than 1-bit normalization, on average. In addition, the procedure is more flexible, allowing, for example, the computation of the temporal weights in a subband (e.g. 15–50 s designed to remove earthquakes) but with application to the broad-band signal.

After the time-series has been processed for each day, we then compute daily cross-correlations in the period band from 5 to 150 s between all station pairs and then stack the results into a set of 3 month and 1 yr time-series. The stacking simply involves adding the daily cross-correlations without weights if the time-series covers more than 80 per cent of the day, otherwise the day of data is discarded. For the year of data, twelve 3 month stacks are produced starting at each month; namely, months 1, 2, 3; months 2, 3, 4; …; months 11, 12, 1; and months 12, 1, 2. The 3 month stacks are used to investigate the seasonal variability of the measurements, which is the basis for the error analysis and is part of the data selection procedure discussed further below.

The resulting cross-correlations contain surface wave signals coming from opposite directions along the path linking the stations. These two signals appear at positive and negative correlation lag and are sometimes called the ‘causal’ and ‘acausal signals’. Although the two signals sample the same structure between a station pair, the noise (source) characteristics in the two directions may be very different, so the spectral content of the cross-correlations may differ appreciably. Asymmetric cross-correlations are, in fact, common. To simplify data analysis and enhance the SNR, we separate each cross-correlation into positive and negative lag components and then average these two components to form a final cross-correlation, which we call the ‘symmetric component’. The following analysis is done on the symmetric components exclusively.

An example of a broad-band (5 to 150 s) symmetric-component cross-correlation for the station pair IBBN and TIRR is shown in Fig. 2 together with the cross-correlation filtered into five frequency subbands. Rayleigh waves emerge clearly in each frequency band with the earlier arriving waves being at longer periods. In Fig. 3, we plot an example of a cross-correlation record-section with the central station being TUE. This figure shows that a variety of azimuths produce visible signals, but not all azimuths are covered for every station. Good azimuthal path coverage is important for resolution and reduce smearing effects in surface wave tomography.

To begin to evaluate the quality of the cross-correlations quantitatively, we calculate the SNR for each cross-correlation. SNR is defined as the ratio of the peak amplitude within a time window containing the signals to the root-mean-square of noise trailing the signal arrival window. The signal window is determined using the arrival times of Rayleigh waves at the minimum and maximum periods of the chosen passband. The group velocities used to predict arrival times here are calculated from the global 3-D shear velocity model of Shapiro & Ritzwoller (2002). Because the SNR can vary strongly with frequency, we filter the broad-band cross-correlations in three narrow passbands at 8–25, 20–50 and 33–70 s and compute the SNR in each band. We reject cross-correlations with SNR less than 7 in each band. This value is chosen as a compromise between optimizing measurement quality and quantity. We identify the cross-correlations with SNR > 7 as the estimated Green's functions. The data selection criteria are significantly more complicated than this, and are described in Section 3. Group velocity curves are measured for the estimated Green's functions that emerge from both the 3 month and 1 yr stacks in each of the three period bands using automatic Frequency Time Analysis (FTAN) as described in the next paragraph.

As in earthquake dispersion analysis (e.g. Ritzwoller & Levshin 1998), our automated dispersion measurement is based on FTAN in a two-step process. Bensen (2006) discuss this procedure in greater detail, but here we provide a brief overview. In the first step, traditional FTAN creates a 2-D diagram of signal power as a function of time and the central frequency of the applied filters (Fig. 4, middle). The automatic procedure tracks the local power maximum along the frequency axis. The group arrival times of the maximum amplitude as a function of filter frequency are used to calculate the tentative (raw) group velocity curve. Measures are taken to ensure the continuity of this curve by rejecting jumps in group arrival times. Formal criteria are set to reject curves with distinctly irregular behaviour or to interpolate through small spectral holes by selecting realistic local instead of absolute maxima. The second part of the method is the application of a phase-matched or antidispersion filter (Fig. 4, bottom). In the non-automated method that has been applied to large numbers of earthquake signals (e.g. Ritzwoller & Levshin 1998), an analyst defines both the phase-matched filter and the frequency band of the measurement. In the automated method that we use here, the frequency band of measurement is pre-set and the phase-matched filter is defined by the dispersion curve that results from the first step of the process. Phase-matched filtering collapses the signal into a ‘delta-like’ arrival, ideally. In the traditional analysis, the analyst defines a window to extract this signal, which is then redispersed and a cleaned FTAN image is returned (Fig. 4, bottom). The automated procedure is similar, but the windowing and extraction of the signal is done automatically. In both the traditional and automated analyses, the actual frequency of a given filter is found from the phase derivative of the output at the group time of the selected amplitude maximum (Levshin & Ritzwoller 2001). Fig. 4 shows an example of the FTAN procedure for the cross-correlation between 12 month time-series from station IBBN and station OBN. Bensen (2006) also discuss the method to measure phase velocities, but only group velocities are used in the present paper.

## Data Selection and Uncertainty Estimation

The automated measurement procedure must be followed by the application of criteria to select the data. We apply three general types of criteria: (1) SNR; (2) repeatability of the measurements (particularly seasonal variability) and (3) coherence across the set of measurements. The formal uncertainty analysis is based on seasonal variability.

First, we reject a cross-correlation if its SNR < 7. (The definition of SNR is presented in Section 2.) Fig. 5(a) shows an example histogram of the distribution of SNR for signals bandpass filtered between 20 and 50 s. Distributions are similar in the other two passbands, between 8 and 25 s and between 33 and 70 s. The mode of the distribution is less than a SNR of 5, and the SNR cut-off of 7 eliminates about half the measurements, as Fig. 5(b) shows. The jump between the two levels seen in Fig. 5(b) is caused by lower SNR in the passband from 25 to 50 s than in the band between 5 and 25 s period. Although we have found that it is often possible to obtain high quality dispersion measurements on waveforms with SNR less than 7, such measurements are frequently erroneous and in an automated procedure it is best to reject them.

Second, for a measurement to be accepted, it needs to be repeatable in the sense that measurements obtained at different times should be similar. This is not a trivial constraint, because both the spectrum and the directional-dependence of ambient noise vary strongly with season. The repeatability of measurements obtained on different time intervals, in particular in different seasons, is in addition to a data selection criterion, the basis for our formal assessment of measurement uncertainties. For surface waves that are emitted from earthquakes, it is difficult to estimate uncertainties in the group velocity measurements. Only when several earthquakes occur near to the same location or when waves follow nearly the same path connecting two stations can uncertainties be estimated. These criteria are not achieved for the vast majority of earthquake measurements, so only average error statistics can be estimated for earthquakes. In contrast, dispersion measurements from cross-correlations of ambient noise are naturally repetitive.

The repeatability criterion is based on quantifying seasonal variability. To do so, we select 12 overlapping 3 month time-series for each station pair. The 3 month time windows capture the seasonal variation of dispersion measurement, but are also long enough to obtain reliable dispersion measurements in many (but not all) cases. Fig. 6 shows an example of the seasonal variability in the observed dispersion curves. The variability typically increases with period, partially due to the fact that the amplitude of ambient noise decreases above 20 s period but also because ambient noise above the microseism band appears to be less azimuthally homogenous. Thus, the longer period signals are more likely to be contaminated by precursory noise that results from incomplete destructive interference of the signals that are off the azimuth between the stations. The precursory signals will give a velocity with a bias towards high values.

The dispersion curves that are used for tomography here are taken from the 12 month stacks. The standard deviation is computed for a station pair if more than four 3 month stacks have a SNR > 7. The measurement is retained if the standard deviation is less than 100 m s^{−1}. It is rejected if the standard deviation either cannot be computed due to the fact that too few measurements can be obtained on the 3 month stacks or if the standard deviation is too large. The effect of this step in eliminating measurements is shown in Fig. 5(b). Equating the seasonal variability with the measurement uncertainty is probably a conservative error estimate. Repeated biased measurements, however, could bias the uncertainty low. Such bias is unlikely due to the fact that the sources of ambient noise change significantly with season in azimuth, amplitude, and spectral content.

Figs 5(d) and (e) show the average measurement uncertainties taken over the entire European data set. Average uncertainties of group velocities and group arrival times increase with period from ∼0.02 km s^{−1} or ∼3 s at 10 s period to 0.09 km s^{−1} or 7.5 at 50 s period. Compared to uncertainties of group velocities of earthquake data using cluster analysis by Ritzwoller & Levshin (1998), we find that our group velocity uncertainties tend to be smaller at periods shorter than 20 s and larger at periods longer than 30 s. The reason for larger uncertainties at periods longer than 30 s is due to the shorter average path length in our data set, which is about 1200 km (Fig. 5c) compared to more than 4000 km in Ritzwoller and Levshin. The uncertainties in group arrival times are much smaller in this study, however. The traveltime uncertainties average less than 20 per cent of a period. This is still a substantial error, attributable largely to the fact that the average SNR of the retained measurements remains fairly low. The SNR of earthquake data tends to be larger, but earthquake measurements are affected by uncertainties in source characteristics that provide a lower bound for measurement errors.

Third, we require that the measurements cohere with one another across the data set. Incoherent measurements that disagree with other measurements in the data set are identified during tomography. This is discussed further in Section 5.

Finally, we find that to obtain a reliable measurement, the stations must be separated by at least 3 wavelengths. Each dispersion curve is retained only up to a period equal to one-third of the observed traveltime. For a phase velocity of 4 km s^{−1}, for example, this implies that if a station pair is separated by a distance Δ in km, a dispersion curve is accepted only up to a maximum cut-off period of above Δ/12 in seconds. In practice, this criterion has only limited practical effect because short-path measurements tend to be eliminated by the previous criteria.

The number of measurements that remain after all criteria have been applied is shown in Fig. 5(b). Less than a third of the original measurements are retained for tomography. The least satisfying part of the procedure is eliminating what appears to be a good measurement on the 12 month stack because the uncertainty estimate could not be determined, usually because of an insufficient number of high SNR 3 month stacks. To retain a higher percentage of measurements, it would probably be necessary to process two or more years of data. This would increase the SNRs of the seasonal stacks and be preferable over a single year of data.

## Method of Surface Wave Tomography

The dispersion measurements of Rayleigh waves from 1 yr cross-correlations are used to invert for group velocity maps on a 1°× 1° grid across Europe using the ray theoretic tomographic method of Barmin (2001). This method is based on minimizing a penalty function composed of a linear combination of data misfit, model smoothness and the perturbation **m** to a reference model **m**_{0} for isotropic wave speed:

*G*is the forward operator that computes traveltimes from a map;

*d*is the data vector whose components are the observed traveltime residuals relative to the reference map;

*C*is the data covariance matrix or matrix of data weights whose diagonal elements are determined from standard deviations of the dispersion measurements based on seasonal variability and off-diagonal elements are set to zero;

*F*is a Gaussian spatial smoothing operator; and

*H*is an operator that penalizes the norm of the model in regions with poor path coverage. The result of

*H*is to blend smoothly the estimated map

*m*into the reference map

**m**

_{0}where data coverage is low. If an input reference map in not given, then the average of the measurements specify

**m**

_{0}. The spatial smoothing operator

*F(m)*is defined over a 2-D tomographic map as follows: where

*S*is a smoothing kernel defined as below: σ is the spatial smoothing width or correlation length and the vector

*r*is a position vector on Earth's surface. A more detailed discussion of this method is given by Barmin (2001). The choice of the damping parameters α and β and the smoothing width σ is subjective. We perform a series of tests using different combinations of these parameters to determine acceptable values by considering data misfit, model resolution and model norm.

In this study, we use ray theory to compute the traveltimes of surface waves. In recent years, surface wave studies have increasing moved toward diffraction tomography using spatially extended finite-frequency sensitivity kernels based on the Born/Rytov approximation (e.g. Clevede 2000; Spetzler 2002; Ritzwoller 2002; Yoshizawa & Kennett 2002; Zhou 2004, many others). Ritzwoller (2002) showed that diffraction tomography recovers similar structure to ray theory at periods shorter than 50 s in most continental regions. In the context of regional tomography with dense path coverage, Sieminski (2004) showed that nearly identical resolution can be achieved using ray theory as that using finite-frequency theory. In this study, we concentrate on Europe where station coverage and the resulting ray paths are both dense, so that ray theory suffices for surface wave tomography. We also assume great-circle propagation between any station pair. In an inhomogeneous medium, the ray path between two stations will deviate from the great-circle path; a phenomenon called off-great-circle propagation. Ritzwoller & Levshin (1998) have shown that the off-great-circle propagation can be ignored at periods above 30 s for ray paths with distance less than 5000 km in global surface wave tomography. In this study, the average distance of ray paths is ∼1200 km, which is much smaller than that in the global study. The shorter ray paths imply that the effects of off-great-circle propagation is much less, but for less than about 15 s it may be still important due to the more inhomogeneous group velocity structures at short periods. We ignore the effect here, but it deserves further investigation in the future.

Resolution is estimated using the method described by Barmin (2001) with modifications presented by Levshin (2005). Each row of the resolution matrix is a resolution surface (or kernel), which is a map defining the resolution at one spatial node. Information in the resolution surface is summarized by a single scalar quantity at each node, called the spatial resolution. Because the shape of the resolution surface for most nodes resembles a 2-D spatial Gaussian function, we fit a 2-D symmetric spatial Gaussian function to the surface at each node:

The spatial resolution at each node is summarized by the standard deviation γ of the best-fitting Gaussian function to the resolution map.

## Group Speed Maps

Group speed tomography is performed in two steps. The first, preliminary, step generates overly smoothed maps at each period in order to identify and reject remaining bad measurements. The over-smoothed model fits most data well as the example misfit histogram for the 16 s data set shows in Fig. 7(a). We discard group velocity measurements with traveltime residuals larger than 15 s, which is about the rms value of the traveltime residuals at most periods. The number of paths remaining after data reduction is presented as the bottom curve in Fig. 5(b). Although there are about 7800 interstation paths, after data rejection is completed, only about 20 per cent measurements remain.

The second step of tomography is the construction of the final maps. Maps are constructed on a 1°× 1° grid across Europe and are defined relative to the reference maps computed from the 3-D model of Shapiro & Ritzwoller (2002). In an important sense, the reference maps already have a priori information imposed on crustal structure. The 3-D model itself was constructed as a perturbation to a starting model that included information about sediments and crustal thicknesses. For this reason, we choose to seek only smooth perturbations to the reference maps.

Examples of path density and resolution are shown in Fig. 8 for the 16 s measurements. Results are similar from 10 to 50 s period, although there is a reduction in path density and resolution with increasing period due to the decrease in the number of measurements at long periods, which is caused by the reduction in SNR above 20 s period. The resolution of surface wave tomography depends primarily on path density and the azimuthal distribution of paths. The azimuthal distribution of paths is good in the centre of the study region as demonstrated by Fig. 3, but deteriorates near the boundaries. Path density is highest in the centre of Europe and also gradually degrades toward the edge of the study region. As Fig. 8 indicates, average resolution is estimated to be about 100 km in the centre of the study region in Europe, and degrades toward the periphery of the map where station coverage is minimal.

The results of group velocity tomography at 10, 16, 20, 30, 35 and 40 s periods are shown in Figs 9, 10(a) and 11(a). In the inversion, reference group velocity maps predicted from the global 3-D shear velocity model of Shapiro & Ritzwoller (2002) are used as starting models. Examples of the reference maps are shown in Fig. 12 at 16 and 35 s period. Figs 10(b) and 11(b) illustrate that the perturbations introduced during tomography are smooth and occur only in regions where path coverage is high, predominantly in the centre of the region of study. In most of the Mediterranean Sea, the Atlantic Ocean, North Africa, the Iberian Peninsula, and far eastern Europe, the perturbations are small and the estimated maps are very similar to the reference maps. The features of the maps are observed to vary gradually with period due to the overlapping depth ranges of Rayleigh wave depth-sensitivity. Many of the observed anomalies are correlated with known geological units, which is discussed further in Section 6.

Fig. 13(a) shows the improvement in fit to the measured dispersion curves achieved by the resulting group velocity maps, expressed as the variance reduction relative to the predicted group velocity maps. In addition, variance reduction relative to the average across each map is shown. Variance reductions relative to predicted group velocity (solid line) are larger at short periods (<25 s) and smaller at long periods (>35 s) than relative to the observed average (dashed line). Variance reductions are highest at short periods and gradually decrease with period. The observed trend of variance reductions results from the group speed anomalies being largest at short periods and also because the 3-D model is more reliable for the longer periods. In particular, Fig. 10(b) illustrates that below about 20 s period the average of the reference model is too high. Only periods above about 20 s went into the construction of the 3-D model of Shapiro & Ritzwoller (2002). The rms group velocity and traveltime misfits after tomography are also shown in Figs 13(b) and (c). Traveltime misfit of the final data set is ∼5–6 s and is nearly independent of period. Contrast this with the mean uncertainty that trends from ∼2 to 7 s shown in Fig. 5(e). Most of the misfit at long periods, therefore, is probably due to measurement error. At short periods, however, about half of the misfit has another cause, perhaps being due to unspecified structures such as azimuthal anisotropy or smaller scale structures.

## Discussion

The principal purpose of this paper is to assess the reliability of ambient noise tomography applied to large regions, such as Europe, across a broad frequency band that extends from short to intermediate periods. We base our assessment following lines of evidence:

- (i)
The repeatability of the measurements.

- (ii)
The coherence of the measurements with one another.

- (iii)
The agreement of the resulting group speed maps with known geological structures.

The first two of these criteria have already been invoked as part of the data selection procedure.

First, the repeatability of the measurements, particularly as they change with the variable ambient noise conditions in different seasons, is discussed above and is the basis for the estimates of measurement uncertainties. As discussed in Section 5, a measurement is retained only if it is shown to be robust over multiple seasons. Not all measurements satisfy this criterion, but many do, as Fig. 5(b) illustrates. This is the foundation for much of our confidence in the ambient noise dispersion measurements.

Second, the ‘coherence’ of the measurements relates to the mutual agreement of measurements that cross the same region. This can be determined from the ability to fit the data with smooth tomographic maps, particularly as that fit compares to that achieved with earthquake data. Fig. 7 plots example histograms of tomographic misfit from over-smooth maps using ambient noise and earthquake data in Europe measured at CU-B. The ambient noise data are fit better, partially due to the fact that earthquake data possess sensitivity to uncertainties in source characteristics. Fig. 14 summarizes the standard deviation of tomographic misfit for both earthquake and ambient noise data across Europe using the over-smooth maps. These results are taken prior to the last stage in data rejection; the rejection due to tomographic misfit. This is in contrast to Fig. 13(c), which is misfit after data rejection is completed. Fig. 13(c), therefore, has a lower misfit level. As periods reduce below 30 s in Fig. 14, relative misfit of ambient noise improves, particularly below about 15 s period. We take this as evidence that short-period dispersion measurements (<20 s) obtained from ambient noise typically are preferable to earthquake derived measurements.

The final criterion to assess the credibility of the estimated maps is agreement with known geological structures. Although the best test may be to interpret a 3-D model constructed from the group speed maps, to address the question more simply one can exploit how surface waves at different periods are sensitive to Earth structure over different depth ranges. The depth of maximum sensitivity is about one-third of a wavelength. At the short-period end of this study (10–20 s), group velocities are dominantly sensitive to shear velocities in the upper crust. Because the seismic velocities of sediments are very low, short-period low-velocity anomalies are a good indicator of sedimentary basins. At the intermediate periods of this study (25–35 s), Rayleigh waves are primarily sensitive to crustal thickness and the shear velocities in the lower crust and uppermost mantle. Due to the large velocity contrasts across the Moho, the group velocities in this band should vary approximately inversely with crustal thickness, with high velocities in regions with a thin crust and low velocities regions with a thick crust.

To aid assessment, Fig. 15 presents a map of sediment thickness from CRUST1.0, which Laske and Masters digitized across most of Europe from the EXXON Tectonic Map of the World (Laske and Masters, 1997), and a map of crustal thickness taken from CRUST2.0 (Bassin 2000). We identify the names of several geological units on these maps, mainly sedimentary basins and mountain ranges.

Comparison of the estimated group speed maps with known geological structures is the thorniest of the assessment tests, both because understanding of known structures is imperfect but more pointedly because a priori information has already been imparted to the reference maps (derived from the 3-D model of Shapiro & Ritzwoller 2002) that form the basis for tomography. In fact, the reference model does not strongly affect the estimated group speed maps in areas of high data coverage. To demonstrate this, Figs 10(c) and 11(c) present maps constructed with no input reference model. Comparison with Figs 10(a) and 11(a) establishes the point—in regions of high data coverage an estimated map can be compared with *a priori* knowledge without worry of contamination by information contained in the reference map. Figs 10(d) and 11(d) also show explicitly that the reference model imposes little constraint on the estimated map in areas of high data coverage.

The 10 and 16 s group speed maps in Figs 9 and 10 exhibit low velocity anomalies associated with most of the known sedimentary basins across Europe. In regions of high data coverage, low-velocity anomalies are observed in the North Sea Basin, the Silesian Basin (North Germany, Poland), the Pannonian Basin (Hungary, Slovakia), the Po Basin (North Italy), the Rhone Basin (Southern France) and the Adriatic Sea. Not surprisingly, many of these features already exist in the reference map. Because there is little resolution in the Mediterranean and Black Seas, as Fig. 8 shows, the basins there are imposed by the reference model. The new features recovered from ambient noise tomography below 20 s period are (1) a general reduction of group speeds in the centre of the region of study, (2) generation of much more pronounced low velocity anomalies associated with the Po and Pannonian Basins and (3) increase in group speeds directly north of the Hellenic Arc. In particular, the low velocity anomaly observed in the Pannonian Basin is stronger than expected from the Exxon sediment maps, which estimates the sediments to be about 2 km thick. A recent study by Grad (2006) includes a 2-D tomographic model, constructed from a refraction profile running from the East European Craton to the Pannonian Basin. In their model, the sediments of the Pannonian Basin extend to a depth of ∼5 km, which is more consistent with our observed 5–10 per cent lower group velocities in this region.

At intermediate periods (25 to 40 s), group velocities become increasingly sensitive to crustal thicknesses. The estimated maps exhibit low-velocity anomalies associated with the Alps, the Carpathians and the mountains in the Balkan region. The low velocity anomalies are probably caused by deeper crustal roots beneath mountain regions which occur due to isostatic compensation. The general reduction in Rayleigh wave velocity in the eastern part of the 30–40 s maps in Figs 9 and 11 is probably related to the general thickening of the crust toward the east European craton, where the crust is about 45 km thick (Grad 2006).

More detailed interpretation of the observed anomalies is beyond the scope of this work, and awaits complementary research, which will include inversion for the *Vs* structure of the crust and upper mantle in Europe. The group velocity maps of our study, particularly at periods below ∼20 s, provide unique constraints on crustal thickness and shear wave velocities in the crust and uppermost mantle.

## Conclusions

In this study, we use ambient noise data recorded at 125 broad-band seismic stations available from the Global Seismic Network (GSN) and the Orfeus Virtual European Broad-band Seismic Network (VEBSN). Cross-correlations are computed in a broad period band from 5 to 150 s in daily segments with a duration of 1 yr. Rayleigh wave group velocities for station pairs are measured from 1 yr stacks of daily cross-correlations using a phase-match filtering frequency time analysis procedure. Uncertainties of the group velocities are estimated based on seasonal variations of the dispersion curves using 3 month time-series. An automated data selection procedure is applied to all dispersion measurements using SNR and seasonal repeatability as selection criteria. About 20 per cent of all station–station cross-correlations are retained for surface wave tomography after data rejection. All measurements have uncertainty estimates that derive from observations of seasonal variability. The data set would benefit from time-series of 2 or more years, which would allow more uncertainties to be measured and, therefore, more measurements to pass the selection criterion.

Group velocity maps at periods from 10 to 50 s are obtained using ambient noise tomography. These maps provide a significant improvement in the understanding of surface wave dispersion in Europe, particularly at periods below about 20 s. This study has a denser and more uniform data coverage and demonstrates higher resolution than pervious studies which have relied on traditional earthquake-based surface wave tomography. Estimated resolution is ∼100 km at 10 s period, but degrades slowly with period above 25 s. Tomographic misfit of group speeds measured from ambient noise shows a significant improvement over earthquake-based measurements below 30 s period, particularly below 15 s. The group velocity maps agree quantitatively with known geological features, such as sedimentary basins and the lateral variation of crustal thickness. Observations at short periods (10–20 s) provide entirely new constraints on sediment thickness, crustal thickness and the shear velocity structure of crust.

In summary, this study demonstrates that surface wave tomography based on cross-correlations of long time-series of ambient noise data can be achieved over a broad period band on a nearly continental scale and yield higher resolution and more reliable group velocity maps than based on traditional earthquake-based measurements.

### Acknowledgments

All of the data used in this research were downloaded from the continuous ftp database of the Orfeus (Observatories and Research Facilities for European Seismology) Data Center and from the IRIS Data Management Center. In particular, the authors are deeply grateful to the data contributors to the Virtual European Broad-band Seismic Network (VEBSN), a partnership of more than 30 local, regional and global arrays and networks. The list of contributors is located at . Comments from two anonymous reviewers are appreciated. This research was supported by a contract from the US Department of Energy, DE-FC52-2005NA26607.

## References

*Alps, Apennines, Hellenides*

*Q*of surface wave