- Split View
-
Views
-
CiteCitation
Nozomu Takeuchi, Robert J. Geller; Accurate numerical methods for solving the elastic equation of motion for arbitrary source locations, Geophysical Journal International, Volume 154, Issue 3, 1 September 2003, Pages 852–866, https://doi.org/10.1046/j.1365-246X.2003.02009.x
Download citation file:
© 2018 Oxford University Press
Close -
Share
Summary
In our previous studies we derived a general criterion for optimally accurate numerical operators for computing synthetic seismograms. In this paper we extend our results by deriving source representations that are ‘tuned’ to match the errors of the numerical operators, so that optimally accurate synthetic seismograms are obtained. The same accuracy can be obtained whether the sources are at nodes of the numerical grid or between nodes. The accuracy of the synthetic seismograms is confirmed by numerical experiments. These results should facilitate inversion of seismic waveform data for 3-D Earth structure and earthquake source parameters.
1 Introduction
Accurate and efficient methods for computing synthetic seismograms in laterally and vertically heterogeneous media are required to perform inversion of seismic waveform data for 3-D Earth structure (e.g. Geller & Hara 1993). Geller & Takeuchi (1995, 1998, hereafter referred to as GT95 and GT98, respectively) and Takeuchi & Geller (2000, hereafter referred to as TG00) derived optimally accurate numerical operators for computing synthetic seismograms that are well suited to waveform inversion studies. Schemes using these numerical operators require far less CPU time to compute synthetics of any given accuracy than conventional numerical schemes (e.g. an improvement by a factor of about 50 is obtained using optimally accurate finite-difference operators for a 2-D heterogeneous medium; see TG00 for details).
In deriving the above numerical schemes, GT95, GT98 and TG00 considered only the discretization errors of the numerical operators. In this paper we extend their treatment by also considering the discretization error due to the force term. We first derive such source representations for the Direct Solution Method of Geller & Ohminato (1994, hereafter referred to as GO94); however, we also demonstrate that our approach can be applied to the staggered grid and pseudo-spectral methods. The numerical examples presented in this paper confirm that when the theoretical results derived here are implemented using optimally accurate numerical operators, comparable accuracy is achieved for sources in the interval between nodes as for sources at nodes, while the accuracy of synthetic seismograms obtained by conventional source representations seriously degrades for sources between nodes.
2 General Condition for Optimal Accuracy
2.1 Theoretical Background
When non-analytic trial functions such as linear splines are used in eq. (2), the numerical operators in eq. (3)will not in general satisfy the general criterion for optimally accurate numerical operators derived by GT95 (). However, as shown by GT95, GT98 and TG00, non-optimally accurate operators can be replaced by the corresponding optimally accurate operators in a straightforward fashion.
The choice between optimally accurate operators of different types depends on the nature of the problem (Mizutani et al. 2000). For example, for homogeneous media pseudo-spectral spatial operators and fourth-order Runge–Kutta temporal operators outperform optimally accurate second-order (in both space and time) finite-difference operators, but the latter are more cost effective for a heterogeneous medium with sharp boundaries between layers. As the Earth is a heterogeneous medium with sharp internal boundaries, optimally accurate local schemes (finite-difference or finite-element) appear best suited to actual seismological applications. The discussion in this paper is therefore limited to such numerical schemes.
2.2 Extension Of Gt95 Theory To The Source Term
GT95 derived a general criterion for optimally accurate numerical schemes (their eq. 2.20). However, their theoretical analysis did not consider the source term. Here we extend GT95's theory by deriving a general criterion that the source term should satisfy to obtain an optimally accurate solution.
We now consider the intuitive meaning of the above derivation. If the numerical operators are not optimally accurate (i.e. do not satisfy eq. 19), then the second term on the right-hand side of the first line of eq. (20)will dominate the error of the numerical solution, and the error due to the force term will be comparatively negligible. However, if we are using optimally accurate numerical operators, then the contribution of the first term on the right-hand side of the first line of eq. (20)is no longer negligible. Eq. (22)shows that in this case the error of the synthetic seismograms can be minimized if the error of the force term is ‘tuned’ to match the error of the numerical operators.
3 Source Representation
We now derive source representations that satisfy eq. (22). We first derive the source representation for a point force at an arbitrary location when we use analytic trial functions. We then apply this representation to the case of numerical trial functions and derive a source representation satisfying eq. (22). Note that the source representations derived in this section are applicable to sources at nodes as well as to sources between nodes. The following discussion is for sources between nodes, but the same formulation can be applied to sources at a node because their discretized representation is the same as that for a source between nodes, which is infinitesimally close to the node. (See the discussion following eq. 48, below.)
3.1 Source Representation For Analytic Trial Functions
As the trial functions used in weak-form solutions are continuous, it is not possible to account for the displacement or traction discontinuity for a seismic source between gridpoints solely in terms of a trial function expansion. Such discontinuities must therefore be accounted for by representing the displacement in the vicinity of the source as the sum of a particular solution and a trial function expansion.
Source region (VS) and remainder of medium (VR) for (a) plane-layered medium and (b) general 2-D medium. Note that VRextends in all directions to the outer boundary of the medium. The diagram for the 3-D case is not shown, but it is a straightforward extension of Fig. 1(b).
Source region (VS) and remainder of medium (VR) for (a) plane-layered medium and (b) general 2-D medium. Note that VRextends in all directions to the outer boundary of the medium. The diagram for the 3-D case is not shown, but it is a straightforward extension of Fig. 1(b).
3.2 General Source Representation
The starting point of the above transformation is the last line of eq. (42). Even if there is a discontinuity in elastic properties across SS, CijklU(p)k,lnj will be continuous. Therefore, rather than evaluating the surface integral on the ‘inner surface’ of SS, with nl pointing outward, we evaluate the surface integral on the outer surface, which we designate as S+S, with the normal vector pointing inward. As there are no sources outside VS, U(p)imust be a solution of the homogeneous equation of motion outside VS, and therefore can be represented as a superposition of trial functions. (If U(p)iis not defined outside Vswe can obtain its value outside Vsby analytic continuation, e.g. Carrier et al. 1966, pp. 63–66). We then use eq. (31)to transform the surface integral in the second line of eq. (43)into the volume integrals in the third line. Note that the volume of integration is VR; i.e. the source region VSis excluded. Finally, in the last line of eq. (43)we express the final result in terms of matrix elements. The matrix elements T″ and H″ are defined following eqs (4)and (5). However, the elements of the matrices T″ and H″ are set to zero except for elements in rows corresponding to trial functions for which one of the components has unit value on SS. Note that the boundary conditions for U(p)ioutside the region for which the elements of the matrices T″ and H″ are non-zero are arbitrary.
3.3 Source Representation For Numerical Trial Functions
Now we apply the above results to the case of numerical trial functions. When using numerical trial functions such as linear splines, some of the terms in the above derivation, such as the surface integrals in the first two lines of eq. (43), may not, strictly speaking, be computable. Nevertheless, the above derivation may be regarded as being valid in the sense of the theory of distributions, as the trial functions could be represented as the limit of a series of functions for which the required computations are rigorously possible. Therefore, the last line of eq. (43)may be used in numerical computations even for the case of trial functions for which the first two lines of eq. (43)cannot be rigorously evaluated. See, for example, Strang & Fix (1973, pp. 11–16) or GO94 () for further discussions of this point.
3.4 Example
As an example, we present the explicit elements of the source vector defined in eq. (43)for the case of a homogeneous 1-D string (see GT95) for which the density is ρ, the elastic constant is κ and with a constant grid interval, Δz. The source is located between the n th and (n + 1)th gridpoints. (Note that in this subsection n denotes a particular node rather than a trial function.)
For completeness, we discuss the case of sources at nodes, as well as those between nodes. Suppose U(p)nis a particular solution for the point force f =δ(z −z0), where z0=n Δz +ε is located an infinitesimal distance from the n th node. By expanding eq. (48)in a Taylor series and using the equation of motion satisfied by the particular solution, it can be readily shown that in the limit as ε→ 0 the same solution (to O (Δz2)) is obtained as for the case of three point force sources at the nodes z = (n − 1)Δz, z =n Δz and z = (n + 1)Δz with weights of 1/12, 10/12 and 1/12, respectively. Similarly, for a point force and z0=n Δz +ε (ε→ 0), we can readily show that the solution obtained using eq. (48)is the same (to O (Δz2)) as the solution obtained for dislocations (displacement discontinuities) with weights of 1/12, 10/12 and 1/12 at z = (n − 1)Δz, z =n Δz and z = (n + 1)Δz, respectively. It can also be shown that the source representation of Cummins et al. (1994) gives the same solution to O (Δz2) when optimally accurate operators are used.
As the computational domain required for the evaluation of the non-zero elements of his small compared with the whole region, the CPU time required to evaluate his negligible.
4 Time Domain Solutions
In the above section, we derived results in the frequency domain. We now derive the corresponding results for the time domain. Note that the time domain solution presented here is for the weak form of the equation of motion (e.g. the weak-form finite-difference method, GT98 and TG00). Such methods differ from staggered grid finite-difference methods (e.g. Virieux 1986) in the following two points. (1) The dependent variable is displacement (rather than stress and velocity). (2) Boundary conditions (such as free surface conditions and continuity of traction) are natural boundary (continuity) conditions that are automatically satisfied (see GO94), whereas they must be imposed explicitly for the staggered grid methods.
4.1 Equation Of Motion For Arbitrary Sources
4.2 Example
4.3 Comparison With The Source Box Approach
The above time domain solution is closely related to the source box approach of Alterman & Karal (1968). Although we do not present a derivation here, Takeuchi (1996) showed for frequency domain numerical solutions that, if we use the analytic solution for an infinite medium as the particular solution and use the conventional numerical operators rather than the optimally accurate numerical operators, the resulting formulation is equivalent to the source box approach. Essentially, the same derivation can be applied to the time domain case as well.
The methods of this paper are preferable to the source box formulation for several reasons. The first is the improvement in accuracy due to using optimally accurate operators (although optimally accurate operators could also be used in the source box formulation). A second reason is the simplicity of coding. The source box approach solves two coupled equations: the equation of motion outside the source region and that inside the source region. These two equations use different dependent variables, and the boundary conditions relating these variables at the boundary of the source region are essential boundary conditions, i.e. boundary conditions that must be imposed explicitly (see GO94). The software must thus be coded to account for different source locations. On the other hand, in our approach there is only one equation of motion (eq. 39in the frequency domain or eq. 53in the time domain). An arbitrary source is accounted for by the source term hin eq. (39)or pin eq. (53). If the source location is changed, all that is necessary is to change this term. This greatly reduces the complexity of the software, as compared with that required for the source box formulation.
5 Extension To Other Methods
5.1 Staggered Grid Solution
Numerical examples (not presented here) show that the synthetic seismograms for the source between nodes computed by using the force term of eq. (59)in the staggered grid formulation give almost equivalent accuracy to that for a source at the nodes. However, as shown by GT98, computationally tractable optimally accurate operators cannot be defined for staggered grid methods. Thus, staggered grid solutions will be much less accurate than solutions using optimally accurate operators for the same CPU time.
5.2 Pseudo-Spectral Method
Eqs (55)and (61)are formally equivalent, but the cost for numerical evaluation of these expressions is quite different. As the PSM uses a Fourier basis, evaluation of eq. (61)requires the evaluation of U(p), X″PSMand K″PSMfor all gridpoints. Furthermore, the evaluation of X″PSMand K″PSMis not straightforward. Thus, the evaluation of eq. (61)would require considerable CPU time. However, this difficulty can readily be eliminated by using eq. (55)directly instead of eq. (61). In other words, we use the finite-difference method to evaluate the source term, but we use the PSM to solve the equation of motion. In a PSM formulation using both velocity and stress as dependent variables, we can use the source term in the same fashion for the staggered grid finite-difference method (e.g. eq. 59). Nonetheless, as pointed out by Mizutani (2000), the poor cost–performance ratio of the PSM for heterogeneous media with sharp discontinuities is a strong argument against using the PSM for modelling wave propagation in realistic Earth models.
6 Application To Cmt Inversion
One of the prime reasons for developing accurate and efficient methods for computing synthetic seismograms is to use them to conduct accurate and efficient waveform inversion for Earth structure and earthquake source parameters. In this section, we show how one can use our source representation (eq. 32or eq. 43) in CMT inversion. The quantities that must be computed for CMT inversion are the synthetic seismograms ui and their partial derivatives δui with respect to the perturbation of the source parameters (centroid and moment tensor) at the receiver position r.
The key advantage of this algorithm is that, as zis common between eqs (64)and (65), the required number of solutions of eq. (62), which is the most computationally intensive step, can be reduced. Using this algorithm only one LU-factorization and one forward- and back-substitution per receiver are required (see Hara 1997, for details).
When we apply the source representation derived in this paper to the above algorithm, there will be no difficulty in defining the terms on the right-hand side of eqs (62)and (64). These are a single force at the receiver and the source term of the actual earthquake, respectively, and we can apply eqs (32)and (43)as they are.
7 Numerical Examples
In this section, we present numerical examples that confirm that optimal accuracy is achieved if and only if optimally accurate numerical operators are used in combination with the source representation derived in this paper.
The error of synthetic seismograms computed by the source representation derived in this paper (solid line) and the conventional source representation (broken line) for various source locations. Vertical lines show the location of numerical nodes. The error of synthetics computed for sources at nodes by the source representation of Cummins et al. (1994) (crosses) is also shown. Optimally accurate operators are used for all cases. The vertical axis is on a log scale.
The error of synthetic seismograms computed by the source representation derived in this paper (solid line) and the conventional source representation (broken line) for various source locations. Vertical lines show the location of numerical nodes. The error of synthetics computed for sources at nodes by the source representation of Cummins et al. (1994) (crosses) is also shown. Optimally accurate operators are used for all cases. The vertical axis is on a log scale.
When the source is exactly at a node (i.e. when r0= 3000, 3005, …, 3025 for this case), we can use the source representation of Cummins et al. (1994), which explicitly imposes the discontinuity condition at the source. The accuracy of synthetic seismograms computed using this source representation is also shown in Fig. 2. This confirms that our source representation achieves comparable accuracy to this method even when the source is not located at a node.
Fig. 3(a) shows a comparison of the accuracy of the various methods as a function of the number of grid intervals. Because the accuracy of the synthetics is highly dependent on ξ for the conventional source representation, we change the source location for each grid interval to satisfy ξ= 0.283 around r0= 3000 km. The highly accurate synthetic seismograms used to evaluate the accuracy are computed by the same method as that for the numerical examples in Fig. 2 except that we put an additional node at the source location. As expected, synthetics computed using a combination of conventional operators and the conventional source representation have the worst accuracy. However, if we replace the conventional matrix operators by optimally accurate operators but use the conventional source representation, the accuracy of the synthetics does not improve significantly. If and only if we use both optimally accurate matrix operators and the source representation derived in this paper, we obtain high accuracy. As shown by Fig. 3(a), this combination yields O (Δr2) accuracy, whereas the combination of optimally accurate matrix operators and a conventional source representation gives only O (Δr) accuracy for a source between node points. This is because the conventional source representation of eq. (6)has only O (Δr) accuracy. For this computation (iii in Fig. 3a), the source representation in eq. (43)was evaluated using the optimally accurate operators for T″ and H″. In Fig. 3(b), we evaluate the accuracy when we use the conventional operators for T″ and H″. Even for this case the accuracy is proportional to O (Δr2) and is greatly improved compared with cases i and ii in Fig. 3(a), for which the conventional source representation is used. However, as expected from the theoretical considerations in and eqs (44)and (45), better accuracy is obtained by using the optimally accurate operators for T″ and H″ in eq. (43).
(a) (i) The error of synthetic seismograms computed using a combination of the conventional operators and the conventional source representation (dashed line); (ii) error of synthetic seismograms computed by a combination of the optimally accurate operators and the conventional source representation (broken line); and (iii) error of synthetic seismograms computed by a combination of the optimally accurate operators and the source representation derived in this paper (solid line) for various grid intervals. The vertical and horizontal axes are both on log scales. (b) (iii) the same as (iii) in (a); (iv) the same as (iii) in (a), except that conventional operators are used for T″ and H″ to evaluate the source representation in eq. (43).
(a) (i) The error of synthetic seismograms computed using a combination of the conventional operators and the conventional source representation (dashed line); (ii) error of synthetic seismograms computed by a combination of the optimally accurate operators and the conventional source representation (broken line); and (iii) error of synthetic seismograms computed by a combination of the optimally accurate operators and the source representation derived in this paper (solid line) for various grid intervals. The vertical and horizontal axes are both on log scales. (b) (iii) the same as (iii) in (a); (iv) the same as (iii) in (a), except that conventional operators are used for T″ and H″ to evaluate the source representation in eq. (43).
The above results also hold for more realistic earth models. Fig. 4 shows synthetic seismograms computed for various combinations of operators and source representations. All of the synthetic seismograms are computed for a spherically symmetric model using the Direct Solution Method software of Cummins et al. (1994). The synthetics are for the toroidal component of velocity seismograms for the isotropic PREM model (Dziewonski & Anderson 1981) for a vertical dip point source at 597 km depth with a step-function time history. The distance between the epicentre and the station is 108°. Computations are made for the period range 20–5120 s, and a six-pole low-pass filter with a corner of 31.25 mHz is applied. We use 200 nodes in the radial direction in the mantle. The very accurate synthetic seismograms used to assess the accuracy are computed using the optimally accurate operators for very fine grids (3200 nodes) with an additional node at the source location. For the computations in Figs 4(a)–(d), the source is located between nodes. The synthetics in Figs 4(a) and (c) are computed using eq. (3)for the conventional and optimally accurate operators, respectively. Because the trial functions are continuous between nodes the displacement discontinuity required at the source cannot be reproduced by the numerical solution; the relatively large errors in Figs 4(a) and (c) are thus expected. Figs 4(b) and (d) show numerical solutions obtained using the source representations derived in this paper (eqs 29and 43) for the conventional and optimally accurate operators, respectively. These figures show that accurate results are only achieved when the source representation derived in this paper is used in combination with optimally accurate operators.
Comparison of the accuracy of synthetic seismograms computed using various combinations of operators and source representations for PREM. (a) Conventional operators and a conventional source representation, (b) conventional operators and the source representation derived in this paper, (c) optimally accurate operators and a conventional source representation, (d) optimally accurate operators and the source representation derived in this paper, (e) optimally accurate operators and a source representation obtained by putting an additional node at the source location and explicitly imposing the discontinuity boundary condition. In each figure, the top trace is the synthetic seismograms for the respective method, the second trace is an almost exact synthetic seismogram (computed using a very fine grid), and the bottom trace is the residual (the difference between the synthetic seismogram and the almost exact seismogram).
Comparison of the accuracy of synthetic seismograms computed using various combinations of operators and source representations for PREM. (a) Conventional operators and a conventional source representation, (b) conventional operators and the source representation derived in this paper, (c) optimally accurate operators and a conventional source representation, (d) optimally accurate operators and the source representation derived in this paper, (e) optimally accurate operators and a source representation obtained by putting an additional node at the source location and explicitly imposing the discontinuity boundary condition. In each figure, the top trace is the synthetic seismograms for the respective method, the second trace is an almost exact synthetic seismogram (computed using a very fine grid), and the bottom trace is the residual (the difference between the synthetic seismogram and the almost exact seismogram).
Although the source is not at a node for the examples in Fig. 4, we could also have computed the synthetics by placing an additional node at the source and explicitly imposing the discontinuity condition using the method of Cummins et al. (1994), as shown in Fig. 4(e). The accuracy obtained in Fig. 4(e) is comparable to that in Fig. 4(d). Similar accuracy could be obtained if we varied the numerical grid spacing to coincide with the source location. However, since, as discussed below, there are good reasons not to reconfigure the grid for each seismic source, and since the accuracy in Figs 4(d) and (e) is comparable, the use of the methods of this paper appears to be preferable.
8 Discussion
Geller & Hara (1993) showed that the most computationally intensive step in iterative waveform inversion for earth structure is the LU-factorization and the forward and back substitutions in eq. (29), and formulated an algorithm that minimized the required number of such operations. Using their algorithm, the required number of LU-factorizations is only one for each iteration (for each frequency) if the matrix elements are source-independent, while the required number of LU-factorizations would be equal to the number of events if the matrix elements were source-dependent. Thus the lack of dependence of the matrix elements on the left-hand side of eq. (29)on the source location is a key point for efficiently utilizing the algorithm of Geller & Hara (1993). If it was necessary to redefine the grid for each source, as was done to compute the synthetics in Fig. 4(e), this would no longer be possible.
The fact that the matrix elements in our formulation do not depend on the source location also facilitates efficient CMT inversion using Hara's (1997) algorithm, as only one LU-factorization is required for each frequency if the matrix elements are independent of the source location, while the required number of LU-factorizations would be the product of the number of events and the number of iterations if the matrix elements were dependent on the source location. Hara (1997) showed that his algorithm is also useful for simultaneous inversion for CMT solutions and 3-D Earth structure because the LU-factorization operations are common between the CMT inversion and the 3-D structure inversion. However, this also only holds if the matrix elements are independent of the source location. The weak-form solution for arbitrary source locations derived in this paper will thus facilitate efficient and accurate waveform inversion for earthquake source parameters and 3-D Solid Earth structure.
Acknowledgments
We thank Hiromitsu Mizutani for a critical reading of earlier drafts. Discussions with Hiromitsu Mizutani and with Nobuyasu Hirabayashi helped us to clarify the theory in and eqs (44)and (45). We also thank two anonymous reviewers for helpful comments. This research was partly supported by grants from the Japanese Ministry of Education, Culture, Sports, Science and Technology (nos 12740257 and 14540393) and by the Japanese Solid Earth Simulator Project.









































































