-
PDF
- Split View
-
Views
-
Cite
Cite
G Molchan, E Varini, A Peresan, Productivity within the epidemic-type seismicity model, Geophysical Journal International, Volume 231, Issue 3, December 2022, Pages 1545–1557, https://doi.org/10.1093/gji/ggac269
- Share Icon Share
SUMMARY
The productivity of a magnitude m event can be characterized in term of triggered events of magnitude above m − Δ: it is the number of direct ‘descendants’ |$\nu _\Delta$| and the number of all ‘descendants’ |$V_\Delta$|. There is evidence in favour of the discrete exponential distribution for both |$\nu _\Delta$| and |$V_\Delta$| with a dominant initial magnitude m (the case of aftershock cluster). We consider the general Epidemic Type Aftershock Sequence model adapted to any distribution of |$\nu _\Delta$|. Our first result shows that models with branching aftershock structure do not allow for the coincidence of distribution types of |$\nu _\Delta$| and |$V_\Delta$| (say, the discrete exponential, as in the scientific literature). The second problem is related to the tail behaviour of the |$V_\Delta$| distribution. We show the fundamental difference in tail behaviour of the |$V_\Delta$|-distributions for general-type clusters and clusters with a dominant initial magnitude: the tail is heavy in the former case and light in the latter. The real data demonstrate the possibilities of this kind. This result provides theoretical and practical constraints for distributional analysis of |$V_\Delta$|.
1 INTRODUCTION
Earthquakes clustering, aftershocks occurrence in particular, is a prominent feature in observed seismicity. Different models have been proposed to describe the features of triggered events, especially in the case of aftershocks clusters with an initial dominant magnitude earthquake; yet, several theoretical aspects still need to be explored. In this study, we investigate the distributions of the number of triggered events as described by a branching process, which might provide useful practical constraints for empirical analysis of earthquake data.
Let |$V_\Delta$| denote the number of seismic events of magnitude m ≥ m0 − Δ caused by an earthquake of magnitude m0 ≥ mc + Δ, where mc is the lower magnitude threshold and Δ a positive default value. |$V_\Delta$| is further referred to as the total Δ-productivity.
Considering an earthquake cluster as the result of a branching process (Zaliapin et al. 2008), we can also deal with the directly triggered events of m0, that is the first generation events of an m0-event. The number |$\nu _\Delta$| of such events is in the same magnitude range m ≥ m0 − Δ and is later named Δ-productivity of m0.
Similarly, the geometric distribution (eq. 1) was adopted for statistics |$V_\Delta$|, provided that it refers to clusters with a dominant initial magnitude m0, that is, to aftershocks. For the first time this hypothesis appeared in the work by Soloviev & Solovieva (1962); it was based on relatively scarce data (1954–1961) for the Pacific Belt (m0 ≈ 7; Δ = 2) and for the Kamchatka and Kuril Islands region (m0 ≈ 6; Δ = 2). Shebalin et al. (2018) expanded the analysis and confirmed the hypothesis using 850 aftershock sequences with main shocks m0 ≥ 6.5, Δ = 2 from the ANSS (1975–2018) catalogue. In the same time Kagan (2010), using aftershocks m ≥ 4.7 from PDE (1977–2007) catalogue for m0 = 7.1−7.2, found that the empirical distribution of |$V_\Delta$| is bimodal with the dominant peak at n = 0.
Zaliapin & Ben-Zion (2016) provided a thorough analysis of earthquake clustering and showed its spatial dependence and connection with heat flow production. They showed that, on the global scale, the total Δ-productivity distribution (for clusters with max m ≥ 6 and Δ = 2) looks like a power-law for locations with high heat flow (H > 0.2 W m–2), and as a log-normal distribution otherwise. These conclusions are qualitative because they are based on log–log plots. Nevertheless, the data for the regions of high heat flow differ significantly from the geometric hypothesis (eq. 1) for aftershocks. The same should be expected for any cluster with max m ≤ 5. In this case, according to Zaliapin & Ben-Zion (2016), the cluster size distribution in zones with both high and low heat flow is a power type distribution, f(n)∝n−κ − 1, κ ≈ 2.5.
These observations rise the following questions: (i) is it possible to consider the geometric distribution of |$\nu _\Delta$| and |$V_\Delta$| (for aftershocks) as laws of seismicity and (ii) how to explain the observed difference in the tail behaviour of the |$V_\Delta$| distribution for arbitrary clusters and aftershocks. A rigorous analysis of these issues is possible within the framework of a model adequately related to the concepts of |$\nu _\Delta$| and |$V_\Delta$|. A model suitable for this purpose may be the general ETAS model, namely the ETAS model with an arbitrary (not only Poisson) distribution of direct aftershocks associated to any event (Saichev et al. 2005). The purpose of this note is twofold: first, to show the incompatibility of the hypotheses about the geometric distribution of |$\nu _\Delta$| and |$V_\Delta$| (for aftershocks) within the general ETAS model; and second, to prove the fundamental difference between size distributions for general type clusters and aftershock clusters.
2 ETAS MODELS
The independence of the magnitudes of direct ‘descendants’ between themselves and from the parent event x0 is an important property of the model. This allows us to consider the distribution f1(m) as a frequency–magnitude law for the cluster as a whole. This also is important for understanding the choice of the distribution models for |$\nu _\Delta$| and |$V_\Delta$|.
Note: The exponents (α, β) in the laws of Utsu and Gutenberg–Richter are more convenient (Saichev et al. 2005) to use with the base ‘e’ instead of the usual 10.
A theoretical analysis of various aspects of the general ETAS model with the regular components λ(m) and f1(m) is provided in the works of Saichev and co-authors (see Saichev & Sornette 2017, for further details). In particular, they proved the power law behaviour of the cluster size distribution fcl(n)∝n−1 − κ with κ ∈ [0.5, 1] (Saichev et al. 2005; Baró 2020). Note that, according to Zaliapin & Ben-Zion (2016), κ ∈ [1, 2.5] for real data.
To provide the G-distribution of Δ-productivity for any Δ, it is natural to consider the G-distribution of ν(m). It follows from the following seemingly obvious but useful statement.
The proof of the statement and all subsequent ones are contained in the Appendix.
The conditions for the independence of the |$\nu _\Delta (m_0)$|-distribution from m0 are very strict, and therefore, in contrast with Shebalin et al. (2020), this property should be unstable.
The Random Thinning (RT) property. Statement 1 allows the following generalization. Consider the RT(π) operation of a random cluster thinning: RT(π) saves each cluster event independently with probability π, and deletes it with probability 1 − π; the transformed cluster size V is RT(π)V. Given the magnitude independence in the ETAS(F) model, the productivity |$\nu _\Delta$| has the same distribution as RT(π)ν provided that π = P(m ≥ m0 − Δ).
In the case F = (P or G), E[RT(π)ν] = πEν and the distribution type of ν and RT(π)ν is identical. We will call these two properties of the discrete distribution the RT property. We can formalize the RT property as follows.
Examples of the RT property.
The Negative Binomial Distribution (NBD) for whichφ(z, λ) = (1 − λ(z − 1))−τ, τ > 0, holds the RT property. In the case τ = 1 it corresponds to the Geometric distribution.
- Combining Nindependent clusters withRT property also preserves this property. Formally this follows from eq. (7) and the relation for the cluster characteristics φi(z, λi):The NBD model was used by Kagan (2014) to describe the distribution of annual earthquake numbers for the CIT catalogue 1932–2001, m ≥ 5.0. Below we will show that the RT property, which NBD possesses, is poorly compatible with the branching cluster structure. However, in the Kagan’s case, the statistics of events occurring worldwide in one year consists mainly of independent weakly grouped events. Such case is closer to the model (eq. 8) with components: φi(z, λi) = 1 + λi(z − 1), λi < 1.(8)$$\begin{eqnarray*} \varphi (z,\lambda ) = \prod _i \varphi _i(z,\lambda _i) = \prod _i \psi _i(p_i\lambda (z-1)), \quad \quad p_i\lambda =\lambda _i \quad {\rm and} \quad \sum p_i=1 \end{eqnarray*}$$
- A cluster selected randomly with probabilities {ρi > 0} from N clusters withRT property retains RT property. In this caseHowever, if the cluster sizes distributions are of the same type, that isφi(z, λi) = ψ(λi(z − 1)), then the random cluster usually loses this type provided that λi ≠ const.(9)$$\begin{eqnarray*} \varphi (z,\lambda )=\sum _i \varphi _i(z,\lambda _i)\rho _i =\sum _i \psi _i(p_i\lambda (z-1))\rho _i , \quad \quad \lambda p_i=\lambda _i \quad {\rm and} \quad \sum _i \lambda _i \rho _i = \lambda . \quad \end{eqnarray*}$$
For example, assuming a geometric distribution for both the size of a random cluster and the sizes of its components, (9) gives (1 − w)−1 = ∑i(1 − wpi)−1ρi, w = λ(z − 1), which is possible only when pi = 1. This is also true in general case if φ(z, λi) is N-smooth at z = 1. The considered case is typical for the observed clusters taken as one statistical population. Consequently, empirical distributions of |$\nu _\Delta$| or |$V_\Delta$| obtained over large territories generally do not represent the true types of their distributions. This circumstance makes it difficult to prove the universality of the type of |$\nu _\Delta (m_0)$| distribution.
ETAS* model. To get the geometric distribution of |$\nu _\Delta$| in framework of the ETAS model, Shebalin et al. (2020) used randomization of the parameter λ(m0). They used the fact that a Poisson distribution with an exponentially distributed random parameter is equivalent to a geometric distribution. The resulting model was named ETAS*. Its analogue is the ETAS(G) model in which the average productivity does not depend on the initial magnitude, namely λ(m0) = λ. Because of (6), in this case |$\lambda _\Delta (m_0)$| becomes dependent on m0 for any f1(m) > 0.
3 THE TOTAL Δ -PRODUCTIVITY
We will consider the ETAS(F) model in a subcritical regime: Λ1 = Eλ(m) < 1. Eν(m) = λ(m) is a non-decreasing, finite, positive function; the other components, namely f1(m), f2(t) and f3(g∣m), are free from additional restrictions.
As above m0 > 0 is the magnitude of an initial cluster event and |$V_\Delta (m_0)$| is the total Δ-productivity of m0. Any initial event can be fixed or random, namely it has the f1(m) distribution. A further distinction concerns the dominant initial events, that is those initial events having maximum magnitude within their cluster. The case of a random m0 is important because identifying the initial cluster event is an unstable practical procedure.
Statement 2. Consider the specified ETAS(F) model. Assume that all moments of F-distribution are finite and F has the RT property.
- Let |$V_\Delta (m_0)$|, m0 > Δ > 0, be the size of the cluster with any condition C: the initial magnitude m0 is fixed or random. Then the number |$N_{V_\Delta }$| of finite moments of |$V_\Delta (m_0)$| is independent of C. Moreover,$$\begin{eqnarray*} N_{V_\Delta } = N_\lambda := max\left\lbrace n: I_n = \int _0^\infty \lambda ^n(m) f_1(m) dm \lt \infty \right\rbrace \ . \end{eqnarray*}$$
- Let F be Poisson or Geometric distribution and m0 is fixed and dominant. Then |$N_{V_\Delta }=\infty$| regardless of the Nλ-value. This is also true for the random dominant m0-event if additionally(13)$$\begin{eqnarray*} \lambda (m) \int _0^\Delta f_1(m-u) {\rm d}u \le {\rm const}\lt \infty \ . \end{eqnarray*}$$
Statement 2 allows us to judge on the tail attenuation of the |$V_\Delta$| distribution because, by the Chebyshev–Markov inequality, P(ξ ≥ u) ≤ Eξn/un for any random variable ξ ≥ 0 and u > 0. Distributions having all moments will be called distributions with light tails. Otherwise, we will talk about heavy tails.
Let’s consider clusters with fixed or random initial magnitude. The conditions of the Statement 2 are easily verified for the regular ETAS components: λ(m) = λeαm, α > 0 and f1(m) = βe−βm, β > α. In this case we have Nλ < β/α. This means that the total Δ-productivity distribution has only a heavy tail for small β/α. The data of Zaliapin & Ben-Zion (2016) demonstrate the possibilities of this kind. In the considered case Saichev et al. (2005) give more information on the tail behaviour of |$V_\Delta$| distribution: if β/α is large, then the tail is subexponential and looks like eq. (11).
The case α = 0 corresponds to the analogue of the ETAS* model; here Nλ = ∞ in accordance with eq. (10).
Given α ≤ β, condition (13) is satisfied. This guarantees the light tails in the distribution of the total Δ-productivity of main shocks. All the observations mentioned above support this fact.
The difference in tail behaviour of |$V_\Delta$| statistics can be partially explained as follows. The total Δ-productivity of the main shock with a fixed magnitude m0 is based on events in the finite range of magnitudes (0, m0). In this case, the frequency–magnitude law f1(m) is replaced by its truncated counterpart in the interval (0, m0), where all moments of λ(m) are finite. Since |$N_{V_\Delta }=N_\lambda$|, the tail of the |$V_\Delta$| distribution becomes typical for the case Nλ = ∞ that is quasi exponential. The similar effect for a random dominant magnitude is not so obvious and is a non-trivial analytical fact.
Statement 3. (1) Let m0 be a dominant initial magnitude in the ETAS(G) cluster and let f1(m) be strictly positive. Then the statistics |$V_\Delta (m_0)$| can have a geometric distribution for no more than one value Δ ∈ (0, m0).
(2) Let ν and V be the sizes of clusters of direct descendants and all descendants, respectively, in the ETAS(F) cluster with the random initial event. Assume that both distributions of ν and V have the RT property, as well as Eν2 < ∞ and ∫λ2(m)f1(m)dm < ∞. Then the distribution types of ν and V cannot be the same.
According to Statement 3 we can say that, in the framework of generalized ETAS model, the geometric laws for |$\nu _\Delta$| and |$V_\Delta$|, including the case of aftershocks, are incompatible. In the ETAS(G) model case with λ(m) = const., the geometric law of |$\nu _\Delta$| completely excludes the RT property for |$V_\Delta$| (see eq. 12). The reason for this lies in the branching structure of the cluster.
Numerical examples
To support the theoretical conclusions, we simulated a synthetic earthquake catalogue from the regular ETAS(P) model. To mimic the features of real seismicity, the set of ETAS parameters was mainly derived from the analysis of the earthquake catalogue of Northeastern Italy (Benali et al. 2020). The simulation region also corresponds to that area, namely: Lon 11.5–14.0 and Lat 45.5–47.0.
The model was based on the following parameters:
The uniform background seismicity with the rate μ = 0.04 events (degree2·day)–1, which roughly corresponds to the rate of background events identified in the earthquake catalogue of Northeastern Italy. The uniformity makes it easier the declustering process, as it allows to better identify the clustered component.
Magnitude range m ≥ 2; the Gutenberg–Richter parameter β = 2.07, which corresponds to the b-value 0.9.
The Utsu Law parameters: λ = 0.22 (event/day), α = 1.54. The estimated λ = 0.67 for Northeastern Italy was reduced to 0.22 to satisfy the stability conditions Λ1 = Eλ(m) < 1.
The Omori law parameters: c = 0.015 (day), p = 1.037. The estimated p parameter very close to 1 implies an extremely long tail of f2(t), so that offspring events may span several thousands years (Harte 2013); this practically hinders the analysis of cluster size for synthetic catalogues. However, we recall that, from the theoretical viewpoint, the temporal component of the ETAS model does not affect cluster size. Therefore, in order to prevent this inconvenience, instead of increasing the p parameter, we considered a truncated distribution f2(t) on the support t < 0.7 yr, which corresponds to the 95 per cent quantile of the cluster lifetime for Northeastern Italy.
The space scattering law parameters: d = 0.0085 (degree), q = 2.25, γ = 0.62.
The synthetic earthquake catalogue, consisting of 143 568 events, spans about 500 yr and includes information about the links between each event and its direct descendants. The synthetic time-series were processed by the nearest-neighbour method (Zaliapin & Ben-Zion 2013) to identify clusters of events. This method allows partitioning earthquakes into background and clustered components, based on nearest-neighbour distances between earthquakes in the space–time–magnitude domain (Baiesi & Paczuski 2004; Zaliapin et al. 2008). In this study, the parameters necessary for the computation of nearest-neighbour distances, namely the b-value and the fractal dimension of epicentres df, were set according to the values estimated from the analysis of Northeastern Italy catalogue (Benali et al. 2020, and references therein): b-value = 0.9 and df = 1.1. The threshold of the nearest-neighbour distance, which separates the clustered and background components, was automatically set (log10η0 = −4.2) following a criterion based on a 1-D Gaussian mixture model with two modes, where the threshold is the maximum likelihood boundary between the two modes (Zaliapin & Ben-Zion 2013). The total Δ-productivity distribution was examined considering the clusters extracted from the synthetic earthquake catalogue.
Fig. 1 represents the total Δ-productivity distributions (Δ = 2) (a) for arbitrary initial events and (b) for main shocks, as identified by nearest-neighbour method applied to the synthetic ETAS(P) seismicity. In the first case, the distribution |$V_\Delta$| demonstrates a heavy tail (requiring the use of logarithmic values in the figure), which is also confirmed by the coefficient of variation |$\sigma (V_\Delta )/EV_\Delta \approx 10.8$|. On the contrary, the distribution |$V_\Delta$| for main shocks (Fig. 1b) demonstrates the light tail behaviour; for comparison with the previous case |$\sigma (V_\Delta )/EV_\Delta \approx 0.7$|.

Clusters identified by nearest-neighbour method applied to the synthetic ETAS(P) seismicity: The |$V_\Delta$| distributions, Δ = 2, for (a) arbitrary initial events, m0 ≥ 4 and (b) main shocks, m0 ≥ 4.
Fig. 2 shows similar results based on the clusters originally defined by the branching structure of the synthetic ETAS(P) catalogue. The coefficients of variation are also consistent with previous results: (a) |$\sigma (V_\Delta )/EV_\Delta \approx 9.7$| for arbitrary initial events and (b) |$\sigma (V_\Delta )/EV_\Delta \approx 0.8$| for main shocks.

Clusters in the synthetic ETAS(P) seismicity: The |$V_\Delta$| distributions, Δ = 2, for (a) arbitrary initial events, m0 ≥ 4 and (b) main shocks, m0 ≥ 4.
A good agreement of the empirical data Figs 1 and 2 demonstrates the robustness of conclusions about the tail behaviour of |$V_\Delta$| distribution.
4 CONCLUSION
The concept of the Δ-productivity |$\nu _\Delta$| is associated with a priori ideas about the structure of a cluster of seismic events in the form of a random branching tree. Such a representation is not the only possible one. Therefore, the Geometric distribution of |$\nu _\Delta$| can be considered as a useful alternative within the framework of the generalized ETAS(F) model. The other characteristic of the Δ-productivity, namely |$V_\Delta$| for main shocks, is more objective and stable.
For large class of the ETAS(F) models, we answered the question when the total Δ-productivity distribution has light tails for main shocks and heavy ones for arbitrary events. The real data demonstrate the possibilities of this kind.
Note the difficulties of substantiating the mentioned geometric laws for |$\nu _\Delta$| and aftershock’s |$V_\Delta$|. They are associated both with the declustering of seismicity and with the effect of averaging the distributions of |$\nu _\Delta$| and |$V_\Delta$| when grouping data.
The first difficulty can be judged by experiments of Zaliapin & Ben-Zion (2013) with the nearest-neighbour method. This method adapted to the ETAS structure has 40 per cent error of incorrectly determining the parent of an event. The stability of the conclusions in this case may depend on the strict similarity in the real hierarchical structure of the seismic cluster.
The second difficulty is expressed in the fact that the mixture of geometric distributions retains the original property of monotonicity, but ceases to be geometric, that is loses the original G-type. This is unavoidable if the distribution parameter is dependent on location or the magnitude of the initial event (see eq. 6). Therefore, we can speak more confidently about the light tail of the |$V_\Delta$| distribution for the main shocks.
Finally, the theoretical result for ETAS(F) model shows that the types of distributions of |$\nu _\Delta$| and |$V_\Delta$| cannot be the same due to the branching structures of the clusters.
ACKNOWLEDGEMENTS
We are grateful to Prof J. Zhuang and another anonymous reviewer for their careful reading of the manuscript and comments that allowed us to significantly revise and improve the text.
DATA AVAILABILITY
The data (synthetic earthquake catalogue) underlying this paper will be shared on reasonable request to the corresponding author.
REFERENCES
APPENDIX
A1 Proof of Statement 1
A2 Proof of the statement 2
A2.1 General remarks.
We will need the following notation: NM(m) is the total number of events of magnitude ≥M triggered by an event of magnitude m. In particular, the total Δ-productivity |$V_\Delta (m_0)$| of the initial event with magnitude m0 is |$N_{M(m_0)}(m_0)$|, M(m) = (m − Δ)+, where a+ = max(a, 0). The notation E(*∣m) denotes the conditional mean given m.
The spatio-temporal components of the ETAS model do not affect cluster size, since the law of generating new events depends only on the parent magnitude, and the problem of declustering does not arise in our theoretical analysis. Therefore, the spatio-temporal components in theoretical analysis can be any and the distribution of NM(m) is the same for any x = (t, g, m).
The moments |$\mathbf {EV_\Delta ^n}$|.
Mean value: |$\mathbf {EV_\Delta }$|.
The case |$\mathbf {n\gt 1}$|.
(a) Fixed/random initial |$\mathbf {m_0}$|
(b) Fixed dominant |$\mathbf {m_0}$|
Let us consider the ETAS cluster with a dominant initial magnitude m0. Let |$\hat{N}_{0M}(m)$| be the number of events of magnitude >M in the cluster with the initial event m ≤ m0 in which all events have magnitude <m0. In such cluster, direct descendants of any initial magnitude m are realized under the condition Ω(m) = {mi < m0: i = 1, …, ν(m0)}.
(c) Random dominant |$\mathbf {m_0}$|
The proof is complete.
A3 Proof of the statement 3.1
The proof is complete.