Abstract

We demonstrate that universal scaling behavior is observed in the current coronavirus (SARS-CoV-2) spread, the COVID-19 pandemic, in various countries. We analyze the numbers of infected people who tested positive (cases) in 11 selected countries (Japan, USA, Russia, Brazil, China, Italy, Indonesia, Spain, South Korea, UK, and Sweden). By using a double exponential function called the Gompertz function, |$f_\mathrm{G}(x)=\exp(-e^{-x})$|⁠, the number of cases is well described as |$N(t)=N_0 f_\mathrm{G}(\gamma(t-t_0))$|⁠, where |$N_0$|⁠, |$\gamma$|⁠, and |$t_0$| are the final number of cases, the damping rate of the infection probability, and the peak time of the daily number of new cases, |$dN(t)/dt$|⁠, respectively. The scaled data of cases in most of the analyzed countries are found to collapse onto a common scaling function |$f_\mathrm{G}(x)$| with |$x=\gamma(t-t_0)$| being the scaling variable in the range of |$f_\mathrm{G}(x)\pm 0.05$|⁠. The recently proposed indicator, the so-called |$K$| value, the increasing rate of cases in one week, is also found to show universal behavior. The mechanism for the Gompertz function to appear is discussed from the time dependence of the produced pion numbers in nucleus–nucleus collisions, which is also found to be described by the Gompertz function.

1. Introduction

The COVID-19 pandemic is the worst disease spread in this century. As of May 20, 2020, over 4 million people have tested positive in the world, and the number of cases |$N(t)$|⁠, the number of infected people who tested positive at the time |$t$|⁠, is still increasing rapidly. In order to control the spread of infection, it is desirable to understand the diffusion mechanism of COVID-19.

Recently, a double exponential function called the Gompertz function was found to catch the features of |$N(t)$| [121]. The Gompertz function appears when the infection probability per infected people exponentially decreases as a function of time. With the Gompertz function, the daily number of new cases |$dN(t)/dt$|⁠, the daily increase of infected people who tested positive, shows an asymmetric time profile rather than the symmetric one found in the prediction of the susceptible–infected (SI) model [22], one of the standard models for the spread of infection. The Gompertz function was proposed by B. Gompertz in 1825 to discuss life contingencies [23]. It is interesting to note that the Gompertz function also appears as the number of tumors [24] and the number of detected bugs in a piece of software [25], as well as particle multiplicities at high energies [26].

The exponential decrease of the infection probability is also important to deduce when the restrictions can be relaxed. For example, Nakano and Ikeda [27] found that the number of cases is well characterized by the newly proposed indicator |$K$|⁠, which represents the increasing rate of cases in one week. The indicator |$K$| takes a value between zero and unity, is not affected by the weekly schedule of the test, and is found to decrease almost linearly as a function of time in the region |$0.25<K<0.9$| provided that there is only a single outbreak affecting the infection. In order to understand the linearly decreasing behavior of |$K$|⁠, Nakano and Ikeda proposed the “constant damping hypothesis” in their paper [27], and Akiyama proved that the hypothesis of exponential decrease in discrete time shows the Gompertz curve in continuous time [28], independently of other works [121]. Since the indicator |$K$| is expected to be useful to predict the date when the restrictions can be relaxed as |$K(t) \simeq 0.05$|⁠, it would be valuable to analyze its solution, the Gompertz function, in more detail.

In this article, we analyze the number of cases by using the Gompertz function. We examine that |$N(t)$| in one outbreak is well described by the Gompertz function in several countries. Then with the time shift and the scale transformation of time and |$N(t)$|⁠, the data are found to show universal behavior; they are on one curve described by the basic Gompertz function, |$\exp(-e^{-x})$|⁠. The newly proposed indicator |$K$| [27] also shows universality. We further discuss that the number of produced pions in nucleus–nucleus collisions is described by the Gompertz function. This similarity may be helpful to understand the mechanism of COVID-19 spread.

This article is organized as follows. In Sect. 2, we give a brief review of the Gompertz function and its relevance to the disease spread. In Sect. 3, we show a comparison between the number of cases and the Gompertz function fitting results. We demonstrate that the numbers of cases in many countries show universal scaling behavior. In Sect. 4, we show that the number of pions in nuclear collisions is well described by the Gompertz function, and we deduce a mechanism to produce the time dependence described by the Gompertz function. In Sect. 5, we summarize our work.

2. Gompertz function, indicator |$K$|⁠, and scaling variables

When the infection probability |$k(t)$| is given as a function of time, the evolution equation for the number of infected people |$N(t)$| and its solution are given as
(1)
Adopting an exponentially decreasing function as |$k(t)$|⁠, the solution is found to be
(2)
By choosing the reference time |$t_0$| so that |$k(t_0)=\gamma$|⁠, the solution is given as
(3)
where |$N_0=N(t_0)\exp(k(t_0)/\gamma)=eN(t_0)$| is the asymptotic value of |$N(t)$|⁠, |$N_0=\lim_{t\to\infty}N(t)$|⁠. Here, |$N_0$| can be interpreted as “terminal velocity”, since the evolution equation (1) has an analogy to the equation of motion of particle feeling a viscous resistance, where its coefficient is not constant but exponentially decreasing.
The double exponential function appearing in Eq. (3) is called the Gompertz function [23]:
(4)
By using the Gompertz function, |$N(t)$| is given as
(5)
where |$x$| is the scaling variable for |$N(t)$|⁠.
One of the characteristic features of the Gompertz function is the asymmetry of its derivative in the early (⁠|$x<0$|⁠) and late (⁠|$x>0$|⁠) stages. In Fig. 1, we show the Gompertz function |$f_\mathrm{G}(x)$| and its derivative,
(6)
as functions of |$x$|⁠. The derivative takes the maximum at |$x=0$|⁠, and the asymmetry in the negative and positive |$x$| region is clearly seen.
Comparison of Gompertz $f_\mathrm{G}(x)$ and sigmoid $f_\mathrm{S}(x)$ functions and their derivatives, $f'_\mathrm{G}(x)=df_\mathrm{G}(x)/dx$ and $f'_\mathrm{S}(x)=df_\mathrm{S}(x)/dx$, on linear (top) and logarithmic (bottom) scales. We also show $f'_\mathrm{S}(2x)$, the sigmoid function with doubled infection probability, by green dot-dashed curves.
Fig. 1

Comparison of Gompertz |$f_\mathrm{G}(x)$| and sigmoid |$f_\mathrm{S}(x)$| functions and their derivatives, |$f'_\mathrm{G}(x)=df_\mathrm{G}(x)/dx$| and |$f'_\mathrm{S}(x)=df_\mathrm{S}(x)/dx$|⁠, on linear (top) and logarithmic (bottom) scales. We also show |$f'_\mathrm{S}(2x)$|⁠, the sigmoid function with doubled infection probability, by green dot-dashed curves.

This asymmetry should be compared with the solution of the SI model [22], in which the susceptible people (⁠|$S(t)$|⁠) are infected by the infectious people (⁠|$I(t)$|⁠) at a rate proportional to the product |$S(t)I(t)$|⁠,
(7)
where |$k$| is a constant. The solution is found to be
(8)
where |$N_0=S+I$| is a constant, |$\gamma=N_0k$|⁠, and |$t_0$| is the time when half of the people are infected, |$I(t_0)=N_0/2$|⁠. The characteristic function |$f_\mathrm{S}(x)$| is called the sigmoid function, which is also referred to as the logistic function:
(9)
Then the daily number of infected people is a symmetric function of |$t-t_0$|⁠,
(10)
where |$f'_\mathrm{S}(x)=df_\mathrm{S}(x)/dx$|⁠. In Fig. 1, we also show |$f_\mathrm{S}(x)$| and |$df_\mathrm{S}(x)/dx$| by green curves. At large |$x$| (⁠|$x \geq 1$|⁠), the Gompertz and sigmoid functions show similar behavior, |$f_\mathrm{G}(x)\simeq f_\mathrm{S}(x)\simeq 1-\exp(-x)$| and |$f'_\mathrm{G}(x)\simeq f'_\mathrm{S}(x)\simeq \exp(-x)$|⁠. The difference found in the derivatives in the figure comes from the factor (⁠|$e$| and |$4$|⁠) multiplied to normalize them to be unity at |$x=0$|⁠. By comparison, the sigmoid function with doubled transition probability (⁠|$f'_\mathrm{S}(2x)$|⁠) approximately agrees with |$f'_\mathrm{G}(x)$| on average in the negative |$x$| region, |$-2\leq x \leq 0$|⁠. As already noticed [121] and as discussed later, the asymmetry is clearly found in |$dN(t)/dt$| for COVID-19, so |$N(t)$| is better understood by the Gompertz function than by the sigmoid function.
Once the solution is given, one can obtain the |$K$| value [27], the increasing rate of infected people in a week, as
(11)
where |$x_K$| is the scaling variable for |$K(t)$|⁠,
(12)
 
(13)
with |$w=7$| days.

In the later discussions, we regard the number of cases (the number of infected people who tested positive) as the number of infected people, since the latter is difficult to measure. We expect that the former takes a similar value to the latter, as long as the infected people are defined as infectious people who already have symptoms or bear enough coronavirus DNA.

3. Comparison with data

3.1. Adopted dataset

Let us now examine the universal behavior given by the Gompertz function in real data. We use the data given in Ref. [29] as of May 21, which contain the |$N(t)$| data from December 31, 2019 |$(t=0)$| till May 20 (⁠|$t=141$|⁠). Throughout this article, we measure the time |$t$| in units of days. In order to avoid the discontinuity coming from the definition change, we have removed the spikes in the daily increase (⁠|$dN(t)/dt$|⁠) data in Japan (April 12, |$t=103$|⁠) and China (February 13, |$t=44$|⁠), and instead the daily numbers in previous days are increased by multiplying a common factor, which is determined to keep the total number of cases in the days after the spike. The number of cases (⁠|$N(t)$|⁠) is obtained as the integral of the thus-smoothed |$dN(t)/dt$|⁠. In addition, seven-day averages (⁠|$\pm 3$| days) are considered in the analysis of |$dN(t)/dt$| in order to remove the fluctuations in a week. Thus-smoothed |$dN(t)/dt$| data are available in |$3 \leq t \leq 138$|1.

We have chosen countries with large numbers of cases (the USA, Russia, Brazil, and the UK as of May 20, 2020), the first three Asian countries where COVID-19 spread explosively (China, South Korea, and Japan), the first two countries of spread in Europe (Italy and Spain), a country with a unique policy (Sweden), and a country with a somewhat different |$dN(t)/dt$| profile (Indonesia).

3.2. Number of cases and its daily increase

In the left panel of Fig. 2, we show the daily number of new cases |$dN(t)/dt$| given in Ref. [29] with the smoothing mentioned above. The legend gives the abbreviation of the country name (internet country domain code, see Table 1). The |$dN(t)/dt$| data show there is one big peak in each country, and the shape of the peak is asymmetric; fast rise and slow decay. In many of the countries, there are several other peaks, which are smaller than the dominant one but visible at least in the log-scale plot. The fitting results using the Gompertz function are shown by dotted lines, and are found to explain the dominant peak region of data well.

Table 1.

Fitting results with the fitting ranges of |$t$|⁠. Parameters in rows with (*) are used for drawing figures.

Country|$\gamma\,[\%/\text{day}]$||$t_0\,[\text{day}]$||$N_0\,[10^3]$||$\Delta N$|fit range|$\chi^2$|/dof
Japan|$ 9.1\pm 0.2$||$104.0\pm 0.3$||$ 15.4\pm 0.3$||$ 1554\pm 22$|85–1387.2(*)
 |$ 8.7\pm 0.3$||$103.8\pm 0.4$||$ 15.6\pm 0.4$||$ 0.5\pm 1.3$|3–13811 
USA|$ 7.2\pm 0.1$||$100.3\pm 0.3$||$ 1202\pm 23$||$ -800\pm 240$|75–1386.7 
 |$ 5.2\pm 0.1$||$108.5\pm 0.4$||$ 1780\pm 28$||$ 0.1\pm 2.2$|3–138240(*)
Russia|$ 3.7\pm 0.1$||$140.8\pm 0.9$||$ 819\pm 28$||$ 0.1\pm 0.3$|3–13814(*)
Brazil|$ 1.3\pm 0.1$||$ 245\pm 15$||$ 15\,100\pm 5600$||$ -806\pm 45$|74–13824 
 |$ 1.9\pm 0.1$||$193.2\pm 4.3$||$ 3890\pm 510$||$ -0.7\pm 0.6$|3–13822(*)
China|$11.9\pm 0.2$||$ 36.4\pm 0.2$||$ 88.5\pm 1.5$||$ 52.6\pm 9.5$|3– 5014(*)
 |$14.1\pm 0.2$||$ 35.4\pm 0.2$||$ 78.9\pm 1.7$||$ 73\pm 19$|3–13838 
Italy|$ 6.0\pm 0.1$||$ 89.1\pm 0.2$||$ 234.6\pm 2.0$||$ 0.0\pm 0.4$|3–13816(*)
Indonesia|$ 2.7\pm 0.2$||$135.0\pm 3.0$||$ 42.4\pm 3.5$||$ -173\pm 12$|75–1384.6 
 |$ 3.4\pm 0.1$||$126.7\pm 1.2$||$ 32.8\pm 1.2$||$ -0.2\pm 0.2$|3–1383.2(*)
Spain|$ 8.1\pm 0.1$||$ 88.9\pm 0.2$||$ 231.5\pm 2.6$||$ 0.1\pm 0.4$|3–13828(*)
S. Korea|$17.5\pm 0.4$||$ 60.4\pm 0.1$||$ 8.5\pm 0.2$||$ 0.2\pm 0.3$|3– 753.4(*)
 |$14.1\pm 0.5$||$ 61.1\pm 0.3$||$ 9.1\pm 0.4$||$ 0.2\pm 0.7$|3–13814 
UK|$ 4.8\pm 0.1$||$110.6\pm 0.4$||$ 310.1\pm 4.6$||$ 20\pm 19$|60–13828 
 |$ 4.8\pm 0.1$||$110.6\pm 0.3$||$ 310.1\pm 3.5$||$ 0.1\pm 0.4$|3–13816(*)
Sweden|$ 3.5\pm 0.1$||$116.0\pm 0.8$||$ 46.3\pm 1.1$||$ -36.1\pm 5.7$|60–1384.3 
 |$ 3.7\pm 0.1$||$114.9\pm 0.6$||$ 44.7\pm 0.8$||$ -0.0\pm 0.2$|3–1382.9(*)
Country|$\gamma\,[\%/\text{day}]$||$t_0\,[\text{day}]$||$N_0\,[10^3]$||$\Delta N$|fit range|$\chi^2$|/dof
Japan|$ 9.1\pm 0.2$||$104.0\pm 0.3$||$ 15.4\pm 0.3$||$ 1554\pm 22$|85–1387.2(*)
 |$ 8.7\pm 0.3$||$103.8\pm 0.4$||$ 15.6\pm 0.4$||$ 0.5\pm 1.3$|3–13811 
USA|$ 7.2\pm 0.1$||$100.3\pm 0.3$||$ 1202\pm 23$||$ -800\pm 240$|75–1386.7 
 |$ 5.2\pm 0.1$||$108.5\pm 0.4$||$ 1780\pm 28$||$ 0.1\pm 2.2$|3–138240(*)
Russia|$ 3.7\pm 0.1$||$140.8\pm 0.9$||$ 819\pm 28$||$ 0.1\pm 0.3$|3–13814(*)
Brazil|$ 1.3\pm 0.1$||$ 245\pm 15$||$ 15\,100\pm 5600$||$ -806\pm 45$|74–13824 
 |$ 1.9\pm 0.1$||$193.2\pm 4.3$||$ 3890\pm 510$||$ -0.7\pm 0.6$|3–13822(*)
China|$11.9\pm 0.2$||$ 36.4\pm 0.2$||$ 88.5\pm 1.5$||$ 52.6\pm 9.5$|3– 5014(*)
 |$14.1\pm 0.2$||$ 35.4\pm 0.2$||$ 78.9\pm 1.7$||$ 73\pm 19$|3–13838 
Italy|$ 6.0\pm 0.1$||$ 89.1\pm 0.2$||$ 234.6\pm 2.0$||$ 0.0\pm 0.4$|3–13816(*)
Indonesia|$ 2.7\pm 0.2$||$135.0\pm 3.0$||$ 42.4\pm 3.5$||$ -173\pm 12$|75–1384.6 
 |$ 3.4\pm 0.1$||$126.7\pm 1.2$||$ 32.8\pm 1.2$||$ -0.2\pm 0.2$|3–1383.2(*)
Spain|$ 8.1\pm 0.1$||$ 88.9\pm 0.2$||$ 231.5\pm 2.6$||$ 0.1\pm 0.4$|3–13828(*)
S. Korea|$17.5\pm 0.4$||$ 60.4\pm 0.1$||$ 8.5\pm 0.2$||$ 0.2\pm 0.3$|3– 753.4(*)
 |$14.1\pm 0.5$||$ 61.1\pm 0.3$||$ 9.1\pm 0.4$||$ 0.2\pm 0.7$|3–13814 
UK|$ 4.8\pm 0.1$||$110.6\pm 0.4$||$ 310.1\pm 4.6$||$ 20\pm 19$|60–13828 
 |$ 4.8\pm 0.1$||$110.6\pm 0.3$||$ 310.1\pm 3.5$||$ 0.1\pm 0.4$|3–13816(*)
Sweden|$ 3.5\pm 0.1$||$116.0\pm 0.8$||$ 46.3\pm 1.1$||$ -36.1\pm 5.7$|60–1384.3 
 |$ 3.7\pm 0.1$||$114.9\pm 0.6$||$ 44.7\pm 0.8$||$ -0.0\pm 0.2$|3–1382.9(*)
Table 1.

Fitting results with the fitting ranges of |$t$|⁠. Parameters in rows with (*) are used for drawing figures.

Country|$\gamma\,[\%/\text{day}]$||$t_0\,[\text{day}]$||$N_0\,[10^3]$||$\Delta N$|fit range|$\chi^2$|/dof
Japan|$ 9.1\pm 0.2$||$104.0\pm 0.3$||$ 15.4\pm 0.3$||$ 1554\pm 22$|85–1387.2(*)
 |$ 8.7\pm 0.3$||$103.8\pm 0.4$||$ 15.6\pm 0.4$||$ 0.5\pm 1.3$|3–13811 
USA|$ 7.2\pm 0.1$||$100.3\pm 0.3$||$ 1202\pm 23$||$ -800\pm 240$|75–1386.7 
 |$ 5.2\pm 0.1$||$108.5\pm 0.4$||$ 1780\pm 28$||$ 0.1\pm 2.2$|3–138240(*)
Russia|$ 3.7\pm 0.1$||$140.8\pm 0.9$||$ 819\pm 28$||$ 0.1\pm 0.3$|3–13814(*)
Brazil|$ 1.3\pm 0.1$||$ 245\pm 15$||$ 15\,100\pm 5600$||$ -806\pm 45$|74–13824 
 |$ 1.9\pm 0.1$||$193.2\pm 4.3$||$ 3890\pm 510$||$ -0.7\pm 0.6$|3–13822(*)
China|$11.9\pm 0.2$||$ 36.4\pm 0.2$||$ 88.5\pm 1.5$||$ 52.6\pm 9.5$|3– 5014(*)
 |$14.1\pm 0.2$||$ 35.4\pm 0.2$||$ 78.9\pm 1.7$||$ 73\pm 19$|3–13838 
Italy|$ 6.0\pm 0.1$||$ 89.1\pm 0.2$||$ 234.6\pm 2.0$||$ 0.0\pm 0.4$|3–13816(*)
Indonesia|$ 2.7\pm 0.2$||$135.0\pm 3.0$||$ 42.4\pm 3.5$||$ -173\pm 12$|75–1384.6 
 |$ 3.4\pm 0.1$||$126.7\pm 1.2$||$ 32.8\pm 1.2$||$ -0.2\pm 0.2$|3–1383.2(*)
Spain|$ 8.1\pm 0.1$||$ 88.9\pm 0.2$||$ 231.5\pm 2.6$||$ 0.1\pm 0.4$|3–13828(*)
S. Korea|$17.5\pm 0.4$||$ 60.4\pm 0.1$||$ 8.5\pm 0.2$||$ 0.2\pm 0.3$|3– 753.4(*)
 |$14.1\pm 0.5$||$ 61.1\pm 0.3$||$ 9.1\pm 0.4$||$ 0.2\pm 0.7$|3–13814 
UK|$ 4.8\pm 0.1$||$110.6\pm 0.4$||$ 310.1\pm 4.6$||$ 20\pm 19$|60–13828 
 |$ 4.8\pm 0.1$||$110.6\pm 0.3$||$ 310.1\pm 3.5$||$ 0.1\pm 0.4$|3–13816(*)
Sweden|$ 3.5\pm 0.1$||$116.0\pm 0.8$||$ 46.3\pm 1.1$||$ -36.1\pm 5.7$|60–1384.3 
 |$ 3.7\pm 0.1$||$114.9\pm 0.6$||$ 44.7\pm 0.8$||$ -0.0\pm 0.2$|3–1382.9(*)
Country|$\gamma\,[\%/\text{day}]$||$t_0\,[\text{day}]$||$N_0\,[10^3]$||$\Delta N$|fit range|$\chi^2$|/dof
Japan|$ 9.1\pm 0.2$||$104.0\pm 0.3$||$ 15.4\pm 0.3$||$ 1554\pm 22$|85–1387.2(*)
 |$ 8.7\pm 0.3$||$103.8\pm 0.4$||$ 15.6\pm 0.4$||$ 0.5\pm 1.3$|3–13811 
USA|$ 7.2\pm 0.1$||$100.3\pm 0.3$||$ 1202\pm 23$||$ -800\pm 240$|75–1386.7 
 |$ 5.2\pm 0.1$||$108.5\pm 0.4$||$ 1780\pm 28$||$ 0.1\pm 2.2$|3–138240(*)
Russia|$ 3.7\pm 0.1$||$140.8\pm 0.9$||$ 819\pm 28$||$ 0.1\pm 0.3$|3–13814(*)
Brazil|$ 1.3\pm 0.1$||$ 245\pm 15$||$ 15\,100\pm 5600$||$ -806\pm 45$|74–13824 
 |$ 1.9\pm 0.1$||$193.2\pm 4.3$||$ 3890\pm 510$||$ -0.7\pm 0.6$|3–13822(*)
China|$11.9\pm 0.2$||$ 36.4\pm 0.2$||$ 88.5\pm 1.5$||$ 52.6\pm 9.5$|3– 5014(*)
 |$14.1\pm 0.2$||$ 35.4\pm 0.2$||$ 78.9\pm 1.7$||$ 73\pm 19$|3–13838 
Italy|$ 6.0\pm 0.1$||$ 89.1\pm 0.2$||$ 234.6\pm 2.0$||$ 0.0\pm 0.4$|3–13816(*)
Indonesia|$ 2.7\pm 0.2$||$135.0\pm 3.0$||$ 42.4\pm 3.5$||$ -173\pm 12$|75–1384.6 
 |$ 3.4\pm 0.1$||$126.7\pm 1.2$||$ 32.8\pm 1.2$||$ -0.2\pm 0.2$|3–1383.2(*)
Spain|$ 8.1\pm 0.1$||$ 88.9\pm 0.2$||$ 231.5\pm 2.6$||$ 0.1\pm 0.4$|3–13828(*)
S. Korea|$17.5\pm 0.4$||$ 60.4\pm 0.1$||$ 8.5\pm 0.2$||$ 0.2\pm 0.3$|3– 753.4(*)
 |$14.1\pm 0.5$||$ 61.1\pm 0.3$||$ 9.1\pm 0.4$||$ 0.2\pm 0.7$|3–13814 
UK|$ 4.8\pm 0.1$||$110.6\pm 0.4$||$ 310.1\pm 4.6$||$ 20\pm 19$|60–13828 
 |$ 4.8\pm 0.1$||$110.6\pm 0.3$||$ 310.1\pm 3.5$||$ 0.1\pm 0.4$|3–13816(*)
Sweden|$ 3.5\pm 0.1$||$116.0\pm 0.8$||$ 46.3\pm 1.1$||$ -36.1\pm 5.7$|60–1384.3 
 |$ 3.7\pm 0.1$||$114.9\pm 0.6$||$ 44.7\pm 0.8$||$ -0.0\pm 0.2$|3–1382.9(*)
Daily number of new cases $dN(t)/dt$ (left, symbols) and the scaling behavior (right, symbols). In the left panel, dotted lines show the fitting results in the derivative of the Gompertz function, $dN(t)/dt\simeq N_0\gamma f'_\mathrm{G}(\gamma(t-t_0))$, and open circles show the peak points in the fitting function $(t_0,N_0\gamma f'_\mathrm{G}(0))$. In the right panel, the solid black curve shows the derivative of the Gompertz function $f'_\mathrm{G}(x)$, and the gray band shows its region with 5% uncertainty in $\gamma$ and 20% in $N_0$. The green solid (dot-dashed) curve shows the derivative of the sigmoid function normalized to reproduce the peak height, $4f'_\mathrm{S}(x)/e$ ($4f'_\mathrm{S}(2x)/e$).
Fig. 2

Daily number of new cases |$dN(t)/dt$| (left, symbols) and the scaling behavior (right, symbols). In the left panel, dotted lines show the fitting results in the derivative of the Gompertz function, |$dN(t)/dt\simeq N_0\gamma f'_\mathrm{G}(\gamma(t-t_0))$|⁠, and open circles show the peak points in the fitting function |$(t_0,N_0\gamma f'_\mathrm{G}(0))$|⁠. In the right panel, the solid black curve shows the derivative of the Gompertz function |$f'_\mathrm{G}(x)$|⁠, and the gray band shows its region with 5% uncertainty in |$\gamma$| and 20% in |$N_0$|⁠. The green solid (dot-dashed) curve shows the derivative of the sigmoid function normalized to reproduce the peak height, |$4f'_\mathrm{S}(x)/e$| (⁠|$4f'_\mathrm{S}(2x)/e$|⁠).

The fitting to |$dN(t)/dt$| data is carried out by using the derivative of the Gompertz function:
(14)

In order to concentrate on the dominant peak, we first limit the time region of the fit to |$t_\mathrm{min}~\leq~t~\leq~t_\mathrm{max}$|⁠, which covers it. Next, the fitting time region is extended to the whole range, |$0 \leq t \leq 140$|⁠. In the fitting procedure, we have assumed a Poisson distribution for the daily number of new cases, then the uncertainty in |$dN(t)/dt$| is assumed to be |$\sqrt{dN(t)/dt+\varepsilon}$|⁠, where |$\varepsilon=0.1$| is introduced to avoid zero uncertainty in the case of zero daily number. We summarize the obtained parameters |$(N_0,\gamma,t_0)$| in Table 1; parameters in rows with (*) are used for drawing the figures. When the |$\chi^2$| value is smaller in the whole range analysis and the parameters obtained in two cases are similar, the single outbreak assumption is supported and we show only the results in the whole range analysis. In other cases, we in principle adopt the results giving the smaller reduced |$\chi^2$|⁠. The exception is the USA, where the reduced |$\chi^2$| is larger but we adopt the whole range analysis results. This is closely related to multiple outbreaks, and will be discussed in Appendix A. It should also be noted that, unfortunately, the reduced |$\chi^2$| values are large in the single-outbreak model with the present error estimate of |$dN(t)/dt$|⁠. There are non-negligible contributions of other outbreaks as discussed in Appendix A. In addition, while we use seven-day average data, we cannot completely remove the daily oscillations of |$dN(t)/dt$| in a week coming from the test schedule. The Gompertz function does not take care of such oscillators; thus |$\chi^2$|/dof remains large.

In the right panel of Fig. 2, we show the normalized daily numbers, |$(dN(t)/dt)/(N_0\gamma)$|⁠, as functions of the scaling variable, |$x=\gamma(t-t_0)$|⁠. Most of the data points are around the derivative of the Gompertz function and inside the gray band, which shows the region with |$5\%$| uncertainty in |$\gamma$| and |$20\%$| uncertainty in |$N_0$|⁠.

We also show the sigmoid function by the green curves in the right panel of Fig. 2. It is clear that a single sigmoid function cannot describe the behavior of the |$dN(t)/dt$| data. If we try to fit |$dN(t)/dt$| data by the sigmoid function, we need to adopt larger |$\gamma$| in the negative |$x$| region as shown by the green dot-dashed curve in the right panel of Fig. 2, |$f'_\mathrm{S}(2x)$|⁠, while |$f'_\mathrm{S}(x)$| approximately agrees with |$f'_\mathrm{G}(x)$| in shape in the positive |$x$| region.

The left panel of Fig. 3 shows the number of cases |$N(t)$|⁠, obtained as an integral of the |$dN(t)/dt$| data after removing the spike. We also show the Gompertz function results with the parameters |$(N_0,\gamma,t_0)$| determined from the |$dN(t)/dt$| data, and an integration constant |$\Delta N$|⁠,
(15)
where |$\Delta N$| is obtained by fitting to the |$N(t)$| data. In most of the countries, the Gompertz function with |$\Delta N$| explains the data in the large-|$N(t)$| region and deviations are found only in the region with small |$N(t)$|⁠. In Japan, the fitted time range is limited to |$t \geq 85$| and earlier time data are not fitted. Thus deviations at |$t<80$| are visible in the log scale, while the value is less than |$10\%$| of the total number of cases at |$t=140$|⁠.
The reported number of cases $N(t)$ (left) and the scaling behavior (right). In the left panel, dotted lines show the Gompertz function fit, $N_0 f_\mathrm{G}(\gamma(t-t_0))$, open circles show the reflection points, $(t_0, N_0/e)$, and open squares show the offset points, $(t_\mathrm{offset},N_\mathrm{offset})$. In the right panel, the black solid curve shows the Gompertz function $f_\mathrm{G}(x)$ and the gray band shows the region $f_\mathrm{G}(x)\pm 0.05$. The green solid (dot-dashed) curve shows the sigmoid function $f_\mathrm{S}(x)$ ($f_\mathrm{S}(2x)$).
Fig. 3

The reported number of cases |$N(t)$| (left) and the scaling behavior (right). In the left panel, dotted lines show the Gompertz function fit, |$N_0 f_\mathrm{G}(\gamma(t-t_0))$|⁠, open circles show the reflection points, |$(t_0, N_0/e)$|⁠, and open squares show the offset points, |$(t_\mathrm{offset},N_\mathrm{offset})$|⁠. In the right panel, the black solid curve shows the Gompertz function |$f_\mathrm{G}(x)$| and the gray band shows the region |$f_\mathrm{G}(x)\pm 0.05$|⁠. The green solid (dot-dashed) curve shows the sigmoid function |$f_\mathrm{S}(x)$| (⁠|$f_\mathrm{S}(2x)$|⁠).

We show the normalized |$N(t)$| as a function of the scaling variables in the right panel of Fig. 3. We subtract |$\Delta N$| from |$N(t)$|⁠. It is interesting to find that most of the world data are on the Gompertz function |$f_\mathrm{G}(x)$|⁠. In Japan, China, and South Korea, the fitted time range is limited and deviation from the Gompertz function results are found at earlier times (Japan) and at later times (China and South Korea). It should be noted that the agreement at |$x<0$| is largely a result of the large denominator compared with the number of cases in the early stage. Compared with the number of cases after explosive spread, the number of cases in the early stage is much smaller and the ratio is seen to be very small. Nevertheless, it is impressive to find the agreement of the Gompertz function and the observed number of cases after scaling.

We also show the sigmoid functions, |$f_\mathrm{S}(x)$| and |$f_\mathrm{S}(2x)$|⁠, by green solid and dot-dashed curves in the right panel of Fig. 3. As in the |$dN(t)/dt$| case, the sigmoid function |$f_\mathrm{S}(x)$| agrees with the scaled data at |$x\geq1$|⁠, but we need to use |$f_\mathrm{S}(2x)$|⁠, the sigmoid function with a larger infection probability, to explain the scaled data in the region of |$x\leq 0$|⁠.

3.3. |$K$| value

We now proceed to discuss the |$K$| value. We choose the offset day when explosive spread started, as shown by open squares in Fig. 3. The offset day and the offset number of cases, |$(t_\mathrm{offset}, N_\mathrm{offset})$|⁠, are summarized in Table 2. With these offset parameters, the |$K$| value is obtained as
(16)
Table 2.

Offset parameters |$(t_\mathrm{offset}, N_\mathrm{offset})$| used to evaluate the |$K$| value.

Country|$t_\mathrm{offset}$| [day]|$N_\mathrm{offset}$|
Japan851193
USA6066
Russia7110
Brazil7498
China1880
Italy5317
Indonesia7596
Spain62136
S. Korea5046
UK6018
Sweden6012
Country|$t_\mathrm{offset}$| [day]|$N_\mathrm{offset}$|
Japan851193
USA6066
Russia7110
Brazil7498
China1880
Italy5317
Indonesia7596
Spain62136
S. Korea5046
UK6018
Sweden6012
Table 2.

Offset parameters |$(t_\mathrm{offset}, N_\mathrm{offset})$| used to evaluate the |$K$| value.

Country|$t_\mathrm{offset}$| [day]|$N_\mathrm{offset}$|
Japan851193
USA6066
Russia7110
Brazil7498
China1880
Italy5317
Indonesia7596
Spain62136
S. Korea5046
UK6018
Sweden6012
Country|$t_\mathrm{offset}$| [day]|$N_\mathrm{offset}$|
Japan851193
USA6066
Russia7110
Brazil7498
China1880
Italy5317
Indonesia7596
Spain62136
S. Korea5046
UK6018
Sweden6012

Since |$N_\mathrm{offset}$| is generally much smaller than the number of cases after explosive spread, the |$K$| value is not sensitive to the choice of the offset parameters as long as we discuss the long-time behavior.

In the left panel of Fig. 4, we show the |$K$| factor as a function of time. Data for |$K(t)$| are explained by the prediction from the scaling function, |$1-f_\mathrm{G}(x_K)$|⁠, while the fluctuations around the predictions are large compared with the number of cases, |$N(t)$|⁠. In the right panel of Fig. 4, we show |$K$| values as functions of the scaling variable |$x_K=\gamma(t-t_0-\Delta t)$|⁠. Except for several countries, the scaling behavior in |$K$| is observed.

$K$ value as a function of $t$ (left) and as a function of the scaling variable $x_K$ (right). In the left panel, dotted lines show the $K$ value from the Gompertz function fit. In the right panel, the black solid curve shows $1-f_\mathrm{G}(x_K)$ and the gray band shows the region $1-f_\mathrm{G}(x_K)\pm 0.05$.
Fig. 4

|$K$| value as a function of |$t$| (left) and as a function of the scaling variable |$x_K$| (right). In the left panel, dotted lines show the |$K$| value from the Gompertz function fit. In the right panel, the black solid curve shows |$1-f_\mathrm{G}(x_K)$| and the gray band shows the region |$1-f_\mathrm{G}(x_K)\pm 0.05$|⁠.

In Ref. [27], |$K(t)$| is found to show linear dependence on time in the large-|$K(t)$| region, |$0.25 < K(t) < 0.9$|⁠. Actually, the Gompertz function |$f_\mathrm{G}(x)$| shows linear dependence on time in the small-|$|x_K|$| region. When the scaling variable is small, the first-order Taylor expansion would work:
(17)
The precision of this first-order approximation is around |$1\%$| (⁠|$10\%$|⁠) for |$|x_K|\leq 0.5$| (⁠|$|x_K|\leq 1$|⁠), where |$1-f_\mathrm{G}(x_K)$| amounts to |$0.45$||$0.81$| (⁠|$0.28$||$0.93$|⁠). When |$K$| is smaller (⁠|$K<0.25$|⁠), |$e^{-x_K}$| should also be small and the Taylor expansion with respect to |$e^{-x_K}$| may work. Then |$K$| decreases exponentially:
(18)
With the sigmoid function, while scaling behavior is observed in |$N(t)$| and |$dN(t)/dt$|⁠, |$K$| does not scale as a function of a single scaling variable. In Fig. 5, we compare the functions in the |$K$| value derived from the Gompertz and sigmoid functions for |$N(t)$| as functions of |$x=\gamma(t-t_0)$|⁠. With the Gompertz function, the shift of the scaling variable is enough for the |$K$| value to be described by the basic Gompertz function. In contrast, the |$K$| value from the sigmoid function reads
(19)
Comparison of $K$ values from the Gompertz and sigmoid functions in terms of $x = \gamma (t - t_0)$.
Fig. 5

Comparison of |$K$| values from the Gompertz and sigmoid functions in terms of |$x = \gamma (t - t_0)$|⁠.

In addition to the shift in the scaling variable, the amplitude also depends on |$\gamma w=7\gamma$|⁠.

3.4. Days of expected relaxing COVID-19 restrictions from the |$K$| value

When the |$K(t)$| value goes down to around 0.05 [27], it would be possible to relax COVID-19 restrictions such as lifting the lockdown of a city or relaxing the state of emergency. Let us call these days |$t_\mathrm{relax}$|⁠. The days of relaxing the restrictions in several countries roughly correspond to the time at |$K(t)=0.05$|⁠. With this condition, the number of cases in the current outbreak will increase by around |$5\%$| and the infection in the current outbreak will converge. In terms of the scaling variable, this corresponds to the solution of |$1-f_\mathrm{G}(x_K)=0.05$|⁠, denoted by |$x_R$|⁠. It is found to be |$x_K=x_R\simeq 2.97$|⁠. Thus the expected day of relaxing the restrictions can be evaluated to be around
(20)
provided that there are no further outbreaks. In Table 3, we summarize the expected day of relaxing COVID-19 restrictions |$t_\mathrm{relax}$| from the Gompertz function analyses in comparison with the relaxed day |$t_\mathrm{relaxed}$| in several countries.
Table 3.

Expected day of relaxing COVID-19 restrictions |$t_\mathrm{relax}$| and the relaxed day |$t_\mathrm{relaxed}$|⁠.

Country|$t_\mathrm{relax}$| [day]|$t_\mathrm{relaxed}$| [day]
Japan135.4|$^{+1.5}_{-0.7}$|146
USA149.9|$^{+2.0}_{-0.7}$|141
Russia188.3|$^{+3.7}_{-1.2}$|133
Brazil246.9|$^{+15.2}_{-4.2}$|
China63.6|$^{+0.8}_{-0.4}$|131
Italy127.8|$^{+0.9}_{-0.4}$|125
Indonesia175.5|$^{+5.2}_{-1.5}$|
Spain122.2|$^{+0.7}_{-0.3}$|123
S. Korea82.4|$^{+0.7}_{-0.4}$|127
UK153.3|$^{+1.5}_{-0.5}$|132
Sweden162.3|$^{+2.7}_{-0.8}$|
Country|$t_\mathrm{relax}$| [day]|$t_\mathrm{relaxed}$| [day]
Japan135.4|$^{+1.5}_{-0.7}$|146
USA149.9|$^{+2.0}_{-0.7}$|141
Russia188.3|$^{+3.7}_{-1.2}$|133
Brazil246.9|$^{+15.2}_{-4.2}$|
China63.6|$^{+0.8}_{-0.4}$|131
Italy127.8|$^{+0.9}_{-0.4}$|125
Indonesia175.5|$^{+5.2}_{-1.5}$|
Spain122.2|$^{+0.7}_{-0.3}$|123
S. Korea82.4|$^{+0.7}_{-0.4}$|127
UK153.3|$^{+1.5}_{-0.5}$|132
Sweden162.3|$^{+2.7}_{-0.8}$|
Table 3.

Expected day of relaxing COVID-19 restrictions |$t_\mathrm{relax}$| and the relaxed day |$t_\mathrm{relaxed}$|⁠.

Country|$t_\mathrm{relax}$| [day]|$t_\mathrm{relaxed}$| [day]
Japan135.4|$^{+1.5}_{-0.7}$|146
USA149.9|$^{+2.0}_{-0.7}$|141
Russia188.3|$^{+3.7}_{-1.2}$|133
Brazil246.9|$^{+15.2}_{-4.2}$|
China63.6|$^{+0.8}_{-0.4}$|131
Italy127.8|$^{+0.9}_{-0.4}$|125
Indonesia175.5|$^{+5.2}_{-1.5}$|
Spain122.2|$^{+0.7}_{-0.3}$|123
S. Korea82.4|$^{+0.7}_{-0.4}$|127
UK153.3|$^{+1.5}_{-0.5}$|132
Sweden162.3|$^{+2.7}_{-0.8}$|
Country|$t_\mathrm{relax}$| [day]|$t_\mathrm{relaxed}$| [day]
Japan135.4|$^{+1.5}_{-0.7}$|146
USA149.9|$^{+2.0}_{-0.7}$|141
Russia188.3|$^{+3.7}_{-1.2}$|133
Brazil246.9|$^{+15.2}_{-4.2}$|
China63.6|$^{+0.8}_{-0.4}$|131
Italy127.8|$^{+0.9}_{-0.4}$|125
Indonesia175.5|$^{+5.2}_{-1.5}$|
Spain122.2|$^{+0.7}_{-0.3}$|123
S. Korea82.4|$^{+0.7}_{-0.4}$|127
UK153.3|$^{+1.5}_{-0.5}$|132
Sweden162.3|$^{+2.7}_{-0.8}$|

In Fig. 6, we show the expected days of relaxing the restrictions in comparison with some of the relaxed days. The lockdown in Wuhan city was lifted on May 10, 2020 (⁠|$t=131$|⁠) in China; the lockdown was relaxed on May 2, 2020 (⁠|$t=123$|⁠) in Spain; May 4, 2020 (⁠|$t=125$|⁠) in Italy; and May 11, 2020 (⁠|$t=132$|⁠) in the UK. Restrictions were relaxed in part on May 6, 2020 (⁠|$t=127$|⁠) in South Korea; May 12, 2020 (⁠|$t=133$|⁠) in Russia; and May 20, 2020 (⁠|$t=141$|⁠) in the USA. In Japan, a state of emergency was declared on April 7, 2020 (⁠|$t=98$|⁠) and canceled on May 25 (⁠|$t=146$|⁠). The relaxed days in Italy, Spain, Japan, the UK, and the USA are close to those expected from the Nakano–Ikeda model analyses. In China and Korea, the relaxed days were significantly later than the expectations from the model. One can guess that the governments tried to be on the safe side in these first two countries of COVID-19 spread.

Expected days of relaxing COVID-19 restrictions and the relaxed days.
Fig. 6

Expected days of relaxing COVID-19 restrictions and the relaxed days.

The expected day of relaxing COVID-19 restrictions strongly depends on the value of the damping rate of the infection probability, |$\gamma$|⁠, which people and governments should try to enhance. In South Korea and China, the damping rate is larger than |$0.1$|⁠. Then the infection probability decreases by a factor of |$1/e$| within 10 days. In these countries, the test-containment processes have been performed strongly. In many of the countries under consideration (Japan, USA, UK, Italy, and Spain), the damping rates take values between 4 and 10%/day. In these countries, many of the restaurants and shops are closed and people are requested to stay home for one month or more. Sweden may be an interesting example. The Swedish government does not require restaurants and shops to be closed and does not ask people to stay home. The government asks people to be responsible for their behavior and social distancing is encouraged. The damping rate in Sweden, |$\gamma=3.7$|%/day, may be regarded as a value representing the intrinsic nature of COVID-19.

It would be valuable to comment on the use of the effective reproduction number, |$R_\mathrm{e}$|⁠, which is defined as the ratio of the number of new cases in one week to that in the previous week and has the merit that it is not necessary to define the offset. For a constant infection probability per infected people, |$k(t)=k_0=\mathrm{const.}$| in Eq. (1), |$R_\mathrm{e}$| is found to be |$R_\mathrm{e}=\exp(k_0w)$| and we can guess |$k_0$| from |$R_\mathrm{e}$|⁠. In contrast, when the number of cases is given by the Gompertz function, |$N(t)=N_0 f_\mathrm{G}(x)$|⁠, the effective reproduction number |$R_\mathrm{e}(t)$| is a function of two variables, |$x_K$| and |$\gamma w$|⁠,
(21)
while |$K(t)$| is a function of a single scaling variable |$x_K$|⁠. Thus the universality observed in |$K(t)$| is lost in |$R_\mathrm{e}(t)$|⁠, and it would be less easy to give a prediction of |$t_\mathrm{relax}$| from |$R_\mathrm{e}(t)$| than from |$K(t)$|⁠. In Fig. 7, we show |$R_\mathrm{e}(t)$| from the Gompertz function at |$\gamma=4$|⁠, 8, and |$16\%$|/day. The value of |$R_\mathrm{e}(t)$| at |$x_K=x_R$| depends on |$\gamma$|⁠; the value of |$x_K$| at |$R_\mathrm{e}(t)=1$| is different from |$x_R$|⁠, |$x_K=0.49$| (⁠|$1.42$|⁠) for |$\gamma=16\%\,(4\%)$|/day; and apparent large values of |$R_\mathrm{e}(t)$| appear in the early stage, |$x_K<0$| or |$t<t_0+\Delta t(\gamma)$|⁠.
The effective reproduction number $R_\mathrm{e}$ as a function of the scaling variable $x_K$.
Fig. 7

The effective reproduction number |$R_\mathrm{e}$| as a function of the scaling variable |$x_K$|⁠.

4. Mechanism of appearance of the Gompertz function

Fast and slow rises in the early and late stages are found in many physical processes such as particle production in nuclear collisions. In Fig. 8, we show the number of |$\Delta$| particles (⁠|$\Delta^{++},\ \Delta^+,\ \Delta^0$|⁠, and |$\Delta^-$|⁠) and |$\pi$| particles (⁠|$\pi^+,\ \pi^0$|⁠, and |$\pi^-$|⁠) produced in central Au+Au collisions at an incident energy of |$1~\mathrm{GeV/nucleon}$| [30]. Histograms show the calculated results by using the hadronic transport model JAM (Jet AA Microscopic transport model) [31]. The main production mechanism of |$\pi$| particles at this incident energy is the |$\Delta$| production and its decay. In the early stage (⁠|$t < 15~\mathrm{fm}/c$|⁠), nucleons are excited to resonances such as |$\Delta$| particles in nucleon–nucleon collisions, |$NN \to N\Delta$|⁠. The |$\Delta$| particles produced collide with other nucleons and |$\Delta$| particles; some of them produce additional |$\Delta$| particles, |$\Delta N\to \Delta\Delta$|⁠, and some of them are de-excited to nucleons in |$\Delta$| absorption processes such as |$\Delta N\to NN$|⁠. The |$\Delta$| particles decay and produce |$\pi$| particles, |$\Delta \to N\pi$|⁠. The |$\pi$| particles produced may collide with other nucleons, |$\Delta$| particles, and |$\pi$| particles, and occasionally produce additional |$\Delta$|⁠, |$\pi N\to\pi\Delta$|⁠. In the later stage, the system expands, particle density decreases, and interaction rate goes down. When the density becomes low enough, all |$\Delta$| particles decay to |$N\pi$| with the lifetime |$\tau_\Delta=\hbar/\Gamma_\Delta\simeq 2~\mathrm{fm}/c$| and |$\pi$| particles go out from the reaction region and are detected.

The number of $\pi$ and $\Delta$ particles as a function of time $t$ in central Au+Au collisions at an incident energy of $1~\mathrm{GeV}$ per nucleon. Histograms show the $\pi$ (red) and $\Delta$ (blue) numbers taken from Ref. [30], and solid curves show the fitting results using the Gompertz function. The green dashed curve shows the sum of $\pi$ and $\Delta$ numbers.
Fig. 8

The number of |$\pi$| and |$\Delta$| particles as a function of time |$t$| in central Au+Au collisions at an incident energy of |$1~\mathrm{GeV}$| per nucleon. Histograms show the |$\pi$| (red) and |$\Delta$| (blue) numbers taken from Ref. [30], and solid curves show the fitting results using the Gompertz function. The green dashed curve shows the sum of |$\pi$| and |$\Delta$| numbers.

The number of |$\pi$| and |$\Delta$| particles in the above nucleus–nucleus collision is found to be well fitted by the Gompertz function |$f_\mathrm{G}(x)$| and its derivative |$f'_\mathrm{G}(x)$|⁠. Curves in Fig. 8 show the results of fitting by the Gompertz function and its derivative,
(22)
where the parameters are obtained as |$(N_\pi,\gamma_\pi, t_\pi) =(62.0, 0.113~(\mathrm{fm}/c)^{-1}, 14.6~\mathrm{fm}/c)$| and |$(N_\Delta,\gamma_\Delta, t_\Delta) =(760, 0.160~(\mathrm{fm}/c)^{-1}, 14.3~\mathrm{fm}/c)$|⁠. Compared with the lifetime of |$\Delta$|⁠, the number of |$\Delta$| during nucleus–nucleus collisions has a longer tail. This may be because of the relativistic effects, resonance mass dependence of the width, and sequential decay and production of |$\Delta$| such as |$\Delta \to \pi N$| followed by |$\pi N \to \Delta$|⁠.
Let us discuss the time dependence of pion production in a simplified treatment, where the microscopic processes are limited to the |$\Delta$| production from |$NN$| collisions and |$\Delta$| decay, and Markov processes are assumed:
(23)
where |$C(t)$| denotes the production rate of |$\Delta$| from |$NN$| collisions and |$\Gamma_\mathrm{eff}$| represents the effective decay rate of |$\Delta$|⁠. The above |$\Delta$| decay represents the net decay rate, the difference of the rate of |$\Delta\to N\pi$| and |$N\pi\to\Delta$| in nuclear collisions. By further assuming that |$\Gamma_\mathrm{eff}$| is constant in time, the solution of Eq. (23) is obtained as |$N^\Delta(t)=\int^t_{-\infty} dt' e^{-\Gamma_\mathrm{eff}(t-t')} C(t')$|⁠. Then after the time where |$C(t)$| vanishes, |$N^\Delta$| decays exponentially and its integral, |$N^\pi$|⁠, approaches a constant. These features agree with those of the Gompertz function in the late stage. Earlier time behavior depends on the function form of |$C(t)$|⁠. In terms of the scaling variable, |$x=\Gamma_\mathrm{eff}(t-t_0)$|⁠, Eq. (23) reads |$N^{\Delta\pi}(x)\equiv N^\Delta+N^\pi=dN^\pi/dx+N^\pi=\int^{t_0+x/\Gamma_\mathrm{eff}}_{-\infty} dt\, C(t)$|⁠. The function |$N^{\Delta\pi}$| is a rapidly increasing function of time in the early stage, and the increasing rate becomes smaller in the later stage because of the energy loss and expansion of nuclear matter. If we fit the curve to the |$N^{\Delta\pi}$| data in Ref. [30] by using |$f_\mathrm{S}(x),\ f_\mathrm{S}(x)+f'_\mathrm{S}(x),\ f_\mathrm{G}(x)$|⁠, and |$f_\mathrm{G}(x)+f'_\mathrm{G}(x)$| in the |$\Delta$| production region (⁠|$t\leq 15~\mathrm{fm}/c$|⁠), the variance of residuals (reduced |$\chi^2$| with an uncertainty of unity being assumed in the data) is lowest with |$f_\mathrm{G}(x)+f'_\mathrm{G}(x)$|⁠, as shown by the green dashed curve in Fig. 8. This supports that |$N^\pi$| is described by a Gompertz function, while the mechanism of the appearance of this function form is not understood. It should be noted that the number of |$\Delta$| is proportional to the increasing rate of |$\pi$| number in the present simplified treatment, but the fitting to the transport model data results in |$\gamma_\Delta \not= \gamma_\pi$|⁠, implying that this is not the case.

It should be noted that one |$\Delta$| particle mostly decays into |$N\pi$| and the additionally produced number of |$\pi$| is less than unity in the present pion production in nucleus–nucleus collisions. A rough estimate of the upper bound of the basic reproduction number |$R_0$| from |$\Delta$| to |$\pi$| may be obtained as follows. The lower bound of the number of produced |$\Delta$| particles is the peak number of |$\Delta$|⁠, which is |$N^\Delta=45.3$| at |$t=14~\mathrm{fm}/c$| in the calculated data and is expected to be |$N^\Delta(t_\Delta)=N_\Delta\gamma_\Delta/e\simeq 44.7$| at |$t=t_\Delta$| from the Gompertz function. Then the upper bound of the additionally produced |$\pi$| number is given as |$N^\pi(t=\infty)-N^\Delta(t_\Delta)\simeq 17$|⁠. Consequently, the upper bound of |$R_0$| is given as |$R_0=[N^\pi(t=\infty)-N^\Delta(t_\Delta)]/N^\Delta(t_\Delta)\simeq 0.38$|⁠, which is less than unity. As a result, the number of pions in the final state is already determined in the early stage. The green dashed curve in Fig. 8 shows the “|$\pi$|-like” particle number, |$N^{\pi\Delta}=N^\pi+N^\Delta$|⁠, which shows a peak at |$t\simeq 16~\mathrm{fm}/c$|⁠, gradually decreases by the |$\Delta$| absorption processes, and converges to |$N^\pi(t=\infty)$|⁠.

Based on the success in describing |$N^\pi(t)$| by using the Gompertz function as in the case of |$N(t)$| associated with COVID-19, we may make a conjecture about the correspondence between COVID-19 spread and |$\pi$| production in nuclear collisions. Let us assume that |$\pi$| particles correspond to cases. Then |$\Delta$| particles are regarded as coronavirus carriers, defined as infected people who have not tested positive yet. Carriers will test positive later or will recover without testing positive. Susceptible people are infected and become carriers in the initial dense stage (⁠|$\Delta$| production), occasionally infect other susceptible people (additional |$\Delta$| production) or recover (⁠|$\Delta$| absorption), develop symptoms and test positive (⁠|$\Delta \to N\pi$|⁠). Thus the number of carriers (⁠|$N^\Delta$|⁠) increases rapidly by explosive spread of infection in the early stage (⁠|$\Delta$| production stage), while it decreases more slowly by developing symptoms in the late stage (⁠|$\Delta$| decay stage). This causes the asymmetric time profile in the number of carriers (⁠|$N^\Delta$|⁠), which is roughly proportional to the daily number of new cases (⁠|$dN^\pi/dt$|⁠). By comparison, the number of infected people including carriers (⁠|$N^\pi+N^\Delta$|⁠) grows rapidly in the early dense stage but does not change much in the later stage. Hence, except for the early dense stage, the basic reproduction number would be less than unity.

5. Summary

We have analyzed the number of cases (the number of infected people who tested positive for COVID-19), as a function of time, |$N(t)$|⁠, by using the double exponential function referred to as the Gompertz function, |$f_\mathrm{G}(x)=\exp(-e^{-x})$|⁠. The Gompertz function appears when the infection probability is an exponentially decreasing function of time. One of the characteristic features of the Gompertz function is the asymmetry of its derivative, |$f'_\mathrm{G}(x)=df_\mathrm{G}(x)/dx$|⁠: fast rise and slow decay.

This feature is found in the daily new cases, |$dN(t)/dt$|⁠. We have assumed that the number of cases from one outbreak is given as |$N(t) \simeq N_0\exp[-e^{-\gamma(t-t_0)}]$|⁠, where |$N_0$|⁠, |$\gamma$|⁠, and |$t_0$| are the final number of cases, damping rate of the infection probability, and the time at which the daily number of new cases peaks. These parameters are obtained by |$\chi^2$| fitting to the |$dN(t)/dt$| data. Then we have found that |$N(t)$| and |$dN(t)/dt$| show universal scaling, |$N(t)/N_0=f_\mathrm{G}(x)$| and |$(dN(t)/dt)/(N_0\gamma)=f'_\mathrm{G}(x)$|⁠, where |$x=\gamma(t-t_0)$| is the scaling variable. The |$K$| value, the increasing rate of cases in one week, is also found to show scaling behavior, |$K(t)=1-f_\mathrm{G}(x_K)$|⁠, where |$x_K=\gamma(t-t_0-\Delta t(\gamma))$| is the scaling variable for |$K(t)$| with |$\Delta t(\gamma)$| being a given function of |$\gamma$|⁠.

We have also found that the time dependence of the produced pion number in nucleus–nucleus collisions is described by the Gompertz function. Since both COVID-19 spread and pion production are transport phenomena, the mechanism of the former may be similar to the latter. If this is the case, there is a possibility that the basic reproduction number is high only in the initial stages of the outbreak.

Throughout this article, we have imposed the single-outbreak assumption. Since this assumption may be too restrictive, we show the results of multiple-outbreak analyses in Appendix A. The multiple-outbreak analyses also show that the COVID-19 spread in one outbreak is well described by the Gompertz function. We also note that after submitting the original manuscript of this article, the daily numbers of new cases were found to be significantly larger than the predictions given in Fig. 2 (and Fig. A.1 in Appendix A). We give brief descriptions of the data observed later in some countries in Appendix B.

Daily number of new cases $dN/dt$ with single-outbreak (gray) and multiple-outbreak (red) model analyses on linear (top) and logarithmic (bottom) scales. Magenta, green, and blue dotted curves show the contributions from the first, second, and third outbreaks, respectively.
Fig. A.1

Daily number of new cases |$dN/dt$| with single-outbreak (gray) and multiple-outbreak (red) model analyses on linear (top) and logarithmic (bottom) scales. Magenta, green, and blue dotted curves show the contributions from the first, second, and third outbreaks, respectively.

Acknowledgements

The authors thank T. Kunihiro and T. T. Takahashi for constructive comments and suggestions. They also thank N. Ikeno, A. Ono, and Y. Nara for useful discussions. This work is supported in part by Grants-in-Aid for Scientific Research from Japan Society for the Promotion of Science (JSPS) (Nos. 19H01898 and 19H05151).

Appendix A. Multiple-outbreak model analysis

In the analyses in the main text, we have assumed that there is only one dominant outbreak. Let us consider here multiple-outbreak cases, where we assume that the number of cases is described by the sum of several Gompertz functions:
(A.1)
As in the single-outbreak model discussed in the main text, we analyze the daily number of new cases, |$dN(t)/dt$|⁠,
(A.2)
where |$f'_\mathrm{G}(x)=df_\mathrm{G}(x)/dx$|⁠.

In Fig. A.1, we show |$dN/dt$| in Eq. (A.2) in comparison with the daily number of new cases. In the multiple-outbreak analysis, we use data in the whole range, |$0 \leq t \leq 140$|⁠. Two or three outbreaks are considered, and we try to describe the region with large |$dN/dt$| by adding outbreaks. The parameters obtained are summarized in Table A.1.

Table A.1.

Parameters in multiple-outbreak model analyses.

Country|$\gamma_i\ [\%/\text{day}]$||$t_i$| [day]|$N_i\ [10^3]$||$\Delta N$|
Japan|$ 5.0\pm 0.5$||$ 78.5\pm3.0$||$ 2.2\pm 0.3$||$ 0.3\pm 0.3$|
 |$ 8.2\pm 0.3$||$104.3\pm0.2$||$ 11.4\pm 0.6$| 
 |$19.4\pm 1.8$||$105.0\pm0.3$||$ 3.2\pm 0.6$| 
USA|$ 9.4\pm 0.2$||$ 95.4\pm0.4$||$ 560\pm 33$||$ 0.2\pm 0.5$|
 |$ 3.7\pm 0.1$||$124.8\pm1.2$||$ 1682\pm 21$| 
Russia|$15.4\pm 0.5$||$111.8\pm0.2$||$ 28.9\pm 1.8$||$ 0.1\pm 0.1$|
 |$16.1\pm 0.6$||$128.8\pm0.2$||$ 65.8\pm 5.1$| 
 |$ 3.3\pm 0.0$||$144.1\pm0.9$||$ 647\pm 31$| 
Brazil|$ 5.0\pm 0.4$||$111.3\pm3.5$||$ 86.6\pm 22.4$||$ 0.0\pm 0.1$|
 |$ 3.4\pm 0.4$||$160.6\pm4.9$||$ 1380\pm 330$| 
China|$14.8\pm 2.1$||$ 32.0\pm1.9$||$ 36.5\pm 14.6$||$ 52.2\pm 6.6$|
 |$17.0\pm 1.3$||$ 38.5\pm0.3$||$ 43.8\pm 14.6$| 
 |$10.4\pm 0.6$||$ 90.0\pm0.8$||$ 2.5\pm 0.2$| 
Italy|$ 8.3\pm 0.5$||$ 76.6\pm1.8$||$ 47.3\pm 11.1$||$ 0.0\pm 0.1$|
 |$10.6\pm 0.4$||$ 85.0\pm0.3$||$ 110\pm 12$| 
 |$ 7.4\pm 0.1$||$110.0\pm0.6$||$ 76.7\pm 3.7$| 
Indonesia|$ 4.1\pm 0.1$||$116.5\pm1.0$||$ 22.3\pm 1.0$||$ -0.1\pm 0.1$|
 |$ 6.0\pm 2.4$||$ 152\pm 12$||$ 23\pm 23$| 
Spain|$19.1\pm 0.9$||$ 86.3\pm0.2$||$ 36.9\pm 3.2$||$ 0.1\pm 0.2$|
 |$ 7.5\pm 0.1$||$ 89.5\pm0.1$||$ 195.4\pm 3.2$| 
 |$39.8\pm 3.1$||$129.9\pm0.2$||$ 3.6\pm 0.3$| 
S. Korea|$17.5\pm 0.3$||$ 60.4\pm0.1$||$ 8.5\pm 0.2$||$ 0.2\pm 0.2$|
 |$12.6\pm 0.6$||$ 87.8\pm0.5$||$ 2.0\pm 0.1$| 
 |$15.0\pm 3.7$||$134.1\pm1.9$||$ 0.4\pm 0.1$| 
UK|$ 5.7\pm 0.1$||$106.1\pm0.3$||$ 251.5\pm 3.5$||$ 0.1\pm 0.3$|
 |$15.5\pm 1.3$||$127.6\pm0.4$||$ 29.4\pm 3.0$| 
 |$ 50\pm 38$||$137.0\pm1.4$||$ 2.0\pm 1.9$| 
Sweden|$17.2\pm 1.4$||$ 73.2\pm0.5$||$ 1.3\pm 0.2$||$ 0.0\pm 0.1$|
 |$12.0\pm 1.1$||$ 94.9\pm0.7$||$ 4.3\pm 0.8$| 
 |$ 3.4\pm 0.1$||$124.1\pm1.7$||$ 44.6\pm 0.9$| 
Country|$\gamma_i\ [\%/\text{day}]$||$t_i$| [day]|$N_i\ [10^3]$||$\Delta N$|
Japan|$ 5.0\pm 0.5$||$ 78.5\pm3.0$||$ 2.2\pm 0.3$||$ 0.3\pm 0.3$|
 |$ 8.2\pm 0.3$||$104.3\pm0.2$||$ 11.4\pm 0.6$| 
 |$19.4\pm 1.8$||$105.0\pm0.3$||$ 3.2\pm 0.6$| 
USA|$ 9.4\pm 0.2$||$ 95.4\pm0.4$||$ 560\pm 33$||$ 0.2\pm 0.5$|
 |$ 3.7\pm 0.1$||$124.8\pm1.2$||$ 1682\pm 21$| 
Russia|$15.4\pm 0.5$||$111.8\pm0.2$||$ 28.9\pm 1.8$||$ 0.1\pm 0.1$|
 |$16.1\pm 0.6$||$128.8\pm0.2$||$ 65.8\pm 5.1$| 
 |$ 3.3\pm 0.0$||$144.1\pm0.9$||$ 647\pm 31$| 
Brazil|$ 5.0\pm 0.4$||$111.3\pm3.5$||$ 86.6\pm 22.4$||$ 0.0\pm 0.1$|
 |$ 3.4\pm 0.4$||$160.6\pm4.9$||$ 1380\pm 330$| 
China|$14.8\pm 2.1$||$ 32.0\pm1.9$||$ 36.5\pm 14.6$||$ 52.2\pm 6.6$|
 |$17.0\pm 1.3$||$ 38.5\pm0.3$||$ 43.8\pm 14.6$| 
 |$10.4\pm 0.6$||$ 90.0\pm0.8$||$ 2.5\pm 0.2$| 
Italy|$ 8.3\pm 0.5$||$ 76.6\pm1.8$||$ 47.3\pm 11.1$||$ 0.0\pm 0.1$|
 |$10.6\pm 0.4$||$ 85.0\pm0.3$||$ 110\pm 12$| 
 |$ 7.4\pm 0.1$||$110.0\pm0.6$||$ 76.7\pm 3.7$| 
Indonesia|$ 4.1\pm 0.1$||$116.5\pm1.0$||$ 22.3\pm 1.0$||$ -0.1\pm 0.1$|
 |$ 6.0\pm 2.4$||$ 152\pm 12$||$ 23\pm 23$| 
Spain|$19.1\pm 0.9$||$ 86.3\pm0.2$||$ 36.9\pm 3.2$||$ 0.1\pm 0.2$|
 |$ 7.5\pm 0.1$||$ 89.5\pm0.1$||$ 195.4\pm 3.2$| 
 |$39.8\pm 3.1$||$129.9\pm0.2$||$ 3.6\pm 0.3$| 
S. Korea|$17.5\pm 0.3$||$ 60.4\pm0.1$||$ 8.5\pm 0.2$||$ 0.2\pm 0.2$|
 |$12.6\pm 0.6$||$ 87.8\pm0.5$||$ 2.0\pm 0.1$| 
 |$15.0\pm 3.7$||$134.1\pm1.9$||$ 0.4\pm 0.1$| 
UK|$ 5.7\pm 0.1$||$106.1\pm0.3$||$ 251.5\pm 3.5$||$ 0.1\pm 0.3$|
 |$15.5\pm 1.3$||$127.6\pm0.4$||$ 29.4\pm 3.0$| 
 |$ 50\pm 38$||$137.0\pm1.4$||$ 2.0\pm 1.9$| 
Sweden|$17.2\pm 1.4$||$ 73.2\pm0.5$||$ 1.3\pm 0.2$||$ 0.0\pm 0.1$|
 |$12.0\pm 1.1$||$ 94.9\pm0.7$||$ 4.3\pm 0.8$| 
 |$ 3.4\pm 0.1$||$124.1\pm1.7$||$ 44.6\pm 0.9$| 
Table A.1.

Parameters in multiple-outbreak model analyses.

Country|$\gamma_i\ [\%/\text{day}]$||$t_i$| [day]|$N_i\ [10^3]$||$\Delta N$|
Japan|$ 5.0\pm 0.5$||$ 78.5\pm3.0$||$ 2.2\pm 0.3$||$ 0.3\pm 0.3$|
 |$ 8.2\pm 0.3$||$104.3\pm0.2$||$ 11.4\pm 0.6$| 
 |$19.4\pm 1.8$||$105.0\pm0.3$||$ 3.2\pm 0.6$| 
USA|$ 9.4\pm 0.2$||$ 95.4\pm0.4$||$ 560\pm 33$||$ 0.2\pm 0.5$|
 |$ 3.7\pm 0.1$||$124.8\pm1.2$||$ 1682\pm 21$| 
Russia|$15.4\pm 0.5$||$111.8\pm0.2$||$ 28.9\pm 1.8$||$ 0.1\pm 0.1$|
 |$16.1\pm 0.6$||$128.8\pm0.2$||$ 65.8\pm 5.1$| 
 |$ 3.3\pm 0.0$||$144.1\pm0.9$||$ 647\pm 31$| 
Brazil|$ 5.0\pm 0.4$||$111.3\pm3.5$||$ 86.6\pm 22.4$||$ 0.0\pm 0.1$|
 |$ 3.4\pm 0.4$||$160.6\pm4.9$||$ 1380\pm 330$| 
China|$14.8\pm 2.1$||$ 32.0\pm1.9$||$ 36.5\pm 14.6$||$ 52.2\pm 6.6$|
 |$17.0\pm 1.3$||$ 38.5\pm0.3$||$ 43.8\pm 14.6$| 
 |$10.4\pm 0.6$||$ 90.0\pm0.8$||$ 2.5\pm 0.2$| 
Italy|$ 8.3\pm 0.5$||$ 76.6\pm1.8$||$ 47.3\pm 11.1$||$ 0.0\pm 0.1$|
 |$10.6\pm 0.4$||$ 85.0\pm0.3$||$ 110\pm 12$| 
 |$ 7.4\pm 0.1$||$110.0\pm0.6$||$ 76.7\pm 3.7$| 
Indonesia|$ 4.1\pm 0.1$||$116.5\pm1.0$||$ 22.3\pm 1.0$||$ -0.1\pm 0.1$|
 |$ 6.0\pm 2.4$||$ 152\pm 12$||$ 23\pm 23$| 
Spain|$19.1\pm 0.9$||$ 86.3\pm0.2$||$ 36.9\pm 3.2$||$ 0.1\pm 0.2$|
 |$ 7.5\pm 0.1$||$ 89.5\pm0.1$||$ 195.4\pm 3.2$| 
 |$39.8\pm 3.1$||$129.9\pm0.2$||$ 3.6\pm 0.3$| 
S. Korea|$17.5\pm 0.3$||$ 60.4\pm0.1$||$ 8.5\pm 0.2$||$ 0.2\pm 0.2$|
 |$12.6\pm 0.6$||$ 87.8\pm0.5$||$ 2.0\pm 0.1$| 
 |$15.0\pm 3.7$||$134.1\pm1.9$||$ 0.4\pm 0.1$| 
UK|$ 5.7\pm 0.1$||$106.1\pm0.3$||$ 251.5\pm 3.5$||$ 0.1\pm 0.3$|
 |$15.5\pm 1.3$||$127.6\pm0.4$||$ 29.4\pm 3.0$| 
 |$ 50\pm 38$||$137.0\pm1.4$||$ 2.0\pm 1.9$| 
Sweden|$17.2\pm 1.4$||$ 73.2\pm0.5$||$ 1.3\pm 0.2$||$ 0.0\pm 0.1$|
 |$12.0\pm 1.1$||$ 94.9\pm0.7$||$ 4.3\pm 0.8$| 
 |$ 3.4\pm 0.1$||$124.1\pm1.7$||$ 44.6\pm 0.9$| 
Country|$\gamma_i\ [\%/\text{day}]$||$t_i$| [day]|$N_i\ [10^3]$||$\Delta N$|
Japan|$ 5.0\pm 0.5$||$ 78.5\pm3.0$||$ 2.2\pm 0.3$||$ 0.3\pm 0.3$|
 |$ 8.2\pm 0.3$||$104.3\pm0.2$||$ 11.4\pm 0.6$| 
 |$19.4\pm 1.8$||$105.0\pm0.3$||$ 3.2\pm 0.6$| 
USA|$ 9.4\pm 0.2$||$ 95.4\pm0.4$||$ 560\pm 33$||$ 0.2\pm 0.5$|
 |$ 3.7\pm 0.1$||$124.8\pm1.2$||$ 1682\pm 21$| 
Russia|$15.4\pm 0.5$||$111.8\pm0.2$||$ 28.9\pm 1.8$||$ 0.1\pm 0.1$|
 |$16.1\pm 0.6$||$128.8\pm0.2$||$ 65.8\pm 5.1$| 
 |$ 3.3\pm 0.0$||$144.1\pm0.9$||$ 647\pm 31$| 
Brazil|$ 5.0\pm 0.4$||$111.3\pm3.5$||$ 86.6\pm 22.4$||$ 0.0\pm 0.1$|
 |$ 3.4\pm 0.4$||$160.6\pm4.9$||$ 1380\pm 330$| 
China|$14.8\pm 2.1$||$ 32.0\pm1.9$||$ 36.5\pm 14.6$||$ 52.2\pm 6.6$|
 |$17.0\pm 1.3$||$ 38.5\pm0.3$||$ 43.8\pm 14.6$| 
 |$10.4\pm 0.6$||$ 90.0\pm0.8$||$ 2.5\pm 0.2$| 
Italy|$ 8.3\pm 0.5$||$ 76.6\pm1.8$||$ 47.3\pm 11.1$||$ 0.0\pm 0.1$|
 |$10.6\pm 0.4$||$ 85.0\pm0.3$||$ 110\pm 12$| 
 |$ 7.4\pm 0.1$||$110.0\pm0.6$||$ 76.7\pm 3.7$| 
Indonesia|$ 4.1\pm 0.1$||$116.5\pm1.0$||$ 22.3\pm 1.0$||$ -0.1\pm 0.1$|
 |$ 6.0\pm 2.4$||$ 152\pm 12$||$ 23\pm 23$| 
Spain|$19.1\pm 0.9$||$ 86.3\pm0.2$||$ 36.9\pm 3.2$||$ 0.1\pm 0.2$|
 |$ 7.5\pm 0.1$||$ 89.5\pm0.1$||$ 195.4\pm 3.2$| 
 |$39.8\pm 3.1$||$129.9\pm0.2$||$ 3.6\pm 0.3$| 
S. Korea|$17.5\pm 0.3$||$ 60.4\pm0.1$||$ 8.5\pm 0.2$||$ 0.2\pm 0.2$|
 |$12.6\pm 0.6$||$ 87.8\pm0.5$||$ 2.0\pm 0.1$| 
 |$15.0\pm 3.7$||$134.1\pm1.9$||$ 0.4\pm 0.1$| 
UK|$ 5.7\pm 0.1$||$106.1\pm0.3$||$ 251.5\pm 3.5$||$ 0.1\pm 0.3$|
 |$15.5\pm 1.3$||$127.6\pm0.4$||$ 29.4\pm 3.0$| 
 |$ 50\pm 38$||$137.0\pm1.4$||$ 2.0\pm 1.9$| 
Sweden|$17.2\pm 1.4$||$ 73.2\pm0.5$||$ 1.3\pm 0.2$||$ 0.0\pm 0.1$|
 |$12.0\pm 1.1$||$ 94.9\pm0.7$||$ 4.3\pm 0.8$| 
 |$ 3.4\pm 0.1$||$124.1\pm1.7$||$ 44.6\pm 0.9$| 

In many countries under consideration, |$dN/dt$| is shown to be decreasing on May 21, 2020, and the parameters are well determined. Then we adopt the three-outbreak model (⁠|$n=3$|⁠). In Japan, China, and South Korea, the multiple-outbreak structure of |$dN/dt$| is clearly seen in the logarithmic plot and can be fitted by using Eq. (A.2). In Russia, Italy, Spain, the UK, and Sweden, additional outbreaks improve the reduced |$\chi^2$| by filling the peaks that are not covered by the single outbreak. In Brazil and Indonesia, where the numbers of cases are still rapidly increasing, we need at least one outbreak term with |$t_i > t_\mathrm{now}$|⁠. In these cases, the parameters generally have large uncertainties, so the third outbreak, if included, has extremely large uncertainties larger than 100%. Thus we use the two-outbreak model (⁠|$n=2$|⁠) in these countries.

In the USA, there are many centers of outbreaks. In Fig. A.2, we show the |$dN(t)/dt$| data in New York State, Massachusetts, and California [32]. It is possible to fit the data in each state by using the Gompertz function, but the results show significantly different values in |$(\gamma, t_0)$|⁠. This could be the reason for the slow decrease of |$dN(t)/dt$| in the USA. As a result, a single-outbreak treatment is not appropriate. By comparison, the |$dN(t)/dt$| data are reasonably explained by two outbreaks (⁠|$n=2$|⁠). We have used the data in Ref. [29], updated on June 7, 2020. The daily number of new cases does not decrease and it seems that it is taking more time to settle down.

Daily number of new cases $dN(t)/dt$ in the USA with single-outbreak (gray) and multiple-outbreak (red and blue) model analyses. The red and blue curves show the fitting results to the data up to May 20 and June 6, respectively. Magenta, brown and cyan histograms (curves) show the data (fitting results in the single-outbreak model) in New York State, Massachusetts, and California, respectively.
Fig. A.2

Daily number of new cases |$dN(t)/dt$| in the USA with single-outbreak (gray) and multiple-outbreak (red and blue) model analyses. The red and blue curves show the fitting results to the data up to May 20 and June 6, respectively. Magenta, brown and cyan histograms (curves) show the data (fitting results in the single-outbreak model) in New York State, Massachusetts, and California, respectively.

After obtaining |$(N_i, \gamma_i, t_i)$| by fitting |$dN/dt$| data, the constant part (⁠|$\Delta N$|⁠) is obtained by fitting |$N(t)$|⁠. Thus obtained multiple-outbreak functions in Eq. (A.1) are compared with the data in Fig. A.3. In the region with |$N(t)>100$|⁠, the multiple-outbreak functions are found to explain the data well. This supports the idea that the number of cases in one outbreak would be described by the Gompertz function.

Number of cases $N(t)$ with single-outbreak and multiple-outbreak model analyses on linear (top) and logarithmic (bottom) scales. Magenta, green, and blue dotted curves show the contributions from the first, second, and third outbreaks, respectively.
Fig. A.3

Number of cases |$N(t)$| with single-outbreak and multiple-outbreak model analyses on linear (top) and logarithmic (bottom) scales. Magenta, green, and blue dotted curves show the contributions from the first, second, and third outbreaks, respectively.

Appendix B. COVID-19 spread in July and August

After submitting the original manuscript in June, 2020, we find excess cases over predictions in several countries. It is also valuable to verify the consequences of the lockdowns. We give a brief description of the above-mentioned issues. This appendix is given as a “note added”, and the details will be reported elsewhere.

In Sweden, the USA, and Japan, a significant excess of cases over the Gompertz function analyses was observed after mid-June as shown in Fig. B.1. In addition to |$dN(t)/dt$|⁠, the daily number of deaths |$dD(t)/dt$| can also be explained by the Gompertz functions [1,2], as shown by blue (⁠|$dN(t)/dt$|⁠) and red (⁠|$dD(t)/dt$|⁠) solid curves. Each of the dotted curves shows the contribution from one outbreak. We have used two and three Gompertz function terms for Sweden and the USA, respectively, and three and four terms for |$dN(t)/dt$| and |$dD(t)/dt$| in Japan. In Sweden, |$dN(t)/dt$| started to increase again around |$t=140$|⁠, but |$dD(t)/dt$| kept decreasing. In the USA, |$dN(t)/dt$| started to increase again around |$t=165$|⁠. Compared with the new cases, the increase of |$dD(t)/dt$| after mid-June is less prominent. The gray dashed line shows the fitting results of the new cases shifted later by seven days and multiplied by 0.07, |$A \times dN(t-t_D)/dt$| with |$A=0.07$| and |$t_D=7$|⁠. This curve roughly agrees with |$dD(t)/dt$| until |$t\simeq 130$| but is larger than |$dD(t)/dt$| later. A similar but stronger trend is observed in Japan. We find that |$dN(t)/dt$| once decreased rapidly but increased again around |$t=160$|⁠. The shifted and scaled function of the new cases (⁠|$A=0.05$| and |$t_D=14$|⁠) roughly explains the daily number of deaths until |$t\simeq 170$| but it overestimates |$dD(t)/dt$| later.

The number of new cases $dN(t)/dt$ (blue histograms), the daily number of deaths $dD(t)/dt$ (red histograms), and the number of new PCR tests with the rescaling factor 0.1 or 0.2 (green histograms) in Sweden (left), the USA (middle), and Japan (right). Histograms show data taken from Ref. [29]. Solid (dotted) curves show the fitting results (their breakdowns) in the multiple-outbreak model for $dN(t)/dt$ (blue) and $dD(t)/dt$ (red). The gray dashed curves show the shifted and scaled $dN(t)/dt$ results in the model (see the text).
Fig. B.1

The number of new cases |$dN(t)/dt$| (blue histograms), the daily number of deaths |$dD(t)/dt$| (red histograms), and the number of new PCR tests with the rescaling factor 0.1 or 0.2 (green histograms) in Sweden (left), the USA (middle), and Japan (right). Histograms show data taken from Ref. [29]. Solid (dotted) curves show the fitting results (their breakdowns) in the multiple-outbreak model for |$dN(t)/dt$| (blue) and |$dD(t)/dt$| (red). The gray dashed curves show the shifted and scaled |$dN(t)/dt$| results in the model (see the text).

Let us guess the reason for these behaviors. There are several possible reasons for the |$dN(t)/dt$| increase in July, such as (a) an increase in the number of polymerase chain reaction (PCR) tests and improved sensitivity, (b) lifting the lockdown and economic resumption, and (c) genomic mutation [33,34]. In terms of the decrease in mortality (death rate), (ab|$'$|⁠) domination by the young and mild symptom cases from (a) and (b) with improved medical care, (c) genomic mutation [33,34], and (d) T cell immunity [3537] are the candidate reasons. In Sweden, the government expanded coronavirus testing to include people with mild symptoms on May 19 (⁠|$t=141$|⁠); thus the increase in cases after |$t=140$| may have been caused mainly by candidate reason (a), an increase in the number of PCR tests. The increase in cases would have been dominated by people with mild symptoms and the actual number of infected people may have not increased. Thus it is reasonable that |$dD(t)/dt$| did not increase. In the USA and Japan, |$dD(t)/dt$| also started to increase in late June (⁠|$t \geq 160$|⁠), and then reason (a) is insufficient. We guess that candidate reason (b), lifting the lockdown and economic resumption, would have affected the spread, and that (ab|$'$|⁠), domination by the young and mild symptom cases with improved medical care, may have suppressed the mortality. We would like to mention other possibilities, (c) and (d). It has already been pointed out that genomes with advanced mutation were found after late June in Japan [33,34]. Candidate reason (c), a genomic mutation, may have caused the July and August epidemic and reduced mortality. Another candidate is (d), T cell immunity [3537]. The memory T cells exposed to SARS-CoV-2 may work for long-term immune protection against COVID-19 [3537], and these effects may be enhanced by BCG vaccination [38,39]. This mechanism could be universal and explains the suppression of mortality in many countries. The actual mechanism of the present coronavirus spread could be a combination of (a)–(d) and others. Analysis of more data and further research into infectious diseases are necessary to elucidate the mechanism. These are beyond the scope of this article.

The numbers of total confirmed cases and the |$K$| value should depend on both the lockdown of the social systems and the amount of PCR testing carried out in the different countries. Here let us take a look at these effects in the case of the UK, as an example. In Fig. B.2, we show the |$K$| value before |$t=140$| (left) and the number of new cases, deaths, and PCR tests (right). The peak time was |$t_0=110.6$| (⁠|$t_1=106.1$| and |$t_2=127.6$|⁠) in the single-outbreak (multiple-outbreak) analysis as given in Table 1 (Table A.1). The lockdown in the UK started on March 23 (⁠|$t=83$|⁠), and the |$K$| value started to decrease around two weeks after that (⁠|$t=97$|⁠) as seen in the left panel of Fig. B.2, although the decrease was not enough to go well below the Gompertz curve. Nevertheless, we should mention that the lockdown prevented the next outbreak from starting. The increase of new cases around |$t=120$| is considered to be caused by the speedy increase of PCR testing since the daily number of deaths kept decreasing, and the next outbreak started later than the lift of the lockdown, |$t_\mathrm{relaxed}=132$|⁠. It should be noted that these are conjectures deduced from the |$dN(t)/dt$| and |$dD(t)/dt$| data, and confirmation by further analyses is necessary.

The $K$ value as a function of $t$ (left) and the number of new cases $dN(t)/dt$, the daily number of deaths $dD(t)/dt$, and the number of new PCR tests with the rescaling factor 0.5 (right) in the UK. In the left panel, we show the lockdown time, two weeks after the lockdown, and the peak time(s) in the single (multiple) outbreak model $t_0$ ($t_1$ and $t_2$) by arrows. In the right panel, the meanings of the curves are the same as those in Fig. B.1. The gray dashed line in the right panel shows the shifted and scaled function of the new cases with $A=0.15$ and $t_D=7$.
Fig. B.2

The |$K$| value as a function of |$t$| (left) and the number of new cases |$dN(t)/dt$|⁠, the daily number of deaths |$dD(t)/dt$|⁠, and the number of new PCR tests with the rescaling factor 0.5 (right) in the UK. In the left panel, we show the lockdown time, two weeks after the lockdown, and the peak time(s) in the single (multiple) outbreak model |$t_0$| (⁠|$t_1$| and |$t_2$|⁠) by arrows. In the right panel, the meanings of the curves are the same as those in Fig. B.1. The gray dashed line in the right panel shows the shifted and scaled function of the new cases with |$A=0.15$| and |$t_D=7$|⁠.

Footnotes

1 In actual fitting processes, we regard the daily difference as the derivative, |$N(t)-N(t-1)=dN/dt|_{t-1/2}$|⁠; then the actual fitting range is |$2.5 \leq t \leq 137.5$| for |$dN/dt$|⁠. In the following discussions, this half-day difference is not explicitly written but should be understood.

References

[1]

Levitt
 
M.
,
A.
 
Scaiewicz
, and
F.
 
Zonta
(
2020
), “
Predicting the trajectory of any COVID19 epidemic from the best straight line
”,
medRxiv
, doi:http://dx.doi.org/10.1101/2020.06.26.20140814, preprint: not peer reviewed.

[2]

Levitt
 
M.
,
“Exponential Growth is Terrifying” and “Curve Fitting for Understanding”
, online lectures on
May
 
14th
,
2020
, linked from http://med.stanford.edu/levitt.html (2020).

[3]

Castorina
 
P.
,
Iorio
 
A.
, and
Lanteri
 
D.
,
Int. J. Mod. Phys. C
 
31
,
2050103
(
2020
) [arXiv:2003.00507 [physics.soc-ph]]. ()

[4]

Jia
 
L.
,
Li
 
K.
,
Jiang
 
Y.
,
Guo
 
X.
, and
Zhao
 
T.
, arXiv:2003.05447 [q-bio.PE].

[5]

Dattoli
 
G.
,
Di Palma
 
E.
,
Licciardi
 
S.
, and
Sabia
 
E.
, arXiv:2003.08684 [q-bio.PE].

[6]

Lanteri
 
D.
,
D.
 
Carco’
, and
P.
 
Castorina
, arXiv:2003.12457 [q-bio.PE].

[7]

Ponciano
 
J. M.
,
Ponciano
 
J. A.
,
Gómez
 
J. P.
,
Holt
 
R. D.
, and
Blackburn
 
J. K.
, arXiv:2005.11201 [q-bio.PE].

[8]

J.
 
Mazurek
and
Z.
 
Neničkovà
(
2020
), “
Predicting the number of total COVID-19 cases in the USA by a Gompertz curve
”,
Research Gate
, doi:10.13140/RG.2.2.19841.81761, preprint: not peer reviewed.

[9]

Yang
 
W.
,
D.
 
Zhang
,
L.
 
Peng
,
C.
 
Zhuge
, and
L.
 
Hong
(
2020
), “
Rational evaluation of various epidemic models based on the COVID-19 data of China
”,
medRxiv
, doi:10.1101/2020.03.12.20034595, preprint: not peer reviewed.

[10]

Ahmadi
 
A.
,
Y.
 
Fadaei
,
M.
 
Shirani
, and
F.
 
Rahmani
(
2020
), “
Modeling and forecasting trend of COVID-19 epidemic in Iran
”,
medRxiv
, doi:10.1101/2020.03.17.20037671, preprint: not peer reviewed.

[11]

Utsunomiya
 
Y. T.
,
Utsunomiya
 
A. T. H.
,
Torrecilha
 
R. B. P.
,
Paulan
 
S. C.
,
Milanesi
 
M.
, and
Garcia
 
J. F.
,
Front Med
 
7
,
247
(
2020
). )

[12]

D. J.
 
Austin
(
2020
), “
Near- and forecasting the SARS-COV-2 epidemic requires a global view and multiple methods
”,
medRxiv
, doi:10.1101/2020.04.09.20059402, preprint: not peer reviewed.

[13]

Donsimoni
 
J. R.
,
R.
 
Glawion
,
B.
 
Plachter
,
C.
 
Weiser
, and
K.
 
Wälde
(
2020
), “
Should contact bans be lifted in Germany? A quantitative prediction of its effects
”,
medRxiv
, doi:10.1101/2020.04.10.20060301, preprint: not peer reviewed.

[14]

T.
 
Klabunde
and
C.
 
Giegerich
(
2020
), “
How high and long will the COVID-19 wave be? A data-driven approach to model and predict the COVID-19 epidemic and the required capacity for the German health system
”,
medRxiv
, doi:10.1101/2020.04.14.20064790, preprint: not peer reviewed.

[15]

Silva
 
R. R.
,
W. D.
 
Velasco
,
W.
 
da Silva Marques
, and
C. A. G.
 
Tibirica
(
2020
), “
A Bayesian analysis of the total number of cases of the COVID 19 when only a few data is available. A case study in the state of Goias, Brazil
”,
medRxiv
, doi:10.1101/2020.04.19.20071852, preprint: not peer reviewed.

[16]

C. M.
 
Dutra
 Sr. (
2020
), “
Non-linear fitting of sigmoidal growth curves to predict a maximum limit to the total number of COVID-19 cases in the United States
”,
medRxiv
, doi:10.1101/2020.04.22.20074898, preprint: not peer reviewed.

[17]

Attanayake
 
A. M. C. H.
,
Perera
 
S.
, and
Jayasinghe
 
S.
,
Comput Math Method M
 
2020
,
6397063
(
2020
). ()

[18]

Catala
 
M.
,
S.
 
Alonso
,
E. A.
 
Lacalle
,
D.
 
Lopez
,
P.-J.
 
Cardona
, and
C.
 
Prats
(
2020
), “
Empiric model for short-time prediction of COVID-19 spreading
”,
medRxiv
, doi:10.1101/2020.05.13.20101329, preprint: not peer reviewed.

[19]

R. P.
 
Hughes
and
D. A.
 
Hughes
(
2020
), “
Impact of relaxing Covid-19 social distancing measures on rural North Wales: A simulation analysis
”,
medRxiv
, doi:10.1101/2020.05.15.20102764, preprint: not peer reviewed.

[20]

C. Z. Z.
 
Guilmoto
(
2020
), “
COVID-19 death rates by age and sex and the resulting mortality vulnerability of countries and regions in the world
”,
medRxiv
, doi:10.1101/2020.05.17.20097410, preprint: not peer reviewed.

[21]

Bartolomeo
 
N.
,
P.
 
Trerotoli
, and
G.
 
Serio
(
2020
), “
Estimating the size of the COVID-19 outbreak in Italy: Application of an exponential decay model to the weighted and cumulative average daily growth rate
”,
medRxiv
, doi:10.1101/2020.05.20.20108241, preprint: not peer reviewed.

[22]

Dagon
 
D.
,
Zou
 
C.
, and
Lee
 
W.
,
Proc. 13th Annual Network and Distributed System Security Symp
. (NDSS’06), p.
235
(
2006
).

[23]

Gompertz
 
B.
,
Phil. Trans. R. Soc. Lond.
 
115
,
513
(
1825
). ()

[24]

Laird
 
A. K.
,
Br. J. Cancer
 
13
,
490
(
1964
). ()

[25]

Ohishi
 
K.
,
Okamura
 
H.
, and
Dohi
 
T.
,
J. Syst. Softw.
 
82
,
535
(
2009
). ()

[26]

Chawla
 
R.
and
Kaur
 
M.
,
Adv. High Energy Phys.
 
2018
,
5129341
(
2018
). ()

[27]

T.
 
Nakano
and
Y.
 
Ikeda
(
2020
), “
Novel indicator of change in COVID-19 spread status
”,
medRxiv
, doi:10.1101/2020.04.25.20080200, preprint: not peer reviewed.

[28]

Y.
 
Akiyama
(
2020
), “
The K indicator epidemic model follows the Gompertz curve
”, in the note linked from http://www.bi.cs.titech.ac.jp/COVID-19/.

[29]

Hannah
 
Ritchie
(
2020
), “
Our World in Data COVID-19 dataset
”, https://ourworldindata.org/coronavirus-source-data.

[30]

Ikeno
 
N.
,
Ono
 
A.
,
Nara
 
Y.
, and
Ohnishi
 
A.
,
Phys. Rev. C
 
93
,
044612
(
2016
);
97
,
069902
(
2018
) [erratum]. ()

[31]

Nara
 
Y.
,
Otuka
 
N.
,
Ohnishi
 
A.
,
Niita
 
K.
, and
Chiba
 
S.
,
Phys. Rev. C
 
61
,
024901
(
1999
). ()

[32]

The COVID tracking project (

2020
), https://covidtracking.com/.

[33]

Sekizuka
 
T.
 et al. (2020),
mSphere
, in press [doi:].

[34]

M.
 
Kuroda
(
2020
), “
Genome molecular epidemiological survey of the new coronavirus SARS-CoV-2 2
”, https://www.niid.go.jp/niid/ja/basic-science/467-genome/9787-genome-2020-2.html [in Japanese].

[35]

Le Bert
 
N.
 et al. ,
Nature
 
584
,
457
(
2020
). ()

[36]

Nelde
 
A.
 et al. (2020), “SARS-CoV-2 T-cell epitopes define heterologous and COVID-19-induced T-cell recognition”, Research Square, doi:10.21203/rs.3.rs-35331/v1, preprint: not peer reviewed.

[37]

Sekine
 
T.
 et al. (2020),
Cell
 
183
,
158
(
2020
). ()

[38]

Amirlak
 
I.
,
R.
 
Haddad
,
J. D.
 
Hardy
,
N. S.
 
Khaled
,
M.
 
Hsiang
, and Chung, B. Amirlak (
2020
), “
Effectiveness of booster BCG vaccination in preventing Covid-19 infection
”,
medRxiv
, doi:10.1101/2020.08.10.20172288, preprint: not peer reviewed.

[39]

J.
 
Sato
(
2020
), “
T cell enhanced by BCG/TB is the key to fight against C19 and Dectin-2+TB/BCG may prevent the severe stage of C19, cytokine storm
”, https://www.jsatonotes.com/.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.