Abstract

Background

The origins of Ebola disease outbreaks remain enigmatic. Historically outbreaks have been attributed to spillover events from wildlife. However, recent data suggest that some outbreaks may originate from human-to-human transmission of prior outbreak strains instead of spillover. Clarifying the origins of Ebola disease outbreaks could improve detection and mitigation of future outbreaks.

Methods

We reviewed the origins of all Ebola disease outbreaks from 1976 to 2022 to analyze the earliest cases and characteristics of each outbreak. The epidemiology and phylogenetic relationships of outbreak strains were used to further identify the likely source of each outbreak.

Results

From 1976 to 2022 there were 35 Ebola disease outbreaks with 48 primary/index cases. While the majority of outbreaks were associated with wildlife spillover, resurgence of human-to-human transmission could account for roughly a quarter of outbreaks caused by Ebola virus. Larger outbreaks were more likely to lead to possible resurgence, and nosocomial transmission was associated with the majority of outbreaks.

Conclusions

While spillover from wildlife has been a source for many Ebola disease outbreaks, multiple outbreaks may have originated from flare-ups of prior outbreak strains. Improving access to diagnostics as well as identifying groups at risk for resurgence of ebolaviruses will be crucial to preventing future outbreaks.

Despite global advancements in vaccines, diagnostics, and therapeutics for ebolaviruses, devastating outbreaks of Ebola disease (EBOD) continue to occur across Africa. Understanding where and when ebolaviruses emerge in human populations could enhance preparedness for future outbreaks. Since ebolaviruses were first detected in humans in 1976, it has been widely thought that all EBOD outbreaks originate from spillover of ebolaviruses from wildlife into humans [1], but recent outbreaks have changed our understanding of the emergence of ebolaviruses [2]. Therefore, reassessing historic EBOD outbreaks through modern epidemiologic and phylogenetic lenses could reveal patterns for preventing further outbreaks.

Ebolaviruses are members of the family Filoviridae, and viruses from 4 Ebolavirus species are known to cause disease in humans (Table 1). We now generally refer to the disease caused by all ebolaviruses in humans as EBOD, ever since the International Classification of Disease for filoviruses (ICD-11) in 2019 [3]. These different ebolaviruses likely have distinct reservoir hosts and ecological niches [4]. Ebola virus (EBOV) RNA has been detected in 3 fruit bat species [5], whereas Sudan virus (SUDV), Bundibugyo virus (BDBV), and Taï Forest virus (TAFV) have not been detected in any potential wildlife reservoirs. While no ebolavirus that is known to be pathogenic in humans has ever been isolated from a putative reservoir host, multiple bat species have been considered possible reservoir hosts and other mammals have been deemed likely incidental hosts [4]. Spillover from these reservoir hosts or incidental hosts has been historically determined to be the likely source of EBOD outbreaks.

Table 1.

Ebolaviruses Known to Cause Disease in Humans

EbolavirusEbolavirus SpeciesEbola Disease (EBOD)
Ebola virus (EBOV)Zaire ebolavirusEbola virus disease (EVD)
Sudan virus (SUDV)Sudan ebolavirusSudan virus disease (SVD)
Bundibugyo virus (BDBV)Bundibugyo ebolavirusBundibugyo virus disease (BVD)
Taï Forest virus (TAFV)Taï Forest ebolavirusOther specified Ebola disease
EbolavirusEbolavirus SpeciesEbola Disease (EBOD)
Ebola virus (EBOV)Zaire ebolavirusEbola virus disease (EVD)
Sudan virus (SUDV)Sudan ebolavirusSudan virus disease (SVD)
Bundibugyo virus (BDBV)Bundibugyo ebolavirusBundibugyo virus disease (BVD)
Taï Forest virus (TAFV)Taï Forest ebolavirusOther specified Ebola disease
Table 1.

Ebolaviruses Known to Cause Disease in Humans

EbolavirusEbolavirus SpeciesEbola Disease (EBOD)
Ebola virus (EBOV)Zaire ebolavirusEbola virus disease (EVD)
Sudan virus (SUDV)Sudan ebolavirusSudan virus disease (SVD)
Bundibugyo virus (BDBV)Bundibugyo ebolavirusBundibugyo virus disease (BVD)
Taï Forest virus (TAFV)Taï Forest ebolavirusOther specified Ebola disease
EbolavirusEbolavirus SpeciesEbola Disease (EBOD)
Ebola virus (EBOV)Zaire ebolavirusEbola virus disease (EVD)
Sudan virus (SUDV)Sudan ebolavirusSudan virus disease (SVD)
Bundibugyo virus (BDBV)Bundibugyo ebolavirusBundibugyo virus disease (BVD)
Taï Forest virus (TAFV)Taï Forest ebolavirusOther specified Ebola disease

However, recent findings have changed how we think about the emergence of ebolaviruses in humans. During the West Africa Ebola virus disease (EVD) epidemic, a male survivor infected his female partner 179 days after being diagnosed with EVD, likely through sexual transmission [6]. Towards the end of the West Africa epidemic, a new cluster of EVD cases were found in Guinea and Liberia, which were linked to sexual transmission from a survivor that occurred about 470 days after he had symptoms [7]. Subsequent cohort studies have demonstrated prolonged EBOV persistence in the semen of EVD survivors, further indicating risk for delayed sexual transmission [8]. EBOV was also found to persist in aqueous humor as well as cerebrospinal fluid in survivors who developed uveitis and meningoencephalitis after their initial convalescence [9, 10]. Therefore, the testes, eyes, central nervous system, as well as placenta and potentially breast milk were deemed immune-privileged sites where EBOV can persist and evade the immune system [11].

Another potential source for EVD outbreaks was identified after a survivor had relapse of infection 149 days after leaving an Ebola treatment unit, which caused a new chain of transmission [12]. Although risk factors for relapse of infection remain unknown, it was noted that the survivor had received antibody-based therapies during their initial treatment [12]. Researchers also found EBOV persistence in the central nervous system and recrudescence of infection in nonhuman primates who were treated with monoclonal antibody-based therapies, leading to concerns such therapies may contribute to relapse of EVD in humans [13].

Recently, phylogenetic analyses of EBOV genomes from the 2021 outbreak in Guinea showed that the outbreak strain was genetically similar to the Makona strain from the West Africa epidemic from 2013 to 2016 [2]. It was postulated that this outbreak may have originated from delayed sexual transmission, reactivation of latent infection, or unrecognized chains of transmission among humans instead of spillover from wildlife. Hereon we refer to outbreaks originating from such sources as flare-ups or resurgence events of human-to-human transmission. Unpublished sequences from the 2020 Equateur, 2021 North Kivu, and 2022 North Kivu outbreaks were also shown to be similar to sequences from prior outbreaks in those regions, and it is thought that these outbreaks were resurgence events, while the 2022 Equateur outbreak has been recognized to be likely a separate spillover event [14–18].

Investigating the spatiotemporal and phylogenetic relationships of prior EBOD outbreaks could further elucidate how outbreaks originate. While reviewing the exposure history and contexts of initial cases could help identify the sources of EBOD outbreaks, additional insights can be gained through phylogenetic analyses. During initial EVD outbreaks researchers found little genetic variation within the same epidemic chain and that the glycoprotein (GP) gene was conserved [19–22]. Rapid sequencing in the field during the West Africa EVD epidemic also initially revealed that EBOV sequences from human-to-human transmission had few mutations that were mostly synonymous or in noncoding regions [23]. However, as the outbreak grew and became the largest and longest EVD epidemic in history, multiple EBOV lineages were subsequently found, and a higher amount of genetic diversity was found in the GP gene [24]. The mean EBOV evolutionary rate from human-to-human transmission during this outbreak was estimated at approximately 1.2 × 10−3 nucleotide substitutions/site/year [24]. The EBOV genome is approximately 19 kb so this is roughly 23 substitutions per year.

While less is known about the evolutionary rate during persistent EBOV infection, researchers found low genetic divergence in reemerged strains from persistently infected individuals [25]. In samples from acutely infected individuals, researchers found an evolutionary rate of 0.96 × 10−3 nucleotide substitutions/site/year and similar or reduced evolutionary rates in semen, aqueous humor, and urine during persistent infection [26]. In fact, the virus from the EVD survivor who infected his partner through sexual transmission 470 days after symptom onset had an evolutionary rate of 0.19 × 10−3 substitutions/site/year, or approximately 4 substitutions per year [7]. Therefore, these evolutionary rates provide rough estimates of the expected variation one might see from very prolonged human-to-human transmission versus relapse of persistent infection as the origin of a new outbreak.

The evolutionary rate in the natural reservoir host for EBOV remains unknown. However, there was found to be significant sequence variation between EBOV strains from separate spillover events from wildlife [20, 22]. Because spillover events from wildlife occur with viruses from different populations, host species, and locations, it is expected that there would be greater genetic variation between viruses from different spillover events. Therefore, considering both epidemiologic and phylogenetic patterns is necessary to characterize the origins of EBOD outbreaks.

Knowing how EBOD outbreaks originate and whether they are more likely to occur from spillover or resurgence of human-to-human transmission could change the way that we prepare for future outbreaks. Therefore, the goal of this study is to identify the origins of all EBOD outbreaks from 1976 to 2022. Through comparing epidemiologic and phylogenetic relationships, we aim to determine the origins and characteristics of each EBOD outbreak. Additionally, we aim to analyze if the detection of ebolaviruses and EBOD outbreaks has changed, and what gaps remain in detecting and preventing future outbreaks.

METHODS

We investigated the epidemiology of all known EBOD outbreaks from 1976 to 2022. The US Centers for Disease Control and Prevention's chronology of EBOD outbreaks was used as a guide to determine outbreaks (excluding laboratory-acquired and exported cases) [27]. We then reviewed the primary literature as well as World Health Organization (WHO) and international outbreak reports. For each outbreak, we sought to determine the earliest infected individual, known as the primary case. If this was not possible, we identified the first case recognized by health care authorities, the index case. We labeled some individuals as both primary and index cases if they fulfilled both of these criteria. In certain circumstances, we identified multiple primary/index cases from the same outbreak (if multiple individuals were exposed to the same animal or if there was an epizootic event). We categorized each primary/index case as confirmed, probable, or suspected according to WHO Integrated Disease Surveillance and Response EVD case definitions [28]. We created a dataset of all primary/index cases from 1976 to 2022 along with their associated locations, dates of onset, demographics, suspected sources, and diagnostics [29]. We also made a dataset of all EBOD outbreaks from 1976 to 2022, including locations, most likely origins, cases, deaths, case fatality rates, and factors associated with transmission [29].

To determine whether EBOD outbreaks originated from spillover or resurgence of human-to-human transmission, we conducted a phylogenetic analysis with representative sequences from each outbreak (Supplementary Table 1). We then compared epidemiological contexts with phylogenetic relationships to determine the most likely origin of each outbreak. We aligned 26 ebolavirus whole-genome sequences from 1976 to 2021 using MAFFT [30] with the default parameters in Geneious Prime version 2023.0 (https://www.geneious.com) and constructed a maximum likelihood tree from PhyML [31] using the GTR nucleotide substitution model with 100 bootstraps in Geneious Prime. Given representative sequence similarities, we also performed separate alignments of multiple EBOV whole-genome sequences from the 1976 and 1977 outbreaks, as well as the 2007 and 2008 outbreaks in the Democratic Republic of the Congo (Supplementary Table 2). We used a distance matrix to compare the number of nucleotide differences between aligned sequences. We considered the previously described EBOV evolutionary rates in humans when comparing nucleotide differences, with 23 nucleotide substitutions/year as the upper boundary in variation one might expect from an outbreak originating from sustained human-to-human transmission. Therefore, genome sequences from consecutive years that had alignment differences of less than 23 nucleotide substitutions and no epidemiologic evidence of spillover were considered to potentially originate from resurgence of human transmission.

We created a map of the initial locations of each outbreak using QGIS (http://www.qgis.org). We used the point coordinates from prior analyses [4, 32], as well as where initial cases were treated for the 2017–2022 outbreaks. For outbreaks that were possibly from resurgence events, we calculated the number of days from the first case of a possible flare-up to the date that the previous zoonotic outbreak was declared to be over. We also performed a logistic regression analysis to determine the association between the number of cases in an outbreak and whether that outbreak strain was associated with a subsequent flare-up. We conducted statistical analyses using GraphPad Prism 9 (https://www.graphpad.com/).

RESULTS

We identified a total of 35 EBOD outbreaks (Table 2); 24 of 35 (68.5%) of EBOD outbreaks were due to EVD. If a primary case seemed more likely due to spillover and there was an epidemiologic association with particular wildlife exposure, we included these wildlife species as suspected sources (Supplementary Table 3). Only outbreaks of EVD were recognized as potentially originating from resurgence of prior outbreak strains. TAFV, BDBV, and SUDV have not been associated with transmission via relapse or epizootic events, so outbreaks from these viruses were characterized as separate spillover events, likely from different wildlife hosts. The potential origins of EBOD outbreaks included a single spillover event from a wildlife host, multiple spillover events from a wildlife host, epizootic events with spillover from multiple different wildlife hosts, and resurgence of human-to-human transmission/flare-ups from previous outbreaks. At least one EVD outbreak may have originated from both a spillover event and a resurgence event. Seven of 24 (25.9%) of EVD outbreaks were found to possibly originate from resurgence events, 5 of which occurred after the 2013–2016 epidemic. The initial locations and origins for each outbreak are shown in Figure 1.

Locations of Ebolavirus emergence 1976–2022. The locations and potential origins of each Ebola disease outbreak are shown. Potential resurgence of human-to-human transmission generally occurred in close proximity to the locations of prior outbreaks.
Figure 1.

Locations of Ebolavirus emergence 1976–2022. The locations and potential origins of each Ebola disease outbreak are shown. Potential resurgence of human-to-human transmission generally occurred in close proximity to the locations of prior outbreaks.

Table 2.

Ebola Disease Outbreak Origins 1976–2022

YearCountryInitial LocationVirusOrigin, Suspected SourceCases (CFR %)
1976South SudanNzaraSUDVSpillover284 (53)
1976DRCYambukuEBOVSpillover318 (88)
1977DRCBonduniEBOV1976 Yambuku outbreaka1 (100)
1979South SudanNzaraSUDVSpillover34 (64)
1994GabonMékoukaEBOVSpillover52 (60)
1994Cote d’IvoireTai National ParkTAFVSpillover1 (0)
1995DRCKikwitEBOVSpillover315 (81)
1996GabonMayiboutEBOVSpillover31 (67)
1996GabonBoouéEBOVSpillover60 (75)
2000–2001UgandaRwot-ObilloSUDVSpillover425 (53)
2001–2002GabonMendembaEBOVSpillover65 (82)
2001ROCOllobaEBOVSpillover59 (75)
2003ROCYembelangoye; MvoulaEBOVSpillover143 (90)
2003ROCMbandzaEBOVSpillover35 (83)
2004South SudanYambioSUDVSpillover17 (41)
2005ROCParc d’OdzalaEBOVSpillover12 (83)
2007DRCLueboEBOVSpillover264 (71)
2007UgandaKabangoBDBVSpillover149 (25)
2008–2009DRCLueboEBOV2007 Luebo outbreaka32 (47)
2011UgandaNakisamataSUDVSpillover1 (1)
2012DRCIsiroBDBVSpillover62 (55)
2012UgandaNyanswigaSUDVSpillover24 (71)
2012UgandaLuweroSUDVSpillover7 (57)
2013–2016GuineaMeliandouEBOVSpillover28 656 (40)
2014DRCIkanamongoEBOVSpillover69 (71)
2017DRCLikatiEBOVSpillover8 (50)
2018DRCIkoko-ImpengeEBOVSpillover54 (61)
2018–2020DRCManginaEBOVSpillover3470 (66)
2020DRCMbandakaEBOVSpillover and 2018 Equateur outbreaka130 (42)
2021DRCButemboEBOV2018–2020 North Kivu outbreaka12 (50)
2021GuineaGouékéEBOV2013–2016 epidemica23 (52)
2021DRCButsiliEBOV2018–2020 North Kivu outbreaka11 (55)
2022DRCMbandakaEBOVSpillover5 (5)
2022DRCBeni Health ZoneEBOV2018–2020 North Kivu outbreaka1 (1)
2022UgandaMubendeSUDVSpillover164 (47)
YearCountryInitial LocationVirusOrigin, Suspected SourceCases (CFR %)
1976South SudanNzaraSUDVSpillover284 (53)
1976DRCYambukuEBOVSpillover318 (88)
1977DRCBonduniEBOV1976 Yambuku outbreaka1 (100)
1979South SudanNzaraSUDVSpillover34 (64)
1994GabonMékoukaEBOVSpillover52 (60)
1994Cote d’IvoireTai National ParkTAFVSpillover1 (0)
1995DRCKikwitEBOVSpillover315 (81)
1996GabonMayiboutEBOVSpillover31 (67)
1996GabonBoouéEBOVSpillover60 (75)
2000–2001UgandaRwot-ObilloSUDVSpillover425 (53)
2001–2002GabonMendembaEBOVSpillover65 (82)
2001ROCOllobaEBOVSpillover59 (75)
2003ROCYembelangoye; MvoulaEBOVSpillover143 (90)
2003ROCMbandzaEBOVSpillover35 (83)
2004South SudanYambioSUDVSpillover17 (41)
2005ROCParc d’OdzalaEBOVSpillover12 (83)
2007DRCLueboEBOVSpillover264 (71)
2007UgandaKabangoBDBVSpillover149 (25)
2008–2009DRCLueboEBOV2007 Luebo outbreaka32 (47)
2011UgandaNakisamataSUDVSpillover1 (1)
2012DRCIsiroBDBVSpillover62 (55)
2012UgandaNyanswigaSUDVSpillover24 (71)
2012UgandaLuweroSUDVSpillover7 (57)
2013–2016GuineaMeliandouEBOVSpillover28 656 (40)
2014DRCIkanamongoEBOVSpillover69 (71)
2017DRCLikatiEBOVSpillover8 (50)
2018DRCIkoko-ImpengeEBOVSpillover54 (61)
2018–2020DRCManginaEBOVSpillover3470 (66)
2020DRCMbandakaEBOVSpillover and 2018 Equateur outbreaka130 (42)
2021DRCButemboEBOV2018–2020 North Kivu outbreaka12 (50)
2021GuineaGouékéEBOV2013–2016 epidemica23 (52)
2021DRCButsiliEBOV2018–2020 North Kivu outbreaka11 (55)
2022DRCMbandakaEBOVSpillover5 (5)
2022DRCBeni Health ZoneEBOV2018–2020 North Kivu outbreaka1 (1)
2022UgandaMubendeSUDVSpillover164 (47)

Abbreviations: CFR, case fatality rate; DRC, Republic of the Congo; EBOV, Ebola virus; ROC, Republic of the Congo; SUDV, Sudan virus; TAFV, Taï Forest virus.

aPotential resurgence.

Table 2.

Ebola Disease Outbreak Origins 1976–2022

YearCountryInitial LocationVirusOrigin, Suspected SourceCases (CFR %)
1976South SudanNzaraSUDVSpillover284 (53)
1976DRCYambukuEBOVSpillover318 (88)
1977DRCBonduniEBOV1976 Yambuku outbreaka1 (100)
1979South SudanNzaraSUDVSpillover34 (64)
1994GabonMékoukaEBOVSpillover52 (60)
1994Cote d’IvoireTai National ParkTAFVSpillover1 (0)
1995DRCKikwitEBOVSpillover315 (81)
1996GabonMayiboutEBOVSpillover31 (67)
1996GabonBoouéEBOVSpillover60 (75)
2000–2001UgandaRwot-ObilloSUDVSpillover425 (53)
2001–2002GabonMendembaEBOVSpillover65 (82)
2001ROCOllobaEBOVSpillover59 (75)
2003ROCYembelangoye; MvoulaEBOVSpillover143 (90)
2003ROCMbandzaEBOVSpillover35 (83)
2004South SudanYambioSUDVSpillover17 (41)
2005ROCParc d’OdzalaEBOVSpillover12 (83)
2007DRCLueboEBOVSpillover264 (71)
2007UgandaKabangoBDBVSpillover149 (25)
2008–2009DRCLueboEBOV2007 Luebo outbreaka32 (47)
2011UgandaNakisamataSUDVSpillover1 (1)
2012DRCIsiroBDBVSpillover62 (55)
2012UgandaNyanswigaSUDVSpillover24 (71)
2012UgandaLuweroSUDVSpillover7 (57)
2013–2016GuineaMeliandouEBOVSpillover28 656 (40)
2014DRCIkanamongoEBOVSpillover69 (71)
2017DRCLikatiEBOVSpillover8 (50)
2018DRCIkoko-ImpengeEBOVSpillover54 (61)
2018–2020DRCManginaEBOVSpillover3470 (66)
2020DRCMbandakaEBOVSpillover and 2018 Equateur outbreaka130 (42)
2021DRCButemboEBOV2018–2020 North Kivu outbreaka12 (50)
2021GuineaGouékéEBOV2013–2016 epidemica23 (52)
2021DRCButsiliEBOV2018–2020 North Kivu outbreaka11 (55)
2022DRCMbandakaEBOVSpillover5 (5)
2022DRCBeni Health ZoneEBOV2018–2020 North Kivu outbreaka1 (1)
2022UgandaMubendeSUDVSpillover164 (47)
YearCountryInitial LocationVirusOrigin, Suspected SourceCases (CFR %)
1976South SudanNzaraSUDVSpillover284 (53)
1976DRCYambukuEBOVSpillover318 (88)
1977DRCBonduniEBOV1976 Yambuku outbreaka1 (100)
1979South SudanNzaraSUDVSpillover34 (64)
1994GabonMékoukaEBOVSpillover52 (60)
1994Cote d’IvoireTai National ParkTAFVSpillover1 (0)
1995DRCKikwitEBOVSpillover315 (81)
1996GabonMayiboutEBOVSpillover31 (67)
1996GabonBoouéEBOVSpillover60 (75)
2000–2001UgandaRwot-ObilloSUDVSpillover425 (53)
2001–2002GabonMendembaEBOVSpillover65 (82)
2001ROCOllobaEBOVSpillover59 (75)
2003ROCYembelangoye; MvoulaEBOVSpillover143 (90)
2003ROCMbandzaEBOVSpillover35 (83)
2004South SudanYambioSUDVSpillover17 (41)
2005ROCParc d’OdzalaEBOVSpillover12 (83)
2007DRCLueboEBOVSpillover264 (71)
2007UgandaKabangoBDBVSpillover149 (25)
2008–2009DRCLueboEBOV2007 Luebo outbreaka32 (47)
2011UgandaNakisamataSUDVSpillover1 (1)
2012DRCIsiroBDBVSpillover62 (55)
2012UgandaNyanswigaSUDVSpillover24 (71)
2012UgandaLuweroSUDVSpillover7 (57)
2013–2016GuineaMeliandouEBOVSpillover28 656 (40)
2014DRCIkanamongoEBOVSpillover69 (71)
2017DRCLikatiEBOVSpillover8 (50)
2018DRCIkoko-ImpengeEBOVSpillover54 (61)
2018–2020DRCManginaEBOVSpillover3470 (66)
2020DRCMbandakaEBOVSpillover and 2018 Equateur outbreaka130 (42)
2021DRCButemboEBOV2018–2020 North Kivu outbreaka12 (50)
2021GuineaGouékéEBOV2013–2016 epidemica23 (52)
2021DRCButsiliEBOV2018–2020 North Kivu outbreaka11 (55)
2022DRCMbandakaEBOVSpillover5 (5)
2022DRCBeni Health ZoneEBOV2018–2020 North Kivu outbreaka1 (1)
2022UgandaMubendeSUDVSpillover164 (47)

Abbreviations: CFR, case fatality rate; DRC, Republic of the Congo; EBOV, Ebola virus; ROC, Republic of the Congo; SUDV, Sudan virus; TAFV, Taï Forest virus.

aPotential resurgence.

Forty-eight primary/index cases were identified for the 35 EBOD outbreaks. The mean age of these cases was 32 years (range, 2–65 years). Thirty of 48 (62.5%) primary/index cases had available demographic information: of these thirty, 9 (30%) were hunters, 6 (20%) were children, 2 (6.67%) were health care workers, and 3 (10%) were pregnant. One primary/index case had a husband who was an EVD survivor. Full case details are available in our Index and Primary Cases dataset [29] and demographic details are included in Supplementary Table 4. The initial diagnoses for index cases included: malaria, unknown viral hemorrhagic fever, yellow fever, dysentery/gastroenteritis, and salmonellosis. Thirty-six of 48 (75%) were diagnosed only via clinical review of signs/symptoms and epidemiological history, whereas the other cases were diagnosed via reverse transcription polymerase chain reaction (RT-PCR), serology, and/or virus isolation. From 2021 to 2022 all 6 index cases were confirmed via RT-PCR.

Thirty-one of 35 (88.6%) EBOD outbreaks involved multiple cases, for which there were documented chains of transmission. Of these 31 outbreaks, 25 (80.6%) had reported nosocomial transmission, 14 (45.1%) had transmission from burial practices, and 11 (35.5%) were associated with bushmeat contact. An average of 637 days (range, 222–1690 days) occurred between the end of an original zoonotic outbreak and a potential subsequent flare-up. A logistic regression analysis showed that the odds of an EVD outbreak being associated with a future flare-up increased as the total cases increased (odds ratio, 1.013; 95% confidence interval, 1.002–1.028).

The maximum likelihood phylogenetic tree was constructed from 26 ebolavirus whole-genome sequences (16 EBOV, 7 SUDV, 2 BDBV sequences, and 1 TAFV) (Figure 2). There were multiple outbreaks in which phylogenetic and epidemiologic evidence supported the presumption of spillover as the etiology. The 1994, 1995, 1996, 2002, 2003, 2013, 2014, 2017, and 2018 EBOV genomes all had greater sequence variation and/or suspected wildlife sources, so these outbreaks likely started from different spillover events. Only sequences from the EBOV GP or nucleoprotein were available for some of the 2001–2003 outbreaks in the Republic of the Congo (ROC) and Gabon, so these were not included in our phylogenetic tree. Reviewing previous analyses of these sequences and their contexts revealed that there were 4 separate outbreaks during this period due to multiple epizootic and spillover events [33–35]. For each of these outbreaks there were multiple suspected primary/index cases in the context of different spillover sources, chains of transmission, and sequences. We were also unable to find a whole-genome sequence for the 2005 ROC outbreak, where the primary/index cases were 2 hunters with bushmeat contact [36]. We also found that the representative sequences from the 2013–2016 epidemic and the 2021 outbreak in Guinea were closely related (28 nucleotide differences), which was consistent with prior analyses [2].

Maximum likelihood phylogenetic tree of ebolavirus whole genomes. Representative whole-genome sequences of EBOV, SUDV, TAFV, and BDBV from 1976 to 2021 were used to construct the phylogenetic tree. Numbers above the main nodes are bootstrap values (100 replicates). Each tip is labeled by the ebolavirus, year of onset, and initial country or province/district of each outbreak. Abbreviations: BDBV, Bundibugyo virus; DRC, Republic of the Congo; EBOV, Ebola virus; ROC, Republic of the Congo; SUDV, Sudan virus; TAFV, Taï Forest virus
Figure 2.

Maximum likelihood phylogenetic tree of ebolavirus whole genomes. Representative whole-genome sequences of EBOV, SUDV, TAFV, and BDBV from 1976 to 2021 were used to construct the phylogenetic tree. Numbers above the main nodes are bootstrap values (100 replicates). Each tip is labeled by the ebolavirus, year of onset, and initial country or province/district of each outbreak. Abbreviations: BDBV, Bundibugyo virus; DRC, Republic of the Congo; EBOV, Ebola virus; ROC, Republic of the Congo; SUDV, Sudan virus; TAFV, Taï Forest virus

There were a few historic outbreaks that may have originated from resurgence of prior outbreak strains based on phylogenetic and epidemiologic evidence. The 1977 sequence from Tandala/Bonduni was 99.97%–99.99% similar (2–5 nucleotide differences) to the 3 sequences from the 1976 outbreak in Yambuku (isolates named Mayinga, deRoover, and Ecran). In fact, the Bonduni sequence was more similar to the deRoover isolate (2 nucleotide differences) than all the 1976 isolates were to each other (5 nucleotide differences). Given this sequence similarity, it is possible that the 1977 outbreak may have originated from a resurgence event instead of spillover. The locations of the index cases are roughly 390 kilometers apart and there was no clear epidemiologic chain of transmission. However, during the 1976 outbreak cases were detected within 120 kilometers of Yambuku, and an additional cluster occurred via even further travel by a nurse to Kinshasa [37]. Therefore, a human origin of the 1977 outbreak could not be ruled out.

Given close similarities between the 2007 and 2008 Luebo strains, an additional alignment was done for the 2008 sequence with 5 sequences from the 2007 outbreak in the same location. The sequence from the 2008 Luebo outbreak had 99.9%–99.94% sequence similarity (11–19 nucleotide differences) to the 2007 viruses that occurred in the same location. A prior analysis of the nucleotide substitutions between these outbreak strains postulated that the 2008 virus could be a direct descendent of the 2007 strain [38]. Given the sequence similarities and epidemiologic relationships between these outbreaks, it is possible that the 2008 outbreak originated from resurgence of human-to-human transmission instead of spillover.

The 1979 SUDV sequence from Nzara was 99.75% similar (46 nucleotide differences) to the 1976 outbreak in the same region. The 2004 SUDV strain from Yambio was 99.6% similar (70 nucleotide differences) to the nearby 1976 Nzara sequence. The 2000, 2011, and 2012 SUDV sequences were more divergent, and the preliminary 2022 sequence was not included in our phylogenetic analysis [39]. The evolutionary rate of SUDV from sustained human-to-human transmission or persistent infection has not been determined, and it is unknown whether SUDV can cause flare-ups, so it was assumed that these outbreaks were all separate spillover events. The 2012 Isiro BDBV sequence was 98.6% similar (252 nucleotide differences) to the 2007 Uganda BDBV sequence; there were no identified sources for these outbreaks, and given the sequence differences it was assumed that they occurred from separate spillover events.

DISCUSSION

We identified multiple origins for EBOD outbreaks. While wildlife spillover seemed the likely etiology for the majority of outbreaks, there are both recent and historical outbreaks that may have emerged from human-to-human transmission of prior outbreak strains instead of spillover from wildlife. We identified 2 historic outbreaks (1977 Tandala/Bonduni and 2008 Luebo) that were previously deemed due to spillover events, but it now seems difficult to exclude resurgence of human-to-human transmission as the potential origin for these outbreaks. Since the 2013–2016 West Africa epidemic, 5 EBOD outbreaks have been considered to originate from possible flare-ups of previous outbreak strains. Given these findings we must reconsider the ways that we prepare for future outbreaks.

Previous research on ebolaviruses has focused on mitigating spillover through reducing contact with wildlife and bushmeat. While roughly a third of all EBOD outbreaks and primary/index cases involved suspected bushmeat contact, this is not the only source that we must consider. Identifying which human populations are at risk for reemergence of outbreak strains and increasing surveillance among these populations will also be key to outbreak prevention. Additionally, understanding risk factors for resurgence, such as whether certain antibody-based therapies increase the likelihood of viral persistence, will also be important to determining populations at risk for future outbreaks.

We found that as cases increase during a particular EVD outbreak, there appear to be greater odds of subsequent resurgence. Therefore, early detection and mitigation of EBOD outbreaks is not only critical for stopping current outbreaks but also for preventing future outbreaks. Identifying regions that have had high numbers of survivors from prior outbreaks could help target surveillance and improve early detection of future outbreaks. While we found that there were fewer historical outbreaks that seemed to originate from delayed sexual transmission or relapse of infection, a potential confounding factor is that detection of flare-ups may have increased recently due to enhanced diagnostic methods.

We also found that the majority of EBOD outbreaks were associated with nosocomial transmission. Equipping and training responders with personal protective equipment and the appropriate resources to quickly diagnose and isolate patients will be crucial to stopping health care-acquired transmission. Additionally, multiple transmission events occurred among family groups during burials and by sexual transmission from survivors. Community-based approaches are therefore essential for educating about ebolavirus transmission while also mitigating stigma.

Our epidemiologic and phylogenetic analyses provide a rough estimation of what outbreaks could plausibly occur from resurgence of prior strains, but there are multiple limitations. Given limited sequence data from wildlife, the genetic diversity in wildlife reservoirs remains unknown. Although it has been observed that there are greater differences between strains from separate spillover events compared to human-to-human transmission, further wildlife and human sequences are needed to clarify these relationships and the degree of genetic variation. We also used representative whole-genome sequences from each outbreak to estimate phylogenetic relationships because of limited availability of multiple sequences from different outbreaks. We used previously published estimates of EBOV evolutionary rates as a guide, but estimates may vary depending on the outbreak strain and size of the outbreak. Additionally, ebolaviruses may exhibit heterogeneous evolutionary rates during persistent infection [26], and further studies among survivors are needed to determine these phylogenetic relationships.

Another limitation in our phylogenetic analysis is that methods for sequencing and virus isolation have changed over time, which could influence comparisons of evolutionary relationships. The ability to detect genetic variation within and between outbreaks has improved, making it difficult to compare sequences determined multiple decades ago to those from recent outbreaks. Virus isolates used to go through plaque purification prior to sequencing, which could introduce cell-adapted mutations and clonal selection, which could minimize nucleotide variations. Resequencing archived samples using newer methods and as well as more in-depth phylogenetic analyses including evolutionary rate estimations could provide further information about genetic relationships between ebolaviruses.

The evolutionary rates of SUDV, BDBV, and TAFV are less characterized, and it is also unknown whether these viruses can cause resurgent outbreaks through sexual transmission or relapse of infection. Additional phylogenetic analyses and cohort studies of survivors from infections from these other ebolaviruses are needed to determine genetic variation and the risk for resurgence. A recent genomic analysis of BDBV sequences from the 2012 outbreak supports the hypothesis that multiple spillover events could have contributed to this outbreak, but further epidemiologic and phylogenetic data are needed [40].

Ebolaviruses can emerge in human populations in multiple ways. Recently, there have been large EBOD outbreaks, which create the potential for future resurgence. Therefore, we need additional research and resources dedicated to improving risk assessment and early detection in human populations. Improving the rapid detection and isolation of primary and index cases will be essential to preventing further EBOD outbreaks.

Supplementary Data

Supplementary materials are available at The Journal of Infectious Diseases online. Consisting of data provided by the authors to benefit the reader, the posted materials are not copyedited and are the sole responsibility of the authors, so questions or comments should be addressed to the corresponding author.

Notes

Author contributions. S. D. J. contributed conceptualization, analysis, and figures, and wrote the original draft. S. D. J. and V. J. M. reviewed and edited the manuscript.

Disclaimer. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.

Data availability. All relevant data are within the manuscript and its online files. Datasets and analyses are available online (https://github.com/sethdjudson/ebola-origins) and can be cited at https://doi.org/10.6084/m9.figshare.22257433.v4.

Financial support. This work was supported by the National Institute of Health (grant number T32 AI007291–27 to S. D. J.). V. J. M. is supported by the Intramural Research Program of the National Institute of Allergy and Infectious Diseases, National Institutes of Health.

Supplement sponsorship. This article appears as part of the supplement “10th International Symposium on Filoviruses.”

References

1

Feldmann
H
,
Geisbert
TW
.
Ebola haemorrhagic fever
.
Lancet
2011
;
377
:
849
62
.

2

Keita
AK
,
Koundouno
FR
,
Faye
M
, et al.
Resurgence of Ebola virus in 2021 in Guinea suggests a new paradigm for outbreaks
.
Nature
2021
;
597
:
539
43
.

3

Kuhn
JH
,
Adachi
T
,
Adhikari
NKJ
, et al.
New filovirus disease classification and nomenclature
.
Nat Rev Microbiol
2019
;
17
:
261
.

4

Judson
SD
,
Fischer
R
,
Judson
A
,
Munster
VJ
.
Ecological contexts of index cases and spillover events of different ebolaviruses
.
PLOS Pathogens
2016
;
12
:
e1005780
.

5

Leroy
EM
,
Kumulungui
B
,
Pourrut
X
, et al.
Fruit bats as reservoirs of Ebola virus
.
Nature
2005
;
438
:
575
6
.

6

Mate
SE
,
Kugelman
JR
,
Nyenswah
TG
, et al.
Molecular evidence of sexual transmission of Ebola virus
.
N Engl J Med
2015
;
373
:
2448
54
.

7

Diallo
B
,
Sissoko
D
,
Loman
NJ
, et al.
Resurgence of Ebola virus disease in Guinea linked to a survivor with virus persistence in seminal fluid for more than 500 days
.
Clin Infect Dis
2016
;
63
:
1353
6
.

8

Thorson
AE
,
Deen
GF
,
Bernstein
KT
, et al.
Persistence of Ebola virus in semen among Ebola virus disease survivors in Sierra Leone: a cohort study of frequency, duration, and risk factors
.
PLoS Medicinec
2021
;
18
:
e1003273
.

9

Varkey
JB
,
Shantha
JG
,
Crozier
I
, et al.
Persistence of Ebola virus in ocular fluid during convalescence
.
N Engl J Med
2015
;
372
:
2423
7
.

10

Jacobs
M
,
Rodger
A
,
Bell
DJ
, et al.
Late Ebola virus relapse causing meningoencephalitis: a case report
.
Lancet
2016
;
388
:
498
503
.

11

Jacob
ST
,
Crozier
I
,
Fischer
WA
, et al.
Ebola virus disease
.
Nat Rev Dis Primers
2020
;
6
:
1
31
.

12

Mbala-Kingebeni
P
,
Pratt
C
,
Mutafali-Ruffin
M
, et al.
Ebola virus transmission initiated by relapse of systemic Ebola virus disease
.
N Engl J Med
2021
;
384
:
1240
7
.

13

Liu
J
,
Trefry
JC
,
Babka
AM
, et al.
Ebola virus persistence and disease recrudescence in the brains of antibody-treated nonhuman primate survivors
.
Sci Transl Med
2022
;
14
:
eabi5229
.

14

Virological.
Ebolavirus. Oct 2021 EVD case in DRC linked to 2018–2020 Nord Kivu EVD outbreak.
https://virological.org/t/oct-2021-evd-case-in-drc-linked-to-2018-2020-nord-kivu-evd-outbreak/762. Accessed 15 February 2022.

15

Virological.
Ebolavirus. April 2022 Ebola virus disease case in Equateur Province, DRC, represents a new spillover event.
https://virological.org/t/april-2022-ebola-virus-disease-case-in-equateur-province-drc-represents-a-new-spillover-event/795. Accessed 15 February 2022.

16

Virological.
Ebolavirus. Two Ebola virus variants circulating during the 2020 Equateur Province outbreak.
https://virological.org/t/two-ebola-virus-variants-circulating-during-the-2020-equateur-province-outbreak/538. Accessed 15 February 2022.

17

Virological.
Ebolavirus. August 2022 EVD case in DRC linked to 2018–2020 Nord Kivu EVD outbreak.
https://virological.org/t/august-2022-evd-case-in-drc-linked-to-2018-2020-nord-kivu-evd-outbreak/889. Accessed 15 February 2022.

18

Centers for Disease Control and Prevention
.
February 2021 Democratic Republic of the Congo, North Kivu Province.
https://www.cdc.gov/vhf/ebola/outbreaks/drc/2021-february.html. Accessed 15 February 2022.

19

Rodriguez
LL
,
De Roo
A
,
Guimard
Y
, et al.
Persistence and genetic stability of Ebola virus during the outbreak in Kikwit, Democratic Republic of the Congo, 1995
.
J Infect Dis
1999
;
179
(
Suppl 1
):
S170
6
.

20

Leroy
EM
,
Rouquet
P
,
Formenty
P
, et al.
Multiple Ebola virus transmission events and rapid decline of central African wildlife
.
Science
2004
;
303
:
387
90
.

21

Leroy
EM
,
Baize
S
,
Mavoungou
E
,
Apetrei
C
.
Sequence analysis of the GP, NP, VP40 and VP24 genes of Ebola virus isolated from deceased, surviving and asymptomatically infected individuals during the 1996 outbreak in Gabon: comparative studies and phylogenetic characterization
.
J Gen Virol
2002
;
83
:
67
73
.

22

Georges-Courbot
MC
,
Lu
CY
,
Lansoud-Soukate
J
,
Leroy
E
,
Baize
S
.
Isolation and partial molecular characterisation of a strain of Ebola virus during a recent epidemic of viral haemorrhagic fever in Gabon
.
Lancet
1997
;
349
:
181
.

23

Hoenen
T
,
Groseth
A
,
Rosenke
K
, et al.
Nanopore sequencing as a rapidly deployable Ebola outbreak tool
.
Emerg Infect Dis
2016
;
22
:
331
.

24

Holmes
EC
,
Dudas
G
,
Rambaut
A
,
Andersen
KG
.
The evolution of Ebola virus: insights from the 2013–2016 epidemic
.
Nature
2016
;
538
:
193
200
.

25

Blackley
DJ
,
Wiley
MR
,
Ladner
JT
, et al.
Reduced evolutionary rate in reemerged Ebola virus transmission chains
.
Sci Adv
2016
;
2
.

26

Whitmer
SLM
,
Ladner
JT
,
Wiley
MR
, et al.
Active Ebola virus replication and heterogeneous evolutionary rates in EVD survivors
.
Cell Rep
2018
;
22
:
1159
68
.

27

Centers for Disease Control and Prevention
.
History of Ebola outbreaks.
https://www.cdc.gov/vhf/ebola/history/chronology.html#anchor_1526565058132. Accessed 15 February 2022.

28

World Health Organization, Regional Office for Africa
. Technical guidelines for integrated disease surveillance and response in the WHO African region, 3rd edn.
https://www.afro.who.int/publications/technical-guidelines-integrated-disease-surveillance-and-response-african-region-third. Accessed 15 February 2022.

29

Judson
SD
,
Munster
VJ
.
The multiple origins of Ebola disease outbreaks (datasets). figshare, 2023.
https://doi.org/10.6084/m9.figshare.22257433.v4. Accessed 15 February 2022.

30

Katoh
K
,
Standley
DM
.
MAFFT multiple sequence alignment software version 7: improvements in performance and usability
.
Mol Biol Evol
2013
;
30
:
772
80
.

31

Guindon
S
,
Dufayard
JF
,
Lefort
V
,
Anisimova
M
,
Hordijk
W
,
Gascuel
O
.
New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0
.
Syst Biol
2010
;
59
:
307
21
.

32

Mylne
A
,
Brady
OJ
,
Huang
Z
, et al.
A comprehensive database of the geographic spread of past human Ebola outbreaks
.
Sci Data
2014
;
1
:
140042
.

33

Pourrut
X
,
Kumulungui
B
,
Wittmann
T
, et al.
The natural history of Ebola virus in Africa
.
Microbes Infect
2005
;
7
:
1005
14
.

34

Boumandouki
P
,
Formenty
P
,
Epelboin
A
, et al.
L ‘épidémie de fièvre hémorragique due au virus Ebola d ‘Octobre à Décembre 2003 au Congo
.
P rise en charge des malades et des défunts lors de 2005
.
Bull Soc Pathol Exot
.
2005
;
98
:
218
23
.

35

Nkoghe
D
,
Formenty
P
,
Leroy
E
, et al.
Plusieurs epidémies de fièvre hémorragique due au virus Ebola au Gabon, d'Octobre 2001 a Avril 2002. [Multiple Ebola virus haemorrhagic fever outbreaks in Gabon, from October 2001 to April 2002]
.
Bull Soc Pathol Exot
2005
;
98
:
224
9
.

36

Nkoghe
D
,
Kone
ML
,
Yada
A
,
Leroy
E
.
A limited outbreak of Ebola haemorrhagic fever in Etoumbi, Republic of Congo, 2005
.
Trans R Soc Trop Med Hyg
2011
;
105
:
466
72
.

37

Burke
J
,
Declerq
R
,
Ghysebrechts
G
.
Ebola haemorrhagic fever in Zaire, 1976.
Bull World Health Organ
1978
;
56
:
271
.

38

Grard
G
,
Biek
R
,
Muyembe Tamfum
JJ
, et al.
Emergence of divergent Zaire Ebola virus strains in Democratic Republic of the Congo in 2007 and 2008
.
J Infect Dis
2011
;
204
(
Suppl 3
):
S776
84
.

39

Virological.
Ebolavirus. September 2022 Sudan Ebola virus disease outbreak in Uganda.
https://virological.org/t/september-2022-sudan-ebola-virus-disease-outbreak-in-uganda/902. Accessed 15 February 2022.

40

Hulseberg
CE
,
Kumar
R
,
Di Paola
N
, et al.
Molecular analysis of the 2012 Bundibugyo virus disease outbreak
.
Cell Rep Med
2021
;
2
:
100351
.

Author notes

Presented in part: IDWeek, 18–21 September 2022, San Diego, CA, USA, poster presentation by S. D. J.

Potential conflicts of interest. Both authors: No reported conflicts. Both authors have submitted the ICMJE Form for Disclosure of Potential Conflicts of Interest. Conflicts that the editors consider relevant to the content of the manuscript have been disclosed.

This work is written by (a) US Government employee(s) and is in the public domain in the US.

Supplementary data