-
PDF
- Split View
-
Views
-
Cite
Cite
Nadine Bachmann, Katharina Kusejko, Huyen Nguyen, Sandra E Chaudron, Claus Kadelka, Teja Turk, Jürg Böni, Matthieu Perreau, Thomas Klimkait, Sabine Yerly, Manuel Battegay, Andri Rauch, Alban Ramette, Pietro Vernazza, Enos Bernasconi, Matthias Cavassini, Huldrych F Günthard, Roger D Kouyos, Swiss HIV Cohort Study , Phylogenetic Cluster Analysis Identifies Virological and Behavioral Drivers of Human Immunodeficiency Virus Transmission in Men Who Have Sex With Men, Clinical Infectious Diseases, Volume 72, Issue 12, 15 June 2021, Pages 2175–2183, https://doi.org/10.1093/cid/ciaa411
- Share Icon Share
Abstract
Identifying local outbreaks and their drivers is a key step toward curbing human immunodeficiency virus (HIV) transmission and potentially achieving HIV elimination. Such outbreaks can be identified as transmission clusters extracted from phylogenetic trees constructed of densely sampled viral sequences. In this study, we combined phylogenetic transmission clusters with extensive data on virological suppression and behavioral risk of cluster members to quantify the drivers of ongoing transmission over 10 years.
Using the comprehensive Swiss HIV Cohort Study and its drug-resistance database, we reconstructed phylogenetic trees for each year between 2007 and 2017. We identified HIV transmission clusters dominated by men who have sex with men (MSM) and determined their annual growth. We used Poisson regression to assess if cluster growth was associated with a per-cluster infectivity and behavioral risk score.
Both infectivity and behavioral risk scores were significantly higher in growing MSM transmission clusters compared to nongrowing clusters (P ≤ .01). The fraction of transmission clusters without infectious members acquiring new infections increased significantly over the study period. The infectivity score was significantly associated with per-capita incidence of MSM transmission clusters in 8 years, while the behavioral risk score was significantly associated with per-capita incidence of MSM transmission clusters in 3 years.
We present a phylogenetic method to identify hotspots of ongoing transmission among MSM. Our results demonstrate the effectiveness of treatment as prevention at the population level. However, the significantly increasing number of new infections among transmission clusters without infectious members highlights a relative shift from diagnosed to undiagnosed individuals as drivers of HIV transmission in Swiss MSM.
(See the Editorial Commentary by Mehta and Little on pages 2184–6.)
The global epidemic of human immunodeficiency virus (HIV) is still not under control, with approximately 1.7 million new HIV infections in 2018 [1]. As the HIV community embraces the Joint United Nations Programme on HIV/AIDS 90-90-90 targets for HIV diagnosis, treatment, and viral suppression, the focus has largely been on increasing access to antiretroviral treatment (ART) [2]. However, reducing HIV incidence remains a challenge in HIV medicine. Thus, optimizing prevention using state-of-the-art methodology and allocating resources for prevention to subpopulations at risk are crucial to curbing the epidemic.
Molecular epidemiology is essential for the improvement of prevention of new HIV infections [3, 4]. The possibilities of molecular epidemiology have advanced with increasing computational power and availability of densely sampled HIV sequencing data from genotypic resistance testing (GRT). Representative HIV phylogenies can be reconstructed to subsequently extract phylogenetic clusters. Epidemiologically, these phylogenetic clusters reflect transmission chains that are or were ongoing and can identify local outbreaks within epidemics. Subgroups within the epidemic with a higher risk of leading to local outbreaks may be identified by specific clinical, demographic, or behavioral factors [5–9]. This study presents a new longitudinal method of identifying HIV clusters at risk of transmission, combining molecular epidemiology tools with classical clinical and behavioral risk factors. Our method was applied to men who have sex with men (MSM) enrolled in the Swiss HIV Cohort Study (SHCS) as a case study.
Within the SHCS it has been shown that more than half of Swiss people with HIV were included in transmission clusters and that most of these individuals were infected via the same transmission route [10]. MSM having unprotected anal sex represent the major risk group for ongoing transmission of HIV in Western Europe and North America [11]. Based on numbers of newly diagnosed cases over the last decade, the epidemic in MSM does not seem to be under control [12]. Recent studies in this community indicate reduced condom use in some subgroups of MSM [9, 13, 14]. Additionally, the numbers of HIV-positive MSM infected with sexually transmitted infections, such as syphilis, chlamydia, and hepatitis C, have been on the rise in the past decade [15–19].
In 2011 global HIV prevention efforts progressed considerably when the HIV Prevention Trials Network (HPTN) 052 trial demonstrated that ART reduces the sexual transmission of HIV in HIV-serodiscordant couples by > 96% [20]. In MSM, the risk of HIV transmission through condomless sex when the HIV viral load is suppressed was recently shown to be effectively zero (upper 95% confidence interval of 0.23 per 100 couple-years of follow-up) [21], thereby confirming the concept of treatment as prevention (TasP) on the individual level for MSM. The impact of TasP at the population level is less clear. The results of the HPTN 071 (PopART) trial revealed that initiating ART immediately after diagnosis did not significantly reduce incidence [22]. In Switzerland, TasP was gradually introduced after 2008 based on the Swiss statement [23]. What still remains controversial about TasP is its impact on risk behavior of HIV-positive individuals potentially compensating effects of TasP by increasing risk behavior [16]. Utilizing the comprehensive database of the SHCS, we sought to better understand impacts of TasP and the potentially increased risk behavior on HIV transmission in MSM at both the individual and population levels.
METHODS
The Swiss HIV Cohort Study
The SHCS is a large, prospective, multicenter study established in 1988 [24]. At each semiannual follow-up, clinical and laboratory data of quarterly visits are obtained from SHCS participants. The SHCS has an estimated coverage of approximately 75% of all HIV-positive individuals on ART in Switzerland [24]. It includes a drug resistance database that contains HIV sequences for approximately 60% of the > 20 000 SHCS participants and 75% of all participants enrolled after 1995, because > 11 000 genotypes were retrospectively sequenced from the SHCS biobank [25, 26]. Written informed consent was obtained from each SHCS study participant.
Phylogenetic Tree Construction
For each of the years from 2007 to 2017, we considered all partial pol sequences of SHCS participants, which were available prior to the beginning of the respective year and had adequate quality (at least 250 bp from protease and at least 500 bp from reverse transcriptase). If multiple sequences per patient were available, the first one was considered. Importantly, all sequences included in a certain year were also included in subsequent years. For each year, we blasted non-Swiss Los Alamos background sequences against the described choice of SHCS sequences (targeting the 10 closest sequences with a threshold of 90%). SHCS and Los Alamos sequences were pooled and pairwise aligned to the HXB2 reference genome using MUSCLE [27]. Insertions relative to HXB2 and resistance mutations according to the Stanford (http://hivdb.stanford.edu/) and International Antiviral Society–USA (https://www.iasusa.org/) lists were removed, and positions with many gaps were eliminated by trimAl [28]. Maximum likelihood phylogenetic trees were reconstructed with FastTree using a generalized time-reversible model [29]. We obtained 11 phylogenetic trees reflecting the observed transmission network within the SHCS in the beginning of each year from 2007 to 2017 (Figure 1).
![Methods overview. For all pairs of 2 consecutive years (year x and year x + 1, x being one of the years from 2007 to 2016), phylogenetic trees were built of sequences available prior to the respective years (green tips refer to men who have sex with men [MSM] transmission clusters; blue tips refer to all other). We extracted MSM transmission clusters from these phylogenetic trees and checked which MSM transmission clusters were preexistent in the previous year with at least 50% of the already infected members in MSM transmission clusters. For all preexisting clusters, we calculated the number of newly infected MSM that were acquired by the respective cluster within the year of interest. Furthermore, we assigned a behavioral risk and an infectivity score to each preexistent cluster of year x. We performed a regression analysis to investigate the association of behavioral risk and infectivity score and the number of newly infected individuals with MSM transmission clusters (indicated by question mark). Abbreviations: HIV, human immunodeficiency virus; MSM, men who have sex with men.](https://oup.silverchair-cdn.com/oup/backfile/Content_public/Journal/cid/72/12/10.1093_cid_ciaa411/1/m_ciaa411f0001.jpeg?Expires=1747938107&Signature=LfqN0ikZdnYElhKCUNB24DYmGiTOsxtOfo0fIrPaN9uXdo5an7H--fmJ4SpVuNUe1~5-BQawGDQCEijvQgzRudPhP-byg8sQuzbwOj~t32rt1kR~vbCKWMjjE9b8TvUUg03pv-xB~E4bU81Ja3js8vtQsE0hqqCmNzDkjHUS1TLw5vjVXYwgg1Tglj-FHp38e01XqqpDRAZMdvx3J4ydvJfrDuyfM381PBvRQXpmdpWmskX8uExYUtjVjLKdaASmgDmoS~IhA~hFztt5jIK7XsPlMTGVNeEHQp3hNoZOLmon6DMv-obNKxHSoFmKwQlNNOU9xV8NDlWPeCsCWmLH9A__&Key-Pair-Id=APKAIE5G5CRDK6RD3PGA)
Methods overview. For all pairs of 2 consecutive years (year x and year x + 1, x being one of the years from 2007 to 2016), phylogenetic trees were built of sequences available prior to the respective years (green tips refer to men who have sex with men [MSM] transmission clusters; blue tips refer to all other). We extracted MSM transmission clusters from these phylogenetic trees and checked which MSM transmission clusters were preexistent in the previous year with at least 50% of the already infected members in MSM transmission clusters. For all preexisting clusters, we calculated the number of newly infected MSM that were acquired by the respective cluster within the year of interest. Furthermore, we assigned a behavioral risk and an infectivity score to each preexistent cluster of year x. We performed a regression analysis to investigate the association of behavioral risk and infectivity score and the number of newly infected individuals with MSM transmission clusters (indicated by question mark). Abbreviations: HIV, human immunodeficiency virus; MSM, men who have sex with men.
Extraction of MSM Transmission Clusters
We identified transmission clusters using phylogenetic tree topology including Los Alamos background sequences to split individual groups that were not closely related in the transmission chain into phylogenetic clades consisting of at least 80% SHCS sequences. As we aimed to study MSM transmission clusters, we discarded transmission clusters if they did not consist of at least 50% Swiss MSM. This definition allowed us to also capture transmission clusters that did not exclusively contain MSM, which is especially important considering MSM who did not disclose as MSM and women infected by bisexual men. We did not consider newly emerging clusters as we were interested in the predictive power of the clusters that already existed the year before. Therefore, we defined “preexistent” clusters as clusters that existed in the previous year containing at least 50% of the already infected members appearing in MSM transmission clusters (Figure 1). Transmission clusters were extracted using custom scripts incorporating the R package APE [30].
Identification of Newly Infected MSM
For each study year, we identified all MSM in the SHCS who were newly infected in that year and reported to have been infected in Switzerland. (For the new infections considered, we excluded MSM reporting to have been infected abroad. This information is only collected in the SHCS since 2007; thus, we did not consider earlier years.) The time period spanned the first to the last day of the year (ie, newly infected MSM in 2016 were infected between 1 January 2016 and 31 December 2016). MSM were considered newly infected in a certain year, if they both (1) had their first GRT done within the year and (2) their estimated date of infection was within the year (MSM with first GRT and estimated date of infection in different years were discarded as newly infected).
We used the following 2 methods to estimate HIV infection dates as accurately as possible:
Defined HIV primary infection: For individuals with available seroconversion dates (negative and positive HIV tests < 1 year apart) or with a diagnosis of primary infection as previously described [31], we used the midpoint between the dates of the negative and the positive test, or, if known, the date of the primary infection as best estimate of the HIV infection date.
Defined recent HIV infection: For individuals who (1) exhibited low HIV diversity (< 0.5% of ambiguous nucleotides) in a genotypic HIV drug resistance test within the first year after diagnosis and (2) had > 200 CD4+ cells/µL blood at cohort registration [32–34], we used the diagnosis date as best estimate of the HIV infection date.
Individuals for whom neither (1) nor (2) was possible to estimate the date of infection were viewed as not newly infected within that given year and thus excluded from analysis (ie, newly diagnosed chronic infections were excluded from our analysis).
Definition of Explanatory Variables
For each MSM transmission cluster and each considered calendar year, we calculated the following explanatory variables using all individuals within the transmission clusters (not only those who disclosed as MSM).
Annual Infectivity Score
For each MSM transmission cluster, the infectivity score is defined as the fraction of cluster members who had at least 1 viral load > 1000 copies in the year of interest. Laboratory values such as the viral load are usually measured during every quarterly to half-yearly follow-up visit for individuals participating in the SHCS.
Annual Behavioral Risk Score
For each MSM transmission cluster, the behavioral risk score is defined as the fraction of cluster members who either reported risky behavior and/or had a coinfection with hepatitis B virus (HBV), hepatitis C virus (HCV), or syphilis as a surrogate for risky behavior. Risky behavior was defined as reported condomless anal or vaginal intercourse with occasional partners or refusal to answer this question (a median of 3 [interquartile range, 2–5] participants per year refused an answer). HBV coinfection was present if both the qualitative result of an hepatitis B virus core antibody test and the qualitative result of a hepatitis B surface antigen test were positive. HCV coinfection was present if either the qualitative result of a HCV antibody test was positive or HCV RNA viral load from a quantitative test was detectable. Syphilis coinfection was present if the qualitative result of the screening test and the interpretation of the quantitative result were positive. Coinfections were primarily included as proxies for risk behavior [19] and not because we assumed that they biologically facilitate HIV transmission.
Statistical Analysis
Regression Analysis
To evaluate the infectivity and behavioral risk score as predictors for new infections, we performed multivariable Poisson regression analyses using behavioral risk score and infectivity score as predictor variables. For each study year, we identified preexisting clusters from the beginning of that year and the beginning of the subsequent year and quantified the number of newly infected MSM within each cluster. We then used the number of new infections within each cluster as an outcome variable and included infectivity and behavioral risk score of each cluster from the previous year as predictor variables. By separating predictor and outcome variables by time, we aimed at establishing a causal relationship (ie, to test that the former cause the latter). Furthermore, we included the logarithm of the number of cluster members as an offset to exclude the risk of observing an effect driven by the higher chance of acquisition of newly infected MSM in larger MSM transmission clusters. Thus, we model the numbers of new infections per cluster as
where
and thus
is the per-capita growth rate of the clusters. Accordingly, the exponential of the coefficients derived from the Poisson regression analysis can be interpreted as the effect from the risk and the infectivity scores on the per-capita cluster growth rate of the considered transmission clusters. In other words, these coefficients can be interpreted as the per-capita incidence risk ratios.
P Values for Differences and Time Trends
P values for time trends over our study period were derived using a linear regression analysis. P values for differences were derived using the paired Wilcoxon rank-sum test.
RESULTS
Study Population of MSM Within MSM Transmission Clusters
Table 1 displays characteristics of MSM within MSM transmission clusters and of all nonclustered MSM on the respective phylogeny for the first and last study year (Supplementary Table 1 for all study years). Of the individuals in the MSM transmission clusters, at least 77.9% were MSM, while up to 9.5% of individuals were males reporting infection through heterosexual contacts (Supplementary Figure 1).
Participant Characteristics in the First (2007) and the Last (2017) Year of the Study
Characteristic . | MSM in Clusters . | Nonclustered MSM on Tree . | Pcluster/tree . | MSM Not on Phylogeny . | Ptree/SHCS . |
---|---|---|---|---|---|
First year of study (2007) | |||||
No. | 1767 | 1124 | 785 | ||
Age, y, median (IQR) | 45 (40–52) | 45 (40–54) | .091 | 49 (43–58) | < .001 |
White ethnicity | 95.59% | 92.45% | < .001 | 93% | .51 |
CD4 cell count at diagnosis, median (IQR) | 367 (196.5–557) | 345 (184–537) | .065 | 290 (115–487) | < .001 |
Year of SHCS registration, median (IQR) | 1998 (1994–2003) | 1998 (1994–2003) | .5 | 1995 (1992–2000) | < .001 |
Last year of study (2017) | |||||
No. | 3429 | 1544 | 1372 | ||
Age, y, median (IQR) | 51 (44–58) | 52 (43–60) | .056 | 53 (45–63) | .003 |
White ethnicity | 92.97% | 86.69% | < .001 | 88.63% | .104 |
CD4 cell count at diagnosis, median (IQR) | 381.5 (222–561) | 367.5 (200–546.5) | .03 | 369.5 (194–577.25) | .735 |
Year of SHCS registration, median (IQR) | 2005 (1997–2010) | 2004.5 (1996–2010) | .888 | 2004.5 (1995–2012) | .301 |
Characteristic . | MSM in Clusters . | Nonclustered MSM on Tree . | Pcluster/tree . | MSM Not on Phylogeny . | Ptree/SHCS . |
---|---|---|---|---|---|
First year of study (2007) | |||||
No. | 1767 | 1124 | 785 | ||
Age, y, median (IQR) | 45 (40–52) | 45 (40–54) | .091 | 49 (43–58) | < .001 |
White ethnicity | 95.59% | 92.45% | < .001 | 93% | .51 |
CD4 cell count at diagnosis, median (IQR) | 367 (196.5–557) | 345 (184–537) | .065 | 290 (115–487) | < .001 |
Year of SHCS registration, median (IQR) | 1998 (1994–2003) | 1998 (1994–2003) | .5 | 1995 (1992–2000) | < .001 |
Last year of study (2017) | |||||
No. | 3429 | 1544 | 1372 | ||
Age, y, median (IQR) | 51 (44–58) | 52 (43–60) | .056 | 53 (45–63) | .003 |
White ethnicity | 92.97% | 86.69% | < .001 | 88.63% | .104 |
CD4 cell count at diagnosis, median (IQR) | 381.5 (222–561) | 367.5 (200–546.5) | .03 | 369.5 (194–577.25) | .735 |
Year of SHCS registration, median (IQR) | 2005 (1997–2010) | 2004.5 (1996–2010) | .888 | 2004.5 (1995–2012) | .301 |
MSM transmission clusters were defined as phylogenetic transmission clusters consisting of at least 50% MSM. The first column “MSM in clusters” refers to MSM cluster members, the second column “nonclustered MSM on tree” refers to all MSM who were on the respective phylogeny (ie, with available genotypic resistance testing [GRT]), while the column “MSM not on phylogeny” includes all MSM registered in the SHCS but not on the phylogeny (ie, individuals without sequencing data) with follow-up data after the beginning of the year 1995, when the collection of plasma samples was initiated. P values comparing characteristics of MSM in clusters to nonclustered MSM on the tree and all MSM included in the analysis to all MSM without GRT, respectively, were derived using paired Wilcoxon rank-sum test.
Abbreviations: HIV, human immunodeficiency virus; IQR, interquartile range; SHCS, Swiss HIV Cohort Study.
Participant Characteristics in the First (2007) and the Last (2017) Year of the Study
Characteristic . | MSM in Clusters . | Nonclustered MSM on Tree . | Pcluster/tree . | MSM Not on Phylogeny . | Ptree/SHCS . |
---|---|---|---|---|---|
First year of study (2007) | |||||
No. | 1767 | 1124 | 785 | ||
Age, y, median (IQR) | 45 (40–52) | 45 (40–54) | .091 | 49 (43–58) | < .001 |
White ethnicity | 95.59% | 92.45% | < .001 | 93% | .51 |
CD4 cell count at diagnosis, median (IQR) | 367 (196.5–557) | 345 (184–537) | .065 | 290 (115–487) | < .001 |
Year of SHCS registration, median (IQR) | 1998 (1994–2003) | 1998 (1994–2003) | .5 | 1995 (1992–2000) | < .001 |
Last year of study (2017) | |||||
No. | 3429 | 1544 | 1372 | ||
Age, y, median (IQR) | 51 (44–58) | 52 (43–60) | .056 | 53 (45–63) | .003 |
White ethnicity | 92.97% | 86.69% | < .001 | 88.63% | .104 |
CD4 cell count at diagnosis, median (IQR) | 381.5 (222–561) | 367.5 (200–546.5) | .03 | 369.5 (194–577.25) | .735 |
Year of SHCS registration, median (IQR) | 2005 (1997–2010) | 2004.5 (1996–2010) | .888 | 2004.5 (1995–2012) | .301 |
Characteristic . | MSM in Clusters . | Nonclustered MSM on Tree . | Pcluster/tree . | MSM Not on Phylogeny . | Ptree/SHCS . |
---|---|---|---|---|---|
First year of study (2007) | |||||
No. | 1767 | 1124 | 785 | ||
Age, y, median (IQR) | 45 (40–52) | 45 (40–54) | .091 | 49 (43–58) | < .001 |
White ethnicity | 95.59% | 92.45% | < .001 | 93% | .51 |
CD4 cell count at diagnosis, median (IQR) | 367 (196.5–557) | 345 (184–537) | .065 | 290 (115–487) | < .001 |
Year of SHCS registration, median (IQR) | 1998 (1994–2003) | 1998 (1994–2003) | .5 | 1995 (1992–2000) | < .001 |
Last year of study (2017) | |||||
No. | 3429 | 1544 | 1372 | ||
Age, y, median (IQR) | 51 (44–58) | 52 (43–60) | .056 | 53 (45–63) | .003 |
White ethnicity | 92.97% | 86.69% | < .001 | 88.63% | .104 |
CD4 cell count at diagnosis, median (IQR) | 381.5 (222–561) | 367.5 (200–546.5) | .03 | 369.5 (194–577.25) | .735 |
Year of SHCS registration, median (IQR) | 2005 (1997–2010) | 2004.5 (1996–2010) | .888 | 2004.5 (1995–2012) | .301 |
MSM transmission clusters were defined as phylogenetic transmission clusters consisting of at least 50% MSM. The first column “MSM in clusters” refers to MSM cluster members, the second column “nonclustered MSM on tree” refers to all MSM who were on the respective phylogeny (ie, with available genotypic resistance testing [GRT]), while the column “MSM not on phylogeny” includes all MSM registered in the SHCS but not on the phylogeny (ie, individuals without sequencing data) with follow-up data after the beginning of the year 1995, when the collection of plasma samples was initiated. P values comparing characteristics of MSM in clusters to nonclustered MSM on the tree and all MSM included in the analysis to all MSM without GRT, respectively, were derived using paired Wilcoxon rank-sum test.
Abbreviations: HIV, human immunodeficiency virus; IQR, interquartile range; SHCS, Swiss HIV Cohort Study.
Incidence of Newly Infected MSM Within MSM Transmission Cluster During 10-Year Study Period
The annual number of newly infected MSM, identified by incorporating estimated infection dates and excluding new diagnosis of chronic infections (see Methods), decreased significantly over our study period (Ptrend< .001; Figure 2). Between 30.6% (2009) and 57.1% (2014) of the newly infected MSM appeared within preexistent MSM transmission clusters (Ptrend= .802; Figure 2). The majority of newly infected MSM not in preexisting MSM transmission clusters appeared in newly emerged clusters containing only 2 individuals including the newly infected MSM (Supplementary Figure 2).

Newly infected men who have sex with men (MSM) within MSM transmission clusters. A, Number of newly infected MSM with acute or recent human immunodeficiency virus infection (as defined in the Methods) (solid green line) and fraction of all newly infected MSM who appear within preexisting phylogenetic MSM transmission clusters (as defined in the Methods) (dashed pink line). P values for time trends were derived using linear regression. B, Example of 3 MSM transmission clusters acquiring newly infected MSM between 1 January 2007 and 1 January 2008. Olive green color represents individuals who were already in the cluster in the previous year; pink, newly infected individuals within the cluster; black, Los Alamos background sequences; and light green, other Swiss HIV Cohort Study (SHCS) participants (SHCS participants who were not in the same cluster in the previous year). Abbreviation: HIV, human immunodeficiency virus.
Characteristics of MSM Transmission Clusters Acquiring Newly Infected MSM
The mean annual infectivity score for MSM transmission clusters significantly decreased over the study period (Figure 3A). This paralleled the observation that the median time between HIV diagnosis to treatment initiation decreased from a maximum of 22 weeks in 2008 to a minimum of 3 weeks in 2016 (Supplementary Figure 3), indicating a decreasing fraction of infectious SHCS participants over our study period. In contrast, the mean annual behavioral risk score for MSM transmission clusters strongly increased over the study period (Figure 3B). Both the infectivity and the behavioral risk score were significantly higher in MSM transmission clusters that acquired newly infected MSM, compared to MSM transmission clusters not acquiring newly infected MSM (Figure 3A and 3B). The fraction of transmission clusters acquiring new infections, even though not containing any infectious members (ie, infectivity score of 0), increased significantly over the study period (Ptrend = .017; Figure 3C), reaching levels of up to 50% by 2016.

Characteristics of men who have sex with men (MSM) transmission clusters acquiring newly infected individuals. The x-axis denotes the study year (first year per tick: year of transmission clusters used to identify cluster characteristics; second year per tick: year of transmission clusters used to identify cluster growth). A, Mean annual per-cluster infectivity scores of clusters acquiring newly infected individuals (squares) compared to clusters not acquiring newly infected individuals (circles). The fraction of cluster members who were infectious (defined as at least 1 viral load ≥ 1000 copies/mL) within the study years is displayed on the y-axis. B, Mean annual per cluster behavioral risk scores of clusters acquiring newly infected individuals (squares) compared to clusters not acquiring newly infected individuals (circles). The fraction of cluster members who either reported risk behavior and/or had a coinfection (hepatitis B, hepatitis C, or syphilis) within the study years is displayed on the y-axis. C, Infectivity and behavioral risk scores of zero. The fraction of clusters acquiring newly infected individuals even though harboring an infectivity score (green) or behavioral risk score (pink) of zero is displayed on the y-axis. Total number of clusters identified at the beginning of the first year per tick (“clusters”) and number of clusters acquiring newly infected MSM during the next year (“growing”) are displayed. P values for time trends were derived using linear regression. P values for differences were derived using paired Wilcoxon rank-sum test.
Per-capita Cluster Growth Was Determined by Annual Infectivity and Behavioral Risk Scores
Multivariable Poisson regression analysis revealed that the annual infectivity score was significantly and independently associated with per-capita cluster growth of MSM transmission clusters in 8 of 10 study years (2007–2011, 2013–2014, 2016) (Figure 4). At the same time, the annual behavioral risk score was significantly associated with the per-capita MSM transmission cluster growth in 3 of 10 study years (2011–2012, 2014). Combining the 2 scores did not improve the observed associations (Supplementary Table 2). These associations highlight the potential of identifying transmission clusters at high risk of transmission using these already available variables. At the same time, our power to predict cluster growth in the subsequent year using the previous regression analysis decreased strongly over our study period (Ptrend = .0002; Supplementary Figure 4).

Association of cluster infectivity and cluster risk behavior with per-capita cluster growth. Results of a Poisson regression with per-capita growth as outcome, and infectivity as well as behavioral risk score as dependent variables. The effect of the infectivity score (A) and the behavioral risk score (B) on the incidence rate ratio is depicted on the y-axis, whereas the x-axis shows the study years. Coefficients from univariable (green) and multivariable (purple) models are shown. The y-axes depict the incidence rate ratio. *P < .05; **P < .01; ***P < .001.
DISCUSSION
This is the first study quantifying the risk of ongoing transmission in phylogenetically identified clusters by combining data on virological suppression with behavioral risk, thereby identifying the changing drivers of the HIV epidemic. Based on the comprehensive SHCS, this analysis was performed for over a decade and thus provides a long-term perspective on challenges underlying HIV elimination. The decreasing infectivity score along with the decreasing total number of newly infected MSM over the study period provides evidence for a beneficial effect of TasP at the population level. Despite the increase of the behavioral risk score in the last years of our study period, HIV infections still decreased, illustrating that TasP also works in the context of risky behavior. Over a decade, we found a significantly increasing fraction of transmission clusters that acquired new infections but did not contain any infectious members. In particular, in the last study years up to one-half of MSM transmission clusters acquiring newly infected MSM contained only virally suppressed cluster members. Given the proven effectiveness of TasP at the individual level [20, 21, 35], this implies that there were unsampled or undiagnosed HIV-positive individuals within the transmission networks corresponding to the MSM transmission clusters we identified. This is additionally supported by our finding of a significantly decreasing power to predict cluster growth over our study period and illustrates the declining role of people living with diagnosed but unsuppressed HIV in driving the more recent epidemic. Furthermore, this highlights that in recent years a relative shift occurred from the diagnosed to the undiagnosed population as the driver of the HIV epidemic in Swiss MSM.
Our study has limitations and strengths. As with any study in molecular epidemiology, we cannot achieve perfect sampling. Even though the SHCS is highly representative of the HIV epidemic in Swiss MSM, phylogenetically identified transmission clusters are likely to have unsampled members. Thus, the detected transmission clusters should be interpreted as proxies for the actual transmission networks. However, despite the incomplete sampling, our approach allows for relevant conclusions about the drivers of the epidemic. In particular, in the last study years we observed newly infected individuals in clusters where all members were virally suppressed, implying that there were unsampled/undiagnosed HIV-positive individuals within the transmission networks of the corresponding MSM transmission clusters. Moreover, the estimated dates of infection sometimes harbor large confidence intervals. Thus, we may not exclude the possibility of incorrectly categorizing individuals as “newly infected.” However, the quality of our estimated infection dates using defined HIV primary or recent infections was not time-dependent (data not shown) and misclassification of “newly infected” individuals was equally likely for individuals of clusters with varying characteristics over the time period examined. These shortcomings are therefore unlikely to explain any of the observed outcomes. Furthermore, there is no consensus or gold standard for phylogenetically defining HIV transmission clusters. The frequently used cluster detection methods fall into 3 categories: distance-based, topology-based, or a combination of the 2. Suitability of the methods highly depends on the research question of interest. In our case, a distance-based approach was not suitable as we did not want to prioritize new infections within recently established compared to long-circulating transmission clusters—thus, we applied a purely topological approach. The strengths of this study include that the statistical analysis directly yields the impact of risk and infectivity scores on the observed per-capita growth rate of the cluster over a long-term perspective in a representative population.
We demonstrate a new method to track the epidemiology of HIV, thereby identifying hotspots of ongoing transmission among MSM. The identified high-risk clusters could be used to target social networks harboring undiagnosed individuals spreading HIV. In these social networks, intensification of prevention (eg, testing campaigns and preexposure prophylaxis) is likely to have the largest impact. The significantly increasing number of new infections within transmission clusters without infectious members indicates a relative shift of epidemiological importance from diagnosed to undiagnosed individuals as drivers of the epidemic. Furthermore, this highlights the need for additional efforts to identify infectious individuals as early as possible and to link them into care. Our method combining phylogenetics, clinical data, and behavioral data sheds light on high-risk social networks, where prevention and HIV testing should be intensified, and may thus inform the optimal allocation of resources for ending HIV transmission.
SUPPLEMENTARY DATA
Supplementary materials are available at Clinical Infectious Diseases online. Consisting of data provided by the authors to benefit the reader, the posted materials are not copyedited and are the sole responsibility of the authors, so questions or comments should be addressed to the corresponding author.
Notes
Members of the Swiss HIV Cohort Study (SHCS). A. Anagnostopoulos, M. Battegay, E. Bernasconi, J. Böni, D. L. Braun, H. C. Bucher, A. Calmy, M. Cavassini, A. Ciuffi, G. Dollenmaier, M. Egger, L. Elzi, J. Fehr, J. Fellay, H. Furrer, C. A. Fux, H. F. Günthard (President of the SHCS), D. Haerry (deputy of “Positive Council”), B. Hasse, H. H. Hirsch, M. Hoffmann, I. Hösli, M. Huber, C. R. Kahlert (Chairman of the Mother and Child Substudy), L. Kaiser, O. Keiser, T. Klimkait, R. D. Kouyos, H. Kovari, B. Ledergerber, G. Martinetti, B. Martinez de Tejada, C. Marzolini, K. J. Metzner, N. Müller, D. Nicca, P. Paioni, G. Pantaleo, M. Perreau, A. Rauch (Chairman of the Scientific Board), C. Rudin, A. U. Scherrer (Head of Data Center), P. Schmid, R. Speck, M. Stöckle (Chairman of the Clinical and Laboratory Committee), P. Tarr, A. Trkola, P. Vernazza, G. Wandeler, R. Weber, S. Yerly.
Acknowledgments. The authors thank the patients for participating in the SHCS; the study nurses and physicians for excellent patient care; A. Scherrer, A. Traytel, and S. Wild for excellent data management; and D. Perraudin and M. Amstutz for administrative assistance.
Financial support. This work was funded within the framework of the Swiss HIV Cohort Study (Swiss National Science Foundation [SNF] grant number 33CS30_177499 to H. F. G.). The data were gathered by the 5 Swiss university hospitals, 2 cantonal hospitals, 15 affiliated hospitals, and 36 private physicians listed at http://www.shcs.ch/180-health-care-providers. The work was further supported by the SNF (grant numbers 324730B_179571 and 310030_141067 to H. F. G. and PZ00P3-142411 and BSSGI0_155851 to R. D. K.); the Yvonne-Jacob Foundation (to H. F. G.); the University of Zurich’s Clinical Research Priority Program for viral infectious disease; the Zurich Primary HIV Infection Cohort Study (to H. F. G.); and an unrestricted research grant from Gilead Sciences to the SHCS research foundation.
Potential conflicts of interest. M. C. has received research and travel grants for his institution from ViiV and Gilead. E.B. has received fees for his institution for participation on advisory boards for MSD, Gilead Sciences, ViiV Healthcare, Pfizer, Sandoz, and AbbVie. M. B. has received research or educational grants by AbbVie AG, Gilead Sciences Switzerland Sàrl, Janssen-Cilag AG, Merck Sharp & Dohme AG, and ViiV Healthcare GmbH. P. V.’s institution has received advisory fees for participation of team members in advisory boards from MSD, ViiV Healthcare GmbH, Teva, and Gilead Sciences. T. K. has received honoraria from Gilead Sciences and Roche Diagnostics. A. R. has received honoraria for advisory boards and/or travel grants from MSD, Gilead Sciences, AbbVie, Bristol-Myers Squib, Janssen-Cilag, and Pfizer, and an investigator-initiated trial grant from Gilead Sciences. All remuneration went to his home institution and not to A. R. personally. R. D. K. has received grants from the SNF and personal fees from Gilead Sciences, outside the submitted work. H. F. G. has received unrestricted research grants from Gilead Sciences and Roche; fees for data and safety monitoring board membership from Merck; and consulting/advisory board membership fees from Gilead Sciences, ViiV, Sandoz, and Mepha. All other authors report no potential conflicts of interest. All authors have submitted the ICMJE Form for Disclosure of Potential Conflicts of Interest. Conflicts that the editors consider relevant to the content of the manuscript have been disclosed.
References
Author notes
H. F. G. and R. D. K. contributed equally to this work.