Linkage disequilibrium suggests genomic stability in Omicron clades of SARS-CoV-2 from the ASEAN countries

Genome-wide population structure analyses showed the presence of linkage disequilibrium and low recombination rates in the SARS-CoV-2.

After more than 2 years of pandemic caused by SARS-CoV-2, COVID-19 is still a national concern in many countries worldwide.One of the key investigations is to understand the factors contributing to the evolutionary dynamics of SARS-CoV-2 as a pathogen.Currently, almost all countries have lifted border control orders and have allowed inter-country travel with minimal restrictions.This provides better resolutions on genomic patterns and the evolution of circulating SARS-CoV-2 in each community with the influence of imported strains.
In this report, we surveyed genomes of SARS-CoV-2 strains circulating in the Association of Southeast Asian Nations (ASEAN) countries.This project serves as a collaborative effort from the ASEAN Member States that had participated in the programme 'Strengthening Laboratory Capacity on COVID-19 Bio Genomic for ASEAN Countries'.A total of 124 SARS-CoV-2 samples were collected by the national level public health laboratories: Malaysia (n = 24), Brunei Darussalam (n = 20), Cambodia (n = 20), Indonesia (n = 20), Thailand (n = 20) and Vietnam (n = 20).All samples were sequenced using shortread technology.The version of the genomes used in this study was statistically quality-assessed and preprocessed in FASTQC and Trimmomatics, respectively, prior to de novo assembly in Megahit. 1 -3 The genomes were submitted to GISAID by the respective laboratories (Supplementary Data 1).
Phylogenomics analyses using 210 representative genomes from the NextStrain database (accessed on 25 August 2022) as background, assigned all the genomes into four different clades in the Omicron lineage.The majority of the strains were clustered into 22B (48.39%), followed by 22L (44.35%), 22A (4.03%) and, lastly, 21K (3.23%).Each strain was clustered into a respective clade and showed a non-observable cascadelike structure, indicating the genomes of SARS-CoV-2 exerted a slow impact on evolution (Figure 1).
There are speculations on the roles of recombination in viruses, such as mechanisms for rapid repair and host adaptation, as well as viral genome integrity. 4In this study, there were no observable recombination trends in the reconstructed phylogenomics trees.Thus, the hypothesis of recombination in the SARS-CoV-2 was further tested with inter-and intra-clades PHI test of recombination.Both tests showed insignificant recombination events among the strains at a P-value of 0.405 and 0.253, respectively.
Linkage disequilibrium measures the non-random association of nucleotides at different sites.Analyses of linkage disequilibrium have been reported to be able to infer evolutionary features in pathogens. 5In this study, linkage disequilibrium analyses were performed on the genomes of the SARS-CoV-2 analysed (Table 1).Using a threshold of 0.8 for a correlation between alleles at two loci (R 2 ), 22 sites were predicted to be under the effect of linkage disequilibrium.All the sites achieved highconfidence disequilibrium coefficients (D ) ranging from 0.96 to 1.0, which were supported by Fisher's Exact test ranging from 3.65 × 10 −37 to 1.06 × 10 −10 .One linkage disequilibrium was found to involve 13 out of the 22 predicted sites (LD Set 1).The sites involved 11 SNP-SNP, single SNP-INDEL and single triplet-nucleotide linkage disequilibria.The linkage disequilibrium for the remaining sites involved only SNP-SNP associations, with four (LD Set 2), three (LD Set 3) and two (LD Set 4) affected sites, respectively.Recombination processes were found to disrupt linkage disequilibrium and managed to alter the variants-associated sites. 6In this study, the positions of the linkage disequilibrium pairs ranged from 22 to 26 712 bp (Figure 1).Coupled with the PHI test of recombination, the closeto-wide pairing distances of linkage disequilibrium suggested that the genomes of SARS-CoV-2 had little to no recombination influence.
Successful reproduction in a population is the indicator of genome stability.Recombination is one of the common processes used for viral genome repair and can introduce mutations into the host (reviewed by Kockler and Gordenin). 7Hence, the process increases the diversity in a population.However, it has been reported that there is a lack of genomic diversity (low frequency of nucleotide changes) among SARS-CoV-2 strains. 8Although the recombination in SARS-CoV-2 has been discussed in multiple publications, 9 the number of recombinant strains has not been found to be alarming.It was reported that only approximately 2-3% out of the total SARS-CoV-2 genomes deposited in the public database exerted recombination events. 10The recombinant strains were also predicted to be present in the population only for a short period of time.Together with all these aspects, a strong influence of linkage disequilibrium from our study suggests the stability of Omicron clades SARS-CoV-2 genomes.The stability in the genomes indicates that random nucleotide changes are less likely to occur in the SARS-CoV-2.Nucleotide changes, especially linkage disequilibrium, can have a direct influence on the virulence and vaccine effectiveness in the infected hosts.This characteristic of SARS-CoV-2 will raise public health concerns if new variants are identified.Further studies are needed to evaluate the linkage disequilibrium among all SARS-CoV-2 clades and the impact on their fitness factor.This study also provides better Note: The genomes will either have LD Pattern 1 or LD Pattern 2 across the positions in each LD set.
insights for other researchers to thoroughly customize specific parameters to analyse the trend of infection and drug design in combating SARS-CoV-2.

Funding
This study is a part of the support under the project 'Strength-

Figure 1 .
Figure 1.(a) Unrooted representation for phylogenomics of 124 genomes yielded from this study with 210 NextStrain clade representative genomes.Each colour represented single clade.(b) Illustration of linkage disequilibrium in the SARS-CoV-2 genomes.

Table 1 .
Details of linkage disequilibrium ening Regional Initiatives in ASEAN on COVID-19 Response and other Public Health Emergencies', an immediate COVID-19 support programme of the German Federal Ministry of Economic Cooperation and Development (BMZ), implemented by the Deutsche Gesellschaft für Internationale Zusammenarbeit (GIZ) GmbH.