Impact of Homologous Recombination on Core Genome Evolution and Host Adaptation of Pectobacterium parmentieri

Abstract Homologous recombination is a major force mechanism driving bacterial evolution, host adaptability, and acquisition of novel virulence traits. Pectobacterium parmentieri is a plant bacterial pathogen distributed worldwide, primarily affecting potatoes, by causing soft rot and blackleg diseases. The goal of this investigation was to understand the impact of homologous recombination on the genomic evolution of P. parmentieri. Analysis of P. parmentieri genomes using Roary revealed a dynamic pan-genome with 3,742 core genes and over 55% accessory genome variability. Bayesian population structure analysis identified 7 lineages, indicating species heterogeneity. ClonalFrameML analysis displayed 5,125 recombination events, with the lineage 4 exhibiting the highest events. fastGEAR analysis identified 486 ancestral and 941 recent recombination events ranging from 43 bp to 119 kb and 36 bp to 13.96 kb, respectively, suggesting ongoing adaptation. Notably, 11% (412 genes) of the core genome underwent recent recombination, with lineage 1 as the main donor. The prevalence of recent recombination (double compared to ancient) events implies continuous adaptation, possibly driven by global potato trade. Recombination events were found in genes involved in vital cellular processes (DNA replication, DNA repair, RNA processing, homeostasis, and metabolism), pathogenicity determinants (type secretion systems, cell-wall degrading enzymes, iron scavengers, lipopolysaccharides (LPS), flagellum, etc.), antimicrobial compounds (phenazine and colicin) and even CRISPR-Cas genes. Overall, these results emphasize the potential role of homologous recombination in P. parmentieri's evolutionary dynamics, influencing host colonization, pathogenicity, adaptive immunity, and ecological fitness.


Introduction
In various organisms, including plant bacteria, homologous recombination constitutes an essential genetic mechanism (González-Torres et al. 2019).Plant bacteria engage in homologous recombination through a process known as transformation, enabling them to uptake foreign DNA from the surrounding environment.The process involves the exchange and recombination of genetic material between 2 similar DNA sequences, leading to the emergence of new gene combinations and potential changes in phenotype (Kuzminov 2011;González-Torres et al. 2019).For homologous recombination to occur, both ends of the recombined fragment must exhibit a high degree of DNA sequence similarity.As a result, this molecular mechanism primarily occurs between strains of the same species and taxonomically related bacteria (David et al. 2017;Jibrin et al. 2018;González-Torres et al. 2019).The need for a certain level of identity explains why the highest rates of homologous recombination are found in sections of the core genome where sequence similarities are preserved (Everitt et al. 2014;Dixit et al. 2015;Jibrin et al. 2018).
Homologous recombination in bacterial pathogens is of great importance for genetic diversity and adaptation, as it enables the acquisition of new traits and the evolution of these microorganisms.Through homologous recombination, bacteria can acquire new genes and alleles that confer advantages, such as the rapid spread of traits enabling pathogen survival, overcome host defense mechanism, and establishing successful infections (David et al. 2017;Jibrin et al. 2018;González-Torres et al. 2019).During the recombination process, bacteria can exchange genes involved in virulence, antibiotic resistance, and environmental adaptation, leading to the emergence of new pathogenic variants (Prokchorchik et al. 2020).Importantly, homologous recombination has been shown to accelerate evolution by up to a hundred times when different strains are present in the same host (Didelot et al. 2016).This genetic diversity helps plant bacterial pathogens in colonizing a range of hosts, evade plant defenses, and thrive in various environments (Vanhove et al. 2019).
In addition, homologous recombination functions as the main method for DNA repair in plant bacteria (David et al. 2017).It aids in repairing DNA damage caused by environmental factors, such as breaks or mutations (González-Torres et al. 2019).Plant bacteria may repair their genomes and maintain genomic integrity by exchanging genetic information with a homologous undamaged sequence (Potnis et al. 2019).This ensures their existence and aids in their ability to adapt to shifting environmental factors (Jibrin et al. 2018).Besides its role in DNA repair, recombination has also been observed to help in the deletion of deleterious mutations and the introduction of beneficial mutations during DNA synthesis, acting as a positive side effect leading to an energy source (David et al. 2017).Finally, some studies have found that bacteria involved engage in homologous recombination to delete mobile genetic elements from their genomes (David et al. 2017).In summary, homologous recombination in plant bacterial pathogens is a key process that drives genetic diversity, adaptation, DNA repair, and the acquisition of new traits.Therefore, understanding the mechanisms and dynamics of this process can have profound implications for developing effective strategies to manage and control bacterial pathogens, as well as for optimizing crop health and productivity in agricultural settings.
Pectobacterium parmentieri is classified as a pectinolytic rod-shaped, facultatively anaerobic bacterium belonging to the genus Pectobacterium within the Pectobacteriace family (Adeolu et al. 2016).This species was formerly known as P. wasabiae, the horse radish pathogen, and was later assigned as new species (Khayi et al. 2016).Like other members within the Pectobacterium genus, P. parmentieri is a plantpathogenic bacterium causing soft rot and black leg diseases (Nykyri et al. 2012;Charkowski 2018;Zoledowska, et al. 2018a).The bacterium is widely distributed across the globe, having been reported in several countries, including Belgium, France, Finland, Germany, Norway, Switzerland, The Netherlands, Canada, South Africa, New Zealand, Malaysia, Poland, Spain, Turkey, China, and different states within the United States (Pitman et al. 2010;De Boer et al. 2012;Ngadze et al. 2012;Pasanen et al. 2013;Moleleki et al. 2017;Suárez et al. 2017;Zoledowska, et al. 2018b;Arizala et al. 2019;Cao et al. 2021;Ge et al. 2021).The strains of this species have been isolated from various environments and sources, including plants, soil, water, and plant debris.In a recent study, the metabolic modeling of the Finland strain SCC3193 revealed that P. parmentieri can adapt and survive in either soil or rhizosphere (Zoledowska et al. 2019).Although most P. parmentieri strains deposited in the NCBI GenBank database have been isolated mainly from potato (accessed on 2022 September 14), there are different surveys reporting the isolation of this bacterium from other hosts such as cabbage, tomato, eggplant, onions, carrot, maize, sugar beet, calla lily, sweet potato, and star of Bethlehem, impacting both yield and quality (Zoledowska et al. 2018b).
The ability of P. parmentieri to cause soft rot diseases and establish a successful infection depends on the production of plant cell wall-degrading enzymes, namely cellulases, pectinases, and proteases (Nykyri et al. 2012;Zoledowska et al. 2018a;Arizala and Arif 2019).These enzymes are delivered via the type I (TISS) or II (TIISS) secretion systems, allowing the pathogen to macerate plant tissues, and spread throughout the host tissues (Nykyri et al. 2012;Zoledowska et al. 2018a;Arizala and Arif 2019;Arif et al. 2022).Importantly, due to the absence of the type III (T3SS) secretion system, studies have reported that P. parmentieri might be less virulent compared to other Pectobacterium species (Kim et al. 2009;Nykyri et al. 2012;Arizala and Arif 2019).However, other research studies have found highly pathogenic P. parmentieri strains on potato plants and tubers (Ge et al. 2021), demonstrating that the lack of T3SS in this bacterium is not a detrimental pathogenicity determinant for disease development (Kim et al. 2009).
Genomic analysis has revealed that this bacterium has a relatively large genome size, with approximately 4.6 to 5.6 million base pairs (Mbp), containing approximately 4,400 to 5,400 predicted protein-coding genes (Nykyri et al. 2012;Khayi et al. 2015;Zoledowska et al. 2018aZoledowska et al. , 2018b;;Arizala and Arif 2019).The P. parmentieri genome typically comprises a single circular chromosome, with few strains harboring a plasmid (Zoledowska et al. 2018a(Zoledowska et al. , 2018b)).Numerous genes involved in important biological functions, including metabolism, replication, transcription, and translation have been found on the chromosome (Nykyri et al. 2012;Zoledowska et al. 2018aZoledowska et al. , 2018b)).In addition, the genome contains many coding regions linked to pathogenicity, including those responsible for producing pectinolytic enzymes, extracellular enzymes, iron scavengers, toxins, protein effectors, and specialized secretion systems (Nykyri et al. 2012;Zoledowska et al. 2018aZoledowska et al. , 2018b;;Arizala and Arif 2019).Together, these factors contribute to the infection and colonization of this pathogen inside its host plants.
Comparative genomic analyses have provided valuable insights into the genetic variations within the P. parmentieri species and their implications for host specificity and pathogenicity (Nykyri et al. 2012;Zoledowska et al. 2018a;Arizala and Arif 2019).Zoledowska et al. (2018a), reported several genes found to be part of the dispensable genome, indicating a high genomic plasticity across the P. parmentieri strains.These findings could be associated with the widespread distribution of this bacterium across the globe, its broad host range, and its rapid adaptation to thrive in different climatic zones exhibiting different temperatures and humidity ranges (Zoledowska et al. 2018b).On the other hand, widely associated genome studies have shown that P. parmetieri has horizontally acquired gene regions crucial for pathogenicity and environment adaptation (Nykyri et al. 2012;Zoledowska et al. 2018a;Arizala and Arif 2019).The French and type strain RNS 08.42.1A (CFBP 8475 T ), possesses several quorum sensing genes that seem to be acquired thorough horizontal gene transfer (HGT) (Khayi et al. 2015).Moreover, the genome of the Finland strain SCC3193, which has been used as a model for studies for decades, was observed to harbor different genome islands, some of which contain important pathogenicity determinants.These outcomes provide a glimpse of the impact of gene transfer in the evolution of this phytopathogen.
No study has been conducted regarding the role of homologous recombination and its implication for the evolution of P. parmentieri.Considering this background, the primary objective of the present research article was to examine the importance of homologous recombination and its potential contribution to the evolutionary relationships, wide-strain genomic diversity, pathogenicity, host adaptation, and immunity of P. parmentieri.To our knowledge, this is a pioneering research that delves deeply into the study of homologous recombination across the core genome of the soft rot bacterium P. parmentieri.

Pan Genome and Population Structural Analyses Reveal High Diversity Within Strains of P. parmentieri
In total 17 complete and 15 draft genomes belonging to different P. parmentieri strains, isolated from various time periods and locations, were retrieved from the NCBI GenBank database and used in the analyses (supplementary table S1, Supplementary Material online).The pan genome analysis conducted in Roary identified 3,742 and 71 genes in the core (genes present in 99% of strains) and soft core (genes present in 95% to 99% of strains) genomes, respectively (Fig. 1a).The accessory genome composed of the cloud and shell genomes, represented 55% of the pan-genome (8,433 genes).The shell genome (genes present in 15% to 94% of strains) and cloud genome (genes present in less than 15% of strains) accounted for 19% (1,605 genes) and 36% (3,015 genes) of the pan genome, respectively (Fig. 1a).The higher number of genes identified in the accessory genome compared to the core/ soft core genomes highlights the relevance of cloud and shell genes in the genetic diversity within the P. parmentieri species.Comparable results were found in a previous pangenome study performed with 15 P. parmentieri strains, where 1,468 genes were part of the accessory genome and 1,847 were unique genes while the core genome was scarcely bigger than 50% of the pan-genome.Overall, these outcomes reflect an open pan-genome within P. parmentieri, which, as previously pointed (Zoledowska et al. 2018a), might be linked to the high genome plasticity and adaptation of this species to diverse environmental regions.Indeed, the presence of a large number of genes in the accessory genome (found also in our analysis) has been highlighted as the potential basis for the widespread diffusion of P. parmentieri (Zoledowska et al. 2018b) as the pathogen has been reported in numerous geographic locations worldwide, including, Finland, New Zealand, Canada, China, Turkey, South Africa, and several states within the USA.
To examine the population structure within the 32 P. parmentieri strains, a clustering analysis was conducted using a hierarchical approach with Bayesian Analysis of Population Structure (BAPS, Cheng et al. 2013) implemented in R (RhierBAPS, Tonkin-Hill et al. 2018).RhierBAPS identified 7 lineages (clusters) among the P. parmentieri strains (Fig. 1b).
The first lineage consisted of strains CFIA1002 and SS09 isolated in 2007 and 2017 in Canada: Alberta and Pakistan, Punjab, respectively.Lineage 2 encloses the New York (USA) strains NY1584A, NY1587A, and NY1532B isolated in 2016.Lineage 3 comprises one New York (USA) strain NY1712A isolated in 2017 and 3 Poland strains, 2 collected in 2013 (IFB5441, IFB5432) and IFB5623 isolated in 2014.Interestingly, lineage 4 was the most heterogenous, composed of 4 isolates which formed like-outgroups subclades within the lineages 3, 5, and 6.Specifically, strains WPP163 and WC19161 isolated in 2004 (USA, Wisconsin) and 2019 (China, Hohhot), respectively, grouped separately but closely to the isolates assigned to the lineage 3, whereas IFB5597 isolated in Poland in 2014 grouped closely to the strains of lineage 6.The South Korea strain HC isolated in 2016 was distinctively separated from all lineages and clustered close to strains of lineage 5. Lineage 5, on the other hand, was integrated by 6 isolates, including 2 Poland strains isolated in 2014 (IFB5604, IFB5619), a Belgium strain (IFB5485) isolated in 2012, QK-5 isolated in 2019 in China (QingKou), the Russian strain PB20 isolated in 2014 in the Moscow region, and strain IPO1955 isolated in 2002 (unknown location).Like lineage 5, lineage 6 is constituted by 6 isolates, including 5 New York strains (NY1588A, NY1548A, NY1585A, NY1540A, and NY1533B) isolated in 2016 and the Finland strain SCC3193 isolated in the early 1980s, used as a model for different molecular biology studies concerning Pectobacterium over many decades (Nykyri et al. 2012;Zoledowska et al. 2019).This strain was previously observed to cluster far away from the other P. parmentieri strains in a pan-genome tree due to the high number of unique genes present in this isolate (Zoledowska et al. 2018a).Here, we found that SCC3193 grouped together with strain NY1588A in a sub-clade within the lineage 6.Finally, the lineage 7 harbors 7 strains, including the type strain RNS 08-42-1Aᵀ isolated in France in 2008, 4 Poland strains isolated in 2013 (IFB5427, IFB5408) and 2014 (IFB5605, IFB5626), strain IFB5486 isolated from Belgium in 2012 and strain NY1722A isolated in New York in 2017.Except for strain IFB5427, which was isolated from weed, all other strains were isolated from either symptomatic tuber or potato stem.
Overall, all strains assigned to each of the 7 lineages grouped differently regardless of their geographic origin and isolation year (Fig. 1b), correlating with former phylogeographic analysis of this pathogen (Zoledowska et al. 2018b).This highly heterogenous pattern exhibited in the P. parmentieri population was suggested to be a consequence of high geographical mobility (Zoledowska et al. 2018b).Similarly, an elevated level of genetic diversity has been observed in the population structure analyses of the causal agent of tomato bacterial canker, Clavibacter michiganensis, as result of presumably multiple introductions events, where the international tomato seed trade has been highlighted as the leading force (Thapa et al. 2017;Ansari et al. 2019).Likewise, in P. parmentieri, the potato The phylogenetic tree was built using iTOL v6 (https://itol.embl.de/)and the tips of each branch were annotated and colored according to the level clusters identified using RhierBAPS and for matching the colors used by fastGEAR.
trading business appears to be one of the major contributors to the remarkably heterogeneity population of this pathogen and its global expansion.

Reconstructed Core-genome Genealogy and Rates of Homologous Recombination
To assess recombination events in the core genome of P. parmentieri and reconstruct its genealogy, a recombination analysis using the ClonalFrameML (Didelot and Wilson 2015) was carried out employing a maximum likelihood tree as starter.ClonalFrameML detected 5,125 recombination events, with 3,094 occurring in the nodes that gave origin to the different lineages identified by RhierBAPS, and 2,031 accounted for recombination regions across the distinct P. parmentieri strains used in this study (Fig. 2).The ClonalFrameML algorithm estimated the following homologous recombination rates (Table 1): the average length of recombined fragment (δ) was 83.456 bp, the average divergence between donor and recipient (ν) displayed a value of 0.0685 and the ratio of recombination to mutation rate was (R/θ) 0.236.This data shows that mutations have occurred 4.24 times more frequently compared to recombination.However, since each recombination event introduced an average of δν = 5.71 substitutions, the impact of recombination over mutation (r/m) was 1.348 times higher than mutation, emphasizing the relevance of recombination in P. parmentieri evolution.
In general, recombination events were detected in the ancestors of each lineage (Fig. 2, indicated with a red arrow).Supplementary fig.S1, Supplementary Material online shows the positions and numbers of the nodes assigned by ClonalFrameML.The highest numbers of recombination events were observed in specific nodes, with node 48 having FIG.2.-Reconstructed genealogy and inferred homologous recombination events in the core genome of 32 p. parmentieri strains are depicted in the graphic.The graphic depicts the phylogenomic relationships among the 32 isolates, along with locations of inferred recombination and substitutions events.Reconstructed ML phylogenetic tree is depicted on the left; a color-coded vertical bar based on the lineages (clades) identified in RHierBAPS is shown in the center; inferred recombination events are illustrated in the right frame according to each position across the core genome (x-axis) and for each branch of the tree (y-axis).Unlabeled branches indicate ancestral lineages in the phylogeny.Dark blue horizontal lines pinpoint homologous recombination events, while vertical lines represent reconstructed substitutions or mutations.Light blue background color in the frame refers to no substitution or invariant polymorphic sites.White lines mark the absence of homoplasy, whereas the range of redness lines, from yellow to red, represent an increase of homoplasy levels.Inference of recombination analysis was conducted in ClonalFrameML using an RaxML tree with 1,000 bootstraps as a guide tree.Arrows indicate the nodes with the highest recombination events, while pink framed squares point to the most recombinant strains as inferred by ClonalFrameML.supplementary fig.S1, Supplementary Material online shows the position and names of all nodes as assigned by ClonalFrameML. the most (343 events), followed by node 56 (341 events), node 42 (339 events), node 40 (338 events), node 37 (317 events), node 45 (292 events), and node 50 (150 events).Moreover, we observed distinct degrees of homoplasy (Fig. 2, represented by yellow to red lines) across the lineages, seemingly imported as a consequence of mutation events occurring on the main branches of the genealogy tree.A similar effect has been reported in Staphylococcus aureus (Didelot and Wilson 2015).Intriguingly, all P. parmentieri strains assigned to the lineage 4 (Fig. 2, marked in a pink square) were detected as the most recombinant isolates in the clonal phylogeny.The Poland strain IFB5597 led the list with 320 recombination events, featuring recombinant fragments ranging from 2,283 to 9 bp.The South Korean strain HC ranked second in the list, exhibiting 311 recombination regions with sizes ranging 1,995 to 3 bp.Following, the American strain WPP163, isolated in Wisconsin, presented 306 recombination events with fragment lengths ranging from 1,273 to 5 bp.Finally, the Chinese strain WC19161 displayed 298 recombinant events with size ranging from 2,094 to 5 bp.Within the lineage 7, the Poland isolates IFB5626 and IFB5605 exhibited the highest number of recombinant events, with 72 and 60 inferred recombinant sites ranging from 1,584 to 4 bp and 5,490 to 6 bp, respectively.Additionally, the Belgium strain IFB5486 showed 57 recombinant sites with a maximum and minimum lengths of 885 and 4 bp, respectively, whereas the type strain RNS 08-42-1Aᵀ, isolated from France, and the Poland strains IFB5408 and IFB5427, along with NY1722A from New York, appeared as non-recombinant strains.Regarding lineage 5, the strains IFB5619 (Poland) and IPO1955 (unknown origin) showed 70 and 95 recombination events, with size ranging from 935 to 6 bp and 1,335 to 8 bp, respectively.In contrast, the strains IFB5604 (from Poland) and QK-5 (from China) solely presented 1 and 10 recombination regions of 48 bp and 2,152 to 25 bp, respectively, while PB20 (from Russia) and IFB5604 (from Poland) did not show any recombination.
In lineage 6, only 2 strains NY1588A and the model organism SCC3193, presented recombination events of 26 and 33, with fragments sizes oscillating between 5,490 to 88 bp and 1,016 to 6 bp, respectively.On the other hand, the New York NY1712A and Poland IFB5432 strains clustering in lineage 3 displayed 36 and 29 recombinant regions with lengths ranging from 624 to 36 bp and 780 to 2 bp, respectively, while no recombination events were inferred in strains IFB5623 and IFB5441.All strains in lineage 2 have undergone through homologous recombination, thus recombination numbers of 68, 58, and 51 were observed in the New York strains NY1584A, NY1587A, and NY1532B, with fragments sizes of 997 to 6 bp, 1,009 to 6 bp and 927 to 3 bp, respectively.In the case of lineage 1, the Canadian CFIA1002 (74 recombination events) and Pakistani strain SS90 (51 recombination events) strains showed recombinant sizes ranging from 1,261 to 4 bp and 606 to 5 bp, respectively.Altogether, these results indicate that the frequency of recombination varies across the strains.Related results were observed in recombination studies among Xylella fastidiosa subspecies (Vanhove et al. 2019).It seems reasonable that, similar to X. fastidiosa subspecies, due to geographic isolation, the introduction of foreign P. parmentieri isolates into new regions where other strains were already coexisting has contributed to homologous recombination events, leading to the emergence of novel strains.

Screening of Recombinant Genes Inferred in the P. parmentieri Strains and Ancestral Nodes in the Genealogy Tree
All recombinant fragments predicted by ClonalFrameML in either each strain or the ancestral nodes in the genealogy tree were scanned and tracked using a customized BLASTN tool.Genes related to several vital cell functions, such as DNA repair, RNA methylation, RNA processing and degradation, pyrimidine and purine conversions, nitrate and nitrite ammonification, biosynthesis of amino acids (methionine, alanine, cysteine, glutamine, glutamate, aspartate, asparagine, lysine), utilization of D-galacturonate, D-glucoronate, and D-ribose, uptake and utilization of lactose and galactose, phosphate metabolism, potassium homeostasis, biosynthesis of vitamins (thiamin, biotin, folate, pyridoxin), fatty acid biosynthesis, glycogen, glycerate and phosphate metabolism, protein chaperones, and response to stress showed to be affected by homologous recombination in the analysis (supplementary tables S2 to S44, Supplementary Material online).These findings suggest that homologous recombination may play a role in essential cellular processes for P. parmentieri.Indeed, homologous recombination has been highlighted to be fundamental for completion of DNA replication (Syeda et al. 2014); moreover, homologous recombination has showed to participate in the repair of DNA double-stranded breaks (DSBs) by incorporating foreign DNA as a template (Li andHeyer 2008, Wright et al. 2018).Interestingly, recombination was also found to occur within the genes involved in the Clustered Regularly Interspaced Short palindromic Repeats (CRISPR)-Cas (CRISPR-associated proteins) system.The CRISPR-Cas system confers an innate and adaptive immunity in many bacteria by defending the bacterial cell against bacteriophages or other exogenous invaders (Koonin and Makarova 2013;Makarova et al. 2015).In the case of P. parmentieri, 2 types of CRISPR-Cas systems have been reported: subtype I-F and subtype I-E (Arizala and Arif 2019).In this research, recombination fragments were detected in the cas genes encoding the CRISPR-associated (Cas) proteins Cas1 (subtype I-F) and Cas3 (subtype I-E) with maximum sizes of 1,009 and 469 bp, respectively.Strains WPP163 and WC19161 exhibited recombination signals in both genes cas1 and cas3 (supplementary tables S27 to S28, Supplementary Material online).While strains HC and NY1587A showed recombination only in cas1 (supplementary tables S29 and S38, Supplementary Material online).Conversely, in the strains IFB5597, NY1712A, and NY1584A, homologous recombination events were detected solely in cas3 (supplementary tables S26 and S36 to S39, Supplementary Material online).Likewise, in the genealogy tree (Fig. 2), recombination events within both cas1 and cas3 genes were also found in the ancestral nodes 37, 40, and 56, while nodes 41, 42, 48, and 53 showed recombination only in cas3 gene, and node 45, and 50 exhibited recombination only in cas1 (supplementary tables S4 to S22, Supplementary Material online).Altogether, this data suggests that homologous recombination within cas1 and cas3 derived from the main ancestral node 56 and was transmitted during evolution to the different lineage nodes, and finally, to the P. parmentieri strains mentioned above.
In line with our findings, previous studies have reported HGT events in CRISPR-Cas regions (Tyson and Banfield 2008;Chakraborty et al. 2010).Arizala and Arif (2019) described the acquisition of an entire type III-A CRISPR-Cas system in Pectobacterium aroidearum PC1 from Serratia species through HGT mechanism.The transfer of cas genes between identical loci surrounding CRISPR regions has been reported in Escherichia coli and Salmonella enterica genomes, suggesting that CRISPR-associated regions might serve as hot spots for recombination of cas genes (Touchon and Rocha 2010).Additionally, homologous recombination has been implicated in facilitating this genetic exchange of cas genes (Touchon and Rocha 2010).Subsequently, studies identified widespread microscale recombination events in several cas genes, highlighting the dynamic evolution of cas genes beyond cas1 (Takeuchi et al. 2012).Building on these earlier investigations, the homologous recombination detected in cas1 and cas3 in P. parmentieri in our study supports previous hypotheses about the coevolutionary competition of the CRISPR-Cas system as an immune defense mechanism against rapidly evolving viruses (Takeuchi et al. 2012;González-Torres et al. 2019).
Strikingly, recombination events consistently appeared in genes integral to critical pathogenicity determinants, such as the type VI secretion system (T6SS), iron uptake systems (siderophore aerobactin, ferric citrate, and enterobactin), LPS, plant cell wall degrading enzymes (CWDE), type IVb fimbrial low-molecular-weight protein/tight adherence protein (Flp/Tad) pilus, citrate metabolism, butanediol metabolism, flagella (supplementary tables S2 to S43, Supplementary Material online).These factors collectively play pivotal roles in the infection process, bacterial colonization, and the progression of soft rot/black leg diseases in Pectobacterium sp.(Nykyri et al. 2012;Charkowski 2018;Arizala and Arif 2019;Arif et al. 2022).This underscores the significant impact of homologous recombination on the evolution of several virulence-associated factors across the core genome of P. parmentieri.
The T6SS comprises 15 to 23 proteins, with 13 highly conserved proteins believed to be essential for its function (Shyntum et al. 2019).Although numerous functions such as virulence, bacterial competition, and host interaction have been attributed to the T6SS (Morgado and Vicente 2022), a recent study has suggested its involvement in biofilm formation in Pseudomonas aeruginosa (Chen et al. 2020).Within our analysis, we found recombination regions in 10 T6SS-related genes (supplementary tables S2 to S28 and S38, Supplementary Material online), including: impA encoding for the T6SS component TssA (ImpA) in the nodes 33 (252 bp) and 46 (98 bp), impC encoding for the T6SS component TssC (ImpC/VipB) in the strains IFB5597 (113 bp) and WC19161 (113 bp), impE encoding for the T6SS associated component TagJ (ImpE) in the node 49 (144 bp), impF encoding for the T6SS lysozymelike component TssE in the nodes 40 (76 bp), 42 (261 bp), 45 (114 bp), 48 (375 bp), 56 (261 bp) and in the strains IFB5597 (364 bp), WPP163 (76 bp), and WC19161 (129 bp), vasA ecoding for the T6SS component TssF (ImpG/VasA) in the strain IFB5597 (30 bp), vasC encoding for the T6SS fork head associated domain protein ImpI/ VasC in the node 44 (141 bp), vasD encoding for the T6SS secretion lipoprotein TssJ (VasD) in the node 37 (59 bp), vasK encoding for the T6SS component TssM (IcmF/VasK) in the node 44 (93 bp), vasL encoding for the Type VI secretion-related protein VasL in the nodes 46 (63 bp) and 37 (79 bp) as well as in the strain NY1587A (138 bp), stk1 encoding for the T6SS Serine/threonine protein kinase PpkA in the node 52 (104 bp).Additionally, ClonalFrameML detected recombination events in the genes encoding for the products haemolysin-coregulated protein (Hcp) in the nodes 40 (306 bp) and 43 (48 bp), in the strains HC (12 bp) and SS90 (93 bp), and the valineglycine repeat protein (VgrG) in the nodes 36 (220 bp) and 40 (220 bp) (supplementary tables S3 to S9 and S29 and S33, Supplementary Material online).Both Hcp and VgrG are considered as major effector proteins injected from T6SS into host cells (Pukatzki et al. 2006;Mattinen et al. 2007).Importantly, the ImpC protein, a T6SS element showing recombination in our analysis, has been identified as a crucial component of the T6SS, playing a role in virulence and participating in the intracellular host response in Pseudomonas syringae pv.tomato (Sarris et al. 2010).Moreover, T6SS gene clusters are often found within predicted genomic islands, suggesting the potential of this system for HGT between bacteria (Morgado and Vicente 2022).Nykyri et al. (2012) reported 2 T6SS in the genome of the Finland strain P. parmentieri SCC3193, with one located on a genomic island.Our findings further underscore that in P. parmentieri, homologous recombination seems to be another important mechanism in the evolution of the Impact of Homologous Recombination on Core Genome of P. parmentieri T6SS core genes.Consistent with our findings, frequent homologous recombination has been observed in Vibrio cholera and is considered a major contributor to the high diversity of T6SS effector genes (Kirchberger et al. 2017).Comprehensive studies are warranted to establish a conclusive understanding of the specific role of homologous recombination in the evolution of the T6SS in P. parmentieri.
The secretion of CWDE is the most well-recognized and the most studied pathogenicity determinant in pectinolytic bacteria of the genera Pectobacterium and Dickeya; indeed, efficient production of CWDE is directly linked with soft rot and black leg symptoms (Nykyri et al. 2012;Charkowski 2018;Arizala and Arif 2019;Boluk et al. 2021;Arif et al. 2022).In our study, recombining signatures were detected mainly in the pelX gene (supplementary tables S6, S11, S26 to S29, Supplementary Material online), encoding for an exopolygalacturonate lyase, in the strains IFB5597 (267 bp), WC19161 (225 bp) and HC (91 bp) as well as in the ancestral nodes 40 and 45 (91 bp).The pectin methylesterase, pemB, also exhibited recombination in the node 49 (44 bp; supplementary table S15, Supplementary Material online), while in the case of the oligogalacturonides degradation, the kdfF (pectin degradation protein KdgF) locus underwent recombination in the Wisconsin strain WPP163 (81 bp; supplementary table S27, Supplementary Material online) and in the ancestral nodes 42 (81 bp; supplementary table S8, Supplementary Material online) and 45 (91 bp; supplementary table S11, Supplementary Material online).In agreement with our analysis, recombination regions have also been encountered in the CWDE-genes of the pathogen Xylella fastidiosa (Potnis et al. 2019).We also observed recombination in genes encoding for proteases and peptidases.For instance, a recombinant fragment of 98 bp was identified in the gene pqqL (putative zinc protease) in the node 51 (supplementary table S17, Supplementary Material online).On the other hand, 3 loci (citC, dcuC, citF) involved with citrate metabolism, uptake and regulation showed evidence of recombination.The genes citC (citrate [pro-3S]-lyase ligase) and citF (citrate lyase alpha chain) displayed recombination fragments of 78 and 321 bp, only in the strain HC and node 48, respectively (supplementary tables S14 and S29, Supplementary Material online).Conversely, dcuC has undergone recombination in 4 strains (SCC3193, NY1588A, IFB5605, and IFB5432; supplementary tables S34, S37, S41, and S43, Supplementary Material online) and 4 nodes (33, 46, 49, and 57; supplementary tables S2, S12, S15, and S23, Supplementary Material online).The New York strain NY1588A harbored the largest recombined fragment of 1,263 bp (supplementary table S37, Supplementary Material online).The citrate uptake system was reported to be critical for bacterial virulence of P. atrosepticum by reducing the citrate concentration during colonization in potato tubers (Urbany and Neuhaus 2008).Our findings highlight the relevance of homologous recombination in the evolution of the citrate uptake system and its potential association with the adaptation of certain P. parmentieri strains to host environments containing high citrate levels.The 3-hydroxy-2butanone (3H2B-acetoin, butanediol) pathway has been characterized as another relevant factor for P. carotovorum pathogenesis by promoting media alkalization and, consequently, favoring plant cell maceration (Marquez-Villavicencio et al. 2011).In our analysis, the strains IFB5597 (36 bp), WC19161 (60 bp), and ancestral nodes 40 (147 bp) and 45 (60 bp) experienced recombination events in the gene product acetolactate synthase large subunit (supplementary table S6, S11, S26 to S28, Supplementary Material online).
The Flp/Tad (Fimbrial low-molecular-weight protein/ Tight adherence protein) pilus, encoded by the flp/tad genes, was characterized as another novel virulence determinant in Pectobacterium due to its implication with maceration of potato tubers (Nykyri et al. 2013).ClonalFramelML revealed recombination events in 4 genes of the flp/tad biogenesis cluster (supplementary tables S2 to S8 and S16 to S31, Supplementary Material online).The tadA (type II/IV secretion system ATP hydrolase TadA/ VirB11/CpaF, TadA subfamily) gene showed recombination in the node 37 (96 bp), 42 (66 bp), 57 (79 bp), and strain WC19161 (102 bp).Recombination in the rcpC (Flp pilus assembly protein RcpC/CpaB) gene was found in the nodes 40 (42 bp), 50 (52 bp), and in the strains HC (85 bp) and IF5626 (52 bp).The tadC (type II/IV secretion system protein TadC, associated with Flp pilus assembly) gene displayed recombination regions in the nodes 33 (21 bp) and 52 (21 bp), whereas recombination within the rcpB (Flp pilus assembly protein RcpB/CpaD) locus was predicted only in the node 57 (32 bp).A comparative genomics study showed that the flp/tad cluster is harbored by all Pectobacterium species (Arizala and Arif 2019).The discovery of recombination signals in some of the flp/tad genes highlights how this system is evolving via genetic exchange in P. parmentieri, presumably linked to the host adaptation of the pathogen to efficiently macerate tissues.
The flagella-encoding cluster has been shown to be pivotal for motility, colonization, and virulent lifestyle in numerous pathogenic bacteria, including Pectobacterium (Toth et al. 2006;Nykyri et al. 2012;Arizala and Arif 2019).Three flagella-genes have undergone recombination, including fliD, fliK, flgI (supplementary tables S14 to S16 and S26 to S29 and S41, Supplementary Material online).While recombinant fragments of 155 bp located in node 50 and 300 bp in strain IFB5605 were found in the loci fliD (flagellar cap protein FliD) and fliK (flagellar hook-length control protein FliK), respectively, the flgI (flagellar P-ring protein FlgI) gene displayed recombination in 4 strains (IFB5597-286 bp, WPP163-286 bp, WC19161-393 bp, and HC-387 bp) and 2 nodes (48 to 279 bp and 50 to 99 bp).In a former study, the flagellar genes were shown to participate in chemotaxis, motility, and were fundamental for pathogenesis of D. dadantii (Jahn et al. 2008).Moreover, a reduced virulence was observed in tobacco plants infected with P. parmentieri SCC3193 mutants lacking motility (Pirhonen et al. 1991).In another report, the flagellum was related t the secretion of colin, a toxin released to kill competitors in the niche, thereby suggesting an antagonist role of the flagellar apparatus in Pectobacterium (Charkowski et al. 2012).Finally, the gene predicted as pgaC, encoding for the biofilm PGA (polygamma-glutamic acid) synthesis N-glycosyltransferase PgaC protein, showed a recombination region of 246 bp in the strain WC19161.PgaC was demonstrated to participate in host-bacteria interactions and modulate biofilm formation in the human pathogen Klebsiella pneumoniae (Chen et al. 2014).

Insights into Ancient and Recent Recombination Events in P. parmentieri
To examine the ancestral and recent recombinogenic regions between and within the 7 lineages (inferred by RhierBAPS-Fig.1b), we used the fastGEAR software (Mostowy et al. 2017), utilizing the core-genome alignment obtained from the Roary pipeline as input data.Ancestral recombinations are defined as recombinant regions present in all strains belonging to the lineage (Mostowy et al. 2017).Since the lineage 7 contained the largest number of strains within the P. parmentieri population, according to the principle of parsimony, this lineage was assigned as the main donor, while the other lineages (1 to 6) served as recipients for the ancestral recombination analysis.Thus, excluding lineage 7, all other lineages have undergone recombination, as depicted by Fig. 3a, FIG.3.-Ancestral recombination events identified using fastGEAR across the core genome of 32 P. parmentieri strains are illustrated.a) ML phylogeny, using a bootstrap support of 1,000 replicates, is illustrated on the left, whereas a panel showing the recombination sites and the population genetic structure is located on the right.The y-axis on the panel displays the 32 strains present in the core genome alignment, and the x-axis displays the sequence positions.Strains on the y-axis were positioned according to the ML phylogeny.Seven lineages previously defined with RhierBAPS were used in the analysis.The vertical color-coded bar on the right side of the panel shows the division of the 32 strains into the 7 predicted lineages.Each lineage represents a specific clade and is labeled and colored as indicated in the supplementary fig.S2, Supplementary Material online.The black color denotes recombination from outside origin of any lineage in the data set.The presence of different color lines within a lineage in the frame pinpoints ancestral recombination event sites occurred respect to the sequence position.White gaps in the panel show the recent recombination events that were removed before the analysis of ancestral recombination.b) Horizontal bar graphic illustrating the number of ancestral recombinations that were received in each lineage.The bars were colored based on the color of the lineages previously assigned in Fig. 1b.
where a mosaic-like pattern across the core genome is evident.In total, 486 ancient recombination events were detected in the core genome obtained from the 32 P. parmentieri isolates (Fig. 3b).
The overall size of ancestral recombination events ranged from 43 to 119 kb, with a median length of 4.8 kb.Lineage 2 exhibited the highest number of recombinations, totaling 126 events originating from all 6 lineages (lineage 2-16, lineage 3-14, lineage 4-1, lineage 5-23, lineage 6-19, and lineage 7-22).This finding highlights the 3 New York strains (NY1584A, NY1532B, NY1587A) belonging to the lineage 2 as highly ancient recombinants.In contrast, lineage 5 presented 114 ancestral recombination events, with fragments donated by lineages 6 (80 events) and 7 (34 events).Notably, this lineage transferred the highest number of recombinogenic regions (191 events) to the other lineages, potentially resulting from coexistence of this lineage with the other ones in the same ecological niche, promoting DNA recombination.In Xyllela fastidiosa, for instance, homologous recombination was reported in the South American strains sharing the same geographic area (Coletta-Filho et al. 2017).Lineage 4 has experienced the lowest level of ancestral recombination events, with 1 and 5 fragments donated by lineages 5 and 4, respectively.In the case of lineage 6, 41 events were detected, originating from lineage 7.
Regarding recent recombinations, these events refer to recombinant regions present in a subset of strains within a lineage (Mostowy et al. 2017).Similar to the ancestral recombination plot, a mosaic like-pattern indicative of recent recombination signals was observed in the core genome of most of the strains (Fig. 4a).A total of 941 unique events were detected, with some events repeating across the strains, resulting in a total of 1,177 events (Fig. 4b).Upon scanning these recent recombination regions, we identified 412 genes that have undergone recent recombination, which represents 11% of the P. parmentieri core genome.The recombinant fragments displayed an average length of 1.38 kb, ranging from 36 bp to 13.96 kb.The 2-fold number of recent recombination events compared to the ancestral ones highlights that P. parmentieri is continuously evolving and adapting to new environments.This observation aligns with the widespread distribution of this bacterium reported in many countries in recent years (Pitman et al. 2010;Pasanen et al. 2013;Zoledowska et al. 2018b;Arizala et al. 2019).No signs of homologous recombination could be inferred in the strains IFB5408, RNS 08-42-1A, NY1722A, IFB5427, IFB5623, and IFB5441.In contrast, the strains IFB5626, IFB5597, HC, and NY1584A showed high levels of recombinants with 140, 122, 113, and 87 events, respectively (Fig. 4b).Strains CFIA1002, QK-5, and SS90 experienced a relatively low rate of recombination with 9, 10, and 13 events, respectively.Lineage 1 appeared as the major donor with 272 events (Fig. 4c), donating mainly to the strains IPO1955 (24%), IFB5626 (21%) and NY1584A (18%).On the contrary, 5% of the total recent recombination events were categorized as outside origins (Fig. 4c) and identified in 10 strains: HC, IFB5485, IFB5486, IFB5597, IFB5626, NY1588A, PB20, SCC3193, WC19161, and WPP163.These outside origin events are defined as recombined regions originating from outside the population or not corresponding to any of the lineages found in the population (Mostowy et al. 2017).Strains NY1588A and IFB5597 led the highest position in this category, each exhibited 15 and 16 outside origin events, respectively.Previous studies have documented the coexistence of soft rot bacteria from the genera Pectobacterium and Dickeya in the same niche (Degefu 2021;Ge et al. 2021;Kastelein et al. 2021).Considering this, it seems logical that these external origin regions might have been incorporated in the genomes of these 9 P. parmentieri strains through genetic exchange with either other Pectobacterium species or members of the Dickeya genus.
Subsequently, we traced and identified the coordinates and functional implications of all gene regions affected by ancestral (supplementary tables S45 to S50, Supplementary Material online) and recent recombination events (supplementary tables S51 to S76, Supplementary Material online).Genes associated with cellular functions, including DNA repair, DNA replication, RNA methylation, RNA processing and degradation, tRNA aminoacylation, cell respiratory pathways, dehydrogenase complexes, purine and pyrimidine conversions, fatty acid biosynthesis, redox cycle, lactose, and galactose utilization, sucrose utilization, glycogen metabolism, and oxidative stress have been affected by both ancestral and recent recombination events (Tables 2 to 3).This underscores the potential significance of homologous recombination in the evolution of P. parmentieri.Recombinant fragments from recent and ancestral events were also observed in genes encoding amino acids and vitamins biosynthesis, phosphate metabolism, copper, and potassium homeostasis, nitrite and nitrate ammonification, carbon starvation, nitrosative stress, and ammonia assimilation (Tables 2 to 3).In X. fastidiosa and Xanthomonas species, recombination in these loci has been associated with an adaptability mechanism for surviving in niches with limited nutrient sources (Chen et al. 2018;Potnis et al. 2019).
Secretion of CWDE is crucial for host colonization and disease development in soft rot bacteria (Toth et al. 2006;Nykyri et al. 2012;Charkowski 2018;Arizala and Arif 2019).In P. parmentieri, 9 genes (pelI, pehA, pelA, pelB, pelC, pelL, pehX, pelX, pemA) were identified as being affected by ancestral recombination events in lineages 1, 2, 5, and 6 (Table 2).Recombination in genes pelA, pelB, and pelC, which encode for pectate lyases, was observed in lineages 2 and 5, while the other genes experienced recombination event in at least one lineage.Previous studies on the activity of CWDE and potato maceration efficacy have revealed significant statistical differences among P. parmentieri strains (Ngadze et al. 2012;Moleleki et al. 2013;Dees et al. 2017;Zoledowska et al. 2018a;Zoledowska et al. 2018b).The evidence of ancestral recombination found in our study in some of the CWDE genes might be related to the reported variation in the ability to macerate potato tissue, as well as the high differences displayed in CWDE activities within the P. parmentieri intraspecies.
Flagellum is another detrimental pathogenicity associated factor specially in soft rot bacteria (Charkowski et al. 2012).Mutations in flagella genes have led to a reduced virulence in Dickeya (Jahn et al. 2008).In P. carotovorum Pcc21, mutants with defective flagella biosynthesis in the genes flgA and fliA produced less than 50% of biofilm in comparison with the wild type (Lee et al. 2013).Our analysis found ancestral recombination occurring in 12 components (FliK, FlgA, FlgB, FlgC, FlgD, FlgE, FlgF, FlgG, FlgI, FliD, FliE, and FliF) of the flagellum biogenesis cluster in the lineage 1, whereas lineages 6, 2, and 3 experienced recombination in less number of genes (6 and 3 genes, respectively) (Table 2).In addition, recent recombination events (Table 3) were also observed within the coding sequences FliK (in strains HC, IFB5605, and WPP163) and FliO (NY1584A).These results indicate that the recombination seen in the flagellum biosynthesis cluster may be connected to the pathogen's ability to adapt to various host conditions, given that the flagellum is a crucial component for ensuring a successful host colonization.In synchronization with the data obtained from the ClonalFrameML analysis, the ancestral and recent recombinations derived from the fastGEAR algorithm also put in evidence that other distinct and highly important pathogenicity determinants, such as genes encoding for components of the T1SS (type I secretion system), T6SS, T1SS for aggregation, type IV pilus biogenesis, the type IVb Flp/Tad cluster, butanediol Important ancestral recombination events predicted in the core-genome of Pectobacterium parmentieri.The genes where the recombination events were identified are listed per subsystem and their product name is described as annotated in the RASTtk server Arizala and Arif

GBE
metabolism, citrate metabolism, siderophore production, and polysaccharide biosynthesis, have undergone to homologous recombination (Tables 2 and 3).These results seem to indicate that homologous recombination has a pivotal implication in the pathogenic lifestyle of P. parmentieri.Notably, gene regions encoding for antimicrobial products, namely phenazine and colicin, were also affected by ancestral and recent recombination events, respectively (Tables 2 and 3).Phenazines are secondary metabolites produced by bacteria and are associated with multiple roles, namely, biofilm formation, enhanced virulence, host colonization, and as weapon against other competitors like fungi or bacteria (Pierson and Pierson 2010).Colicins, on the other hand, are proteins produced by bacteria to kill other closely related members, giving them an advantage over other bacteria in the race for nutrients (Grinter et al. 2012;Arizala and Arif 2019).P. carotovorum Pcc21, for instance, has been reported to secrete a colicin-like bacteriocin product, Carocin D, which inhibited the growth of other Pectobacterium members (Roh et al. 2010).The fact that we found recombination within genes involved with the synthesis of these 2 antimicrobial compounds highlights the contribution of homologous recombination in the ecological fitness of P. parmentieri.
Moreover, our fastGEAR output showed recombination within 2 CRISPR-Cas adaptative immune systems of P. parmentieri.In the CRISPR-Cas subtype I-F system, ancestral recombination patterns were shown within the genes cse4 in lineage 1 as well as csy2, and csy3 in lineage 2 (Table 2).In the CRISPR-Cas subtype I-E system, recent recombination events were observed within the cas3 gene of the strains NY1584A and NY1712A (Table 3).These findings suggest that homologous recombination seems to have participated in the evolution of the CRISPR-Cas system of P. parmentieri.This is consistent with earlier research that pointed recombination events spanning the genetic exchange of several cas gene regions (Touchon and Rocha 2010;Takeuchi et al. 2012).

Conclusions
In the present study, we assessed the impact of homologous recombination on the core-genome of the newly, but widely spread, described species P. parmentieri.Clear evidence of diverse recombination patterns was found, with data indicating variations in the sizes of DNA recombinant fragments occurring randomly across the core genome of the 32 P. parmentieri isolates.The fastGEAR analysis identified 941 recent recombination events and 486 ancestral recombination events.The ancestral recombination data suggested that strains from lineages 1st to 6th evolved from strains of the 7th lineage.Importantly, several ancient recombinant regions (3,094) were inferred in the reconstructed ClonalFrameML genealogy tree in the main nodes, giving origin to different Numbers 1 to 7 refer to the lineages detected by the population structural analysis hierBAPS; for instance, 1 means to all strains belonging to Lineage 1.
a Numbers between parenthesis in column 3 indicates the number of ancestral recombination events identified in each of the recipient lineages (1 to 6) for each gene product shown in column 2. For instance, 3 (1) means that Lineage 3 in that specific gene product presented 1 recombination event.
b Presence of numbers in column 4 "Donor Lineages" points which lineage acted as donor (1 to 7) or donated the recombination segment to its respective recipient lineage (1 to 6) displayed in column 3. Subcolumns 1 to 6 (located in panel beneath "Donor Lineages" header) refers to the respective recipient lineages provided in column 3.For instance, a number 6 in the sub-column 1 within the "Donor Lineage" column means that the Lineage 6 donated that recombination event to the Lineage 1.
c Numbers separated by a comma in the subcolumns (1 to 6) within the "Donor Lineages" column indicates that 2 or more lineages acted as donors.For instance, "6,7" within the subcolumn 1 means that Lineages 6 and 7 acted as donors for that recombination event in the Lineage 1.
d Column 5 indicates the largest recombined fragment in base pair (bp) found for that gene product provided in column 2.
Impact of Homologous Recombination on Core Genome of P. parmentieri

Table 3
Summary of main recent recombination events predicted in the core-genome of Pectobacterium parmentieri.The genes where the recombination events were identified are listed per subsystem and their product name are described as annotated in the RASTtk server  (1), PB20 (1), IFB5604 (1), NY1548A (1), NY1585A   Numbers between parenthesis in column 3 indicates the number of recent recombination events identified in each of the recipient strains for each gene product shown in column 2. For instance, IFB5626 (1) means that strain IFB5626 presented 1 recombination event for that specific gene product.
b L1 to L7 in column 4 "Donor Lineages" points which lineage acted as donor or donated the recombination segment to its respective recipient strain whereas O, refers to recombination events categorized as outside origin or not corresponding to any of the lineages identified in the P. parmentieri population structural analysis.The order of the letter coding (L1 to L7, O) separated by a comma within column 4 in each row for each gene product (column 2) corresponds to the same order of the recipient lineages as listed in each row displayed within column 3.For instance, in row 1 within column 4, "L2, L3" indicates that lineages 2 and 3 acted as donors for recombination events in the strains IFB5626 and IFB5605 (row 1 in column 3), respectively.
c Column 5 displays the largest recombined fragment in base pair (bp) found for that gene product provided in column 2.
d Letter coding (L1 to L7, O) separated by a "&" symbol within the "Donor Lineages" column indicates that 2 or more lineages acted as donors.For instance, "L1&L2" in row 2 within column 4 means that Lineages 1 and 2 acted as donors for the two recombination events detected in strain HC (row 2 in column 3) for the gene product "soluble pyridine nucleotide transhydrogenase".
clades.Genes affected by homologous recombination are associated with vital cell functions such as replication, transcription, metabolism, and homeostasis.Strikingly, recombination was also observed within gene loci categorized as pathogenicity determinants, including type secretion systems, iron acquisition, flagellum biosynthesis, CWDE, citrate and butanediol metabolism, type IV pilus biogenesis, etc.Recombinant fragments were also present in sequences encoding antimicrobial metabolites (phenazine and colicin) and genes of the CRISPR-Cas systems.These findings highlight the significant role of homologous recombination in shaping P. parmentieri genomes, influencing evolution, strain diversity, lifestyle, pathogenicity, adaptative immunity, and ecological fitness of this phytopathogenic bacterium.

Isolate Collection
In total, 32 P. parmentieri strains from different geographic origins and isolated in distinct years were used in this study (supplementary table S1, Supplementary Material online).All genomes (17 complete and 15 draft genomes) available in the NCBI GenBank database as of the time of accession (2022 September 14) were retrieved in FASTA (Fast-All) format.The accession numbers used to retrieve the genomic data of the 32 isolates are provided in supplementary table S1, Supplementary Material online.

Core-genome Analysis
All 32 genomes in FASTA format were annotated using the rapid prokaryotic genome annotation tool Prokka (Seemann 2014).The annotated genomes in GFF3 (general feature format) were used as input data to carry out the pan-genome analysis in the Roary pipeline (Page et al. 2015) with a minimum BLASTP percentage identity of 90%.A multi-FASTA alignment of the previously identified core genes was generated using PRANK (Löytynoja 2014).

Population Genetic Structure, Genealogy Reconstruction, and Inference of Homologous Recombination
The core-genome alignment generated with Roary for the 32 P. parmentieri isolates was used as input file to determine the intraspecies population structure using the hierarchical Bayesian Analysis of Population Structure (hierBAPS; Cheng et al. 2013) implemented in R (RhierBAPS;Tonkin-Hill et al. 2018).Each strain was assigned to its specific lineage genetic cluster (lineage).
Later, the generated core genome served to compute the ML phylogenetic tree.The inferred phylogeny was constructed using the General Time-Reversible GAMMA (GTR-GAMMA) distribution model with 1,000 bootstraps replicates.The analysis was carried out using the RAxML-NG (RAxML Next Generation) tool (Kozlov et al. 2019).The RAxML-NG ML tree was visualized and plotted in FigTree v1.4.4 (http://tree.bio.ed.ac.uk/software/figtree/) and ordered based on increasing order of nodes.
ClonalFrameML (Didelot and Wilson 2015) analysis was deployed to reconstruct the genealogy, infer homologous recombined fragments and polymorphisms sites within the branches and nodes across the core genome of P. parmentieri.The CLonalFrameML phylogeny was inferred using the previous RAxML-NG ML tree as the starting topology tree and the core genome alignment was obtained using ROARY.Recombination parameters in ClonalFrameML were calculated using the hidden Markov model (HMM) based on 100 simulations (emsim = 100) to refine the reliability of the analysis.Moreover, the impact of introduced recombination events relative to single-point mutations (r/m) was calculated for each branch in ClonalFrameML.

Inference of Recent and Ancestral Recombination Events
The origin, number, and location of recent and ancestral recombination events within the P. parmentieri core-genome was predicted using the novel algorithm fastGEAR, which provides information about lineages across microbial alignments and identifies recombination sites between the studied isolates as well as from external origins (Mostowy et al. 2017).A pre-specified partition file containing the number of lineages obtained in RhierBAPS was used in the fastGEAR analysis instead of the BAPS3 clustering algorithm.Once the lineages were identified in the core-alignment, recent recombinations within each lineage were detected by applying a HMM approach to all subsets of strains assigned to that lineage.Ancestral recombinations were analyzed between lineages and using the same HMM approach.Regions inferred as recent recombination were removed prior to the analysis of ancestral recombination.Bayes factor >1 and >10 were used to evaluate the statistical significance of putative recent and ancestral recombination events.Additionally, the order of the 32 P. parmentieri strains shown in the recent and ancestral recombination plots was specified according to the phylogenetic tree generated in RAxML-NG.To extract the coordinates of the recombined regions and determine the annotation of genes that underwent recent recombination, the next 3 steps, modified from Potnis et al. (2019), were followed.First, the recombination sites obtained from fastGEAR were organized into specific BED (Browser Extensible Data) files for each P. parmentieri strain as well as for each node identified by ClonalFrameML (supplementary fig.S1, Supplementary Material online).Second, the recombined fragments were extracted from their respective genomes using the GetFastaBed tool.Finally, the extracted sequences were submitted to a nucleotide Basic Local Alignment Search Tool (BLASTn) analysis against a customized database of its corresponding P. parmentieri annotated core-genome.The coregenomes were annotated using the Rapid Annotation using Subsystem Technology server (RASTtk; Brettin et al. 2015).The custom BLAST analysis was performed in Geneious Prime.The same approach was followed to determine the genes affected by ancestral recombination in each of the lineages as well as to locate the recombinant fragments inferred by ClonalFrameML.

FIG. 1 .
FIG.1.-Pan-coregenome and population structure analyses in P. parmentieri.a) Pie chart displays the proportion of core, soft-core, shell, and cloud genes in the 32 P. parmentieri genomes analyzed in the Roary pipeline.b) ML tree depicting the population structure analysis of 32 P. parmentieri strains.The phylogenetic tree was built using iTOL v6 (https://itol.embl.de/)and the tips of each branch were annotated and colored according to the level clusters identified using RhierBAPS and for matching the colors used by fastGEAR.

FIG. 4 .
FIG. 4.-Recent recombination events identified using fastGEAR across the core genome of 32 P. parmentieri strains are illustrated.a) The y-axis on the panel displays the 32 strains present in the core genome alignment, and the x-axis displays the sequence positions.Strains on the y-axis were positioned according to the ML phylogeny depicted on the left.Seven lineages previously defined with RhierBAPS were used in the population clustering analysis.The vertical color-coded bar on the right side of the panel shows the division of the 32 strains into the 7 predicted lineages.Each lineage represents a specific clade and is labeled and colored as indicated in the supplementary fig.S1, Supplementary Material online.Black color denotes recombination from outside origin of any lineage in the data set.The presence of different color lines within a lineage in the frame pinpoints recent recombination event sites occurred respect to the sequence position.b) Bar graphic depicting the number of recent recombinations received by each strain.The bars were color-coded based on the color of the lineages assigned in supplementary fig.S2, Supplementary Material online.ML tree based on the core genome alignment of 32 P. parmentieri strains.The positions and names of all nodes assigned by ClonalFrameML are indicated and highlighted with different colors.The tree was visualized using FigTree v1.4.4 and midpoint rooted.c) Pie chart showing the number of recent recombinations percentagewise donated by each lineage and outside of the P. parmentieri population.