-
PDF
- Split View
-
Views
-
Cite
Cite
Todd A Smith, Timothy Driscoll, Joseph J Gillespie, Rahul Raghavan, A Coxiella-Like Endosymbiont Is a Potential Vitamin Source for the Lone Star Tick, Genome Biology and Evolution, Volume 7, Issue 3, March 2015, Pages 831–838, https://doi.org/10.1093/gbe/evv016
Close -
Share
Abstract
Amblyomma americanum (Lone star tick) is an important disease vector in the United States. It transmits several human pathogens, including the agents of human monocytic ehrlichiosis, tularemia, and southern tick-associated rash illness. Blood-feeding insects (Class Insecta) depend on bacterial endosymbionts to provide vitamins and cofactors that are scarce in blood. It is unclear how this deficiency is compensated in ticks (Class Arachnida) that feed exclusively on mammalian blood. A bacterium related to Coxiella burnetii, the agent of human Q fever, has been observed previously within cells of A. americanum. Eliminating this bacterium (CLEAA, Coxiella-like endosymbiont of A. americanum) with antibiotics reduced tick fecundity, indicating that it is an essential endosymbiont. In an effort to determine its role within this symbiosis, we sequenced the CLEAA genome. While highly reduced (656,901 bp) compared with C. burnetii (1,995,281 bp), the CLEAA genome encodes most major vitamin and cofactor biosynthesis pathways, implicating CLEAA as a vitamin provisioning endosymbiont. In contrast, CLEAA lacks any recognizable virulence genes, indicating that it is not a pathogen despite its presence in tick salivary glands. As both C. burnetii and numerous “Coxiella-like bacteria” have been reported from several species of ticks, we determined the evolutionary relationship between the two bacteria. Phylogeny estimation revealed that CLEAA is a close relative of C. burnetii, but was not derived from it. Our results are important for strategies geared toward controlling A. americanum and the pathogens it vectors, and also contribute novel information regarding the metabolic interdependencies of ticks and their nutrient-provisioning endosymbionts.
Introduction
Nutritional symbiosis is widespread in insects that depend on unbalanced diets (Buchner 1965). Endosymbiotic bacteria provision essential amino acids to insects that feed on plant sap that has low levels of essential amino acids (Shigenobu et al. 2000; Baumann 2005; Nakabachi et al. 2006; McCutcheon et al. 2009; McCutcheon and Moran 2010; Wernegreen 2012; Bennett and Moran 2013). Similarly, obligate blood-sucking insects, including tsetse fly, human body louse, and bedbug depend on bacterial endosymbionts for vitamins and cofactors that are lacking from mammalian blood (Akman et al. 2002; Kirkness et al. 2010; Sassera et al. 2013; Nikoh et al. 2014). These so-called primary endosymbionts live in specialized host cells and are transmitted vertically (Baumann 2005; Moran et al. 2008; Wernegreen 2012). Another form of nutritional symbiosis is also found in certain insects (e.g., termites and cockroaches) where a complex community of gut microbes provides all needed nutrients, enabling the host to live on a nutrient-poor diet, such as wood (Warnecke et al. 2007; Sabree et al. 2012). Unlike blood-feeding insects (Arthropoda; Class Insecta), it is not clear how ticks (Arthropoda; Class Arachnida) that also feed exclusively on vertebrate blood overcome a diet lacking essential vitamins and cofactors. Several bacteria that are suspected to be endosymbionts have been detected from various tick species (Rounds et al. 2012), but it is not known whether any can provide the required nutrients lacking in a bloodmeal.
Amblyomma americanum, a hard backed tick common in the southeastern United States, is an important disease vector that transmits Ehrlichia chaffeensis, Francisella tularensis, and Borrelia lonestari, the etiologic agents of human monocytic ehrlichiosis, tularemia and southern tick-associated rash illness, respectively (Childs and Paddock 2003). An intracellular “Coxiella-like bacterium” has previously been detected in various tissues (salivary gland, gut, and ovaries) of all field-caught and lab-reared Amblyomma amblyomma (Jasinskas et al. 2007; Klyachko et al. 2007). Interestingly, eliminating this bacterium with antibiotics caused severe reduction in tick fecundity and fitness, suggesting an important role for the bacterium in promoting host fitness (Zhong et al. 2007). We sequenced the whole genome of this Coxiella-like endosymbiont of A. americanum (CLEAA) and determined that it is closely related to Coxiella burnetii but was not directly derived from it, as previously assumed (Zhong 2012). Although the CLEAA genome is severely reduced, it is replete with metabolic pathways for the biosynthesis of several vitamins and cofactors that are scarce in vertebrate blood. Thus, CLEAA may impart A. americanum with essential nutrients not obtained in its diet.
CLEAA Is Highly Prevalent in A. americanum
A 16S rRNA gene diversity analysis of DNA isolated from four adult female A. americanum ticks revealed that over 95% of reads from all four ticks were from a bacterium closely related to C. burnetii (fig. 1). Additionally, as reported earlier, Rickettsia and Brevibacterium species were also present in all four ticks but at a significantly lower prevalence than CLEAA (Klyachko et al. 2007; Clay et al. 2008; Heise et al. 2010). Although Staphylococcus was also detected in all ticks, this common human skin bacterium was most probably introduced during tick handling or sample preparation.
CLEAA is highly abundant in A. americanum. Abundance of Coxiella-like bacteria in four lab-reared female A. americanum ticks is depicted. y axis denotes number of sequencing reads obtained from each tick.
CLEAA Is a Sister Taxon of C. burnetii
Previous studies based on 16S rRNA analyses have indicated that CLEAA was derived from C. burnetii, which is occasionally vectored by ticks, including A. americanum (Childs and Paddock 2003; Zhong et al. 2007). Our phylogeny estimation based on 161 orthologous proteins across 34 gammaproteobacteria placed CLEAA as a sister taxon to C. burnetii (fig. 2). A phylogeny estimated from 16S rRNA genes also supported the finding that CLEAA is closely related to C. burnetii, but not directly derived from the human Q fever pathogen (fig. 3). Additionally, our analyses revealed that a large group of Coxiella-like bacteria are present in a variety of hard and soft ticks, suggestive of an ancient symbiotic relationship from which both CLEAA and C. burnetii were derived (fig. 3). Intriguingly, as shown recently for order Rickettsiales (Kang et al. 2014), a common ancestor of CLEAA and Coxiella appears to have an aquatic origin (fig. 3).
CLEAA is a sister taxon of Coxiella. Phylogenetic tree based on 161 orthologous genes (supplementary table S3, Supplementary Material online) in CLEAA and 33 fully sequenced bacterial genomes.
Phylogeny estimation of Coxiellaceae 16S rRNA genes. Number of taxa collapsed into each branch is shown within parentheses. Dotted lines depict branches that have been extended for clarity. Taxa colored red depict probable Coxiella-like organisms erroneously named as species of Legionella. Coxiella burnetii is highlighted in blue and CLEAA is boxed in yellow.
CLEAA Genome Is Highly Reduced and Does Not Contain Coxiella Virulence Genes
The complete genome of CLEAA is severely reduced in size when compared with that of C. burnetii (fig. 4 and supplementary fig. S1, Supplementary Material online). It consists of a circular 656,901 bp chromosome that is predicted to encode 537 protein-coding genes, 39 tRNAs, and a single rRNA operon (table 1). Similar to Coxiella, central metabolic processes such as glycolysis, nonoxidative branch of pentose phosphate pathway, TCA cycle, and nucleotide biosynthesis are largely intact in CLEAA; however, only approximately 30% of Coxiella’s gene content has been retained in the CLEAA genome. The small size, low G+C (34.6%), low gene density (83%), and the presence of 23 pseudogenes suggest that the genome has undergone reductive evolution due to CLEAA’s host-dependent life style (McCutcheon and Moran 2012).
Alignment of C. burnetii RSA 493 and CLEAA genomes. Each contiguously colored locally collinear block (LCB) represents a region without rearrangement of the homologous backbone sequence. LCBs were calculated with the Mauve 2.1.0 aligner (Darling et al. 2010). Lines between genomes indicate orthologous LCBs. LCBs below the center in Coxiella genome represent blocks in the reverse orientation. LCB regions in Coxiella without homologs in CLEAA are indicated in white.
General Genome Features of CLEAA
| Feature . | Value for Genome . |
|---|---|
| Genome size (bp) | 656,901 |
| GC (%) | 34.6 |
| Genes (%) | 83 |
| Protein | 537 |
| rRNA | 3 |
| tRNA | 39 |
| ncRNA | 3 |
| GenBank accession | CP007541 |
| Feature . | Value for Genome . |
|---|---|
| Genome size (bp) | 656,901 |
| GC (%) | 34.6 |
| Genes (%) | 83 |
| Protein | 537 |
| rRNA | 3 |
| tRNA | 39 |
| ncRNA | 3 |
| GenBank accession | CP007541 |
General Genome Features of CLEAA
| Feature . | Value for Genome . |
|---|---|
| Genome size (bp) | 656,901 |
| GC (%) | 34.6 |
| Genes (%) | 83 |
| Protein | 537 |
| rRNA | 3 |
| tRNA | 39 |
| ncRNA | 3 |
| GenBank accession | CP007541 |
| Feature . | Value for Genome . |
|---|---|
| Genome size (bp) | 656,901 |
| GC (%) | 34.6 |
| Genes (%) | 83 |
| Protein | 537 |
| rRNA | 3 |
| tRNA | 39 |
| ncRNA | 3 |
| GenBank accession | CP007541 |
Previous studies have reported a high density of CLEAA in A. americanum salivary glands, suggestive of an infective life cycle for this bacterium, as has been suggested for other tick-associated bacteria (Ahantarig et al. 2013). A dot/icm type IV secretion system (T4SS) and its secreted effectors are major contributors to the pathogenesis of C. burnetii (Beare et al. 2011) and in other pathogens in the order Legionellales (e.g., Legionella pneumophila and Rickettsiella grylli) that are closely related to C. burnetii and CLEAA. None of the genes that encode either T4SS components or effector proteins is present in CLEAA. Furthermore, no other known virulence genes or bacterial secretion systems are encoded within the CLEAA genome, but an intact Sec translocon (Stead et al. 2013) is present (supplementary table S1, Supplementary Material online).
Vitamin and Cofactor Biosynthetic Capability of CLEAA
Obligate blood-sucking insects such as tsetse fly (Glossina morsitans), body louse (Pediculus humanus humanus), and bedbug (Cimex lectularius) depend on bacterial endosymbionts to provide vitamins and cofactors that are available in trace amounts in mammalian blood (Akman et al. 2002; Kirkness et al. 2010; Sassera et al. 2013; Nikoh et al. 2014). To determine whether CLEAA can play a similar role in A. americanum, we reconstructed its vitamin and cofactor biosynthesis pathways. As shown in figure 5, CLEAA has complete (or almost complete) pathways for the biosynthesis of several vitamins and cofactors including folic acid (vitamin B9), riboflavin (B2), pantothenic acid (B5), nicotinamide (B3), pyridoxine (B6), thiamine (B1), biotin (B7), and lipoic acid—two cofactors synthesized using the fatty acid biosynthesis pathway. As shown for other endosymbionts, enzymes from either the host or from co-occurring endosymbionts might compensate for the enzymes missing from the cofactor biosynthesis pathways in CLEAA (Husnik et al. 2013; Sloan et al. 2014). Interestingly, pabA (and possibly pabB) required for the synthesis of para-aminobenzoic acid (PABA), from which folic acid is synthesized, has been acquired by CLEAA through horizontal gene transfer (HGT) from an Alphaproteobacteria (fig. 5 and supplementary fig. S2, Supplementary Material online). This scenario is similar to what has been observed in the biotin-provisioning Wolbachia endosymbiont of bedbug and in a Rickettsia endosymbiont of Ixodes scapularis where genes in the biotin biosynthesis pathway were gained through HGT (Gillespie et al. 2012; Nikoh et al. 2014). Additionally, an analysis of A. americanum expression sequence tags (ESTs) deposited in the National Center for Biotechnology Information (NCBI) dbEST database confirmed the expression within A. amblyomma of CLEAA genes from six of the eight cofactor biosynthesis pathways (supplementary table S2, Supplementary Material online), suggesting a role for vitamin and cofactor provisioning in maintaining the host–endosymbiont relationship.
Cofactor biosynthetic pathways in CLEAA. Arrows represent each step catalyzed by the named enzymes in each pathway. Cofactors are within yellow boxes. Question marks indicate enzymes for which no genes were found in the CLEAA genome. Green indicates genes involved in the conversion of chorismate to PABA, with phylogeny estimation indicating pabA was acquired from Alphaproteobacteria (supplementary table S2, supplementary Material online).
Gaining new functions through bacterial endosymbionts allows eukaryotes to expand into novel niches (Moran 2007). Because all life stages of A. americanum feed exclusively on mammalian blood, which contains a limited supply of essential vitamins and cofactors, the presence of a vitamin provisioning endosymbiont enables the tick to subsist on this nutrient-poor diet. As shown recently in bedbugs (Nikoh et al. 2014), detailed functional analyses are required to determine the specific cofactor(s) provided by CLEAA to its host, but considering the potentially vital function of CLEAA in tick reproduction and development (Zhong et al. 2007), it is surprising that the endosymbiont is not retained within specialized host cells (Klyachko et al. 2007) as observed for many insect primary endosymbionts (McCutcheon and Moran 2012; Wernegreen 2012). The CLEAA–A. americanum relationship could be a fairly recent phenomenon, with CLEAA in a process of transitioning from a facultative endosymbiont to an obligate endosymbiont. However, several species of both soft and hard ticks seem to contain bacteria related to CLEAA, suggestive of an ancient relationship. In addition to providing vitamins, CLEAA might play a role in host immune system development, as shown for Wigglesworthia in tsetse flies (Weiss et al. 2011), and due to its presence in salivary glands CLEAA might be contributing certain metabolites to tick saliva, or this behavior could be a relic from an ancestral sylvatic life cycle. Functional studies of CLEAA and analyzing the genomes of other “Coxiella-like bacteria” detected in ticks from around the world may shed more light on this apparent paradox. Nevertheless, this dependency of the Lone star tick on CLEAA for reproductive success could be exploited in the field for improved control of A. americanum and the pathogens it vectors (Zhong et al. 2007) and further illuminates the complex metabolic interdependencies that exist between arthropods and their nutrient-provisioning endosymbionts.
Materials and Methods
Tick Microbiome Analysis
DNA isolated from four adult female A. americanum ticks (procured from Oklahoma State University Tick Rearing Facility) was amplified using a 16S rDNA polymerase chain reaction (PCR) primer set (V3-V4; 550 bp amplicon; Nextera DNA sample preparation kit, Illumina) and sequenced using a MiSeq system (Illumina). The sequencing reads were trimmed using Trimmomatic v0.30 (Bolger et al. 2014) and approximately 65,000 reads obtained from each sample were searched against the Greengenes 16S rRNA gene database (DeSantis et al. 2006) to identify bacterial species present in each tick.
CLEAA Genome Sequencing and Assembly
To analyze the genome of CLEAA, DNA from four female ticks was pooled and sequenced on a single lane of an Illumina HiSeq 2000, which produced approximately 152 million paired-end 100-bp reads. Sequencing reads were trimmed using Trimmomatic v0.30 (Bolger et al. 2014) and approximately 119 million paired reads were aligned to several genomes related to C. burnetii (accession numbers: NC_011527, NC_009726, NZ_AAUP01000001, NC_010115, NZ_AAYJ01000001, NC_002971.3, DQ912980.1, NC_002942.5, AY939824.1, and NZ_AAQJ01000001), to obtain a smaller set of 79,628 paired-end reads, as described earlier (Sloan and Moran 2012). These reads along with approximately 22 million pairs of original reads were assembled into six contigs using Velvet (Zerbino and Birney 2008), and SOAP2 (Li et al. 2009), and the gaps were closed using PCR. The full set of trimmed reads was mapped back to the closed genome using Bowtie2 (Langmead and Salzberg 2012) (∼4.3 million paired reads mapped uniquely) to obtain the final genome sequence, which was submitted to NCBI GenBank (accession number CP007541). The finished genome was annotated using the Joint Genome Institute IMG ER pipeline (Markowitz et al. 2009) and the NCBI Prokaryotic Genome Annotation Pipeline.
To identify metabolic pathways, proteins from CLEAA (537) and C. burnetii RSA 493 (1,818) were annotated with metabolic information from the Kyoto Encyclopedia of Genes and Genomes (KEGG) using blastKOALA (Kanehisa et al. 2014). Proteins with existing KEGG pathway, module, or functional hierarchy (BRITE) annotations were binned into 18 general metabolic categories based on their annotation. Nonhypothetical proteins that had no significant matches in any KEGG database were searched against the Conserved Domain Database at NCBI and subsequently manually binned. Orthologs of CLEAA genes in Wigglesworthia glossinidia (NC_004344.2), Candidatus Riesia pediculicola (NC_014109.1), and Wolbachia wCle (AP013028.1) were identified by reciprocal blast, and proteins were binned into metabolic groups using the RAST server (Aziz et al. 2008) (supplementary fig. S3, Supplementary Material online).
Phylogenetic Analyses
We included 33 fully sequenced genomes to estimate a robust phylogenetic framework for placing CLEAA, a presumed member of the family Coxiellaceae (Gammaproteobacteria: Legionellales). Taxons were sampled based on a previous phylogeny of Gammaproteobacteria (Williams et al. 2010), including representatives from major groups at the base of the tree, wherein Legionellales was positioned: Xanthomonadales (Xanthomonas axonopodis pv. citri str. 306); Cardiobacteriales (Cardiobacterium hominis ATCC 15826, Dichelobacter nodosus VCS1703A); Chromatiales (Alkalilimnicola ehrlichii MLHE-1, Nitrosococcus oceani ATCC 19707); Methylococcales (Methylococcus capsulatus str. Bath); Oceanospirillales (Marinomonas sp. MWYL1); Aeromonadales (Aeromonas salmonicida subsp. salmonicida A449); and Thiotrichales (Francisella tularensis subsp. tularensis SCHU S4). Four additional species were included from derived gammaproteobacterial lineages: Alteromonadales (Shewanella oneidensis str. MR-1); Vibrionales (Vibrio cholerae O1 biovar El Tor str. N16961); Pasteurellales (Pasteurella multocida subsp. multocida str. Pm70); and Enterobacteriales (Escherichia coli str. K-12 substr. MG1655). To ensure robust sampling within Legionellales, families Legionellaceae and Coxiellaceae were adequately sampled. For Legionellaceae, Legionella longbeachae str. NSW150 and six strains of L. pneumophila (2300/99 Alcoy, Corby, Lens, Paris, 130b, and ATCC 43290) were included. For Coxiellaceae, Rickettsiella grylli, Diplorickettsia massiliensis str. 20B, and eleven strains of C. burnetti (cb109, RSA_331, CbuK Q154, Z3055, Cb175 Guyana, RSA 493, Dugway 5J108‐111, CbuG Q212, MSU Goat Q177, Q321, and Cb185) were included.
Utilizing FastOrtho, an in-house modified version of OrthoMCL (Li et al. 2003), orthologous groups of 161 proteins (supplementary table S3, Supplementary Material online) were generated and processed for phylogeny estimation as previously described (Driscoll et al. 2013). Utilizing Bayesian inference for phylogeny estimation, two independent Markov chains were run in parallel using PhyloBayes MPI v.1.2e (Lartillot et al. 2013) under the CAT-GTR (general time reversible) model, with the bipartition frequencies analyzed at various time points using the bpcomp program. For tree-building, appropriate burn-in values were determined by plotting the log-likelihoods for each chain over sampled generations (time). Analyses were considered complete when the maximum difference in bipartition frequencies between the two chains was less than 0.1. Ultimately, a burn-in value of 1,000, with sampling every two trees, was used to build a consensus tree. Maximum-likelihood trees (WAG [Whelan and Goldman] and Le and Gascuel [LG] substitution models) were also generated using RAxML (Stamatakis et al. 2008) to confirm the Bayesian tree.
To better evaluate the phylogenetic position of CLEAA within Coxiellaceae, a phylogeny was estimated based on 16S rDNA sequences. The CLEAA 16S rDNA sequence was used in a BLASTN search against the NCBI nr database, with 160 sequences retrieved having 91% identity or greater. Six outgroup sequences from species of Legionella, Rickettsiella, and Diplorickettsia were also included. All 167 16S rDNA sequences were aligned using MUSCLE v3.6 (Edgar 2004) with default parameters. Ambiguously aligned positions, the majority being present within the variable regions of the small subunit rRNA structure, were culled using Gblocks (Castresana 2000; Talavera and Castresana 2007). Phylogenies of both the unmasked and masked alignments were estimated under maximum likelihood using RAxML (Stamatakis et al. 2008). The GTR substitution model was used with estimation of GAMMA and the proportion of invariable sites. Branch support was measured with bootstrapping (1,000 replications). A Bayesian tree was also generated using PhyloBayes MPI v.1.2e (Lartillot et al. 2013) as described above.
Acknowledgments
The authors thank Zakee Sabree for assistance with 16S rDNA sequencing, Daniel Sloan for helpful discussions, and Christine Sislak for tick DNA extraction. This work was supported in part by Portland State University and by grants from Medical Research Foundation of Oregon and American Heart Institution to R.R. J.J.G. acknowledges support from National Institutes of Health/National Institute of Allergy and Infectious Diseases grants R01AI017828 and R01AI043006 (Abdu F. Azad, University of Maryland Baltimore, School of Medicine).
Literature Cited
Author notes
Associate editor: Daniel Sloan
Data deposition: This project has been deposited at NCBI GenBank under the accession CP007541.





