Abstract

Chromatin immunoprecipitation identified 191 binding sites of Mycobacterium tuberculosis cAMP receptor protein (CRPMt) at endogenous expression levels using a specific α-CRPMt antibody. Under these native conditions an equal distribution between intragenic and intergenic locations was observed. CRPMt binding overlapped a palindromic consensus sequence. Analysis by RNA sequencing revealed widespread changes in transcriptional profile in a mutant strain lacking CRPMt during exponential growth, and in response to nutrient starvation. Differential expression of genes with a CRPMt-binding site represented only a minor portion of this transcriptional reprogramming with ∼19% of those representing transcriptional regulators potentially controlled by CRPMt. The subset of genes that are differentially expressed in the deletion mutant under both culture conditions conformed to a pattern resembling canonical CRP regulation in Escherichia coli, with binding close to the transcriptional start site associated with repression and upstream binding with activation. CRPMt can function as a classical transcription factor in M. tuberculosis, though this occurs at only a subset of CRPMt-binding sites.

INTRODUCTION

The success of Mycobacterium tuberculosis as one of the most deadly human pathogens depends on its ability to adapt to diverse intracellular and extracellular environments encountered during infection, persistence and transmission [reviewed in (1)]. This is mediated in part by an extensive repertoire of transcriptional regulators, including alternative sigma factors, two-component signal transduction proteins and serine–threonine protein kinase sensors (2). Defining the scope of individual regulators and their participation in integrated regulatory networks generates insights into the in vivo physiology of M. tuberculosis that will assist in the selection of optimal treatment strategies (3). A combination of chromatin immunoprecipitation (ChIP) with sequence-based transcriptional profiling provides a powerful approach to this goal. Whilst some mycobacterial transcription factors display a restricted profile of binding to a limited set of regulated promoters (4–6), a recent study of EspR revealed a much broader profile resembling that of nucleoid-associated proteins (NAPs) involved in structural organization of the chromosome (7,8). High-throughput ChIP experiments based on overexpressed transcription factors in M. tuberculosis systematically detect a wide repertoire of low affinity binding sites, however, suggesting that there may be no clear-cut distinction between proteins with localized and generalized binding profiles (9). In the present study, we set out to define the genomic binding profile of cAMP (cyclic adenosine monophosphate) receptor protein (CRPMt) of M. tuberculosis with a particular emphasis on the localization of CRPMt-binding sites relative to transcription start sites (TSSs).

CRP is one of the best-characterized transcription factors in the model organism, Escherichia coli, with a role in regulation of around 200 transcriptional units. CRP binds to a palindromic sequence (TGTGAN6TCACA) in the promoters of target genes and, depending on the distance between the binding site and the transcriptional start site, enhances or restricts recruitment of ribonucleic acid (RNA) polymerase [reviewed in (10,11)]. In addition to binding to target promoters, ChIP-chip analysis of E. coli CRP uncovered an extensive background pattern of low affinity sites suggesting that CRP may have an additional role as a more general chromosomal organizer (12). A key feature of E. coli CRP is that its affinity for deoxyribonucleic acid (DNA) is strongly dependent on binding of the cAMP ligand, allowing it to play a central role in the global coordination of transcriptional reprogramming required for optimal utilization of different carbon substrates.

The corresponding M. tuberculosis CRPMt, encoded by Rv3676, recognizes a similar binding motif and has been shown to regulate transcription of several promoters—for example, the serC promoter (13), the rpfA promoter (14), the whiB1 promoter (15,16) and the frdA promoter (17). Computational approaches and in vitro DNA-binding studies suggest that M. tuberculosis CRPMt has multiple targets and is likely to share the global profile of the E. coli homologue (18–20). A significant difference between the two proteins is that cAMP binds to M. tuberculosis CRPMt with weak affinity and has less effect on its binding to DNA [(21), for review see (22)]. This may reflect differences in the abundance and regulation of cAMP in mycobacteria (23,24). In contrast to the single enzyme in E. coli, M. tuberculosis has 17 genes encoding adenylate cyclases (25,26), and the dynamics of cAMP synthesis and secretion are proposed to play an important role during infection [(27), for review see (23)]. Consistent with a role in pathogenesis, deletion of the crp gene results in significant impairment of in vitro growth of M. tuberculosis and attenuates virulence in a mouse model (14). We anticipated that comprehensive mapping of the binding profile of M. tuberculosis CRPMt would assist in characterization of this global regulator and contribute more broadly to a fundamental understanding of gene regulation in mycobacteria.

MATERIALS AND METHODS

Bacterial strains, plasmids and growth conditions

The strains used were E. coli strain DH5α, for all plasmid construction, and M. tuberculosis strains H37Rv (wild type) and M. tuberculosis Δcrp, an H37Rv mutant in which Rv3676 has been deleted (14). E. coli cultures were grown in Luria-Bertani broth and agar (15 g/l). Where needed, ampicillin and kanamycin were used at final concentrations of 100 and 50 μg/ml, respectively. M. tuberculosis cultures were grown in Dubos broth containing 0.05% (v/v) Tween, Middlebrook 7H9 broth or Middlebrook 7H11 agar and supplemented with 0.5% (v/v) glycerol and 4% albumin. To monitor the response to nutrient starvation (28), M. tuberculosis was grown to mid-exponential phase (OD600 0.6), the cells pelleted and washed twice in phosphate buffered saline (PBS) supplemented with 0.025% tyloxapol. Cells were then resuspended in 100 ml PBS plus tyloxapol and incubated without shaking at 37 °C.

Chromatin immunoprecipitation

ChIP was performed as previously described (29) with some modifications to the protocol. Rv3676 protein tagged with hexa-His at the N-terminus was purified in E. coli as described previously (30) and used to produce CRPMt-specific polyclonal antibodies in rabbits. The primary dose (300-μg protein) was administered subcutaneously in Freund's complete adjuvant, followed by two booster injections in Freund's incomplete adjuvant after 14 and 30 days. Sera were prepared and then stored at −20°C until required. Immunoglobulin G (IgG) was purified from the serum using T-Gel purification kit (Pierce) as per manufacturer's instructions. Purified IgG was used for ChIP-seq analysis. Rv3597c antibody was raised by Cambridge Research Biochemicals, using a peptide antigen.

M. tuberculosis H37Rv (wild type) and M. tuberculosis Δcrp cells were grown in roller bottles to mid-exponential phase (OD600 0.6) and formaldehyde was added to a final concentration of 1%. After 10 min of incubation, glycine was added to a final concentration of 0.5 M to quench the reaction and incubated for a further 5 min. Cross-linked cells were harvested by centrifugation and washed twice with ice-cold Tris-buffered saline (TBS, pH 7.5). Cell pellets were resuspended in 4-ml immunoprecipitation (IP) buffer [50-mM HEPES-KOH (pH 7.5), 150-mM NaCl, 1-mM ethylenediaminetetraacetic acid (EDTA), 1% Triton X-100, 0.1% (w/v) sodium deoxycholate, 0.1% sodium dodecyl sulphate (SDS), 0.1-mg/ml RNase A and one protease inhibitor cocktail tablet (Roche)]. Cells were lysed and the DNA sheared to an average size of ∼300 base pairs (bp) using a Bioruptor (Diagenode) with 40 cycles of 30-s on/off at high setting. Insoluble matter was removed by centrifugation for 10 min at 4°C, and the supernatant was split into two 1.8-ml aliquots. The remaining ∼400 μl was kept to check the size of the DNA fragments and for sequencing to remove any sequencing bias (input).

Each 1.8-ml aliquot was incubated with 20-μl Protein A/G UltraLink Resin (Pierce) on a rotary shaker for 45 min at room temperature to eliminate complexes bound to the resin non-specifically. The supernatant was then removed and incubated with either no antibody (mock-IP), specific α-CRPMt antibody raised against purified CRPMt (Rv3676) protein (Supplementary Figure S1) and 50-μl Protein A/G UltraLink Resin, pre-incubated with 1-mg/ml bovine serum albumin in TBS, on a rotary shaker at 4°C overnight. Samples were washed once with IP buffer, twice with IP buffer containing 500-mM NaCl, once with wash buffer [10-mM Tris (pH 8.0), 250-mM LiCl, 1-mM EDTA, 0.5% Tergitol (Sigma) and 0.5% sodium deoxycholate] and once with TE (pH 7.5). Immunoprecipitated complexes were eluted from the resin in 100-μl elution buffer [10-mM Tris (pH 7.5), 10-mM EDTA and 1% SDS] at 65°C for 30 min. Immunoprecipitated samples and the input DNA were de-cross-linked in 0.5x elution buffer containing 0.8-mg/ml Pronase at 42°C for 2 h followed by 65°C for 6 h. DNA was purified using a polymerase chain reaction (PCR) purification kit (QIAGEN). Prior to library preparation and sequencing, the DNA fragment sizes were checked by agarose gel electrophoresis and gene-specific quantitative PCR was carried out.

RNA isolation

RNA was extracted from exponentially growing wild-type and Δcrp M. tuberculosis H37Rv in Middlebrook 7H9 media, as previously described (3). Briefly, 25 ml of mid-log phase cultures were pelleted at 4°C, and RNA was isolated using the FastRNA Pro Blue kit (MP Bio), according to the manufacturer's guidelines. Following extraction, RNA was treated with Turbo DNase (Ambion) to degrade all DNA present, and the quality and integrity were assessed using a Nanodrop (ND-1000, Labtech) and Agilent bioanalyzer.

Library construction and Illumina sequencing

For ChIP-seq, prior and post library construction, the concentration of the immunoprecipitated DNA samples was measured using the Qubit HS DNA kit (Invitrogen). Library construction and sequencing were performed using the ChIP-seq Sample Prep kit, Reagent Preparation kit and Cluster Station kit (Illumina). Samples were sequenced on an Illumina Genome Analyzer IIx (GAIIx) instrument and loaded at a concentration of 10 pM. For RNA sequencing (RNA-seq), strand-specific cDNA libraries were constructed using the Illumina Small RNA Sample preparation kit with the v1.5 small RNA (sRNA) 3′ Adaptor and sequenced on a GAIIx (Illumina).

Data analysis

Sequence reads were aligned to the reference sequence of M. tuberculosis H37Rv (EMBL accession number AL123456) as single-end data using the Burrows-Wheeler aligner (BWA) (31). The genome coverage was calculated using Samtools (32) and visualized in the Artemis genome browser. For ChIP-seq, peaks were called using a combination of CisGenome (33) and the BayesPeak package in R/Bioconductor (34). For RNA-seq, differential gene expression was analysed using the DESeq package in R/Bioconductor (35). For evaluating the location of binding sites as potential transcriptional activators or repressors, a region between −200 and +20 bp with respect to the transcriptional start site was considered. For functional enrichment analysis, the 87 genes with CRP-binding sites up to 200 bp upstream from the annotated TSS (36) were analysed. All functional categories were assigned using Tuberculist annotations. GraphPad Prism v6.04 was used to compare the frequencies of different functional categories in respect to the H37Rv genome distribution using two-tailed Chi-square tests.

Quantitative real-time PCR

Quantitative real-time PCR (qRT-PCR) was used to determine whether the ChIP experiment had worked prior to library construction and to validate both the ChIP-seq and RNA-seq data. To measure the enrichment of TF-binding targets in the immunoprecipitated DNA samples, 1 µl of IP or mock-IP DNA was used with Quantitect SYBR Green (QIAGEN) together with specific primers to the upstream region of Rv3616c, known to be bound by CRPMt. To validate the RNA-seq data, specific primers were used to quantify the messenger RNA targets showing up- or downregulation, and control targets showing no differential expression. RNA was extracted as described above, and 30 ng total RNA from wild-type and Δcrp cells was used with the Express One-Step SYBR GreenER kit (Invitrogen) according to the manufacturer's guidelines. All primer sequences and qRT-PCR results are in Supplementary Tables S1 and S2.

cAMP measurement

Samples for cAMP measurement during the starvation response were taken at time points 0 h, 24 h, 48 h and 96 h. At each time point, 2 ml of culture was spun down, resuspended in 0.1-M HCl and boiled for 10 min. The samples were lysed in the presence of glass beads (150–212 μm; Sigma) using a Ribolyser (Hybaid) at a speed setting of 6.0 for 40 s. The supernatant was collected by centrifugation and stored at −20°C until the assay was performed. Levels of cAMP in the cells were measured using the Direct cAMP Enzyme Immunoassay kit (Sigma), following the acetylated version according to the manufacturer's guidelines, and normalized to the total amount of protein in the samples (measured using a Nanodrop ND-1000).

RESULTS

Genomic mapping of the CRPMt-binding profile

The aim of this work was to investigate CRPMt binding to the M. tuberculosis chromosome by ChIP combined with high-throughput sequencing (ChIP-seq) and to integrate these data with transcriptional profiling by RNA-seq. ChIP-seq was done using a specific antibody against CRPMt, thus enabling us to study the binding under native conditions without the need to tag and overexpress the protein. Under these native conditions, CRPMt is expressed at high levels in the cell; based on published quantitative mass spectrometry and electron microscopy (36,37) and western blot analysis we estimate the number of CRPMt molecules to be approximately stoichiometric to the number of ribosomes per cell (∼3500).

We were able to map 98% of the sequences uniquely to the H37Rv genome (allowing for up to two mismatches per read) and achieved a near-complete representation of the entire genome (98% of the genome was mapped). The remaining 2% of the genome includes PE/PPE genes, which contain highly repetitive sequences that are poorly resolved by short-read sequencing. To visualize the genome coverage, the number of reads mapping to each position on the M. tuberculosis chromosome was calculated and the traces visualized in the Artemis genome browser. Peaks were called using the CisGenome software (33) to identify enriched regions in the CRPMt-IP compared to the mock-IP (performed in the absence of antibody) and input (sheared genomic DNA). To validate the results, the data were also analysed using the BayesPeak package in R/Bioconductor, and peaks called in only one of the two methods were discarded. As seen in Figure 1A, there was no significant enrichment of any regions of the M. tuberculosis chromosome in the mock-IP (or the input; data not shown) indicating negligible non-specific binding to the resin or the antibody. In the CRPMt-IP, however, 191 peaks were found, denoting CRPMt-binding sites (here abbreviated to CBSs) on the chromosome (Figure 1A). The average length of the CBSs was 276 bp, and in total this represents 0.7% of the entire M. tuberculosis genome. No differences were observed in CRPMt-DNA binding between cells grown in Dubos medium and cells grown in Middlebrook 7H9 medium (data not shown). No ChIP-seq signals were detected for the crp-deletion strain.

ChIP-seq mapping of CRPMt-binding sites. (A) Sequence reads from ChIP are mapped onto the M. tuberculosis H37Rv genome and displayed using the Artemis browser. No peak enrichment was observed in a mock-IP sample (red trace). One hundred and ninety one peaks were identified in the IP sample using anti-CRPMt antibody (black trace). (B) A consensus motif resembling that defined for E. coli CRP was identified within 50 bp of the bound centre for 97% of the 191 of ChIP-seq peaks. (C) A clear correlation was observed between peak enrichment and match to the consensus motif. (D) Representative Artemis profiles of CBSs. CRPMt binding (i) to the divergent promoter region between Rv0078A and Rv0079; (ii) in the intragenic region within Rv1592c; and (iii) overlapping the sRNA ncRv13943. Red boxes denote the CBS and blue boxes denote the TSS.
Figure 1.

ChIP-seq mapping of CRPMt-binding sites. (A) Sequence reads from ChIP are mapped onto the M. tuberculosis H37Rv genome and displayed using the Artemis browser. No peak enrichment was observed in a mock-IP sample (red trace). One hundred and ninety one peaks were identified in the IP sample using anti-CRPMt antibody (black trace). (B) A consensus motif resembling that defined for E. coli CRP was identified within 50 bp of the bound centre for 97% of the 191 of ChIP-seq peaks. (C) A clear correlation was observed between peak enrichment and match to the consensus motif. (D) Representative Artemis profiles of CBSs. CRPMt binding (i) to the divergent promoter region between Rv0078A and Rv0079; (ii) in the intragenic region within Rv1592c; and (iii) overlapping the sRNA ncRv13943. Red boxes denote the CBS and blue boxes denote the TSS.

De novo motif discovery by MEME-ChIP (38), using 50 bp upstream and downstream of the centre of each peak, identified a consensus motif present in 97% of the 191 binding sites that is similar to motifs previously predicted for M. tuberculosis CRPMt and experimentally defined for E. coli CRP (Figure 1B). The midpoint of each ChIP peak was compared to the centre of the CRPMt-binding site, as predicted from the consensus sequence, and the average difference was found to be 6 bp, with a correlation coefficient of 0.995 between the two data sets. At 14 of the sites, we identified more than one copy of the consensus motif. At three of the sites we were unable to identify a match to the consensus motif. The enrichment of the DNA fragments in the CRPMt-IP compared to the mock-IP was inversely proportional to the number of mismatches found at each site (Figure 1C). The top 14 sites, with an enrichment factor (maxT) of 10 or more, are shown in Table 1 and representative Artemis profiles are illustrated in Figure 1D (for details of all sites, see Supplementary Table S3).

ChIP peaks with highest fold enrichment

Table 1.
ChIP peaks with highest fold enrichment
maxTaBound centreCRP-binding site (CBS)bChIP to CBS (bp)cConsensus mismatchGeneNameCBS to TSS (bp)dLocation of CBS
17.1122322122323.51.50Rv0104Rv0104+6.5Inside 5′ end of gene
14.130150723015072.50.50Rv2699cRv2699c−62.5between divergent genes
Rv2700Rv2700−130.5
13.71347413470.53.50Rv0010cRv0010c217.5Internal to gene
13.110617991061798.50.51Rv0950cRv0950c−69.5between divergent genes
1061835.536.52Rv0951sucC−95.5
Rv0950cRv0950c−106.5
Rv0951sucC−58.5
13.041314524131461.59.51Rv3689Rv3689>500Internal to gene
12.425181642518166.52.51Rv2245kasA>500Internal to gene
12.017574041757404.50.51Rv1552frdA+4.5Upstream of operon
11.740574244057425.51.51Rv3616cespA−983.5between divergent genes
Rv3617ephA−55.5
11.6571602571601.50.51Rv0482murB>500Internal to gene
571517.584.53Rv0483lprQ−150.5Upstream of gene
Rv0482murB>500Internal to gene
Rv0483lprQ−234.5Upstream of gene
11.027526692752680.511.52Rv2451Rv2451>500between convergent genes
2752616.552.52Rv2451Rv2451>500Internal to gene
2752593.577.52Rv2451Rv2451>500Internal to gene
10.929215192921520.51.51Rv2591PE_PGRS44+16.5Upstream of gene
10.729113892911389.50.51Rv2584capt−57.5Upstream of gene
Rv2585cRv2585c>500Internal to gene
10.314874991487299.55.51Rv1324Rv1324+138.5Internal to gene
1487370.576.53Rv1324Rv1324+209.5Internal to gene
1487064.5229.51Rv1324Rv1324−96.5Upstream of gene
10.043299424329946.54.51MT3972.1MT3972.1+9.5Upstream of gene
maxTaBound centreCRP-binding site (CBS)bChIP to CBS (bp)cConsensus mismatchGeneNameCBS to TSS (bp)dLocation of CBS
17.1122322122323.51.50Rv0104Rv0104+6.5Inside 5′ end of gene
14.130150723015072.50.50Rv2699cRv2699c−62.5between divergent genes
Rv2700Rv2700−130.5
13.71347413470.53.50Rv0010cRv0010c217.5Internal to gene
13.110617991061798.50.51Rv0950cRv0950c−69.5between divergent genes
1061835.536.52Rv0951sucC−95.5
Rv0950cRv0950c−106.5
Rv0951sucC−58.5
13.041314524131461.59.51Rv3689Rv3689>500Internal to gene
12.425181642518166.52.51Rv2245kasA>500Internal to gene
12.017574041757404.50.51Rv1552frdA+4.5Upstream of operon
11.740574244057425.51.51Rv3616cespA−983.5between divergent genes
Rv3617ephA−55.5
11.6571602571601.50.51Rv0482murB>500Internal to gene
571517.584.53Rv0483lprQ−150.5Upstream of gene
Rv0482murB>500Internal to gene
Rv0483lprQ−234.5Upstream of gene
11.027526692752680.511.52Rv2451Rv2451>500between convergent genes
2752616.552.52Rv2451Rv2451>500Internal to gene
2752593.577.52Rv2451Rv2451>500Internal to gene
10.929215192921520.51.51Rv2591PE_PGRS44+16.5Upstream of gene
10.729113892911389.50.51Rv2584capt−57.5Upstream of gene
Rv2585cRv2585c>500Internal to gene
10.314874991487299.55.51Rv1324Rv1324+138.5Internal to gene
1487370.576.53Rv1324Rv1324+209.5Internal to gene
1487064.5229.51Rv1324Rv1324−96.5Upstream of gene
10.043299424329946.54.51MT3972.1MT3972.1+9.5Upstream of gene

aEnrichment factor (maxT) of the peaks between the CRPMt-IP and the mock-IP as calculated by the CisGenome software.

bCentre of the CRPMt-binding site based on the consensus sequence (Figure 1B).

cDistance of the centre of the ChIP peak (bound centre) to the centre of the CRPMt-binding site.

dDistance of the centre of the CRPMt-binding site to the TSS, as defined by (36) and documented in Supplementary Table S3.

Table 1.
ChIP peaks with highest fold enrichment
maxTaBound centreCRP-binding site (CBS)bChIP to CBS (bp)cConsensus mismatchGeneNameCBS to TSS (bp)dLocation of CBS
17.1122322122323.51.50Rv0104Rv0104+6.5Inside 5′ end of gene
14.130150723015072.50.50Rv2699cRv2699c−62.5between divergent genes
Rv2700Rv2700−130.5
13.71347413470.53.50Rv0010cRv0010c217.5Internal to gene
13.110617991061798.50.51Rv0950cRv0950c−69.5between divergent genes
1061835.536.52Rv0951sucC−95.5
Rv0950cRv0950c−106.5
Rv0951sucC−58.5
13.041314524131461.59.51Rv3689Rv3689>500Internal to gene
12.425181642518166.52.51Rv2245kasA>500Internal to gene
12.017574041757404.50.51Rv1552frdA+4.5Upstream of operon
11.740574244057425.51.51Rv3616cespA−983.5between divergent genes
Rv3617ephA−55.5
11.6571602571601.50.51Rv0482murB>500Internal to gene
571517.584.53Rv0483lprQ−150.5Upstream of gene
Rv0482murB>500Internal to gene
Rv0483lprQ−234.5Upstream of gene
11.027526692752680.511.52Rv2451Rv2451>500between convergent genes
2752616.552.52Rv2451Rv2451>500Internal to gene
2752593.577.52Rv2451Rv2451>500Internal to gene
10.929215192921520.51.51Rv2591PE_PGRS44+16.5Upstream of gene
10.729113892911389.50.51Rv2584capt−57.5Upstream of gene
Rv2585cRv2585c>500Internal to gene
10.314874991487299.55.51Rv1324Rv1324+138.5Internal to gene
1487370.576.53Rv1324Rv1324+209.5Internal to gene
1487064.5229.51Rv1324Rv1324−96.5Upstream of gene
10.043299424329946.54.51MT3972.1MT3972.1+9.5Upstream of gene
maxTaBound centreCRP-binding site (CBS)bChIP to CBS (bp)cConsensus mismatchGeneNameCBS to TSS (bp)dLocation of CBS
17.1122322122323.51.50Rv0104Rv0104+6.5Inside 5′ end of gene
14.130150723015072.50.50Rv2699cRv2699c−62.5between divergent genes
Rv2700Rv2700−130.5
13.71347413470.53.50Rv0010cRv0010c217.5Internal to gene
13.110617991061798.50.51Rv0950cRv0950c−69.5between divergent genes
1061835.536.52Rv0951sucC−95.5
Rv0950cRv0950c−106.5
Rv0951sucC−58.5
13.041314524131461.59.51Rv3689Rv3689>500Internal to gene
12.425181642518166.52.51Rv2245kasA>500Internal to gene
12.017574041757404.50.51Rv1552frdA+4.5Upstream of operon
11.740574244057425.51.51Rv3616cespA−983.5between divergent genes
Rv3617ephA−55.5
11.6571602571601.50.51Rv0482murB>500Internal to gene
571517.584.53Rv0483lprQ−150.5Upstream of gene
Rv0482murB>500Internal to gene
Rv0483lprQ−234.5Upstream of gene
11.027526692752680.511.52Rv2451Rv2451>500between convergent genes
2752616.552.52Rv2451Rv2451>500Internal to gene
2752593.577.52Rv2451Rv2451>500Internal to gene
10.929215192921520.51.51Rv2591PE_PGRS44+16.5Upstream of gene
10.729113892911389.50.51Rv2584capt−57.5Upstream of gene
Rv2585cRv2585c>500Internal to gene
10.314874991487299.55.51Rv1324Rv1324+138.5Internal to gene
1487370.576.53Rv1324Rv1324+209.5Internal to gene
1487064.5229.51Rv1324Rv1324−96.5Upstream of gene
10.043299424329946.54.51MT3972.1MT3972.1+9.5Upstream of gene

aEnrichment factor (maxT) of the peaks between the CRPMt-IP and the mock-IP as calculated by the CisGenome software.

bCentre of the CRPMt-binding site based on the consensus sequence (Figure 1B).

cDistance of the centre of the ChIP peak (bound centre) to the centre of the CRPMt-binding site.

dDistance of the centre of the CRPMt-binding site to the TSS, as defined by (36) and documented in Supplementary Table S3.

Location of CBSs

Sixty nine of the CRPMt-binding sites mapped uniquely to a location within a protein coding gene or stable RNA, with a possible role in long-distance regulation and/or chromosome organization. Of the remaining sites, 86 CBSs mapped uniquely to intergenic loci corresponding to potential promoter regions, whilst 35 CBSs were located within a protein coding sequence in a region that could also serve as the potential promoter of a downstream gene. This represents a significant enrichment of intergenic regions over that predicted by chance, considering ∼90% of the entire M. tuberculosis genome is intragenic. The distribution between intragenic and intergenic locations remained approximately equal irrespective of the fold-enrichment used as cutoff, indicating that CRPMt binds with similar affinity to both types of site. In 32 cases, the CRPMt-binding site was located between divergently transcribed gene pairs; this is proportional to the genome average of 16% of all genes with divergent orientation in M. tuberculosis. CRP regulation of divergent gene pairs has also been observed in E. coli (39). In some instances, CBSs mapped to the intergenic region between convergent gene pairs, like Rv2866 and Rv2867c and Rv2451 and Rv2452c. Three CRPMt-binding sites also mapped upstream of the sRNAs, ncRv13843, ncRv11373 and ncRv13660c (40).

Canonically positioned CBSs are associated with functional categories

To define the precise location of CBSs with respect to transcriptional start sites (TSSs), we integrated the ChIP data set with a M. tuberculosis TSS map generated by sequence analysis of 5′-triphosphate-enriched RNA libraries (36). The spacing between the midpoint of each CBS motif and adjacent primary TSSs is recorded in Supplementary Table S3. Including data from peaks with multiple motifs, and CBSs mapping to more than one TSS, we measured a total of 242 CBS-TSS pairs; in 203 cases, the CBS was located within 500 bp of a TSS, 127 sites were upstream and 76 downstream, 41 of which were between the TSS and the start codon (i.e. within the 5′-Untranslated Region -UTR-) and 35 within the coding sequence. Plotting of the distribution of CBS-TSS spacing revealed clustering in the regions from −60 to −40, and from +1 to +20 (Figure 2A and B). Genes harbouring CRP-binding sites in the −200 to 0 region were analysed for functional categories. Amongst the 87 genes that contain a CRP site in a putative promoter region, genes involved in cell wall and cell processes were enriched in our data set compared to the H37Rv genome (Chi-square test, P = 0.021; Figure 2C).

CBS distribution around transcriptional start sites. (A) CRPMt sites in M. tuberculosis show a clustering in the region around TSSs (n = 203). (B) The distribution of CBS-TSS distances for M. tuberculosis CRPMt sites is compared to a similar data set from E. coli (39). (C) Genes harbouring CRPMt sites in the −220 to 0 region (n = 87) were analysed for functional categories according to Tuberculist annotations. Bar graphs represent the percentage of each functional class for CBS genes (black bars) compared to the distribution of these classes amongst all H37Rv genes (grey bars). Asterisks denote functional categories that are statistically significant after Chi-square test analyses.
Figure 2.

CBS distribution around transcriptional start sites. (A) CRPMt sites in M. tuberculosis show a clustering in the region around TSSs (n = 203). (B) The distribution of CBS-TSS distances for M. tuberculosis CRPMt sites is compared to a similar data set from E. coli (39). (C) Genes harbouring CRPMt sites in the −220 to 0 region (n = 87) were analysed for functional categories according to Tuberculist annotations. Bar graphs represent the percentage of each functional class for CBS genes (black bars) compared to the distribution of these classes amongst all H37Rv genes (grey bars). Asterisks denote functional categories that are statistically significant after Chi-square test analyses.

Several of the CBSs have also been identified as binding sites for other transcription factors, suggesting that CRPMt may act in concert with other regulators. The promoter region of Rv1057, for example, has binding sites for MprA, EspR and TrcR in addition to CRPMt (7,41) (Figure 3). Additional promoter regions with binding sites for multiple transcription factors include fadD26 (Rv2930) with an EspR-binding site (7) and espA (Rv3616c) with binding sites for EspR, MprA and CRP (7,42–45).

Transcription factor binding to the promoter region of Rv1057. (A) Artemis traces showing the binding of CRP (blue) to the AT-rich region upstream of Rv1057. TSS mapping (green) according to (36). Traces record the normalized number of mapped reads and the maximum normalized read count is indicated. (B) The promoter region of Rv1057 has several binding sites for other transcription factors suggesting that CRPMt may act in concert with other regulators. Transcription factor binding sites are shown as coloured boxes [black: LexA (5); red: TrcR (41); green: MprA (44); pink: EspR (7); turquoise: CRP). The arrow denotes the TSS (36). Genome coordinates indicate the start of LexA-binding site (1178795) (5), Rv1057 TSS (36) and translational start site (1179396).
Figure 3.

Transcription factor binding to the promoter region of Rv1057. (A) Artemis traces showing the binding of CRP (blue) to the AT-rich region upstream of Rv1057. TSS mapping (green) according to (36). Traces record the normalized number of mapped reads and the maximum normalized read count is indicated. (B) The promoter region of Rv1057 has several binding sites for other transcription factors suggesting that CRPMt may act in concert with other regulators. Transcription factor binding sites are shown as coloured boxes [black: LexA (5); red: TrcR (41); green: MprA (44); pink: EspR (7); turquoise: CRP). The arrow denotes the TSS (36). Genome coordinates indicate the start of LexA-binding site (1178795) (5), Rv1057 TSS (36) and translational start site (1179396).

Transcriptional regulation of CBS genes

Previous microarray and targeted qRT-PCR analyses have demonstrated differential expression of CBS genes following deletion of the crp gene (14). Using an RNA-seq approach to compare the transcriptional profile of wild-type and crp-deletion strains during exponential growth, we observed widespread changes in gene expression affecting more than 20% of the total transcriptome (Supplementary Table S4). Filtering based on an adjusted P-value of <0.05 identified 453 genes with >2-fold increased abundance in the knockout and 412 with >2-fold decrease. CBS genes comprised only a minor fraction of the differential expression profile, with statistically significant upregulation of 37 genes and downregulation of 15 genes. Forty eight per cent of the CBS-regulated genes corresponded to genes annotated as key metabolic enzymes or genes with predicted roles on transcription regulation that could amplify the CRPMt regulatory signal (Supplementary Table S5). Fifty percent of the differential expression profile (212 of the upregulated genes and 211 of the downregulated genes) was shared with the response to nutrient starvation (Supplementary Table S4), and is likely to reflect secondary effects associated with the marked growth defect of the crp mutant.

We anticipated that if CRPMt was acting together with other transcription factors, differential expression of CBS genes may be enhanced under alternative growth conditions. ChIP-seq analysis revealed a general decrease in CRPMt binding to DNA after incubation for 24 h in PBS (Figure 4A). Furthermore, a reduction in the amount of cAMP was observed in the nutrient starvation model (Figure 4B). There was no significant change in CRPMt protein abundance in the starvation model (36). The majority of the ChIP-seq peaks identified in exponential culture were also detected after starvation, though with a reduction in fold-enrichment and loss of 33 of the 76 peaks having an enrichment ratio of less than 5 in the exponential data set. The ChIP-seq peaks not identified after starvation were not enriched in any specific functional category and only 53% of the CBS-associated genes in exponential culture showed downregulation during starvation. Comparison of the wild-type and mutant strains under starvation conditions revealed wide-ranging differences in the overall transcript profile, with 361 genes having >2-fold higher abundance and 465 reduced abundance in the knockout (Supplementary Table S4), but again CBS genes made only a minor contribution, with 28 genes upregulated and 33 downregulated.

CBSs during nutrient starvation. (A) Enrichment of CRPMt binding to DNA during exponential and starvation phase. Box plots indicating median (horizontal line), interquartile range (box) and minimum and maximum values (whiskers) of the enrichment factor (maxT) of the 151 shared peaks between exponential growth and the starvation model (Mann–Whitney test; ** indicates significant difference between values P < 0.01). (B) cAMP concentration in the starvation model.
Figure 4.

CBSs during nutrient starvation. (A) Enrichment of CRPMt binding to DNA during exponential and starvation phase. Box plots indicating median (horizontal line), interquartile range (box) and minimum and maximum values (whiskers) of the enrichment factor (maxT) of the 151 shared peaks between exponential growth and the starvation model (Mann–Whitney test; ** indicates significant difference between values P < 0.01). (B) cAMP concentration in the starvation model.

Twenty nine CBS genes showing a concordant response in a comparison of wild-type and crp deletion strains under both culture conditions are shown in Table 2, ranked according to the distance between the CBS and the TSS. Whilst the number of differentially expressed genes is low, the results are consistent with the canonical E. coli model of CRPMt binding close to the TSS inhibiting transcription and upstream binding enhancing transcription. There was no obvious pattern of up- or downregulation associated with CRPMt binding at sites distant from the TSS.

Differential expression of CBS genes during exponential growth and nutrient starvation

Table 2.
Differential expression of CBS genes during exponential growth and nutrient starvation
ExponentialStarved
geneCRP to TSSlog2 Fold ChangeP adjlog2 Fold ChangeP adj
Rv0046cino1500−1.220.000−1.250.003
Rv0169mce1A5000.840.0052.360.000
Rv0469umaA500−0.710.020−0.990.043
Rv1660pks105000.870.0361.020.044
Rv2145cwag31500−0.630.035−1.350.002
Rv2189cRv2189c5001.670.0004.010.000
Rv2200cctaC500−2.120.000−3.080.000
Rv3629cRv3629c5001.310.0011.140.039
Rv0054ssb475.5−1.020.006−1.670.000
Rv3801cfadD32159.5−1.370.000−1.750.000
Rv3680Rv3680119.50.910.0031.510.000
Rv2990cRv2990c42.5−1.890.000−1.670.000
Rv0104Rv01046.51.100.0021.500.002
Rv0167yrbE1A3.50.970.0042.100.000
Rv1230cRv1230c3.50.720.0391.240.005
Rv0655mkl−0.5−0.880.002−1.320.002
Rv2107PE22−44.5−1.630.000−2.250.005
Rv1592caRv1592c−54.5−1.310.000−1.820.000
Rv3219bwhiB1−57.5−1.130.000−1.390.001
Rv1057Rv1057−59.5−4.210.000−2.890.000
Rv0452Rv0452−76.51.090.0091.580.017
Rv0885Rv0885−88.5−0.910.004−1.400.001
Rv3053cnrdH−128.5−1.410.000−2.440.000
Rv0467icl1−242.5−1.680.000−2.570.000
Rv2173idsA2−246.51.000.0041.650.000
Rv1379pyrR−365.51.290.0001.950.000
Rv2703sigA−407.5−1.640.003−1.380.007
Rv2846cefpA−416.5−0.930.001−2.670.000
Rv3616cespA−983.5−0.650.028−1.350.002
ExponentialStarved
geneCRP to TSSlog2 Fold ChangeP adjlog2 Fold ChangeP adj
Rv0046cino1500−1.220.000−1.250.003
Rv0169mce1A5000.840.0052.360.000
Rv0469umaA500−0.710.020−0.990.043
Rv1660pks105000.870.0361.020.044
Rv2145cwag31500−0.630.035−1.350.002
Rv2189cRv2189c5001.670.0004.010.000
Rv2200cctaC500−2.120.000−3.080.000
Rv3629cRv3629c5001.310.0011.140.039
Rv0054ssb475.5−1.020.006−1.670.000
Rv3801cfadD32159.5−1.370.000−1.750.000
Rv3680Rv3680119.50.910.0031.510.000
Rv2990cRv2990c42.5−1.890.000−1.670.000
Rv0104Rv01046.51.100.0021.500.002
Rv0167yrbE1A3.50.970.0042.100.000
Rv1230cRv1230c3.50.720.0391.240.005
Rv0655mkl−0.5−0.880.002−1.320.002
Rv2107PE22−44.5−1.630.000−2.250.005
Rv1592caRv1592c−54.5−1.310.000−1.820.000
Rv3219bwhiB1−57.5−1.130.000−1.390.001
Rv1057Rv1057−59.5−4.210.000−2.890.000
Rv0452Rv0452−76.51.090.0091.580.017
Rv0885Rv0885−88.5−0.910.004−1.400.001
Rv3053cnrdH−128.5−1.410.000−2.440.000
Rv0467icl1−242.5−1.680.000−2.570.000
Rv2173idsA2−246.51.000.0041.650.000
Rv1379pyrR−365.51.290.0001.950.000
Rv2703sigA−407.5−1.640.003−1.380.007
Rv2846cefpA−416.5−0.930.001−2.670.000
Rv3616cespA−983.5−0.650.028−1.350.002

CBS genes with concordant patterns of differential expression in the crp deletion mutant under both growth conditions are listed. According to the canonical model for CRPMt regulation, it is anticipated that genes having a CBS overlapping the transcription start site will show increased expression in the absence of CRPMt (highlighted in bold), while genes with a CBS in the upstream region will show decreased expression (underlined). CBSs between −200 and +20 bp with respect to the transcriptional start site were considered for highlighting differences in expression.

aadditional CBS at −101.5.

badditional CBS at −57.5 and 47.5.

Table 2.
Differential expression of CBS genes during exponential growth and nutrient starvation
ExponentialStarved
geneCRP to TSSlog2 Fold ChangeP adjlog2 Fold ChangeP adj
Rv0046cino1500−1.220.000−1.250.003
Rv0169mce1A5000.840.0052.360.000
Rv0469umaA500−0.710.020−0.990.043
Rv1660pks105000.870.0361.020.044
Rv2145cwag31500−0.630.035−1.350.002
Rv2189cRv2189c5001.670.0004.010.000
Rv2200cctaC500−2.120.000−3.080.000
Rv3629cRv3629c5001.310.0011.140.039
Rv0054ssb475.5−1.020.006−1.670.000
Rv3801cfadD32159.5−1.370.000−1.750.000
Rv3680Rv3680119.50.910.0031.510.000
Rv2990cRv2990c42.5−1.890.000−1.670.000
Rv0104Rv01046.51.100.0021.500.002
Rv0167yrbE1A3.50.970.0042.100.000
Rv1230cRv1230c3.50.720.0391.240.005
Rv0655mkl−0.5−0.880.002−1.320.002
Rv2107PE22−44.5−1.630.000−2.250.005
Rv1592caRv1592c−54.5−1.310.000−1.820.000
Rv3219bwhiB1−57.5−1.130.000−1.390.001
Rv1057Rv1057−59.5−4.210.000−2.890.000
Rv0452Rv0452−76.51.090.0091.580.017
Rv0885Rv0885−88.5−0.910.004−1.400.001
Rv3053cnrdH−128.5−1.410.000−2.440.000
Rv0467icl1−242.5−1.680.000−2.570.000
Rv2173idsA2−246.51.000.0041.650.000
Rv1379pyrR−365.51.290.0001.950.000
Rv2703sigA−407.5−1.640.003−1.380.007
Rv2846cefpA−416.5−0.930.001−2.670.000
Rv3616cespA−983.5−0.650.028−1.350.002
ExponentialStarved
geneCRP to TSSlog2 Fold ChangeP adjlog2 Fold ChangeP adj
Rv0046cino1500−1.220.000−1.250.003
Rv0169mce1A5000.840.0052.360.000
Rv0469umaA500−0.710.020−0.990.043
Rv1660pks105000.870.0361.020.044
Rv2145cwag31500−0.630.035−1.350.002
Rv2189cRv2189c5001.670.0004.010.000
Rv2200cctaC500−2.120.000−3.080.000
Rv3629cRv3629c5001.310.0011.140.039
Rv0054ssb475.5−1.020.006−1.670.000
Rv3801cfadD32159.5−1.370.000−1.750.000
Rv3680Rv3680119.50.910.0031.510.000
Rv2990cRv2990c42.5−1.890.000−1.670.000
Rv0104Rv01046.51.100.0021.500.002
Rv0167yrbE1A3.50.970.0042.100.000
Rv1230cRv1230c3.50.720.0391.240.005
Rv0655mkl−0.5−0.880.002−1.320.002
Rv2107PE22−44.5−1.630.000−2.250.005
Rv1592caRv1592c−54.5−1.310.000−1.820.000
Rv3219bwhiB1−57.5−1.130.000−1.390.001
Rv1057Rv1057−59.5−4.210.000−2.890.000
Rv0452Rv0452−76.51.090.0091.580.017
Rv0885Rv0885−88.5−0.910.004−1.400.001
Rv3053cnrdH−128.5−1.410.000−2.440.000
Rv0467icl1−242.5−1.680.000−2.570.000
Rv2173idsA2−246.51.000.0041.650.000
Rv1379pyrR−365.51.290.0001.950.000
Rv2703sigA−407.5−1.640.003−1.380.007
Rv2846cefpA−416.5−0.930.001−2.670.000
Rv3616cespA−983.5−0.650.028−1.350.002

CBS genes with concordant patterns of differential expression in the crp deletion mutant under both growth conditions are listed. According to the canonical model for CRPMt regulation, it is anticipated that genes having a CBS overlapping the transcription start site will show increased expression in the absence of CRPMt (highlighted in bold), while genes with a CBS in the upstream region will show decreased expression (underlined). CBSs between −200 and +20 bp with respect to the transcriptional start site were considered for highlighting differences in expression.

aadditional CBS at −101.5.

badditional CBS at −57.5 and 47.5.

DISCUSSION

The 191 CRPMt-binding sites identified by ChIP-seq analysis have approximately equal distribution between intragenic and intergenic locations across the genome of M. tuberculosis. This is similar to the distribution recently reported for EspR-binding sites (7), and is intermediate between the dominant upstream preference identified by antibody-based ChIP with the ‘classical’ DosR transcription factor (4) and the predominantly intragenic distribution of NAP Lsr2 (8). Using fold-enrichment of bound sequences as a surrogate measure, there was no evidence of any difference in the affinity of CRPMt binding to intergenic as compared to intragenic sites. The antibody-based pull-down strategy we used to identify the binding profile of the native protein did not result in detection of a background pattern of low affinity sites as described for E. coli CRP (12), though results from high-throughput screening of tagged recombinant proteins reveal this to be a common feature of ChIP analysis of M. tuberculosis transcription factors (9). Whilst most transcription factors are present at low or undetectable abundance in proteome profiles measured by shotgun mass spectrometry, CRPMt resembles EspR and MtrA in having a copy number approaching that of NAPs HupB, MihF and Lsr2 (46). The number of CRPMt molecules present in the cell is in several fold excess of that required to saturate binding to the CBSs detected by ChIP-seq.

Taking advantage of a comprehensive map of M. tuberculosis transcriptional start sites, we were able to calculate the distance between each CRPMt-binding site and the closest TSS. This revealed a pattern of clustering in upstream and downstream regions, suggesting a parallel with canonical E. coli CRP regulation, in which CRP binding upstream of the TSS can activate expression, and CRP binding close to the TSS is inhibitory. Data generated by combined ChIP-seq/TSS mapping reproduced previous studies of validated CRPMt regulatory targets, confirming an upstream binding site consistent with activation of serC and Rv0885 (13), and a binding site occluding the TSS of CRPMt-repressed frdA (17). Alignment of ChIP-seq and TSS data sets also reproduced the more complex situation of tandem activating and inhibitory binding sites in the whiB1 promoter (21). Additional potential internal regulatory features include identification of a CBS shared by mmpS4, implicated in siderophore export (47), and its potential regulator Rv0452, and CBSs upstream of both the PE13/PPE18 operon and its Rv0485 regulator (48).

Deletion of the crp gene had a pronounced impact on the growth of M. tuberculosis and on transcription profiles measured in exponential and starved cultures. The effect of crp deletion on expression of the set of CBS genes was similar to its effect on the overall transcriptome, with ∼20% of the genes showing increased or decreased abundance. Consistent with previous publications (13–14,17,21) our results showed that M. tuberculosis CRPMt can act as a ‘classical’ transcription factor in reducing or enhancing expression (dependent on spacing of CBS and TSS), but that this paradigm operates at only a subset of CRPMt-binding sites. Two models can be proposed to reconcile the limited direct effect of crp deletion on expression of CBS genes with the extensive impact of crp deletion on the total transcriptome and growth phenotype. It can be envisaged that the primary transcription changes are amplified through their effect on secondary networks and co-regulation with additional transcription factors. This model is illustrated by recent mapping of the multiple regulatory layers that contribute to the overall function of the FNR transcription factor in E. coli (49). In an alternative model, it can be envisaged that rather than acting as primary determinants, transcription factors such as CRPMt play a complementary role within a regulatory environment dominated by global physiological control mechanisms (50). Deletion of CRPMt may have an influence on global physiology in addition to its localized effect on individual genes. CBS genes that did not show expression changes in the crp-deletion strain could be due to a specific role of CRPMt in regulating transcription states that will be dependent on environmental conditions or a combined regulation in conjunction with other transcription factors. This is in agreement with the findings of Hollands et al., who observed no CRP-dependent regulation in E. coli at several promoters containing high-affinity DNA-binding sites for CRP (51). They also suggest that there may be some specific conditions where CRP-dependent regulation becomes important. Alternatively, they suggest that this may be due to CRP playing a role as an NAP, thus influencing the dynamic spatial arrangement of the chromosome (51).

There is a need for further analysis of the effects of cAMP on CRPMt binding and of the available intracellular concentration of cAMP in different growth states. In addition to its possible role in CRPMt regulation (52), cAMP binds to other transcription factors (53) and has important allosteric effects on enzyme function (54). However, the role of cAMP in CRPMt regulation remains unclear. Whilst some structural studies demonstrate a conformational change associated with binding of cAMP (55), biochemical analysis shows that this has little or no effect on binding to DNA (21). Therefore, based on the current understanding of protein allostery (56), it is likely that CRPMt is a ‘dynamic’ protein (more dynamic than E. coli CRP) that can readily switch to a conformation that promotes DNA interaction, even when cAMP is not bound. cAMP binding to CRP may increase the fraction of protein in the DNA-binding state, via conformational selection, which is reflected in ‘enhanced’ DNA binding. By lowering cAMP levels during starvation, the fraction of CRP in the DNA-binding conformation is lowered, thereby reducing occupancy at CRP-binding sites. This significant DNA binding by CRPMt in the absence of cAMP suggests an evolutionary adaptation given the high levels of cAMP seen in mycobacteria. Therefore CRPMt could have evolved to act as a DNA-coating protein or a recruiting protein for other transcription factors or RNA polymerase. CRPMt levels are quite high in the cell, so any decrease in cAMP levels could also be offset by the high levels of CRPMt present at any given time. However, cAMP binding to CRP resulting in the sustained occupancy of a CRP site may influence the interaction of CRPMt with other transcriptional regulators and chromosome organizing proteins.

ACCESSION NUMBER

ChIPseq and RNAseq data have been submitted to the EBI ArrayExpress under accession number E-MTAB-2390.

ACKNOWLEDGEMENT

We thank Abdul Sesay and the High Throughput Sequencing team at the MRC National Institute for Medical Research for library sequencing.

FUNDING

European 7th Framework Program SysteMTb, European Community [HEALTH-F4-2010-241587]; UK Medical Research Council [U117581288 and U117585867]; Indian Institute of Science, Bangalore [Senior Research Fellowship to N.M.]; Department of Biotechnology, Government of India [DBT/PR7240 to S.S.V]. Funding for open access charge: UK Medical Research Council [U117581288].

Conflict of interest statement. None declared.

REFERENCES

1.
Barry
C.E.
III
Boshoff
H.I.
Dartois
V.
Dick
T.
Ehrt
S.
Flynn
J.
Schnappinger
D.
Wilkinson
R.J.
Young
D.
,
The spectrum of latent tuberculosis: rethinking the biology and intervention strategies
Nat. Rev. Microbiol.
,
2009
, vol.
7
(pg.
845
-
855
)
2.
Cole
S.T.
Brosch
R.
Parkhill
J.
Garnier
T.
Churcher
C.
Harris
D.
Gordon
S.V.
Eiglmeier
K.
Gas
S.
Barry
C.E.
III
et al.
,
Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence
Nature
,
1998
, vol.
393
(pg.
537
-
544
)
3.
Galagan
J.E.
Minch
K.
Peterson
M.
Lyubetskaya
A.
Azizi
E.
Sweet
L.
Gomes
A.
Rustad
T.
Dolganov
G.
Glotova
I.
et al.
,
The Mycobacterium tuberculosis regulatory network and hypoxia
Nature
,
2013
, vol.
499
(pg.
178
-
183
)
4.
Lun
D.S.
Sherrid
A.
Weiner
B.
Sherman
D.R.
Galagan
J.E.
,
A blind deconvolution approach to high-resolution mapping of transcription factor binding sites from ChIP-seq data
Genome Biol.
,
2009
, vol.
10
pg.
R142
5.
Smollett
K.L.
Smith
K.M.
Kahramanoglou
C.
Arnvig
K.B.
Buxton
R.S.
Davis
E.O.
,
Global analysis of the regulon of the transcriptional repressor LexA, a key component of SOS response in Mycobacterium tuberculosis
J. Biol. Chem.
,
2012
, vol.
287
(pg.
22004
-
22014
)
6.
Sala
C.
Haouz
A.
Saul
F.A.
Miras
I.
Rosenkrands
I.
Alzari
P.M.
Cole
S.T.
,
Genome-wide regulon and crystal structure of BlaI (Rv1846c) from Mycobacterium tuberculosis
Mol. Microbiol.
,
2009
, vol.
71
(pg.
1102
-
1116
)
7.
Blasco
B.
Chen
J.M.
Hartkoorn
R.
Sala
C.
Uplekar
S.
Rougemont
J.
Pojer
F.
Cole
S.T.
,
Virulence regulator EspR of Mycobacterium tuberculosis is a nucleoid-associated protein
PLoS Pathog.
,
2012
, vol.
8
pg.
e1002621
8.
Gordon
B.R.
Li
Y.
Wang
L.
Sintsova
A.
van Bakel
H.
Tian
S.
Navarre
W.W.
Xia
B.
Liu
J.
,
Lsr2 is a nucleoid-associated protein that targets AT-rich sequences and virulence genes in Mycobacterium tuberculosis
Proc. Natl. Acad. Sci. U.S.A.
,
2010
, vol.
107
(pg.
5154
-
5159
)
9.
Galagan
J.
Lyubetskaya
A.
Gomes
A.
,
ChIP-Seq and the complexity of bacterial transcriptional regulation
Curr. Top. Microbiol. Immunol.
,
2013
, vol.
363
(pg.
43
-
68
)
10.
Busby
S.
Kolb
A
Lin
ECC
Lynch
AS
,
The CAP modulon
Regulation and Gene Expression in Escherichia coli
,
1996
TX
R.G. Landes & Co.
(pg.
255
-
279
)
11.
Busby
S.
Ebright
R.H.
,
Transcription activation by catabolite activator protein (CAP)
J. Mol. Biol.
,
1999
, vol.
293
(pg.
199
-
213
)
12.
Grainger
D.C.
Hurd
D.
Harrison
M.
Holdstock
J.
Busby
S.J.
,
Studies of the distribution of Escherichia coli cAMP-receptor protein and RNA polymerase along the E. coli chromosome
Proc. Natl. Acad. Sci. U.S.A.
,
2005
, vol.
102
(pg.
17693
-
17698
)
13.
Bai
G.
Schaak
D.D.
Smith
E.A.
McDonough
K.A.
,
Dysregulation of serine biosynthesis contributes to the growth defect of a Mycobacterium tuberculosis crp mutant
Mol. Microbiol.
,
2011
, vol.
82
(pg.
180
-
198
)
14.
Rickman
L.
Scott
C.
Hunt
D.M.
Hutchinson
T.
Menendez
M.C.
Whalan
R.
Hinds
J.
Colston
M.J.
Green
J.
Buxton
R.S.
,
A member of the cAMP receptor protein family of transcription regulators in Mycobacterium tuberculosis is required for virulence in mice and controls transcription of the rpfA gene coding for a resuscitation promoting factor
Mol. Microbiol.
,
2005
, vol.
56
(pg.
1274
-
1286
)
15.
Agarwal
N.
Raghunand
T.R.
Bishai
W.R.
,
Regulation of the expression of whiB1 in Mycobacterium tuberculosis: role of cAMP receptor protein
Microbiology
,
2006
, vol.
152
(pg.
2749
-
2756
)
16.
Stapleton
M.R.
Smith
L.J.
Hunt
D.M.
Buxton
R.S.
Green
J.
,
Mycobacterium tuberculosis WhiB1 represses transcription of the essential chaperonin GroEL2
Tuberculosis
,
2012
, vol.
92
(pg.
328
-
332
)
17.
Akhter
Y.
Tundup
S.
Hasnain
S.E.
,
Novel biochemical properties of a CRP/FNR family transcription factor from Mycobacterium tuberculosis
Int. J. Med. Microbiol.
,
2007
, vol.
297
(pg.
451
-
457
)
18.
Bai
G.
McCue
L.A.
McDonough
K.A.
,
Characterization of Mycobacterium tuberculosis Rv3676 (CRPMt), a cyclic AMP receptor protein-like DNA binding protein
J. Bacteriol.
,
2005
, vol.
187
(pg.
7795
-
7804
)
19.
Akhter
Y.
Yellaboina
S.
Farhana
A.
Ranjan
A.
Ahmed
N.
Hasnain
S.E.
,
Genome scale portrait of cAMP-receptor protein (CRP) regulons in mycobacteria points to their role in pathogenesis
Gene
,
2008
, vol.
407
(pg.
148
-
158
)
20.
Krawczyk
J.
Kohl
T.A.
Goesmann
A.
Kalinowski
J.
Baumbach
J.
,
From Corynebacterium glutamicum to Mycobacterium tuberculosis–towards transfers of gene regulatory networks and integrated data analyses with MycoRegNet
Nucleic Acids Res.
,
2009
, vol.
37
pg.
e97
21.
Stapleton
M.
Haq
I.
Hunt
D.M.
Arnvig
K.B.
Artymiuk
P.J.
Buxton
R.S.
Green
J.
,
Mycobacterium tuberculosis cAMP receptor protein (Rv3676) differs from the Escherichia coli paradigm in its cAMP binding and DNA binding properties and transcription activation properties
J. Biol. Chem.
,
2010
, vol.
285
(pg.
7016
-
7027
)
22.
Green
J.
Stapleton
M.R.
Smith
L.J.
Artymiuk
P.J.
Kahramanoglou
C.
Hunt
D.M.
Buxton
R.S.
,
Cyclic-AMP and bacterial cyclic-AMP receptor proteins revisited: adaptation for different ecological niches
Curr. Opin. Microbiol.
,
2014
, vol.
18C
(pg.
1
-
7
)
23.
Shenoy
A.R.
Visweswariah
S.S.
,
New messages from old messengers: cAMP and mycobacteria
Trends Microbiol.
,
2006
, vol.
14
(pg.
543
-
550
)
24.
Dass
B.K.
Sharma
R.
Shenoy
A.R.
Mattoo
R.
Visweswariah
S.S.
,
Cyclic AMP in mycobacteria: characterization and functional role of the Rv1647 ortholog in Mycobacterium smegmatis
J. Bacteriol.
,
2008
, vol.
190
(pg.
3824
-
3834
)
25.
McCue
L.A.
McDonough
K.A.
Lawrence
C.E.
,
Functional classification of cNMP-binding proteins and nucleotide cyclases with implications for novel regulatory pathways in Mycobacterium tuberculosis
Genome Res.
,
2000
, vol.
10
(pg.
204
-
219
)
26.
Shenoy
A.R.
Sivakumar
K.
Krupa
A.
Srinivasan
N.
Visweswariah
S.S.
,
A survey of nucleotide cyclases in actinobacteria: unique domain organization and expansion of the class III cyclase family in Mycobacterium tuberculosis
Comp. Funct. Genomics
,
2004
, vol.
5
(pg.
17
-
38
)
27.
Agarwal
N.
Lamichhane
G.
Gupta
R.
Nolan
S.
Bishai
W.R.
,
Cyclic AMP intoxication of macrophages by a Mycobacterium tuberculosis adenylate cyclase
Nature
,
2009
, vol.
460
(pg.
98
-
102
)
28.
Betts
J.C.
Lukey
P.T.
Robb
L.C.
McAdam
R.A.
Duncan
K.
,
Evaluation of a nutrient starvation model of Mycobacterium tuberculosis persistence by gene and protein expression profiling
Mol. Microbiol.
,
2002
, vol.
43
(pg.
717
-
731
)
29.
Kahramanoglou
C.
Seshasayee
A.S.
Prieto
A.I.
Ibberson
D.
Schmidt
S.
Zimmermann
J.
Benes
V.
Fraser
G.M.
Luscombe
N.M.
,
Direct and indirect effects of H-NS and Fis on global gene expression control in Escherichia coli
Nucleic Acids Res.
,
2011
, vol.
39
(pg.
2073
-
2091
)
30.
Shenoy
A.R.
Sreenath
N.
Podobnik
M.
Kovacevic
M.
Visweswariah
S.S.
,
The Rv0805 gene from Mycobacterium tuberculosis encodes a 3’,5’-cyclic nucleotide phosphodiesterase: biochemical and mutational analysis
Biochemistry
,
2005
, vol.
44
(pg.
15695
-
15704
)
31.
Li
H.
Durbin
R.
,
Fast and accurate short read alignment with Burrows-Wheeler transform
Bioinformatics
,
2009
, vol.
25
(pg.
1754
-
1760
)
32.
Li
H.
Handsaker
B.
Wysoker
A.
Fennell
T.
Ruan
J.
Homer
N.
Marth
G.
Abecasis
G.
Durbin
R.
,
The Sequence Alignment/Map format and SAMtools
Bioinformatics
,
2009
, vol.
25
(pg.
2078
-
2079
)
33.
Ji
H.
Jiang
H.
Ma
W.
Johnson
D.S.
Myers
R.M.
Wong
W.H.
,
An integrated software system for analyzing ChIP-chip and ChIP-seq data
Nat. Biotechnol.
,
2008
, vol.
26
(pg.
1293
-
1300
)
34.
Cairns
J.
Spyrou
C.
Stark
R.
Smith
M.L.
Lynch
A.G.
Tavare
S.
,
BayesPeak—an R package for analysing ChIP-seq data
Bioinformatics
,
2011
, vol.
27
(pg.
713
-
714
)
35.
Anders
S.
Huber
W.
,
Differential expression analysis for sequence count data
Genome Biol.
,
2010
, vol.
11
pg.
R106
36.
Cortes
T.
Schubert
O.T.
Rose
G.
Arnvig
K.B.
Comas
I.
Aebersold
R.
Young
D.B.
,
Genome-wide mapping of transcriptional start sites defines an extensive leaderless transcriptome in Mycobacterium tuberculosis
Cell Rep.
,
2013
, vol.
5
(pg.
1121
-
1131
)
37.
Yamada
H.
Chikamatsu
K.
Aono
A.
Mitarai
S.
,
Pre-fixation of virulent Mycobacterium tuberculosis with glutaraldehyde preserves exquisite ultrastructure on transmission electron microscopy through cryofixation and freeze-substitution with osmium-acetone at ultralow temperature
J. Microbiol. Methods
,
2014
, vol.
96
(pg.
50
-
55
)
38.
Bailey
T.L.
Elkan
C.
,
Fitting a mixture model by expectation maximization to discover motifs in biopolymers
Proc. Int. Conf. Intell. Syst. Mol. Biol.
,
1994
, vol.
2
(pg.
28
-
36
)
39.
Shimada
T.
Fujita
N.
Yamamoto
K.
Ishihama
A.
,
Novel roles of cAMP receptor protein (CRP) in regulation of transport and metabolism of carbon sources
PloS ONE
,
2011
, vol.
6
pg.
e20081
40.
Lamichhane
G.
Arnvig
K.B.
McDonough
K.A.
,
Definition and annotation of (myco)bacterial non-coding RNA
Tuberculosis (Edinb)
,
2013
, vol.
93
(pg.
26
-
29
)
41.
Pang
X.
Cao
G.
Neuenschwander
P.F.
Haydel
S.E.
Hou
G.
Howard
S.T.
,
The beta-propeller gene Rv1057 of Mycobacterium tuberculosis has a complex promoter directly regulated by both the MprAB and TrcRS two-component systems
Tuberculosis
,
2011
, vol.
91(Suppl. 1)
(pg.
S142
-
S149
)
42.
Blasco
B.
Stenta
M.
Alonso-Sarduy
L.
Dietler
G.
Peraro
M.D.
Cole
S.T.
Pojer
F.
,
Atypical DNA recognition mechanism used by the EspR virulence regulator of Mycobacterium tuberculosis
Mol. Microbiol.
,
2011
, vol.
82
(pg.
251
-
264
)
43.
Hunt
D.M.
Sweeney
N.P.
Mori
L.
Whalan
R.H.
Comas
I.
Norman
L.
Cortes
T.
Arnvig
K.B.
Davis
E.O.
Stapleton
M.R.
et al.
,
Long-range transcriptional control of an operon necessary for virulence-critical ESX-1 secretion in Mycobacterium tuberculosis
J. Bacteriol.
,
2012
, vol.
194
(pg.
2307
-
2320
)
44.
Pang
X.
Samten
B.
Cao
G.
Wang
X.
Tvinnereim
A.R.
Chen
X.L.
Howard
S.T.
,
MprAB regulates the espA operon in Mycobacterium tuberculosis and modulates ESX-1 function and host cytokine response
J. Bacteriol.
,
2013
, vol.
195
(pg.
66
-
75
)
45.
Rosenberg
O.S.
Dovey
C.
Tempesta
M.
Robbins
R.A.
Finer-Moore
J.S.
Stroud
R.M.
Cox
J.S.
,
EspR, a key regulator of Mycobacterium tuberculosis virulence, adopts a unique dimeric structure among helix-turn-helix proteins
Proc. Natl. Acad. Sci. U.S.A.
,
2011
, vol.
108
(pg.
13450
-
13455
)
46.
Schubert
O.T.
Mouritsen
J.
Ludwig
C.
Rost
H.L.
Rosenberger
G.
Arthur
P.K.
Claassen
M.
Campbell
D.S.
Sun
Z.
Farrah
T.
et al.
,
The Mtb proteome library: a resource of assays to quantify the complete proteome of Mycobacterium tuberculosis
Cell Host Microbe
,
2013
, vol.
13
(pg.
602
-
612
)
47.
Wells
R.M.
Jones
C.M.
Xi
Z.
Speer
A.
Danilchanka
O.
Doornbos
K.S.
Sun
P.
Wu
F.
Tian
C.
Niederweis
M.
,
Discovery of a siderophore export system essential for virulence of Mycobacterium tuberculosis
PLoS Pathog.
,
2013
, vol.
9
pg.
e1003120
48.
Goldstone
R.M.
Goonesekera
S.D.
Bloom
B.R.
Sampson
S.L.
,
The transcriptional regulator Rv0485 modulates the expression of a pe and ppe gene pair and is required for Mycobacterium tuberculosis virulence
Infect. Immun.
,
2009
, vol.
77
(pg.
4654
-
4667
)
49.
Myers
K.S.
Yan
H.
Ong
I.M.
Chung
D.
Liang
K.
Tran
F.
Keles
S.
Landick
R.
Kiley
P.J.
,
Genome-scale analysis of Escherichia coli FNR reveals complex features of transcription factor binding
PLoS Genet.
,
2013
, vol.
9
pg.
e1003565
50.
Berthoumieux
S.
de Jong
H.
Baptist
G.
Pinel
C.
Ranquet
C.
Ropers
D.
Geiselmann
J.
,
Shared control of gene expression in bacteria by transcription factors and global physiology of the cell
Mol. Syst. Biol.
,
2013
, vol.
9
(pg.
1
-
11
)
51.
Hollands
K.
Busby
S.J.
Lloyd
G.S.
,
New targets for the cyclic AMP receptor protein in the Escherichia coli K-12 genome
FEMS Microbiol. Lett.
,
2007
, vol.
274
(pg.
89
-
94
)
52.
Harman
J.G.
,
Allosteric regulation of the cAMP receptor protein
Biochim. Biophys. Acta
,
2001
, vol.
1547
(pg.
1
-
17
)
53.
Gazdik
M.A.
Bai
G.
Wu
Y.
McDonough
K.A.
,
Rv1675c (cmr) regulates intramacrophage and cyclic AMP-induced gene expression in Mycobacterium tuberculosis-complex mycobacteria
Mol. Microbiol.
,
2009
, vol.
71
(pg.
434
-
448
)
54.
Nambi
S.
Basu
N.
Visweswariah
S.S.
,
cAMP-regulated protein lysine acetylases in mycobacteria
J. Biol. Chem.
,
2010
, vol.
285
(pg.
24313
-
24323
)
55.
Reddy
M.C.
Palaninathan
S.K.
Bruning
J.B.
Thurman
C.
Smith
D.
Sacchettini
J.C.
,
Structural insights into the mechanism of the allosteric transitions of Mycobacterium tuberculosis cAMP receptor protein
J. Biol. Chem.
,
2009
, vol.
284
(pg.
36581
-
36591
)
56.
Motlagh
H.N.
Wrabl
J.O.
Li
J.
Hilser
V.J.
,
The ensemble nature of allostery
Nature
,
2014
, vol.
508
(pg.
331
-
339
)
This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.

Supplementary data

Comments

0 Comments
Submit a comment
You have entered an invalid code
Thank you for submitting a comment on this article. Your comment will be reviewed and published at the journal's discretion. Please check for further notifications by email.