Abstract

probeBase is a curated database of annotated rRNA-targeted oligonucleotide probes and supporting information. Rapid access to probe, microarray and reference data is achieved by powerful search tools and via different lists that are based on selected categories such as functional or taxonomic properties of the target organism(s) or the hybridization format (fluorescence in situ hybridization or microarray) in which the probes were applied. Additional information on probe coverage and specificity is available through direct submissions of probe sequences from probeBase to RDP-II and Greengenes, two major rRNA sequence databases. A freely editable user comments field for each probe entry allows any user to add, modify or remove information or to report errors in real-time. probeBase entries increased from 700 to more than 1200 during the past three years. Several options for submission of single probes or entire probe sets, even prior to publication of newly developed probes, should further contribute to keeping probeBase an up-to-date and useful resource. probeBase is freely accessible at . Email correspondence can be addressed to probebase@microbial-ecology.net.

INTRODUCTION

Numerous ribosomal RNA (rRNA)-targeted oligonucleotide probes have been developed and are widely employed in a variety of hybridization formats for identification (1–3), quantification (4) and physiological characterization (5) of cultivated and un-cultivated organisms in environmental, bioengineered and clinical settings. In order to release researchers from the necessity to tediously survey the literature for already published probes, probeBase, an internet database for rRNA-targeted oligonucleotides [NAR Molecular Biology Database Collection entry number 390 (6)], was established in 2002 to provide a freely accessible, rapid platform to search for published probes and annotated/pre-calculated information and to facilitate an up-to-date evaluation of probe coverage and specificity (7). Suitable probes in probeBase can be found via the ‘Search’ tool by indicating the name of a probe, a target organism, a certain probe target site or by searching for references. The ‘Match’ service can be used to easily retrieve all probes that are fully complementary to up to 150 query rRNA gene sequences without prior aligning and phylogenetic analysis.

Today, probeBase contains 1258 probes and 8 microarrays from 266 references (status October 2006) and is frequently used by researchers worldwide. For example, probeBase user statistics (accessible on the start page) have recorded an average of 723 hits (corresponding to 554 opened pages) per day in the year 2005.

Here we describe novel features and search services thatwere implemented into probeBase since 2003, accompanied by significant layout and interface changes, which should collectively enhance data interconnectivity and usability of the database. Furthermore, we present a meta-analysis of all probes deposited at probeBase and have evaluated the in silico specificity of frequently used probes targeting major bacterial groups.

NEW DATA AND ANALYSIS TOOLS

Additional probe features

A distinctive accession number that begins with the prefix ‘pB’ is now assigned to each probe, which is essentially defined by its unique sequence. For example, the probes EUB338, EUB338-II and EUB338-III, together targeting most Bacteria (8), have the accession numbers pB-00159, pB-00160 and pB-00161, respectively. This minor but very important new feature of probeBase prevents potential confusion in cases where the same name was incidentally used for more than one probe, especially if a new probe is just a modification of an already existing one.

probeBase now also includes data on rRNA-targeted microarrays [so-called PhyloChips e.g. (9)]. Entries for probes that are used on microarrays contain links to more detailed information on the respective microarray, including a brief description, the technology applied, bibliographic data and a list of all probes used on this particular microarray. probeBase will generally host and welcome submission of all microarrays that consist of rRNA-targeted oligonucleotide probes with hierarchically nested and identical specificity for phylogenetically defined groups of organisms. Microarrays with random probes will not be accommodated.

Theoretical thermodynamic probe parameters according to Yilmaz et al. (10) have been pre-calculated and added to each probe entry. The standard free energy changes ΔG1 and ΔG2 for probe/rRNA duplex formation and intra-molecular probe folding (hairpin formation), respectively, and the thermodynamic affinity of a probe to a hypothetical perfectly accessible target site, ΔG12, should aid in the optimization of existing probes, which have so far not been tested for fluorescence in situ hybridization (FISH) or which show only weak fluorescence signals [See (10,11) for details].

Furthermore, each probe entry contains a so-called Wiki user comments field, which allows users to instantly add, modify or remove information on the probe and its application without the need for prior registration. The opportunity to easily annotate probes in real-time should promote up-to-dateness of probe information, reporting of bugs and errors in the database or the original publication, and interactions among probe users. This combination of community driven and protected content as realized in the current version of probeBase should generate an optimal resource, less susceptible to abuse than pure Wiki systems.

Each probe entry contains links to directly match the probe sequence against RDP-II and Greengenes, major encompassing, regularly updated 16S rRNA sequence online databases (12,13). probeBase users thereby benefit from the more efficient probe match tool that was recently implemented in RDP-II to directly match probe sequences against Bacteria and for the first time are able to match probe sequences against Bacteria and Archaea at Greengenes. These tools enable researchers to quickly check the current in silico specificity and coverage of a probe and thus to validate its originally intended specificity—a step that becomes increasingly important with ever-accumulating rRNA sequence data in public databases.

Search probeBase

In order to extend the search options of probeBase, it is now also possible to search for a certain probe sequence, the accession number of a probe and references. Regarding the latter option, users can survey all references (including titles, authors and abstracts) that are available in the database via a simple key word search. probeBase then returns a list of references and all probes originally described therein, allowing a quick overview of all probes that were published by a certain author. Finally, all search options can now also be restricted to only those probes that were already applied for FISH or microarray hybridization.

Lists and probe categories

A new option to view and retrieve probes, references and microarray layouts from probeBase is the ‘Lists’ service. The following dynamic lists, (pre)compiled according to different criteria, are currently available (suggestions of new probe categories by users are welcome). Because probes targeting major taxonomic groups of bacteria are those most frequently applied in ecological, biotechnological and clinical studies, we have re-evaluated their in silico coverage against the RDP-II database [release 9.41, (12)]. A few selected probes with a good coverage of >70% for the respective target taxon are exemplarily shown in Table 1. Knowing the exact target range of a given probe is vital for meaningful interpretation of the hybridization results.

  • Lists of probes that target a certain functional or taxonomic category of organisms (e.g. sulphate-reducing prokaryotes or organisms of medical/hygienical relevance)

  • A list of all probes

  • A list of all references

  • A list of all probes that were successfully used for FISH

  • A list of all rRNA-targeted oligonucleotide microarrays

  • A pre-calculated, but regularly updated list showing the coverage of selected probes targeting major taxonomic groups of bacteria (exemplified by Table 1)

Table 1

Selected in silico coverages of published group-specific probes in probeBase, as analyzed with the RDP II probe match tool

pB accession number Probe name Specificityb Reference Probe evaluation with RDP II probe matcha 
    RDP II target taxon Coveragec(%) Hits in target taxond Total non-target hitse Major non-target taxaf 
pB-00327 UNIV1390 all organisms (16domain Bacteria 92 75934 NAg — 
pB-00159 EUB338 Bacteria (17) domain Bacteria 90 169389 NAg — 
pB-00021 ALF968 Alphaproteobacteria, except of Rickettsiales (18) class Alphaproteobacteria 79 9750 1876 class Deinococci (38/364),phylum Proteobacteria (11138/51072), class Fibrobacteres (48/53) 
pB-00432 DELTA495a Deltaproteobacteria (19) class Deltaproteobacteria 73 4236 729 phylum Gemmatimonadetes (343/391), phylum Genera_incertae_sedis_WS3 (23/68) 
pB-00050 CIV/V1342 heterocystous strains of cyanobacteria (sections IV and V) (20) Cyanobacteria subsection 4 96 491 36 — 
pB-00044 CFB1082 class Bacteroidetes (21) class Bacteroidetes 85 8485 — 
pB-00046 CFB563 Flavobacteriales (21) order Flavobacteriales 84 3301 302 — 
pB-00718 GNSB-941 Chloroflexi (green nonsulfur bacteria) (22) phylum Chloroflexi 94 1240 816 phylum Thermomicrobia (8/11), phylum Genera_incertae_sedis_Dehalococcoides (81/89), unclassified_Bacteria (706/6563) 
pB-00645 GSB-532 Chlorobiaceae (green sulfur bacteria) (23) family Chlorobiaceae 91 221 — 
pB-00283 PLA46 Planctomycetales (24) family Planctomycetaceae 75 599 310 phylum Lentisphaerae (38/54), phylum Genera_incertae_sedis_BRC1 (10/11), phylum Genera_incertae_sedis_WS3 (21/25), unclassified_Bacteria (232/6286) 
pB-00049 Chls-0523 Chlamydiales (25) order Chlamydiales 93 189 21 — 
pB-00183 HoAc1402 phylum Acidobacteria (26) phylum Acidobacteria 95 693 718 unclassified_Bacteria (602/3249) 
pB-00056 CYA361 phylum Cyanobacteria (20) phylum Cyanobacteria 91 4043 438 phylum Firmicutes(143/49033), unclassified_Bacteria (254/10078) 
pB-00435 NTSPA714 phylum Nitrospira, not Thermodesulfovibrio islandicus (19) phylum Nitrospira 71 373 — 
pB-00535 TDSBM1282 genera Thermodesulfobacterium and Geothermobacterium (19) phylum Thermodesulfobacteria 84 36 — 
pB-00601 TM7522 candidate phylum TM7 (27) phylum Genera_incertae_sedis_TM7 92 264 12 — 
pB accession number Probe name Specificityb Reference Probe evaluation with RDP II probe matcha 
    RDP II target taxon Coveragec(%) Hits in target taxond Total non-target hitse Major non-target taxaf 
pB-00327 UNIV1390 all organisms (16domain Bacteria 92 75934 NAg — 
pB-00159 EUB338 Bacteria (17) domain Bacteria 90 169389 NAg — 
pB-00021 ALF968 Alphaproteobacteria, except of Rickettsiales (18) class Alphaproteobacteria 79 9750 1876 class Deinococci (38/364),phylum Proteobacteria (11138/51072), class Fibrobacteres (48/53) 
pB-00432 DELTA495a Deltaproteobacteria (19) class Deltaproteobacteria 73 4236 729 phylum Gemmatimonadetes (343/391), phylum Genera_incertae_sedis_WS3 (23/68) 
pB-00050 CIV/V1342 heterocystous strains of cyanobacteria (sections IV and V) (20) Cyanobacteria subsection 4 96 491 36 — 
pB-00044 CFB1082 class Bacteroidetes (21) class Bacteroidetes 85 8485 — 
pB-00046 CFB563 Flavobacteriales (21) order Flavobacteriales 84 3301 302 — 
pB-00718 GNSB-941 Chloroflexi (green nonsulfur bacteria) (22) phylum Chloroflexi 94 1240 816 phylum Thermomicrobia (8/11), phylum Genera_incertae_sedis_Dehalococcoides (81/89), unclassified_Bacteria (706/6563) 
pB-00645 GSB-532 Chlorobiaceae (green sulfur bacteria) (23) family Chlorobiaceae 91 221 — 
pB-00283 PLA46 Planctomycetales (24) family Planctomycetaceae 75 599 310 phylum Lentisphaerae (38/54), phylum Genera_incertae_sedis_BRC1 (10/11), phylum Genera_incertae_sedis_WS3 (21/25), unclassified_Bacteria (232/6286) 
pB-00049 Chls-0523 Chlamydiales (25) order Chlamydiales 93 189 21 — 
pB-00183 HoAc1402 phylum Acidobacteria (26) phylum Acidobacteria 95 693 718 unclassified_Bacteria (602/3249) 
pB-00056 CYA361 phylum Cyanobacteria (20) phylum Cyanobacteria 91 4043 438 phylum Firmicutes(143/49033), unclassified_Bacteria (254/10078) 
pB-00435 NTSPA714 phylum Nitrospira, not Thermodesulfovibrio islandicus (19) phylum Nitrospira 71 373 — 
pB-00535 TDSBM1282 genera Thermodesulfobacterium and Geothermobacterium (19) phylum Thermodesulfobacteria 84 36 — 
pB-00601 TM7522 candidate phylum TM7 (27) phylum Genera_incertae_sedis_TM7 92 264 12 — 

Note: The RDP II probe match only includes bacterial but no archaeal or eukaryal small subunit rRNA sequences in the analysis.

aRDP II probe match was performed with database release 9.41 (July 28, 2006) containing 253 813 bacterial 16S rRNA sequences. The search for each probe was restricted to sequences of good quality with data in the respective probe binding region.

bThe intended specificity of the probe.

cThe percentage of sequences within the RDP II target taxon that show a full match to the probe sequence.

dThe number of sequences within the RDP II target taxon that show a full match to the probe sequence.

eThe number of sequences outside the RDP II target taxon that show a full match to the probe sequence.

fIf applicable, additional information on the major non-target taxa are provided. The number of sequences that show a full match to the probe versus the total number of sequences within a non-target taxon are shown in brackets.

gNA, not applicable because RDP II only contains bacterial 16S rRNA sequences.

Distribution of probes along the 16S/18S rRNA sequence

Using the roughly 1200 probes currently available at probeBase we have analyzed the distribution of their respective binding sites on the 16S/18S rRNA sequence (Figure 1). The number of all probes and FISH probes covering a certain target position on the 16S/18S rRNA displayed a weak but significant positive correlation with the variability of the respective sequence position (the non-parametric Spearman-Rho correlation coefficients were 0.404, P < 0.001 and 0.410,P < 0.001, respectively). Most probes in the database target highly variable sequence regions, e.g. around Escherichia coli position 450 and 650, while conserved regions, e.g. around position 350 and 1400 are typically occupied by only few probes that are specific for higher taxa, such as the bacterial probe EUB338 or the universal probe UNIV1390. This analysis may provide guidance for the development of novel probes and the selection of suitable target sites.

Figure 1

Number of probes per target site and positional variability of the 16/18S rRNA molecule (status July 2006). Positional variability profile of the 16S/18S rRNA in percentage was created using the ARB database “ssu_jan04_corr_opt.arb” (14). Background colors indicate the different fluorescence brightness classes (I, bright to VI, dim) according to the consensus accessibility map of bacterial and archaeal 16S rRNA for FISH probes, established by Behrens et al. (15); uncolored regions were not analyzed.

Figure 1

Number of probes per target site and positional variability of the 16/18S rRNA molecule (status July 2006). Positional variability profile of the 16S/18S rRNA in percentage was created using the ARB database “ssu_jan04_corr_opt.arb” (14). Background colors indicate the different fluorescence brightness classes (I, bright to VI, dim) according to the consensus accessibility map of bacterial and archaeal 16S rRNA for FISH probes, established by Behrens et al. (15); uncolored regions were not analyzed.

SUBMISSION OF PROBES

Several options are provided on the probeBase website for submission of single probes and whole probe sets, including microarray (PhyloChip) probes. It is also possible to deposit newly developed probes in probeBase already prior to their publication in a scientific journal, thereby apprising probeBase users of the new probes and the upcoming publication. In this case, the information on the actual probe sequence might be hidden until the day of publication, and only the short name, the specificity, and the target molecule of the probe are displayed. Additionally, the contact details of the person who has deposited the probe are shown.

AVAILABILITY, ERROR REPORTING AND CONTACT

probeBase is maintained by the Department of Microbial Ecology, University of Vienna, Wien, Austria. Data and analysis tools are available at . We welcome questions or comments concerning probeBase and highly appreciate reports of bugs, errors or missing probes. You may contact us by email to probebase@microbial-ecology.net.

The authors are indebted to Şafak Yilmaz and Daniel Noguera for calculation of the ΔG values. We greatly thank Todd DeSantis for his efforts to facilitate the access to Greengenes and acknowledge help of Christian Baranyi for statistical analysis and of Carmen Krammer, Manuela Hartmann and Silvia Weber for database maintenance. We further thank Phil Hugenholtz for valuable comments and all users who contributed to probeBase by submitting probes. probeBase was partly supported by grants of the bmb+f (01 LC 0021A-TP2 in the framework of the BIOLOG II program) to M.W. and the Austrian Science Fund to A.L. (P18836-B17) and M.H. (Y277-B03). Funding to pay the Open Access publication charges for this article was provided by the Austrian Science Fund.

Conflict of interest statement. None declared.

REFERENCES

1
Amann
R.
Fuchs
B.M.
Behrens
S.
The identification of microorganisms by fluorescence in situ hybridisation
Curr. Opin. Biotechnol.
 , 
2001
, vol. 
12
 (pg. 
231
-
236
)
2
Loy
A.
Bodrossy
L.
Highly parallel microbial diagnostics using oligonucleotide microarrays
Clin. Chim. Acta
 , 
2006
, vol. 
363
 (pg. 
106
-
119
)
3
Wagner
M.
Horn
M.
Daims
H.
Fluorescence in situ hybridisation for the identification and characterisation of prokaryotes
Curr. Opin. Microbiol.
 , 
2003
, vol. 
6
 (pg. 
302
-
309
)
4
Daims
H.
Lücker
S.
Wagner
M.
daime, a novel image analysis program for microbial ecology and biofilm research
Environ. Microbiol.
 , 
2006
, vol. 
8
 (pg. 
200
-
213
)
5
Wagner
M.
Nielsen
P.H.
Loy
A.
Nielsen
J.L.
Daims
H.
Linking microbial community structure with function: fluorescence in situ hybridization-microautoradiography and isotope arrays
Curr. Opin. Biotechnol.
 , 
2006
, vol. 
17
 (pg. 
1
-
9
)
6
Galperin
M.Y.
The Molecular Biology Database Collection: 2006 update
Nucleic Acids Res.
 , 
2006
, vol. 
34
 (pg. 
D3
-
D5
)
7
Loy
A.
Horn
M.
Wagner
M.
probeBase: an online resource for rRNA-targeted oligonucleotide probes
Nucleic Acids Res.
 , 
2003
, vol. 
31
 (pg. 
514
-
516
)
8
Daims
H.
Brühl
A.
Amann
R.
Schleifer
K.-H.
Wagner
M.
The domain-specific probe EUB338 is insufficient for the detection of all Bacteria: development and evaluation of a more comprehensive probe set
Syst. Appl. Microbiol.
 , 
1999
, vol. 
22
 (pg. 
434
-
444
)
9
Loy
A.
Schulz
C.
Lücker
S.
Schöpfer-Wendels
A.
Stoecker
K.
Baranyi
C.
Lehner
A.
Wagner
M.
16S rRNA gene-based oligonucleotide microarray for environmental monitoring of the betaproteobacterial order “Rhodocyclales
Appl. Environ. Microbiol.
 , 
2005
, vol. 
71
 (pg. 
1373
-
1386
)
10
Yilmaz
L.S.
Okten
H.E.
Noguera
D.R.
Making all parts of the 16S rRNA of Escherichia coli accessible in situ to single DNA oligonucleotides
Appl. Environ. Microbiol.
 , 
2006
, vol. 
72
 (pg. 
733
-
744
)
11
Yilmaz
L.S.
Noguera
D.R.
Mechanistic approach to the problem of hybridization efficiency in fluorescent in situ hybridization
Appl. Environ. Microbiol.
 , 
2004
, vol. 
70
 (pg. 
7126
-
7139
)
12
Cole
J.R.
Chai
B.
Farris
R.J.
Wang
Q.
Kulam
S.A.
McGarrell
D.M.
Garrity
G.M.
Tiedje
J.M.
The Ribosomal Database Project (RDP-II): sequences and tools for high-throughput rRNA analysis
Nucleic Acids Res.
 , 
2005
, vol. 
33
 (pg. 
294
-
296
)
13
DeSantis
T.Z.
Hugenholtz
P.
Larsen
N.
Rojas
M.
Brodie
E.L.
Keller
K.
Huber
T.
Dalevi
D.
Hu
P.
Andersen
G.L.
Greengenes, a chimera-checked 16S rRNA gene database and workbench compatible with ARB
Appl. Environ. Microbiol.
 , 
2006
, vol. 
72
 (pg. 
5069
-
5072
)
14
Ludwig
W.
Strunk
O.
Westram
R.
Richter
L.
Meier
H.
Yadhukumar
Buchner
A.
Lai
T.
Steppi
S.
Jobb
G.
, et al.  . 
ARB: a software environment for sequence data
Nucleic Acids Res.
 , 
2004
, vol. 
32
 (pg. 
1363
-
1371
)
15
Behrens
S.
Ruhland
C.
Inacio
J.
Huber
H.
Fonseca
A.
Spencer-Martins
I.
Fuchs
B.M.
Amann
R.
In situ accessibility of small-subunit rRNA of members of the domains Bacteria, Archaea, and Eucarya to Cy3-labeled oligonucleotide probes
Appl. Environ. Microbiol.
 , 
2003
, vol. 
69
 (pg. 
1748
-
1758
)
16
Zheng
D.
Alm
E.W.
Stahl
D.A.
Raskin
L.
Characterization of universal small-subunit rRNA hybridization probes for quantitative molecular microbial ecology studies
Appl. Environ. Microbiol.
 , 
1996
, vol. 
62
 (pg. 
4504
-
4513
)
17
Amann
R.I.
Binder
B.J.
Olson
R.J.
Chisholm
S.W.
Devereux
R.
Stahl
D.A.
Combination of 16S rRNA-targeted oligonucleotide probes with flow cytometry for analyzing mixed microbial populations
Appl. Environ. Microbiol.
 , 
1990
, vol. 
56
 (pg. 
1919
-
1925
)
18
Neef
A.
1997
München
Technische Universität München
 
Ph. D. thesis. Anwendung der in situ Einzelzell-Identifizierung von Bakterien zur Populations analyse in komplexen mikrobiellen Biozönosen. Technische Universität München, München
19
Loy
A.
Lehner
A.
Lee
N.
Adamczyk
J.
Meier
H.
Ernst
J.
Schleifer
K.-H.
Wagner
M.
Oligonucleotide microarray for 16S rRNA gene-based detection of all recognized lineages of sulfate-reducing prokaryotes in the environment
Appl. Environ. Microbiol.
 , 
2002
, vol. 
68
 (pg. 
5064
-
5081
)
20
Schönhuber
W.
Zarda
B.
Eix
S.
Rippka
R.
Herdman
M.
Ludwig
W.
Amann
R.
In situ identification of cyanobacteria with horseradish peroxidase-labeled, rRNA-targeted oligonucleotide probes
Appl. Environ. Microbiol.
 , 
1999
, vol. 
65
 (pg. 
1259
-
1267
)
21
Weller
R.
Glöckner
F.O.
Amann
R.
16S rRNA-targeted oligonucleotide probes for the in situ detection of members of the phylum Cytophaga-Flavobacterium-Bacteroides
Syst. Appl. Microbiol.
 , 
2000
, vol. 
23
 (pg. 
107
-
114
)
22
Gich
F.
Garcia-Gil
J.
Overmann
J.
Previously unknown and phylogenetically diverse members of the green nonsulfur bacteria are indigenous to freshwater lakes
Arch. Microbiol.
 , 
2001
, vol. 
177
 (pg. 
1
-
10
)
23
Tuschak
C.
Glaeser
J.
Overmann
J.
Specific detection of green sulfur bacteria by in situ hybridization with a fluorescently labeled oligonucleotide probe
Arch. Microbiol.
 , 
1999
, vol. 
171
 (pg. 
265
-
272
)
24
Neef
A.
Amann
R.
Schlesner
H.
Schleifer
K.-H.
Monitoring a widespread bacterial group: in situ detection of planctomycetes with 16S rRNA-targeted probes
Microbiology
 , 
1998
, vol. 
144
 (pg. 
3257
-
3266
)
25
Poppert
S.
Essig
A.
Marre
R.
Wagner
M.
Horn
M.
Detection and differentiation of chlamydiae by fluorescence in situ hybridization
Appl. Environ. Microbiol.
 , 
2002
, vol. 
68
 (pg. 
4081
-
4089
)
26
Juretschko
S.
Loy
A.
Lehner
A.
Wagner
M.
The microbial community composition of a nitrifying-denitrifying activated sludge from an industrial sewage treatment plant analyzed by the full-cycle rRNA approach
Syst. Appl. Microbiol.
 , 
2002
, vol. 
25
 (pg. 
84
-
99
)
27
Hugenholtz
P.
Tyson
G.W.
Webb
R.I.
Wagner
A.M.
Blackall
L.L.
Investigation of candidate division TM7, a recently recognized major lineage of the domain Bacteria with no known pure-culture representatives
Appl. Environ. Microbiol.
 , 
2001
, vol. 
67
 (pg. 
411
-
419
)
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

Comments

0 Comments