Abstract

Summary: Rapid development, transparency and small size are the outstanding features of zebrafish that make it as an increasingly important vertebrate system for developmental biology, functional genomics, disease modeling and drug discovery. Zebrafish has been regarded as ideal animal specie for studying the relationship between genotype and phenotype, for pathway analysis and systems biology. However, the tremendous amount of data generated from large numbers of embryos has led to the bottleneck of data analysis and modeling. The zebrafish image quantitator (ZFIQ) software provides streamlined data processing and analysis capability for developmental biology and disease modeling using zebrafish model.

Availability: ZFIQ is available for download at http://www.cbi-platform.net

Contact:  [email protected]

Supplementary information: Additional documentation for this software package is referred to http://www.cbi-platform.net/document.htm. Application examples of this software are referred to http://www.cbi-platform.net/download.htm

1 OVERVIEW

The overall architecture of the zebrafish image quantitator (ZFIQ) platform is illustrated in Figure 1a. The major four components of the ZFIQ platform, represented by the blue boxes, include: (1) Pipeline construction environment. The ZFIQ platform allows developers or users to dynamically construct data processing and analysis pipelines in an interactive graphical environment. The pipeline construction is driven by the data quantification problem raised by the zebrafish research community. The data types depend on specific applications. Current supported data types include image data and text data. (2) Pipeline pool. Once the components and parameters of a data quantification pipeline are tested and validated, the pipeline is archived in the pipeline pool, which is the aggregation of users’ knowledge and experience of data quantification. (3) Algorithm and method repository. Currently, the algorithms and method repository has already integrated most of the data analysis routines (over 400) in the Intel OpenCV library (Gary et al., 2000), and in-house built zebrafish data quantification methods. (4) Pipeline running environment. This environment provides users with graphic tools to perform large scale of data quantification studies, and to output the quantification results for specific statistical modeling and analysis. The ZFIQ platform runs on the Microsoft Windows system and accepts most of popular formats of input images.

(a) Overall architecture of the ZFIQ platform. (b). An example of zebrafish nuclei segmentation result produced by ZFIQ. The algorithm is referred to Li et al. (2007b).
Fig. 1.

(a) Overall architecture of the ZFIQ platform. (b). An example of zebrafish nuclei segmentation result produced by ZFIQ. The algorithm is referred to Li et al. (2007b).

2 FUNCTIONALITIES

2.1 Pipeline construction environment

There are four windows in the pipeline construction environment, as shown in http://www.cbi-platform.net/document.htm. Here, a constructed pipeline for cell nuclei segmentation is used as an example. Specifically, the left window in Figure 1b lists all of the integrated computational algorithms in the ZFIQ platform. The user can drag-and-drop any of these algorithms or filters, into the middle window of pipeline construction window, and link them together into a pipeline. In particular, the ZFIQ platform provides automatic checking mechanism to make sure that the links that connects algorithm components are legal in terms of parameter types. At the same time, the execution order of the pipeline is rendered in the execution sequence window on the top right. The execution sequence can be adjusted by user's drag-and-drop operation in this window, if there are independent executions in parallel. Finally, the parameters in the pipeline can be interactively edited in the parameter editing window, as shown in the right bottom. This functionality provides the capability to adaptively fine-tune some parameters of the pipeline for a specific dataset. In this sense, a pipeline not only means a sequence of data analysis algorithms, but a set of parameters associated with these algorithms.

2.2 Algorithm and method repository

The ZFIQ algorithm and method repository has integrated: (1) the data analysis routines (over 400 functions) in the Intel OpenCV library (Gary et al., 2000) and (2) zebrafish data quantification methods and tools developed by the CBI (Center for Biomedical Informatics) and its collaborators. The ZFIQ platform focuses on data quantification problems in the zebrafish biology community. In particular, we are interested in the following common zebrafish data quantification problems: (1) zebrafish cell quantification and pattern analysis, (2) zebrafish neuron and axon quantification, (3) zebrafish somite quantification, (4) zebrafish gene expression quantification and (5) zebrafish morphological analysis. Currently, we have integrated over 30 algorithms and tools developed by CBI and its collaborators into the ZFIQ platform (Li et al., 2007c; Liu et al., 2006).

2.3 Pipeline pool

So far, the ZFIQ platform has over 50 pipelines for zebrafish cell detection, zebrafish cell quantification and zebrafish neuron detection, as well as for general data manipulation and processing. Although these pipelines are already extensively tested and validated using datasets from our collaborating zebrafish research labs, other users might need to fine-tune the parameters according to their own datasets. The pipeline pool is under the folder./CBI_platform/pipelines after the installation of ZFIQ.

2.4 Pipeline running environment

The pipeline running environment provides users with different modes to run the selected pipeline, depending on the amount of dataset to be analyzed. For example, the running environment includes graphical tools that allow users to select large amounts of datasets for batch mode running using a specific pipeline. This support is particularly useful when large datasets generated from zebrafish experiments are to be processed and quantified. The analyzed results and relevant meta-data can be output to a structured database for statistical analysis and modeling of the relationship between genotypes and phenotypes. The pipeline running environment also provides users graphical tool to fine-tune the parameters specific for their zebrafish data quantification applications.

3 APPLICATION

The ZFIQ platform has been applied for zebrafish data quantification in study of Alzheimer's disease genes using zebrafish models, zebrafish retina development and zebrafish somitogenesis. We emphasize that this article focuses on the demonstration of the software platform and that the specific algorithm details and biological questions represented by the datasets are addressed in other publications (Campbell et al., 2006; Li et al., 2007a,b,c; Liu et al., 2006; Liu, 2007)

4 CONCLUSION

Developing a data analysis system that can deal with the diversity of zebrafish datasets and data quality is a challenge (Holley et al., 2000; Patton and Zon, 2001; Peterson et al., 2004; Pujic et al., 2006). The development of the ZFIQ software package aims to fill the void. By building ZFIQ on top of the public domain OpenCV platform, we welcome the computation community to contribute their data quantification pipelines to the ZFIQ, and, in turn, expect the zebrafish research community to contribute their quantification problems and associated datasets to the development. The growth of the pipeline pool of the ZFIQ package would depend on the fruitful interaction of the two communities. In addition, the development of ZFIQ would be complementary to available tools for other model systems such as Caenorhabditis elegans and fly (Peng et al., 2007a,b)

ACKNOWLEDGEMENTS

We would like to thank Dr Weiming Xia and Dr Jarema Malicki of Harvard Medical School and Dr Scott Holley of Yale University for providing the zebrafish datasets discussed in this article. The research was funded by the Bioinformatics Research Center Program Grant (PI: S.T.C.W.) from Harvard Neurodiscovery Center (formally known as HCNR), Harvard Medical School and The Bioinformatics Program Grant (PI: S.T.C.W.) from the Methodist Hospital Research Institute.

Conflict of Interest: none declared.

REFERENCES

Campbell
WA
et al.
,
Zebrafish lacking Alzheimer presenilin enhancer 2 (Pen-2) demonstrate excessive p53-dependent apoptosis and neuronal loss
J. Neurochem
,
2006
, vol.
96
(pg.
1423
-
1440
)
Gary
RB
Vadim
P
,
Intel's computer vision library: applications in calibration, stereo, segmentation, tracking, gesture, face and object recognition
,
2000
Proceeding of IEEE Computer Vision and Pattern Recognition
Hilton Head Island, South Carolina
Peng
H
et al.
,
Automatic image analysis for gene expression patterns of fly embryos
BMC Cell Biol
,
2007
, vol.
8
Supp.1
pg.
S7
Peng
H
et al.
,
Straightening C. elegans images
Bioinformatics
,
2007
 
Published online on 19 November, 2007, doi: 10.1093/bioinformatics/btm569
Holley
SA
et al.
,
Control of her1 expression during zebrafish somitogenesis by a Delta-dependent oscillator and an independent wavefront activity
Genes Dev
,
2000
, vol.
14
(pg.
1678
-
1690
)
Li
G
et al.
,
Detection of blob objects in microscopic Zebrafish images based on gradient vector diffusion
Cytometry A
,
2007
, vol.
71
(pg.
835
-
845
)
Li
G
et al.
,
3D cell segmentation based on gradient flow tracking, accepted with revision
BMC Cell Biol
,
2007
, vol.
8
pg.
40
Li
G
et al.
,
Segmentation of touching cell nuclei using gradient flow tracking
J. Microsc
,
2007
 
in press
Liu
T
et al.
,
Computerized image analysis for quantitative neuronal phenotyping in zebrafish
J. Neurosci. Methods
,
2006
, vol.
153
(pg.
190
-
202
)
Liu
T
,
A quantitative zebrafish phenotyping tool for developmental biology and disease modeling
IEEE Signal Proc. Mag
,
2007
, vol.
24
(pg.
126
-
129
)
Patton
EE
Zon
LI
,
The art and design of genetic screens: zebrafish
Nat. Rev. Genet
,
2001
, vol.
2
(pg.
956
-
966
)
Peterson
RT
et al.
,
Chemical suppression of a genetic mutation in a zebrafish model of aortic coarctation
Nat. Biotechnol
,
2004
, vol.
22
(pg.
595
-
599
)
Pujic
Z
et al.
,
Reverse genetic analysis of neurogenesis in the zebrafish retina
Dev. Biol
,
2006
, vol.
293
(pg.
330
-
347
)

Author notes

Associate Editor: Chris Stoeckert