Abstract

The Arabidopsis Information Portal (https://www.araport.org) is a new online resource for plant biology research. It houses the Arabidopsis thaliana genome sequence and associated annotation. It was conceived as a framework that allows the research community to develop and release ‘modules’ that integrate, analyze and visualize Arabidopsis data that may reside at remote sites. The current implementation provides an indexed database of core genomic information. These data are made available through feature-rich web applications that provide search, data mining, and genome browser functionality, and also by bulk download and web services. Araport uses software from the InterMine and JBrowse projects to expose curated data from TAIR, GO, BAR, EBI, UniProt, PubMed and EPIC CoGe. The site also hosts ‘science apps,’ developed as prototypes for community modules that use dynamic web pages to present data obtained on-demand from third-party servers via RESTful web services. Designed for sustainability, the Arabidopsis Information Portal strategy exploits existing scientific computing infrastructure, adopts a practical mixture of data integration technologies and encourages collaborative enhancement of the resource by its user community.

INTRODUCTION

The flowering plant Arabidopsis thaliana is a model organism for molecular, cellular and systems biology. Arabidopsis offers the favorable characteristics of short lifecycle, prolific seed production and genomic transformation efficiency. The A. thaliana Col-0 ecotype offers a nearly complete genome sequence assembly and high-quality gene annotation, along with many informatic and experimental resources. Other ecotypes and mutant lines, available in seed form, have been genotypically and phenotypically characterized. For over a decade, The Arabidopsis Information Resource (TAIR) served as the collector, curator and provider of pertinent information centered on the reference Col-0 genome. In 2009, the Arabidopsis community was faced with a growing number of online resources and potential loss of its central organizing resource, TAIR. After a series of workshops, members of the Arabidopsis research community published a white paper calling for a new Arabidopsis Information Portal (AIP) (1). The portal would assume responsibility for curation of the genome sequence and gene annotation. Intensive human curation of gene function, literature association etc. is not part of AIP's funded mandate. Rather, it will use an extensible architecture that will permit growth through contributions of community-curated data-centric modules embracing a wide range of data types. With NSF and BBSRC funding, we presented in 2014 a first functional release of Araport (ICAR 2014 proceedings).

Following its termination of funding and in parallel with the development of the AIP, TAIR began to require subscription fees in 2014, allowing it to continue its gene-centric data curation services (ICAR 2014 proceedings) that will complement the work of AIP. The AIP project does, and will continue to, incorporate all the pertinent data that TAIR makes public on a time-delayed schedule. Future Araport releases could act as a gateway by providing access to additional data for registered users of TAIR and other subscription-based services.

The AIP is freely available online at https://www.araport.org. It offers two core web applications: ThaleMine for data mining and JBrowse for genome browsing. It hosts an embryonic collection of ‘science apps’ that should serve as prototypes for community-contributed modules. It offers a reproducible and extensible framework, based on data federation and community-generated content, that should allow it to serve the growing and changing Arabidopsis fields of omics research.

CORE FUNCTIONALITY

The core of AIP is a database of basic genomic data within a data mining web application called ThaleMine. ThaleMine is a customized instance of the InterMine software (2), which provides parsers for many data sources, rapid retrieval of indexed data, dynamic table presentation, saved and shared list functionality, and web services (3) support. Currently, ThaleMine includes the TAIR 10 Col-0 sequence and annotation dataset (4). Assimilation of other data types (polymorphisms, phenotypes, stocks, etc.) is in progress. ThaleMine integrates data from UniProtKB (5), PubMed (6), Bio-Analytic Resource (BAR) (7), PANTHER (8), Sequence Ontology (9) and Gene Ontology (10). ThaleMine links to sources including PhytoMine implemented by Phytozome (11), FlyMine (12) and YeastMine (13) (Table 1). ThaleMine organizes 33 602 genes (including transposable element and pseudogenes), 41 671 transcripts, 31 189 transposable elements, 47 698 proteins, 16 127 publications, 125 expression studies, 6365 protein domains, 197 625 interactions and 5323 gene ontology terms. AIP loaded ThaleMine with an extensible set of template-based queries, which provide simple web interfaces to common kinds of searches (Table 2). The application supports free-text search as well as structured queries. For instance, searches can return lists of loci with specified attributes within the A. thaliana Col-0 reference genome sequence. List analysis is facilitated by widgets that generate column-wise statistics, identify over-represented terms or publication links or compare lists by set-union and set-intersection. Capability to drill down by locus is facilitated by report pages for genes, transcripts and proteins. These pages display gene structure, functional annotation, protein domains, protein interactions, associated publications, expression patterns, co-expressed genes, orthologs in other species and links to other resources. Locus-specific pages also display an embedded BAR Electronic Fluorescent Pictograph (eFP) viewer (14) (Figure 1). The ThaleMine User Guide (https://www.araport.org/thalemine/user-guide) has more details.

Figure 1.

A Gene report page showing: (A) attributes like standard gene locus identifier, symbols/synonyms, TAIR curator summary, confidence rating, etc. Other useful information is segregated into the aspects, as follows: (B) ‘Genomics’ showing gene structure information; (C) ‘Function’ displaying the Gene Ontology annotation; (D) ‘Proteins’ with links to the relevant protein records populated with data from UniProt; (E) ‘Homology’ listing the computed orthologs and paralogs across a diverse set of species; (F) ‘Interactions’ showing a visual representation of the genes’ physical/genetic interactions (tabular format is also available); (G) ‘Expression’ reporting gene expression levels based on AtGenExpress project data, also visualized using an embedded eFP view; (H) ‘Publications’ reports papers associated with the current gene; (I) ‘Links’ to other InterMine databases where homologs of the current gene exist, and external links to several important Arabidopsis and plant genomics data providers.

Summary of the datasets used to populate ThaleMine and their corresponding sources

Table 1.
Summary of the datasets used to populate ThaleMine and their corresponding sources
DatasetData source
Genome sequence and annotationTAIR (version 10; 8/31/13)
Protein sequence and propertiesUniprotKB
Protein interactionsBio-Analytic Resource (BAR)
Affymetrix expression dataAtGenExpress via BAR
Electronic Fluorescent PictographsBio-Analytic Resource
PublicationsUniprotKB and NCBI Entrez
OrthologsPANTHER and PhytoMine
DatasetData source
Genome sequence and annotationTAIR (version 10; 8/31/13)
Protein sequence and propertiesUniprotKB
Protein interactionsBio-Analytic Resource (BAR)
Affymetrix expression dataAtGenExpress via BAR
Electronic Fluorescent PictographsBio-Analytic Resource
PublicationsUniprotKB and NCBI Entrez
OrthologsPANTHER and PhytoMine
Table 1.
Summary of the datasets used to populate ThaleMine and their corresponding sources
DatasetData source
Genome sequence and annotationTAIR (version 10; 8/31/13)
Protein sequence and propertiesUniprotKB
Protein interactionsBio-Analytic Resource (BAR)
Affymetrix expression dataAtGenExpress via BAR
Electronic Fluorescent PictographsBio-Analytic Resource
PublicationsUniprotKB and NCBI Entrez
OrthologsPANTHER and PhytoMine
DatasetData source
Genome sequence and annotationTAIR (version 10; 8/31/13)
Protein sequence and propertiesUniprotKB
Protein interactionsBio-Analytic Resource (BAR)
Affymetrix expression dataAtGenExpress via BAR
Electronic Fluorescent PictographsBio-Analytic Resource
PublicationsUniprotKB and NCBI Entrez
OrthologsPANTHER and PhytoMine

List of commonly executed searches provided as ‘Template searches’

Table 2.
List of commonly executed searches provided as ‘Template searches’
Input (accepts wildcard characters)Output (reported as a table, exportable in different file formats)
Gene (identifier/alias)Related protein identifiers
List of interactors
Set of homologous genes
Expression values
Publications referencing this gene
FASTA sequences of the CDS
FASTA sequence of the proteins
FASTA sequences of the 5′/3′ UTRs
Protein (identifier/domain)Related gene identifiers
Publications referencing this protein
Ontology termList of genes
Input (accepts wildcard characters)Output (reported as a table, exportable in different file formats)
Gene (identifier/alias)Related protein identifiers
List of interactors
Set of homologous genes
Expression values
Publications referencing this gene
FASTA sequences of the CDS
FASTA sequence of the proteins
FASTA sequences of the 5′/3′ UTRs
Protein (identifier/domain)Related gene identifiers
Publications referencing this protein
Ontology termList of genes
Table 2.
List of commonly executed searches provided as ‘Template searches’
Input (accepts wildcard characters)Output (reported as a table, exportable in different file formats)
Gene (identifier/alias)Related protein identifiers
List of interactors
Set of homologous genes
Expression values
Publications referencing this gene
FASTA sequences of the CDS
FASTA sequence of the proteins
FASTA sequences of the 5′/3′ UTRs
Protein (identifier/domain)Related gene identifiers
Publications referencing this protein
Ontology termList of genes
Input (accepts wildcard characters)Output (reported as a table, exportable in different file formats)
Gene (identifier/alias)Related protein identifiers
List of interactors
Set of homologous genes
Expression values
Publications referencing this gene
FASTA sequences of the CDS
FASTA sequence of the proteins
FASTA sequences of the 5′/3′ UTRs
Protein (identifier/domain)Related gene identifiers
Publications referencing this protein
Ontology termList of genes

AIP hosts a customized instance of JBrowse (15) that, like InterMine, is actively supported by the GMOD consortium (http://www.gmod.org). The browser integrates gene structure, various kinds of evidence and genomic attributes, expression data, and more. Below the reference sequence axis, the browser presents a stack of ‘tracks’ representing, for example, the TAIR 10 gene models. The browser currently incorporates 20 tracks, including datasets such as pseudogenes, expression, non-coding RNAs, epigenetics and T-DNA insertion lines, which are presented to the user via the hierarchical track selector (Figure 2A). Additionally, 28 more tracks (not in the hierarchical selector) are obtained on-demand from EPIC CoGe (16) via web services, which are cataloged within the faceted track selector (Figure 2B). We modified JBrowse to enable the hierarchical and faceted track selectors to co-exist. This allows users to select from our 20 standard tracks and also our 28 metadata-rich tracks organized by user-set facets of their metadata. The browser also offers bulk download of interval-specific track data in common file formats (FASTA, GFF3, BED). Similarly, it can integrate user-provided datasets, either local or hosted on third party servers (i.e. displaying tracks of read mappings without transmitting entire files, by exploiting indexed BAM files) without data transmission to AIP servers (Figure 2C). While searching or scrolling, the page URL is dynamically updated with sufficient information to fully reproduce the display. This feature enables collaboration by URL sharing between remote users.

Figure 2.

(A) (i) List of tracks available via the hierarchical track selector; (ii) a link to access the faceted track selector; (iii) track display panel showing the currently selected tracks for viewing (green inset: track chosen from the faceted selector). (B) Faceted track selector offers a search and filtering interface to enable choosing tracks based on the experiment metadata. (C) File upload capability ensures privacy to view ones own datasets alongside the Arabidopsis genome, and flexibility to work with large datasets such as short read alignments.

EXTENSIBLE FUNCTIONALITY

The AIP design relies on data federation and software modularity for scalability and sustainability. The AIP module concept consists of a visual front end to an externally provided database or computational resource. AIP encourages the Arabidopsis research community to contribute modules as a means of disseminating their results. AIP provides the framework for developing, hosting and integrating modules as well as software and training materials. The AIP software development kit (SDK) provides boilerplate code (as an example it deploys a ‘Hello World’ app, the classic first task for programmers), serving as a starting point for developers. The SDK was built with node.js (http://nodejs.org), grunt (http://gruntjs.com) and yeoman (http://yeoman.io) software. Using this SDK, AIP has developed and hosts several demonstration modules. One is a BLAST (17) module offering homology search and sequence alignment visualization. Another provides graphical and tabular viewers for Arabidopsis protein–protein interactions, for which the data are obtained on demand from EBI IntAct web services (18) and the visualization relies on BioJS (19) and Cytoscape (20) open-source software. AIP science apps can access web services directly or through the AIP middleware. They can access AIP-hosted web services that expose the data, metadata and indexes associated with ThaleMine and JBrowse at AIP.

AIP hosts community information. This includes notices of news regarding the AIP project, postings syndicated from GARNet (http://www.garnetcommunity.org.uk), conferences/events and job openings. AIP has a forum for posting, and answering, questions to AIP staff and the community. AIP offers files for bulk download including the latest public TAIR release of whole genome sequence (FASTA), gene structure annotation (GFF3) and other supporting datasets.

ARCHITECTURE

With the goal of providing the Arabidopsis research community with a sustainable resource, Araport is designed to be scalable and reproducible. For scalability, its organizing principle is federation. AIP hosts the middleware and services that enable the development and deployment of community-contributed science apps. As enablers, AIP hosts the two data-centric applications, ThaleMine and JBrowse, which provide data services, visualization tools and optional launch points for third-party modules. For reproducibility, AIP maintains public repositories hosted on GitHub (https://www.github.com), tracking the data, configuration and software code that are required to recreate the Araport website. We already rebuild parts of Araport (e.g. ThaleMine) automatically from scripts in the repository. We aim to automatically deploy the complete site at two locations (JCVI, TACC) and possibly others.

The AIP core applications, ThaleMine and JBrowse, were both implemented by customizing open-source GMOD software. The core applications represent a mixture of the data federation and data hosting approaches to website development. Both applications use federation to obtain data on demand from remote sites. ThaleMine displays links to orthologs and eFP expression pattern images, both dependent on run-time data requests to third-party web services. ThaleMine and JBrowse not only consume but also expose web services. These web services have RESTful interfaces, as first described by Fielding (21). They are exposed through an AIP mediation layer in an effort to increase their uniformity, stability and ease of use (e.g. by precluding cross-origin resource sharing (CORS) security restrictions). Both ThaleMine and JBrowse also rely on AIP-hosted data, currently ∼150 GB, sufficient to provide rapid displays and search responses. Hosted data include genome sequence; IDs and functional descriptions of genes, transcripts, proteins, domains and ontological terms; and metadata used to identify genome browser tracks. A portion of ThaleMine data is obtained from TAIR on a delayed-release schedule. Other data are downloaded from FTP sites and web services shortly before deploying the applications. A Chado database (22), with Tripal modules (23), is under construction. It will act as a data warehouse in support of future application deployments.

The Araport software employs a strongly decoupled three-tier design in which front end presentation software communicates through a middle layer to back end databases. This architecture was adopted to promote site sustainability by enabling future re-engineering of each tier independently. The AIP presentation layer uses HTML5 (http://www.w3.org/TR/html5/), CSS (http://www.w3.org/Style/CSS/), JavaScript, AJAX and jQuery (http://jquery.com) delivered via HTTPS to provide a responsive and secure experience to clients using modern web browsers on any platform. AIP's middle layer employs the Apache Tomcat web server, InterMine software, the Drupal content management system, the iPlant Collaborative-developed Agave API platform (24) and OAuth2 (http://tools.ietf.org/html/rfc6749) authentication. On the back end, AIP employs PostgreSQL (http://www.postgresql.org) databases and the iPlant Data Store (25) for data persistence. Modules of AIP communicate with databases via AIP's mediating layer that provides authentication and other services (logging, throttling, load balancing, data transformation, caching). Communications between front-to-middle layers and middle-to-back layers are implemented as web services. These exchange JSON-formatted data over the HTTPS protocol using RESTful URL standards being developed and documented at AIP.

FUTURE DIRECTIONS

As TAIR's NSF funding ended, responsibility for the A. thaliana Col-0 reference genome and annotation passed to AIP. We plan to update the standard ‘TAIR10′ sequence and annotation with releases labeled ‘AIP11’, ‘AIP12’, etc. We are implementing an automated re-annotation pipeline that combines publicly available RNA-seq reads, their Trinity de novo and reference-guided assemblies (26) and PASA annotation comparison (27). One goal is to delineate tissue-specific transcript isoforms. We will solicit expert community review to be implemented with Tripal and WebApollo (28) software.

Araport is designed to respond to a recent whitepaper (1) that called for a sustainable community portal whose growth would depend on modular contributions from the Arabidopsis research community. Now entering its second year of funding, Araport provides two fully functional web applications for browsing and data mining A. thaliana genomics data. More importantly for growth, the AIP staff is working with the community to extend its documentation, middleware and SDKs that enable third-party module development and dissemination. By early November 2014, AIP will have hosted its first workshop for community developers. The workshop is expected to generate about a dozen modules addressing research areas including expression, regulation and epigenetics. Each module will offer dynamic web pages providing visualization of remote data that is accessed on demand through RESTful web services. This and other workshops should enhance the repertoire of available modules, increase community involvement, lower the technical hurdles for contribution, and eventually enable increasing levels of integration and sophistication of module design. AIP will continue to enhance its presentation layer to aid users in the discovery, use and combination of modules. AIP will integrate third-party modules into its core applications and simultaneously promote the integration of core application web services and visualizations within third-party modules. This federated model of data sharing and software development should help AIP continuously adapt to the omics data explosion and the resulting needs of the broader Arabidopsis user community.

We would like to thank Eva Huala and Bob Muller (Phoenix Bioinformatics, formerly TAIR) for their assistance with the migration of Arabidopsis data to AIP; summer interns Eleanor Pence and Jane Smitham for their contributions; Nicholas Provart and the Bio-Analytic Resource group for providing eFP web services and databases of expression and protein interaction data; Eric Lyons and the CoGe team for their assistance with integrating EPIC-CoGe data; the iPlant Collaborative for assistance in developing BAM file streaming; Joe Carlson and David Goodstein for assistance with integrating Phytozome's PhytoMine.

FUNDING

National Science Foundation [DBI-1262414 to C.D.T., G.M., M.V., J.R.M., K.K.]; Biotechnology and Biological Sciences Research Council [BB/L027151/1 to G.M.]. Funding for open access charge: National Science Foundation [DBI-1262414].

Conflict of interest statement. None declared.

Present address: Konstantinos Krampis, Biological Sciences, Hunter College at City University of New York, New York, NY 10065, USA.

REFERENCES

1.
International Arabidopsis Informatics Consortium
Taking the next step: building an Arabidopsis information portal
Plant Cell
2012
, vol. 
24
 (pg. 
2248
-
2256
)
2.
Smith
R.N.
Aleksic
J.
Butano
D.
Carr
A.
Contrino
S.
Hu
F.
Lyne
M.
Lyne
R.
Kalderimis
A.
Rutherford
K.
InterMine: a flexible data warehouse system for the integration and analysis of heterogeneous biological data
Bioinformatics
2012
, vol. 
28
 (pg. 
3163
-
3165
)
3.
Kalderimis
A.
Lyne
R.
Butano
D.
Contrino
S.
Lyne
M.
Heimbach
J.
Hu
F.
Smith
R.
Stepan
R.
Sullivan
J.
, et al. 
InterMine: extensive web services for modern biology
Nucleic Acids Res.
2014
, vol. 
42
 (pg. 
W468
-
W472
)
4.
Lamesch
P.
Berardini
T.Z.
Li
D.
Swarbreck
D.
Wilks
C.
Sasidharan
R.
Muller
R.
Dreher
K.
Alexander
D.L.
Garcia-Hernandez
M.
, et al. 
The Arabidopsis Information Resource (TAIR): improved gene annotation and new tools
Nucleic Acids Res.
2012
, vol. 
40
 (pg. 
D1202
-
D1210
)
5.
UniProt Consortium.
Activities at the Universal Protein Resource (UniProt)
Nucleic Acids Res.
2014
, vol. 
42
 (pg. 
D191
-
D198
)
6.
Maglott
D.
Ostell
J.
Pruitt
K.D.
Tatusova
T.
Entrez Gene: gene-centered information at NCBI
Nucleic Acids Res.
2011
, vol. 
39
 (pg. 
D52
-
D57
)
7.
Brady
S.M.
Provart
N.J.
Web-queryable large-scale data sets for hypothesis generation in plant biology
Plant Cell
2009
, vol. 
21
 (pg. 
1034
-
1051
)
8.
Mi
H.
Muruganujan
A.
Thomas
P.D.
PANTHER in 2013: modeling the evolution of gene function, and other gene attributes, in the context of phylogenetic trees
Nucleic Acids Res.
2013
, vol. 
41
 (pg. 
D377
-
D386
)
9.
Eilbeck
K.
Lewis
S.E.
Mungall
C.J.
Yandell
M.
Stein
L.
Durbin
R.
Ashburner
M.
The Sequence Ontology: a tool for the unification of genome annotations
Genome Biol.
2005
, vol. 
6
 pg. 
R44
 
10.
Gene Ontology Consortium.
Blake
J.A.
Dolan
M.
Drabkin
H.
Hill
D.P.
Li
N.
Sitnikov
D.
Bridges
S.
Burgess
S.
Buza
T.
Gene Ontology Annotations and Resources
Nucleic Acids Res.
2013
, vol. 
41
 (pg. 
D530
-
D535
)
11.
Goodstein
D.M.
Shu
S.
Howson
R.
Neupane
R.
Hayes
R.D.
Fazo
J.
Mitros
T.
Dirks
W.
Hellsten
U.
Putnam
N.
, et al. 
Phytozome: a comparative platform for green plant genomics
Nucleic Acids Res.
2012
, vol. 
40
 (pg. 
D1178
-
D1186
)
12.
Lyne
R.
Smith
R.
Rutherford
K.
Wakeling
M.
Varley
A.
Guillier
F.
Janssens
H.
Ji
W.
McLaren
P.
North
P.
, et al. 
FlyMine: an integrated database for Drosophila and Anopheles genomics
Genome Biol.
2007
, vol. 
8
 pg. 
R129
 
13.
Balakrishnan
R.
Park
J.
Karra
K.
Hitz
B.C.
Binkley
G.
Hong
E.L.
Sullivan
J.
Micklem
G.
Michael Cherry
J.
YeastMine—an integrated data warehouse for Saccharomyces cerevisiae data as a multipurpose tool-kit
Database
2012
, vol. 
2012
 pg. 
bar062
 
14.
Winter
D.
Vinegar
B.
Nahal
H.
Ammar
R.
Wilson
G.V.
Provart
N.J.
An ‘Electronic Fluorescent Pictograph’ browser for exploring and analyzing large-scale biological data sets
PloS One
2007
, vol. 
2
 pg. 
e718
 
15.
Skinner
M.E.
Uzilov
A.V.
Stein
L.D.
Mungall
C.J.
Holmes
I.H.
JBrowse: a next-generation genome browser
Genome Res.
2009
, vol. 
19
 (pg. 
1630
-
1638
)
16.
Lyons
E.
Pedersen
B.
Kane
J.
Alam
M.
Ming
R.
Tang
H.
Wang
X.
Bowers
J.
Paterson
A.
Lisch
D.
, et al. 
Finding and comparing syntenic regions among Arabidopsis and the outgroups papaya, poplar, and grape: CoGe with rosids
Plant Physiol.
2008
, vol. 
148
 (pg. 
1772
-
1781
)
17.
Altschul
S.F.
Gish
W.
Miller
W.
Myers
E.W.
Lipman
D.J.
Basic local alignment search tool
J. Mol. Biol.
1990
, vol. 
215
 (pg. 
403
-
410
)
18.
Orchard
S.
Ammari
M.
Aranda
B.
Breuza
L.
Briganti
L.
Broackes-Carter
F.
Campbell
N.H.
Chavali
G.
Chen
C.
del-Toro
N.
, et al. 
The MIntAct project—IntAct as a common curation platform for 11 molecular interaction databases
Nucleic Acids Res.
2014
, vol. 
42
 (pg. 
D358
-
D363
)
19.
Gomez
J.
Garcia
L.J.
Salazar
G.A.
Villaveces
J.
Gore
S.
Garcia
A.
Martin
M.J.
Launay
G.
Alcantara
R.
Del-Toro
N.
BioJS: an open source JavaScript framework for biological data visualization
Bioinformatics
2013
, vol. 
29
 (pg. 
1103
-
1104
)
20.
Shannon
P.
Markiel
A.
Ozier
O.
Baliga
N.S.
Wang
J.T.
Ramage
D.
Amin
N.
Schwikowski
B.
Ideker
T.
Cytoscape: a software environment for integrated models of biomolecular interaction networks
Genome Res.
2003
, vol. 
13
 (pg. 
2498
-
2504
)
21.
Fielding
R.T.
PhD Thesis
2000
, vol. 
AAI9980887
 
Irvine
University of California
(pg. 
1
-
162
)
22.
Zhou
P.
Emmert
D.
Zhang
P.
Using Chado to store genome annotation data
Curr. Protoc. Bioinformatics
2006
, vol. 
Chapter 9
 pg. 
Unit 9.6
 
23.
Sanderson
L.A.
Ficklin
S.P.
Cheng
C.H.
Jung
S.
Feltus
F.A.
Bett
K.E.
Main
D.
Tripal v1.1: a standards-based toolkit for construction of online genetic and genomic databases
Database
2013
, vol. 
2013
 pg. 
bat075
 
24.
Dooley
R.
Hanlon
M.R.
Recipes 2.0: building for today and tomorrow
Concurrency Computat.: Pract. Exper.
2014
 
doi:10.1002/cpe.3285
25.
Goff
S.A.
Vaughn
M.
McKay
S.
Lyons
E.
Stapleton
A.E.
Gessler
D.
Matasci
N.
Wang
L.
Hanlon
M.
Lenards
A.
, et al. 
The iPlant Collaborative: cyberinfrastructure for plant biology
Front. Plant Sci.
2011
, vol. 
2
 (pg. 
1
-
16
)
26.
Haas
B.J.
Papanicolaou
A.
Yassour
M.
Grabherr
M.
Blood
P.D.
Bowden
J.
Couger
M.B.
Eccles
D.
Li
B.
Lieber
M.
, et al. 
De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis
Nat. Protoc.
2013
, vol. 
8
 (pg. 
1494
-
1512
)
27.
Haas
B.J.
Delcher
A.L.
Mount
S.M.
Wortman
J.R.
Smith
R.K.
Jr
Hannick
L.I.
Maiti
R.
Ronning
C.M.
Rusch
D.B.
Town
C.D.
, et al. 
Improving the Arabidopsis genome annotation using maximal transcript alignment assemblies
Nucleic Acids Res.
2003
, vol. 
31
 (pg. 
5654
-
5666
)
28.
Lee
E.
Helt
G.A.
Reese
J.T.
Munoz-Torres
M.C.
Childers
C.P.
Buels
R.M.
Stein
L.
Holmes
I.H.
Elsik
C.G.
Lewis
S.E.
Web Apollo: a web-based genomic annotation editing platform
Genome Biol.
2013
, vol. 
14
 pg. 
R93
 
This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.

Comments

0 Comments
Submit a comment
You have entered an invalid code
Thank you for submitting a comment on this article. Your comment will be reviewed and published at the journal's discretion. Please check for further notifications by email.