Saccharomyces Genome Database (SGD) provides biochemical and structural information for budding yeast proteins

Weng, Shuai; Dong, Qing; Balakrishnan, Rama; Christie, Karen; Costanzo, Maria; Dolinski, Kara; Dwight, Selina S.; Engel, Stacia; Fisk, Dianna G.; Hong, Eurie; Issel-Tarver, Laurie; Sethuraman, Anand; Theesfeld, Chandra; Andrada, Rey; Binkley, Gail; Lane, Christopher; Schroeder, Mark; Botstein, David; Michael Cherry, J.

doi:10.1093/nar/gkg054

Abstract

The Saccharomyces Genome Database (SGD: http://genome-www.stanford.edu/Saccharomyces/ ) has recently developed new resources to provide more complete information about proteins from the budding yeast Saccharomyces cerevisiae . The PDB Homologs page provides structural information from the Protein Data Bank (PDB) about yeast proteins and/or their homologs. SGD has also created a resource that utilizes the eMOTIF database for motif information about a given protein. A third new resource is the Protein Information page, which contains protein physical and chemical properties, such as molecular weight and hydropathicity scores, predicted from the translated ORF sequence.

Received September 13, 2002; Accepted September 24, 2002

INTRODUCTION

The Saccharomyces Genome Database (SGD) collects, organizes, and presents information about the molecular biology and genetics of the budding yeast Saccharomyces cerevisiae . SGD contains diverse types of biological data and provides tools for their search and analysis. Information in SGD is generally organized around a gene; each gene in the genome has a Locus page ( 2 ), which contains basic information about that gene such as Gene Ontology (GO) annotations ( 1 , 3 ) and phenotype data as well as links to additional tools and resources. The protein resources described below are newly provided links located in the Protein Info and Structure Resources section of the Locus page.

The Protein Data Bank (PDB) Homologs page provides a list of structures available in PDB ( 7 ) relevant to a given S. cerevisiae protein (Fig. 1 A). The list is generated by comparing the sequences of the systematically defined S. cerevisiae proteins against the protein sequences within PDB using the Smith–Waterman ( 6 ) sequence comparison algorithm. All PDB sequences with a p -value of 0.01 or less are presented, regardless of species. Thus, if the structure of an S. cerevisiae protein is unknown, the structure of the homo-logous protein in another species may be available. For each PDB homolog, Smith–Waterman scores are provided, with links to PDB, other protein databases, and an alignment page. The alignment page (Fig. 1 B) displays the Smith–Waterman alignment of the S. cerevisiae protein against the PDB protein, a colour ribbon image of the structure, and a link to the interactive QuickPDB structure viewer provided by PDB.

SGD has recently released another protein information tool. The new Motif tool displays the motifs in a particular protein using results from the eMOTIF resource ( 4 ). The Motif results page includes an image illustrating all the motifs in the protein of interest and also lists all the other S. cerevisiae proteins that share these motifs. Links to other motif databases are also provided.

The third new protein resource in SGD is the Protein Information page. This page contains calculated data based on protein sequence, such as molecular weight, amino acid content, protein length, pI, and codon adaptation index ( 5 ). Links are provided to graphical displays of hydropathy plots, helical wheels and other tools to predict secondary structure. SGD is developing enhancements to this page and will soon be incorporating predicted transmembrane domains, signal sequences and other types of data that can be generated based on protein sequence. All of these data for the entire predicted yeast proteome are available for download on the SGD anonymous FTP site at: ftp://genome-ftp.stanford.edu/yeast/data_download/protein_info/ .

From its beginning in 1994, SGD has provided biological information about yeast genes as well as search and analysis tools. We are now expanding our scope to provide more protein information by developing tools like the PDB Homolog, Motif, and Protein Information pages. SGD is also creating resources for global analyses, in addition to the gene-by-gene tools and resources currently provided. As more eukaryotic genomes are completely sequenced, comparative genomics will become an increasingly powerful method for solving biological problems; thus, SGD is exploring new ways of presenting the results of these and other types of genome-wide studies. Check the SGD home page at http://genome-www.stanford.edu/Saccharomyces/ for announcements as these new tools become available. Supplemental material for this paper can be found online at: http://genome-www.stanford.edu/Saccharomyces/help/NAR2003Supplement.html .

Figure 1. (Following page) Protein structure information at SGD: the new PDB Homologs tool. Due to space considerations, both ( A ) and ( B ) are just a portion of the web display. (A) The PDB Homologs results page lists the results of the Smith–Waterman sequence comparison of a S. cerevisiae protein against the proteins in PDB. Links are provided to the PDB and other external protein structure databases, as well as the alignment page shown in B. In this example, Fpr1p results are shown. A structure of the S. cerevisiae was identified (first row in the table), as well as additional structures of the Bos taurus homolog. Additional homologs were found but are not shown. This page can be reached from the ‘Protein Info and Structure’ pull-down menu on Locus and Protein Information pages. (B) The PDB Alignment page displays the alignment of the S. cerevisiae protein and the PDB protein, a color ribbon image of the structure (provided by PDB), and links to other databases and tools such as the interactive structural viewer QuickPDB provided by the PDB.

Open in new tab Download slide

Figure 1. (Following page) Protein structure information at SGD: the new PDB Homologs tool. Due to space considerations, both ( A ) and ( B ) are just a portion of the web display. (A) The PDB Homologs results page lists the results of the Smith–Waterman sequence comparison of a S. cerevisiae protein against the proteins in PDB. Links are provided to the PDB and other external protein structure databases, as well as the alignment page shown in B. In this example, Fpr1p results are shown. A structure of the S. cerevisiae was identified (first row in the table), as well as additional structures of the Bos taurus homolog. Additional homologs were found but are not shown. This page can be reached from the ‘Protein Info and Structure’ pull-down menu on Locus and Protein Information pages. (B) The PDB Alignment page displays the alignment of the S. cerevisiae protein and the PDB protein, a color ribbon image of the structure (provided by PDB), and links to other databases and tools such as the interactive structural viewer QuickPDB provided by the PDB.

References

1.

Ashburner,M., Ball,C.A., Blake,J.A., Botstein,D., Butler,H., Cherry,J.M., Davis,A.P., Dolinski,K., Dwight,S.S., Eppig,J.T., Harris,M.A., Hill,D.P., Issel-Tarver,L., Kasarskis,A., Lewis,S., Matese,J.C., Richardson,J.E., Ringwald,M., Rubin,G.M. and Sherlock,G. (

2000

) Gene ontology: tool for the unification of biology.

Nature Genet.

,

25

,

25

–29.

2.

Ball,C.A., Dolinski,K., Dwight,S.S., Harris,M.A., Issel-Tarver,L., Kasarskis,A., Scafe,C.R., Sherlock,G., Binkley,G., Jin,H., Kaloper,M., Orr,S.D., Schroeder,M., Weng,S., Zhu,Y., Botstein,D. and Cherry,J.M. (

2000

) Integrating functional genomic information into the Saccharomyces genome database.

Nucleic Acids Res.

,

28

,

77

–80.

3.

Dwight,S.S., Harris,M.A., Dolinski,K., Ball,C.A., Binkley,G., Christie,K.R., Fisk,D.G., Issel-Tarver,L., Schroeder,M., Sherlock,G., Sethuraman,A., Weng,S., Botstein,D. and Cherry,J.M. (

2002

) Saccharomyces Genome Database (SGD) provides secondary gene annotation using the Gene Ontology (GO).

Nucleic Acids Res.

,

30

,

69

–72.

4.

Huang,J.Y. and Brutlag,D.L. (

2001

) The EMOTIF database.

Nucleic Acids Res.

,

29

,

202

–204.

5.

Sharp,P.M. and Li,W.H. (

1987

) The codon Adaptation Index—a measure of directional synonymous codon usage bias, and its potential applications.

Nucleic Acids Res.

,

15

,

1281

–1295.

6.

Smith,T.F. and Waterman,M.S. (

1981

) Identification of common molecular subsequences.

J. Mol. Biol.

,

147

,

195

–197.

7.

Westbrook,J., Feng,Z., Jain,S., Bhat,T.N., Thanki,N., Ravichandran,V., Gilliland,G.L., Bluhm,W., Weissig,H., Greer,D.S., Bourne,P.E. and Berman,H.M. (

2002

) The Protein Data Bank: unifying the archive.

Nucleic Acids Res.

,

30

,

245

–248.

Download all slides

Month:	Total Views:
December 2016	1
January 2017	1
February 2017	3
March 2017	6
April 2017	3
June 2017	2
July 2017	5
August 2017	1
September 2017	3
October 2017	4
November 2017	4
December 2017	14
January 2018	12
February 2018	12
March 2018	8
April 2018	6
May 2018	1
June 2018	3
July 2018	3
August 2018	9
September 2018	8
October 2018	12
November 2018	16
December 2018	9
January 2019	8
February 2019	8
March 2019	9
April 2019	19
May 2019	12
June 2019	7
July 2019	17
August 2019	11
September 2019	9
October 2019	15
November 2019	5
December 2019	8
January 2020	11
February 2020	6
March 2020	7
April 2020	15
May 2020	12
June 2020	10
July 2020	17
August 2020	18
September 2020	8
October 2020	7
November 2020	6
December 2020	5
January 2021	9
February 2021	7
March 2021	39
April 2021	5
May 2021	5
June 2021	7
July 2021	13
August 2021	5
September 2021	13
October 2021	6
November 2021	6
December 2021	5
January 2022	8
February 2022	6
March 2022	14
April 2022	9
May 2022	9
June 2022	3
July 2022	15
August 2022	8
September 2022	28
October 2022	26
November 2022	12
December 2022	4
January 2023	7
February 2023	7
March 2023	9
April 2023	43
May 2023	7
June 2023	8
July 2023	8
August 2023	7
September 2023	6
October 2023	7
November 2023	3
December 2023	8
January 2024	9
February 2024	4
March 2024	9
April 2024	3

Article Contents

Saccharomyces Genome Database (SGD) provides biochemical and structural information for budding yeast proteins

Abstract

INTRODUCTION

References

Comments

Citations

Views

Altmetric

Email alerts

Citing articles via

Latest

Most Read

Most Cited

Article Contents

Saccharomyces Genome Database (SGD) provides biochemical and structural information for budding yeast proteins

Abstract

INTRODUCTION

References

Comments

Citations

Views

Altmetric

Email alerts

Citing articles via

Latest

Most Read

Most Cited

This Feature Is Available To Subscribers Only