GeneBase 1.1: a tool to summarize data from NCBI Gene datasets and its application to an update of human gene statistics

3

Speir

M.L.

,

Zweig

A.S.

,

Rosenbloom

K.R.

et al. (

2016

)

The UCSC Genome Browser database: 2016 update

.

Nucleic Acids Res

.,

44

,

D717

–

D725

.

4

Piovesan

A.

,

Vitale

L.

,

Pelleri

M.C.

et al. (

2013

)

Universal tight correlation of codon bias and pool of RNA codons (codonome): the genome is optimized to allow any distribution of gene expression values in the transcriptome from bacteria to humans

.

Genomics

,

101

,

282

–

289

.

5

Vitale

L.

,

Lenzi

L.

,

Huntsman

S.A.

et al. (

2006

)

Differential expression of alternatively spliced mRNA forms of the insulin-like growth factor 1 receptor in human neuroendocrine tumors

.

Oncol. Rep

.,

15

,

1249

–

1256

.

PubMed

6

Piovesan

A.

,

Caracausi

M.

,

Ricci

M.

et al. (

2015

)

Identification of minimal eukaryotic introns through GeneBase, a user-friendly tool for parsing the NCBI Gene databank

.

DNA Res

.,

22

,

495

–

503

.

7

de Koning

A.P.

,

Gu

W.

,

Castoe

T.A.

et al. (

2011

)

Repetitive elements may comprise over two-thirds of the human genome

.

PLoS Genet

.,

7

,

e1002384.

8

O'Leary

N.A.

,

Wright

M.W.

,

Brister

J.R.

et al. (

2016

)

Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation

.

Nucleic Acids Res

.,

44

,

D733

–

D745

.

9

Harrow

J.

,

Frankish

A.

,

Gonzalez

J.M.

et al. (

2012

)

GENCODE: the reference human genome annotation for The ENCODE Project

.

Genome Res

.,

22

,

1760

–

1774

.

10

Frankish

A.

,

Uszczynska

B.

,

Ritchie

G.R.

et al. (

2015

)

Comparison of GENCODE and RefSeq gene annotation and the impact of reference geneset on variant effect prediction

.

BMC Genomics

,

16

,

S2

.

11

Schwerk

J.

,

Savan

R.

(

2015

)

Translating the untranslated region

.

J. Immunol

.,

195

,

2963

–

2971

.

12

Uddin

B.

,

Chen

N.P.

,

Panic

M.

et al. (

2015

)

Genome editing through large insertion leads to the skipping of targeted exon

.

BMC Genomics

,

16

,

1082.

13

Speicher

M.

,

Antonarakis

S.E.

,

Motulsky

A.G.

(

2010

)

Vogel and Motulsky's Human Genetics: Problems and Approaches

.

Springer-Verlag

,

Berlin Heidelberg

.

14

Kim

D.

,

Pertea

G.

,

Trapnell

C.

et al. (

2013

)

TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions

.

Genome Biol

.,

14

,

R36.

15

Lander

E.S.

,

Linton

L.M.

,

Birren

B.

et al. (

2001

)

Initial sequencing and analysis of the human genome

.

Nature

,

409

,

860

–

921

.

16

Venter

J.C.

,

Adams

M.D.

,

Myers

E.W.

et al. (

2001

)

The sequence of the human genome

.

Science

,

291

,

1304

–

1351

.

17

Makalowski

W.

(

2001

)

The human genome structure and organization

.

Acta Biochim. Pol

.,

48

,

587

–

598

.

PubMed

18

Doglio

L.

,

Goode

D.K.

,

Pelleri

M.C.

et al. (

2013

)

Parallel evolution of chordate cis-regulatory code for development

.

PLoS Genet

.,

9

,

e1003904.

19

Ezkurdia

I.

,

Juan

D.

,

Rodriguez

J.M.

et al. (

2014

)

Multiple evidence strands suggest that there may be as few as 19,000 human protein-coding genes

.

Hum. Mol. Genet

.,

23

,

5866

–

5878

.

20

Zhang

D.L.

,

Ji

L.

,

Li

Y.D.

(

2004

)

Analysis, identification and correction of some errors of model refseqs appeared in NCBI Human Gene Database by in silico cloning and experimental verification of novel human genes

.

Yi Chuan Xue Bao

,

31

,

431

–

443

.

PubMed

21

Zhang

D.L.

,

Li

Y.D.

,

Ji

L.

(

2004

)

Correction of five different types of errors of model REFSEQs appeared in NCBI human gene database only by using two novel human genes C17orf32 and ZNF362

.

Yi Chuan Xue Bao

,

31

,

325

–

334

.

22

Strippoli

P.

,

Canaider

S.

,

Noferini

F.

et al. (

2005

)

Uncertainty principle of genetic information in a living cell

.

Theor. Biol. Med. Model

.,

2

,

40.

23

Boguski

M.S.

,

Lowe

T.M.

,

Tolstoshev

C.M.

(

1993

)

dbEST—database for "expressed sequence tags"

.

Nat. Genet

.,

4

,

332

–

333

.

24

Caracausi

M.

,

Vitale

L.

,

Pelleri

M.C.

et al. (

2014

)

A quantitative transcriptome reference map of the normal human brain

.

Neurogenetics

,

15

,

267

–

287

.

25

Lenzi

L.

,

Frabetti

F.

,

Facchin

F.

et al. (

2006

)

UniGene Tabulator: a full parser for the UniGene format

.

Bioinformatics

,

22

,

2570

–

2571

.

26

Caracausi

M.

,

Piovesan

A.

,

Vitale

L.

et al. (

2016

)

Integrated transcriptome map highlights structural and functional aspects of the normal human heart

.

J. Cell. Physiol

., doi: 10.1002/jcp.25471. [Epub ahead of print].

27

Bill

B.R.

,

Lowe

J.K.

,

Dybuncio

C.T.

et al. (

2013

)

Orchestration of neurodevelopmental programs by RBFOX1: implications for autism spectrum disorder

.

Int. Rev. Neurobiol

.,

113

,

251

–

267

.

28

Verkerk

A.J.

,

Mathews

C.A.

,

Joosse

M.

et al. (

2003

)

CNTNAP2 is disrupted in a family with Gilles de la Tourette syndrome and obsessive compulsive disorder

.

Genomics

,

82

,

1

–

9

.

29

Tuffery-Giraud

S.

,

Beroud

C.

,

Leturcq

F.

et al. (

2009

)

Genotype-phenotype analysis in 2,405 patients with a dystrophinopathy using the UMD-DMD database: a model of nationwide knowledgebase

.

Hum. Mutat

.,

30

,

934

–

945

.

30

Vendola

C.

,

Canfield

M.

,

Daiger

S.P.

et al. (

2010

)

Survival of Texas infants born with trisomies 21, 18, and 13

.

Am. J. Med. Genet. A

,

152a

,

360

–

366

.

31

Facchin

F.

,

Vitale

L.

,

Bianconi

E.

et al. (

2011

)

Complexity of bidirectional transcription and alternative splicing at human RCAN3 locus

.

PLoS One

,

6

,

e24508.

32

Casadei

R.

,

Pelleri

M.C.

,

Vitale

L.

et al. (

2014

)

Characterization of human gene locus CYYR1: a complex multi-transcript system

.

Mol. Biol. Rep

.,

41

,

6025

–

6038

.

33

de Klerk

E.

,

T Hoen

P.A.

(

2015

)

Alternative mRNA transcription, processing, and translation: insights from RNA sequencing

.

Trends Genet

.,

31

,

128

–

139

.

34

Morino

H.

,

Matsuda

Y.

,

Muguruma

K.

et al. (

2015

)

A mutation in the low voltage-gated calcium channel CACNA1G alters the physiological properties of the channel, causing spinocerebellar ataxia

.

Mol. Brain

,

8

,

89.

35

Laaser

I.

,

Theis

F.J.

,

de Angelis

M.H.

et al. (

2011

)

Huge splicing frequency in human Y chromosomal UTY gene

.

Omics

,

15

,

141

–

154

.

36

Walport

L.J.

,

Hopkinson

R.J.

,

Vollmar

M.

et al. (

2014

)

Human UTY(KDM6C) is a male-specific N-methyl lysyl demethylase

.

J. Biol. Chem

.,

289

,

18302

–

18313

.

37

Frabetti

F.

,

Casadei

R.

,

Lenzi

L.

et al. (

2007

)

Systematic analysis of mRNA 5' coding sequence incompleteness in Danio rerio: an automated EST-based approach

.

Biol. Direct

.,

2

,

34.

38

Casadei

R.

,

Piovesan

A.

,

Vitale

L.

et al. (

2012

)

Genome-scale analysis of human mRNA 5' coding sequences based on expressed sequence tag (EST) database

.

Genomics

,

100

,

125

–

130

.

39

Hangauer

M.J.

,

Vaughn

I.W.

,

McManus

M.T.

(

2013

)

Pervasive transcription of the human genome produces thousands of previously unidentified long intergenic noncoding RNAs

.

PLoS Genet

.,

9

,

e1003569

.

40

Pelleri

M.C.

,

Cicchini

E.

,

Locatelli

C.

et al. (

2016

)

Systematic reanalysis of partial trisomy 21 cases with or without Down syndrome suggests a small region on 21q22.13 as critical to the phenotype

.

Hum. Mol. Genet

., pii: ddw116. [Epub ahead of print].

41

Strippoli

P.

,

Pelleri

M.C.

,

Caracausi

M.

et al. (

2013

)

An integrated route to identifying new pathogenesis-based therapeutic approaches for trisomy 21 (Down Syndrome) following the thought of Jérôme Lejeune

.

Sci. Postprint

,

1

,

e00010

.

Crossref