Exploiting and assessing multi-source data for supervised biomedical named entity recognition

Cejuela

J.M.

et al. (

2017

)

nala: text mining natural language mutation mentions

.

Bioinformatics

,

33

,

1852

–

1858

.

Comeau

D.C.

et al. (

2013

)

Bioc: a minimalist approach to interoperability for biomedical text processing

.

Database

,

2013

,

bat064.

Crichton

G.

et al. (

2017

)

A neural network multi-task learning approach to biomedical named entity recognition

.

BMC Bioinformatics

,

18

,

368

.

Ding

J.

et al. (

2001

). Mining MEDLINE: abstracts, sentences, or phrases? In:

Biocomputing 2002

.

World Scientific

.

Google Preview

Figueroa

R.L.

et al. (

2012

)

Predicting sample size required for classification performance

.

BMC Med. Inf. Dec. Mak

.,

12

.

Finkel

J.R.

et al. (

2005

). Incorporating non-local information into information extraction systems by Gibbs sampling. In:

Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics, ACL ’05

.

Association for Computational Linguistics

,

Stroudsburg, PA, USA

, pp.

363

–

370

.

Fundel

K.

et al. (

2007

)

RelEx–relation extraction using dependency parse trees

.

Bioinformatics

,

23

,

365

–

371

.

Furlong

L.I.

et al. (

2008

)

OSIRISv1.2: a named entity recognition system for sequence variants of genes in biomedical literature

.

BMC Bioinformatics

,

9

,

84

.

Gerner

M.

et al. (

2010

). An exploration of mining gene expression mentions and their anatomical locations from biomedical text. In: Proceedings of the 2010 Workshop on Biomedical Natural Language Processing, BioNLP ’10. Association for Computational Linguistics, Stroudsburg, PA, USA, pp.

72

–

80

.

Gridach

M.

(

2017

)

Character-level neural network for biomedical named entity recognition

.

J. Biomed. Inf

.,

70

,

85

–

91

.

GuoDong

Z.

,

Jian

S.

(

2004a

) Exploring deep knowledge resources in biomedical name recognition. In: Proceedings of the International Joint Workshop on Natural Language Processing in Biomedicine and Its Applications, JNLPBA ’04. Association for Computational Linguistics, Stroudsburg, PA, USA, pp.

96

–

99

.

GuoDong

Z.

,

Jian

S.

(

2004b

) Exploring deep knowledge resources in biomedical name recognition. In: Proceedings of the International Joint Workshop on Natural Language Processing in Biomedicine and Its Applications, JNLPBA ’04. Association for Computational Linguistics, Stroudsburg, PA, USA, pp.

96

–

99

.

Herrero-Zazo

M.

et al. (

2013

)

The DDI corpus: an annotated corpus with pharmacological substances and drug–drug interactions

.

J. Biomed. Inf

.,

46

,

914

–

920

.

Hsu

C.

et al. (

2008

)

Integrating high dimensional bi-directional parsing models for gene mention tagging

.

Bioinformatics

,

24

,

i286

–

i294

.

Jimeno Yepes

A.

,

Verspoor

K.

(

2014

)

Mutation extraction tools can be combined for robust recognition of genetic variants in the literature

.

F1000Res

,

3

,

18

. [pmid].

Kim

J.-D.

et al. (

2003

)

Genia corpus-a semantically annotated corpus for bio-textmining

.

Bioinformatics

,

19

,

i180

–

i182

.

Krallinger

M.

et al. (

2015

)

The CHEMDNER corpus of chemicals and drugs and its annotation principles

.

J. Cheminf

.,

7

,

S2.

Liu

Y.

et al. (

2015

)

PolySearch2: a significantly improved text-mining system for discovering associations between human diseases, genes, drugs, metabolites, toxins and more

.

Nucleic Acids Res

.,

43

,

W535

–

W542

.

McCallum

A.K.

(

2002

) Mallet: a machine learning for language toolkit.

Neves

M.

et al. (

2012

) Annotating and evaluating text for stem cell research.

Ohta

T.

et al. (

2012

) Open-domain anatomical entity mention detection. In: Proceedings of the Workshop on Detecting Structure in Scholarly Discourse, ACL ’12. Association for Computational Linguistics, Stroudsburg, PA, USA, pp.

27

–

36

.

Pyysalo

S.

,

Ananiadou

S.

(

2014

)

Anatomical entity mention recognition at literature scale

.

Bioinformatics

,

30

,

868

–

875

.

Pyysalo

S.

et al. (

2007

)

Bioinfer: a corpus for information extraction in the biomedical domain

.

BMC Bioinformatics

,

8

,

50.

Pyysalo

S.

et al. (

2012a

)

Event extraction across multiple levels of biological organization

.

Bioinformatics

,

28

,

i575

–

i581

.

Pyysalo

S.

et al. (

2012b

)

Overview of the ID, EPI and REL tasks of BioNLP shared task 2011

.

BMC Bioinformatics

,

13

,

S2

.

Rei

M.

et al. (

2016

) Attending to characters in neural sequence labeling models. Proceedings of COLING, 309–318.

Settles

B.

(

2005

)

Abner: an open source tool for automatically tagging genes, proteins and other entity names in text

.

Bioinformatics

,

21

,

3191

–

3192

.

Thomas

P.E.

et al. (

2011

)

Challenges in the association of human single nucleotide polymorphism mentions with unique database identifiers

.

BMC Bioinformatics

,

12

,

S4

.

Thompson

P.

et al. (

2009

)

Construction of an annotated corpus to support biomedical information extraction

.

BMC Bioinformatics

,

10

,

349.

Tsai

R.

et al. (

2006

)

NERBio: using selected word conjunctions, term normalization, and global patterns to improve biomedical named entity recognition

.

BMC Bioinformatics

,

7

,

S11.

Xu

D.

et al. (

2016

)

Dtminer: identification of potential disease targets through biomedical literature mining

.

Bioinformatics

,

32

,

3619

–

3626

.

PubMed

Yeh

A.

et al. (

2005

)

Biocreative task 1a: gene mention finding evaluation

.

BMC Bioinformatics

,

6

,

S2.

Zeng

D.

et al. (

2017

)

LSTM-CRF for drug-named entity recognition

.

Entropy

,

19

,

283.