Deep learning improves antimicrobial peptide recognition

Bahdanau

D.

et al. (

2014

) Neural machine translation by jointly learning to align and translate. arXiv e-Print, arXiv:1409.0473.

Betts

M.J.

,

Russell

R.B.

(

2003

) Amino acid properties and consequences of substitutions. In:

Barnes

M.R.

,

Grays

I.C.

(eds)

Bioinformatics for Geneticists

.

Wiley, West Sussex

,

England

, pp.

291

–

314

.

Bishop

B.M.

et al. (

2015

)

Bioprospecting the american alligator (Alligator mississippiensis) host defense peptidome

.

PLoS ONE

,

10

,

e0117394.

Boman

H.G.

(

2003

)

Antibacterial peptides: basic facts and emerging concepts

.

J. Intern. Med

.,

254

,

197

–

215

.

Cherkasov

A.

,

Jankovic

B.

(

2004

)

Application of ’inductive’ QSAR descriptors for quantification of antibacterial activity of cationic polypeptides

.

Molecules

,

9

,

1034

–

1052

.

Chou

K.-C.

(

2001

)

Prediction of protein cellular attributes using pseudo-amino acid composition

.

Proteins

,

43

,

246

–

255

.

Chung

J.

et al. (

2014

) Empirical evaluation of gated recurrent neural networks on sequence modeling. arXiv e-Print, arXiv:1412.3555.

Davis

C.

(

1962

)

The norm of the Schur product operation

.

Numerische Math

.,

4

,

343

–

344

.

Epand

R.M.

et al. (

2016

)

Molecular mechanisms of membrane targeting antibiotics

.

Biochim. Biophys. Acta

,

1858

,

980

–

987

.

Fernandes

F.C.

et al. (

2012

)

Prediction of antimicrobial peptides based on the adaptive neuro-fuzzy inference system application

.

Peptide Sci

.,

98

,

280

–

287

.

Fjell

C.

et al. (

2007

)

AMPer: a database and an automated discovery tool for antimicrobial peptides

.

Bioinformatics

,

23

,

1148

–

1155

.

Fjell

C.

et al. (

2009

)

Identification of novel antibacterial peptides by chemoinformatics and machine learning

.

J. Med. Chem

,

52

,

2006

–

2015

.

Gers

F.A.

et al. (

2000

)

Learning to forget: continual prediction with LSTM

.

Neural Comput

.,

12

,

2451

–

2471

.

Ghandi

M.

et al. (

2014

)

Enhanced regulartory sequence prediction using gapped k-mer features

.

PLoS Comput. Biol

.,

10

,

e1003711.

Goodfellow

I.J.

et al. (

2014

). Generative adversarial nets. In:

Ghahramani

Z.

et al. (eds)

Advances in Neural Information Processing Systems 27

.

Curran Associates, Inc.

pp.

2672

–

2680

.

Graves

A.

et al. (

2013

). Hybrid speech recognition with deep bidirectional LSTM. In

2013 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU)

.

IEEE

, pp

273

–

278

.

Hanley

J.A.

,

McNeil

B.J.

(

1982

)

The meaning and use of the area under a receiver operating characteristic (ROC) curve

.

Radiology

,

143

,

29

–

36

.

Hochreiter

S.

,

Schmidhuber

J.

(

1997

)

Long short-term memory

.

Neural Comput

.,

9

,

1735

–

1780

.

Huang

Y.

et al. (

2010

)

CD-HIT suite: a web server for clustering and comparing biological sequences

.

Bioinformatics

,

26

,

680

–

682

.

Hunter

J.D.

(

2007

)

Matplotlib: a 2D graphics environment

.

Comput. Sci. Eng

.,

9

,

90

–

95

.

Jo

T.

et al. (

2015

)

Improving protein fold recognition by deep learning networks

.

Sci. Rep

.,

5

,

17573.

Kamath

U.

et al. (

2014

)

Effective automated feature construction and selection for classification of biological sequences

.

PLoS One

,

9

,

e99982.

Kent

W.J.

(

2002

)

BLAT - the BLAST-like alignment tool

.

Genome Res

.,

12

,

656

–

664

.

PubMed

Kingma

D.P.

,

Welling

M.

(

2014

). Auto-encoding variational bayes. In International Conference on Learning Representations (ICLR). pp.

1

–

14

.

Kumar

A.

et al. (

2016

). Ask me anything: dynamic memory networks for natural language processing. In

Balcan

M.F.

,

Weinberger

K.Q.

(eds),

International Conference on Machine Learn Res (MLR)

, Vol.

48

.

New York, NY

, pp

1378

–

1387

.

Lata

S.

et al. (

2007

)

Analysis and prediction of antibacterial peptides

.

BMC Bioinformatics

,

8

,

263

–

272

.

Lata

S.

et al. (

2010

)

AntiBP2: improved version of antibacterial peptide prediction

.

BMC Bioinformatics

,

11

,

S19.

LeCun

Y.

et al. (

2015

)

Deep learning

.

Nature

,

521

,

436

–

444

.

Lee

E.Y.

et al. (

2016

)

Mapping membrane activity in undiscovered peptide sequence space using machine learning

.

Proc. Natl. Acad. Sci. USA

,

113

,

13588

–

13593

.

Lloyd

S.

(

1982

)

Least squares quantization in pcm

.

IEEE Trans. Inf. Theory

,

28

,

129

–

137

.

Magrane

M.

, and

the UniProt consortium

(

2011

)

UniProt knowledgebase: a hub of integrated protein data

.

Database

,

2011

,

bar009.

Meher

P.K.

et al. (

2017

)

Predicting antimicrobial peptides with improved accuracy by incorporating the compositional, physico-chemical and structural features into Chou’s general PseAAC

.

Sci. Rep

.,

7

,

42362.

Nielsen

H.

et al. (

2016

)

Convolutional LSTM networks for subcellular localization of proteins

.

Mach. Learn

.,

25

,

01

.

Pedregosa

F.

et al. (

2011

)

Scikit-learn: machine learning in python

.

J. Mach. Learn. Res

,

12(Oct)

,

2825

–

2830

.

Price

L.B.

et al. (

2012

)

Staphylococcus aureus CC398: host adaptation and emergence of methicillin resistance in livestock

.

MBio

,

3

, e00305-11–e00311.

R Core Team

(

2015

)

R: A Language and Environment for Statistical Computing

.

R Foundation for Statistical Computing

,

Vienna, Austria

.

Randou

E.G.

et al. (

2013

) Binary response models for recognition of antimicrobial peptides. In Proceedings of the International Conference on Bioinformatics, Computational Biology and Biomedical Informatics, ACM, p.

76

.

Robin

X.

et al. (

2011

)

Proc: an open-source package for R and S+ to analyze and compare roc curves

.

BMC Bioinformatics

,

12

,

77.

Schmidhuber

J.

(

2015

)

Deep learning in neural networks: an overview

.

Neural Netw

.,

61

,

85

–

117

.

Spencer

M.

et al. (

2015

)

A deep learning network approach to ab initio protein secondary structure prediction

.

IEEE/ACM Trans. Comput. Biol. Bioinform

.,

12

,

103

–

112

.

Srivastava

N.

et al. (

2014

)

Dropout: a simple way to prevent neural networks from overfitting

.

J. Mach. Learn. Res

.,

15

,

1929

–

1958

.

Thomas

S.

et al. (

2009

)

CAMP: a useful resource for research on antimicrobial peptides

.

Nucleic Acids Res

.,

38(suppl. 1)

,

D774

–

D780

.

Thorndike

R.L.

(

1953

)

Who belongs in the family?

Psychometrika

,

18

,

267

–

276

.

Torrent

M.

et al. (

2011

)

Connecting peptide physicochemical and antimicrobial properties by a rational prediction model

.

PLoS One

,

6

,

e16968.

U.S. Department of Health and Human Services

(

2013

)

Antibiotic Resistance Threats in the United States

.

U.S. Department of Health and Human Services

,

Atlanta, GA

.

Van der Maaten

L.

et al. (

2008

)

Visualizing data using t-SNE

.

J. Mach. Learn. Res

.,

9(Nov)

,

2579

–

2605

.

Veltri

D.

(

2015

) A computatioanl and statistical framework for screening novel antimicrobial peptides. PhD Dissertation, George Mason University, Fairfax, VA.

Veltri

D.

et al. (

2014

) A novel method to improve recognition of antimicrobial peptides through distal sequence-based features. In 2014 IEEE International Conference on Bioinformatics and Biomedicine (BIBM). IEEE, pp.

371

–

378

.

Veltri

D.

et al. (

2017

)

Improving recognition of antimicrobial peptides and target selectivity through machine learning and genetic programming

.

Trans. Comput. Biol. Bioinform

.,

14

,

300

–

313

.

Vinyals

O.

et al. (

2015

). Grammar as a foreign language. In:

Cortes

C.

et al. (eds) Advances in Neural Information Processing Systems. Curran Associates, Inc. pp.

2773

–

2781

.

Wang

G.

(

2010

).

Antimicrobial Peptides: Discovery, Design and Novel Therapeutic Strategies

.

CABI Bookshop

,

Wallingford, England

.

Wang

G.

et al. (

2016

)

APD3: the antimicrobial peptide database as a tool for research and education

.

Nucleic Acids Res

.,

44

,

D1087

–

D1093

.

Wimley

W.C.

,

Hristova

K.

(

2011

)

Antimicrobial peptides: successes, challenges and unanswered questions

.

J. Membr. Biol

.,

239

,

27

–

34

.

World Health Organization

(

2014

)

Antimicrobial Resistance: Global Report on Surveillance 2014

.

WHO

,

Geneva, Switzerland

.