Organism-specific training improves performance of linear B-cell epitope prediction

Basáñez

M.-G.

et al. (

2006

)

River blindness: a success story under threat?

PLoS Med

.,

3

,

e371

.

Blythe

M.J.

,

Flower

D.R.

(

2005

)

Benchmarking b cell epitope prediction: underperformance of existing methods

.

Protein Sci

.,

14

,

246

–

248

.

Chicco

D.

,

Jurman

G.

(

2020

)

The advantages of the Matthews correlation coefficient (MCC) over f1 score and accuracy in binary classification evaluation

.

BMC Genomics

,

21

,

6

.

Collatz

,

M.

et al. (

2021

)

EpiDope: a deep neural network for linear B-cell epitope prediction

.

Bioinformatics

,

37

,

448

–

455

.

Davison

A.C.

,

Hinkley

D.V.

(

2013

)

Bootstrap Methods and Their Application

.

Cambridge University Press

,

USA

.

Google Preview

EL-Manzalawy

Y.

et al. (

2008

)

Predicting linear B-cell epitopes using string kernels

.

J. Mol. Recognit. Interdiscipl. J

.,

21

,

243

–

255

.

Ferri

C.

et al. (

2015

)

HCV syndrome: a constellation of organ- and non-organ specific autoimmune disorders, B-cell non-Hodgkin’s lymphoma, and cancer

.

World J. Hepatol

.,

7

,

327

–

343

.

Forsström

B.

et al. (

2015

)

Dissecting antibodies with regards to linear and conformational epitopes

.

PLoS One

,

10

,

e0121673

.

Georgiev

A.G.

(

2009

)

Interpretable numerical descriptors of amino acid space

.

J. Comput. Biol

.,

16

,

703

–

723

.

Getzoff

E.D.

et al. (

1988

) The chemistry and mechanism of antibody binding to protein antigens. In: Dixon F.J.(ed.), Advances in Immunology, Vol.

43

, pp.

1

–

98

, Academic Press, Cambridge, MA, USA.

Giacò

L.

et al. (

2012

)

B-pred, a structure based B-cell epitopes prediction server

.

Adv. Appl. Bioinf. Chem

.,

5

,

11

–

21

.

Greenbaum

J.A.

et al. (

2007

)

Towards a consensus on datasets and evaluation metrics for developing B-cell epitope prediction tools

.

J. Mol. Recognit. Interdiscipl. J

.,

20

,

75

–

82

.

Haste Andersen

P.

et al. (

2006

)

Prediction of residues in discontinuous B-cell epitopes using protein 3D structures

.

Protein Sci

.,

15

,

2558

–

2567

.

Holm

S.

(

1979

)

A simple sequentially rejective multiple test procedure

.

Scand. J. Stat

.,

6

,

65

–

70

.

Hopp

T.P.

,

Woods

K.R.

(

1981

)

Prediction of protein antigenic determinants from amino acid sequences

.

Proc. Natl. Acad. Sci. USA

,

78

,

3824

–

3828

.

Jespersen

M.C.

et al. (

2017

)

Bepipred-2.0: improving sequence-based B-cell epitope prediction using conformational epitopes

.

Nucleic Acids Res

.,

45

,

W24

–

W29

.

Jespersen

M.C.

et al. (

2019

)

Antibody specific B-cell epitope predictions: leveraging information from antibody-antigen protein complexes

.

Front. Immunol

.,

10

,

298

.

Kaufman

S.

et al. (

2011

) Leakage in data mining. In Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining – KDD’11. ACM Press, San Diego, CA, USA.

Kindt

T.J.

et al. (

2007

) Kuby Immunology.

Macmillan Learning. New York, NY, USA

.

Kolaskar

A.

,

Tongaonkar

P.C.

(

1990

)

A semi-empirical method for prediction of antigenic determinants on protein antigens

.

FEBS Lett

.,

276

,

172

–

174

.

Kulkarni-Kale

U.

et al. (

2005

)

CEP: a conformational epitope prediction server

.

Nucleic Acids Res

.,

33

,

W168

–

W171

.

Dudek

N.

et al. (

2010

)

Epitope discovery and their use in peptide based vaccines

.

Curr. Pharm. Des

.,

16

,

3149

–

3157

.

Larsen

J.E.P.

et al. (

2006

)

Improved method for predicting linear B-cell epitopes

.

Immunome Res

.,

2

,

2

.

Leinikki

P.

et al. (

1993

) Synthetic peptides as diagnostic tools in virology. In: K. Maramorosh (eds.) et al., Advances in Virus Research, Vol.

42

, pp.

149

–

186

, Academic Press, Cambridge, MA, USA.

Lo

Y.-T.

et al. (

2013

)

Prediction of conformational epitopes with the use of a knowledge-based energy function and geometrically related neighboring residue characteristics

.

BMC Bioinformatics

,

14

,

S3

.

Lodish

H.

et al. (

2000

) Molecular Cell Biology, 4th edn.

W.H.Freeman & Co Ltd

. New York, NY, USA.

Manavalan

B.

et al. (

2018

)

iBCE-EL: a new ensemble learning framework for improved linear B-cell epitope prediction

.

Front. Immunol

.,

9

,

1695

.

NCBI Resource Coordinators. (

2015

)

Database resources of the national center for biotechnology information

.

Nucleic Acids Res

.,

44

,

D7

–

D19

.

PubMed

Osei-Atweneboana

M.Y.

et al. (

2007

)

Prevalence and intensity of Onchocerca volvulus infection and efficacy of ivermectin in endemic communities in Ghana: a two-phase epidemiological study

.

Lancet

,

369

,

2021

–

2029

.

Osorio

D.

et al. (

2015

)

Peptides: a package for data mining of antimicrobial peptides

.

R. J

.,

7

,

4

–

14

.

Pandurangan

A.P.

,

Blundell

T.L.

(

2020

)

Prediction of impacts of mutations on protein structure and interactions: SDM, a statistical approach, and MCSM, using machine learning

.

Protein Sci

.,

29

,

247

–

257

.

Parker

J.

et al. (

1986

)

New hydrophilicity scale derived from high-performance liquid chromatography peptide retention data: correlation of predicted surface residues with antigenicity and X-ray-derived accessible sites

.

Biochemistry

,

25

,

5425

–

5432

.

Paul

W.

(

2012

)

Fundamental Immunology

, 7th edn.

Lippincott Williams & Wilkins

,

London

.

Google Preview

Pellequer

J.

,

Westhof

E.

(

1993

)

Preditop: a program for antigenicity prediction

.

J. Mol. Graph

.,

11

,

204

–

210

.

Pellequer

J.

et al. (

1991

) Predicting location of continuous epitopes in proteins from their primary structures. In: Langone J.J. (ed.), Methods in Enzymology,

Vol. 203

.

pp.

176

–

201, Elsevier, Amsterdam

.

Pellequer

J.-L.

et al. (

1993

)

Correlation between the location of antigenic sites and the prediction of turns in proteins

.

Immunol. Lett

.,

36

,

83

–

99

.

Ponomarenko

J.V.

,

Bourne

P.E.

(

2007

)

Antibody-protein interactions: benchmark datasets and prediction tools evaluation

.

BMC Struct. Biol

.,

7

,

64

.

Potocnakova

L.

et al. (

2016

)

An introduction to B-cell epitope mapping and in silico epitope prediction

.

J. Immunol. Res

.,

2016

,

6760830

.

R Core Team. (

2020

) R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria.

Rezk

S.A.

et al. (

2018

)

Epstein–Barr virus (EBV)-associated lymphoid proliferations, a 2018 update

.

Hum. Pathol

.,

79

,

18

–

41

.

Saha

S.

,

Raghava

G.P.S.

(

2004

) BcePred: prediction of continuous B-cell epitopes in antigenic sequences using physico-chemical properties. In: Third International Conference on Artificial Immune Systems, Sicily, Italy, pp.

197

–

204, Springer

.

Saha

S.

,

Raghava

G.P.S.

(

2006

)

Prediction of continuous B-cell epitopes in an antigen using recurrent neural network

.

Proteins Struct. Funct. Bioinf

.,

65

,

40

–

48

.

Sanchez-Trincado

J.L.

et al. (

2017

)

Fundamentals and methods for T- and B-cell epitope prediction

.

J. Immunol. Res

.,

2017

,

2680160

.

Saravanan

V.

,

Gautham

N.

(

2015

)

Harnessing computational biology for exact linear B-cell epitope prediction: a novel amino acid composition-based feature descriptor

.

Omics J. Integr. Biol

.,

19

,

648

–

658

.

Shen

J.

et al. (

2007

)

Predicting protein–protein interactions based only on sequences information

.

Proc. Natl. Acad. Sci. USA

,

104

,

4337

–

4341

.

Singh

H.

et al. (

2013

)

Improved method for linear B-cell epitope prediction using antigen’s primary sequence

.

PLoS One

,

8

,

e62216

.

Tan

P.-N.

et al. (

2005

) Introduction to Data Mining.

Addison Wesley

, Boston, MA, USA.

UniProt Consortium. (

2020

)

UniProt: the universal protein knowledgebase in 2021

.

Nucleic Acids Res

.,

49

,

D480

–

D489

.

Van der Maaten

L.

,

Hinton

G.

(

2008

)

Visualizing data using t-SNE

.

J. Mach. Learn. Res

.,

9

,

2579

–

2605

.

Van Regenmortel

M.H.

(

1996

)

Mapping epitope structure and activity: from one-dimensional prediction to four-dimensional description of antigenic specificity

.

Methods

,

9

,

465

–

472

.

Vita

R.

et al. (

2019

)

The immune epitope database (IEDB): 2018 update

.

Nucleic Acids Res

.,

47

,

D339

–

D343

.

Wang

J.

et al. (

2017

)

Protein–protein interactions prediction using a novel local conjoint triad descriptor of amino acid sequences

.

Int. J. Mol. Sci

.,

18

,

2373

.

World Health Organization. (

2019

) Onchocerciasis Fact Sheet. https://www.who.int/news-room/fact-sheets/detail/onchocerciasis (22 July 2020, date last accessed).

Wright

M.N.

,

Ziegler

A.

(

2017

)

ranger: a fast implementation of random forests for high dimensional data in C++ and R

.

J. Stat. Softw

.,

77

,

1

–

17

.