lordFAST: sensitive and Fast Alignment Search Tool for LOng noisy Read sequencing Data

1000 Genomes Project Consortium

(

2012

)

An integrated map of genetic variation from 1, 092 human genomes

.

Nature

,

491

,

56

–

65

.

Alkan

C.

et al. (

2009

)

Personalized copy number and segmental duplication maps using next-generation sequencing

.

Nat. Genet

.,

41

,

1061

–

1067

.

Alkan

C.

et al. (

2011

)

Genome structural variation discovery and genotyping

.

Nat. Rev. Genet

.,

12

,

363

–

376

.

Bashir

A.

et al. (

2012

)

A hybrid approach for the automated finishing of bacterial genomes

.

Nat. Biotechnol

.,

30

,

701

–

707

.

Berlin

K.

et al. (

2015

)

Assembling large genomes with single-molecule sequencing and locality-sensitive hashing

.

Nat. Biotechnol

.,

33

,

623

–

630

.

Brown

S.D.

et al. (

2014

)

Comparison of single-molecule sequencing and hybrid approaches for finishing the genome of clostridium autoethanogenum and analysis of CRISPR systems in industrial relevant clostridia

.

Biotechnol. Biofuels

,

7

,

40.

Burrows

M.

,

Wheeler

D.J.

(

1994

) A block-sorting lossless data compression algorithm. Technical Report. DEC Labs.

Chaisson

M.J.

,

Tesler

G.

(

2012

)

Mapping single molecule sequencing reads using basic local alignment with successive refinement (blasr): application and theory

.

BMC Bioinformatics

,

13

,

238.

Chaisson

M.J.

et al. (

2015

)

Resolving the complexity of the human genome using single-molecule sequencing

.

Nature

,

517

,

608

–

611

.

Chaisson

M.J.

et al. (

2017

) Resolving multicopy duplications de novo using polyploid phasing. In:

International Conference on Research in Computational Molecular Biology

, pp.

117

–

133

.

Springer

,

Cham

.

Google Preview

Cherf

G.M.

et al. (

2012

)

Automated forward and reverse ratcheting of dna in a nanopore at 5-a precision

.

Nat. Biotechnol

.,

30

,

344

–

348

.

Chin

C.-S.

et al. (

2013

)

Nonhybrid, finished microbial genome assemblies from long-read smrt sequencing data

.

Nat. Methods

,

10

,

563

–

569

.

David

M.

et al. (

2011

).

Shrimp2: sensitive yet practical short read mapping

.

Bioinformatics

,

27

,

1011

–

1012

.

Doi

K.

et al. (

2014

)

Rapid detection of expanded short tandem repeats in personal genomics using hybrid sequencing

.

Bioinformatics

,

30

,

815

–

822

.

Eid

J.

et al. (

2009

)

Real-time dna sequencing from single polymerase molecules

.

Science

,

323

,

133

–

138

.

Eisenstein

M.

(

2012

)

Oxford nanopore announcement sets sequencing sector abuzz

.

Nat. Biotechnol

.,

30

,

295

–

296

.

English

A.C.

et al. (

2012

)

Mind the gap: upgrading genomes with pacific biosciences rs long-read sequencing technology

.

PLoS One

,

7

,

e47768.

Fan

X.

et al. (

2017

)

Hysa: a hybrid structural variant assembly approach using next-generation and single-molecule sequencing technologies

.

Genome Res

.,

27

,

793

–

800

.

Ferragina

P.

,

Manzini

G.

(

2000

)

Opportunistic data structures with applications

. In:

Proceedings 41st Annual Symposium on Foundations of Computer Science (FOCS'00)

, pp.

390

–

398

.

IEEE Computer Society, Redondo Beach

,

California, USA

.

Gnerre

S.

et al. (

2011

)

High-quality draft assemblies of mammalian genomes from massively parallel sequence data

.

Proc. Natl. Acad. Sci. USA

,

108

,

1513

–

1518

.

Gontarz

P.M.

et al. (

2013

)

SRmapper: a fast and sensitive genome-hashing alignment tool

.

Bioinformatics

,

29

,

316

–

321

.

Goodwin

S.

et al. (

2015

)

Oxford nanopore sequencing, hybrid error correction, and de novo assembly of a eukaryotic genome

.

Genome Res

.,

25

,

1750

–

1756

.

Hach

F.

et al. (

2010

)

mrsfast: a cache-oblivious algorithm for short-read mapping

.

Nat. Methods

,

7

,

576

–

577

.

Hach

F.

et al. (

2014

)

mrsfast-ultra: a compact, snp-aware mapper for high performance sequencing applications

.

Nucleic Acids Res

.,

42

,

gku370

.

Hormozdiari

F.

et al. (

2009

)

Combinatorial algorithms for structural variation detection in high-throughput sequenced genomes

.

Genome Res

.,

19

,

1270

–

1278

.

Huddleston

J.

et al. (

2014

)

Reconstructing complex regions of genomes using long-read sequencing technology

.

Genome Res

.,

24

,

688

–

696

.

Huddleston

J.

et al. (

2017

)

Discovery and genotyping of structural variation from long-read haploid genome sequence data

.

Genome Res

.,

27

,

677

–

685

.

Koren

S.

et al. (

2012

)

Hybrid error correction and de novo assembly of single-molecule sequencing reads

.

Nat. Biotechnol

.,

30

,

693

–

700

.

Koren

S.

et al. (

2013

)

Reducing assembly complexity of microbial genomes with single-molecule sequencing

.

Genome Biol

.,

14

,

R101.

Korlach

J.

et al. (

2010

)

Real-time dna sequencing from single polymerase molecules

.

Methods Enzymol

.,

472

,

431

–

455

.

Langmead

B.

,

Salzberg

S.L.

(

2012

)

Fast gapped-read alignment with Bowtie 2

.

Nat. Methods

,

9

,

357

–

359

.

Li

H.

(

2012

)

Exploring single-sample snp and indel calling with whole-genome de novo assembly

.

Bioinformatics

,

28

,

1838

–

1844

.

Li

H.

(

2013

)

Aligning sequence reads, clone sequences and assembly contigs with bwa-mem

.

arXiv

,

1303

,

3997

.

Li

H.

(

2018

)

Minimap2: pairwise alignment for nucleotide sequences

.

Bioinformatics

,

1

,

7

.

Li

H.

,

Durbin

R.

(

2009

)

Fast and accurate short read alignment with Burrows-Wheeler transform

.

Bioinformatics

,

25

,

1754

–

1760

.

Li

R.

et al. (

2009

)

SOAP2: an improved ultrafast tool for short read alignment

.

Bioinformatics

,

25

,

1966

–

1967

.

Lin

H.

et al. (

2008

)

Zoom! zillions of oligos mapped

.

Bioinformatics

,

24

,

2431

–

2437

.

Liu

B.

et al. (

2016

)

rhat: fast alignment of noisy long reads with regional hashing

.

Bioinformatics

,

32

,

1625

–

1631

.

Liu

B.

et al. (

2017

)

Lamsa: fast split read alignment with long approximate matches

.

Bioinformatics

,

33

,

192

–

201

.

Loman

N.J.

et al. (

2015

)

A complete bacterial genome assembled de novo using only nanopore sequencing data

.

Nat. Methods

,

12

,

733

–

735

.

Manber

U.

,

Myers

G.

(

1993

)

Suffix arrays: a new method for on-line string searches

.

SIAM J. Comput

.,

22

,

935

–

948

.

Manrao

E.A.

et al. (

2012

)

Reading dna at single-nucleotide resolution with a mutant MsPa nanopore and phi29 dna polymerase

.

Nat. Biotechnol

.,

30

,

349

–

353

.

Marco-Sola

S.

et al. (

2012

)

The GEM mapper: fast, accurate and versatile alignment by filtration

.

Nat. Methods

,

9

,

1185

–

1188

.

Margulies

M.

et al. (

2005

)

Genome sequencing in microfabricated high-density picolitre reactors

.

Nature

,

437

,

376

–

380

.

Myers

G.

(

1999

)

A fast bit-vector algorithm for approximate string matching based on dynamic programming

.

JACM

,

46

,

395

–

415

.

Ohlebusch

E.

,

Abouelhoda

M.I.

(

2005

)

Chaining Algorithms and Applications in Comparative Genomics

. Chapman & Hall/CRC.

Ono

Y.

et al. (

2013

)

PBSIM: PacBio reads simulator toward accurate genome assembly

.

Bioinformatics

,

29

,

119

–

121

.

O'Roak

B.J.

et al. (

2011

)

Exome sequencing in sporadic autism spectrum disorders identifies severe de novo mutations

.

Nat. Genet

.,

43

,

585

–

589

.

Otto

C.

et al. (

2011

)

Fast local fragment chaining using sum-of-pair gap costs

.

Algorithms Mol. Biol

.,

6

,

4.

Pendleton

M.

et al. (

2015

)

Assembly and diploid architecture of an individual human genome via single-molecule technologies

.

Nat. Methods

,

12

,

780

–

786

.

Rand

A.C.

et al. (

2017

)

Mapping dna methylation with high-throughput nanopore sequencing

.

Nat. Methods

,

14

,

411

–

413

.

Rang

F.J.

et al. (

2018

)

From squiggle to basepair: computational approaches for improving nanopore sequencing read accuracy

.

Genome Biol.

,

19

,

90

.

Roberts

M.

et al. (

2004

)

Reducing storage requirements for biological sequence comparison

.

Bioinformatics

,

20

,

3363

–

3369

.

Scott

D.

,

Ely

B.

(

2014

)

Comparison of genome sequencing technology and assembly methods for the analysis of a GC-rich bacterial genome

.

Curr. Microbiol

.,

70

,

1

–

7

.

Sedlazeck

F.J.

et al. (

2018

)

Accurate detection of complex structural variations using single-molecule sequencing

.

Nat. Methods

,

15

,

461

–

468

.

Shin

S.C.

et al. (

2013

)

Advantages of single-molecule real-time sequencing in high-GC content genomes

.

PLoS One

,

8

,

e68824.

Simpson

J.T.

et al. (

2017

)

Detecting DNA cytosine methylation using nanopore sequencing

.

Nat. Methods

,

14

,

407

–

410

.

Siragusa

E.

et al. (

2013

)

Fast and accurate read mapping with approximate seeds and multiple backtracking

.

Nucleic Acids Res

.,

41

,

e78.

Šošić

M.

,

Šikić

M.

(

2017

)

Edlib: a c/c++ library for fast, exact sequence alignment using edit distance

.

Bioinformatics

,

33

,

1394

–

1395

.

Sović

I.

et al. (

2016

)

Fast and sensitive mapping of nanopore sequencing reads with GraphMap

.

Nat. Commun

.,

7

,

11307

.

Thompson

J.F.

,

Milos

P.M.

(

2011

)

The properties and applications of single-molecule DNA sequencing

.

Genome Biol

.,

12

,

217.

Travers

K.J.

et al. (

2010

)

A flexible and efficient template format for circular consensus sequencing and snp detection

.

Nucleic Acids Res

.,

38

,

e159

.

Ummat

A.

,

Bashir

A.

(

2014

)

Resolving complex tandem repeats with long reads

.

Bioinformatics

,

30

,

3491

–

3498

.

Weese

D.

et al. (

2012

)

Razers 3: faster, fully sensitive read mapping

.

Bioinformatics

,

28

,

2592

–

2599

.

Xin

H.

et al. (

2013

)

Accelerating read mapping with fastHASH

.

BMC Genomics

,

14 (Suppl. 1)

,

S13.