Assembling short reads from jumping libraries with large insert sizes
- PMID: 26040456
- DOI: 10.1093/bioinformatics/btv337
Assembling short reads from jumping libraries with large insert sizes
Abstract
Motivation: Advances in Next-Generation Sequencing technologies and sample preparation recently enabled generation of high-quality jumping libraries that have a potential to significantly improve short read assemblies. However, assembly algorithms have to catch up with experimental innovations to benefit from them and to produce high-quality assemblies.
Results: We present a new algorithm that extends recently described exSPAnder universal repeat resolution approach to enable its applications to several challenging data types, including jumping libraries generated by the recently developed Illumina Nextera Mate Pair protocol. We demonstrate that, with these improvements, bacterial genomes often can be assembled in a few contigs using only a single Nextera Mate Pair library of short reads.
Availability and implementation: Described algorithms are implemented in C++ as a part of SPAdes genome assembler, which is freely available at bioinf.spbau.ru/en/spades.
Contact: ap@bioinf.spbau.ru
Supplementary information: Supplementary data are available at Bioinformatics online.
© The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Similar articles
-
ExSPAnder: a universal repeat resolver for DNA fragment assembly.Bioinformatics. 2014 Jun 15;30(12):i293-301. doi: 10.1093/bioinformatics/btu266. Bioinformatics. 2014. PMID: 24931996 Free PMC article.
-
hybridSPAdes: an algorithm for hybrid assembly of short and long reads.Bioinformatics. 2016 Apr 1;32(7):1009-15. doi: 10.1093/bioinformatics/btv688. Epub 2015 Nov 20. Bioinformatics. 2016. PMID: 26589280 Free PMC article.
-
HGA: de novo genome assembly method for bacterial genomes using high coverage short sequencing reads.BMC Genomics. 2016 Mar 5;17:193. doi: 10.1186/s12864-016-2515-7. BMC Genomics. 2016. PMID: 26945881 Free PMC article.
-
Improvements in Genomic Technologies: Application to Crop Genomics.Trends Biotechnol. 2017 Jun;35(6):547-558. doi: 10.1016/j.tibtech.2017.02.009. Epub 2017 Mar 9. Trends Biotechnol. 2017. PMID: 28284542 Review.
-
Algorisms used for in silico finishing of bacterial genomes based on short-read assemblage implemented in GenoFinisher, AceFileViewer, and ShortReadManager.Biosci Biotechnol Biochem. 2022 May 24;86(6):693-703. doi: 10.1093/bbb/zbac032. Biosci Biotechnol Biochem. 2022. PMID: 35425950 Review.
Cited by
-
Complete genome sequence of a Histophilus somni strain 91 isolated from a beef calf with pneumonia.Microbiol Resour Announc. 2024 Oct 10;13(10):e0057024. doi: 10.1128/mra.00570-24. Epub 2024 Sep 6. Microbiol Resour Announc. 2024. PMID: 39240084 Free PMC article.
-
AMRomics: a scalable workflow to analyze large microbial genome collections.BMC Genomics. 2024 Jul 22;25(1):709. doi: 10.1186/s12864-024-10620-8. BMC Genomics. 2024. PMID: 39039439 Free PMC article.
-
Unveiling Agricultural Biotechnological Prospects: The Draft Genome Sequence of Stenotrophomonas geniculata LGMB417.Curr Microbiol. 2024 Jul 1;81(8):247. doi: 10.1007/s00284-024-03784-9. Curr Microbiol. 2024. PMID: 38951210
-
Genomic analyses of Symbiomonas scintillans show no evidence for endosymbiotic bacteria but does reveal the presence of giant viruses.PLoS Genet. 2024 Apr 1;20(4):e1011218. doi: 10.1371/journal.pgen.1011218. eCollection 2024 Apr. PLoS Genet. 2024. PMID: 38557755 Free PMC article.
-
Phylogenomics of Tetraopes longhorn beetles unravels their evolutionary history and biogeographic origins.Sci Rep. 2024 Mar 27;14(1):7285. doi: 10.1038/s41598-024-57827-z. Sci Rep. 2024. PMID: 38538660 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Research Materials