BLAST+: architecture and applications
- PMID: 20003500
- PMCID: PMC2803857
- DOI: 10.1186/1471-2105-10-421
BLAST+: architecture and applications
Abstract
Background: Sequence similarity searching is a very important bioinformatics task. While Basic Local Alignment Search Tool (BLAST) outperforms exact methods through its use of heuristics, the speed of the current BLAST software is suboptimal for very long queries or database sequences. There are also some shortcomings in the user-interface of the current command-line applications.
Results: We describe features and improvements of rewritten BLAST software and introduce new command-line applications. Long query sequences are broken into chunks for processing, in some cases leading to dramatically shorter run times. For long database sequences, it is possible to retrieve only the relevant parts of the sequence, reducing CPU time and memory usage for searches of short queries against databases of contigs or chromosomes. The program can now retrieve masking information for database sequences from the BLAST databases. A new modular software library can now access subject sequence data from arbitrary data sources. We introduce several new features, including strategy files that allow a user to save and reuse their favorite set of options. The strategy files can be uploaded to and downloaded from the NCBI BLAST web site.
Conclusion: The new BLAST command-line applications, compared to the current BLAST tools, demonstrate substantial speed improvements for long queries as well as chromosome length database sequences. We have also improved the user interface of the command-line applications.
Figures
Similar articles
-
SS-Wrapper: a package of wrapper applications for similarity searches on Linux clusters.BMC Bioinformatics. 2004 Oct 28;5:171. doi: 10.1186/1471-2105-5-171. BMC Bioinformatics. 2004. PMID: 15511296 Free PMC article.
-
BOV--a web-based BLAST output visualization tool.BMC Genomics. 2008 Sep 15;9:414. doi: 10.1186/1471-2164-9-414. BMC Genomics. 2008. PMID: 18793422 Free PMC article.
-
Database indexing for production MegaBLAST searches.Bioinformatics. 2008 Aug 15;24(16):1757-64. doi: 10.1093/bioinformatics/btn322. Epub 2008 Jun 21. Bioinformatics. 2008. PMID: 18567917 Free PMC article.
-
Finding homologs to nucleotide sequences using network BLAST searches.Curr Protoc Bioinformatics. 2002 Aug;Chapter 3:Unit 3.3. doi: 10.1002/0471250953.bi0303s00. Curr Protoc Bioinformatics. 2002. PMID: 18792938 Review.
-
Sequence Similarity Searching.Curr Protoc Protein Sci. 2019 Feb;95(1):e71. doi: 10.1002/cpps.71. Epub 2018 Aug 13. Curr Protoc Protein Sci. 2019. PMID: 30102464 Review.
Cited by
-
Deciphering the anthocyanin metabolism gene network in tea plant (Camellia sinensis) through structural equation modeling.BMC Genomics. 2024 Nov 15;25(1):1093. doi: 10.1186/s12864-024-11012-8. BMC Genomics. 2024. PMID: 39548396
-
Chromosome-level genome assembly of Indo-Pacific king mackerel (Scomberomorus guttatus).Sci Data. 2024 Nov 13;11(1):1224. doi: 10.1038/s41597-024-04110-5. Sci Data. 2024. PMID: 39537638 Free PMC article.
-
Virulence perspective genomic research unlocks the secrets of Rhizoctonia solani associated with banded sheath blight in Barnyard Millet (Echinochloa frumentacea).Front Plant Sci. 2024 Oct 28;15:1457912. doi: 10.3389/fpls.2024.1457912. eCollection 2024. Front Plant Sci. 2024. PMID: 39529934 Free PMC article.
-
Whole genome sequencing and de novo genome assembly of the Kazakh native horse Zhabe.Front Genet. 2024 Oct 21;15:1466382. doi: 10.3389/fgene.2024.1466382. eCollection 2024. Front Genet. 2024. PMID: 39529846 Free PMC article. No abstract available.
-
Genomics sequence data of a drug-resistant Pseudomonas aeruginosa producing Tripoli Metallo-β-lactamase 1 isolated from Sudan.Data Brief. 2024 Oct 16;57:111040. doi: 10.1016/j.dib.2024.111040. eCollection 2024 Dec. Data Brief. 2024. PMID: 39525649 Free PMC article.
References
-
- Altschul S, Gish W, Miller W, Myers E, Lipman D. Basic local alignment search tool. J Mol Biol. 1990;215(3):403–410. - PubMed
-
- NCBI C toolkit. http://www.ncbi.nlm.nih.gov/IEB/ToolBox/SDKDOCS/INDEX.HTML
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials