iFeature: a Python package and web server for features extraction and selection from protein and peptide sequences
- PMID: 29528364
- PMCID: PMC6658705
- DOI: 10.1093/bioinformatics/bty140
iFeature: a Python package and web server for features extraction and selection from protein and peptide sequences
Abstract
Summary: Structural and physiochemical descriptors extracted from sequence data have been widely used to represent sequences and predict structural, functional, expression and interaction profiles of proteins and peptides as well as DNAs/RNAs. Here, we present iFeature, a versatile Python-based toolkit for generating various numerical feature representation schemes for both protein and peptide sequences. iFeature is capable of calculating and extracting a comprehensive spectrum of 18 major sequence encoding schemes that encompass 53 different types of feature descriptors. It also allows users to extract specific amino acid properties from the AAindex database. Furthermore, iFeature integrates 12 different types of commonly used feature clustering, selection and dimensionality reduction algorithms, greatly facilitating training, analysis and benchmarking of machine-learning models. The functionality of iFeature is made freely available via an online web server and a stand-alone toolkit.
Availability and implementation: http://iFeature.erc.monash.edu/; https://github.com/Superzchen/iFeature/.
Supplementary information: Supplementary data are available at Bioinformatics online.
Similar articles
-
protr/ProtrWeb: R package and web server for generating various numerical representation schemes of protein sequences.Bioinformatics. 2015 Jun 1;31(11):1857-9. doi: 10.1093/bioinformatics/btv042. Epub 2015 Jan 24. Bioinformatics. 2015. PMID: 25619996
-
POSSUM: a bioinformatics toolkit for generating numerical sequence feature descriptors based on PSSM profiles.Bioinformatics. 2017 Sep 1;33(17):2756-2758. doi: 10.1093/bioinformatics/btx302. Bioinformatics. 2017. PMID: 28903538
-
propy: a tool to generate various modes of Chou's PseAAC.Bioinformatics. 2013 Apr 1;29(7):960-2. doi: 10.1093/bioinformatics/btt072. Epub 2013 Feb 19. Bioinformatics. 2013. PMID: 23426256
-
FEPS: A Tool for Feature Extraction from Protein Sequence.Methods Mol Biol. 2022;2499:65-104. doi: 10.1007/978-1-0716-2317-6_3. Methods Mol Biol. 2022. PMID: 35696075
-
iLearn: an integrated platform and meta-learner for feature engineering, machine-learning analysis and modeling of DNA, RNA and protein sequence data.Brief Bioinform. 2020 May 21;21(3):1047-1057. doi: 10.1093/bib/bbz041. Brief Bioinform. 2020. PMID: 31067315
Cited by
-
ACVPICPred: Inhibitory activity prediction of anti-coronavirus peptides based on artificial neural network.Comput Struct Biotechnol J. 2024 Oct 2;23:3625-3633. doi: 10.1016/j.csbj.2024.09.015. eCollection 2024 Dec. Comput Struct Biotechnol J. 2024. PMID: 39469670 Free PMC article.
-
Neoantigen immunogenicity landscapes and evolution of tumor ecosystems during immunotherapy with nivolumab.Nat Med. 2024 Sep 30. doi: 10.1038/s41591-024-03240-y. Online ahead of print. Nat Med. 2024. PMID: 39349627
-
DeepPBI-KG: a deep learning method for the prediction of phage-bacteria interactions based on key genes.Brief Bioinform. 2024 Sep 23;25(6):bbae484. doi: 10.1093/bib/bbae484. Brief Bioinform. 2024. PMID: 39344712 Free PMC article.
-
Advances in Computational Intelligence-Based Methods of Structure and Function Prediction of Proteins.Biomolecules. 2024 Aug 29;14(9):1083. doi: 10.3390/biom14091083. Biomolecules. 2024. PMID: 39334850 Free PMC article.
-
Current computational tools for protein lysine acylation site prediction.Brief Bioinform. 2024 Sep 23;25(6):bbae469. doi: 10.1093/bib/bbae469. Brief Bioinform. 2024. PMID: 39316944
References
-
- Bellman R.E. (1961) Adaptive Control Processes: A Guided Tour. Princeton University Press, Princeton, NJ.
-
- Bhasin M., Raghava G.P. (2004) Classification of nuclear receptors based on amino acid composition and dipeptide composition. J. Biol. Chem., 279, 23262–23266. - PubMed
-
- Cao D.S. et al. (2013) propy: a tool to generate various modes of Chou’s PseAAC. Bioinformatics, 29, 960–962. - PubMed
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources