This package contains various tools, datasets and ML model implementations from the field of (phospho-)proteomics. It is intended to facilitate the testing and comparison of different neural network architectures and existing models, using the same datasets. Both retention time and fragmentation (MSMS) models are included.
Implementations of existing models from the literature are intended to be modifiable/extendable. For example, so that tests may be carried out with different peptide input lengths etc.
The latest release of aiproteomics
can be installed from the python package index using pip
as follows:
pip install aiproteomics
The latest version can be installed using pip after cloning the repository.
git clone git@github.com:aiproteomics/aiproteomics.git
cd aiproteomics/
pip install .
After installation, you can try out the demo notebooks by following the instructions here.
This package is in the process of being redesigned to make it more general and portable. The redesign is focussing on the creation of:
- Generators of models (in the open and portable ONNX format)
- Converters from .msp format to input for each model type
- Converters from each model type to .msp
Below is a diagram showing how the proposed tools will be combined to produce a pipeline for training proteomics models and using them to generate synthetic spectral libraries:
If you want to contribute to the development of aiproteomics, have a look at the contribution guidelines.