Abstract
Motivation: Solubility is one of the fundamental protein properties. It is of great interest because of its relevance to protein expression. Reduced solubility and protein aggregation are also associated with many diseases. Results: We collected from literature the largest experimentally verified solubility affecting amino acid substitution (AAS) dataset and used it to train a predictor called PON-Sol. The predictor can distinguish both solubility decreasing and increasing variants from those not affecting solubility. PONSol has normalized correct prediction ratio of 0.491 on cross-validation and 0.432 for independent test set. The performance of the method was compared both to solubility and aggregation predictors and found to be superior. PON-Sol can be used for the prediction of effects of disease-related substitutions, effects on heterologous recombinant protein expression and enhanced crystallizability. One application is to investigate effects of all possible AASs in a protein to aid protein engineering.
Original language | English |
---|---|
Pages (from-to) | 2032-2034 |
Journal | Bioinformatics |
Volume | 32 |
Issue number | 13 |
DOIs | |
Publication status | Published - 2016 Jul 1 |
Subject classification (UKÄ)
- Other Basic Medicine
- Medicinal Chemistry