PON-All: Amino Acid Substitution Tolerance Predictor for All Organisms

Yang Yang, Aibin Shao, Mauno Vihinen

Research output: Contribution to journalArticlepeer-review

Abstract

Genetic variations are investigated in human and many other organisms for many purposes (e.g., to aid in clinical diagnosis). Interpretation of the identified variations can be challenging. Although some dedicated prediction methods have been developed and some tools for human variants can also be used for other organisms, the performance and species range have been limited. We developed a novel variant pathogenicity/tolerance predictor for amino acid substitutions in any organism. The method, PON-All, is a machine learning tool trained on human, animal, and plant variants. Two versions are provided, one with Gene Ontology (GO) annotations and another without these details. GO annotations are not available or are partial for many organisms of interest. The methods provide predictions for three classes: pathogenic, benign, and variants of unknown significance. On the blind test, when using GO annotations, accuracy was 0.913 and MCC 0.827. When GO features were not used, accuracy was 0.856 and MCC 0.712. The performance is the best for human and plant variants and somewhat lower for animal variants because the number of known disease-causing variants in animals is rather small. The method was compared to several other tools and was found to have superior performance. PON-All is freely available at http://structure.bmc.lu.se/PON-All and http://8.133.174.28:8999/.

Original languageEnglish
Article number867572
JournalFrontiers in Molecular Biosciences
Volume9
DOIs
Publication statusPublished - 2022 Jun 16

Subject classification (UKÄ)

  • Genetics

Free keywords

  • amino acid substitution
  • animal variants
  • machine learning
  • mutation
  • pathogenicity
  • plant variants
  • prediction
  • variation interpretation

Fingerprint

Dive into the research topics of 'PON-All: Amino Acid Substitution Tolerance Predictor for All Organisms'. Together they form a unique fingerprint.

Cite this