MutaFrame—an interpretative visualization framework for deleteriousness prediction of missense variants in the human exome

Ancien, François; Pucci, Fabrizio; Vranken, Wim; Rooman, Marianne

doi:10.1093/bioinformatics/btab453

Abstract

Motivation

High-throughput experiments are generating ever increasing amounts of various -omics data, so shedding new light on the link between human disorders, their genetic causes and the related impact on protein behavior and structure. While numerous bioinformatics tools now exist that predict which variants in the human exome cause diseases, few tools predict the reasons why they might do so. Yet, understanding the impact of variants at the molecular level is a prerequisite for the rational development of targeted drugs or personalized therapies.

Results

We present the updated MutaFrame webserver, which aims to meet this need. It offers two deleteriousness prediction softwares, DEOGEN2 and SNPMuSiC, and is designed for bioinformaticians and medical researchers who want to gain insights into the origins of monogenic diseases. It contains information at two levels for each human protein: its amino acid sequence and its three-dimensional structure; we used the experimental structures whenever available, and modeled structures otherwise. MutaFrame also includes higher-level information, such as protein essentiality and protein–protein interactions. It has a user-friendly interface for the interpretation of results and a convenient visualization system for protein structures, in which the variant positions introduced by the user and other structural information are shown. In this way, MutaFrame aids our understanding of the pathogenic processes caused by single-site mutations and their molecular and contextual interpretation.

Availability and implementation

Mutaframe webserver at http://mutaframe.com/.

Supplementary information

Supplementary data are available at Bioinformatics online.

Whereas the amount of genetic data obtained through high-throughput sequencing experiments has exploded in the last twenty years (1000 Genomes Project Consortium, 2015), it remains challenging to accurately predict and interpret how some gene variants lead to diseases, which are often caused by changes in the protein(s) the gene encodes (Andreoletti et al., 2019). Especially difficult to predict are the changes these variants cause at the level of protein behavior, which can often explain the pathogenic mechanisms involved and allows optimizing the rational development of targeted drugs. Multiple bioinformatics tools have been developed to classify variants in the human exome as deleterious or neutral (Chen et al., 2020; Livesey and Marsh, 2020), but their explanatory power remains limited.

We present a substantial extension of the Mutaframe webserver (Raimondi et al., 2017), which is designed to improve the interpretability of such protein-level predictions via an easy-to-use graphical interface (Fig. 1). The new version features two complementary state-of-the-art predictors, DEOGEN2 (Raimondi et al., 2016, 2017) and SNPMuSiC (Ancien et al., 2018). DEOGEN2 is a protein sequence-based predictor that utilizes evolutionary information as well as contextual information, such as the relevance of the gene containing the variant or the interactions of the encoded protein. SNPMuSiC uses as input experimental or modeled three-dimensional (3D) protein structures and predicts deleterious variants on the basis of the changes in stability these cause.

Fig. 1.

Open in new tab Download slide

The capabilities of the MutaFrame webserver at the mapping (a), variant effect interpretation (b) and 3D structure visualization (c) levels (Supplementary Material)

Combining these two predictors, which already individually have good performances (Chen et al., 2020; Livesey and Marsh, 2020), yields a consensus predictor with a balanced accuracy of 92% and a positive predictive value of 97% on 80% of the variants. Moreover, the combination of the explanatory power of DEOGEN2 in terms of evolutionary and contextual features, and of SNPMuSiC in terms of structure and stability, improves the contextualization of the impact that a mutation has at the protein level. For example, highly conserved residues located in the protein core, whose variants are predicted as deleterious by both DEOGEN2 and SNPMuSiC, are highly likely to be destabilizing, thus inducing (partial) unfolding of the protein. A full description of these predictors, their performance, large-scale applications and case studies related to the Niemann–Pick disease is available from Supplementary Material.

The new version of the MutaFrame server also provides additional computational and visualization utilities that help the users in the interpretation of the prediction results:

Visualization of the experimental or modeled 3D structure of the wild-type target protein, if available, and of the localization of the variant residue.
Per-residue solvent accessibility and secondary structure as well as additional information on the 3D protein structures such as the resolution of the X-ray structure or of the template used for the homology modeling.
DEOGEN2 and SNPMuSiC prediction scores of specific variants introduced by the user.
Heatmap showing the DEOGEN2 and SNPMuSiC scores of all possible variants in a target protein, both along the sequence and in the 3D protein structure.
Influence of the different features (residue conservation, protein essentiality,….) in the DEOGEN2 prediction.
Mapping between gene, protein sequence and protein structure identifiers and corresponding sequence alignments, for the entire human proteome.

Note, moreover, that all the results available on the webserver can easily be downloaded for offline analyses. In summary, MutaFrame facilitates the analysis of human variants at the molecular, evolutionary and contextual levels, thus going beyond the simple binary deleterious/benign classification. This constitutes an important asset in the clinical and biopharmaceutical fields.

Funding

This work was supported by the European Regional Development Fund and Brussels-Capital Region-Innoviris within the framework of the Operational Programme 2014–2020 [ERDF-2020 project ICITY-RDI.BRU]. F.P. and M.R. are post-doctoral Researcher and Research Director, respectively, at the F.R.S.-FNRS Fund for Scientific Research.

Conflict of Interest: none declared.

Acknowledgements

The authors thank I. Tanyalcin for his help in the technical setup of the web server.

References

1000 Genomes Project Consortium. (

2015

)

A global reference for human genetic variation

.

Nature

,

526

,

68

–

74

.

Crossref

PubMed

WorldCat

Livesey

B.J.

,

Marsh

J.A.

(

2020

)

Using deep mutational scanning data to benchmark computational phenotype predictors and identify pathogenic missense mutations

. Mol. Syst. Biol., 16, e9380.

OpenURL Placeholder Text

WorldCat

Ancien

F.

et al. (

2018

)

Prediction and interpretation of deleterious coding variants in terms of protein structural stability

.

Sci. Rep

.,

8

,

4480

.

Andreoletti

G.

et al. (

2019

)

Reports from the fifth edition of Cagi: the critical assessment of genome interpretation

.

Hum. Mutat

.,

40

,

1197

–

1201

.

Chen

H.

et al. (

2020

)

Comprehensive assessment of computational algorithms in predicting cancer driver mutations

.

Genome Biol

.,

21

,

43

.

Raimondi

D.

et al. (

2016

)

Multilevel biological characterization of exomic variants at the protein level significantly improves the identification of their deleterious effects

.

Bioinformatics

,

32

,

1797

–

1804

.

Raimondi

D.

et al. (

2017

)

DEOGEN2: prediction and interactive visualization of single amino acid variant deleteriousness in human proteins

.

Nucleic Acids Res

.,

45

,

W201

–

W206

.

This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com

Associate Editor:

Download all slides

Month:	Total Views:
June 2021	40
July 2021	54
August 2021	65
September 2021	63
October 2021	32
November 2021	48
December 2021	92
January 2022	95
February 2022	65
March 2022	37
April 2022	54
May 2022	46
June 2022	41
July 2022	31
August 2022	25
September 2022	31
October 2022	53
November 2022	31
December 2022	13
January 2023	31
February 2023	24
March 2023	17
April 2023	17
May 2023	24
June 2023	14
July 2023	21
August 2023	22
September 2023	20
October 2023	15
November 2023	27
December 2023	22
January 2024	51
February 2024	15
March 2024	49
April 2024	10

Article Contents

MutaFrame—an interpretative visualization framework for deleteriousness prediction of missense variants in the human exome

Abstract

Funding

Acknowledgements

References

Supplementary data

Citations

Views

Altmetric

Email alerts

Citing articles via

Latest

Most Read

Most Cited

Looking for your next opportunity?

Article Contents

MutaFrame—an interpretative visualization framework for deleteriousness prediction of missense variants in the human exome

Abstract

Funding

Acknowledgements

References

Supplementary data

Citations

Views

Altmetric

Email alerts

Citing articles via

Latest

Most Read

Most Cited

Looking for your next opportunity?

This Feature Is Available To Subscribers Only