[en] The majority of missense variants in clinical genetic tests are classified as variants of uncertain significance. Prior research has shown that the deleterious effects and the subsequent molecular consequence of variants are often conserved among paralogous protein sequences within a gene family. Here, we systematically quantified on an exome-wide scale if the existence of pathogenic variants in paralogous genes at a conserved position could serve as evidence for the pathogenicity of a new variant. For the gene family of voltage-gated sodium channels where variants and expert-curated clinical phenotypes were available, we also assessed whether phenotype patterns of multiple disorders for each gene were also conserved across variant positions within the gene family.
Methods
We developed a framework that assesses the presence of pathogenic missense variants located in conserved residues across paralogous genes. We systematically mapped 2.5 million pathogenic and general population variants from the ClinVar, HGMD, and gnomAD databases onto a total of 9,990 genes and aligned them by gene families. We evaluated the quantity of classifiable amino acids by utilizing pathogenic variants identified in databases alone and then compared this assessment to the inclusion of paralogous pathogenic variants. We validated and quantified the evidence of conserved pathogenic paralogous variants in variant pathogenicity classification.
Results
Considering conserved pathogenic variants in paralogous genes, increased the number of classifiable variants 2.8-fold across the exome, compared to pathogenic variants in the gene of interest alone. The presence of a pathogenic variant in a paralogous gene is associated with a positive likelihood ratio of 8.32 for variant pathogenicity. The likelihood ratio was gene family-specific. Across ten genes encoding voltage-gated sodium channels and 22 expert-curated disorders, we identified cross-paralog correlated phenotypes based on 3D structure spatial position. For example, the established loss-of-function disorders <italic>SCN1A</italic>-associated Dravet syndrome, <italic>SCN2A-</italic>associated autism, <italic>SCN5A</italic>-associated Brugarda Syndrome, and <italic>SCN8A-</italic>associated neurodevelopmental disorder without seizures were correlated in their spatial variant position on structure. Finally, we show that phenotype integration in paralog variant selection improves variant classification.
Conclusion
Our results show that paralogous variants, in particular with phenotype information can enhance our understanding of variant effects.</p>
Research center :
Luxembourg Centre for Systems Biomedicine (LCSB): Bioinformatics Core (R. Schneider Group)
Disciplines :
Genetics & genetic processes
Author, co-author :
Bruenger, Tobias; Department of Neurology, The University of Texas Health Science Center at Houston, Houston, TX
Ivanuk, Alina; Department of Neurology, Mayo Clinic Florida, Jacksonville, FL
Pérez-Palma, Eduardo; Universidad del Desarrollo, Centro de Genética y Genómica, Facultad de Medicina Clínica Alemana. Santiago
Montanucci, Ludovica; Department of Neurology, The University of Texas Health Science Center at Houston, Houston, TX
Cohen, Stacey; Division of Neurology, Children's Hospital of Philadelphia, Philadelphia, PA
Smith, Lacey; Epilepsy Genetics Program, Division of Epilepsy and Clinical Neurophysiology, Department of Neurology, Boston Children's Hospital, Boston, MA
Parthasarathy, Shridhar; Division of Neurology, Children's Hospital of Philadelphia, Philadelphia, PA
Helbig, Ingo; Division of Neurology, Children's Hospital of Philadelphia, Philadelphia, PA
Nothnagel, Michael; Cologne Center for Genomics (CCG), University of Cologne, Cologne
MAY, Patrick ; University of Luxembourg > Luxembourg Centre for Systems Biomedicine (LCSB) > Bioinformatics Core
Lal, Dennis; Department of Neurology, The University of Texas Health Science Center at Houston, Houston, TX
Language :
English
Title :
Conserved missense variant pathogenicity and correlated phenotypes across paralogous genes
FNR16394868 - Epileptogenesis Of Genetic Epilepsies, 2021 (01/10/2021-...) - Alexander Skupin
Name of the research project :
U-AGR-8268 - Treat-ION 2 - MAY Patrick
Funders :
BMBF - Federal Ministry of Education and Research FNR - Fonds National de la Recherche
Funding text :
Page 16/23
Not applicable
Availability of data and materials
Data is available in the Supplementary Tables. Code to annotate para-PS1/PM5 criteria with a custom
variant dataset for any gene family can be obtained from https://github.com/TobiasBruenger/paraPS1-
PM5-annotation.
Competing interests
The authors report no conicts of interest.
Funding
Funding for this work was provided by the German Federal Ministry for Education and Research (BMBF,
Treat-ION, 01GM1907D) to D.L., T.B., and P.M., by the BMBF (Treat-Ion2, 01GM2210B) to P.M, the Fonds
Nationale de la Recherche in Luxembourg (FNR, Research Unit FOR-2715, INTER/DFG/21/16394868
MechEPI2) to P.M., the Agencia Nacional de Investigación y Desarrollo de Chile (ANID, Fondecyt 1221464
grant) to E.P., the Familie SCN2A foundation 2020 Action Potential Grant to E.P., the Dravet Syndrome
Foundation (grant number, 272016) to D.L, and the NIH NINDS (Channelopathy-Associated Epilepsy
Research Center, 5-U54-NS108874) to D.L.