[en] BACKGROUND: The majority of missense variants in clinical genetic tests are classified as variants of uncertain significance. Prior research shows that the deleterious effects and the subsequent molecular consequences of variants are often conserved among paralogous protein sequences within a gene family. Here, we systematically quantify on an exome-wide scale whether the existence of pathogenic variants in paralogous genes at a conserved position can serve as evidence for the pathogenicity of a new variant. For the gene family of voltage-gated sodium channels, where variants and expert-curated clinical phenotypes are available, we also assess whether phenotype patterns of multiple disorders for each gene are conserved across variant positions within the gene family.
RESULTS: Mapping 590,000 pathogenic and 1.9 million population variants onto 9928 genes grouped into 2054 paralogous families increases the number of residues with classifiable evidence 5.1-fold compared with gene-specific data alone. The presence of a pathogenic variant in a paralogous gene is associated with a positive likelihood ratio of 13.0 for variant pathogenicity. Across ten genes encoding voltage-gated sodium channels and 22 expert-curated disorders, we identify cross-paralog correlated phenotypes based on 3D structure spatial position. For example, multiple established loss-of-function related disorders across SCN1A, SCN2A, SCN5A, and SCN8A show overlapping spatial variant clusters. Finally, we show that phenotype integration in paralog variant selection improves variant classification.
CONCLUSION: Conserved pathogenic missense variants in paralogous genes provide robust, quantifiable support for clinical variant interpretation, and phenotype-informed mapping further improves predictions.
Research center :
Luxembourg Centre for Systems Biomedicine (LCSB): Bioinformatics Core (R. Schneider Group)
Disciplines :
Genetics & genetic processes
Author, co-author :
Brünger, Tobias; Cologne Center for Genomics (CCG), University of Cologne, Cologne, 50931, Germany ; Department of Neurology, The University of Texas Health Science Center at Houston, Houston, TX, USA
Ivaniuk, Alina; Department of Neurology, Mayo Clinic in Florida, Jacksonville, Fl, 32224, USA
Pérez-Palma, Eduardo; Centro de Genética y Genómica, Facultad de Medicina Clínica Alemana, Universidad del Desarrollo, Santiago, Chile
Montanucci, Ludovica; Department of Neurology, The University of Texas Health Science Center at Houston, Houston, TX, USA
Cohen, Stacey; Division of Neurology, Children's Hospital of Philadelphia, Philadelphia, PA, 19104, USA ; The Epilepsy NeuroGenetics Initiative (ENGIN), Children's Hospital of Philadelphia, Philadelphia, PA, USA ; Department of Biomedical and Health Informatics (DBHi), Children's Hospital of Philadelphia, Philadelphia, PA, 19104, USA
Smith, Lacey; Epilepsy Genetics Program, Division of Epilepsy and Clinical Neurophysiology, Department of Neurology, Boston Children's Hospital, Boston, MA, USA
Parthasarathy, Shridhar; Division of Neurology, Children's Hospital of Philadelphia, Philadelphia, PA, 19104, USA ; The Epilepsy NeuroGenetics Initiative (ENGIN), Children's Hospital of Philadelphia, Philadelphia, PA, USA ; Department of Biomedical and Health Informatics (DBHi), Children's Hospital of Philadelphia, Philadelphia, PA, 19104, USA
Helbig, Ingo; Division of Neurology, Children's Hospital of Philadelphia, Philadelphia, PA, 19104, USA ; The Epilepsy NeuroGenetics Initiative (ENGIN), Children's Hospital of Philadelphia, Philadelphia, PA, USA ; Department of Biomedical and Health Informatics (DBHi), Children's Hospital of Philadelphia, Philadelphia, PA, 19104, USA ; Department of Neurology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA
Nothnagel, Michael; Cologne Center for Genomics (CCG), University of Cologne, Cologne, 50931, Germany
MAY, Patrick ; University of Luxembourg > Luxembourg Centre for Systems Biomedicine (LCSB) > Bioinformatics Core
Lal, Dennis; Cologne Center for Genomics (CCG), University of Cologne, Cologne, 50931, Germany. dennis.lal@uth.tmc.edu ; Department of Neurology, The University of Texas Health Science Center at Houston, Houston, TX, USA. dennis.lal@uth.tmc.edu ; Stanley Center of Psychiatric Research, Broad Institute of Harvard and MIT, 75 Ames Street, Cambridge, MA, 02142, USA. dennis.lal@uth.tmc.edu ; Epilepsy Center, Neurological Institute, Cleveland Clinic, Cleveland, OH, 44106, USA. dennis.lal@uth.tmc.edu ; Center for Neurogenetics, The University of Texas Health Science Center at Houston, Houston, TX, USA. dennis.lal@uth.tmc.edu
External co-authors :
yes
Language :
English
Title :
Conserved missense variant pathogenicity and correlated phenotypes across paralogous genes.
FNR16394868 - MechEpi-2 - Epileptogenesis Of Genetic Epilepsies, 2021 (01/01/2022-31/12/2024) - Alexander Skupin
Name of the research project :
U-AGR-7124 - INTER/DFG/21/16394868/MechEPI2 - SKUPIN Alexander U-AGR-8268 - Treat-ION 2 - MAY Patrick R-AGR-3917 - Treat-ION_EKUT - KRAUSE Roland
Funders :
Bundesministerium für Bildung und Forschung Fonds National de la Recherche Luxembourg Agencia Nacional de Investigación y Desarrollo SCN2A Foundation Dravet Syndrome Foundation National Institute of Neurological Disorders and Stroke
Funding text :
unding for this work was provided by the German Federal Ministry for Education and Research (BMBF, Treat-ION, 01GM1907D) to D.L., T.B., and P.M., by the BMBF (Treat-Ion2, 01GM2210B) to P.M, the Fonds Nationale de la Recherche in Luxembourg (FNR, Research Unit FOR-2715, INTER/DFG/21/16394868 MechEPI2) to P.M., the Agencia Nacional de Investigación y Desarrollo de Chile (ANID, Fondecyt 1221464 grant) to E.P., the Familie SCN2A foundation 2020 Action Potential Grant to E.P., the Dravet Syndrome Foundation (grant number, 272016) to D.L, and the NIH NINDS (Channelopathy-Associated Epilepsy Research Center, 5-U54-NS108874) to D.L.