Reference : Identification and Characterization of Variant Intolerant Sites across Human Protein ...
Scientific congresses, symposiums and conference proceedings : Paper published in a journal
Life sciences : Genetics & genetic processes
Systems Biomedicine
Identification and Characterization of Variant Intolerant Sites across Human Protein 3-Dimensional Structures
Iqbal, Sumaiya [> >]
Berg Jespersen, Jakob [> >]
Perez-Palma, Eduardo [> >]
May, Patrick mailto [University of Luxembourg > Luxembourg Centre for Systems Biomedicine (LCSB) >]
Heyne, Henrike [> >]
Lage, Kasper [> >]
Steensbjerre Møller, Rikke [> >]
Wagner, Florence F. [> >]
Daly, Mark [> >]
Campbell, Arthur J. [> >]
Lal, Dennis [> >]
Biophysical Journal
Biophysical Society
3, Suppl. 1
Yes (verified by ORBilu)
Biophysical Society Meeting
San Francisco
[en] 3D structure ; variants ; genetics
[en] The functional interpretation of genetic variation in disease-associated genes is far outpaced by data generation. Existing algorithms for prediction of variant consequences do not adequately distinguish pathogenic variants from benign rare variants. This lack of statistical and bioinformatics analyses, accompanied by an ever-increasing number of identified variants in biomedical
research and clinical applications, has become a major challenge. Established methods to predict the functional effect of genetic variation use the degree of amino acid conservation across species in linear protein sequence alignment. More recent methods include the spatial distribution pattern of known patient and control variants. Here, we propose to combine the linear conservation and spatial constrained based scores to devise a novel score that incorporates 3-dimensional structural properties of amino acid residues, such as the solvent-accessible surface area, degree of flexibility, secondary structure propensity and binding tendency, to quantify the effect of amino acid substitutions. For this study, we develop a framework for large-scale mapping of established linear sequence-based paralog and ortholog conservation scores onto the tertiary structures of human proteins. This framework can be utilized to map the spatial distribution of mutations on solved protein structures as well as homology models. As a proof of concept, using a homology
model of the human Nav1.2 voltage-gated sodium channel structure, we observe spatial clustering in distinct domains of mutations, associated with Autism Spectrum Disorder (>20 variants) and Epilepsy (>100 variants), that exert opposing effects on channel function. We are currently characterizing all variants (>300k individuals) found in ClinVar, the largest disease variant database, as well as variants identified in >140k individuals from general population. The variant mapping framework and our score, informed with structural information, will be useful in identifying structural motifs of proteins associated with disease risk.
Luxembourg Centre for Systems Biomedicine (LCSB): Bioinformatics Core (R. Schneider Group)

File(s) associated to this reference

Fulltext file(s):

Limited access
PIIS0006349517348154.pdfPublisher postprint46.08 kBRequest a copy

Bookmark and Share SFX Query

All documents in ORBilu are protected by a user license.