Automatic methods for predicting functionally important residues.

DEL SOL MESA, Antonio; Pazos, Florencio; Valencia, Alfonso

doi:10.1016/S0022-2836(02)01451-1

No full text

Article (Scientific journals)

Automatic methods for predicting functionally important residues.

DEL SOL MESA, Antonio; Pazos, Florencio; Valencia, Alfonso

2003 • In Journal of Molecular Biology, 326 (4), p. 1289-302

Peer reviewed

Permalink
https://hdl.handle.net/10993/17852

DOI
10.1016/S0022-2836(02)01451-1

PubMed
12589769

Files (0)Send to Details Statistics Bibliography Similar publications

Files

Full Text

No document available.

Send to

RIS BibTex APA Chicago Permalink X Linkedin

Details

Keywords :

Algorithms; Models, Molecular; Models, Theoretical; Multigene Family; Phylogeny; Proteins/chemistry; Proto-Oncogene Proteins p21(ras)/chemistry; Sequence Analysis, Protein

Abstract :

[en] Sequence analysis is often the first guide for the prediction of residues in a protein family that may have functional significance. A few methods have been proposed which use the division of protein families into subfamilies in the search for those positions that could have some functional significance for the whole family, but at the same time which exhibit the specificity of each subfamily ("Tree-determinant residues"). However, there are still many unsolved questions like the best division of a protein family into subfamilies, or the accurate detection of sequence variation patterns characteristic of different subfamilies. Here we present a systematic study in a significant number of protein families, testing the statistical meaning of the Tree-determinant residues predicted by three different methods that represent the range of available approaches. The first method takes as a starting point a phylogenetic representation of a protein family and, following the principle of Relative Entropy from Information Theory, automatically searches for the optimal division of the family into subfamilies. The second method looks for positions whose mutational behavior is reminiscent of the mutational behavior of the full-length proteins, by directly comparing the corresponding distance matrices. The third method is an automation of the analysis of distribution of sequences and amino acid positions in the corresponding multidimensional spaces using a vector-based principal component analysis. These three methods have been tested on two non-redundant lists of protein families: one composed by proteins that bind a variety of ligand groups, and the other composed by proteins with annotated functionally relevant sites. In most cases, the residues predicted by the three methods show a clear tendency to be close to bound ligands of biological relevance and to those amino acids described as participants in key aspects of protein function. These three automatic methods provide a wide range of possibilities for biologists to analyze their families of interest, in a similar way to the one presented here for the family of proteins related with ras-p21.

Disciplines :

Life sciences: Multidisciplinary, general & others

Author, co-author :

DEL SOL MESA, Antonio ; University of Luxembourg > Luxembourg Centre for Systems Biomedicine (LCSB)

Pazos, Florencio

Valencia, Alfonso

Language :

English

Title :

Automatic methods for predicting functionally important residues.

Publication date :

2003

Journal title :

Journal of Molecular Biology

ISSN :

0022-2836

Volume :

326

Issue :

Pages :

1289-302

Peer reviewed :

Peer reviewed

Available on ORBilu :

since 03 September 2014

Statistics

Number of views

191 (3 by Unilu)

Number of downloads

0 (0 by Unilu)

More statistics

Scopus citations^®

182

Scopus citations^®
without self-citations

165

OpenAlex citations

207

WoS citations^™

178