Article (Périodiques scientifiques)
A procedure to recruit members to enlarge protein family databases--the building of UECOG (UniRef-Enriched COG Database) as a model.
Fernandes, G. R.; BARBOSA DA SILVA, Adriano; Prosdocimi, F. et al.
2008In Genetics and Molecular Research, 7 (3), p. 910-24
Peer reviewed vérifié par ORBi
 

Documents


Texte intégral
uecog.pdf
Postprint Éditeur (887.57 kB)
Télécharger

Tous les documents dans ORBilu sont protégés par une licence d'utilisation.

Envoyer vers



Détails



Mots-clés :
Computational Biology/methods; Databases, Protein; Reproducibility of Results
Résumé :
[en] A procedure to recruit members to enlarge protein family databases is described here. The procedure makes use of UniRef50 clusters produced by UniProt. Current family entries are used to recruit additional members based on the UniRef50 clusters to which they belong. Only those additional UniRef50 members that are not fragments and whose length is within a restricted range relative to the original entry are recruited. The enriched dataset is then limited to contain only genomes from selected clades. We used the COG database - used for genome annotation and for studies of phylogenetics and gene evolution - as a model. To validate the method, a UniRef-Enriched COG0151 (UECOG) was tested with distinct procedures to compare recruited members with the recruiters: PSI-BLAST, secondary structure overlap (SOV), Seed Linkage, COGnitor, shared domain content, and neighbor-joining single-linkage, and observed that the former four agree in their validations. Presently, the UniRef50-based recruitment procedure enriches the COG database for Archaea, Bacteria and its subgroups Actinobacteria, Firmicutes, Proteobacteria, and other bacteria by 2.2-, 8.0-, 7.0-, 8.8-, 8.7-, and 4.2-fold, respectively, in terms of sequences, and also considerably increased the number of species.
Disciplines :
Biotechnologie
Auteur, co-auteur :
Fernandes, G. R.
BARBOSA DA SILVA, Adriano ;  University of Luxembourg > Luxembourg Centre for Systems Biomedicine (LCSB)
Prosdocimi, F.
Pena, I. A.
Santana-Santos, L.
Coelho Junior, O.
Barbosa-Silva, A.
Velloso, H. M.
Mudado, M. A.
Natale, D. A.
Faria-Campos, A. C.
Aguiar, S. C. V.
Ortega, J. M.
Plus d'auteurs (3 en +) Voir moins
Co-auteurs externes :
yes
Langue du document :
Anglais
Titre :
A procedure to recruit members to enlarge protein family databases--the building of UECOG (UniRef-Enriched COG Database) as a model.
Date de publication/diffusion :
2008
Titre du périodique :
Genetics and Molecular Research
ISSN :
1676-5680
Maison d'édition :
Fundacao de Pesquisas Cientificas de Ribeirao Preto, Brésil
Volume/Tome :
7
Fascicule/Saison :
3
Pagination :
910-24
Peer reviewed :
Peer reviewed vérifié par ORBi
Focus Area :
Systems Biomedicine
Disponible sur ORBilu :
depuis le 13 avril 2016

Statistiques


Nombre de vues
276 (dont 3 Unilu)
Nombre de téléchargements
190 (dont 0 Unilu)

citations Scopus®
 
6
citations Scopus®
sans auto-citations
1
citations OpenAlex
 
9
citations WoS
 
5

Bibliographie


Publications similaires



Contacter ORBilu