TY - JOUR
T1 - A gold standard set of mechanistically diverse enzyme superfamilies
AU - Brown, Shoshana D.
AU - Gerlt, John A.
AU - Seffernick, Jennifer L.
AU - Babbitt, Patricia
PY - 2006/1/31
Y1 - 2006/1/31
N2 - Superfamily and family analyses provide an effective tool for the functional classification of proteins, but must be automated for use on large datasets. We describe a 'gold standard' set of enzyme superfamilies, clustered according to specific sequence, structure, and functional criteria, for use in the validation of family and superfamily clustering methods. The gold standard set represents four fold classes and differing clustering difficulties, and includes five superfamilies, 91 families, 4,887 sequences and 282 structures.
AB - Superfamily and family analyses provide an effective tool for the functional classification of proteins, but must be automated for use on large datasets. We describe a 'gold standard' set of enzyme superfamilies, clustered according to specific sequence, structure, and functional criteria, for use in the validation of family and superfamily clustering methods. The gold standard set represents four fold classes and differing clustering difficulties, and includes five superfamilies, 91 families, 4,887 sequences and 282 structures.
UR - http://www.scopus.com/inward/record.url?scp=33745027619&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=33745027619&partnerID=8YFLogxK
U2 - 10.1186/gb-2006-7-1-r8
DO - 10.1186/gb-2006-7-1-r8
M3 - Article
C2 - 16507141
AN - SCOPUS:33745027619
SN - 1465-6914
VL - 7
JO - Genome Biology
JF - Genome Biology
IS - 1
M1 - R8
ER -