Title
|
|
Nearest-neighbor classifier as a tool for classification of protein families |
Authors
|
Mona Chaurasiya1*, Gohel Bakul Chandulal1, Krishna Misra1*, Vivek Kumar Chaurasiya2 | |
Affiliation
|
1 Indian Institute of Information Technology, Allahabad, India; 2 Indian Institute of Technology, Roorkee, India
| |
|
| |
Article Type
|
Hypothesis | |
Date
|
Received December 07, 2009; Revised January 30, 2010; accepted November 13, 2010; published March 31, 2010 | |
Abstract |
Knowledge about protein function is essential in understanding the biological processes. A specific class or family of protein shares common structural and chemical properties amongst its member sequences. The set of properties that display its unique characteristics for clearly classifying a protein sequence into its corresponding protein family needs to be studied. Our study of these important properties conducted on four major classes of proteins namely Globins, Homeoboxes, Heat Shock proteins (HSP) and Kinase have shown that frequency of twenty naturally occurring amino acids, hydrophobic content of protein, molecular weight of protein, isoelectric point of protein, secondary structure composition of amino acid residues as helices, coils and sheets and the composition of helices, coils and sheets in the secondary structure topology plays a significant role in correctly classifying the protein into its corresponding class or family as indicated by the overall efficiency of Nearest Neighbor Classifier as 84.92%.
| |
Keywords |
proteins, family, classification, classifier
| |
Citation
|
Chaurasiya et al., Bioinformation 4(9): 396-398 (2010)
| |
Edited by
|
P. Kangueane
| |
ISSN
|
0973-2063
| |
Publisher
|
||
License
|
This is an Open Access article which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. This is distributed under the terms of the Creative Commons Attribution License. |