BACK TO CONTENTS   |    PDF   |    PREVIOUS   |    NEXT

Title

 

 

 

 

 

Nearest-neighbor classifier as a tool for classification of protein families

Authors

 

Mona Chaurasiya1*, Gohel Bakul Chandulal1, Krishna Misra1*, Vivek Kumar Chaurasiya2

Affiliation

 

1 Indian Institute of Information Technology, Allahabad, India; 2 Indian Institute of Technology, Roorkee, India

 

Email

 

kkmisra@yahoo.com

 

Article Type

 

Hypothesis

Date

 

Received December 07, 2009; Revised January 30, 2010; accepted November 13, 2010; published March 31, 2010

Abstract

Knowledge about protein function is essential in understanding the biological processes. A specific class or family of protein shares common structural and chemical properties amongst its member sequences. The set of properties that display its unique characteristics for clearly classifying a protein sequence into its corresponding protein family needs to be studied. Our study of these important properties conducted on four major classes of proteins namely Globins, Homeoboxes, Heat Shock proteins (HSP) and Kinase have shown that frequency of twenty naturally occurring amino acids, hydrophobic content of protein, molecular weight of protein, isoelectric point of protein, secondary structure composition of amino acid residues as helices, coils and sheets and the composition of helices, coils and sheets in the secondary structure topology plays a significant role in correctly classifying the protein into its corresponding class or family as indicated by the overall efficiency of Nearest Neighbor Classifier as 84.92%.

 

Keywords

proteins, family, classification, classifier

 

Citation

 

Chaurasiya et al., Bioinformation 4(9): 396-398 (2010)

 

Edited by

 

P. Kangueane

 

ISSN

 

0973-2063

 

Publisher

 

Biomedical Informatics

License

 

 

This is an Open Access article which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. This is distributed under the terms of the Creative Commons Attribution License.