BACK TO CONTENTS   |    PDF   |    PREVIOUS   |    NEXT

Title

 

 

 

 

 

Length constraints of multi-domain proteins in metazoans

Authors

 

Sarah Middleton, Timothy Song, Sudhir Nayak*

Affiliation

 

Department of Biology, The College of New Jersey, 2000 Pennington Rd., Ewing, NJ 08628

 

Email

 

nayak@tcnj.edu

Article Type

 

Hypothesis

Date

 

Received March 06, 2010; accepted April 09, 2010; published April 30, 2010

Abstract

The increasing number of annotated genome sequences in public databases has made it possible to study the length distributions and domain composition of proteins at unprecedented resolution. To identify factors that influence protein length in metazoans, we performed an analysis of all domain-annotated proteins from a total of 49 animal species from Ensembl (v.56) or EnsemblMetazoa (v.3). Our results indicate that protein length constraints are not fixed as a linear function of domain count and can vary based on domain content. The presence of repeating domains was associated with relaxation of the constraints that govern protein length. Conversely, for proteins with unique domains, length constraints were generally maintained with increased domain counts. It is clear that mean (and median) protein length and domain composition vary significantly between metazoans and other kingdoms; however, the connections between function, domain content, and length are unclear. We incorporated Gene Ontology (GO) annotation to identify biological processes, cellular components, or molecular functions that favor the incorporation of multi-domain proteins. Using this approach, we identified multiple GO terms that favor the incorporation of multi-domain proteins; interestingly, several of the GO terms with elevated domain counts were not restricted to a single gene family. The findings presented here represent an important step in resolving the complex relationship between protein length, function, and domain content. The comparison of the data presented in this work to data from other kingdoms is likely to reveal additional differences in the regulation of protein length.

 

Keywords

length, domian, proteins, metazoans, constraints

Citation

 

Middleton et al. Bioinformation 4(10): 000-000 (2010)

 

Edited by

 

P. Kangueane

ISSN

 

0973-2063

 

Publisher

 

Biomedical Informatics

License

 

 

This is an Open Access article which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. This is distributed under the terms of the Creative Commons Attribution License.