BACK TO CONTENTS   |    PDF   |    PREVIOUS   |    NEXT

Title

 

 

 

 

 

GSTaxClassifier: a genomic signature based taxonomic classifier for metagenomic data analysis

 

Authors

 

Fahong Yu$, Yijun SunS, Li Liu, William Farmerie

Affiliation

 

Interdisciplinary Center for Biotechnology Research, University of Florida, Gainesville, FL 32610; $ Equal contribution

 

Email

 

fyu@ufl.edu

Article Type

 

Hypothesis

Date

 

Received April 07, 2009; Accepted June 18, 2009; Published August 20, 2009

Abstract

GSTaxClassifier (Genomic Signature based Taxonomic Classifier) is a program for metagenomics analysis of shotgun DNA sequences. The program includes (1) a simple but effective algorithm, a modification of the Bayesian method, to predict the most probable genomic origins of sequences at different taxonomical ranks, on the basis of genome databases; (2) a function to generate genomic profiles of reference sequences with tri-, tetra-, penta-, and hexa-nucleotide motifs for setting a user-defined database; (3) two different formats (tabular- and tree-based summaries) to display taxonomic predictions with improved analytical methods; and (4) effective ways to retrieve, search, and summarize results by integrating the predictions into the NCBI tree-based taxonomic information. GSTaxClassifier takes input nucleotide sequences and using a modified Bayesian model evaluates the genomic signatures between metagenomic query sequences and reference genome databases. The simulation studies of a numerical data sets showed that GSTaxClassifier could serve as a useful program for metagenomics studies, which is freely available at http://helix2.biotech.ufl.edu:26878/metagenomics/.

 

Keywords

 

Genomic signature; meta-genomics; taxonomy; Bayesian method

 

Citation

 

Yu et al., Bioinformation 4(1): 46-49 (2009)

Edited by

 

P. Kangueane

 

ISSN

 

0973-2063

 

Publisher

 

Biomedical Informatics 

 

License

 

 

This is an Open Access article which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. This is distributed under the terms of the Creative Commons Attribution License.