Home About us MoEF Contact us Sitemap Tamil Website  
About Envis
Whats New
Microorganisms
Research on Microbes
Database
Bibliography
Publications
Library
E-Resources
Microbiology Experts
Events
Online Submission
mn

Site Visitors

blog tracking


 
Nucleic Acids Research
Vol.
40, No. 15, 2012; Pages: xxx - xxx

Composition-based classification of short metagenomic sequences elucidates the landscapes of taxonomic and functional enrichment of microorganisms

Jiemeng Liu, Haifeng Wang, Hongxing Yang, Yizhe Zhang, Jinfeng Wang, Fangqing Zhao and Ji Qi

State Key Laboratory of Genetic Engineering.

Abstract

Compared with traditional algorithms for long metagenomic sequence classification, characterizing microorganisms’ taxonomic and functional abundance based on tens of millions of very short reads are much more challenging. We describe an efficient composition and phylogeny-based algorithm [Metagenome Composition Vector (MetaCV)] to classify very short metagenomic reads (75–100 bp) into specific taxonomic and functional groups. We applied MetaCV to the Meta-HIT data (371-Gb 75-bp reads of 109 human gut metagenomes), and this single-read-based, instead of assembly-based, classification has a high resolution to characterize the composition and structure of human gut microbiota, especially for low abundance species. Most strikingly, it only took MetaCV 10 days to do all the computation work on a server with five 24-core nodes. To our knowledge, MetaCV, benefited from the strategy of composition comparison, is the first algorithm that can classify millions of very short reads within affordable time.

Keywords: phylogeny-based algorithm; human gut metagenomes; characterizing microorganisms; enrichment of microorganisms.


 

 

 
Copyright © 2005 ENVIS Centre ! All rights reserved
This site is optimized for 1024 x 768 screen resolution