Parallel-META: efficient metagenomic data analysis based on high-performance computation | |
Su, Xiaoquan; Xu, Jian; Ning, Kang | |
2012-07-16 | |
发表期刊 | BMC SYSTEMS BIOLOGY |
卷号 | 6期号:1 |
摘要 | BACKGROUND: Metagenomics method directly sequences and analyses genome information from microbial communities. There are usually more than hundreds of genomes from different microbial species in the same community, and the main computational tasks for metagenomic data analyses include taxonomical and functional component examination of all genomes in the microbial community. Metagenomic data analysis is both data- and computation- intensive, which requires extensive computational power. Most of the current metagenomic data analysis softwares were designed to be used on a single computer or single computer clusters, which could not match with the fast increasing number of large metagenomic projects' computational requirements. Therefore, advanced computational methods and pipelines have to be developed to cope with such need for efficient analyses. RESULT: In this paper, we proposed Parallel-META, a GPU- and multi-core-CPU-based open-source pipeline for metagenomic data analysis, which enabled the efficient and parallel analysis of multiple metagenomic datasets and the visualization of the results for multiple samples. In Parallel-META, the similarity-based database search was parallelized based on GPU computing and multi-core CPU computing optimization. Experiments have shown that Parallel-META has at least 15 times speed-up compared to traditional metagenomic data analysis method, with the same accuracy of the results http://www.computationalbioenergy.org/parallel-meta.html. CONCLUSION: The parallel processing of current metagenomic data would be very promising: with current speed up of 15 times and above, binning would not be a very time-consuming process any more. Therefore, some deeper analysis of the metagenomic data, such as the comparison of different samples, would be feasible in the pipeline, and some of these functionalities have been included into the Parallel-META pipeline. ; Background: Metagenomics method directly sequences and analyses genome information from microbial communities. There are usually more than hundreds of genomes from different microbial species in the same community, and the main computational tasks for metagenomic data analyses include taxonomical and functional component examination of all genomes in the microbial community. Metagenomic data analysis is both data- and computation-intensive, which requires extensive computational power. Most of the current metagenomic data analysis softwares were designed to be used on a single computer or single computer clusters, which could not match with the fast increasing number of large metagenomic projects' computational requirements. Therefore, advanced computational methods and pipelines have to be developed to cope with such need for efficient analyses. |
文章类型 | Article |
学科领域 | 功能基因组 |
WOS标题词 | Science & Technology ; Life Sciences & Biomedicine |
DOI | 10.1186/1752-0509-6-S1-S16 |
关键词[WOS] | PHYLOGENETIC CLASSIFICATION ; MICROBIAL GENOMES ; DNA FRAGMENTS ; SEQUENCES ; RESOURCE ; SEARCHES ; TOOLS ; HMMER ; ARB |
收录类别 | SCI ; ISTP |
语种 | 英语 |
WOS研究方向 | Mathematical & Computational Biology |
WOS类目 | Mathematical & Computational Biology |
WOS记录号 | WOS:000306568400016 |
引用统计 | |
文献类型 | 期刊论文 |
条目标识符 | http://ir.qibebt.ac.cn/handle/337004/1417 |
专题 | 单细胞中心组群 |
作者单位 | Chinese Acad Sci, Qingdao Inst Bioenergy & Bioproc Technol, Qingdao, Shandong, Peoples R China |
推荐引用方式 GB/T 7714 | Su, Xiaoquan,Xu, Jian,Ning, Kang. Parallel-META: efficient metagenomic data analysis based on high-performance computation[J]. BMC SYSTEMS BIOLOGY,2012,6(1). |
APA | Su, Xiaoquan,Xu, Jian,&Ning, Kang.(2012).Parallel-META: efficient metagenomic data analysis based on high-performance computation.BMC SYSTEMS BIOLOGY,6(1). |
MLA | Su, Xiaoquan,et al."Parallel-META: efficient metagenomic data analysis based on high-performance computation".BMC SYSTEMS BIOLOGY 6.1(2012). |
条目包含的文件 | 下载所有文件 | |||||
文件名称/大小 | 文献类型 | 版本类型 | 开放类型 | 使用许可 | ||
Parallel-META effici(964KB) | 开放获取 | CC BY-NC-SA | 浏览 下载 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论