QIBEBT-IR  > 单细胞中心组群
Towards a Better Solution to the Shortest Common Supersequence Problem: The Deposition and Reduction Algorithm
Kang Ning; Hon Wai Leong
2006
发表期刊BMC Bioinformatics
期号7
摘要

Background: The problem of finding a Shortest Common Supersequence (SCS) of a set of sequences is an important problem with applications in many areas. It is a key problem in biological sequences analysis. The SCS problem is well-known to be NP-complete. Many heuristic algorithms have been proposed. Some heuristics work well on a few long sequences (as in sequence comparison applications); others work well on many short sequences (as in oligo-array synthesis). Unfortunately, most do not work well on large SCS instances where there are many, long sequences. Results: In this paper, we present a Deposition and Reduction (DR) algorithm for solving large SCS instances of biological sequences. There are two processes in our DR algorithm: deposition process, and reduction process. The deposition process is responsible for generating a small set of common supersequences; and the reduction process shortens these common supersequences by removing some characters while preserving the common supersequence property. Our evaluation on simulated data and real DNA and protein sequences show that our algorithm consistently produces the best results compared to many well-known heuristic algorithms, and especially on large instances. Conclusion: Our DR algorithm provides a partial answer to the open problem of designing efficient heuristic algorithm for SCS problem on many long sequences. Our algorithm has a bounded approximation ratio. The algorithm is efficient, both in running time and space complexity and our evaluation shows that it is practical even for SCS problems on many long sequences.

;

Background: The problem of finding a Shortest Common Supersequence (SCS) of a set of sequences is an important problem with applications in many areas. It is a key problem in biological sequences analysis. The SCS problem is well-known to be NP-complete. Many heuristic algorithms have been proposed. Some heuristics work well on a few long sequences (as in sequence comparison applications); others work well on many short sequences (as in oligo-array synthesis). Unfortunately, most do not work well on large SCS instances where there are many, long sequences. Results: In this paper, we present a Deposition and Reduction (DR) algorithm for solving large SCS instances of biological sequences. There are two processes in our DR algorithm: deposition process, and reduction process. The deposition process is responsible for generating a small set of common supersequences; and the reduction process shortens these common supersequences by removing some characters while preserving the common supersequence property. Our evaluation on simulated data and real DNA and protein sequences show that our algorithm consistently produces the best results compared to many well-known heuristic algorithms, and especially on large instances. Conclusion: Our DR algorithm provides a partial answer to the open problem of designing efficient heuristic algorithm for SCS problem on many long sequences. Our algorithm has a bounded approximation ratio. The algorithm is efficient, both in running time and space complexity and our evaluation shows that it is practical even for SCS problems on many long sequences.

关键词Towards a Better Solution To The Shortest Common Supersequence Problem: The Deposition And Reduction Algorithm
学科领域功能基因组
文献类型期刊论文
条目标识符http://ir.qibebt.ac.cn/handle/337004/979
专题单细胞中心组群
推荐引用方式
GB/T 7714
Kang Ning,Hon Wai Leong. Towards a Better Solution to the Shortest Common Supersequence Problem: The Deposition and Reduction Algorithm[J]. BMC Bioinformatics,2006(7).
APA Kang Ning,&Hon Wai Leong.(2006).Towards a Better Solution to the Shortest Common Supersequence Problem: The Deposition and Reduction Algorithm.BMC Bioinformatics(7).
MLA Kang Ning,et al."Towards a Better Solution to the Shortest Common Supersequence Problem: The Deposition and Reduction Algorithm".BMC Bioinformatics .7(2006).
条目包含的文件 下载所有文件
文件名称/大小 文献类型 版本类型 开放类型 使用许可
Towards a Better Sol(294KB) 开放获取使用许可浏览 下载
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Kang Ning]的文章
[Hon Wai Leong]的文章
百度学术
百度学术中相似的文章
[Kang Ning]的文章
[Hon Wai Leong]的文章
必应学术
必应学术中相似的文章
[Kang Ning]的文章
[Hon Wai Leong]的文章
相关权益政策
暂无数据
收藏/分享
文件名: Towards a Better Solution to the Shortest Common Supersequence Problem The Deposition and Reduction Algorithm.pdf
格式: Adobe PDF
此文件暂不支持浏览
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。