TY - GEN
T1 - Accelerating dna sequencing-by-hybridization with noise
AU - Chen, Chen
AU - Xin, Dong
AU - Han, Jiawei
PY - 2005
Y1 - 2005
N2 - As a potential alternative to current wet-lab technologies, DNA sequencing-by-hybridization (SBH) has received much attention from different research communities. In order to deal with real applications, experiment environments should not be considered as error-free. Previously, under the assumption of random independent hybridization errors, Leong et, al. [9 J presented an algorithm for sequence reconstruction which exhibits graceful degradation of output accuracy as the error rate increases. However, as the authors also admitted, a notable downside of their method is its too high computational cost. In this paper, we show that the poor efficiency of [9] is due to its mixing-up of situations with widely different characteristics and treating everything in the safest but also slowest way. Our new algorithm addresses this problem and pushes analysis down to a finer level where a more effective solution is proposed. As demonstrated by experimentations on real human genome datasets, this new methodology yields significant performance improvements and at the same time guarantees almost the same degree of output accuracy.
AB - As a potential alternative to current wet-lab technologies, DNA sequencing-by-hybridization (SBH) has received much attention from different research communities. In order to deal with real applications, experiment environments should not be considered as error-free. Previously, under the assumption of random independent hybridization errors, Leong et, al. [9 J presented an algorithm for sequence reconstruction which exhibits graceful degradation of output accuracy as the error rate increases. However, as the authors also admitted, a notable downside of their method is its too high computational cost. In this paper, we show that the poor efficiency of [9] is due to its mixing-up of situations with widely different characteristics and treating everything in the safest but also slowest way. Our new algorithm addresses this problem and pushes analysis down to a finer level where a more effective solution is proposed. As demonstrated by experimentations on real human genome datasets, this new methodology yields significant performance improvements and at the same time guarantees almost the same degree of output accuracy.
KW - Algorithmic efficiency
KW - Clues from the genome
KW - Noise
KW - Sequencing-by-hybridization
UR - http://www.scopus.com/inward/record.url?scp=84885629716&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84885629716&partnerID=8YFLogxK
U2 - 10.1145/1134030.1134037
DO - 10.1145/1134030.1134037
M3 - Conference contribution
AN - SCOPUS:84885629716
SN - 1595932135
SN - 9781595932136
T3 - Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
SP - 29
EP - 36
BT - Proceedings of the 5th International Workshop on Bioinformatics, BIOKDD 2005
T2 - 5th International Workshop on Bioinformatics, BIOKDD 2005 - In Conjunction with 11th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2005
Y2 - 21 August 2005 through 21 August 2005
ER -