Publication Date: 01 Feb 2015
Type: Review
Journal: Cancer Informatics
Citation: Cancer Informatics 2014:Suppl. 1 159-165
doi: 10.4137/CIN.S13887
The advance in human genome sequencing technology has significantly reduced the cost of data generation and overwhelms the computing capability of sequence analysis. Efficiency, efficacy, and scalability remain challenging in sequence alignment, which is an important and foundational operation for genome data analysis. In this paper, we propose a two-stage approach to tackle this problem. In the preprocessing step, we match blocks of reference and target sequences based on the similarities between their empirical transition probability distributions using belief propagation. We then conduct a refined match using our recently published sparse-coding belief propagation (SCoBeP) technique. Our experimental results demonstrated robustness in nucleotide sequence alignment, and our results are competitive to those of the SOAP aligner and the BWA algorithm. Moreover, compared to SCoBeP alignment, the proposed technique can handle sequences of much longer lengths.
PDF (945.98 KB PDF FORMAT)
RIS citation (ENDNOTE, REFERENCE MANAGER, PROCITE, REFWORKS)
BibTex citation (BIBDESK, LATEX)
I would like to extend my gratitude for creating the next generation of a scientific journal -- the science journal of tomorrow. The entire process bespoke of exceptional efficiency, celerity, professionalism, competency, and service.
Facebook Google+ Twitter
Pinterest Tumblr YouTube