Publication Date: 02 Dec 2014
Type: Review
Journal: Cancer Informatics
Citation: Cancer Informatics 2014:Suppl. 1 123-131
doi: 10.4137/CIN.S13879
We aim at developing a streamlined genome sequence compression algorithm to support alternative miniaturized sequencing devices, which have limited communication, storage, and computation power. Existing techniques that require heavy client (encoder side) cannot be applied. To tackle this challenge, we carefully examined distributed source coding theory and developed a customized reference-based genome compression protocol to meet the low-complexity need at the client side. Based on the variation between source and reference, our protocol will pick adaptively either syndrome coding or hash coding to compress subsequences of changing code length. Our experimental results showed promising performance of the proposed method when compared with the state-of-the-art algorithm (GRS).
PDF (1.83 MB PDF FORMAT)
RIS citation (ENDNOTE, REFERENCE MANAGER, PROCITE, REFWORKS)
BibTex citation (BIBDESK, LATEX)
PMC HTML
Compared with other journals we considered for publishing, Cancer Informatics provided extremely rapid but quality turnaround from draft submission to a flawlessly typeset final publication. Moreover, sharing the article is now as easy as sharing a link with no subscriptions required, and additional code and data files are equally accessible, supporting reproducible research. Because it has published many of our references we feel confident that our target readership must follow the journal. This is further ...
Facebook Google+ Twitter
Pinterest Tumblr YouTube