Publication Date: 02 Apr 2012
Type: Original Research
Journal: Evolutionary Bioinformatics
Citation: Evolutionary Bioinformatics 2012:8 171-180
doi: 10.4137/EBO.S9131
In spite of the recognized importance of tandem duplications in genome evolution, commonly adopted sequence comparison algorithms do not take into account complex mutation events involving more than one residue at the time, since they are not compliant with the underlying assumption of statistical independence of adjacent residues. As a consequence, the presence of tandem repeats in sequences under comparison may impair the biological significance of the resulting alignment. Although solutions have been proposed, repeat-aware sequence alignment is still considered to be an open problem and new efficient and effective methods have been advocated. The present paper describes an alternative lossy compression scheme for genomic sequences which iteratively collapses repeats of increasing length. The resulting approximate representations do not contain tandem duplications, while retaining enough information for making their comparison even more significant than the edit distance between the original sequences. This allows us to exploit traditional alignment algorithms directly on the compressed sequences. Results confirm the validity of the proposed approach for the problem of duplication-aware sequence alignment.
PDF (864.44 KB PDF FORMAT)
RIS citation (ENDNOTE, REFERENCE MANAGER, PROCITE, REFWORKS)
BibTex citation (BIBDESK, LATEX)
PMC HTML
My co-authors and I had a very positive experience with the review and publication process in Evolutionary Bioinformatics. The reviewers were rapid and on point, and publication was also rapid after we made the necessary revisions.
All authors are surveyed after their articles are published. Authors are asked to rate their experience in a variety of areas, and their responses help us to monitor our performance. Presented here are their responses in some key areas. No 'poor' or 'very poor' responses were received; these are represented in the 'other' category.See Our Results
Copyright © 2013 Libertas Academica Ltd (except open access articles and accompanying metadata and supplementary files.)
Facebook Google+ Twitter
Pinterest Tumblr YouTube