Close
Help




JOURNAL

Cancer Informatics

Improving Cancer Gene Expression Data Quality through a TCGA Data-Driven Evaluation of Identifier Filtering

Submit a Paper


Cancer Informatics 2015:14 149-161

Original Research

Published on 16 Dec 2015

DOI: 10.4137/CIN.S33076


Further metadata provided in PDF



Sign up for email alerts to receive notifications of new articles published in Cancer Informatics

Abstract

Data quality is a recognized problem for high-throughput genomics platforms, as evinced by the proliferation of methods attempting to filter out lower quality data points. Different filtering methods lead to discordant results, raising the question, which methods are best? Astonishingly, little computational support is offered to analysts to decide which filtering methods are optimal for the research question at hand. To evaluate them, we begin with a pair of expression data sets, transcriptomic and proteomic, on the same samples. The pair of data sets form a test-bed for the evaluation. Identifier mapping between the data sets creates a collection of feature pairs, with correlations calculated for each pair. To evaluate a filtering strategy, we estimate posterior probabilities for the correctness of probesets accepted by the method. An analyst can set expected utilities that represent the trade-off between the quality and quantity of accepted features. We tested nine published probeset filtering methods and combination strategies. We used two test-beds from cancer studies providing transcriptomic and proteomic data. For reasonable utility settings, the Jetset filtering method was optimal for probeset filtering on both test-beds, even though both assay platforms were different. Further intersection with a second filtering method was indicated on one test-bed but not the other.



Downloads

PDF  (1.42 MB PDF FORMAT)

RIS citation   (ENDNOTE, REFERENCE MANAGER, PROCITE, REFWORKS)

BibTex citation   (BIBDESK, LATEX)


Sharing


What Your Colleagues Say About Cancer Informatics
This is the first time for us to submit a manuscript to Cancer Informatics.  We thank the peer reviewers for their insightful comments, which have improved our manuscript markedly. We were pleased to find that the staff were extremely helpful and kept us informed of the progress of the submission step-by-step. Our experience with Cancer Informatics has been tremendous. Thank you very much!
Dr Yirong Wu (University of Wisconsin, Madison, WI, USA)
More Testimonials

Quick Links


New article and journal news notification services
Email Alerts RSS Feeds
Facebook Google+ Twitter
Pinterest Tumblr YouTube