Close
Help




JOURNAL

Cancer Informatics

Preprocessing Steps for Agilent MicroRNA Arrays: Does the Order Matter?

Submit a Paper


Cancer Informatics 2014:Suppl. 4 105-109

Original Research

Published on 03 Sep 2015

DOI: 10.4137/CIN.S21630


Further metadata provided in PDF



Sign up for email alerts to receive notifications of new articles published in Cancer Informatics

Abstract

Motivation/background: Previous publications on microarray preprocessing mostly focused on method development or comparison for an individual preprocessing step. Very few, if any, focused on recommending an effective ordering of the preprocessing steps, in particular, normalization in relationship to log transformation and probe set summarization. In this study, we aim to study how the relative ordering of the preprocessing steps influences differential expression analysis for Agilent microRNA array data.

Methods: A set of 192 untreated primary gynecologic tumor samples (96 endometrial tumors and 96 ovarian tumors) were collected at Memorial Sloan Kettering Cancer Center during the period of 2000–2012. From this same sample set, two datasets were generated: one dataset had no confounding array effects by experimental design and served as the benchmark, and another dataset exhibited array effects and served as the test data. We preprocessed our test dataset using different orderings between the following three steps: quantile normalization, log transformation, and median summarization. Differential expression analysis was performed on each preprocessed test dataset, and the results were compared against the results from the benchmark dataset. True positive rate, false positive rate, and false discovery rate were used to assess the effectiveness of the orderings.

Results: The ordering of log transformation, quantile normalization (on probe-level data), and median summarization slightly outperforms the other orderings.

Conclusion: Our results ease the anxiety over the uncertain effect that the orderings could have on the analysis of Agilent microRNA array data.



Downloads

PDF  (650.94 KB PDF FORMAT)

RIS citation   (ENDNOTE, REFERENCE MANAGER, PROCITE, REFWORKS)

BibTex citation   (BIBDESK, LATEX)


Sharing


What Your Colleagues Say About Cancer Informatics
Publishing in Cancer Informatics was the fastest publication I have ever experienced and has received the highest viewing rate.  So it is a great place to publish your very latest research.
Dr Yue Zhang (Boston, MA, USA)
More Testimonials

Quick Links


New article and journal news notification services
Email Alerts RSS Feeds
Facebook Google+ Twitter
Pinterest Tumblr YouTube