Cancer Informatics 2014:Suppl. 4 105-109
Original Research
Published on 03 Sep 2015
DOI: 10.4137/CIN.S21630
Motivation/background: Previous publications on microarray preprocessing mostly focused on method development or comparison for an individual preprocessing step. Very few, if any, focused on recommending an effective ordering of the preprocessing steps, in particular, normalization in relationship to log transformation and probe set summarization. In this study, we aim to study how the relative ordering of the preprocessing steps influences differential expression analysis for Agilent microRNA array data.
Methods: A set of 192 untreated primary gynecologic tumor samples (96 endometrial tumors and 96 ovarian tumors) were collected at Memorial Sloan Kettering Cancer Center during the period of 2000–2012. From this same sample set, two datasets were generated: one dataset had no confounding array effects by experimental design and served as the benchmark, and another dataset exhibited array effects and served as the test data. We preprocessed our test dataset using different orderings between the following three steps: quantile normalization, log transformation, and median summarization. Differential expression analysis was performed on each preprocessed test dataset, and the results were compared against the results from the benchmark dataset. True positive rate, false positive rate, and false discovery rate were used to assess the effectiveness of the orderings.
Results: The ordering of log transformation, quantile normalization (on probe-level data), and median summarization slightly outperforms the other orderings.
Conclusion: Our results ease the anxiety over the uncertain effect that the orderings could have on the analysis of Agilent microRNA array data.
PDF (650.94 KB PDF FORMAT)
RIS citation (ENDNOTE, REFERENCE MANAGER, PROCITE, REFWORKS)
BibTex citation (BIBDESK, LATEX)
Publishing in Cancer Informatics was the fastest publication I have ever experienced and has received the highest viewing rate. So it is a great place to publish your very latest research.
Facebook Google+ Twitter
Pinterest Tumblr YouTube