Abstract Background: Gene set enrichment analysis (GSEA) is an analytic approach which simultaneously reduces the dimensionality of microarray data and enables ready inference of the biological meaning of observed gene expression patterns. Here we invert the GSEA process to identify class-specific gene signatures. Because our approach uses the Kolmogorov-Smirnov approach both to define class specific signatures and to classify samples using those signatures, we have termed this methodology “Dual-KS” (DKS). Results: The optimum gene signature identified by the DKS algorithm was smaller than other methods to which it was compared in 5 out of 10 datasets. The estimated error rate of DKS using the optimum gene signature was smaller than the estimated error rate of the random forest method in 4 out of the 10 datasets, and was equivalent in two additional datasets. DKS performance relative to other benchmarked algorithms was similar to its performance relative to random forests. Conclusions: DKS is an efficient analytic methodology that can identify highly parsimonious gene signatures useful for classification in the context of microarray studies. The algorithm is available as the dualKS package for R as part of the bioconductor project.
Discussion
No comments yet...Be the first to comment.
I had an excellent experience publishing our review article in Clinical Medicine Reviews. The managing editor was very helpful and the process was very timely and transparent.Professor Jonathan A. Bernstein (University of Cincinnati College of Medicine, Division of Immunology, Allergy Section, Cincinnati, OH, USA) What our authors say
Copyright © 2010 Libertas Academica Ltd (except open access articles and accompanying metadata and supplementary files.)