Publication Date: 22 Jul 2009
Type: Short Report
Journal: Bioinformatics and Biology Insights
Microarray data repositories as well as large clinical applications of gene expression allow to analyse several hundreds of microarrays at one time. The preprocessing of large amounts of microarrays is still a challenge. The algorithms are limited by the available computer hardware. For example, building classification or prognostic rules from large microarray sets will be very time consuming. Here, preprocessing has to be a part of the cross-validation and resampling strategy which is necessary to estimate the rule’s prediction quality honestly. This paper proposes the new Bioconductor package affyPara for parallelized preprocessing of Affymetrix microarray data. Partition of data can be applied on arrays and parallelization of algorithms is a straightforward consequence. The partition of data and distribution to several nodes solves the main memory problems and accelerates preprocessing by up to the factor 20 for 200 or more arrays. affyPara is a free and open source package, under GPL license, available form the Bioconductor project at www.bioconductor.org. A user guide and examples are provided with the package.
PDF (487.68 KB PDF FORMAT)
RIS citation (ENDNOTE, REFERENCE MANAGER, PROCITE, REFWORKS)
BibTex citation (BIBDESK, LATEX)
PMC HTML
Publishing in Air, Soil and Water and Water Research was the best experience I have had so far in an academic context. The review process was fair, quick and efficient. I congratulate the team at Libertas Academica for a very well managed journal.Magnus Karlsson (IVL Swedish Environmental Research Institute, Stockholm, Sweden) What Your Colleagues Say
Copyright © 2012 Libertas Academica Ltd (except open access articles and accompanying metadata and supplementary files.)
FacebookGoogle+Twitter
PinterestTumblrYouTube