2736 Article Views
Publication Date: 07 May 2010
Type: Original Research
Journal: Cancer Informatics
doi: 10.4137/CIN.S3805
There have been relatively few publications using linear regression models to predict a continuous response based on microarray expression profiles. Standard linear regression methods are problematic when the number of predictor variables exceeds the number of cases. We have evaluated three linear regression algorithms that can be used for the prediction of a continuous response based on high dimensional gene expression data. The three algorithms are the least angle regression (LAR), the least absolute shrinkage and selection operator (LASSO), and the averaged linear regression method (ALM). All methods are tested using simulations based on a real gene expression dataset and analyses of two sets of real gene expression data and using an unbiased complete cross validation approach. Our results show that the LASSO algorithm often provides a model with somewhat lower prediction error than the LAR method, but both of them perform more efficiently than the ALM predictor. We have developed a plug-in for BRB-ArrayTools that implements the LAR and the LASSO algorithms with complete cross-validation.
Discussion
No comments yet...Be the first to comment.
As a peer reviewer for Environmental Health Insights, I have had the opportunity to read several very important research articles in my field. Based on my experience, the submission process, review standards, and publication expectations are rigorous and demanding as other high impact journals. I look forward to further reviewing papers for Environmental Health Insights and learning from my peers and other leaders in the field.Dr Jianbo Jiang (Monell Chemical Senses Center, Philadelphia, PA, USA ) What Your Colleagues Say
Copyright © 2012 Libertas Academica Ltd (except open access articles and accompanying metadata and supplementary files.)
FacebookGoogle+Twitter
PinterestTumblrYouTube