On the Number of Close-to-Optimal Feature Sets

Submit a Paper

Libertas Press Analytics

Download Paper

Send to Endnote

1135 Article Views

Authors: Edward R. Dougherty and Marcel Brun

Publication Date: 16 Feb 2007

Journal: Cancer Informatics 2006:2 189-196

CI
journal

275,719 Article Views

2,621,666 Libertas Article Views

More Statistics

Abstract Edward R. Dougherty^1,2 and Marcel Brun²

¹Department of Electrical and Computer Engineering, Texas A & M University, College Station, TX. ²Computational Biology Division, Translational Genomics Research Institute, Phoenix, AZ.

Abstract: The issue of wide feature-set variability has recently been raised in the context of expression-based classification using microarray data. This paper addresses this concern by demonstrating the natural manner in which many feature sets of a certain size chosen from a large collection of potential features can be so close to being optimal that they are statistically indistinguishable. Feature-set optimality is inherently related to sample size because it only arises on account of the tendency for diminished classifier accuracy as the number of features grows too large for satisfactory design from the sample data. The paper considers optimal feature sets in the framework of a model in which the features are grouped in such a way that intra-group correlation is substantial whereas inter-group correlation is minimal, the intent being to model the situation in which there are groups of highly correlated co-regulated genes and there is little correlation between the co-regulated groups. This is accomplished by using a block model for the covariance matrix that reflects these conditions. Focusing on linear discriminant analysis, we demonstrate how these assumptions can lead to very large numbers of closeto-optimal feature sets.

On the Number of Close-to-Optimal Feature Sets

Post a Comment

Readers of this Article also read

Our Service Promise

Quick Links

On the Number of Close-to-Optimal Feature Sets

Post a Comment

Related Categories

Readers of this Article also read

Our Service Promise

Quick Links