A Method to Detect Differential Gene Expression in Cross-Species Hybridization Experiments at Gene and Probe Level
Ying Chen1,5, Rebekah Wu4, James Felton4, David M. Rocke2 and Anu Chakicherla3,5
1Department of Statistics, University of California Davis, Davis, CA. 2Division of Biostatistics, University of California Davis, Davis, CA. 3Computation Directorate, Lawrence Livermore National Laboratory, Livermore, CA. 4Chemistry, Materials and Life Sciences Directorate, Lawrence Livermore National Laboratory, Livermore, CA. 5Y.C. and A.C made equal contributions to this study.
Abstract
Motivation: Whole genome microarrays are increasingly becoming the method of choice to study responses in model organisms to disease, stressors or other stimuli. However, whole genome sequences are available for only some model organisms, and there are still many species whose genome sequences are not yet available. Cross-species studies, where arrays developed for one species are used to study gene expression in a closely related species, have been used to address this gap, with some promising results. Current analytical methods have included filtration of some probes or genes that showed low hybridization activities. But consensus filtration schemes are still not available.
Results: A novel masking procedure is proposed based on currently available target species sequences to filter out probes and study a cross-species data set using this masking procedure and gene-set analysis. Gene-set analysis evaluates the association of some priori defined gene groups with a phenotype of interest. Two methods, Gene Set Enrichment Analysis (GSEA) and Test of Test Statistics (ToTS) were investigated. The results showed that masking procedure together with ToTS method worked well in our data set. The results from an alternative way to study cross-species hybridization experiments without masking are also presented. We hypothesize that the multi-probes structure of Affymetrix microarrays makes it possible to aggregate the effects of both well-hybridized and poorly- hybridized probes to study a group of genes. The principles of gene-set analysis were applied to the probe-level data instead of gene- level data. The results showed that ToTS can give valuable information and thus can be used as a powerful technique for analyzing cross-species hybridization experiments.
Availability: Software in the form of R code is available at http://anson.ucdavis.edu/~ychen/cross-species.html
Supplementary Data: Supplementary data are available at http://anson.ucdavis.edu/~ychen/cross-species.html
Readers of this also read:
- The Bimodality Index: A Criterion for Discovering and Ranking Bimodal Signatures from Cancer Gene Expression Profiling Data
- Gender Ratios in Autism, Asperger Syndrome and Autism Spectrum Disorder
- ETS-FUSions Networking, Triggering and Beyond
- Evolutionary Pattern of N-Glycosylation Sequon Numbers in Eukaryotic ABC Protein Superfamilies
- A Method to Detect Differential Gene Expression in Cross-Species Hybridization Experiments at Gene and Probe Level