Home Journals Subjects About My LA Reviewers Authors News Submit
Username: Password:
.
(close)

(Ctrl-click to select multiple journals)


How should we address you?

Your email address


Enter the three character code
Visual CAPTCHA
Privacy Statement
 
 
 
 
 
 

Cross-platform Analysis of Cancer Biomarkers: A Bayesian Network Approach to Incorporating Mass Spectrometry and Microarray Data

Authors: Xutao Deng, Huimin Geng and Hesham H. Ali
Publication Date: 29 Apr 2007
Cancer Informatics 2007:3 183-202

Xutao Deng1, Huimin Geng2 and Hesham H. Ali1

1College of Information Science and Technology, University of Nebraska at Omaha, Omaha, NE 68182, U.S.A. 2Department of Pathology and Microbiology, University of Nebraska Medical Center, Omaha, NE 68198, U.S.A.

Abstract: Many studies showed inconsistent cancer biomarkers due to bioinformatics artifacts. In this paper we use multiple data sets from microarrays, mass spectrometry, protein sequences, and other biological knowledge in order to improve the reliability of cancer biomarkers. We present a novel Bayesian network (BN) model which integrates and cross-annotates multiple data sets related to prostate cancer. The main contribution of this study is that we provide a method that is designed to find cancer biomarkers whose presence is supported by multiple data sources and biological knowledge. Relevant biological knowledge is explicitly encoded into the model parameters, and the biomarker finding problem is formulated as a Bayesian inference problem. Besides diagnostic accuracy, we introduce reliability as another quality measurement of the biological relevance of biomarkers. Based on the proposed BN model, we develop an empirical scoring scheme and a simulation algorithm for inferring biomarkers. Fourteen genes/proteins including prostate specific antigen (PSA) are identified as reliable serum biomarkers which are insensitive to the model assumptions. The computational results show that our method is able to find biologically relevant biomarkers with highest reliability while maintaining competitive predictive power. In addition, by combining biological knowledge and data from multiple platforms, the number of putative biomarkers is greatly reduced to allow more-focused clinical studies.