A Single-Array-Based Method for Detecting Copy Number Variants Using Affymetrix High Density SNP Arrays and its Application to Breast Cancer

Ming Li; Yalu Wen; Wenjiang Fu

JOURNAL

Cancer Informatics

Journal Analytics

A Single-Array-Based Method for Detecting Copy Number Variants Using Affymetrix High Density SNP Arrays and its Application to Breast Cancer

Submit a Paper

Ming Li, Yalu Wen and Wenjiang Fu

Cancer Informatics 2014:Suppl. 4 95-103

Technical Advance

Published on 16 Jul 2015

DOI: 10.4137/CIN.S15203

Further metadata provided in PDF

Download Article PDF

Sign up for email alerts to receive notifications of new articles published in Cancer Informatics

Abstract and Sharing
Article Metrics
Discuss

Abstract

Cumulative evidence has shown that structural variations, due to insertions, deletions, and inversions of DNA, may contribute considerably to the development of complex human diseases, such as breast cancer. High-throughput genotyping technologies, such as Affymetrix high density single-nucleotide polymorphism (SNP) arrays, have produced large amounts of genetic data for genome-wide SNP genotype calling and copy number estimation. Meanwhile, there is a great need for accurate and efficient statistical methods to detect copy number variants. In this article, we introduce a hidden-Markov-model (HMM)-based method, referred to as the PICR-CNV, for copy number inference. The proposed method first estimates copy number abundance for each single SNP on a single array based on the raw fluorescence values, and then standardizes the estimated copy number abundance to achieve equal footing among multiple arrays. This method requires no between-array normalization, and thus, maintains data integrity and independence of samples among individual subjects. In addition to our efforts to apply new statistical technology to raw fluorescence values, the HMM has been applied to the standardized copy number abundance in order to reduce experimental noise. Through simulations, we show our refined method is able to infer copy number variants accurately. Application of the proposed method to a breast cancer dataset helps to identify genomic regions significantly associated with the disease.

Downloads

PDF (680.36 KB PDF FORMAT)

RIS citation (ENDNOTE, REFERENCE MANAGER, PROCITE, REFWORKS)

BibTex citation (BIBDESK, LATEX)

XML

PMC HTML

What Your Colleagues Say About Cancer Informatics

I would like to extend my gratitude for creating the next generation of a scientific journal -- the science journal of tomorrow. The entire process bespoke of exceptional efficiency, celerity, professionalism, competency, and service.

Dr Jason B. Nikas (Medical School University of Minnesota, Minneapolis, MN, USA)

More Testimonials