Improving detection of copy-number variation by simultaneous bias correction and read-depth segmentation

Sullivan, Patrick F.; Sun, Wei; Szatkiewicz, Jin P.; Wang, Waibo; Wang, Wei

Improving detection of copy-number variation by simultaneous bias correction and read-depth segmentation

Authors: Patrick F. Sullivan
Wei Sun
Jin P. Szatkiewicz
Waibo Wang
Wei Wang
Publication date: 1 January 2013
Publisher
Doi

Abstract

Structural variation is an important class of genetic variation in mammals. High-throughput sequencing (HTS) technologies promise to revolutionize copy-number variation (CNV) detection but present substantial analytic challenges. Converging evidence suggests that multiple types of CNV-informative data (e.g. read-depth, read-pair, split-read) need be considered, and that sophisticated methods are needed for more accurate CNV detection. We observed that various sources of experimental biases in HTS confound read-depth estimation, and note that bias correction has not been adequately addressed by existing methods. We present a novel read-depth–based method, GENSENG, which uses a hidden Markov model and negative binomial regression framework to identify regions of discrete copy-number changes while simultaneously accounting for the effects of multiple confounders. Based on extensive calibration using multiple HTS data sets, we conclude that our method outperforms existing read-depth–based CNV detection algorithms. The concept of simultaneous bias correction and CNV detection can serve as a basis for combining read-depth with other types of information such as read-pair or split-read in a single analysis. A user-friendly and computationally efficient implementation of our method is freely available

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

CiteSeerX

oai:CiteSeerX.psu:10.1.1.810.1...

Last time updated on 30/10/2017

Carolina Digital Repository

cdr.lib.unc.edu:0g354p055

Last time updated on 24/11/2020