Search CORE

6 research outputs found

Penalized weighted low-rank approximation for robust recovery of recurrent copy number variations

Author: Gao Xiaoli
NC DOCKS at The University of North Carolina at Greensboro
Publication venue
Publication date: 01/01/2015
Field of study

2015-2016 UNCG University Libraries Open Access Publishing Fund Grant Winner. BackgroundCopy number variation (CNV) analysis has become one of the most important researchareas for understanding complex disease. With increasing resolution of array-basedcomparative genomic hybridization (aCGH) arrays, more and more raw copy numberdata are collected for multiple arrays. It is natural to realize the co-existence of bothrecurrent and individual-specific CNVs, together with the possible data contaminationduring the data generation process. Therefore, there is a great need for an efficient androbust statistical model for simultaneous recovery of both recurrent and individualspecificCNVs.ResultWe develop a penalized weighted low-rank approximation method (WPLA) for robustrecovery of recurrent CNVs. In particular, we formulate multiple aCGH arrays into arealization of a hidden low-rank matrix with some random noises and let an additionalweight matrix account for those individual-specific effects. Thus, we do not restrict therandom noise to be normally distributed, or even homogeneous. We show itsperformance through three real datasets and twelve synthetic datasets from different typesof recurrent CNV regions associated with either normal random errors or heavilycontaminated errors.ConclusionOur numerical experiments have demonstrated that the WPLA can successfully recoverthe recurrent CNV patterns from raw data under different scenarios. Compared with twoother recent methods, it performs the best regarding its ability to simultaneously detectboth recurrent and individual-specific CNVs under normal random errors. Moreimportantly, the WPLA is the only method which can effectively recover the recurrentCNVs region when the data is heavily contaminated

The University of North Carolina at Greensboro

nbCNV: a multi-constrained optimization model for discovering copy number variants in single-cell sequencing data

Author: A Abyzov
A Krepischi
A Magi
AB Olshen
AH Handyside
B Langmead
C Xie
C Zong
Changsheng Zhang
D Grün
D Pinkel
D Wells
DY Chiang
G Klambauer
G Nilsen
H Carén
Hongmin Cai
J Duan
Jingying Huang
JT Glessner
KC Amarasinghe
MF Berger
MK Ng
N Navin
NE Navin
P Medvedev
RS Lasken
S Steinberg
T Baslan
T Baslan
V Boeva
X Cai
X Zhou
Yan Song
Z Zhang
Z Zhang
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Multisample aCGH Data Analysis via Total Variation and Spectral Regularization

Author: Wan Xiang
Yang Can
Yu Weichuan
Zhao Hongyu
Zhou Xiaowei
Publication venue
Publication date: 01/01/2013
Field of study

DNA copy number variation (CNV) accounts for a large proportion of genetic variation. One commonly used approach to detecting CNVs is array-based comparative genomic hybridization (aCGH). Although many methods have been proposed to analyze aCGH data, it is not clear how to combine information from multiple samples to improve CNV detection. In this paper, we propose to use a matrix to approximate the multisample aCGH data and minimize the total variation of each sample as well as the nuclear norm of the whole matrix. In this way, we can make use of the smoothness property of each sample and the correlation among multiple samples simultaneously in a convex optimization framework. We also developed an efficient and scalable algorithm to handle large-scale data. Experiments demonstrate that the proposed method outperforms the state-of-the-art techniques under a wide range of scenarios and it is capable of processing large data sets with millions of probes

Hong Kong University of Science and Technology Institutional Repository