19 research outputs found
wuHMM: a robust algorithm to detect DNA copy number variation using long oligonucleotide microarray data
Copy number variants (CNVs) are currently defined as genomic sequences that are polymorphic in copy number and range in length from 1000 to several million base pairs. Among current array-based CNV detection platforms, long-oligonucleotide arrays promise the highest resolution. However, the performance of currently available analytical tools suffers when applied to these data because of the lower signal:noise ratio inherent in oligonucleotide-based hybridization assays. We have developed wuHMM, an algorithm for mapping CNVs from array comparative genomic hybridization (aCGH) platforms comprised of 385 000 to more than 3 million probes. wuHMM is unique in that it can utilize sequence divergence information to reduce the false positive rate (FPR). We apply wuHMM to 385K-aCGH, 2.1M-aCGH and 3.1M-aCGH experiments comparing the 129X1/SvJ and C57BL/6J inbred mouse genomes. We assess wuHMM's performance on the 385K platform by comparison to the higher resolution platforms and we independently validate 10 CNVs. The method requires no training data and is robust with respect to changes in algorithm parameters. At a FPR of <10%, the algorithm can detect CNVs with five probes on the 385K platform and three on the 2.1M and 3.1M platforms, resulting in effective resolutions of 24 kb, 2–5 kb and 1 kb, respectively
Recommended from our members
A High-Resolution Map of Segmental DNA Copy Number Variation in the Mouse Genome
Submicroscopic (less than 2 Mb) segmental DNA copy number changes are a recently recognized source of genetic variability between individuals. The biological consequences of copy number variants (CNVs) are largely undefined. In some cases, CNVs that cause gene dosage effects have been implicated in phenotypic variation. CNVs have been detected in diverse species, including mice and humans. Published studies in mice have been limited by resolution and strain selection. We chose to study 21 well-characterized inbred mouse strains that are the focus of an international effort to measure, catalog, and disseminate phenotype data. We performed comparative genomic hybridization using long oligomer arrays to characterize CNVs in these strains. This technique increased the resolution of CNV detection by more than an order of magnitude over previous methodologies. The CNVs range in size from 21 to 2,002 kb. Clustering strains by CNV profile recapitulates aspects of the known ancestry of these strains. Most of the CNVs (77.5%) contain annotated genes, and many (47.5%) colocalize with previously mapped segmental duplications in the mouse genome. We demonstrate that this technique can identify copy number differences associated with known polymorphic traits. The phenotype of previously uncharacterized strains can be predicted based on their copy number at these loci. Annotation of CNVs in the mouse genome combined with sequence-based analysis provides an important resource that will help define the genetic basis of complex traits
Recommended from our members
NUBPL mitochondrial disease: new patients and review of the genetic and clinical spectrum
Background: The nucleotide binding protein-like (NUBPL) gene was first reported as a cause of mitochondrial complex I deficiency (MIM 613621, 618242) in 2010. To date, only eight patients have been reported with this mitochondrial disorder. Five other patients were recently reported to have NUBPL disease but their clinical picture was different from the first eight patients. Here, we report clinical and genetic findings in five additional patients (four families). Methods: Whole exome sequencing was used to identify patients with compound heterozygous NUBPL variants. Functional studies included RNA-Seq transcript analyses, missense variant biochemical analyses in a yeast model (Yarrowia lipolytica) and mitochondrial respiration experiments on patient fibroblasts. Results: The previously reported c.815-27T>C branch-site mutation was found in all four families. In prior patients, c.166G>A [p.G56R] was always found in cis with c.815-27T>C, but only two of four families had both variants. The second variant found in trans with c.815-27T>C in each family was: C.311T>C [p.L104P] in three patients, c.693+1G>A in one patient and c.545T>C [p.V182A] in one patient. Complex I function in the yeast model was impacted by p.L104P but not p.V182A. Clinical features include onset of neurological symptoms at 3-18 months, global developmental delay, cerebellar dysfunction (including ataxia, dysarthria, nystagmus and tremor) and spasticity. Brain MRI showed cerebellar atrophy. Mitochondrial function studies on patient fibroblasts showed significantly reduced spare respiratory capacity. Conclusion: We report on five new patients with NUBPL disease, adding to the number and phenotypic variability of patients diagnosed worldwide, and review prior reported patients with pathogenic NUBPL variants
Progressive multifocal leukoencephalopathy genetic risk variants for pharmacovigilance of immunosuppressant therapies
BackgroundProgressive multifocal leukoencephalopathy (PML) is a rare and often lethal brain disorder caused by the common, typically benign polyomavirus 2, also known as JC virus (JCV). In a small percentage of immunosuppressed individuals, JCV is reactivated and infects the brain, causing devastating neurological defects. A wide range of immunosuppressed groups can develop PML, such as patients with: HIV/AIDS, hematological malignancies (e.g., leukemias, lymphomas, and multiple myeloma), autoimmune disorders (e.g., psoriasis, rheumatoid arthritis, and systemic lupus erythematosus), and organ transplants. In some patients, iatrogenic (i.e., drug-induced) PML occurs as a serious adverse event from exposure to immunosuppressant therapies used to treat their disease (e.g., hematological malignancies and multiple sclerosis). While JCV infection and immunosuppression are necessary, they are not sufficient to cause PML.MethodsWe hypothesized that patients may also have a genetic susceptibility from the presence of rare deleterious genetic variants in immune-relevant genes (e.g., those that cause inborn errors of immunity). In our prior genetic study of 184 PML cases, we discovered 19 candidate PML risk variants. In the current study of another 152 cases, we validated 4 of 19 variants in both population controls (gnomAD 3.1) and matched controls (JCV+ multiple sclerosis patients on a PML-linked drug ≥ 2 years).ResultsThe four variants, found in immune system genes with strong biological links, are: C8B, 1-57409459-C-A, rs139498867; LY9 (alias SLAMF3), 1-160769595-AG-A, rs763811636; FCN2, 9-137779251-G-A, rs76267164; STXBP2, 19-7712287-G-C, rs35490401. Carriers of any one of these variants are shown to be at high risk of PML when drug-exposed PML cases are compared to drug-exposed matched controls: P value = 3.50E-06, OR = 8.7 [3.7–20.6]. Measures of clinical validity and utility compare favorably to other genetic risk tests, such as BRCA1 and BRCA2 screening for breast cancer risk and HLA-B*15:02 pharmacogenetic screening for pharmacovigilance of carbamazepine to prevent Stevens-Johnson Syndrome and Toxic Epidermal Necrolysis.ConclusionFor the first time, a PML genetic risk test can be implemented for screening patients taking or considering treatment with a PML-linked drug in order to decrease the incidence of PML and enable safer use of highly effective therapies used to treat their underlying disease
Representative Germline CNVs in Mice Identified by High-Resolution aCGH
<div><p>The log<sub>2</sub> ratios of signal intensity for C57BL/6J (reference) versus 20 test strains are shown. Inset, an expanded view of the CNVs in NOD/LtJ and A/J from (A) and (B). Scale, 500 kb.</p><p>(A) A 135.6-kb segment of reduced copy number (mean log<sub>2</sub> = −1.02) on Chromosome 14 is present in most strains.</p><p>(B) A 61.7-kb amplified segment (mean log<sub>2</sub> = +1.01) on Chromosome 1 is present in most strains.</p></div
Heatmap Representation of Copy Number Changes in Mice
<p>Unsupervised clustering of segmental gains (green) and losses (red) yields a dendrogram that recapitulates features of the known genealogy of these strains. Clustering in the vertical axis demonstrates three clusters: segments amplified in most strains, segments reduced in most strains, and a third cluster containing either singleton CNVs or mixtures of amplifications and deletions.</p