14,099 research outputs found
Change-point model on nonhomogeneous Poisson processes with application in copy number profiling by next-generation DNA sequencing
We propose a flexible change-point model for inhomogeneous Poisson Processes,
which arise naturally from next-generation DNA sequencing, and derive score and
generalized likelihood statistics for shifts in intensity functions. We
construct a modified Bayesian information criterion (mBIC) to guide model
selection, and point-wise approximate Bayesian confidence intervals for
assessing the confidence in the segmentation. The model is applied to DNA Copy
Number profiling with sequencing data and evaluated on simulated spike-in and
real data sets.Comment: Published in at http://dx.doi.org/10.1214/11-AOAS517 the Annals of
Applied Statistics (http://www.imstat.org/aoas/) by the Institute of
Mathematical Statistics (http://www.imstat.org
Pathway relevance ranking for tumor samples through network-based data integration
The study of cancer, a highly heterogeneous disease with different causes and clinical outcomes, requires a multi-angle approach and the collection of large multi-omics datasets that, ideally, should be analyzed simultaneously. We present a new pathway relevance ranking method that is able to prioritize pathways according to the information contained in any combination of tumor related omics datasets. Key to the method is the conversion of all available data into a single comprehensive network representation containing not only genes but also individual patient samples. Additionally, all data are linked through a network of previously identified molecular interactions. We demonstrate the performance of the new method by applying it to breast and ovarian cancer datasets from The Cancer Genome Atlas. By integrating gene expression, copy number, mutation and methylation data, the method's potential to identify key pathways involved in breast cancer development shared by different molecular subtypes is illustrated. Interestingly, certain pathways were ranked equally important for different subtypes, even when the underlying (epi)-genetic disturbances were diverse. Next to prioritizing universally high-scoring pathways, the pathway ranking method was able to identify subtype-specific pathways. Often the score of a pathway could not be motivated by a single mutation, copy number or methylation alteration, but rather by a combination of genetic and epi-genetic disturbances, stressing the need for a network-based data integration approach. The analysis of ovarian tumors, as a function of survival-based subtypes, demonstrated the method's ability to correctly identify key pathways, irrespective of tumor subtype. A differential analysis of survival-based subtypes revealed several pathways with higher importance for the bad-outcome patient group than for the good-outcome patient group. Many of the pathways exhibiting higher importance for the bad-outcome patient group could be related to ovarian tumor proliferation and survival
Recommended from our members
A Mitochondrial Health Index Sensitive to Mood and Caregiving Stress.
BACKGROUND:Chronic life stress, such as the stress of caregiving, can promote pathophysiology, but the underlying cellular mechanisms are not well understood. Chronic stress may induce recalibrations in mitochondria leading to changes either in mitochondrial content per cell, or in mitochondrial functional capacity (i.e., quality). METHODS:Here we present a functional index of mitochondrial health (MHI) for human leukocytes that can distinguish between these two possibilities. The MHI integrates nuclear and mitochondrial DNA-encoded respiratory chain enzymatic activities and mitochondrial DNA copy number. We then use the MHI to test the hypothesis that daily emotional states and caregiving stress influence mitochondrial function by comparing healthy mothers of a child with an autism spectrum disorder (high-stress caregivers, n = 46) with mothers of a neurotypical child (control group, n = 45). RESULTS:The MHI outperformed individual mitochondrial function measures. Elevated positive mood at night was associated with higher MHI, and nightly positive mood was also a mediator of the association between caregiving and MHI. Moreover, MHI was correlated to positive mood on the days preceding, but not following the blood draw, suggesting for the first time in humans that mitochondria may respond to proximate emotional states within days. Correspondingly, the caregiver group, which had higher perceived stress and lower positive and greater negative daily affect, exhibited lower MHI. This effect was not explained by a mismatch between nuclear and mitochondrial genomes. CONCLUSIONS:Daily mood and chronic caregiving stress are associated with mitochondrial functional capacity. Mitochondrial health may represent a nexus between psychological stress and health
Combining chromosomal arm status and significantly aberrant genomic locations reveals new cancer subtypes
Many types of tumors exhibit chromosomal losses or gains, as well as local
amplifications and deletions. Within any given tumor type, sample specific
amplifications and deletionsare also observed. Typically, a region that is
aberrant in more tumors,or whose copy number change is stronger, would be
considered as a more promising candidate to be biologically relevant to cancer.
We sought for an intuitive method to define such aberrations and prioritize
them. We define V, the volume associated with an aberration, as the product of
three factors: a. fraction of patients with the aberration, b. the aberrations
length and c. its amplitude. Our algorithm compares the values of V derived
from real data to a null distribution obtained by permutations, and yields the
statistical significance, p value, of the measured value of V. We detected
genetic locations that were significantly aberrant and combined them with
chromosomal arm status to create a succint fingerprint of the tumor genome.
This genomic fingerprint is used to visualize the tumors, highlighting events
that are co ocurring or mutually exclusive. We allpy the method on three
different public array CGH datasets of Medulloblastoma and Neuroblastoma, and
demonstrate its ability to detect chromosomal regions that were known to be
altered in the tested cancer types, as well as to suggest new genomic locations
to be tested. We identified a potential new subtype of Medulloblastoma, which
is analogous to Neuroblastoma type 1.Comment: 34 pages, 3 figures; to appear in Cancer Informatic
Recommended from our members
Identification and separation of DNA mixtures using peak area information (Updated version of Statistical Research Paper No. 25)
We introduce a new methodology, based upon probabilistic expert systems, for analysing forensic identification problems involving DNA mixture traces using quantitative peak area information. Peak area is modelled with conditional Gaussian distributions. The expert system can be used for ascertaining whether individuals, whose profiles have been measured, have contributed to the mixture, but also to predict DNA profiles of unknown contributors by separating the mixture into its individual components. The potential of our probabilistic methodology is illustrated on case data examples and compared with alternative approaches. The advantages are that identification and separation issues can be handled in a unified way within a single probabilistic model and the uncertainty associated with the analysis is quantified. Further work, required to bring the methodology to a point where it could be applied to the routine analysis of casework, is discussed
Recommended from our members
DNA copy number motifs are strong and independent predictors of survival in breast cancer.
Somatic copy number alterations are a frequent sign of genome instability in cancer. A precise characterization of the genome architecture would reveal underlying instability mechanisms and provide an instrument for outcome prediction and treatment guidance. Here we show that the local spatial behavior of copy number profiles conveys important information about this architecture. Six filters were defined to characterize regional traits in copy number profiles, and the resulting Copy Aberration Regional Mapping Analysis (CARMA) algorithm was applied to tumors in four breast cancer cohorts (nā=ā2919). The derived motifs represent a layer of information that complements established molecular classifications of breast cancer. A score reflecting presence or absence of motifs provided a highly significant independent prognostic predictor. Results were consistent between cohorts. The nonsite-specific occurrence of the detected patterns suggests that CARMA captures underlying replication and repair defects and could have a future potential in treatment stratification
Pooled DNA sequencing to identify SNPs associated with a major QTL for bacterial wilt resistance in Italian ryegrass (Lolium multiflorum Lam.)
peer-reviewedItalian ryegrass (Lolium multiflorum Lam.) is one of the most important forage grass species in temperate regions. Its yield, quality and persistency can significantly be reduced by bacterial wilt, a serious disease caused by Xanthomonas translucens pv. graminis. Although a major QTL for bacterial wilt resistance has previously been reported, detailed knowledge on underlying genes and DNA markers to allow for efficient resistance breeding strategies is currently not available. We used pooled DNA sequencing to characterize a major QTL for bacterial wilt resistance of Italian ryegrass and to develop inexpensive sequence-based markers to efficiently target resistance alleles for marker-assisted recurrent selection. From the mapping population segregating for the QTL, DNA of 44 of the most resistant and 44 of the most susceptible F1 individuals was pooled and sequenced using the Illumina HiSeq 2000 platform. Allele frequencies of 18āĆā106 single nucleotide polymorphisms (SNP) were determined in the resistant and susceptible pool. A total of 271 SNPs on 140 scaffold sequences of the reference parental genome showed significantly different allele frequencies in both pools. We converted 44 selected SNPs to KASPā¢ markers, genetically mapped these proximal to the major QTL and thus validated their association with bacterial wilt resistance. This study highlights the power of pooled DNA sequencing to efficiently target binary traits in biparental mapping populations. It delivers genome sequence data, SNP markers and potential candidate genes which will allow to implement marker-assisted strategies to fix bacterial wilt resistance in outcrossing breeding populations of Italian ryegrass
- ā¦