19 research outputs found
TrAp: a Tree Approach for Fingerprinting Subclonal Tumor Composition
Revealing the clonal composition of a single tumor is essential for
identifying cell subpopulations with metastatic potential in primary tumors or
with resistance to therapies in metastatic tumors. Sequencing technologies
provide an overview of an aggregate of numerous cells, rather than
subclonal-specific quantification of aberrations such as single nucleotide
variants (SNVs). Computational approaches to de-mix a single collective signal
from the mixed cell population of a tumor sample into its individual components
are currently not available. Herein we propose a framework for deconvolving
data from a single genome-wide experiment to infer the composition, abundance
and evolutionary paths of the underlying cell subpopulations of a tumor. The
method is based on the plausible biological assumption that tumor progression
is an evolutionary process where each individual aberration event stems from a
unique subclone and is present in all its descendants subclones. We have
developed an efficient algorithm (TrAp) for solving this mixture problem. In
silico analyses show that TrAp correctly deconvolves mixed subpopulations when
the number of subpopulations and the measurement errors are moderate. We
demonstrate the applicability of the method using tumor karyotypes and somatic
hypermutation datasets. We applied TrAp to SNV frequency profile from Exome-Seq
experiment of a renal cell carcinoma tumor sample and compared the mutational
profile of the inferred subpopulations to the mutational profiles of twenty
single cells of the same tumor. Despite the large experimental noise, specific
co-occurring mutations found in clones inferred by TrAp are also present in
some of these single cells. Finally, we deconvolve Exome-Seq data from three
distinct metastases from different body compartments of one melanoma patient
and exhibit the evolutionary relationships of their subpopulations
Picking ChIP-seq peak detectors for analyzing chromatin modification experiments
Numerous algorithms have been developed to analyze ChIP-Seq data. However, the complexity of analyzing diverse patterns of ChIP-Seq signals, especially for epigenetic marks, still calls for the development of new algorithms and objective comparisons of existing methods. We developed Qeseq, an algorithm to detect regions of increased ChIP read density relative to background. Qeseq employs critical novel elements, such as iterative recalibration and neighbor joining of reads to identify enriched regions of any length. To objectively assess its performance relative to other 14 ChIP-Seq peak finders, we designed a novel protocol based on Validation Discriminant Analysis (VDA) to optimally select validation sites and generated two validation datasets, which are the most comprehensive to date for algorithmic benchmarking of key epigenetic marks. In addition, we systematically explored a total of 315 diverse parameter configurations from these algorithms and found that typically optimal parameters in one dataset do not generalize to other datasets. Nevertheless, default parameters show the most stable performance, suggesting that they should be used. This study also provides a reproducible and generalizable methodology for unbiased comparative analysis of high-throughput sequencing tools that can facilitate future algorithmic development
Combined Immunofluorescence and DNA FISH on 3D-preserved Interphase Nuclei to Study Changes in 3D Nuclear Organization
International audienc
Language Models are Few-shot Learners for Prognostic Prediction
Clinical prediction is an essential task in the healthcare industry. However,
the recent success of transformers, on which large language models are built,
has not been extended to this domain. In this research, we explore the use of
transformers and language models in prognostic prediction for immunotherapy
using real-world patients' clinical data and molecular profiles. This paper
investigates the potential of transformers to improve clinical prediction
compared to conventional machine learning approaches and addresses the
challenge of few-shot learning in predicting rare disease areas. The study
benchmarks the efficacy of baselines and language models on prognostic
prediction across multiple cancer types and investigates the impact of
different pretrained language models under few-shot regimes. The results
demonstrate significant improvements in accuracy and highlight the potential of
NLP in clinical research to improve early detection and intervention for
different diseases.Comment: 7 pages, 5 figures, 5 table
Combined Immunofluorescence and DNA FISH on 3D-preserved Interphase Nuclei to Study Changes in 3D Nuclear Organization
Combined Immunofluorescence and DNA FISH on 3D-preserved Interphase Nuclei to Study Changes in 3D Nuclear Organization
The RAG2 C-terminus and ATM protect genome integrity by controlling antigen receptor gene cleavage.
International audienceTight control of antigen-receptor gene rearrangement is required to preserve genome integrity and prevent the occurrence of leukaemia and lymphoma. Nonetheless, mistakes can happen, leading to the generation of aberrant rearrangements, such as Tcra/d-Igh inter-locus translocations that are a hallmark of ataxia telangiectasia-mutated (ATM) deficiency. Current evidence indicates that these translocations arise from the persistence of unrepaired breaks converging at different stages of thymocyte differentiation. Here we show that a defect in feedback control of RAG2 activity gives rise to bi-locus breaks and damage on Tcra/d and Igh in the same T cell at the same developmental stage, which provides a direct mechanism for generating these inter-locus rearrangements. Both the RAG2 C-terminus and ATM prevent bi-locus RAG-mediated cleavage through modulation of three-dimensional conformation (higher-order loops) and nuclear organization of the two loci. This limits the number of potential substrates for translocation and provides an important mechanism for protecting genome stability
Genome-wide remodeling of the epigenetic landscape during myogenic differentiation
We have examined changes in the chromatin landscape during muscle differentiation by mapping the genome-wide location of ten key histone marks and transcription factors in mouse myoblasts and terminally differentiated myotubes, providing an exceptionally rich dataset that has enabled discovery of key epigenetic changes underlying myogenesis. Using this compendium, we focused on a well-known repressive mark, histone H3 lysine 27 trimethylation, and identified novel regulatory elements flanking the myogenin gene that function as a key differentiation-dependent switch during myogenesis. Next, we examined the role of Polycomb-mediated H3K27 methylation in gene repression by systematically ablating components of both PRC1 and PRC2 complexes. Surprisingly, we found mechanistic differences between transient and permanent repression of muscle differentiation and lineage commitment genes and observed that the loss of PRC1 and PRC2 components produced opposing differentiation defects. These phenotypes illustrate striking differences as compared to embryonic stem cell differentiation and suggest that PRC1 and PRC2 do not operate sequentially in muscle cells. Our studies of PRC1 occupancy also suggested a “fail-safe” mechanism, whereby PRC1/Bmi1 concentrates at genes specifying nonmuscle lineages, helping to retain H3K27me3 in the face of declining Ezh2-mediated methyltransferase activity in differentiated cells.</jats:p
