64 research outputs found

    A somatic-mutational process recurrently duplicates germline susceptibility loci and tissue-specific super-enhancers in breast cancers

    Get PDF
    Somatic rearrangements contribute to the mutagenized landscape of cancer genomes. Here, we systematically interrogated rearrangements in 560 breast cancers by using a piecewise constant fitting approach. We identified 33 hotspots of large (>100 kb) tandem duplications, a mutational signature associated with homologous-recombination-repair deficiency. Notably, these tandem-duplication hotspots were enriched in breast cancer germline susceptibility loci (odds ratio (OR) = 4.28) and breast-specific 'super-enhancer' regulatory elements (OR = 3.54). These hotspots may b

    Dynamics of gene silencing during X inactivation using allele-specific RNA-seq

    Get PDF
    Background: During early embryonic development, one of the two X chromosomes in mammalian female cells is inactivated to compensate for a potential imbalance in transcript levels with male cells, which contain a single X chromosome. Here, we use mouse female embryonic stem cells (ESCs) with non-random X chromosome inactivation (XCI) and polymorphic X chromosomes to study the dynamics of gene silencing over the inactive X chromosome by high-resolution allele-specific RNA-seq. Results: Induction of XCI by differentiation of female ESCs shows that genes proximal to the X-inactivation center are silenced earlier than distal genes, while lowly expressed genes show faster XCI dynamics than highly expressed genes. The active X chromosome shows a minor but significant increase in gene activity during differentiation, resulting in complete dosage compensation in differentiated cell types. Genes escaping XCI show little or no silencing during early propagation of XCI. Allele-specific RNA-seq of neural progenitor cells generated from the female ESCs identifies three regions distal to the X-inactivation center that escape XCI. These regions, which stably escape during propagation and maintenance of XCI, coincide with topologically associating domains (TADs) as present in the female ESCs. Also, the previously characterized gene clusters escaping XCI in human fibroblasts correlate with TADs. Conclusions: The gene silencing observed during XCI provides further insight in the establishment of the repressive complex formed by the inactive X chromosome. The association of e

    Processed pseudogenes acquired somatically during cancer development

    Get PDF
    Cancer evolves by mutation, with somatic reactivation of retrotransposons being one such mutational process. Germline retrotransposition can cause processed pseudogenes, but whether this occurs somatically has not been evaluated. Here we screen sequencing data from 660 cancer samples for somatically acquired pseudogenes. We find 42 events in 17 samples, especially non-small cell lung cancer (5/27) and colorectal cancer (2/11). Genomic features mirror those of germline LINE element retrotranspositions, with frequent target-site duplications (67%), consensus TTTTAA sites at insertion points, inverted rearrangements (21%), 5′ truncation (74%) and polyA tails (88%). Transcriptional consequences include expression of pseudogenes from UTRs or introns of target genes. In addition, a somatic pseudogene that integrated into the promoter and first exon of the tumour suppressor gene, MGA, abrogated expression from that allele. Thus, formation of processed pseudogenes represents a new class of mutation occurring during cancer development, with potentially diverse functional consequences depending on genomic context

    Retrospective evaluation of whole exome and genome mutation calls in 746 cancer samples

    No full text
    Funder: NCI U24CA211006Abstract: The Cancer Genome Atlas (TCGA) and International Cancer Genome Consortium (ICGC) curated consensus somatic mutation calls using whole exome sequencing (WES) and whole genome sequencing (WGS), respectively. Here, as part of the ICGC/TCGA Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium, which aggregated whole genome sequencing data from 2,658 cancers across 38 tumour types, we compare WES and WGS side-by-side from 746 TCGA samples, finding that ~80% of mutations overlap in covered exonic regions. We estimate that low variant allele fraction (VAF < 15%) and clonal heterogeneity contribute up to 68% of private WGS mutations and 71% of private WES mutations. We observe that ~30% of private WGS mutations trace to mutations identified by a single variant caller in WES consensus efforts. WGS captures both ~50% more variation in exonic regions and un-observed mutations in loci with variable GC-content. Together, our analysis highlights technological divergences between two reproducible somatic variant detection efforts

    Vol. 23 ISMB/ECCB 2007, pages i195–i204 BIOINFORMATICS doi:10.1093/bioinformatics/btm200 Optimized design and assessment of whole genome tiling arrays

    No full text
    Motivation: Recent advances in microarray technologies have made it feasible to interrogate whole genomes with tiling arrays and this technique is rapidly becoming one of the most important highthroughput functional genomics assays. For large mammalian genomes, analyzing oligonucleotide tiling array data is complicated by the presence of non-unique sequences on the array, which increases the overall noise in the data and may lead to false positive results due to cross-hybridization. The ability to create custom microarrays using maskless array synthesis has led us to consider ways to optimize array design characteristics for improving data quality and analysis. We have identified a number of design parameters to be optimized including uniqueness of the probe sequences within the whole genome, melting temperature and selfhybridization potential. Results: We introduce the uniqueness score, U, a novel quality measure for oligonucleotide probes and present a method to quickly compute it. We show that U is equivalent to the number of shortest unique substrings in the probe and describe an efficient greedy algorithm to design mammalian whole genome tiling arrays using probes that maximize U. Using the mouse genome, we demonstrate how several optimizations influence the tiling array design characteristics. With a sensible set of parameters, our designs cover 78 % of the mouse genome including many regions previously considered ‘untilable ’ due to the presence of repetitive sequence. Finally, we compare our whole genome tiling array designs with commercially available designs. Availability: Source code is available under an open source license fro

    Histone variant innovation in a rapidly evolving chordate lineage

    No full text
    Abstract Background Histone variants alter the composition of nucleosomes and play crucial roles in transcription, chromosome segregation, DNA repair, and sperm compaction. Modification of metazoan histone variant lineages occurs on a background of genome architecture that shows global similarities from sponges to vertebrates, but the urochordate, Oikopleura dioica, a member of the sister group to vertebrates, exhibits profound modification of this ancestral architecture. Results We show that a histone complement of 47 gene loci encodes 31 histone variants, grouped in distinct sets of developmental expression profiles throughout the life cycle. A particularly diverse array of 15 male-specific histone variants was uncovered, including a testes-specific H4t, the first metazoan H4 sequence variant reported. Universal histone variants H3.3, CenH3, and H2A.Z are present but O. dioica lacks homologs of macroH2A and H2AX. The genome encodes many H2A and H2B variants and the repertoire of H2A.Z isoforms is expanded through alternative splicing, incrementally regulating the number of acetylatable lysine residues in the functionally important N-terminal "charge patch". Mass spectrometry identified 40 acetylation, methylation and ubiquitylation posttranslational modifications (PTMs) and showed that hallmark PTMs of "active" and "repressive" chromatin were present in O. dioica. No obvious reduction in silent heterochromatic marks was observed despite high gene density in this extraordinarily compacted chordate genome. Conclusions These results show that histone gene complements and their organization differ considerably even over modest phylogenetic distances. Substantial innovation among all core and linker histone variants has evolved in concert with adaptation of specific life history traits in this rapidly evolving chordate lineage.</p
    corecore