38 research outputs found

    Rearrangement processes and structural variations show evidence of selection in oesophageal adenocarcinomas

    Get PDF
    Oesophageal adenocarcinoma (OAC) provides an ideal case study to characterize large-scale rearrangements. Using whole genome short-read sequencing of 383 cases, for which 214 had matched whole transcriptomes, we observed structural variations (SV) with a predominance of deletions, tandem duplications and inter-chromosome junctions that could be identified as LINE-1 mobile element (ME) insertions. Complex clusters of rearrangements resembling breakage-fusion-bridge cycles or extrachromosomal circular DNA accounted for 22% of complex SVs affecting known oncogenes. Counting SV events affecting known driver genes substantially increased the recurrence rates of these drivers. After excluding fragile sites, we identified 51 candidate new drivers in genomic regions disrupted by SVs, including ETV5, KAT6B and CLTC. RUNX1 was the most recurrently altered gene (24%), with many deletions inactivating the RUNT domain but preserved the reading frame, suggesting an altered protein product. These findings underscore the importance of identification of SV events in OAC with implications for targeted therapies.</p

    Rearrangement processes and structural variations show evidence of selection in oesophageal adenocarcinomas

    Get PDF
    Oesophageal adenocarcinoma (OAC) provides an ideal case study to characterize large-scale rearrangements. Using whole genome short-read sequencing of 383 cases, for which 214 had matched whole transcriptomes, we observed structural variations (SV) with a predominance of deletions, tandem duplications and inter-chromosome junctions that could be identified as LINE-1 mobile element (ME) insertions. Complex clusters of rearrangements resembling breakage-fusion-bridge cycles or extrachromosomal circular DNA accounted for 22% of complex SVs affecting known oncogenes. Counting SV events affecting known driver genes substantially increased the recurrence rates of these drivers. After excluding fragile sites, we identified 51 candidate new drivers in genomic regions disrupted by SVs, including ETV5, KAT6B and CLTC. RUNX1 was the most recurrently altered gene (24%), with many deletions inactivating the RUNT domain but preserved the reading frame, suggesting an altered protein product. These findings underscore the importance of identification of SV events in OAC with implications for targeted therapies.</p

    The repertoire of mutational signatures in human cancer.

    Get PDF
    Somatic mutations in cancer genomes are caused by multiple mutational processes, each of which generates a characteristic mutational signature1. Here, as part of the Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium2 of the International Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA), we characterized mutational signatures using 84,729,690 somatic mutations from 4,645 whole-genome and 19,184 exome sequences that encompass most types of cancer. We identified 49 single-base-substitution, 11 doublet-base-substitution, 4 clustered-base-substitution and 17 small insertion-and-deletion signatures. The substantial size of our dataset, compared with previous analyses3-15, enabled the discovery of new signatures, the separation of overlapping signatures and the decomposition of signatures into components that may represent associated-but distinct-DNA damage, repair and/or replication mechanisms. By estimating the contribution of each signature to the mutational catalogues of individual cancer genomes, we revealed associations of signatures to exogenous or endogenous exposures, as well as to defective DNA-maintenance processes. However, many signatures are of unknown cause. This analysis provides a systematic perspective on the repertoire of mutational processes that contribute to the development of human cancer

    Large expert-curated database for benchmarking document similarity detection in biomedical literature search

    Get PDF
    Document recommendation systems for locating relevant literature have mostly relied on methods developed a decade ago. This is largely due to the lack of a large offline gold-standard benchmark of relevant documents that cover a variety of research fields such that newly developed literature search techniques can be compared, improved and translated into practice. To overcome this bottleneck, we have established the RElevant LIterature SearcH consortium consisting of more than 1500 scientists from 84 countries, who have collectively annotated the relevance of over 180 000 PubMed-listed articles with regard to their respective seed (input) article/s. The majority of annotations were contributed by highly experienced, original authors of the seed articles. The collected data cover 76% of all unique PubMed Medical Subject Headings descriptors. No systematic biases were observed across different experience levels, research fields or time spent on annotations. More importantly, annotations of the same document pairs contributed by different scientists were highly concordant. We further show that the three representative baseline methods used to generate recommended articles for evaluation (Okapi Best Matching 25, Term Frequency-Inverse Document Frequency and PubMed Related Articles) had similar overall performances. Additionally, we found that these methods each tend to produce distinct collections of recommended articles, suggesting that a hybrid method may be required to completely capture all relevant articles. The established database server located at https://relishdb.ict.griffith.edu.au is freely available for the downloading of annotation data and the blind testing of new methods. We expect that this benchmark will be useful for stimulating the development of new powerful techniques for title and title/abstract-based search engines for relevant articles in biomedical research.Peer reviewe

    Retrospective evaluation of whole exome and genome mutation calls in 746 cancer samples

    No full text
    Funder: NCI U24CA211006Abstract: The Cancer Genome Atlas (TCGA) and International Cancer Genome Consortium (ICGC) curated consensus somatic mutation calls using whole exome sequencing (WES) and whole genome sequencing (WGS), respectively. Here, as part of the ICGC/TCGA Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium, which aggregated whole genome sequencing data from 2,658 cancers across 38 tumour types, we compare WES and WGS side-by-side from 746 TCGA samples, finding that ~80% of mutations overlap in covered exonic regions. We estimate that low variant allele fraction (VAF < 15%) and clonal heterogeneity contribute up to 68% of private WGS mutations and 71% of private WES mutations. We observe that ~30% of private WGS mutations trace to mutations identified by a single variant caller in WES consensus efforts. WGS captures both ~50% more variation in exonic regions and un-observed mutations in loci with variable GC-content. Together, our analysis highlights technological divergences between two reproducible somatic variant detection efforts

    MUTATIONAL SIGNATURES AND THEIR APPLICATION TO DETECT AA EXPOSURE IN CANCER

    No full text
    Ph.DDOCTOR OF PHILOSOPHY (NGS

    The Drosophila Dicer-1 Partner Loquacious Enhances miRNA Processing from Hairpins with Unstable Structures at the Dicing Site

    Get PDF
    In Drosophila, Dicer-1 binds Loquacious-PB (Loqs-PB) as its major co-factor. Previous analyses indicated that loqs mutants only partially impede miRNA processing, but the activity of minor isoforms or maternally deposited Loqs was not eliminated in these studies. We addressed this by generating a cell line from loqs-null embryos and found that only ∼40% of miRNAs showed clear Loqs dependence. Genome-wide comparison of the hairpin structure and Loqs dependence suggested that Loqs substrates are influenced by base-pairing status at the dicing site. Artificial alteration of base-pairing stability at this position in model miRNA hairpins resulted in predicted changes in Loqs dependence, providing evidence for this hypothesis. Finally, we found that evolutionarily young miRNA genes tended to be Loqs dependent. We propose that Loqs may have roles in assisting the de novo emergence of miRNA genes by facilitating dicing of suboptimal hairpin substrates

    Understanding the malignant potential of gastric metaplasia of the oesophagus and its relevance to Barrett’s oesophagus surveillance: individual-level data analysis

    Get PDF
    OBJECTIVE: Whether gastric metaplasia (GM) of the oesophagus should be considered as Barrett's oesophagus (BO) is controversial. Given concern intestinal metaplasia (IM) may be missed due to sampling, the UK guidelines include GM as a type of BO. Here, we investigated whether the risk of misdiagnosis and the malignant potential of GM warrant its place in the UK surveillance. DESIGN: We performed a thorough pathology and endoscopy review to follow clinical outcomes in a novel UK cohort of 244 patients, covering 1854 person years of follow-up. We complemented this with a comparative genomic analysis of 160 GM and IM specimens, focused on early molecular hallmarks of BO and oesophageal adenocarcinoma (OAC). RESULTS: We found that 58 of 77 short-segment (<3 cm) GM (SS-GM) cases (75%) continued to be observed as GM-only across a median of 4.4 years of follow-up. We observed that disease progression in GM-only cases and GM+IM cases (cases with reported GM on some occasions, IM on others) was significantly lower than in the IM-only cases (Kaplan-Meier, p=0.03). Genomic analysis revealed that the mutation burden in GM is significantly lower than in IM (p<0.01). Moreover, GM does not bear the mutational hallmarks of OAC, with an absence of associated signatures and driver gene mutations. Finally, we established that GM found adjacent to OAC is evolutionarily distant from cancer. CONCLUSION: SS-GM is a distinct entity from SS-IM and the malignant potential of GM is lower than IM. It is questionable whether SS-GM warrants inclusion in BO surveillance

    Global epidemiology and genetics of hepatocellular carcinoma

    No full text
    Hepatocellular carcinoma (HCC) is one of the leading cancers worldwide. Classically, HCC develops in genetically susceptible individuals who are exposed to risk factors, especially in the presence of liver cirrhosis. Significant temporal and geographic variations exist for HCC and its etiologies. Over time, the burden of HCC has shifted from the low-moderate to the high sociodemographic index regions, reflecting the transition from viral to nonviral causes. Geographically, the hepatitis viruses predominate as the causes of HCC in Asia and Africa. Although there are genetic conditions that confer increased risk for HCC, these diagnoses are rarely recognized outside North America and Europe. In this review, we will evaluate the epidemiologic trends and risk factors of HCC, and discuss the genetics of HCC, including monogenic diseases, single-nucleotide polymorphisms, gut microbiome, and somatic mutations
    corecore