131 research outputs found
A cancer cell-line titration series for evaluating somatic classification.
BackgroundAccurate detection of somatic single nucleotide variants and small insertions and deletions from DNA sequencing experiments of tumour-normal pairs is a challenging task. Tumour samples are often contaminated with normal cells confounding the available evidence for the somatic variants. Furthermore, tumours are heterogeneous so sub-clonal variants are observed at reduced allele frequencies. We present here a cell-line titration series dataset that can be used to evaluate somatic variant calling pipelines with the goal of reliably calling true somatic mutations at low allele frequencies.ResultsCell-line DNA was mixed with matched normal DNA at 8 different ratios to generate samples with known tumour cellularities, and exome sequenced on Illumina HiSeq to depths of >300×. The data was processed with several different variant calling pipelines and verification experiments were performed to assay >1500 somatic variant candidates using Ion Torrent PGM as an orthogonal technology. By examining the variants called at varying cellularities and depths of coverage, we show that the best performing pipelines are able to maintain a high level of precision at any cellularity. In addition, we estimate the number of true somatic variants undetected as cellularity and coverage decrease.ConclusionsOur cell-line titration series dataset, along with the associated verification results, was effective for this evaluation and will serve as a valuable dataset for future somatic calling algorithm development. The data is available for further analysis at the European Genome-phenome Archive under accession number EGAS00001001016. Data access requires registration through the International Cancer Genome Consortium's Data Access Compliance Office (ICGC DACO)
Next-generation sequencing identifies rare variants associated with Noonan syndrome
Noonan syndrome (NS) is a relatively common genetic disorder, characterized by typical facies, short stature, developmental delay, and cardiac abnormalities. Known causative genes account for 70-80% of clinically diagnosed NS patients, but the genetic basis for the remaining 20-30% of cases is unknown. We performed next-generation sequencing on germ-line DNA from 27 NS patients lacking a mutation in the known NS genes. We identified gain-of-function alleles in Ras-like without CAAX 1 (RIT1) and mitogen-activated protein kinase kinase 1 (MAP2K1) and previously unseen loss-of-function variants in RAS p21 protein activator 2 (RASA2) that are likely to cause NS in these patients. Expression of the mutant RASA2, MAP2K1, or RIT1 alleles in heterologous cells increased RAS-ERK pathway activation, supporting a causative role in NS pathogenesis. Two patients had more than one disease-associated variant. Moreover, the diagnosis of an individual initially thought to have NS was revised to neurofibromatosis type 1 based on an NF1 nonsense mutation detected in this patient. Another patient harbored a missense mutation in NF1 that resulted in decreased protein stability and impaired ability to suppress RAS-ERK activation; however, this patient continues to exhibit a NS-like phenotype. In addition, a nonsense mutation in RPS6KA3 was found in one patient initially diagnosed with NS whose diagnosis was later revised to Coffin-Lowry syndrome. Finally, we identified other potential candidates for new NS genes, as well as potential carrier alleles for unrelated syndromes. Taken together, our data suggest that next-generation sequencing can provide a useful adjunct to RASopathy diagnosis and emphasize that the standard clinical categories for RASopathies might not be adequate to describe all patients
The Canadian VirusSeq Data Portal & Duotang: open resources for SARS-CoV-2 viral sequences and genomic epidemiology
The COVID-19 pandemic led to a large global effort to sequence SARS-CoV-2
genomes from patient samples to track viral evolution and inform public health
response. Millions of SARS-CoV-2 genome sequences have been deposited in global
public repositories. The Canadian COVID-19 Genomics Network (CanCOGeN -
VirusSeq), a consortium tasked with coordinating expanded sequencing of
SARS-CoV-2 genomes across Canada early in the pandemic, created the Canadian
VirusSeq Data Portal, with associated data pipelines and procedures, to support
these efforts. The goal of VirusSeq was to allow open access to Canadian
SARS-CoV-2 genomic sequences and enhanced, standardized contextual data that
were unavailable in other repositories and that meet FAIR standards (Findable,
Accessible, Interoperable and Reusable). The Portal data submission pipeline
contains data quality checking procedures and appropriate acknowledgement of
data generators that encourages collaboration. Here we also highlight Duotang,
a web platform that presents genomic epidemiology and modeling analyses on
circulating and emerging SARS-CoV-2 variants in Canada. Duotang presents
dynamic changes in variant composition of SARS-CoV-2 in Canada and by province,
estimates variant growth, and displays complementary interactive
visualizations, with a text overview of the current situation. The VirusSeq
Data Portal and Duotang resources, alongside additional analyses and resources
computed from the Portal (COVID-MVP, CoVizu), are all open-source and freely
available. Together, they provide an updated picture of SARS-CoV-2 evolution to
spur scientific discussions, inform public discourse, and support communication
with and within public health authorities. They also serve as a framework for
other jurisdictions interested in open, collaborative sequence data sharing and
analyses
The dental calculus metabolome in modern and historic samples.
INTRODUCTION: Dental calculus is a mineralized microbial dental plaque biofilm that forms throughout life by precipitation of salivary calcium salts. Successive cycles of dental plaque growth and calcification make it an unusually well-preserved, long-term record of host-microbial interaction in the archaeological record. Recent studies have confirmed the survival of authentic ancient DNA and proteins within historic and prehistoric dental calculus, making it a promising substrate for investigating oral microbiome evolution via direct measurement and comparison of modern and ancient specimens. OBJECTIVE: We present the first comprehensive characterization of the human dental calculus metabolome using a multi-platform approach. METHODS: Ultra performance liquid chromatography-tandem mass spectrometry (UPLC-MS/MS) quantified 285 metabolites in modern and historic (200Â years old) dental calculus, including metabolites of drug and dietary origin. A subset of historic samples was additionally analyzed by high-resolution gas chromatography-MS (GC-MS) and UPLC-MS/MS for further characterization of metabolites and lipids. Metabolite profiles of modern and historic calculus were compared to identify patterns of persistence and loss. RESULTS: Dipeptides, free amino acids, free nucleotides, and carbohydrates substantially decrease in abundance and ubiquity in archaeological samples, with some exceptions. Lipids generally persist, and saturated and mono-unsaturated medium and long chain fatty acids appear to be well-preserved, while metabolic derivatives related to oxidation and chemical degradation are found at higher levels in archaeological dental calculus than fresh samples. CONCLUSIONS: The results of this study indicate that certain metabolite classes have higher potential for recovery over long time scales and may serve as appropriate targets for oral microbiome evolutionary studies
Campylobacter jejuni transcriptome changes during loss of culturability in water
Background:
Water serves as a potential reservoir for Campylobacter, the leading cause of bacterial gastroenteritis in humans. However, little is understood about the mechanisms underlying variations in survival characteristics between different strains of C. jejuni in natural environments, including water.
Results:
We identified three Campylobacter jejuni strains that exhibited variability in their ability to retain culturability after suspension in tap water at two different temperatures (4°C and 25°C). Of the three strains C. jejuni M1 exhibited the most rapid loss of culturability whilst retaining viability. Using RNAseq transcriptomics, we characterised C. jejuni M1 gene expression in response to suspension in water by analyzing bacterial suspensions recovered immediately after introduction into water (Time 0), and from two sampling time/temperature combinations where considerable loss of culturability was evident, namely (i) after 24 h at 25°C, and (ii) after 72 h at 4°C. Transcript data were compared with a culture-grown control. Some gene expression characteristics were shared amongst the three populations recovered from water, with more genes being up-regulated than down. Many of the up-regulated genes were identified in the Time 0 sample, whereas the majority of down-regulated genes occurred in the 25°C (24 h) sample.
Conclusions:
Variations in expression were found amongst genes associated with oxygen tolerance, starvation and osmotic stress. However, we also found upregulation of flagellar assembly genes, accompanied by down-regulation of genes involved in chemotaxis. Our data also suggested a switch from secretion via the sec system to via the tat system, and that the quorum sensing gene luxS may be implicated in the survival of strain M1 in water. Variations in gene expression also occurred in accessory genome regions. Our data suggest that despite the loss of culturability, C. jejuni M1 remains viable and adapts via specific changes in gene expression
Landscape of somatic single nucleotide variants and indels in colorectal cancer and impact on survival
Colorectal cancer (CRC) is a biologically heterogeneous disease. To characterize its mutational profile, we conduct targeted sequencing of 205 genes for 2,105 CRC cases with survival data. Our data shows several findings in addition to enhancing the existing knowledge of CRC. We identify PRKCI, SPZ1, MUTYH, MAP2K4, FETUB, and TGFBR2 as additional genes significantly mutated in CRC. We find that among hypermutated tumors, an increased mutation burden is associated with improved CRC-specific survival (HR=0.42, 95% CI: 0.21-0.82). Mutations in TP53 are associated with poorer CRC-specific survival, which is most pronounced in cases carrying TP53 mutations with predicted 0% transcriptional activity (HR=1.53, 95% CI: 1.21-1.94). Furthermore, we observe differences in mutational frequency of several genes and pathways by tumor location, stage, and sex. Overall, this large study provides deep insights into somatic mutations in CRC, and their potential relationships with survival and tumor features. Large scale sequencing study is of paramount importance to unravel the heterogeneity of colorectal cancer. Here, the authors sequenced 205 cancer genes in more than 2000 tumours and identified additional mutated driver genes, determined that mutational burden and specific mutations in TP53 are associated with survival odds
Pancreatic cancer genomes reveal aberrations in axon guidance pathway genes.
Pancreatic cancer is a highly lethal malignancy with few effective therapies. We performed exome sequencing and copy number analysis to define genomic aberrations in a prospectively accrued clinical cohort (n = 142) of early (stage I and II) sporadic pancreatic ductal adenocarcinoma. Detailed analysis of 99 informative tumours identified substantial heterogeneity with 2,016 non-silent mutations and 1,628 copy-number variations. We define 16 significantly mutated genes, reaffirming known mutations (KRAS, TP53, CDKN2A, SMAD4, MLL3, TGFBR2, ARID1A and SF3B1), and uncover novel mutated genes including additional genes involved in chromatin modification (EPC1 and ARID2), DNA damage repair (ATM) and other mechanisms (ZIM2, MAP2K4, NALCN, SLC16A4 and MAGEA6). Integrative analysis with in vitro functional data and animal models provided supportive evidence for potential roles for these genetic aberrations in carcinogenesis. Pathway-based analysis of recurrently mutated genes recapitulated clustering in core signalling pathways in pancreatic ductal adenocarcinoma, and identified new mutated genes in each pathway. We also identified frequent and diverse somatic aberrations in genes described traditionally as embryonic regulators of axon guidance, particularly SLIT/ROBO signalling, which was also evident in murine Sleeping Beauty transposon-mediated somatic mutagenesis models of pancreatic cancer, providing further supportive evidence for the potential involvement of axon guidance genes in pancreatic carcinogenesis
Neuroanatomical heterogeneity and homogeneity in individuals at clinical high risk for psychosis
Individuals at Clinical High Risk for Psychosis (CHR-P) demonstrate heterogeneity in clinical profiles and outcome features. However, the extent of neuroanatomical heterogeneity in the CHR-P state is largely undetermined. We aimed to quantify the neuroanatomical heterogeneity in structural magnetic resonance imaging measures of cortical surface area (SA), cortical thickness (CT), subcortical volume (SV), and intracranial volume (ICV) in CHR-P individuals compared with healthy controls (HC), and in relation to subsequent transition to a first episode of psychosis. The ENIGMA CHR-P consortium applied a harmonised analysis to neuroimaging data across 29 international sites, including 1579 CHR-P individuals and 1243 HC, offering the largest pooled CHR-P neuroimaging dataset to date. Regional heterogeneity was indexed with the Variability Ratio (VR) and Coefficient of Variation (CV) ratio applied at the group level. Personalised estimates of heterogeneity of SA, CT and SV brain profiles were indexed with the novel Person-Based Similarity Index (PBSI), with two complementary applications. First, to assess the extent of within-diagnosis similarity or divergence of neuroanatomical profiles between individuals. Second, using a normative modelling approach, to assess the ‘normativeness’ of neuroanatomical profiles in individuals at CHR-P. CHR-P individuals demonstrated no greater regional heterogeneity after applying FDR corrections. However, PBSI scores indicated significantly greater neuroanatomical divergence in global SA, CT and SV profiles in CHR-P individuals compared with HC. Normative PBSI analysis identified 11 CHR-P individuals (0.70%) with marked deviation (>1.5 SD) in SA, 118 (7.47%) in CT and 161 (10.20%) in SV. Psychosis transition was not significantly associated with any measure of heterogeneity. Overall, our examination of neuroanatomical heterogeneity within the CHR-P state indicated greater divergence in neuroanatomical profiles at an individual level, irrespective of psychosis conversion. Further large-scale investigations are required of those who demonstrate marked deviation.publishedVersio
- …