225 research outputs found

    A cancer cell-line titration series for evaluating somatic classification.

    Get PDF
    BackgroundAccurate detection of somatic single nucleotide variants and small insertions and deletions from DNA sequencing experiments of tumour-normal pairs is a challenging task. Tumour samples are often contaminated with normal cells confounding the available evidence for the somatic variants. Furthermore, tumours are heterogeneous so sub-clonal variants are observed at reduced allele frequencies. We present here a cell-line titration series dataset that can be used to evaluate somatic variant calling pipelines with the goal of reliably calling true somatic mutations at low allele frequencies.ResultsCell-line DNA was mixed with matched normal DNA at 8 different ratios to generate samples with known tumour cellularities, and exome sequenced on Illumina HiSeq to depths of >300×. The data was processed with several different variant calling pipelines and verification experiments were performed to assay >1500 somatic variant candidates using Ion Torrent PGM as an orthogonal technology. By examining the variants called at varying cellularities and depths of coverage, we show that the best performing pipelines are able to maintain a high level of precision at any cellularity. In addition, we estimate the number of true somatic variants undetected as cellularity and coverage decrease.ConclusionsOur cell-line titration series dataset, along with the associated verification results, was effective for this evaluation and will serve as a valuable dataset for future somatic calling algorithm development. The data is available for further analysis at the European Genome-phenome Archive under accession number EGAS00001001016. Data access requires registration through the International Cancer Genome Consortium's Data Access Compliance Office (ICGC DACO)

    Surgical site infection after gastrointestinal surgery in high-income, middle-income, and low-income countries: a prospective, international, multicentre cohort study

    Get PDF
    Background: Surgical site infection (SSI) is one of the most common infections associated with health care, but its importance as a global health priority is not fully understood. We quantified the burden of SSI after gastrointestinal surgery in countries in all parts of the world. Methods: This international, prospective, multicentre cohort study included consecutive patients undergoing elective or emergency gastrointestinal resection within 2-week time periods at any health-care facility in any country. Countries with participating centres were stratified into high-income, middle-income, and low-income groups according to the UN's Human Development Index (HDI). Data variables from the GlobalSurg 1 study and other studies that have been found to affect the likelihood of SSI were entered into risk adjustment models. The primary outcome measure was the 30-day SSI incidence (defined by US Centers for Disease Control and Prevention criteria for superficial and deep incisional SSI). Relationships with explanatory variables were examined using Bayesian multilevel logistic regression models. This trial is registered with ClinicalTrials.gov, number NCT02662231. Findings: Between Jan 4, 2016, and July 31, 2016, 13 265 records were submitted for analysis. 12 539 patients from 343 hospitals in 66 countries were included. 7339 (58·5%) patient were from high-HDI countries (193 hospitals in 30 countries), 3918 (31·2%) patients were from middle-HDI countries (82 hospitals in 18 countries), and 1282 (10·2%) patients were from low-HDI countries (68 hospitals in 18 countries). In total, 1538 (12·3%) patients had SSI within 30 days of surgery. The incidence of SSI varied between countries with high (691 [9·4%] of 7339 patients), middle (549 [14·0%] of 3918 patients), and low (298 [23·2%] of 1282) HDI (p < 0·001). The highest SSI incidence in each HDI group was after dirty surgery (102 [17·8%] of 574 patients in high-HDI countries; 74 [31·4%] of 236 patients in middle-HDI countries; 72 [39·8%] of 181 patients in low-HDI countries). Following risk factor adjustment, patients in low-HDI countries were at greatest risk of SSI (adjusted odds ratio 1·60, 95% credible interval 1·05–2·37; p=0·030). 132 (21·6%) of 610 patients with an SSI and a microbiology culture result had an infection that was resistant to the prophylactic antibiotic used. Resistant infections were detected in 49 (16·6%) of 295 patients in high-HDI countries, in 37 (19·8%) of 187 patients in middle-HDI countries, and in 46 (35·9%) of 128 patients in low-HDI countries (p < 0·001). Interpretation: Countries with a low HDI carry a disproportionately greater burden of SSI than countries with a middle or high HDI and might have higher rates of antibiotic resistance. In view of WHO recommendations on SSI prevention that highlight the absence of high-quality interventional research, urgent, pragmatic, randomised trials based in LMICs are needed to assess measures aiming to reduce this preventable complication

    Next-generation sequencing identifies rare variants associated with Noonan syndrome

    Get PDF
    Noonan syndrome (NS) is a relatively common genetic disorder, characterized by typical facies, short stature, developmental delay, and cardiac abnormalities. Known causative genes account for 70-80% of clinically diagnosed NS patients, but the genetic basis for the remaining 20-30% of cases is unknown. We performed next-generation sequencing on germ-line DNA from 27 NS patients lacking a mutation in the known NS genes. We identified gain-of-function alleles in Ras-like without CAAX 1 (RIT1) and mitogen-activated protein kinase kinase 1 (MAP2K1) and previously unseen loss-of-function variants in RAS p21 protein activator 2 (RASA2) that are likely to cause NS in these patients. Expression of the mutant RASA2, MAP2K1, or RIT1 alleles in heterologous cells increased RAS-ERK pathway activation, supporting a causative role in NS pathogenesis. Two patients had more than one disease-associated variant. Moreover, the diagnosis of an individual initially thought to have NS was revised to neurofibromatosis type 1 based on an NF1 nonsense mutation detected in this patient. Another patient harbored a missense mutation in NF1 that resulted in decreased protein stability and impaired ability to suppress RAS-ERK activation; however, this patient continues to exhibit a NS-like phenotype. In addition, a nonsense mutation in RPS6KA3 was found in one patient initially diagnosed with NS whose diagnosis was later revised to Coffin-Lowry syndrome. Finally, we identified other potential candidates for new NS genes, as well as potential carrier alleles for unrelated syndromes. Taken together, our data suggest that next-generation sequencing can provide a useful adjunct to RASopathy diagnosis and emphasize that the standard clinical categories for RASopathies might not be adequate to describe all patients

    The Canadian VirusSeq Data Portal & Duotang: open resources for SARS-CoV-2 viral sequences and genomic epidemiology

    Full text link
    The COVID-19 pandemic led to a large global effort to sequence SARS-CoV-2 genomes from patient samples to track viral evolution and inform public health response. Millions of SARS-CoV-2 genome sequences have been deposited in global public repositories. The Canadian COVID-19 Genomics Network (CanCOGeN - VirusSeq), a consortium tasked with coordinating expanded sequencing of SARS-CoV-2 genomes across Canada early in the pandemic, created the Canadian VirusSeq Data Portal, with associated data pipelines and procedures, to support these efforts. The goal of VirusSeq was to allow open access to Canadian SARS-CoV-2 genomic sequences and enhanced, standardized contextual data that were unavailable in other repositories and that meet FAIR standards (Findable, Accessible, Interoperable and Reusable). The Portal data submission pipeline contains data quality checking procedures and appropriate acknowledgement of data generators that encourages collaboration. Here we also highlight Duotang, a web platform that presents genomic epidemiology and modeling analyses on circulating and emerging SARS-CoV-2 variants in Canada. Duotang presents dynamic changes in variant composition of SARS-CoV-2 in Canada and by province, estimates variant growth, and displays complementary interactive visualizations, with a text overview of the current situation. The VirusSeq Data Portal and Duotang resources, alongside additional analyses and resources computed from the Portal (COVID-MVP, CoVizu), are all open-source and freely available. Together, they provide an updated picture of SARS-CoV-2 evolution to spur scientific discussions, inform public discourse, and support communication with and within public health authorities. They also serve as a framework for other jurisdictions interested in open, collaborative sequence data sharing and analyses

    Proximal and distal effects of genetic susceptibility to multiple sclerosis on the T cell epigenome

    Full text link
    Identifying the effects of genetic variation on the epigenome in disease-relevant cell types can help advance our understanding of the first molecular contributions of genetic susceptibility to disease onset. Here, we establish a genome-wide map of DNA methylation quantitative trait loci in CD4+ T-cells isolated from multiple sclerosis patients. Utilizing this map in a colocalization analysis, we identify 19 loci where the same haplotype drives both multiple sclerosis susceptibility and local DNA methylation. We also identify two distant methylation effects of multiple sclerosis susceptibility loci: a chromosome 16 locus affects PRDM8 methylation (a chromosome 4 region not previously associated with multiple sclerosis), and the aggregate effect of multiple sclerosis-associated variants in the major histocompatibility complex influences DNA methylation near PRKCA (chromosome 17). Overall, we present a new resource for a key cell type in inflammatory disease research and uncover new gene targets for the study of predisposition to multiple sclerosis

    Molecular taxonomy of myelodysplastic syndromes and its clinical implications

    Get PDF
    Myelodysplastic syndromes (MDS) are clonal hematologic disorders characterized by morphologic abnormalities of myeloid cells and peripheral cytopenias. Although genetic abnormalities underlie the pathogenesis of these disorders and their heterogeneity, current classifications of MDS rely predominantly on morphology. We performed genomic profiling of 3233 patients with MDS or related disorders to delineate molecular subtypes and define their clinical implications. Gene mutations, copy-number alterations, and copy-neutral loss of heterozygosity were derived from targeted sequencing of a 152-gene panel, with abnormalities identified in 91%, 43%, and 11% of patients, respectively. We characterized 16 molecular groups, encompassing 86% of patients, using information from 21 genes, 6 cytogenetic events, and loss of heterozygosity at the TP53 and TET2 loci. Two residual groups defined by negative findings (molecularly not otherwise specified, absence of recurrent drivers) comprised 14% of patients. The groups varied in size from 0.5% to 14% of patients and were associated with distinct clinical phenotypes and outcomes. The median bone marrow (BM) blast percentage across groups ranged from 1.5% to 10%, and the median overall survival ranged from 0.9 to 8.2 years. We validated 5 well-characterized entities, added further evidence to support 3 previously reported subsets, and described 8 novel groups. The prognostic influence of BM blasts depended on the genetic subtypes. Within genetic subgroups, therapy-related MDS and myelodysplastic/myeloproliferative neoplasms had comparable clinical and outcome profiles to primary MDS. In conclusion, genetically-derived subgroups of MDS are clinically relevant and might inform future classification schemas and translational therapeutic research

    Landscape of somatic single nucleotide variants and indels in colorectal cancer and impact on survival

    Get PDF
    Colorectal cancer (CRC) is a biologically heterogeneous disease. To characterize its mutational profile, we conduct targeted sequencing of 205 genes for 2,105 CRC cases with survival data. Our data shows several findings in addition to enhancing the existing knowledge of CRC. We identify PRKCI, SPZ1, MUTYH, MAP2K4, FETUB, and TGFBR2 as additional genes significantly mutated in CRC. We find that among hypermutated tumors, an increased mutation burden is associated with improved CRC-specific survival (HR=0.42, 95% CI: 0.21-0.82). Mutations in TP53 are associated with poorer CRC-specific survival, which is most pronounced in cases carrying TP53 mutations with predicted 0% transcriptional activity (HR=1.53, 95% CI: 1.21-1.94). Furthermore, we observe differences in mutational frequency of several genes and pathways by tumor location, stage, and sex. Overall, this large study provides deep insights into somatic mutations in CRC, and their potential relationships with survival and tumor features. Large scale sequencing study is of paramount importance to unravel the heterogeneity of colorectal cancer. Here, the authors sequenced 205 cancer genes in more than 2000 tumours and identified additional mutated driver genes, determined that mutational burden and specific mutations in TP53 are associated with survival odds

    Structural covariance network topology in individuals at clinical high risk for psychosis: the ENIGMA-CHR Study

    Get PDF
    Brain network architecture is anticipated to influence future grey matter loss in individuals at Clinical High Risk (CHR) for psychosis. However, existing studies on grey matter structural network properties in CHR are scarce and constrained by small sample sizes. Here, we examined network topology differences comparing a) CHR versus healthy controls (HC); b) CHR who transitioned to psychosis (CHR-T) versus those who did not (CHR-NT); and c) different subsyndromes. We included structural scans from 1842 CHR individuals and 1417 HC individuals from 31 sites within the Enhancing NeuroImaging Genetics through Meta-Analysis (ENIGMA) consortium. At the global level, CHR individuals exhibited lower structural covariance (q < 0.001; Cohen's d = 0.164) and less optimal structural network configuration than HC (lower global efficiency and clustering coefficient, d = 0.100,0.087, qs <= 0.027). Though no global difference between CHR-T and CHR-NT, network distinctiveness of the frontal and temporal surface area networks was higher in CHR-T than CHR-NT (d = 0.223,0.237) and HC (d = 0.208,0.219) (qs < 0.001). Network distinctiveness of the frontal cortical thickness network was lower in CHR-T (d = 0.218, q < 0.001) than CHR-NT and HC (d = 0.165, q < 0.001). Importantly, higher network distinctiveness was associated with worse positive symptoms in CHR-NT (frontal surface area, q = 0.008, R2 = 0.013) and at trend with worse negative symptoms in CHR-T (frontal thickness, q = 0.063, R2 = 0.049). Further, the brief intermittent psychotic syndrome subgroup showed more severe network alterations. Together, brain structural networks inform symptoms and the risk of transition to psychosis in CHR individuals
    corecore