97 research outputs found

    Multi-tissue integrative analysis of personal epigenomes

    Get PDF
    Evaluating the impact of genetic variants on transcriptional regulation is a central goal in biological science that has been constrained by reliance on a single reference genome. To address this, we constructed phased, diploid genomes for four cadaveric donors (using long-read sequencing) and systematically charted noncoding regulatory elements and transcriptional activity across more than 25 tissues from these donors. Integrative analysis revealed over a million variants with allele-specific activity, coordinated, locus-scale allelic imbalances, and structural variants impacting proximal chromatin structure. We relate the personal genome analysis to the ENCODE encyclopedia, annotating allele- and tissue-specific elements that are strongly enriched for variants impacting expression and disease phenotypes. These experimental and statistical approaches, and the corresponding EN-TEx resource, provide a framework for personalized functional genomics

    Breast cancer polygenic risk score and contralateral breast cancer risk

    Get PDF
    Previous research has shown that polygenic risk scores (PRSs) can be used to stratify women according to their risk of developing primary invasive breast cancer. This study aimed to evaluate the association between a recently validated PRS of 313 germline variants (PRS313) and contralateral breast cancer (CBC) risk. We included 56,068 women of European ancestry diagnosed with first invasive breast cancer from 1990 onward with follow-up from the Breast Cancer Association Consortium. Metachronous CBC risk (N = 1,027) according to the distribution of PRS313 was quantified using Cox regression analyses. We assessed PRS313 interaction with age at first diagnosis, family history, morphology, ER status, PR status, and HER2 status, and (neo)adjuvant therapy. In studies of Asian women, with limited follow-up, CBC risk associated with PRS313 was assessed using logistic regression for 340 women with CBC compared with 12,133 women with unilateral breast cancer. Higher PRS313 was associated with increased CBC risk: hazard ratio per standard deviation (SD) = 1.25 (95%CI = 1.18–1.33) for Europeans, and an OR per SD = 1.15 (95%CI = 1.02–1.29) for Asians. The absolute lifetime risks of CBC, accounting for death as competing risk, were 12.4% for European women at the 10th percentile and 20.5% at the 90th percentile of PRS313. We found no evidence of confounding by or interaction with individual characteristics, characteristics of the primary tumor, or treatment. The C-index for the PRS313 alone was 0.563 (95%CI = 0.547–0.586). In conclusion, PRS313 is an independent factor associated with CBC risk and can be incorporated into CBC risk prediction models to help improve stratification and optimize surveillance and treatment strategies

    Evaluation of prognostic risk models for postoperative pulmonary complications in adult patients undergoing major abdominal surgery: a systematic review and international external validation cohort study

    Get PDF
    Background Stratifying risk of postoperative pulmonary complications after major abdominal surgery allows clinicians to modify risk through targeted interventions and enhanced monitoring. In this study, we aimed to identify and validate prognostic models against a new consensus definition of postoperative pulmonary complications. Methods We did a systematic review and international external validation cohort study. The systematic review was done in accordance with the Preferred Reporting Items for Systematic Reviews and Meta-Analyses guidelines. We searched MEDLINE and Embase on March 1, 2020, for articles published in English that reported on risk prediction models for postoperative pulmonary complications following abdominal surgery. External validation of existing models was done within a prospective international cohort study of adult patients (≥18 years) undergoing major abdominal surgery. Data were collected between Jan 1, 2019, and April 30, 2019, in the UK, Ireland, and Australia. Discriminative ability and prognostic accuracy summary statistics were compared between models for the 30-day postoperative pulmonary complication rate as defined by the Standardised Endpoints in Perioperative Medicine Core Outcome Measures in Perioperative and Anaesthetic Care (StEP-COMPAC). Model performance was compared using the area under the receiver operating characteristic curve (AUROCC). Findings In total, we identified 2903 records from our literature search; of which, 2514 (86·6%) unique records were screened, 121 (4·8%) of 2514 full texts were assessed for eligibility, and 29 unique prognostic models were identified. Nine (31·0%) of 29 models had score development reported only, 19 (65·5%) had undergone internal validation, and only four (13·8%) had been externally validated. Data to validate six eligible models were collected in the international external validation cohort study. Data from 11 591 patients were available, with an overall postoperative pulmonary complication rate of 7·8% (n=903). None of the six models showed good discrimination (defined as AUROCC ≥0·70) for identifying postoperative pulmonary complications, with the Assess Respiratory Risk in Surgical Patients in Catalonia score showing the best discrimination (AUROCC 0·700 [95% CI 0·683–0·717]). Interpretation In the pre-COVID-19 pandemic data, variability in the risk of pulmonary complications (StEP-COMPAC definition) following major abdominal surgery was poorly described by existing prognostication tools. To improve surgical safety during the COVID-19 pandemic recovery and beyond, novel risk stratification tools are required. Funding British Journal of Surgery Society

    Comparative analysis of syntenic genes in grass genomes reveals accelerated rates of gene structure and coding sequence evolution in polyploid wheat

    Get PDF
    Citation: Akhunov, E., . . . & Gill, B. (2013). Comparative Analysis of Syntenic Genes in Grass Genomes Reveals Accelerated Rates of Gene Structure and Coding Sequence Evolution in Polyploid Wheat. Plant Physiology, 161(1), 252-265. https://doi.org/10.1104/pp.112.205161Cycles of whole-genome duplication (WGD) and diploidization are hallmarks of eukaryotic genome evolution and speciation. Polyploid wheat (Triticum aestivum) has had a massive increase in genome size largely due to recent WGDs. How these processes may impact the dynamics of gene evolution was studied by comparing the patterns of gene structure changes, alternative splicing (AS), and codon substitution rates among wheat and model grass genomes. In orthologous gene sets, significantly more acquired and lost exonic sequences were detected in wheat than in model grasses. In wheat, 35% of these gene structure rearrangements resulted in frame-shift mutations and premature termination codons. An increased codon mutation rate in the wheat lineage compared with Brachypodium distachyon was found for 17% of orthologs. The discovery of premature termination codons in 38% of expressed genes was consistent with ongoing pseudogenization of the wheat genome. The rates of AS within the individual wheat subgenomes (21%–25%) were similar to diploid plants. However, we uncovered a high level of AS pattern divergence between the duplicated homeologous copies of genes. Our results are consistent with the accelerated accumulation of AS isoforms, nonsynonymous mutations, and gene structure rearrangements in the wheat lineage, likely due to genetic redundancy created by WGDs. Whereas these processes mostly contribute to the degeneration of a duplicated genome and its diploidization, they have the potential to facilitate the origin of new functional variations, which, upon selection in the evolutionary lineage, may play an important role in the origin of novel traits

    Identification of Potential Solid-State Li-Ion Conductors with Semi-Supervised Learning

    No full text
    Despite ongoing efforts to identify high-performance electrolytes for solid-state Li-ion batteries, thousands of prospective Li-containing structures remain unexplored. Here, we employ a semi-supervised learning approach to expedite identification of ionic conductors. We screen 180 unique descriptor representations and use agglomerative clustering to cluster ~26,000 Li-containing structures. The clusters are then labeled with experimental ionic conductivity data to assess the fitness of the descriptors. By inspecting clusters containing the highest conductivity labels, we identify 212 promising structures that are further screened using bond valence site energy and nudged elastic band calculations. Li3BS3 is identified as a potential high-conductivity material and selected for experimental characterization. With sufficient defect engineering, we show that Li3BS3 is a superionic conductor with room temperature ionic conductivity greater than 1 mS cm-1. While the semi-supervised method shows promise for identification of superionic conductors, the results illustrate a continued need for descriptors that explicitly encode for defects.</jats:p

    Identification of potential solid-state Li-ion conductors with semi-supervised learning

    Full text link
    A semi-supervised machine learning pipeline is reported for the discovery of new Li-ion solid-state electrolytes. The approach is experimentally validated with the synthesis and characterization of a new superionic conductor predicted by the model.</jats:p

    Identification of Potential Solid-State Li-Ion Conductors with Semi-Supervised Learning

    No full text
    Despite ongoing efforts to identify high-performance electrolytes for solid-state Li-ion batteries, thousands of prospective Li-containing structures remain unexplored. Here, we employ a semi-supervised learning approach to expedite identification of ionic conductors. We screen 180 unique descriptor representations and use agglomerative clustering to cluster ~26,000 Li-containing structures. The clusters are then labeled with experimental ionic conductivity data to assess the fitness of the descriptors. By inspecting clusters containing the highest conductivity labels, we identify 212 promising structures that are further screened using bond valence site energy and nudged elastic band calculations. Li3BS3 is identified as a potential high-conductivity material and selected for experimental characterization. With sufficient defect engineering, we show that Li3BS3 is a superionic conductor with room temperature ionic conductivity greater than 1 mS cm-1. While the semi-supervised method shows promise for identification of superionic conductors, the results illustrate a continued need for descriptors that explicitly encode for defects
    corecore