83 research outputs found

    PIVOINE: Instruction Tuning for Open-world Information Extraction

    Full text link
    We consider the problem of Open-world Information Extraction (Open-world IE), which extracts comprehensive entity profiles from unstructured texts. Different from the conventional closed-world setting of Information Extraction (IE), Open-world IE considers a more general situation where entities and relations could be beyond a predefined ontology. More importantly, we seek to develop a large language model (LLM) that is able to perform Open-world IE to extract desirable entity profiles characterized by (possibly fine-grained) natural language instructions. We achieve this by finetuning LLMs using instruction tuning. In particular, we construct INSTRUCTOPENWIKI, a substantial instruction tuning dataset for Open-world IE enriched with a comprehensive corpus, extensive annotations, and diverse instructions. We finetune the pretrained BLOOM models on INSTRUCTOPENWIKI and obtain PIVOINE, an LLM for Open-world IE with strong instruction-following capabilities. Our experiments demonstrate that PIVOINE significantly outperforms traditional closed-world methods and other LLM baselines, displaying impressive generalization capabilities on both unseen instructions and out-of-ontology cases. Consequently, PIVOINE emerges as a promising solution to tackle the open-world challenge in IE effectively

    Statin use and non-melanoma skin cancer risk: a meta-analysis of randomized controlled trials and observational studies

    Get PDF
    Background Existing evidence of the association between statin use and non-melanoma skin cancer (NMSC) risk has been inconsistent. Objective To maximize statistical power to synthesize prospective evidence on this relationship. Materials and Methods PubMed, EMBASE, Web of Science, Cochrane Central Register of Controlled Trials, and ClinicalTrial.gov were systematically searched up to December 11, 2016. A random-effects meta-analysis was conducted to calculate summary estimates. Results Our meta-analysis of 14 randomized controlled trials (RCTs) including 63,157 subjects showed no significant association between statin use and NMSC risk (RR = 1.09, 95%CI = 0.85–1.39). However, meta-analysis of four observational studies including 1,528,215 participants showed significantly increased risk of NMSC among statin users compared to non-users (RR = 1.11, 95%CI = 1.02–1.22). Furthermore, ever using lipophilic statins (RR = 1.14, 95%CI = 1.04–1.24) or lower-potency statins (RR = 1.14, 95%CI = 1.03–1.26), as well as usage of any statin longer than one year (RR = 1.14, 95%CI = 1.09–1.18) were significantly associated with increased NMSC risk based on observational studies. Conclusions Evidence from observational studies supported an association between statin use and increased NMSC risk. This finding should be interpreted with caution due to modest number of included studies, possible between-study heterogeneity and inherent limitations of observational studies

    Multiple major increases and decreases in mitochondrial substitution rates in the plant family Geraniaceae

    Get PDF
    Background: Rates of synonymous nucleotide substitutions are, in general, exceptionally low in plant mitochondrial genomes, several times lower than in chloroplast genomes, 10-20 times lower than in plant nuclear genomes, and 50-100 times lower than in many animal mitochondrial genomes. Several cases of moderate variation in mitochondrial substitution rates have been reported in plants, but these mostly involve correlated changes in chloroplast and/or nuclear substitution rates and are therefore thought to reflect whole-organism forces rather than ones impinging directly on the mitochondrial mutation rate. Only a single case of extensive, mitochondrial-specific rate changes has been described, in the angiosperm genus Plantago. Results: We explored a second potential case of highly accelerated mitochondrial sequence evolution in plants. This case was first suggested by relatively poor hybridization of mitochondrial gene probes to DNA of Pelargonium hortorum (the common geranium). We found that all eight mitochondrial genes sequenced from P. hortorum are exceptionally divergent, whereas chloroplast and nuclear divergence is unexceptional in P. hortorum. Two mitochondrial genes were sequenced from a broad range of taxa of variable relatedness to P. hortorum, and absolute rates of mitochondrial synonymous substitutions were calculated on each branch of a phylogenetic tree of these taxa. We infer one major, similar to 10-fold increase in the mitochondrial synonymous substitution rate at the base of the Pelargonium family Geraniaceae, and a subsequent similar to 10-fold rate increase early in the evolution of Pelargonium. We also infer several moderate to major rate decreases following these initial rate increases, such that the mitochondrial substitution rate has returned to normally low levels in many members of the Geraniaceae. Finally, we find unusually little RNA editing of Geraniaceae mitochondrial genes, suggesting high levels of retroprocessing in their history. Conclusion: The existence of major, mitochondrial-specific changes in rates of synonymous substitutions in the Geraniaceae implies major and reversible underlying changes in the mitochondrial mutation rate in this family. Together with the recent report of a similar pattern of rate heterogeneity in Plantago, these findings indicate that the mitochondrial mutation rate is a more plastic character in plants than previously realized. Many molecular factors could be responsible for these dramatic changes in the mitochondrial mutation rate, including nuclear gene mutations affecting the fidelity and efficacy of mitochondrial DNA replication and/or repair and consistent with the lack of RNA editing - exceptionally high levels of mutagenic retroprocessing. That the mitochondrial mutation rate has returned to normally low levels in many Geraniaceae raises the possibility that, akin to the ephemerality of mutator strains in bacteria, selection favors a low mutation rate in plant mitochondria

    A division-of-labor mode contributes to the cardioprotective potential of mesenchymal stem/stromal cells in heart failure post myocardial infarction

    Get PDF
    BackgroundTreatment of heart failure post myocardial infarction (post-MI HF) with mesenchymal stem/stromal cells (MSCs) holds great promise. Nevertheless, 2-dimensional (2D) GMP-grade MSCs from different labs and donor sources have different therapeutic efficacy and still in a low yield. Therefore, it is crucial to increase the production and find novel ways to assess the therapeutic efficacy of MSCs.Materials and methodshUC-MSCs were cultured in 3-dimensional (3D) expansion system for obtaining enough cells for clinical use, named as 3D MSCs. A post-MI HF mouse model was employed to conduct in vivo and in vitro experiments. Single-cell and bulk RNA-seq analyses were performed on 3D MSCs. A total of 125 combination algorithms were leveraged to screen for core ligand genes. Shinyapp and shinycell workflows were used for deploying web-server.Result3D GMP-grade MSCs can significantly and stably reduce the extent of post-MI HF. To understand the stable potential cardioprotective mechanism, scRNA-seq revealed the heterogeneity and division-of-labor mode of 3D MSCs at the cellular level. Specifically, scissor phenotypic analysis identified a reported wound-healing CD142+ MSCs subpopulation that is also associated with cardiac protection ability and CD142- MSCs that is in proliferative state, contributing to the cardioprotective function and self-renewal, respectively. Differential expression analysis was conducted on CD142+ MSCs and CD142- MSCs and the differentially expressed ligand-related model was achieved by employing 125 combination algorithms. The present study developed a machine learning predictive model based on 13 ligands. Further analysis using CellChat demonstrated that CD142+ MSCs have a stronger secretion capacity compared to CD142- MSCs and Flow cytometry sorting of the CD142+ MSCs and qRT-PCR validation confirmed the significant upregulation of these 13 ligand factors in CD142+ MSCs.ConclusionClinical GMP-grade 3D MSCs could serve as a stable cardioprotective cell product. Using scissor analysis on scRNA-seq data, we have clarified the potential functional and proliferative subpopulation, which cooperatively contributed to self-renewal and functional maintenance for 3D MSCs, named as “division of labor” mode of MSCs. Moreover, a ligand model was robustly developed for predicting the secretory efficacy of MSCs. A user-friendly web-server and a predictive model were constructed and available (https://wangxc.shinyapps.io/3D_MSCs/)

    Pre-diagnostic leukocyte mitochondrial DNA copy number and colorectal cancer risk

    Get PDF
    Mitochondrial DNA (mtDNA) is susceptible to oxidative stress and mutation. Few epidemiological studies have assessed the relationship between mtDNA copy number (mtDNAcn) and risk of colorectal cancer (CRC), with inconsistent findings. In this study, we examined the association between pre-diagnostic leukocyte mtDNAcn and CRC risk in a case–control study of 324 female cases and 658 matched controls nested within the Nurses’ Health Study (NHS). Relative mtDNAcn in peripheral blood leukocytes was measured by quantitative polymerase chain reaction-based assay. Conditional logistic regression models were applied to estimate odds ratios (ORs) and 95% confidence intervals (95% CIs) for the association of interest. Results showed lower log-mtDNAcn was significantly associated with increased risk of CRC, in a dose-dependent relationship (P for trend < 0.0001). Compared to the fourth quartile, multivariable-adjusted OR [95% confidence interval (CI)] was 1.10 (0.69, 1.76) for the third quartile, 1.40 (0.89, 2.19) for the second quartile and 2.19 (1.43, 3.35) for the first quartile. In analysis by anatomic subsite of CRC, we found a significant inverse association for proximal colon cancer [lowest versus highest quartile, multivariable-adjusted OR (95% CI) = 3.31 (1.70, 6.45), P for trend = 0.0003]. Additionally, stratified analysis according to the follow-up time since blood collection showed that the inverse association between mtDNAcn and CRC remained significant among individuals with ≥ 5 years’ follow-up, and marginally significant among those with ≥ 10 years’ follow-up since mtDNAcn testing, suggesting that mtDNAcn may serve as a long-term predictor for risk of CRC. In conclusion, pre-diagnostic leukocyte mtDNAcn was inversely associated with CRC risk. Further basic experimental studies are needed to explore the underlying biological mechanisms linking mtDNAcn to CRC carcinogenesis

    The 5th International Conference on Biomedical Engineering and Biotechnology (ICBEB 2016)

    Get PDF

    Molecular Cloning and Characterization of Two Genes Encoding Dihydroflavonol-4-Reductase from Populus trichocarpa

    Get PDF
    Dihydroflavonol 4-reductase (DFR, EC 1.1.1.219) is a rate-limited enzyme in the biosynthesis of anthocyanins and condensed tannins (proanthocyanidins) that catalyzes the reduction of dihydroflavonols to leucoanthocyanins. In this study, two full-length transcripts encoding for PtrDFR1 and PtrDFR2 were isolated from Populus trichocarpa. Sequence alignment of the two PtrDFRs with other known DFRs reveals the homology of these genes. The expression profile of PtrDFRs was investigated in various tissues of P. trichocarpa. To determine their functions, two PtrDFRs were overexpressed in tobacco (Nicotiana tabacum) via Agrobacterium-mediated transformation. The associated color change in the flowers was observed in all 35S:PtrDFR1 lines, but not in 35S:PtrDFR2 lines. Compared to the wild-type control, a significantly higher accumulation of anthocyanins was detected in transgenic plants harboring the PtrDFR1. Furthermore, overexpressing PtrDFR1 in Chinese white poplar (P. tomentosa Carr.) resulted in a higher accumulation of both anthocyanins and condensed tannins, whereas constitutively expressing PtrDFR2 only improved condensed tannin accumulation, indicating the potential regulation of condensed tannins by PtrDFR2 in the biosynthetic pathway in poplars
    corecore