197 research outputs found

    Automatic extraction of biomolecular interactions: an empirical approach

    Get PDF
    Background We describe a method for extracting data about how biomolecule pairs interact from texts. This method relies on empirically determined characteristics of sentences. The characteristics are efficient to compute, making this approach to extraction of biomolecular interactions scalable. The results of such interaction mining can support interaction network annotation, question answering, database construction, and other applications. Results We constructed a software system to search MEDLINE for sentences likely to describe interactions between given biomolecules. The system extracts a list of the interaction-indicating terms appearing in those sentences, then ranks those terms based on their likelihood of correctly characterizing how the biomolecules interact. The ranking process uses a tf-idf (term frequency-inverse document frequency) based technique using empirically derived knowledge about sentences, and was applied to the MEDLINE literature collection. Software was developed as part of the MetNet toolkit (http://www.metnetdb.org). Conclusions Specific, efficiently computable characteristics of sentences about biomolecular interactions were analyzed to better understand how to use these characteristics to extract how biomolecules interact. The text empirics method that was investigated, though arising from a classical tradition, has yet to be fully explored for the task of extracting biomolecular interactions from the literature. The conclusions we reach about the sentence characteristics investigated in this work, as well as the technique itself, could be used by other systems to provide evidence about putative interactions, thus supporting efforts to maximize the ability of hybrid systems to support such tasks as annotating and constructing interaction networks

    PathBinder – text empirics and automatic extraction of biomolecular interactions

    Get PDF
    Motivation The increasingly large amount of free, online biological text makes automatic interaction extraction correspondingly attractive. Machine learning is one strategy that works by uncovering and using useful properties that are implicit in the text. However these properties are usually not reported in the literature explicitly. By investigating specific properties of biological text passages in this paper, we aim to facilitate an alternative strategy, the use of text empirics, to support mining of biomedical texts for biomolecular interactions. We report on our application of this approach, and also report some empirical findings about an important class of passages. These may be useful to others who may also wish to use the empirical properties we describe. Results We manually analyzed syntactic and semantic properties of sentences likely to describe interactions between biomolecules. The resulting empirical data were used to design an algorithm for the PathBinder system to extract biomolecular interactions from texts. PathBinder searches PubMed for sentences describing interactions between two given biomolecules. PathBinder then uses probabilistic methods to combine evidence from multiple relevant sentences in PubMed to assess the relative likelihood of interaction between two arbitrary biomolecules. A biomolecular interaction network was constructed based on those likelihoods. Conclusion The text empirics approach used here supports computationally friendly, performance competitive, automatic extraction of biomolecular interactions from texts

    Performing Arm-Based Network Meta-Analysis in R with the pcnetmeta Package

    Get PDF
    Network meta-analysis is a powerful approach for synthesizing direct and indirect evidence about multiple treatment comparisons from a collection of independent studies. At present, the most widely used method in network meta-analysis is contrast-based, in which a baseline treatment needs to be specified in each study, and the analysis focuses on modeling relative treatment effects (typically log odds ratios). However, populationaveraged treatment-specific parameters, such as absolute risks, cannot be estimated by this method without an external data source or a separate model for a reference treatment. Recently, an arm-based network meta-analysis method has been proposed, and the R package pcnetmeta provides user-friendly functions for its implementation. This package estimates both absolute and relative effects, and can handle binary, continuous, and count outcomes

    Context-Aware Entity Grounding with Open-Vocabulary 3D Scene Graphs

    Full text link
    We present an Open-Vocabulary 3D Scene Graph (OVSG), a formal framework for grounding a variety of entities, such as object instances, agents, and regions, with free-form text-based queries. Unlike conventional semantic-based object localization approaches, our system facilitates context-aware entity localization, allowing for queries such as ``pick up a cup on a kitchen table" or ``navigate to a sofa on which someone is sitting". In contrast to existing research on 3D scene graphs, OVSG supports free-form text input and open-vocabulary querying. Through a series of comparative experiments using the ScanNet dataset and a self-collected dataset, we demonstrate that our proposed approach significantly surpasses the performance of previous semantic-based localization techniques. Moreover, we highlight the practical application of OVSG in real-world robot navigation and manipulation experiments.Comment: The code and dataset used for evaluation can be found at https://github.com/changhaonan/OVSG}{https://github.com/changhaonan/OVSG. This paper has been accepted by CoRL202

    UHRF1 is required for basal stem cell proliferation in response to airway injury

    Get PDF
    Cellular senescence is a cell fate characterized by an irreversible cell cycle arrest, but the molecular mechanism underlying this senescence hallmark remains poorly understood. Through an unbiased search for novel senescence regulators in airway basal cells, we discovered that the epigenetic regulator ubiquitin-like with PHD and ring finger domain-containing protein 1 (UHRF1) is critical for regulating cell cycle progression. Upon injury, basal cells in the mouse airway rapidly induce the expression of UHRF1 in order to stimulate stem cell proliferation and tissue repair. Targeted depletion of Uhrf1 specifically in airway basal cells causes a profound defect in cell cycle progression. Consistently, cultured primary human basal cells lacking UHRF1 do not exhibit cell death or differentiation phenotypes but undergo a spontaneous program of senescence. Mechanistically, UHRF1 loss induces G1 cell cycle arrest by abrogating DNA replication factory formation as evidenced by loss of proliferating cell nuclear antigen (PCNA) puncta and an inability to enter the first cell cycle. This proliferation defect is partially mediated by the p15 pathway. Overall, our study provides the first evidence of an indispensable role of UHRF1 in somatic stem cells proliferation during the process of airway regeneration
    corecore