75 research outputs found

    Domain-independent Extraction of Scientific Concepts from Research Articles

    Get PDF
    We examine the novel task of domain-independent scientific concept extraction from abstracts of scholarly articles and present two contributions. First, we suggest a set of generic scientific concepts that have been identified in a systematic annotation process. This set of concepts is utilised to annotate a corpus of scientific abstracts from 10 domains of Science, Technology and Medicine at the phrasal level in a joint effort with domain experts. The resulting dataset is used in a set of benchmark experiments to (a) provide baseline performance for this task, (b) examine the transferability of concepts between domains. Second, we present two deep learning systems as baselines. In particular, we propose active learning to deal with different domains in our task. The experimental results show that (1) a substantial agreement is achievable by non-experts after consultation with domain experts, (2) the baseline system achieves a fairly high F1 score, (3) active learning enables us to nearly halve the amount of required training data.Comment: Accepted for publishing in 42nd European Conference on IR Research, ECIR 202

    Requirements Analysis for an Open Research Knowledge Graph

    Get PDF
    Current science communication has a number of drawbacks and bottlenecks which have been subject of discussion lately: Among others, the rising number of published articles makes it nearly impossible to get an overview of the state of the art in a certain field, or reproducibility is hampered by fixed-length, document-based publications which normally cannot cover all details of a research work. Recently, several initiatives have proposed knowledge graphs (KGs) for organising scientific information as a solution to many of the current issues. The focus of these proposals is, however, usually restricted to very specific use cases. In this paper, we aim to transcend this limited perspective by presenting a comprehensive analysis of requirements for an Open Research Knowledge Graph (ORKG) by (a) collecting daily core tasks of a scientist, (b) establishing their consequential requirements for a KG-based system, (c) identifying overlaps and specificities, and their coverage in current solutions. As a result, we map necessary and desirable requirements for successful KG-based science communication, derive implications and outline possible solutions.Comment: Accepted for publishing in 24th International Conference on Theory and Practice of Digital Libraries, TPDL 202

    Cdx ParaHox genes acquired distinct developmental roles after gene duplication in vertebrate evolution

    Get PDF
    BACKGROUND: The functional consequences of whole genome duplications in vertebrate evolution are not fully understood. It remains unclear, for instance, why paralogues were retained in some gene families but extensively lost in others. Cdx homeobox genes encode conserved transcription factors controlling posterior development across diverse bilaterians. These genes are part of the ParaHox gene cluster. Multiple Cdx copies were retained after genome duplication, raising questions about how functional divergence, overlap, and redundancy respectively contributed to their retention and evolutionary fate. RESULTS: We examined the degree of regulatory and functional overlap between the three vertebrate Cdx genes using single and triple morpholino knock-down in Xenopus tropicalis followed by RNA-seq. We found that one paralogue, Cdx4, has a much stronger effect on gene expression than the others, including a strong regulatory effect on FGF and Wnt genes. Functional annotation revealed distinct and overlapping roles and subtly different temporal windows of action for each gene. The data also reveal a colinear-like effect of Cdx genes on Hox genes, with repression of Hox paralogy groups 1 and 2, and activation increasing from Hox group 5 to 11. We also highlight cases in which duplicated genes regulate distinct paralogous targets revealing pathway elaboration after whole genome duplication. CONCLUSIONS: Despite shared core pathways, Cdx paralogues have acquired distinct regulatory roles during development. This implies that the degree of functional overlap between paralogues is relatively low and that gene expression pattern alone should be used with caution when investigating the functional evolution of duplicated genes. We therefore suggest that developmental programmes were extensively rewired after whole genome duplication in the early evolution of vertebrates

    Gene Expression Profiling in Cells with Enhanced γ-Secretase Activity

    Get PDF
    BACKGROUND: Processing by gamma-secretase of many type-I membrane protein substrates triggers signaling cascades by releasing intracellular domains (ICDs) that, following nuclear translocation, modulate the transcription of different genes regulating a diverse array of cellular and biological processes. Because the list of gamma-secretase substrates is growing quickly and this enzyme is a cancer and Alzheimer's disease therapeutic target, the mapping of gamma-secretase activity susceptible gene transcription is important for sharpening our view of specific affected genes, molecular functions and biological pathways. METHODOLOGY/PRINCIPAL FINDINGS: To identify genes and molecular functions transcriptionally affected by gamma-secretase activity, the cellular transcriptomes of Chinese hamster ovary (CHO) cells with enhanced and inhibited gamma-secretase activity were analyzed and compared by cDNA microarray. The functional clustering by FatiGO of the 1,981 identified genes revealed over- and under-represented groups with multiple activities and functions. Single genes with the most pronounced transcriptional susceptibility to gamma-secretase activity were evaluated by real-time PCR. Among the 21 validated genes, the strikingly decreased transcription of PTPRG and AMN1 and increased transcription of UPP1 potentially support data on cell cycle disturbances relevant to cancer, stem cell and neurodegenerative diseases' research. The mapping of interactions of proteins encoded by the validated genes exclusively relied on evidence-based data and revealed broad effects on Wnt pathway members, including WNT3A and DVL3. Intriguingly, the transcription of TERA, a gene of unknown function, is affected by gamma-secretase activity and was significantly altered in the analyzed human Alzheimer's disease brain cortices. CONCLUSIONS/SIGNIFICANCE: Investigating the effects of gamma-secretase activity on gene transcription has revealed several affected clusters of molecular functions and, more specifically, 21 genes that hold significant potential for a better understanding of the biology of gamma-secretase and its roles in cancer and Alzheimer's disease pathology

    Tips and tricks for using the internet for professional purposes

    No full text

    TSC1 binding to lysosomal PIPs is required for TSC complex translocation and mTORC1 regulation

    No full text
    The TSC complex is a critical negative regulator of the small GTPase Rheb and mTORC1 in cellular stress signaling. The TSC2 subunit contains a catalytic GTPase activating protein domain and interacts with multiple regulators, while the precise function of TSC1 is unknown. Here we provide a structural characterization of TSC1 and define three domains: a C-terminal coiled-coil that interacts with TSC2, a central helical domain that mediates TSC1 oligomerization, and an N-terminal HEAT repeat domain that interacts with membrane phosphatidylinositol phosphates (PIPs). TSC1 architecture, oligomerization, and membrane binding are conserved in fungi and humans. We show that lysosomal recruitment of the TSC complex and subsequent inactivation of mTORC1 upon starvation depend on the marker lipid PI3,5P2, demonstrating a role for lysosomal PIPs in regulating TSC complex and mTORC1 activity via TSC1. Our study thus identifies a vital role of TSC1 in TSC complex function and mTORC1 signaling
    corecore