5 research outputs found

    Building a Bio-Event Annotated Corpus for the Acquisition of Semantic Frames from Biomedical Corpora

    Get PDF
    This paper reports on the design and construction of a bio-event annotated corpus which was developed with a specific view to the acquisition of semantic frames from biomedical corpora. We describe the adopted annotation scheme and the annotation process, which is supported by a dedicated annotation tool. The annotated corpus contains 677 abstracts of biomedical research article

    Bootstrapping a Verb Lexicon for Biomedical Information Extraction

    Get PDF
    The accurate extraction of information from texts requires both syntactic and semantic resources. We are developing a verb dictionary for use in the processing of biomedical texts that includes both syntactic subcategorisation frames and semantic event frames, and links them together. In this paper, we describe the acquisition of syntactic subcategorisation frames from a large corpus of abstracts of the subject of E. Coli, together with the extraction of linguistic event frames from a subset of this corpus, in which the biological process of E. coli gene regulation has been linguistically annotated by a group of biologists. Finally, we report on work carried out to link the syntactic and semantic information together, by mapping syntactic arguments of subcategorisation frames to semantic arguments of the event frames

    The BioLexicon: a Large-Scale Domain-Specific Lexical Resource for Biomedical Text Mining

    Get PDF
    The talk will focus on building a biolexicon by leveraging existing bio-resources, combining them within a common, standardized lexical, terminological, framework and employing advanced NL technologies to discover new terms, concepts, relations and linguistic lexical information from text
    corecore