5 research outputs found
Building a Bio-Event Annotated Corpus for the Acquisition of Semantic Frames from Biomedical Corpora
This paper reports on the design and construction of a bio-event annotated corpus which was developed with a specific view to the acquisition of semantic frames from biomedical corpora. We describe the adopted annotation scheme and the annotation process, which is supported by a dedicated annotation tool. The annotated corpus contains 677 abstracts of biomedical research article
Bootstrapping a Verb Lexicon for Biomedical Information Extraction
The accurate extraction of information from texts requires both syntactic and semantic resources. We are developing a verb dictionary for use in the processing of biomedical texts that includes both syntactic subcategorisation frames and semantic event frames, and links them together. In this paper, we describe the acquisition of syntactic subcategorisation frames from a large corpus of abstracts of the subject of E. Coli, together with the extraction of linguistic event frames from a subset of this corpus, in which the biological process of E. coli gene regulation has been linguistically annotated by a group of biologists. Finally, we report on work carried out to link the syntactic and semantic information together, by mapping syntactic arguments of subcategorisation frames to semantic arguments of the event frames
The BioLexicon: a Large-Scale Domain-Specific Lexical Resource for Biomedical Text Mining
The talk will focus on building a biolexicon by leveraging existing bio-resources, combining them within a common, standardized lexical, terminological, framework and employing advanced NL technologies to discover new terms, concepts, relations and linguistic lexical information from text