58 research outputs found

    Extracting information from short messages

    Get PDF
    Much currently transmitted information takes the form of e-mails or SMS text messages and so extracting information from such short messages is increasingly important. The words in a message can be partitioned into the syntactic structure, terms from the domain of discourse and the data being transmitted. This paper describes a light-weight Information Extraction component which uses pattern matching to separate the three aspects: the structure is supplied as a template; domain terms are the metadata of a data source (or their synonyms), and data is extracted as those words matching placeholders in the templates

    LSA Based Approach to Domain Detection

    No full text

    Fuzzy Pattern Rule Induction for Information Extraction

    No full text

    Machine Learning in Human Language Technology

    No full text

    Evaluating an Information Extraction System

    No full text
    Many natural language researchers are now turning their attention to a relatively new task orientation known as information extraction. Information extraction systems are predicated on an I/O orientation that makes it possible to conduct formal evaluations and meaningful cross-system comparisons. This paper presents the challenge of information extraction and shows how information extraction systems are currently being evaluated. We describe a specific system developed at the University of Massachusetts, identify key research issues of general interest, and conclude with some observations about the role of performance evaluations as a stimulus for basic research

    Using induced rules as complex features in memory-based language learning

    No full text
    corecore