39,813 research outputs found

    IMAGINE Final Report

    No full text

    Telematics programme (1991-1994). EUR 15402 EN

    Get PDF

    Multimedia search without visual analysis: the value of linguistic and contextual information

    Get PDF
    This paper addresses the focus of this special issue by analyzing the potential contribution of linguistic content and other non-image aspects to the processing of audiovisual data. It summarizes the various ways in which linguistic content analysis contributes to enhancing the semantic annotation of multimedia content, and, as a consequence, to improving the effectiveness of conceptual media access tools. A number of techniques are presented, including the time-alignment of textual resources, audio and speech processing, content reduction and reasoning tools, and the exploitation of surface features

    A road map for interoperable language resource metadata

    Get PDF
    LRs remain expensive to create and thus rare relative to demand across languages and technology types. The accidental re-creation of an LR that already exists is a nearly unforgiveable waste of scarce resources that is unfortunately not so easy to avoid. The number of catalogs the HLT researcher must search, with their different formats, make it possible to overlook an existing resource. This paper sketches the sources of this problem and outlines a proposal to rectify along with a new vision of LR cataloging that will to facilitates the documentation and exploitation of a much wider range of LRs than previously considered

    Knowledge will Propel Machine Understanding of Content: Extrapolating from Current Examples

    Full text link
    Machine Learning has been a big success story during the AI resurgence. One particular stand out success relates to learning from a massive amount of data. In spite of early assertions of the unreasonable effectiveness of data, there is increasing recognition for utilizing knowledge whenever it is available or can be created purposefully. In this paper, we discuss the indispensable role of knowledge for deeper understanding of content where (i) large amounts of training data are unavailable, (ii) the objects to be recognized are complex, (e.g., implicit entities and highly subjective content), and (iii) applications need to use complementary or related data in multiple modalities/media. What brings us to the cusp of rapid progress is our ability to (a) create relevant and reliable knowledge and (b) carefully exploit knowledge to enhance ML/NLP techniques. Using diverse examples, we seek to foretell unprecedented progress in our ability for deeper understanding and exploitation of multimodal data and continued incorporation of knowledge in learning techniques.Comment: Pre-print of the paper accepted at 2017 IEEE/WIC/ACM International Conference on Web Intelligence (WI). arXiv admin note: substantial text overlap with arXiv:1610.0770

    Automated system for the creation and replenishment of users' electronic lexicographical resources

    Get PDF
    This article proposes a solution to improve the efficiency of automated generation of electronic lexicographical resources based on strongly-structured electronic information arrays processing. The developed automated information system for lexicographical resources creation and replenishment have been described is this article. Several supporting subsystems of developed automated system have been characterized. The effectiveness of the information system has been evaluated

    Vicarious learning through capturing task‐directed discussions

    Get PDF
    The vicarious learner group has been developing a multimedia database system to promote and enhance the role of dialogue in learning. A specific interest, and the origin of the projects' collective name, is in the question of whether and how dialogue can be helpfully ‘reused’. What benefits can students gain from dialogue as observers, not just as participants? We describe our initial attempts to generate and capture educationally effective discourse exchanges amongst and between students and tutors. Problems encountered with available CMC discourse formats led to our development of a set of Task Directed Discussions (TDDs). A medium‐sized corpus of discourse exchanges was collected using the TDDs. A selection of nearly two hundred of these TDD exchanges formed the multimedia discourse database to the implemented prototype system, Dissemination. Initial results from a controlled experiment and evaluation of Dissemination are outline
    corecore