23,442 research outputs found

    A quick search method for audio signals based on a piecewise linear representation of feature trajectories

    Full text link
    This paper presents a new method for a quick similarity-based search through long unlabeled audio streams to detect and locate audio clips provided by users. The method involves feature-dimension reduction based on a piecewise linear representation of a sequential feature trajectory extracted from a long audio stream. Two techniques enable us to obtain a piecewise linear representation: the dynamic segmentation of feature trajectories and the segment-based Karhunen-L\'{o}eve (KL) transform. The proposed search method guarantees the same search results as the search method without the proposed feature-dimension reduction method in principle. Experiment results indicate significant improvements in search speed. For example the proposed method reduced the total search time to approximately 1/12 that of previous methods and detected queries in approximately 0.3 seconds from a 200-hour audio database.Comment: 20 pages, to appear in IEEE Transactions on Audio, Speech and Language Processin

    Video browsing interfaces and applications: a review

    Get PDF
    We present a comprehensive review of the state of the art in video browsing and retrieval systems, with special emphasis on interfaces and applications. There has been a significant increase in activity (e.g., storage, retrieval, and sharing) employing video data in the past decade, both for personal and professional use. The ever-growing amount of video content available for human consumption and the inherent characteristics of video data—which, if presented in its raw format, is rather unwieldy and costly—have become driving forces for the development of more effective solutions to present video contents and allow rich user interaction. As a result, there are many contemporary research efforts toward developing better video browsing solutions, which we summarize. We review more than 40 different video browsing and retrieval interfaces and classify them into three groups: applications that use video-player-like interaction, video retrieval applications, and browsing solutions based on video surrogates. For each category, we present a summary of existing work, highlight the technical aspects of each solution, and compare them against each other

    TRECVID 2004 - an overview

    Get PDF

    CHORUS Deliverable 2.1: State of the Art on Multimedia Search Engines

    Get PDF
    Based on the information provided by European projects and national initiatives related to multimedia search as well as domains experts that participated in the CHORUS Think-thanks and workshops, this document reports on the state of the art related to multimedia content search from, a technical, and socio-economic perspective. The technical perspective includes an up to date view on content based indexing and retrieval technologies, multimedia search in the context of mobile devices and peer-to-peer networks, and an overview of current evaluation and benchmark inititiatives to measure the performance of multimedia search engines. From a socio-economic perspective we inventorize the impact and legal consequences of these technical advances and point out future directions of research

    TRECVID 2007 - Overview

    Get PDF

    Learnable PINs: Cross-Modal Embeddings for Person Identity

    Full text link
    We propose and investigate an identity sensitive joint embedding of face and voice. Such an embedding enables cross-modal retrieval from voice to face and from face to voice. We make the following four contributions: first, we show that the embedding can be learnt from videos of talking faces, without requiring any identity labels, using a form of cross-modal self-supervision; second, we develop a curriculum learning schedule for hard negative mining targeted to this task, that is essential for learning to proceed successfully; third, we demonstrate and evaluate cross-modal retrieval for identities unseen and unheard during training over a number of scenarios and establish a benchmark for this novel task; finally, we show an application of using the joint embedding for automatically retrieving and labelling characters in TV dramas.Comment: To appear in ECCV 201

    Ariadne: An interface to support collaborative database browsing:Technical Report CSEG/3/1995

    Get PDF
    This paper outlines issues in the learning of information searching skills. We report on our observations of the learning of browsing skills and the subsequent iterative development and testing of the Ariadne system – intended to investigate and support the collaborative learning of search skills. A key part of this support is a mechanism for recording an interaction history and providing students with a visualisation of that history that they can reflect and comment upon

    Information System for NGO Libraries in Pakistan: A Proposed Model for Organizing the Grey Literature by Syed Attaullah Shah and Humera Ilhaq

    Get PDF
    Abstract In recent years, especially in developed countries, various systems have been created to advance the management and organization of grey literature. Such systems use the latest communication technology and electronic and digital resources, and have developed huge networking systems to distribute and mange grey literature. Because of the scarcity of a global standardized organization system for grey literature and often limited access to computer technology, however, awareness of existence and access to grey literature is still seriously lacking, particularly in developing countries. Based on a survey of selected Pakistani NGOs from various sectors, this study proposes a new model. This paper explains the current usage patterns of grey literature in Pakistani organizations, then assesses their needs and resources for grey literature and finally recommends anew standardized model for organizing grey literature in the developing world. In this model a separate subject and classification scheme to control various types of grey literature, a shelving arrangement system and a networking system have been introduce
    corecore