17,544 research outputs found

    Searching Spontaneous Conversational Speech

    Get PDF
    The ACM SIGIR Workshop on Searching Spontaneous Conversational Speech was held as part of the 2007 ACM SIGIR Conference in Amsterdam.\ud The workshop program was a mix of elements, including a keynote speech, paper presentations and panel discussions. This brief report describes the organization of this workshop and summarizes the discussions

    Examining the contributions of automatic speech transcriptions and metadata sources for searching spontaneous conversational speech

    Get PDF
    The searching spontaneous speech can be enhanced by combining automatic speech transcriptions with semantically related metadata. An important question is what can be expected from search of such transcriptions and different sources of related metadata in terms of retrieval effectiveness. The Cross-Language Speech Retrieval (CL-SR) track at recent CLEF workshops provides a spontaneous speech test collection with manual and automatically derived metadata fields. Using this collection we investigate the comparative search effectiveness of individual fields comprising automated transcriptions and the available metadata. A further important question is how transcriptions and metadata should be combined for the greatest benefit to search accuracy. We compare simple field merging of individual fields with the extended BM25 model for weighted field combination (BM25F). Results indicate that BM25F can produce improved search accuracy, but that it is currently important to set its parameters suitably using a suitable training set

    Proceedings of the ACM SIGIR Workshop ''Searching Spontaneous Conversational Speech''

    Get PDF

    Access to recorded interviews: A research agenda

    Get PDF
    Recorded interviews form a rich basis for scholarly inquiry. Examples include oral histories, community memory projects, and interviews conducted for broadcast media. Emerging technologies offer the potential to radically transform the way in which recorded interviews are made accessible, but this vision will demand substantial investments from a broad range of research communities. This article reviews the present state of practice for making recorded interviews available and the state-of-the-art for key component technologies. A large number of important research issues are identified, and from that set of issues, a coherent research agenda is proposed

    Subword-based Indexing for a Minimal False Positive Rate

    Get PDF
    Subword-based Indexing for a Minimal False Positive Rat

    Overview of the CLEF-2005 cross-language speech retrieval track

    Get PDF
    The task for the CLEF-2005 cross-language speech retrieval track was to identify topically coherent segments of English interviews in a known-boundary condition. Seven teams participated, performing both monolingual and cross-language searches of ASR transcripts, automatically generated metadata, and manually generated metadata. Results indicate that monolingual search technology is sufficiently accurate to be useful for some purposes (the best mean average precision was 0.18) and cross-language searching yielded results typical of those seen in other applications (with the best systems approximating monolingual mean average precision)

    Spoken content retrieval: A survey of techniques and technologies

    Get PDF
    Speech media, that is, digital audio and video containing spoken content, has blossomed in recent years. Large collections are accruing on the Internet as well as in private and enterprise settings. This growth has motivated extensive research on techniques and technologies that facilitate reliable indexing and retrieval. Spoken content retrieval (SCR) requires the combination of audio and speech processing technologies with methods from information retrieval (IR). SCR research initially investigated planned speech structured in document-like units, but has subsequently shifted focus to more informal spoken content produced spontaneously, outside of the studio and in conversational settings. This survey provides an overview of the field of SCR encompassing component technologies, the relationship of SCR to text IR and automatic speech recognition and user interaction issues. It is aimed at researchers with backgrounds in speech technology or IR who are seeking deeper insight on how these fields are integrated to support research and development, thus addressing the core challenges of SCR

    Conversation therapy for agrammatism: exploring the therapeutic process of engagement and learning by a person with aphasia.

    Get PDF
    A recent systematic review of conversation training for communication partners of people with aphasia has shown that it is effective, and improves participation in conversation for people with chronic aphasia. Other research suggests that people with aphasia are better able to learn communication strategies in an environment which closely mirrors that of expected use, and that cognitive flexibility may be a better predictor of response to therapy than severity of language impairment. This study reports results for a single case, one of a case series evaluation of a programme of conversation training for agrammatism that directly involves a person with aphasia (PWA) as well as their communication partner. It explores how a PWA is able to engage with and learn from the therapy, and whether this leads to qualitative change in post-therapy conversation behaviours

    Word searches: on the use of verbal and non-verbal resources during classroom talk

    Get PDF
    Word finding difficulties in children are typically characterised by search behaviours such as silence, circumlocution, repetition and empty words. Yet, how children’s word searches are constructed (including gesture, gaze and prosody) and the actions accomplished during interaction have not yet been researched. In this study, eightyear- old Ciara is interacting with her teacher in the classroom. 37 segments containing word searches were analysed according to the procedures used by conversation analysts. Ciara’s interactional resources include co-ordinated deployment of syntax, pitch height and downward gaze during solitary searching that assist the enterprise of self-repair. Gaze shift towards the teacher signals a transition relevance place, thus constituting a direct invitation for her to participate in the search. Ciara’s interactional resources include semantic category labelling, phonological self-cuing and pronominal substitution that supply valuable linguistic information to the teacher and trigger production of the searched-for item. Recommendations for language teaching and therapy are presented
    corecore