67 research outputs found

    Semantic Processing of Out-Of-Vocabulary Words in a Spoken Dialogue System

    Full text link
    One of the most important causes of failure in spoken dialogue systems is usually neglected: the problem of words that are not covered by the system's vocabulary (out-of-vocabulary or OOV words). In this paper a methodology is described for the detection, classification and processing of OOV words in an automatic train timetable information system. The various extensions that had to be effected on the different modules of the system are reported, resulting in the design of appropriate dialogue strategies, as are encouraging evaluation results on the new versions of the word recogniser and the linguistic processor.Comment: 4 pages, 2 eps figures, requires LaTeX2e, uses eurospeech.sty and epsfi

    Towards Understanding Spontaneous Speech: Word Accuracy vs. Concept Accuracy

    Full text link
    In this paper we describe an approach to automatic evaluation of both the speech recognition and understanding capabilities of a spoken dialogue system for train time table information. We use word accuracy for recognition and concept accuracy for understanding performance judgement. Both measures are calculated by comparing these modules' output with a correct reference answer. We report evaluation results for a spontaneous speech corpus with about 10000 utterances. We observed a nearly linear relationship between word accuracy and concept accuracy.Comment: 4 pages PS, Latex2e source importing 2 eps figures, uses icslp.cls, caption.sty, psfig.sty; to appear in the Proceedings of the Fourth International Conference on Spoken Language Processing (ICSLP 96

    A generic template for the evaluation of dialogue management systems

    Get PDF
    We present a generic template for spoken dialogue systems integrating speech recognition and synthesis with 'higher-level' natural language dialogue modelling components. The generic model is abstracted from a number of real application systems targetted at very different domains. Our research aim in developing this generic template is to investigate a new approach to the evaluation of Dialogue Management Systems. Rather than attempting to measure accuracy/speed of output, we propose principles for the evaluation of the underlying theoretical linguistic model of Dialogue Management in a given system, in terms of how well it fits our generic template for Dialogue Management Systems. This is a measure of 'genericness' or 'application-independence' of a given system, which can be used to moderate accuracy/speed scores in comparisons of very unlike DMSs serving different domains. This relates to (but is orthogonal to) Dialogue Management Systems evaluation in terms of naturalness and like measurable metrics (eg. Dybkjaer et al 1995, Vilnat 1996, EAGLES 1994, Fraser 1995); it follows more closely emerging qualitative evaluation techniques for NL grammatical parsing schemes (Leech et al 1996, Atwell 1996)

    Dialogue-Oriented Review Summary Generation for Spoken Dialogue Recommendation Systems

    Get PDF
    In this paper we present an opinion summarization technique in spoken dialogue systems. Opinion mining has been well studied for years, but very few have considered its application in spoken dialogue systems. Review summarization, when applied to real dialogue systems, is much more complicated than pure text-based summarization. We conduct a systematic study on dialogue-system-oriented review analysis and propose a three-level framework for a recommendation dialogue system. In previous work we have explored a linguistic parsing approach to phrase extraction from reviews. In this paper we will describe an approach using statistical models such as decision trees and SVMs to select the most representative phrases from the extracted phrase set. We will also explain how to generate informative yet concise review summaries for dialogue purposes. Experimental results in the restaurant domain show that the proposed approach using decision tree algorithms achieves an outperformance of 13% compared to SVM models and an improvement of 36% over a heuristic rule baseline. Experiments also show that the decision-tree-based phrase selection model can achieve rather reliable predictions on the phrase label, comparable to human judgment. The proposed statistical approach is based on domain-independent learning features and can be extended to other domains effectively

    Porting the galaxy system to Mandarin Chinese

    Get PDF
    Thesis (M.S.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 1997.Includes bibliographical references (leaves 83-86).by Chao Wang.M.S

    Talking agents: arquitectura de agentes conversacionales

    Get PDF
    Los Talking Agents son entidades software con la capacidad de reconocer el habla humana y sintetizar una respuesta hablada. Se han desarrollado este tipo de agentes para construir instalaciones artísticas, con el propósito de trabajar en nuevos tipos de experiencias en la interacción del espectador con la obra de arte. El diseño e implementación de los talking agents afronta varios problemas por sus requisitos de rendimiento y alto grado de configurabilidad para múltiples escenarios con distintos tipos de recursos en juego. Una forma de abordar estos requisitos ha sido la distribución de los componentes de cada agente y una clara separación entre agentes y recursos. El trabajo describe la arquitectura de los talking agents a distintos niveles, y algunos escenarios de experimentación de los mismos. [ABSTRACT] Talking agents are software entities with the ability to recognize human speech and synthesize a spoken response. We have developed this kind of agents for building Installation-art works, with the purpose of work in new kinds of experiences in the interaction of the spectator with the art work. The design and implementation of talking agents confronts several issues because of their requirements on performance and high degree of configuration for multiple scenarios with dierent kinds of resources in play. One way to cope with these requirements has been the distribution of the components of each agent and a clear separation between agents and resources. This work describes the architecture of talking agents at dierent levels, and several experimentation scenarios with them

    Harvesting and summarizing user-generated content for advanced speech-based human-computer interaction

    Get PDF
    Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2012.Cataloged from PDF version of thesis.Includes bibliographical references (p. 155-164).There have been many assistant applications on mobile devices, which could help people obtain rich Web content such as user-generated data (e.g., reviews, posts, blogs, and tweets). However, online communities and social networks are expanding rapidly and it is impossible for people to browse and digest all the information via simple search interface. To help users obtain information more efficiently, both the interface for data access and the information representation need to be improved. An intuitive and personalized interface, such as a dialogue system, could be an ideal assistant, which engages a user in a continuous dialogue to garner the user's interest and capture the user's intent, and assists the user via speech-navigated interactions. In addition, there is a great need for a type of application that can harvest data from the Web, summarize the information in a concise manner, and present it in an aggregated yet natural way such as direct human dialogue. This thesis, therefore, aims to conduct research on a universal framework for developing speech-based interface that can aggregate user-generated Web content and present the summarized information via speech-based human-computer interaction. To accomplish this goal, several challenges must be met. Firstly, how to interpret users' intention from their spoken input correctly? Secondly, how to interpret the semantics and sentiment of user-generated data and aggregate them into structured yet concise summaries? Lastly, how to develop a dialogue modeling mechanism to handle discourse and present the highlighted information via natural language? This thesis explores plausible approaches to tackle these challenges. We will explore a lexicon modeling approach for semantic tagging to improve spoken language understanding and query interpretation. We will investigate a parse-and-paraphrase paradigm and a sentiment scoring mechanism for information extraction from unstructured user-generated data. We will also explore sentiment-involved dialogue modeling and corpus-based language generation approaches for dialogue and discourse. Multilingual prototype systems in multiple domains have been implemented for demonstration.by Jingjing Liu.Ph.D

    Writing the railway: biosemiotic strategies for enforming meaning and dispersing authorship in site-specific text-based artworks

    Get PDF
    This practice-led PhD is concerned with the subject matter of contemporary art. It proposes methods by which a writer-maker’s authorship can be dispersed throughout reticulated networks of interpretation, and tests the limits of detail articulable in an artwork. To counter the literary and discursive turns that have dominated art theory and practice since the 1970s, the thesis demands a reassessment of the privileging of the viewer and of the adoption of indeterminacy as a generic style. It proposes instead a turn to biosemiotics as a means to situate the artwork materially, bodily, historically. That ambiguity and pluralism can consequently be deployed strategically, affectively and to critical effect is tested and evaluated in the accompanying practice. The thesis gives an account of the theorising and devising of text-based artworks which take the UK railway as site, and considers site-specificity a particular sort of engagement with subject matter. The railway is approached as a complex technical object consisting in multiple entangled intentions and interpretations – social, emotional and political valences, diffracted by a spectrum of practices, knowledges and semiotic ontologies – all of which are available to the writermaker as immanent materials of the artwork. Part One of the thesis presents a transdisciplinary argument that draws on biosemiotics, linguistic anthropology, philosophy of time and socio-psychology as well as art history and critical theory. Part Two performs an analysis of paradigmatic descriptions of the railway, speculates on the social dynamics of a train carriage interior and empirically tests the bureaucratic structures of London Underground. Part Three is an exegesis of three pieces submitted as documentation in the practice portfolio: an audio work, a guided tour and a live performance on a train carriage tabletop
    corecore