75 research outputs found

    Introduction to the Special Issue on Dialogue State Tracking

    Get PDF
    This short article introduces the Special Issue on Dialogue State Tracking

    The Dialog State Tracking Challenge Series: A Review

    Get PDF
    In a spoken dialog system, dialog state tracking refers to the task of correctly inferring the state of the conversation -- such as the user's goal -- given all of the dialog history up to that turn.  Dialog state tracking is crucial to the success of a dialog system, yet until recently there were no common resources, hampering progress.  The Dialog State Tracking Challenge series of 3 tasks introduced the first shared testbed and evaluation metrics for dialog state tracking, and has underpinned three key advances in dialog state tracking: the move from generative to discriminative models; the adoption of discriminative sequential techniques; and the incorporation of the speech recognition results directly into the dialog state tracker.  This paper reviews this research area, covering both the challenge tasks themselves and summarizing the work they have enabled

    Introduction to the special issue on Machine learning for multiple modalities in interactive systems and robots

    Get PDF
    This special issue highlights research articles that apply machine learning to robots and other systems that interact with users through more than one modality, such as speech, gestures, and vision. For example, a robot may coordinate its speech with its actions, taking into account (audio-)visual feedback during their execution. Machine learning provides interactive systems with opportunities to improve performance not only of individual components but also of the system as a whole. However, machine learning methods that encompass multiple modalities of an interactive system are still relatively hard to find. The articles in this special issue represent examples that contribute to filling this gap

    La filiacion y la fecundacion "in vitro"

    Get PDF
    Las tecnicas de reproduccion asistida no solo representan una solucion para ayudar a superar problemas de esterilidad, sino que su practica conlleva problemas eticos y juridicos. Esta Tesis analiza los problemas que plantea la fecundacion "in vitro", desde el punto de vista de la filiacion, para determinar la paternidad y maternidad cuando se utilizan los gametos de la pareja o de un tercero. Desde este punto de vista, se estudian la situacion juridica del tercero -llamado donante- y de las madres subrogadas, asi como las acciones de filiacion Tambien se examina la problematica que plantea la congelacion de semen y embriones, al poder un hombre engendrar un hijo despues de muerto. Entre las fuentes que se analizan estan los principales informes extranjeros que han estudiado la problematica de estas tecnicas, asi como el Informe especial de..

    Automated Lexical Adaptation and Speaker Clustering based on Pronunciation Habits for Non-Native Speech Recognition

    No full text
    This paper describes a method to improve speech recognition for non-native speech in a spoken dialogue system. Based on very general rules about possible vocalic substitutions, the frequency of occurrence of each substitution in different phonetic contexts is estimated on a small set of recordings. The most frequently observed substitutions are applied to the lexicon of the recognizer. Speakers in the training set are automatically clustered according to their preferred phonetic variants, and a specific lexicon is built for each cluster. Acoustic adaptation is also performed on each cluster. Experiments show that lexical adaptation provides a 16.4% relative WER reduction over acoustic adaptation alone. Lexical clustering can further reduce WER if the system can reliably select the cluster best matching each input utterance

    Using task-oriented spoken dialogue systems for language learning: potential, practical applications and challenges

    No full text
    The technology developed for task-based spoken dialogue systems (SDS) has a significant potential for Computer-Assisted Language Learning. Based on the CMU Let’s Go SDS, we describe two areas in which we investigated adaptations of the technology to non-native speakers: speech recognition and correction prompt generation. Although difficulties remain, particularly towards robust understanding, results prove that this technology can be used to provide realistic, involved environment for language learning.
    corecore