Search CORE

149,070 research outputs found

Information Structure in Discourse

Author: Traat Maarika
Publication venue: University of Edinburgh. College of Science and Engineering. School of Informatics.
Publication date: 01/06/2006
Field of study

Institute for Communicating and Collaborative SystemsThe present dissertation proposes integrating Discourse Representation Theory (DRT), information structure (IS) and Combinatory Categorial Grammar (CCG) into a single framework. It achieves this by making two new contributions to computational treatment of information structure. First, it presents an uncomplicated approach to incorporating information structure in DRT. Second, it shows how the new DRT representation can be integrated into a unification-based grammar framework in a straightforward manner. We foresee the main application of the new formalism to be in spoken language systems: the approach presented here has the potential to considerably facilitate spoken language systems benefiting from insights derived from information structure. The DRT representation with information structure which is proposed in this dissertation is simpler than the previous attempts to include information structure in DRT. We believe that the simplicity of the Information-Structure-marked Discourse Representation Structure (IS-DRS) is precisely what makes it attractive and easy to use for practical tasks like determining the intonation in spoken language applications. The IS component in ISDRS covers a range of aspects of information structural semantics. A further advantage of IS-DRS is that in its case a single semantic representation is suitable for both the generation of context-appropriate prosody and automatic reasoning. A semantic representation on its own is useful for describing and analysing a language. However, it is of even greater utility if it is accompanied by a mechanism that allows one to directly infer the semantic representation from a natural language expression. We incorporated the IS-DRS into the Categorial Grammar (CG) framework, developing a unification based realisation of Combinatory Categorial Grammar, which we call Unification-based Combinatory Categorial Grammar (UCCG). UCCG inherits elements from Combinatory Categorial Grammar and Unification Categorial Grammar. The UCCG framework is developed gradually throughout the dissertation. The information structural component is included as the final step. The IS-DRSs for linguistic expressions are built up compositionally from the IS-DRSs of their sub-expressions. Feature unification is the driving force in this process. The formalism is illustrated by numerous examples which are characterised by different levels of syntactic complexity and diverse information structure. We believe that the main assets of both the IS-DRSs as well as the Unification-based Combinatory Categorial Grammar framework are their simplicity, transparency, and inherent suitability for computational implementation. This makes them an appealing choice for use in practical applications like spoken language systems

Edinburgh Research Archive

Individual and Domain Adaptation in Sentence Planning for Dialogue

Author: Mairesse F.
Prasad R.
Stent A.
Walker M. A.
Publication venue: 'AI Access Foundation'
Publication date: 31/10/2011
Field of study

One of the biggest challenges in the development and deployment of spoken dialogue systems is the design of the spoken language generation module. This challenge arises from the need for the generator to adapt to many features of the dialogue domain, user population, and dialogue context. A promising approach is trainable generation, which uses general-purpose linguistic knowledge that is automatically adapted to the features of interest, such as the application domain, individual user, or user group. In this paper we present and evaluate a trainable sentence planner for providing restaurant information in the MATCH dialogue system. We show that trainable sentence planning can produce complex information presentations whose quality is comparable to the output of a template-based generator tuned to this domain. We also show that our method easily supports adapting the sentence planner to individuals, and that the individualized sentence planners generally perform better than models trained and tested on a population of individuals. Previous work has documented and utilized individual preferences for content selection, but to our knowledge, these results provide the first demonstration of individual preferences for sentence planning operations, affecting the content order, discourse structure and sentence structure of system responses. Finally, we evaluate the contribution of different feature sets, and show that, in our application, n-gram features often do as well as features based on higher-level linguistic representations

arXiv.org e-Print Archive

Crossref

Access to recorded interviews: A research agenda

Author: Heeren W.F.L.
Jong F.M.G. de
Oard D.W.
Ordelman R.J.F.
Publication venue: ACM
Publication date: 01/01/2008
Field of study

Recorded interviews form a rich basis for scholarly inquiry. Examples include oral histories, community memory projects, and interviews conducted for broadcast media. Emerging technologies offer the potential to radically transform the way in which recorded interviews are made accessible, but this vision will demand substantial investments from a broad range of research communities. This article reviews the present state of practice for making recorded interviews available and the state-of-the-art for key component technologies. A large number of important research issues are identified, and from that set of issues, a coherent research agenda is proposed

University of Twente Research Information

Description of the Chinese-to-Spanish rule-based machine translation system developed with a hybrid combination of human annotation and statistical techniques

Author: Centelles Jordi
Ruiz Costa-Jussà Marta
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2015
Field of study

Two of the most popular Machine Translation (MT) paradigms are rule based (RBMT) and corpus based, which include the statistical systems (SMT). When scarce parallel corpus is available, RBMT becomes particularly attractive. This is the case of the Chinese--Spanish language pair. This article presents the first RBMT system for Chinese to Spanish. We describe a hybrid method for constructing this system taking advantage of available resources such as parallel corpora that are used to extract dictionaries and lexical and structural transfer rules. The final system is freely available online and open source. Although performance lags behind standard SMT systems for an in-domain test set, the results show that the RBMT’s coverage is competitive and it outperforms the SMT system in an out-of-domain test set. This RBMT system is available to the general public, it can be further enhanced, and it opens up the possibility of creating future hybrid MT systems.Peer ReviewedPostprint (author's final draft

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

UPCommons. Portal del coneixement obert de la UPC

Towards a Unified Knowledge-Based Approach to Modality Choice

Author: Bachvarova Yulia
Dijk Betsy van
Nijholt Anton
Publication venue: University of Twente, Centre for Telematics and Information Technology
Publication date: 01/01/2007
Field of study

This paper advances a unified knowledge-based approach to the process of choosing the most appropriate modality or combination of modalities in multimodal output generation. We propose a Modality Ontology (MO) that models the knowledge needed to support the two most fundamental processes determining modality choice – modality allocation (choosing the modality or set of modalities that can best support a particular type of information) and modality combination (selecting an optimal final combination of modalities). In the proposed ontology we model the main levels which collectively determine the characteristics of each modality and the specific relationships between different modalities that are important for multi-modal meaning making. This ontology aims to support the automatic selection of modalities and combinations of modalities that are suitable to convey the meaning of the intended message

University of Twente Research Information

Parsing of Spoken Language under Time Constraints

Author: Menzel Wolfgang
Publication venue
Publication date: 01/01/1994
Field of study

Spoken language applications in natural dialogue settings place serious requirements on the choice of processing architecture. Especially under adverse phonetic and acoustic conditions parsing procedures have to be developed which do not only analyse the incoming speech in a time-synchroneous and incremental manner, but which are able to schedule their resources according to the varying conditions of the recognition process. Depending on the actual degree of local ambiguity the parser has to select among the available constraints in order to narrow down the search space with as little effort as possible. A parsing approach based on constraint satisfaction techniques is discussed. It provides important characteristics of the desired real-time behaviour and attempts to mimic some of the attention focussing capabilities of the human speech comprehension mechanism.Comment: 19 pages, LaTe

arXiv.org e-Print Archive

CiteSeerX

Universaar

Acronym

Modelling Users, Intentions, and Structure in Spoken Dialog

Author: Goerz Guenther
Ludwig Bernd
Niemann Heinrich
Publication venue
Publication date: 01/01/1998
Field of study

We outline how utterances in dialogs can be interpreted using a partial first order logic. We exploit the capability of this logic to talk about the truth status of formulae to define a notion of coherence between utterances and explain how this coherence relation can serve for the construction of AND/OR trees that represent the segmentation of the dialog. In a BDI model we formalize basic assumptions about dialog and cooperative behaviour of participants. These assumptions provide a basis for inferring speech acts from coherence relations between utterances and attitudes of dialog participants. Speech acts prove to be useful for determining dialog segments defined on the notion of completing expectations of dialog participants. Finally, we sketch how explicit segmentation signalled by cue phrases and performatives is covered by our dialog model.Comment: 17 page

arXiv.org e-Print Archive

CiteSeerX

University of Regensburg Publication Server

Estudios acerca del establecimiento de conexiones entre enunciados hablados: ¿qué pueden contribuir a la promoción de la construcción de una representación coherente del discurso por parte de los estudiantes?

Author: Broek Paul van den
Yomha Cevasco Jazmin
Publication venue: 'Elsevier BV'
Publication date: 01/12/2013
Field of study

The aim of this article is to provide an overview of how the establishment of discourse connections among spoken statements has been studied by approaches to discourse analysis and psycholinguistic studies, in order to highlight what variables appear to be important for understanding how comprehension of spoken discourse can be facilitated. The consideration of discourse analysis approaches allows us to think about the role of the establishment of discourse connections among speech acts in the classroom, the uses of contextualization cues by bilingual students, the identification of social and cultural notions in teachers’ discourse, and the interactional effects of teachers’ interventions. Preliminary psycholinguistic studies contribute to our understanding of the role of establishing causal connections and integrating adjacent statements through the presence of discourse markers in the comprehension of spoken discourse by college students. The results of these approaches and studies provide insight into students’ comprehension of classroom discourse, and hold the potential for implications for instruction.El propósito de este artículo es realizar un recorrido a través de enfoques de análisis del discurso y estudios de psicolingüística que han investigado el establecimiento de conexiones entre enunciados hablados, a fin de destacar las variables que parecen ser centrales para facilitar la comprensión. La consideración de los enfoques del análisis del discurso nos permitirán pensar acerca del rol del establecimiento de conexiones entre actos del lenguaje en el aula, las funciones de las claves de contextualización, la identificación de las nociones sociales y culturales en el discurso de los profesores, los efectos de las intervenciones de los profesores en la interacción con los estudiantes. Los estudios preliminares de psicolingüística contribuirán a nuestra comprensión del rol del establecimiento de conexiones causales e integración de enunciados adyacentes a través de marcadores del discurso por parte de estudiantes universitarios. La consideración de estos enfoques y estudios nos ayudarán a pensar acerca de las contribuciones que sus propuestas y métodos pueden hacer al enriquecimiento de nuestro entendimiento de cómo los estudiantes comprenden el discurso producido durante las clases.Fil: Yomha Cevasco, Jazmin. Universidad de Buenos Aires; Argentina. Consejo Nacional de Investigaciones Científicas y Técnicas; ArgentinaFil: Broek, Paul van den. Leiden University; Países Bajo

Elsevier - Publisher Connector

Crossref

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

CONICET Digital