1,220 research outputs found

    Generating Tailored, Comparative Descriptions with Contextually Appropriate Intonation

    Get PDF
    Generating responses that take user preferences into account requires adaptation at all levels of the generation process. This article describes a multi-level approach to presenting user-tailored information in spoken dialogues which brings together for the first time multi-attribute decision models, strategic content planning, surface realization that incorporates prosody prediction, and unit selection synthesis that takes the resulting prosodic structure into account. The system selects the most important options to mention and the attributes that are most relevant to choosing between them, based on the user model. Multiple options are selected when each offers a compelling trade-off. To convey these trade-offs, the system employs a novel presentation strategy which straightforwardly lends itself to the determination of information structure, as well as the contents of referring expressions. During surface realization, the prosodic structure is derived from the information structure using Combinatory Categorial Grammar in a way that allows phrase boundaries to be determined in a flexible, data-driven fashion. This approach to choosing pitch accents and edge tones is shown to yield prosodic structures with significantly higher acceptability than baseline prosody prediction models in an expert evaluation. These prosodic structures are then shown to enable perceptibly more natural synthesis using a unit selection voice that aims to produce the target tunes, in comparison to two baseline synthetic voices. An expert evaluation and f0 analysis confirm the superiority of the generator-driven intonation and its contribution to listeners' ratings

    Prosody and sentence disambiguation in European Portuguese

    Get PDF
    Our investigation focuses on several types of structural ambiguity in European Portuguese. The materials include sentences with set-divider adverbs ambiguous as to the direction of syntactic attachment, adjunct and complement PPs ambiguous as to the level of syntactic embedding, nonrestrictive clauses with local and non-local possible antecedents, and relative clauses ambiguous as to their restrictive/non-restrictive meaning. Besides providing a prosodic description of sentences with these various sorts of ambiguity, the relation between prosody and syntactic structure is addressed. It is concluded that structural ambiguity is not always cued by prosody, and it may be resolved by prosodic means that are optional. Additionally, some options on sentence partition in intonational phrases are only available under some interpretations, and in specific configurations I-breaks may not be inserted (namely, between a head and an adjacent complement or modifier). In all cases studied intonational phrase level properties play a crucial role in sentence disambiguation. An intonational phrase boundary after set-divider adverbs indicates leftattachment and between a constituent and the preceding material implies non-local attachment. These facts are seen to follow in a principled way from the conditions on the formation of intonational phrases

    Information structure in linguistic theory and in speech production : validation of a cross-linguistic data set

    Get PDF
    The aim of this paper is to validate a dataset collected by means of production experiments which are part of the Questionnaire on Information Structure. The experiments generate a range of information structure contexts that have been observed in the literature to induce specific constructions. This paper compares the speech production results from a subset of these experiments with specific claims about the reflexes of information structure in four different languages. The results allow us to evaluate and in most cases validate the efficacy of our elicitation paradigms, to identify potentially fruitful avenues of future research, and to highlight issues involved in interpreting speech production data of this kind

    Intonation, word order and focus projection in Serbo-Croatian

    Get PDF
    LoC Class: PG1224.7, LoC Subject Headings: Serbo-Croatian language--Intonation, Serbo-Croatian language--Word orde

    Prosodic phrase break prediction: problems in the evaluation of models against a gold standard

    Get PDF
    The goal of automatic phrase break prediction is to identify prosodic-syntactic boundaries in text which correspond to the way a native speaker might process or chunk that same text as speech. This is treated as a classification task in machine learning and output predictions from language models are evaluated against a ‘gold standard’: human-labelled prosodic phrase break annotations in transcriptions of recorded speech - the speech corpus. Despite the introduction of rigorous metrics such as precision and recall, the evaluation of phrase break models is still problematic because prosody is inherently variable; morphosyntactic analysis and prosodic annotations for a given text are not representative of the range of parsing and phrasing strategies available to, and exhibited by, native speakers. This article recommends creating automatically-generated POS tagged and prosodically annotated variants of a text to enrich the gold standard and enable more robust ‘noise-tolerant’ evaluation of language models

    On Left and Right Dislocation: A Dynamic Perspective

    Get PDF
    The paper argues that by modelling the incremental and left-right process of interpretation as a process of growth of logical form (representing logical forms as trees), an integrated typology of left-dislocation and right-dislocation phenomena becomes available, bringing out not merely the similarities between these types of phenomena, but also their asymmetry. The data covered include hanging topic left dislocation, clitic left dislocation, left dislocation, pronoun doubling, expletives, extraposition, and right node raising, with each set of data analysed in terms of general principles of tree growth. In the light of the success in providing a characterisation of the asymmetry between left and right periphery phenomena, a result not achieved in more wellknown formalisms, the paper concludes that grammar formalisms should model the dynamics of language processing in time.Articl
    • 

    corecore