458 research outputs found

    Comparing phrase-final patterns across speech styles and groups in European Portuguese

    Get PDF
    Le présent travail se propose d’étudier les effets phonétiquesphonologiques et les types de tons présents à la frontière d’unités intonatives majeures et mineures (indices de rupture 4 et 3 du système ToBI), en comparant la parole spontanée et la parole préparée (non lue) produites par des adolescents et par des adultes à l’école. Le corpus analysé est constitué d’un relevé de 1041 unités intonatives extraites de 18 présentations orales faites par 6 élèves (14-15 ans) et 3 professeurs (un sous-ensemble du corpus CPE-FACES). Les résultats montrent que les indices phonétiques et prosodiques en fin d’unités intonatives varient en fonction (i) du style de parole et (ii) de l’âge/statut des locuteurs

    Prosodic exercises for children with ASD via virtual therapy

    Get PDF
    which means that there is a wide degree of variation in the way it affects people. It is known that, even though it has a huge spectrum, the characterization of the speech of autistic children has been consensual in the literature as devoid of wealth prosodic parameters manifested by healthy children, such as the emotional aspects that are reflected in communicative interaction. The use of technology as a teaching tool has been growing and the presentation of educational exercises through electronic devices reveals itself as more attractive and captivating for children when compared with traditional methods. In this project, we developed prosodic exercises for intonation assessment in an imitation task, where the main focus is the development and enrichment of prosodic abilities of children with autism spectrum disorders, as a complement to therapy sessions.We evaluated the intonation assessment method, achieving accuracy values between 70%and 83.3%, depending on the feature set adapted (pitch, energy, Mel-Frequency Cepstral features, and pseudo-syllable information), and also by making a fusion of all features. Although the original intention was to integrate these exercises in an existing platform for children diagnosed with ASD, the current implementation is a stand-alone mobile application.info:eu-repo/semantics/publishedVersio

    Extending automatic transcripts in a unified data representation towards a prosodic-based metadata annotation and evaluation

    Get PDF
    This paper describes a framework that extends automatic speech transcripts in order to accommodate relevant information coming from manual transcripts, the speech signal itself, and other resources, like lexica. The proposed framework automatically collects, relates, computes, and stores all relevant information together in a self-contained data source, making it possible to easily provide a wide range of interconnected information suitable for speech analysis, training, and evaluating a number of automatic speech processing tasks. The main goal of this framework is to integrate different linguistic and paralinguistic layers of knowledge for a more complete view of their representation and interactions in several domains and languages. The processing chain is composed of two main stages, where the first consists of integrating the relevant manual annotations in the speech recognition data, and the second consists of further enriching the previous output in order to accommodate prosodic information. The described framework has been used for the identification and analysis of structural metadata in automatic speech transcripts. Initially put to use for automatic detection of punctuation marks and for capitalization recovery from speech data, it has also been recently used for studying the characterization of disfluencies in speech. It was already applied to several domains of Portuguese corpora, and also to English and Spanish Broadcast News corpora

    Acoustic-prosodic entrainment in structural metadata events

    Get PDF
    This paper presents an acoustic-prosodic analysis of entrain- ment in a Portuguese map-task corpus. Our aim is to ana- lyze how turn-by-turn entrainment varies with distinct structural metadata events: types of sentence-like units (SU) in consecu- tive turns (e.g. interrogatives followed by declaratives, or both declaratives), and with the presence of discourse markers, affir- mative cue words, and disfluencies in the beginning of turns. Entrainment at turn-exchanges may be observed in terms of pitch, energy, duration, and voice quality. Regarding SU types, question-answer turns are the ones with stronger similarity, and declarative-interrogative pairs are the ones where less entrain- ment occurs, as expected. Moreover, in question-answer pairs, there is also stronger evidence of entrainment with Yes/No and Tag questions than with Wh- questions. In fact, these subtypes are coded in distinctive prosodic ways (moreover, the first sub- type has no associated lexical-syntactic cues in Portuguese, only prosodic). As for turn-initial structures, entrainment is stronger when the second turn begins with an affirmative cue word; less strong with ambiguous structures (such as ‘OK’), emphatic af- firmative answers, and negative answers; and scarce with dis- fluencies and discourse markers. The different degrees of local entrainment may be related with the informative structure of distinct structural metadata events.info:eu-repo/semantics/publishedVersio

    Obliterative Surgery for Pelvic Organ Prolapse - an Option that Remains Valid?

    Get PDF
    Objective: To investigate the results of vaginal obliterate surgery in elderly women with pelvic organ prolapse. Design: observational retrospective study. Material and Methods: a total of 69 women with the diagnosis of pelvic prolapse were submitted to obliterative surgery in the urogynecology unit of a tertiary care hospital centre over the course of 8 years (2001 to 2008). The following data were collected from their clinical records: age, number of vaginal births, body mass index (BMI), hormone therapy, other existing diseases, type of prolapse and stage, anaesthetic risk score, duration of surgery, length of hospital stay, and short-term complications. Results: Of the 69 women studied, 31 were submitted to colpocleisis and the remaining 38 were managed by the LeFort technique. Mean age was 74.8 years with a standard deviation (sd) of 7.14 years. Average BMI was 26.2 (sd =3.76). Vaginal births were recorded in all patients. Only three patients were taking hormone therapy at the time of surgery. Sixty-three women were classified as having and anesthetic risk of II or III and 55 underwent local-regional anesthesia. Complications were reported in five cases, four of which in the first days after surgery. Nearly all were mild and resolved within the first 6 weeks. Conclusion: Complication rates appear to be low after obliterative surgery for pelvic organ prolapse in elderly women

    Comparing different machine learning approaches for disfluency structure detection in a corpus of university lectures

    Get PDF
    This paper presents a number of experiments focusing on assessing the performance of different machine learning methods on the identification of disfluencies and their distinct structural regions over speech data. Several machine learning methods have been applied, namely Naive Bayes, Logistic Regression, Classification and Regression Trees (CARTs), J48 and Multilayer Perceptron. Our experiments show that CARTs outperform the other methods on the identification of the distinct structural disfluent regions. Reported experiments are based on audio segmentation and prosodic features, calculated from a corpus of university lectures in European Portuguese, containing about 32h of speech and about 7.7% of disfluencies. The set of features automatically extracted from the forced alignment corpus proved to be discriminant of the regions contained in the production of a disfluency. This work shows that using fully automatic prosodic features, disfluency structural regions can be reliably identified using CARTs, where the best results achieved correspond to 81.5% precision, 27.6% recall, and 41.2% F-measure. The best results concern the detection of the interregnum, followed by the detection of the interruption point.info:eu-repo/semantics/publishedVersio

    Cross-domain analysis of discourse markers in European Portuguese

    Get PDF
    This paper presents an analysis of discourse markers in two spontaneous speech corpora for European Portuguese - university lectures and map-task dialogues - and also in a collection of tweets, aiming at contributing to their categorization, scarcely existent for European Portuguese. Our results show that the selection of discourse markers is domain and speaker dependent. We also found that the most frequent discourse markers are similar in all three corpora, despite tweets containing discourse markers not found in the other two corpora. In this multidisciplinary study, comprising both a linguistic perspective and a computational approach, discourse markers are also automatically discriminated from other structural metadata events, namely sentence-like units and disfluencies. Our results show that discourse markers and disfluencies tend to co-occur in the dialogue corpus, but have a complementary distribution in the university lectures. We used three acoustic-prosodic feature sets and machine learning to automatically distinguish between discourse markers, disfluencies and sentence-like units. Our in-domain experiments achieved an accuracy of about 87% in university lectures and 84% in dialogues, in line with our previous results. The eGeMAPS features, commonly used for other paralinguistic tasks, achieved a considerable performance on our data, especially considering the small size of the feature set. Our results suggest that turn-initial discourse markers are usually easier to classify than disfluencies, a result also previously reported in the literature. We conducted a cross-domain evaluation in order to evaluate the robustness of the models across domains. The results achieved are about 11%-12% lower, but we conclude that data from one domain can still be used to classify the same events in the other. Overall, despite the complexity of this task, these are very encouraging state-of-the-art results. Ultimately, using exclusively acoustic-prosodic cues, discourse markers can be fairly discriminated from disfluencies and SUs. In order to better understand the contribution of each feature, we have also reported the impact of the features in both the dialogues and the university lectures. Pitch features are the most relevant ones for the distinction between discourse markers and disfluencies, namely pitch slopes. These features are in line with the wide pitch range of discourse markers, in a continuum from a very compressed pitch range to a very wide one, expressed by total deaccented material or H+L* L* contours, with upstep H tones.info:eu-repo/semantics/publishedVersio

    Correlation Effects in Nuclear Transparency

    Get PDF
    The Glauber approximation is used to calculate the contribution of nucleon correlations in high-energy A(e,e′N)A(e,e'N) reactions. When the excitation energy of the residual nucleus is small, the increase of the nuclear transparency due to correlations between the struck nucleon and the other nucleons is mostly compensated by a decrease of the transparency due to the correlations between non detected nucleons. We derive Glauber model predictions for nuclear transparency for the differential cross section when nuclear shell level excitations are measured. The role of correlations in color transparency is briefly discussed.Comment: 24 pages revtex, 4 uuencoded PostScript Figures as separate fil
    • …
    corecore