21 research outputs found

    Automatically Recognising European Portuguese Children's Speech

    Get PDF
    International audienceThis paper reports findings from an analysis of errors made by an automatic speech recogniser trained and tested with 3-10-year-old European Portuguese children's speech. We expected and were able to identify frequent pronunciation error patterns in the children's speech. Furthermore, we were able to correlate some of these pronunciation error patterns and automatic speech recognition errors. The findings reported in this paper are of phonetic interest but will also be useful for improving the performance of automatic speech recognisers aimed at children representing the target population of the study

    Correlating ASR Errors with Developmental Changes in Speech Production: A Study of 3-10-Year-Old European Portuguese Children's Speech

    Get PDF
    International audienceAutomatically recognising children's speech is a very difficult task. This difficulty can be attributed to the high variability in children's speech, both within and across speakers. The variability is due to developmental changes in children's anatomy, speech production skills et cetera, and manifests itself, for example, in fundamental and formant frequencies, the frequency of disfluencies, and pronunciation quality. In this paper, we report the results of acoustic and auditory analyses of 3-10-year-old European Portuguese children's speech. Furthermore, we are able to correlate some of the pronunciation error patterns revealed by our analyses - such as the truncation of consonant clusters - with the errors made by a children's speech recogniser trained on speech collected from the same age group. Other pronunciation error patterns seem to have little or no impact on speech recognition performance. In future work, we will attempt to use our findings to improve the performance of our recogniser

    Automatically Recognising European Portuguese Children's Speech

    Get PDF
    This paper reports findings from an analysis of errors made by an automatic speech recogniser trained and tested with 3-10-year-old European Portuguese children's speech. We expected and were able to identify frequent pronunciation error patterns in the children's speech. Furthermore, we were able to correlate some of these pronunciation error patterns and automatic speech recognition errors. The findings reported in this paper are of phonetic interest but will also be useful for improving the performance of automatic speech recognisers aimed at children representing the target population of the study

    Design and analysis of a database to evaluate children’s reading aloud performance

    Get PDF
    To evaluate the reading performance of children, human assessment is usually involved, where a teacher or tutor has to take time to individually estimate the performance in terms of fluency (speed, accuracy and expression). Automatic estimation of reading ability can be an important alternative or complement to the usual methods, and can improve other applications such as e-learning. Techniques must be developed to analyse audio recordings of read utterances by children and detect the deviations from the intended correct reading i.e. disfluencies. For that goal, a database of 284 European Portuguese children from 6 to 10 years old (1st–4th grades) reading aloud amounting to 20 h was collected in private and public Portuguese schools. This paper describes the design of the reading tasks as well as the data collection procedure. The presence of different types of disfluencies is analysed as well as reading performance compared to known curricular goals.info:eu-repo/semantics/acceptedVersio

    The Processing of Emotional Sentences by Young and Older Adults: A Visual World Eye-movement Study

    Get PDF
    Carminati MN, Knoeferle P. The Processing of Emotional Sentences by Young and Older Adults: A Visual World Eye-movement Study. Presented at the Architectures and Mechanisms of Language and Processing (AMLaP), Riva del Garda, Italy

    Combining translation into the second language and second language learning : an integrated computational approach

    Get PDF
    This thesis explores the area where translation and language learning intersects. However, this intersection is not one in the traditional sense of second language teaching: where translation is used as a means for learning a foreign language. This thesis treats translating into the foreign language as a separate entity, one that is as important as learning the foreign language itself. Thus the discussion in this thesis is especially relevant to an academic institution which contemplates training foreign language learners who can perform translation into the foreign language at a professional level. The thesis concentrates on developing a pedagogical model which can achieve the goal of fostering linguistic competence and translation competence at the same time. It argues that constructing such a model under a computerised framework is a viable approach, since the task of translation nowadays relies heavily on all kinds o

    The price of a perfect system: learnability and the distribution of errors in the speech of children learning English as a first language

    Get PDF
    This study reports on a strictly-cognitive and symptomatic approach to the treatment of phonological disorders, by an effect which can also be reproduced in most normally- developing children. To explain how this works, it is necessary to address certain asymmetries and singularities in the distribution of children's speech errors over the whole range of development. Particular words occasion particular errors. In early phonology there is 'fronting' with Coronal displacing Dorsal, and harmonies where Coronal is lost. In the middle of phonological acquisition, the harmonic pattern changes with coronal harmony coming to prevail over other forms. As well as these asymmetries, there is also the case of harmonic or migratory errors involving the property of affrication, but not the affricate as a whole, i.e. ignoring the property of voicing. Many of these asymmetries and singularities and the harmony or movement of affrication are described here for the first time. They are all difficult to explain in current theoretical models, especially in 'bottom-up' models. On the basis of the 'top-down' notion of 'parameters' from recent work in phonology, I shall assume that: A) finite learnability has to be ensured; B) there can be no privileged information about the learnability target; and C) phonological theory and the study of speech development (normal and otherwise) have an object in common. I shall propose: A) a Parameter Setting Function, as part of the human genome, possibly a defining part; B) Phonological Parapraxis', as a way of characterising the generalisations here about incompetent phonology by the general mechanisms of floating' and 'non-association'; C) a Stage (_n-1) as a necessary construct in the theory of acquisition, typically not reached before 8;6; D) a' Representability Inspection' relating normal competence to Chomsky's Articulatory/ Perceptual interface', sensitive to a relation between featural properties such as roundness or labiality and prosodic properties such as the foot and syllable; E) a syndrome. Specific Speech and Language Impairment, SSLI, extending the notion of Specific Language Impairment, SLI.I shall hypothesise that: A) segmental and suprasegmental representations interact; B) the phonological learnability space is uniform and consistent; C) it is the very minimality of the learnability system which makes it vulnerable to SSLI. This: A) side-steps the implausible inference that development proceeds by the loss of 'processes'; B) accounts for at least some of the asymmetries noted above; C) lets parameters set' a degree of abstract exponence; D) makes it possible to abolish 'processes' such as fronting, lisping, consonant harmony, in favour of successive degrees of imprecision in the parameterisation; E) provides a conceptual mechanism for the cognitive and symptomatic therapy, mentioned above: the therapy effects an increase in the set of phonological structures which are 'representable' by the child
    corecore