306,008 research outputs found

    Contextual confidence measures for continuous speech recognition

    Get PDF
    This paper explores the repercussion of contextual information into confidence measuring for continuous speech recognition results. Our approach comprises three steps: to extract confidence predictors out of recognition results, to compile those predictors into confidence measures by means of a fuzzy inference system whose parameters have been estimated, directly from examples, with an evolutionary strategy and, finally, to upgrade the confidence measures by the inclusion of contextual information. Through experimentation with two different continuous speech application tasks, results show that the context re-scoring procedure improves the capabilities of confidence measures to discriminate between correct and incorrect recognition results for every level of thresholding, even when a rather simple method to add contextual information is considered.Peer ReviewedPostprint (published version

    A summary of research in reading readiness

    Full text link
    Thesis (Ed.M.)--Boston UniversityPurpose: To measure the various abilities presented in the readiness workbooks of basal reading series and to relate the findings to reading achievement of Grade One in January; to measure, also, the knowledge of letter names and sounds and relate the findings to reading achievement of Grade One in January. Materials Used: Workbooks of nine systems were analyzed to discover types and frequency of suggested exercises. Four general areas were in evidence; auditory discrimination, language development, motor skills, and visual discrimination. Groups tests were constructed to include exercises comparable to the published ones with ceilings in all areas beyond the workbook material. In addition to these four tests, the Boston University Individual Test and the Boston University First Grade Success Study (January Test) were given. Intelligence was measured by the Otis Quick Scoring Mental Ability Test which had been given in October [TRUNCATED

    Vocabulary size influences spontaneous speech in native language users: Validating the use of automatic speech recognition in individual differences research

    No full text
    Previous research has shown that vocabulary size affects performance on laboratory word production tasks. Individuals who know many words show faster lexical access and retrieve more words belonging to pre-specified categories than individuals who know fewer words. The present study examined the relationship between receptive vocabulary size and speaking skills as assessed in a natural sentence production task. We asked whether measures derived from spontaneous responses to every-day questions correlate with the size of participantsā€™ vocabulary. Moreover, we assessed the suitability of automatic speech recognition for the analysis of participantsā€™ responses in complex language production data. We found that vocabulary size predicted indices of spontaneous speech: Individuals with a larger vocabulary produced more words and had a higher speech-silence ratio compared to individuals with a smaller vocabulary. Importantly, these relationships were reliably identified using manual and automated transcription methods. Taken together, our results suggest that spontaneous speech elicitation is a useful method to investigate natural language production and that automatic speech recognition can alleviate the burden of labor-intensive speech transcription

    Frequency drives lexical access in reading but not in speaking: the frequency-lag hypothesis

    Get PDF
    To contrast mechanisms of lexical access in production versus comprehension we compared the effects of word frequency (high, low), context (none, low constraint, high constraint), and level of English proficiency (monolingual, Spanish-English bilingual, Dutch-English bilingual) on picture naming, lexical decision, and eye fixation times. Semantic constraint effects were larger in production than in reading. Frequency effects were larger in production than in reading without constraining context but larger in reading than in production with constraining context. Bilingual disadvantages were modulated by frequency in production but not in eye fixation times, were not smaller in low-constraint contexts, and were reduced by high-constraint contexts only in production and only at the lowest level of English proficiency. These results challenge existing accounts of bilingual disadvantages and reveal fundamentally different processes during lexical access across modalities, entailing a primarily semantically driven search in production but a frequency-driven search in comprehension. The apparently more interactive process in production than comprehension could simply reflect a greater number of frequency-sensitive processing stages in production

    Analysis of ceiling effects occurring with speech recognition tests in adult cochlear-implanted patients

    Get PDF
    This article presents a simple method of analysing speech test scores which are biased through ceiling effects. Eighty postlingually deafened adults implanted with a MED-EL COMBI 40/40+ cochlear implant (CI) were administered a numbers test and a sentence test at initial device activation and at 1, 3, 6, 12 and 24 months thereafter. As a measure for speech recognition performance, the number of patients who scored at the `ceiling level' (i.e. at least 95% correct answers) was counted at each test interval. Results showed a quick increase in this number soon after device activation as well as a continuous improvement over time ( numbers test: 1 month: 51%; 6 months: 73%; 24 months: 88%; sentence test: 1 month: 33%; 6 months: 49%; 24 months: 64%). The new method allows for the detection of speech recognition progress in CI patient samples even at late test intervals, where improvement curves based on averaged scores are usually assuming a flat shape. Copyright (C) 2004 S. Karger AG, Basel

    Durations of repeated non-words for children with cochlear implants

    Get PDF
    Durations of syllables for repeated non-words were calculated for 76 children with cochlear implants (CIs) and 16 children with normal hearing (NH). Average syllable durations did not differ significantly between the groups, however a final syllable lengthening ratio in CI children was significantly shorter than for their NH peers. Measures of hearing related demographics were not correlated with CI syllable measures

    Comparison and Adaptation of Automatic Evaluation Metrics for Quality Assessment of Re-Speaking

    Get PDF
    Re-speaking is a mechanism for obtaining high quality subtitles for use in live broadcast and other public events. Because it relies on humans performing the actual re-speaking, the task of estimating the quality of the results is non-trivial. Most organisations rely on humans to perform the actual quality assessment, but purely automatic methods have been developed for other similar problems, like Machine Translation. This paper will try to compare several of these methods: BLEU, EBLEU, NIST, METEOR, METEOR-PL, TER and RIBES. These will then be matched to the human-derived NER metric, commonly used in re-speaking.Comment: Comparison and Adaptation of Automatic Evaluation Metrics for Quality Assessment of Re-Speaking. arXiv admin note: text overlap with arXiv:1509.0908

    The listening talker: A review of human and algorithmic context-induced modifications of speech

    Get PDF
    International audienceSpeech output technology is finding widespread application, including in scenarios where intelligibility might be compromised - at least for some listeners - by adverse conditions. Unlike most current algorithms, talkers continually adapt their speech patterns as a response to the immediate context of spoken communication, where the type of interlocutor and the environment are the dominant situational factors influencing speech production. Observations of talker behaviour can motivate the design of more robust speech output algorithms. Starting with a listener-oriented categorisation of possible goals for speech modification, this review article summarises the extensive set of behavioural findings related to human speech modification, identifies which factors appear to be beneficial, and goes on to examine previous computational attempts to improve intelligibility in noise. The review concludes by tabulating 46 speech modifications, many of which have yet to be perceptually or algorithmically evaluated. Consequently, the review provides a roadmap for future work in improving the robustness of speech output
    • ā€¦
    corecore