639 research outputs found
Reading acquisition: from digital screening to neurocognitive bases in a transparent orthography
155 p.El aprendizaje de la lectura es un ĂĄrea activa de investigaciĂłn en la psicologĂa y la neurociencia cognitiva. En las Ășltimas dĂ©cadas se ha avanzado enormemente en la comprensiĂłn de los procesos neurocognitivos subyacentes al aprendizaje de la lectura y a sus dificultades. Sin embargo, existen al menos dos dimensiones en las que es necesario seguir trabajando arduamente. Por un lado, el conocimiento actual sobre el aprendizaje de la lectura no ha impactado en las prĂĄcticas educativas. Por otro lado, la diversidad de las caracterĂsticas del aprendizaje de la lectura en distintas ortografĂas no se comprende cabalmente. La presente tesis se enfoca en el estudio del aprendizaje de la lectura combinando estrategias de identificaciĂłn oportuna de niños en riesgo lector en el contexto escolar, y estudios de laboratorio enfocados en comprender las bases neurocognitivas del aprendizaje de la lectura en una ortografĂa transparente como el español. Estos objetivos se lograron a travĂ©s de un diseño longitudinal comenzando desde la educaciĂłn inicial, siguiendo a un mismo grupo de aproximadamente 600 niños hasta segundo año de escuela. Los resultados muestran, por una parte, que es factible identificar a niños en riesgo lector incluso antes de la educaciĂłn primaria, y, por otra parte, que el aprendizaje de la lectura en una ortografĂa transparente como el español tiene caracterĂsticas comunes y caracterĂsticas distintivas respecto a ortografĂas opacas. Estos resultados ponen en evidencia la factibilidad de la identificaciĂłn oportuna de riesgo lector, y remarcan la importancia de considerar las caracterĂsticas de la ortografĂa durante el aprendizaje de la lectura
The production and perception of peripheral geminate/singleton coronal stop contrasts in Arabic
Gemination is typologically common word-medially but is rare at the periphery of the word (word-initially and -finally). In line with this observation, prior research on production and perception of gemination has focused primarily on medial gemination. Much less is known about the production and perception of peripheral gemination. This PhD thesis reports on comprehensive articulatory, acoustic and perceptual investigations of geminate-singleton contrasts according to the position of the contrast in the word and in the utterance. The production component of the project investigated the articulatory and acoustic features of medial and peripheral gemination of voiced and voiceless coronal stops in Modern standard Arabic and regional Arabic vernacular dialects, as produced by speakers from two disparate and geographically distant countries, Morocco and Lebanon. The perceptual experiment investigated how standard and dialectal Arabic gemination contrasts in each word position were categorised and discriminated by three groups of non-native listeners, each differing in their native language experience with gemination at different word positions. The first experiment used ultrasound and acoustic recordings to address the extent to which word-initial gemination in Moroccan and Lebanese dialectal Arabic is maintained, as well as the articulatory and acoustic variability of the contrast according to the position of the gemination contrast in the utterance (initial vs. medial) and between the two dialects. The second experiment compared the production of word-medial and -final gemination in Modern Standard Arabic as produced by Moroccan and Lebanese speakers. The aim of the perceptual experiment was to disentangle the contribution of phonological and phonetic effects of the listenersâ native languages on the categorisation and discrimination of non-lexical Moroccan gemination by three groups of non-native listeners varying in their phonological (native Lebanese group and heritage Lebanese group, for whom Moroccan is unintelligible, i.e., non-native language) and phonetic-only (native English group) experience with gemination across the three word positions. The findings in this thesis constitute important contributions about positional and dialectal effects on the production and perception of gemination contrasts, going beyond medial gemination (which was mainly included as control) and illuminating in particular the typologically rare peripheral gemination
Recommended from our members
Perceptual learning of context-sensitive phonetic detail
[Abstract abbreviated due to inability of DSpace@Cambridge to display phonetic symbols. Please see the full abstract in the attached pdf file.]
Although familiarity with a talker or accent is known to facilitate perception, it is not clear what underlies this phenomenon. Previous research has focused primarily on whether listeners can learn to associate novel phonetic characteristics with low-level units such as features or phonemes. However, this neglects the potential role of phonetic information at many other levels of representation. To address this shortcoming, this thesis investigated perceptual learning of systematic phonetic detail relating to higher levels of linguistic structure, including prosodic, grammatical and morphological contexts. Furthermore, in contrast to many previous studies, this research used relatively natural stimuli and tasks, thus maximising its relevance to perceptual learning in ordinary listening situations.
This research shows that listeners can update their phonetic representations in response to incoming information and its relation to linguistic-structural context. In addition, certain patterns of systematic phonetic detail were more learnable than others. These findings are used to inform an account of how new information is integrated with prior experience in speech processing, within a framework that emphasises the importance of phonetic detail at multiple levels of representation.This work was funded by an AHRC grant
Multiple Media Correlation: Theory and Applications
This thesis introduces multiple media correlation, a new technology for the automatic alignment of multiple media objects such as text, audio, and video. This research began with the question: what can be learned when multiple multimedia components are analyzed simultaneously? Most ongoing research in computational multimedia has focused on queries, indexing, and retrieval within a single media type. Video is compressed and searched independently of audio, text is indexed without regard to temporal relationships it may have to other media data. Multiple media correlation provides a framework for locating and exploiting correlations between multiple, potentially heterogeneous, media streams. The goal is computed synchronization, the determination of temporal and spatial alignments that optimize a correlation function and indicate commonality and synchronization between media objects. The model also provides a basis for comparison of media in unrelated domains. There are many real-world applications for this technology, including speaker localization, musical score alignment, and degraded media realignment. Two applications, text-to-speech alignment and parallel text alignment, are described in detail with experimental validation. Text-to-speech alignment computes the alignment between a textual transcript and speech-based audio. The presented solutions are effective for a wide variety of content and are useful not only for retrieval of content, but in support of automatic captioning of movies and video. Parallel text alignment provides a tool for the comparison of alternative translations of the same document that is particularly useful to the classics scholar interested in comparing translation techniques or styles. The results presented in this thesis include (a) new media models more useful in analysis applications, (b) a theoretical model for multiple media correlation, (c) two practical application solutions that have wide-spread applicability, and (d) Xtrieve, a multimedia database retrieval system that demonstrates this new technology and demonstrates application of multiple media correlation to information retrieval. This thesis demonstrates that computed alignment of media objects is practical and can provide immediate solutions to many information retrieval and content presentation problems. It also introduces a new area for research in media data analysis
Mechanisms of vowel devoicing in Japanese
The processes of vowel devoicing in Standard Japanese were examined with respect
to the phonetic and phonological environments and the syllable structure of Japanese, in
comparison with vowel reduction processes in other languages, in most of which vowel
reduction occurs optionally in fast or casual speech. This thesis examined whether
Japanese vowel devoicing was a phonetic phenomenon caused by glottal assimilation
between a high vowel and its adjacent voiceless consonants, or it was a more
phonologically controlled compulsory process.
Experimental results showed that Japanese high vowel devoicing must be analysed
separately in two devoicing conditions, namely single and consecutive devoicing
environments. Devoicing was almost compulsory regardless of the presence of
proposed blocking factors such as type of preceding consonant, accentuation, position
in an utterance, as long as there was no devoiceable vowel in adjacent morae (single
devoicing condition). However, under consecutive devoicing conditions, blocking
factors became effective and prevented some devoiceable vowels from becoming
voiceless.
The effect of speaking rate was also generally minimal in the single devoicing
condition, but in the consecutive devoicing condition, the vowels were devoiced more
at faster tempi than slower tempi, which created many examples of consecutively
devoiced vowels over two morae.
Durational observations found that vowel devoicing involves not only phonatory
change, but also slight durational reduction. However, the shorter duration of devoiced
syllables were adjusted at the word level, so that the whole duration of a word with
devoiced vowels remained similar to the word without devoiced vowels, regardless of
the number of devoiced vowels in the word.
It must be noted that there was no clear-cut distinction between voiced and
devoiced vowels, and the phonetic realisation of a devoiced vowel could vary from
fully voiced to completely voiceless. A high vowel may be voiced in a typical
devoicing environment, but its intensity is significantly weaker than those of vowels in
a non-devoicing environment, at all speaking tempi. The mean differences of vowel
intensities between these environments were generally higher at faster tempi.
The results implied that even when the vowel was voiced, its production process
moved in favour of devoicing. However, in consecutive devoicing conditions, this
process did not always apply. When some of the devoiceable vowels were devoiced in
the consecutive devoicing environment, the intensities of devoiceable vowels were not
significantly lower than those of other vowels.
The results of intensity measurements of voiced vowels in the devoicing and nondevoicing
environments suggested that Japanese vowel devoicing was part of the
overall process of complex vowel weakening, and that a completely devoiced vowel
was the final state of the weakening process. Japanese vowel devoicing is primarily a
process of glottal assimilation, but the results in the consecutive devoicing condition
showed that this process was constrained by Japanese syllable structure
- âŠ