Search CORE

3,072 research outputs found

Machine Assisted Analysis of Vowel Length Contrasts in Wolof

Author: Besacier Laurent
Gauthier Elodie
Voisin Sylvie
Publication venue
Publication date: 01/06/2017
Field of study

Growing digital archives and improving algorithms for automatic analysis of text and speech create new research opportunities for fundamental research in phonetics. Such empirical approaches allow statistical evaluation of a much larger set of hypothesis about phonetic variation and its conditioning factors (among them geographical / dialectal variants). This paper illustrates this vision and proposes to challenge automatic methods for the analysis of a not easily observable phenomenon: vowel length contrast. We focus on Wolof, an under-resourced language from Sub-Saharan Africa. In particular, we propose multiple features to make a fine evaluation of the degree of length contrast under different factors such as: read vs semi spontaneous speech ; standard vs dialectal Wolof. Our measures made fully automatically on more than 20k vowel tokens show that our proposed features can highlight different degrees of contrast for each vowel considered. We notably show that contrast is weaker in semi-spontaneous speech and in a non standard semi-spontaneous dialect.Comment: Accepted to Interspeech 201

arXiv.org e-Print Archive

Crossref

Hal - Université Grenoble Alpes

Strategies for Representing Tone in African Writing Systems

Author: Bird Steven
Publication venue: John Benjamins
Publication date: 01/01/1999
Field of study

Tone languages provide some interesting challenges for the designers of new orthographies. One approach is to omit tone marks, just as stress is not marked in English (zero marking). Another approach is to do phonemic tone analysis and then make heavy use of diacritic symbols to distinguish the `tonemes' (exhaustive marking). While orthographies based on either system have been successful, this may be thanks to our ability to manage inadequate orthographies rather than to any intrinsic advantage which is afforded by one or the other approach. In many cases, practical experience with both kinds of orthography in sub-Saharan Africa has shown that people have not been able to attain the level of reading and writing fluency that we know to be possible for the orthographies of non-tonal languages. In some cases this can be attributed to a sociolinguistic setting which does not favour vernacular literacy. In other cases, the orthography itself might be to blame. If the orthography of a tone language is difficult to user or to learn, then a good part of the reason, I believe, is that the designer either has not paid enough attention to the function of tone in the language, or has not ensured that the information encoded in the orthography is accessible to the ordinary (non-linguist) user of the language. If the writing of tone is not going to continue to be a stumbling block to literacy efforts, then a fresh approach to tone orthography is required, one which assigns high priority to these two factors. This article describes the problems with orthographies that use too few or too many tone marks, and critically evaluates a wide range of creative intermediate solutions. I review the contributions made by phonology and reading theory, and provide some broad methodological principles to guide someone who is seeking to represent tone in a writing system. The tone orthographies of several languages from sub-Saharan Africa are presented throughout the article, with particular emphasis on some tone languages of Cameroon

CogPrints Cognitive Sciences Eprint Archive

Speed perturbation and vowel duration modeling for ASR in Hausa and Wolof languages

Author: Besacier Laurent
Gauthier Elodie
Voisin Sylvie
Publication venue: HAL CCSD
Publication date: 01/09/2016
Field of study

International audienceAutomatic Speech Recognition (ASR) for (under-resourced) Sub-Saharan African languages faces several challenges: small amount of transcribed speech, written language normalization issues, few text resources available for language modeling, as well as specific features (tones, morphology, etc.) that need to be taken into account seriously to optimize ASR performance. This paper tries to address some of the above challenges through the development of ASR systems for two Sub-Saharan African languages: Hausa and Wolof. First, we investigate data augmentation technique (through speed perturbation) to overcome the lack of resources. Secondly, the main contribution is our attempt to model vowel length contrast existing in both languages. For reproducible experiments, the ASR systems developed for Hausa and Wolof are made available to the research community on github. To our knowledge, the Wolof ASR system presented in this paper is the first large vocabulary continuous speech recognition system ever developed for this language

Crossref

Hal - Université Grenoble Alpes

A Tutorial on Acoustic Phonetic Feature Extraction for Automatic Speech Recognition (ASR) and Text-to-Speech (TTS) Applications in African Languages

Author: Koffi Ettien
Publication venue: The Repository at St. Cloud State
Publication date: 24/03/2020
Field of study

At present, Siri, Dragon Dictate, Google Voice, and Alexa-like functionalities are not available in any indigenous African language. Yet, a 2015 Pew Research found that between 2002 to 2014, mobile phone usage increased tenfold in Africa, from 8% to 83%.[1] The Acoustic Phonetic Approach (APA) discussed in this paper lays the foundation that will make Automatic Speech Recognition (ASR) and Text-to-Speech (TTS) applications possible in African languages. The paper is written as a tutorial so that others can use the information therein to help digitalize many of the continent’s indigenous languages. [1] http://www.pewglobal.org/2015/04/15/cell-phones-in-africa-communication-lifeline/. Retrieved on November 10, 2017

St. Cloud State University

Speech Technologies for African Languages: Example of a Multilingual Calculator for Education

Author: Bagshaw Paul
Besacier Laurent
Bretier Philippe
Gauthier Elodie
Mangeot Mathieu
Marsico Egidio
Moudenc Thierry
Nocera Pascal
Pellegrino François
Rosec Olivier
Voisin Sylvie
Publication venue: HAL CCSD
Publication date: 06/09/2015
Field of study

International audienceThis paper presents our achievements after 18 months of the ALFFA project dealing with African languages technologies. We focus on a multilingual calculator (Android app) that will be demonstrated during the Show and Tell session

Hal - Université Grenoble Alpes

HAL

Recommended from our members

Intrinsic vowel F0, the size of vowel inventories and second language acquisition

Author: Adank
Caubet
Connel
Diehl
Diehl
Diehl
Dyhr
Field
Fischer-Jørgensen
Flanagan
Flege
Gandour
Jo Verhoeven
Katz
Kingston
Kingston
Kingston
Koopmans-Van Beinum
Ladefoged
Lobanov
Max
Mitchell
Piske
Reinholt Petersen
Reinholt Petersen
Sapir
Sarah Van Hoof
Silverman
Syrdal
Traunmüller
Verhoeven
Verhoeven
Whalen
Whalen
Whalen
‘t Hart
Publication venue: 'Elsevier BV'
Publication date: 01/01/2011
Field of study

The phenomenon of intrinsic vowel F0 (IF0), in which high vowels exhibit higher F0 than low vowels, has been widely attested in languages of the world. Most often, IF0 is regarded as an automatic, physiologically determined phenomenon, whereas some claim that IF0 is a controlled feature, introduced to enhance vowel contrasts. This paper presents new evidence on this issue by means of a cross-linguistic investigation of the influence of vowel inventory size on IF0 and a study of IF0 in second language (L2) acquisition. IF0 was measured in three language varieties: Arabic (a language with 3 vowels), Dutch (a 12-vowel system), and Dutch spoken by native Arabic-speaking learners. IF0 was significantly larger in Dutch than Arabic, but did not differ significantly between Arabic and Dutch produced by L2 learners. No spectral differences between the corresponding vowels of the three language varieties were found. While confirming the universality of IF0, these results also suggest that the size of IF0 may be language-specific, depending on the need to enhance vowel contrasts. Thus, these results agree well with a mixed physiological-enhancement account, which assumes that IF0 is physiologically determined, but also at least in part the effect of an interacting, controlled mechanism

City Research Online

Crossref

Ghent University Academic Bibliography

Institutional Repository Universiteit Antwerpen

Skilled reading in isiZulu: what can we learn from it?

Author: Land Sandra
Publication venue: 'Academy of Science of South Africa'
Publication date: 12/10/2020
Field of study

Research on reading in African languages is particularly pertinent in South Africa now, inview of the poor reading performance in many South African schools. This paper is basedon a study of competent adult readers of isiZulu that analysed what its orthography (the wayit is written) requires of readers. As an agglutinative language with a conjoined writingsystem, isiZulu carries meaning not only in separate words, but also in morphemes thatcluster together, forming long complex words. Eye tracking data shows that competentreaders of isiZulu move their eyes across text in saccades (shifts of the point of focus) thatare short in comparison with the saccades of efficient reading of English. It also shows thatreaders of isiZulu fixate on points of text for longer periods than do readers of English.The study links eye movement data to information gained from a stimulated recall process,to discover strategies consciously used by competent readers of isiZulu. Some of thesestrategies, such as visualisation, are common to efficient readers of all languages, whileothers might be peculiar to agglutinating and/or tonal languages. These strategies informsuggestions for the development of effective reading skills in isiZulu.The key argument of the paper is that the orthography of isiZulu has features that requireattention by teachers of reading if their learners are to benefit from the advantages thatreading in their first language should bring

Open Journals@UKZN

Speech Communication

Author: Halle Morris
Stevens Kenneth N.
Wolf J. J.
Publication venue: Research Laboratory of Electronics (RLE) at the Massachusetts Institute of Technology (MIT)
Publication date: 15/07/1969
Field of study

Contains reports on two research projects.U. S. Air Force Cambridge Research Laboratories, Office of Aerospace Research, under Contract F19628-69-C-0044National Institutes of Health (Grant 2 RO1 NB-04332-06

DSpace@MIT

Proceedings of the EACL 2009 Workshop on Language Technologies for African Languages

Author: De Pauw Guy
de Schryver Gilles-Maurice
Levin Lori
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2009
Field of study

Ghent University Academic Bibliography