Search CORE

118 research outputs found

Characterization of Arabic sibilant consonants

Author: Abajaddi Nesrine
Elfahm Youssef
Elmaazouzi Laila
Farchi Abdelmajid
Mounir Badia
Mounir Ilham
Publication venue: 'Institute of Advanced Engineering and Science'
Publication date: 01/04/2023
Field of study

The aim of this study is to develop an automatic speech recognition system in order to classify sibilant Arabic consonants into two groups: alveolar consonants and post-alveolar consonants. The proposed method is based on the use of the energy distribution, in a consonant-vowel type syllable, as an acoustic cue. The application of this method on our own corpus reveals that the amount of energy included in a vocal signal is a very important parameter in the characterization of Arabic sibilant consonants. For consonants classifications, the accuracy achieved to identify consonants as alveolar or post-alveolar is 100%. For post-alveolar consonants, the rate is 96% and for alveolar consonants, the rate is over 94%. Our classification technique outperformed existing algorithms based on support vector machines and neural networks in terms of classification rate

ZENODO

Institute of Advanced Engineering and Science

Segmental intonation information in French fricatives

Author: Niebuhr Oliver
Welby Pauline
Publication venue: HAL CCSD
Publication date: 04/08/2019
Field of study

International audienceWe examined the "segmental intonation" hypothesis (Niebuhr, 2012), according to which voiceless consonants contain spectral information that may contribute to the percept of high or low pitch in the absence of fundamental frequency (F0). French speakers read target words embedded in a carrier phrase and containing fricatives in accentual phrase-initial,-medial or-final position (e.g. sidéré 'stunned', nécessite 'require', ressaisisse 'seize again'), expected to correspond to regions of low, intermediate or high F0, respectively, as well as control words containing only sonorants (e.g. laminé 'rolled'). Analyses show lower center of gravity (CoG) for word-initial (low F0 region) than for word-final (high F0 region) fricatives. For word-final fricatives, CoG is higher at the end than in the beginning of the fricative, which may contribute to the percept of the continuation of the F0 rise across the preceding vowel.ReferenceNiebuhr, Oliver. 2012. At the edge of intonation – The interplay of utterance-final F0 movements and voiceless fricative sounds. Phonetica 69, 7–27

Wavelet-based techniques for speech recognition

Author: Omar Farooq (7204418)
Publication venue
Publication date: 01/01/2002
Field of study

In this thesis, new wavelet-based techniques have been developed for the extraction of features from speech signals for the purpose of automatic speech recognition (ASR). One of the advantages of the wavelet transform over the short time Fourier transform (STFT) is its capability to process non-stationary signals. Since speech signals are not strictly stationary the wavelet transform is a better choice for time-frequency transformation of these signals. In addition it has compactly supported basis functions, thereby reducing the amount of computation as opposed to STFT where an overlapping window is needed. [Continues.

Phoneme Recognition on the TIMIT Database

Author: Carla Lopes
Fernando Perdigão
Publication venue: 'IntechOpen'
Publication date: 13/06/2011
Field of study