Search CORE

22 research outputs found

Estimation of Severity of Speech Disability through Speech Envelope

Author: Gudi Anandthirtha B.
Nagaraj H. C.
Shreedhar H. K.
Publication venue: 'Academy and Industry Research Collaboration Center (AIRCC)'
Publication date: 21/07/2011
Field of study

In this paper, envelope detection of speech is discussed to distinguish the pathological cases of speech disabled children. The speech signal samples of children of age between five to eight years are considered for the present study. These speech signals are digitized and are used to determine the speech envelope. The envelope is subjected to ratio mean analysis to estimate the disability. This analysis is conducted on ten speech signal samples which are related to both place of articulation and manner of articulation. Overall speech disability of a pathological subject is estimated based on the results of above analysis.Comment: 8 pages,4 Figures,Signal & Image Processing Journal AIRC

arXiv.org e-Print Archive

Crossref

Sessizliğin Kaldırılması ve Konuşmanın Parçalara Ayrılması İşleminin Türkçe Otomatik Konuşma Tanıma Üzerindeki Etkisi

Author: Hayri Sever
Hüseyin Polat
Saadin Oyucu
Publication venue: 'Duzce Universitesi Bilim ve Teknoloji Dergisi'
Publication date: 01/01/2020
Field of study

Otomatik Konuşma Tanıma sistemleri temel olarak akustik bilgiden faydalanılarak geliştirilmektedir. Akustik bilgiden fonem bilgisinin elde edilmesi için eşleştirilmiş konuşma ve metin verileri kullanılmaktadır. Bu veriler ile eğitilen akustik modeller gerçek hayattaki bütün akustik bilgiyi modelleyememektedir. Bu nedenle belirli ön işlemlerin yapılması ve otomatik konuşma tanıma sistemlerinin başarımını düşürecek akustik bilgilerin ortadan kaldırılması gerekmektedir. Bu çalışmada konuşma içerisinde geçen sessizliklerin kaldırılması için bir yöntem önerilmiştir. Önerilen yöntemin amacı sessizlik bilgisinin ortadan kaldırılması ve akustik bilgide uzun bağımlılıklar sağlayan konuşmaların parçalara ayrılmasıdır. Geliştirilen yöntemin sonunda elde edilen sessizlik içermeyen ve parçalara ayrılan konuşma bilgisi bir Türkçe Otomatik Konuşma Tanıma sistemine girdi olarak verilmiştir. Otomatik Konuşma Tanıma sisteminin çıkışında sisteme giriş olarak verilen konuşma parçalarına karşılık gelen metinler birleştirilerek sunulmuştur. Gerçekleştirilen deneylerde sessizliğin kaldırılması ve konuşmanın parçalara ayrılması işleminin Otomatik Konuşma Tanıma sistemlerinin başarımını artırdığı görülmüştür

Directory of Open Access Journals

Duzce University Open Access

A Novel Adaptive method for Acoustic Echo Cancellation

Author: Pravat Ku. Dash, Pradyumna Ku. Mohapatra, Sunil Ku. Bisoi
Publication venue: 'Auricle Technologies, Pvt., Ltd.'
Publication date: 30/06/2016
Field of study

Speech is compulsory in audio teleconferenceing system. In present scenareo acoustic echo is a major setback for user and causes a lessening in the quality of the communication.By means of some adaptive filtering methods acoustic echo canbe eliminated and can be reachrd in a desired value. A detail performance assessment is reported, including echo return loss enhancement (ERLE), convergence time and system distance metrics.We have also compared two different signals and how noise can be cancelled out using NLMS algorithm

International Journal on Recent and Innovation Trends in Computing and Communication

Spotting Agreement and Disagreement: A Survey of Nonverbal Audiovisual Cues and Tools

Author: Bousmalis Konstantinos
Mehu Marc
Pantic Maja
Publication venue: IEEE Computer Society Press
Publication date: 01/01/2009
Field of study

While detecting and interpreting temporal patterns of non–verbal behavioral cues in a given context is a natural and often unconscious process for humans, it remains a rather difficult task for computer systems. Nevertheless, it is an important one to achieve if the goal is to realise a naturalistic communication between humans and machines. Machines that are able to sense social attitudes like agreement and disagreement and respond to them in a meaningful way are likely to be welcomed by users due to the more natural, efficient and human–centered interaction they are bound to experience. This paper surveys the nonverbal cues that could be present during agreement and disagreement behavioural displays and lists a number of tools that could be useful in detecting them, as well as a few publicly available databases that could be used to train these tools for analysis of spontaneous, audiovisual instances of agreement and disagreement

CiteSeerX

University of Twente Research Information

A Family of Coherence-Based Multi-Microphone Speech Enhancement Systems

Author: Quang Hung Pham
Sovka Pavel
Publication venue: Společnost pro radioelektronické inženýrství
Publication date: 01/06/2003
Field of study

This contribution addresses the problem of additive noise reduction in speech picked up by a microphone in a noisy environment. Two systems belonging to the family of coherence-based noise cancellers are presented. Suggested systems have the modular structure using 2 or 4 microphones and suppress non-stationary noises in the range of 4 to 17 dB depending on the chosen structure and noise characteristics. The common properties are acceptable noise suppression, low speech distortion and residual noise

Directory of Open Access Journals

Digital library of Brno University of Technology

Voice activity detection based on conjugate subspace matching pursuit and likelihood ratio test

Author: Jiqing Han
Shiwen Deng
Publication venue: Springer Nature
Publication date: 01/01/2011
Field of study

Springer - Publisher Connector

New Advances in Voice Activity Detection using HOS and Optimization Strategies

Author: C.G. Puntonet
J. Ramirez
J.M. Gorriz
Publication venue: 'IntechOpen'
Publication date: 01/06/2007
Field of study

IntechOpen

A Novel Robust Mel-Energy Based Voice Activity Detector for Nonstationary Noise and Its Application for Speech Waveform Compression

Author: Waheeduddin Syed, Q.
Publication venue: LSU Digital Commons
Publication date: 01/01/2006
Field of study

The voice activity detection (VAD) is crucial in all kinds of speech applications. However, almost all existing VAD algorithms suffer from the nonstationarity of both speech and noise. To combat this difficulty, we propose a new voice activity detector, which is based on the Mel-energy features and an adaptive threshold related to the signal-to-noise ratio (SNR) estimates. In this thesis, we first justify the robustness of the Bayes classifier using the Mel-energy features over that using the Fourier spectral features in various noise environments. Then, we design an algorithm using the dynamic Mel-energy estimator and the adaptive threshold which depends on the SNR estimates. In addition, a realignment scheme is incorporated to correct the sparse-and-spurious noise estimates. Numerous simulations are carried out to evaluate the performance of our proposed VAD method and the comparisons are made with a couple existing representative schemes, namely the VAD using the likelihood ratio test with Fourier spectral energy features and that based on the enhanced time-frequency parameters. Three types of noise, namely white noise (stationary), babble noise (nonstationary) and vehicular noise (nonstationary) were artificially added by the computer for our experiments. As a result, our proposed VAD algorithm significantly outperforms other existing methods as illustrated by the corresponding receiver operating curves (ROCs). Finally, we demonstrate one of the major applications, namely speech waveform compression, associated with our new robust VAD scheme and quantify the effectiveness in terms of compression efficiency

Louisiana State University

Voice Activity Detection. Fundamentals and Speech Recognition System Robustness

Author: J. C. Segura
J. M. Gorriz
J. Ramirez
Publication venue: 'IntechOpen'
Publication date: 01/01/2007
Field of study

IntechOpen

CiteSeerX