Search CORE

363 research outputs found

Low Complexity Rate-Distortion Optimized Time-Segmentation for Audio Coding

Author: Christensen Mads Græsbøll
Jensen Søren Holdt
Nordén Fredrik
Rødbro Christoffer A.
Publication venue: Electrical Engineering/Electronics, Computer, Communications and Information Technology Association
Publication date: 01/01/2005
Field of study

VBN

Subspace-based Fundamental Frequency Estimation

Author: Andersen S. V.
Christensen Mads Græsbøll
Jakobsson , A.
Jensen Søren Holdt
Publication venue: IEEE Signal Processing Society
Publication date: 01/01/2004
Field of study

Publication in the conference proceedings of EUSIPCO, Viena, Austria, 200

VBN

NEUROSURGERY ENTHUSIASTIC WOMEN SOCIETY

Compressed Domain Packet Loss Concealment of Sinusoidally Coded Speech

Author: Andersen Søren Vang
Christensen Mads Græsbøll
Jensen Søren Holdt
Rødbro Christoffer A.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2003
Field of study

In this paper we consider the problem of packet loss concealment for Voice over IP (VoIP). The speech signal is compressed at the transmitter using A sinusoidal coding scheme working at 8 kbit/s. At the receiver, packet loss concealment is carried out working directly on the quantized sinusoidal parameters, based on time-scaling of the packets surrounding the missing ones. Subjective listening tests show promising results indicating the potential of sinusoidal speech coding for VoIP

Lund University Publications

VBN

Dominant distortion classification for pre-processing of vowels in remote biomedical voice analysis

Author: Christensen Mads Græsbøll
Jensen Jesper Rindom
Little Max A.
Poorjam Amir Hossein
Publication venue: 'International Speech Communication Association'
Publication date: 01/08/2017
Field of study

Advances in speech signal analysis facilitate the development of techniques for remote biomedical voice assessment. However, the performance of these techniques is affected by noise and distortion in signals. In this paper, we focus on the vowel /a/ as the most widely-used voice signal for pathological voice assessments and investigate the impact of four major types of distortion that are commonly present during recording or transmission in voice analysis, namely: background noise, reverberation, clipping and compression, on Mel-frequency cepstral coefficients (MFCCs) - the most widely-used features in biomedical voice analysis. Then, we propose a new distortion classification approach to detect the most dominant distortion in such voice signals. The proposed method involves MFCCs as frame-level features and a support vector machine as classifier to detect the presence and type of distortion in frames of a given voice signal. Experimental results obtained from the healthy and Parkinson's voices show the effectiveness of the proposed approach in distortion detection and classification

Crossref

Aston Publications Explorer

VBN

A Parametric Approach for Classification of Distortions in Pathological Voices

Author: Christensen Mads Græsbøll
Jensen Jesper Rindom
Little Max A
Poorjam Amir Hossein
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 10/09/2018
Field of study

In biomedical acoustics, distortion in voice signals, commonly present during acquisition and transmission, adversely affects acoustic features extracted from pathological voice. Information on the type of distortion can help in compensating for its effects. This paper proposes a new approach to detecting four major types of commonly encountered distortion in remote analysis of pathological voice, namely background noise, reverberation, clipping and coding. In this approach, by applying factor analysis to Gaussian mixture model mean supervectors, distortions in variable-duration recordings are modeled by fixed-length, low-dimensional channel vectors. Then, linear discriminant analysis (LDA) is used to remove the remaining nuisance effects in the channel vectors. Finally, two different classifiers, namely support vector machines and probabilistic LDA classify the different types of distortion. Experimental results obtained using Parkinson's voices, as an example of pathological voice, show 11.4% relative improvement in performance over systems which directly use acoustic features for distortion classification

Crossref

Aston Publications Explorer

VBN

Combination of inflammatory and hemostatic markers in mortality model for critically ill canine icu patients significantly improves efficacy

Author: Jensen A.L.
Kjeldgaard-Hansen Mads
Kristiansen A. T.
Rozanski Elizabeth
Wiinberg Bo
Publication venue
Publication date: 01/01/2009
Field of study

Copenhagen University Research Information System

A Supervised Approach to Global Signal-to-Noise Ratio Estimation for Whispered and Pathological Voices

Author: Christensen Mads Græsbøll
Jensen Jesper Rindom
Little Max A
Poorjam Amir Hossein
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 10/09/2018
Field of study

The presence of background noise in signals adversely affects the performance of many speech-based algorithms. Accurate estimation of signal-to-noise-ratio (SNR), as a measure of noise level in a signal, can help in compensating for noise effects. Most existing SNR estimation methods have been developed for normal speech and might not provide accurate estimation for special speech types such as whispered or disordered voices, particularly, when they are corrupted by non-stationary noises. In this paper, we first investigate the impact of stationary and non-stationary noise on the behavior of mel-frequency cepstral coefficients (MFCCs) extracted from normal, whispered and pathological voices. We demonstrate that, regardless of the speech type, the mean and the covariance of MFCCs are predictably modified by additive noise and the amount of change is related to the noise level. Then, we propose a new supervised method for SNR estimation which is based on a regression model trained on MFCCs of the noisy signals. Experimental results show that the proposed approach provides accurate estimation and consistent performance for various speech types under different noise conditions

Aston Publications Explorer

VBN

Quality Control in Remote Speech Data Collection

Author: Christensen Mads Graesboll
Jensen Jesper Rindom
Little Max A.
Poorjam Amir Hossein
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/03/2019
Field of study

There is a need for algorithms that can automatically control the quality of the remotely collected speech databases by detecting potential outliers, which deserve further investigation. In this paper, a simple and effective approach for identification of outliers in a speech database is proposed. Using the deterministic minimum covariance determinant (DetMCD) algorithm to estimate the mean and covariance of the speech data in the mel-frequency cepstral domain, this approach identifies potential outliers based on the statistical distance of the observations in the feature space from the central location of the data that are larger than a predefined threshold. DetMCD is a computationally efficient algorithm, which provides a highly robust estimate of the mean and covariance of multivariate data even when 50% of the data are outliers. Experimental results using eight different speech databases with manually inserted outliers show the effectiveness of the proposed method for outlier detection in speech databases. Moreover, applying the proposed method to a remotely collected Parkinson's voice database shows that the outliers that are part of the database are detected with 97.4% accuracy, resulting in a significant decrease in the effort required for manually controlling the quality of the database

Aston Publications Explorer

VBN

Robust Bayesian Pitch Tracking Based on the Harmonic Model

Author: Christensen Mads Graesboll
Jensen Jesper Rindom
Little Max A.
Nielsen Jesper Kjaer
Shi Liming
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/11/2019
Field of study

Fundamental frequency is one of the most important characteristics of speech and audio signals. Harmonic model-based fundamental frequency estimators offer a higher estimation accuracy and robustness against noise than the widely used autocorrelation-based methods. However, the traditional harmonic model-based estimators do not take the temporal smoothness of the fundamental frequency, the model order, and the voicing into account as they process each data segment independently. In this paper, a fully Bayesian fundamental frequency tracking algorithm based on the harmonic model and a first-order Markov process model is proposed. Smoothness priors are imposed on the fundamental frequencies, model orders, and voicing using first-order Markov process models. Using these Markov models, fundamental frequency estimation and voicing detection errors can be reduced. Using the harmonic model, the proposed fundamental frequency tracker has an improved robustness to noise. An analytical form of the likelihood function, which can be computed efficiently, is derived. Compared to the state-of-the-art neural network and nonparametric approaches, the proposed fundamental frequency tracking algorithm has superior performance in almost all investigated scenarios, especially in noisy conditions. For example, under 0 dB white Gaussian noise, the proposed algorithm reduces the mean absolute errors and gross errors by 15% and 20% on the Keele pitch database and 36% and 26% on sustained /a/ sounds from a database of Parkinson's disease voices. A MATLAB version of the proposed algorithm is made freely available for reproduction of the results. 1 1An implementation of the proposed algorithm using MATLAB may be found in https://tinyurl.com/yxn4a543

Crossref

Aston Publications Explorer

VBN