Search CORE

25 research outputs found

Non-intrusive speech quality prediction using modulation energies and LSTM-network

Author: Cauchi B.
Doclo S.
Falk T.H.
Goetze S.
Santos J.F.
Siedenburg K.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/07/2019
Field of study

Many signal processing algorithms have been proposed to improve the quality of speech recorded in the presence of noise and reverberation. Perceptual measures, i.e., listening tests, are usually considered the most reliable way to evaluate the quality of speech processed by such algorithms but are costly and time-consuming. Consequently, speech enhancement algorithms are often evaluated using signal-based measures, which can be either intrusive or non-intrusive. As the computation of intrusive measures requires a reference signal, only non-intrusive measures can be used in applications for which the clean speech signal is not available. However, many existing non-intrusive measures correlate poorly with the perceived speech quality, particularly when applied over a wide range of algorithms or acoustic conditions. In this paper, we propose a novel non-intrusive measure of the quality of processed speech that combines modulation energy features and a recurrent neural network using long short-term memory cells. We collected a dataset of perceptually evaluated signals representing several acoustic conditions and algorithms and used this dataset to train and evaluate the proposed measure. Results show that the proposed measure yields higher correlation with perceptual speech quality than that of benchmark intrusive and non-intrusive measures when considering various categories of algorithms. Although the proposed measure is sensitive to mismatch between training and testing, results show that it is a useful approach to evaluate specific algorithms over a wide range of acoustic conditions and may, thus, become particularly useful for real-time selection of speech enhancement algorithm settings

White Rose Research Online

A brief overview of speech enhancement with linear filtering

Author: B Kollmeier
GH Golub
H Huang
J Benesty
J Benesty
J Benesty
J Benesty
J Benesty
J Benesty
J Benesty
J Benesty
J Chen
J Chen
J Chen
J Freudenberger
JR Jensen
JR Jensen
K Hermus
LR Rabiner
M Dendrinos
M Souden
P Loizou
P Vary
R Martin
RC Hendriks
RJ McAulay
S Boll
S Doclo
S Srinivasan
SH Jensen
T Long
T Lotter
Y Ephraim
Y Ephraim
Y Hu
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Electronic Structure of the fcc Transition Metals Ir, Rh, Pt, and Pd

Author: A. E. Dixon
A. J. Freeman
A. M. Clogstron
A. R. Mackintosh
A. Y.-C. Yu
B. W. Veal
C. G. Robbins
D. D. Koelling
D. D. Koelling
D. Liberman
D. W. Budworth
E. Bucher
E. C. Snow
E. P. Wohlfarth
E. P. Wohlfarth
F. E. Hoare
F. Herman
F. M. Mueller
F. M. Mueller
F. M. Mueller
F. M. Mueller
F. M. Mueller
G. A. Burdick
G. Chouteau
G. E. Shoemake
J. B. Ketterson
J. B. Ketterson
J. B. Ketterson
J. B. Ketterson
J. C. Phillips
J. C. Phillips
J. C. Slater
J. Callaway
J. Friedel
J. J. Vuillemin
J. J. Vuillemin
J. M. Ziman
J. R. Schrieffer
J. W. D. Connolly
K. P. Gupta
L. F. Mattheiss
L. F. Mattheiss
L. Hedin
L. Hodges
L. J. Sham
L. L. Foldy
L. R. Windmiller
L. R. Windmiller
M. A. Jensen
M. Dixon
M. Dixon
O. K. Andersen
O. K. Andersen
O. Krogh Andersen
P. Lederer
P. Lederer
P. Lenglart
P. T. Coleridge
R. Doclo
S. Foner
S. Foner
S. Hörnfeldt
S. Hörnfeldt
T. A. Seitchik
T. L. Loucks
T. L. Loucks
U. Rössler
V. Heine
W. L. McMillan
Publication venue: 'American Physical Society (APS)'
Publication date: 01/01/1970
Field of study

Crossref

Online Research Database In Technology

A comparative study of galactose oxidase and active site analogs based on QM/MM Car-Parrinello simulations

Author: K. Doclo
M. Parrinello
P. Carloni
U. Rothlisberger
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Density functional calculations of magnetic exchange interactions in polynuclear transition metal complexes

Author: A
Barone V
BENCINI
CA
DAUL
Doclo
F
Fantucci
K
P
TOTTI
Publication venue
Publication date: 01/01/1997
Field of study

Archivio istituzionale della Ricerca - Scuola Normale Superiore

Special issue on DSP in hearing aids and cochlear implants

Author: Doclo Simon
Jensen Søren Holdt
Pango Philippe A.
Riis Søren K.
Wouters Jan
Publication venue
Publication date: 01/01/2005
Field of study

VBN

Speech enhancement for multimicrophone binaural hearing aids aiming to preserve the spatial auditory scene

Author: AS Bregman
AW Bronkhorst
B Cornelis
BD Van Veen
D Marquardt
DS Brungart
EC Cherry
H Ye
J Bitzer
J Blauert
J Peissig
J Wouters
JE Greenberg
JI Marin-Hurtado
K Wagener
K Wagener
K Wagener
KU Simmer
M Cooke
M Dietz
ML Hawley
R Beutelmann
S Doclo
S Doclo
S Gannot
T Lotter
T Rohdenburg
T Van den Bogaert
T Van den Bogaert
TJ Klasen
V Hamacher
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Neutral atoms in ionic lattices: Stability and ground state properties of KCl:Ag(0)

Author: A. D. Becke
A. G. Breñosa
B. Villacampa
B. Villacampa
C. de Graaf
C. J. Delbecq
C. K. Jorgensen
C. Marco de Lucas
C. Marco de Lucas
C. Sousa
F. Agulló-López
F. C. Brown
F. Illas
G. E. Holmberg
G. te Velde
H. Seidel
I. Cabria
I. Cabria
I. Cabria
I. V. Abarenkov
J. A. Aramburu
J. A. Aramburu
J. H. Barkyoumb
J. L. Pascual
J. M. Spaeth
J. P. Huke
J. P. Perdew
J. Tejeda
K. Andersson
K. Andersson
K. Doclo
K. Nakamoto
K. Wissing
L. Seijo
M. Bucher
M. Moreno
M. Moreno
M. Moreno
M. Moreno
M. P. Tosi
M. Saidoh
M. Saidoh
M. T. Barriuso
M. T. Barriuso
M. T. Barriuso
M. T. Barriuso
M. T. Bennebroek
M. T. Olm
P. Belanzoni
P. G. Baranov
P. G. Baranov
P. W. Jacobs
R. C. Baetzold
R. G. Parr
R. T. Poole
R. T. Poole
S. H. Vosko
S. J. Duclos
S. Sugano
S. V. Nistor
S. Veliah
V. S. Osminin
W. Hayes
Y. Ravi Sekhar
Z. Barandiarán
Z. Barandiarán
Z. Barandiarán
Publication venue: 'American Physical Society (APS)'
Publication date: 01/01/2000
Field of study

The equilibrium geometry of Ag0 centers formed at cation sites in KCl has been investigated by means of total-energy calculations carried out on clusters of different sizes. Two distinct methods have been employed: First, an ab initio wave-function based method on embedded clusters and second, density-functional theory ~DFT! methods on clusters in vacuo involving up to 117 atoms. In the ab initio calculations the obtained equilibrium Ag0 -Cl2 distance Re is 3.70 Å, implying a large outward relaxation of 18%, along with 7% relaxation for the distance between Ag0 and the first K1 ions in ^100& directions. A very similar result is reached through DFT with a 39-atom cluster. Both approaches lead to a rather shallow minimum of the total-energy surface, the associated force constant of the A1g mode is several times smaller than that found for other impurities in halides. These conclusions are shown to be compatible with available experimental results. The shallow minimum is not clearly seen in DFT calculations with larger clusters. The unpaired electron density on silver and Cl ligands has been calculated as function of the metal-ligand distance and has been compared with values derived from electron-paramagnetic resonance data. The DFT calculations for all cluster sizes indicate that the experimental hyperfine and superhyperfine constants are compatible when Re is close to 3.70 Å. The important relation between the electronic stability of a neutral atom inside an ionic lattice and the local relaxation is established through a simple electrostatic model. As most remarkable features it is shown that ~i! the cationic Ag0 center is not likely to be formed inside AgCl, ~ii! in the Ag0 center encountered in SrCl2, the silver atom is probably located at an anion site, and ~iii! the properties of a center-like KCl:Ag0 would experience significant changes under hydrostatic pressures of the order of 6 GPa

Crossref

Secretaría de Estado de Cultura

Diposit Digital de la Universitat de Barcelona

Blind Suppression of Nonstationary Diffuse Acoustic Noise Based on Spatial Covariance Matrix Decomposition

Author: A Kurematsu
A Ozerov
AP Dempster
D-T Pham
Emmanuel Vincent
H Sawada
IA McCowan
K Toh
N Ito
Nobutaka Ito
Nobutaka Ono
NQK Duong
S Doclo
SF Boll
Shigeki Sagayama
Shoko Araki
T Nakatani
Tomohiro Nakatani
Y Ephraim
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/05/2015
Field of study

International audienceWe propose methods for blind suppression of nonstationary diffuse noise based on decomposition of the observed spatial covariance matrix into signal and noise parts. In modeling noise to regularize the ill-posed decomposition problem, we exploit spatial invariance (isotropy) instead of temporal invariance (stationarity). The isotropy assumption is that the spatial cross-spectrum of noise is dependent on the distance between microphones and independent of the direction between them. We propose methods for spatial covariance matrix decomposition based on least squares and maximum likelihood estimation. The methods are validated on real-world recordings

Crossref

INRIA a CCSD electronic archive server

HAL-Rennes 1

An Improved Speech Enhancement Method Based on SVD

Author: Boll S F
Comon P and Golub G H
Doclo S and Moonen M
Fischer S and Simmer K U
Flanagan J L
Griffiths L J and Jim C W
Luk FT
Veen A J D
Publication venue: 'China Science Publishing & Media Ltd.'
Publication date
Field of study

Crossref