Search CORE

10 research outputs found

A Robust Capon Beamformer against Uncertainty of Nominal Steering Vector

Author
Publication venue: Springer
Publication date
Field of study

Acoustics - Spatial properties

Author: Affes
Asari
Bartlett
Benichoux
Blauert
Duong
Duong
Ehlers
Févotte
Gannot
Gorlow
Gupta
Gustafsson
Ito
Jeub
Knaak
Koldovský
Kowalski
Li
Lin
Markovich
Nguyen Thi
Nikunen
Parra
Polack
Reindl
Ribas
Sawada
Sawada
Sturmel
Vincent
Wallach
Wightman
Yılmaz
Publication venue: 'Royal College of Obstetricians & Gynaecologists (RCOG)'
Publication date: 03/08/2018
Field of study

International audienceIn Chapter 2, we presented the spectral properties of sound sources which can be exploited for the separation or enhancement of single-channel signals. In multichannel scenarios, the fact the acoustic scene is observed from different positions in space can also be exploited. In this chapter, we recall basic elements of acoustics and sound engineering, and use them to model multichannel mixtures. We consider the relationship between a source signal and its spatial image in a given channel in Section 3.1, and examine how it translates in the case of microphone recordings or artificial mixtures in Sections 3.2 and 3.3, respectively. We then introduce several possible models in Section 3.4. We summarize the main concepts and provide links to other chapters and more advanced topics in Section 3.5

Crossref

INRIA a CCSD electronic archive server

Hal-Diderot

HAL-Rennes 1

Blind-Matched Filtering for Speech Enhancement with Distributed Microphones

Author
Publication venue: 'Hindawi Limited'
Publication date: 01/01/2012
Field of study

Crossref

Blind Subband Beamforming With Time-Delay Constraints for Moving Source Speech Enhancement

Author: Ingvar Claesson
Nedelko Grbic
Zohra Yermeche
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

Crossref

Sector-Based Detection for Hands-Free Speech Enhancement in Cars

Author: Bourgeois Julien
Freudenberger Jürgen
Lathoud Guillaume
Publication venue: Martigny, Switzerland, IDIAP
Publication date: 10/03/2006
Field of study

Speech-based command interfaces are becoming more and more common in cars. Applications include automatic dialog systems for hands-free phone calls as well as more advanced features such as navigation systems. However, interferences, such as speech from the codriver, can hamper a lot the performance of the speech recognition component, which is crucial for those applications. This issue can be addressed with {\em adaptive} interference cancellation techniques such as the Generalized Sidelobe Canceller~(GSC). In order to cancel the interference (codriver) while not cancelling the target (driver), adaptation must happen only when the interference is active and dominant. To that purpose, this paper proposes two efficient adaptation control methods called ``implicit'' and ``explicit''. While the ``implicit'' method is fully automatic, the ``explicit'' method relies on pre-estimation of target and interference energies. A major contribution of this paper is a direct, robust method for such pre-estimation, directly derived from sector-based detection and localization techniques. Experiments on real in-car data validate both adaptation methods, including a case with 100 km/h background road noise

Infoscience - École polytechnique fédérale de Lausanne

Blind identification of acoustic systems and enhancement of reverberant speech

Author: Gaubitch Nikolay Dian
Gaubitch Nikolay Dian
Publication venue
Publication date: 01/01/2007
Field of study

Imperial Users onl

Spiral - Imperial College Digital Repository

Fonctions de coût pour l'estimation des filtres acoustiques dans les mélanges réverbérants

Author: BENICHOUX Alexis
GRIBONVAL Rémi
VINCENT Emmanuel
Publication venue
Publication date: 01/01/2013
Field of study

On se place dans le cadre du traitement des signaux audio multicanaux et multi-sources. À partir du mélange de plusieurs sources sonores enregistrées en milieu réverbérant, on cherche à estimer les réponses acoustiques (ou filtres de mélange) entre les sources et les microphones. Ce problème inverse ne peut être résolu qu'en prenant en compte des hypothèses sur la nature des filtres. Notre approche consiste d'une part à identifier mathématiquement les hypothèses nécessaires sur les filtres pour pouvoir les estimer et d'autre part à construire des fonctions de coût et des algorithmes permettant de les estimer effectivement. Premièrement, nous avons considéré le cas où les signaux sources sont connus. Nous avons développé une méthode d'estimation des filtres basée sur une régularisation convexe prenant en compte à la fois la nature parcimonieuse des filtres et leur enveloppe de forme exponentielle décroissante. Nous avons effectué des enregistrements en environnement réel qui ont confirmé l'efficacité de cet algorithme. Deuxièmement, nous avons considéré le cas où les signaux sources sont inconnus, mais statistiquement indépendants. Les filtres de mélange peuvent alors être estimés à une indétermination de permutation et de gain près à chaque fréquence par des techniques d'analyse en composantes indépendantes. Nous avons apporté une étude exhaustive des garanties théoriques par lesquelles l'indétermination de permutation peut être levée dans le cas où les filtres sont parcimonieux dans le domaine temporel. Troisièmement, nous avons commencé à analyser les hypothèses sous lesquelles notre algorithme d'estimation des filtres pourrait être étendu à l'estimation conjointe des signaux sources et des filtres et montré un premier résultat négatif inattendu : dans le cadre de la déconvolution parcimonieuse aveugle, pour une famille assez large de fonctions de coût régularisées, le minimum global est trivial. Des contraintes supplémentaires sur les signaux sources ou les filtres sont donc nécessaires.This work is focused on the processing of multichannel and multisource audio signals. From an audio mixture of several audio sources recorded in a reverberant room, we wish to estimate the acoustic responses (a.k.a. mixing filters) between the sources and the microphones. To solve this inverse problem one need to take into account additional hypotheses on the nature of the acoustic responses. Our approach consists in first identifying mathematically the necessary hypotheses on the acoustic responses for their estimation and then building cost functions and algorithms to effectively estimate them. First, we considered the case where the source signals are known. We developed a method to estimate the acoustic responses based on a convex regularization which exploits both the temporal sparsity of the filters and the exponentially decaying envelope. Real-world experiments confirmed the effectiveness of this method on real data. Then, we considered the case where the sources signal are unknown, but statistically independent. The mixing filters can be estimated up to a permutation and scaling ambiguity. We brought up an exhaustive study of the theoretical conditions under which we can solve the indeterminacy, when the multichannel filters are sparse in the temporal domain. Finally, we started to analyse the hypotheses under which this algorithm could be extended to the joint estimation of the sources and the filters, and showed a first unexpected results : in the context of blind deconvolution with sparse priors, for a quite large family of regularised cost functions, the global minimum is trivial. Additional constraints on the source signals and the filters are needed.RENNES1-Bibl. électronique (352382106) / SudocSudocFranceF

OpenGrey Repository