Search CORE

32,842 research outputs found

Structured Sparsity Models for Multiparty Speech Recovery from Reverberant Recordings

Author: Asaei Afsaneh
Bourlard Hervé
Cevher Volkan
Golbabaee Mohammad
Publication venue
Publication date: 01/01/2012
Field of study

We tackle the multi-party speech recovery problem through modeling the acoustic of the reverberant chambers. Our approach exploits structured sparsity models to perform room modeling and speech recovery. We propose a scheme for characterizing the room acoustic from the unknown competing speech sources relying on localization of the early images of the speakers by sparse approximation of the spatial spectra of the virtual sources in a free-space model. The images are then clustered exploiting the low-rank structure of the spectro-temporal components belonging to each source. This enables us to identify the early support of the room impulse response function and its unique map to the room geometry. To further tackle the ambiguity of the reflection ratios, we propose a novel formulation of the reverberation model and estimate the absorption coefficients through a convex optimization exploiting joint sparsity model formulated upon spatio-spectral sparsity of concurrent speech representation. The acoustic parameters are then incorporated for separating individual speech signals through either structured sparse recovery or inverse filtering the acoustic channels. The experiments conducted on real data recordings demonstrate the effectiveness of the proposed approach for multi-party speech recovery and recognition.Comment: 31 page

arXiv.org e-Print Archive

Edinburgh Research Explorer

Acoustic modeling using the digital waveguide mesh

Author: Kelloniemi Antti
Mullen Jack
Murphy Damian
Shelley Simon
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2007
Field of study

The digital waveguide mesh has been an active area of music acoustics research for over ten years. Although founded in 1-D digital waveguide modeling, the principles on which it is based are not new to researchers grounded in numerical simulation, FDTD methods, electromagnetic simulation, etc. This article has attempted to provide a considerable review of how the DWM has been applied to acoustic modeling and sound synthesis problems, including new 2-D object synthesis and an overview of recent research activities in articulatory vocal tract modeling, RIR synthesis, and reverberation simulation. The extensive, although not by any means exhaustive, list of references indicates that though the DWM may have parallels in other disciplines, it still offers something new in the field of acoustic simulation and sound synth

CiteSeerX

Crossref

White Rose Research Online

Waveguide physical modeling of vocal tract acoustics: flexible formant bandwidth control from increased model dimensionality

Author: Howard D M
Mullen J
Murphy D T
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/05/2006
Field of study

Digital waveguide physical modeling is often used as an efficient representation of acoustical resonators such as the human vocal tract. Building on the basic one-dimensional (1-D) Kelly-Lochbaum tract model, various speech synthesis techniques demonstrate improvements to the wave scattering mechanisms in order to better approximate wave propagation in the complex vocal system. Some of these techniques are discussed in this paper, with particular reference to an alternative approach in the form of a two-dimensional waveguide mesh model. Emphasis is placed on its ability to produce vowel spectra similar to that which would be present in natural speech, and how it improves upon the 1-D model. Tract area function is accommodated as model width, rather than translated into acoustic impedance, and as such offers extra control as an additional bounding limit to the model. Results show that the two-dimensional (2-D) model introduces approximately linear control over formant bandwidths leading to attainable realistic values across a range of vowels. Similarly, the 2-D model allows for application of theoretical reflection values within the tract, which when applied to the 1-D model result in small formant bandwidths, and, hence, unnatural sounding synthesized vowels

Crossref

White Rose Research Online

Virtual Audio - Three-Dimensional Audio in Virtual Environments

Author: Adler Daniel
Publication venue: Swedish Institute of Computer Science
Publication date: 01/01/1996
Field of study

Three-dimensional interactive audio has a variety ofpotential uses in human-machine interfaces. After lagging seriously behind the visual components, the importance of sound is now becoming increas-ingly accepted. This paper mainly discusses background and techniques to implement three-dimensional audio in computer interfaces. A case study of a system for three-dimensional audio, implemented by the author, is described in great detail. The audio system was moreover integrated with a virtual reality system and conclusions on user tests and use of the audio system is presented along with proposals for future work at the end of the paper. The thesis begins with a definition of three-dimensional audio and a survey on the human auditory system to give the reader the needed knowledge of what three-dimensional audio is and how human auditory perception works

RISE – Research Institutes of Sweden

Digitala Vetenskapliga Arkivet - Academic Archive On-line

Swedish Institute of Computer Science Publications Database

High Finesse Fiber Fabry-Perot Cavities: Stabilization and Mode Matching Analysis

Author: Alavi Seyed Khalil
Alt Wolfgang
Gallego Jose
Ghosh Sutapa
Martinez-Dorantes Miguel
Meschede Dieter
Ratschbacher Lothar
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 10/03/2016
Field of study

Fiber Fabry-Perot cavities, formed by micro-machined mirrors on the end-facets of optical fibers, are used in an increasing number of technical and scientific applications, where they typically require precise stabilization of their optical resonances. Here, we study two different approaches to construct fiber Fabry-Perot resonators and stabilize their length for experiments in cavity quantum electrodynamics with neutral atoms. A piezo-mechanically actuated cavity with feedback based on the Pound-Drever-Hall locking technique is compared to a novel rigid cavity design that makes use of the high passive stability of a monolithic cavity spacer and employs thermal self-locking and external temperature tuning. Furthermore, we present a general analysis of the mode matching problem in fiber Fabry-Perot cavities, which explains the asymmetry in their reflective line shapes and has important implications for the optimal alignment of the fiber resonators. Finally, we discuss the issue of fiber-generated background photons. We expect that our results contribute towards the integration of high-finesse fiber Fabry-Perot cavities into compact and robust quantum-enabled devices in the future.Comment: The Supplemental Material is included in the source code of the article that can be downloaded from this arXiv page (see "Other formats"). Peer-reviewed version with changes to text and figure

arXiv.org e-Print Archive

Springer - Publisher Connector