Search CORE

2,884 research outputs found

Systems for Personalization of Hearing Instruments:A Machine Learning Approach

Author: Nielsen Jens Brehm
Publication venue: DTU Compute
Publication date: 01/01/2015
Field of study

AIDA: An Active Inference-based Design Agent for Audio Processing Algorithms

Author: de Vries Bert
Koudahl Magnus
Podusenko Albert
van Erp Bart
Publication venue
Publication date: 10/01/2022
Field of study

In this paper we present AIDA, which is an active inference-based agent that iteratively designs a personalized audio processing algorithm through situated interactions with a human client. The target application of AIDA is to propose on-the-spot the most interesting alternative values for the tuning parameters of a hearing aid (HA) algorithm, whenever a HA client is not satisfied with their HA performance. AIDA interprets searching for the "most interesting alternative" as an issue of optimal (acoustic) context-aware Bayesian trial design. In computational terms, AIDA is realized as an active inference-based agent with an Expected Free Energy criterion for trial design. This type of architecture is inspired by neuro-economic models on efficient (Bayesian) trial design in brains and implies that AIDA comprises generative probabilistic models for acoustic signals and user responses. We propose a novel generative model for acoustic signals as a sum of time-varying auto-regressive filters and a user response model based on a Gaussian Process Classifier. The full AIDA agent has been implemented in a factor graph for the generative model and all tasks (parameter learning, acoustic context classification, trial design, etc.) are realized by variational message passing on the factor graph. All verification and validation experiments and demonstrations are freely accessible at our GitHub repository

arXiv.org e-Print Archive

Pure OAI Repository

Probabilistic Modeling Paradigms for Audio Source Separation

Author: A. P.Dempster
A.Gelman
D. L.Wang
D.FitzGerald
J.Nocedal
J.Winn
M. I.Mandel
R. J.Weiss
R.Mukai
S. T.Roweis
S.Makino
Publication venue: 'IGI Global'
Publication date: 01/01/2010
Field of study

This is the author's final version of the article, first published as E. Vincent, M. G. Jafari, S. A. Abdallah, M. D. Plumbley, M. E. Davies. Probabilistic Modeling Paradigms for Audio Source Separation. In W. Wang (Ed), Machine Audition: Principles, Algorithms and Systems. Chapter 7, pp. 162-185. IGI Global, 2011. ISBN 978-1-61520-919-4. DOI: 10.4018/978-1-61520-919-4.ch007file: VincentJafariAbdallahPD11-probabilistic.pdf:v\VincentJafariAbdallahPD11-probabilistic.pdf:PDF owner: markp timestamp: 2011.02.04file: VincentJafariAbdallahPD11-probabilistic.pdf:v\VincentJafariAbdallahPD11-probabilistic.pdf:PDF owner: markp timestamp: 2011.02.04Most sound scenes result from the superposition of several sources, which can be separately perceived and analyzed by human listeners. Source separation aims to provide machine listeners with similar skills by extracting the sounds of individual sources from a given scene. Existing separation systems operate either by emulating the human auditory system or by inferring the parameters of probabilistic sound models. In this chapter, the authors focus on the latter approach and provide a joint overview of established and recent models, including independent component analysis, local time-frequency models and spectral template-based models. They show that most models are instances of one of the following two general paradigms: linear modeling or variance modeling. They compare the merits of either paradigm and report objective performance figures. They also,conclude by discussing promising combinations of probabilistic priors and inference algorithms that could form the basis of future state-of-the-art systems

HAL-CentraleSupelec

CiteSeerX

Crossref

INRIA a CCSD electronic archive server

Queen Mary Research Online

Surrey Research Insight

HAL-Rennes 1

Message Passing-based Inference in Hierarchical Autoregressive Models

Author: Podusenko Albert
Publication venue: Eindhoven University of Technology
Publication date: 20/12/2022
Field of study

Pure OAI Repository

Bayesian linear regression for user-adaptive hearing aids

Author: Özer S.
Publication venue
Publication date: 01/01/2007
Field of study

Repository TU/e

Pure OAI Repository

Minimum Mean-Square Error Single-Channel Signal Estimation

Author: Beierholm Thomas
Publication venue
Publication date: 01/04/2008
Field of study

Online Research Database In Technology

AeGAN: Time-Frequency Speech Denoising via Generative Adversarial Networks

Author: Abdulatif Sherif
Armanious Karim
Guirguis Karim
Sajeev Jayasankar T.
Yang Bin
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 05/06/2020
Field of study

Automatic speech recognition (ASR) systems are of vital importance nowadays in commonplace tasks such as speech-to-text processing and language translation. This created the need for an ASR system that can operate in realistic crowded environments. Thus, speech enhancement is a valuable building block in ASR systems and other applications such as hearing aids, smartphones and teleconferencing systems. In this paper, a generative adversarial network (GAN) based framework is investigated for the task of speech enhancement, more specifically speech denoising of audio tracks. A new architecture based on CasNet generator and an additional feature-based loss are incorporated to get realistically denoised speech phonetics. Finally, the proposed framework is shown to outperform other learning and traditional model-based speech enhancement approaches.Comment: 5 pages, 4 figures and 2 Tables. Accepted in EUSIPCO 202

arXiv.org e-Print Archive

Crossref

Integration of top-down and bottom-up information for audio organization and retrieval

Author: Jensen Bjørn Sand
Publication venue: Technical University of Denmark
Publication date: 01/01/2012
Field of study

Online Research Database In Technology