Search CORE

3 research outputs found

User-Symbiotic Speech Enhancement for Hearing Aids

Author: Hoang Poul
Publication venue: Aalborg Universitetsforlag
Publication date: 01/01/2022
Field of study

Akustinen monimikrofonikeilanmuodostus käytännöllisessä laitteessa

Author: Kiiski Roope
Publication venue
Publication date: 03/04/2017
Field of study

Signal quality and noiselessness is important for Automatic Speech Recognition and communications. These can be achieved with the help of microphone arrays and beamforming. Multiple microphones can be used to pick sound coming from one direction while attenuating other directions. In this thesis, three arrays and three beamformers are compared. The target is to see how different structures affect the output signal and if one of the arrays is superior to the others. The results show that the baffled array appears bigger than the reference array or the array on the top surface of a cylinder, but it adds complexity to the beamformers. For Automatic Speech Recognition, all three arrays performed the same, but there were clear differences between the beamforming algorithms. Thus the array to use depends on the use-case and especially on the industrial design of the device.Äänenlaatu ja kohinattomuus ovat tärkeitä automaattisen puheentunnistuksen ja puheviestinnän toimivuuden kannalta. Mikrofoniryhmiä ja keilanmuodostusta voidaan käyttää parantamaan signaalin laatua ja vähentämään kohinaa nauhoittamalla vain yhdestä suunnasta tulevaa ääntä. Tässä työssä vertaillaan kolmea mikrofoniryhmää ja kolmea keilanmuodostusalgoritmia. Tarkoituksena on selvittää miten erilaiset rakenteet, joihin mikrofoniryhmä on kiinnitettynä vaikuttaa nauhoitettuun signaaliin, ja jos jokin ryhmistä on merkittävästi parempi kuin muut. Tulokset osoittavat, että sylinterin ulkoreunalle kiinnitetty mikrofoniryhmä vaikuttaa akustisesti suuremmalta kuin referenssi mikrofoniryhmä tai sylinterin päälipinnalle kiinnitetty mikrofoniryhmä, mutta se lisää keilanmuodostusalgoritmin monimutkaisuutta. Automaattisessa puheentunnistuksessa kaikki kolme mikrofoniryhmää suoriutuivat yhtä hyvin, mutta keilanmuodostusalgoritmien suorituskyvyssä oli selkeät erot. Käytännössä siis mikrofoniryhmä kannattaa valita käyttötarkoitukseen ja etenkin lopullisen tuotteen teollisen muotoiluun sopivaksi

Aaltodoc Publication Archive

Principled methods for mixtures processing

Author: Liutkus Antoine
Publication venue: HAL CCSD
Publication date: 11/02/2022
Field of study

This document is my thesis for getting the habilitation à diriger des recherches, which is the french diploma that is required to fully supervise Ph.D. students. It summarizes the research I did in the last 15 years and also provides the shortterm research directions and applications I want to investigate. Regarding my past research, I first describe the work I did on probabilistic audio modeling, including the separation of Gaussian and αstable stochastic processes. Then, I mention my work on deep learning applied to audio, which rapidly turned into a large effort for community service. Finally, I present my contributions in machine learning, with some works on hardware compressed sensing and probabilistic generative models.My research programme involves a theoretical part that revolves around probabilistic machine learning, and an applied part that concerns the processing of time series arising in both audio and life sciences

INRIA a CCSD electronic archive server