Search CORE

31 research outputs found

Generalized identifiability conditions for blind convolutive MIMO separation

Author: Castella Marc
Moreau Eric
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/07/2009
Field of study

International audienceThis paper deals with the problem of source separation in the case where the output of a multivariate convolutive mixture is observed: we propose novel and generalized conditions for the blind identifiability of a separating system. The results are based on higher-order statistics and are valid in the case of stationary but not necessarily i.i.d. signals. In particular, we extend recent results based on second-order statistics only. The approach relies on the use of so called reference signals. Our new results also show that only weak conditions are required on the reference signals: this is illustrated by simulations and opens up the possibility of developing new methods

HAL-INSU

Decomposition methods for unsupervised learning

Author: Mørup Morten
Publication venue
Publication date: 01/09/2008
Field of study

Online Research Database In Technology

Content-based music classification, summarization and retrieval

Author: SHAO XI
Publication venue
Publication date: 05/04/2007
Field of study

Ph.DDOCTOR OF PHILOSOPH

ScholarBank@NUS

New ISO standards for hearing protectors (A)

Author: Poulsen Torben
Publication venue
Publication date: 01/01/2000
Field of study

Online Research Database In Technology

Efficient and Robust Methods for Audio and Video Signal Analysis

Author: Mahkonen Katariina
Publication venue: Tampere University of Technology
Publication date: 01/01/2018
Field of study

This thesis presents my research concerning audio and video signal processing and machine learning. Specifically, the topics of my research include computationally efficient classifier compounds, automatic speech recognition (ASR), music dereverberation, video cut point detection and video classification.Computational efficacy of information retrieval based on multiple measurement modalities has been considered in this thesis. Specifically, a cascade processing framework, including a training algorithm to set its parameters has been developed for combining multiple detectors or binary classifiers in computationally efficient way. The developed cascade processing framework has been applied on video information retrieval tasks of video cut point detection and video classification. The results in video classification, compared to others found in the literature, indicate that the developed framework is capable of both accurate and computationally efficient classification. The idea of cascade processing has been additionally adapted for the ASR task. A procedure for combining multiple speech state likelihood estimation methods within an ASR framework in cascaded manner has been developed. The results obtained clearly show that without impairing the transcription accuracy the computational load of ASR can be reduced using the cascaded speech state likelihood estimation process.Additionally, this thesis presents my work on noise robustness of ASR using a nonnegative matrix factorization (NMF) -based approach. Specifically, methods for transformation of sparse NMF-features into speech state likelihoods has been explored. The results reveal that learned transformations from NMF activations to speech state likelihoods provide better ASR transcription accuracy than dictionary label -based transformations. The results, compared to others in a noisy speech recognition -challenge show that NMF-based processing is an efficient strategy for noise robustness in ASR.The thesis also presents my work on audio signal enhancement, specifically, on removing the detrimental effect of reverberation from music audio. In the work, a linear prediction -based dereverberation algorithm, which has originally been developed for speech signal enhancement, was applied for music. The results obtained show that the algorithm performs well in conjunction with music signals and indicate that dynamic compression of music does not impair the dereverberation performance

Trepo - Institutional Repository of Tampere University

Acoustic source separation based on target equalization-cancellation

Author: Mi Jing
Publication venue
Publication date: 20/02/2018
Field of study

Normal-hearing listeners are good at focusing on the target talker while ignoring the interferers in a multi-talker environment. Therefore, efforts have been devoted to build psychoacoustic models to understand binaural processing in multi-talker environments and to develop bio-inspired source separation algorithms for hearing-assistive devices. This thesis presents a target-Equalization-Cancellation (target-EC) approach to the source separation problem. The idea of the target-EC approach is to use the energy change before and after cancelling the target to estimate a time-frequency (T-F) mask in which each entry estimates the strength of target signal in the original mixture. Once the mask is calculated, it is applied to the original mixture to preserve the target-dominant T-F units and to suppress the interferer-dominant T-F units. On the psychoacoustic modeling side, when the output of the target-EC approach is evaluated with the Coherence-based Speech Intelligibility Index (CSII), the predicted binaural advantage closely matches the pattern of the measured data. On the application side, the performance of the target-EC source separation algorithm was evaluated by psychoacoustic measurements using both a closed-set speech corpus and an open-set speech corpus, and it was shown that the target-EC cue is a better cue for source separation than the interaural difference cues

Boston University Institutional Repository (OpenBU)

HEALTH MONITORING, FAULT DETECTION AND DIAGNOSIS IN INDUSTRIAL ROTATING MACHINERY BY ADVANCED VIBRATION ANALYSIS

Author: DIARRASSOUBA Karamoko
Publication venue: place:Palermo
Publication date
Field of study

Archivio istituzionale della ricerca - Università di Palermo

Correlative Information Maximization: A Biologically Plausible Approach to Supervised Deep Neural Networks without Weight Symmetry

Author: Bozkurt Bariscan
Erdogan Alper T
Pehlevan Cengiz
Publication venue: NeurIPS
Publication date: 21/09/2023
Field of study

The backpropagation algorithm has experienced remarkable success in training large-scale artificial neural networks; however, its biological plausibility has been strongly criticized, and it remains an open question whether the brain employs supervised learning mechanisms akin to it. Here, we propose correlative information maximization between layer activations as an alternative normative approach to describe the signal propagation in biological neural networks in both forward and backward directions. This new framework addresses many concerns about the biological-plausibility of conventional artificial neural networks and the backpropagation algorithm. The coordinate descent-based optimization of the corresponding objective, combined with the mean square error loss function for fitting labeled supervision data, gives rise to a neural network structure that emulates a more biologically realistic network of multi-compartment pyramidal neurons with dendritic processing and lateral inhibitory neurons. Furthermore, our approach provides a natural resolution to the weight symmetry problem between forward and backward signal propagation paths, a significant critique against the plausibility of the conventional backpropagation algorithm. This is achieved by leveraging two alternative, yet equivalent forms of the correlative mutual information objective. These alternatives intrinsically lead to forward and backward prediction networks without weight symmetry issues, providing a compelling solution to this long-standing challenge

UCL Discovery

Blind channel equalization and instantaneous blind source separation

Author: Abrar Shafayat
Publication venue
Publication date
Field of study

University of Liverpool Repository