Search CORE

36,849 research outputs found

Disentangling the Horowitz Factor: Learning Content and Style From Expressive Piano Performance

Author: Dixon S
ICASSP 2023 - 2023 IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP)
Zhang H
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 04/06/2023
Field of study

Queen Mary Research Online

On the relevance of the differences between HRTF measurement setups for machine learning

Author: 2023 IEEE International Conference on Acoustics Speech, and Signal Processing (ICASSP 2023)
Pauwels J
Picinali L
Publication venue
Publication date: 17/02/2023
Field of study

Queen Mary Research Online

Audio Quality Assessment of Vinyl Music Collections Using Self-Supervised Learning

Author: Benetos E
Hines A
ICASSP 2023 - 2023 IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP)
Ragano A
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 04/06/2023
Field of study

Queen Mary Research Online

Modeling plate and spring reverberation using a DSP-informed deep neural network

Author: Benetos E
IEEE International Conference on Acoustics Speech, and Signal Processing (ICASSP 2020)
Martinez Ramirez M
Reiss J
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 04/05/2020
Field of study

Plate and spring reverberators are electromechanical systems first used and researched as means to substitute real room reverberation. Currently, they are often used in music production for aesthetic reasons due to their particular sonic characteristics. The modeling of these audio processors and their perceptual qualities is difficult since they use mechanical elements together with analog electronics resulting in an extremely complex response. Based on digital reverberators that use sparse FIR filters, we propose a signal processing-informed deep learning architecture for the modeling of artificial reverberators. We explore the capabilities of deep neural networks to learn such highly nonlinear electromechanical responses and we perform modeling of plate and spring reverberators. In order to measure the performance of the model, we conduct a perceptual evaluation experiment and we also analyze how the given task is accomplished and what the model is actually learning

Queen Mary Research Online

Modelling Black-Box Audio Effects with Time-Varying Feature Modulation

Author: Comunità M
ICASSP 2023 - 2023 IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP)
Phan H
Reiss JD
Steinmetz CJ
Publication venue: Institute of Electrical and Electronics Engineers (IEEE)
Publication date: 04/06/2023
Field of study

Queen Mary Research Online

A-CRNN: a domain adaptation model for sound event detection

Author: Benetos E
IEEE International Conference on Acoustics Speech, and Signal Processing (ICASSP 2020)
Wang Y
Wei W
Zhu H
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 04/05/2020
Field of study

This paper presents a domain adaptation model for sound event detection. A common challenge for sound event detection is how to deal with the mismatch among different datasets. Typically, the performance of a model will decrease if it is tested on a dataset which is different from the one that the model is trained on. To address this problem, based on convolutional recurrent neural networks (CRNNs), we propose an adapted CRNN (A-CRNN) as an unsupervised adversarial domain adaptation model for sound event detection. We have collected and annotated a dataset in Singapore with two types of recording devices to complement existing datasets in the research community, especially with respect to domain adaptation. We perform experiments on recordings from different datasets and from different recordings devices. Our experimental results show that the proposed A-CRNN model can achieve a better performance on an unseen dataset in comparison with the baseline non-adapted CRNN model

Queen Mary Research Online

Playing Technique Recognition by Joint Time–Frequency Scattering

Author: Benetos E
Chew E
IEEE International Conference on Acoustics Speech, and Signal Processing (ICASSP 2020)
Lostanlen V
Wang C
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 04/05/2020
Field of study

Playing techniques are important expressive elements in music signals. In this paper, we propose a recognition system based on the joint time–frequency scattering transform (jTFST) for pitch evolution-based playing techniques (PETs), a group of playing techniques with monotonic pitch changes over time. The jTFST represents spectro-temporal patterns in the time–frequency domain, capturing discriminative information of PETs. As a case study, we analyse three commonly used PETs of the Chinese bamboo flute: acciacatura, portamento, and glissando, and encode their characteristics using the jTFST. To verify the proposed approach, we create a new dataset, the CBF-petsDB, containing PETs played in isolation as well as in the context of whole pieces performed and annotated by professional players. Feeding the jTFST to a machine learning classifier, we obtain F-measures of 71% for acciacatura, 59% for portamento, and 83% for glissando detection, and provide explanatory visualisations of scattering coefficients for each technique

Queen Mary Research Online

A Study on the Transferability of Adversarial Attacks in Sound Event Classification

Author: Benetos E
IEEE International Conference on Acoustics Speech, and Signal Processing (ICASSP 2020)
McDonald S
Pankajakshan A
Sandler M
SUBRAMANIAN V
Xu N
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 04/05/2020
Field of study

An adversarial attack is an algorithm that perturbs the input of a machine learning model in an intelligent way in order to change the output of the model. An important property of adversarial attacks is transferability. According to this property, it is possible to generate adversarial perturbations on one model and apply it the input to fool the output of a different model. Our work focuses on studying the transferability of adversarial attacks in sound event classification. We are able to demonstrate differences in transferability properties from those observed in computer vision. We show that dataset normalization techniques such as z-score normalization does not affect the transferability of adversarial attacks and we show that techniques such as knowledge distillation do not increase the transferability of attacks

Queen Mary Research Online

Convex separable problems with linear and box constraints

Author: D'Amico Antonio A.
Palomar Daniel P.
Sanguinetti Luca
Publication venue
Publication date: 01/01/2014
Field of study

In this work, we focus on separable convex optimization problems with linear and box constraints and compute the solution in closed-form as a function of some Lagrange multipliers that can be easily computed in a finite number of iterations. This allows us to bridge the gap between a wide family of power allocation problems of practical interest in signal processing and communications and their efficient implementation in practice.Comment: 5 pages, 2 figures. Published at IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2014

arXiv.org e-Print Archive

HAL-CentraleSupelec

CiteSeerX

Crossref

Archivio della Ricerca - Università di Pisa

Hal-Diderot

HAL-Rennes 1