518 research outputs found

    Synthesis of Soundfields through Irregular Loudspeaker Arrays Based on Convolutional Neural Networks

    Full text link
    Most soundfield synthesis approaches deal with extensive and regular loudspeaker arrays, which are often not suitable for home audio systems, due to physical space constraints. In this article we propose a technique for soundfield synthesis through more easily deployable irregular loudspeaker arrays, i.e. where the spacing between loudspeakers is not constant, based on deep learning. The input are the driving signals obtained through a plane wave decomposition-based technique. While the considered driving signals are able to correctly reproduce the soundfield with a regular array, they show degraded performances when using irregular setups. Through a Convolutional Neural Network (CNN) we modify the driving signals in order to compensate the errors in the reproduction of the desired soundfield. Since no ground-truth driving signals are available for the compensated ones, we train the model by calculating the loss between the desired soundfield at a number of control points and the one obtained through the driving signals estimated by the network. Numerical results show better reproduction accuracy both with respect to the plane wave decomposition-based technique and the pressure-matching approach

    Timbre transfer using image-to-image denoising diffusion implicit models

    Full text link
    Timbre transfer techniques aim at converting the sound of a musical piece generated by one instrument into the same one as if it was played by another instrument, while maintaining as much as possible the content in terms of musical characteristics such as melody and dynamics. Following their recent breakthroughs in deep learning-based generation, we apply Denoising Diffusion Models (DDMs) to perform timbre transfer. Specifically, we apply the recently proposed Denoising Diffusion Implicit Models (DDIMs) that enable to accelerate the sampling procedure. Inspired by the recent application of DDMs to image translation problems we formulate the timbre transfer task similarly, by first converting the audio tracks into log mel spectrograms and by conditioning the generation of the desired timbre spectrogram through the input timbre spectrogram. We perform both one-to-one and many-to-many timbre transfer, by converting audio waveforms containing only single instruments and multiple instruments, respectively. We compare the proposed technique with existing state-of-the-art methods both through listening tests and objective measures in order to demonstrate the effectiveness of the proposed model

    The wayward spectator

    Get PDF
    Through a heterogeneous set of contributions from film studies, psychoanalysis and critical theory, including Leo Bersani and Laura Marks, Jacques Rancière and Jean-Bertrand Pontalis, the dissertation confronts spectatorship, film theory, and their relation, on the issue of emancipation and of its discursive regulation. Against the pedagogical forms of film theory and the authoritarian framing of the spectator’s position that can be seen to be integral to the functioning of the cinematographic apparatus, this work suggests that we consider theory as an internal aspect of film experience, rather than as its external explanation. Arguing for the fundamental emancipation of the spectator together with the heteronomy of the subject and the discursivity of film experience, the dissertation addresses what, in film experience, resists being reduced within intellectual mastery, metapsychological structures, and the logic of interpretation, and rather remains radically incommensurable with the principles of its intelligibility. Indeterminacy and a lack in mastery are thus taken to be the constitutional ground of spectatorship as a praxis and of the spectator as a site of tensions and dissensus. More specifically, three basic dimensions and categories of this “wayward” ground of film experience will be examined in their correspondences and connections: contingency, free association, and embodiment

    Frequency-Sliding Generalized Cross-Correlation: A Sub-band Time Delay Estimation Approach

    Full text link
    The generalized cross correlation (GCC) is regarded as the most popular approach for estimating the time difference of arrival (TDOA) between the signals received at two sensors. Time delay estimates are obtained by maximizing the GCC output, where the direct-path delay is usually observed as a prominent peak. Moreover, GCCs play also an important role in steered response power (SRP) localization algorithms, where the SRP functional can be written as an accumulation of the GCCs computed from multiple sensor pairs. Unfortunately, the accuracy of TDOA estimates is affected by multiple factors, including noise, reverberation and signal bandwidth. In this paper, a sub-band approach for time delay estimation aimed at improving the performance of the conventional GCC is presented. The proposed method is based on the extraction of multiple GCCs corresponding to different frequency bands of the cross-power spectrum phase in a sliding-window fashion. The major contributions of this paper include: 1) a sub-band GCC representation of the cross-power spectrum phase that, despite having a reduced temporal resolution, provides a more suitable representation for estimating the true TDOA; 2) such matrix representation is shown to be rank one in the ideal noiseless case, a property that is exploited in more adverse scenarios to obtain a more robust and accurate GCC; 3) we propose a set of low-rank approximation alternatives for processing the sub-band GCC matrix, leading to better TDOA estimates and source localization performance. An extensive set of experiments is presented to demonstrate the validity of the proposed approach.Comment: Article accepted in IEEE/ACM Transactions on Audio, Speech, and Language Processin

    El abuso del derecho y la interpretaciĂłn jurĂ­dica

    Get PDF
    El ensayo presenta y critica, en general, la teoría de los ilícitos atípicos de Atienza y Ruiz Manero. Luego pone en tela de juicio, de manera más específica, la teoría normativa que Atienza y Ruiz Manero han delineado sobre el abuso del derecho. El autor basa sus críticas en una teoría escéptica de la interpretación jurídica y en un enfoque meta-ético de corte no-objetivist
    • …
    corecore