518 research outputs found
Synthesis of Soundfields through Irregular Loudspeaker Arrays Based on Convolutional Neural Networks
Most soundfield synthesis approaches deal with extensive and regular
loudspeaker arrays, which are often not suitable for home audio systems, due to
physical space constraints. In this article we propose a technique for
soundfield synthesis through more easily deployable irregular loudspeaker
arrays, i.e. where the spacing between loudspeakers is not constant, based on
deep learning. The input are the driving signals obtained through a plane wave
decomposition-based technique. While the considered driving signals are able to
correctly reproduce the soundfield with a regular array, they show degraded
performances when using irregular setups. Through a Convolutional Neural
Network (CNN) we modify the driving signals in order to compensate the errors
in the reproduction of the desired soundfield. Since no ground-truth driving
signals are available for the compensated ones, we train the model by
calculating the loss between the desired soundfield at a number of control
points and the one obtained through the driving signals estimated by the
network. Numerical results show better reproduction accuracy both with respect
to the plane wave decomposition-based technique and the pressure-matching
approach
Timbre transfer using image-to-image denoising diffusion implicit models
Timbre transfer techniques aim at converting the sound of a musical piece
generated by one instrument into the same one as if it was played by another
instrument, while maintaining as much as possible the content in terms of
musical characteristics such as melody and dynamics. Following their recent
breakthroughs in deep learning-based generation, we apply Denoising Diffusion
Models (DDMs) to perform timbre transfer. Specifically, we apply the recently
proposed Denoising Diffusion Implicit Models (DDIMs) that enable to accelerate
the sampling procedure. Inspired by the recent application of DDMs to image
translation problems we formulate the timbre transfer task similarly, by first
converting the audio tracks into log mel spectrograms and by conditioning the
generation of the desired timbre spectrogram through the input timbre
spectrogram. We perform both one-to-one and many-to-many timbre transfer, by
converting audio waveforms containing only single instruments and multiple
instruments, respectively. We compare the proposed technique with existing
state-of-the-art methods both through listening tests and objective measures in
order to demonstrate the effectiveness of the proposed model
The wayward spectator
Through a heterogeneous set of contributions from film studies, psychoanalysis and critical theory, including Leo Bersani and Laura Marks, Jacques Rancière and Jean-Bertrand Pontalis, the dissertation confronts spectatorship, film theory, and their relation, on the issue of emancipation and of its discursive regulation. Against the pedagogical forms of film theory and the authoritarian framing of the spectator’s position that can be seen to be integral to the functioning of the cinematographic apparatus, this work suggests that we consider theory as an internal aspect of film experience, rather than as its external explanation. Arguing for the fundamental emancipation of the spectator together with the heteronomy of the subject and the discursivity of film experience, the dissertation addresses what, in film experience, resists being reduced within intellectual mastery, metapsychological structures, and the logic of interpretation, and rather remains radically incommensurable with the principles of its intelligibility. Indeterminacy and a lack in mastery are thus taken to be the constitutional ground of spectatorship as a praxis and of the spectator as a site of tensions and dissensus. More specifically, three basic dimensions and categories of this “wayward” ground of film experience will be examined in their correspondences and connections: contingency, free association, and embodiment
Frequency-Sliding Generalized Cross-Correlation: A Sub-band Time Delay Estimation Approach
The generalized cross correlation (GCC) is regarded as the most popular
approach for estimating the time difference of arrival (TDOA) between the
signals received at two sensors. Time delay estimates are obtained by
maximizing the GCC output, where the direct-path delay is usually observed as a
prominent peak. Moreover, GCCs play also an important role in steered response
power (SRP) localization algorithms, where the SRP functional can be written as
an accumulation of the GCCs computed from multiple sensor pairs. Unfortunately,
the accuracy of TDOA estimates is affected by multiple factors, including
noise, reverberation and signal bandwidth. In this paper, a sub-band approach
for time delay estimation aimed at improving the performance of the
conventional GCC is presented. The proposed method is based on the extraction
of multiple GCCs corresponding to different frequency bands of the cross-power
spectrum phase in a sliding-window fashion. The major contributions of this
paper include: 1) a sub-band GCC representation of the cross-power spectrum
phase that, despite having a reduced temporal resolution, provides a more
suitable representation for estimating the true TDOA; 2) such matrix
representation is shown to be rank one in the ideal noiseless case, a property
that is exploited in more adverse scenarios to obtain a more robust and
accurate GCC; 3) we propose a set of low-rank approximation alternatives for
processing the sub-band GCC matrix, leading to better TDOA estimates and source
localization performance. An extensive set of experiments is presented to
demonstrate the validity of the proposed approach.Comment: Article accepted in IEEE/ACM Transactions on Audio, Speech, and
Language Processin
El abuso del derecho y la interpretaciĂłn jurĂdica
El ensayo presenta y critica, en general, la teorĂa de los ilĂcitos atĂpicos de Atienza y Ruiz Manero. Luego pone en tela de juicio, de manera más especĂfica, la teorĂa normativa que Atienza y Ruiz Manero han delineado sobre el abuso del derecho. El autor basa sus crĂticas en una teorĂa escĂ©ptica de la interpretaciĂłn jurĂdica y en un enfoque meta-Ă©tico de corte no-objetivist
- …