Search CORE

6,296 research outputs found

Listening for Sirens: Locating and Classifying Acoustic Alarms in City Scenes

Author: Marchegiani Letizia
Newman Paul
Publication venue
Publication date: 11/10/2018
Field of study

This paper is about alerting acoustic event detection and sound source localisation in an urban scenario. Specifically, we are interested in spotting the presence of horns, and sirens of emergency vehicles. In order to obtain a reliable system able to operate robustly despite the presence of traffic noise, which can be copious, unstructured and unpredictable, we propose to treat the spectrograms of incoming stereo signals as images, and apply semantic segmentation, based on a Unet architecture, to extract the target sound from the background noise. In a multi-task learning scheme, together with signal denoising, we perform acoustic event classification to identify the nature of the alerting sound. Lastly, we use the denoised signals to localise the acoustic source on the horizon plane, by regressing the direction of arrival of the sound through a CNN architecture. Our experimental evaluation shows an average classification rate of 94%, and a median absolute error on the localisation of 7.5{\deg} when operating on audio frames of 0.5s, and of 2.5{\deg} when operating on frames of 2.5s. The system offers excellent performance in particularly challenging scenarios, where the noise level is remarkably high.Comment: 6 pages, 9 figure

arXiv.org e-Print Archive

Archivio istituzionale della Ricerca - Università degli Studi di Parma

VBN

Component separation methods for the Planck mission

Author: A. Bonaldi
Ashdown
Bedini
Bedini
Bennett
Bennett
Bertin
Bobin
Bonaldi
Bonaldi
Bouchet
C. Baccigalupi
C. Dickinson
Colafrancesco
D. Herranz
Davies
de Zotti
Delabrouille
Dickinson
E. Martínez-González
E. Salerno
Eriksen
F. K. Hansen
F. Stivoli
Finkbeiner
G. de Zotti
G. Patanchon
González-Nuevo
González-Nuevo
Granato
Górski
H. K. Eriksen
Hansen
Haslam
Herranz
Hinshaw
Hinshaw
Hobson
Hobson
Hyvarinen
J. Bobin
J. Delabrouille
J. González-Nuevo
J. L. Sanz
J.-B. Melin
J.-F. Cardoso
J.-L. Starck
Jones
Komatsu
López-Caniego
López-Caniego
M. Betoule
M. Le Jeune
M. López-Caniego
M. Massardi
M.-A. Miville-Deschênes
Maino
Maino
Martínez-González
Masi
Melin
Miville-Deschênes
Negrello
P. Vielva
Poutanen
R. B. Barreiro
R. Stompor
Reinecke
S. M. Leach
S. Prunet
S. Ricciardi
Schlegel
Serjeant
Stolyarov
Stolyarov
Tegmark
V. Stolyarov
Publication venue: 'EDP Sciences'
Publication date: 01/01/2008
Field of study

The Planck satellite will map the full sky at nine frequencies from 30 to 857 GHz. The CMB intensity and polarization that are its prime targets are contaminated by foreground emission. The goal of this paper is to compare proposed methods for separating CMB from foregrounds based on their different spectral and spatial characteristics, and to separate the foregrounds into components of different physical origin. A component separation challenge has been organized, based on a set of realistically complex simulations of sky emission. Several methods including those based on internal template subtraction, maximum entropy method, parametric method, spatial and harmonic cross correlation methods, and independent component analysis have been tested. Different methods proved to be effective in cleaning the CMB maps from foreground contamination, in reconstructing maps of diffuse Galactic emissions, and in detecting point sources and thermal Sunyaev-Zeldovich signals. The power spectrum of the residuals is, on the largest scales, four orders of magnitude lower than that of the input Galaxy power spectrum at the foreground minimum. The CMB power spectrum was accurately recovered up to the sixth acoustic peak. The point source detection limit reaches 100 mJy, and about 2300 clusters are detected via the thermal SZ effect on two thirds of the sky. We have found that no single method performs best for all scientific objectives. We foresee that the final component separation pipeline for Planck will involve a combination of methods and iterations between processing steps targeted at different objectives such as diffuse component separation, spectral estimation and compact source extraction.Comment: Matches version accepted by A&A. A version with high resolution figures is available at http://people.sissa.it/~leach/compsepcomp.pd

EDP Sciences OAI-PMH repository (1.2.0)

Caltech Authors

Hal-Diderot

arXiv.org e-Print Archive

HAL-INSU

Resiliency Assessment and Enhancement of Intrinsic Fingerprinting

Author: Chuang Wei-Hong
Publication venue
Publication date: 01/01/2012
Field of study

Intrinsic fingerprinting is a class of digital forensic technology that can detect traces left in digital multimedia data in order to reveal data processing history and determine data integrity. Many existing intrinsic fingerprinting schemes have implicitly assumed favorable operating conditions whose validity may become uncertain in reality. In order to establish intrinsic fingerprinting as a credible approach to digital multimedia authentication, it is important to understand and enhance its resiliency under unfavorable scenarios. This dissertation addresses various resiliency aspects that can appear in a broad range of intrinsic fingerprints. The first aspect concerns intrinsic fingerprints that are designed to identify a particular component in the processing chain. Such fingerprints are potentially subject to changes due to input content variations and/or post-processing, and it is desirable to ensure their identifiability in such situations. Taking an image-based intrinsic fingerprinting technique for source camera model identification as a representative example, our investigations reveal that the fingerprints have a substantial dependency on image content. Such dependency limits the achievable identification accuracy, which is penalized by a mismatch between training and testing image content. To mitigate such a mismatch, we propose schemes to incorporate image content into training image selection and significantly improve the identification performance. We also consider the effect of post-processing against intrinsic fingerprinting, and study source camera identification based on imaging noise extracted from low-bit-rate compressed videos. While such compression reduces the fingerprint quality, we exploit different compression levels within the same video to achieve more efficient and accurate identification. The second aspect of resiliency addresses anti-forensics, namely, adversarial actions that intentionally manipulate intrinsic fingerprints. We investigate the cost-effectiveness of anti-forensic operations that counteract color interpolation identification. Our analysis pinpoints the inherent vulnerabilities of color interpolation identification, and motivates countermeasures and refined anti-forensic strategies. We also study the anti-forensics of an emerging space-time localization technique for digital recordings based on electrical network frequency analysis. Detection schemes against anti-forensic operations are devised under a mathematical framework. For both problems, game-theoretic approaches are employed to characterize the interplay between forensic analysts and adversaries and to derive optimal strategies. The third aspect regards the resilient and robust representation of intrinsic fingerprints for multiple forensic identification tasks. We propose to use the empirical frequency response as a generic type of intrinsic fingerprint that can facilitate the identification of various linear and shift-invariant (LSI) and non-LSI operations