Search CORE

5 research outputs found

A new variance-based approach for discriminative feature extraction in machine hearing classification using spectrogram features

Author: Atrey
Barker
Boureau
Cai
Casey
Chechik
Chu
Dennis
Dennis
Dennis
Guo
Haomin Zhang
Heittola
Hinton
Hinton
Huang
Ian McLoughlin
Lin
Lyon
Maxime
McLoughlin
McLoughlin
Mohamed
Morgan
Nakamura
Palm
Phan
Plinge
Portelo
Sidiropoulos
Walters
Wei Xiao
Yan Song
Zhang
Zhipeng Xie
Zue
Publication venue: 'Elsevier BV'
Publication date: 01/07/2016
Field of study

Machine hearing is an emerging research field that is analogous to machine vision in that it aims to equip computers with the ability to hear and recognise a variety of sounds. It is a key enabler of natural human–computer speech interfacing, as well as in areas such as automated security surveillance, environmental monitoring, smart homes/buildings/cities. Recent advances in machine learning allow current systems to accurately recognise a diverse range of sounds under controlled conditions. However doing so in real-world noisy conditions remains a challenging task. Several front–end feature extraction methods have been used for machine hearing, employing speech recognition features like MFCC and PLP, as well as image-like features such as AIM and SIF. The best choice of feature is found to be dependent upon the noise environment and machine learning techniques used. Machine learning methods such as deep neural networks have been shown capable of inferring discriminative classification rules from less structured front–end features in related domains. In the machine hearing field, spectrogram image features have recently shown good performance for noise-corrupted classification using deep neural networks. However there are many methods of extracting features from spectrograms. This paper explores a novel data-driven feature extraction method that uses variance-based criteria to define spectral pooling of features from spectrograms. The proposed method, based on maximising the pooled spectral variance of foreground and background sound models, is shown to achieve very good performance for robust classification

Crossref

Kent Academic Repository

Lanczos kernel based spectrogram image features for sound classification

Author: Cowling
Dennis
Ghiurcau
Guo
Li
Portelo
Rabaoui
Varga
Varga
Wold
Xie
Publication venue: 'Elsevier BV'
Publication date: 01/01/2017
Field of study

Crossref

Feature extraction based on the high-pass filtering of audio signals for Acoustic Event Classification

Author: Ascensión Gallardo-Antolín
Bertrand
Chang
Chu
Cichocki
Clavel
Cotton
Dhanalakshmi
FBK-Irst
Geiger
Jimmy Ludeña-Choez
Kons
Kwangyoun
Lee
Ludeña-Choez
Mejía-Navarrete
Meng
Muller
Pearce
Portelo
Smaragdis
Steeneken
Temko
UPC-TALP
Weninger
Zhang
Zhuang
Zieger
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref

Acoustic Event Classification using spectral band selection and Non-Negative Matrix Factorization-based features

Author: Alam
Arenas-García
Ascensión Gallardo-Antolín
Battiti
Bins
Brown
Butko
Chang
Choi
Chu
Cover
de-la Calle-Silos
Espi
Fleuret
Férnandez
Geiger
Guyon
Jimmy Ludeña-Choez
Kiktova-Vozarikova
Kons
Kwangyoun
Lee
Lin
Ludeña-Choez
Ludeña-Choez
Ludeña-Choez
Ludeña-Choez
McKinney
Mejía-Navarrete
Meng
Meyer
Meyer
Muller
Pearce
Peng
Perperis
Plinge
Portelo
Principi
Sandhan
Schroder
Schuller
Sebban
Steeneken
Temko
Wilson
Zhang
Zhuang
Zhuang
Zieger
Publication venue: 'Elsevier BV'
Publication date: 15/03/2016
Field of study

Feature extraction methods for sound events have been traditionally based on parametric representations specifically developed for speech signals, such as the well-known Mel Frequency Cepstrum Coefficients (MFCC). However, the discrimination capabilities of these features for Acoustic Event Classification (AEC) tasks could be enhanced by taking into account the spectro-temporal structure of acoustic event signals. In this paper, a new front-end for AEC which incorporates this specific information is proposed. It consists of two different stages: short-time feature extraction and temporal feature integration. The first module aims at providing a better spectral representation of the different acoustic events on a frame-by-frame basis, by means of the automatic selection of the optimal set of frequency bands from which cepstral-like features are extracted. The second stage is designed for capturing the most relevant temporal information in the short-time features, through the application of Non-Negative Matrix Factorization (NMF) on their periodograms computed over long audio segments. The whole front-end has been evaluated in clean and noisy conditions. Experiments show that the removal of certain frequency bands (which are mainly located in the medium region of the spectrum for clean conditions and in low frequencies for noisy environments) in the short-time feature computation process in conjunction with the NMF technique for temporal feature integration improves significantly the performance of a Support Vector Machine (SVM) based AEC system with respect to the use of conventional MFCCs. (C) 2015 Elsevier Ltd. All rights reserved.This work has been partially supported by the Spanish Government Grant TEC2014-53390-P. Financial support from the Fundación Carolina and Universidad Católica San Pablo, Arequipa (Jimmy Ludeña-Choez) is thankfully acknowledged

Crossref

e-Archivo (Univ. Carlos III de Madrid e-Archivo)

Antibacterial and modifying-antibiotic activities of the essential oils of Ocimum gratissimum L. and Plectranthus amboinicus L.

Crossref