Search CORE

46 research outputs found

Source separation with one ear : proposition for an anthropomorphic approach

Author: Pichevar Ramin
Rouat Jean
Publication venue
Publication date: 01/01/2005
Field of study

Abstract : We present an example of an anthropomorphic approach, in which auditory-based cues are combined with temporal correlation to implement a source separation system. The auditory features are based on spectral amplitudemodulation and energy information obtained through 256 cochlear filters. Segmentation and binding of auditory objects are performed with a two-layered spiking neural network. The first layer performs the segmentation of the auditory images into objects, while the second layer binds the auditory objects belonging to the same source. The binding is further used to generate a mask (binary gain) to suppress the undesired sources fromthe original signal. Results are presented for a double-voiced (2 speakers) speech segment and for sentences corrupted with different noise sources. Comparative results are also given using PESQ (perceptual evaluation of speech quality) scores. The spiking neural network is fully adaptive and unsupervised

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

Savoirs UdeS

A parallel supercomputer implementation of a biological inspired neural network and its use for pattern recognition

Author: Bergeron Jocelyn
De Ladurantaye Vincent
Lavoie Jean
Lu Huizhong
Parenteau Maxime
Pichevar Ramin
Rouat Jean
Publication venue
Publication date: 01/01/2012
Field of study

Abstract : A parallel implementation of a large spiking neural network is proposed and evaluated. The neural network implements the binding by synchrony process using the Oscillatory Dynamic Link Matcher (ODLM). Scalability, speed and performance are compared for 2 implementations: Message Passing Interface (MPI) and Compute Unified Device Architecture (CUDA) running on clusters of multicore supercomputers and NVIDIA graphical processing units respectively. A global spiking list that represents at each instant the state of the neural network is described. This list indexes each neuron that fires during the current simulation time so that the influence of their spikes are simultaneously processed on all computing units. Our implementation shows a good scalability for very large networks. A complex and large spiking neural network has been implemented in parallel with success, thus paving the road towards real-life applications based on networks of spiking neurons. MPI offers a better scalability than CUDA, while the CUDA implementation on a GeForce GTX 285 gives the best cost to performance ratio. When running the neural network on the GTX 285, the processing speed is comparable to the MPI implementation on RQCHP’s Mammouth parallel with 64 notes (128 cores)

Crossref

Savoirs UdeS

Source separation with one ear : proposition for an anthropomorphic approach

Author: Pichevar Ramin
Rouat Jean
Publication venue
Publication date: 01/01/2005
Field of study

Savoirs UdeS

New Trends in Biologically-Inspired Audio Coding

Author: Hassan Lahdili
Hossein Najaf-Zadeh
Louis Thibault
Ramin Pichevar
Publication venue: 'IntechOpen'
Publication date: 01/03/2010
Field of study

This book chapter deals with the generation of auditory-inspired spectro-temporal features aimed at audio coding. To do so, we first generate sparse audio representations we call spikegrams, using projections on gammatone or gammachirp kernels that generate neural spikes. Unlike Fourier-based representations, these representations are powerful at identifying auditory events, such as onsets, offsets, transients and harmonic structures. We show that the introduction of adaptiveness in the selection of gammachirp kernels enhances the compression rate compared to the case where the kernels are non-adaptive. We also integrate a masking model that helps reduce bitrate without loss of perceptible audio quality. We then quantize coding values using the genetic algorithm that is more optimal than uniform quantization for this framework. We finally propose a method to extract frequent auditory objects (patterns) in the aforementioned sparse representations. The extracted frequency-domain patterns (auditory objects) help us address spikes (auditory events) collectively rather than individually. When audio compression is needed, the different patterns are stored in a small codebook that can be used to efficiently encode audio materials in a lossless way. The approach is applied to different audio signals and results are discussed and compared. This work is a first step towards the design of a high-quality auditory-inspired \"object-based\" audio coder

IntechOpen

Crossref

Discriminative Lasso

Author: A Frank
AS Georghiades
B Efron
B He
C Mol De
CC Chang
Da Zhou
Edwin R. Hancock
G Davis
G Gan
H Wang
H Xu
H Zou
J Huang
J Xu
Jianbing Xiahou
JJ Hull
Liyan Chen
M Yuan
MR Osborne
MR Osborne
N Shervashidze
P Zhao
R Pichevar
R Tibshirani
Si-Bao Chen
WE Vinje
Y Yao
Zheng-Jian Bai
Zhihong Zhang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 07/04/2016
Field of study

Crossref

White Rose Research Online

A parallel supercomputer implementation of a biological inspired neural network and its use for pattern recognition

Author: Bergeron Jocelyn
De Ladurantaye Vincent
Lavoie Jean
Lu Huizhong
Parenteau Maxime
Pichevar Ramin
Rouat Jean
Publication venue
Publication date: 01/01/2012
Field of study

Savoirs UdeS

Computational Models of Auditory Scene Analysis: A Review

Author: Akram
Akram
Alain
Alain
Alain
Andreou
Andreou
Bar
Barascud
Barniv
Bee
Bee
Bendixen
Bendixen
Bey
Beáta T. Szabó
Boes
Bregman
Bregman
Carlyon
Ciocca
Cooke
Cusack
Darwin
De Coensel
Deike
Deike
Denham
Denham
Denham
Denham
Ding
Dowling
Duifhuis
Elhilali
Elhilali
Elhilali
Erber
Farkas
Farkas
Fishman
Fishman
Friston
Gibson
Goswami
Gregory
Griffiths
Guinan
Gutschalk
Gutschalk
Hartmann
Haykin
Helfer
Helmholtz
Hupé
Hupé
Irvine
István Winkler
Kersten
Kidd
Kocsis
Kondo
Kondo
Kramer
Krishnan
Krumbholz
Kubovy
Kumar
Kumar
Köhler
Leaver
Leopold
Lipp
Ma
Mathys
McDermott
McDonald
McGurk
Micheyl
Mill
Mittag
Moore
Moore
Moore
Nix
Näätänen
O'Sullivan
Oldoni
Patterson
Pichevar
Pressnitzer
Rajendran
Rankin
Rasch
Roberts
Schadwinkel
Scholl
Schwartz
Shamma
Shamma
Simon
Snyder
Snyder
Steiger
Stoffregen
Susan L. Denham
Szalárdy
Szalárdy
Teki
Teki
Teki
Thakur
Tougas
Tóth
Ulanovsky
Ulanovsky
van Noorden
Wang
Wang
Wilson
Winkler
Winkler
Winkler
Winkler
Wrigley
Publication venue: 'Frontiers Media SA'
Publication date: 01/01/2016
Field of study

Auditory scene analysis (ASA) refers to the process(es) of parsing the complex acoustic input into auditory perceptual objects representing either physical sources or temporal sound patterns, such as melodies, which contributed to the sound waves reaching the ears. A number of new computational models accounting for some of the perceptual phenomena of ASA have been published recently. Here we provide a theoretically motivated review of these computational models, aiming to relate their guiding principles to the central issues of the theoretical framework of ASA. Specifically, we ask how they achieve the grouping and separation of sound elements and whether they implement some form of competition between alternative interpretations of the sound input. We consider the extent to which they include predictive processes, as important current theories suggest that perception is inherently predictive, and also how they have been evaluated. We conclude that current computational models of ASA are fragmentary in the sense that rather than providing general competing interpretations of ASA, they focus on assessing the utility of specific processes (or algorithms) for finding the causes of the complex acoustic signal. This leaves open the possibility for integrating complementary aspects of the models into a more comprehensive theory of ASA

Crossref

Directory of Open Access Journals

Frontiers - Publisher Connector

PubMed Central

PEARL (Univ. of Plymouth)