Search CORE

64 research outputs found

Contractive De-noising Auto-encoder

Author: C.C. Chang
C.J. Burges
D.E. Rumelhart
G.E. Hinton
G.E. Hinton
H. Bourlard
P. Vincent
Publication venue
Publication date: 01/01/2014
Field of study

Auto-encoder is a special kind of neural network based on reconstruction. De-noising auto-encoder (DAE) is an improved auto-encoder which is robust to the input by corrupting the original data first and then reconstructing the original input by minimizing the reconstruction error function. And contractive auto-encoder (CAE) is another kind of improved auto-encoder to learn robust feature by introducing the Frobenius norm of the Jacobean matrix of the learned feature with respect to the original input. In this paper, we combine de-noising auto-encoder and contractive auto- encoder, and propose another improved auto-encoder, contractive de-noising auto- encoder (CDAE), which is robust to both the original input and the learned feature. We stack CDAE to extract more abstract features and apply SVM for classification. The experiment result on benchmark dataset MNIST shows that our proposed CDAE performed better than both DAE and CAE, proving the effective of our method.Comment: Figures edite

arXiv.org e-Print Archive

Crossref

Processing and Linking Audio Events in Large Multimedia Archives: The EU inEvent Project

Author: Bell P.
Bourlard H.
Ferras M.
Guillemot M.
Ingram S.
McInnes F.
Pappas N.
Popescu-Belis A.
Renals S.
Publication venue
Publication date: 01/08/2013
Field of study

In the inEvent EU project [1], we aim at structuring, retrieving, and sharing large archives of networked, and dynamically changing, multimedia recordings, mainly consisting of meetings, videoconferences, and lectures. More specifically, we are developing an integrated system that performs audiovisual processing of multimedia recordings, and labels them in terms of interconnected “hyper-events ” (a notion inspired from hyper-texts). Each hyper-event is composed of simpler facets, including audio-video recordings and metadata, which are then easier to search, retrieve and share. In the present paper, we mainly cover the audio processing aspects of the system, including speech recognition, speaker diarization and linking (across recordings), the use of these features for hyper-event indexing and recommendation, and the search portal. We present initial results for feature extraction from lecture recordings using the TED talks. Index Terms: Networked multimedia events; audio processing: speech recognition; speaker diarization and linking; multimedia indexing and searching; hyper-events. 1

Infoscience - École polytechnique fédérale de Lausanne

CiteSeerX

Edinburgh Research Explorer

Representation of Time-Varying Stimuli by a Network Exhibiting Oscillations on a Faster Time Scale

Author: A Bragin
A Bruns
A Delorme
A Gunawardana
A Rokem
AL Giraud
BJ Rhodes
C Börgers
C Tallon-Baudry
CM Glaze
CM Gray
DV Buonomano
EM Izhikevich
FE Theunissen
G Buzsáki
G Laurent
GB Christianson
GB Ermentrout
H Bourlard
HK Hartline
I Nelken
J Beshel
J Fritz
JB Kruskal
JJ Hopfield
JM Palva
JP Donoghue
KJ de Jong
KJ Maloney
LC Osborne
M Bastiaansen
M Bazhenov
M Bazhenov
M Shamir
Maoz Shamir
MS Olufsen
N Brunel
N Brunel
Nancy Kopell
O Ghitza
O Jensen
Oded Ghitza
P Lakatos
P Tass
Peter E. Latham
R Gütig
R Van Rullen
R VanRullen
RC deCharms
RD Traub
RT Canolty
S Furukawa
S Greenberg
S Panzeri
SK Kuffler
SL Hooper
SM Chase
Steven Epstein
T Gruber
V Digilakis
W Maass
Y Loewenstein
ZN Aldworth
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/05/2009
Field of study

Sensory processing is associated with gamma frequency oscillations (30–80 Hz) in sensory cortices. This raises the question whether gamma oscillations can be directly involved in the representation of time-varying stimuli, including stimuli whose time scale is longer than a gamma cycle. We are interested in the ability of the system to reliably distinguish different stimuli while being robust to stimulus variations such as uniform time-warp. We address this issue with a dynamical model of spiking neurons and study the response to an asymmetric sawtooth input current over a range of shape parameters. These parameters describe how fast the input current rises and falls in time. Our network consists of inhibitory and excitatory populations that are sufficient for generating oscillations in the gamma range. The oscillations period is about one-third of the stimulus duration. Embedded in this network is a subpopulation of excitatory cells that respond to the sawtooth stimulus and a subpopulation of cells that respond to an onset cue. The intrinsic gamma oscillations generate a temporally sparse code for the external stimuli. In this code, an excitatory cell may fire a single spike during a gamma cycle, depending on its tuning properties and on the temporal structure of the specific input; the identity of the stimulus is coded by the list of excitatory cells that fire during each cycle. We quantify the properties of this representation in a series of simulations and show that the sparseness of the code makes it robust to uniform warping of the time scale. We find that resetting of the oscillation phase at stimulus onset is important for a reliable representation of the stimulus and that there is a tradeoff between the resolution of the neural representation of the stimulus and robustness to time-warp. Author Summary Sensory processing of time-varying stimuli, such as speech, is associated with high-frequency oscillatory cortical activity, the functional significance of which is still unknown. One possibility is that the oscillations are part of a stimulus-encoding mechanism. Here, we investigate a computational model of such a mechanism, a spiking neuronal network whose intrinsic oscillations interact with external input (waveforms simulating short speech segments in a single acoustic frequency band) to encode stimuli that extend over a time interval longer than the oscillation's period. The network implements a temporally sparse encoding, whose robustness to time warping and neuronal noise we quantify. To our knowledge, this study is the first to demonstrate that a biophysically plausible model of oscillations occurring in the processing of auditory input may generate a representation of signals that span multiple oscillation cycles.National Science Foundation (DMS-0211505); Burroughs Wellcome Fund; U.S. Air Force Office of Scientific Researc

CiteSeerX

Crossref

Boston University Institutional Repository (OpenBU)

Directory of Open Access Journals

PubMed Central

Recommended from our members

Pushing the Envelope—Aside

Author: Athineos Marios
Bourlard Hervé
Chen Barry
Doddington George
Ellis Daniel P. W.
Hermansky Hynek
Jain Pratibha
Morgan Nelson
Ostendorf Mari
Shinozaki Takahiro
Sivadas Sunil
Stolcke Andreas
Sönmez Kemal
Zhu Qifeng
Çetin Özgür
Publication venue: 'Columbia University Libraries/Information Services'
Publication date: 01/01/2005
Field of study

Despite successes, there are still significant limitations to speech recognition performance, particularly for conversational speech and/or for speech with significant acoustic degradations from noise or reverberation. For this reason, authors have proposed methods that incorporate different (and larger) analysis windows, which are described in this article. Note in passing that we and many others have already taken advantage of processing techniques that incorporate information over long time ranges, for instance for normalization (by cepstral mean subtraction as stated in B. Atal (1974) or relative spectral analysis (RASTA) based in H. Hermansky and N. Morgan (1994)). They also have proposed features that are based on speech sound class posterior probabilities, which have good properties for both classification and stream combination

Columbia University Academic Commons

Principal component analysis of ensemble recordings reveals cell assemblies at high temporal resolution

Author: A Levina
A Mokeichev
A Peyrache
A Riehle
Adrien Peyrache
AK Lee
AM Sengupta
BL McNaughton
C Geisler
C Tracy
CM Bishop
D Ji
D Marr
DO Hebb
DR Euston
E Oja
EP Wigner
Francesco P. Battaglia
G Biroli
G Buzsáki
G Pipa
H Bourlard
HS Kudrimoti
JK Chapin
JM Beggs
JM Beggs
JP Imhof
K Louie
Karim Benchenane
KD Harris
L Nadel
LR Squire
M Steriade
MA Nicolelis
MA Wilson
Mehdi Khamassi
RI Wilson
S Fujisawa
S Ribeiro
Sidney I. Wiener
VA Marčenko
WB Scoville
Y Ikegaya
Z Nádasdy
Publication venue: Springer US
Publication date: 01/01/2009
Field of study

Simultaneous recordings of many single neurons reveals unique insights into network processing spanning the timescale from single spikes to global oscillations. Neurons dynamically self-organize in subgroups of coactivated elements referred to as cell assemblies. Furthermore, these cell assemblies are reactivated, or replayed, preferentially during subsequent rest or sleep episodes, a proposed mechanism for memory trace consolidation. Here we employ Principal Component Analysis to isolate such patterns of neural activity. In addition, a measure is developed to quantify the similarity of instantaneous activity with a template pattern, and we derive theoretical distributions for the null hypothesis of no correlation between spike trains, allowing one to evaluate the statistical significance of instantaneous coactivations. Hence, when applied in an epoch different from the one where the patterns were identified, (e.g. subsequent sleep) this measure allows to identify times and intensities of reactivation. The distribution of this measure provides information on the dynamics of reactivation events: in sleep these occur as transients rather than as a continuous process

Crossref

Springer - Publisher Connector

International Migration, Integration and Social Cohesion online publications

Vinorelbine plus trastuzumab combination as first-line therapy for HER 2-positive metastatic breast cancer patients: an international phase II trial

Author: A Chan
A Seidman
AD Seidman
CL Vogel
D Aubert
DJ Slamon
DJ Slamon
DJ Slamon
FJ Esteva
G Bernardo
G Villanova
H L Sommer
HJ Burstein
HJ Burstein
J Gasmi
JL Bayo-Calero
L Petruzelka
M G Gil
M Jahanzeb
M Martin
M Marty
M Pawlicki
M Pegram
M Untch
M Wojtukiewicz
MA Cobleigh
MD Pegram
MS Berger
N Battelli
P Kellokumpu-Lehtinen
PM Ravdin
T Bourlard
V Georgoulias
V Guillem-Porta
Publication venue
Publication date
Field of study

The aim of this international phase II trial was to determine the efficacy and safety profile of weekly vinorelbine plus trastuzumab as first-line chemotherapy for women with HER 2-overexpressing metastatic breast cancer. Sixty-nine patients with tumours overexpressing HER 2 received vinorelbine: 30 mg m−2 week−1 and trastuzumab: 4 mg kg−1 on day 1 as a loading dose followed by 2 mg kg−1 week−1 starting on day 8. Sixty-two patients were evaluable for response and 69 patients were evaluable for toxicity. The overall response rate was 62.9%. The median time to response was 8.4 weeks, the median duration of response was 17.5 months, the median progression-free survival was 9.9 months (95% CI, 5.6–12.1) and the one-year progression-free survival was 39.1%. The median survival for all patients was 23.7 months (95% CI, 18.4–32.6). This regimen was safe: grade 3–4 neutropenia were observed over 17.7% of courses in 83.8% of patients, with only two episodes of febrile neutropenia (0.1%) in two patients (2.9%). Only one patient discontinued treatment due to grade 3 symptomatic cardiac dysfunction that resolved with therapy. Vinorelbine plus trastuzumab is one of the most active treatment regimens for patients with HER 2-positive metastatic breast cancer and demonstrates a very favourable safety profile allowing prolonged treatment with long-term survival. This study has been presented in part at the following conferences: The San Antonio Breast Cancer Symposium, San Antonio, TX, USA, 2003; The American Society of Clinical Oncology, Orlando, FL, USA, 2005

Crossref

PubMed Central