Search CORE

22,290 research outputs found

Deep Multimodal Learning for Audio-Visual Speech Recognition

Author: Goel Vaibhava
Marcheret Etienne
Mroueh Youssef
Publication venue
Publication date: 22/01/2015
Field of study

In this paper, we present methods in deep multimodal learning for fusing speech and visual modalities for Audio-Visual Automatic Speech Recognition (AV-ASR). First, we study an approach where uni-modal deep networks are trained separately and their final hidden layers fused to obtain a joint feature space in which another deep network is built. While the audio network alone achieves a phone error rate (PER) of

41\%

under clean condition on the IBM large vocabulary audio-visual studio dataset, this fusion model achieves a PER of

35.83\%

demonstrating the tremendous value of the visual channel in phone classification even in audio with high signal to noise ratio. Second, we present a new deep network architecture that uses a bilinear softmax layer to account for class specific correlations between modalities. We show that combining the posteriors from the bilinear networks with those from the fused model mentioned above results in a further significant phone error rate reduction, yielding a final PER of

34.03\%

.Comment: ICASSP 201

arXiv.org e-Print Archive

Crossref

A Novel Approach to Face Recognition using Image Segmentation based on SPCA-KNN Method

Author: Benco M.
Hlubik J.
Hudec R.
Jarina R.
Kamencay P.
Zachariasova M.
Publication venue: Společnost pro radioelektronické inženýrství
Publication date: 01/04/2013
Field of study

In this paper we propose a novel method for face recognition using hybrid SPCA-KNN (SIFT-PCA-KNN) approach. The proposed method consists of three parts. The first part is based on preprocessing face images using Graph Based algorithm and SIFT (Scale Invariant Feature Transform) descriptor. Graph Based topology is used for matching two face images. In the second part eigen values and eigen vectors are extracted from each input face images. The goal is to extract the important information from the face data, to represent it as a set of new orthogonal variables called principal components. In the final part a nearest neighbor classifier is designed for classifying the face images based on the SPCA-KNN algorithm. The algorithm has been tested on 100 different subjects (15 images for each class). The experimental result shows that the proposed method has a positive effect on overall face recognition performance and outperforms other examined methods

Directory of Open Access Journals

Digital library of Brno University of Technology

Deep Multi-Modal Classification of Intraductal Papillary Mucinous Neoplasms (IPMN) with Canonical Correlation Analysis

Author: Bagci Ulas
Bolan Candice W.
Corral Juan E.
Hussein Sarfaraz
Kandel Pujan
Wallace Michael B.
Publication venue
Publication date: 27/04/2018
Field of study

Pancreatic cancer has the poorest prognosis among all cancer types. Intraductal Papillary Mucinous Neoplasms (IPMNs) are radiographically identifiable precursors to pancreatic cancer; hence, early detection and precise risk assessment of IPMN are vital. In this work, we propose a Convolutional Neural Network (CNN) based computer aided diagnosis (CAD) system to perform IPMN diagnosis and risk assessment by utilizing multi-modal MRI. In our proposed approach, we use minimum and maximum intensity projections to ease the annotation variations among different slices and type of MRIs. Then, we present a CNN to obtain deep feature representation corresponding to each MRI modality (T1-weighted and T2-weighted). At the final step, we employ canonical correlation analysis (CCA) to perform a fusion operation at the feature level, leading to discriminative canonical correlation features. Extracted features are used for classification. Our results indicate significant improvements over other potential approaches to solve this important problem. The proposed approach doesn't require explicit sample balancing in cases of imbalance between positive and negative examples. To the best of our knowledge, our study is the first to automatically diagnose IPMN using multi-modal MRI.Comment: Accepted for publication in IEEE International Symposium on Biomedical Imaging (ISBI) 201

arXiv.org e-Print Archive

Crossref

University of Central Florida (UCF): STARS (Showcase of Text, Archives, Research & Scholarship)

Common and Distinct Components in Data Fusion

Author: Acar
Acar
Alter
Bevilacqua
Bookstein
Bro
Bro
Bylesjo
Consonni
Correa
Dahl
De Moor
De Moor
de Noord
Golub
Hanafi
Hotelling
Jansen
Kettenring
Kirwan
Lahat
Lips
Lock
Lofstedt
Lofstedt
Lynch
Mage
Mattarucchi
Naes
Pages
Paige
Peres-Neto
Petrakis
Ponnapalli
Ray
Schonemann
Schott
Schouteden
Shan
Sidiropoulos
Smilde
Smilde
Smilde
Srivastava
Szymanski
Tao
Tauler
Tenenhaus
Ten Berge
Tibshirani
Timmerman
Timmerman
Tomassini
Trygg
Trygg
Van de Geer
Van den Berg
Van den Berg
van der Burg
Van der Kloet
Van Deun
Van Deun
Van Deun
Van Loan
Van Mechelen
Westerhuis
Wilderjans
Yanai
Publication venue
Publication date: 08/07/2016
Field of study

In many areas of science multiple sets of data are collected pertaining to the same system. Examples are food products which are characterized by different sets of variables, bio-processes which are on-line sampled with different instruments, or biological systems of which different genomics measurements are obtained. Data fusion is concerned with analyzing such sets of data simultaneously to arrive at a global view of the system under study. One of the upcoming areas of data fusion is exploring whether the data sets have something in common or not. This gives insight into common and distinct variation in each data set, thereby facilitating understanding the relationships between the data sets. Unfortunately, research on methods to distinguish common and distinct components is fragmented, both in terminology as well as in methods: there is no common ground which hampers comparing methods and understanding their relative merits. This paper provides a unifying framework for this subfield of data fusion by using rigorous arguments from linear algebra. The most frequently used methods for distinguishing common and distinct components are explained in this framework and some practical examples are given of these methods in the areas of (medical) biology and food science.Comment: 50 pages, 12 figure

arXiv.org e-Print Archive

Proceedings - University of Groningen

Crossref

University of Groningen

ARTS repository - University of Groningen

NOFIMA Repository

Copenhagen University Research Information System

Leiden University Scholary Publications

Dissertations of the University of Groningen