Search CORE

4,208 research outputs found

Seeing voices and hearing voices: learning discriminative embeddings using cross-modal self-supervision

Author: Chung Joon Son
Chung Soo-Whan
Kang Hong Goo
Publication venue: 'International Speech Communication Association'
Publication date: 06/05/2020
Field of study

The goal of this work is to train discriminative cross-modal embeddings without access to manually annotated data. Recent advances in self-supervised learning have shown that effective representations can be learnt from natural cross-modal synchrony. We build on earlier work to train embeddings that are more discriminative for uni-modal downstream tasks. To this end, we propose a novel training strategy that not only optimises metrics across modalities, but also enforces intra-class feature separation within each of the modalities. The effectiveness of the method is demonstrated on two downstream tasks: lip reading using the features trained on audio-visual synchronisation, and speaker recognition using the features trained for cross-modal biometric matching. The proposed method outperforms state-of-the-art self-supervised baselines by a signficant margin.Comment: Under submission as a conference pape

arXiv.org e-Print Archive

Crossref

Predictive biometrics: A review and analysis of predicting personal characteristics from biometric data

Author: Abreu M.C.C.
Abreu M.C.C.
Abreu M.C.D.C.
Chang T.‐Y.
Chen Y.‐L.
Dobry G.
Gao Y.
Geng X.
Giot R.
Idrus S.
Jain A.K.
Leon S.
Li C.
Li S.Z.
Likert R.
Livingstone S.R.
Lu H.
Matta F.
Mutalib S.
Pan L.
Pervouchine V.
Pisani P.H.
Proenca H.
Ricanek K.
Rodrigues R.N.
Roli F.
Santos O.C.
Schuller B.
Tapia J.E.
Teh P.S.
Wang Z.‐H.
Yan H.
Publication venue: 'Institution of Engineering and Technology (IET)'
Publication date: 16/11/2017
Field of study

Interest in the exploitation of soft biometrics information has continued to develop over the last decade or so. In comparison with traditional biometrics, which focuses principally on person identification, the idea of soft biometrics processing is to study the utilisation of more general information regarding a system user, which is not necessarily unique. There are increasing indications that this type of data will have great value in providing complementary information for user authentication. However, the authors have also seen a growing interest in broadening the predictive capabilities of biometric data, encompassing both easily definable characteristics such as subject age and, most recently, `higher level' characteristics such as emotional or mental states. This study will present a selective review of the predictive capabilities, in the widest sense, of biometric data processing, providing an analysis of the key issues still adequately to be addressed if this concept of predictive biometrics is to be fully exploited in the future

Crossref

Sheffield Hallam University Research Archive