Search CORE

281 research outputs found

Automatically estimating emotion in music with deep long-short term memory recurrent neural networks

Author: Coutinho E
Schuller B
Trigeorgis G
Zafeiriou S
Publication venue
Publication date: 01/01/2015
Field of study

In this paper we describe our approach for the MediaEval's "Emotion in Music" task. Our method consists of deep Long-Short Term Memory Recurrent Neural Networks (LSTM-RNN) for dynamic Arousal and Valence regression, using acoustic and psychoacoustic features extracted from the songs that have been previously proven as effective for emotion prediction in music. Results on the challenge test demonstrate an excellent performance for Arousal estimation (r = 0.613 ± 0.278), but not for Valence (r = 0.026 ± 0.500). Issues regarding the quality of the test set annotations' reliability and distributions are indicated as plausible justifications for these results. By using a subset of the development set that was left out for performance estimation, we could determine that the performance of our approach may be underestimated for Valence (Arousal: r = 0.596 ± 0.386; Valence: r = 0.458 ± 0.551)

University of Liverpool Repository

3D face morphable models "In-The-Wild"

Author: Antonakos E.
Antonakos E.
Booth J.
Booth J.
Panagakis Y.
Panagakis Y.
Ploumpis S.
Ploumpis S.
Trigeorgis G.
Trigeorgis G.
Zafeiriou S.
Zafeiriou S.
Publication venue: Institute of Electrical and Electronics Engineers (IEEE)
Publication date: 01/01/2017
Field of study

3D Morphable Models (3DMMs) are powerful statistical models of 3D facial shape and texture, and among the state-of-the-art methods for reconstructing facial shape from single images. With the advent of new 3D sensors, many 3D facial datasets have been collected containing both neutral as well as expressive faces. However, all datasets are captured under controlled conditions. Thus, even though powerful 3D facial shape models can be learnt from such data, it is difficult to build statistical texture models that are sufficient to reconstruct faces captured in unconstrained conditions (in-the-wild). In this paper, we propose the first, to the best of our knowledge, in-the-wild 3DMM by combining a powerful statistical model of facial shape, which describes both identity and expression, with an in-the-wild texture model. We show that the employment of such an in-the-wild texture model greatly simplifies the fitting procedure, because there is no need to optimise with regards to the illumination parameters. Furthermore, we propose a new fast algorithm for fitting the 3DMM in arbitrary images. Finally, we have captured the first 3D facial database with relatively unconstrained conditions and report quantitative evaluations with state-of-the-art performance. Complementary qualitative reconstruction results are demonstrated on standard in-the-wild facial databases

Middlesex University Research Repository

The 3D Menpo Facial Landmark Tracking Challenge

Author: Chrysos G
Deng J
Roussos A
Trigeorgis G
Ververas E
Zafeiriou S
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 08/03/2018
Field of study

This is the final version of the article. It is the open access version, provided by the Computer Vision Foundation. Except for the watermark, it is identical to the IEEE published version. Available from IEEE via the DOI in this record.Test descriptionRecently, deformable face alignment is synonymous to the task of locating a set of 2D sparse landmarks in intensity images. Currently, discriminatively trained Deep Convolutional Neural Networks (DCNNs) are the state-of-the-art in the task of face alignment. DCNNs exploit large amount of high quality annotations that emerged the last few years. Nevertheless, the provided 2D annotations rarely capture the 3D structure of the face (this is especially evident in the facial boundary). That is, the annotations neither provide an estimate of the depth nor correspond to the 2D projections of the 3D facial structure. This paper summarises our efforts to develop (a) a very large database suitable to be used to train 3D face alignment algorithms in images captured "in-the-wild" and (b) to train and evaluate new methods for 3D face landmark tracking. Finally, we report the results of the first challenge in 3D face tracking "in-the-wild".The work of S. Zafeiriou and A. Roussos has been partially funded by the EPSRC Project EP/N007743/

Open Research Exeter

Value Uncertainty

Author: Bali Turan G.
Del Viva Luca
Hefnawy Menatalla El
Trigeorgis Lenos
Publication venue: Institute for Operations Research and Management Sciences
Publication date: 01/01/2022
Field of study

We examine how time-series volatility of book-to-market (UNC) is priced in equity returns and the relative contributions of its book volatility (variations in earnings and book value) and market volatility components (shocks in required return). UNC captures valuation risk, so stocks with high valuation risk earn higher return. An investment strategy long in high-UNC and short in low-UNC firms generates 8.5% annual risk-adjusted return. UNC valuation risk premium is driven by outperformance of high-UNC firms facing higher information risk and is not explained by established risk factors and firm characteristics

Durham Research Online

The ICL-TUM-PASSAU approach for the MediaEval 2015 "affective impact of movies" task

Author: Coutinho E
Marchi E
Ringeval F
Schuller B
Trigeorgis G
Zafeiriou S
Publication venue
Publication date: 01/01/2015
Field of study

In this paper we describe the Imperial College London, Technische Universitat München and University of Passau (ICL+TUM+PASSAU) team approach to the MediaEval's "Affective Impact of Movies" challenge, which consists in the automatic detection of affective (arousal and valence) and violent content in movie excerpts. In addition to the baseline features, we computed spectral and energy related acoustic features, and the probability of various objects being present in the video. Random Forests, AdaBoost and Support Vector Machines were used as classification methods. Best results show that the dataset is highly challenging for both affect and violence detection tasks, mainly because of issues in inter-rater agreement and data scarcity

University of Liverpool Repository

DenseReg: fully convolutional dense shape regression in-the-wild

Author: Antonakos E
Guler R
Kokkinos I
Snape P
Trigeorgis G
Zafeiriou S
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 03/03/2017
Field of study

In this paper we propose to learn a mapping from image pixels into a dense template grid through a fully convolutional network. We formulate this task as a regression problem and train our network by leveraging upon manually annotated facial landmarks “in-the-wild”. We use such landmarks to establish a dense correspondence field between a three-dimensional object template and the input image, which then serves as the ground-truth for training our regression system. We show that we can combine ideas from semantic segmentation with regression networks, yielding a highly-accurate ‘quantized regression’ architecture. Our system, called DenseReg, allows us to estimate dense image-to-template correspondences in a fully convolutional manner. As such our network can provide useful correspondence information as a stand-alone system, while when used as an initialization for Statistical Deformable Models we obtain landmark localization results that largely outperform the current state-of-the-art on the challenging 300W benchmark. We thoroughly evaluate our method on a host of facial analysis tasks, and demonstrate its use for other correspondence estimation tasks, such as the human body and the human ear. DenseReg code is made available at http://alpguler.com/DenseReg.html along with supplementary materials

Spiral - Imperial College Digital Repository

Face Normals "in-the- wild" using Fully Convolutional Networks

Author: Kokkinos I
Snape P
Trigeorgis G
Zafeiriou S
Publication venue: 30th IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Publication date: 21/07/2017
Field of study

In this work we pursue a data-driven approach to the problem of estimating surface normals from a single intensity image, focusing in particular on human faces. We introduce new methods to exploit the currently available facial databases for dataset construction and tailor a deep convolutional neural network to the task of estimating facial surface normals in-the-wild. We train a fully convolutional network that can accurately recover facial normals from images including a challenging variety of expressions and facial poses. We compare against state-of-the-art face Shape-from-Shading and 3D reconstruction techniques and show that the proposed network can recover substantially more accurate and realistic normals. Furthermore, in contrast to other existing face-specific surface recovery methods, we do not require the solving of an explicit alignment step due to the fully convolutional nature of our network

Crossref

UCL Discovery

Deep Canonical Time Warping for simultaneous alignment and representation learning of sequences

Author: Nicolaou M. A.
Schuller B.
Trigeorgis G.
Zafeiriou S.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 11/04/2017
Field of study

Machine learning algorithms for the analysis of time-series often depend on the assumption that utilised data are temporally aligned. Any temporal discrepancies arising in the data is certain to lead to ill-generalisable models, which in turn fail to correctly capture properties of the task at hand. The temporal alignment of time-series is thus a crucial challenge manifesting in a multitude of applications. Nevertheless, the vast majority of algorithms oriented towards temporal alignment are either applied directly on the observation space or simply utilise linear projections - thus failing to capture complex, hierarchical non-linear representations that may prove beneficial, especially when dealing with multi-modal data (e.g., visual and acoustic information). To this end, we present Deep Canonical Time Warping (DCTW), a method that automatically learns non-linear representations of multiple time-series that are (i) maximally correlated in a shared subspace, and (ii) temporally aligned. Furthermore, we extend DCTW to a supervised setting, where during training, available labels can be utilised towards enhancing the alignment process. By means of experiments on four datasets, we show that the representations learnt significantly outperform state-of-the-art methods in temporal alignment, elegantly handling scenarios with heterogeneous feature sets, such as the temporal alignment of acoustic and visual information

OPUS Augsburg

Goldsmiths Research Online

Crossref

University of Oulu Repository - Jultika

Spiral - Imperial College Digital Repository

A Hedged Monte Carlo Approach to Real Option Pricing

Author: A. Borison
A. Dixit
A. Dixit
A.G.N. Novaes
C.C. Chen
E. Gobet
F. Hubalek
F.A. Longstaff
G. Mittal
G.H. Golub
H. Pham
J. Oldenburg
J.A. Primbs
J.E. Ingersoll
J.F. Shapiro
J.L. Paddock
J.P. Lemor
L. Moro
L. Trigeorgis
L. Trigeorgis
L.G. Papageorgiou
L.V. Gastel
M. Musiela
M. Potters
M. Schweizer
M. Schweizer
M.J. Brennan
M.R. Grasselli
M.R. Grasselli
N. Karoui El
N. Sahinidis
O. Bobrovnytska
P. Glasserman
R. McDonald
S. Mathews
S. Titman
S.C. Myers
T. Copeland
T. Copeland
V. Henderson
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

In this work we are concerned with valuing optionalities associated to invest or to delay investment in a project when the available information provided to the manager comes from simulated data of cash flows under historical (or subjective) measure in a possibly incomplete market. Our approach is suitable also to incorporating subjective views from management or market experts and to stochastic investment costs. It is based on the Hedged Monte Carlo strategy proposed by Potters et al (2001) where options are priced simultaneously with the determination of the corresponding hedging. The approach is particularly well-suited to the evaluation of commodity related projects whereby the availability of pricing formulae is very rare, the scenario simulations are usually available only in the historical measure, and the cash flows can be highly nonlinear functions of the prices.Comment: 25 pages, 14 figure

arXiv.org e-Print Archive

Crossref

End-to-End Multimodal Emotion Recognition using Deep Neural Networks

Author: Nicolaou M. A.
Schuller B.
Trigeorgis G.
Tzirakis P.
Zafeiriou S.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2017
Field of study

Automatic affect recognition is a challenging task due to the various modalities emotions can be expressed with. Applications can be found in many domains including multimedia retrieval and human computer interaction. In recent years, deep neural networks have been used with great success in determining emotional states. Inspired by this success, we propose an emotion recognition system using auditory and visual modalities. To capture the emotional content for various styles of speaking, robust features need to be extracted. To this purpose, we utilize a Convolutional Neural Network (CNN) to extract features from the speech, while for the visual modality a deep residual network (ResNet) of 50 layers. In addition to the importance of feature extraction, a machine learning algorithm needs also to be insensitive to outliers while being able to model the context. To tackle this problem, Long Short-Term Memory (LSTM) networks are utilized. The system is then trained in an end-to-end fashion where - by also taking advantage of the correlations of the each of the streams - we manage to significantly outperform the traditional approaches based on auditory and visual handcrafted features for the prediction of spontaneous and natural emotions on the RECOLA database of the AVEC 2016 research challenge on emotion recognition

arXiv.org e-Print Archive

OPUS Augsburg

Goldsmiths Research Online

Crossref

University of Oulu Repository - Jultika

Spiral - Imperial College Digital Repository