6 research outputs found
Perspectives on Bayesian Optimization for HCI
In this position paper we discuss optimization in the HCI
domain based on our experiences with Bayesian methods
for modeling and optimization of audio systems, including
challenges related to evaluating, designing, and optimizing
such interfaces. We outline and demonstrate how a
combined Bayesian modeling and optimization approach
provides a flexible framework for integrating various user
and content attributes, while also supporting model-based
optimization of HCI systems. Finally, we discuss current
and future research direction and applications, such as
inferring user needs and optimizing interfaces for
computer assisted teaching
Perspectives on Bayesian Optimization for HCI
In this position paper we discuss optimization in the HCI
domain based on our experiences with Bayesian methods
for modeling and optimization of audio systems, including
challenges related to evaluating, designing, and optimizing
such interfaces. We outline and demonstrate how a
combined Bayesian modeling and optimization approach
provides a flexible framework for integrating various user
and content attributes, while also supporting model-based
optimization of HCI systems. Finally, we discuss current
and future research direction and applications, such as
inferring user needs and optimizing interfaces for
computer assisted teaching
Ordered Preference Elicitation Strategies for Supporting Multi-Objective Decision Making
In multi-objective decision planning and learning, much attention is paid to
producing optimal solution sets that contain an optimal policy for every
possible user preference profile. We argue that the step that follows, i.e,
determining which policy to execute by maximising the user's intrinsic utility
function over this (possibly infinite) set, is under-studied. This paper aims
to fill this gap. We build on previous work on Gaussian processes and pairwise
comparisons for preference modelling, extend it to the multi-objective decision
support scenario, and propose new ordered preference elicitation strategies
based on ranking and clustering. Our main contribution is an in-depth
evaluation of these strategies using computer and human-based experiments. We
show that our proposed elicitation strategies outperform the currently used
pairwise methods, and found that users prefer ranking most. Our experiments
further show that utilising monotonicity information in GPs by using a linear
prior mean at the start and virtual comparisons to the nadir and ideal points,
increases performance. We demonstrate our decision support framework in a
real-world study on traffic regulation, conducted with the city of Amsterdam.Comment: AAMAS 2018, Source code at
https://github.com/lmzintgraf/gp_pref_elici
Learning Combinations of Multiple Feature Representations for Music Emotion Prediction
Music consists of several structures and patterns evolving through time which greatly influences the human decoding of higher-level cognitive aspects of music like the emotions expressed in music. For tasks, such as genre, tag and emotion recognition, these structures have often been identified and used as individual and non-temporal features and representations. In this work, we address the hypothesis whether using multiple temporal and non-temporal representations of different features is beneficial for modeling music structure with the aim to predict the emotions expressed in music. We test this hypothesis by representing temporal and non-temporal structures using generative models of multiple audio features. The representations are used in a discriminative setting via the Product Probability Kernel and the Gaussian Process model enabling Multiple Kernel Learning, finding optimized combinations of both features and temporal/ non-temporal representations. We show the increased predictive performance using the combination of different features and representations along with the great interpretive prospects of this approach
Predictive modeling of expressed emotions in music using pairwise comparisons
We introduce a two-alternative forced-choice (2AFC) experimental paradigm to quantify expressed emotions in music using the arousal and valence (AV) dimensions. A wide range of well-known audio features are investigated for predicting the expressed emotions in music using learning curves and essential baselines. We furthermore investigate the scalability issues of using 2AFC in quantifying emotions expressed in music on large-scale music databases. The possibility of dividing the annotation task between multiple individuals, while pooling individuals’ comparisons is investigated by looking at the subjective differences of ranking emotion in the AV space. We find this to be problematic due to the large variation in subjects’ rankings of excerpts. Finally, solving scalability issues by reducing the number of pairwise comparisons is analyzed. We compare two active learning schemes to selecting comparisons at random by using learning curves. We show that a suitable predictive model of expressed valence in music can be achieved from only 15% of the total number of comparisons when using the Expected Value of Information (EVOI) active learning scheme. For the arousal dimension we require 9% of the total number of comparisons