Search CORE

22 research outputs found

Equity and Fairness of Bayesian Knowledge Tracing

Author: Knobelsdorf M.
Singla A.
Tschiatschek S.
Publication venue
Publication date: 01/01/2022
Field of study

We consider the equity and fairness of curricula derived from Knowledge Tracing models. We begin by defining a unifying notion of an equitable tutoring system as a system that achieves maximum possible knowledge in minimal time for each student interacting with it. Realizing perfect equity requires tutoring systems that can provide individualized curricula per student. In particular, we investigate the design of equitable tutoring systems that derive their curricula from Knowledge Tracing models. We first show that many existing models, including classical Bayesian Knowledge Tracing (BKT) and Deep Knowledge Tracing (DKT), and their derived curricula can fall short of achieving equitable tutoring. To overcome this issue, we then propose a novel model, Bayesian-Bayesian Knowledge Tracing (BBKT), that naturally enables online individualization and, thereby, more equitable tutoring. We demonstrate that curricula derived from our model are more effective and equitable than those derived from classical BKT models. Furthermore, we highlight that improving models with a focus on the fairness of next-step predictions might be insufficient to develop equitable tutoring systems

MPG.PuRe

Learning User Preferences to Incentivize Exploration in the Sharing Economy

Author: Hirnschall C.
Krause A.
Singla A.
Tschiatschek S.
Publication venue
Publication date: 01/01/2017
Field of study

We study platforms in the sharing economy and discuss the need for incentivizing users to explore options that otherwise would not be chosen. For instance, rental platforms such as Airbnb typically rely on customer reviews to provide users with relevant information about different options. Yet, often a large fraction of options does not have any reviews available. Such options are frequently neglected as viable choices, and in turn are unlikely to be evaluated, creating a vicious cycle. Platforms can engage users to deviate from their preferred choice by offering monetary incentives for choosing a different option instead. To efficiently learn the optimal incentives to offer, we consider structural information in user preferences and introduce a novel algorithm - Coordinated Online Learning (CoOL) - for learning with structural information modeled as convex constraints. We provide formal guarantees on the performance of our algorithm and test the viability of our approach in a user study with data of apartments on Airbnb. Our findings suggest that our approach is well-suited to learn appropriate incentives and increase exploration on the investigated platform

MPG.PuRe

Recommended from our members

Hybrid generative-discriminative training of Gaussian mixture models

Author: Peharz R
Pernkopf F
Roth W
Tschiatschek S
Publication venue: Pattern Recognition Letters
Publication date: 08/06/2018
Field of study

Recent work has shown substantial performance improvements of discriminative probabilistic models over their generative counterparts. However, since discriminative models do not capture the input distribution of the data, their use in missing data scenarios is limited. To utilize the advantages of both paradigms, we present an approach to train Gaussian mixture models (GMMs) in a hybrid generative-discriminative way. This is accomplished by optimizing an objective that trades off between a generative likelihood term and either a discriminative conditional likelihood term or a large margin term using stochastic optimization. Our model substantially improves the performance of classical maximum likelihood optimized GMMs while at the same time allowing for both a consistent treatment of missing features by marginalization, and the use of additional unlabeled data in a semi-supervised setting. For the covariance matrices, we employ a diagonal plus low-rank matrix structure to model important correlations while keeping the number of parameters small. We show that a non-diagonal matrix structure is crucial to achieve good performance and that the proposed structure can be utilized to considerably reduce classification time in case of missing features. The capabilities of our model are demonstrated in extensive experiments on real-world data

Apollo (Cambridge)

CUED - Cambridge University Engineering Department

Recommended from our members

Social sensemaking with AI: Designing an open-ended AI experience with a blind child

Author: Cutrell E.
Grayson M.
Longden C.
Marques R. F.
Morrison C.
Roumen G.
Sellen A.
Taylor A.
Thieme A.
Tschiatschek S.
Publication venue: Association for Computing Machinery
Publication date: 06/05/2021
Field of study

AI technologies are often used to aid people in performing discrete tasks with well-defned goals (e.g., recognising faces in images). Emerging technologies that provide continuous, real-time information enable more open-ended AI experiences. In partnership with a blind child, we explore the challenges and opportunities of designing human-AI interaction for a system intended to support social sensemaking. Adopting a research-through-design perspective, we refect upon working with the uncertain capabilities of AI systems in the design of this experience. We contribute: (i) a concrete example of an open-ended AI system that enabled a blind child to extend his own capabilities; (ii) an illustration of the delta between imagined and actual use, highlighting how capabilities derive from the human-AI interaction and not the AI system alone; and (iii) a discussion of design choices to craft an ongoing human-AI interaction that addresses the challenge of uncertain outputs of AI systems

City Research Online

Teaching Inverse Reinforcement Learners via Features and Demonstrations

Author: Haug L.
Singla A.
Tschiatschek S.
Publication venue
Publication date: 01/01/2018
Field of study

MPG.PuRe

Learning Constraints From Human Stop-Feedback in Reinforcement Learning

Author: Poletti S.
Testolin A.
Tschiatschek S.
Publication venue
Publication date: 01/01/2023
Field of study

We investigate an approach for enabling a reinforcement learning agent to learn about dangerous states or constraints from stop-feedback preventing the agent from taking any further, potentially dangerous, actions. Such feedback could be provided by human supervisors overseeing the RL agent's behavior while carrying out some complex tasks. To enable the RL agent to learn from the supervisor's feedback, we propose a probabilistic model for approximating how the supervisor's feedback could have been generated and consider a Bayesian approach for inferring dangerous states. We evaluated our approach using an OpenAI Safety Gym environment and demonstrated that our agent can effectively infer the imposed safety constraints. Furthermore, we conducted a user study to validate our human-inspired feedback model and to obtain insights into the human provision of stop-feedback

Archivio istituzionale della ricerca - Università di Padova

Adaptive Scaffolding in Block-Based Programming via Synthesizing New Tasks as Pop Quizzes

Author: Devlin S.
Ghosh A.
Singla A.
Tschiatschek S.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2022
Field of study

MPG.PuRe

The most generative maximum margin Bayesian networks

Author: Peharz R
Pernkopf F
Tschiatschek S
Publication venue
Publication date: 01/01/2013
Field of study

Although discriminative learning in graphical models generally improves classification results, the generative semantics of the model are compromised. In this paper, we introduce a novel approach of hybrid generative-discriminative learning for Bayesian networks. We use an SVM-type large margin formulation for discriminative training, introducing a likelihood-weighted ℓ1- norm for the SVM-norm-penalization. This simultaneously optimizes the data likelihood and therefore partly maintains the generative character of the model. For many network structures, our method can be formulated as a convex problem, guaranteeing a globally optimal solution. In terms of classification, the resulting models outperform state-of-the art generative and discriminative learning methods for Bayesian networks, and are comparable with linear and kernelized SVMs. Furthermore, the models achieve likelihoods close to the maximum likelihood solution and show robust behavior in classification experiments with missing features. Copyright 2013 by the author(s)

CUED - Cambridge University Engineering Department

Teaching Inverse Reinforcement Learners via Features and Demonstrations

Author: Haug L.
Singla A.
Tschiatschek S.
Publication venue
Publication date: 01/01/2018
Field of study

Learning near-optimal behaviour from an expert's demonstrations typically relies on the assumption that the learner knows the features that the true reward function depends on. In this paper, we study the problem of learning from demonstrations in the setting where this is not the case, i.e., where there is a mismatch between the worldviews of the learner and the expert. We introduce a natural quantity, the teaching risk, which measures the potential suboptimality of policies that look optimal to the learner in this setting. We show that bounds on the teaching risk guarantee that the learner is able to find a near-optimal policy using standard algorithms based on inverse reinforcement learning. Based on these findings, we suggest a teaching scheme in which the expert can decrease the teaching risk by updating the learner's worldview, and thus ultimately enable her to find a near-optimal policy

MPG.PuRe

Learning User Preferences to Incentivize Exploration in the Sharing Economy

Author: Hirnschall C.
Krause A.
Singla A.
Tschiatschek S.
Publication venue
Publication date: 01/01/2018
Field of study

MPG.PuRe