Search CORE

7 research outputs found

Evaluating Visual Conversational Agents via Cooperative Human-AI Games

Author: Batra Dhruv
Chandrasekaran Arjun
Chattopadhyay Prithvijit
Das Abhishek
Lee Stefan
Parikh Devi
Prabhu Viraj
Yadav Deshraj
Publication venue
Publication date: 16/08/2017
Field of study

As AI continues to advance, human-AI teams are inevitable. However, progress in AI is routinely measured in isolation, without a human in the loop. It is crucial to benchmark progress in AI, not just in isolation, but also in terms of how it translates to helping humans perform certain tasks, i.e., the performance of human-AI teams. In this work, we design a cooperative game - GuessWhich - to measure human-AI team performance in the specific context of the AI being a visual conversational agent. GuessWhich involves live interaction between the human and the AI. The AI, which we call ALICE, is provided an image which is unseen by the human. Following a brief description of the image, the human questions ALICE about this secret image to identify it from a fixed pool of images. We measure performance of the human-ALICE team by the number of guesses it takes the human to correctly identify the secret image after a fixed number of dialog rounds with ALICE. We compare performance of the human-ALICE teams for two versions of ALICE. Our human studies suggest a counterintuitive trend - that while AI literature shows that one version outperforms the other when paired with an AI questioner bot, we find that this improvement in AI-AI performance does not translate to improved human-AI performance. This suggests a mismatch between benchmarking of AI in isolation and in the context of human-AI teams.Comment: HCOMP 201

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

On the Influence of Explainable AI on Automation Bias

Author: Benz Carina
Kuehl Niklas
Satzger Gerhard
Schemmer Max
Publication venue: AIS Electronic Library (AISeL)
Publication date: 19/04/2022
Field of study

Artificial intelligence (AI) is gaining momentum, and its importance for the future of work in many areas, such as medicine and banking, is continuously rising. However, insights on the effective collaboration of humans and AI are still rare. Typically, AI supports humans in decision-making by addressing human limitations. However, it may also evoke human bias, especially in the form of automation bias as an over-reliance on AI advice. We aim to shed light on the potential to influence automation bias by explainable AI (XAI). In this pre-test, we derive a research model and describe our study design. Subsequentially, we conduct an online experiment with regard to hotel review classifications and discuss first results. We expect our research to contribute to the design and development of safe hybrid intelligence systems

arXiv.org e-Print Archive

AIS Electronic Library (AISeL)

Recommended from our members

Lets Talk about Race: Identity, Chatbots, and AI

Author: Ahmed Sara
Akten Memo
Ananny Mike
Baldwin Alyx
Baldwin James
Bartlett Jamie
Bonilla-Silva Eduardo
Bright Peter
Caliskan-Islam Aylin
Chiel Ethan
Christin Angèle
Coates Rodney
Deleuze Gilles
Diakopoulos Nicholas
Diakopoulos Nicholas
Dourish Paul
Eisenstein Jacob
Erhan Dumitru
Foley Mary Jo
Haraway Donna
Haraway Donna J.
Haraway Donna J.
Horton Helena
Jaakkola David
Jaques Natasha
Jeong Sarah
Kazemi Darius
Kushner David
Lee Heeyoung
Lee Peter
Marcus Mitchell P.
Marino Mark C.
Mikolov Tomas
Nakamura Lisa
Perez Sarah
Powazek Derek
Russell Stuart
Sinders Caroline
Tay
Tufekci Zeynep
Whitehead Keving A.
Winner Langdon
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 21/04/2018
Field of study

Why is it so hard for chatbots to talk about race? This work explores how the biased contents of databases, the syntactic focus of natural language processing, and the opaque nature of deep learning algorithms cause chatbots difficulty in handling race-talk. In each of these areas, the tensions between race and chatbots create new opportunities for people and machines. By making the abstract and disparate qualities of this problem space tangible, we can develop chatbots that are more capable of handling race-talk in its many forms. Our goal is to provide the HCI community with ways to begin addressing the question, how can chatbots handle race-talk in new and improved ways

City Research Online

Crossref

OpenWorm: an open-science approach to modeling Caenorhabditis elegans.

Author: Cantarelli M
Currie M
Gleeson P
Hokanson J
Idili G
Khayrulin S
Larson S
Palyanov A
Szigeti B
Vella M
Publication venue
Publication date: 01/01/2014
Field of study

OpenWorm is an international collaboration with the aim of understanding how the behavior of Caenorhabditis elegans (C. elegans) emerges from its underlying physiological processes. The project has developed a modular simulation engine to create computational models of the worm. The modularity of the engine makes it possible to easily modify the model, incorporate new experimental data and test hypotheses. The modeling framework incorporates both biophysical neuronal simulations and a novel fluid-dynamics-based soft-tissue simulation for physical environment-body interactions. The project's open-science approach is aimed at overcoming the difficulties of integrative modeling within a traditional academic environment. In this article the rationale is presented for creating the OpenWorm collaboration, the tools and resources developed thus far are outlined and the unique challenges associated with the project are discussed

UCL Discovery

Frontiers - Publisher Connector

PubMed Central

Recommended from our members

Fly with me : algorithms and methods for influencing a flock

Author: Genter Kathryn Long
Publication venue
Publication date: 02/10/2017
Field of study

As robots become more affordable, they will begin to exist in the world in greater quantities. Some of these robots will likely be designed to act as components in specific teams. These teams could work on tasks that are too large or complex for a single robot - or that are merely more efficiently accomplished by a team - such as surveillance in a large building or product delivery to packers in a warehouse. Multiagent systems research studies how these teams are formed and how they work together. Ad hoc teamwork, a newer area of multiagent systems research, studies how new robots can join these pre-existing teams and assist the team in accomplishing its goal. This dissertation extends and applies research in ad hoc teamwork towards the general area of flocking, which is an emergent swarm behavior. In particular, the work in this dissertation considers how ad hoc agents - called influencing agents in this dissertation - can join a flock, be recognized by the rest of the flock as part of the flock, influence the flock towards particular behaviors through their own behavior, and then separate from the flock. Specifically, the primary research question addressed in this dissertation is How can influencing agents be utilized in various types of flocks to influence the flock towards a particular behavior? In order to address this research question, this dissertation makes six main types of contributions. First, this dissertation formalizes the problem of using influencing agents to influence a flock. Second, this dissertation contributes and analyzes algorithms for influencing a flock to a desired orientation. Third, this dissertation presents methods for determining how to best add influencing agents to a flock. Fourth, this dissertation provides methods by which influencing agents can join and then leave a flock in motion. Fifth, this dissertation evaluates some of the influencing agent algorithms on a robot platform. Sixth, although the majority of this dissertation assumes the influencing agents will join a flock that behaves similarly to European starlings, this dissertation also provides insight into when and how its algorithms are generalizable to other types of flocks as well as to general teamwork and coordination research. All of the methods presented in this dissertation are empirically evaluated using a simulator that can support large flocks.Computer Science

Texas ScholarWorks