Search CORE

93 research outputs found

Probabilistic Modeling of Human Teams to Infer False Beliefs

Author: Barnard Kobus
Pyarelal Adarsh
Soares Paulo
Publication venue
Publication date: 19/10/2023
Field of study

We develop a probabilistic graphical model (PGM) for artificially intelligent (AI) agents to infer human beliefs during a simulated urban search and rescue (USAR) scenario executed in a Minecraft environment with a team of three players. The PGM approach makes observable states and actions explicit, as well as beliefs and intentions grounded by evidence about what players see and do over time. This approach also supports inferring the effect of interventions, which are vital if AI agents are to assist human teams. The experiment incorporates manipulations of players' knowledge, and the virtual Minecraft-based testbed provides access to several streams of information, including the objects in the players' field of view. The participants are equipped with a set of marker blocks that can be placed near room entrances to signal the presence or absence of victims in the rooms to their teammates. In each team, one of the members is given a different legend for the markers than the other two, which may mislead them about the state of the rooms; that is, they will hold a false belief. We extend previous works in this field by introducing ToMCAT, an AI agent that can reason about individual and shared mental states. We find that the players' behaviors are affected by what they see in their in-game field of view, their beliefs about the meaning of the markers, and their beliefs about which meaning the team decided to adopt. In addition, we show that ToMCAT's beliefs are consistent with the players' actions and that it can infer false beliefs with accuracy significantly better than chance and comparable to inferences made by human observers.Comment: 8 pages, 7 figures, presented in the 2021 AAAI Fall Symposiu

arXiv.org e-Print Archive

Fiber bundle imaging resolution enhancement using deep learning

Author: Barnard Kobus
Liang Rongguang
Shao Jianbo
Zhang Junchao
Publication venue: 'The Optical Society'
Publication date: 27/05/2019
Field of study

We propose a deep learning based method to estimate high-resolution images from multiple fiber bundle images. Our approach first aligns raw fiber bundle image sequences with a motion estimation neural network and then applies a 3D convolution neural network to learn a mapping from aligned fiber bundle image sequences to their ground truth images. Evaluations on lens tissue samples and a 1951 USAF resolution target suggest that our proposed method can significantly improve spatial resolution for fiber bundle imaging systems.National Institute of Biomedical Imaging and Bioengineering (NIBIB) [R21EB022378]Open access journalThis item from the UA Faculty Publications collection is made available by the University of Arizona with support from the University of Arizona Libraries. If you have questions, please contact us at [email protected]

Institutional Repository of Institute of Automation, CAS

Shenyang Institute of Automation,Chinese Academy Of Sciences

The University of Arizona

Investigations into Multi-Scale Retinex

Author: Barnard Kobus
Funt Brian
Publication venue
Publication date: 01/01/1998
Field of study

The main thrust of this paper is to investigate the multi-scale retinex (MSR) approach to image enhancement to explain the effect of the processing from a theoretical standpoint. This leads to a new algorithm with fewer arbitrary parameters that is more flexible, maintains colour fidelity, and still preserves the contrast-enhancement benfits of the original MSR method. To accomplish this we identify the explicit and implicit processing goals of MSR. By decoupling the MSR operations from one another, we build an algorithm composed of independent steps that separates out the issues of gamma adjustment, colour balance, dynamic range compression, and colour enhancement, which are all jumbled together in the original MSR method. We then extend MSR with colour constancy and chromaticity-preserving contrast enhancement

CiteSeerX

Simon Fraser University Institutional Repository

Probabilistic web image gathering

Author: Keiji Yanai
Kobus Barnard
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2005
Field of study

We propose a new method for automated large scale gath-ering of Web images relevant to speci¯ed concepts. Our main goal is to build a knowledge base associated with as many concepts as possible for large scale object recognition studies. A second goal is supporting the building of more accurate text-based indexes for Web images. In our method, good quality candidate sets of images for each keyword are gathered as a function of analysis of the surrounding HTML text. The gathered images are then segmented into regions, and a model for the probability distribution of regions for the concept is computed using an iterative algorithm based on the previous work on statistical image annotation. The learned model is then applied to identify which images are visually relevant to the concept implied by the keyword. Implicitly, which regions or the images are relevant is also determined. Our experiments reveal that the new method performs much better than Google Image Search and a sim-ple method based on more standard content based image retrieval methods

CiteSeerX

Crossref

Fully Point-wise Convolutional Neural Network for Modeling Statistical Regularities in Natural Images

Author: Barnard Kobus
Finlayson Graham D
He Kaiming
Howard Andrew G
Hu Yuanming
Vaezi Joze Hamid Reza
Zhang Xiangyu
Publication venue
Publication date: 20/08/2018
Field of study

Modeling statistical regularity plays an essential role in ill-posed image processing problems. Recently, deep learning based methods have been presented to implicitly learn statistical representation of pixel distributions in natural images and leverage it as a constraint to facilitate subsequent tasks, such as color constancy and image dehazing. However, the existing CNN architecture is prone to variability and diversity of pixel intensity within and between local regions, which may result in inaccurate statistical representation. To address this problem, this paper presents a novel fully point-wise CNN architecture for modeling statistical regularities in natural images. Specifically, we propose to randomly shuffle the pixels in the origin images and leverage the shuffled image as input to make CNN more concerned with the statistical properties. Moreover, since the pixels in the shuffled image are independent identically distributed, we can replace all the large convolution kernels in CNN with point-wise (

1*1

) convolution kernels while maintaining the representation ability. Experimental results on two applications: color constancy and image dehazing, demonstrate the superiority of our proposed network over the existing architectures, i.e., using 1/10

\sim

1/100 network parameters and computational cost while achieving comparable performance.Comment: 9 pages, 7 figures. To appear in ACM MM 201

arXiv.org e-Print Archive

Crossref

Attention as Activation

Author: Barnard Kobus
Dai Yimian
Gieseke Fabian
Oehmcke Stefan
Wu Yiquan
Publication venue
Publication date: 01/01/2020
Field of study

Activation functions and attention mechanisms are typically treated as having different purposes and have evolved differently. However, both concepts can be formulated as a non-linear gating function. Inspired by their similarity, we propose a novel type of activation units called attentional activation (ATAC) units as a unification of activation functions and attention mechanisms. In particular, we propose a local channel attention module for the simultaneous non-linear activation and element-wise feature refinement, which locally aggregates point-wise cross-channel feature contexts. By replacing the well-known rectified linear units by such ATAC units in convolutional networks, we can construct fully attentional networks that perform significantly better with a modest number of additional parameters. We conducted detailed ablation studies on the ATAC units using several host networks with varying network depths to empirically verify the effectiveness and efficiency of the units. Furthermore, we compared the performance of the ATAC units against existing activation functions as well as other attention mechanisms on the CIFAR-10, CIFAR-100, and ImageNet datasets. Our experimental results show that networks constructed with the proposed ATAC units generally yield performance gains over their competitors given a comparable number of parameters

arXiv.org e-Print Archive

Copenhagen University Research Information System

Attentional Feature Fusion

Author: Barnard Kobus
Dai Yimian
Gieseke Fabian
Oehmcke Stefan
Wu Yiquan
Publication venue
Publication date: 09/11/2020
Field of study

Feature fusion, the combination of features from different layers or branches, is an omnipresent part of modern network architectures. It is often implemented via simple operations, such as summation or concatenation, but this might not be the best choice. In this work, we propose a uniform and general scheme, namely attentional feature fusion, which is applicable for most common scenarios, including feature fusion induced by short and long skip connections as well as within Inception layers. To better fuse features of inconsistent semantics and scales, we propose a multi-scale channel attention module, which addresses issues that arise when fusing features given at different scales. We also demonstrate that the initial integration of feature maps can become a bottleneck and that this issue can be alleviated by adding another level of attention, which we refer to as iterative attentional feature fusion. With fewer layers or parameters, our models outperform state-of-the-art networks on both CIFAR-100 and ImageNet datasets, which suggests that more sophisticated attention mechanisms for feature fusion hold great potential to consistently yield better results compared to their direct counterparts. Our codes and trained models are available online.Comment: Accepted by WACV 202

arXiv.org e-Print Archive

Copenhagen University Research Information System