3,180 research outputs found
Shepherding Slots to Objects: Towards Stable and Robust Object-Centric Learning
Object-centric learning (OCL) aspires general and compositional understanding
of scenes by representing a scene as a collection of object-centric
representations. OCL has also been extended to multi-view image and video
datasets to apply various data-driven inductive biases by utilizing geometric
or temporal information in the multi-image data. Single-view images carry less
information about how to disentangle a given scene than videos or multi-view
images do. Hence, owing to the difficulty of applying inductive biases, OCL for
single-view images remains challenging, resulting in inconsistent learning of
object-centric representation. To this end, we introduce a novel OCL framework
for single-view images, SLot Attention via SHepherding (SLASH), which consists
of two simple-yet-effective modules on top of Slot Attention. The new modules,
Attention Refining Kernel (ARK) and Intermediate Point Predictor and Encoder
(IPPE), respectively, prevent slots from being distracted by the background
noise and indicate locations for slots to focus on to facilitate learning of
object-centric representation. We also propose a weak semi-supervision approach
for OCL, whilst our proposed framework can be used without any assistant
annotation during the inference. Experiments show that our proposed method
enables consistent learning of object-centric representation and achieves
strong performance across four datasets. Code is available at
\url{https://github.com/object-understanding/SLASH}
Leveraging Image Augmentation for Object Manipulation: Towards Interpretable Controllability in Object-Centric Learning
The binding problem in artificial neural networks is actively explored with
the goal of achieving human-level recognition skills through the comprehension
of the world in terms of symbol-like entities. Especially in the field of
computer vision, object-centric learning (OCL) is extensively researched to
better understand complex scenes by acquiring object representations or slots.
While recent studies in OCL have made strides with complex images or videos,
the interpretability and interactivity over object representation remain
largely uncharted, still holding promise in the field of OCL. In this paper, we
introduce a novel method, Slot Attention with Image Augmentation (SlotAug), to
explore the possibility of learning interpretable controllability over slots in
a self-supervised manner by utilizing an image augmentation strategy. We also
devise the concept of sustainability in controllable slots by introducing
iterative and reversible controls over slots with two proposed submethods:
Auxiliary Identity Manipulation and Slot Consistency Loss. Extensive empirical
studies and theoretical validation confirm the effectiveness of our approach,
offering a novel capability for interpretable and sustainable control of object
representations
Identification of Correlated Damage Parameters Under Noise and Bias Using
ABSTRACT This paper presents statistical model parameter identification using Bayesian inference when parameters are correlated and observed data have noise and bias. The method is explained using the Paris model that describes crack growth in a plate under mode I loading. It is assumed the observed data are obtained through structural health monitoring systems, which may have random noise and deterministic bias. It was found that strong correlation exists (a) between two model parameters of the Paris model, and (b) between initially measured crack size and bias. As the level of noise increases, the Bayesian inference was not able to identify the correlated parameters. However, the remaining useful life was predicted accurately because the identification errors in correlated parameters were compensated by each other
Identification of Correlated Damage Parameters Under Noise and Bias Using
Abstract This article presents statistical model parameter identification using Bayesian inference when parameters are correlated and observed data have noise and bias. The method is explained using the Paris model that describes crack growth in a plate under mode I loading. It is assumed that the observed data are obtained through structural health monitoring systems, which may have random noise and deterministic bias. It was found that a strong correlation exists (a) between two parameters of the Paris model, and (b) between initially measured crack size and bias. As the level of noise increases, the Bayesian inference was not able to identify the correlated parameters. However, the remaining useful life was predicted accurately because the identification errors in correlated parameters were compensated by each other. It was also found that the Bayesian identification process converges slowly when the level of noise is high
Echo Path Transfer Function Estimation for Spectral Subtraction-based Acoustic Echo Suppression
In this study, we propose a novel technique for spectral subtraction (SS)-based acoustic echo suppression (AES). Conventional AES methods based on SS apply fixed weights to the estimated echo path transfer function (EPTF) at the current signal segment and to the EPTF estimated until the previous time interval. We propose a new EPTF estimation approach that adaptively updates the weight parameters in response to abrupt changes in the acoustic environment. From the experiments, we conclude that the developed techniques can be successfully used for the SS-based AES systems
- …