3,180 research outputs found

    Shepherding Slots to Objects: Towards Stable and Robust Object-Centric Learning

    Full text link
    Object-centric learning (OCL) aspires general and compositional understanding of scenes by representing a scene as a collection of object-centric representations. OCL has also been extended to multi-view image and video datasets to apply various data-driven inductive biases by utilizing geometric or temporal information in the multi-image data. Single-view images carry less information about how to disentangle a given scene than videos or multi-view images do. Hence, owing to the difficulty of applying inductive biases, OCL for single-view images remains challenging, resulting in inconsistent learning of object-centric representation. To this end, we introduce a novel OCL framework for single-view images, SLot Attention via SHepherding (SLASH), which consists of two simple-yet-effective modules on top of Slot Attention. The new modules, Attention Refining Kernel (ARK) and Intermediate Point Predictor and Encoder (IPPE), respectively, prevent slots from being distracted by the background noise and indicate locations for slots to focus on to facilitate learning of object-centric representation. We also propose a weak semi-supervision approach for OCL, whilst our proposed framework can be used without any assistant annotation during the inference. Experiments show that our proposed method enables consistent learning of object-centric representation and achieves strong performance across four datasets. Code is available at \url{https://github.com/object-understanding/SLASH}

    Leveraging Image Augmentation for Object Manipulation: Towards Interpretable Controllability in Object-Centric Learning

    Full text link
    The binding problem in artificial neural networks is actively explored with the goal of achieving human-level recognition skills through the comprehension of the world in terms of symbol-like entities. Especially in the field of computer vision, object-centric learning (OCL) is extensively researched to better understand complex scenes by acquiring object representations or slots. While recent studies in OCL have made strides with complex images or videos, the interpretability and interactivity over object representation remain largely uncharted, still holding promise in the field of OCL. In this paper, we introduce a novel method, Slot Attention with Image Augmentation (SlotAug), to explore the possibility of learning interpretable controllability over slots in a self-supervised manner by utilizing an image augmentation strategy. We also devise the concept of sustainability in controllable slots by introducing iterative and reversible controls over slots with two proposed submethods: Auxiliary Identity Manipulation and Slot Consistency Loss. Extensive empirical studies and theoretical validation confirm the effectiveness of our approach, offering a novel capability for interpretable and sustainable control of object representations

    Identification of Correlated Damage Parameters Under Noise and Bias Using

    Get PDF
    ABSTRACT This paper presents statistical model parameter identification using Bayesian inference when parameters are correlated and observed data have noise and bias. The method is explained using the Paris model that describes crack growth in a plate under mode I loading. It is assumed the observed data are obtained through structural health monitoring systems, which may have random noise and deterministic bias. It was found that strong correlation exists (a) between two model parameters of the Paris model, and (b) between initially measured crack size and bias. As the level of noise increases, the Bayesian inference was not able to identify the correlated parameters. However, the remaining useful life was predicted accurately because the identification errors in correlated parameters were compensated by each other

    Identification of Correlated Damage Parameters Under Noise and Bias Using

    Get PDF
    Abstract This article presents statistical model parameter identification using Bayesian inference when parameters are correlated and observed data have noise and bias. The method is explained using the Paris model that describes crack growth in a plate under mode I loading. It is assumed that the observed data are obtained through structural health monitoring systems, which may have random noise and deterministic bias. It was found that a strong correlation exists (a) between two parameters of the Paris model, and (b) between initially measured crack size and bias. As the level of noise increases, the Bayesian inference was not able to identify the correlated parameters. However, the remaining useful life was predicted accurately because the identification errors in correlated parameters were compensated by each other. It was also found that the Bayesian identification process converges slowly when the level of noise is high

    Echo Path Transfer Function Estimation for Spectral Subtraction-based Acoustic Echo Suppression

    Get PDF
    In this study, we propose a novel technique for spectral subtraction (SS)-based acoustic echo suppression (AES). Conventional AES methods based on SS apply fixed weights to the estimated echo path transfer function (EPTF) at the current signal segment and to the EPTF estimated until the previous time interval. We propose a new EPTF estimation approach that adaptively updates the weight parameters in response to abrupt changes in the acoustic environment. From the experiments, we conclude that the developed techniques can be successfully used for the SS-based AES systems
    • …
    corecore