516 research outputs found

    CBA: Improving Online Continual Learning via Continual Bias Adaptor

    Full text link
    Online continual learning (CL) aims to learn new knowledge and consolidate previously learned knowledge from non-stationary data streams. Due to the time-varying training setting, the model learned from a changing distribution easily forgets the previously learned knowledge and biases toward the newly received task. To address this problem, we propose a Continual Bias Adaptor (CBA) module to augment the classifier network to adapt to catastrophic distribution change during training, such that the classifier network is able to learn a stable consolidation of previously learned tasks. In the testing stage, CBA can be removed which introduces no additional computation cost and memory overhead. We theoretically reveal the reason why the proposed method can effectively alleviate catastrophic distribution shifts, and empirically demonstrate its effectiveness through extensive experiments based on four rehearsal-based baselines and three public continual learning benchmarks.Comment: Accepted by ICCV 202

    Learning to Purify Noisy Labels via Meta Soft Label Corrector

    Full text link
    Recent deep neural networks (DNNs) can easily overfit to biased training data with noisy labels. Label correction strategy is commonly used to alleviate this issue by designing a method to identity suspected noisy labels and then correct them. Current approaches to correcting corrupted labels usually need certain pre-defined label correction rules or manually preset hyper-parameters. These fixed settings make it hard to apply in practice since the accurate label correction usually related with the concrete problem, training data and the temporal information hidden in dynamic iterations of training process. To address this issue, we propose a meta-learning model which could estimate soft labels through meta-gradient descent step under the guidance of noise-free meta data. By viewing the label correction procedure as a meta-process and using a meta-learner to automatically correct labels, we could adaptively obtain rectified soft labels iteratively according to current training problems without manually preset hyper-parameters. Besides, our method is model-agnostic and we can combine it with any other existing model with ease. Comprehensive experiments substantiate the superiority of our method in both synthetic and real-world problems with noisy labels compared with current SOTA label correction strategies.Comment: 12 pages,6 figure

    The Long-Baseline Neutrino Experiment: Exploring Fundamental Symmetries of the Universe

    Get PDF
    The preponderance of matter over antimatter in the early Universe, the dynamics of the supernova bursts that produced the heavy elements necessary for life and whether protons eventually decay --- these mysteries at the forefront of particle physics and astrophysics are key to understanding the early evolution of our Universe, its current state and its eventual fate. The Long-Baseline Neutrino Experiment (LBNE) represents an extensively developed plan for a world-class experiment dedicated to addressing these questions. LBNE is conceived around three central components: (1) a new, high-intensity neutrino source generated from a megawatt-class proton accelerator at Fermi National Accelerator Laboratory, (2) a near neutrino detector just downstream of the source, and (3) a massive liquid argon time-projection chamber deployed as a far detector deep underground at the Sanford Underground Research Facility. This facility, located at the site of the former Homestake Mine in Lead, South Dakota, is approximately 1,300 km from the neutrino source at Fermilab -- a distance (baseline) that delivers optimal sensitivity to neutrino charge-parity symmetry violation and mass ordering effects. This ambitious yet cost-effective design incorporates scalability and flexibility and can accommodate a variety of upgrades and contributions. With its exceptional combination of experimental configuration, technical capabilities, and potential for transformative discoveries, LBNE promises to be a vital facility for the field of particle physics worldwide, providing physicists from around the globe with opportunities to collaborate in a twenty to thirty year program of exciting science. In this document we provide a comprehensive overview of LBNE's scientific objectives, its place in the landscape of neutrino physics worldwide, the technologies it will incorporate and the capabilities it will possess.Comment: Major update of previous version. This is the reference document for LBNE science program and current status. Chapters 1, 3, and 9 provide a comprehensive overview of LBNE's scientific objectives, its place in the landscape of neutrino physics worldwide, the technologies it will incorporate and the capabilities it will possess. 288 pages, 116 figure

    Influence of Natural Gas Hydrate Distribution Patterns on the Macroscale–Mesoscale Mechanical Properties of Hydrate-Bearing Sediments

    Get PDF
    Studying the mechanical characteristics of hydrate-bearing sediments (HBS) contributes to the comprehensive understanding of the mechanical behavior in environments with natural gas hydrate (NGH) occurrences. Simultaneously, the distribution patterns of hydrates significantly influence the strength, deformation, and stability of HBS. Therefore, this paper employs particle flow code (PFC) to conduct biaxial discrete element simulations on specimens of HBS with different hydrate distribution patterns, revealing the macroscale–mesoscale mechanical properties, evolution patterns, and destructive mechanisms. The results indicate that the strain-softening behavior of HBS specimens strengthens with the increase in hydrate layer thickness, leading to higher peak strength and E50 values. During the gradual movement of the hydrate layer position (Ay) from both ends to the center of the specimen (Ay = 0.40 mm → Ay = 20 mm), the strain-softening behavior weakens. However, when Ay = 20 mm, the specimen exhibits evident strain-softening behavior again. Moreover, with an increase in the angle between the hydrate layer and the horizontal direction (α) greater than 20°, the peak strength of the specimen increases, while E50 shows an overall decreasing trend. The influence of axial loads on the hydrate layer in specimens varies with α, with larger contact forces and fewer cracks observed for higher α values.Journal of Marine Science and Engineering, 12(1), art. no. 20; 2023journal articl

    VP-LLM: Text-Driven 3D Volume Completion with Large Language Models through Patchification

    Full text link
    Recent conditional 3D completion works have mainly relied on CLIP or BERT to encode textual information, which cannot support complex instruction. Meanwhile, large language models (LLMs) have shown great potential in multi-modal understanding and generation tasks. Inspired by the recent advancements of LLM, we present Volume Patch LLM (VP-LLM), which leverages LLMs to perform conditional 3D completion in a single-forward pass. To integrate a 3D model into the LLM tokenization configuration, the incomplete 3D object is first divided into small patches that can be encoded independently. These encoded patches are then fed into an LLM along with the text prompt, instructing the LLM to capture the relations between these patches as well as injecting semantic meanings into the 3D object. Our results demonstrate a strong ability of LLMs to interpret complex text instructions and understand 3D objects, surpassing state-of-the-art diffusion-based 3D completion models in generation quality.Comment: 27pages, 16 figure
    corecore