516 research outputs found
CBA: Improving Online Continual Learning via Continual Bias Adaptor
Online continual learning (CL) aims to learn new knowledge and consolidate
previously learned knowledge from non-stationary data streams. Due to the
time-varying training setting, the model learned from a changing distribution
easily forgets the previously learned knowledge and biases toward the newly
received task. To address this problem, we propose a Continual Bias Adaptor
(CBA) module to augment the classifier network to adapt to catastrophic
distribution change during training, such that the classifier network is able
to learn a stable consolidation of previously learned tasks. In the testing
stage, CBA can be removed which introduces no additional computation cost and
memory overhead. We theoretically reveal the reason why the proposed method can
effectively alleviate catastrophic distribution shifts, and empirically
demonstrate its effectiveness through extensive experiments based on four
rehearsal-based baselines and three public continual learning benchmarks.Comment: Accepted by ICCV 202
Learning to Purify Noisy Labels via Meta Soft Label Corrector
Recent deep neural networks (DNNs) can easily overfit to biased training data
with noisy labels. Label correction strategy is commonly used to alleviate this
issue by designing a method to identity suspected noisy labels and then correct
them. Current approaches to correcting corrupted labels usually need certain
pre-defined label correction rules or manually preset hyper-parameters. These
fixed settings make it hard to apply in practice since the accurate label
correction usually related with the concrete problem, training data and the
temporal information hidden in dynamic iterations of training process. To
address this issue, we propose a meta-learning model which could estimate soft
labels through meta-gradient descent step under the guidance of noise-free meta
data. By viewing the label correction procedure as a meta-process and using a
meta-learner to automatically correct labels, we could adaptively obtain
rectified soft labels iteratively according to current training problems
without manually preset hyper-parameters. Besides, our method is model-agnostic
and we can combine it with any other existing model with ease. Comprehensive
experiments substantiate the superiority of our method in both synthetic and
real-world problems with noisy labels compared with current SOTA label
correction strategies.Comment: 12 pages,6 figure
The Long-Baseline Neutrino Experiment: Exploring Fundamental Symmetries of the Universe
The preponderance of matter over antimatter in the early Universe, the
dynamics of the supernova bursts that produced the heavy elements necessary for
life and whether protons eventually decay --- these mysteries at the forefront
of particle physics and astrophysics are key to understanding the early
evolution of our Universe, its current state and its eventual fate. The
Long-Baseline Neutrino Experiment (LBNE) represents an extensively developed
plan for a world-class experiment dedicated to addressing these questions. LBNE
is conceived around three central components: (1) a new, high-intensity
neutrino source generated from a megawatt-class proton accelerator at Fermi
National Accelerator Laboratory, (2) a near neutrino detector just downstream
of the source, and (3) a massive liquid argon time-projection chamber deployed
as a far detector deep underground at the Sanford Underground Research
Facility. This facility, located at the site of the former Homestake Mine in
Lead, South Dakota, is approximately 1,300 km from the neutrino source at
Fermilab -- a distance (baseline) that delivers optimal sensitivity to neutrino
charge-parity symmetry violation and mass ordering effects. This ambitious yet
cost-effective design incorporates scalability and flexibility and can
accommodate a variety of upgrades and contributions. With its exceptional
combination of experimental configuration, technical capabilities, and
potential for transformative discoveries, LBNE promises to be a vital facility
for the field of particle physics worldwide, providing physicists from around
the globe with opportunities to collaborate in a twenty to thirty year program
of exciting science. In this document we provide a comprehensive overview of
LBNE's scientific objectives, its place in the landscape of neutrino physics
worldwide, the technologies it will incorporate and the capabilities it will
possess.Comment: Major update of previous version. This is the reference document for
LBNE science program and current status. Chapters 1, 3, and 9 provide a
comprehensive overview of LBNE's scientific objectives, its place in the
landscape of neutrino physics worldwide, the technologies it will incorporate
and the capabilities it will possess. 288 pages, 116 figure
Influence of Natural Gas Hydrate Distribution Patterns on the Macroscale–Mesoscale Mechanical Properties of Hydrate-Bearing Sediments
Studying the mechanical characteristics of hydrate-bearing sediments (HBS) contributes to the comprehensive understanding of the mechanical behavior in environments with natural gas hydrate (NGH) occurrences. Simultaneously, the distribution patterns of hydrates significantly influence the strength, deformation, and stability of HBS. Therefore, this paper employs particle flow code (PFC) to conduct biaxial discrete element simulations on specimens of HBS with different hydrate distribution patterns, revealing the macroscale–mesoscale mechanical properties, evolution patterns, and destructive mechanisms. The results indicate that the strain-softening behavior of HBS specimens strengthens with the increase in hydrate layer thickness, leading to higher peak strength and E50 values. During the gradual movement of the hydrate layer position (Ay) from both ends to the center of the specimen (Ay = 0.40 mm → Ay = 20 mm), the strain-softening behavior weakens. However, when Ay = 20 mm, the specimen exhibits evident strain-softening behavior again. Moreover, with an increase in the angle between the hydrate layer and the horizontal direction (α) greater than 20°, the peak strength of the specimen increases, while E50 shows an overall decreasing trend. The influence of axial loads on the hydrate layer in specimens varies with α, with larger contact forces and fewer cracks observed for higher α values.Journal of Marine Science and Engineering, 12(1), art. no. 20; 2023journal articl
VP-LLM: Text-Driven 3D Volume Completion with Large Language Models through Patchification
Recent conditional 3D completion works have mainly relied on CLIP or BERT to
encode textual information, which cannot support complex instruction.
Meanwhile, large language models (LLMs) have shown great potential in
multi-modal understanding and generation tasks. Inspired by the recent
advancements of LLM, we present Volume Patch LLM (VP-LLM), which leverages LLMs
to perform conditional 3D completion in a single-forward pass. To integrate a
3D model into the LLM tokenization configuration, the incomplete 3D object is
first divided into small patches that can be encoded independently. These
encoded patches are then fed into an LLM along with the text prompt,
instructing the LLM to capture the relations between these patches as well as
injecting semantic meanings into the 3D object. Our results demonstrate a
strong ability of LLMs to interpret complex text instructions and understand 3D
objects, surpassing state-of-the-art diffusion-based 3D completion models in
generation quality.Comment: 27pages, 16 figure
- …
