574 research outputs found
Angle-based hierarchical classification using exact label embedding
Hierarchical classification problems are commonly seen in practice. However,
most existing methods do not fully utilize the hierarchical information among
class labels. In this paper, a novel label embedding approach is proposed,
which keeps the hierarchy of labels exactly, and reduces the complexity of the
hypothesis space significantly. Based on the newly proposed label embedding
approach, a new angle-based classifier is developed for hierarchical
classification. Moreover, to handle massive data, a new (weighted) linear loss
is designed, which has a closed form solution and is computationally efficient.
Theoretical properties of the new method are established and intensive
numerical comparisons with other methods are conducted. Both simulations and
applications in document categorization demonstrate the advantages of the
proposed method
Nano: Nested Human-in-the-Loop Reward Learning for Few-shot Language Model Control
Pretrained language models have demonstrated extraordinary capabilities in
language generation. However, real-world tasks often require controlling the
distribution of generated text in order to mitigate bias, promote fairness, and
achieve personalization. Existing techniques for controlling the distribution
of generated text only work with quantified distributions, which require
pre-defined categories, proportions of the distribution, or an existing corpus
following the desired distributions. However, many important distributions,
such as personal preferences, are unquantified. In this work, we tackle the
problem of generating text following arbitrary distributions (quantified and
unquantified) by proposing Nano, a few-shot human-in-the-loop training
algorithm that continuously learns from human feedback. Nano achieves
state-of-the-art results on single topic/attribute as well as quantified
distribution control compared to previous works. We also show that Nano is able
to learn unquantified distributions, achieves personalization, and captures
differences between different individuals' personal preferences with high
sample efficiency.Comment: Accepted to ACL Findings 202
X-Oscar: A Progressive Framework for High-quality Text-guided 3D Animatable Avatar Generation
Recent advancements in automatic 3D avatar generation guided by text have
made significant progress. However, existing methods have limitations such as
oversaturation and low-quality output. To address these challenges, we propose
X-Oscar, a progressive framework for generating high-quality animatable avatars
from text prompts. It follows a sequential Geometry->Texture->Animation
paradigm, simplifying optimization through step-by-step generation. To tackle
oversaturation, we introduce Adaptive Variational Parameter (AVP), representing
avatars as an adaptive distribution during training. Additionally, we present
Avatar-aware Score Distillation Sampling (ASDS), a novel technique that
incorporates avatar-aware noise into rendered images for improved generation
quality during optimization. Extensive evaluations confirm the superiority of
X-Oscar over existing text-to-3D and text-to-avatar approaches. Our anonymous
project page: https://xmu-xiaoma666.github.io/Projects/X-Oscar/.Comment: ICML202
MultiZoo & MultiBench: A Standardized Toolkit for Multimodal Deep Learning
Learning multimodal representations involves integrating information from
multiple heterogeneous sources of data. In order to accelerate progress towards
understudied modalities and tasks while ensuring real-world robustness, we
release MultiZoo, a public toolkit consisting of standardized implementations
of > 20 core multimodal algorithms and MultiBench, a large-scale benchmark
spanning 15 datasets, 10 modalities, 20 prediction tasks, and 6 research areas.
Together, these provide an automated end-to-end machine learning pipeline that
simplifies and standardizes data loading, experimental setup, and model
evaluation. To enable holistic evaluation, we offer a comprehensive methodology
to assess (1) generalization, (2) time and space complexity, and (3) modality
robustness. MultiBench paves the way towards a better understanding of the
capabilities and limitations of multimodal models, while ensuring ease of use,
accessibility, and reproducibility. Our toolkits are publicly available, will
be regularly updated, and welcome inputs from the community.Comment: JMLR Open Source Software 2023, Code available at
https://github.com/pliang279/MultiBenc
Problematic Internet Use Among Residential College Students During the COVID-19 Lockdown: A Social Network Analysis Approach
Glucagon-like peptide-1 receptor agonists as a disease-modifying therapy for knee osteoarthritis mediated by weight loss:Findings from the Shanghai Osteoarthritis Cohort
Objective: Obesity is a risk factor for knee osteoarthritis (KOA) development and progression. Glucagon-like peptide-1 receptor agonists (GLP-1RAs) are indicated for type 2 diabetes mellitus (T2DM) and obesity. However, whether KOA patients can benefit from GLP-1RA therapies has not been sufficiently investigated, especially in the long term. Methods: The Shanghai Osteoarthritis Cohort study is a prospective, observational, multicentre study of >40 000 adults with clinically diagnosed osteoarthritis aged >45 years in Shanghai. We identified all KOA participants with comorbid T2DM enrolled from 1 January 2011 to 1 January 2017. Primary outcome was incidence of knee surgery after enrolment. Secondary outcomes included pain-relieving medication use, number of intra-articular therapies, Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC) and medial femorotibial joint cartilage thickness. To evaluate the effects of GLP-1RA, we performed before-and-after comparison and comparison with participants who had no GLP-1RA exposure. Results: For an intergroup comparison (non-GLP-1RA vs GLP-1RA), more weight loss (adjusted mean difference in weight change from baseline-7.29 kg (95% CI-8.07 to-6.50 kg), p<0.001) and lower incidence of knee surgery (93/1574 (5.9%) vs 4/233 (1.7%), adjusted p=0.014) were observed in the GLP-1RA group. Statistically significant differences in mean change from baseline for the WOMAC total and pain subscale scores were observed (adjusted mean difference in WOMAC total score-1.46 (95% CI-2.84 to-0.08), p=0.038; adjusted mean difference in WOMAC pain subscore-3.37 (95% CI-5.79 to-0.94), p=0.007). Cartilage-loss velocity of the medial femorotibial joint was significantly lower in the GLP-1RA group postadjustment for baseline characteristics (adjusted mean difference-0.02 mm (95% CI-0.03 to-0.002 mm), p=0.004). For the before-and-after comparison within the GLP-1RA group, we observed a significant decrease of symptom-relieving medication consumption and cartilage loss velocity of medial femorotibial joint (after-treatment vs before-treatment:-0.03±0.05 vs-0.05±0.07 mm/year, p<0.001). The association between GLP-1RA exposure and decreased incidence of knee surgery was mediated by weight reduction (mediation proportion: 32.1%), instead of glycaemic control (too small to calculate). Conclusion: With sufficient treatment duration, GLP-1RA therapies might be disease-modifying for KOA patients with comorbid T2DM, possibly mediated by weight loss. Further investigation is needed to elucidate effects of GLP-1RA on disease process, joint structure and patient-reported outcomes of osteoarthritis.</p
The X-LANCE Technical Report for Interspeech 2024 Speech Processing Using Discrete Speech Unit Challenge
Discrete speech tokens have been more and more popular in multiple speech
processing fields, including automatic speech recognition (ASR), text-to-speech
(TTS) and singing voice synthesis (SVS). In this paper, we describe the systems
developed by the SJTU X-LANCE group for the TTS (acoustic + vocoder), SVS, and
ASR tracks in the Interspeech 2024 Speech Processing Using Discrete Speech Unit
Challenge. Notably, we achieved 1st rank on the leaderboard in the TTS track
both with the whole training set and only 1h training data, with the highest
UTMOS score and lowest bitrate among all submissions.Comment: 5 pages, 3 figures. Report of a challeng
MLIP: Efficient Multi-Perspective Language-Image Pretraining with Exhaustive Data Utilization
Contrastive Language-Image Pretraining (CLIP) has achieved remarkable
success, leading to rapid advancements in multimodal studies. However, CLIP
faces a notable challenge in terms of inefficient data utilization. It relies
on a single contrastive supervision for each image-text pair during
representation learning, disregarding a substantial amount of valuable
information that could offer richer supervision. Additionally, the retention of
non-informative tokens leads to increased computational demands and time costs,
particularly in CLIP's ViT image encoder. To address these issues, we propose
Multi-Perspective Language-Image Pretraining (MLIP). In MLIP, we leverage the
frequency transform's sensitivity to both high and low-frequency variations,
which complements the spatial domain's sensitivity limited to low-frequency
variations only. By incorporating frequency transforms and token-level
alignment, we expand CILP's single supervision into multi-domain and
multi-level supervision, enabling a more thorough exploration of informative
image features. Additionally, we introduce a token merging method guided by
comprehensive semantics from the frequency and spatial domains. This allows us
to merge tokens to multi-granularity tokens with a controllable compression
rate to accelerate CLIP. Extensive experiments validate the effectiveness of
our design.Comment: ICML 202
Polyamine-Mediated Ferroptosis Amplification Acts as a Targetable Vulnerability in Cancer
Targeting ferroptosis, an iron-dependent form of regulated cell death triggered by the lethal overload of lipid peroxides, in cancer therapy is impeded by our limited understanding of the intersection of tumour’s metabolic feature and ferroptosis vulnerability. In the present study, arginine is identified as a ferroptotic promoter using a metabolites library. This effect is mainly achieved through arginine’s conversion to polyamines, which exerts their potent ferroptosis-promoting property in an H2O2-dependent manner. Notably, the expression of ornithine decarboxylase 1 (ODC1), the critical enzyme catalysing polyamine synthesis, is significantly activated by the ferroptosis signal——iron overload——through WNT/MYC signalling, as well as the subsequent elevated polyamine synthesis, thus forming a ferroptosis-iron overload-WNT/MYC-ODC1-polyamine-H2O2 positive feedback loop that amplifies ferroptosis. Meanwhile, we notice that ferroptotic cells release enhanced polyamine-containing extracellular vesicles into the microenvironment, thereby further sensitizing neighbouring cells to ferroptosis and accelerating the “spread” of ferroptosis in the tumour region. Besides, polyamine supplementation also sensitizes cancer cells or xenograft tumours to radiotherapy or chemotherapy through inducing ferroptosis. Considering that cancer cells are often characterized by elevated intracellular polyamine pools, our results indicate that polyamine metabolism exposes a targetable vulnerability to ferroptosis and represents an exciting opportunity for therapeutic strategies for cancer
- …