574 research outputs found

    Angle-based hierarchical classification using exact label embedding

    Get PDF
    Hierarchical classification problems are commonly seen in practice. However, most existing methods do not fully utilize the hierarchical information among class labels. In this paper, a novel label embedding approach is proposed, which keeps the hierarchy of labels exactly, and reduces the complexity of the hypothesis space significantly. Based on the newly proposed label embedding approach, a new angle-based classifier is developed for hierarchical classification. Moreover, to handle massive data, a new (weighted) linear loss is designed, which has a closed form solution and is computationally efficient. Theoretical properties of the new method are established and intensive numerical comparisons with other methods are conducted. Both simulations and applications in document categorization demonstrate the advantages of the proposed method

    Nano: Nested Human-in-the-Loop Reward Learning for Few-shot Language Model Control

    Full text link
    Pretrained language models have demonstrated extraordinary capabilities in language generation. However, real-world tasks often require controlling the distribution of generated text in order to mitigate bias, promote fairness, and achieve personalization. Existing techniques for controlling the distribution of generated text only work with quantified distributions, which require pre-defined categories, proportions of the distribution, or an existing corpus following the desired distributions. However, many important distributions, such as personal preferences, are unquantified. In this work, we tackle the problem of generating text following arbitrary distributions (quantified and unquantified) by proposing Nano, a few-shot human-in-the-loop training algorithm that continuously learns from human feedback. Nano achieves state-of-the-art results on single topic/attribute as well as quantified distribution control compared to previous works. We also show that Nano is able to learn unquantified distributions, achieves personalization, and captures differences between different individuals' personal preferences with high sample efficiency.Comment: Accepted to ACL Findings 202

    X-Oscar: A Progressive Framework for High-quality Text-guided 3D Animatable Avatar Generation

    Full text link
    Recent advancements in automatic 3D avatar generation guided by text have made significant progress. However, existing methods have limitations such as oversaturation and low-quality output. To address these challenges, we propose X-Oscar, a progressive framework for generating high-quality animatable avatars from text prompts. It follows a sequential Geometry->Texture->Animation paradigm, simplifying optimization through step-by-step generation. To tackle oversaturation, we introduce Adaptive Variational Parameter (AVP), representing avatars as an adaptive distribution during training. Additionally, we present Avatar-aware Score Distillation Sampling (ASDS), a novel technique that incorporates avatar-aware noise into rendered images for improved generation quality during optimization. Extensive evaluations confirm the superiority of X-Oscar over existing text-to-3D and text-to-avatar approaches. Our anonymous project page: https://xmu-xiaoma666.github.io/Projects/X-Oscar/.Comment: ICML202

    MultiZoo & MultiBench: A Standardized Toolkit for Multimodal Deep Learning

    Full text link
    Learning multimodal representations involves integrating information from multiple heterogeneous sources of data. In order to accelerate progress towards understudied modalities and tasks while ensuring real-world robustness, we release MultiZoo, a public toolkit consisting of standardized implementations of > 20 core multimodal algorithms and MultiBench, a large-scale benchmark spanning 15 datasets, 10 modalities, 20 prediction tasks, and 6 research areas. Together, these provide an automated end-to-end machine learning pipeline that simplifies and standardizes data loading, experimental setup, and model evaluation. To enable holistic evaluation, we offer a comprehensive methodology to assess (1) generalization, (2) time and space complexity, and (3) modality robustness. MultiBench paves the way towards a better understanding of the capabilities and limitations of multimodal models, while ensuring ease of use, accessibility, and reproducibility. Our toolkits are publicly available, will be regularly updated, and welcome inputs from the community.Comment: JMLR Open Source Software 2023, Code available at https://github.com/pliang279/MultiBenc

    Glucagon-like peptide-1 receptor agonists as a disease-modifying therapy for knee osteoarthritis mediated by weight loss:Findings from the Shanghai Osteoarthritis Cohort

    Get PDF
    Objective: Obesity is a risk factor for knee osteoarthritis (KOA) development and progression. Glucagon-like peptide-1 receptor agonists (GLP-1RAs) are indicated for type 2 diabetes mellitus (T2DM) and obesity. However, whether KOA patients can benefit from GLP-1RA therapies has not been sufficiently investigated, especially in the long term. Methods: The Shanghai Osteoarthritis Cohort study is a prospective, observational, multicentre study of &gt;40 000 adults with clinically diagnosed osteoarthritis aged &gt;45 years in Shanghai. We identified all KOA participants with comorbid T2DM enrolled from 1 January 2011 to 1 January 2017. Primary outcome was incidence of knee surgery after enrolment. Secondary outcomes included pain-relieving medication use, number of intra-articular therapies, Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC) and medial femorotibial joint cartilage thickness. To evaluate the effects of GLP-1RA, we performed before-and-after comparison and comparison with participants who had no GLP-1RA exposure. Results: For an intergroup comparison (non-GLP-1RA vs GLP-1RA), more weight loss (adjusted mean difference in weight change from baseline-7.29 kg (95% CI-8.07 to-6.50 kg), p&lt;0.001) and lower incidence of knee surgery (93/1574 (5.9%) vs 4/233 (1.7%), adjusted p=0.014) were observed in the GLP-1RA group. Statistically significant differences in mean change from baseline for the WOMAC total and pain subscale scores were observed (adjusted mean difference in WOMAC total score-1.46 (95% CI-2.84 to-0.08), p=0.038; adjusted mean difference in WOMAC pain subscore-3.37 (95% CI-5.79 to-0.94), p=0.007). Cartilage-loss velocity of the medial femorotibial joint was significantly lower in the GLP-1RA group postadjustment for baseline characteristics (adjusted mean difference-0.02 mm (95% CI-0.03 to-0.002 mm), p=0.004). For the before-and-after comparison within the GLP-1RA group, we observed a significant decrease of symptom-relieving medication consumption and cartilage loss velocity of medial femorotibial joint (after-treatment vs before-treatment:-0.03±0.05 vs-0.05±0.07 mm/year, p&lt;0.001). The association between GLP-1RA exposure and decreased incidence of knee surgery was mediated by weight reduction (mediation proportion: 32.1%), instead of glycaemic control (too small to calculate). Conclusion: With sufficient treatment duration, GLP-1RA therapies might be disease-modifying for KOA patients with comorbid T2DM, possibly mediated by weight loss. Further investigation is needed to elucidate effects of GLP-1RA on disease process, joint structure and patient-reported outcomes of osteoarthritis.</p

    The X-LANCE Technical Report for Interspeech 2024 Speech Processing Using Discrete Speech Unit Challenge

    Full text link
    Discrete speech tokens have been more and more popular in multiple speech processing fields, including automatic speech recognition (ASR), text-to-speech (TTS) and singing voice synthesis (SVS). In this paper, we describe the systems developed by the SJTU X-LANCE group for the TTS (acoustic + vocoder), SVS, and ASR tracks in the Interspeech 2024 Speech Processing Using Discrete Speech Unit Challenge. Notably, we achieved 1st rank on the leaderboard in the TTS track both with the whole training set and only 1h training data, with the highest UTMOS score and lowest bitrate among all submissions.Comment: 5 pages, 3 figures. Report of a challeng

    MLIP: Efficient Multi-Perspective Language-Image Pretraining with Exhaustive Data Utilization

    Full text link
    Contrastive Language-Image Pretraining (CLIP) has achieved remarkable success, leading to rapid advancements in multimodal studies. However, CLIP faces a notable challenge in terms of inefficient data utilization. It relies on a single contrastive supervision for each image-text pair during representation learning, disregarding a substantial amount of valuable information that could offer richer supervision. Additionally, the retention of non-informative tokens leads to increased computational demands and time costs, particularly in CLIP's ViT image encoder. To address these issues, we propose Multi-Perspective Language-Image Pretraining (MLIP). In MLIP, we leverage the frequency transform's sensitivity to both high and low-frequency variations, which complements the spatial domain's sensitivity limited to low-frequency variations only. By incorporating frequency transforms and token-level alignment, we expand CILP's single supervision into multi-domain and multi-level supervision, enabling a more thorough exploration of informative image features. Additionally, we introduce a token merging method guided by comprehensive semantics from the frequency and spatial domains. This allows us to merge tokens to multi-granularity tokens with a controllable compression rate to accelerate CLIP. Extensive experiments validate the effectiveness of our design.Comment: ICML 202

    Polyamine-Mediated Ferroptosis Amplification Acts as a Targetable Vulnerability in Cancer

    Get PDF
    Targeting ferroptosis, an iron-dependent form of regulated cell death triggered by the lethal overload of lipid peroxides, in cancer therapy is impeded by our limited understanding of the intersection of tumour’s metabolic feature and ferroptosis vulnerability. In the present study, arginine is identified as a ferroptotic promoter using a metabolites library. This effect is mainly achieved through arginine’s conversion to polyamines, which exerts their potent ferroptosis-promoting property in an H2O2-dependent manner. Notably, the expression of ornithine decarboxylase 1 (ODC1), the critical enzyme catalysing polyamine synthesis, is significantly activated by the ferroptosis signal——iron overload——through WNT/MYC signalling, as well as the subsequent elevated polyamine synthesis, thus forming a ferroptosis-iron overload-WNT/MYC-ODC1-polyamine-H2O2 positive feedback loop that amplifies ferroptosis. Meanwhile, we notice that ferroptotic cells release enhanced polyamine-containing extracellular vesicles into the microenvironment, thereby further sensitizing neighbouring cells to ferroptosis and accelerating the “spread” of ferroptosis in the tumour region. Besides, polyamine supplementation also sensitizes cancer cells or xenograft tumours to radiotherapy or chemotherapy through inducing ferroptosis. Considering that cancer cells are often characterized by elevated intracellular polyamine pools, our results indicate that polyamine metabolism exposes a targetable vulnerability to ferroptosis and represents an exciting opportunity for therapeutic strategies for cancer
    • …
    corecore