3,152 research outputs found

    Parameter-efficient is not sufficient: Exploring Parameter, Memory, and Time Efficient Adapter Tuning for Dense Predictions

    Full text link
    Pre-training & fine-tuning is a prevalent paradigm in computer vision (CV). Recently, parameter-efficient transfer learning (PETL) methods have shown promising performance in adapting to downstream tasks with only a few trainable parameters. Despite their success, the existing PETL methods in CV can be computationally expensive and require large amounts of memory and time cost during training, which limits low-resource users from conducting research and applications on large models. In this work, we propose Parameter, Memory, and Time Efficient Visual Adapter (E3VA\mathrm{E^3VA}) tuning to address this issue. We provide a gradient backpropagation highway for low-rank adapters which eliminates the need for expensive backpropagation through the frozen pre-trained model, resulting in substantial savings of training memory and training time. Furthermore, we optimise the E3VA\mathrm{E^3VA} structure for CV tasks to promote model performance. Extensive experiments on COCO, ADE20K, and Pascal VOC benchmarks show that E3VA\mathrm{E^3VA} can save up to 62.2% training memory and 26.2% training time on average, while achieving comparable performance to full fine-tuning and better performance than most PETL methods. Note that we can even train the Swin-Large-based Cascade Mask RCNN on GTX 1080Ti GPUs with less than 1.5% trainable parameters.Comment: 14 pages, 4 figures, 5 tables, Submitted to NeurIPS202

    Isospin breaking decays as a diagnosis of the hadronic molecular structure of the Pc(4457)P_c(4457)

    Full text link
    The LHCb Collaboration announced the observation of three narrow structures consistent with hidden-charm pentaquark states. They are candidates of hadronic molecules formed of a pair of a charmed baryon and an anticharmed meson. Among them, the Pc(4457)P_c(4457) mass is consistent with earlier predictions of a ΣcDˉ∗\Sigma_c\bar D^* molecule with I=1/2I=1/2. We point out that if such a picture were true, one would have B(Pc(4457)→J/ψΔ+)/B(Pc(4457)→J/ψp)\mathcal{B}(P_c(4457)\to J/\psi \Delta^+)/\mathcal{B}(P_c(4457)\to J/\psi p) at the level ranging from a few percent to about 30%. Such a large isospin breaking decay ratio is two to three orders of magnitude larger than that for normal hadron resonances. It is a unique feature of the ΣcDˉ∗\Sigma_c\bar D^* molecular model, and can be checked by LHCb.Comment: Version to be published as a Rapid Communication in Phys. Rev. D, selected as Editors' Suggestio

    Experimental Test of Tracking the King Problem

    Full text link
    In quantum theory, the retrodiction problem is not as clear as its classical counterpart because of the uncertainty principle of quantum mechanics. In classical physics, the measurement outcomes of the present state can be used directly for predicting the future events and inferring the past events which is known as retrodiction. However, as a probabilistic theory, quantum-mechanical retrodiction is a nontrivial problem that has been investigated for a long time, of which the Mean King Problem is one of the most extensively studied issues. Here, we present the first experimental test of a variant of the Mean King Problem, which has a more stringent regulation and is termed "Tracking the King". We demonstrate that Alice, by harnessing the shared entanglement and controlled-not gate, can successfully retrodict the choice of King's measurement without knowing any measurement outcome. Our results also provide a counterintuitive quantum communication to deliver information hidden in the choice of measurement.Comment: 16 pages, 5 figures, 2 table
    • …
    corecore