3,118 research outputs found

    Energy-latency manipulation of multi-modal large language models via verbose samples

    Get PDF
    Despite the exceptional performance of multi-modal large language models (MLLMs), their deployment requires substantial computational resources. Once malicious users induce high energy consumption and latency time (energy-latency cost), it will exhaust computational resources and harm availability of service. In this paper, we investigate this vulnerability for MLLMs, particularly image-based and video-based ones, and aim to induce high energy-latency cost during inference by crafting an imperceptible perturbation. We find that high energy-latency cost can be manipulated by maximizing the length of generated sequences, which motivates us to propose verbose samples, including verbose images and videos. Concretely, two modality non-specific losses are proposed, including a loss to delay end-of-sequence (EOS) token and an uncertainty loss to increase the uncertainty over each generated token. In addition, improving diversity is important to encourage longer responses by increasing the complexity, which inspires the following modality specific loss. For verbose images, a token diversity loss is proposed to promote diverse hidden states. For verbose videos, a frame feature diversity loss is proposed to increase the feature diversity among frames. To balance these losses, we propose a temporal weight adjustment algorithm. Experiments demonstrate that our verbose samples can largely extend the length of generated sequences

    Mice with a Mutation in the Mdm2 Gene That Interferes with MDM2/Ribosomal Protein Binding Develop a Defect in Erythropoiesis

    Get PDF
    MDM2, an E3 ubiquitin ligase, is an important negative regulator of tumor suppressor p53. In turn the Mdm2 gene is a transcriptional target of p53, forming a negative feedback loop that is important in cell cycle control. It has recently become apparent that the ubiquitination of p53 by MDM2 can be inhibited when certain ribosomal proteins, including RPL5 and RPL11, bind to MDM2. This inhibition, and the resulting increase in p53 levels has been proposed to be responsible for the red cell aplasia seen in Diamond-Blackfan anemia (DBA) and in 5q- myelodysplastic syndrome (MDS). DBA and 5q- MDS are associated with inherited (DBA) or acquired (5q- MDS) haploinsufficiency of ribosomal proteins. A mutation in Mdm2 causing a C305F amino acid substitution blocks the binding of ribosomal proteins. Mice harboring this mutation (Mdm2C305F), retain a normal p53 response to DNA damage, but lack the p53 response to perturbations in ribosome biogenesis. While studying the interaction between RP haploinsufficiency and the Mdm2C305F mutation we noticed that Mdm2C305F homozygous mice had altered hematopoiesis. These mice developed a mild macrocytic anemia with reticulocytosis. In the bone marrow (BM), these mice showed a significant decrease in Ter119hi cells compared to wild type (WT) littermates, while no decrease in the number of mature erythroid cells (Ter119hiCD71low) was found in the spleen, which showed compensated bone marrow hematopoiesis. In methylcellulose cultures, BFU-E colonies from the mutant mice were slightly reduced in number and there was a significant reduction in CFU-E colony numbers in mutant mice compared with WT controls (p < 0.01). This erythropoietic defect was abrogated by concomitant p53 deficiency (Trp53ko/ko). Further investigation revealed that in Mdm2C305F animals, there was a decrease in Lin-Sca-1+c-Kit+ (LSK) cells, accompanied by significant decreases in multipotent progenitor (MPP) cells (p < 0.01). Competitive BM repopulation experiments showed that donor BM harboring the Mdm2C305F mutation possessed decreased repopulation capacity compared to WT BM, suggesting a functional stem cell deficit. These results suggest that there is a fine tuned balance in the interaction of ribosomal proteins with the MDM2/p53 axis which is important in normal hematopoiesis

    Physics-informed neural network for friction-involved nonsmooth dynamics problems

    Full text link
    Friction-induced vibration (FIV) is very common in engineering areas. Analysing the dynamic behaviour of systems containing a multiple-contact point frictional interface is an important topic. However, accurately simulating nonsmooth/discontinuous dynamic behaviour due to friction is challenging. This paper presents a new physics-informed neural network approach for solving nonsmooth friction-induced vibration or friction-involved vibration problems. Compared with schemes of the conventional time-stepping methodology, in this new computational framework, the theoretical formulations of nonsmooth multibody dynamics are transformed and embedded in the training process of the neural network. Major findings include that the new framework not only can perform accurate simulation of nonsmooth dynamic behaviour, but also eliminate the need for extremely small time steps typically associated with the conventional time-stepping methodology for multibody systems, thus saving much computation work while maintaining high accuracy. Specifically, four kinds of high-accuracy PINN-based methods are proposed: (1) single PINN; (2) dual PINN; (3) advanced single PINN; (4) advanced dual PINN. Two typical dynamics problems with nonsmooth contact are tested: one is a 1-dimensional contact problem with stick-slip, and the other is a 2-dimensional contact problem considering separation-reattachment and stick-slip oscillation. Both single and dual PINN methods show their advantages in dealing with the 1-dimensional stick-slip problem, which outperforms conventional methods across friction models that are difficult to simulate by the conventional time-stepping method. For the 2-dimensional problem, the capability of the advanced single and advanced dual PINN on accuracy improvement is shown, and they provide good results even in the cases when conventional methods fail.Comment: 38 Pages, 24 figure

    Ratio of Hadronic Decay Rates of J\psi and \psi(2S) and the \rho\pi Puzzle

    Full text link
    The so-called \rho\pi puzzle of J\psi and \psi(2S) decays is examined using the experimental data available to date. Two different approaches were taken to estimate the ratio of J\psi and \psi(2S) hadronic decay rates. While one of the estimates could not yield the exact ratio of \psi(2S) to J\psi inclusive hadronic decay rates, the other, based on a computation of the inclusive ggg decay rate for \psi(2S) (J\psi) by subtracting other decay rates from the total decay rate, differs by two standard deviations from the naive prediction of perturbative QCD, even though its central value is nearly twice as large as what was naively expected. A comparison between this ratio, upon making corrections for specific exclusive two-body decay modes, and the corresponding experimental data confirms the puzzles in J\psi and \psi(2S) decays. We find from our analysis that the exclusively reconstructed hadronic decays of the \psi(2S) account for only a small fraction of its total decays, and a ratio exceeding the above estimate should be expected to occur for a considerable number of the remaining decay channels. We also show that the recent new results from the BES experiment provide crucial tests of various theoretical models proposed to explain the puzzle.Comment: 8 pages, no figure, 4 table

    Multipurpose watermarking approach for copyright and integrity of steganographic autoencoder models

    Get PDF
    With the great achievements of deep learning technology, neural network models have emerged as a new type of intellectual property. Neural network models’ design and training require considerable computational resources and time. Watermarking is a potential solution for achieving copyright protection and integrity of neural network models without excessively compromising the models’ accuracy and stability. In this work, we develop a multipurpose watermarking method for securing the copyright and integrity of a steganographic autoencoder referred to as “HiDDen.” This autoencoder model is used to hide different kinds of watermark messages in digital images. Copyright information is embedded with imperceptibly modified model parameters, and integrity is verified by embedding the Hash value generated from the model parameters. Experimental results show that the proposed multipurpose watermarking method can reliably identify copyright ownership and localize tampered parts of the model parameters. Furthermore, the accuracy and robustness of the autoencoder model are perfectly preserved
    • …
    corecore