91 research outputs found

    Ultrafast Video Attention Prediction with Coupled Knowledge Distillation

    Get PDF
    Large convolutional neural network models have recently demonstrated impressive performance on video attention prediction. Conventionally, these models are with intensive computation and large memory. To address these issues, we design an extremely light-weight network with ultrafast speed, named UVA-Net. The network is constructed based on depth-wise convolutions and takes low-resolution images as input. However, this straight-forward acceleration method will decrease performance dramatically. To this end, we propose a coupled knowledge distillation strategy to augment and train the network effectively. With this strategy, the model can further automatically discover and emphasize implicit useful cues contained in the data. Both spatial and temporal knowledge learned by the high-resolution complex teacher networks also can be distilled and transferred into the proposed low-resolution light-weight spatiotemporal network. Experimental results show that the performance of our model is comparable to ten state-of-the-art models in video attention prediction, while it costs only 0.68 MB memory footprint, runs about 10,106 FPS on GPU and 404 FPS on CPU, which is 206 times faster than previous models

    CFAD: A Chinese Dataset for Fake Audio Detection

    Full text link
    Fake audio detection is a growing concern and some relevant datasets have been designed for research. However, there is no standard public Chinese dataset under complex conditions.In this paper, we aim to fill in the gap and design a Chinese fake audio detection dataset (CFAD) for studying more generalized detection methods. Twelve mainstream speech-generation techniques are used to generate fake audio. To simulate the real-life scenarios, three noise datasets are selected for noise adding at five different signal-to-noise ratios, and six codecs are considered for audio transcoding (format conversion). CFAD dataset can be used not only for fake audio detection but also for detecting the algorithms of fake utterances for audio forensics. Baseline results are presented with analysis. The results that show fake audio detection methods with generalization remain challenging. The CFAD dataset is publicly available at: https://zenodo.org/record/8122764.Comment: FAD renamed as CFA

    Expanding Language-Image Pretrained Models for General Video Recognition

    Full text link
    Contrastive language-image pretraining has shown great success in learning visual-textual joint representation from web-scale data, demonstrating remarkable "zero-shot" generalization ability for various image tasks. However, how to effectively expand such new language-image pretraining methods to video domains is still an open problem. In this work, we present a simple yet effective approach that adapts the pretrained language-image models to video recognition directly, instead of pretraining a new model from scratch. More concretely, to capture the long-range dependencies of frames along the temporal dimension, we propose a cross-frame attention mechanism that explicitly exchanges information across frames. Such module is lightweight and can be plugged into pretrained language-image models seamlessly. Moreover, we propose a video-specific prompting scheme, which leverages video content information for generating discriminative textual prompts. Extensive experiments demonstrate that our approach is effective and can be generalized to different video recognition scenarios. In particular, under fully-supervised settings, our approach achieves a top-1 accuracy of 87.1% on Kinectics-400, while using 12 times fewer FLOPs compared with Swin-L and ViViT-H. In zero-shot experiments, our approach surpasses the current state-of-the-art methods by +7.6% and +14.9% in terms of top-1 accuracy under two popular protocols. In few-shot scenarios, our approach outperforms previous best methods by +32.1% and +23.1% when the labeled data is extremely limited. Code and models are available at https://aka.ms/X-CLIPComment: Accepted by ECCV2022, Ora

    A New Two-Port Network Model-Based Pilot Protection for AC Transmission Lines

    Get PDF

    Efficient perpendicular magnetization switching by a magnetic spin Hall effect in a noncollinear antiferromagnet

    Get PDF
    Current induced spin-orbit torques driven by the conventional spin Hall effect are widely used to manipulate the magnetization. This approach, however, is nondeterministic and inefficient for the switching of magnets with perpendicular magnetic anisotropy that are demanded by the high-density magnetic storage and memory devices. Here, we demonstrate that this limitation can be overcome by exploiting a magnetic spin Hall effect in noncollinear antiferromagnets, such as Mn3Sn. The magnetic group symmetry of Mn3Sn allows generation of the out-of-plane spin current carrying spin polarization collinear to its direction induced by an in-plane charge current. This spin current drives an out-of-plane anti-damping torque providing the deterministic switching of the perpendicular magnetization of an adjacent Ni/Co multilayer. Due to being odd with respect to time reversal symmetry, the observed magnetic spin Hall effect and the resulting spin-orbit torque can be reversed with reversal of the antiferromagnetic order. Contrary to the conventional spin-orbit torque devices, the demonstrated magnetization switching does not need an external magnetic field and requires much lower current density which is useful for low power spintronics

    Aberrant Expression of N-Methylpurine-DNA Glycosylase Influences Patient Survival in Malignant Gliomas

    Get PDF
    Aim. To examine the expression of N-methylpurine-DNA glycosylase (MPG) gene and protein in glioma samples with different WHO grades and its association with patients' survival. Methods. Immunohistochemistry assay, quantitative real-time PCR and Western blot analysis were carried out to investigate the expression of MPG gene and protein in 128 glioma and 10 non-neoplastic brain tissues. Results. MPG gene expression level in glioma tissues was significantly higher than that in non-neoplastic brain tissues (P < 0.001). Immunohistochemistry also showed that MPG protein was over-expressed in glioma tissues, which was consistent with the resutls of Western blot analysis. Additionally, the expression levels of MPG gene and protein both increase from grade I to grade IV glioma according to the results of real-time PCR, immunohistochemistry and western blot analysis. Moreover, the survival rate of MPG-positive patients was significantly lower than that of MPG-negative patients (P < 0.001). We further confirmed that the over-expression of MPG was a significant and independent prognostic indicator in glioma by multivariate analysis (P < 0.001). Conclusions. Our data showed the over-expression of MPG gene and protein in human gliomas, and also suggested for the first time that MPG be an unfavorable independent prognostic indicator for glioma patients

    Unraveling the Role of the rssC Gene of Serratia marcescens by Atomic Force Microscopy

    Get PDF
    100學年度研究獎補助論文[[abstract]]The product and direct role of the rssC gene of Serratia marcescens is unknown. For unraveling the role of the rssC gene, atomic force microscopy has been used to identify the surfaces of intact S. marcescens wild-type CH-1 cells and rssC mutant CH-1ΔC cells. The detailed surface topographies were directly visualized, and quantitative measurements of the physical properties of the membrane structures were provided. CH-1 and CH-1ΔC cells were observed before and after treatment with lysozyme, and their topography-related parameters, e.g., a valley-to-peak distance, mean height, surface roughness, and surface root-mean-square values, were defined and compared. The data obtained suggest that the cellular surface topography of mutant CH-1ΔC becomes rougher and more precipitous than that of wild-type CH-1 cells. Moreover, it was found that, compared with native wild-type CH-1, the cellular surface topography of lysozyme-treated CH-1 was not changed profoundly. The product of the rssC gene is thus predicted to be mainly responsible for fatty-acid biosynthesis of the S. marcescens outer membrane. This study represents the first direct observation of the structural changes in membranes of bacterial mutant cells and offers a new prospect for predicting gene expression in bacterial cells.[[journaltype]]國外[[incitationindex]]SCI[[booktype]]紙本[[countrycodes]]GB
    corecore