41 research outputs found

    H-VFI: Hierarchical Frame Interpolation for Videos with Large Motions

    Full text link
    Capitalizing on the rapid development of neural networks, recent video frame interpolation (VFI) methods have achieved notable improvements. However, they still fall short for real-world videos containing large motions. Complex deformation and/or occlusion caused by large motions make it an extremely difficult problem in video frame interpolation. In this paper, we propose a simple yet effective solution, H-VFI, to deal with large motions in video frame interpolation. H-VFI contributes a hierarchical video interpolation transformer (HVIT) to learn a deformable kernel in a coarse-to-fine strategy in multiple scales. The learnt deformable kernel is then utilized in convolving the input frames for predicting the interpolated frame. Starting from the smallest scale, H-VFI updates the deformable kernel by a residual in succession based on former predicted kernels, intermediate interpolated results and hierarchical features from transformer. Bias and masks to refine the final outputs are then predicted by a transformer block based on interpolated results. The advantage of such a progressive approximation is that the large motion frame interpolation problem can be decomposed into several relatively simpler sub-tasks, which enables a very accurate prediction in the final results. Another noteworthy contribution of our paper consists of a large-scale high-quality dataset, YouTube200K, which contains videos depicting a great variety of scenarios captured at high resolution and high frame rate. Extensive experiments on multiple frame interpolation benchmarks validate that H-VFI outperforms existing state-of-the-art methods especially for videos with large motions

    Disentangling superconducting and magnetic orders in NaFe_1-xNi_xAs using muon spin rotation

    Full text link
    Muon spin rotation and relaxation studies have been performed on a "111" family of iron-based superconductors NaFe_1-xNi_xAs. Static magnetic order was characterized by obtaining the temperature and doping dependences of the local ordered magnetic moment size and the volume fraction of the magnetically ordered regions. For x = 0 and 0.4 %, a transition to a nearly-homogeneous long range magnetically ordered state is observed, while for higher x than 0.4 % magnetic order becomes more disordered and is completely suppressed for x = 1.5 %. The magnetic volume fraction continuously decreases with increasing x. The combination of magnetic and superconducting volumes implies that a spatially-overlapping coexistence of magnetism and superconductivity spans a large region of the T-x phase diagram for NaFe_1-xNi_xAs . A strong reduction of both the ordered moment size and the volume fraction is observed below the superconducting T_C for x = 0.6, 1.0, and 1.3 %, in contrast to other iron pnictides in which one of these two parameters exhibits a reduction below TC, but not both. The suppression of magnetic order is further enhanced with increased Ni doping, leading to a reentrant non-magnetic state below T_C for x = 1.3 %. The reentrant behavior indicates an interplay between antiferromagnetism and superconductivity involving competition for the same electrons. These observations are consistent with the sign-changing s-wave superconducting state, which is expected to appear on the verge of microscopic coexistence and phase separation with magnetism. We also present a universal linear relationship between the local ordered moment size and the antiferromagnetic ordering temperature TN across a variety of iron-based superconductors. We argue that this linear relationship is consistent with an itinerant-electron approach, in which Fermi surface nesting drives antiferromagnetic ordering.Comment: 20 pages, 14 figures, Correspondence should be addressed to Prof. Yasutomo Uemura: [email protected]

    An analog of the adjugate matrix for the outer inverse AT,S-(2)

    No full text

    Multi-Attention Module for Dynamic Facial Emotion Recognition

    No full text
    Video-based dynamic facial emotion recognition (FER) is a challenging task, as one must capture and distinguish tiny facial movements representing emotional changes while ignoring the facial differences of different objects. Recent state-of-the-art studies have usually adopted more complex methods to solve this task, such as large-scale deep learning models or multimodal analysis with reference to multiple sub-models. According to the characteristics of the FER task and the shortcomings of existing methods, in this paper we propose a lightweight method and design three attention modules that can be flexibly inserted into the backbone network. The key information for the three dimensions of space, channel, and time is extracted by means of convolution layer, pooling layer, multi-layer perception (MLP), and other approaches, and attention weights are generated. By sharing parameters at the same level, the three modules do not add too many network parameters while enhancing the focus on specific areas of the face, effective feature information of static images, and key frames. The experimental results on CK+ and eNTERFACE’05 datasets show that this method can achieve higher accuracy

    Multi-Attention Module for Dynamic Facial Emotion Recognition

    No full text
    Video-based dynamic facial emotion recognition (FER) is a challenging task, as one must capture and distinguish tiny facial movements representing emotional changes while ignoring the facial differences of different objects. Recent state-of-the-art studies have usually adopted more complex methods to solve this task, such as large-scale deep learning models or multimodal analysis with reference to multiple sub-models. According to the characteristics of the FER task and the shortcomings of existing methods, in this paper we propose a lightweight method and design three attention modules that can be flexibly inserted into the backbone network. The key information for the three dimensions of space, channel, and time is extracted by means of convolution layer, pooling layer, multi-layer perception (MLP), and other approaches, and attention weights are generated. By sharing parameters at the same level, the three modules do not add too many network parameters while enhancing the focus on specific areas of the face, effective feature information of static images, and key frames. The experimental results on CK+ and eNTERFACE’05 datasets show that this method can achieve higher accuracy

    Targeting CDK9 for the Treatment of Glioblastoma

    No full text
    Glioblastoma is the most common and aggressive primary malignant brain tumor, and more than two-thirds of patients with glioblastoma die within two years of diagnosis. The challenges of treating this disease mainly include genetic and microenvironmental features that often render the tumor resistant to treatments. Despite extensive research efforts, only a small number of drugs tested in clinical trials have become therapies for patients. Targeting cyclin-dependent kinase 9 (CDK9) is an emerging therapeutic approach that has the potential to overcome the challenges in glioblastoma management. Here, we discuss how CDK9 inhibition can impact transcription, metabolism, DNA damage repair, epigenetics, and the immune response to facilitate an anti-tumor response. Moreover, we discuss small-molecule inhibitors of CDK9 in clinical trials and future perspectives on the use of CDK9 inhibitors in treating patients with glioblastoma
    corecore