41 research outputs found
H-VFI: Hierarchical Frame Interpolation for Videos with Large Motions
Capitalizing on the rapid development of neural networks, recent video frame
interpolation (VFI) methods have achieved notable improvements. However, they
still fall short for real-world videos containing large motions. Complex
deformation and/or occlusion caused by large motions make it an extremely
difficult problem in video frame interpolation. In this paper, we propose a
simple yet effective solution, H-VFI, to deal with large motions in video frame
interpolation. H-VFI contributes a hierarchical video interpolation transformer
(HVIT) to learn a deformable kernel in a coarse-to-fine strategy in multiple
scales. The learnt deformable kernel is then utilized in convolving the input
frames for predicting the interpolated frame. Starting from the smallest scale,
H-VFI updates the deformable kernel by a residual in succession based on former
predicted kernels, intermediate interpolated results and hierarchical features
from transformer. Bias and masks to refine the final outputs are then predicted
by a transformer block based on interpolated results. The advantage of such a
progressive approximation is that the large motion frame interpolation problem
can be decomposed into several relatively simpler sub-tasks, which enables a
very accurate prediction in the final results. Another noteworthy contribution
of our paper consists of a large-scale high-quality dataset, YouTube200K, which
contains videos depicting a great variety of scenarios captured at high
resolution and high frame rate. Extensive experiments on multiple frame
interpolation benchmarks validate that H-VFI outperforms existing
state-of-the-art methods especially for videos with large motions
Disentangling superconducting and magnetic orders in NaFe_1-xNi_xAs using muon spin rotation
Muon spin rotation and relaxation studies have been performed on a "111"
family of iron-based superconductors NaFe_1-xNi_xAs. Static magnetic order was
characterized by obtaining the temperature and doping dependences of the local
ordered magnetic moment size and the volume fraction of the magnetically
ordered regions. For x = 0 and 0.4 %, a transition to a nearly-homogeneous long
range magnetically ordered state is observed, while for higher x than 0.4 %
magnetic order becomes more disordered and is completely suppressed for x = 1.5
%. The magnetic volume fraction continuously decreases with increasing x. The
combination of magnetic and superconducting volumes implies that a
spatially-overlapping coexistence of magnetism and superconductivity spans a
large region of the T-x phase diagram for NaFe_1-xNi_xAs . A strong reduction
of both the ordered moment size and the volume fraction is observed below the
superconducting T_C for x = 0.6, 1.0, and 1.3 %, in contrast to other iron
pnictides in which one of these two parameters exhibits a reduction below TC,
but not both. The suppression of magnetic order is further enhanced with
increased Ni doping, leading to a reentrant non-magnetic state below T_C for x
= 1.3 %. The reentrant behavior indicates an interplay between
antiferromagnetism and superconductivity involving competition for the same
electrons. These observations are consistent with the sign-changing s-wave
superconducting state, which is expected to appear on the verge of microscopic
coexistence and phase separation with magnetism. We also present a universal
linear relationship between the local ordered moment size and the
antiferromagnetic ordering temperature TN across a variety of iron-based
superconductors. We argue that this linear relationship is consistent with an
itinerant-electron approach, in which Fermi surface nesting drives
antiferromagnetic ordering.Comment: 20 pages, 14 figures, Correspondence should be addressed to Prof.
Yasutomo Uemura: [email protected]
Multi-Attention Module for Dynamic Facial Emotion Recognition
Video-based dynamic facial emotion recognition (FER) is a challenging task, as one must capture and distinguish tiny facial movements representing emotional changes while ignoring the facial differences of different objects. Recent state-of-the-art studies have usually adopted more complex methods to solve this task, such as large-scale deep learning models or multimodal analysis with reference to multiple sub-models. According to the characteristics of the FER task and the shortcomings of existing methods, in this paper we propose a lightweight method and design three attention modules that can be flexibly inserted into the backbone network. The key information for the three dimensions of space, channel, and time is extracted by means of convolution layer, pooling layer, multi-layer perception (MLP), and other approaches, and attention weights are generated. By sharing parameters at the same level, the three modules do not add too many network parameters while enhancing the focus on specific areas of the face, effective feature information of static images, and key frames. The experimental results on CK+ and eNTERFACE’05 datasets show that this method can achieve higher accuracy
Multi-Attention Module for Dynamic Facial Emotion Recognition
Video-based dynamic facial emotion recognition (FER) is a challenging task, as one must capture and distinguish tiny facial movements representing emotional changes while ignoring the facial differences of different objects. Recent state-of-the-art studies have usually adopted more complex methods to solve this task, such as large-scale deep learning models or multimodal analysis with reference to multiple sub-models. According to the characteristics of the FER task and the shortcomings of existing methods, in this paper we propose a lightweight method and design three attention modules that can be flexibly inserted into the backbone network. The key information for the three dimensions of space, channel, and time is extracted by means of convolution layer, pooling layer, multi-layer perception (MLP), and other approaches, and attention weights are generated. By sharing parameters at the same level, the three modules do not add too many network parameters while enhancing the focus on specific areas of the face, effective feature information of static images, and key frames. The experimental results on CK+ and eNTERFACE’05 datasets show that this method can achieve higher accuracy
Targeting CDK9 for the Treatment of Glioblastoma
Glioblastoma is the most common and aggressive primary malignant brain tumor, and more than two-thirds of patients with glioblastoma die within two years of diagnosis. The challenges of treating this disease mainly include genetic and microenvironmental features that often render the tumor resistant to treatments. Despite extensive research efforts, only a small number of drugs tested in clinical trials have become therapies for patients. Targeting cyclin-dependent kinase 9 (CDK9) is an emerging therapeutic approach that has the potential to overcome the challenges in glioblastoma management. Here, we discuss how CDK9 inhibition can impact transcription, metabolism, DNA damage repair, epigenetics, and the immune response to facilitate an anti-tumor response. Moreover, we discuss small-molecule inhibitors of CDK9 in clinical trials and future perspectives on the use of CDK9 inhibitors in treating patients with glioblastoma