268 research outputs found

    RADIO: Reference-Agnostic Dubbing Video Synthesis

    Full text link
    One of the most challenging problems in audio-driven talking head generation is achieving high-fidelity detail while ensuring precise synchronization. Given only a single reference image, extracting meaningful identity attributes becomes even more challenging, often causing the network to mirror the facial and lip structures too closely. To address these issues, we introduce RADIO, a framework engineered to yield high-quality dubbed videos regardless of the pose or expression in reference images. The key is to modulate the decoder layers using latent space composed of audio and reference features. Additionally, we incorporate ViT blocks into the decoder to emphasize high-fidelity details, especially in the lip region. Our experimental results demonstrate that RADIO displays high synchronization without the loss of fidelity. Especially in harsh scenarios where the reference frame deviates significantly from the ground truth, our method outperforms state-of-the-art methods, highlighting its robustness. Pre-trained model and codes will be made public after the review.Comment: Under revie

    Improving the Expressiveness of Deep Learning Frameworks with Recursion

    Full text link
    Recursive neural networks have widely been used by researchers to handle applications with recursively or hierarchically structured data. However, embedded control flow deep learning frameworks such as TensorFlow, Theano, Caffe2, and MXNet fail to efficiently represent and execute such neural networks, due to lack of support for recursion. In this paper, we add recursion to the programming model of existing frameworks by complementing their design with recursive execution of dataflow graphs as well as additional APIs for recursive definitions. Unlike iterative implementations, which can only understand the topological index of each node in recursive data structures, our recursive implementation is able to exploit the recursive relationships between nodes for efficient execution based on parallel computation. We present an implementation on TensorFlow and evaluation results with various recursive neural network models, showing that our recursive implementation not only conveys the recursive nature of recursive neural networks better than other implementations, but also uses given resources more effectively to reduce training and inference time.Comment: Appeared in EuroSys 2018. 13 pages, 11 figure

    Behavioral analysis of Pacific abalone, Haliotis discus hannai, reveals its feeding preference and attraction potential for brown alga, Sargassum horneri

    Get PDF
    The Pacific abalone, Haliotis discus hannai, is a highly valued and industrially important aquaculture species with growing demands of the expanding abalone aquaculture industry. To explore the feasibility of using the brown alga, Sargassum horneri, as a potential substitute for abalone feed, it is important to identify the feed preference and attractant effect of S. horneri on Pacific abalone. Our experiments indicated that the feeding-associated movement of abalone could be detected using a video tracking system under indirect illumination with dim red light. To further analyze the attraction potentials of various test materials, preference analysis was performed using Avicel-coated glass plates with ground powders of various seaweeds (e.g., S. horneri, Saccharina japonica, and Undaria pinnatifida) and commercial abalone feed, together with coffee waste. Heat map analysis indicated greater attraction by the kelp S. japonica than by S. horneri and commercial feed, which showed similar preference levels. Feeding preference based on the area of Avicel eaten by abalone showed a significant preference for U. pinnatifida over S. horneri (feeding area: 68.6 ± 20.1% vs. 37.5 ± 22.4%, p < 0.05). Additionally, the feeding area was significantly greater for plates with S. ja-ponica than for plates with S. horneri (44.0 ± 16.6% vs. 22.6 ± 15.4%, p < 0.05). There was no significant difference in feeding area between commercial feed and S. horneri (31.7 ± 11.6% vs. 31.6 ± 20.2%, p > 0.05). The methanol extracts attracted abalone in the following order: U. pinnatifida > S. horneri > S. japonica > commercial feed > coffee waste. To determine the attractive effects of the components of methanol extracts, mixtures of methanol extracts of commercial feed with increasing amounts of S. horneri were examined. The results showed a significant increase in feeding preference upon addition of S. horneri up to 50% and 75%, suggesting its potential for use as an appetite-enhancing feed additive. This study identified conditions that can be successfully used to monitor the movement of Pacific abalone; the results of preference analysis confirmed that abalone exhibited similar attraction and feeding preference for S. horneri, compared with commercial feed
    • …
    corecore