31 research outputs found

    Learning Dynamic Tetrahedra for High-Quality Talking Head Synthesis

    Full text link
    Recent works in implicit representations, such as Neural Radiance Fields (NeRF), have advanced the generation of realistic and animatable head avatars from video sequences. These implicit methods are still confronted by visual artifacts and jitters, since the lack of explicit geometric constraints poses a fundamental challenge in accurately modeling complex facial deformations. In this paper, we introduce Dynamic Tetrahedra (DynTet), a novel hybrid representation that encodes explicit dynamic meshes by neural networks to ensure geometric consistency across various motions and viewpoints. DynTet is parameterized by the coordinate-based networks which learn signed distance, deformation, and material texture, anchoring the training data into a predefined tetrahedra grid. Leveraging Marching Tetrahedra, DynTet efficiently decodes textured meshes with a consistent topology, enabling fast rendering through a differentiable rasterizer and supervision via a pixel loss. To enhance training efficiency, we incorporate classical 3D Morphable Models to facilitate geometry learning and define a canonical space for simplifying texture learning. These advantages are readily achievable owing to the effective geometric representation employed in DynTet. Compared with prior works, DynTet demonstrates significant improvements in fidelity, lip synchronization, and real-time performance according to various metrics. Beyond producing stable and visually appealing synthesis videos, our method also outputs the dynamic meshes which is promising to enable many emerging applications.Comment: CVPR 202

    Visually Grounded Commonsense Knowledge Acquisition

    Full text link
    Large-scale commonsense knowledge bases empower a broad range of AI applications, where the automatic extraction of commonsense knowledge (CKE) is a fundamental and challenging problem. CKE from text is known for suffering from the inherent sparsity and reporting bias of commonsense in text. Visual perception, on the other hand, contains rich commonsense knowledge about real-world entities, e.g., (person, can_hold, bottle), which can serve as promising sources for acquiring grounded commonsense knowledge. In this work, we present CLEVER, which formulates CKE as a distantly supervised multi-instance learning problem, where models learn to summarize commonsense relations from a bag of images about an entity pair without any human annotation on image instances. To address the problem, CLEVER leverages vision-language pre-training models for deep understanding of each image in the bag, and selects informative instances from the bag to summarize commonsense entity relations via a novel contrastive attention mechanism. Comprehensive experimental results in held-out and human evaluation show that CLEVER can extract commonsense knowledge in promising quality, outperforming pre-trained language model-based methods by 3.9 AUC and 6.4 mAUC points. The predicted commonsense scores show strong correlation with human judgment with a 0.78 Spearman coefficient. Moreover, the extracted commonsense can also be grounded into images with reasonable interpretability. The data and codes can be obtained at https://github.com/thunlp/CLEVER.Comment: Accepted by AAAI 202

    Low-Temperature Gas Plasma Combined with Antibiotics for the Reduction of Methicillin-Resistant \u3ci\u3eStaphylococcus aureus\u3c/i\u3e Biofilm Both in Vitro and in Vivo

    Get PDF
    Biofilm infections in wounds seriously delay the healing process, and methicillin-resistant Staphylococcus aureus is a major cause of wound infections. In addition to inactivating micro-organisms, low-temperature gas plasma can restore the sensitivity of pathogenic microbes to antibiotics. However, the combined treatment has not been applied to infectious diseases. In this study, low-temperature gas plasma treatment promoted the effects of different antibiotics on the reduction of S. aureus biofilms in vitro. Low-temperature gas plasma combined with rifampicin also effectively reduced the S. aureus cells in biofilms in the murine wound infection model. The blood and histochemical analysis demonstrated the biosafety of the combined treatment. Our findings demonstrated that low-temperature gas plasma combined with antibiotics is a promising therapeutic strategy for wound infections

    Pre‐symptomatic transmission of novel coronavirus in community settings

    Full text link
    We used contact tracing to document how COVID‐19 was transmitted across 5 generations involving 10 cases, starting with an individual who became ill on January 27. We calculated the incubation period of the cases as the interval between infection and development of symptoms. The median incubation period was 6.0 days (interquartile range, 3.5‐9.5 days). The last two generations were infected in public places, 3 and 4 days prior to the onset of illness in their infectors. Both had certain underlying conditions and comorbidity. Further identification of how individuals transmit prior to being symptomatic will have important consequences.Peer Reviewedhttp://deepblue.lib.umich.edu/bitstream/2027.42/163478/2/irv12773.pdfhttp://deepblue.lib.umich.edu/bitstream/2027.42/163478/1/irv12773_am.pd

    Exploration in Mapping Kernel-Based Home Range Models from Remote Sensing Imagery with Conditional Adversarial Networks

    No full text
    Kernel-based home range models are widely-used to estimate animal habitats and develop conservation strategies. They provide a probabilistic measure of animal space use instead of assuming the uniform utilization within an outside boundary. However, this type of models estimates the home ranges from animal relocations, and the inadequate locational data often prevents scientists from applying them in long-term and large-scale research. In this paper, we propose an end-to-end deep learning framework to simulate kernel home range models. We use the conditional adversarial network as a supervised model to learn the home range mapping from time-series remote sensing imagery. Our approach enables scientists to eliminate the persistent dependence on locational data in home range analysis. In experiments, we illustrate our approach by mapping the home ranges of Bar-headed Geese in Qinghai Lake area. The proposed framework outperforms all baselines in both qualitative and quantitative evaluations, achieving visually recognizable results and high mapping accuracy. The experiment also shows that learning the mapping between images is a more effective way to map such complex targets than traditional pixel-based schemes

    Investigating Home Range, Movement Pattern, and Habitat Selection of Bar-headed Geese during Breeding Season at Qinghai Lake, China

    No full text
    The Bar-headed Goose is the only true goose species or Anserinae to migrate solely within the Central Asian Flyway, and thus, it is an ideal species for observing the effects of both land use and climate change throughout the flyway. In this paper, we investigate the home range, movement pattern, and habitat selection of Bar-headed Geese (Anser indicus) during the breeding season at Qinghai Lake, which is one of their largest breeding areas and a major migration staging area in the flyway. We identified several areas used by the geese during the breeding season along the shoreline of Qinghai Lake and found that most geese had more than one core use area and daily movements that provided insight into their breeding activity. We also observed the intensive use of specific wetlands and habitats near Qinghai Lake. These data provide interesting insights into the movement ecology of this important species and also provide critical information for managers seeking to understand and respond to conservation concerns threatening Bar-headed Geese, such as landscape and habitat changes

    Exploiting Time-Series Image-to-Image Translation to Expand the Range of Wildlife Habitat Analysis

    No full text
    Characterizing wildlife habitat is one of the main topics in animal ecology. Locational data obtained from radio tracking and field observation are widely used in habitat analysis. However, such sampling methods are costly and laborious, and insufficient relocations often prevent scientists from conducting large-range and long-term research. In this paper, we innovatively exploit the image-to-image translation technology to expand the range of wildlife habitat analysis. We proposed a novel approach for implementing time-series imageto-image translation via metric embedding. A siamese neural network is used to learn the Euclidean temporal embedding from the image space. This embedding produces temporal vectors which bring time information into the adversarial network. The well-trained framework could effectively map the probabilistic habitat models from remote sensing imagery, helping scientists get rid of the persistent dependence on animal relocations. We illustrate our approach in a real-world application for mapping the habitats of Bar-headed Geese at Qinghai Lake breeding ground. We compare our model against several baselines and achieve promising results
    corecore