43 research outputs found

    Object Level Depth Reconstruction for Category Level 6D Object Pose Estimation From Monocular RGB Image

    Full text link
    Recently, RGBD-based category-level 6D object pose estimation has achieved promising improvement in performance, however, the requirement of depth information prohibits broader applications. In order to relieve this problem, this paper proposes a novel approach named Object Level Depth reconstruction Network (OLD-Net) taking only RGB images as input for category-level 6D object pose estimation. We propose to directly predict object-level depth from a monocular RGB image by deforming the category-level shape prior into object-level depth and the canonical NOCS representation. Two novel modules named Normalized Global Position Hints (NGPH) and Shape-aware Decoupled Depth Reconstruction (SDDR) module are introduced to learn high fidelity object-level depth and delicate shape representations. At last, the 6D object pose is solved by aligning the predicted canonical representation with the back-projected object-level depth. Extensive experiments on the challenging CAMERA25 and REAL275 datasets indicate that our model, though simple, achieves state-of-the-art performance.Comment: 19 pages, 7 figures, 4 table

    EmoTalk: Speech-Driven Emotional Disentanglement for 3D Face Animation

    Full text link
    Speech-driven 3D face animation aims to generate realistic facial expressions that match the speech content and emotion. However, existing methods often neglect emotional facial expressions or fail to disentangle them from speech content. To address this issue, this paper proposes an end-to-end neural network to disentangle different emotions in speech so as to generate rich 3D facial expressions. Specifically, we introduce the emotion disentangling encoder (EDE) to disentangle the emotion and content in the speech by cross-reconstructed speech signals with different emotion labels. Then an emotion-guided feature fusion decoder is employed to generate a 3D talking face with enhanced emotion. The decoder is driven by the disentangled identity, emotional, and content embeddings so as to generate controllable personal and emotional styles. Finally, considering the scarcity of the 3D emotional talking face data, we resort to the supervision of facial blendshapes, which enables the reconstruction of plausible 3D faces from 2D emotional data, and contribute a large-scale 3D emotional talking face dataset (3D-ETF) to train the network. Our experiments and user studies demonstrate that our approach outperforms state-of-the-art methods and exhibits more diverse facial movements. We recommend watching the supplementary video: https://ziqiaopeng.github.io/emotalkComment: Accepted by ICCV 202

    Strain localisation and failure at twin-boundary complexions in nickel-based superalloys

    Get PDF
    From Springer Nature via Jisc Publications RouterHistory: received 2020-06-22, accepted 2020-08-21, registration 2020-09-04, pub-electronic 2020-09-29, online 2020-09-29, collection 2020-12Publication status: PublishedAbstract: Twin boundaries (TBs) in Ni-based superalloys are vulnerable sites for failure in demanding environments, and a current lack of mechanistic understanding hampers the reliable lifetime prediction and performance optimisation of these alloys. Here we report the discovery of an unexpected γ″ precipitation mechanism at TBs that takes the responsibility for alloy failure in demanding environments. Using multiscale microstructural and mechanical characterisations (from millimetre down to atomic level) and DFT calculations, we demonstrate that abnormal γ″ precipitation along TBs accounts for the premature dislocation activities and pronounced strain localisation associated with TBs during mechanical loading, which serves as a precursor for crack initiation. We clarify the physical origin of the TBs-related cracking at the atomic level of γ″-strengthened Ni-based superalloys in a hydrogen containing environment, and provide practical methods to mitigate the adverse effect of TBs on the performance of these alloys

    On the use of deep learning for phase recovery

    Full text link
    Phase recovery (PR) refers to calculating the phase of the light field from its intensity measurements. As exemplified from quantitative phase imaging and coherent diffraction imaging to adaptive optics, PR is essential for reconstructing the refractive index distribution or topography of an object and correcting the aberration of an imaging system. In recent years, deep learning (DL), often implemented through deep neural networks, has provided unprecedented support for computational imaging, leading to more efficient solutions for various PR problems. In this review, we first briefly introduce conventional methods for PR. Then, we review how DL provides support for PR from the following three stages, namely, pre-processing, in-processing, and post-processing. We also review how DL is used in phase image processing. Finally, we summarize the work in DL for PR and outlook on how to better use DL to improve the reliability and efficiency in PR. Furthermore, we present a live-updating resource (https://github.com/kqwang/phase-recovery) for readers to learn more about PR.Comment: 82 pages, 32 figure

    Relationship between freight accessibility and logistics employment in US counties

    Get PDF
    This paper analyzes the relationship between freight accessibility and logistics employment in the US. It develops an accessibility measure relevant for logistics companies based on a gravity model. This allows for an analysis of the accessibility of US counties focusing on four different modes of transportation: road, rail, air, and maritime. Using a Partial Least Squares model, these four different freight accessibility measures are combined into two constructs, continental and intercontinental freight accessibility, and related to logistics employment. Results show that highly accessible counties attract more logistics employment than other counties. The analyses show that it is very important to control for the effect of the county population on both freight accessibility and logistics employment. While county population explains the most variation in the logistics employment per county, there is a significant relationship between freight accessibility and logistics employment, when controlling for this effect
    corecore