43 research outputs found
Object Level Depth Reconstruction for Category Level 6D Object Pose Estimation From Monocular RGB Image
Recently, RGBD-based category-level 6D object pose estimation has achieved
promising improvement in performance, however, the requirement of depth
information prohibits broader applications. In order to relieve this problem,
this paper proposes a novel approach named Object Level Depth reconstruction
Network (OLD-Net) taking only RGB images as input for category-level 6D object
pose estimation. We propose to directly predict object-level depth from a
monocular RGB image by deforming the category-level shape prior into
object-level depth and the canonical NOCS representation. Two novel modules
named Normalized Global Position Hints (NGPH) and Shape-aware Decoupled Depth
Reconstruction (SDDR) module are introduced to learn high fidelity object-level
depth and delicate shape representations. At last, the 6D object pose is solved
by aligning the predicted canonical representation with the back-projected
object-level depth. Extensive experiments on the challenging CAMERA25 and
REAL275 datasets indicate that our model, though simple, achieves
state-of-the-art performance.Comment: 19 pages, 7 figures, 4 table
EmoTalk: Speech-Driven Emotional Disentanglement for 3D Face Animation
Speech-driven 3D face animation aims to generate realistic facial expressions
that match the speech content and emotion. However, existing methods often
neglect emotional facial expressions or fail to disentangle them from speech
content. To address this issue, this paper proposes an end-to-end neural
network to disentangle different emotions in speech so as to generate rich 3D
facial expressions. Specifically, we introduce the emotion disentangling
encoder (EDE) to disentangle the emotion and content in the speech by
cross-reconstructed speech signals with different emotion labels. Then an
emotion-guided feature fusion decoder is employed to generate a 3D talking face
with enhanced emotion. The decoder is driven by the disentangled identity,
emotional, and content embeddings so as to generate controllable personal and
emotional styles. Finally, considering the scarcity of the 3D emotional talking
face data, we resort to the supervision of facial blendshapes, which enables
the reconstruction of plausible 3D faces from 2D emotional data, and contribute
a large-scale 3D emotional talking face dataset (3D-ETF) to train the network.
Our experiments and user studies demonstrate that our approach outperforms
state-of-the-art methods and exhibits more diverse facial movements. We
recommend watching the supplementary video:
https://ziqiaopeng.github.io/emotalkComment: Accepted by ICCV 202
Strain localisation and failure at twin-boundary complexions in nickel-based superalloys
From Springer Nature via Jisc Publications RouterHistory: received 2020-06-22, accepted 2020-08-21, registration 2020-09-04, pub-electronic 2020-09-29, online 2020-09-29, collection 2020-12Publication status: PublishedAbstract: Twin boundaries (TBs) in Ni-based superalloys are vulnerable sites for failure in demanding environments, and a current lack of mechanistic understanding hampers the reliable lifetime prediction and performance optimisation of these alloys. Here we report the discovery of an unexpected γ″ precipitation mechanism at TBs that takes the responsibility for alloy failure in demanding environments. Using multiscale microstructural and mechanical characterisations (from millimetre down to atomic level) and DFT calculations, we demonstrate that abnormal γ″ precipitation along TBs accounts for the premature dislocation activities and pronounced strain localisation associated with TBs during mechanical loading, which serves as a precursor for crack initiation. We clarify the physical origin of the TBs-related cracking at the atomic level of γ″-strengthened Ni-based superalloys in a hydrogen containing environment, and provide practical methods to mitigate the adverse effect of TBs on the performance of these alloys
On the use of deep learning for phase recovery
Phase recovery (PR) refers to calculating the phase of the light field from
its intensity measurements. As exemplified from quantitative phase imaging and
coherent diffraction imaging to adaptive optics, PR is essential for
reconstructing the refractive index distribution or topography of an object and
correcting the aberration of an imaging system. In recent years, deep learning
(DL), often implemented through deep neural networks, has provided
unprecedented support for computational imaging, leading to more efficient
solutions for various PR problems. In this review, we first briefly introduce
conventional methods for PR. Then, we review how DL provides support for PR
from the following three stages, namely, pre-processing, in-processing, and
post-processing. We also review how DL is used in phase image processing.
Finally, we summarize the work in DL for PR and outlook on how to better use DL
to improve the reliability and efficiency in PR. Furthermore, we present a
live-updating resource (https://github.com/kqwang/phase-recovery) for readers
to learn more about PR.Comment: 82 pages, 32 figure
Relationship between freight accessibility and logistics employment in US counties
This paper analyzes the relationship between freight accessibility and logistics employment in the US. It develops an accessibility measure relevant for logistics companies based on a gravity model. This allows for an analysis of the accessibility of US counties focusing on four different modes of transportation: road, rail, air, and maritime. Using a Partial Least Squares model, these four different freight accessibility measures are combined into two constructs, continental and intercontinental freight accessibility, and related to logistics employment. Results show that highly accessible counties attract more logistics employment than other counties. The analyses show that it is very important to control for the effect of the county population on both freight accessibility and logistics employment. While county population explains the most variation in the logistics employment per county, there is a significant relationship between freight accessibility and logistics employment, when controlling for this effect