61 research outputs found
Implicit Neural Representation for Physics-driven Actuated Soft Bodies
Active soft bodies can affect their shape through an internal actuation
mechanism that induces a deformation. Similar to recent work, this paper
utilizes a differentiable, quasi-static, and physics-based simulation layer to
optimize for actuation signals parameterized by neural networks. Our key
contribution is a general and implicit formulation to control active soft
bodies by defining a function that enables a continuous mapping from a spatial
point in the material space to the actuation value. This property allows us to
capture the signal's dominant frequencies, making the method discretization
agnostic and widely applicable. We extend our implicit model to mandible
kinematics for the particular case of facial animation and show that we can
reliably reproduce facial expressions captured with high-quality capture
systems. We apply the method to volumetric soft bodies, human poses, and facial
expressions, demonstrating artist-friendly properties, such as simple control
over the latent space and resolution invariance at test time.Comment: Accepted to SIGGRAPH 2022. Project page:
https://studios.disneyresearch.com/2022/07/24/implicit-neural-representation-for-physics-driven-actuated-soft-bodies/
Video: https://www.youtube.com/watch?v=9EERe_CTaz
SeeSR: Towards Semantics-Aware Real-World Image Super-Resolution
Owe to the powerful generative priors, the pre-trained text-to-image (T2I)
diffusion models have become increasingly popular in solving the real-world
image super-resolution problem. However, as a consequence of the heavy quality
degradation of input low-resolution (LR) images, the destruction of local
structures can lead to ambiguous image semantics. As a result, the content of
reproduced high-resolution image may have semantic errors, deteriorating the
super-resolution performance. To address this issue, we present a
semantics-aware approach to better preserve the semantic fidelity of generative
real-world image super-resolution. First, we train a degradation-aware prompt
extractor, which can generate accurate soft and hard semantic prompts even
under strong degradation. The hard semantic prompts refer to the image tags,
aiming to enhance the local perception ability of the T2I model, while the soft
semantic prompts compensate for the hard ones to provide additional
representation information. These semantic prompts can encourage the T2I model
to generate detailed and semantically accurate results. Furthermore, during the
inference process, we integrate the LR images into the initial sampling noise
to mitigate the diffusion model's tendency to generate excessive random
details. The experiments show that our method can reproduce more realistic
image details and hold better the semantics
MonoHair: High-Fidelity Hair Modeling from a Monocular Video
Undoubtedly, high-fidelity 3D hair is crucial for achieving realism, artistic
expression, and immersion in computer graphics. While existing 3D hair modeling
methods have achieved impressive performance, the challenge of achieving
high-quality hair reconstruction persists: they either require strict capture
conditions, making practical applications difficult, or heavily rely on learned
prior data, obscuring fine-grained details in images. To address these
challenges, we propose MonoHair,a generic framework to achieve high-fidelity
hair reconstruction from a monocular video, without specific requirements for
environments. Our approach bifurcates the hair modeling process into two main
stages: precise exterior reconstruction and interior structure inference. The
exterior is meticulously crafted using our Patch-based Multi-View Optimization
(PMVO). This method strategically collects and integrates hair information from
multiple views, independent of prior data, to produce a high-fidelity exterior
3D line map. This map not only captures intricate details but also facilitates
the inference of the hair's inner structure. For the interior, we employ a
data-driven, multi-view 3D hair reconstruction method. This method utilizes 2D
structural renderings derived from the reconstructed exterior, mirroring the
synthetic 2D inputs used during training. This alignment effectively bridges
the domain gap between our training data and real-world data, thereby enhancing
the accuracy and reliability of our interior structure inference. Lastly, we
generate a strand model and resolve the directional ambiguity by our hair
growth algorithm. Our experiments demonstrate that our method exhibits
robustness across diverse hairstyles and achieves state-of-the-art performance.
For more results, please refer to our project page
https://keyuwu-cs.github.io/MonoHair/.Comment: Accepted by IEEE CVPR 202
Learning from Rich Semantics and Coarse Locations for Long-tailed Object Detection
Long-tailed object detection (LTOD) aims to handle the extreme data imbalance
in real-world datasets, where many tail classes have scarce instances. One
popular strategy is to explore extra data with image-level labels, yet it
produces limited results due to (1) semantic ambiguity -- an image-level label
only captures a salient part of the image, ignoring the remaining rich
semantics within the image; and (2) location sensitivity -- the label highly
depends on the locations and crops of the original image, which may change
after data transformations like random cropping. To remedy this, we propose
RichSem, a simple but effective method, which is robust to learn rich semantics
from coarse locations without the need of accurate bounding boxes. RichSem
leverages rich semantics from images, which are then served as additional soft
supervision for training detectors. Specifically, we add a semantic branch to
our detector to learn these soft semantics and enhance feature representations
for long-tailed object detection. The semantic branch is only used for training
and is removed during inference. RichSem achieves consistent improvements on
both overall and rare-category of LVIS under different backbones and detectors.
Our method achieves state-of-the-art performance without requiring complex
training and testing procedures. Moreover, we show the effectiveness of our
method on other long-tailed datasets with additional experiments. Code is
available at \url{https://github.com/MengLcool/RichSem}.Comment: Accepted by NeurIPS202
Application of Fractal to Evaluate the Drying Shrinkage Behavior of Soil Composites from Recycled Waste Clay Brick
Soil drying cracking is the most common natural phenomenon affecting soil stability. Due to the complexity of the geometric shapes of soil cracks during the cracking process, it has become a major problem in engineering science. The extremely irregular and complex crack networks formed in civil engineering materials can be quantitatively investigated using fractal theory. In this paper, fractal dimension is proposed to characterize the drying cracking characteristics of composite soil by adding recycled waste brick micro-powder. At the same time, the concept of the probability entropy of cracking is introduced to quantify the ordered state of crack development. Correspondingly, the endpoint value of probability entropy was solved mathematically, and the meaning of the probability entropy of cracking was clarified. In this study, the fracture fractal characteristics of composite soil mixed with different materials were first investigated. Then, five groups of composite soil-saturated muds with added recycled waste brick micro-powder of different contents were prepared in the laboratory. Using the evaporation test under constant temperature and humidity, the change rules of the fractal dimensions, probability entropy, crack ratios, and water contents of cracks during the cracking process of the soil samples were obtained. The results show that: (1) on the whole, the fractal dimensions of the soil samples added with recycled waste brick micro-powder of different contents increased over time, and the fractal dimensions of the soil samples without recycled waste brick micro-powder were obviously larger than those of the soil samples with recycled waste brick micro-powder. With the increase in the content of recycled waste brick micro-powder, the maximum fractal dimension decreased in turn. The maximum fractal dimensions of the five groups of soil samples were 1.74, 1.68, 1.62, 1.57, and 1.45. (2) The change trends of the probability entropy and fractal dimensions were similar; both of them showed an upward trend over time, and the probability entropy of the soil samples without recycled waste brick micro-powder was greater than that of the soil samples with recycled waste brick micro-powder. With the increase in the contents of recycled waste brick micro-powder, the probability entropy decreased in turn. The maximum values of the crack probability entropy of the five groups of soil samples were 0.99, 0.92, 0.87, 0.83, and 0.80. (3) Under the action of continuous evaporation, the moisture contents of the soil samples gradually decreased over time, while the crack ratios increased over time. To sum up, both from the perspective of the development process of the cracks of the soil samples and from the perspective of the final stable crack networks of the soil samples, the geometric shapes of the cracks of the soil samples without recycled waste brick micro-powder were the most complex. With the increase in the content of recycled waste brick micro-powder, the fractal characteristics of the cracks gradually changed from complex to simple
High-Dimensional Covariance Estimation via Constrained <i>L<sub>q</sub></i>-Type Regularization
High-dimensional covariance matrix estimation is one of the fundamental and important problems in multivariate analysis and has a wide range of applications in many fields. In practice, it is common that a covariance matrix is composed of a low-rank matrix and a sparse matrix. In this paper we estimate the covariance matrix by solving a constrained Lq-type regularized optimization problem. We establish the first-order optimality conditions for this problem by using proximal mapping and the subspace method. The proposed stationary point degenerates to the first-order stationary points of the unconstrained Lq regularized sparse or low-rank optimization problems. A smoothing alternating updating method is proposed to find an estimator for the covariance matrix. We establish the convergence of the proposed calculation method. The numerical simulation results show the effectiveness of the proposed approach for high-dimensional covariance estimation
Optimization of Solidification and Stabilization Efficiency of Heavy Metal Contaminated Sediment Based on Response Surface Methodology
Solidification and stabilization (S/S) by agents and stabilizers is an effective way to treat heavy metal-contaminated sediments. Optimization of curing condition is crucial to minimize the consumption of reagents on the base of effective S/S. In this work, the synergistic effects of cement and stabilizer on mechanical strength and leaching toxicity of contaminated sediments were investigated, and the S/S conditions were optimized using response surface methodology. On the basis of a single-factor test, multi-factor experiments were conducted to fit the relationship between the S/S effect of contaminated sediments and the amount of cement and stabilizer. The mechanism of stabilization was investigated by the results from the revised BCR method. The results indicate that the optimal curing conditions were 44.29% of cement content with 2.05% of trimercapto-s-triazine trisodium salt (TMT). After 28 days of curing, the compressive strength reached 2.07 MPa and the leaching concentrations of Cd, Cu, and Pb were 0.094 mg/L, 0.031 mg/L, and 0.173 mg/L, respectively, which met the requirement of in-situ resource recycling standard. The stability of heavy metals was significantly improved as a result of the removal of acid extractable fraction (15.58~69.92%) and an increase in the residual fraction (18.27~49.07%)
Research progress of nanoplastics in freshwater
With the mass production and use of plastic products, which leads to their continuous entry into the water environment, the problem of environmental pollution has been paid more and more attention by scholars from different countries. In recent years, a large number of studies have focused on microplastics, but few on nanoplastics (NPs). However, NPs are smaller in size, have a higher affinity for cells, and surface and volume ratios are higher than those of microplastics. NPs may also enter biological tissues, blood and cells, which may cause greater potential harm to organisms. In this paper, firstly, the environmental fate of NPs accumulation and deposition is summarized, and further research is needed in the future; secondly, the current techniques for NPs extraction and characterization of NPs extraction and characterization are summarized. At present, the analytical methods of NPs are in the primary stage, and lack of standardized and accurate methods; finally, the toxic effects of NPs on biological morphology, behavior and reproduction are discussed. It has been found that the small size and high surface area of NPs make them more toxic to organisms than microplastics. However, most of the current toxicological studies of NPs on freshwater organisms could not be simulated in real environment. (C) 2020 Elsevier B.V. All rights reserved
Implicit neural representation for physics-driven actuated soft bodies
Active soft bodies can affect their shape through an internal actuation mechanism that induces a deformation. Similar to recent work, this paper utilizes a differentiable, quasi-static, and physics-based simulation layer to optimize for actuation signals parameterized by neural networks. Our key contribution is a general and implicit formulation to control active soft bodies by defining a function that enables a continuous mapping from a spatial point in the material space to the actuation value. This property allows us to capture the signal's dominant frequencies, making the method discretization agnostic and widely applicable. We extend our implicit model to mandible kinematics for the particular case of facial animation and show that we can reliably reproduce facial expressions captured with high-quality capture systems. We apply the method to volumetric soft bodies, human poses, and facial expressions, demonstrating artist-friendly properties, such as simple control over the latent space and resolution invariance at test time.ISSN:0730-0301ISSN:1557-736
- …