61 research outputs found

    Implicit Neural Representation for Physics-driven Actuated Soft Bodies

    Full text link
    Active soft bodies can affect their shape through an internal actuation mechanism that induces a deformation. Similar to recent work, this paper utilizes a differentiable, quasi-static, and physics-based simulation layer to optimize for actuation signals parameterized by neural networks. Our key contribution is a general and implicit formulation to control active soft bodies by defining a function that enables a continuous mapping from a spatial point in the material space to the actuation value. This property allows us to capture the signal's dominant frequencies, making the method discretization agnostic and widely applicable. We extend our implicit model to mandible kinematics for the particular case of facial animation and show that we can reliably reproduce facial expressions captured with high-quality capture systems. We apply the method to volumetric soft bodies, human poses, and facial expressions, demonstrating artist-friendly properties, such as simple control over the latent space and resolution invariance at test time.Comment: Accepted to SIGGRAPH 2022. Project page: https://studios.disneyresearch.com/2022/07/24/implicit-neural-representation-for-physics-driven-actuated-soft-bodies/ Video: https://www.youtube.com/watch?v=9EERe_CTaz

    SeeSR: Towards Semantics-Aware Real-World Image Super-Resolution

    Full text link
    Owe to the powerful generative priors, the pre-trained text-to-image (T2I) diffusion models have become increasingly popular in solving the real-world image super-resolution problem. However, as a consequence of the heavy quality degradation of input low-resolution (LR) images, the destruction of local structures can lead to ambiguous image semantics. As a result, the content of reproduced high-resolution image may have semantic errors, deteriorating the super-resolution performance. To address this issue, we present a semantics-aware approach to better preserve the semantic fidelity of generative real-world image super-resolution. First, we train a degradation-aware prompt extractor, which can generate accurate soft and hard semantic prompts even under strong degradation. The hard semantic prompts refer to the image tags, aiming to enhance the local perception ability of the T2I model, while the soft semantic prompts compensate for the hard ones to provide additional representation information. These semantic prompts can encourage the T2I model to generate detailed and semantically accurate results. Furthermore, during the inference process, we integrate the LR images into the initial sampling noise to mitigate the diffusion model's tendency to generate excessive random details. The experiments show that our method can reproduce more realistic image details and hold better the semantics

    MonoHair: High-Fidelity Hair Modeling from a Monocular Video

    Full text link
    Undoubtedly, high-fidelity 3D hair is crucial for achieving realism, artistic expression, and immersion in computer graphics. While existing 3D hair modeling methods have achieved impressive performance, the challenge of achieving high-quality hair reconstruction persists: they either require strict capture conditions, making practical applications difficult, or heavily rely on learned prior data, obscuring fine-grained details in images. To address these challenges, we propose MonoHair,a generic framework to achieve high-fidelity hair reconstruction from a monocular video, without specific requirements for environments. Our approach bifurcates the hair modeling process into two main stages: precise exterior reconstruction and interior structure inference. The exterior is meticulously crafted using our Patch-based Multi-View Optimization (PMVO). This method strategically collects and integrates hair information from multiple views, independent of prior data, to produce a high-fidelity exterior 3D line map. This map not only captures intricate details but also facilitates the inference of the hair's inner structure. For the interior, we employ a data-driven, multi-view 3D hair reconstruction method. This method utilizes 2D structural renderings derived from the reconstructed exterior, mirroring the synthetic 2D inputs used during training. This alignment effectively bridges the domain gap between our training data and real-world data, thereby enhancing the accuracy and reliability of our interior structure inference. Lastly, we generate a strand model and resolve the directional ambiguity by our hair growth algorithm. Our experiments demonstrate that our method exhibits robustness across diverse hairstyles and achieves state-of-the-art performance. For more results, please refer to our project page https://keyuwu-cs.github.io/MonoHair/.Comment: Accepted by IEEE CVPR 202

    Learning from Rich Semantics and Coarse Locations for Long-tailed Object Detection

    Full text link
    Long-tailed object detection (LTOD) aims to handle the extreme data imbalance in real-world datasets, where many tail classes have scarce instances. One popular strategy is to explore extra data with image-level labels, yet it produces limited results due to (1) semantic ambiguity -- an image-level label only captures a salient part of the image, ignoring the remaining rich semantics within the image; and (2) location sensitivity -- the label highly depends on the locations and crops of the original image, which may change after data transformations like random cropping. To remedy this, we propose RichSem, a simple but effective method, which is robust to learn rich semantics from coarse locations without the need of accurate bounding boxes. RichSem leverages rich semantics from images, which are then served as additional soft supervision for training detectors. Specifically, we add a semantic branch to our detector to learn these soft semantics and enhance feature representations for long-tailed object detection. The semantic branch is only used for training and is removed during inference. RichSem achieves consistent improvements on both overall and rare-category of LVIS under different backbones and detectors. Our method achieves state-of-the-art performance without requiring complex training and testing procedures. Moreover, we show the effectiveness of our method on other long-tailed datasets with additional experiments. Code is available at \url{https://github.com/MengLcool/RichSem}.Comment: Accepted by NeurIPS202

    Application of Fractal to Evaluate the Drying Shrinkage Behavior of Soil Composites from Recycled Waste Clay Brick

    No full text
    Soil drying cracking is the most common natural phenomenon affecting soil stability. Due to the complexity of the geometric shapes of soil cracks during the cracking process, it has become a major problem in engineering science. The extremely irregular and complex crack networks formed in civil engineering materials can be quantitatively investigated using fractal theory. In this paper, fractal dimension is proposed to characterize the drying cracking characteristics of composite soil by adding recycled waste brick micro-powder. At the same time, the concept of the probability entropy of cracking is introduced to quantify the ordered state of crack development. Correspondingly, the endpoint value of probability entropy was solved mathematically, and the meaning of the probability entropy of cracking was clarified. In this study, the fracture fractal characteristics of composite soil mixed with different materials were first investigated. Then, five groups of composite soil-saturated muds with added recycled waste brick micro-powder of different contents were prepared in the laboratory. Using the evaporation test under constant temperature and humidity, the change rules of the fractal dimensions, probability entropy, crack ratios, and water contents of cracks during the cracking process of the soil samples were obtained. The results show that: (1) on the whole, the fractal dimensions of the soil samples added with recycled waste brick micro-powder of different contents increased over time, and the fractal dimensions of the soil samples without recycled waste brick micro-powder were obviously larger than those of the soil samples with recycled waste brick micro-powder. With the increase in the content of recycled waste brick micro-powder, the maximum fractal dimension decreased in turn. The maximum fractal dimensions of the five groups of soil samples were 1.74, 1.68, 1.62, 1.57, and 1.45. (2) The change trends of the probability entropy and fractal dimensions were similar; both of them showed an upward trend over time, and the probability entropy of the soil samples without recycled waste brick micro-powder was greater than that of the soil samples with recycled waste brick micro-powder. With the increase in the contents of recycled waste brick micro-powder, the probability entropy decreased in turn. The maximum values of the crack probability entropy of the five groups of soil samples were 0.99, 0.92, 0.87, 0.83, and 0.80. (3) Under the action of continuous evaporation, the moisture contents of the soil samples gradually decreased over time, while the crack ratios increased over time. To sum up, both from the perspective of the development process of the cracks of the soil samples and from the perspective of the final stable crack networks of the soil samples, the geometric shapes of the cracks of the soil samples without recycled waste brick micro-powder were the most complex. With the increase in the content of recycled waste brick micro-powder, the fractal characteristics of the cracks gradually changed from complex to simple

    High-Dimensional Covariance Estimation via Constrained <i>L<sub>q</sub></i>-Type Regularization

    No full text
    High-dimensional covariance matrix estimation is one of the fundamental and important problems in multivariate analysis and has a wide range of applications in many fields. In practice, it is common that a covariance matrix is composed of a low-rank matrix and a sparse matrix. In this paper we estimate the covariance matrix by solving a constrained Lq-type regularized optimization problem. We establish the first-order optimality conditions for this problem by using proximal mapping and the subspace method. The proposed stationary point degenerates to the first-order stationary points of the unconstrained Lq regularized sparse or low-rank optimization problems. A smoothing alternating updating method is proposed to find an estimator for the covariance matrix. We establish the convergence of the proposed calculation method. The numerical simulation results show the effectiveness of the proposed approach for high-dimensional covariance estimation

    Optimization of Solidification and Stabilization Efficiency of Heavy Metal Contaminated Sediment Based on Response Surface Methodology

    No full text
    Solidification and stabilization (S/S) by agents and stabilizers is an effective way to treat heavy metal-contaminated sediments. Optimization of curing condition is crucial to minimize the consumption of reagents on the base of effective S/S. In this work, the synergistic effects of cement and stabilizer on mechanical strength and leaching toxicity of contaminated sediments were investigated, and the S/S conditions were optimized using response surface methodology. On the basis of a single-factor test, multi-factor experiments were conducted to fit the relationship between the S/S effect of contaminated sediments and the amount of cement and stabilizer. The mechanism of stabilization was investigated by the results from the revised BCR method. The results indicate that the optimal curing conditions were 44.29% of cement content with 2.05% of trimercapto-s-triazine trisodium salt (TMT). After 28 days of curing, the compressive strength reached 2.07 MPa and the leaching concentrations of Cd, Cu, and Pb were 0.094 mg/L, 0.031 mg/L, and 0.173 mg/L, respectively, which met the requirement of in-situ resource recycling standard. The stability of heavy metals was significantly improved as a result of the removal of acid extractable fraction (15.58~69.92%) and an increase in the residual fraction (18.27~49.07%)

    Research progress of nanoplastics in freshwater

    No full text
    With the mass production and use of plastic products, which leads to their continuous entry into the water environment, the problem of environmental pollution has been paid more and more attention by scholars from different countries. In recent years, a large number of studies have focused on microplastics, but few on nanoplastics (NPs). However, NPs are smaller in size, have a higher affinity for cells, and surface and volume ratios are higher than those of microplastics. NPs may also enter biological tissues, blood and cells, which may cause greater potential harm to organisms. In this paper, firstly, the environmental fate of NPs accumulation and deposition is summarized, and further research is needed in the future; secondly, the current techniques for NPs extraction and characterization of NPs extraction and characterization are summarized. At present, the analytical methods of NPs are in the primary stage, and lack of standardized and accurate methods; finally, the toxic effects of NPs on biological morphology, behavior and reproduction are discussed. It has been found that the small size and high surface area of NPs make them more toxic to organisms than microplastics. However, most of the current toxicological studies of NPs on freshwater organisms could not be simulated in real environment. (C) 2020 Elsevier B.V. All rights reserved

    Implicit neural representation for physics-driven actuated soft bodies

    No full text
    Active soft bodies can affect their shape through an internal actuation mechanism that induces a deformation. Similar to recent work, this paper utilizes a differentiable, quasi-static, and physics-based simulation layer to optimize for actuation signals parameterized by neural networks. Our key contribution is a general and implicit formulation to control active soft bodies by defining a function that enables a continuous mapping from a spatial point in the material space to the actuation value. This property allows us to capture the signal's dominant frequencies, making the method discretization agnostic and widely applicable. We extend our implicit model to mandible kinematics for the particular case of facial animation and show that we can reliably reproduce facial expressions captured with high-quality capture systems. We apply the method to volumetric soft bodies, human poses, and facial expressions, demonstrating artist-friendly properties, such as simple control over the latent space and resolution invariance at test time.ISSN:0730-0301ISSN:1557-736
    corecore