35 research outputs found

    Cloth2Tex: A Customized Cloth Texture Generation Pipeline for 3D Virtual Try-On

    Full text link
    Fabricating and designing 3D garments has become extremely demanding with the increasing need for synthesizing realistic dressed persons for a variety of applications, e.g. 3D virtual try-on, digitalization of 2D clothes into 3D apparel, and cloth animation. It thus necessitates a simple and straightforward pipeline to obtain high-quality texture from simple input, such as 2D reference images. Since traditional warping-based texture generation methods require a significant number of control points to be manually selected for each type of garment, which can be a time-consuming and tedious process. We propose a novel method, called Cloth2Tex, which eliminates the human burden in this process. Cloth2Tex is a self-supervised method that generates texture maps with reasonable layout and structural consistency. Another key feature of Cloth2Tex is that it can be used to support high-fidelity texture inpainting. This is done by combining Cloth2Tex with a prevailing latent diffusion model. We evaluate our approach both qualitatively and quantitatively and demonstrate that Cloth2Tex can generate high-quality texture maps and achieve the best visual effects in comparison to other methods. Project page: tomguluson92.github.io/projects/cloth2tex/Comment: 15 pages, 15 figure

    FetusMapV2: Enhanced Fetal Pose Estimation in 3D Ultrasound

    Full text link
    Fetal pose estimation in 3D ultrasound (US) involves identifying a set of associated fetal anatomical landmarks. Its primary objective is to provide comprehensive information about the fetus through landmark connections, thus benefiting various critical applications, such as biometric measurements, plane localization, and fetal movement monitoring. However, accurately estimating the 3D fetal pose in US volume has several challenges, including poor image quality, limited GPU memory for tackling high dimensional data, symmetrical or ambiguous anatomical structures, and considerable variations in fetal poses. In this study, we propose a novel 3D fetal pose estimation framework (called FetusMapV2) to overcome the above challenges. Our contribution is three-fold. First, we propose a heuristic scheme that explores the complementary network structure-unconstrained and activation-unreserved GPU memory management approaches, which can enlarge the input image resolution for better results under limited GPU memory. Second, we design a novel Pair Loss to mitigate confusion caused by symmetrical and similar anatomical structures. It separates the hidden classification task from the landmark localization task and thus progressively eases model learning. Last, we propose a shape priors-based self-supervised learning by selecting the relatively stable landmarks to refine the pose online. Extensive experiments and diverse applications on a large-scale fetal US dataset including 1000 volumes with 22 landmarks per volume demonstrate that our method outperforms other strong competitors.Comment: 16 pages, 11 figures, accepted by Medical Image Analysis(2023

    Segment Anything Model for Medical Images?

    Full text link
    The Segment Anything Model (SAM) is the first foundation model for general image segmentation. It designed a novel promotable segmentation task, ensuring zero-shot image segmentation using the pre-trained model via two main modes including automatic everything and manual prompt. SAM has achieved impressive results on various natural image segmentation tasks. However, medical image segmentation (MIS) is more challenging due to the complex modalities, fine anatomical structures, uncertain and complex object boundaries, and wide-range object scales. SAM has achieved impressive results on various natural image segmentation tasks. Meanwhile, zero-shot and efficient MIS can well reduce the annotation time and boost the development of medical image analysis. Hence, SAM seems to be a potential tool and its performance on large medical datasets should be further validated. We collected and sorted 52 open-source datasets, and build a large medical segmentation dataset with 16 modalities, 68 objects, and 553K slices. We conducted a comprehensive analysis of different SAM testing strategies on the so-called COSMOS 553K dataset. Extensive experiments validate that SAM performs better with manual hints like points and boxes for object perception in medical images, leading to better performance in prompt mode compared to everything mode. Additionally, SAM shows remarkable performance in some specific objects and modalities, but is imperfect or even totally fails in other situations. Finally, we analyze the influence of different factors (e.g., the Fourier-based boundary complexity and size of the segmented objects) on SAM's segmentation performance. Extensive experiments validate that SAM's zero-shot segmentation capability is not sufficient to ensure its direct application to the MIS.Comment: 23 pages, 14 figures, 12 table

    Success in books: a big data approach to bestsellers

    No full text
    Abstract Reading remains the preferred leisure activity for most individuals, continuing to offer a unique path to knowledge and learning. As such, books remain an important cultural product, consumed widely. Yet, while over 3 million books are published each year, very few are read widely and less than 500 make it to the New York Times bestseller lists. And once there, only a handful of authors can command the lists for more than a few weeks. Here we bring a big data approach to book success by investigating the properties and sales trajectories of bestsellers. We find that there are seasonal patterns to book sales with more books being sold during holidays, and even among bestsellers, fiction books sell more copies than nonfiction books. General fiction and biographies make the list more often than any other genre books, and the higher a book’s initial place in the rankings, the longer the book stays on the list as well. Looking at patterns characterizing authors, we find that fiction writers are more productive than nonfiction writers, commonly achieving bestseller status with multiple books. Additionally, there is no gender disparity among bestselling fiction authors but nonfiction, most bestsellers are written by male authors. Finally we find that there is a universal pattern to book sales. Using this universality we introduce a statistical model to explain the time evolution of sales. This model not only reproduces the entire sales trajectory of a book but also predicts the total number of copies it will sell in its lifetime, based on its early sales numbers. The analysis of the bestseller characteristics and the discovery of the universal nature of sales patterns with its driving forces are crucial for our understanding of the book industry, and more generally, of how we as a society interact with cultural products

    Crystal structure of (E)-4-bromo-N′-(3-chloro-2-hydroxybenzylidene)benzohydrazide, C14H10BrClN2O2

    No full text
    C14H10BrClN2O2, orthorhombic, Pca21 (no. 29), a = 32.4137(12) Å, b = 4.6683(2) Å, c = 8.8873(3) Å, V = 1344.80(9) Å3, Z = 4, Rgt(F) = 0.0239, wRref(F2) = 0.0611, T = 150(2) K

    Exploration of a deep learning-based mechanism for predicting the work competence of community caregivers

    No full text
    To predict the workability of community nursing staff and provide corresponding training strategies based on the results. In this study, a nursing staff workability prediction model based on R-GCN-GRU was constructed. In the process of community nursing staff workability feature extraction, the attention mechanism is introduced, combined with the degree of association between the captured nodes of the R-GCN network and the long-term memory capacity of the GRU network, and the model optimization is carried out using the cross-entropy loss function. Finally, the workability of community caregivers in a city in Guangdong Province was predicted to verify the accuracy of the model from multiple perspectives. The results showed that clinical handling ability, keen observation ability, and communication ability were more valued by most caregivers, and their selection rates all reached 98.4%. On the other hand, clinical research, organizational management, and innovation abilities were relatively low. In the ability prediction of individual characteristics, the highest income personnel’s working ability was second only to the lowest salary personnel reaching 44.61±6.03. The working ability of older age and higher-position nursing staff, and nursing staff with more than 25 years of service reached 45.62±6.14, 48.30±5.22, and 45.86±5.52, respectively

    Crystal structure of 4-bromo-N′-[(3-bromo-2-hydroxyphenyl)methylidene]benzohydrazide methanol solvate, C15H14Br2N2O3

    No full text
    C15H14Br2N2O3, triclinic, P1‾P‾{1} (no. 2), a = 7.2748(3) Å, b = 9.0786(4) Å, c = 12.8613(5) Å, α = 99.458(2)°, β = 100.401(2)°, γ = 99.499(2)°, V = 807.35(6) Å3, Z = 2, Rgt(F) = 0.0253, wRref(F2) = 0.0673, T = 150.0 K
    corecore