4 research outputs found

    DiffComplete: Diffusion-based Generative 3D Shape Completion

    Full text link
    We introduce a new diffusion-based approach for shape completion on 3D range scans. Compared with prior deterministic and probabilistic methods, we strike a balance between realism, multi-modality, and high fidelity. We propose DiffComplete by casting shape completion as a generative task conditioned on the incomplete shape. Our key designs are two-fold. First, we devise a hierarchical feature aggregation mechanism to inject conditional features in a spatially-consistent manner. So, we can capture both local details and broader contexts of the conditional inputs to control the shape completion. Second, we propose an occupancy-aware fusion strategy in our model to enable the completion of multiple partial shapes and introduce higher flexibility on the input conditions. DiffComplete sets a new SOTA performance (e.g., 40% decrease on l_1 error) on two large-scale 3D shape completion benchmarks. Our completed shapes not only have a realistic outlook compared with the deterministic methods but also exhibit high similarity to the ground truths compared with the probabilistic alternatives. Further, DiffComplete has strong generalizability on objects of entirely unseen classes for both synthetic and real data, eliminating the need for model re-training in various applications.Comment: Project Page: https://ruihangchu.com/diffcomplete.htm

    Proof and Application of Discriminating Ocean Oil Spills and Seawater Based on Polarization Ratio Using Quad-Polarization Synthetic Aperture Radar

    No full text
    This paper focuses on the proof and application of discriminating between oil spills and seawater (including the “look-alikes”, named low wind areas) based on the polarization ratio. A new relative polarization ratio (PRr) method is proposed, which is based on the difference between the scattering mechanism and the dielectric constant for oil spills compared to that of seawater. The case study found that (1) PRr numerically amplifies the contrast between oil spills and seawater, reduces the difference between low wind areas and ordinary seawater, and exhibits better details of the image; (2) the threshold method based on Euclidean distance can obtain the highest classification overall accuracy within the allowable error range, and can be widely used in the study of different incidence angles and environmental conditions; and (3) the identification of oil spills and seawater by the proposed methods can largely avoid the misjudgment of low wind areas as oil spills. Considering visual interpretation as the reference ‘ground truth’, the overall classification accuracy of all cases is more than 95%; only the edge of the diffuse thin oil slick and oil–water mixture is difficult to identify. This method can serve as an effective supplement to existing oil spill detection methods

    A Survey of Reasoning with Foundation Models

    No full text
    https://github.com/reasoning-survey/Awesome-Reasoning-Foundation-Models Reasoning, a crucial ability for complex problem-solving, plays a pivotal role in various real-world settings such as negotiation, medical diagnosis, and criminal investigation. It serves as a fundamental methodology in the field of Artificial General Intelligence (AGI). With the ongoing development of foundation models, there is a growing interest in exploring their abilities in reasoning tasks. In this paper, we introduce seminal foundation models proposed or adaptable for reasoning, highlighting the latest advancements in various reasoning tasks, methods, and benchmarks. We then delve into the potential future directions behind the emergence of reasoning abilities within foundation models. We also discuss the relevance of multimodal learning, autonomous agents, and super alignment in the context of reasoning. By discussing these future research directions, we hope to inspire researchers in their exploration of this field, stimulate further advancements in reasoning with foundation models, and contribute to the development of AGI
    corecore