11 research outputs found

    Internal-External Boundary Attention Fusion for Glass Surface Segmentation

    Full text link
    Glass surfaces of transparent objects and mirrors are not able to be uniquely and explicitly characterized by their visual appearances because they contain the visual appearance of other reflected or transmitted surfaces as well. Detecting glass regions from a single-color image is a challenging task. Recent deep-learning approaches have paid attention to the description of glass surface boundary where the transition of visual appearances between glass and non-glass surfaces are observed. In this work, we analytically investigate how glass surface boundary helps to characterize glass objects. Inspired by prior semantic segmentation approaches with challenging image types such as X-ray or CT scans, we propose separated internal-external boundary attention modules that individually learn and selectively integrate visual characteristics of the inside and outside region of glass surface from a single color image. Our proposed method is evaluated on six public benchmarks comparing with state-of-the-art methods showing promising results

    Enhanced Boundary Learning for Glass-like Object Segmentation

    Full text link
    Glass-like objects such as windows, bottles, and mirrors exist widely in the real world. Sensing these objects has many applications, including robot navigation and grasping. However, this task is very challenging due to the arbitrary scenes behind glass-like objects. This paper aims to solve the glass-like object segmentation problem via enhanced boundary learning. In particular, we first propose a novel refined differential module that outputs finer boundary cues. We then introduce an edge-aware point-based graph convolution network module to model the global shape along the boundary. We use these two modules to design a decoder that generates accurate and clean segmentation results, especially on the object contours. Both modules are lightweight and effective: they can be embedded into various segmentation models. In extensive experiments on three recent glass-like object segmentation datasets, including Trans10k, MSD, and GDD, our approach establishes new state-of-the-art results. We also illustrate the strong generalization properties of our method on three generic segmentation datasets, including Cityscapes, BDD, and COCO Stuff. Code and models is available at \url{https://github.com/hehao13/EBLNet}.Comment: ICCV-2021 Code is availabe at https://github.com/hehao13/EBLNe

    NeRRF: 3D Reconstruction and View Synthesis for Transparent and Specular Objects with Neural Refractive-Reflective Fields

    Full text link
    Neural radiance fields (NeRF) have revolutionized the field of image-based view synthesis. However, NeRF uses straight rays and fails to deal with complicated light path changes caused by refraction and reflection. This prevents NeRF from successfully synthesizing transparent or specular objects, which are ubiquitous in real-world robotics and A/VR applications. In this paper, we introduce the refractive-reflective field. Taking the object silhouette as input, we first utilize marching tetrahedra with a progressive encoding to reconstruct the geometry of non-Lambertian objects and then model refraction and reflection effects of the object in a unified framework using Fresnel terms. Meanwhile, to achieve efficient and effective anti-aliasing, we propose a virtual cone supersampling technique. We benchmark our method on different shapes, backgrounds and Fresnel terms on both real-world and synthetic datasets. We also qualitatively and quantitatively benchmark the rendering results of various editing applications, including material editing, object replacement/insertion, and environment illumination estimation. Codes and data are publicly available at https://github.com/dawning77/NeRRF

    Korean mask-dance and Aristotle\u27s poetics

    Full text link
    Korean mask-dance is the traditional theatre of Korea. It was formerly the country\u27s most well-known form of drama among traditional theatrical entertainments. This study explores the theatrical structure of Korean mask-dance as well as its historical background. The rise of Korean mask-dance may be traced back to the shamanistic village ritual which gradually became similar to the extant form after absorbing aspects of the Buddhism festival through the Goryeo Dynasty, which lasted from 918-1392). During the Joseon Dynasty (1392-1910), the mask-dance had acquired its basic form with aspects of professional theatrical entertainment. The mask-dances have been performed during traditional holidays and festivals over the past three hundred years. Four types of the mask-dance continue to exist today, all being derived from their geographic origins. Many scholar and artists have explained the value of Korean mask-dance and its own esthetic level. Dance, song, music, masks, costumes, props, stage, and audience participation of Korean mask-dance are obvious theatrical elements and have their own separate meaning. The main purpose of this study is to examine the weak parts of the dramatic structure of the art form and attempt to analyze it using Aristotle\u27s Poetics (384-322 BCE). When Korean mask-dance is analyzed by Aristotle\u27s concept of drama, the mask-dance exactly reverses this order of importance of the dramatic elements. Through recognition of both the uniqueness of Korean mask-dance and the dramatic standards of Aristotle\u27s concept, this study should enable scholars and artists to embrace more fully the universal nature of theatre

    Mirror Position Detection in a Catoptric Surface

    Get PDF
    The Catoptric Surface research project is a pioneering exploration of controlling daylight effects within built environments. In this thesis, we focus on the mirror position detection problem, which plays a vital role in achieving dynamic control over the direction of reflected light within a space. To address the challenge of mirror position detection, we employ computer vision techniques, specifically edge detection and the RANdom SAmple Consensus (RANSAC) algorithm. Edge detection is utilized to identify significant changes in intensity or color, corresponding to object boundaries, while RANSAC is applied for ellipse fitting. By iteratively selecting minimal subsets of points and fitting ellipses that meet geometric constraints, we attempt to accurately determine the position and geometry of mirrors in the catoptric array. We evaluate two different RANSAC libraries for ellipse fitting, and our findings show that the skimage library in Python provides superior results compared to other alternatives. Additionally, we leverage the multiprocessing package to enable parallel processing, improving the efficiency of mirror detection. We conclude that it is possible to detect single steps of mirror movement, however, reliable operation is highly sensitive to parameter settings within the computational pipeline
    corecore