77,955 research outputs found

    ImGeoNet: Image-induced Geometry-aware Voxel Representation for Multi-view 3D Object Detection

    Full text link
    We propose ImGeoNet, a multi-view image-based 3D object detection framework that models a 3D space by an image-induced geometry-aware voxel representation. Unlike previous methods which aggregate 2D features into 3D voxels without considering geometry, ImGeoNet learns to induce geometry from multi-view images to alleviate the confusion arising from voxels of free space, and during the inference phase, only images from multiple views are required. Besides, a powerful pre-trained 2D feature extractor can be leveraged by our representation, leading to a more robust performance. To evaluate the effectiveness of ImGeoNet, we conduct quantitative and qualitative experiments on three indoor datasets, namely ARKitScenes, ScanNetV2, and ScanNet200. The results demonstrate that ImGeoNet outperforms the current state-of-the-art multi-view image-based method, ImVoxelNet, on all three datasets in terms of detection accuracy. In addition, ImGeoNet shows great data efficiency by achieving results comparable to ImVoxelNet with 100 views while utilizing only 40 views. Furthermore, our studies indicate that our proposed image-induced geometry-aware representation can enable image-based methods to attain superior detection accuracy than the seminal point cloud-based method, VoteNet, in two practical scenarios: (1) scenarios where point clouds are sparse and noisy, such as in ARKitScenes, and (2) scenarios involve diverse object classes, particularly classes of small objects, as in the case in ScanNet200.Comment: ICCV'23; project page: https://ttaoretw.github.io/imgeonet

    Feature detection from echocardiography images using local phase information

    Get PDF
    Ultrasound images are characterized by their special speckle appearance, low contrast, and low signal-to-noise ratio. It is always challenging to extract important clinical information from these images. An important step before formal analysis is to transform the image to significant features of interest. Intensity based methods do not perform particularly well on ultrasound images. However, it has been previously shown that these images respond well to local phase-based methods which are theoretically intensity-invariant and thus suitable for ultrasound images. We extend the previous local phase-based method to detect features using the local phase computed from monogenic signal which is an isotropic extension of the analytic signal. We apply our method of multiscale feature-asymmetry measurement and local phase-gradient computation to cardiac ultrasound (echocardiography) images for the detection of endocardial, epicardial and myocardial centerline
    • …
    corecore