309 research outputs found

    The Unbalanced Gromov Wasserstein Distance: Conic Formulation and Relaxation

    Full text link
    Comparing metric measure spaces (i.e. a metric space endowed with aprobability distribution) is at the heart of many machine learning problems. The most popular distance between such metric measure spaces is theGromov-Wasserstein (GW) distance, which is the solution of a quadratic assignment problem. The GW distance is however limited to the comparison of metric measure spaces endowed with a probability distribution.To alleviate this issue, we introduce two Unbalanced Gromov-Wasserstein formulations: a distance and a more tractable upper-bounding relaxation.They both allow the comparison of metric spaces equipped with arbitrary positive measures up to isometries. The first formulation is a positive and definite divergence based on a relaxation of the mass conservation constraint using a novel type of quadratically-homogeneous divergence. This divergence works hand in hand with the entropic regularization approach which is popular to solve large scale optimal transport problems. We show that the underlying non-convex optimization problem can be efficiently tackled using a highly parallelizable and GPU-friendly iterative scheme. The second formulation is a distance between mm-spaces up to isometries based on a conic lifting. Lastly, we provide numerical experiments onsynthetic examples and domain adaptation data with a Positive-Unlabeled learning task to highlight the salient features of the unbalanced divergence and its potential applications in ML

    Multiscale neighborhood-wise decision fusion for redundancy detection in image pairs

    Get PDF
    SIAM Journal Multiscale Modeling & SimulationTo develop better image change detection algorithms, new models able to capture spatio-temporal regularities and geometries present in an image pair are needed. In this paper, we propose a multiscale formulation for modeling semi-local inter-image interactions and detecting local or regional changes in an image pair. By introducing dissimilarity measures to compare patches and binary local decisions, we design collaborative decision rules that use the total number of detections obtained from the neighboring pixels, for different patch sizes. We study the statistical properties of the non-parametric detection approach that guarantees small probabilities of false alarms. Experimental results on several applications demonstrate that the detection algorithm (with no optical flow computation) performs well at detecting occlusions and meaningful changes for a variety of illumination conditions and signal-to-noise ratios. The number of control parameters of the algorithm is small and the adjustment is intuitive in most cases

    Municipal Road Infrastructure Assessment Using Street View Images

    Get PDF
    Road quality assessment is a crucial part in Municipalities' work to maintain their infrastructure, plan upgrades, and manage their budgets. Properly maintaining this infrastructure relies heavily on consistently monitoring its condition and deterioration over time. This can be a challenge, especially in larger towns and cities where there is a lot of city property to keep an eye on. Municipalities rely on surveyors to keep them up to date on the condition of their infrastructure to prevent this failure before it happens. This is both to prevent injuries and further damage from occurring as a result of infrastructure failure, and since it is can be more cost effective to maintain property rather than have to replace it. Surveying can either be done manually or automatically, but it is not done frequently as it is expensive and also time consuming. Manual surveying can be inaccurate, while a large portion of automatic surveying techniques rely on expensive equipment. To solve this problem, we propose an automated infrastructure assessment method that relies on Street View images for its input and uses various computer vision and pattern recognition methods to generate its assessments. First, we segment the image into 'road' and 'background' regions. We propose a road segmentation algorithm specifically aimed at segmenting roads from street view images. We use Fisher vectors calculated on SIFT descriptors to encode small windows extracted from the main image at multiple scales. Then we classify these patches using an SVM and utilize a Gaussian voting scheme to obtain a segmentation. We additionally utilize a spatial prior to improve this segmentation. Optionally, we improve the segmentation further by making use of a weighted contour map calculated on a shadow-free intrinsic image, and a find an optimal segmentation by utilizing a purity tree. Our algorithm performs well and outputs a good segmentation for further use in road evaluation. We test our method on the KITTI road dataset, and compare it to the state-of-the-art on this dataset, along with a manually annotated subset of Google Street View. After segmenting the road, we describe an algorithm aimed at identifying distressed road regions and pinpointing cracks within them. We predict distressed regions by re-using the computed Fisher vectors and classifying them with a different SVM trained to distinguish between road qualities. We follow this step with a comparison to the weighed contour map within these distressed regions to identify exact crack and defect locations, and use the contour weights to predict the crack severity. Promising results are obtained on our manually annotated dataset, which indicate the viability of using this cost-effective system to perform road quality assessment at a municipal level
    • …
    corecore