Search CORE

182,180 research outputs found

Mutual Information based Semi-Global Stereo Matching on the GPU

Author: D. Scharstein
H. Hirschmüller
H. Hirschmüller
J. Owens
S. Birchfield
T. Scheuermann
Publication venue
Publication date: 01/01/2008
Field of study

Institute of Transport Research:Publications

Crossref

Multi-layer Architecture For Storing Visual Data Based on WCF and Microsoft SQL Server Database

Author: A. Biniaz
C.L. Zitnick
D.G. Lowe
H. Bay
H. Bay
J. Śmietański
J.L. Chu
K. Łapa
M. Bazarganigilani
M. Chen
M. Chromiak
M. Zalasiński
M.R. Ogiela
M.R. Ogiela
P. Drozda
P. Drozda
R. Grycuk
R. Grycuk
R. Grycuk
R. Hirschheim
R.C. Veltkamp
S. Makinana
S. Mallik
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

In this paper we present a novel architecture for storing visual data. Effective storing, browsing and searching collections of images is one of the most important challenges of computer science. The design of architecture for storing such data requires a set of tools and frameworks such as SQL database management systems and service-oriented frameworks. The proposed solution is based on a multi-layer architecture, which allows to replace any component without recompilation of other components. The approach contains five components, i.e. Model, Base Engine, Concrete Engine, CBIR service and Presentation. They were based on two well-known design patterns: Dependency Injection and Inverse of Control. For experimental purposes we implemented the SURF local interest point detector as a feature extractor and

K

-means clustering as indexer. The presented architecture is intended for content-based retrieval systems simulation purposes as well as for real-world CBIR tasks.Comment: Accepted for the 14th International Conference on Artificial Intelligence and Soft Computing, ICAISC, June 14-18, 2015, Zakopane, Polan

arXiv.org e-Print Archive

Crossref

Retrieval of shape from silhouettes

Author: Bottino Andrea Giuseppe
Laurentini Aldo
Publication venue: ACADEMIC PRESS
Publication date: 01/01/2006
Field of study

Crossref

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)

PORTO Publications Open Repository TOrino

A graph-based mathematical morphology reader

Author: Cousty Jean
Najman Laurent
Publication venue: 'Elsevier BV'
Publication date: 01/01/2014
Field of study

This survey paper aims at providing a "literary" anthology of mathematical morphology on graphs. It describes in the English language many ideas stemming from a large number of different papers, hence providing a unified view of an active and diverse field of research

arXiv.org e-Print Archive

CiteSeerX

Crossref

HAL Descartes

HAL-Ecole des Ponts ParisTech

HAL - UPEC / UPEM

Anytime Stereo Image Depth Estimation on Mobile Devices

Author: Campbell Mark
Huang Gao
Lai Zihang
van der Maaten Laurens
Wang Brian H.
Wang Yan
Weinberger Kilian Q.
Publication venue
Publication date: 05/03/2019
Field of study

Many applications of stereo depth estimation in robotics require the generation of accurate disparity maps in real time under significant computational constraints. Current state-of-the-art algorithms force a choice between either generating accurate mappings at a slow pace, or quickly generating inaccurate ones, and additionally these methods typically require far too many parameters to be usable on power- or memory-constrained devices. Motivated by these shortcomings, we propose a novel approach for disparity prediction in the anytime setting. In contrast to prior work, our end-to-end learned approach can trade off computation and accuracy at inference time. Depth estimation is performed in stages, during which the model can be queried at any time to output its current best estimate. Our final model can process 1242

\times

375 resolution images within a range of 10-35 FPS on an NVIDIA Jetson TX2 module with only marginal increases in error -- using two orders of magnitude fewer parameters than the most competitive baseline. The source code is available at https://github.com/mileyan/AnyNet .Comment: Accepted by ICRA201

arXiv.org e-Print Archive

Crossref

Recurrent Saliency Transformation Network: Incorporating Multi-Stage Visual Cues for Small Organ Segmentation

Author: Fishman Elliot K.
Wang Yan
Xie Lingxi
Yu Qihang
Yuille Alan L.
Zhou Yuyin
Publication venue
Publication date: 07/04/2018
Field of study

We aim at segmenting small organs (e.g., the pancreas) from abdominal CT scans. As the target often occupies a relatively small region in the input image, deep neural networks can be easily confused by the complex and variable background. To alleviate this, researchers proposed a coarse-to-fine approach, which used prediction from the first (coarse) stage to indicate a smaller input region for the second (fine) stage. Despite its effectiveness, this algorithm dealt with two stages individually, which lacked optimizing a global energy function, and limited its ability to incorporate multi-stage visual cues. Missing contextual information led to unsatisfying convergence in iterations, and that the fine stage sometimes produced even lower segmentation accuracy than the coarse stage. This paper presents a Recurrent Saliency Transformation Network. The key innovation is a saliency transformation module, which repeatedly converts the segmentation probability map from the previous iteration as spatial weights and applies these weights to the current iteration. This brings us two-fold benefits. In training, it allows joint optimization over the deep networks dealing with different input scales. In testing, it propagates multi-stage visual information throughout iterations to improve segmentation accuracy. Experiments in the NIH pancreas segmentation dataset demonstrate the state-of-the-art accuracy, which outperforms the previous best by an average of over 2%. Much higher accuracies are also reported on several small organs in a larger dataset collected by ourselves. In addition, our approach enjoys better convergence properties, making it more efficient and reliable in practice.Comment: Accepted to CVPR 2018 (10 pages, 6 figures

arXiv.org e-Print Archive

Crossref