Search CORE

10,088 research outputs found

Convolutional Patch Networks with Spatial Prior for Road Detection and Urban Scene Understanding

Author: Brust Clemens-Alexander
Denzler Joachim
Rodner Erik
Sickert Sven
Simon Marcel
Publication venue
Publication date: 01/01/2015
Field of study

Classifying single image patches is important in many different applications, such as road detection or scene understanding. In this paper, we present convolutional patch networks, which are convolutional networks learned to distinguish different image patches and which can be used for pixel-wise labeling. We also show how to incorporate spatial information of the patch as an input to the network, which allows for learning spatial priors for certain categories jointly with an appearance model. In particular, we focus on road detection and urban scene understanding, two application areas where we are able to achieve state-of-the-art results on the KITTI as well as on the LabelMeFacade dataset. Furthermore, our paper offers a guideline for people working in the area and desperately wandering through all the painstaking details that render training CNs on image patches extremely difficult.Comment: VISAPP 2015 pape

arXiv.org e-Print Archive

CiteSeerX

Crossref

Vessel tractography using an intensity based tensor model

Author: Cetin Suheyla
Degertekin Muzaffer
Demir Ali
Değertekin Muzaffer
Unal Gozde
Yezzi Anthony Joseph
Çetin Süheyla
Ünal Gözde
Publication venue: Miccai Conference
Publication date: 01/01/2011
Field of study

In this paper, we propose a novel tubular structure segmen- tation method, which is based on an intensity-based tensor that fits to a vessel. Our model is initialized with a single seed point and it is ca- pable of capturing whole vessel tree by an automatic branch detection algorithm. The centerline of the vessel as well as its thickness is extracted. We demonstrated the performance of our algorithm on 3 complex contrast varying tubular structured synthetic datasets for quantitative validation. Additionally, extracted arteries from 10 CTA (Computed Tomography An- giography) volumes are qualitatively evaluated by a cardiologist expert’s visual scores

Sabanci University Research Database

Automatic Environmental Sound Recognition: Performance versus Computational Cost

Author: Krstulovic Sacha
Plumbley Mark D.
Sigtia Siddharth
Stark Adam M.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 15/07/2016
Field of study

In the context of the Internet of Things (IoT), sound sensing applications are required to run on embedded platforms where notions of product pricing and form factor impose hard constraints on the available computing power. Whereas Automatic Environmental Sound Recognition (AESR) algorithms are most often developed with limited consideration for computational cost, this article seeks which AESR algorithm can make the most of a limited amount of computing power by comparing the sound classification performance em as a function of its computational cost. Results suggest that Deep Neural Networks yield the best ratio of sound classification accuracy across a range of computational costs, while Gaussian Mixture Models offer a reasonable accuracy at a consistently small cost, and Support Vector Machines stand between both in terms of compromise between accuracy and computational cost

arXiv.org e-Print Archive

Surrey Research Insight

Log-Euclidean Bag of Words for Human Action Recognition

Author: Bhatia R.
Conrad Sanderson
Lazebnik S.
Masoud Faraki
Maziar Palhang
Wong Y.
Publication venue: 'Institution of Engineering and Technology (IET)'
Publication date: 01/01/2015
Field of study

Representing videos by densely extracted local space-time features has recently become a popular approach for analysing actions. In this paper, we tackle the problem of categorising human actions by devising Bag of Words (BoW) models based on covariance matrices of spatio-temporal features, with the features formed from histograms of optical flow. Since covariance matrices form a special type of Riemannian manifold, the space of Symmetric Positive Definite (SPD) matrices, non-Euclidean geometry should be taken into account while discriminating between covariance matrices. To this end, we propose to embed SPD manifolds to Euclidean spaces via a diffeomorphism and extend the BoW approach to its Riemannian version. The proposed BoW approach takes into account the manifold geometry of SPD matrices during the generation of the codebook and histograms. Experiments on challenging human action datasets show that the proposed method obtains notable improvements in discrimination accuracy, in comparison to several state-of-the-art methods

arXiv.org e-Print Archive

Crossref

Directory of Open Access Journals

Queensland University of Technology ePrints Archive

University of Queensland eSpace

Polygonal Building Segmentation by Frame Field Learning

Author: Girard Nicolas
Smirnov Dmitriy
Solomon Justin
Tarabalka Yuliya
Publication venue
Publication date: 31/03/2021
Field of study

While state of the art image segmentation models typically output segmentations in raster format, applications in geographic information systems often require vector polygons. To help bridge the gap between deep network output and the format used in downstream tasks, we add a frame field output to a deep segmentation model for extracting buildings from remote sensing images. We train a deep neural network that aligns a predicted frame field to ground truth contours. This additional objective improves segmentation quality by leveraging multi-task learning and provides structural information that later facilitates polygonization; we also introduce a polygonization algorithm that utilizes the frame field along with the raster segmentation. Our code is available at https://github.com/Lydorn/Polygonization-by-Frame-Field-Learning.Comment: CVPR 2021 - IEEE Conference on Computer Vision and Pattern Recognition, Jun 2021, Pittsburg / Virtual, United State

arXiv.org e-Print Archive

INRIA a CCSD electronic archive server