Search CORE

8,970 research outputs found

3D ShapeNets: A Deep Representation for Volumetric Shapes

Author: Khosla Aditya
Song Shuran
Tang Xiaoou
Wu Zhirong
Xiao Jianxiong
Yu Fisher
Zhang Linguang
Publication venue
Publication date: 01/01/2015
Field of study

3D shape is a crucial but heavily underutilized cue in today's computer vision systems, mostly due to the lack of a good generic shape representation. With the recent availability of inexpensive 2.5D depth sensors (e.g. Microsoft Kinect), it is becoming increasingly important to have a powerful 3D shape representation in the loop. Apart from category recognition, recovering full 3D shapes from view-based 2.5D depth maps is also a critical part of visual understanding. To this end, we propose to represent a geometric 3D shape as a probability distribution of binary variables on a 3D voxel grid, using a Convolutional Deep Belief Network. Our model, 3D ShapeNets, learns the distribution of complex 3D shapes across different object categories and arbitrary poses from raw CAD data, and discovers hierarchical compositional part representations automatically. It naturally supports joint object recognition and shape completion from 2.5D depth maps, and it enables active object recognition through view planning. To train our 3D deep learning model, we construct ModelNet -- a large-scale 3D CAD model dataset. Extensive experiments show that our 3D deep representation enables significant performance improvement over the-state-of-the-arts in a variety of tasks.Comment: to be appeared in CVPR 201

arXiv.org e-Print Archive

Princeton University Open Access Repository

Crossref

ZENODO

Geometric deep learning: going beyond Euclidean data

Author: Bronstein Michael M.
Bruna Joan
LeCun Yann
Szlam Arthur
Vandergheynst Pierre
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 28/11/2016
Field of study

Many scientific fields study data with an underlying structure that is a non-Euclidean space. Some examples include social networks in computational social sciences, sensor networks in communications, functional networks in brain imaging, regulatory networks in genetics, and meshed surfaces in computer graphics. In many applications, such geometric data are large and complex (in the case of social networks, on the scale of billions), and are natural targets for machine learning techniques. In particular, we would like to use deep neural networks, which have recently proven to be powerful tools for a broad range of problems from computer vision, natural language processing, and audio analysis. However, these tools have been most successful on data with an underlying Euclidean or grid-like structure, and in cases where the invariances of these structures are built into networks used to model them. Geometric deep learning is an umbrella term for emerging techniques attempting to generalize (structured) deep neural models to non-Euclidean domains such as graphs and manifolds. The purpose of this paper is to overview different examples of geometric deep learning problems and present available solutions, key difficulties, applications, and future research directions in this nascent field

arXiv.org e-Print Archive

Infoscience - École polytechnique fédérale de Lausanne

Geometric deep learning

Author: Andreux M.
Boscaini D.
Bruna J.
Choromanska A.
Clevert D.
Cosmo L.
Dechter R.
Erhan D.
Glorot X.
Goodfellow I.
Gregor K.
Han X.
Hochreiter S.
Jaderberg M.
Kawaguchi K.
Kingma D. P.
Krizhevsky A.
Lähner Z.
Mallat S.
Masci J.
Mnih V.
Radford A.
Rodolà E.
Rusinkiewicz S.
Srivastava R. K.
Srivastava R. K.
Stollenga M.
Wang S.
Werbos P. J.
Wu Z.
Xu K.
Yu F.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2016
Field of study

The goal of these course notes is to describe the main mathematical ideas behind geometric deep learning and to provide implementation details for several applications in shape analysis and synthesis, computer vision and computer graphics. The text in the course materials is primarily based on previously published work. With these notes we gather and provide a clear picture of the key concepts and techniques that fall under the umbrella of geometric deep learning, and illustrate the applications they enable. We also aim to provide practical implementation details for the methods presented in these works, as well as suggest further readings and extensions of these ideas

Crossref

Archivio della ricerca - Fondazione Bruno Kessler

Archivio della ricerca- Università di Roma La Sapienza

Discrete spherical means of directional derivatives and Veronese maps

Author: Alexander Belyaev
Arnold
Aubert
Bannai
Bickley
Birkhoff
Boris Khesin
Bradford
Childress
Colonius
Delsarte
Friedman
Gonzalez
Gordon
Hamming
Harris
Hong
Jähne
Kamgar-Parsi
Kantorovich
Koenderink
Langer
Natterer
Patra
Perona
Perona
Petrila
Pottmann
Pratt
Schneider
Serge Tabachnikov
Seymour
Stroud
Weickert
Weickert
Weickert
Zhong
Publication venue: 'Elsevier BV'
Publication date: 01/01/2011
Field of study

We describe and study geometric properties of discrete circular and spherical means of directional derivatives of functions, as well as discrete approximations of higher order differential operators. For an arbitrary dimension we present a general construction for obtaining discrete spherical means of directional derivatives. The construction is based on using the Minkowski's existence theorem and Veronese maps. Approximating the directional derivatives by appropriate finite differences allows one to obtain finite difference operators with good rotation invariance properties. In particular, we use discrete circular and spherical means to derive discrete approximations of various linear and nonlinear first- and second-order differential operators, including discrete Laplacians. A practical potential of our approach is demonstrated by considering applications to nonlinear filtering of digital images and surface curvature estimation

arXiv.org e-Print Archive

Heriot Watt Pure

Crossref

MPG.PuRe

A multiresolution framework for local similarity based image denoising

Author: Aharon
Barash
Black
Bosch
Brox
Buades
Burt
Chang
Chen
Dabov
Demanet
Do
Do
Donoho
Donoho
Farbman
Foi
Garnett
Hou
Irfan Butt
Jung
Katkovnik
Lu
Mahmoudi
Mallat
Meyer
Nasir Rajpoot
Otsu
Perona
Rajpoot
Rajpoot
Scherzer
Simoncelli
Starck
Tasdizen
Tomasi
van de Ville
Villemoes
Wang
Xie
Yao
Yu
Zhang
Publication venue: 'Elsevier BV'
Publication date: 01/08/2012
Field of study

In this paper, we present a generic framework for denoising of images corrupted with additive white Gaussian noise based on the idea of regional similarity. The proposed framework employs a similarity function using the distance between pixels in a multidimensional feature space, whereby multiple feature maps describing various local regional characteristics can be utilized, giving higher weight to pixels having similar regional characteristics. An extension of the proposed framework into a multiresolution setting using wavelets and scale space is presented. It is shown that the resulting multiresolution multilateral (MRM) filtering algorithm not only eliminates the coarse-grain noise but can also faithfully reconstruct anisotropic features, particularly in the presence of high levels of noise

Crossref

Warwick Research Archives Portal Repository

Graph Signal Processing: Overview, Challenges and Applications

Author: Frossard Pascal
Kovačević Jelena
Moura José M. F.
Ortega Antonio
Vandergheynst Pierre
Publication venue
Publication date: 26/03/2018
Field of study

Research in Graph Signal Processing (GSP) aims to develop tools for processing data defined on irregular graph domains. In this paper we first provide an overview of core ideas in GSP and their connection to conventional digital signal processing. We then summarize recent developments in developing basic GSP tools, including methods for sampling, filtering or graph learning. Next, we review progress in several application areas using GSP, including processing and analysis of sensor network data, biological data, and applications to image processing and machine learning. We finish by providing a brief historical perspective to highlight how concepts recently developed in GSP build on top of prior research in other areas.Comment: To appear, Proceedings of the IEE

arXiv.org e-Print Archive

Infoscience - École polytechnique fédérale de Lausanne

Pointwise Convolutional Neural Networks

Author: Hua Binh-Son
Tran Minh-Khoi
Yeung Sai-Kit
Publication venue
Publication date: 29/03/2018
Field of study

Deep learning with 3D data such as reconstructed point clouds and CAD models has received great research interests recently. However, the capability of using point clouds with convolutional neural network has been so far not fully explored. In this paper, we present a convolutional neural network for semantic segmentation and object recognition with 3D point clouds. At the core of our network is pointwise convolution, a new convolution operator that can be applied at each point of a point cloud. Our fully convolutional network design, while being surprisingly simple to implement, can yield competitive accuracy in both semantic segmentation and object recognition task.Comment: 10 pages, 6 figures, 10 tables. Paper accepted to CVPR 201

arXiv.org e-Print Archive

Crossref

Multi-view Convolutional Neural Networks for 3D Shape Recognition

Author: Kalogerakis Evangelos
Learned-Miller Erik
Maji Subhransu
Su Hang
Publication venue
Publication date: 27/09/2015
Field of study

A longstanding question in computer vision concerns the representation of 3D shapes for recognition: should 3D shapes be represented with descriptors operating on their native 3D formats, such as voxel grid or polygon mesh, or can they be effectively represented with view-based descriptors? We address this question in the context of learning to recognize 3D shapes from a collection of their rendered views on 2D images. We first present a standard CNN architecture trained to recognize the shapes' rendered views independently of each other, and show that a 3D shape can be recognized even from a single view at an accuracy far higher than using state-of-the-art 3D shape descriptors. Recognition rates further increase when multiple views of the shapes are provided. In addition, we present a novel CNN architecture that combines information from multiple views of a 3D shape into a single and compact shape descriptor offering even better recognition performance. The same architecture can be applied to accurately recognize human hand-drawn sketches of shapes. We conclude that a collection of 2D views can be highly informative for 3D shape recognition and is amenable to emerging CNN architectures and their derivatives.Comment: v1: Initial version. v2: An updated ModelNet40 training/test split is used; results with low-rank Mahalanobis metric learning are added. v3 (ICCV 2015): A second camera setup without the upright orientation assumption is added; some accuracy and mAP numbers are changed slightly because a small issue in mesh rendering related to specularities is fixe

arXiv.org e-Print Archive

CiteSeerX

Crossref