Search CORE

17,724 research outputs found

On orthogonal projections for dimension reduction and applications in augmented target loss functions for learning problems

Author: Breger Anna
Orlando Jose Ignacio
Harar Pavol
Dörfler Monika
Klimscha Sophie
Grechenig Christoph
Gerendas Bianca S.
Schmidt-Erfurth Ursula
Ehler Martin
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2009
Field of study

The use of orthogonal projections on high-dimensional input and target data in learning frameworks is studied. First, we investigate the relations between two standard objectives in dimension reduction, preservation of variance and of pairwise relative distances. Investigations of their asymptotic correlation as well as numerical experiments show that a projection does usually not satisfy both objectives at once. In a standard classification problem we determine projections on the input data that balance the objectives and compare subsequent results. Next, we extend our application of orthogonal projections to deep learning tasks and introduce a general framework of augmented target loss functions. These loss functions integrate additional information via transformations and projections of the target data. In two supervised learning problems, clinical image segmentation and music information classification, the application of our proposed augmented target loss functions increase the accuracy

arXiv.org e-Print Archive

VU Research Portal

Permanent Hosting, Archiving and Indexing of Digital Resources and Assets

Digital library of Brno University of Technology

Res2Net: A New Multi-scale Backbone Architecture

Author: Cheng Ming-Ming
Gao Shang-Hua
Torr Philip
Yang Ming-Hsuan
Zhang Xin-Yu
Zhao Kai
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 30/08/2019
Field of study

Representing features at multiple scales is of great importance for numerous vision tasks. Recent advances in backbone convolutional neural networks (CNNs) continually demonstrate stronger multi-scale representation ability, leading to consistent performance gains on a wide range of applications. However, most existing methods represent the multi-scale features in a layer-wise manner. In this paper, we propose a novel building block for CNNs, namely Res2Net, by constructing hierarchical residual-like connections within one single residual block. The Res2Net represents multi-scale features at a granular level and increases the range of receptive fields for each network layer. The proposed Res2Net block can be plugged into the state-of-the-art backbone CNN models, e.g., ResNet, ResNeXt, and DLA. We evaluate the Res2Net block on all these models and demonstrate consistent performance gains over baseline models on widely-used datasets, e.g., CIFAR-100 and ImageNet. Further ablation studies and experimental results on representative computer vision tasks, i.e., object detection, class activation mapping, and salient object detection, further verify the superiority of the Res2Net over the state-of-the-art baseline methods. The source code and trained models are available on https://mmcheng.net/res2net/.Comment: 11 pages, 7 figure

arXiv.org e-Print Archive

Oxford University Research Archive

Distributed Low-rank Subspace Segmentation

Author: Chang Shih-Fu
Jordan Michael I.
Mackey Lester
Mu Yadong
Talwalkar Ameet
Publication venue
Publication date: 15/10/2013
Field of study

Vision problems ranging from image clustering to motion segmentation to semi-supervised learning can naturally be framed as subspace segmentation problems, in which one aims to recover multiple low-dimensional subspaces from noisy and corrupted input data. Low-Rank Representation (LRR), a convex formulation of the subspace segmentation problem, is provably and empirically accurate on small problems but does not scale to the massive sizes of modern vision datasets. Moreover, past work aimed at scaling up low-rank matrix factorization is not applicable to LRR given its non-decomposable constraints. In this work, we propose a novel divide-and-conquer algorithm for large-scale subspace segmentation that can cope with LRR's non-decomposable constraints and maintains LRR's strong recovery guarantees. This has immediate implications for the scalability of subspace segmentation, which we demonstrate on a benchmark face recognition dataset and in simulations. We then introduce novel applications of LRR-based subspace segmentation to large-scale semi-supervised learning for multimedia event detection, concept detection, and image tagging. In each case, we obtain state-of-the-art results and order-of-magnitude speed ups

arXiv.org e-Print Archive

Crossref

Towards automatic pulmonary nodule management in lung cancer screening with deep learning

Author: Chung Kaman
Ciompi Francesco
Gerke Paul K.
Jacobs Colin
Marchiano Alfonso
Pastorino Ugo
Prokop Mathias
Schaefer-Prokop Cornelia
Scholten Ernst Th.
Setio Arnaud Arindra Adiyoso
van Ginneken Bram
van Riel Sarah J.
Wille Mathilde M. W.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

The introduction of lung cancer screening programs will produce an unprecedented amount of chest CT scans in the near future, which radiologists will have to read in order to decide on a patient follow-up strategy. According to the current guidelines, the workup of screen-detected nodules strongly relies on nodule size and nodule type. In this paper, we present a deep learning system based on multi-stream multi-scale convolutional networks, which automatically classifies all nodule types relevant for nodule workup. The system processes raw CT data containing a nodule without the need for any additional information such as nodule segmentation or nodule size and learns a representation of 3D data by analyzing an arbitrary number of 2D views of a given nodule. The deep learning system was trained with data from the Italian MILD screening trial and validated on an independent set of data from the Danish DLCST screening trial. We analyze the advantage of processing nodules at multiple scales with a multi-stream convolutional network architecture, and we show that the proposed deep learning system achieves performance at classifying nodule type that surpasses the one of classical machine learning approaches and is within the inter-observer variability among four experienced human observers.Comment: Published on Scientific Report

arXiv.org e-Print Archive

Radboud Repository