Search CORE

179 research outputs found

Keeping Your Eye on the Ball: Trajectory Attention in Video Transformers

Author: Asano Y.
Campbell D.
Feichtenhofer C.
Henriques J.
Metze F.
Misra I.
Patrick M.
Vedaldi A.
Publication venue: Neural Information Processing Systems Foundation
Publication date: 01/01/2022
Field of study

International Migration, Integration and Social Cohesion online publications

Keeping Your Eye on the Ball: Trajectory Attention in Video Transformers

Author: Asano Y.
Campbell D.
Feichtenhofer C.
Henriques J.
Metze F.
Misra I.
Patrick M.
Vedaldi A.
Publication venue: Neural Information Processing Systems Foundation
Publication date: 01/01/2022
Field of study

International Migration, Integration and Social Cohesion online publications

Deep learning: an introduction for applied mathematicians

Author: Catherine F. Higham
Davenport J. H.
Deng J.
Desmond J. Higham
Goodfellow I. J.
Jia Y.
Rumelhart D. E.
Su J.
Vedaldi A.
Publication venue: 'Society for Industrial & Applied Mathematics (SIAM)'
Publication date: 17/01/2018
Field of study

Multilayered artificial neural networks are becoming a pervasive tool in a host of application fields. At the heart of this deep learning revolution are familiar concepts from applied and computational mathematics; notably, in calculus, approximation theory, optimization and linear algebra. This article provides a very brief introduction to the basic ideas that underlie deep learning from an applied mathematics perspective. Our target audience includes postgraduate and final year undergraduate students in mathematics who are keen to learn about the area. The article may also be useful for instructors in mathematics who wish to enliven their classes with references to the application of deep learning techniques. We focus on three fundamental questions: what is a deep neural network? how is a network trained? what is the stochastic gradient method? We illustrate the ideas with a short MATLAB code that sets up and trains a network. We also show the use of state-of-the art software on a large scale image classification problem. We finish with references to the current literature

arXiv.org e-Print Archive

Crossref

University of Strathclyde Institutional Repository

Edinburgh Research Explorer

Enlighten

Discriminative learning with latent variables for cluttered indoor scene understanding

Author: Besag J.
Daphne Roller
Geiger A.
Hedau V.
Hoiem D.
Huayan Wang
Lee D.
Pepik B.
Stephen Gould
Vedaldi A.
Wang H.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref

Color-to-Grayscale: Does the Method Matter in Image Recognition?

Author: A Andrepoulos
A Bosch
A Coates
A Martinez
A Vedaldi
AC Berg
AJ Bell
C Evans
C Kanan
Christopher Kanan
D Lowe
Eshel Ben-Jacob
Garrison W. Cottrell
H Bay
J Yang
K Barnard
K Dana
K Jack
K Ohba
L Fei-fei
M Grundland
M Ĉadík
O Boiman
P Simard
R Raina
S Yoo
T Acharya
T Ojala
W Hoeffding
W Pratt
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

In image recognition it is often assumed the method used to convert color images to grayscale has little impact on recognition performance. We compare thirteen different grayscale algorithms with four types of image descriptors and demonstrate that this assumption is wrong: not all color-to-grayscale algorithms work equally well, even when using descriptors that are robust to changes in illumination. These methods are tested using a modern descriptor-based image recognition framework, on face, object, and texture datasets, with relatively few training instances. We identify a simple method that generally works best for face and object recognition, and two that work well for recognizing textures

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Shape Retrieval of Non-rigid 3D Human Models

Author: A Elad
A Giachetti
A Vedaldi
A. Ben Hamza
A. Bronstein
A. Giachetti
A. Godil
A. Tatsuma
AM Bronstein
B Li
B Li
B. Li
C Ionescu
C Li
C Li
C Li
C. Li
D. Pickup
F Heijden Van Der
G. Tam
GE Hinton
GK Tam
H. Johan
H. Li
J Sun
J. Han
J. Ye
KQ Weinberger
L. Isaia
L. Lai
L. Sun
M Kac
M Ovsjanikov
M Reuter
M. Aono
M. Bronstein
N Hasler
P. L. Rosin
R Gal
R Litman
R Osada
R. Litman
R. R. Martin
RO Duda
S Bu
S Bu
S Valette
S. Bu
S. Cheng
U. Castellani
V. Garro
X. Liu
X. Sun
Y Lipman
Y Rubner
Y. Lu
Z Lian
Z Lian
Z. Cheng
Z. Lian
Z. Liu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

3D models of humans are commonly used within computer graphics and vision, and so the ability to distinguish between body shapes is an important shape retrieval problem. We extend our recent paper which provided a benchmark for testing non-rigid 3D shape retrieval algorithms on 3D human models. This benchmark provided a far stricter challenge than previous shape benchmarks. We have added 145 new models for use as a separate training set, in order to standardise the training data used and provide a fairer comparison. We have also included experiments with the FAUST dataset of human scans. All participants of the previous benchmark study have taken part in the new tests reported here, many providing updated results using the new data. In addition, further participants have also taken part, and we provide extra analysis of the retrieval results. A total of 25 different shape retrieval methods are compared

Crossref

Online Research @ Cardiff

Springer - Publisher Connector

Fraunhofer-ePrints

Catalogo dei prodotti della ricerca

Cronfa at Swansea University

Unsupervised segmentation of noisy electron microscopy images using salient watersheds and region merging

Author: A Buades
A Levinshtein
A Lucchi
A Vazquez-Reina
A Vedaldi
A Vedaldi
AP Moore
B Andres
B Andres
D Comaniciu
DR Martin
Eugene W Myers
F Calderero
H Peng
J Canny
J Shi
JR Beveridge
JS Cardoso
K Haris
L Liu
L Vincent
M Varma
O Pele
O Veksler
P Arbelaez
Parvez Ahammad
PF Felzenszwalb
R Achanta
R Nock
S Gould
Saket Navlakha
SC Turaga
T Leung
V Jain
Y Rubner
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Region-based progressive localization of cell nuclei in microscopic images with data adaptive modeling

Author: A Vedaldi
C Chen
C Hagwood
C Li
C Wahlby
CC Chang
D Mumford
David Dagan Feng
DG Lowe
E Bernardis
E Meijering
F Li
F Long
H Chang
H Chang
H Kong
H Ling
H Peng
Heng Huang
HF Yang
J Cardinale
J Lafferty
J Matas
J Monaco
JA Helmuth
JA Helmuth
JP Bergeest
JP Bergeest
K Li
K Mosaliganti
K Smith
L Cheng
L Qu
L Yang
LP Coelho
M Everingham
Mei Chen
O Dzyubachyk
O Lezoray
P Quelhas
P Yan
S Ali
S Boyd
S Huh
SH Rezatofighi
TR Raviv
V Kolmogorov
Weidong Cai
X Lou
Y Rubner
Y Song
Y Song
Y Song
Y Song
Yang Song
Yue Wang
Z Yin
Z Yin
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Cross-depiction problem: Recognition and synthesis of photographs and artwork

Author: A. C. Berg
A. Krizhevsky
A. Shrivastava
A. Vedaldi
A. Vedaldi
B. Fernando
B. Gong
B. Gong
B. Leibe
B. Xiao
C. Gu
D. Crandall
D. G. Lowe
E. J. Crowley
E. J. Crowley
E. J. Crowley
E. Shechtman
F. Perronnin
G. Csurka
G. Elidan
H. Rom
H. Sundar
J. Coughlan
J. E. Kyprianidis
J. P. Collomosse
K. Chatfield
K. Saenko
K. Siddiqi
M. A. Fischler
M. Cho
M. Everingham
M. Leordeanu
N. Dalal
O. Russakovsky
P. F. Felzenszwalb
P. F. Felzenszwalb
P. F. Felzenszwalb
P. Hall
Q. Wu
Q. Wu
Q. Wu
Q. Wu
R. Fergus
R. Gopalan
R. Hu
R. Hu
R. Hu
S. Ginosar
S. J. Pan
S. Lazebnik
T. F. Cootes
V. Ferrari
V. Ferrari
W. Jia
Y. Amit
Y. Li
Y.-Z. Song
Y.-Z. Song
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

I have seen enough: Transferring parts across categories

Author: Larlus D
Novotny D
Vedaldi A
Publication venue
Publication date: 01/01/2016
Field of study

The recent successes of deep learning have been possible due to the availability of increasingly large quantities of annotated data. A natural question, therefore, is whether further progress can be indefinitely sustained by annotating more data, or whether there is a saturation point beyond which a problem is essentially solved, or the capacity of a model is saturated. In this paper we examine this question from the viewpoint of learning shareable semantic parts, a fundamental building block to generalize visual knowledge between object categories. We ask two research questions often neglected: whether semantic parts are also visually shareable between classes, and how many annotations are required to learn them. In order to answer such questions, we collect 15,000 images of 100 animal classes and annotate them with parts. We then thoroughly test active learning and domain adaptation techniques to generalize to unseen classes parts that are learned from a limited number of classes and example images. Our experiments show that, for a majority of the classes, part annotations transfer well, and that performance reaches 98% of the accuracy of the fully annotated scenario by providing only a few thousand examples

Oxford University Research Archive