Search CORE

151 research outputs found

Recovering 6D Object Pose: A Review and Multi-modal Analysis

Author: A Tejani
C Sahin
D Hoiem
E Brachmann
H Azizpour
M Everingham
M Everingham
MY Liu
N Correll
O Russakovsky
S Hinterstoisser
S Hinterstoisser
T Hodaň
U Bonde
W Kehl
Publication venue
Publication date: 15/08/2018
Field of study

A large number of studies analyse object detection and pose estimation at visual level in 2D, discussing the effects of challenges such as occlusion, clutter, texture, etc., on the performances of the methods, which work in the context of RGB modality. Interpreting the depth data, the study in this paper presents thorough multi-modal analyses. It discusses the above-mentioned challenges for full 6D object pose estimation in RGB-D images comparing the performances of several 6D detectors in order to answer the following questions: What is the current position of the computer vision community for maintaining "automation" in robotic manipulation? What next steps should the community take for improving "autonomy" in robotics while handling objects? Our findings include: (i) reasonably accurate results are obtained on textured-objects at varying viewpoints with cluttered backgrounds. (ii) Heavy existence of occlusion and clutter severely affects the detectors, and similar-looking distractors is the biggest challenge in recovering instances' 6D. (iii) Template-based methods and random forest-based learning algorithms underlie object detection and 6D pose estimation. Recent paradigm is to learn deep discriminative feature representations and to adopt CNNs taking RGB images as input. (iv) Depending on the availability of large-scale 6D annotated depth datasets, feature representations can be learnt on these datasets, and then the learnt representations can be customized for the 6D problem

arXiv.org e-Print Archive

Crossref

Smarter irrigation scheduling in the sugarcane farming system using the Internet of Things

Author: Attard S.
Everingham Y.
Linton A.L.
McGlinchey M.
Philippa B.
Wang E.
Xiang W.
Publication venue: Australian Society of Sugar Cane Technologists
Publication date: 01/01/2019
Field of study

Better irrigation practices can lead to improved yields through less water stress and reduced water usage to deliver economic benefits for farmers. More and more sugarcane growers are transitioning to automated irrigation in the Burdekin and other regions. Automated irrigation systems can save farmers a significant amount of time by remotely turning on and off pumps and valves. However, the system could be improved if it could be integrated with tools that factor in the weather, crop growing conditions, water deficit, and crop stress, to improve irrigation use efficiency. IrrigWeb is a decision-support tool that is turned to as a solution to this problem. IrrigWeb uses CANEGRO to help farmers decide when to irrigate and how much to apply. Farmers can then use this information to plan their irrigation management. However, managing irrigation is a considerable time investment for Burdekin farmers. A tool is needed to integrate the auto-irrigation system (e.g., WiSA) and IrrigWeb to provide a smarter irrigation solution. An uplink program (WiSA to IrrigWeb) has been successfully developed and implemented as part of a pilot study. It saves farmers a significant amount of time by uploading irrigation and rainfall data automatically instead of the farmer having to input them manually. This paper focuses on developing a smarter irrigation-scheduling tool that connects IrrigWeb to WiSA. A downlink program was developed to download, calculate and apply irrigation schedules automatically. In this process, sugarcane irrigators will spend less time manually setting up irrigation schedules as it will happen automatically. The simulation results demonstrated that the downlink program could improve the scheduling by incorporating practical limitations, such as pumping capacity or pumping time constraints, that are found on the farm

ResearchOnline at James Cook University

Linguistic Structure Guided Context Modeling for Referring Image Segmentation

Author: E Margffoy-Tuay
H Shi
L Yu
LC Chen
M Everingham
R Hu
S Hochreiter
S Qiu
TY Lin
Publication venue
Publication date: 05/10/2020
Field of study

Referring image segmentation aims to predict the foreground mask of the object referred by a natural language sentence. Multimodal context of the sentence is crucial to distinguish the referent from the background. Existing methods either insufficiently or redundantly model the multimodal context. To tackle this problem, we propose a "gather-propagate-distribute" scheme to model multimodal context by cross-modal interaction and implement this scheme as a novel Linguistic Structure guided Context Modeling (LSCM) module. Our LSCM module builds a Dependency Parsing Tree suppressed Word Graph (DPT-WG) which guides all the words to include valid multimodal context of the sentence while excluding disturbing ones through three steps over the multimodal feature, i.e., gathering, constrained propagation and distributing. Extensive experiments on four benchmarks demonstrate that our method outperforms all the previous state-of-the-arts.Comment: Accepted by ECCV 2020. Code is available at https://github.com/spyflying/LSCM-Refse

arXiv.org e-Print Archive

Crossref

Deep Burst Denoising

Author: A Foi
C Dong
C Dong
Chih-Yuan Yang
CRA Chaitanya
E Shelhamer
F Heide
F Heide
H Zhao
J Yang
JJ Hopfield
K Dabov
K Nasrollahi
K Zhang
M Everingham
M Gharbi
M Maggioni
O Ronneberger
PJ Werbos
S Farsiu
SW Hasinoff
Y Chen
Z Liu
Publication venue
Publication date: 15/12/2017
Field of study

Noise is an inherent issue of low-light image capture, one which is exacerbated on mobile devices due to their narrow apertures and small sensors. One strategy for mitigating noise in a low-light situation is to increase the shutter time of the camera, thus allowing each photosite to integrate more light and decrease noise variance. However, there are two downsides of long exposures: (a) bright regions can exceed the sensor range, and (b) camera and scene motion will result in blurred images. Another way of gathering more light is to capture multiple short (thus noisy) frames in a "burst" and intelligently integrate the content, thus avoiding the above downsides. In this paper, we use the burst-capture strategy and implement the intelligent integration via a recurrent fully convolutional deep neural net (CNN). We build our novel, multiframe architecture to be a simple addition to any single frame denoising model, and design to handle an arbitrary number of noisy input frames. We show that it achieves state of the art denoising results on our burst dataset, improving on the best published multi-frame techniques, such as VBM4D and FlexISP. Finally, we explore other applications of image enhancement by integrating content from multiple frames and demonstrate that our DNN architecture generalizes well to image super-resolution

arXiv.org e-Print Archive

Crossref

Detecting Faces, Visual Medium Types, and Gender in Historical Advertisements, 1950–1995

Author: BC Russell
CB Ng
E Eidinger
E Goffman
G Antipov
G Kuipers
JE Schroeder
K Lindner
K Parkin
M Everingham
M Wevers
ME Kang
P Belknap
P Bell
P Bell
P Fyfe
P Scranton
P van der Hoeven
P Viola
R Goldman
R Marchand
S Zafeiriou
TD Conley
W Schreurs
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2020
Field of study

Crossref

International Migration, Integration and Social Cohesion online publications

UvA-DARE

Leaf segmentation in plant phenotyping: a collation study

Author: A Hartmann
A Walter
A Walter
Andrew P. French
B Biskup
B Wu
B Yanikoglu
C Granier
C Kalyoncu
C Klukas
C Nieuwenhuis
Christian Klukas
D Martin
D Martin
D Ziou
Danijela Vukadinovic
David M. Kramer
E Aksoy
F Kurugollu
G Cerutti
G Heijden van der
Gerrit Polder
Hanno Scharr
Imanol Luengo
J Canny
J Jin
J Vylder De
J Wang
Jean-Michel Pape
JVB Soares
K Nagel
L Grady
L Quan
L Silva
L Vincent
M Everingham
M Jansen
M Minervini
M Minervini
M Müller-Linow
M Polak
Massimo Minervini
R Achanta
R Adams
S Arvidsson
S Bansal
S Beucher
Sotirios A. Tsaftaris
WK Pratt
Xi Yin
Xiaoming Liu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

Image-based plant phenotyping is a growing application area of computer vision in agriculture. A key task is the segmentation of all individual leaves in images. Here we focus on the most common rosette model plants, Arabidopsis and young tobacco. Although leaves do share appearance and shape characteristics, the presence of occlusions and variability in leaf shape and pose, as well as imaging conditions, render this problem challenging. The aim of this paper is to compare several leaf segmentation solutions on a unique and first-of-its-kind dataset containing images from typical phenotyping experiments. In particular, we report and discuss methods and findings of a collection of submissions for the first Leaf Segmentation Challenge of the Computer Vision Problems in Plant Phenotyping workshop in 2014. Four methods are presented: three segment leaves by processing the distance transform in an unsupervised fashion, and the other via optimal template selection and Chamfer matching. Overall, we find that although separating plant from background can be accomplished with satisfactory accuracy (>>90 % Dice score), individual leaf segmentation and counting remain challenging when leaves overlap. Additionally, accuracy is lower for younger leaves. We find also that variability in datasets does affect outcomes. Our findings motivate further investigations and development of specialized algorithms for this particular application, and that challenges of this form are ideally suited for advancing the state of the art. Data are publicly available (online at http://www.plant-phenotyping.org/datasets) to support future challenges beyond segmentation within this application domain

Nottingham ePrints

Nottingham eTheses

Crossref

Repository@Nottingham

Edinburgh Research Explorer

Wageningen University & Research Publications

Juelich Shared Electronic Resources

IMT Institutional Repository

Family Histories and Women's Retirement: The Role of Childbearing and Marital Experiences

Author: A C Liefbroer
A E Fasang
A M O&apos
A M Pienta
A M Pienta
A M Pienta
A.-R Poortman
B A Simmons
B De Vroom
C A Price
C Dewilde
C Everingham
D B Smith
D Eisenhower
D Hyllegard
D M Blau
D M Blau
D Price
E Skirboll
F R Addo
F Steele
G Guo
G H Elder
H Van Solinge
J Byles
J C Henretta
J D Vlasblom
J De Jong Gierveld
J Ginn
J Ginn
J Ginn
J Hendricks
J M Raymo
J M Raymo
J Wilmoth
K Denaeghel
K F Slevin
K Hank
K Henkens
K Henkens
K L Brewster
Kene Henkens
L Gonz�lez
L Mcdonald
L Zimmerman
M Coleman
M D Hayward
M D Hayward
M Damman
M E Szinovacz
M E Szinovacz
M E Szinovacz
M E Szinovacz
M Honig
M Jansen
M Kalmijn
M Mills
M Mills
Marleen Damman
Matthijs Kalmijn
N G Choi
Oecd
P Frericks
P M De Graaf
P Moen
R A August
R A Settersten
R Boss�
R Schalk
S Arber
S Arber
S Drobni?
S Gustafsson
S T Yabiku
Sudman
T A Beehr
T Fokkema
T H Brown
T Jefferson
T Vartanian
Uk: Aldershot
V E Richardson
Publication venue: 'Elsevier BV'
Publication date: 01/01/2014
Field of study

Crossref

Using Multi-view Recognition and Meta-data Annotation to Guide a Robot's Attention

Author: Alexander Thomas
Bastian Leibe
Bay H.
Belongie S.
Brostow G.
Cheng L.
Cornelis N.
Everingham M.
Everingham M.
Ferrari V.
Fraundorfer F.
Goedemé T.
Han F.
Hassner T.
Hoiem D.
Hoiem D.
Hoiem D.
Kushal A.
Leibe B.
Leibe B.
Leibe B.
Liebelt J.
Liu C.
Lowe D.G.
Luc Van Gool
Mumford D.
Munoz D.
Pantofaru C.
Posner I.
Russell B.
Savarese S.
Saxena A.
Saxena A.
Seemann E.
Segvic S.
Thomas A.
Thomas A.
Thomas A.
Tinne Tuytelaars
Vittorio Ferrari
Yan P.
Publication venue: 'SAGE Publications'
Publication date: 01/01/2009
Field of study

In the transition from industrial to service robotics, robots will have to deal with increasingly unpredictable and variable environments. We present a system that is able to recognize objects of a certain class in an image and to identify their parts for potential interactions. The method can recognize objects from arbitrary viewpoints and generalizes to instances that have never been observed during training, even if they are partially occluded and appear against cluttered backgrounds. Our approach builds on the implicit shape model of Leibe et al. We extend it to couple recognition to the provision of meta-dat

Lirias

CiteSeerX

Crossref

Edinburgh Research Explorer

Publikationsserver der RWTH Aachen University