Search CORE

4,842 research outputs found

A Methodology for Extracting Human Bodies from Still Images

Author: Tsitsoulis Athanasios
Publication venue: CORE Scholar
Publication date: 01/01/2013
Field of study

Monitoring and surveillance of humans is one of the most prominent applications of today and it is expected to be part of many future aspects of our life, for safety reasons, assisted living and many others. Many efforts have been made towards automatic and robust solutions, but the general problem is very challenging and remains still open. In this PhD dissertation we examine the problem from many perspectives. First, we study the performance of a hardware architecture designed for large-scale surveillance systems. Then, we focus on the general problem of human activity recognition, present an extensive survey of methodologies that deal with this subject and propose a maturity metric to evaluate them. One of the numerous and most popular algorithms for image processing found in the field is image segmentation and we propose a blind metric to evaluate their results regarding the activity at local regions. Finally, we propose a fully automatic system for segmenting and extracting human bodies from challenging single images, which is the main contribution of the dissertation. Our methodology is a novel bottom-up approach relying mostly on anthropometric constraints and is facilitated by our research in the fields of face, skin and hands detection. Experimental results and comparison with state-of-the-art methodologies demonstrate the success of our approach

CORE

Appearance-based localization for mobile robots using digital zoom and visual compass

Author: Ballard
Bovik
Burgard
Dempster
Duckett
Duckett
E. Fletcher
Jensfelt
K. Burn
Lowe
Moravec
N. Bellotto
Oore
S. Wermter
Thrun
Zuidervel
Publication venue: 'Elsevier BV'
Publication date: 18/07/2007
Field of study

This paper describes a localization system for mobile robots moving in dynamic indoor environments, which uses probabilistic integration of visual appearance and odometry information. The approach is based on a novel image matching algorithm for appearance-based place recognition that integrates digital zooming, to extend the area of application, and a visual compass. Ambiguous information used for recognizing places is resolved with multiple hypothesis tracking and a selection procedure inspired by Markov localization. This enables the system to deal with perceptual aliasing or absence of reliable sensor data. It has been implemented on a robot operating in an office scenario and the robustness of the approach demonstrated experimentally

University of Lincoln Institutional Repository

Crossref

Sunderland University Institutional Repository

VIBE: Video Inference for Human Body Pose and Shape Estimation

Author: Athanasiou Nikos
Black Michael J.
Kocabas Muhammed
Publication venue
Publication date: 01/01/2020
Field of study

Human motion is fundamental to understanding behavior. Despite progress on single-image 3D pose and shape estimation, existing video-based state-of-the-art methods fail to produce accurate and natural motion sequences due to a lack of ground-truth 3D motion data for training. To address this problem, we propose Video Inference for Body Pose and Shape Estimation (VIBE), which makes use of an existing large-scale motion capture dataset (AMASS) together with unpaired, in-the-wild, 2D keypoint annotations. Our key novelty is an adversarial learning framework that leverages AMASS to discriminate between real human motions and those produced by our temporal pose and shape regression networks. We define a temporal network architecture and show that adversarial training, at the sequence level, produces kinematically plausible motion sequences without in-the-wild ground-truth 3D labels. We perform extensive experimentation to analyze the importance of motion and demonstrate the effectiveness of VIBE on challenging 3D pose estimation datasets, achieving state-of-the-art performance. Code and pretrained models are available at https://github.com/mkocabas/VIBE.Comment: CVPR-2020 camera ready. Code is available at https://github.com/mkocabas/VIB

arXiv.org e-Print Archive

Crossref

MPG.PuRe

Autonomous Navigation of Mobile Robots in Complex Dynamic Environments

Author: Ivan Marković
Ivan Petrović
Marija Seder
Publication venue: Croatian Academy of Engineering (HATZ)
Publication date: 01/01/2019
Field of study

Most of the future robots will be mobile, and the main challenge is to develop algorithms for their autonomous navigation as well as for human-robot interactions. The Laboratory for Autonomous Systems and Mobile Robotics (LAMOR) at the Faculty of Electrical Engineering and Computing of the University of Zagreb is involved in the research of such mobile robotic systems, and currently participates in a number of related international and national research projects. This paper addresses the issue of autonomous navigation of mobile robots in complex dynamic environments, providing state of the art of the domain and major LAMOR’s contribution to it. At the end, we present an application example of the autonomous navigation technologies in flexible warehouses, which we have been developing within a Horizon 2020 project SafeLog

HRČAK - Portal of Croatian Scientific and Professional Journals

Tracking more than 100 arbitrary objects at 25 FPS through deep learning

Author: Brea Sánchez Víctor Manuel
Mucientes Molina Manuel Felipe
Vaquero Otal Lorenzo
Publication venue: 'Elsevier BV'
Publication date: 01/01/2022
Field of study

Most video analytics applications rely on object detectors to localize objects in frames. However, when real-time is a requirement, running the detector at all the frames is usually not possible. This is somewhat circumvented by instantiating visual object trackers between detector calls, but this does not scale with the number of objects. To tackle this problem, we present SiamMT, a new deep learning multiple visual object tracking solution that applies single-object tracking principles to multiple arbitrary objects in real-time. To achieve this, SiamMT reuses feature computations, implements a novel crop-and-resize operator, and defines a new and efficient pairwise similarity operator. SiamMT naturally scales up to several dozens of targets, reaching 25 fps with 122 simultaneous objects for VGA videos, or up to 100 simultaneous objects in HD720 video. SiamMT has been validated on five large real-time benchmarks, achieving leading performance against current state-of-the-art trackersThis research was partially funded by the Spanish Ministerio de Ciencia e Innovación [grant numbers PID2020-112623GB-I00, RTI2018-097088-B-C32], and the Galician Consellería de Cultura, Educación e Universidade [grant numbers ED431C 2018/29, ED431C 2017/69, accreditation 2016–2019, ED431G/08]. These grants are co-funded by the European Regional Development Fund (ERDF). Lorenzo Vaquero is supported by the Spanish Ministerio de Universidades under the FPU national plan (FPU18/03174)S

Repositorio Institucional da Universidade de Santiago de Compostela

ホヨウ　ニンショウ　二　モトヅク　エイゾウ　カンシ　ノ　タメ　ノ　コンピュータ　ビジョン　ギジュツ

Author: Iwama Haruyuki
イワマハルユキ
岩間晴之
Publication venue
Publication date
Field of study

Osaka University Knowledge Archive

Evaluating indoor positioning systems in a shopping mall : the lessons learned from the IPIN 2018 competition

Author: Ali Muhammad Usman
Ben-Moshe Boaz
Chien Ying-Ren
Cho Eunyoung
Ding Zhenxing
Fang Shih-Hau
Hacohen Shlomi
Han Jaeseung
Hur Soojung
Jeong Hyeongyo
Jun Sungwoo
Knauth Stefan
Kronenwett Nikolai
Kuang Jian
Landa Vlad
Landau Yael
Lee Changeun
Lee Keumryeol
Lee Soyeon
Lee Yonghyun
Li Xianghong
Li Yu
Lu Chuanhua
Lungenstrass Tomas
Marbel Revital
Martin Mendoza-Silva German
Niu Xiaoji
Opiela Miroslav
Ortiz Miguel
Pablo Morales Juan
Park Chan Gook
Park Changjun
Park Sangjoon
Park So Young
Park Yongwan
Perez-Navarro Antoni
Perul Johan
Pipelidis Georgios
Plets David
Ramon Jimenez Antonio
Renaudin Valerie
Rew Jehyeok
Seco Fernando
Shimada Atsushi
Shvalb Nir
Taniguchi Rin-Ichiro
Thomas Diego
Torres-Sospedra Joaquin
Trogh Jens
Tsao Yu
Tsiamitros Nikolaos
Uchiyama Hideaki
Vladimirov Blagovest
Wei Dongyan
Xu Feng
Yang Shi-Shen
Ye Feng
Ye Shih-Jyun
Zhang Wenchao
Zhang Ying
Zheng Xingyu
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2019
Field of study

The Indoor Positioning and Indoor Navigation (IPIN) conference holds an annual competition in which indoor localization systems from different research groups worldwide are evaluated empirically. The objective of this competition is to establish a systematic evaluation methodology with rigorous metrics both for real-time (on-site) and post-processing (off-site) situations, in a realistic environment unfamiliar to the prototype developers. For the IPIN 2018 conference, this competition was held on September 22nd, 2018, in Atlantis, a large shopping mall in Nantes (France). Four competition tracks (two on-site and two off-site) were designed. They consisted of several 1 km routes traversing several floors of the mall. Along these paths, 180 points were topographically surveyed with a 10 cm accuracy, to serve as ground truth landmarks, combining theodolite measurements, differential global navigation satellite system (GNSS) and 3D scanner systems. 34 teams effectively competed. The accuracy score corresponds to the third quartile (75th percentile) of an error metric that combines the horizontal positioning error and the floor detection. The best results for the on-site tracks showed an accuracy score of 11.70 m (Track 1) and 5.50 m (Track 2), while the best results for the off-site tracks showed an accuracy score of 0.90 m (Track 3) and 1.30 m (Track 4). These results showed that it is possible to obtain high accuracy indoor positioning solutions in large, realistic environments using wearable light-weight sensors without deploying any beacon. This paper describes the organization work of the tracks, analyzes the methodology used to quantify the results, reviews the lessons learned from the competition and discusses its future

Ghent University Academic Bibliography

Digital.CSIC

Computer vision

Author: Cunningham R.
Gennery D.
High J.
Ruoff C.
Saund E.
Publication venue
Publication date
Field of study

The field of computer vision is surveyed and assessed, key research issues are identified, and possibilities for a future vision system are discussed. The problems of descriptions of two and three dimensional worlds are discussed. The representation of such features as texture, edges, curves, and corners are detailed. Recognition methods are described in which cross correlation coefficients are maximized or numerical values for a set of features are measured. Object tracking is discussed in terms of the robust matching algorithms that must be devised. Stereo vision, camera control and calibration, and the hardware and systems architecture are discussed

NASA Technical Reports Server