Search CORE

3,461 research outputs found

Key-Pose Prediction in Cyclic Human Motion

Author: Lienhart Rainer
Zecha Dan
Publication venue
Publication date: 01/01/2015
Field of study

In this paper we study the problem of estimating innercyclic time intervals within repetitive motion sequences of top-class swimmers in a swimming channel. Interval limits are given by temporal occurrences of key-poses, i.e. distinctive postures of the body. A key-pose is defined by means of only one or two specific features of the complete posture. It is often difficult to detect such subtle features directly. We therefore propose the following method: Given that we observe the swimmer from the side, we build a pictorial structure of poselets to robustly identify random support poses within the regular motion of a swimmer. We formulate a maximum likelihood model which predicts a key-pose given the occurrences of multiple support poses within one stroke. The maximum likelihood can be extended with prior knowledge about the temporal location of a key-pose in order to improve the prediction recall. We experimentally show that our models reliably and robustly detect key-poses with a high precision and that their performance can be improved by extending the framework with additional camera views.Comment: Accepted at WACV 2015, 8 pages, 3 figure

arXiv.org e-Print Archive

CiteSeerX

OPUS Augsburg

Crossref

Recent Advances on Face Recongition using Thermal Infrared Images

Author: César San Martin
Edel García-Reyes
Gabriel Hermosilla
Heydi Mendez-Vazquez
Pablo Meza
Roberto Carrillo
Yenisel Plasencia
Publication venue: 'IntechOpen'
Publication date: 27/07/2011
Field of study

IntechOpen

Essential updates 2020/2021 : Current topics of simulation and navigation in hepatectomy

Author: Morine Yuji
Saito Yu
Shimada Mitsuo
Sugimoto Maki
Yamada Shinichiro
Publication venue: The Japanese Society of Gastroenterological Surgery|Wiley
Publication date: 23/12/2021
Field of study

With the development of three-dimensional (3D) simulation software, preoperative simulation technology is almost completely established. The remaining issue is how to recognize anatomy three-dimensionally. Extended reality is a newly developed technology with several merits for surgical application: no requirement for a sterilized display monitor, better spatial awareness, and the ability to share 3D images among all surgeons. Various technology or devices for intraoperative navigation have also been developed to support the safety and certainty of liver surgery. Consensus recommendations regarding indocyanine green fluorescence were determined in 2021. Extended reality has also been applied to intraoperative navigation, and artificial intelligence (AI) is one of the topics of real-time navigation. AI might overcome the problem of liver deformity with automatic registration. Including the issues described above, this article focuses on recent advances in simulation and navigation in liver surgery from 2020 to 2021

PubMed Central

Tokushima University Institutional Repository

Thermal Infrared Face Recognition – A Biometric Identification Technique for Robust Security system

Author: Bhattacharjee Debotosh
Kanti Bhowmik Mrinal
Kumar Basu Dipak
Majumder Goutam
Majumder Sharmistha
Nasipuri Mita
Nath Sarma Aniruddha
Saha Ashim
Saha Kankan
Publication venue: 'IntechOpen'
Publication date: 27/07/2011
Field of study

IntechOpen

Overview of contextual tracking approaches in information fusion

Author: Blash Erik
García Jesús
Llinas James
Palaniappan Kannappan
Seetharaman Guna
Snidaro Lauro
Publication venue: 'SPIE-Intl Soc Optical Eng'
Publication date: 01/01/2013
Field of study

Proceedings of: Geospatial InfoFusion III. 2-3 May 2013 Baltimore, Maryland, United States.Many information fusion solutions work well in the intended scenarios; but the applications, supporting data, and capabilities change over varying contexts. One example is weather data for electro-optical target trackers of which standards have evolved over decades. The operating conditions of: technology changes, sensor/target variations, and the contextual environment can inhibit performance if not included in the initial systems design. In this paper, we seek to define and categorize different types of contextual information. We describe five contextual information categories that support target tracking: (1) domain knowledge from a user to aid the information fusion process through selection, cueing, and analysis, (2) environment-to-hardware processing for sensor management, (3) known distribution of entities for situation/threat assessment, (4) historical traffic behavior for situation awareness patterns of life (POL), and (5) road information for target tracking and identification. Appropriate characterization and representation of contextual information is needed for future high-level information fusion systems design to take advantage of the large data content available for a priori knowledge target tracking algorithm construction, implementation, and application.Publicad

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Archivio istituzionale della ricerca - Università degli Studi di Udine

Universidad Carlos III de Madrid e-Archivo

Ontological representation of context knowledge for visual data fusion

Author: García Jesús
Gómez Romero Juan
Molina José M.
Patricio Guisado Miguel Ángel
Publication venue: International Society of Information Fusion (ISIF)
Publication date: 01/01/2009
Field of study

8 pages, 4 figures.-- Contributed to: 12th International Conference on Information Fusion, 2009 (FUSION '09, Seattle, Washington, US, Jul 6-9, 2009).Context knowledge is essential to achieve successful information fusion, especially at high JDL levels. Context can be used to interpret the perceived situation, which is required for accurate assessment. Both types of knowledge, contextual and perceptual, can be represented with formal languages such as ontologies, which support the creation of readable representations and reasoning with them. In this paper, we present an ontology-based model compliant with JDL to represent knowledge in cognitive visual data fusion systems. We depict the use of the model with an example on surveillance. We show that such a model promotes system extensibility and facilitates the incorporation of humans in the fusion loop.This work was supported in part by Projects CICYT TIN2008-06742-C02-02/TSI, CICYT TEC2008-06732-C02-02/TEC, SINPROB, CAM MADRINET S-0505/TIC/0255 and DPS2008-07029-C02-02.Publicad

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Universidad Carlos III de Madrid e-Archivo

TAPIR: tracking any point with per-frame initialization and temporal refinement

Author: Aytar Yusuf
Carreira Joao
Doersch Carl
Gokay Dilara
Gupta Ankush
Vecerik Mel
Yang Yi
Zisserman Andrew
Publication venue: IEEE
Publication date: 15/01/2024
Field of study

We present a novel model for Tracking Any Point (TAP) that effectively tracks any queried point on any physical surface throughout a video sequence. Our approach employs two stages: (1) a matching stage, which independently locates a suitable candidate point match for the query point on every other frame, and (2) a refinement stage, which updates both the trajectory and query features based on local correlations. The resulting model surpasses all baseline methods by a significant margin on the TAP-Vid benchmark, as demonstrated by an approximate 20% absolute average Jaccard (AJ) improvement on DAVIS. Our model facilitates fast inference on long and high-resolution video sequences. On a modern GPU, our implementation has the capacity to track points faster than real-time. Given the high-quality trajectories extracted from a large dataset, we demonstrate a proof-of-concept diffusion model which generates trajectories from static images, enabling plausible animations. Visualizations, source code, and pretrained models can be found at https://deepmind-tapir.github.io

Oxford University Research Archive

TAPIR: Tracking Any Point with per-frame Initialization and temporal Refinement

Author: Aytar Yusuf
Carreira Joao
Doersch Carl
Gokay Dilara
Gupta Ankush
Vecerik Mel
Yang Yi
Zisserman Andrew
Publication venue
Publication date: 30/08/2023
Field of study

We present a novel model for Tracking Any Point (TAP) that effectively tracks any queried point on any physical surface throughout a video sequence. Our approach employs two stages: (1) a matching stage, which independently locates a suitable candidate point match for the query point on every other frame, and (2) a refinement stage, which updates both the trajectory and query features based on local correlations. The resulting model surpasses all baseline methods by a significant margin on the TAP-Vid benchmark, as demonstrated by an approximate 20% absolute average Jaccard (AJ) improvement on DAVIS. Our model facilitates fast inference on long and high-resolution video sequences. On a modern GPU, our implementation has the capacity to track points faster than real-time, and can be flexibly extended to higher-resolution videos. Given the high-quality trajectories extracted from a large dataset, we demonstrate a proof-of-concept diffusion model which generates trajectories from static images, enabling plausible animations. Visualizations, source code, and pretrained models can be found on our project webpage.Comment: Published at ICCV 202

arXiv.org e-Print Archive