Search CORE

72 research outputs found

Determinazione della massa del buco nero supermassiccio al centro della galassia NGC 5377

Author: Garattoni Alessia
Publication venue
Publication date: 08/04/2022
Field of study

La galassia a spirale barrata NGC 5377 ospita un buco nero di massa Mº < 1.2 10 8 Mo. Questo valore è stato determinato dallo studio della distribuzione e della cinematica del gas ionizzato misurate analizzando gli spettri del nucleo della galassia ottenuti dal telescopio spaziale Hubbleope

Padua Thesis and Dissertation Archive

Self-Attention Temporal Convolutional Network for Long-Term Daily Living Activity Detection

Author: Bremond François
Dai Rui
Francesca Gianpiero
Garattoni Lorenzo
Minciullo Luca
Publication venue: HAL CCSD
Publication date: 18/09/2019
Field of study

International audienceIn this paper, we address the detection of daily living activities in long-term untrimmed videos. The detection of daily living activities is challenging due to their long temporal components, low inter-class variation and high intra-class variation. To tackle these challenges, recent approaches based on Temporal Convolutional Networks (TCNs) have been proposed. Such methods can capture long-term temporal patterns using a hierarchy of temporal convolutional filters, pooling and up sampling steps. However, as one of the important features of con-volutional networks, TCNs process a local neighborhood across time which leads to inefficiency in modeling the long-range dependencies between these temporal patterns of the video. In this paper, we propose Self-Attention-Temporal Convolutional Network (SA-TCN), which is able to capture both complex activity patterns and their dependencies within long-term untrimmed videos. We evaluate our proposed model on DAily Home LIfe Activity Dataset (DAHLIA) and Breakfast datasets. Our proposed method achieves state-of-the-art performance on both DAHLIA and Breakfast dataset

INRIA a CCSD electronic archive server

HAL Descartes

Hal-Diderot

LAC: Latent Action Composition for Skeleton-based Action Segmentation

Author: Bremond Francois
Dantcheva Antitza
Francesca Gianpiero
Garattoni Lorenzo
Kong Quan
Wang Yaohui
Yang Di
Publication venue
Publication date: 31/08/2023
Field of study

Skeleton-based action segmentation requires recognizing composable actions in untrimmed videos. Current approaches decouple this problem by first extracting local visual features from skeleton sequences and then processing them by a temporal model to classify frame-wise actions. However, their performances remain limited as the visual features cannot sufficiently express composable actions. In this context, we propose Latent Action Composition (LAC), a novel self-supervised framework aiming at learning from synthesized composable motions for skeleton-based action segmentation. LAC is composed of a novel generation module towards synthesizing new sequences. Specifically, we design a linear latent space in the generator to represent primitive motion. New composed motions can be synthesized by simply performing arithmetic operations on latent representations of multiple input skeleton sequences. LAC leverages such synthesized sequences, which have large diversity and complexity, for learning visual representations of skeletons in both sequence and frame spaces via contrastive learning. The resulting visual encoder has a high expressive power and can be effectively transferred onto action segmentation tasks by end-to-end fine-tuning without the need for additional temporal models. We conduct a study focusing on transfer-learning and we show that representations learned from pre-trained LAC outperform the state-of-the-art by a large margin on TSU, Charades, PKU-MMD datasets.Comment: ICCV 202

arXiv.org e-Print Archive

Self-Attention Temporal Convolutional Network for Long-Term Daily Living Activity Detection

Author: Bremond François
Dai Rui
Francesca Gianpiero
Garattoni Lorenzo
Minciullo Luca
Publication venue: HAL CCSD
Publication date: 18/09/2019
Field of study

INRIA a CCSD electronic archive server

PDAN: Pyramid Dilated Attention Network for Action Detection

Author: Bremond Francois,
Dai Rui
Das Srijan
Francesca Gianpiero
Garattoni Lorenzo
Minciullo Luca
Publication venue: HAL CCSD
Publication date: 05/01/2021
Field of study

International audienceHandling long and complex temporal information is an important challenge for action detection tasks. This challenge is further aggravated by densely distributed actions in untrimmed videos. Previous action detection methods fail in selecting the key temporal information in long videos. To this end, we introduce the Dilated Attention Layer (DAL). Compared to previous temporal convolution layer, DAL allocates attentional weights to local frames in the kernel, which enables it to learn better local representation across time. Furthermore, we introduce Pyramid Dilated Attention Network (PDAN) which is built upon DAL. With the help of multiple DALs with different dilation rates, PDAN can model short-term and long-term temporal relations simultaneously by focusing on local segments at the level of low and high temporal receptive fields. This property enables PDAN to handle complex temporal relations between different action instances in long untrimmed videos. To corroborate the effectiveness and robustness of our method, we evaluate it on three densely annotated, multi-label datasets: MultiTHUMOS, Charades and Toyota Smarthome Untrimmed (TSU) dataset. PDAN is able to outperform previous state-of-the-art methods on all these datasets

INRIA a CCSD electronic archive server

HAL Descartes

Hal-Diderot

Human-Scene Network: A Novel Baseline with Self-rectifying Loss for Weakly supervised Video Anomaly Detection

Author: Bremond Francois,
Dai Rui
Francesca Gianpiero
Garattoni Lorenzo
Kong Quan
Majhi Snehashis
Publication venue: HAL CCSD
Publication date: 19/01/2023
Field of study

Video anomaly detection in surveillance systems with only video-level labels (i.e. weakly-supervised) is challenging. This is due to, (i) complex integration of human and scene based anomalies comprising of subtle and sharp spatio-temporal cues in real-world scenarios, (ii) non-optimal optimization between normal and anomaly instances under weak-supervision. In this paper, we propose a Human-Scene Network to learn discriminative representations by capturing both subtle and strong cues in a dissociative manner. In addition, a self-rectifying loss is also proposed that dynamically computes the pseudo temporal-annotations from video-level labels for optimizing the Human-Scene Network effectively. The proposed Human-Scene Network optimized with self-rectifying loss is validated on three publicly available datasets i.e. UCF-Crime, ShanghaiTech and IITB-Corridor, outperforming recently reported state-of-the-art approaches on five out of the six scenarios considered

INRIA a CCSD electronic archive server

HouseCat6D -- A Large-Scale Multi-Modal Category Level 6D Object Pose Dataset with Household Objects in Realistic Scenarios

Author: Busam Benjamin
Garattoni Lorenzo
Jung HyunJun
Meier Sven
Navab Nassir
Rizzoli Giulia
Roth Daniel
Ruhkamp Patrick
Schieber Hannah
Wang Pengyuan
Wu Shun-Cheng
Zhai Guangyao
Zhao Hongcheng
Publication venue
Publication date: 26/04/2023
Field of study

Estimating the 6D pose of objects is a major 3D computer vision problem. Since the promising outcomes from instance-level approaches, research heads also move towards category-level pose estimation for more practical application scenarios. However, unlike well-established instance-level pose datasets, available category-level datasets lack annotation quality and provided pose quantity. We propose the new category-level 6D pose dataset HouseCat6D featuring 1) Multi-modality of Polarimetric RGB and Depth (RGBD+P), 2) Highly diverse 194 objects of 10 household object categories including 2 photometrically challenging categories, 3) High-quality pose annotation with an error range of only 1.35 mm to 1.74 mm, 4) 41 large-scale scenes with extensive viewpoint coverage and occlusions, 5) Checkerboard-free environment throughout the entire scene, and 6) Additionally annotated dense 6D parallel-jaw grasps. Furthermore, we also provide benchmark results of state-of-the-art category-level pose estimation networks

arXiv.org e-Print Archive

Toyota Smarthome: Real-World Activities of Daily Living

Author: Bremond François
Dai Rui
Das Srijan
Francesca Gianpiero
Garattoni Lorenzo
Koperski Michal
Minciullo Luca
Publication venue: HAL CCSD
Publication date: 27/10/2019
Field of study

International audienceThe performance of deep neural networks is strongly influenced by the quantity and quality of annotated data. Most of the large activity recognition datasets consist of data sourced from the web, which does not reflect challenges that exist in activities of daily living. In this paper, we introduce a large real-world video dataset for activities of daily living: Toyota Smarthome. The dataset consists of 16K RGB+D clips of 31 activity classes, performed by seniors in a smarthome. Unlike previous datasets, videos were fully unscripted. As a result, the dataset poses several challenges: high intra-class variation, high class imbalance, simple and composite activities, and activities with similar motion and variable duration. Activities were annotated with both coarse and fine-grained labels. These characteristics differentiate Toyota Smarthome from other datasets for activity recognition. As recent activity recognition approaches fail to address the challenges posed by Toyota Smarthome, we present a novel activity recognition method with attention mechanism. We propose a pose driven spatio-temporal attention mechanism through 3D ConvNets. We show that our novel method outperforms state-of-the-art methods on benchmark datasets, as well as on the Toyota Smarthome dataset. We release the dataset for research use

Crossref

INRIA a CCSD electronic archive server

HAL Descartes

Hal-Diderot

ARDebug: An Augmented Reality Tool for Analysing and Debugging Swarm Robotic Systems

Author: Antoun
Azuma
Brambilla
Farrow
Garattoni
Garrido-Jurado
Ghiringhelli
Hilder
Hoenig
Krasner
Liu
McLurkin
Millard
Millard
Mondada
Quigley
Reina
Reina
Rubenstein
Rule
Publication venue: 'Frontiers Media SA'
Publication date: 01/01/2018
Field of study

Despite growing interest in collective robotics over the past few years, analysing and debugging the behaviour of swarm robotic systems remains a challenge due to the lack of appropriate tools. We present a solution to this problem—ARDebug: an open-source, cross-platform, and modular tool that allows the user to visualise the internal state of a robot swarm using graphical augmented reality techniques. In this paper we describe the key features of the software, the hardware required to support it, its implementation, and usage examples. ARDebug is specifically designed with adoption by other institutions in mind, and aims to provide an extensible tool that other researchers can easily integrate with their own experimental infrastructure

University of Lincoln Institutional Repository

Crossref

Directory of Open Access Journals

Frontiers - Publisher Connector

Ulster University's Research Portal

White Rose Research Online

Supplemental Information 1: Physiological data and SAM questionnaire data.

Author: Amraii
Bashyal
Bethel
Boucsein
Cacioppo
Carroll
De la Croix
Dorigo
Dorigo
Fasola
Garattoni
Gutiérrez
Hegel
Kidd
Kolling
Kolling
Lang
Leite
Li
Mehrabian
Mondada
Nagavalli
Nagi
Nagi
Nunnally
Pendleton
Pereira
Podevijn
Podevijn
Powers
R Core Team
Setter
Sweetser
Wainer
Walker
Walker
Wrobel
Publication venue: 'PeerJ'
Publication date
Field of study

Crossref