Search CORE

1,228 research outputs found

Automatic Emotion Recognition from Mandarin Speech

Author: Gu Yu
Publication venue: [s.n.]
Publication date: 01/01/2018
Field of study

Tilburg University Repository

Semantics-aware image understanding

Author: PASINI ANDREA
Publication venue: country:Italy
Publication date: 19/10/2021
Field of study

L'abstract è presente nell'allegato / the abstract is in the attachmen

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)

Cobot Programming for Collaborative Industrial Tasks: An Overview

Author: Banziger
Bauer
Benzeghiba
Bicchi
Busch
Calinon
Cao
Chandrasekaran
Cheng
Cherubini
Commission
de Gea Fernandez
Ding
Duque
Faber
Gaz
Ghalamzan
Giuliani
Gleeson
Gombolay
Green
Gu
Gustavsson
Haddadin
Hangl
Hangl
Heess
Hu
Huang
Johannsmeier
Kim
Kobayashi
Koch
Kouris
Kumicakova
Lafleche
Lasota
Lee
Li
Liu
Luo
Maeda
Matsas
Maurice
Maurtua
Meziane
Mohamed Marei
Mohan
Muller
Munzer
Nikolaidis
Noohi
Pedersen
Pellegrinelli
Peternel
Pohlt
Rajeswaran
Realyvasquez-Vargas
Reyes
Rozo
Rude
Schmidt
Schou
Schou
Schulz
Sheng
Shirine El Zaatari
Srimal
Steinmetz
Sylla
Tang
Wang
Weidong Li
Winkelmann
Wojtara
Wongphati
Yang
Zahid Usman
Zhu
Zidek
Publication venue: 'Elsevier BV'
Publication date: 01/06/2019
Field of study

Grounded Semantic Reasoning for Robotic Interaction with Real-World Objects

Author: Liu Weiyu
Publication venue: Georgia Institute of Technology
Publication date: 10/01/2023
Field of study

Robots are increasingly transitioning from specialized, single-task machines to general-purpose systems that operate in unstructured environments, such as homes, offices, and warehouses. In these real-world domains, robots need to manipulate novel objects while adapting to changes in environments and goals. Semantic knowledge, which concisely describes target domains with symbols, can potentially reveal the meaningful patterns shared between problems and environments. However, existing robots are yet to effectively reason about semantic data encoding complex relational knowledge or jointly reason about symbolic semantic data and multimodal data pertinent to robotic manipulation (e.g., object point clouds, 6-DoF poses, and attributes detected with multimodal sensing). This dissertation develops semantic reasoning frameworks capable of modeling complex semantic knowledge grounded in robot perception and action. We show that grounded semantic reasoning enables robots to more effectively perceive, model, and interact with objects in real-world environments. Specifically, this dissertation makes the following contributions: (1) a survey providing a unified view for the diversity of works in the field by formulating semantic reasoning as the integration of knowledge sources, computational frameworks, and world representations; (2) a method for predicting missing relations in large-scale knowledge graphs by leveraging type hierarchies of entities, effectively avoiding ambiguity while maintaining generalization of multi-hop reasoning patterns; (3) a method for predicting unknown properties of objects in various environmental contexts, outperforming prior knowledge graph and statistical relational learning methods due to the use of n-ary relations for modeling object properties; (4) a method for purposeful robotic grasping that accounts for a broad range of contexts (including object visual affordance, material, state, and task constraint), outperforming existing approaches in novel contexts and for unknown objects; (5) a systematic investigation into the generalization of task-oriented grasping that includes a benchmark dataset of 250k grasps, and a novel graph neural network that incorporates semantic relations into end-to-end learning of 6-DoF grasps; (6) a method for rearranging novel objects into semantically meaningful spatial structures based on high-level language instructions, more effectively capturing multi-object spatial constraints than existing pairwise spatial representations; (7) a novel planning-inspired approach that iteratively optimizes placements of partially observed objects subject to both physical constraints and semantic constraints inferred from language instructions.Ph.D

Expressive movement generation with machine learning

Author: Alemi Omid
Publication venue
Publication date: 25/03/2021
Field of study

Movement is an essential aspect of our lives. Not only do we move to interact with our physical environment, but we also express ourselves and communicate with others through our movements. In an increasingly computerized world where various technologies and devices surround us, our movements are essential parts of our interaction with and consumption of computational devices and artifacts. In this context, incorporating an understanding of our movements within the design of the technologies surrounding us can significantly improve our daily experiences. This need has given rise to the field of movement computing – developing computational models of movement that can perceive, manipulate, and generate movements. In this thesis, we contribute to the field of movement computing by building machine-learning-based solutions for automatic movement generation. In particular, we focus on using machine learning techniques and motion capture data to create controllable, generative movement models. We also contribute to the field by creating datasets, tools, and libraries that we have developed during our research. We start our research by reviewing the works on building automatic movement generation systems using machine learning techniques and motion capture data. Our review covers background topics such as high-level movement characterization, training data, features representation, machine learning models, and evaluation methods. Building on our literature review, we present WalkNet, an interactive agent walking movement controller based on neural networks. The expressivity of virtual, animated agents plays an essential role in their believability. Therefore, WalkNet integrates controlling the expressive qualities of movement with the goal-oriented behaviour of an animated virtual agent. It allows us to control the generation based on the valence and arousal levels of affect, the movement’s walking direction, and the mover’s movement signature in real-time. Following WalkNet, we look at controlling movement generation using more complex stimuli such as music represented by audio signals (i.e., non-symbolic music). Music-driven dance generation involves a highly non-linear mapping between temporally dense stimuli (i.e., the audio signal) and movements, which renders a more challenging modelling movement problem. To this end, we present GrooveNet, a real-time machine learning model for music-driven dance generation