3,092 research outputs found
Deep Affordance-grounded Sensorimotor Object Recognition
It is well-established by cognitive neuroscience that human perception of
objects constitutes a complex process, where object appearance information is
combined with evidence about the so-called object "affordances", namely the
types of actions that humans typically perform when interacting with them. This
fact has recently motivated the "sensorimotor" approach to the challenging task
of automatic object recognition, where both information sources are fused to
improve robustness. In this work, the aforementioned paradigm is adopted,
surpassing current limitations of sensorimotor object recognition research.
Specifically, the deep learning paradigm is introduced to the problem for the
first time, developing a number of novel neuro-biologically and
neuro-physiologically inspired architectures that utilize state-of-the-art
neural networks for fusing the available information sources in multiple ways.
The proposed methods are evaluated using a large RGB-D corpus, which is
specifically collected for the task of sensorimotor object recognition and is
made publicly available. Experimental results demonstrate the utility of
affordance information to object recognition, achieving an up to 29% relative
error reduction by its inclusion.Comment: 9 pages, 7 figures, dataset link included, accepted to CVPR 201
Locally Homogeneous Spaces, Induced Killing Vector Fields and Applications to Bianchi Prototypes
An answer to the question: Can, in general, the adoption of a given symmetry
induce a further symmetry, which might be hidden at a first level? has been
attempted in the context of differential geometry of locally homogeneous
spaces. Based on E. Cartan's theory of moving frames, a methodology for finding
all symmetries for any n dimensional locally homogeneous space is provided. The
analysis is applied to 3 dimensional spaces, whereby the embedding of them into
a 4 dimensional Lorentzian manifold is examined and special solutions to
Einstein's field equations are recovered. The analysis is mainly of local
character, since the interest is focused on local structures based on
differential equations (and their symmetries), rather than on the implications
of, e.g., the analytic continuation of their solution(s) and their dynamics in
the large.Comment: 27 pages, no figues, no tables, one reference added, spelling and
punctuation issues correcte
Killing spinors in supergravity with 4-fluxes
We study the spinorial Killing equation of supergravity involving a torsion
3-form \T as well as a flux 4-form \F. In dimension seven, we construct
explicit families of compact solutions out of 3-Sasakian geometries, nearly
parallel \G_2-geometries and on the homogeneous Aloff-Wallach space. The
constraint \F \cdot \Psi = 0 defines a non empty subfamily of solutions. We
investigate the constraint \T \cdot \Psi = 0, too, and show that it singles
out a very special choice of numerical parameters in the Killing equation,
which can also be justified geometrically
Visual inspection for illicit items in X-ray images using Deep Learning
Automated detection of contraband items in X-ray images can significantly
increase public safety, by enhancing the productivity and alleviating the
mental load of security officers in airports, subways, customs/post offices,
etc. The large volume and high throughput of passengers, mailed parcels, etc.,
during rush hours practically make it a Big Data problem. Modern computer
vision algorithms relying on Deep Neural Networks (DNNs) have proven capable of
undertaking this task even under resource-constrained and embedded execution
scenarios, e.g., as is the case with fast, single-stage object detectors.
However, no comparative experimental assessment of the various relevant DNN
components/methods has been performed under a common evaluation protocol, which
means that reliable cross-method comparisons are missing. This paper presents
exactly such a comparative assessment, utilizing a public relevant dataset and
a well-defined methodology for selecting the specific DNN components/modules
that are being evaluated. The results indicate the superiority of Transformer
detectors, the obsolete nature of auxiliary neural modules that have been
developed in the past few years for security applications and the efficiency of
the CSP-DarkNet backbone CNN.Comment: arXiv admin note: substantial text overlap with arXiv:2305.0193
Multimodal Explainable Artificial Intelligence: A Comprehensive Review of Methodological Advances and Future Research Directions
The current study focuses on systematically analyzing the recent advances in
the field of Multimodal eXplainable Artificial Intelligence (MXAI). In
particular, the relevant primary prediction tasks and publicly available
datasets are initially described. Subsequently, a structured presentation of
the MXAI methods of the literature is provided, taking into account the
following criteria: a) The number of the involved modalities, b) The stage at
which explanations are produced, and c) The type of the adopted methodology
(i.e. mathematical formalism). Then, the metrics used for MXAI evaluation are
discussed. Finally, a comprehensive analysis of current challenges and future
research directions is provided.Comment: 26 pages, 11 figure
Knowledge-based semantic annotation and retrieval of multimedia content
aceMedia is a 4 year EC part-funded FP6 Integrated Project, ending in December 2007. The project has developed tools to enable users to manage and share both personal and purchased content across PC, STB and mobile platforms. Knowledge-based analysis and ontologies have been successfully exploited in an end-to-end system to enable automated semantic annotation and retrieval of multimedia content. The paper briefly describes the objectives of aceMedia and the application of knowledge-based analysis in the project
- …