3,092 research outputs found

    Deep Affordance-grounded Sensorimotor Object Recognition

    Full text link
    It is well-established by cognitive neuroscience that human perception of objects constitutes a complex process, where object appearance information is combined with evidence about the so-called object "affordances", namely the types of actions that humans typically perform when interacting with them. This fact has recently motivated the "sensorimotor" approach to the challenging task of automatic object recognition, where both information sources are fused to improve robustness. In this work, the aforementioned paradigm is adopted, surpassing current limitations of sensorimotor object recognition research. Specifically, the deep learning paradigm is introduced to the problem for the first time, developing a number of novel neuro-biologically and neuro-physiologically inspired architectures that utilize state-of-the-art neural networks for fusing the available information sources in multiple ways. The proposed methods are evaluated using a large RGB-D corpus, which is specifically collected for the task of sensorimotor object recognition and is made publicly available. Experimental results demonstrate the utility of affordance information to object recognition, achieving an up to 29% relative error reduction by its inclusion.Comment: 9 pages, 7 figures, dataset link included, accepted to CVPR 201

    Locally Homogeneous Spaces, Induced Killing Vector Fields and Applications to Bianchi Prototypes

    Full text link
    An answer to the question: Can, in general, the adoption of a given symmetry induce a further symmetry, which might be hidden at a first level? has been attempted in the context of differential geometry of locally homogeneous spaces. Based on E. Cartan's theory of moving frames, a methodology for finding all symmetries for any n dimensional locally homogeneous space is provided. The analysis is applied to 3 dimensional spaces, whereby the embedding of them into a 4 dimensional Lorentzian manifold is examined and special solutions to Einstein's field equations are recovered. The analysis is mainly of local character, since the interest is focused on local structures based on differential equations (and their symmetries), rather than on the implications of, e.g., the analytic continuation of their solution(s) and their dynamics in the large.Comment: 27 pages, no figues, no tables, one reference added, spelling and punctuation issues correcte

    Killing spinors in supergravity with 4-fluxes

    Full text link
    We study the spinorial Killing equation of supergravity involving a torsion 3-form \T as well as a flux 4-form \F. In dimension seven, we construct explicit families of compact solutions out of 3-Sasakian geometries, nearly parallel \G_2-geometries and on the homogeneous Aloff-Wallach space. The constraint \F \cdot \Psi = 0 defines a non empty subfamily of solutions. We investigate the constraint \T \cdot \Psi = 0, too, and show that it singles out a very special choice of numerical parameters in the Killing equation, which can also be justified geometrically

    Visual inspection for illicit items in X-ray images using Deep Learning

    Full text link
    Automated detection of contraband items in X-ray images can significantly increase public safety, by enhancing the productivity and alleviating the mental load of security officers in airports, subways, customs/post offices, etc. The large volume and high throughput of passengers, mailed parcels, etc., during rush hours practically make it a Big Data problem. Modern computer vision algorithms relying on Deep Neural Networks (DNNs) have proven capable of undertaking this task even under resource-constrained and embedded execution scenarios, e.g., as is the case with fast, single-stage object detectors. However, no comparative experimental assessment of the various relevant DNN components/methods has been performed under a common evaluation protocol, which means that reliable cross-method comparisons are missing. This paper presents exactly such a comparative assessment, utilizing a public relevant dataset and a well-defined methodology for selecting the specific DNN components/modules that are being evaluated. The results indicate the superiority of Transformer detectors, the obsolete nature of auxiliary neural modules that have been developed in the past few years for security applications and the efficiency of the CSP-DarkNet backbone CNN.Comment: arXiv admin note: substantial text overlap with arXiv:2305.0193

    Multimodal Explainable Artificial Intelligence: A Comprehensive Review of Methodological Advances and Future Research Directions

    Full text link
    The current study focuses on systematically analyzing the recent advances in the field of Multimodal eXplainable Artificial Intelligence (MXAI). In particular, the relevant primary prediction tasks and publicly available datasets are initially described. Subsequently, a structured presentation of the MXAI methods of the literature is provided, taking into account the following criteria: a) The number of the involved modalities, b) The stage at which explanations are produced, and c) The type of the adopted methodology (i.e. mathematical formalism). Then, the metrics used for MXAI evaluation are discussed. Finally, a comprehensive analysis of current challenges and future research directions is provided.Comment: 26 pages, 11 figure

    Knowledge-based semantic annotation and retrieval of multimedia content

    Get PDF
    aceMedia is a 4 year EC part-funded FP6 Integrated Project, ending in December 2007. The project has developed tools to enable users to manage and share both personal and purchased content across PC, STB and mobile platforms. Knowledge-based analysis and ontologies have been successfully exploited in an end-to-end system to enable automated semantic annotation and retrieval of multimedia content. The paper briefly describes the objectives of aceMedia and the application of knowledge-based analysis in the project
    corecore