3 research outputs found
Visual Question Answering: A SURVEY
Visual Question Answering (VQA) has been an emerging field in computer vision and natural language processing that aims to enable machines to understand the content of images and answer natural language questions about them. Recently, there has been increasing interest in integrating Semantic Web technologies into VQA systems to enhance their performance and scalability. In this context, knowledge graphs, which represent structured knowledge in the form of entities and their relationships, have shown great potential in providing rich semantic information for VQA. This paper provides an abstract overview of the state-of-the-art research on VQA using Semantic Web technologies, including knowledge graph based VQA, medical VQA with semantic segmentation, and multi-modal fusion with recurrent neural networks. The paper also highlights the challenges and future directions in this area, such as improving the accuracy of knowledge graph based VQA, addressing the semantic gap between image content and natural language, and designing more effective multimodal fusion strategies. Overall, this paper emphasizes the importance and potential of using Semantic Web technologies in VQA and encourages further research in this exciting area
An Evaluation of Knowledge Graph Embeddings for Autonomous Driving Data: Experience and Practice
The autonomous driving (AD) industry is exploring the use of knowledge graphs
(KGs) to manage the vast amount of heterogeneous data generated from vehicular
sensors. The various types of equipped sensors include video, LIDAR and RADAR.
Scene understanding is an important topic in AD which requires consideration of
various aspects of a scene, such as detected objects, events, time and
location. Recent work on knowledge graph embeddings (KGEs) - an approach that
facilitates neuro-symbolic fusion - has shown to improve the predictive
performance of machine learning models. With the expectation that
neuro-symbolic fusion through KGEs will improve scene understanding, this
research explores the generation and evaluation of KGEs for autonomous driving
data. We also present an investigation of the relationship between the level of
informational detail in a KG and the quality of its derivative embeddings. By
systematically evaluating KGEs along four dimensions -- i.e. quality metrics,
KG informational detail, algorithms, and datasets -- we show that (1) higher
levels of informational detail in KGs lead to higher quality embeddings, (2)
type and relation semantics are better captured by the semantic transitional
distance-based TransE algorithm, and (3) some metrics, such as coherence
measure, may not be suitable for intrinsically evaluating KGEs in this domain.
Additionally, we also present an (early) investigation of the usefulness of
KGEs for two use-cases in the AD domain.Comment: 11 pages, To appear in AAAI 2020 Spring Symposium on Combining
Machine Learning and Knowledge Engineering in Practice (AAAI-MAKE 2020