8 research outputs found
HyperLearn: A Distributed Approach for Representation Learning in Datasets With Many Modalities
Multimodal datasets contain an enormous amount of relational information,
which grows exponentially with the introduction of new modalities. Learning
representations in such a scenario is inherently complex due to the presence of
multiple heterogeneous information channels. These channels can encode both (a)
inter-relations between the items of different modalities and (b)
intra-relations between the items of the same modality. Encoding multimedia
items into a continuous low-dimensional semantic space such that both types of
relations are captured and preserved is extremely challenging, especially if
the goal is a unified end-to-end learning framework. The two key challenges
that need to be addressed are: 1) the framework must be able to merge complex
intra and inter relations without losing any valuable information and 2) the
learning model should be invariant to the addition of new and potentially very
different modalities. In this paper, we propose a flexible framework which can
scale to data streams from many modalities. To that end we introduce a
hypergraph-based model for data representation and deploy Graph Convolutional
Networks to fuse relational information within and across modalities. Our
approach provides an efficient solution for distributing otherwise extremely
computationally expensive or even unfeasible training processes across
multiple-GPUs, without any sacrifices in accuracy. Moreover, adding new
modalities to our model requires only an additional GPU unit keeping the
computational time unchanged, which brings representation learning to truly
multimodal datasets. We demonstrate the feasibility of our approach in the
experiments on multimedia datasets featuring second, third and fourth order
relations
Visual Analytics for Temporal Hypergraph Model Exploration
Many processes, from gene interaction in biology to computer networks to
social media, can be modeled more precisely as temporal hypergraphs than by
regular graphs. This is because hypergraphs generalize graphs by extending
edges to connect any number of vertices, allowing complex relationships to be
described more accurately and predict their behavior over time. However, the
interactive exploration and seamless refinement of such hypergraph-based
prediction models still pose a major challenge. We contribute Hyper-Matrix, a
novel visual analytics technique that addresses this challenge through a tight
coupling between machine-learning and interactive visualizations. In
particular, the technique incorporates a geometric deep learning model as a
blueprint for problem-specific models while integrating visualizations for
graph-based and category-based data with a novel combination of interactions
for an effective user-driven exploration of hypergraph models. To eliminate
demanding context switches and ensure scalability, our matrix-based
visualization provides drill-down capabilities across multiple levels of
semantic zoom, from an overview of model predictions down to the content. We
facilitate a focused analysis of relevant connections and groups based on
interactive user-steering for filtering and search tasks, a dynamically
modifiable partition hierarchy, various matrix reordering techniques, and
interactive model feedback. We evaluate our technique in a case study and
through formative evaluation with law enforcement experts using real-world
internet forum communication data. The results show that our approach surpasses
existing solutions in terms of scalability and applicability, enables the
incorporation of domain knowledge, and allows for fast search-space traversal.
With the technique, we pave the way for the visual analytics of temporal
hypergraphs in a wide variety of domains.Comment: 11 pages, 6 figures, IEEE VIS VAST 2020 - Proceedings of IEEE
Conference on Visual Analytics Science and Technology (VAST), 202
Visual Analytics Framework for the Assessment of Temporal Hypergraph Prediction Models
Members of communities often share topics of interest. However, usually not all members are interested in all topics, and participation in topics changes over time. Prediction models based on temporal hypergraphs that—in contrast to state-of-the-art models—exploit group structures in the communication network can be used to anticipate changes of interests. In practice, there is a need to assess these models in detail. While loss functions used in the training process can provide initial cues on the model’s global quality, local quality can be investigated with visual analytics. In this paper, we present a visual analytics framework for the assessment of temporal hypergraph prediction models. We introduce its core components: a sliding window approach to prediction and an interactive visualization for partially fuzzy temporal hypergraphs.publishe
Visual Analytics for Temporal Hypergraph Model Exploration
Many processes, from gene interaction in biology to computer networks to social media, can be modeled more precisely as temporal hypergraphs than by regular graphs. This is because hypergraphs generalize graphs by extending edges to connect any number of vertices, allowing complex relationships to be described more accurately and predict their behavior over time. However, the interactive exploration and seamless refinement of such hypergraph-based prediction models still pose a major challenge. We contribute Hyper-Matrix, a novel visual analytics technique that addresses this challenge through a tight coupling between machine-learning and interactive visualizations. In particular, the technique incorporates a geometric deep learning model as a blueprint for problem-specific models while integrating visualizations for graph-based and category-based data with a novel combination of interactions for an effective user-driven exploration of hypergraph models. To eliminate demanding context switches and ensure scalability, our matrix-based visualization provides drill-down capabilities across multiple levels of semantic zoom, from an overview of model predictions down to the content. We facilitate a focused analysis of relevant connections and groups based on interactive user-steering for filtering and search tasks, a dynamically modifiable partition hierarchy, various matrix reordering techniques, and interactive model feedback. We evaluate our technique in a case study and through formative evaluation with law enforcement experts using real-world internet forum communication data. The results show that our approach surpasses existing solutions in terms of scalability and applicability, enables the incorporation of domain knowledge, and allows for fast search-space traversal. With the proposed technique, we pave the way for the visual analytics of temporal hypergraphs in a wide variety of domains.publishe
Interactive exploration of journalistic video footage through multimodal semantic matching
This demo presents a system for journalists to explore video footage for broadcasts. Daily news broadcasts contain multiple news items that consist of many video shots and searching for relevant footage is a labor intensive task. Without the need for annotated video shots, our system extracts semantics from footage and automatically matches these semantics to query terms from the journalist. The journalist can then indicate which aspects of the query term need to be emphasized, e.g. the title or its thematic meaning. The goal of this system is to support the journalists in their search process by encouraging interaction and exploration with the system.Web Information System