Search CORE

86,463 research outputs found

Interactive Visual Data Exploration with Subjective Feedback

Author: J Lijffijt
J Venna
JB Kruskal
JH Friedman
K Pearson
L Maaten van der
M Leeuwen
PJ Huber
ST Roweis
T Bie
T Ruotsalo
V Dzyuba
WS Torgerson
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

Data visualization and iterative/interactive data mining are growing rapidly in attention, both in research as well as in industry. However, integrated methods and tools that combine advanced visualization and data mining techniques are rare, and those that exist are often specialized to a single problem or domain. In this paper, we introduce a novel generic method for interactive visual exploration of high-dimensional data. In contrast to most visualization tools, it is not based on the traditional dogma of manually zooming and rotating data. Instead, the tool initially presents the user with an ‘interesting’ projection of the data and then employs data randomization with constraints to allow users to flexibly and intuitively express their interests or beliefs using visual interactions that correspond to exactly defined constraints. These constraints expressed by the user are then taken into account by a projection-finding algorithm to compute a new ‘interesting’ projection, a process that can be iterated until the user runs out of time or finds that constraints explain everything she needs to find from the data. We present the tool by means of two case studies, one controlled study on synthetic data and another on real census data. The data and software related to this paper are available at http://www.interesting-patterns.net/forsied/interactive-visual-data-exploration-with-subjective-feedback/

Crossref

Ghent University Academic Bibliography

SIDE : a web app for interactive visual data exploration with subjective feedback

Author: De Bie Tijl
Kang Bo
Lijffijt Jefrey
Puolamäki Kai
Publication venue
Publication date: 01/01/2016
Field of study

Ghent University Academic Bibliography

Interactive visual data exploration with subjective feedback : an information-theoretic approach

Author: De Bie Tijl
Kang Bo
Lijffijt Jefrey
Oikarinen Emilia
Puolamäki Kai
Publication venue
Publication date: 01/01/2017
Field of study

Visual exploration of high-dimensional real-valued datasets is a fundamental task in exploratory data analysis (EDA). Existing methods use predefined criteria to choose the representation of data. There is a lack of methods that (i) elicit from the user what she has learned from the data and (ii) show patterns that she does not know yet. We construct a theoretical model where identified patterns can be input as knowledge to the system. The knowledge syntax here is intuitive, such as "this set of points forms a cluster", and requires no knowledge of maths. This background knowledge is used to find a Maximum Entropy distribution of the data, after which the system provides the user data projections in which the data and the Maximum Entropy distribution differ the most, hence showing the user aspects of the data that are maximally informative given the user's current knowledge. We provide an open source EDA system with tailored interactive visualizations to demonstrate these concepts. We study the performance of the system and present use cases on both synthetic and real data. We find that the model and the prototype system allow the user to learn information efficiently from various data sources and the system works sufficiently fast in practice. We conclude that the information theoretic approach to exploratory data analysis where patterns observed by a user are formalized as constraints provides a principled, intuitive, and efficient basis for constructing an EDA system

Ghent University Academic Bibliography

Interactive visual data exploration with subjective feedback : an information-theoretic approach

Author: Bie Tijl de
Kang Bo
Lijffijt Jefrey
Oikarinen Emilia
Puolamäki Kai
Publication venue
Publication date: 01/01/2020
Field of study

Visual exploration of high-dimensional real-valued datasets is a fundamental task in exploratory data analysis (EDA). Existing projection methods for data visualization use predefined criteria to choose the representation of data. There is a lack of methods that (i) use information on what the user has learned from the data and (ii) show patterns that she does not know yet. We construct a theoretical model where identified patterns can be input as knowledge to the system. The knowledge syntax here is intuitive, such as "this set of points forms a cluster", and requires no knowledge of maths. This background knowledge is used to find a maximum entropy distribution of the data, after which the user is provided with data projections for which the data and the maximum entropy distribution differ the most, hence showing the user aspects of data that are maximally informative given the background knowledge. We study the computational performance of our model and present use cases on synthetic and real data. We find that the model allows the user to learn information efficiently from various data sources and works sufficiently fast in practice. In addition, we provide an open source EDA demonstrator system implementing our model with tailored interactive visualizations. We conclude that the information theoretic approach to EDA where patterns observed by a user are formalized as constraints provides a principled, intuitive, and efficient basis for constructing an EDA system.Peer reviewe

Ghent University Academic Bibliography

Helsingin yliopiston digitaalinen arkisto

A Constrained Randomization Approach to Interactive Visual Data Exploration with Subjective Feedback

Author: Bie Tijl de
Kang Bo
Lijffijt Jefrey
Puolamäki Kai
Publication venue
Publication date: 01/01/2020
Field of study

Data visualization and iterative/interactive data mining are growing rapidly in attention, both in research as well as in industry. However, while there are a plethora of advanced data mining methods and lots of works in the field of visualization, integrated methods that combine advanced visualization and/or interaction with data mining techniques in a principled way are rare. We present a framework based on constrained randomization which lets users explore high-dimensional data via 'subjectively informative' two-dimensional data visualizations. The user is presented with 'interesting' projections, allowing users to express their observations using visual interactions that update a background model representing the user's belief state. This background model is then considered by a projection-finding algorithm employing data randomization to compute a new 'interesting' projection. By providing users with information that contrasts with the background model, we maximize the chance that the user encounters striking new information present in the data. This process can be iterated until the user runs out of time or until the difference between the randomized and the real data is insignificant. We present two case studies, one controlled study on synthetic data and another on census data, using the proof-of-concept tool SIDE that demonstrates the presented framework.Peer reviewe

Ghent University Academic Bibliography

Helsingin yliopiston digitaalinen arkisto

A tool for subjective and interactive visual data exploration

Author: C Ware
D Paurat
DJ Hand
J Lijffijt
T Bie De
T Ruotsalo
V Dzyuba
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

We present SIDE, a tool for Subjective and Interactive Visual Data Exploration, which lets users explore high dimensional data via subjectively informative 2D data visualizations. Many existing visual analytics tools are either restricted to specific problems and domains or they aim to find visualizations that align with user’s belief about the data. In contrast, our generic tool computes data visualizations that are surprising given a user’s current understanding of the data. The user’s belief state is represented as a set of projection tiles. Hence, this user-awareness offers users an efficient way to interactively explore yet-unknown features of complex high dimensional datasets

Crossref

Ghent University Academic Bibliography

Subjective information visualizations

Author: Blandford A.
Cairns P.
Faisal S.
Publication venue
Publication date: 01/01/2006
Field of study

Information Visualizations (InfoViz) are systems that require high levels of cognitive processing. They revolve around the notion of decoding and interpreting visual patterns in order to achieve certain goals. We argue that purely designing for the visual will not allow for optimum experiences since there is more to InfoViz than just the visual. Interaction is a key to achieving higher levels of knowledge. In this position paper we present a different perspective on the underlying meaning of interaction, where we describe it as incorporating both the visual and the physical activities. By physical activities we mean the physical actions upon the physical input device/s. We argue that interaction is the key element for supporting users’ subjective experiences hence these experiences should first be understood. All the discussions in this paper are based upon on going work in the field of visualizing the literature knowledge domain (LKDViz)

CiteSeerX

UCL Discovery

Design and Evaluation of a Probabilistic Music Projection Interface

Author: Boland Daniel
Murray-Smith Roderick
Steffensen Peter Berg
Vad Beatrix
Williamson John
Publication venue
Publication date: 01/01/2015
Field of study

We describe the design and evaluation of a probabilistic interface for music exploration and casual playlist generation. Predicted subjective features, such as mood and genre, inferred from low-level audio features create a 34- dimensional feature space. We use a nonlinear dimensionality reduction algorithm to create 2D music maps of tracks, and augment these with visualisations of probabilistic mappings of selected features and their uncertainty. We evaluated the system in a longitudinal trial in users’ homes over several weeks. Users said they had fun with the interface and liked the casual nature of the playlist generation. Users preferred to generate playlists from a local neighbourhood of the map, rather than from a trajectory, using neighbourhood selection more than three times more often than path selection. Probabilistic highlighting of subjective features led to more focused exploration in mouse activity logs, and 6 of 8 users said they preferred the probabilistic highlighting mode

Enlighten

Browsing through 3D representations of unstructured picture collections: an empirical study

Author: Carbonell Noëlle
Christmann Olivier
Publication venue
Publication date: 23/05/2006
Field of study

The paper presents a 3D interactive representation of fairly large picture collections which facilitates browsing through unstructured sets of icons or pictures. Implementation of this representation implies choosing between two visualization strategies: users may either manipulate the view (OV) or be immersed in it (IV). The paper first presents this representation, then describes an empirical study (17 participants) aimed at assessing the utility and usability of each view. Subjective judgements in questionnaires and debriefings were varied: 7 participants preferred the IV view, 4 the OV one, and 6 could not choose between the two. Visual acuity and visual exploration strategies seem to have exerted a greater influence on participants' preferences than task performance or feeling of immersion.Comment: 4 page

arXiv.org e-Print Archive

INRIA a CCSD electronic archive server

Learning what matters - Sampling interesting patterns

Author: M Bhuiyan
M Boley
M Leeuwen van
M Leeuwen van
S Chakraborty
S Shalev-Shwartz
T Calders
V Dzyuba
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

In the field of exploratory data mining, local structure in data can be described by patterns and discovered by mining algorithms. Although many solutions have been proposed to address the redundancy problems in pattern mining, most of them either provide succinct pattern sets or take the interests of the user into account-but not both. Consequently, the analyst has to invest substantial effort in identifying those patterns that are relevant to her specific interests and goals. To address this problem, we propose a novel approach that combines pattern sampling with interactive data mining. In particular, we introduce the LetSIP algorithm, which builds upon recent advances in 1) weighted sampling in SAT and 2) learning to rank in interactive pattern mining. Specifically, it exploits user feedback to directly learn the parameters of the sampling distribution that represents the user's interests. We compare the performance of the proposed algorithm to the state-of-the-art in interactive pattern mining by emulating the interests of a user. The resulting system allows efficient and interleaved learning and sampling, thus user-specific anytime data exploration. Finally, LetSIP demonstrates favourable trade-offs concerning both quality-diversity and exploitation-exploration when compared to existing methods.Comment: PAKDD 2017, extended versio

arXiv.org e-Print Archive

Crossref

Leiden University Scholary Publications