Search CORE

75,623 research outputs found

Approximated and User Steerable tSNE for Progressive Visual Analytics

Author: Eisemann Elmar
Höllt Thomas
Lelieveldt Boudewijn P. F.
Pezzotti Nicola
van der Maaten Laurens
Vilanova Anna
Publication venue
Publication date: 01/01/2016
Field of study

Progressive Visual Analytics aims at improving the interactivity in existing analytics techniques by means of visualization as well as interaction with intermediate results. One key method for data analysis is dimensionality reduction, for example, to produce 2D embeddings that can be visualized and analyzed efficiently. t-Distributed Stochastic Neighbor Embedding (tSNE) is a well-suited technique for the visualization of several high-dimensional data. tSNE can create meaningful intermediate results but suffers from a slow initialization that constrains its application in Progressive Visual Analytics. We introduce a controllable tSNE approximation (A-tSNE), which trades off speed and accuracy, to enable interactive data exploration. We offer real-time visualization techniques, including a density-based solution and a Magic Lens to inspect the degree of approximation. With this feedback, the user can decide on local refinements and steer the approximation level during the analysis. We demonstrate our technique with several datasets, in a real-world research scenario and for the real-time analysis of high-dimensional streams to illustrate its effectiveness for interactive data analysis

arXiv.org e-Print Archive

Leiden University Scholary Publications

Recommended from our members

ToScA North America (6 – 8 June 2017, The University of Texas, Austin, TX) Program

Author: Ahmed Farah
Maisano Jessie
Publication venue: Jackson School of Geosciences; The University of Texas at Austin
Publication date: 01/06/2017
Field of study

ToScA North America will address key areas of science, including Multi-modal Imaging, Geosciences, Forensics, Increasing Contrast, Educational Outreach, Data, Materials Science and Medical and Biological Science.University of Texas High-Resolution X-ray CT Facility (UTCT); Jackson School of Geosciences, The University of Texas at Austin; Natural History Museum (London); Royal Microscopical Society (Oxford, UK)Geological Science

Texas ScholarWorks

Simple and Effective Visual Models for Gene Expression Cancer Diagnostics

Author: Bratko Ivan
Leban Gregor
Mramor Minca
Zupan Blaz
Publication venue
Publication date: 01/01/2005
Field of study

In the paper we show that diagnostic classes in cancer gene expression data sets, which most often include thousands of features (genes), may be effectively separated with simple two-dimensional plots such as scatterplot and radviz graph. The principal innovation proposed in the paper is a method called VizRank, which is able to score and identify the best among possibly millions of candidate projections for visualizations. Compared to recently much applied techniques in the field of cancer genomics that include neural networks, support vector machines and various ensemble-based approaches, VizRank is fast and finds visualization models that can be easily examined and interpreted by domain experts. Our experiments on a number of gene expression data sets show that VizRank was always able to find data visualizations with a small number of (two to seven) genes and excellent class separation. In addition to providing grounds for gene expression cancer diagnosis, VizRank and its visualizations also identify small sets of relevant genes, uncover interesting gene interactions and point to outliers and potential misclassifications in cancer data sets

ePrints.FRI

Diffusion map for clustering fMRI spatial maps extracted by independent component analysis

Author: Alluri Vinoo
Brattico Elvira
Cong Fengyu
Nandi Asoke K.
Ristaniemi Tapani
Sipola Tuomo
Toiviainen Petri
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 27/09/2013
Field of study

Functional magnetic resonance imaging (fMRI) produces data about activity inside the brain, from which spatial maps can be extracted by independent component analysis (ICA). In datasets, there are n spatial maps that contain p voxels. The number of voxels is very high compared to the number of analyzed spatial maps. Clustering of the spatial maps is usually based on correlation matrices. This usually works well, although such a similarity matrix inherently can explain only a certain amount of the total variance contained in the high-dimensional data where n is relatively small but p is large. For high-dimensional space, it is reasonable to perform dimensionality reduction before clustering. In this research, we used the recently developed diffusion map for dimensionality reduction in conjunction with spectral clustering. This research revealed that the diffusion map based clustering worked as well as the more traditional methods, and produced more compact clusters when needed.Comment: 6 pages. 8 figures. Copyright (c) 2013 IEEE. Published at 2013 IEEE International Workshop on Machine Learning for Signal Processin

arXiv.org e-Print Archive

Crossref