7,523 research outputs found
VisIVO - Integrated Tools and Services for Large-Scale Astrophysical Visualization
VisIVO is an integrated suite of tools and services specifically designed for
the Virtual Observatory. This suite constitutes a software framework for
effective visual discovery in currently available (and next-generation) very
large-scale astrophysical datasets. VisIVO consists of VisiVO Desktop - a stand
alone application for interactive visualization on standard PCs, VisIVO Server
- a grid-enabled platform for high performance visualization and VisIVO Web - a
custom designed web portal supporting services based on the VisIVO Server
functionality. The main characteristic of VisIVO is support for
high-performance, multidimensional visualization of very large-scale
astrophysical datasets. Users can obtain meaningful visualizations rapidly
while preserving full and intuitive control of the relevant visualization
parameters. This paper focuses on newly developed integrated tools in VisIVO
Server allowing intuitive visual discovery with 3D views being created from
data tables. VisIVO Server can be installed easily on any web server with a
database repository. We discuss briefly aspects of our implementation of VisiVO
Server on a computational grid and also outline the functionality of the
services offered by VisIVO Web. Finally we conclude with a summary of our work
and pointers to future developments
14-08 Big Data Analytics to Aid Developing Livable Communities
In transportation, ubiquitous deployment of low-cost sensors combined with powerful computer hardware and high-speed network makes big data available. USDOT defines big data research in transportation as a number of advanced techniques applied to the capture, management and analysis of very large and diverse volumes of data. Data in transportation are usually well organized into tables and are characterized by relatively low dimensionality and yet huge numbers of records. Therefore, big data research in transportation has unique challenges on how to effectively process huge amounts of data records and data streams. The purpose of this study is to conduct research on the problems caused by large data volume and data streams and to develop applications for data analysis in transportation. To process large number of records efficiently, we have proposed to aggregate the data at multiple resolutions and to explore the data at various resolutions to balance between accuracy and speed. Techniques and algorithms in statistical analysis and data visualization have been developed for efficient data analytics using multiresolution data aggregation. Results will be helpful in setting up a primitive stage towards a rigorous framework for general analytical processing of big data in transportation
Visualization of multidimensional data with collocated paired coordinates and general line coordinates
Often multidimensional data are visualized by splitting n-D data to a set of low dimensional data. While it is useful it destroys integrity of n-D data, and leads to a shallow understanding complex n-D data. To mitigate this challenge a difficult perceptual task of assembling low-dimensional visualized pieces to the whole n-D vectors must be solved. Another way is a lossy dimension reduction by mapping n-D vectors to 2-D vectors (e.g., Principal Component Analysis). Such 2-D vectors carry only a part of information from n-D vectors, without a way to restore n-D vectors exactly from it. An alternative way for deeper understanding of n-D data is visual representations in 2-D that fully preserve n-D data. Methods of Parallel and Radial coordinates are such methods. Developing new methods that preserve dimensions is a long standing and challenging task that we address by proposing Paired Coordinates that is a new type of n-D data visual representation and by generalizing Parallel and Radial coordinates as a General Line coordinates. The important novelty of the concept of the Paired Coordinates is that it uses a single 2-D plot to represent n-D data as an oriented graph based on the idea of collocation of pairs of attributes. The advantage of the General Line Coordinates and Paired Coordinates is in providing a common framework that includes Parallel and Radial coordinates and generating a large number of new visual representations of multidimensional data without lossy dimension reduction
A Visual Modeling Method for Spatiotemporal and Multidimensional Features in Epidemiological Analysis: Applied COVID-19 Aggregated Datasets
The visual modeling method enables flexible interactions with rich graphical
depictions of data and supports the exploration of the complexities of
epidemiological analysis. However, most epidemiology visualizations do not
support the combined analysis of objective factors that might influence the
transmission situation, resulting in a lack of quantitative and qualitative
evidence. To address this issue, we have developed a portrait-based visual
modeling method called +msRNAer. This method considers the spatiotemporal
features of virus transmission patterns and the multidimensional features of
objective risk factors in communities, enabling portrait-based exploration and
comparison in epidemiological analysis. We applied +msRNAer to aggregate
COVID-19-related datasets in New South Wales, Australia, which combined
COVID-19 case number trends, geo-information, intervention events, and
expert-supervised risk factors extracted from LGA-based censuses. We perfected
the +msRNAer workflow with collaborative views and evaluated its feasibility,
effectiveness, and usefulness through one user study and three subject-driven
case studies. Positive feedback from experts indicates that +msRNAer provides a
general understanding of analyzing comprehension that not only compares
relationships between cases in time-varying and risk factors through portraits
but also supports navigation in fundamental geographical, timeline, and other
factor comparisons. By adopting interactions, experts discovered functional and
practical implications for potential patterns of long-standing community
factors against the vulnerability faced by the pandemic. Experts confirmed that
+msRNAer is expected to deliver visual modeling benefits with spatiotemporal
and multidimensional features in other epidemiological analysis scenarios
Progressive Analytics: A Computation Paradigm for Exploratory Data Analysis
Exploring data requires a fast feedback loop from the analyst to the system,
with a latency below about 10 seconds because of human cognitive limitations.
When data becomes large or analysis becomes complex, sequential computations
can no longer be completed in a few seconds and data exploration is severely
hampered. This article describes a novel computation paradigm called
Progressive Computation for Data Analysis or more concisely Progressive
Analytics, that brings at the programming language level a low-latency
guarantee by performing computations in a progressive fashion. Moving this
progressive computation at the language level relieves the programmer of
exploratory data analysis systems from implementing the whole analytics
pipeline in a progressive way from scratch, streamlining the implementation of
scalable exploratory data analysis systems. This article describes the new
paradigm through a prototype implementation called ProgressiVis, and explains
the requirements it implies through examples.Comment: 10 page
Abstract Data Visualisation in Mobile VR Platforms
Data visualisation, as a key tool in data understanding, is widely used in science and everyday life. In order data visualisation to be effective, perceptual factors and the characteristics of the display interface play a crucial role. Virtual Reality is nowadays accepted as a valid medium for scientific visualisation, because of its inherent characteristics of real-world emulation and intuitive interaction. However, the use of VR in abstract data visualisation is still limited. In this research, I investigate the use and suitability of mobile phone-based Virtual Reality as a medium for abstract data visualisation. I develop a prototype VR Android application and visualise data using the Scatterplot and Parallel Coordinates methods. After that, I conduct a user study to compare the effectiveness of the mobile VR application compared to a similar screen-based one by implementing some data exploration scenarios. The study results, while not being statistically significant, show improved accuracy and speed in the mobile VR visualisation application. The main conclusions are two-fold: Virtual Reality is beneficial for abstract data visualisation, even in the case of limited processing power and display resolution. Mobile VR, an affordable alternative to expensive desktop VR set-ups can be utilized as a data visualisation platform
Parallel Coordinates for Discovery of Interpretable Machine Learning Models
This work uses visual knowledge discovery in parallel coordinates to advance
methods of interpretable machine learning. The graphic data representation in
parallel coordinates made the concepts of hypercubes and hyperblocks (HBs)
simple to understand for end users. It is suggested to use mixed and pure
hyperblocks in the proposed data classifier algorithm Hyper. It is shown that
Hyper models generalize decision trees. The algorithm is presented in several
settings and options to discover interactively or automatically overlapping or
non-overlapping hyperblocks. Additionally, the use of hyperblocks in
conjunction with language descriptions of visual patterns is demonstrated. The
benchmark data from the UCI ML repository were used to evaluate the Hyper
algorithm. It enabled the discovery of mixed and pure HBs evaluated using
10-fold cross validation. Connections among hyperblocks, dimension reduction
and visualization have been established. The capability of end users to find
and observe hyperblocks, as well as the ability of side-by-side visualizations
to make patterns evident, are among major advantages ofhyperblock technology
and the Hyper algorithm. A new method to visualize incomplete n-D data with
missing values is proposed, while the traditional parallel coordinates do not
support it. The ability of HBs to better prevent both overgeneralization and
overfitting of data over decision trees is demonstrated as another benefit of
the hyperblocks. The features of VisCanvas 2.0 software tool that implements
Hyper technology are presented.Comment: 32 pages, 30 figures, 7 tables. arXiv admin note: substantial text
overlap with arXiv:2106.0747
mTreeIllustrator: A Mixed-Initiative Framework for Visual Exploratory Analysis of Multidimensional Hierarchical Data
Multidimensional hierarchical (mTree) data are very common in daily life and scientific research. However, mTree data exploration is a laborious and time-consuming process due to its structural complexity and large dimension combination space. To address this problem, we present mTreeIllustrator, a mixed-initiative framework for exploratory analysis of multidimensional hierarchical data with faceted visualizations. First, we propose a recommendation pipeline for the automatic selection and visual representation of important subspaces of mTree data. Furthermore, we design a visual framework and an interaction schema to couple automatic recommendations with human specifications to facilitate progressive exploratory analysis. Comparative experiments and user studies demonstrate the usability and effectiveness of our framework
- …