5 research outputs found

    Design and Interpretability of Contour Lines for Visualizing Multivariate Data

    Get PDF
    Multivariate geospatial data are commonly visualized using contour plots, where the plots for various attributes are often examined side by side, or using color blending. As the number of attributes grows, however, these approaches become less efficient. This limitation motivated the use of glyphs, where different attributes are mapped to different pre-attentive features of the glyphs. Since both contour plot overlays and glyphs clutter the underlying map, in this paper we examine whether contour lines, which are already present in map space, can be leveraged to visualize multivariate geospatial data. We present five different designs for stylizing contour lines, and investigate their interpretability using three crowdsourced studies. We evaluated the designs through a set of common geospatial data analysis tasks on a four-dimensional dataset. Our first two studies examined how the contour line width and the number of contour intervals affect interpretability, using synthetic datasets where we controlled the underlying data distribution. Study 1 revealed that the increase of width improves the task performance in most of the designs, specially in completion time, except some scenarios where reducing width does not affect performance where the visibility of the background is critical. In Study 2, we found out that fewer contour intervals lead to less visual clutter, hence improved performance. We then compared the designs in a third study that used both synthetic and real-life meteorological data. The study revealed that the results found using synthetic data were generalizable to the real-life data, as hypothesized. Moreover, we formulated a design recommendation table tuned to give users task- and category-specific design suggestions under various environment constraints. At last, we discuss the comparison between the lab and online versions of study 1 with respect to display size (lab study was done on big screen and vice versa). Our studies show the effectiveness of stylizing contour lines to represent multivariate data, reveal trade-offs among design parameters, and provide designers with important insights into the factors that influence multivariate interpretability. We also show some real-life scenarios where our visualization approach may improve decision making

    Set-Stat-Map: Visualizing Spatial Data with Mixed Numeric and Categorical Attributes

    Get PDF
    Multi-attribute datasets are common and appear in many important scenarios for data analytics. Such data can be complex and thus difficult to understand directly without using visualization techniques. Existing visualizations for multi-attribute datasets are often designed based on attribute types, i.e., whether the attributes are categorical or numerical. Parallel Coordinates and Parallel Sets are two well-known techniques to visualize numerical and categorical data, respectively. However, visualization for mixed data types appears to be challenging. A common strategy to visualize mixed data is to use multiple information-linked views, e.g., Parallel Coordinates are often augmented with maps to explore spatial data with numeric attributes. In this paper, we design visualizations for mixed data types, where the dataset may include numerical, categorical, and spatial attributes. The proposed solution Set-Stat-Map is a harmonious combination of three interactive components: Parallel Sets (visualizes sets determined by the combination of categories or numeric ranges), statistics columns (visualizes numerical summaries of the sets), and a dataset-specified map view (geospatial map view for spatial information, heatmap for pairwise information, etc.). We also augment the Parallel Sets view in two main ways: First, we impose textures on top of colors, which are spread into the other views, to enhance users' capability of analyzing distributions of pairs of attribute combinations. Second, we limit the number of sets for each axis to a small number by merging some of them into one and limit the sizes of the merged sets to improve the rendering performance as well as to reduce users' cognitive loads. We demonstrate the use of Set-Stat-Map using different types of datasets: a meteorological dataset (CFSR), an online vacation rental dataset (Airbnb), and a software developer community dataset (StackOverflow). We provide design guidelines based on the results of the analysis of the performance from both visual analytics and scalability aspects. To examine the usability of the system, we collaborated with meteorologists, which reveals both challenges and opportunities for Set-Stat-Map to be used for real-life visual analytics

    A Fast and Scalable System to Visualize Contour Gradient from Spatio-temporal Data

    Get PDF
    Changes in geological processes that span over the years may often go unnoticed due to their inherent noise and variability. Natural phenomena such as riverbank erosion, and climate change in general, is invisible to humans unless appropriate measures are taken to analyze the underlying data. Visualization helps geological sciences to generate scientific insights into such long-term geological events. Commonly used approaches such as side-by-side contour plots and spaghetti plots do not provide a clear idea about the historical spatial trends. To overcome this challenge, we propose an image-gradient based approach called ContourDiff. ContourDiff overlays gradient vector over contour plots to analyze the trends of change across spatial regions and temporal domain. Our approach first aggregates for each location, its value differences from the neighboring points over the temporal domain, and then creates a vector field representing the prominent changes. Finally, it overlays the vectors (differential trends) along the contour paths, revealing the differential trends that the contour lines (isolines) experienced over time. We designed an interface, where users can interact with the generated visualization to reveal changes and trends in geospatial data. We evaluated our system using real-life datasets, consisting of millions of data points, where the visualizations were generated in less than a minute in a single-threaded execution. We show the potential of the system in detecting subtle changes from almost identical images, describe implementation challenges, speed-up techniques, and scope for improvements. Our experimental results reveal that ContourDiff can reliably visualize the differential trends, and provide a new way to explore the change pattern in spatiotemporal data. The expert evaluation of our system using real-life WRF (Weather Research and Forecasting) model output reveals the potential of our technique to generate useful insights on the spatio-temporal trends of geospatial variables

    Visual Analysis of Variability and Features of Climate Simulation Ensembles

    Get PDF
    This PhD thesis is concerned with the visual analysis of time-dependent scalar field ensembles as occur in climate simulations. Modern climate projections consist of multiple simulation runs (ensemble members) that vary in parameter settings and/or initial values, which leads to variations in the resulting simulation data. The goal of ensemble simulations is to sample the space of possible futures under the given climate model and provide quantitative information about uncertainty in the results. The analysis of such data is challenging because apart from the spatiotemporal data, also variability has to be analyzed and communicated. This thesis presents novel techniques to analyze climate simulation ensembles visually. A central question is how the data can be aggregated under minimized information loss. To address this question, a key technique applied in several places in this work is clustering. The first part of the thesis addresses the challenge of finding clusters in the ensemble simulation data. Various distance metrics lend themselves for the comparison of scalar fields which are explored theoretically and practically. A visual analytics interface allows the user to interactively explore and compare multiple parameter settings for the clustering and investigate the resulting clusters, i.e. prototypical climate phenomena. A central contribution here is the development of design principles for analyzing variability in decadal climate simulations, which has lead to a visualization system centered around the new Clustering Timeline. This is a variant of a Sankey diagram that utilizes clustering results to communicate climatic states over time coupled with ensemble member agreement. It can reveal several interesting properties of the dataset, such as: into how many inherently similar groups the ensemble can be divided at any given time, whether the ensemble diverges in general, whether there are different phases in the time lapse, maybe periodicity, or outliers. The Clustering Timeline is also used to compare multiple climate simulation models and assess their performance. The Hierarchical Clustering Timeline is an advanced version of the above. It introduces the concept of a cluster hierarchy that may group the whole dataset down to the individual static scalar fields into clusters of various sizes and densities recording the nesting relationship between them. One more contribution of this work in terms of visualization research is, that ways are investigated how to practically utilize a hierarchical clustering of time-dependent scalar fields to analyze the data. To this end, a system of different views is proposed which are linked through various interaction possibilities. The main advantage of the system is that a dataset can now be inspected at an arbitrary level of detail without having to recompute a clustering with different parameters. Interesting branches of the simulation can be expanded to reveal smaller differences in critical clusters or folded to show only a coarse representation of the less interesting parts of the dataset. The last building block of the suit of visual analysis methods developed for this thesis aims at a robust, (largely) automatic detection and tracking of certain features in a scalar field ensemble. Techniques are presented that I found can identify and track super- and sub-levelsets. And I derive “centers of action” from these sets which mark the location of extremal climate phenomena that govern the weather (e.g. Icelandic Low and Azores High). The thesis also presents visual and quantitative techniques to evaluate the temporal change of the positions of these centers; such a displacement would be likely to manifest in changes in weather. In a preliminary analysis with my collaborators, we indeed observed changes in the loci of the centers of action in a simulation with increased greenhouse gas concentration as compared to pre-industrial concentration levels
    corecore