12 research outputs found

    Measuring Categorical Perception in Color-Coded Scatterplots

    Full text link
    Scatterplots commonly use color to encode categorical data. However, as datasets increase in size and complexity, the efficacy of these channels may vary. Designers lack insight into how robust different design choices are to variations in category numbers. This paper presents a crowdsourced experiment measuring how the number of categories and choice of color encodings used in multiclass scatterplots influences the viewers' abilities to analyze data across classes. Participants estimated relative means in a series of scatterplots with 2 to 10 categories encoded using ten color palettes drawn from popular design tools. Our results show that the number of categories and color discriminability within a color palette notably impact people's perception of categorical data in scatterplots and that the judgments become harder as the number of categories grows. We examine existing palette design heuristics in light of our results to help designers make robust color choices informed by the parameters of their data.Comment: The paper has been accepted to the ACM CHI 2023. 14 pages, 7 figure

    Out of the plane: Flower vs. star glyphs to support high-dimensional exploration in two-dimensional embeddings

    Get PDF
    Exploring high-dimensional data is a common task in many scientific disciplines. To address this task, two-dimensional embeddings, such as tSNE and UMAP, are widely used. While these determine the 2D position of data items, effectively encoding the first two dimensions, suitable visual encodings can be employed to communicate higher-dimensional features. To investigate such encodings, we have evaluated two commonly used glyph types, namely flower glyphs and star glyphs. To evaluate their capabilities for communicating higher-dimensional features in two-dimensional embeddings, we ran a large set of crowd-sourced user studies using real-world data obtained from data.gov. During these studies, participants completed a broad set of relevant tasks derived from related research. This paper describes the evaluated glyph designs, details our tasks, and the quantitative study setup before discussing the results. Finally, we will present insights and provide guidance on the choice of glyph encodings when exploring high-dimensional data.Peer ReviewedPostprint (published version

    A Multi-Faceted Approach for Evaluating Visualization Recommendation Algorithms

    Get PDF
    Data visualizations allow analysts to quickly understand data trends, outliers, and patterns. However, designing the "best" visualizations for a given dataset is complicated. Multiple factors need to be considered, such as the data size, data types, target analysis tasks being supported, and even how the visualization needs to be personalized to the audience. In response, many visualization recommendation algorithms are being proposed to reduce user effort by automatically making some or all of these design decisions for analysts. However, existing visualization recommendation algorithms are evaluated in isolation, or the comparisons do not measure user performance. In other words, existing algorithms are not tested in a way that aligns with how they are used in practice. The lack of evaluation approaches makes it impossible to know how functional an algorithm is compared to another across various analysis tasks, hindering our ability to design new algorithms that provide significantly more benefits than the existing ones.This dissertation contributes a multi-faceted approach for evaluating visualization recommendation algorithms to investigate factors affecting an algorithm's performance and ways to improve it. It first proposes an evaluation-focused framework and then demonstrates how the framework can evaluate strategic behaviors and user performance among a broad range of existing algorithms. The case study results show that newly proposed algorithms might not significantly improve user performance. One way to improve the algorithm performance is by integrating more established theoretical rules or empirical results on how people perceive different visualization designs, i.e., graphical perception guidelines, to guide the recommendation ranking process. Thus, this dissertation next presents a thorough literature review of existing graphical perception literature that can inform visualization recommendation algorithms. It contributes a systematic dataset that collates existing theoretical and experimental visualization comparison results and summarizes key study outcomes. Further, this dissertation conducts an exploratory analysis to investigate the influence of each piece of graphical perception study in changing a visualization recommendation algorithm's behavior and outputs. The analysis results show that some graphical perception studies can alter the behavior of visualization recommendation algorithms dominantly, while others have little influence. Based on the analysis findings, this dissertation opens avenues at the intersection of graphical perception and visualization research, like how to evaluate the effectiveness of new graphical perception work in guiding visualization recommendations
    corecore