441 research outputs found

    Animated interval scatter-plot views for the exploratory analysis of large scale microarray time-course data.

    Get PDF
    Microarray technologies are a relatively new development that allow biologists to monitor the activity of thousands of genes (normally around 8,000) in parallel across multiple stages of a biological process. While this new perspective on biological functioning is recognised as having the potential to have a significant impact on the diagnosis, treatment, and prevention of diseases, it is only through effective analysis of the data produced that biologists can begin to unlock this potential. A significant obstacle to achieving effective analysis of microarray time-course is the combined scale and complexity of the data. This inevitably makes it difficult to reveal certain significant patterns in the data. In particular, it is less dominant patterns and, specifically, patterns that occur over smaller intervals of an experiment's overall time-frame that are more difficult to find. While existing techniques are capable of finding either unexpected patterns of activity over the majority of an experiment's time-frame or expected patterns of activity over smaller intervals of the time-frame, there are no techniques, or combination of techniques, that are suitable for finding unsuspected patterns of activity over smaller intervals. In order to overcome this limitation we have developed the Time-series Explorer, which specifically supports biologists in their attempts to reveal these types of pattern by allowing them to control an animated interval scatter-plot view of their data. This paper discusses aspects of the technique that make such an animated overview viable and describes the results of a user evaluation assessing the practical utility of the technique within the wider context of microarray time-series analysis as a whole

    MaTSE: the gene expression time-series explorer.

    Get PDF
    Background High throughput gene expression time-course experiments provide a perspective on biological functioning recognized as having huge value for the diagnosis, treatment, and prevention of diseases. There are however significant challenges to properly exploiting this data due to its massive scale and complexity. In particular, existing techniques are found to be ill suited to finding patterns of changing activity over a limited interval of an experiments time frame. The Time-Series Explorer (TSE) was developed to overcome this limitation by allowing users to explore their data by controlling an animated scatter-plot view. MaTSE improves and extends TSE by allowing users to visualize data with missing values, cross reference multiple conditions, highlight gene groupings, and collaborate by sharing their findings. Results MaTSE was developed using an iterative software development cycle that involved a high level of user feedback and evaluation. The resulting software combines a variety of visualization and interaction techniques which work together to allow biologists to explore their data and reveal temporal patterns of gene activity. These include a scatter-plot that can be animated to view different temporal intervals of the data, a multiple coordinated view framework to support the cross reference of multiple experimental conditions, a novel method for highlighting overlapping groups in the scatter-plot, and a pattern browser component that can be used with scatter-plot box queries to support cooperative visualization. A final evaluation demonstrated the tools effectiveness in allowing users to find unexpected temporal patterns and the benefits of functionality such as the overlay of gene groupings and the ability to store patterns. Conclusions We have developed a new exploratory analysis tool, MaTSE, that allows users to find unexpected patterns of temporal activity in gene expression time-series data. Overall, the study acted well to demonstrate the benefits of an iterative software development life cycle and allowed us to investigate some visualization problems that are likely to be common in the field of bioinformatics. The subjects involved in the final evaluation were positive about the potential of MaTSE to help them find unexpected patterns in their data and characterized MaTSE as an exploratory tool valuable for hypothesis generation and the creation of new biological knowledge

    Visual Support for the Modeling and Simulation of Cell Biological Processes

    Get PDF
    This dissertation aims at bringing information visualization closer to the demands of analytical problem solving for the specific domain of modeling and simulating cell biological systems. To this end, main segments of visual support in the domain are identified. For one of these segments, the visual analysis of simulation data, new concepts are developed. First, this includes the visualization of simulation data in the context of data generation. Second, new multiple view techniques for large and complex simulation data are introduced.Diese Arbeit verfolgt das Ziel, Informationsvisualisierung nรคher an die Anforderungen des Analyseprozesses heranzufรผhren, mit Blick auf die konkrete Anwendung der Modellierung und Simulation zellbiologischer Systeme. Dazu werden wesentliche Teilbereiche der visuellen Unterstรผtzung identifiziert. Fรผr den Teilbereich der visuellen Analyse von Simulationsdaten werden neue Konzepte entwickelt. Dies beinhaltet zum einen die Visualisierung von Simulationsdaten im Kontext der Datengenerierung. Zum anderen werden neue Multiple-View-Techniken fรผr groรŸe und komplexe Simulationsdaten vorgestellt

    Knowledge visualization: From theory to practice

    Get PDF
    Visualizations have been known as efficient tools that can help users analyze com- plex data. However, understanding the displayed data and finding underlying knowl- edge is still difficult. In this work, a new approach is proposed based on understanding the definition of knowledge. Although there are many definitions used in different ar- eas, this work focuses on representing knowledge as a part of a visualization and showing the benefit of adopting knowledge representation. Specifically, this work be- gins with understanding interaction and reasoning in visual analytics systems, then a new definition of knowledge visualization and its underlying knowledge conversion processes are proposed. The definition of knowledge is differentiated as either explicit or tacit knowledge. Instead of directly representing data, the value of the explicit knowledge associated with the data is determined based on a cost/benefit analysis. In accordance to its importance, the knowledge is displayed to help the user under- stand the complex data through visual analytical reasoning and discovery

    ์˜ํ•™ ์—ฐ๊ตฌ์—์„œ์˜ ๊ณผํ•™์  ์ฆ๊ฑฐ์˜ ํ™œ์šฉ์„ ์œ„ํ•œ ์‹œ๊ฐ์  ๋ถ„์„ ์‹œ์Šคํ…œ ๋””์ž์ธ

    Get PDF
    ํ•™์œ„๋…ผ๋ฌธ(๋ฐ•์‚ฌ) -- ์„œ์šธ๋Œ€ํ•™๊ต๋Œ€ํ•™์› : ๊ณต๊ณผ๋Œ€ํ•™ ์ปดํ“จํ„ฐ๊ณตํ•™๋ถ€, 2022. 8. ์„œ์ง„์šฑ.Evidence-based medicine, "the conscientious, explicit, and judicious use of current best evidence in healthcare and medical research" [98], is one of the most widely accepted medical paradigms of modern times. Searching, reviewing, and synthesizing reliable and high-quality scientific evidence is the key step for the paradigm. However, despite the widespread use of the EBM paradigm, challenges remain in applying Evidence-based medicine protocols to medical research. One of the barriers to applying the best scientific evidence to medical research is the severe literature and clinical data overload that causes the evidence-based tasks to be tremendous time-consuming tasks that require vast human effort. In this dissertation, we aim to employ visual analytics approaches to address the challenges of searching and reviewing massive scientific evidence in medical research. To overcome the burden and facilitate handling scientific evidence in medical research, we conducted three design studies and implemented novel visual analytics systems for laborious evidence-based tasks. First, we designed PLOEM, a novel visual analytics system to aid evidence synthesis, an essential step in Evidence-Based medicine, and generate an Evidence Map in a standardized method. We conducted a case study with an oncologist with years of evidence-based medicine experience. In the second study, we conducted a preliminary survey with 76 medical doctors to derive the design requirements for a biomedical literature search. Based on the results, We designed EEEVis, an interactive visual analytic system for biomedical literature search tasks. The system enhances the PubMed search result with several bibliographic visualizations and PubTator annotations. We performed a user study to evaluate the designs with 24 medical doctors and presented the design guidelines and challenges for a biomedical literature search system design. The third study presents GeneVis, a visual analytics system to identify and analyze gene expression signatures across major cancer types. A task that cancer researchers utilize to discover biomarkers in precision medicine. We conducted four case studies with domain experts in oncology and genomics. The study results show that the system can facilitate the task and provide new insights from the data. Based on the three studies of this dissertation, we conclude that carefully designed visual analytics approaches can provide an enhanced understanding and support medical researchers for laborious evidence-based tasks in medical research.๊ทผ๊ฑฐ์ค‘์‹ฌ์˜ํ•™(Evidence-Based Medicine)์ด๋ž€ "์ž„์ƒ ์น˜๋ฃŒ ๋ฐ ์˜ํ•™ ์—ฐ๊ตฌ์—์„œ ํ˜„์žฌ ์กด์žฌํ•˜๋Š” ์ตœ๊ณ ์˜ ์ฆ๊ฑฐ๋ฅผ ์–‘์‹ฌ์ ์ด๊ณ , ๋ช…๋ฐฑํ•˜๋ฉฐ, ๋ถ„๋ณ„ ์žˆ๊ฒŒ ์ด์šฉํ•˜๋Š” ๋ฐฉ๋ฒ•๋ก "์ด๋ฉฐ [98], ํ˜„๋Œ€ ์˜ํ•™์—์„œ ๊ฐ€์žฅ ๋„๋ฆฌ ๋ฐ›์•„๋“ค์—ฌ์ง€๋Š” ์˜ํ•™ ํŒจ๋Ÿฌ๋‹ค์ž„์ด๋‹ค. ์‹ ๋ขฐํ•  ์ˆ˜ ์žˆ๋Š” ๊ณ ์ˆ˜์ค€์˜ ๊ณผํ•™์  ๊ทผ๊ฑฐ๋ฅผ ๊ฒ€์ƒ‰, ๊ฒ€ํ† , ํ•ฉ์„ฑํ•˜๋Š” ๊ฒƒ์ด์•ผ ๋ง๋กœ ๊ทผ๊ฑฐ์ค‘์‹ฌ์˜ํ•™์˜ ํ•ต์‹ฌ์ด๋‹ค. ํ•˜์ง€๋งŒ, ๊ทผ๊ฑฐ์ค‘์‹ฌ์˜ํ•™์ด ์ด๋ฏธ ๊ด‘๋ฒ”์œ„ํ•˜๊ฒŒ ์‚ฌ์šฉ๋˜๊ณ  ์žˆ์Œ์—๋„ ๋ถˆ๊ตฌํ•˜๊ณ , ์˜ํ•™ ์—ฐ๊ตฌ์— ๊ทผ๊ฑฐ์ค‘์‹ฌ์˜ํ•™์˜ ํ”„๋กœํ† ์ฝœ์„ ์‹ค์ฒœํ•˜๋Š” ๋ฐ์—๋Š” ์—ฌ์ „ํžˆ ๋งŽ์€ ์–ด๋ ค์›€์ด ๋”ฐ๋ฅธ๋‹ค. ์˜๋ฃŒ ๋ฌธํ—Œ ์ •๋ณด, ์ž„์ƒ ์ •๋ณด ๋ฐ ์œ ์ „์ฒดํ•™ ์ •๋ณด๊นŒ์ง€ ์—ฐ๊ตฌ์ž๊ฐ€ ๊ฒ€ํ† ํ•ด์•ผ ํ•  ๊ทผ๊ฑฐ์˜ ์–‘์€ ๋ฐฉ๋Œ€ํ•˜๋ฉฐ ๊ด‘๋ฒ”์œ„ํ•˜๋‹ค. ๋˜ํ•œ ์˜ํ•™๊ณผ ๊ธฐ์ˆ ์˜ ๋ฐœ์ „์œผ๋กœ ์ธํ•ด ์ ์ฐจ ๋” ๋น ๋ฅธ ์†๋„๋กœ ๋Š˜์–ด๋‚˜๊ณ  ์žˆ๊ธฐ์—, ์ด๋ฅผ ๋ชจ๋‘ ์—„๋ฐ€ํžˆ ๊ฒ€ํ† ํ•˜๊ธฐ ์œ„ํ•ด์„œ๋Š” ๋ง‰๋Œ€ํ•œ ์–‘์˜ ์‹œ๊ฐ„๊ณผ ์ธ๋ ฅ์ด ์žˆ์–ด์•ผ ํ•œ๋‹ค. ๋ณธ ๋…ผ๋ฌธ์€ ์‹œ๊ฐ์  ๋ถ„์„ ๋ฐฉ๋ฒ•๋ก ์„ ์ ‘๋ชฉํ•˜์—ฌ ์˜ํ•™ ์—ฐ๊ตฌ์—์„œ ๋ฐฉ๋Œ€ํ•œ ๊ณผํ•™์  ์ฆ๊ฑฐ๋ฅผ ๊ฒ€์ƒ‰ํ•˜๊ณ  ๊ฒ€ํ† ํ•  ์‹œ ๋ฐœ์ƒํ•˜๋Š” ๋ง‰๋Œ€ํ•œ ์ธ์  ์ž์›์˜ ๊ณผ๋ถ€ํ•˜ ๋ฌธ์ œ๋ฅผ ์™„ํ™”ํ•˜๊ณ ์ž ํ•œ๋‹ค. ์ด๋ฅผ ์œ„ํ•˜์—ฌ ๊ทผ๊ฑฐ์ค‘์‹ฌ์˜ํ•™์˜ ์ ˆ์ฐจ ์ค‘ ํŠนํžˆ ์ธ๋ ฅ ์†Œ๋ชจ๊ฐ€ ๋ง‰์‹ฌํ•œ ์ ˆ์ฐจ๋“ค์„ ์„ ์ •ํ•˜๊ณ , ์ด๋Ÿฌํ•œ ๋‚œ๊ด€์„ ๊ทน๋ณตํ•˜๊ณ  ๋ณด๋‹ค ํšจ์œจ์ ์ด๊ณ  ํšจ๊ณผ์ ์œผ๋กœ ๋ฐ์ดํ„ฐ์—์„œ ์œ ์˜๋ฏธํ•œ ์ •๋ณด๋ฅผ ๋„์ถœํ•  ์ˆ˜ ์žˆ๊ฒŒ๋” ๋ณด์กฐํ•˜๋Š” ์„ธ ๊ฐ€์ง€ ์‹œ๊ฐ์  ๋ถ„์„ ์‹œ์Šคํ…œ๋“ค์„ ๊ตฌํ˜„ํ•˜์˜€์œผ๋ฉฐ, ๊ฐ๊ฐ์˜ ์‹œ์Šคํ…œ์— ๊ด€ํ•œ ๋””์ž์ธ ์—ฐ๊ตฌ๋ฅผ ์ˆ˜ํ–‰ํ•˜์˜€๋‹ค. ์šฐ์„  ์ฒซ ๋””์ž์ธ ์—ฐ๊ตฌ์—์„œ๋Š” ๊ทผ๊ฑฐ์ค‘์‹ฌ์˜ํ•™ ์—ฐ๊ตฌ์— ์žˆ์–ด ํ•„์ˆ˜์  ๋‹จ๊ณ„์ธ ๊ทผ๊ฑฐ ํ•ฉ์„ฑ ๋ฐฉ๋ฒ•๋ก ์˜ ํ•˜๋‚˜์ธ ๊ทผ๊ฑฐ ๋งคํ•‘(Evidence Mapping) ๊ณผ์ •์„ ์ง€์›ํ•˜๊ธฐ ์œ„ํ•œ ์‹œ๊ฐ์  ๋ถ„์„ ์‹œ์Šคํ…œ PLOEM์„ ์„ค๊ณ„ํ–ˆ๋‹ค. ๊ทธ๋ฆฌ๊ณ  ์ด๋ฅผ ๊ฒ€์ฆํ•˜๊ธฐ ์œ„ํ•ด ๋‹ค๋…„๊ฐ„์˜ ๊ทผ๊ฑฐ ๊ธฐ๋ฐ˜ ์˜๋ฃŒ ๊ฒฝํ—˜์ด ์žˆ๋Š” ์ข…์–‘ํ•™์ž์™€ ํ•จ๊ป˜ ์‚ฌ๋ก€ ์—ฐ๊ตฌ๋ฅผ ์ˆ˜ํ–‰ํ–ˆ๋‹ค. ๋‘ ๋ฒˆ์งธ ๋””์ž์ธ ์—ฐ๊ตฌ์—์„œ๋Š” ์˜ํ•™ ๋ฌธํ—Œ ๊ฒ€์ƒ‰ ์‹œ์Šคํ…œ์˜ ์š”๊ตฌ์‚ฌํ•ญ ๋ถ„์„์„ ์œ„ํ•ด ์ด 76๋ช…์˜ ์˜์‚ฌ๋ฅผ ์ƒ๋Œ€๋กœ ์„ค๋ฌธ์กฐ์‚ฌ๋ฅผ ์ง„ํ–‰ํ•˜์˜€๊ณ , ์ด๋Ÿฌํ•œ ๋ถ„์„์„ ๋ฐ”ํƒ•์œผ๋กœ ๋Œ€ํ™”ํ˜• ์‹œ๊ฐ์  ๋ถ„์„ ์‹œ์Šคํ…œ์ธ EEEVis๋ฅผ ์„ค๊ณ„ํ–ˆ๋‹ค. ์ด ์‹œ์Šคํ…œ์€ ์—ฌ๋Ÿฌ ์ข…์˜ ์„œ์ง€ ์ •๋ณด ์‹œ๊ฐํ™” ์ธํ„ฐํŽ˜์ด์Šค์™€ PubTator์˜ ์ฃผ์„ ์ •๋ณด๋ฅผ ํ™œ์šฉํ•˜์—ฌ PubMed ๊ฒ€์ƒ‰ ์—”์ง„์˜ ๊ฒ€์ƒ‰ ๊ฒฐ๊ณผ๋ฅผ ์ฆ๊ฐ•ํ•˜๋Š” ์‹œ์Šคํ…œ์ด๋ฉฐ, ์ด๋ฅผ ํ‰๊ฐ€ํ•˜๊ธฐ ์œ„ํ•ด ์ด 24๋ช…์˜ ์˜์‚ฌ์™€ ํ•จ๊ป˜ ์‚ฌ์šฉ์ž ์—ฐ๊ตฌ๋ฅผ ์ˆ˜ํ–‰ํ•˜์˜€๋‹ค. ์ด ์—ฐ๊ตฌ ๊ฒฐ๊ณผ๋ฅผ ๋ฐ”ํƒ•์œผ๋กœ ์˜ํ•™ ๋ฌธํ—Œ ๊ฒ€์ƒ‰ ์‹œ์Šคํ…œ์— ๋Œ€ํ•œ ์„ค๊ณ„ ์ง€์นจ๊ณผ ๊ณผ์ œ๋ฅผ ์ œ์‹œํ•œ๋‹ค. ๋งˆ์ง€๋ง‰์œผ๋กœ ์„ธ ๋ฒˆ์งธ ๋””์ž์ธ ์—ฐ๊ตฌ์—์„œ๋Š” ์ž„์˜์˜ ์œ ์ „์ž๊ตฐ์˜ ์œ ์ „์ž ๋ฐœํ˜„ ํŒจํ„ด์„ ์ฃผ์š” ์•” ์œ ํ˜•์— ๋”ฐ๋ผ ์‹œ๊ฐํ™”ํ•˜๊ณ  ๋ถ„์„ํ•  ์ˆ˜ ์žˆ๋Š” ์‹œ์Šคํ…œ์ธ GeneVis๋ฅผ ์„ค๊ณ„ํ•˜์˜€๋‹ค. ์•” ์œ ํ˜•์— ๋”ฐ๋ฅธ ์œ ์ „์ž ๋ฐœํ˜„ ํŒจํ„ด์˜ ๋ถ„์„๊ณผ ๋น„๊ต๋Š” ์•” ์—ฐ๊ตฌ์ž๋“ค์ด ์ •๋ฐ€ ์˜ํ•™์—์„œ ์ƒ์ฒด ์ง€ํ‘œ(Biomarker)๋ฅผ ๋ฐœ๊ฒฌํ•˜๊ธฐ ์œ„ํ•ด ๋นˆ๋ฒˆํžˆ ์ˆ˜ํ–‰ํ•˜๋Š” ์ž‘์—…์ด๋‹ค. ์šฐ๋ฆฌ๋Š” ์ข…์–‘ํ•™ ์ „๋ฌธ๊ฐ€ ๋ฐ ์œ ์ „์ฒดํ•™ ์ „๋ฌธ๊ฐ€ ์ด 4์ธ์„ ๋Œ€์ƒ์œผ๋กœ ์‚ฌ๋ก€ ์—ฐ๊ตฌ๋ฅผ ์ง„ํ–‰ํ•˜์˜€๊ณ , ๊ทธ ๊ฒฐ๊ณผ GeneVis๊ฐ€ ํ•ด๋‹น ์ž‘์—…์„ ๋” ์ˆ˜์›”ํ•˜๊ฒŒ ์ˆ˜ํ–‰ํ•˜๋Š” ๊ฒƒ๊ณผ ๊ธฐ์กด์˜ ๋ฐ์ดํ„ฐ์—์„œ ์ƒˆ๋กœ์šด ์ •๋ณด๋ฅผ ๋„์ถœํ•˜๋Š” ๊ฒƒ์— ๋„์›€์ด ๋˜์—ˆ์Œ์„ ํ™•์ธํ•˜์˜€๋‹ค. ์œ„์˜ ์„ธ ๋””์ž์ธ ์—ฐ๊ตฌ์˜ ๊ฒฐ๊ณผ๋ฅผ ๋ฐ”ํƒ•์œผ๋กœ, ๋ณธ ๋…ผ๋ฌธ์€ ์‚ฌ์šฉ์ž ๋ถ„์„๊ณผ ์ž‘์—… ๋ถ„์„์„ ๋™๋ฐ˜ํ•œ ์‹œ๊ฐ์  ๋ถ„์„ ๋ฐฉ๋ฒ•๋ก ์ด ์˜ํ•™ ์—ฐ๊ตฌ์˜ ๊ทผ๊ฑฐ ๊ด€๋ จ ์ž‘์—…์˜ ์–ด๋ ค์›€์„ ํ•ด์†Œํ•˜๊ณ , ๋ถ„์„ ๋ฐ์ดํ„ฐ์— ๋Œ€ํ•œ ๋ณด๋‹ค ๋‚˜์€ ์ดํ•ด๋ฅผ ์ œ๊ณตํ•˜๋Š” ๊ฒƒ์ด ๊ฐ€๋Šฅํ•˜๋‹ค๊ณ  ๊ฒฐ๋ก  ๋‚ด๋ฆฐ๋‹ค.CHAPTER1 Introduction 1 1.1 Background and Motivation 1 1.2 Dissertation Outline 5 CHAPTER2 Related Work 7 2.1 Evidence Mapping: Graphical Representation for a Scientific Evidence Landscape 7 2.2 Scientific Literature Visualizations and Bibliography Visualizations 9 2.3 Visual Anlytics Systems for Genomics Data sets and Research Tasks 10 CHAPTER3 PLOEM: An Interactive Visualization Tool for Effective Evidence Mapping with Biomedical literature 12 3.1 Introduction 12 3.2 Visual Representations and Interactions of PLOEM 14 3.2.1 Overview of the PICO Criteria 14 3.2.2 Trend Visualization with the Timeline view 17 3.2.3 Representing the PICO Co-occurrence with the Relation view 20 3.2.4 Study detail view 22 3.3 Usage Scenarios: Visualizing Various Study Sizes with PLOEM 23 3.4 Conclusion 24 CHAPTER4 EEEvis: Efficacy improvement in searching MEDLINE database using a novel PubMed visual analytic system 26 4.1 Introduction 26 4.1.1 Motivation 26 4.1.2 Preliminary Survey: A Questionnaire on conventional literature search methods 28 4.1.3 Design Requirements for Biomedical Literature Search Systems 36 4.2 System and Interface Implementation of EEEVis 37 4.2.1 System Overview 37 4.2.2 Bibliography Filters 40 4.2.3 Timeline View 41 4.2.4 Co-authorship Network View 43 4.2.5 Article List and Detail View 44 4.3 User Study 46 4.3.1 Participants 46 4.3.2 Procedures 48 4.3.3 Results and Observations 50 4.4 Discussion 54 4.4.1 Design Implications 56 4.4.2 Limitations and Future Work 57 4.5 Conclusions 59 CHAPTER5 GeneVis: A Visual Analytics Systemfor Gene Signature Analysis in Cancers 68 5.1 Motivation 68 5.2 System and Interface Implementation 69 5.2.1 System Overview 69 5.2.2 Gene Expression Detail View 71 5.2.3 Gene Vector Projection View 72 5.2.4 Gene x Cancer Type Heatmap view 74 5.2.5 User Interaction in Multiple Coordinated Views 76 5.3 Case Studies 76 5.3.1 Participants 76 5.3.2 Task and Procedures 76 5.3.3 Case1: Identifying SimilarGeneSignatures with TGFB1in Hallmark Gene Sets 80 5.3.4 Case2: Identifying Cluster Patterns in the HRD data set 81 5.3.5 Results 82 5.4 Summary 85 CHAPTER6 Conclusion and future work 86 6.1 Conclusion 86 6.2 Future Work 87 Abstract (Korean) . . . . . . . . . . . . . . . . . . . . . . . . . . . . 102๋ฐ•

    Neural-Network Enhanced Visualization of High-Dimensional Data

    Get PDF

    A visual analytics approach to feature discovery and subspace exploration in protein flexibility matrices

    Get PDF
    The vast amount of information generated by domain scientists makes the transi- tion from data to knowledge difficult and often impedes important discoveries. For example, the knowledge gained from protein flexibility data sets can speed advances in genetic therapies and drug discovery. However, these models generate so much data that large scale analysis by traditional methods is almost impossible. This hinders biomedical advances. Visual analytics is a new field that can help alleviate this problem. Visual analytics attempts to seamlessly integrate human abilities in pattern recognition, domain knowledge, and synthesis with automatic analysis techniques. I propose a novel, visual analytics pipeline and prototype which eases discovery, com- parison, and exploration in the outputs of complex computational biology datasets. The approach utilizes automatic feature extraction by image segmentation to locate regions of interest in the data, visually presents the features to users in an intuitive way, and provides rich interactions for multi-resolution visual exploration. Functional- ity is also provided for subspace exploration based on automatic similarity calculation and comparative visualizations. The effectiveness of feature discovery and subspace exploration is shown through a user study and user scenarios. Feedback from analysts confirms the suitability of the proposed solution to domain tasks

    Epiviz: Integrative Visual Analysis Software for Genomics

    Get PDF
    Computational and visual data analysis for genomics has traditionally involved a combination of tools and resources, of which the most ubiquitous consist of genome browsers, focused mainly on integrative visualization of large numbers of big datasets, and computational environments, focused on data modeling of a small number of moderately sized datasets. Workflows that involve the integration and exploration of multiple heterogeneous data sources, small and large, public and user specific have been poorly addressed by these tools. Commonly, the data visualized in these tools is the output of analyses performed in powerful computing environments like R/Bioconductor or Python. Two essential aspects of data analysis are usually treated as distinct, in spite of being part of the same exploratory process: algorithmic analysis and interactive visualization. In current technologies these are not integrated within one tool, but rather, one precedes the other. Recent technological advances in web-based data visualization have made it possible for interactive visualization tools to tightly integrate with powerful algorithmic tools, without being restricted to one such tool in particular. We introduce Epiviz (http://epiviz.cbcb.umd.edu), an integrative visualization tool that bridges the gap between the two types of tools, simplifying genomic data analysis workflows. Epiviz is the first genomics interactive visualization tool to provide tight-knit integration with computational and statistical modeling and data analysis. We discuss three ways in which Epiviz advances the field of genomic data analysis: 1) it brings code to interactive visualizations at various different levels; 2) takes the first steps in the direction of collaborative data analysis by incorporating user plugins from source control providers, as well as by allowing analysis states to be shared among the scientific community; 3) combines established analysis features that have never before been available simultaneously in a visualization tool for genomics. Epiviz can be used in multiple branches of genomics data analysis for various types of datasets, of which we detail two: functional genomics data, aligned to a continuous coordinate such as the genome, and metagenomics, organized according to volatile hierarchical coordinate spaces. We also present security implications of the current design, performance benchmarks, a series of limitations and future research steps

    Epiviz: a view inside the design of an integrated visual analysis software for genomics

    Get PDF
    Computational and visual data analysis for genomics has traditionally involved a combination of tools and resources, of which the most ubiquitous consist of genome browsers, focused mainly on integrative visualization of large numbers of big datasets, and computational environments, focused on data modeling of a small number of moderately sized datasets. Workflows that involve the integration and exploration of multiple heterogeneous data sources, small and large, public and user specific have been poorly addressed by these tools. In our previous work, we introduced Epiviz, which bridges the gap between the two types of tools, simplifying these workflows. In this paper we expand on the design decisions behind Epiviz, and introduce a series of new advanced features that further support the type of interactive exploratory workflow we have targeted. We discuss three ways in which Epiviz advances the field of genomic data analysis: 1) it brings code to interactive visualizations at various different levels; 2) takes the first steps in the direction of collaborative data analysis by incorporating user plugins from source control providers, as well as by allowing analysis states to be shared among the scientific community; 3) combines established analysis features that have never before been available simultaneously in a genome browser. In our discussion section, we present security implications of the current design, as well as a series of limitations and future research steps. Since many of the design choices of Epiviz are novel in genomics data analysis, this paper serves both as a document of our own approaches with lessons learned, as well as a start point for future efforts in the same direction for the genomics community.https://doi.org/10.1186/1471-2105-16-S11-S
    • โ€ฆ
    corecore