2,108 research outputs found

    A virtual workspace for hybrid multidimensional scaling algorithms

    Get PDF
    In visualising multidimensional data, it is well known that different types of algorithms to process them. Data sets might be distinguished according to volume, variable types and distribution, and each of these characteristics imposes constraints upon the choice of applicable algorithms for their visualization. Previous work has shown that a hybrid algorithmic approach can be successful in addressing the impact of data volume on the feasibility of multidimensional scaling (MDS). This suggests that hybrid combinations of appropriate algorithms might also successfully address other characteristics of data. This paper presents a system and framework in which a user can easily explore hybrid algorithms and the data flowing through them. Visual programming and a novel algorithmic architecture let the user semi-automatically define data flows and the co-ordination of multiple views

    A visual workspace for constructing hybrid MDS algorithms and coordinating multiple views

    Get PDF
    Data can be distinguished according to volume, variable types and distribution, and each of these characteristics imposes constraints upon the choice of applicable algorithms for their visualisation. This has led to an abundance of often disparate algorithmic techniques. Previous work has shown that a hybrid algorithmic approach can be successful in addressing the impact of data volume on the feasibility of multidimensional scaling (MDS). This paper presents a system and framework in which a user can easily explore algorithms as well as their hybrid conjunctions and the data flowing through them. Visual programming and a novel algorithmic architecture let the user semi-automatically define data flows and the co-ordination of multiple views of algorithmic and visualisation components. We propose that our approach has two main benefits: significant improvements in run times of MDS algorithms can be achieved, and intermediate views of the data and the visualisation program structure can provide greater insight and control over the visualisation process

    Visualisation techniques for users and designers of layout algorithms

    Get PDF
    Visualisation systems consisting of a set of components through which data and interaction commands flow have been explored by a number of researchers. Such hybrid and multistage algorithms can be used to reduce overall computation time, and to provide views of the data that show intermediate results and the outputs of complementary algorithms. In this paper we present work on expanding the range and variety of such components, with two new techniques for analysing and controlling the performance of visualisation processes. While the techniques presented are quite different, they are unified within HIVE: a visualisation system based upon a data-flow model and visual programming. Embodied within this system is a framework for weaving together our visualisation components to better afford insight into data and also deepen understanding of the process of the data's visualisation. We describe the new components and offer short case studies of their application. We demonstrate that both analysts and visualisation designers can benefit from a rich set of components and integrated tools for profiling performance

    A Survey of CUDA-based Multidimensional Scaling on GPU Architecture

    Get PDF
    The need to analyze large amounts of multivariate data raises the fundamental problem of dimensionality reduction which is defined as a process of mapping data from high-dimensional space into low-dimensional. One of the most popular methods for handling this problem is multidimensional scaling. Due to the technological advances, the dimensionality of the input data as well as the amount of processed data is increasing steadily but the requirement of processing these data within a reasonable time frame still remains an open problem. Recent development in graphics hardware allows to perform generic parallel computations on powerful hardware and provides an opportunity to solve many time-constrained problems in both graphical and non-graphical domain. The purpose of this survey is to describe and analyze recent implementations of multidimensional scaling algorithms on graphics processing units and present the applicability of these algorithms on such architectures based on the experimental results which show a decrease of execution time for multi-level approaches

    Visualizing Profiles of Large Datasets of Weighted and Mixed Data

    Get PDF
    This work provides a procedure with which to construct and visualize profiles, i.e., groups of individuals with similar characteristics, for weighted and mixed data by combining two classical multivariate techniques, multidimensional scaling (MDS) and the k-prototypes clustering algorithm. The well-known drawback of classical MDS in large datasets is circumvented by selecting a small random sample of the dataset, whose individuals are clustered by means of an adapted version of the k-prototypes algorithm and mapped via classical MDS. Gower’s interpolation formula is used to project remaining individuals onto the previous configuration. In all the process, Gower’s distance is used to measure the proximity between individuals. The methodology is illustrated on a real dataset, obtained from the Survey of Health, Ageing and Retirement in Europe (SHARE), which was carried out in 19 countries and represents over 124 million aged individuals in Europe. The performance of the method was evaluated through a simulation study, whose results point out that the new proposal solves the high computational cost of the classical MDS with low error.This research was funded by the Spanish Ministry of Economy and Competitiveness, grant number MTM2014-56535-R; and the V Regional Plan for Scientific Research and Technological Innovation 2016-2020 of the Community of Madrid, an agreement with Universidad Carlos III de Madrid in the action of "Excellence for University Professors.

    The Energy Landscape, Folding Pathways and the Kinetics of a Knotted Protein

    Get PDF
    The folding pathway and rate coefficients of the folding of a knotted protein are calculated for a potential energy function with minimal energetic frustration. A kinetic transition network is constructed using the discrete path sampling approach, and the resulting potential energy surface is visualized by constructing disconnectivity graphs. Owing to topological constraints, the low-lying portion of the landscape consists of three distinct regions, corresponding to the native knotted state and to configurations where either the N- or C-terminus is not yet folded into the knot. The fastest folding pathways from denatured states exhibit early formation of the N-terminus portion of the knot and a rate-determining step where the C-terminus is incorporated. The low-lying minima with the N-terminus knotted and the C-terminus free therefore constitute an off-pathway intermediate for this model. The insertion of both the N- and C-termini into the knot occur late in the folding process, creating large energy barriers that are the rate limiting steps in the folding process. When compared to other protein folding proteins of a similar length, this system folds over six orders of magnitude more slowly.Comment: 19 page

    Coordinating views for data visualisation and algorithmic profiling

    Get PDF
    A number of researchers have designed visualisation systems that consist of multiple components, through which data and interaction commands flow. Such multistage (hybrid) models can be used to reduce algorithmic complexity, and to open up intermediate stages of algorithms for inspection and steering. In this paper, we present work on aiding the developer and the user of such algorithms through the application of interactive visualisation techniques. We present a set of tools designed to profile the performance of other visualisation components, and provide further functionality for the exploration of high dimensional data sets. Case studies are provided, illustrating the application of the profiling modules to a number of data sets. Through this work we are exploring ways in which techniques traditionally used to prepare for visualisation runs, and to retrospectively analyse them, can find new uses within the context of a multi-component visualisation system

    Mercury concentration and speciation in coastal rainwater

    Get PDF
    Mercury exists in mainly two oxidation states in the atmosphere, Hg0 and Hg(II). Inorganic divalent mercury, Hg(II), has a greater solubility; therefore is in higher concentration in rainwater, than Hg0. The toxic species, methylmercury is an organic form of Hg(II) and is present in low concentration. Mercury is released into the atmosphere by natural and anthropogenic sources. Rainwater is thought to be a main removal mechanism for atmospheric mercury. The concentration and speciation of mercury were determined in rainwater from Wilmington, NC, from September 1, 2003 to September 30, 2005. Volume weighted averages for total Hg in unfiltered rainwater, total dissolved Hg, particulate Hg, dissolved gaseous Hg (Hg0) and methyl-Hg were 52.9 ± 4.7 pM, 40.6 ± 4.0 pM, 13.7± 1.5 pM, 4.3 ± 0.9 pM and 1.1 ± 0.1 pM, respectively. All mercury species were present in all seasons with no significant difference in concentrations between summer and winter, except dissolved gaseous mercury concentration was higher in the winter, with a higher ratio of Hg(II)/Hg(0) in summer relative to winter events. Diurnal variation was seen where Hg(II) decreased during the day into the night, suggesting photochemical reduction of Hg(II). All Hg concentrations were higher in continental storms relative to coastal rain events. Both total mercury species (UFHg and TDHg) were positively correlated with particulate mercury. Total mercury species were washed out of the atmosphere by rainwater with lower concentrations for larger rain events. A weak positive correlation was observed between TDHg and NO3 -, TDHg and SO4 2-, DGHg and Cl-, and Hgpart and DOC. The photochemistry of mercury from Wilmington was also investigated. UFHg, TDHg, and DGHg were generally produced upon irradiation of rainwater samples by simulated sunlight. Particulate Hg concentrations generally declined upon irradiation and MMHg concentrations showed no pattern, in some instances, increasing, decreasing or remaining the same. Positive correlation was observed between, production of UFHg and Hgpart and a negative correlation was observed between production of TDHg and Hgpart. Continental events increased in Hgpart while decreasing in TDHg, whereas coastal events increased in TDHg while decreasing in Hgpart after irradiation. Seasonal differences between Hg species were similar with an increase in TDHg and DGHg, while decreasing in Hgpart and greater changes were observed during the winter. Diurnal variations of Hg(II)/Hg(0) ratio increased during the afternoon and decreased there after. Atmospheric global inputs by natural and anthropogenic sources, 4.1 x 106 kg or 2.0 x 107 mol per year, were in good agreement with calculated total flux of mercury removed via wet deposition, 3.8 x 106 kg or 1.9 x 107 mol per year, suggesting that essentially all mercury released into the atmosphere is removed via rain

    Changes in the distribution and density of Florida Bay macrophytes: 1995-2004

    Get PDF
    Benthic macrophyte cover and distribution data have been collected in ten basins within Florida Bay since 1995 as part of the Florida Bay Fisheries Habitat Assessment Program (FHAP). A weighted average for the most prevalent macrophytes during each sampling event since spring 1995 was calculated. Results indicate that the three most common seagrasses observed in Florida Bay, Thalassia testudinum, Halodule wrightii, and Syringodium filiforme, have increased in distribution since 1995. Halodule wrightii and Syringodium filiforme exhibited an increase in both frequency and cover at the bay-scale, an occurrence driven by their dramatic increases in the western-most FHAP study basins: Johnson and Rabbit Key Basins. Acetabularia, Batophora, Halimeda, and Penicillus also exhibited increases in both frequency and cover since spring 1995. The drift Rhodophytes increased in density and distribution between spring 1995 and spring 1998, but did not increase further after that time. Batophora proved to be the most ubiquitous macroalgae throughout Florida Bay and Acetabularia and the drift Rhodophytes showed the strongest seasonal fluctuations, as they were both much more abundant and widespread during spring samplings. Spearman rank order correlation analysis of the Braun-Blanquet cover data showed that Thalassia was generally negatively correlated to all other macrophytes, while Halodule, Halophila, and Syringodium were positively correlated to one another on most occasions. These seagrasses fluctuated between positive and negative correlations with the macroalgae, and as a group, the macroalgae were positively correlated with one another on most occasions, although exceptions did apply. Non-metric multidimensional scaling was used to create ordination plots of the ~ 315 sample stations. Density overlays were used in conjunction with these ordination plots, and together these showed that total seagrass cover and total macroalgal cover were generally mutually exclusive. Spearman rank order correlation analysis was further used to determine if this spatial separation of the two macrophyte groups was statistically significant at the bay-scale, and it was found that 11 of the 18 bi-annual sampling events yielded a statistically significant negative correlation between total seagrass cover and total macroalgae cover. Canonical Correspondence Analysis (CCA) was used to determine which, if any, of the environmental/physical variables, collected as part of the FHAP data set, had a significant effect on macrophyte distribution within Florida Bay. Significance of these effects was determined using Monte Carlo Permutation Tests. CCA showed that depth and visibility were the initial driving forces in macrophyte distribution. During fall 2000, however, a spike in salinity was observed and by spring 2001 this became the most significant variable affecting macrophyte distribution, and it remained so, along with depth, throughout the duration of FHAP
    corecore