768 research outputs found

    Search Computing

    Get PDF

    Explorative search of distributed bio-data to answer complex biomedical questions

    Get PDF
    Background The huge amount of biomedical-molecular data increasingly produced is providing scientists with potentially valuable information. Yet, such data quantity makes difficult to find and extract those data that are most reliable and most related to the biomedical questions to be answered, which are increasingly complex and often involve many different biomedical-molecular aspects. Such questions can be addressed only by comprehensively searching and exploring different types of data, which frequently are ordered and provided by different data sources. Search Computing has been proposed for the management and integration of ranked results from heterogeneous search services. Here, we present its novel application to the explorative search of distributed biomedical-molecular data and the integration of the search results to answer complex biomedical questions. Results A set of available bioinformatics search services has been modelled and registered in the Search Computing framework, and a Bioinformatics Search Computing application (Bio-SeCo) using such services has been created and made publicly available at http://www.bioinformatics.deib.polimi.it/bio-seco/seco/. It offers an integrated environment which eases search, exploration and ranking-aware combination of heterogeneous data provided by the available registered services, and supplies global results that can support answering complex multi-topic biomedical questions. Conclusions By using Bio-SeCo, scientists can explore the very large and very heterogeneous biomedical-molecular data available. They can easily make different explorative search attempts, inspect obtained results, select the most appropriate, expand or refine them and move forward and backward in the construction of a global complex biomedical query on multiple distributed sources that could eventually find the most relevant results. Thus, it provides an extremely useful automated support for exploratory integrated bio search, which is fundamental for Life Science data driven knowledge discovery

    Initialization and Restart in Stochastic Local Search: Computing a Most Probable Explanation in Bayesian Networks

    Get PDF
    For hard computational problems, stochastic local search has proven to be a competitive approach to finding optimal or approximately optimal problem solutions. Two key research questions for stochastic local search algorithms are: Which algorithms are effective for initialization? When should the search process be restarted? In the present work we investigate these research questions in the context of approximate computation of most probable explanations (MPEs) in Bayesian networks (BNs). We introduce a novel approach, based on the Viterbi algorithm, to explanation initialization in BNs. While the Viterbi algorithm works on sequences and trees, our approach works on BNs with arbitrary topologies. We also give a novel formalization of stochastic local search, with focus on initialization and restart, using probability theory and mixture models. Experimentally, we apply our methods to the problem of MPE computation, using a stochastic local search algorithm known as Stochastic Greedy Search. By carefully optimizing both initialization and restart, we reduce the MPE search time for application BNs by several orders of magnitude compared to using uniform at random initialization without restart. On several BNs from applications, the performance of Stochastic Greedy Search is competitive with clique tree clustering, a state-of-the-art exact algorithm used for MPE computation in BNs

    Numerical Modeling of Multiphase, Turbulent Galactic Disks with Star Formation Feedback

    Full text link
    Star formation is self-regulated by its feedback that drives turbulence and heats the gas. In equilibrium, the star formation rate (SFR) should be directly related to the total (thermal plus turbulent) midplane pressure and hence the total weight of the diffuse gas if energy balance and vertical dynamical equilibrium hold simultaneously. To investigate this quantitatively, we utilize numerical hydrodynamic simulations focused on outer-disk regions where diffuse atomic gas dominates. By analyzing gas properties at saturation, we obtain relationships between the turbulence driving and dissipation rates, heating and cooling rates, the total midplane pressure and the total weight of gas, and the SFR and the total midplane pressure. We find a nearly linear relationship between the SFR and the midplane pressure consistent with the theoretical prediction.Comment: 2 pages, 1 figure. To appear in the proceeding of the IAU GA XXVIII, Special Session 12: Modern Views of the Interstellar Mediu

    Understanding grapevine-microbiome interactions: implications for viticulture industry.

    Get PDF
    Until recently, the analysis of complex communities such as that of the grapevine-microbe holobiont has been limited by the fact that most microbes are not culturable under laboratory conditions (less than 1%). However, metagenomics, the study of the genetic material recovered directly from environmental samples without the need for enrichment or of culturing, has led to open an unprecedented era in the field of microbiology. Importantly, this technological advance has now become so pervasive that it is being regularly applied to explore soils and plants of agricultural interest. Interestingly, many large companies are taking notice, with significant financial investment being used to exploring ways to manipulate the productivity, disease resistance and stress tolerance for crops by influencing the microbiome. To understand which microbes one needs to manipulate to influence this valuable characteristics, we need to sequence the microbiome and capture the genetic and hence functional metabolic information contained therein. For viticulture and other agricultural fields where the crop is also associated to particular flavor properties that may also be manipulated, understanding how the bacteria, fungi and viruses influence the development and hence chemical makeup of the crop is essential

    Materialization strategies for web based search computing applications

    Get PDF
    In the thesis we provide a characterization of view materialization in the context of multi domain heterogeneous search application. Web data view materialization is presented as a solution for technical constraints and problems implied by the unknown structure of the web data sources. The web data materialization model extends the search computing (SeCo) multi-layered model, where the search services are registered in a service repository that describes the functional (e.g. invocation end-point, input and output attributes) information of data end-points. Our first research goal is to solve the problem of finding a sequence of access patterns, which when executed produces a materialization output. For the first research goal we make the following novel contributions: 1) Formulation of the building blocks for the materialization feasibility analysis; 2) Definition of the materialization feasibility analysis method and the accompanying algorithms; 3) A detailed empirical study conducted on a set of materialization tasks ranging in their schema dependency complexity. Our second research goal is the optimization of the materialization process so that the most optimal sequence in terms of materialization output efficiency and quality, executes at all times. For this goal we make the following novel contributions: 1) Formulation of a set of performance dimensions and their metrics for web source materialization; 2) A cost model that utilizes optimization metrics in order to qualitatively differentiate between web services in terms of materialization time; 3) A query optimization procedure that explores the characteristics of the underlying source data domain in order to prioritize the execution of the most productive queries in terms of their data harvesting power; 4) Materialization process optimization strategies based on the web source performance dimension metrics and query optimization procedure; 5) A detailed empirical study conducted on several relevant web based data sources that clearly shows the effectiveness of the proposed solution

    Ground-State Energy and Spin Gap of Spin-1/2 Kagome Heisenberg Antiferromagnetic Clusters: Large Scale Exact Diagonalization Results

    Full text link
    We present a comprehensive list of ground state energies and spin gaps of finite kagome clusters with up to 42 spins obtained using large-scale exact diagonalization techniques. This represents the current limit of this exact approach. For a fixed number of spins N we study several cluster shapes under periodic boundary conditions in both directions resulting in a toroidal geometry. The clusters are characterized by their side length and diagonal as well as the shortest "Manhattan" diameter of the torii. A finite-size scaling analysis of the ground state energy as well as the spin gap is then performed in terms of the shortest toroidal diameter as well as the shortest "Manhattan" diameter. The structure of the spin-spin correlations further supports the importance of short loops wrapping around the torii.Comment: 4 pages, 4 figures, added one referenc
    • …
    corecore