41 research outputs found

    A Comparison of Blocking Methods for Record Linkage

    Full text link
    Record linkage seeks to merge databases and to remove duplicates when unique identifiers are not available. Most approaches use blocking techniques to reduce the computational complexity associated with record linkage. We review traditional blocking techniques, which typically partition the records according to a set of field attributes, and consider two variants of a method known as locality sensitive hashing, sometimes referred to as "private blocking." We compare these approaches in terms of their recall, reduction ratio, and computational complexity. We evaluate these methods using different synthetic datafiles and conclude with a discussion of privacy-related issues.Comment: 22 pages, 2 tables, 7 figure

    Linking Discrete and Stochastic Models: The Chemical Master Equation as a Bridge between Process Hitting and Proper Generalized Decomposition

    Get PDF
    Modeling frameworks bring structure and analysis tools to large and non-intuitive systems but come with certain inherent assumptions and limitations, sometimes to an inhibitive extent. By building bridges in existing models, we can exploit the advantages of each, widening the range of analysis possible for larger, more detailed models of gene regulatory networks. In this paper, we create just such a link between Process Hitting [6,7,8], a recently introduced discrete framework, and the Chemical Master Equation in such a way that allows the application of powerful numerical techniques, namely Proper Generalized Decomposition [1,2,3], to overcome the curse of dimensionality. With these tools in hand, one can exploit the formal analysis of discrete models without sacrificing the ability to obtain a full space state solution, widening the scope of analysis and interpretation possible. As a demonstration of the utility of this methodology, we have applied it here to the p53-mdm2 network [4,5], a widely studied biological regulatory network

    Characterization of Reachable Attractors Using Petri Net Unfoldings

    Get PDF
    International audienceAttractors of network dynamics represent the long-term behaviours of the modelled system. Their characterization is therefore crucial for understanding the response and differentiation capabilities of a dynamical system. In the scope of qualitative models of interaction networks, the computation of attractors reachable from a given state of the network faces combinatorial issues due to the state space explosion. In this paper, we present a new algorithm that exploits the concurrency between transitions of parallel acting components in order to reduce the search space. The algorithm relies on Petri net unfoldings that can be used to compute a compact representation of the dynamics. We illustrate the applicability of the algorithm with Petri net models of cell signalling and regulation networks, Boolean and multi-valued. The proposed approach aims at being complementary to existing methods for deriving the attractors of Boolean models, while being %so far more generic since it applies to any safe Petri net

    Transactional Support for Visual Instance Search

    Get PDF
    International audienceThis article addresses the issue of dynamicity and durability for scalable indexing of very large and rapidly growing collections of local features for visual instance retrieval. By extending the NV-tree, a scalable disk-based high-dimensional index, we show how to implement the ACID properties of transactions which ensure both dynamicity and durability. We present a detailed performance evaluation of the transactional NV-tree, showing that the insertion throughput is excellent despite the effort to enforce the ACID properties

    BioSimulators: a central registry of simulation engines and services for recommending specific tools

    Get PDF
    Computational models have great potential to accelerate bioscience, bioengineering, and medicine. However, it remains challenging to reproduce and reuse simulations, in part, because the numerous formats and methods for simulating various subsystems and scales remain siloed by different software tools. For example, each tool must be executed through a distinct interface. To help investigators find and use simulation tools, we developed BioSimulators (https://biosimulators.org), a central registry of the capabilities of simulation tools and consistent Python, command-line and containerized interfaces to each version of each tool. The foundation of BioSimulators is standards, such as CellML, SBML, SED-ML and the COMBINE archive format, and validation tools for simulation projects and simulation tools that ensure these standards are used consistently. To help modelers find tools for particular projects, we have also used the registry to develop recommendation services. We anticipate that BioSimulators will help modelers exchange, reproduce, and combine simulations

    Dynamical properties of Discrete Reaction Networks.

    No full text
    Reaction networks are commonly used to model the dynamics of populations subject to transformations that follow an imposed stoichiometry. This paper focuses on the efficient characterisation of dynamical properties of Discrete Reaction Networks (DRNs). DRNs can be seen as modeling the underlying discrete nondeterministic transitions of stochastic models of reaction networks. In that sense, a proof of non-reachability in a given DRN has immediate implications for any concrete stochastic model based on that DRN, independent of the choice of kinetic laws and constants. Moreover, if we assume that stochastic kinetic rates are given by the mass-action law (or any other kinetic law that gives non-vanishing probability to each reaction if the required number of interacting substrates is present), then reachability properties are equivalent in the two settings. The analysis of two types of global dynamical properties of DRNs is addressed: irreducibility, i.e., the ability to reach any discrete state from any other state; and recurrence, i.e., the ability to return to any initial state. Our results consider both the verification of such properties when species are present in a large copy number, and in the general case. The necessary and sufficient conditions obtained involve algebraic conditions on the network reactions which in most cases can be verified using linear programming. Finally, the relationship of DRN irreducibility and recurrence with dynamical properties of stochastic and continuous models of reaction networks is discussed

    Locality Sensitive Hashing Using GMM

    No full text

    Une nouvelle transition ordre-désordre dans Fe-Ni (50-50 )

    No full text
    Neutrons irradiation experiments under magnetic fields on Fe-Ni alloys (50-50) have led to discovery of a new order-disorder transition of the type Au Cu of which the critical temperature is 320 °C. The diffusion coefficient at températures of the order of 300 °C being very low, the ordering can only be established under irradiation. The directing effect of thé applied magnetic field due to the magnetic coupling of the bonds with the magnetizsation favours the development of the tetragpnal axis in the direction of the field for the crystallites whose [100] axis lies in the neighbourhood of the direction of the field. A study on the single crystal shows that this tetragonal axis is the only axis of easy magnetization in the ordered structure.Des expériences d'irradiation aux neutrons sous champ magnétique d'alliages Fe-Ni (50-50) ont permis de découvrir une transition ordre-désordre du type Au Cu dont la température critique est 320 °C. L'ordre ne peut s'établir que sous irradiation étant donné la valeur trop faible du coefficient de diffusion auteur de 300°C. Un effet directeur du champ magnétique appliqué, dû aux couplages magnétiques des liaisons avec l'aimantation favorise le développement de l'axe tétragonal dans la direction du champ pour les cristallites dont un axe [100] est voisin de la direction du champ. Une étude sur des monocristaux montre que cet axe tétragonal est le seul axe de facile aimantation de la structure ordonnée
    corecore