41 research outputs found
A Comparison of Blocking Methods for Record Linkage
Record linkage seeks to merge databases and to remove duplicates when unique
identifiers are not available. Most approaches use blocking techniques to
reduce the computational complexity associated with record linkage. We review
traditional blocking techniques, which typically partition the records
according to a set of field attributes, and consider two variants of a method
known as locality sensitive hashing, sometimes referred to as "private
blocking." We compare these approaches in terms of their recall, reduction
ratio, and computational complexity. We evaluate these methods using different
synthetic datafiles and conclude with a discussion of privacy-related issues.Comment: 22 pages, 2 tables, 7 figure
Linking Discrete and Stochastic Models: The Chemical Master Equation as a Bridge between Process Hitting and Proper Generalized Decomposition
Modeling frameworks bring structure and analysis tools to large and non-intuitive systems but come with certain inherent assumptions and limitations, sometimes to an inhibitive extent. By building bridges in existing models, we can exploit the advantages of each, widening the range of analysis possible for larger, more detailed models of gene regulatory networks. In this paper, we create just such a link between Process Hitting [6,7,8], a recently introduced discrete framework, and the Chemical Master Equation in such a way that allows the application of powerful numerical techniques, namely Proper Generalized Decomposition [1,2,3], to overcome the curse of dimensionality. With these tools in hand, one can exploit the formal analysis of discrete models without sacrificing the ability to obtain a full space state solution, widening the scope of analysis and interpretation possible. As a demonstration of the utility of this methodology, we have applied it here to the p53-mdm2 network [4,5], a widely studied biological regulatory network
Characterization of Reachable Attractors Using Petri Net Unfoldings
International audienceAttractors of network dynamics represent the long-term behaviours of the modelled system. Their characterization is therefore crucial for understanding the response and differentiation capabilities of a dynamical system. In the scope of qualitative models of interaction networks, the computation of attractors reachable from a given state of the network faces combinatorial issues due to the state space explosion. In this paper, we present a new algorithm that exploits the concurrency between transitions of parallel acting components in order to reduce the search space. The algorithm relies on Petri net unfoldings that can be used to compute a compact representation of the dynamics. We illustrate the applicability of the algorithm with Petri net models of cell signalling and regulation networks, Boolean and multi-valued. The proposed approach aims at being complementary to existing methods for deriving the attractors of Boolean models, while being %so far more generic since it applies to any safe Petri net
Transactional Support for Visual Instance Search
International audienceThis article addresses the issue of dynamicity and durability for scalable indexing of very large and rapidly growing collections of local features for visual instance retrieval. By extending the NV-tree, a scalable disk-based high-dimensional index, we show how to implement the ACID properties of transactions which ensure both dynamicity and durability. We present a detailed performance evaluation of the transactional NV-tree, showing that the insertion throughput is excellent despite the effort to enforce the ACID properties
BioSimulators: a central registry of simulation engines and services for recommending specific tools
Computational models have great potential to accelerate bioscience, bioengineering, and medicine. However, it remains challenging to reproduce and reuse simulations, in part, because the numerous formats and methods for simulating various subsystems and scales remain siloed by different software tools. For example, each tool must be executed through a distinct interface. To help investigators find and use simulation tools, we developed BioSimulators (https://biosimulators.org), a central registry of the capabilities of simulation tools and consistent Python, command-line and containerized interfaces to each version of each tool. The foundation of BioSimulators is standards, such as CellML, SBML, SED-ML and the COMBINE archive format, and validation tools for simulation projects and simulation tools that ensure these standards are used consistently. To help modelers find tools for particular projects, we have also used the registry to develop recommendation services. We anticipate that BioSimulators will help modelers exchange, reproduce, and combine simulations
Dynamical properties of Discrete Reaction Networks.
Reaction networks are commonly used to model the dynamics of populations subject to transformations that follow an imposed stoichiometry. This paper focuses on the efficient characterisation of dynamical properties of Discrete Reaction Networks (DRNs). DRNs can be seen as modeling the underlying discrete nondeterministic transitions of stochastic models of reaction networks. In that sense, a proof of non-reachability in a given DRN has immediate implications for any concrete stochastic model based on that DRN, independent of the choice of kinetic laws and constants. Moreover, if we assume that stochastic kinetic rates are given by the mass-action law (or any other kinetic law that gives non-vanishing probability to each reaction if the required number of interacting substrates is present), then reachability properties are equivalent in the two settings. The analysis of two types of global dynamical properties of DRNs is addressed: irreducibility, i.e., the ability to reach any discrete state from any other state; and recurrence, i.e., the ability to return to any initial state. Our results consider both the verification of such properties when species are present in a large copy number, and in the general case. The necessary and sufficient conditions obtained involve algebraic conditions on the network reactions which in most cases can be verified using linear programming. Finally, the relationship of DRN irreducibility and recurrence with dynamical properties of stochastic and continuous models of reaction networks is discussed
Une nouvelle transition ordre-désordre dans Fe-Ni (50-50 )
Neutrons irradiation experiments under magnetic fields on Fe-Ni alloys (50-50) have led to discovery of a new order-disorder transition of the type Au Cu of which the critical temperature is 320 °C. The diffusion coefficient at températures of the order of 300 °C being very low, the ordering can only be established under irradiation. The directing effect of thé applied magnetic field due to the magnetic coupling of the bonds with the magnetizsation favours the development of the tetragpnal axis in the direction of the field for the crystallites whose [100] axis lies in the neighbourhood of the direction of the field. A study on the single crystal shows that this tetragonal axis is the only axis of easy magnetization in the ordered structure.Des expériences d'irradiation aux neutrons sous champ magnétique d'alliages Fe-Ni (50-50) ont permis de découvrir une transition ordre-désordre du type Au Cu dont la température critique est 320 °C. L'ordre ne peut s'établir que sous irradiation étant donné la valeur trop faible du coefficient de diffusion auteur de 300°C. Un effet directeur du champ magnétique appliqué, dû aux couplages magnétiques des liaisons avec l'aimantation favorise le développement de l'axe tétragonal dans la direction du champ pour les cristallites dont un axe [100] est voisin de la direction du champ. Une étude sur des monocristaux montre que cet axe tétragonal est le seul axe de facile aimantation de la structure ordonnée