4 research outputs found
Ligand-Based Virtual Screening Using Bayesian Inference Network and Reweighted Fragments
Many of the similarity-based virtual screening approaches assume that molecular fragments that are not related to the biological activity carry the same weight as the important ones. This was the reason that led to the use of Bayesian networks as an alternative to existing tools for similarity-based virtual screening. In our recent work, the retrieval performance of the Bayesian inference network (BIN) was observed to improve significantly when molecular fragments were reweighted using the relevance feedback information. In this paper, a set of active reference structures were used to reweight the fragments in the reference structure. In this approach, higher weights were assigned to those fragments that occur more frequently in the set of active reference structures while others were penalized. Simulated virtual screening experiments with MDL Drug Data Report datasets showed that the proposed approach significantly improved the retrieval effectiveness of ligand-based virtual screening, especially when the active molecules being sought had a high degree of structural heterogeneity
Recommended from our members
Inferring structures, free energy differences, and kinetic rates of biological macromolecular assemblies by integrative modeling
Biological macromolecular assemblies play crucial roles in most cellular processes. The determination of their structures, thermodynamics, and kinetics is essential to understand their function, evolution, modulation, and design. Determining such models, however, remains challenging. One particularly powerful approach to constructing models in general is integrative modeling. Integrative modeling aims to maximize the accuracy, precision, and completeness of models, by simultaneously utilizing all available information, including experimental data, physical principles, statistical analyses, and other prior models. The goal of this thesis is to expand the scope of integrative modeling to the inference of spatial, thermodynamic, and kinetic aspects of macromolecular assemblies. In Chapter I, I introduce the integrative modeling framework for spatiotemporal modeling of biological macromolecular assemblies. In Chapter II, I demonstrate how the synergy between multi-chemistry cross-linking mass spectrometry and integrative modeling can map the structural dynamics of macromolecular assemblies, by application to the human Cop9 signalosome complex. In Chapter III, I present a method for determining structures, free energy differences, and kinetic rates of macromolecular assemblies along their functional cycle, mainly from negative stain electron microscopy (EM). We apply the method to the yeast Hsp90 to estimate the free energy differences and kinetic parameters along its nucleotide hydrolysis cycle, which includes open and closed states of Hsp90. In Chapter IV, I describe a validation of stochastic sampling in integrative modeling. The remaining chapters describe applications of integrative modeling to assemblies of various sizes and scales, using various sources of information, thus illustrating the flexibility of the integrative modeling approach. Specifically, I apply integrative modeling to the human ECM29-Proteasome assembly under oxidative stress (Chapter V), the yeast nuclear pore complex (NPC) cytoplasmic mRNA export platform (Chapter VI), the major membrane ring component of the yeast NPC (Chapter VII), the entire yeast NPC (Chapter VIII), and the reconstruction of 3D structures of MET antibodies (Chapter IX)
A lightweight, graph-theoretic model of class-based similarity to support object-oriented code reuse.
The work presented in this thesis is principally concerned with the development of a method and set of tools designed to support the identification of class-based similarity in collections of object-oriented code. Attention is focused on enhancing the potential for software reuse in situations where a reuse process is either absent or informal, and the characteristics of the organisation are unsuitable, or resources unavailable, to promote and sustain a systematic approach to reuse. The approach builds on the definition of a formal, attributed, relational model that captures the inherent structure of class-based, object-oriented code. Based on code-level analysis, it relies solely on the structural characteristics of the code and the peculiarly object-oriented features of the class as an organising principle: classes, those entities comprising a class, and the intra and inter-class relationships existing between them, are significant factors in defining a two-phase similarity measure as a basis for the comparison process. Established graph-theoretic techniques are adapted and applied via this model to the problem of determining similarity between classes. This thesis illustrates a successful transfer of techniques from the domains of molecular chemistry and computer vision. Both domains provide an existing template for the analysis and comparison of structures as graphs. The inspiration for representing classes as attributed relational graphs, and the application of graph-theoretic techniques and algorithms to their comparison, arose out of a well-founded intuition that a common basis in graph-theory was sufficient to enable a reasonable transfer of these techniques to the problem of determining similarity in object-oriented code. The practical application of this work relates to the identification and indexing of instances of recurring, class-based, common structure present in established and evolving collections of object-oriented code. A classification so generated additionally provides a framework for class-based matching over an existing code-base, both from the perspective of newly introduced classes, and search "templates" provided by those incomplete, iteratively constructed and refined classes associated with current and on-going development. The tools and techniques developed here provide support for enabling and improving shared awareness of reuse opportunity, based on analysing structural similarity in past and ongoing development, tools and techniques that can in turn be seen as part of a process of domain analysis, capable of stimulating the evolution of a systematic reuse ethic