200 research outputs found

    Large scale study of multiple-molecule queries

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>In ligand-based screening, as well as in other chemoinformatics applications, one seeks to effectively search large repositories of molecules in order to retrieve molecules that are similar typically to a single molecule lead. However, in some case, multiple molecules from the same family are available to seed the query and search for other members of the same family.</p> <p>Multiple-molecule query methods have been less studied than single-molecule query methods. Furthermore, the previous studies have relied on proprietary data and sometimes have not used proper cross-validation methods to assess the results. In contrast, here we develop and compare multiple-molecule query methods using several large publicly available data sets and background. We also create a framework based on a strict cross-validation protocol to allow unbiased benchmarking for direct comparison in future studies across several performance metrics.</p> <p>Results</p> <p>Fourteen different multiple-molecule query methods were defined and benchmarked using: (1) 41 publicly available data sets of related molecules with similar biological activity; and (2) publicly available background data sets consisting of up to 175,000 molecules randomly extracted from the ChemDB database and other sources. Eight of the fourteen methods were parameter free, and six of them fit one or two free parameters to the data using a careful cross-validation protocol. All the methods were assessed and compared for their ability to retrieve members of the same family against the background data set by using several performance metrics including the Area Under the Accumulation Curve (AUAC), Area Under the Curve (AUC), F1-measure, and BEDROC metrics.</p> <p>Consistent with the previous literature, the best parameter-free methods are the MAX-SIM and MIN-RANK methods, which score a molecule to a family by the maximum similarity, or minimum ranking, obtained across the family. One new parameterized method introduced in this study and two previously defined methods, the Exponential Tanimoto Discriminant (ETD), the Tanimoto Power Discriminant (TPD), and the Binary Kernel Discriminant (<b>BKD</b>), outperform most other methods but are more complex, requiring one or two parameters to be fit to the data.</p> <p>Conclusion</p> <p>Fourteen methods for multiple-molecule querying of chemical databases, including novel methods, (ETD) and (TPD), are validated using publicly available data sets, standard cross-validation protocols, and established metrics. The best results are obtained with ETD, TPD, BKD, MAX-SIM, and MIN-RANK. These results can be replicated and compared with the results of future studies using data freely downloadable from <url>http://cdb.ics.uci.edu/</url>.</p

    The role of manufacturing and market managers in strategy development:lessons from three companies

    Get PDF
    According to researchers and managers, there is a lack of agreement between marketing and manufacturing managers on critical strategic issues. However, most of the literature on the subject is anecdotal and little formal empirical research has been done. Three companies are investigated to study the extent of agreement/disagreement between manufacturing and marketing managers on strategy content and process. A novel method permits the study of agreement between the two different functional managers on the process of developing strategy. The findings consistently show that manufacturing managers operate under a wider range of strategic priorities than marketing managers, and that manufacturing managers participate less than marketing managers in the strategy development process. Further, both marketing and manufacturing managers show higher involvement in the strategy development process in the latter stages of the Hayes and Wheelwright four-stage model of manufacturing’s strategic role

    Discovery of novel reductive elimination pathway for 10-hydroxywarfarin

    Get PDF
    Coumadin (R/S-warfarin) anticoagulant therapy is highly efficacious in preventing the formation of blood clots; however, significant inter-individual variations in response risks over or under dosing resulting in adverse bleeding events or ineffective therapy, respectively. Levels of pharmacologically active forms of the drug and metabolites depend on a diversity of metabolic pathways. Cytochromes P450 play a major role in oxidizing R- and S-warfarin to 6-, 7-, 8-, 10-, and 4\u27-hydroxywarfarin, and warfarin alcohols form through a minor metabolic pathway involving reduction at the C11 position. We hypothesized that due to structural similarities with warfarin, hydroxywarfarins undergo reduction, possibly impacting their pharmacological activity and elimination. We modeled reduction reactions and carried out experimental steady-state reactions with human liver cytosol for conversion o

    Bioactivation of isoxazole-containing bromodomain and extra-terminal domain (BET) inhibitors

    Get PDF
    The 3,5-dimethylisoxazole motif has become a useful and popular acetyl-lysine mimic employed in isoxazole-containing bromodomain and extra-terminal (BET) inhibitors but may introduce the potential for bioactivations into toxic reactive metabolites. As a test, we coupled deep neural models for quinone formation, metabolite structures, and biomolecule reactivity to predict bioactivation pathways for 32 BET inhibitors and validate the bioactivation of select inhibitors experimentally. Based on model predictions, inhibitors were more likely to undergo bioactivation than reported non-bioactivated molecules containing isoxazoles. The model outputs varied with substituents indicating the ability to scale their impact on bioactivation. We selected OXFBD02, OXFBD04, and I-BET151 for more in-depth analysis. OXFBD\u27s bioactivations were evenly split between traditional quinones and novel extended quinone-methides involving the isoxazole yet strongly favored the latter quinones. Subsequent experimental studies confirmed the formation of both types of quinones for OXFBD molecules, yet traditional quinones were the dominant reactive metabolites. Modeled I-BET151 bioactivations led to extended quinone-methides, which were not verified experimentally. The differences in observed and predicted bioactivations reflected the need to improve overall bioactivation scaling. Nevertheless, our coupled modeling approach predicted BET inhibitor bioactivations including novel extended quinone methides, and we experimentally verified those pathways highlighting potential concerns for toxicity in the development of these new drug leads

    OrChem - An open source chemistry search engine for Oracle®

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Registration, indexing and searching of chemical structures in relational databases is one of the core areas of cheminformatics. However, little detail has been published on the inner workings of search engines and their development has been mostly closed-source. We decided to develop an open source chemistry extension for Oracle, the de facto database platform in the commercial world.</p> <p>Results</p> <p>Here we present OrChem, an extension for the Oracle 11G database that adds registration and indexing of chemical structures to support fast substructure and similarity searching. The cheminformatics functionality is provided by the Chemistry Development Kit. OrChem provides similarity searching with response times in the order of seconds for databases with millions of compounds, depending on a given similarity cut-off. For substructure searching, it can make use of multiple processor cores on today's powerful database servers to provide fast response times in equally large data sets.</p> <p>Availability</p> <p>OrChem is free software and can be redistributed and/or modified under the terms of the GNU Lesser General Public License as published by the Free Software Foundation. All software is available via <url>http://orchem.sourceforge.net</url>.</p

    Performance evaluation of flexible manufacturing systems under uncertain and dynamic situations

    Get PDF
    The present era demands the efficient modelling of any manufacturing system to enable it to cope with unforeseen situations on the shop floor. One of the complex issues affecting the performance of manufacturing systems is the scheduling of part types. In this paper, the authors have attempted to overcome the impact of uncertainties such as machine breakdowns, deadlocks, etc., by inserting slack that can absorb these disruptions without affecting the other scheduled activities. The impact of the flexibilities in this scenario is also investigated. The objective functions have been formulated in such a manner that a better trade-off between the uncertainties and flexibilities can be established. Consideration of automated guided vehicles (AGVs) in this scenario helps in the loading or unloading of part types in a better manner. In the recent past, a comprehensive literature survey revealed the supremacy of random search algorithms in evaluating the performance of these types of dynamic manufacturing system. The authors have used a metaheuristic known as the quick convergence simulated annealing (QCSA) algorithm, and employed it to resolve the dynamic manufacturing scenario. The metaheuristic encompasses a Cauchy distribution function as a probability function that helps in escaping the local minima in a better manner. Various machine breakdown scenarios are generated. A ‘heuristic gap’ is measured, and it indicates the effectiveness of the performance of the proposed methodology with the varying problem complexities. Statistical validation is also carried out, which helps in authenticating the effectiveness of the proposed approach. The efficacy of the proposed approach is also compared with deterministic priority rules

    WENDI: A tool for finding non-obvious relationships between compounds and biological properties, genes, diseases and scholarly publications

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>In recent years, there has been a huge increase in the amount of publicly-available and proprietary information pertinent to drug discovery. However, there is a distinct lack of data mining tools available to harness this information, and in particular for knowledge discovery across multiple information sources. At Indiana University we have an ongoing project with Eli Lilly to develop web-service based tools for integrative mining of chemical and biological information. In this paper, we report on the first of these tools, called WENDI (Web Engine for Non-obvious Drug Information) that attempts to find non-obvious relationships between a query compound and scholarly publications, biological properties, genes and diseases using multiple information sources.</p> <p>Results</p> <p>We have created an aggregate web service that takes a query compound as input, calls multiple web services for computation and database search, and returns an XML file that aggregates this information. We have also developed a client application that provides an easy-to-use interface to this web service. Both the service and client are publicly available.</p> <p>Conclusions</p> <p>Initial testing indicates this tool is useful in identifying potential biological applications of compounds that are not obvious, and in identifying corroborating and conflicting information from multiple sources. We encourage feedback on the tool to help us refine it further. We are now developing further tools based on this model.</p

    A constructive approach for discovering new drug leads: Using a kernel methodology for the inverse-QSAR problem

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The inverse-QSAR problem seeks to find a new molecular descriptor from which one can recover the structure of a molecule that possess a desired activity or property. Surprisingly, there are very few papers providing solutions to this problem. It is a difficult problem because the molecular descriptors involved with the inverse-QSAR algorithm must adequately address the forward QSAR problem for a given biological activity if the subsequent recovery phase is to be meaningful. In addition, one should be able to construct a feasible molecule from such a descriptor. The difficulty of recovering the molecule from its descriptor is the major limitation of most inverse-QSAR methods.</p> <p>Results</p> <p>In this paper, we describe the reversibility of our previously reported descriptor, the vector space model molecular descriptor (VSMMD) based on a vector space model that is suitable for kernel studies in QSAR modeling. Our inverse-QSAR approach can be described using five steps: (1) generate the VSMMD for the compounds in the training set; (2) map the VSMMD in the input space to the kernel feature space using an appropriate kernel function; (3) design or generate a new point in the kernel feature space using a kernel feature space algorithm; (4) map the feature space point back to the input space of descriptors using a pre-image approximation algorithm; (5) build the molecular structure template using our VSMMD molecule recovery algorithm.</p> <p>Conclusion</p> <p>The empirical results reported in this paper show that our strategy of using kernel methodology for an inverse-Quantitative Structure-Activity Relationship is sufficiently powerful to find a meaningful solution for practical problems.</p
    corecore