Search CORE

200 research outputs found

Large scale study of multiple-molecule queries

Author: Baldi Pierre F
Nasr Ramzi J
Swamidass S Joshua
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background In ligand-based screening, as well as in other chemoinformatics applications, one seeks to effectively search large repositories of molecules in order to retrieve molecules that are similar typically to a single molecule lead. However, in some case, multiple molecules from the same family are available to seed the query and search for other members of the same family. Multiple-molecule query methods have been less studied than single-molecule query methods. Furthermore, the previous studies have relied on proprietary data and sometimes have not used proper cross-validation methods to assess the results. In contrast, here we develop and compare multiple-molecule query methods using several large publicly available data sets and background. We also create a framework based on a strict cross-validation protocol to allow unbiased benchmarking for direct comparison in future studies across several performance metrics. Results Fourteen different multiple-molecule query methods were defined and benchmarked using: (1) 41 publicly available data sets of related molecules with similar biological activity; and (2) publicly available background data sets consisting of up to 175,000 molecules randomly extracted from the ChemDB database and other sources. Eight of the fourteen methods were parameter free, and six of them fit one or two free parameters to the data using a careful cross-validation protocol. All the methods were assessed and compared for their ability to retrieve members of the same family against the background data set by using several performance metrics including the Area Under the Accumulation Curve (AUAC), Area Under the Curve (AUC), F1-measure, and BEDROC metrics. Consistent with the previous literature, the best parameter-free methods are the MAX-SIM and MIN-RANK methods, which score a molecule to a family by the maximum similarity, or minimum ranking, obtained across the family. One new parameterized method introduced in this study and two previously defined methods, the Exponential Tanimoto Discriminant (ETD), the Tanimoto Power Discriminant (TPD), and the Binary Kernel Discriminant (BKD), outperform most other methods but are more complex, requiring one or two parameters to be fit to the data. Conclusion Fourteen methods for multiple-molecule querying of chemical databases, including novel methods, (ETD) and (TPD), are validated using publicly available data sets, standard cross-validation protocols, and established metrics. The best results are obtained with ETD, TPD, BKD, MAX-SIM, and MIN-RANK. These results can be replicated and compared with the results of future studies using data freely downloadable from <url>http://cdb.ics.uci.edu/</url>.</p

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

eScholarship - University of California

The role of manufacturing and market managers in strategy development:lessons from three companies

Author: Baines T.S.
Ghose S.
Hayes R.H.
Hill T.J.
Mills J.
Neil Darlow
Paul M. Swamidass
Porter M.E.
Shapiro B.P.
Skinner W.
Skinner W.
Swamidass P.M.
Swamidass P.M.
Tim Baines
Voss C.A.
Wheelwright S.C.
Publication venue: 'Emerald'
Publication date: 01/01/2001
Field of study

According to researchers and managers, there is a lack of agreement between marketing and manufacturing managers on critical strategic issues. However, most of the literature on the subject is anecdotal and little formal empirical research has been done. Three companies are investigated to study the extent of agreement/disagreement between manufacturing and marketing managers on strategy content and process. A novel method permits the study of agreement between the two different functional managers on the process of developing strategy. The findings consistently show that manufacturing managers operate under a wider range of strategic priorities than marketing managers, and that manufacturing managers participate less than marketing managers in the strategy development process. Further, both marketing and manufacturing managers show higher involvement in the strategy development process in the latter stages of the Hayes and Wheelwright four-stage model of manufacturing’s strategic role

Crossref

Aston Publications Explorer

Discovery of novel reductive elimination pathway for 10-hydroxywarfarin

Author: Barnette Dustyn A
Flynn Noah R
Hendrickson Howard P
Miller Grover P
Phillips Sarah J
Pouncey Dakota L
Sinnott Riley W
Swamidass S Joshua
Publication venue: Digital Commons@Becker
Publication date: 01/01/2021
Field of study

Coumadin (R/S-warfarin) anticoagulant therapy is highly efficacious in preventing the formation of blood clots; however, significant inter-individual variations in response risks over or under dosing resulting in adverse bleeding events or ineffective therapy, respectively. Levels of pharmacologically active forms of the drug and metabolites depend on a diversity of metabolic pathways. Cytochromes P450 play a major role in oxidizing R- and S-warfarin to 6-, 7-, 8-, 10-, and 4\u27-hydroxywarfarin, and warfarin alcohols form through a minor metabolic pathway involving reduction at the C11 position. We hypothesized that due to structural similarities with warfarin, hydroxywarfarins undergo reduction, possibly impacting their pharmacological activity and elimination. We modeled reduction reactions and carried out experimental steady-state reactions with human liver cytosol for conversion o

Digital Commons@Becker

PubMed Central

Bioactivation of isoxazole-containing bromodomain and extra-terminal domain (BET) inhibitors

Author: Boysen Gunnar
Conway Stuart J
Farmer Rohit
Flynn Noah R
Laurin Corentine M C
Miller Grover P
Schleiff Mary A
Swamidass S Joshua
Ward Michael D
Publication venue: Digital Commons@Becker
Publication date: 01/01/2021
Field of study

The 3,5-dimethylisoxazole motif has become a useful and popular acetyl-lysine mimic employed in isoxazole-containing bromodomain and extra-terminal (BET) inhibitors but may introduce the potential for bioactivations into toxic reactive metabolites. As a test, we coupled deep neural models for quinone formation, metabolite structures, and biomolecule reactivity to predict bioactivation pathways for 32 BET inhibitors and validate the bioactivation of select inhibitors experimentally. Based on model predictions, inhibitors were more likely to undergo bioactivation than reported non-bioactivated molecules containing isoxazoles. The model outputs varied with substituents indicating the ability to scale their impact on bioactivation. We selected OXFBD02, OXFBD04, and I-BET151 for more in-depth analysis. OXFBD\u27s bioactivations were evenly split between traditional quinones and novel extended quinone-methides involving the isoxazole yet strongly favored the latter quinones. Subsequent experimental studies confirmed the formation of both types of quinones for OXFBD molecules, yet traditional quinones were the dominant reactive metabolites. Modeled I-BET151 bioactivations led to extended quinone-methides, which were not verified experimentally. The differences in observed and predicted bioactivations reflected the need to improve overall bioactivation scaling. Nevertheless, our coupled modeling approach predicted BET inhibitor bioactivations including novel extended quinone methides, and we experimentally verified those pathways highlighting potential concerns for toxicity in the development of these new drug leads

Directory of Open Access Journals

Digital Commons@Becker

OrChem - An open source chemistry search engine for Oracle®

Author: AR Leach
C Steinbeck
C Steinbeck
C Steinbeck
C Steinbeck
Christoph Steinbeck
E Sayers
J Barnard
J Frome
L Cordella
Mark Rijnbeek
P Willett
R Guha
S Swamidass
T Hagadone
T Hagadone
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background Registration, indexing and searching of chemical structures in relational databases is one of the core areas of cheminformatics. However, little detail has been published on the inner workings of search engines and their development has been mostly closed-source. We decided to develop an open source chemistry extension for Oracle, the de facto database platform in the commercial world. Results Here we present OrChem, an extension for the Oracle 11G database that adds registration and indexing of chemical structures to support fast substructure and similarity searching. The cheminformatics functionality is provided by the Chemistry Development Kit. OrChem provides similarity searching with response times in the order of seconds for databases with millions of compounds, depending on a given similarity cut-off. For substructure searching, it can make use of multiple processor cores on today's powerful database servers to provide fast response times in equally large data sets. Availability OrChem is free software and can be redistributed and/or modified under the terms of the GNU Lesser General Public License as published by the Free Software Foundation. All software is available via <url>http://orchem.sourceforge.net</url>.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Performance evaluation of flexible manufacturing systems under uncertain and dynamic situations

Author: Benjaafar S.
Bhaskaran K.
Chan F. T. S
Choi R. H.
Ettlie J. E.
Falkner C. H.
Gupta D.
Jaikumar R.
Sabuncuoglu I.
Sabuncuoglu I.
Sethi A. K.
Suresh N. C.
Swamidass P. M.
Tenenbaum A.
Veilleux R. F.
Publication venue: 'SAGE Publications'
Publication date: 01/01/2008
Field of study

The present era demands the efficient modelling of any manufacturing system to enable it to cope with unforeseen situations on the shop floor. One of the complex issues affecting the performance of manufacturing systems is the scheduling of part types. In this paper, the authors have attempted to overcome the impact of uncertainties such as machine breakdowns, deadlocks, etc., by inserting slack that can absorb these disruptions without affecting the other scheduled activities. The impact of the flexibilities in this scenario is also investigated. The objective functions have been formulated in such a manner that a better trade-off between the uncertainties and flexibilities can be established. Consideration of automated guided vehicles (AGVs) in this scenario helps in the loading or unloading of part types in a better manner. In the recent past, a comprehensive literature survey revealed the supremacy of random search algorithms in evaluating the performance of these types of dynamic manufacturing system. The authors have used a metaheuristic known as the quick convergence simulated annealing (QCSA) algorithm, and employed it to resolve the dynamic manufacturing scenario. The metaheuristic encompasses a Cauchy distribution function as a probability function that helps in escaping the local minima in a better manner. Various machine breakdown scenarios are generated. A ‘heuristic gap’ is measured, and it indicates the effectiveness of the performance of the proposed methodology with the varying problem complexities. Statistical validation is also carried out, which helps in authenticating the effectiveness of the proposed approach. The efficacy of the proposed approach is also compared with deterministic priority rules

Crossref

Irish Universities

DCU Online Research Access Service

WENDI: A tool for finding non-obvious relationships between compounds and biological properties, genes, diseases and scholarly publications

Author: B Chen
David J Wild
DJ Wild
DJ Wild
E Willighagen
F Belleau
GM Cramer
H Wang
J Hur
JL Durant
MA Johnson
Michael S Lajiness
PJ Ballester
Qian Zhu
R Mullin
SJ Swamidass
X Dong
X Dong
Ying Ding
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background In recent years, there has been a huge increase in the amount of publicly-available and proprietary information pertinent to drug discovery. However, there is a distinct lack of data mining tools available to harness this information, and in particular for knowledge discovery across multiple information sources. At Indiana University we have an ongoing project with Eli Lilly to develop web-service based tools for integrative mining of chemical and biological information. In this paper, we report on the first of these tools, called WENDI (Web Engine for Non-obvious Drug Information) that attempts to find non-obvious relationships between a query compound and scholarly publications, biological properties, genes and diseases using multiple information sources. Results We have created an aggregate web service that takes a query compound as input, calls multiple web services for computation and database search, and returns an XML file that aggregates this information. We have also developed a client application that provides an easy-to-use interface to this web service. Both the service and client are publicly available. Conclusions Initial testing indicates this tool is useful in identifying potential biological applications of compounds that are not obvious, and in identifying corroborating and conflicting information from multiple sources. We encourage feedback on the tool to help us refine it further. We are now developing further tools based on this model.</p

Crossref

Springer - Publisher Connector

IUScholarWorks (University of Indiana)

Directory of Open Access Journals

PubMed Central

Standard operating procedure for somatic variant refinement of sequencing data with paired tumor and normal samples

Author: Ainscough Benjamin J.
Barnell Erica K.
Campbell Katie M.
Cotto Kelsy C.
Danos Arpad M.
Gomez Felicia
Griffith Malachi
Griffith Obi L.
Hundal Jasreet
Krysiak Kilannin
Kunisaki Jason
Matlock Matthew
Pema Shahil P.
Ramirez Cody
Richters Megan
Ronning Peter
Schmidt Alina D.
Sediqzad Malik S.
Sheta Lana M.
Skidmore Zachary L.
Spies Nicholas C.
Swamidass S. Joshua
Trani Lee
Wagner Alex H.
Publication venue: Digital Commons@Becker
Publication date: 01/01/2019
Field of study

Digital Commons@Becker

Prediction of chemical compounds properties using a deep learning model

Author: A Agrawal
A Agrawal
A Koutsoukas
A Lusci
A Mayr
A Shivanyuk
AG Gagorik
AP Bento
AP Bradley
B Cox
C Zhang
D Bajusz
D Butina
D Weininger
D Wishart
FP Miller
HH Aghdam
J Irwin
J Ker
M Davies
M Klose
M Mozaffar
M Popova
MD Hoffman
O Kramer
R Gómez-Bombarelli
S Simplified
SG Rohrer
SJ Swamidass
SM Kearnes
T Dietterich
Y Zhang
Z Wu
Z Wu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 31/10/2021
Field of study

Crossref

Ulster University's Research Portal

A constructive approach for discovering new drug leads: Using a kernel methodology for the inverse-QSAR problem

Author: A Tatsuya
A Tatsuya
AC Good
AC Good
B Mak
BB Masek
C Steinbeck
C Steinbeck
CA Azencott
CJ Churchwell
DB Reitz
FJ Burkowski
Forbes J Burkowski
GH Bakir
HC Huang
J Shawe-Taylor
JJ Sutherland
JL Faulon
JL Faulon
JL Faulon
JL Faulon
JTY Kwok
JW Robin
K-R Müller
KA Sharp
L Ralaivola
LB Kier
LH Hall
LH Hall
MI Skvortsova
N Brown
P Chavatte
P Mahe
P Mahe
PA Pevzner
R Todeschini
RA Lewis
RC Glenn
RP Sheridan
S Mika
SJ Swamidass
V Kvasnicka
V Venkatasubramanian
VJ Gillet
William WL Wong
X Leval
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background The inverse-QSAR problem seeks to find a new molecular descriptor from which one can recover the structure of a molecule that possess a desired activity or property. Surprisingly, there are very few papers providing solutions to this problem. It is a difficult problem because the molecular descriptors involved with the inverse-QSAR algorithm must adequately address the forward QSAR problem for a given biological activity if the subsequent recovery phase is to be meaningful. In addition, one should be able to construct a feasible molecule from such a descriptor. The difficulty of recovering the molecule from its descriptor is the major limitation of most inverse-QSAR methods. Results In this paper, we describe the reversibility of our previously reported descriptor, the vector space model molecular descriptor (VSMMD) based on a vector space model that is suitable for kernel studies in QSAR modeling. Our inverse-QSAR approach can be described using five steps: (1) generate the VSMMD for the compounds in the training set; (2) map the VSMMD in the input space to the kernel feature space using an appropriate kernel function; (3) design or generate a new point in the kernel feature space using a kernel feature space algorithm; (4) map the feature space point back to the input space of descriptors using a pre-image approximation algorithm; (5) build the molecular structure template using our VSMMD molecule recovery algorithm. Conclusion The empirical results reported in this paper show that our strategy of using kernel methodology for an inverse-Quantitative Structure-Activity Relationship is sufficiently powerful to find a meaningful solution for practical problems.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central