Search CORE

4 research outputs found

OPERA models for predicting physicochemical properties and environmental fate endpoints

Author: Antony J. Williams
Chris M. Grulke
Kamel Mansouri
Richard S. Judson
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/03/2018
Field of study

Abstract The collection of chemical structure information and associated experimental data for quantitative structure–activity/property relationship (QSAR/QSPR) modeling is facilitated by an increasing number of public databases containing large amounts of useful data. However, the performance of QSAR models highly depends on the quality of the data and modeling methodology used. This study aims to develop robust QSAR/QSPR models for chemical properties of environmental interest that can be used for regulatory purposes. This study primarily uses data from the publicly available PHYSPROP database consisting of a set of 13 common physicochemical and environmental fate properties. These datasets have undergone extensive curation using an automated workflow to select only high-quality data, and the chemical structures were standardized prior to calculation of the molecular descriptors. The modeling procedure was developed based on the five Organization for Economic Cooperation and Development (OECD) principles for QSAR models. A weighted k-nearest neighbor approach was adopted using a minimum number of required descriptors calculated using PaDEL, an open-source software. The genetic algorithms selected only the most pertinent and mechanistically interpretable descriptors (2–15, with an average of 11 descriptors). The sizes of the modeled datasets varied from 150 chemicals for biodegradability half-life to 14,050 chemicals for logP, with an average of 3222 chemicals across all endpoints. The optimal models were built on randomly selected training sets (75%) and validated using fivefold cross-validation (CV) and test sets (25%). The CV Q2 of the models varied from 0.72 to 0.95, with an average of 0.86 and an R2 test value from 0.71 to 0.96, with an average of 0.82. Modeling and performance details are described in QSAR model reporting format and were validated by the European Commission’s Joint Research Center to be OECD compliant. All models are freely available as an open-source, command-line application called OPEn structure–activity/property Relationship App (OPERA). OPERA models were applied to more than 750,000 chemicals to produce freely available predicted data on the U.S. Environmental Protection Agency’s CompTox Chemistry Dashboard

Directory of Open Access Journals

“MS-Ready” structures for non-targeted high-resolution mass spectrometry screening studies

Author: AB Lindstrom
AD McEachran
AD McEachran
AJ Williams
AJ Williams
AM Richard
Andrew D. McEachran
Antony J. Williams
B Warth
C Ruttkies
Chris Grulke
Christoph Ruttkies
D Fourches
D Young
EL Schymanski
EL Schymanski
EL Schymanski
EL Schymanski
Emma L. Schymanski
H Horai
HE Pence
I Blaženović
J Hollender
J Little
JE Rager
JR Sobus
K Karapetyan
K Mansouri
K Mansouri
K Phillips
Kamel Mansouri
KL Dionisio
M Krauss
M Sitzmann
M Sun
N Sakurai
R Aalizadeh
R Bade
S Heller
S Newton
S Stein
SR Newton
T Kind
X Trier
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Open-source QSAR models for pKa prediction using multiple machine learning approaches

Author: AC Lee
AD McEachran
Alexandru Korotcov
AM Richard
Antony J. Williams
B Obama
BA Wetmore
C Angermueller
C Cortes
C Hansch
C Liao
C Yang
Catherine S. Sprankle
CC Chang
Chris M. Grulke
CL Strope
CW Yap
D Ballabio
D Castelvecchi
D Fourches
David Allen
DT Manallack
EH Kerns
F Sahigara
G Cruciani
GB Goh
I Sushko
IV Tetko
J Liu
J Ma
JF Wambaugh
K Mansouri
K Mansouri
K Mansouri
Kamel Mansouri
M Kuhn
M Rupp
National Research Council
Neal F. Cariello
Nicole C. Kleinstreuer
P Mamoshina
R Leardi
R Todeschini
RP Sheridan
T Sander
TB Hughes
V Consonni
Valery Tkachenko
W Jones
W Klöpffer
Warren M. Casey
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

CoMPARA : Collaborative Modeling Project for Androgen Receptor Activity

Author: Abdelaziz Ahmed M.
Alberga Domenico
Alves Vinicius M.
Andersson Patrik L.
Andrade Carolina H.
Bai Fang
Balabin Ilya
Ballabio Davide
Benfenati Emilio
Bhhatarai Barun
Boyer Scott
Chen Jingwen
Consonni Viviana
Farag Sherif
Fourches Denis
García-Sosa Alfonso T.
Gramatica Paola
Grisoni Francesca
Grulke Chris M.
Hong Huixiao
Horvath Dragos
Hu Xin
Huang Ruili
Jeliazkova Nina
Judson Richard S.
Kleinstreuer Nicole
Li Jiazhong
Li Xuehua
Liu Huanxiang
Manganelli Serena
Mangiatordi Giuseppe F.
Mansouri Kamel
Maran Uko
Marcou Gilles
Martin Todd
Muratov Eugene
Nguyen Dac-Trung
Nicolotti Orazio
Nikolov Nikolai G.
Norinder Ulf
Papa Ester
Petitjean Michel
Pogodin Pavel
Poroikov Vladimir
Pür Geven
Qiao Xianliang
Richard Ann M.
Roncaglioni Alessandra
Ruiz Patricia
Rupakheti Chetan
Sakkiah Sugunadevi
Sangion Alessandro
Schramm Karl-Werner
Selvaraj Chandrabose
Shah Imran
Sild Sulev
Sun Lixia
Taboureau Olivier
Tang Yun
Tetko Igor V.
Todeschini Roberto
Tong Weida
Trisciuzzi Daniela
Tropsha Alexander
Van Den Driessche George
Varnek Alexandre
Wang Zhongyu
Wedebye Eva B.
Williams Antony J.
Xie Hongbin
Zakharov Alexey V.
Zheng Ziye
Publication venue: 'Environmental Health Perspectives'
Publication date: 01/01/2020
Field of study

BACKGROUND: Endocrine disrupting chemicals (EDCs) are xenobiotics that mimic the interaction of natural hormones and alter synthesis, transport, or metabolic pathways. The prospect of EDCs causing adverse health effects in humans and wildlife has led to the development of scientific and regulatory approaches for evaluating bioactivity. This need is being addressed using high-throughput screening (HTS) in vitro approaches and computational modeling. OBJECTIVES: In support of the Endocrine Disruptor Screening Program, the U.S. Environmental Protection Agency (EPA) led two worldwide consortiums to virtually screen chemicals for their potential estrogenic and androgenic activities. Here, we describe the Collaborative Modeling Project for Androgen Receptor Activity (CoMPARA) efforts, which follows the steps of the Collaborative Estrogen Receptor Activity Prediction Project (CERAPP). METHODS: The CoMPARA list of screened chemicals built on CERAPP's list of 32,464 chemicals to include additional chemicals of interest, as well as simulated ToxCast (TM) metabolites, totaling 55,450 chemical structures. Computational toxicology scientists from 25 international groups contributed 91 predictive models for binding, agonist, and antagonist activity predictions. Models were underpinned by a common training set of 1,746 chemicals compiled from a combined data set of 11 ToxCast (TM)/Tox21 HTS in vitro assays. RESULTS: The resulting models were evaluated using curated literature data extracted from different sources. To overcome the limitations of single-model approaches, CoMPARA predictions were combined into consensus models that provided averaged predictive accuracy of approximately 80% for the evaluation set. DISCUSSION: The strengths and limitations of the consensus predictions were discussed with example chemicals; then, the models were implemented into the free and open-source OPERA application to enable screening of new chemicals with a defined applicability domain and accuracy assessment. This implementation was used to screen the entire EPA DSSTox database of similar to 875,000 chemicals, and their predicted AR activities have been made available on the EPA CompTox Chemicals dashboard and National Toxicology Program's Integrated Chemical Environment

Crossref

Publikationer från Umeå universitet

HAL Descartes

Digitala Vetenskapliga Arkivet - Academic Archive On-line

univOAK

Online Research Database In Technology

Hal-Diderot