5 research outputs found
Wikipedia on the CompTox Chemicals Dashboard:Connecting Resources to Enrich Public Chemical Data
The online encyclopedia Wikipedia aggregates a large amount of data on chemistry, encompassing well over 20,000 individual Wikipedia pages and serves the general public as well as the chemistry community. Many other chemical databases and services utilize these data, and previous projects have focused on methods to index, search, and extract it for review and use. We present a comprehensive effort that combines bulk automated data extraction over tens of thousands of pages, semiautomated data extraction over hundreds of pages, and fine-grained manual extraction of individual lists and compounds of interest. We then correlate these data with the existing contents of the U.S. Environmental Protection Agency's (EPA) Distributed Structure-Searchable Toxicity (DSSTox) database. This was performed with a number of intentions including ensuring as complete a mapping as possible between the Dashboard and Wikipedia so that relevant snippets of the article are loaded for the user to review. Conflicts between Dashboard content and Wikipedia in terms of, for example, identifiers such as chemical registry numbers, names, and InChIs and structure-based collisions such as SMILES were identified and used as the basis of curation of both DSSTox and Wikipedia. This work also allowed us to evaluate available data for sets of chemicals of interest to the Agency, such as synthetic cannabinoids, and expand the content in DSSTox as appropriate. This work also led to improved bidirectional linkage of the detailed chemistry and usage information from Wikipedia with expert-curated structure and identifier data from DSSTox for a new list of nearly 20,000 chemicals. All of this work ultimately enhances the data mappings that allow for the display of the introduction of the Wikipedia article in the community-accessible web-based EPA Comptox Chemicals Dashboard, enhancing the user experience for the thousands of users per day accessing the resource
ToxCast Chemical Landscape: Paving the Road to 21st Century Toxicology
The
U.S. Environmental Protection Agency’s (EPA) ToxCast
program is testing a large library of Agency-relevant chemicals using <i>in vitro</i> high-throughput screening (HTS) approaches to support
the development of improved toxicity prediction models. Launched in
2007, Phase I of the program screened 310 chemicals, mostly pesticides,
across hundreds of ToxCast assay end points. In Phase II, the ToxCast
library was expanded to 1878 chemicals, culminating in the public
release of screening data at the end of 2013. Subsequent expansion
in Phase III has resulted in more than 3800 chemicals actively undergoing
ToxCast screening, 96% of which are also being screened in the multi-Agency
Tox21 project. The chemical library unpinning these efforts plays
a central role in defining the scope and potential application of
ToxCast HTS results. The history of the phased construction of EPA’s
ToxCast library is reviewed, followed by a survey of the library contents
from several different vantage points. CAS Registry Numbers are used
to assess ToxCast library coverage of important toxicity, regulatory,
and exposure inventories. Structure-based representations of ToxCast
chemicals are then used to compute physicochemical properties, substructural
features, and structural alerts for toxicity and biotransformation.
Cheminformatics approaches using these varied representations are
applied to defining the boundaries of HTS testability, evaluating
chemical diversity, and comparing the ToxCast library to potential
target application inventories, such as used in EPA’s Endocrine
Disruption Screening Program (EDSP). Through several examples, the
ToxCast chemical library is demonstrated to provide comprehensive
coverage of the knowledge domains and target inventories of potential
interest to EPA. Furthermore, the varied representations and approaches
presented here define local chemistry domains potentially worthy of
further investigation (e.g., not currently covered in the testing
library or defined by toxicity “alerts”) to strategically
support data mining and predictive toxicology modeling moving forward
Comprehensive Assessment of a Chlorinated Drinking Water Concentrate in a Rat Multigenerational Reproductive Toxicity Study
Some epidemiological
studies report associations between drinking
water disinfection byproducts (DBPs) and adverse reproductive/developmental
effects, e.g., low birth weight, spontaneous abortion, stillbirth,
and birth defects. Using a multigenerational rat bioassay, we evaluated
an environmentally relevant “whole” mixture of DBPs
representative of chlorinated drinking water, including unidentified
DBPs as well as realistic proportions of known DBPs at low-toxicity
concentrations. Source water from a water utility was concentrated
136-fold, chlorinated, and provided as drinking water to Sprague–Dawley
rats. Timed-pregnant females (P<sub>0</sub> generation) were exposed
during gestation and lactation. Weanlings (F<sub>1</sub> generation)
continued exposures and were bred to produce an F<sub>2</sub> generation.
Large sample sizes enhanced statistical power, particularly for pup
weight and prenatal loss. No adverse effects were observed for pup
weight, prenatal loss, pregnancy rate, gestation length, puberty onset
in males, growth, estrous cycles, hormone levels, immunological end
points, and most neurobehavioral
end points. Significant, albeit slight, effects included delayed puberty
for F<sub>1</sub> females, reduced caput epidydimal sperm counts in
F<sub>1</sub> adult males, and increased incidences of thyroid follicular
cell hypertrophy in adult females. These results highlight areas for
future research, while the largely negative findings, particularly
for pup weight and prenatal loss, are notable