Search CORE

108 research outputs found

Consolidating metabolite identifiers to enable contextual and multi-platform metabolomics data analysis

Author: Atsushi Fukushima
B Usadel
C Knox
D Smedley
DS Wishart
DW Huang
EM Lenz
F Carrari
Fumio Matsuda
H Li
H Pages
H Redestig
Henning Redestig
J Zhang
K Akiyama
K Urano
Kazuki Saito
M Sud
Masanori Arita
Miyako Kusano
MJvan der Werf
MP van Iersel
N Schauer
P Romero
P Shannon
PD Karp
Plant Metabolic Network
PubChem
PubChem
Q Cui
R Development Core Team
R Gentleman
R Taguchi
R Williams
RG Côté
T Kind
T Tokimatsu
VJ Carey
VK Mootha
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Analysis of data from high-throughput experiments depends on the availability of well-structured data that describe the assayed biomolecules. Procedures for obtaining and organizing such meta-data on genes, transcripts and proteins have been streamlined in many data analysis packages, but are still lacking for metabolites. Chemical identifiers are notoriously incoherent, encompassing a wide range of different referencing schemes with varying scope and coverage. Online chemical databases use multiple types of identifiers in parallel but lack a common primary key for reliable database consolidation. Connecting identifiers of analytes found in experimental data with the identifiers of their parent metabolites in public databases can therefore be very laborious. Results Here we present a strategy and a software tool for integrating metabolite identifiers from local reference libraries and public databases that do not depend on a single common primary identifier. The program constructs groups of interconnected identifiers of analytes and metabolites to obtain a local metabolite-centric SQLite database. The created database can be used to map in-house identifiers and synonyms to external resources such as the KEGG database. New identifiers can be imported and directly integrated with existing data. Queries can be performed in a flexible way, both from the command line and from the statistical programming environment R, to obtain data set tailored identifier mappings. Conclusions Efficient cross-referencing of metabolite identifiers is a key technology for metabolomics data analysis. We provide a practical and flexible solution to this task and an open-source program, the metabolite masking tool (MetMask), available at <url>http://metmask.sourceforge.net</url>, that implements our ideas.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Effects of 17β-estradiol (E2) on aqueous organisms and its treatment problem: a review

Author: Pubchem
Pubchem
Publication venue: 'Walter de Gruyter GmbH'
Publication date
Field of study

Crossref

Quantitative weight of evidence assessment of higher tier studies on the toxicity and risks of neonicotinoids in honeybees. 3. Clothianidin

Author: [PubChem] PubChem
EFSA
Gladys L. Stephenson
Keith R. Solomon
Marzaro M.
Publication venue: 'Informa UK Limited'
Publication date
Field of study

Crossref

The Effect Acetic Acid has on Poly( N

Author: Laukkanen A.
PubChem.
Publication venue: 'Informa UK Limited'
Publication date
Field of study

Crossref

Weighted Feature Significance: A Simple, Interpretable Model of Compound Toxicity Based on the Statistical Enrichment of Structural Features

Author: Ajit Jadhav
Anonymous
Christopher P. Austin
Dac-Trung Nguyen
James Inglese
Menghang Xia
Ming-Hsuang Cho
Noel Southall
PubChem
PubChem
PubChem
PubChem
Raymond R. Tice
RTECS
Ruili Huang
Publication venue: Oxford University Press
Publication date
Field of study

In support of the U.S. Tox21 program, we have developed a simple and chemically intuitive model we call weighted feature significance (WFS) to predict the toxicological activity of compounds, based on the statistical enrichment of structural features in toxic compounds. We trained and tested the model on the following: (1) data from quantitative high–throughput screening cytotoxicity and caspase activation assays conducted at the National Institutes of Health Chemical Genomics Center, (2) data from Salmonella typhimurium reverse mutagenicity assays conducted by the U.S. National Toxicology Program, and (3) hepatotoxicity data published in the Registry of Toxic Effects of Chemical Substances. Enrichments of structural features in toxic compounds are evaluated for their statistical significance and compiled into a simple additive model of toxicity and then used to score new compounds for potential toxicity. The predictive power of the model for cytotoxicity was validated using an independent set of compounds from the U.S. Environmental Protection Agency tested also at the National Institutes of Health Chemical Genomics Center. We compared the performance of our WFS approach with classical classification methods such as Naive Bayesian clustering and support vector machines. In most test cases, WFS showed similar or slightly better predictive power, especially in the prediction of hepatotoxic compounds, where WFS appeared to have the best performance among the three methods. The new algorithm has the important advantages of simplicity, power, interpretability, and ease of implementation

Crossref

PubMed Central