Search CORE

28 research outputs found

The EDKB: an established knowledge base for endocrine disrupting chemicals

Author: AM Richard
AM Richard
Council NNSaT
CR Tyler
Don Ding
Edward D Bearden
FS Collins
FS vom Saal
GC Fonger
H Fang
H Fang
H Fang
H Hong
H Hong
Hong Fang
Huixiao Hong
JD Walker
Lei Xu
Leming Shi
LM Shi
LM Shi
MD Anway
MK Skinner
P Wexler
R Blair
RB Fitzpatrick
RJ Kavlock
RJ Kavlock
Roger Perkins
RR Young
Steve Harris
W Tong
Weida Tong
WS Branham
YB Wetherill
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Endocrine disruptors (EDs) and their broad range of potential adverse effects in humans and other animals have been a concern for nearly two decades. Many putative EDs are widely used in commercial products regulated by the Food and Drug Administration (FDA) such as food packaging materials, ingredients of cosmetics, medical and dental devices, and drugs. The Endocrine Disruptor Knowledge Base (EDKB) project was initiated in the mid 1990’s by the FDA as a resource for the study of EDs. The EDKB database, a component of the project, contains data across multiple assay types for chemicals across a broad structural diversity. This paper demonstrates the utility of EDKB database, an integral part of the EDKB project, for understanding and prioritizing EDs for testing. Results The EDKB database currently contains 3,257 records of over 1,800 EDs from different assays including estrogen receptor binding, androgen receptor binding, uterotropic activity, cell proliferation, and reporter gene assays. Information for each compound such as chemical structure, assay type, potency, etc. is organized to enable efficient searching. A user-friendly interface provides rapid navigation, Boolean searches on EDs, and both spreadsheet and graphical displays for viewing results. The search engine implemented in the EDKB database enables searching by one or more of the following fields: chemical structure (including exact search and similarity search), name, molecular formula, CAS registration number, experiment source, molecular weight, etc. The data can be cross-linked to other publicly available and related databases including TOXNET, Cactus, ChemIDplus, ChemACX, Chem Finder, and NCI DTP. Conclusion The EDKB database enables scientists and regulatory reviewers to quickly access ED data from multiple assays for specific or similar compounds. The data have been used to categorize chemicals according to potential risks for endocrine activity, thus providing a basis for prioritizing chemicals for more definitive but expensive testing. The EDKB database is publicly available and can be found online at <url>http://edkb.fda.gov/webstart/edkb/index.html</url>. Disclaimer:<it>The views presented in this article do not necessarily reflect those of the US Food and Drug Administration.</it></p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Decision Forest Analysis of 61 Single Nucleotide Polymorphisms in a Case-Control Study of Esophageal Cancer; a novel method

Author: A Kleespies
AH Wu
CH Lee
D Fallin
DJ Schaid
H Hong
H Hong
HJ Cordell
Huixiao Hong
J R Votano
LM Brown
LP Zhao
Luke D Ratnasinghe
Nan Hu
PC Enzinger
PC Sabeti
Philip R Taylor
Qian Xie
RM Hubley
Roger Perkins
SS Li
ST Sherry
W Tong
W Tong
Weida Tong
WN Venables
Ze-Zhong Tang
Publication venue: BioMed Central
Publication date: 01/01/2005
Field of study

BACKGROUND: Systematic evaluation and study of single nucleotide polymorphisms (SNPs) made possible by high throughput genotyping technologies and bioinformatics promises to provide breakthroughs in the understanding of complex diseases. Understanding how the millions of SNPs in the human genome are involved in conferring susceptibility or resistance to disease, or in rendering a drug efficacious or toxic in the individual is a major goal of the relatively new fields of pharmacogenomics. Esophageal squamous cell carcinoma is a high-mortality cancer with complex etiology and progression involving both genetic and environmental factors. We examined the association between esophageal cancer risk and patterns of 61 SNPs in a case-control study for a population from Shanxi Province in North Central China that has among the highest rates of esophageal squamous cell carcinoma in the world. METHODS: High-throughput Masscode mass spectrometry genotyping was done on genomic DNA from 574 individuals (394 cases and 180 age-frequency matched controls). SNPs were chosen from among genes involving DNA repair enzymes, and Phase I and Phase II enzymes. We developed a novel adaptation of the Decision Forest pattern recognition method named Decision Forest for SNPs (DF-SNPs). The method was designated to analyze the SNP data. RESULTS: The classifier in separating the cases from the controls developed with DF-SNPs gave concordance, sensitivity and specificity, of 94.7%, 99.0% and 85.1%, respectively; suggesting its usefulness for hypothesizing what SNPs or combinations of SNPs could be involved in susceptibility to esophageal cancer. Importantly, the DF-SNPs algorithm incorporated a randomization test for assessing the relevance (or importance) of individual SNPs, SNP types (Homozygous common, heterozygous and homozygous variant) and patterns of SNP types (SNP patterns) that differentiate cases from controls. For example, we found that the different genotypes of SNP GADD45B E1122 are all associated with cancer risk. CONCLUSION: The DF-SNPs method can be used to differentiate esophageal squamous cell carcinoma cases from controls based on individual SNPs, SNP types and SNP patterns. The method could be useful to identify potential biomarkers from the SNP data and complement existing methods for genotype analyses

Crossref

Springer - Publisher Connector

PubMed Central

Two new ArrayTrack libraries for personalized biomedical research

Author: A Adeyemo
B Rhead
Baitang Ning
C Wise
Carolyn Wise
Consortium IHGS
EI Park
G Peng
H Fang
Hong Fang
Huixiao Hong
J Kaput
J Kaput
J Kaput
Jim Kaput
Joshua Xu
KA Frazer
L Tappy
S Myles
S Myles
SN Twigger
T Illig
The International HapMap C
Vijayalakshmi Varma
W Tong
Weida Tong
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Recent advances in high-throughput genotyping technology are paving the way for research in personalized medicine and nutrition. However, most of the genetic markers identified from association studies account for a small contribution to the total risk/benefit of the studied phenotypic trait. Testing whether the candidate genes identified by association studies are causal is critically important to the development of personalized medicine and nutrition. An efficient data mining strategy and a set of sophisticated tools are necessary to help better understand and utilize the findings from genetic association studies. Description SNP (single nucleotide polymorphism) and QTL (quantitative trait locus) libraries were constructed and incorporated into ArrayTrack, with user-friendly interfaces and powerful search features. Data from several public repositories were collected in the SNP and QTL libraries and connected to other domain libraries (genes, proteins, metabolites, and pathways) in ArrayTrack. Linking the data sets within ArrayTrack allows searching of SNP and QTL data as well as their relationships to other biological molecules. The SNP library includes approximately 15 million human SNPs and their annotations, while the QTL library contains publically available QTLs identified in mouse, rat, and human. The QTL library was developed for finding the overlap between the map position of a candidate or metabolic gene and QTLs from these species. Two use cases were included to demonstrate the utility of these tools. The SNP and QTL libraries are freely available to the public through ArrayTrack at <url>http://www.fda.gov/ArrayTrack</url>. Conclusions These libraries developed in ArrayTrack contain comprehensive information on SNPs and QTLs and are further cross-linked to other libraries. Connecting domain specific knowledge is a cornerstone of systems biology strategies and allows for a better understanding of the genetic and biological context of the findings from genetic association studies. </p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Microarray scanner calibration curves: characteristics and implications

Author: AM Dudley
Axon
BA Rosenzweig
D Hekstra
EP Hoffman
F Naef
Federico M Goodsaid
Felix W Frueh
GA Held
H Bengtsson
H Lyng
H Yue
Hong Fang
Huixiao Hong
IV Yang
J Fuscoe
J Quackenbush
James C Fuscoe
James J Chen
Jing Han
JN Weinstein
K Dobbin
K Dobbin
L Shi
L Shi
LE Dodd
Lei Guo
Leming Shi
MJ Martinez
N Raghavachari
Qian Xie
Raj K Puri
Roger G Perkins
S Pickett
Stephen C Harris
T Yuen
Tao Han
VG Cheung
VG Desai
W Tong
W Tong
Weida Tong
William S Branham
WR Foster
Y Zong
YH Yang
Z Alex Xu
Zhenqiang Su
Publication venue: BioMed Central
Publication date: 01/01/2005
Field of study

BACKGROUND: Microarray-based measurement of mRNA abundance assumes a linear relationship between the fluorescence intensity and the dye concentration. In reality, however, the calibration curve can be nonlinear. RESULTS: By scanning a microarray scanner calibration slide containing known concentrations of fluorescent dyes under 18 PMT gains, we were able to evaluate the differences in calibration characteristics of Cy5 and Cy3. First, the calibration curve for the same dye under the same PMT gain is nonlinear at both the high and low intensity ends. Second, the degree of nonlinearity of the calibration curve depends on the PMT gain. Third, the two PMTs (for Cy5 and Cy3) behave differently even under the same gain. Fourth, the background intensity for the Cy3 channel is higher than that for the Cy5 channel. The impact of such characteristics on the accuracy and reproducibility of measured mRNA abundance and the calculated ratios was demonstrated. Combined with simulation results, we provided explanations to the existence of ratio underestimation, intensity-dependence of ratio bias, and anti-correlation of ratios in dye-swap replicates. We further demonstrated that although Lowess normalization effectively eliminates the intensity-dependence of ratio bias, the systematic deviation from true ratios largely remained. A method of calculating ratios based on concentrations estimated from the calibration curves was proposed for correcting ratio bias. CONCLUSION: It is preferable to scan microarray slides at fixed, optimal gain settings under which the linearity between concentration and intensity is maximized. Although normalization methods improve reproducibility of microarray measurements, they appear less effective in improving accuracy

Crossref

Springer - Publisher Connector

PubMed Central

Cross-platform comparability of microarray technology: Intra-platform consistency and appropriate data analysis procedures are essential

Author: A Barczak
AK Jarvinen
AT Rogojina
BH Mecham
CL Yauk
Daniel A Casciano
DF Ransohoff
DF Ransohoff
E Marshall
EF Petricoin 3rd
Federico M Goodsaid
Felix W Frueh
FW Frueh
GP Page
H Van Bakel
Hong Fang
Huixiao Hong
James C Fuscoe
James J Chen
Jing Han
JL Hackett
L Shi
L Shi
Lei Guo
Leming Shi
M Bakay
MD Piper
N Mah
N Raikhel
PK Tan
Qian Xie
R Breitling
R Shippy
Raj K Puri
Roger G Perkins
T Barrett
T Mehta
T Yuen
Tao Han
TR Hughes
Tucker A Patterson
Uwe Scherf
VG Tusher
Weida Tong
WP Kuo
Y Woo
Z aAlex Xu
Zhenqiang Su
Publication venue: BioMed Central
Publication date: 01/01/2005
Field of study

BACKGROUND: The acceptance of microarray technology in regulatory decision-making is being challenged by the existence of various platforms and data analysis methods. A recent report (E. Marshall, Science, 306, 630–631, 2004), by extensively citing the study of Tan et al. (Nucleic Acids Res., 31, 5676–5684, 2003), portrays a disturbingly negative picture of the cross-platform comparability, and, hence, the reliability of microarray technology. RESULTS: We reanalyzed Tan's dataset and found that the intra-platform consistency was low, indicating a problem in experimental procedures from which the dataset was generated. Furthermore, by using three gene selection methods (i.e., p-value ranking, fold-change ranking, and Significance Analysis of Microarrays (SAM)) on the same dataset we found that p-value ranking (the method emphasized by Tan et al.) results in much lower cross-platform concordance compared to fold-change ranking or SAM. Therefore, the low cross-platform concordance reported in Tan's study appears to be mainly due to a combination of low intra-platform consistency and a poor choice of data analysis procedures, instead of inherent technical differences among different platforms, as suggested by Tan et al. and Marshall. CONCLUSION: Our results illustrate the importance of establishing calibrated RNA samples and reference datasets to objectively assess the performance of different microarray platforms and the proficiency of individual laboratories as well as the merits of various data analysis procedures. Thus, we are progressively coordinating the MAQC project, a community-wide effort for microarray quality control

Crossref

Springer - Publisher Connector

PubMed Central

Very Important Pool (VIP) genes – an application for microarray-based molecular signatures

Author: A Ben-Dor
A Bhattacharjee
A Butte
A Rosenwald
AK Jain
AL Bluma
B Liu
C Ambroise
C Ding
C Lai
D Singh
DG Beer
DJ Lockhart
EJ Yeoh
EK Tang
GJ Gordon
H Hackl
HH Zhang
Hong Fang
Huixiao Hong
IM Gana Dresen
InfoMetrix
J Dopazoa
J Gould
J Quackenbush
J Quackenbush
JG Zhang
JJ Chen
KE Lee
L Brehelin
L Breiman
L Ein-Dor
L Li
L Shi
L Shi
L Shi
L Wang
Leming Shi
LF Wessels
LJ van 't Veer
M Dettling
M Schena
MA Shipp
R Diaz-Uriarte
R Simon
Roger Perkins
S Dudoit
S Michiels
S Mukherjee
S Wold
SE Jarvis
SJ Raudys
SL Pomeroy
U Alon
U Lutz
VN Vapnik
W Jiang
Weida Tong
WJ Fu
X Chen
Y Peng
Y Wang
Z Su
Zhenqiang Su
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Abstract Background Advances in DNA microarray technology portend that molecular signatures from which microarray will eventually be used in clinical environments and personalized medicine. Derivation of biomarkers is a large step beyond hypothesis generation and imposes considerably more stringency for accuracy in identifying informative gene subsets to differentiate phenotypes. The inherent nature of microarray data, with fewer samples and replicates compared to the large number of genes, requires identifying informative genes prior to classifier construction. However, improving the ability to identify differentiating genes remains a challenge in bioinformatics. Results A new hybrid gene selection approach was investigated and tested with nine publicly available microarray datasets. The new method identifies a Very Important Pool (VIP) of genes from the broad patterns of gene expression data. The method uses a bagging sampling principle, where the re-sampled arrays are used to identify the most informative genes. Frequency of selection is used in a repetitive process to identify the VIP genes. The putative informative genes are selected using two methods, t-statistic and discriminatory analysis. In the t-statistic, the informative genes are identified based on p-values. In the discriminatory analysis, disjoint Principal Component Analyses (PCAs) are conducted for each class of samples, and genes with high discrimination power (DP) are identified. The VIP gene selection approach was compared with the p-value ranking approach. The genes identified by the VIP method but not by the p-value ranking approach are also related to the disease investigated. More importantly, these genes are part of the pathways derived from the common genes shared by both the VIP and p-ranking methods. Moreover, the binary classifiers built from these genes are statistically equivalent to those built from the top 50 p-value ranked genes in distinguishing different types of samples. Conclusion The VIP gene selection approach could identify additional subsets of informative genes that would not always be selected by the p-value ranking method. These genes are likely to be additional true positives since they are a part of pathways identified by the p-value ranking method and expected to be related to the relevant biology. Therefore, these additional genes derived from the VIP method potentially provide valuable biological insights.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Homology modeling, molecular docking, and molecular dynamics simulations elucidated α-fetoprotein binding modes

Author: A Bujacz
A Onufriev
A Raval
A-S Yang
AA Terentiev
AC Kruse
B Rost
CE Willett
D Gitlin
DA Case
DC Chan
EA Nunez
F Heitz
F Hérve
G Vallette
GJ Mizejewski
GJ Mizejewski
GJ Mizejewski
GP Daston
H Hong
Hong Fang
Huixiao Hong
J Shen
J Uriel
J Wang
J Weiser
J-P Ryckaert
JD Yager
Jie Shen
K Lindorff-Larsen
M Wormke
MR Wilson
PA Kollman
R Nygaard
Roger Perkins
RT Zoeller
S Nishi
S Nishi
S Piana
S Safe
SF Altschul
SF Sousa
SW Law
T Cheng
T Darden
T Halgren
TU Consortium
W Li
Weida Tong
Wenqian Zhang
Y Duan
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref