Search CORE

353 research outputs found

Two new ArrayTrack libraries for personalized biomedical research

Author: A Adeyemo
B Rhead
Baitang Ning
C Wise
Carolyn Wise
Consortium IHGS
EI Park
G Peng
H Fang
Hong Fang
Huixiao Hong
J Kaput
J Kaput
J Kaput
Jim Kaput
Joshua Xu
KA Frazer
L Tappy
S Myles
S Myles
SN Twigger
T Illig
The International HapMap C
Vijayalakshmi Varma
W Tong
Weida Tong
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Recent advances in high-throughput genotyping technology are paving the way for research in personalized medicine and nutrition. However, most of the genetic markers identified from association studies account for a small contribution to the total risk/benefit of the studied phenotypic trait. Testing whether the candidate genes identified by association studies are causal is critically important to the development of personalized medicine and nutrition. An efficient data mining strategy and a set of sophisticated tools are necessary to help better understand and utilize the findings from genetic association studies. Description SNP (single nucleotide polymorphism) and QTL (quantitative trait locus) libraries were constructed and incorporated into ArrayTrack, with user-friendly interfaces and powerful search features. Data from several public repositories were collected in the SNP and QTL libraries and connected to other domain libraries (genes, proteins, metabolites, and pathways) in ArrayTrack. Linking the data sets within ArrayTrack allows searching of SNP and QTL data as well as their relationships to other biological molecules. The SNP library includes approximately 15 million human SNPs and their annotations, while the QTL library contains publically available QTLs identified in mouse, rat, and human. The QTL library was developed for finding the overlap between the map position of a candidate or metabolic gene and QTLs from these species. Two use cases were included to demonstrate the utility of these tools. The SNP and QTL libraries are freely available to the public through ArrayTrack at <url>http://www.fda.gov/ArrayTrack</url>. Conclusions These libraries developed in ArrayTrack contain comprehensive information on SNPs and QTLs and are further cross-linked to other libraries. Connecting domain specific knowledge is a cornerstone of systems biology strategies and allows for a better understanding of the genetic and biological context of the findings from genetic association studies. </p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

T1DBase: update 2011, organization and presentation of large-scale data sets for type 1 diabetes research

Author: Barrett
Chen
Clayton
Cooper
Dowell
E. C. Adlem
Frazer
Gentleman
Homer
J. A. Todd
Kutlu
M. Christensen
O. S. Burren
P. Achuthan
R. M. R. Coulson
Sherry
Smedley
Stein
Todd
Wallace
Zeller
Publication venue: Oxford University Press
Publication date
Field of study

T1DBase (http://www.t1dbase.org) is web platform, which supports the type 1 diabetes (T1D) community. It integrates genetic, genomic and expression data relevant to T1D research across mouse, rat and human and presents this to the user as a set of web pages and tools. This update describes the incorporation of new data sets, tools and curation efforts as well as a new website design to simplify site use. New data sets include curated summary data from four genome-wide association studies relevant to T1D, HaemAtlas—a data set and tool to query gene expression levels in haematopoietic cells and a manually curated table of human T1D susceptibility loci, incorporating genetic overlap with other related diseases. These developments will continue to support T1D research and allow easy access to large and complex T1D relevant data sets

Crossref

PubMed Central

Advanced Genomic Data Mining

Author: A Kasprzyk
B Giardine
D Diez
D Hull
D Karolchik
D Maglott
E Birney
E Segal
Ewan Birney
Fran Lewitter
G Alonso
I Vastrik
J Pratap
J Taylor
JL Ashurst
KD Pruitt
LA Davidson
M Ashburner
MB Eisen
N Chen
N de la Cruz
P Jaiswal
P Rice
R Ihaka
R Ramakrishnan
RA Becker
RC Gentleman
RD Dowell
RM Kuhn
SN Twigger
ST Sherry
TJ Hubbard
TR Golub
TW Harris
Xosé M. Fernández-Suárez
Publication venue: Public Library of Science
Publication date: 01/01/2008
Field of study

As data banks increase their size, one of the current challenges in bioinformatics is to be able to query them in a sensible way. Information is contained in differen

CiteSeerX

Crossref

Directory of Open Access Journals

PubMed Central

PhenoGO: an integrated resource for the multiscale mining of clinical and biological data

Author: Blake Judith
Friedman Carol
Li Jianrong
Lussier Yves A
Mendonça Eneida A
Sam Lee T
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

The evolving complexity of genome-scale experiments has increasingly centralized the role of a highly computable, accurate, and comprehensive resource spanning multiple biological scales and viewpoints. To provide a resource to meet this need, we have significantly extended the PhenoGO database with gene-disease specific annotations and included an additional ten species. This a computationally-derived resource is primarily intended to provide phenotypic context (cell type, tissue, organ, and disease) for mining existing associations between gene products and GO terms specified in the Gene Ontology Databases Automated natural language processing (BioMedLEE) and computational ontology (PhenOS) methods were used to derive these relationships from the literature, expanding the database with information from ten additional species to include over 600,000 phenotypic contexts spanning eleven species from five GO annotation databases. A comprehensive evaluation evaluating the mappings (n = 300) found precision (positive predictive value) at 85%, and recall (sensitivity) at 76%. Phenotypes are encoded in general purpose ontologies such as Cell Ontology, the Unified Medical Language System, and in specialized ontologies such as the Mouse Anatomy and the Mammalian Phenotype Ontology. A web portal has also been developed, allowing for advanced filtering and querying of the database as well as download of the entire dataset

Crossref

The Jackson Laboratory: The Mouseion at the JAXlibrary

Springer - Publisher Connector

Columbia University Academic Commons

PubMed Central

Deep Blue Documents at the University of Michigan

genenames.org: the HGNC resources in 2011

Author: Bruford
Doms
E. A. Bruford
Goodstadt
Hoffmann
Horaitis
Lestrade
M. J. Lush
M. W. Wright
R. L. Seal
S. M. Gordon
Shows
Weinreich
Publication venue: Oxford University Press
Publication date: 01/01/2011
Field of study

The HUGO Gene Nomenclature Committee (HGNC) aims to assign a unique gene symbol and name to every human gene. The HGNC database currently contains almost 30 000 approved gene symbols, over 19 000 of which represent protein-coding genes. The public website, www.genenames.org, displays all approved nomenclature within Symbol Reports that contain data curated by HGNC editors and links to related genomic, phenotypic and proteomic information. Here we describe improvements to our resources, including a new Quick Gene Search, a new List Search, an integrated HGNC BioMart and a new Statistics and Downloads facility

CiteSeerX

Crossref

PubMed Central

Disease Ontology: a backbone for disease semantic integration

Author: Amberger
Bug
C. Arze
Ceusters
Cote
Feng
G. Feng
L. M. Schriml
M. Mazaitis
Osborne
Robinson
Rosse
Rupprecht
S. Nadendla
Sioutos
Smith
V. Felix
W. A. Kibbe
Y.-W. W. Chang
Publication venue: Oxford University Press
Publication date
Field of study

The Disease Ontology (DO) database (http://disease-ontology.org) represents a comprehensive knowledge base of 8043 inherited, developmental and acquired human diseases (DO version 3, revision 2510). The DO web browser has been designed for speed, efficiency and robustness through the use of a graph database. Full-text contextual searching functionality using Lucene allows the querying of name, synonym, definition, DOID and cross-reference (xrefs) with complex Boolean search strings. The DO semantically integrates disease and medical vocabularies through extensive cross mapping and integration of MeSH, ICD, NCI's thesaurus, SNOMED CT and OMIM disease-specific terms and identifiers. The DO is utilized for disease annotation by major biomedical databases (e.g. Array Express, NIF, IEDB), as a standard representation of human disease in biomedical ontologies (e.g. IDO, Cell line ontology, NIFSTD ontology, Experimental Factor Ontology, Influenza Ontology), and as an ontological cross mappings resource between DO, MeSH and OMIM (e.g. GeneWiki). The DO project (http://diseaseontology.sf.net) has been incorporated into open source tools (e.g. Gene Answers, FunDO) to connect gene and disease biomedical data through the lens of human disease. The next iteration of the DO web browser will integrate DO's extended relations and logical definition representation along with these biomedical resource cross-mappings

Crossref

PubMed Central

IUPHAR-DB: new receptors and tools for easy searching and visualization of pharmacological data

Author: Anthony J. Harmar
Bart Staels
Berman
Bruford
Bult
Catherine Dacquet
Chidochangu P. Mpamhanga
de Matos
Ertl
Germain
Harmar
Hopkins
Joanna L. Sharman
Klein
Lipinski
McKenna
Michael Spedding
Pierre Germain
Sayers
Steinbeck
Twigger
Vincent Laudet
Wang
Wang
Wishart
Publication venue: Oxford University Press
Publication date: 01/01/2011
Field of study

The IUPHAR database is an established online reference resource for several important classes of human drug targets and related proteins. As well as providing recommended nomenclature, the database integrates information on the chemical, genetic, functional and pathophysiological properties of receptors and ion channels, curated and peer-reviewed from the biomedical literature by a network of experts. The database now includes information on 616 gene products from four superfamilies in human and rodent model organisms: G protein-coupled receptors, voltage- and ligand-gated ion channels and, in a recent update, 49 nuclear hormone receptors (NHRs). New data types for NHRs include details on co-regulators, DNA binding motifs, target genes and 3D structures. Other recent developments include curation of the chemical structures of approximately 2000 ligand molecules, providing electronic descriptors, identifiers, link-outs and calculated molecular properties, all available via enhanced ligand pages. The interface now provides intelligent tools for the visualization and exploration of ligand structure-activity relationships and the structural diversity of compounds active at each target. The database is freely available at http://www.iuphar-db.org

Crossref

PubMed Central

Edinburgh Research Explorer

GeneWeaver: a web-based system for integrative functional genomics

Author: Austin
Baker
Barrett
Blake
Bruford
Cherry
Chesler
Davis
Dennis
Elissa J. Chesler
Erich J. Baker
Gardner
Guan
Guo
Harris
Jason A. Bubier
Jeremy J. Jay
Jonquet
Le-Niculescu
Li
Li
Mattingly
McGary
Meehan
Michael A. Langston
Mnaimneh
Mulligan
Neely
Nissenbaum
Osborne
Shannon
Smith
Smith
Sprague
The Gene Ontology Consortium
Tweedie
Twigger
Wang
Zhang
Publication venue: Oxford University Press
Publication date: 01/01/2012
Field of study

High-throughput genome technologies have produced a wealth of data on the association of genes and gene products to biological functions. Investigators have discovered value in combining their experimental results with published genome-wide association studies, quantitative trait locus, microarray, RNA-sequencing and mutant phenotyping studies to identify gene-function associations across diverse experiments, species, conditions, behaviors or biological processes. These experimental results are typically derived from disparate data repositories, publication supplements or reconstructions from primary data stores. This leaves bench biologists with the complex and unscalable task of integrating data by identifying and gathering relevant studies, reanalyzing primary data, unifying gene identifiers and applying ad hoc computational analysis to the integrated set. The freely available GeneWeaver (http://www.GeneWeaver.org) powered by the Ontological Discovery Environment is a curated repository of genomic experimental results with an accompanying tool set for dynamic integration of these data sets, enabling users to interactively address questions about sets of biological functions and their relations to sets of genes. Thus, large numbers of independently published genomic results can be organized into new conceptual frameworks driven by the underlying, inferred biological relationships rather than a pre-existing semantic framework. An empirical ‘ontology’ is discovered from the aggregate of experimental knowledge around user-defined areas of biological inquiry

CiteSeerX

Crossref

The Jackson Laboratory: The Mouseion at the JAXlibrary

PubMed Central

A Gene Wiki for Community Annotation of Gene Function

Author: Anderson
Andrew I Su
Barabasi
Barabasi
Bult
Camilo Orozco
Chunlei Wu
Dowell
Faramarz Valafar
Flicek
Giles
Giles
Giles
James Goodale
Jeong
Jon W Huss
Kamps
Mons
Pan
Salzberg
Serge Batalov
Su
Tim J Vickers
Twigger
Wagner
Wang
Wheeler
Wilson
Yook
Publication venue: Public Library of Science
Publication date: 01/01/2008
Field of study

This manuscript describes the creation of comprehensive gene wiki, seeded with data from public domain sources, which will enable and encourage community annotation of gene function

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

The Novartis Repository

The UCSC Genome Browser Database: update 2009

Author: A. Pohl
A. S. Hinrichs
A. S. Zweig
B. Giardine
B. J. Raney
B. Rhead
Bellen
Blanchette
D. Haussler
D. Karolchik
F. Hsu
G. P. Barber
H. Clawson
Hinrichs
Hsu
Iafrate
K. E. Smith
K. R. Rosenbloom
Karolchik
Karolchik
Kent
L. Meyer
M. Diekhans
M. Pheasant
Mattes
Nord
P. Fujita
R. A. Harte
R. M. Kuhn
Sherry
T. Dreszer
T. Wang
The ENCODE Project Consortium
The MGC Project Team
W. J. Kent
Yang
Zhu
Publication venue: Oxford University Press
Publication date
Field of study

The UCSC Genome Browser Database (GBD, http://genome.ucsc.edu) is a publicly available collection of genome assembly sequence data and integrated annotations for a large number of organisms, including extensive comparative-genomic resources. In the past year, 13 new genome assemblies have been added, including two important primate species, orangutan and marmoset, bringing the total to 46 assemblies for 24 different vertebrates and 39 assemblies for 22 different invertebrate animals. The GBD datasets may be viewed graphically with the UCSC Genome Browser, which uses a coordinate-based display system allowing users to juxtapose a wide variety of data. These data include all mRNAs from GenBank mapped to all organisms, RefSeq alignments, gene predictions, regulatory elements, gene expression data, repeats, SNPs and other variation data, as well as pairwise and multiple-genome alignments. A variety of other bioinformatics tools are also provided, including BLAT, the Table Browser, the Gene Sorter, the Proteome Browser, VisiGene and Genome Graphs

CiteSeerX

Crossref

PubMed Central