Search CORE

24 research outputs found

Genome sequences and great expectations

Author: Andrade M.A.
Audit B.
Iliopoulos I.
Janssen P.
Leroy C.
Ouzounis C.A.
Sander C.
Tramontano A.
Tsoka S.
Valencia A.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2001
Field of study

To assess how automatic function assignment will contribute to genome annotation in the next five years, we have performed an analysis of 31 available genome sequences. An emerging pattern is that function can be predicted for almost two-thirds of the 73,500 genes that were analyzed. Despite progress in computational biology, there will always be a great need for large-scale experimental determination of protein function

MDC Repository

Genome sequences and great expectations

Author: Andrade Miguel A
Audit Benjamin
Iliopoulos Ioannis
Janssen Paul
Leroy Christophe
Ouzounis Christos A
Sander Chris
Tramontano Anna
Tsoka Sophia
Valencia Alfonso
Publication venue: BioMed Central
Publication date: 01/01/2000
Field of study

PubMed Central

King's Research Portal

MDC Repository

The Genomes On Line Database (GOLD) v.2: a monitor of genome projects worldwide

Author: Hugenholtz Philip
Kyrpides Nikos C.
Liolios Konstantinos
Tavernarakis Nektarios
Publication venue: Oxford University Press
Publication date: 28/12/2005
Field of study

The Genomes On Line Database (GOLD) is a web resource for comprehensive access to information regarding complete and ongoing genome sequencing projects worldwide. The database currently incorporates information on over 1500 sequencing projects, of which 294 have been completed and the data deposited in the public databases. GOLD v.2 has been expanded to provide information related to organism properties such as phenotype, ecotype and disease. Furthermore, project relevance and availability information is now included. GOLD is available at . It is also mirrored at the Institute of Molecular Biology and Biotechnology, Crete, Greece a

CiteSeerX

Crossref

PubMed Central

University of Queensland eSpace

The Genomes On Line Database (GOLD) in 2007: status of genomic and metagenomic projects and their associated metadata

Author: Fenner Marsha W
Kyrpides Nikos C.
Liolios Konstantinos
Mavromatis Konstantinos
Tavernarakis Nektarios
Publication venue: Oxford University Press
Publication date: 31/12/2007
Field of study

The Genomes On Line Database (GOLD) is a comprehensive resource that provides information on genome and metagenome projects worldwide. Complete and ongoing projects and their associated metadata can be accessed in GOLD through pre-computed lists and a search page. As of September 2007, GOLD contains information on more than 2900 sequencing projects, out of which 639 have been completed and their sequence data deposited in the public databases. GOLD continues to expand with the goal of providing metadata information related to the projects and the organisms/environments towards the Minimum Information about a Genome Sequence’ (MIGS) guideline. GOLD is available at http://www.genomesonline.org and has a mirror site at the Institute of Molecular Biology and Biotechnology, Crete, Greece at http://gold.imbb.forth.gr

PubMed Central

UNT Digital Library

Recommended from our members

Towards a standards-compliant genomic and metagenomic publication record

Author: Anguiloi Samuel
Cole James R.
Fenner Marsha W
Field Dawn
Garrity George M.
Glockner Frank Oliver
Hirschman Lynette
Kolker Eugene
Kowaluchuk George
Kyrpides Nikos
Moran Mary Ann
San-sone Susanna-Assunta
Ussery Dave
White Owen
Publication venue: Lawrence Berkeley National Laboratory
Publication date: 01/04/2008
Field of study

Increasingly we are aware as a community of the growing need to manage the avalanche of genomic and metagenomic data, in addition to related data types like ribosomal RNA and barcode sequences, in a way that tightly integrates contextual data with traditional literature in a machine-readable way. It is for this reason that the Genomic Standards Consortium (GSC) formed in 2005. Here we suggest that we move beyond the development of standards and tackle standards-compliance and improved data capture at the level of the scientific publication. We are supported in this goal by the fact that the scientific community is in the midst of a publishing revolution. This revolution is marked by a growing shift away from a traditional dichotomy between 'journal articles' and 'database entries' and an increasing adoption of hybrid models of collecting and disseminating scientific information. With respect to genomes and metagenomes and related data types, we feel the scientific community would be best served by the immediate launch of a central repository of short, highly structured 'Genome Notes' that must be standards-compliant. This could be done in the context of an existing journal, but we also suggest the more radical solution of launching a new journal. Such a journal could be designed to cater to a wide range of standards-related content types that are not currently centralized in the published literature. It could also support the demand for centralizing aspects of the 'gray literature' (documents developed by institutions or communities) such as the call by the GSCl for a central repository of Standard Operating Procedures describing the genomic annotation pipelines of the major sequencing centers. We argue that such an 'eJournal', published under the Open Access paradigm by the GSC, could be an attractive publishing forum for a broader range of standardization initiatives within, and beyond, the GSC and thereby fill an unoccupied yet increasingly important niche within the current research landscape

UNT Digital Library

The Genomes On Line Database (GOLD) in 2009: status of genomic and metagenomic projects and their associated metadata

Author: Benson
Bernal
Birney
Dawyndt
Field
Garrity
Hirschman
I-Min A. Chen
Konstantinos Liolios
Konstantinos Mavromatis
Kulikova
Kyrpides
Kyrpides
Liolios
Liolios
Markowitz
Markowitz
Markowitz
Meyer
Nektarios Tavernarakis
Nikos C. Kyrpides
Okubo
Philip Hugenholtz
Seshadri
Victor M. Markowitz
Publication venue: Oxford University Press
Publication date
Field of study

The Genomes On Line Database (GOLD) is a comprehensive resource for centralized monitoring of genome and metagenome projects worldwide. Both complete and ongoing projects, along with their associated metadata, can be accessed in GOLD through precomputed tables and a search page. As of September 2009, GOLD contains information for more than 5800 sequencing projects, of which 1100 have been completed and their sequence data deposited in a public repository. GOLD continues to expand, moving toward the goal of providing the most comprehensive repository of metadata information related to the projects and their organisms/environments in accordance with the Minimum Information about a (Meta)Genome Sequence (MIGS/MIMS) specification. GOLD is available at: http://www.genomesonline.org and has a mirror site at the Institute of Molecular Biology and Biotechnology, Crete, Greece, at: http://gold.imbb.forth.gr

Crossref

PubMed Central

Microbial genotype–phenotype mapping by class association rule mining

Author: Agrawal
Benjamini
Bowers
Butte
Clark
Cover
Eisen
Goh
Jedrzejas
Jeong
Jim
Kanehisa
Klus
Korbel
Kyrpides
Liu
Liu
Liu
Madigan
Makio Tamura
Markowitz
Moore
Moriyama
Ogata
Overbeek
Patrik D'haeseleer
Pellegrini
Pounds
Quinlan
Ravasz
Shaffer
Slonim
Storey
Tatusov
Tatusov
Von Mering
Von Mering
Zhang
Publication venue: Oxford University Press
Publication date
Field of study

Motivation: Microbial phenotypes are typically due to the concerted action of multiple gene functions, yet the presence of each gene may have only a weak correlation with the observed phenotype. Hence, it may be more appropriate to examine co-occurrence between sets of genes and a phenotype (multiple-to-one) instead of pairwise relations between a single gene and the phenotype. Here, we propose an efficient class association rule mining algorithm, netCAR, in order to extract sets of COGs (clusters of orthologous groups of proteins) associated with a phenotype from COG phylogenetic profiles and a phenotype profile. netCAR takes into account the phylogenetic co-occurrence graph between COGs to restrict hypothesis space, and uses mutual information to evaluate the biconditional relation

Crossref

PubMed Central

The Genomes OnLine Database (GOLD) v.4: status of genomic and metagenomic projects and their associated metadata

Author: B. Nosrat
Bernal
Field
Glass
I. Pagani
I.-M. A. Chen
Ivanova
J. Jansson
K. Liolios
Kyrpides
Kyrpides
Liolios
N. C. Kyrpides
T. Smirnova
The Human Microbiome Jumpstart Reference Strains C
V. M. Markowitz
Wu
Yilmaz
Publication venue: Oxford University Press
Publication date
Field of study

The Genomes OnLine Database (GOLD, http://www.genomesonline.org/) is a comprehensive resource for centralized monitoring of genome and metagenome projects worldwide. Both complete and ongoing projects, along with their associated metadata, can be accessed in GOLD through precomputed tables and a search page. As of September 2011, GOLD, now on version 4.0, contains information for 11 472 sequencing projects, of which 2907 have been completed and their sequence data has been deposited in a public repository. Out of these complete projects, 1918 are finished and 989 are permanent drafts. Moreover, GOLD contains information for 340 metagenome studies associated with 1927 metagenome samples. GOLD continues to expand, moving toward the goal of providing the most comprehensive repository of metadata information related to the projects and their organisms/environments in accordance with the Minimum Information about any (x) Sequence specification and beyond

Crossref

PubMed Central