Search CORE

80 research outputs found

NCBI reference sequences (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins

Author: Maglott Donna R.
Pruitt Kim D.
Tatusova Tatiana
Publication venue: Oxford University Press
Publication date: 27/11/2006
Field of study

NCBI's reference sequence (RefSeq) database () is a curated non-redundant collection of sequences representing genomes, transcripts and proteins. The database includes 3774 organisms spanning prokaryotes, eukaryotes and viruses, and has records for 2 879 860 proteins (RefSeq release 19). RefSeq records integrate information from multiple sources, when additional data are available from those sources and therefore represent a current description of the sequence and its features. Annotations include coding regions, conserved domains, tRNAs, sequence tagged sites (STS), variation, references, gene and protein product names, and database cross-references. Sequence is reviewed and features are added using a combined approach of collaboration and other input from the scientific community, prediction, propagation from GenBank and curation by NCBI staff. The format of all RefSeq records is validated, and an increasing number of tests are being applied to evaluate the quality of sequence and annotation, especially in the context of complete genomic sequence

Crossref

PubMed Central

Entrez Gene: gene-centered information at NCBI

Author: Maglott Donna
Ostell Jim
Pruitt Kim D.
Tatusova Tatiana
Publication venue: Oxford University Press
Publication date: 05/12/2006
Field of study

Entrez Gene () is NCBI's database for gene-specific information. Entrez Gene includes records from genomes that have been completely sequenced, that have an active research community to contribute gene-specific information or that are scheduled for intense sequence analysis. The content of Entrez Gene represents the result of both curation and automated integration of data from NCBI's Reference Sequence project (RefSeq), from collaborating model organism databases and from other databases within NCBI. Records in Entrez Gene are assigned unique, stable and tracked integers as identifiers. The content (nomenclature, map location, gene products and their attributes, markers, phenotypes and links to citations, sequences, variation details, maps, expression, homologs, protein domains and external databases) is provided via interactive browsing through NCBI's Entrez system, via NCBI's Entrez programing utilities (E-Utilities), and for bulk transfer by ftp

CiteSeerX

Crossref

PubMed Central

Entrez Gene: gene-centered information at NCBI

Author: Maglott Donna
Ostell Jim
Pruitt Kim D.
Tatusova Tatiana
Publication venue: Oxford University Press
Publication date
Field of study

Entrez Gene (http://www.ncbi.nlm.nih.gov/gene) is National Center for Biotechnology Information (NCBI)’s database for gene-specific information. Entrez Gene maintains records from genomes which have been completely sequenced, which have an active research community to submit gene-specific information, or which are scheduled for intense sequence analysis. The content represents the integration of curation and automated processing from NCBI’s Reference Sequence project (RefSeq), collaborating model organism databases, consortia such as Gene Ontology and other databases within NCBI. Records in Entrez Gene are assigned unique, stable and tracked integers as identifiers. The content (nomenclature, genomic location, gene products and their attributes, markers, phenotypes and links to citations, sequences, variation details, maps, expression, homologs, protein domains and external databases) is available via interactive browsing through NCBI’s Entrez system, via NCBI’s Entrez programming utilities (E-Utilities) and for bulk transfer by FTP

Crossref

PubMed Central

A QTL Resource and Comparison Tool for Pigs: PigQTLDB

Author: Bastiaansen John
Dracheva Svetlana
Hu Zhi-Liang
Jang Wonhee
Maglott Donna
Reecy James
Reecy James
Rothschild Max
Rothschild Max
Publication venue: Iowa State University Digital Repository
Publication date: 01/10/2005
Field of study

During the past decade, efforts to map quantitative trait loci (QTL) in pigs have resulted in hundreds of QTL being reported for growth, meat quality, reproduction, disease resistance, and other traits. It is a challenge to locate, interpret, and compare QTL results from different studies. We have developed a pig QTL database (PigQTLdb) that integrates available pig QTL data in the public domain, thus, facilitating the use of this QTL data in future studies. We also developed a pig trait classification system to standardize names of traits and to simplify organization and searching of the trait data. These steps made it possible to compare primary data from diverse sources and methods. We used existing pig map databases and other publicly available data resources (such as PubMed) to avoid redundant developmental work. The PigQTLdb was also designed to include data representing major genes and markers associated with a large effect on economically important traits. To date, over 790 QTL from 73 publications have been curated into the database. Those QTL cover more than 300 different traits. The data have been submitted to the Entrez Gene and the Map Viewer resources at NCBI, where the information about markers was matched to marker records in NCBI’s UniSTS database. Having these data in a public resource like NCBI allows regularly updated automatic matching of markers to public sequence data by e-PCR. The submitted data, and the results of these calculations, are retrievable from NCBI via Entrez Gene, Map Viewer, and UniSTS. Efforts were undertaken to improve the integrated functional genomics resources for pigs

Digital Repository @ Iowa State University (ISU)

The chicken gene nomenclature committee report

Author: Antin Parker B
Burgess Shane C
Burt David W
Carrë Wilfrid
Fell Mark
Law Andy S
Maglott Donna R
McCarthy Fiona M
Schmidt Carl J
Weber Janet A
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Comparative genomics is an essential component of the post-genomic era. The chicken genome is the first avian genome to be sequenced and it will serve as a model for other avian species. Moreover, due to its unique evolutionary niche, the chicken genome can be used to understand evolution of functional elements and gene regulation in mammalian species. However comparative biology both within avian species and within amniotes is hampered due to the difficulty of recognising functional orthologs. This problem is compounded as different databases and sequence repositories proliferate and the names they assign to functional elements proliferate along with them. Currently, genes can be published under more than one name and one name sometimes refers to unrelated genes. Standardized gene nomenclature is necessary to facilitate communication between scientists and genomic resources. Moreover, it is important that this nomenclature be based on existing nomenclature efforts where possible to truly facilitate studies between different species. We report here the formation of the Chicken Gene Nomenclature Committee (CGNC), an international and centralized effort to provide standardized nomenclature for chicken genes. The CGNC works in conjunction with public resources such as NCBI and Ensembl and in consultation with existing nomenclature committees for human and mouse. The CGNC will develop standardized nomenclature in consultation with the research community and relies on the support of the research community to ensure that the nomenclature facilitates comparative and genomic studies

Crossref

Springer - Publisher Connector

PubMed Central

Edinburgh Research Explorer

The University of Arizona

University of Queensland eSpace

Locus Reference Genomic sequences: an improved basis for describing human DNA variants

Author: Astashyn Alex
Birney Ewan
Brookes Anthony J
Béroud Christophe
Chen Yuan
Cunningham Fiona
Dalgleish Raymond
den Dunnen Johan T
Devereau Andrew
Dobson Glen
Flicek Paul
Larsson Pontus
Lehväslaiho Heikki
Maglott Donna R
McLaren William M
Proctor Glenn
Taschner Peter EM
Tully Raymond E
Vaughan Brendan W
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

As our knowledge of the complexity of gene architecture grows, and we increase our understanding of the subtleties of gene expression, the process of accurately describing disease-causing gene variants has become increasingly problematic. In part, this is due to current reference DNA sequence formats that do not fully meet present needs. Here we present the Locus Reference Genomic (LRG) sequence format, which has been designed for the specific purpose of gene variant reporting. The format builds on the successful National Center for Biotechnology Information (NCBI) RefSeqGene project and provides a single-file record containing a uniquely stable reference DNA sequence along with all relevant transcript and protein sequences essential to the description of gene variants. In principle, LRGs can be created for any organism, not just human. In addition, we recognize the need to respect legacy numbering systems for exons and amino acids and the LRG format takes account of these. We hope that widespread adoption of LRGs - which will be created and maintained by the NCBI and the European Bioinformatics Institute (EBI) - along with consistent use of the Human Genome Variation Society (HGVS)-approved variant nomenclature will reduce errors in the reporting of variants in the literature and improve communication about variants affecting human health. Further information can be found on the LRG web site: http://www.lrg-sequence.org

Crossref

Springer - Publisher Connector

HAL-Inserm

PubMed Central

HAL Descartes

Leiden University Scholary Publications

Leicester Research Archive

HGVS recommendations for the description of sequence variants: 2016 update

Author: Anne-Francoise Roux
Donna R Maglott
Jean Mcgowan-Jordan
Johan T Den Dunnen
Marc S Greenblatt
Peter E M Taschner
Raymond Dalgleish
Reece K Hart
Stylianos E Antonarakis
Timothy Smith
Publication venue
Publication date: 23/04/2020
Field of study

CiteSeerX

Database resources of the National Center for Biotechnology Information

Author: Barrett Tanya
Benson Dennis A.
Bryant Stephen H.
Canese Kathi
Church Deanna M.
DiCuccio Michael
Edgar Ron
Federhen Scott
Helmberg Wolfgang
Kenton David L.
Khovayko Oleg
Lipman David J.
Madden Thomas L.
Maglott Donna R.
Ostell James
Pontius Joan U.
Pruitt Kim D.
Schriml Lynn M.
Schuler Gregory D.
Sequeira Edwin
Sherry Steven T.
Sirotkin Karl
Starchenko Grigory
Suzek Tugba O.
Tatusov Roman
Tatusova Tatiana A.
Wagner Lukas
Wheeler David L.
Yaschenko Eugene
Publication venue: Oxford University Press
Publication date: 17/12/2004
Field of study

In addition to maintaining the GenBank(R) nucleic acid sequence database, the National Center for Biotechnology Information (NCBI) provides data retrieval systems and computational resources for the analysis of data in GenBank and other biological data made available through NCBI's website. NCBI resources include Entrez, Entrez Programming Utilities, PubMed, PubMed Central, Entrez Gene, the NCBI Taxonomy Browser, BLAST, BLAST Link (BLink), Electronic PCR, OrfFinder, Spidey, RefSeq, UniGene, HomoloGene, ProtEST, dbMHC, dbSNP, Cancer Chromosomes, Entrez Genomes and related tools, the Map Viewer, Model Maker, Evidence Viewer, Clusters of Orthologous Groups (COGs), Retroviral Genotyping Tools, HIV-1/Human Protein Interaction Database, SAGEmap, Gene Expression Omnibus (GEO), Online Mendelian Inheritance in Man (OMIM), the Molecular Modeling Database (MMDB), the Conserved Domain Database (CDD) and the Conserved Domain Architecture Retrieval Tool (CDART). Augmenting many of the Web applications are custom implementations of the BLAST program optimized to search specialized datasets. All of the resources can be accessed through the NCBI home page at http://www.ncbi.nlm.nih.gov

Crossref

PubMed Central

ClinGen — The Clinical Genome Resource

Author: Berg Jonathan S.
Brooks Lisa D.
Bustamante Carlos D.
De Backer Julie
Evans James P.
Gen Clin
Landrum Melissa J.
Ledbetter David H.
Maglott Donna R.
Martin Christa Lese
Nussbaum Robert L.
Plon Sharon E.
Ramos Erin M.
Rehm Heidi L.
Sherry Stephen T.
Watson Michael S.
Publication venue
Publication date: 01/01/2015
Field of study

On autopsy, a patient is found to have hypertrophic cardiomyopathy. The patient’s family pursues genetic testing that shows a “likely pathogenic” variant for the condition on the basis of a study in an original research publication. Given the dominant inheritance of the condition and the risk of sudden cardiac death, other family members are tested for the genetic variant to determine their risk. Several family members test negative and are told that they are not at risk for hypertrophic cardiomyopathy and sudden cardiac death, and those who test positive are told that they need to be regularly monitored for cardiomyopathy on echocardiography. Five years later, during a routine clinic visit of one of the genotype-positive family members, the cardiologist queries a database for current knowledge on the genetic variant and discovers that the variant is now interpreted as “likely benign” by another laboratory that uses more recently derived population-frequency data. A newly available testing panel for additional genes that are implicated in hypertrophic cardiomyopathy is initiated on an affected family member, and a different variant is found that is determined to be pathogenic. Family members are retested, and one member who previously tested negative is now found to be positive for this new variant. An immediate clinical workup detects evidence of cardiomyopathy, and an intracardiac defibrillator is implanted to reduce the risk of sudden cardiac death

Ghent University Academic Bibliography

PubMed Central

Carolina Digital Repository

Archivsystem Ask23

eScholarship - University of California