Search CORE

47 research outputs found

NCBI reference sequences (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins

Author: Maglott Donna R.
Pruitt Kim D.
Tatusova Tatiana
Publication venue: Oxford University Press
Publication date: 27/11/2006
Field of study

NCBI's reference sequence (RefSeq) database () is a curated non-redundant collection of sequences representing genomes, transcripts and proteins. The database includes 3774 organisms spanning prokaryotes, eukaryotes and viruses, and has records for 2 879 860 proteins (RefSeq release 19). RefSeq records integrate information from multiple sources, when additional data are available from those sources and therefore represent a current description of the sequence and its features. Annotations include coding regions, conserved domains, tRNAs, sequence tagged sites (STS), variation, references, gene and protein product names, and database cross-references. Sequence is reviewed and features are added using a combined approach of collaboration and other input from the scientific community, prediction, propagation from GenBank and curation by NCBI staff. The format of all RefSeq records is validated, and an increasing number of tests are being applied to evaluate the quality of sequence and annotation, especially in the context of complete genomic sequence

Crossref

PubMed Central

Entrez Gene: gene-centered information at NCBI

Author: Maglott Donna
Ostell Jim
Pruitt Kim D.
Tatusova Tatiana
Publication venue: Oxford University Press
Publication date: 05/12/2006
Field of study

Entrez Gene () is NCBI's database for gene-specific information. Entrez Gene includes records from genomes that have been completely sequenced, that have an active research community to contribute gene-specific information or that are scheduled for intense sequence analysis. The content of Entrez Gene represents the result of both curation and automated integration of data from NCBI's Reference Sequence project (RefSeq), from collaborating model organism databases and from other databases within NCBI. Records in Entrez Gene are assigned unique, stable and tracked integers as identifiers. The content (nomenclature, map location, gene products and their attributes, markers, phenotypes and links to citations, sequences, variation details, maps, expression, homologs, protein domains and external databases) is provided via interactive browsing through NCBI's Entrez system, via NCBI's Entrez programing utilities (E-Utilities), and for bulk transfer by ftp

CiteSeerX

Crossref

PubMed Central

Entrez Gene: gene-centered information at NCBI

Author: Maglott Donna
Ostell Jim
Pruitt Kim D.
Tatusova Tatiana
Publication venue: Oxford University Press
Publication date
Field of study

Entrez Gene (http://www.ncbi.nlm.nih.gov/gene) is National Center for Biotechnology Information (NCBI)’s database for gene-specific information. Entrez Gene maintains records from genomes which have been completely sequenced, which have an active research community to submit gene-specific information, or which are scheduled for intense sequence analysis. The content represents the integration of curation and automated processing from NCBI’s Reference Sequence project (RefSeq), collaborating model organism databases, consortia such as Gene Ontology and other databases within NCBI. Records in Entrez Gene are assigned unique, stable and tracked integers as identifiers. The content (nomenclature, genomic location, gene products and their attributes, markers, phenotypes and links to citations, sequences, variation details, maps, expression, homologs, protein domains and external databases) is available via interactive browsing through NCBI’s Entrez system, via NCBI’s Entrez programming utilities (E-Utilities) and for bulk transfer by FTP

Crossref

PubMed Central

Database resources of the National Center for Biotechnology Information

Author: Barrett Tanya
Benson Dennis A.
Bryant Stephen H.
Canese Kathi
Church Deanna M.
DiCuccio Michael
Edgar Ron
Federhen Scott
Helmberg Wolfgang
Kenton David L.
Khovayko Oleg
Lipman David J.
Madden Thomas L.
Maglott Donna R.
Ostell James
Pontius Joan U.
Pruitt Kim D.
Schriml Lynn M.
Schuler Gregory D.
Sequeira Edwin
Sherry Steven T.
Sirotkin Karl
Starchenko Grigory
Suzek Tugba O.
Tatusov Roman
Tatusova Tatiana A.
Wagner Lukas
Wheeler David L.
Yaschenko Eugene
Publication venue: Oxford University Press
Publication date: 17/12/2004
Field of study

In addition to maintaining the GenBank(R) nucleic acid sequence database, the National Center for Biotechnology Information (NCBI) provides data retrieval systems and computational resources for the analysis of data in GenBank and other biological data made available through NCBI's website. NCBI resources include Entrez, Entrez Programming Utilities, PubMed, PubMed Central, Entrez Gene, the NCBI Taxonomy Browser, BLAST, BLAST Link (BLink), Electronic PCR, OrfFinder, Spidey, RefSeq, UniGene, HomoloGene, ProtEST, dbMHC, dbSNP, Cancer Chromosomes, Entrez Genomes and related tools, the Map Viewer, Model Maker, Evidence Viewer, Clusters of Orthologous Groups (COGs), Retroviral Genotyping Tools, HIV-1/Human Protein Interaction Database, SAGEmap, Gene Expression Omnibus (GEO), Online Mendelian Inheritance in Man (OMIM), the Molecular Modeling Database (MMDB), the Conserved Domain Database (CDD) and the Conserved Domain Architecture Retrieval Tool (CDART). Augmenting many of the Web applications are custom implementations of the BLAST program optimized to search specialized datasets. All of the resources can be accessed through the NCBI home page at http://www.ncbi.nlm.nih.gov

Crossref

PubMed Central

ClinGen — The Clinical Genome Resource

Author: Berg Jonathan S.
Brooks Lisa D.
Bustamante Carlos D.
De Backer Julie
Evans James P.
Gen Clin
Landrum Melissa J.
Ledbetter David H.
Maglott Donna R.
Martin Christa Lese
Nussbaum Robert L.
Plon Sharon E.
Ramos Erin M.
Rehm Heidi L.
Sherry Stephen T.
Watson Michael S.
Publication venue
Publication date: 01/01/2015
Field of study

On autopsy, a patient is found to have hypertrophic cardiomyopathy. The patient’s family pursues genetic testing that shows a “likely pathogenic” variant for the condition on the basis of a study in an original research publication. Given the dominant inheritance of the condition and the risk of sudden cardiac death, other family members are tested for the genetic variant to determine their risk. Several family members test negative and are told that they are not at risk for hypertrophic cardiomyopathy and sudden cardiac death, and those who test positive are told that they need to be regularly monitored for cardiomyopathy on echocardiography. Five years later, during a routine clinic visit of one of the genotype-positive family members, the cardiologist queries a database for current knowledge on the genetic variant and discovers that the variant is now interpreted as “likely benign” by another laboratory that uses more recently derived population-frequency data. A newly available testing panel for additional genes that are implicated in hypertrophic cardiomyopathy is initiated on an affected family member, and a different variant is found that is determined to be pathogenic. Family members are retested, and one member who previously tested negative is now found to be positive for this new variant. An immediate clinical workup detects evidence of cardiomyopathy, and an intracardiac defibrillator is implanted to reduce the risk of sudden cardiac death

Ghent University Academic Bibliography

PubMed Central

Carolina Digital Repository

Archivsystem Ask23

eScholarship - University of California

Database resources of the National Center for Biotechnology Information

Author: Alexandre Souvorov
Altschul
Altschul
Amberger
Anna Panchenko
Aron Marchler-Bauer
Barrett
Benson
Berman
Blumenfeld
Brazma
Crosby
David J. Lipman
David Landsman
Deanna M. Church
Dennis A. Benson
Donna R. Maglott
Douglas Slotta
Edwin Sequeira
Eppig
Eric W. Sayers
Eugene Yaschenko
Evan Bolton
Finn
Fu
Geer
Geschwind
Ghedin
Gibrat
Gong
Gregory D. Schuler
Grigory Starchenko
Haft
Heintz
Helmberg
Hong
Ilene Mizrachi
James Ostell
Ji
Jian Ye
Kanehisa
Kanehisa
Kanehisa
Kapustin
Karl Sirotkin
Kathi Canese
Keseler
Kim D. Pruitt
Klimke
Knutsen
Lenffer
Letunic
Lewis Y. Geer
Lukas Wagner
Ma
Madej
Maglott
Manolio
Marchler-Bauer
Martin Shumway
Michael DiCuccio
Michael Feolo
Mitelman
Needleman
Pagon
Papadopoulos
Pruitt
Schuler
Schuler
Scott Federhen
Sequeira
Sewell
Sherry
Shumway
Sprague
Stephen H. Bryant
Stephen T. Sherry
Tanya Barrett
Tatiana A. Tatusova
Tatusov
Tatusova
Thomas L. Madden
Tom Madej
Vadim Miller
Vyacheslav Chetvernin
W. John Wilbur
Waggoner
Wang
Wang
Wang
Whetzel
Wolfgang Helmberg
Yanli Wang
Ye
Yuri Kapustin
Zhang
Zhiyong Lu
Publication venue: Oxford University Press
Publication date: 01/01/2010
Field of study

In addition to maintaining the GenBank® nucleic acid sequence database, the National Center for Biotechnology Information (NCBI) provides analysis and retrieval resources for the data in GenBank and other biological data made available through the NCBI web site. NCBI resources include Entrez, the Entrez Programming Utilities, MyNCBI, PubMed, PubMed Central, Entrez Gene, the NCBI Taxonomy Browser, BLAST, BLAST Link (BLink), Electronic PCR, OrfFinder, Spidey, Splign, Reference Sequence, UniGene, HomoloGene, ProtEST, dbMHC, dbSNP, Cancer Chromosomes, Entrez Genomes and related tools, the Map Viewer, Model Maker, Evidence Viewer, Trace Archive, Sequence Read Archive, Retroviral Genotyping Tools, HIV-1/Human Protein Interaction Database, Gene Expression Omnibus, Entrez Probe, GENSAT, Online Mendelian Inheritance in Man, Online Mendelian Inheritance in Animals, the Molecular Modeling Database, the Conserved Domain Database, the Conserved Domain Architecture Retrieval Tool, Biosystems, Peptidome, Protein Clusters and the PubChem suite of small molecule databases. Augmenting many of the web applications are custom implementations of the BLAST program optimized to search specialized data sets. All these resources can be accessed through the NCBI home page at www.ncbi.nlm.nih.gov

CiteSeerX

Crossref

PubMed Central

The Consensus Coding Sequence (Ccds) Project: Identifying a Common Protein-Coding Gene Set for the Human and Mouse Genomes

Effective use of the human and mouse genomes requires reliable identification of genes and their products. Although multiple public resources provide annotation, different methods are used that can result in similar but not identical representation of genes, transcripts, and proteins. The collaborative consensus coding sequence (CCDS) project tracks identical protein annotations on the reference mouse and human genomes with a stable identifier (CCDS ID), and ensures that they are consistently represented on the NCBI, Ensembl, and UCSC Genome Browsers. Importantly, the project coordinates on manually reviewing inconsistent protein annotations between sites, as well as annotations for which new evidence suggests a revision is needed, to progressively converge on a complete protein-coding set for the human and mouse reference genomes, while maintaining a high standard of reliability and biological accuracy. To date, the project has identified 20,159 human and 17,707 mouse consensus coding regions from 17,052 human and 16,893 mouse genes. Three evaluation methods indicate that the entries in the CCDS set are highly likely to represent real proteins, more so than annotations from contributing groups not included in CCDS. The CCDS database thus centralizes the function of identifying well-supported, identically-annotated, protein-coding regions.National Human Genome Research Institute (U.S.) (Grant number 1U54HG004555-01)Wellcome Trust (London, England) (Grant number WT062023)Wellcome Trust (London, England) (Grant number WT077198

DSpace@MIT

PubMed Central

King's Research Portal

Systematic documentation and analysis of human genetic variation in hemoglobinopathies using the microattribution approach

Author: A Nazli Basak
Adamantia Papachatzopoulou
Alain Francina
Alex E Felice
Barnaby Clark
Belinda Giardine
Belinda K Singleton
BK Singleton
BK Singleton
Branka Zukic
C Yu
Cathy Riemer
Claudia Wiemann
Cornelis L Harteveld
David H K Chui
David J Anstee
Donna Maglott
Douglas R Higgs
DP Heruth
DP Steensma
Emmanuel Kanavakis
Flavia C Costa
George P Patrinos
GP Patrinos
GP Patrinos
GP Patrinos
Halyna Fedosyuk
Henri Wajcman
I Amoyal
IF Fokkema
Iris Schrijver
J Borg
J Borg
J Xu
James D Hoyer
Jan Traeger-Synodinos
John Old
John S Waye
Joseph Borg
K Moradkhani
Kamran Moradkhani
Kenneth R Peterson
KR Peterson
L Arnaud
Lucia Perseu
M Siatecka
Maja Stojiljkovic
Manoussos N Papadakis
Marianthi Georgitsi
Martin Jarvis
MH Steinberg
Milena Radmilovic
MN Papadakis
Monica V E Gallivan
Panagoula Kollia
Paula Faustino
Petros Papadopoulos
Philippe Joly
Piero C Giordano
Q Ma
R Drissen
Ray Tully
RC Hardison
Renzo Galanello
Richard J Gibbons
RJ Gibbons
RJ Gibbons
RM Böhmer
Ross C Hardison
S Harju
S Harju-Baker
S Menzel
Sjaak Philipsen
SL Thein
Sonja Pavlovic
Stefania Satta
Stephan Menzel
Swee Lay Thein
Takahito Wada
V Viprakasit
VG Sankaran
Webb Miller
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2011
Field of study

We developed a series of interrelated locus-specific databases to store all published and unpublished genetic variation related to hemoglobinopathies and thalassemia and implemented microattribution to encourage submission of unpublished observations of genetic variation to these public repositories. A total of 1,941 unique genetic variants in 37 genes, encoding globins and other erythroid proteins, are currently documented in these databases, with reciprocal attribution of microcitations to data contributors. Our project provides the first example of implementing microattribution to incentivise submission of all known genetic variation in a defined system. It has demonstrably increased the reporting of human variants, leading to a comprehensive online resource for systematically describing human genetic variation in the globin genes and other genes contributing to hemoglobinopathies and thalassemias. The principles established here will serve as a model for other systems and for the analysis of other common and/or complex human genetic diseases

Archivio istituzionale della ricerca - Università di Cagliari

Leiden University Scholary Publications

Erasmus University Digital Repository

Crossref

PubMed Central

Oxford University Research Archive

King's Research Portal

imagine

Repositório Científico do Instituto Nacional de Saúde

Lineage-Specific Biology Revealed by a Finished Genome Assembly of the Mouse

Author: A Morgulis
A Spiess
A Touré
A Valouev
A Varki
Ana C. Marques
B Charlesworth
B Oh
Brian Teague
Carol J. Bult
Chris P. Ponting
Christopher Churas
CM Laukaitis
D Kipling
D Nguyen
D Söderlund
Daniel Forrest
David C. Schwartz
Deanna M. Church
Donna Maglott
E Whitelaw
EC Salido
ER Liman
ET Dimalanta
Evan E. Eichler
FS Collins
H Huang
H Iida
H Skaletsky
IA Maksakova
J Eid
J Perry
J Ponjavic
J Rossant
JA Bailey
JA Bailey
JA Bailey
James Amos-Landgraf
JC Stevens
JC Venter
JHM Lammers
Jill Herschleb
JL Mueller
JM Young
Joshua L. Cherry
K Lindblad-Toh
KD Pruitt
Kerstin Lindblad-Toh
Konstantinos Potamousis
L Armengol
L Chittenden
L Goodstadt
L Goodstadt
LaDeana W. Hillier
Leo Goodstadt
LL Jacobs
LN Reynard
M Clamp
M Jackson
MF Bolliger
Michael C. Zody
Michael DiCuccio
Michael Place
MJ Justice
MM Abd El-Aziz
MT Ross
P Carninci
P Pevzner
Peter Meric
PJI Ellis
RA Gibbs
RA Gibbs
RD Emes
RD Emes
RD Martin
RH Waterston
Richa Agarwala
Richard J. Roberts
Ron Runnheim
S Aluru
S Dadé
S Griffiths-Jones
S Ohno
S Rouquier
S Tu
S Zhou
SC Grubb
SF Altschul
SG Gregory
Shiguo Zhou
Steve Goldstein
T Marques-Bonet
Tina Graves
TJ Hudson
TJ Nicholas
TS Mikkelsen
WF Dietrich
WJ Murphy
WJ Murphy
Wratko Hlavina
X She
X She
X She
Xinwe She
Y Okazaki
Yuri Kapustin
Z Birtle
Ze Cheng
Zoë Birtle
Publication venue: Public Library of Science
Publication date: 01/01/2009
Field of study

A finished clone-based assembly of the mouse genome reveals extensive recent sequence duplication during recent evolution and rodent-specific expansion of certain gene families. Newly assembled duplications contain protein-coding genes that are mostly involved in reproductive function

CiteSeerX

Crossref

Cold Spring Harbor Laboratory Institutional Repository

The Jackson Laboratory: The Mouseion at the JAXlibrary

Directory of Open Access Journals

PubMed Central

UCL Discovery

Edinburgh Research Explorer

Oxford University Research Archive