Skip to main content
Article thumbnail
Location of Repository

SNP500Cancer: a public resource for sequence validation, assay development, and frequency analysis for genetic variation in candidate genes

By Bernice R. Packer, Meredith Yeager, Laura Burdett, Robert Welch, Michael Beerman, Liqun Qi, Hugues Sicotte, Brian Staats, Mekhala Acharya, Andrew Crenshaw, Andrew Eckert, Vinita Puri, Daniela S. Gerhard and Stephen J. Chanock


The SNP500Cancer database provides sequence and genotype assay information for candidate SNPs useful in mapping complex diseases, such as cancer. The database is an integral component of the NCI Cancer Genome Anatomy Project (). SNP500Cancer reports sequence analysis of anonymized control DNA samples (n = 102 Coriell samples representing four self-described ethnic groups: African/African-American, Caucasian, Hispanic and Pacific Rim). The website is searchable by gene, chromosome, gene ontology pathway, dbSNP ID and SNP500Cancer SNP ID. As of October 2005, the database contains >13 400 SNPs, 9124 of which have been sequenced in the SNP500Cancer population. For each analysed SNP, gene location and >200 bp of surrounding annotated sequence (including nearby SNPs) are provided, with frequency information in total and per subpopulation as well as calculation of Hardy–Weinberg equilibrium for each subpopulation. The website provides the conditions for validated sequencing and genotyping assays, as well as genotype results for the 102 samples, in both viewable and downloadable formats. A subset of sequence validated SNPs with minor allele frequency >5% are entered into a high-throughput pipeline for genotyping analysis to determine concordance for the same 102 samples. In addition, the results of genotype analysis for select validated SNP assays (defined as 100% concordance between sequence analysis and genotype results) are posted for an additional 280 samples drawn from the Human Diversity Panel (HDP). SNP500Cancer provides an invaluable resource for investigators to select SNPs for analysis, design genotyping assays using validated sequence data, choose selected assays already validated on one or more genotyping platforms, and select reference standards for genotyping assays. The SNP500Cancer database is freely accessible via the web page at

Topics: Article
Publisher: Oxford University Press
OAI identifier:
Provided by: PubMed Central
Download PDF:
Sorry, we are unable to provide the full text but you may find it at the following location(s):
  • http://www.pubmedcentral.nih.g... (external link)
  • Suggested articles


    1. (2002). A human genome diversity cell line panel.
    2. and Yeager,M.(2005) Genewindow: an interactive tool for visualization of genomic variation.
    3. (2005). Comparison of yield and genotyping performance of multiple displacement amplification and Omniplex whole genome amplified DNA generated from multiple DNA sources.
    4. Consortium (2000), Gene Ontology: tool for the unification of biology.
    5. (2005). Cyberinfrastructure: empowering a ‘‘third way’’ in biomedical research.
    6. (2005). Database resources of the National Center for Biotechnology Information.
    7. (2001). dbSNP: the NCBI database of genetic variation.
    8. (2005). Effects of natural selection on inter-population divergence at polymorphic sites in human protein-coding loci.
    9. (2005). Ensembl
    10. (2005). Entrez Gene: gene-centered information at NCBI.
    11. (1996). Genetic Data Analysis II: Methods for Discrete Population Genetics Data. Sinauer,
    12. (2005). Genetic variation, nucleotide diversity, and linkage disequilibrium in seven telomere stability genes suggest that these genes may be under constraint.
    13. (2005). Haploview: analysis and visualization of LD and haplotype maps.
    14. (2003). Modeling and E-M estimation of haplotype-specific relative risks from D620
    15. (2000). Primer3 on the WWW for general users and for biologist programmers.
    16. (2004). Selecting a maximally informative set of SNPs forassociationanalysesusinglinkagedisequilibrium.Am.J.Hum.Genet.,
    17. (2004). Sequence analysis of the mannose-binding lectin (MBL2) gene reveals a high degree of heterozygosity with evidence of selection.
    18. (2004). SNP500Cancer: a public resource for sequence validation and assay developmentforgeneticvariationincandidategenes.NucleicAcidsRes.,
    19. Strausberg,R.L.,Simpson,A.J.G.andWooster,R.(2003)Sequence-based cancer genomics: progress, lessons, and opportunities.
    20. (2005). The International HapMap Project.
    21. (2003). Widespread purifying selection at polymorphic sites in human protein-coding loci.

    To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.