Skip to main content
Article thumbnail
Location of Repository

Semantic Web-Based Integration of Cancer Pathways and Allele Frequency Data

By Matthew E. Holford, Haseena Rajeevan, Hongyu Zhao, Kenneth K. Kidd and Kei-Hoi Cheung


We demonstrate the use of Semantic Web technology to integrate the ALFRED allele frequency database and the Starpath pathway resource. The linking of population-specific genotype data with cancer-related pathway data is potentially useful given the growing interest in personalized medicine and the exploitation of pathway knowledge for cancer drug discovery. We model our data using the Web Ontology Language (OWL), drawing upon ideas from existing standard formats BioPAX for pathway data and PML for allele frequency data. We store our data within an Oracle database, using Oracle Semantic Technologies. We then query the data using Oracle’s rule-based inference engine and SPARQL-like RDF query language. The ability to perform queries across the domains of population genetics and pathways offers the potential to answer a number of cancer-related research questions. Among the possibilities is the ability to identify genetic variants which are associated with cancer pathways and whose frequency varies significantly between ethnic groups. This sort of information could be useful for designing clinical studies and for providing background data in personalized medicine. It could also assist with the interpretation of genetic analysis results such as those from genome-wide association studies

Topics: Methodology
Publisher: Libertas Academica
OAI identifier:
Provided by: PubMed Central

Suggested articles


  1. (2002). a new tool for viewing and analyzing microarray data on biological pathways. Nature Genetics.
  2. A reference ontology for biomedical informatics: the foundational model of anatomy.
  3. (2003). A Semantic Web Primer.
  4. A Semantic Web Toolkit.
  5. A view from the Web: BioCarta.
  6. (2006). Abstract 4110: Allele Frequencies of Functional SNPs in Candidate Genes for Cardiovascular Disease Differ by Ethnicity. Genetic Epidemiology.
  7. ALFRED—The Allele FREquency Database [online].
  8. (2003). ALFRED—the ALlele FREquency Database—update. Nucleic Acids Res.
  9. ALFRED: an allele frequency database for diverse populations and DNA polymorphisms.
  10. (2008). An ontology-driven semantic mashup of gene and biological pathway information: Application to the domain of nicotine dependence.
  11. Biological Pathways Exchange [online].
  12. (2007). Biomedical ontologies: a functional perspective. Briefi ngs In Bioinformatics.
  13. caGrid: design and implementation of the core architecture of the cancer biomedical informatics grid.
  14. Co-evolution of Proteins with their Interaction Partners.
  15. (2000). dbSNP: a database of single nucleotide polymorphisms. Nucleic Acids Res.
  16. (2008). dom4j: the fl exible XML framework for Java [online].
  17. (2004). E-neuroscience: challenges and triumphs in integrating distributed data from molecules to brains. Nat Neurosci.
  18. (1978). Evolution and the genetics of populations,
  19. Gene ontology: tool for the unifi cation of biology.
  20. Gramene: a growing plant comparative genomics resource.
  21. (2008). Health Care and Life Sciences (HCLS) Interest Group [online].
  22. (2008). Human Genome Diversity Project [online].
  23. Implementing an Inference Engine for RDFS/OWL Constructs and User-Defi ned Rules in Oracle.
  24. (2008). Kazuki project at SemWebCentral [online].
  25. (1999). KEGG: Kyoto Encyclopedia of Genes and Genomes. Nucleic Acids Res.
  26. (2008). Language for RDF [online].
  27. (2005). Large-scale SNP analysis reveals clustered and continuous patterns of human genetic variation. Hum Genomics.
  28. Markup Language [online].
  29. (2005). Modeling a description logic vocabulary for cancer research,
  30. Ontology Language Reference [online].
  31. (2008). Semantic Technologies Center [online].
  32. (2007). Semantic Web Services: Concepts, Technologies, and Applications.
  33. semCDI: A Query Formulation for Semantic Data Integration in caBIG.
  34. (2008). Starpath—Tools to analyze pathways [online].
  35. (2008). State of the nation in data integration for bioinformatics. J Biomed Inform. in press.
  36. TAMBIS: transparent access to multiple bioinformatics information sources.
  37. The Cancer Biomedical Informatics Grid: pioneering an expansive network of information and tools for collaborative cancer research.
  38. (2008). The Cancer Cell. Map [online].
  39. The evolution of Protege: an environment for knowledge-based systems development.
  40. (2005). The Internatonal HapMap Project Web site. Genome Res.
  41. The OBO foundry: coordinated evolution of ontologies to support biomedical data integration.
  42. The semantic web revisited.
  43. The Unifi ed Medical Language System (UMLS): integrating biomedical terminology.
  44. (2008). yOWL: An ontology-driven knowledge base for yeast biologists. J Biomed Inform. in press.

To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.