Skip to main content
Article thumbnail
Location of Repository

Combinatorial Information Theoretical Measurement of the Semantic Significance of Semantic Graph Motifs

By Cliff Joslyn, David Haglin, Sinan Al-saffar and Lawrence Holder

Abstract

Given a semantic graph data set, perhaps one lacking in an explicit ontology, we wish to first identify its significant semantic structures, and then measure the extent of their significance. Casting a semantic graph dataset as an edgelabeled, directed graph, this task can be built on the ability to mine frequent labeled subgraphs in edge-labeled, directed graphs. We begin by considering the enumerative combinatorics of subgraph motif structures in edge-labeled directed graphs. We identify frequent labeled, directed subgraph motif patterns, and measure the significance of the resulting motifs by the information gain relative to the expected value of the motif based on the empirical frequency distribution of the link types which compose them, assuming independence. We illustrate on a small test graph, and discuss results obtained for small linear motifs (link type bigrams and trigrams) in the Billion Triple Challenge triplestore. 1

Year: 2013
OAI identifier: oai:CiteSeerX.psu:10.1.1.352.4044
Provided by: CiteSeerX
Download PDF:
Sorry, we are unable to provide the full text but you may find it at the following location(s):
  • http://citeseerx.ist.psu.edu/v... (external link)
  • http://cass-mt.pnnl.gov/docs/p... (external link)
  • Suggested articles


    To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.