Search CORE

40 research outputs found

Surrogate Parenthood: Protected and Informative Graphs

Author: Allen M. David
Blaustein Barbara
Chapman Adriane
Rosenthal Arnon
Seligman Len
Publication venue
Publication date: 01/01/2011
Field of study

Many applications, including provenance and some analyses of social networks, require path-based queries over graph-structured data. When these graphs contain sensitive information, paths may be broken, resulting in uninformative query results. This paper presents innovative techniques that give users more informative graph query results; the techniques leverage a common industry practice of providing what we call surrogates: alternate, less sensitive versions of nodes and edges releasable to a broader community. We describe techniques for interposing surrogate nodes and edges to protect sensitive graph components, while maximizing graph connectivity and giving users as much information as possible. In this work, we formalize the problem of creating a protected account G' of a graph G. We provide a utility measure to compare the informativeness of alternate protected accounts and an opacity measure for protected accounts, which indicates the likelihood that an attacker can recreate the topology of the original graph from the protected account. We provide an algorithm to create a maximally useful protected account of a sensitive graph, and show through evaluation with the PLUS prototype that using surrogates and protected accounts adds value for the user, with no significant impact on the time required to generate results for graph queries.Comment: VLDB201

arXiv.org e-Print Archive

CiteSeerX

Southampton (e-Prints Soton)

A Metadata Resource to Promote Data Integration

Author: Arnon Rosenthal
Len Seligman
Publication venue
Publication date
Field of study

Data integration is expensive, largely due to the difficulty of gathering relevant metadata. We observe that a database may participate in multiple integration efforts, and that much of the same metadata is required, regardless of which kind of effort is being undertaken. This presents a tremendous opportunity to reduce the long-term development and maintenance cost of interoperable systems, by gathering relevant metadata only once and representing it in ways that are amenable to reuse. We present metadata management strategies that promote reuse and describe a repository-based vision for the development of interoperable systems

CiteSeerX

The challenge of “quick and dirty” information quality

Author: Chapman Adriane P.
Rosenthal Arnon
Seligman Len
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 17/02/2016
Field of study

Southampton (e-Prints Soton)

Crossref

Time Constrained Information Quality

Author: Rosenthal Arnon
Seligman Len
Chapman Adriane P
Publication venue
Publication date: 01/01/2012
Field of study

Southampton (e-Prints Soton)

Biblioteca Digital de la Comunidad de Madrid