Search CORE

3 research outputs found

Union Types for Semistructured Data

Author: A. J. Kfoury
F. Barbanera
F. M. Damm
M. Dezani-Ciancaglini
P. Buneman
P. Buneman
S. Abiteboul
S. Hayashi
Serge Abiteboul and Richard Hull. IFO
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/1999
Field of study

Semistructured databases are treated as dynamically typed: they come equipped with no independent schema or type system to constrain the data. Query languages that are designed for semistructured data, even when used with structured data, typically ignore any type information that may be present. The consequences of this are what one would expect from using a dynamic type system with complex data: fewer guarantees on the correctness of applications. For example, a query that would cause a type error in a statically typed query language will return the empty set when applied to a semistructured representation of the same data. Much semistructured data originates in structured data. A semistructured representation is useful when one wants to add data that does not conform to the original type or when one wants to combine sources of different types. However, the deviations from the prescribed types are often minor, and we believe that a better strategy than throwing away all typ..

CiteSeerX

Crossref

Edinburgh Research Explorer

ScholarlyCommons@Penn

Regular Expression Types for XML

Author: Hosoya Haruo
Pierce Benjamin C.
Vouillon Jerome
Publication venue: ScholarlyCommons
Publication date: 01/01/2005
Field of study

We propose regular expression types as a foundation for statically typed XML processing languages. Regular expression types, like most schema languages for XML, introduce regular expression notations such as repetition (*), alternation (|), etc., to describe XML documents. The novelty of our type system is a semantic presentation of subtyping, as inclusion between the sets of documents denoted by two types. We give several examples illustrating the usefulness of this form of subtyping in XML processing. The decision problem for the subtype relation reduces to the inclusion problem between tree automata, which is known to be EXPTIME-complete. To avoid this high complexity in typical cases, we develop a practical algorithm that, unlike classical algorithms based on determinization of tree automata, checks the inclusion relation by a top-down traversal of the original type expressions. The main advantage of this algorithm is that it can exploit the property that type expressions being compared often share portions of their representations. Our algorithm is a variant of Aiken and Murphy\u27s set-inclusion constraint solver, to which are added several new implementation techniques, correctness proofs, and preliminary performance measurements on some small programs in the domain of typed XML processing

HAL Descartes

ScholarlyCommons@Penn

Hal-Diderot

Regular expression types for XML

Author: Aiken A.
Benjamin C. Pierce
Brandt M.
Buneman P.
Buneman P.
Chawathe S. S.
Clark J.
Cluet S.
Damm F. M.
Fernández M. F.
Freeman T.
Frisch A.
Gapeyev V.
Gilleron R.
Goldman R.
Haruo Hosoya
Hornung T.
Jérôme Vouillon
Klarlund N.
Kuper G. M.
Milo T.
Papakonstantinou Y.
Shields M.
Wallace M.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref