2 research outputs found

    A graphical environment for change detection in structured documents

    Get PDF
    Change detection in structured documents (e.g. SGML is important in data warehousing, digital libraries and Internet databases. This thesis presents a graphical environment for detecting changes in the structured documents. We represent. each document by alp ordered labeled tree based on the underlying markup language. We then compare two documents by invoking previously developed algorithms for approximate pattern matching and pattern discovery in trees. Several operators are developed to support. the comparison of the documents; graphical devices are provided to facilitate the use of the operators. We believe the proposed tool is useful for not only document management, but also software maintenance, particularly configuration management and version control, where programs aro represented as parse trees and detecting changes in the trees provides a way to find the syntactic differences of two program versions

    Version Management in Structured Document Retrieval Systems

    No full text
    A structured document retrieval system(SDRS) is composed of management control, structured documents and indexes. A version control mechanism is needed to support the management of versions of document content, structure and indexes. In this paper, we propose a version control model to represent, create, store and manipulate versions of documents and indexes. We store only the differences between two versions of a document or a composite element. We use indexes with weighting factors to support ranked query. The index storage model also stores only the differences of indexes in two versions. In addition, we discuss a mapping table approach to reduce redundant index updates. 1 Introduction Recent research on the storage and retrieval of structured documents, whose structure is represented using markup tags, has indicated the necessity of version management for the design of a structured document retrieval system. Version control mechanisms have been applied to versioning for CAD, the d..
    corecore