40 research outputs found

    Document Type De�nition (DTD) Metrics

    Get PDF
    In this paper, we present two complexity metrics for the assessment of schema quality written in Document Type De�finition (DTD) language. Both "Entropy (E) metric: E(DTD)" and "Distinct Structured Element Repetition Scale (DSERS) metric: DSERS(DTD)" are intended to measure the structural complexity of schemas in DTD language. These metrics exploit a directed graph representation of schema document and consider the complexity of schema due to its similar structured elements and the occurrences of these elements. The empirical and theoretical validations of these metrics prove the robustness of the metrics

    Everything You Ever Wanted to Know About DTDs, But Were Afraid to Ask (Extended Abstract)

    No full text

    Keys for XML

    Get PDF
    this paper. The proposal extends the key speci cation of XML Data by allowing one to specify keys in terms of XPath [24] expressions. There are a number of technical problems in connection with XPath. XPath is a relatively complex language in which one can not only move down the document tree, but also sideways or upwards, not to mention that predicates and functions can be embedded as well. The problem with XPath is that questions about equivalence or inclusion of XPath expressions are, as far as the authors are aware, unresolved; and these issues are importantifwewant to reason about keys as wedo in relational databases. Yet until we know how to determine the equivalence of XPath expressions, there is no general method of saying whether two such speci cations are equivalent. Another technical issue is value equality. XML Schema restricts equality to text, but the authors have encountered cases in whichkeys are not so restricted. See Section 7.1 for a more detailed discussion. However, the main reason for writing this paper is that none of the existing key proposals address the issue of hierarchical keys, which appear to be ubiquitous in hierarchically structured databases, especially in scienti c data formats. A top-level key may be used to identify components of a document, and within each component a secondary key is used to identify sub-components, and so on. Moreover, the authors believe that the use of keys for citing parts of a document is suciently important that it is appropriate to consider key speci cation independently of other proposals for constraining the structure of XML document

    Xml Beyond The Tags

    Get PDF
    XML is quickly being utilized in the field of technical communication to transfer information from database to person and company to company. Often communicators will structure information without a second thought of how or why certain tags are used to mark up the information. Because the company or a manual says to use those tags, the communicator does so. However, if professionals want to unlock the true potential of XML for better sharing of information across platforms, they need to understand the effects the technology using XML as well as political and cultural factors have on the tags being used. This thesis reviewed literature from multiple fields utilizing XML to find how tag choices can be influenced. XML allows for the sharing of information across multiple platforms and databases. Because of this efficiency, XML is utilized by many technologies. Often communicators must tag information so that the technologies can find the marked up information; therefore, technologies like single sourcing, data mining, and knowledge management influence the types of tags created. Additionally, cultural and political influences are analyzed to see how they play a role in determining what tags are used and created for specific documents. The thesis concludes with predictions on the future of XML and the technological, political, and cultural influences associated with XML tag sets based on information found within the thesis

    Literary texts in an electronic age: Scholarly implications and library services [papers presented at the 1994 Clinic on Library applications of Data Processing, April 10-12, 1994]

    Get PDF
    Authors and readers in an age of electronic texts / Jay David Bolter -- Electronic texts in the humanities : a coming of age / Susan Hockey -- The Text Encoding Initiative : electronic text markup for research / C.M. Sperberg-McQueen -- Electronic texts and multimedia in the academic library : a view from the front line / Anita K. Lowry -- Humanizing information technology : cultural evolution and the institutionalization of electronic text processing / Mark Tyler Day -- Cohabiting with copyright on the nets / Mary Brandt Jensen -- The role of the scholarly publisher in an electronic environment / Lorrie LeJeune -- The feasibility of wide-area textual analysis systems in libraries : a practical analysis / John Price-Wilkin -- The scholar and his library in the computer age / James W. Marchand -- The challenges of electronic texts in the library : bibliographic control and access / Rebecca S. Guenther -- Durkheim???s imperative : the role of humanities faculty in the information technologies revolution / Robert Alun Jones -- The materiality of the book : another turn of the screw / Terry Belanger.published or submitted for publicatio

    From paper to digital documents : Challenging and improving the SGML approach

    Get PDF
    This research has been initiated on the basis of practical experiences in developing a relatively large SGML system at the University of Oslo. This thesis contributes to the field of information systems, with a particular focus on document systems. The aim of this work is to inform the design of document systems by considering the transformation from paper to digital documents in organizations. The Standard Generalized Markup Language (SGML, ISO 8879) approach is emphasized. The SGML approach takes the documents' structure and content as the starting point in design, and regards the document as a collection of structured information. This approach is challenged and tentatively improved by empirical studies of documents in use and theoretical considerations of artifacts at work. The research approach has been an Action Case, as defined by Vidgen and Braa (1997). The interpretation of the transformation process from paper to digital documents is based mainly on an in-depth case study that was conducted at a Norwegian news agency from January 1996 to March 1998. The empirical findings are discussed according to theoretical concepts that emphasize the significance of artifacts at work to illuminate the various roles of documents at work. Concepts from the Actor Network Theory (ANT) (for example, see Callon, 1986; Latour, 1987; Law, 1986) are applied to emphasize the interrelations of humans and artifacts, as well as the importance of artifacts' properties in these relations. The concepts of 'boundary object' (Star and Griesemer; 1989) and 'borderline issues' (Brown and Duguid, 1994) are applied to get various perspectives on the actor-network. The study illustrates that it is challenging to substitute paper documents with SGML documents. Firstly, two different types of technology, with different properties and features, are exchanged. By removing paper documents, we also remove resources that go beyond the canonical meaning of the artifact. These resources are related to paper as a technology. Secondly, the document perspective in SGML is too restricted in relation to the various perspectives on documents in practical use. The emphasis on structure complicates the production of documents. Thirdly, the application of shared document models across work practices turns the various heterogeneous actor-networks into one network, which requires a common objective among the actors involved. The dilemma of "who does the job and who gets the benefits" (Grudin, 1989; 1994) arises as well. The study indicates that an investigation of the actor-networks that include documents provides an insight into the more hidden aspects of work. By regarding documents' central, peripheral, local and shared properties, one can gain an understanding of how documents are embedded in work, including the importance of documents and related artifacts to aspects such as awareness, articulation and coordination of work. The properties determine how things become interrelated into heterogeneous networks. The research shows how a document's properties or inscriptions are essential to its production and application in use. Insight into these prerequisites helps us to understand how the computer system can fit into work practices, even if we do have no guarantees that it will be used in the way that we expect. According to design, work practices are improved by changing the technical properties or the technical fundamentals, by adding various inscriptions into the system. This thesis describes how an existing system was improved by the use of 'gateways'. In the design of the gateways, the idea has been to keep the technical possibilities that SGML provides, and at the same time take into account our knowledge about the paperwork

    Gallup Herald, 03-24-1923

    Get PDF
    https://digitalrepository.unm.edu/gallup_herald_news/1093/thumbnail.jp

    Wittgenstein and Contemporary Theories of Language

    Get PDF
    Papers read at the French-Norwegian seminar in Skjolden, 23-26 May 199
    corecore