Search CORE

11,842 research outputs found

Evoluce XML schémat

Author: Malý Jakub
Publication venue: Univerzita Karlova, Matematicko-fyzikální fakulta
Publication date: 01/01/2012
Field of study

V předložené práci studujeme evoluci XML dat, a především důvody a dopady evoluce XML schémat. Práce obsahuje přehled existujících přístupu. Přístup prezentovaný v této práci rozšiřuje konceptuální model XSem o podporu více verzí systému. Díky tomuto rozšíření lze definovat sadu změn mezi dvěma verzemi. Dále práce obsahuje popis algoritmu, který porovnáním dvou verzí schématu vytvoří revalidační skript v jazyce XSL.In the presented work we study the XML data evolution, reasons and consequences of XML schema evolution in particular. The thesis contains a survey of the existing approaches to this problem. The approach presented in this work extends the XSem conceptual model with the support for multiple versions of the model. Thanks to this extension, it is possible to define a set of changes between two versions of a schema. The thesis contains a description of an algorithm that compares two versions of a schema and produces a revalidation script in XSL.Katedra softwarového inženýrstvíDepartment of Software EngineeringFaculty of Mathematics and PhysicsMatematicko-fyzikální fakult

CU Digital Repository

Evoluce XML schémat

Author: Malý Jakub
Publication venue: Univerzita Karlova, Matematicko-fyzikální fakulta
Publication date: 01/01/2010
Field of study

V předložené práci studujeme evoluci XML dat, a především důvody a dopady evoluce XML schémat. Práce obsahuje přehled existujících přístupů. Přístup prezentovaný v této práci rozšiřuje konceptuální model XSem o podporu více verzí systému. Díky tomuto rozšíření lze definovat sadu změn mezi dvěma verzemi. Dále práce obsahuje popis algoritmu, který porovnáním dvou verzí schématu vytvoří revalidační skript v jazyce XSL.In the presented work we study the XML data evolution, reasons and consequences of XML schema evolution in particular. The thesis contains a survey of the existing approaches to this problem. The approach presented in this work extends the XSem conceptual model with the support for multiple versions of the model. Thanks to this extension, it is possible to define a set of changes between two versions of a schema. The thesis contains a description of an algorithm that compares two versions of a schema and produces a revalidation script in XSL.Department of Software EngineeringKatedra softwarového inženýrstvíFaculty of Mathematics and PhysicsMatematicko-fyzikální fakult

CU Digital Repository

A Call to Arms: Revisiting Database Design

Author: Antonio Badia
Berners-Lee T.
Chauduri S.
Daniel Lemire
Golab L.
Helland P.
Kiely G.
Nagarajan S.
Olivé A.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2011
Field of study

Good database design is crucial to obtain a sound, consistent database, and - in turn - good database design methodologies are the best way to achieve the right design. These methodologies are taught to most Computer Science undergraduates, as part of any Introduction to Database class. They can be considered part of the "canon", and indeed, the overall approach to database design has been unchanged for years. Moreover, none of the major database research assessments identify database design as a strategic research direction. Should we conclude that database design is a solved problem? Our thesis is that database design remains a critical unsolved problem. Hence, it should be the subject of more research. Our starting point is the observation that traditional database design is not used in practice - and if it were used it would result in designs that are not well adapted to current environments. In short, database design has failed to keep up with the times. In this paper, we put forth arguments to support our viewpoint, analyze the root causes of this situation and suggest some avenues of research.Comment: Removed spurious column break. Nothing else was change

arXiv.org e-Print Archive

CiteSeerX

R-libre

Crossref

XML content warehousing: Improving sociological studies of mailing lists and web data

Author: Colazzo Dario
Dudouet François-Xavier
Manolescu Ioana
Nguyen Benjamin
Senellart Pierre
Vion Antoine
Publication venue
Publication date: 01/01/2011
Field of study

In this paper, we present the guidelines for an XML-based approach for the sociological study of Web data such as the analysis of mailing lists or databases available online. The use of an XML warehouse is a flexible solution for storing and processing this kind of data. We propose an implemented solution and show possible applications with our case study of profiles of experts involved in W3C standard-setting activity. We illustrate the sociological use of semi-structured databases by presenting our XML Schema for mailing-list warehousing. An XML Schema allows many adjunctions or crossings of data sources, without modifying existing data sets, while allowing possible structural evolution. We also show that the existence of hidden data implies increased complexity for traditional SQL users. XML content warehousing allows altogether exhaustive warehousing and recursive queries through contents, with far less dependence on the initial storage. We finally present the possibility of exporting the data stored in the warehouse to commonly-used advanced software devoted to sociological analysis

arXiv.org e-Print Archive

HAL-CentraleSupelec

Base de publications de l'université Paris-Dauphine

HAL AMU

INRIA a CCSD electronic archive server

NOSQL design for analytical workloads: Variability matters

Author: Abelló Gamazo Alberto
Herrero Otal Víctor
Romero Moral Óscar
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

Big Data has recently gained popularity and has strongly questioned relational databases as universal storage systems, especially in the presence of analytical workloads. As result, co-relational alternatives, commonly known as NOSQL (Not Only SQL) databases, are extensively used for Big Data. As the primary focus of NOSQL is on performance, NOSQL databases are directly designed at the physical level, and consequently the resulting schema is tailored to the dataset and access patterns of the problem in hand. However, we believe that NOSQL design can also benefit from traditional design approaches. In this paper we present a method to design databases for analytical workloads. Starting from the conceptual model and adopting the classical 3-phase design used for relational databases, we propose a novel design method considering the new features brought by NOSQL and encompassing relational and co-relational design altogether.Peer ReviewedPostprint (author's final draft

UPCommons. Portal del coneixement obert de la UPC

Requirement-driven creation and deployment of multidimensional and ETL designs

Author: Abelló Gamazo Alberto
Jovanovic Petar
Romero Moral Óscar
Simitsis Alkis
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

We present our tool for assisting designers in the error-prone and time-consuming tasks carried out at the early stages of a data warehousing project. Our tool semi-automatically produces multidimensional (MD) and ETL conceptual designs from a given set of business requirements (like SLAs) and data source descriptions. Subsequently, our tool translates both the MD and ETL conceptual designs produced into physical designs, so they can be further deployed on a DBMS and an ETL engine. In this paper, we describe the system architecture and present our demonstration proposal by means of an example.Peer ReviewedPostprint (author's final draft

UPCommons. Portal del coneixement obert de la UPC

Ontology technology for the development and deployment of learning technology systems - a survey

Author: Holohan Edmond
Pahl Claus
Publication venue: 'AACE Corp (American Association of Clinical Endocrinologists)'
Publication date: 01/06/2004
Field of study

The World-Wide Web is undergoing dramatic changes at the moment. The Semantic Web is an initiative to bring meaning to the Web. The Semantic Web is based on ontology technology – a knowledge representation framework – at its core. We illustrate the importance of this evolutionary development. We survey five scenarios demonstrating different forms of applications of ontology technologies in the development and deployment of learning technology systems. Ontology technologies are highly useful to organise, personalise, and publish learning content and to discover, generate, and compose learning objects

Irish Universities

DCU Online Research Access Service

Approaches to Semantic Web Services: An Overview and Comparison

Author: B. Omelayenko
D. Fensel
D. Fensel
D. Wu
E. Motta
E.. Motta
M. Paolucci
M. Paolucci
S. McIlraith
T. Berners-Lee
Publication venue
Publication date: 01/01/2004
Field of study

Abstract. The next Web generation promises to deliver Semantic Web Services (SWS); services that are self-described and amenable to automated discovery, composition and invocation. A prerequisite to this, however, is the emergence and evolution of the Semantic Web, which provides the infrastructure for the semantic interoperability of Web Services. Web Services will be augmented with rich formal descriptions of their capabilities, such that they can be utilized by applications or other services without human assistance or highly constrained agreements on interfaces or protocols. Thus, Semantic Web Services have the potential to change the way knowledge and business services are consumed and provided on the Web. In this paper, we survey the state of the art of current enabling technologies for Semantic Web Services. In addition, we characterize the infrastructure of Semantic Web Services along three orthogonal dimensions: activities, architecture and service ontology. Further, we examine and contrast three current approaches to SWS according to the proposed dimensions

CiteSeerX

Southampton (e-Prints Soton)

Crossref

Open Research Online (The Open University)

Encoding models for scholarly literature

Author: Holmes Martin
Romary Laurent
Publication venue: 'IGI Global'
Publication date: 03/06/2009
Field of study

We examine the issue of digital formats for document encoding, archiving and publishing, through the specific example of "born-digital" scholarly journal articles. We will begin by looking at the traditional workflow of journal editing and publication, and how these practices have made the transition into the online domain. We will examine the range of different file formats in which electronic articles are currently stored and published. We will argue strongly that, despite the prevalence of binary and proprietary formats such as PDF and MS Word, XML is a far superior encoding choice for journal articles. Next, we look at the range of XML document structures (DTDs, Schemas) which are in common use for encoding journal articles, and consider some of their strengths and weaknesses. We will suggest that, despite the existence of specialized schemas intended specifically for journal articles (such as NLM), and more broadly-used publication-oriented schemas such as DocBook, there are strong arguments in favour of developing a subset or customization of the Text Encoding Initiative (TEI) schema for the purpose of journal-article encoding; TEI is already in use in a number of journal publication projects, and the scale and precision of the TEI tagset makes it particularly appropriate for encoding scholarly articles. We will outline the document structure of a TEI-encoded journal article, and look in detail at suggested markup patterns for specific features of journal articles

arXiv.org e-Print Archive

INRIA a CCSD electronic archive server