19 research outputs found

    An XML Query Engine for Network-Bound Data

    Get PDF
    XML has become the lingua franca for data exchange and integration across administrative and enterprise boundaries. Nearly all data providers are adding XML import or export capabilities, and standard XML Schemas and DTDs are being promoted for all types of data sharing. The ubiquity of XML has removed one of the major obstacles to integrating data from widely disparate sources –- namely, the heterogeneity of data formats. However, general-purpose integration of data across the wide area also requires a query processor that can query data sources on demand, receive streamed XML data from them, and combine and restructure the data into new XML output -- while providing good performance for both batch-oriented and ad-hoc, interactive queries. This is the goal of the Tukwila data integration system, the first system that focuses on network-bound, dynamic XML data sources. In contrast to previous approaches, which must read, parse, and often store entire XML objects before querying them, Tukwila can return query results even as the data is streaming into the system. Tukwila is built with a new system architecture that extends adaptive query processing and relational-engine techniques into the XML realm, as facilitated by a pair of operators that incrementally evaluate a query’s input path expressions as data is read. In this paper, we describe the Tukwila architecture and its novel aspects, and we experimentally demonstrate that Tukwila provides better overall query performance and faster initial answers than existing systems, and has excellent scalability

    РЕАЛИЗАЦИЯ БАЗЫ ЗНАНИЙ СТРУКТУРНЫХ ОБЪЕКТОВ

    Get PDF
    The problem concerning storage of knowledge fragments presented as XML documents in relation DBMS is discussed in the paper. The paper presents a database scheme that can store «disassembled» XML documents. The paper describes methodology for searching such structure in the data base. A mechanism for creation of user’s demands is given in the paperОбсуждается проблема хранения фрагментов знаний, представленных в виде XML-документов, в реляционной СУБД. Представлена схема БД, в которой можно хранить «разобранные» XML-документы. Описаны методики осуществления поиска в БД такой структуры, а также механизм создания пользовательских запросов

    ОРГАНИЗАЦИЯ ХРАНЕНИЯ СТРУКТУРНЫХ ОБЪЕКТОВ, ПРЕДСТАВЛЕННЫХ В ВИДЕ XML-ДОКУМЕНТОВ, В РЕЛЯЦИОННЫХ БАЗАХ ДАННЫХ

    Get PDF
    Обсуждается проблема хранения фрагментов знаний, представленных в виде XML-доку­ментов, в реляционной системе управления базой данных. Представляется схема базы данных, в которой можно хранить разобранные XML-документы. Описываются методики осуществления поиска в базе данных  такой структуры, а также механизм создания пользовательских запросов

    СИНТЕЗ СТРУКТУР СИСТЕМ СБОРА ДАННЫХ, УСТОЙЧИВЫХ К ОТКАЗАМ ОДНОТИПНЫХ ЭЛЕМЕНТОВ

    Get PDF
    The paper considers method for structural synthesis of a distributed hierarchical system for data collection that makes it possible to determine a structure which is resistant to failures of its separate elements.Рассматривается метод структурного синтеза распределенной иерархической системы сбора данных, позволяющий определить структуру, устойчивую к отказам ее отдельных элементо

    The XL Web Service Language : Concepts and Implementation

    Get PDF
    The XL programming language has been build on two very simple premises. First, XML is the forthcoming language used to describe and communicate complex data. Second, services provided via the internet are complex but loosely coupled and use XML. Services are neither bound to a certain platform, computer, or application scenario. The interaction between service provider and consumer is based on availability and reliability of interface descriptions and the coherence to internet standards like HTTP and XML. The XL language provides the means to easily describe complex services based on the XML data model, the XML query language XQuery and an XML storage model. In the following, the ideas behind the XL language, the language itself, and the XL runtime engine used as a prototype will be described in detail. Furthermore, different statement processing concepts, different usage scenarios and the non-functional requirements of the runtime engine itself are discusse

    Internet based molecular collaborative and publishing tools

    No full text
    The scientific electronic publishing model has hitherto been an Internet based delivery of electronic articles that are essentially replicas of their paper counterparts. They contain little in the way of added semantics that may better expose the science, assist the peer review process and facilitate follow on collaborations, even though the enabling technologies have been around for some time and are mature. This thesis will examine the evolution of chemical electronic publishing over the past 15 years. It will illustrate, which the help of two frameworks, how publishers should be exploiting technologies to improve the semantics of chemical journal articles, namely their value added features and relationships with other chemical resources on the Web. The first framework is an early exemplar of structured and scalable electronic publishing where a Web content management system and a molecular database are integrated. It employs a test bed of articles from several RSC journals and supporting molecular coordinate and connectivity information. The value of converting 3D molecular expressions in chemical file formats, such as the MOL file, into more generic 3D graphics formats, such as Web3D, is assessed. This exemplar highlights the use of metadata management for bidirectional hyperlink maintenance in electronic publishing. The second framework repurposes this metadata management concept into a Semantic Web application called SemanticEye. SemanticEye demonstrates how relationships between chemical electronic articles and other chemical resources are established. It adapts the successful semantic model used for digital music metadata management by popular applications such as iTunes. Globally unique identifiers enable relationships to be established between articles and other resources on the Web and SemanticEye implements two: the Document Object Identifier (DOI) for articles and the IUPAC International Chemical Identifier (InChI) for molecules. SemanticEye’s potential as a framework for seeding collaborations between researchers, who have hitherto never met, is explored using FOAF, the friend-of-a-friend Semantic Web standard for social networks

    Earth Observation Open Science and Innovation

    Get PDF
    geospatial analytics; social observatory; big earth data; open data; citizen science; open innovation; earth system science; crowdsourced geospatial data; citizen science; science in society; data scienc
    corecore