984 research outputs found

    XML: aplicações e tecnologias associadas: 6th National Conference

    Get PDF
    This volume contains the papers presented at the Sixth Portuguese XML Conference, called XATA (XML, Aplicações e Tecnologias Associadas), held in Évora, Portugal, 14-15 February, 2008. The conference followed on from a successful series held throughout Portugal in the last years: XATA2003 was held in Braga, XATA2004 was held in Porto, XATA2005 was held in Braga, XATA2006 was held in Portalegre and XATA2007 was held in Lisboa. Dued to research evaluation criteria that are being used to evaluate researchers and research centers national conferences are becoming deserted. Many did not manage to gather enough submissions to proceed in this scenario. XATA made it through. However with a large decrease in the number of submissions. In this edition a special meeting will join the steering committee with some interested attendees to discuss XATA's future: internationalization, conference model, ... We think XATA is important in the national context. It has succeeded in gathering and identifying a comunity that shares the same research interests and has promoted some colaborations. We want to keep "the wheel spinning"... This edition has its program distributed by first day's afternoon and next day's morning. This way we are facilitating travel arrangements and we will have one night to meet

    XMPP for cloud computing in bioinformatics supporting discovery and invocation of asynchronous web services

    Get PDF
    Background: Life sciences make heavily use of the web for both data provision and analysis. However, the increasing amount of available data and the diversity of analysis tools call for machine accessible interfaces in order to be effective. HTTP-based Web service technologies, like the Simple Object Access Protocol (SOAP) and REpresentational State Transfer (REST) services, are today the most common technologies for this in bioinformatics. However, these methods have severe drawbacks, including lack of discoverability, and the inability for services to send status notifications. Several complementary workarounds have been proposed, but the results are ad-hoc solutions of varying quality that can be difficult to use. Results: We present a novel approach based on the open standard Extensible Messaging and Presence Protocol (XMPP), consisting of an extension (IO Data) to comprise discovery, asynchronous invocation, and definition of data types in the service. That XMPP cloud services are capable of asynchronous communication implies that clients do not have to poll repetitively for status, but the service sends the results back to the client upon completion. Implementations for Bioclipse and Taverna are presented, as are various XMPP cloud services in bio- and cheminformatics. Conclusion: XMPP with its extensions is a powerful protocol for cloud services that demonstrate several advantages over traditional HTTP-based Web services: 1) services are discoverable without the need of an external registry, 2) asynchronous invocation eliminates the need for ad-hoc solutions like polling, and 3) input and output types defined in the service allows for generation of clients on the fly without the need of an external semantics description. The many advantages over existing technologies make XMPP a highly interesting candidate for next generation online services in bioinformatics

    A unified view of data-intensive flows in business intelligence systems : a survey

    Get PDF
    Data-intensive flows are central processes in today’s business intelligence (BI) systems, deploying different technologies to deliver data, from a multitude of data sources, in user-preferred and analysis-ready formats. To meet complex requirements of next generation BI systems, we often need an effective combination of the traditionally batched extract-transform-load (ETL) processes that populate a data warehouse (DW) from integrated data sources, and more real-time and operational data flows that integrate source data at runtime. Both academia and industry thus must have a clear understanding of the foundations of data-intensive flows and the challenges of moving towards next generation BI environments. In this paper we present a survey of today’s research on data-intensive flows and the related fundamental fields of database theory. The study is based on a proposed set of dimensions describing the important challenges of data-intensive flows in the next generation BI setting. As a result of this survey, we envision an architecture of a system for managing the lifecycle of data-intensive flows. The results further provide a comprehensive understanding of data-intensive flows, recognizing challenges that still are to be addressed, and how the current solutions can be applied for addressing these challenges.Peer ReviewedPostprint (author's final draft

    Workflow Standards and XML

    Get PDF

    SemLinker: automating big data integration for casual users

    Get PDF
    A data integration approach combines data from different sources and builds a unified view for the users. Big data integration inherently is a complex task, and the existing approaches are either potentially limited or invariably rely on manual inputs and interposition from experts or skilled users. SemLinker, an ontology-based data integration system, is part of a metadata management framework for personal data lake (PDL), a personal store-everything architecture. PDL is for casual and unskilled users, therefore SemLinker adopts an automated data integration workflow to minimize manual input requirements. To support the flat architecture of a lake, SemLinker builds and maintains a schema metadata level without involving any physical transformation of data during integration, preserving the data in their native formats while, at the same time, allowing them to be queried and analyzed. Scalability, heterogeneity, and schema evolution are big data integration challenges that are addressed by SemLinker. Large and real-world datasets of substantial heterogeneities are used in evaluating SemLinker. The results demonstrate and confirm the integration efficiency and robustness of SemLinker, especially regarding its capability in the automatic handling of data heterogeneities and schema evolutions

    Metainformationssysteme – Backbone der Anwendungssystemkopplung

    Full text link
    Die Kopplung von Anwendungssystemen ist als komplexes Entwicklungsproblem im Sinne der Wirtschaftsinformatik zu begreifen. Der Beitrag ordnet aktuelle Standards und Technologien den Entwicklungsphasen der Informationssystementwicklung als Gestaltungsoptionen zu. Anhand von Terminologien und Nachrichtenstandards wird die Bedeutung von Metainformationssystemen gezeigt und es wird die Architektur der Terminologischen Klammer zur Kopplung von Anwendungssystemen eingeführt. Mittels der Kombination von Entwicklungsphasen und Abstraktionsebenen wird ein Rahmenmodell zur Kopplung von Anwendungssystemen eingeführt, welches der Strukturierung von Entwicklungsaufgaben und Beziehungen von Metainformationssystemen bei der Anwendungssystemkopplung dient. <br/

    An E-Learning Semantic Grid for Life science Education

    Get PDF
    There are a lot of life science databases and services on the Internet nowadays, especially in life science e-science. In this paper, we will present an E-Learning Semantic Grid that integrates these resources provided by both teachers and scientists for life science education. It uses domain ontologies to integrate these heterogeneous life science database and service resources, and supports ontology-based e-learning data-sharing and service-coordination for life science teachers and students in an e-learning virtual organization. Our system provides life science students with semantically superior experience in learning activities, and also extends the function of life science e-science. It has a promising future in the domain of life science education

    A Contribution to the E-Framework: a Specification of a Programming Exercise Evaluation Service

    Get PDF
    This work is a contribution to the e-Framework, arguably the mostprominent e-learning framework today, and consists of the definition ofa service for the automatic evaluation of programming exercises. Thisevaluation domain differs from trivial evaluations modelled bylanguages such as the IMS Question & Test Interoperability (QTI)specification. Complex evaluation domains justify the development ofspecialized evaluators that participate in several business processes.These business processes can combine other type of systems such asProgramming Contest Management Systems, Learning ManagementSystems, Integrated Development Environments and Learning ObjectRepositories where programming exercises are stored as LearningObjects. This contribution describes the implementation approachesused, more precisely, behaviours & requests, use & interactions,applicable standards, interface definition and usage scenarios
    • …
    corecore