25 research outputs found

    Project DEGREE: Bringing Grid into the Earth Science.

    No full text

    Data Management in Flood Prediction

    No full text

    Distributed web-scale infrastructure for crawling, indexing and search with semantic support

    No full text
    In this paper, we describe our work in progress in the scope of web-scale information extraction and information retrieval utilizing distributed computing. We present a distributed architecture built on top of the MapReduce paradigm for information retrieval, information processing and intelligent search supported by spatial capabilities. Proposed architecture is focused on crawling documents in several different formats, information extraction, lightweight semantic annotation of the extracted information, indexing of extracted information and finally on indexing of documents based on the geo-spatial information found in a document. We demonstrate the architecture on two use cases, where the first is search in job offers retrieved from the LinkedIn portal and the second is search in BBC news feeds and discuss several problems we had to face during the implemen-tation. We also discuss spatial search applications for both cases because both LinkedIn job offer pages and BBC news feeds contain a lot of spatial information to extract and process

    Services for replica consistency handling in data grids

    No full text

    Improving inter-enterprise collaboration with recommendation tool based on lightweight semantics in emails

    No full text
    In current time of web and mobile applications, classic email is still one of the most popular means of communication over the Internet. Beset by many problems such as spam or information overload, yet it yields significant benefits especially to enterprise users when communicating, collaborating or solving business tasks. In addition, the current groupware frameworks still does not support well the collaboration between large and small & medium-sized enterprises. In this paper, we present an approach that addresses the challenges of inter-enterprise collaboration, putting together email as a tool for information sharing and lightweight semantics to automate the processing of the knowledge extracted from the emails and their attachments. The building blocks of our approach are: connectors to legacy applications retaining current working styles, use of email for collaboration especially on the SME’s side, information extraction enabling semantic search and recommendation in inter-enterprise collaborations, automatic tagging of documents, and integration of existing business process models. To evaluate our approach we use three different scenarios: new product development, software development supply chain, and supply chain collaboration between SMEs and LEs. Evaluation results suggests that our solution is a benefit for the collaborating enterprises

    Information searching for an experience management platform of the EU Pellucid project

    No full text
    The EU Pellucid project is developing an experience management system for public organizations with staff mobility. The paper presents an activity whitin the project focused on searching for information in repositories of documents. The project's background and the process of information searching are described. Ontological methods such as semantic annotation and similarity searching, as well as ontology- and full-text-based searching are presented. Monitoring of organizational repositories is discussed

    DDG Task Recovery for Cluster Computing ⋆

    No full text
    Abstract. This paper presents a solution for the problem of transparent recovery of asynchronous distributed computation on clusters of workstations when a fault occurs on a node. If the system has fault-tolerant features, it can survive the fault and continues its computations. Performance degradation is unavoidable when hardware redundancies are not available. It is a large advantage if the long-runtime application can restart from a checkpoint instead of restarting whole computation. This paper presents the fault-tolerant feature of the DDG environment oriented to cluster systems without hardware spare. 1

    Mapping and load balancing on distributed memory systems

    No full text
    Two kinds of tools are necessary to optimise the use of available resources by the execution of parallel programs on distributed memory systems: mapping and load balancing tools. A mapping tool is well suited for programs whose behaviour is predictable while for many "real applications", it needs to be complemented by a dynamic load balancing tool. Both tools are currently investigated to be included in the programming environment designed by the SEPP COPERNICUS project. I

    Data Mining and Integration for Environmental Data Archives

    No full text
    corecore