Search CORE

21,869 research outputs found

No users no dataspaces! Query-driven dataspace orchestration

Author: Fletcher George H. L.
MANDREOLI Federica
Publication venue: country:ITA
Publication date: 01/01/2016
Field of study

Data analysis in rich spaces of heterogeneous data sources is an increasingly common activity. Examples include querying the web of linked data and personal information management. Such analytics on dataspaces is often iterative and dynamic, in an open-ended interaction between discovery and data orchestration. The current state of the art in integration and orchestration in dataspaces is primarily geared towards close-ended analysis, targeting the discovery of stable data mappings or one-time, pay-as-you-go ad hoc data mappings. The perspective here is dataspace-centric. In this paper, we propose a shift to a user-centric perspective on dataspace orchestration. We outline basic conceptual and technical challenges in supporting data analytics which is open-ended and always evolving, as users respond to new discoveries and connections

Pure OAI Repository

Archivio istituzionale della ricerca - Università di Modena e Reggio Emilia

Hypermedia-based discovery for source selection using low-cost linked data interfaces

Author: Colpaert Pieter
Dimou Anastasia
Mannens Erik
Vander Sande Miel
Verborgh Ruben
Publication venue: 'IGI Global'
Publication date: 01/01/2016
Field of study

Evaluating federated Linked Data queries requires consulting multiple sources on the Web. Before a client can execute queries, it must discover data sources, and determine which ones are relevant. Federated query execution research focuses on the actual execution, while data source discovery is often marginally discussed-even though it has a strong impact on selecting sources that contribute to the query results. Therefore, the authors introduce a discovery approach for Linked Data interfaces based on hypermedia links and controls, and apply it to federated query execution with Triple Pattern Fragments. In addition, the authors identify quantitative metrics to evaluate this discovery approach. This article describes generic evaluation measures and results for their concrete approach. With low-cost data summaries as seed, interfaces to eight large real-world datasets can discover each other within 7 minutes. Hypermedia-based client-side querying shows a promising gain of up to 50% in execution time, but demands algorithms that visit a higher number of interfaces to improve result completeness

Ghent University Academic Bibliography

The Semantic Web MIDI Tape: An Interface for Interlinking MIDI and Context Metadata

Author: Bechhofer Sean
Daquino Marilena
David
Harte Christopher
Jacobson Kurt
Keil Klaus
Kent-Muller Anna
Meroño-Peñuela Albert
Meroño-Peñuela Albert
Raffel Colin
Raimond Yves
Urbano Julián
Urbano Julián
Urbano Julián
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 09/10/2018
Field of study

The Linked Data paradigm has been used to publish a large number of musical datasets and ontologies on the Semantic Web, such as MusicBrainz, AcousticBrainz, and the Music Ontology. Recently, the MIDI Linked Data Cloud has been added to these datasets, representing more than 300,000 pieces in MIDI format as Linked Data, opening up the possibility for linking fine-grained symbolic music representations to existing music metadata databases. Despite the dataset making MIDI resources available in Web data standard formats such as RDF and SPARQL, the important issue of finding meaningful links between these MIDI resources and relevant contextual metadata in other datasets remains. A fundamental barrier for the provision and generation of such links is the difficulty that users have at adding new MIDI performance data and metadata to the platform. In this paper, we propose the Semantic Web MIDI Tape, a set of tools and associated interface for interacting with the MIDI Linked Data Cloud by enabling users to record, enrich, and retrieve MIDI performance data and related metadata in native Web data standards. The goal of such interactions is to find meaningful links between published MIDI resources and their relevant contextual metadata. We evaluate the Semantic Web MIDI Tape in various use cases involving user-contributed content, MIDI similarity querying, and entity recognition methods, and discuss their potential for finding links between MIDI resources and metadata

Southampton (e-Prints Soton)

VU Research Portal

Crossref

Open Research Online (The Open University)

FigShare

RegenBase: a knowledge base of spinal cord injury biology for translational research.

Author: Abeyruwan Saminda W
Al-Ali Hassan
Bixby John L
Callahan Alison
Ferguson Adam R
Lemmon Vance P
Popovich Phillip G
Sakurai Kunie
Shah Nigam H
Visser Ubbo
Publication venue: eScholarship, University of California
Publication date: 01/01/2016
Field of study

Spinal cord injury (SCI) research is a data-rich field that aims to identify the biological mechanisms resulting in loss of function and mobility after SCI, as well as develop therapies that promote recovery after injury. SCI experimental methods, data and domain knowledge are locked in the largely unstructured text of scientific publications, making large scale integration with existing bioinformatics resources and subsequent analysis infeasible. The lack of standard reporting for experiment variables and results also makes experiment replicability a significant challenge. To address these challenges, we have developed RegenBase, a knowledge base of SCI biology. RegenBase integrates curated literature-sourced facts and experimental details, raw assay data profiling the effect of compounds on enzyme activity and cell growth, and structured SCI domain knowledge in the form of the first ontology for SCI, using Semantic Web representation languages and frameworks. RegenBase uses consistent identifier schemes and data representations that enable automated linking among RegenBase statements and also to other biological databases and electronic resources. By querying RegenBase, we have identified novel biological hypotheses linking the effects of perturbagens to observed behavioral outcomes after SCI. RegenBase is publicly available for browsing, querying and download.Database URL:http://regenbase.org

PubMed Central

eScholarship - University of California

Recommended from our members

Extracting and re-using research data from chemistry e-theses: the SPECTRa-T project

Author: Downing Jim
Harvey Matt
Morgan Peter
Murray-Rust Peter
Rzepa Henry S
Stewart Diana
Tonge Alan
Townsend Joseph A
Publication venue: 11th International Symposium on Electronic Theses and Dissertations
Publication date: 01/06/2008
Field of study

Scientific e-theses are data-rich resources, but much of the information they contain is not readily accessible. For chemistry, the SPECTRa-T project has addressed this problem by developing data-mining techniques to extract experimental data, creating RDF (Resource Description Framework) triples for exposure to sophisticated Semantic Web searches. We used OSCAR3, an Open Source chemistry text-mining tool, to parse and extract data from theses in PDF, and from theses in Office Open XML document format. Theses in PDF suffered data corruption and a loss of formatting that prevented the identification of chemical objects. Theses in .docx yielded semantically rich SciXML that enabled the additional extraction of associated data. Chemical objects were placed in a data repository, and RDF triples deposited in a triplestore. Data-mining from chemistry e-theses is both desirable and feasible; but the use of PDF, the de facto format standard for deposit in most repositories, prevents the optimal extraction of data for semantic querying. In order to facilitate this, we recommend that universities also require deposition of chemistry e-theses in an XML document format. Further work is required to clarify the complex IPR issues and ensure that they do not become an unwarranted barrier to data extraction and re-use

Apollo (Cambridge)

Recommended from our members

Telecommunication mashups using RESTful services

Author: Alvaro Rey Guillermo
Davies John
Domingue John
Duke Alistair
Lecue Freddy
Liu Dong
Maleshkova Maria
Mehandjiev Nikolay
Pedrinaci Carlos
Stincic Sandra
Publication venue
Publication date: 01/01/2010
Field of study

Open Research Online (The Open University)

The University of Manchester - Institutional Repository