Search CORE

9,603 research outputs found

DataHub: Collaborative Data Science & Dataset Version Management at Scale

Author: Bhardwaj Anant
Bhattacherjee Souvik
Chavan Amit
Deshpande Amol
Elmore Aaron J.
Madden Samuel
Parameswaran Aditya G.
Publication venue
Publication date: 02/09/2014
Field of study

Relational databases have limited support for data collaboration, where teams collaboratively curate and analyze large datasets. Inspired by software version control systems like git, we propose (a) a dataset version control system, giving users the ability to create, branch, merge, difference and search large, divergent collections of datasets, and (b) a platform, DataHub, that gives users the ability to perform collaborative data analysis building on this version control system. We outline the challenges in providing dataset version control at scale.Comment: 7 page

arXiv.org e-Print Archive

CiteSeerX

DSpace@MIT

Analysing imperfect temporal information in GIS using the Triangular Model

Author: De Maeyer Philippe
De Tré Guy
Delafontaine Matthias
Neutens Tijs
Qiang Yi
Stichelbaut Birger
Van de Weghe Nico
Publication venue: 'Maney Publishing'
Publication date: 01/01/2012
Field of study

Rough set and fuzzy set are two frequently used approaches for modelling and reasoning about imperfect time intervals. In this paper, we focus on imperfect time intervals that can be modelled by rough sets and use an innovative graphic model [i.e. the triangular model (TM)] to represent this kind of imperfect time intervals. This work shows that TM is potentially advantageous in visualizing and querying imperfect time intervals, and its analytical power can be better exploited when it is implemented in a computer application with graphical user interfaces and interactive functions. Moreover, a probabilistic framework is proposed to handle the uncertainty issues in temporal queries. We use a case study to illustrate how the unique insights gained by TM can assist a geographical information system for exploratory spatio-temporal analysis

Ghent University Academic Bibliography

State-of-the-art on evolution and reactivity

Author: Alferes José Júlio
Bailey James
Berndtsson Mikael
Bry François
Dietrich Jens
Kozlenkov Alexander
May Wolfgang
Patrânjan Paula Lavinia
Pinto Alexandre
Schroeder Michael
Wagner Gerd
Publication venue
Publication date: 05/08/2004
Field of study

This report starts by, in Chapter 1, outlining aspects of querying and updating resources on the Web and on the Semantic Web, including the development of query and update languages to be carried out within the Rewerse project. From this outline, it becomes clear that several existing research areas and topics are of interest for this work in Rewerse. In the remainder of this report we further present state of the art surveys in a selection of such areas and topics. More precisely: in Chapter 2 we give an overview of logics for reasoning about state change and updates; Chapter 3 is devoted to briefly describing existing update languages for the Web, and also for updating logic programs; in Chapter 4 event-condition-action rules, both in the context of active database systems and in the context of semistructured data, are surveyed; in Chapter 5 we give an overview of some relevant rule-based agents frameworks

Open Access LMU

The design and implementation of an infrastructure for multimedia digital libraries

Author: Eberman B.
Kovalcin D.E.
Vries A.P. de
Publication venue
Publication date: 01/01/1998
Field of study

We develop an infrastructure for managing, indexing and serving multimedia content in digital libraries. This infrastructure follows the model of the Web, and thereby is distributed in nature. We discuss the design of the Librarian, the component that manages meta data about the content. The management of meta data has been separated from the media servers that manage the content itself. Also, the extraction of the meta data is largely independent of the Librarian. We introduce our extensible data model and the daemon paradigm that are the core pieces of this architecture. We evaluate our initial implementation using a relational database. We conclude with a discussion of the lessons we learned in building this system, and proposals for improving the flexibility, reliability, and performance of the syste

CiteSeerX

CWI's Institutional Repository

University of Twente Research Information

RegenBase: a knowledge base of spinal cord injury biology for translational research.

Author: Abeyruwan Saminda W
Al-Ali Hassan
Bixby John L
Callahan Alison
Ferguson Adam R
Lemmon Vance P
Popovich Phillip G
Sakurai Kunie
Shah Nigam H
Visser Ubbo
Publication venue: eScholarship, University of California
Publication date: 01/01/2016
Field of study

Spinal cord injury (SCI) research is a data-rich field that aims to identify the biological mechanisms resulting in loss of function and mobility after SCI, as well as develop therapies that promote recovery after injury. SCI experimental methods, data and domain knowledge are locked in the largely unstructured text of scientific publications, making large scale integration with existing bioinformatics resources and subsequent analysis infeasible. The lack of standard reporting for experiment variables and results also makes experiment replicability a significant challenge. To address these challenges, we have developed RegenBase, a knowledge base of SCI biology. RegenBase integrates curated literature-sourced facts and experimental details, raw assay data profiling the effect of compounds on enzyme activity and cell growth, and structured SCI domain knowledge in the form of the first ontology for SCI, using Semantic Web representation languages and frameworks. RegenBase uses consistent identifier schemes and data representations that enable automated linking among RegenBase statements and also to other biological databases and electronic resources. By querying RegenBase, we have identified novel biological hypotheses linking the effects of perturbagens to observed behavioral outcomes after SCI. RegenBase is publicly available for browsing, querying and download.Database URL:http://regenbase.org

Crossref

PubMed Central

eScholarship - University of California

University of Miami: Scholarship Miami

Time-Aware Probabilistic Knowledge Graphs

Author: Chekol Melisachew Wudage
Stuckenschmidt Heiner
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 26th International Symposium on Temporal Representation and Reasoning (TIME 2019)
Publication date: 01/01/2019
Field of study

The emergence of open information extraction as a tool for constructing and expanding knowledge graphs has aided the growth of temporal data, for instance, YAGO, NELL and Wikidata. While YAGO and Wikidata maintain the valid time of facts, NELL records the time point at which a fact is retrieved from some Web corpora. Collectively, these knowledge graphs (KG) store facts extracted from Wikipedia and other sources. Due to the imprecise nature of the extraction tools that are used to build and expand KG, such as NELL, the facts in the KG are weighted (a confidence value representing the correctness of a fact). Additionally, NELL can be considered as a transaction time KG because every fact is associated with extraction date. On the other hand, YAGO and Wikidata use the valid time model because they maintain facts together with their validity time (temporal scope). In this paper, we propose a bitemporal model (that combines transaction and valid time models) for maintaining and querying bitemporal probabilistic knowledge graphs. We study coalescing and scalability of marginal and MAP inference. Moreover, we show that complexity of reasoning tasks in atemporal probabilistic KG carry over to the bitemporal setting. Finally, we report our evaluation results of the proposed model

MAnnheim DOCument Server

Dagstuhl Research Online Publication Server

Mapping an ancient historian in a digital age: the Herodotus Encoded Space-Text-Image Archive (HESTIA)

Author: Barker Elton
Bouzarovski Stefan
Isaksen Leif
Pelling Chris
Publication venue
Publication date: 01/01/2010
Field of study

HESTIA (the Herodotus Encoded Space-Text-Imaging Archive) employs the latest digital technology to develop an innovative methodology to the study of spatial data in Herodotus' Histories. Using a digital text of Herodotus, freely available from the Perseus on-line library, to capture all the place-names mentioned in the narrative, we construct a database to house that information and represent it in a series of mapping applications, such as GIS, GoogleEarth and GoogleMap Timeline. As a collaboration of academics from the disciplines of Classics, Geography, and Archaeological Computing, HESTIA has the twin aim of investigating the ways geography is represented in the Histories and of bringing Herodotus' world into people's homes

Southampton (e-Prints Soton)

University of Birmingham Research Portal

Open Research Online (The Open University)

The University of Manchester - Institutional Repository

Bipolar fuzzy querying of temporal databases

Author: Billiet Christophe
De Tré Guy
Matthé Tom
Pons Capote Olga
Pons Jose Enrique
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2011
Field of study

Temporal databases handle temporal aspects of the objects they describe with an eye to maintaining consistency regarding these temporal aspects. Several techniques have allowed these temporal aspects, along with the regular aspects of the objects, to be defined and queried in an imprecise way. In this paper, a new technique is proposed, which allows using both positive and negative -possibly imprecise- information in querying relational temporal databases. The technique is discussed and the issues which arise are dealt with in a consistent way

Ghent University Academic Bibliography