Search CORE

31,684 research outputs found

Recommendations for Evolving Relational Databases

Author: Anquetil Nicolas
Delplanque Julien
Ducasse Stéphane
Etien Anne
Publication venue: HAL CCSD
Publication date: 08/06/2020
Field of study

International audienceRelational databases play a central role in many information systems. Their schemas contain structural and behavioral entity descriptions. Databases must continuously be adapted to new requirements of a world in constant change while: (1) relational database management systems (RDBMS) do not allow inconsistencies in the schema; (2) stored procedure bodies are not meta-described in RDBMS such as PostgreSQL that consider their bodies as plain text. As a consequence , evaluating the impact of an evolution of the database schema is cumbersome , being essentially manual. We present a semi-automatic approach based on recommendations that can be compiled into a SQL patch fulfilling RDBMS constraints. To support recommendations, we designed a meta-model for relational databases easing computation of change impact. We performed an experiment to validate the approach by reproducing a real evolution on a database. The results of our experiment show that our approach can set the database in the same state as the one produced by the manual evolution in 75% less time

INRIA a CCSD electronic archive server

Recommendations for Evolving Relational Databases: Technical Report

Author: Anquetil Nicolas
Delplanque Julien
Ducasse Stéphane
Etien Anne
Publication venue: HAL CCSD
Publication date: 11/03/2020
Field of study

This report contains technical details that could not be included in the article "Recommendations for Evolving Legacy Databases" submitted to the 32nd International Conference on Advanced Information Systems Engineering (CAISE'20)

INRIA a CCSD electronic archive server

HAL Descartes

Hal-Diderot

Theory and Practice of Data Citation

Author: Silvello Gianmaria
Publication venue: 'Wiley'
Publication date: 24/06/2017
Field of study

Citations are the cornerstone of knowledge propagation and the primary means of assessing the quality of research, as well as directing investments in science. Science is increasingly becoming "data-intensive", where large volumes of data are collected and analyzed to discover complex patterns through simulations and experiments, and most scientific reference works have been replaced by online curated datasets. Yet, given a dataset, there is no quantitative, consistent and established way of knowing how it has been used over time, who contributed to its curation, what results have been yielded or what value it has. The development of a theory and practice of data citation is fundamental for considering data as first-class research objects with the same relevance and centrality of traditional scientific products. Many works in recent years have discussed data citation from different viewpoints: illustrating why data citation is needed, defining the principles and outlining recommendations for data citation systems, and providing computational methods for addressing specific issues of data citation. The current panorama is many-faceted and an overall view that brings together diverse aspects of this topic is still missing. Therefore, this paper aims to describe the lay of the land for data citation, both from the theoretical (the why and what) and the practical (the how) angle.Comment: 24 pages, 2 tables, pre-print accepted in Journal of the Association for Information Science and Technology (JASIST), 201

arXiv.org e-Print Archive

Archivio istituzionale della ricerca - Università di Padova

Curriculum Guidelines for Undergraduate Programs in Data Science

Author: Agarwal Mahesh
Averett Maia
Baumer Benjamin
Bray Andrew
Bressoud Thomas
Bryant Lance
Cheng Lei
De Veaux Richard
Francis Amanda
Gould Robert
Kim Albert Y.
Kretchmar Matt
Lu Qin
Moskol Ann
Nolan Deborah
Pelayo Roberto
Raleigh Sean
Sethi Ricky J.
Sondjaja Mutiara
Tiruviluamala Neelesh
Uhlig Paul
Washington Talitha
Wesley Curtis
White David
Ye Ping
Publication venue: 'Annual Reviews'
Publication date: 01/01/2017
Field of study

The Park City Math Institute (PCMI) 2016 Summer Undergraduate Faculty Program met for the purpose of composing guidelines for undergraduate programs in Data Science. The group consisted of 25 undergraduate faculty from a variety of institutions in the U.S., primarily from the disciplines of mathematics, statistics and computer science. These guidelines are meant to provide some structure for institutions planning for or revising a major in Data Science

arXiv.org e-Print Archive

Smith College: Smith ScholarWorks

Virtual HR Departments: Getting Out of the Middle

Author: Lepak David P.
Snell Scott A.
Stueber Donna
Publication venue: DigitalCommons@ILR
Publication date: 01/03/2001
Field of study

In this chapter, we explore the notion of virtual HR departments: a network-based organization built on partnerships and mediated by information technologies in order to be simultaneously strategic, flexible, cost-efficient, and service-oriented. We draw on experiences and initiatives at Merck Pharmaceuticals in order to show how information technology in establishing an infrastructure for virtual HR. Then, we present a model for mapping the architecture of HR activities that includes both internal and external sourcing options. We conclude by offering some recommendations for management practice as well as future research

DigitalCommons@ILR

eCommons@Cornell

Query-based Schema Evolution Recommendations for Hybrid Polystores

Author: Benats Pol
Cleve Anthony
Meurice Loup
Publication venue
Publication date: 01/01/2022
Field of study

Repository of the University of Namur

Query-based Schema Evolution Recommendations for Hybrid Polystores

Author: Benats Pol
Cleve Anthony
Meurice Loup
Publication venue
Publication date: 01/01/2022
Field of study

Repository of the University of Namur

Fast Search for Dynamic Multi-Relational Graphs

Author: Chin George
Choudhury Sutanay
Feo John
Holder Lawrence
Publication venue
Publication date: 01/01/2013
Field of study

Acting on time-critical events by processing ever growing social media or news streams is a major technical challenge. Many of these data sources can be modeled as multi-relational graphs. Continuous queries or techniques to search for rare events that typically arise in monitoring applications have been studied extensively for relational databases. This work is dedicated to answer the question that emerges naturally: how can we efficiently execute a continuous query on a dynamic graph? This paper presents an exact subgraph search algorithm that exploits the temporal characteristics of representative queries for online news or social media monitoring. The algorithm is based on a novel data structure called the Subgraph Join Tree (SJ-Tree) that leverages the structural and semantic characteristics of the underlying multi-relational graph. The paper concludes with extensive experimentation on several real-world datasets that demonstrates the validity of this approach.Comment: SIGMOD Workshop on Dynamic Networks Management and Mining (DyNetMM), 201

arXiv.org e-Print Archive

Crossref

Publishing Linked Data - There is no One-Size-Fits-All Formula

Author: Corcho Oscar
Garijo Daniel
Gómez-Pérez A.
Mora Jose
Poveda-Villalon Maria
Vila-Suero Daniel
Vilches-Blazquez LM.
Villazón-Terrazas Boris
Publication venue: Facultad de Informática (UPM)
Publication date: 01/06/2012
Field of study

Publishing Linked Data is a process that involves several design decisions and technologies. Although some initial guidelines have been already provided by Linked Data publishers, these are still far from covering all the steps that are necessary (from data source selection to publication) or giving enough details about all these steps, technologies, intermediate products, etc. Furthermore, given the variety of data sources from which Linked Data can be generated, we believe that it is possible to have a single and uni�ed method for publishing Linked Data, but we should rely on di�erent techniques, technologies and tools for particular datasets of a given domain. In this paper we present a general method for publishing Linked Data and the application of the method to cover di�erent sources from di�erent domains

Archivo Digital UPM