Search CORE

874 research outputs found

Dias: Dynamic Rewriting of Pandas Code

Author: Baziotis Stefanos
Kang Daniel
Mendis Charith
Publication venue
Publication date: 28/03/2023
Field of study

In recent years, dataframe libraries, such as pandas have exploded in popularity. Due to their flexibility, they are increasingly used in ad-hoc exploratory data analysis (EDA) workloads. These workloads are diverse, including custom functions which can span libraries or be written in pure Python. The majority of systems available to accelerate EDA workloads focus on bulk-parallel workloads, which contain vastly different computational patterns, typically within a single library. As a result, they can introduce excessive overheads for ad-hoc EDA workloads due to their expensive optimization techniques. Instead, we identify program rewriting as a lightweight technique which can offer substantial speedups while also avoiding slowdowns. We implemented our techniques in Dias, which rewrites notebook cells to be more efficient for ad-hoc EDA workloads. We develop techniques for efficient rewrites in Dias, including dynamic checking of preconditions under which rewrites are correct and just-in-time rewrites for notebook environments. We show that Dias can rewrite individual cells to be 57

\times

faster compared to pandas and 1909

\times

faster compared to optimized systems such as modin. Furthermore, Dias can accelerate whole notebooks by up to 3.6

\times

compared to pandas and 26.4

\times

compared to modin.Comment: 16 pages, 22 figure

arXiv.org e-Print Archive

Requirements Catalog for Business Process Modeling Recommender Systems

Author: Fellmann Michael
Koschmider Agnes
Metzger Dirk
Zarvic Novica
Publication venue: AIS Electronic Library (AISeL)
Publication date: 04/03/2015
Field of study

The manual construction of business process models is a time-consuming and error-prone task. To improve the quality of business process models, several modeling support techniques have been suggested spanning from strict auto-completion of a business process model with pre-defined model elements to suggesting closely matching recommendations. While recommendation systems are widely used and auto-completion functions are a standard feature of programming tools, such techniques have not been exploited for business process modeling although implementation strategies have already been suggested. Therefore, this paper collects requirements from different perspectives (literature and empirical studies) of how to effectively and efficiently assist process modelers in their modeling task. The condensation of requirements represents a comprehensive catalog, which constitutes a solid foundation to implement effective and efficient Process Modeling Recommender Systems (PMRSs). We expect that our contribution will fertilize the field of modeling support techniques to make them a common feature of BPM tools

AIS Electronic Library (AISeL)

Using semantic web technologies for exploratory OLAP: A survey

Author: Abelló Gamazo Alberto
Aramburu María José
Berlanga Rafael
Nebot Victoria
Pedersen Torben
Romero Moral Óscar
Simitsis Alkis
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2015
Field of study

Peer ReviewedPostprint (author’s final draft

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

UPCommons. Portal del coneixement obert de la UPC

TOWARDS HARNESSING COMPUTATIONAL WORKFLOW PROVENANCE FOR EXPERIMENT REPORTING

Author: Alper Pinar
Publication venue
Publication date: 01/08/2016
Field of study

The University of Manchester - Institutional Repository

Neuere Entwicklungen der deklarativen KI-Programmierung : proceedings

Author: Boley Harold
Bry François
Geske Ulrich
Publication venue: Sonstige Einrichtungen. DFKI Deutsches Forschungszentrum für Künstliche Intelligenz
Publication date: 01/01/1993
Field of study

The field of declarative AI programming is briefly characterized. Its recent developments in Germany are reflected by a workshop as part of the scientific congress KI-93 at the Berlin Humboldt University. Three tutorials introduce to the state of the art in deductive databases, the programming language Gödel, and the evolution of knowledge bases. Eleven contributed papers treat knowledge revision/program transformation, types, constraints, and type-constraint combinations

Universaar

Acronym

Implementing OBDA for an end-user query answering service on an educational ontology

Author: Cueva Rueda Rodrigo
Publication venue: Universitat Politècnica de Catalunya
Publication date: 01/01/2016
Field of study

In the age where productivity of society is no longer defined by the amount of information generated, but from the quality and assertiveness that a set of data may potentially hold, the right questions to do depends on the semantic awareness capability that an information system could evolve into. To address this challenge, in the last decade, exhaustive research has been done in the Ontology Based Data Access (OBDA) paradigm. A conspectus of the most promising technologies with data integration capabilities and the foundations where they rely are documented in this memory as a point of reference for choosing tools that supports the incorporation of a conceptual model under a OBDA method. The present study provides a practical approach for implementing an ontology based data access service, to educational context users of a Learning Analytics initiative, by means of allowing them to formulate intuitive enquiries with a familiar domain terminology on top of a Learning Management System. The ontology used was completely transformed to semantic linked data standards and some data mappings for testing were included. Semantic Linked Data technologies exposed in this document may exert modernization to environments in which object oriented and relational paradigms may propagate heterogeneous and contradictory requirements. Finally, to validate the implementation, a set of queries were constructed emulating the most relevant dynamics of the model regarding the dataset nature

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

UPCommons. Portal del coneixement obert de la UPC

Proceedings of the First Karlsruhe Service Summit Workshop - Advances in Service Research, Karlsruhe, Germany, February 2015 (KIT Scientific Reports ; 7692)

Author: Bertsch Valentin
Caton Simon
Feldmann Niels
Görlitz Roland
Jochem Patrick
Maleshkova Maria
Reuter-Oppermann Melanie
Publication venue: KIT Scientific Publishing, Karlsruhe
Publication date: 01/01/2015
Field of study

Since April 2008 KSRI fosters interdisciplinary research in order to support and advance the progress in the service domain. KSRI brings together academia and industry while serving as a European research hub with respect to service science. For KSS2015 Research Workshop, we invited submissions of theoretical and empirical research dealing with the relevant topics in the context of services including energy, mobility, health care, social collaboration, and web technologies

KITopen

Data Vaults: Database Technology for Scientific File Repositories

Author: Ivanova M.G. (Milena)
Kargin Y. (Yagiz)
Kersten M.L. (Martin)
Manegold S. (Stefan)
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2013
Field of study

Current data-management systems and analysis tools fail to meet scientists’ data-intensive needs. A "data vault" approach lets researchers effectively and efficiently explore and analyze information

CWI's Institutional Repository

International Migration, Integration and Social Cohesion online publications

Working notes of the KI\u2795 Workshop : KRDB-95 - Reasoning about structured objects : knowledge representation meets databases ; Bielefeld, Germany, Sept. 11-12, 1995

Author: Baader Franz
Buchheit Martin
Jeusfeld Manfred A.
Nutt Werner
Publication venue: Sonstige Einrichtungen. DFKI Deutsches Forschungszentrum für Künstliche Intelligenz
Publication date: 01/01/1994
Field of study

Universaar

Acronym