874 research outputs found
Dias: Dynamic Rewriting of Pandas Code
In recent years, dataframe libraries, such as pandas have exploded in
popularity. Due to their flexibility, they are increasingly used in ad-hoc
exploratory data analysis (EDA) workloads. These workloads are diverse,
including custom functions which can span libraries or be written in pure
Python. The majority of systems available to accelerate EDA workloads focus on
bulk-parallel workloads, which contain vastly different computational patterns,
typically within a single library. As a result, they can introduce excessive
overheads for ad-hoc EDA workloads due to their expensive optimization
techniques. Instead, we identify program rewriting as a lightweight technique
which can offer substantial speedups while also avoiding slowdowns. We
implemented our techniques in Dias, which rewrites notebook cells to be more
efficient for ad-hoc EDA workloads. We develop techniques for efficient
rewrites in Dias, including dynamic checking of preconditions under which
rewrites are correct and just-in-time rewrites for notebook environments. We
show that Dias can rewrite individual cells to be 57 faster compared to
pandas and 1909 faster compared to optimized systems such as modin.
Furthermore, Dias can accelerate whole notebooks by up to 3.6 compared
to pandas and 26.4 compared to modin.Comment: 16 pages, 22 figure
Requirements Catalog for Business Process Modeling Recommender Systems
The manual construction of business process models is a time-consuming and error-prone task. To improve the quality of business process models, several modeling support techniques have been suggested spanning from strict auto-completion of a business process model with pre-defined model elements to suggesting closely matching recommendations. While recommendation systems are widely used and auto-completion functions are a standard feature of programming tools, such techniques have not been exploited for business process modeling although implementation strategies have already been suggested. Therefore, this paper collects requirements from different perspectives (literature and empirical studies) of how to effectively and efficiently assist process modelers in their modeling task. The condensation of requirements represents a comprehensive catalog, which constitutes a solid foundation to implement effective and efficient Process Modeling Recommender Systems (PMRSs). We expect that our contribution will fertilize the field of modeling support techniques to make them a common feature of BPM tools
Using semantic web technologies for exploratory OLAP: A survey
Peer ReviewedPostprint (author’s final draft
Neuere Entwicklungen der deklarativen KI-Programmierung : proceedings
The field of declarative AI programming is briefly characterized. Its recent developments in Germany are reflected by a workshop as part of the scientific congress KI-93 at the Berlin Humboldt University. Three tutorials introduce to the state of the art in deductive databases, the programming language Gödel, and the evolution of knowledge bases. Eleven contributed papers treat knowledge revision/program transformation, types, constraints, and type-constraint combinations
Implementing OBDA for an end-user query answering service on an educational ontology
In the age where productivity of society is no longer defined by the amount of information
generated, but from the quality and assertiveness that a set of data may potentially hold,
the right questions to do depends on the semantic awareness capability that an
information system could evolve into. To address this challenge, in the last decade,
exhaustive research has been done in the Ontology Based Data Access (OBDA)
paradigm.
A conspectus of the most promising technologies with data integration capabilities and
the foundations where they rely are documented in this memory as a point of reference
for choosing tools that supports the incorporation of a conceptual model under a OBDA
method. The present study provides a practical approach for implementing an ontology
based data access service, to educational context users of a Learning Analytics initiative,
by means of allowing them to formulate intuitive enquiries with a familiar domain
terminology on top of a Learning Management System. The ontology used was
completely transformed to semantic linked data standards and some data mappings for
testing were included. Semantic Linked Data technologies exposed in this document may
exert modernization to environments in which object oriented and relational paradigms
may propagate heterogeneous and contradictory requirements. Finally, to validate the
implementation, a set of queries were constructed emulating the most relevant dynamics
of the model regarding the dataset nature
Proceedings of the First Karlsruhe Service Summit Workshop - Advances in Service Research, Karlsruhe, Germany, February 2015 (KIT Scientific Reports ; 7692)
Since April 2008 KSRI fosters interdisciplinary research in order to support and advance the progress in the service domain. KSRI brings together academia and industry while serving as a European research hub with respect to service science. For KSS2015 Research Workshop, we invited submissions of theoretical and empirical research dealing with the relevant topics in the context of services including energy, mobility, health care, social collaboration, and web technologies
Data Vaults: Database Technology for Scientific File Repositories
Current data-management systems and analysis tools fail to meet scientists’ data-intensive needs. A "data vault" approach lets researchers effectively and efficiently explore and analyze information
- …