Search CORE

188,755 research outputs found

Data generator for evaluating ETL process quality

Author: Abelló Gamazo Alberto
Jovanovic Petar
Nakuçi Emona
Theodorou Vasileios
Publication venue: 'Elsevier BV'
Publication date: 01/01/2017
Field of study

Obtaining the right set of data for evaluating the fulfillment of different quality factors in the extract-transform-load (ETL) process design is rather challenging. First, the real data might be out of reach due to different privacy constraints, while manually providing a synthetic set of data is known as a labor-intensive task that needs to take various combinations of process parameters into account. More importantly, having a single dataset usually does not represent the evolution of data throughout the complete process lifespan, hence missing the plethora of possible test cases. To facilitate such demanding task, in this paper we propose an automatic data generator (i.e., Bijoux). Starting from a given ETL process model, Bijoux extracts the semantics of data transformations, analyzes the constraints they imply over input data, and automatically generates testing datasets. Bijoux is highly modular and configurable to enable end-users to generate datasets for a variety of interesting test scenarios (e.g., evaluating specific parts of an input ETL process design, with different input dataset sizes, different distributions of data, and different operation selectivities). We have developed a running prototype that implements the functionality of our data generation framework and here we report our experimental findings showing the effectiveness and scalability of our approach.Peer ReviewedPostprint (author's final draft

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

UPCommons. Portal del coneixement obert de la UPC

On the relation between the base of an EI algebra and word graphs

Author: Hoede C.
Wang Xin
Publication venue: Department of Applied Mathematics, University of Twente
Publication date: 01/01/2007
Field of study

This paper is an attempt to investigate the possibilities to link algebraic fuzzy set theory with the theory of word graphs. In both theories concepts are studied and concepts can be set in correspondence. This enables to use algebraic results in the context of word graph theory

University of Twente Research Information

Categories in Control

Author: Baez John C.
Erbele Jason
Publication venue
Publication date: 20/05/2015
Field of study

Control theory uses "signal-flow diagrams" to describe processes where real-valued functions of time are added, multiplied by scalars, differentiated and integrated, duplicated and deleted. These diagrams can be seen as string diagrams for the symmetric monoidal category FinVect_k of finite-dimensional vector spaces over the field of rational functions k = R(s), where the variable s acts as differentiation and the monoidal structure is direct sum rather than the usual tensor product of vector spaces. For any field k we give a presentation of FinVect_k in terms of the generators used in signal flow diagrams. A broader class of signal-flow diagrams also includes "caps" and "cups" to model feedback. We show these diagrams can be seen as string diagrams for the symmetric monoidal category FinRel_k, where objects are still finite-dimensional vector spaces but the morphisms are linear relations. We also give a presentation for FinRel_k. The relations say, among other things, that the 1-dimensional vector space k has two special commutative dagger-Frobenius structures, such that the multiplication and unit of either one and the comultiplication and counit of the other fit together to form a bimonoid. This sort of structure, but with tensor product replacing direct sum, is familiar from the "ZX-calculus" obeyed by a finite-dimensional Hilbert space with two mutually unbiased bases.Comment: 42 pages LaTe

arXiv.org e-Print Archive

eScholarship - University of California

Simple and Effective Type Check Removal through Lazy Basic Block Versioning

Author: Chevalier-Boisvert Maxime
Feeley Marc
Publication venue
Publication date: 01/01/2015
Field of study

Dynamically typed programming languages such as JavaScript and Python defer type checking to run time. In order to maximize performance, dynamic language VM implementations must attempt to eliminate redundant dynamic type checks. However, type inference analyses are often costly and involve tradeoffs between compilation time and resulting precision. This has lead to the creation of increasingly complex multi-tiered VM architectures. This paper introduces lazy basic block versioning, a simple JIT compilation technique which effectively removes redundant type checks from critical code paths. This novel approach lazily generates type-specialized versions of basic blocks on-the-fly while propagating context-dependent type information. This does not require the use of costly program analyses, is not restricted by the precision limitations of traditional type analyses and avoids the implementation complexity of speculative optimization techniques. We have implemented intraprocedural lazy basic block versioning in a JavaScript JIT compiler. This approach is compared with a classical flow-based type analysis. Lazy basic block versioning performs as well or better on all benchmarks. On average, 71% of type tests are eliminated, yielding speedups of up to 50%. We also show that our implementation generates more efficient machine code than TraceMonkey, a tracing JIT compiler for JavaScript, on several benchmarks. The combination of implementation simplicity, low algorithmic complexity and good run time performance makes basic block versioning attractive for baseline JIT compilers

arXiv.org e-Print Archive

CiteSeerX

Dagstuhl Research Online Publication Server

Recommended from our members

Designing an efficient test pattern generator using input reduction with linear operations

Author: Lee Kangjoo
Publication venue
Publication date: 30/08/2018
Field of study

Advances in fabrication technology have resulted in more complicated systems, being used in ever increasing numbers of applications. The large increase in transistor counts versus the number of pins on the chip has made VLSI testing much harder than ever before. Denser integrated circuits chips increase the required test cases enormously for comprehensive testing of a chip. This results in expensive test cost and long test time. In this thesis, an improved method for on-chip test pattern generation is proposed. It generates a complete test set more efficiently by using input reduction with linear operations. Input reduction for pseudo-exhaustive test pattern generation based on compatible and inverse-compatible relationships between inputs has been proposed in the past. This work extends the concept by using linear combinations of inputs to generate other inputs as a means for further input reduction. Results are presented showing the improvements that can be obtained.Electrical and Computer Engineerin

Texas ScholarWorks

The role of formal and informal control mechanisms for supplier selection: Experimental evidence.

Author: Dierynck Bart
Roodhooft Filip
Publication venue
Publication date
Field of study

Research Papers in Economics