1,515 research outputs found
Performance Measurement Based on Coloured Petri Net Simulation of Scalable Business Processes
Business process is also a complex area which receives much attention in recent years especially in increasing productivity and saving cost. Meanwhile, situation at the company allows existing business processes to be enlarged. This paper proposed the performance measurement based on coloured petri net simulation of scalable business processes, which has purpose to compare the performance of scalable business processes. For experiments, this paper uses real-world business processes. Then compare it to some business processes that have been enlarged. The result shows that scalable business processes influence the performance of business process. This paper provides feedback to business process developers for determine appropriate business processes based on the performance through coloured petri net simulation
Fast Knowledge Graph Completion using Graphics Processing Units
Knowledge graphs can be used in many areas related to data semantics such as
question-answering systems, knowledge based systems. However, the currently
constructed knowledge graphs need to be complemented for better knowledge in
terms of relations. It is called knowledge graph completion. To add new
relations to the existing knowledge graph by using knowledge graph embedding
models, we have to evaluate vector operations, where
is the number of entities and is the number of relation types. It is very
costly.
In this paper, we provide an efficient knowledge graph completion framework
on GPUs to get new relations using knowledge graph embedding vectors. In the
proposed framework, we first define "transformable to a metric space" and then
provide a method to transform the knowledge graph completion problem into the
similarity join problem for a model which is "transformable to a metric space".
After that, to efficiently process the similarity join problem, we derive
formulas using the properties of a metric space. Based on the formulas, we
develop a fast knowledge graph completion algorithm. Finally, we experimentally
show that our framework can efficiently process the knowledge graph completion
problem
Efficient Joinable Table Discovery in Data Lakes: A High-Dimensional Similarity-Based Approach
Finding joinable tables in data lakes is key procedure in many applications
such as data integration, data augmentation, data analysis, and data market.
Traditional approaches that find equi-joinable tables are unable to deal with
misspellings and different formats, nor do they capture any semantic joins. In
this paper, we propose PEXESO, a framework for joinable table discovery in data
lakes. We embed textual values as high-dimensional vectors and join columns
under similarity predicates on high-dimensional vectors, hence to address the
limitations of equi-join approaches and identify more meaningful results. To
efficiently find joinable tables with similarity, we propose a block-and-verify
method that utilizes pivot-based filtering. A partitioning technique is
developed to cope with the case when the data lake is large and the index
cannot fit in main memory. An experimental evaluation on real datasets shows
that our solution identifies substantially more tables than equi-joins and
outperforms other similarity-based options, and the join results are useful in
data enrichment for machine learning tasks. The experiments also demonstrate
the efficiency of the proposed method.Comment: Full version of paper in ICDE 202
Diversity in similarity joins
With the increasing ability of current applications to produce and consume more complex data, such as images and geographic information, the similarity join has attracted considerable attention. However, this operator does not consider the relationship among the elements in the answer, generating results with many pairs similar among themselves, which does not add value to the final answer. Result diversification methods are intended to retrieve elements similar enough to satisfy the similarity conditions, but also considering the diversity among the elements in the answer, producing a more heterogeneous result with smaller cardinality, which improves the meaning of the answer. Still, diversity have been studied only when applied to unary operations. In this paper, we introduce the concept of diverse similarity joins: a similarity join operator that ensures a smaller, more diversified and useful answers. The experiments performed on real and synthetic datasets show that our proposal allows exploiting diversity in similarity joins without diminish their performance whereas providing elements that cover the same data space distribution of the non-diverse answers.FAPESPCNPQCAPESRescuer (EU Commission Grant 614154 and CNPQ/MCTI Grant 490084/2013-3)International Conference on Similarity Search and Applications - SISAP (8. 2015 Glasgow
Ideals and hereditary subalgebras in operator algebras
This paper may be viewed as having two aims. First, we continue our study of
algebras of operators on a Hilbert space which have a contractive approximate
identity, this time from a more Banach algebraic point of view. Namely, we
mainly investigate topics concerned with the ideal structure, and hereditary
subalgebras (HSA's), which are in some sense generalization of ideals. Second,
we study properties of operator algebras which are hereditary subalgebras in
their bidual, or equivalently which are `weakly compact'. We also give several
examples answering natural questions that arise in such an investigation.Comment: 24 page
Implementation for spatial data of the shared nearest neighbour with metric data structures
Dissertação para obtenção do Grau de Mestre em
Engenharia InformĂĄtic
- âŠ