Search CORE

5 research outputs found

Ensemble Clustering for Biological Datasets

Author: Harun Pirim
Şadi Evren Şeker
Publication venue: 'IntechOpen'
Publication date: 28/11/2012
Field of study

MaxPart: An Efficient Search-Space Pruning Approach to Vertical Partitioning

Author: Bouakkaz Mustapha
Ouinten Youcef
Ziani Benameur
Publication venue: Institute of Informatics, Slovak Academy of Sciences
Publication date: 07/11/2018
Field of study

Vertical partitioning is the process of subdividing the attributes of a relation into groups, creating fragments. It represents an effective way of improving performance in the database systems where a significant percentage of query processing time is spent on the full scans of tables. Most of proposed approaches for vertical partitioning in databases use a pairwise affinity to cluster the attributes of a given relation. The affinity measures the frequency of accessing simultaneously a pair of attributes. The attributes having high affinity are clustered together so as to create fragments containing a maximum of attributes with a strong connectivity. However, such fragments can directly and efficiently be achieved by the use of maximal frequent itemsets. This technique of knowledge engineering reflects better the closeness or affinity when more than two attributes are involved. The partitioning process can be done faster and more accurately with the help of such knowledge discovery technique of data mining. In this paper, an approach based on maximal frequent itemsets to vertical partitioning is proposed to efficiently search for an optimized solution by judiciously pruning the potential search space. Moreover, we propose an analytical cost model to evaluate the produced partitions. Experimental studies show that the cost of the partitioning process can be substantially reduced using only a limited set of potential fragments. They also demonstrate the effectiveness of our approach in partitioning small and large tables

Computing and Informatics (E-Journal - Institute of Informatics, SAS, Bratislava)

Bioinformatics

Author
Publication venue: 'IntechOpen'
Publication date: 20/04/2021
Field of study

This book is divided into different research areas relevant in Bioinformatics such as biological networks, next generation sequencing, high performance computing, molecular modeling, structural bioinformatics, molecular modeling and intelligent data analysis. Each book section introduces the basic concepts and then explains its application to problems of great relevance, so both novice and expert readers can benefit from the information and research works presented here

Directory of Open Access Books (DOAB)

NEW OPTIMIZATION MODELS FOR DATA MINING

Author: FRED W. GLOVER
GARY KOCHENBERGER
Publication venue
Publication date
Field of study

In recent years modern methods of optimization have contributed greatly to the advances in data mining and related areas. These contributions continue today and promise to further advance the state of the art both in terms of modeling innovations and new solution methodologies. In this paper, we present a new modeling and solution methodology for unsupervised clustering. Preliminary computational experience is given to illustrate the approach. This methodology is part of our current research and offers considerable opportunity for additional investigation to be conducted by other researchers.Clustering, MIP, Tabu search, metaheuristics

Research Papers in Economics

NEW OPTIMIZATION MODELS FOR DATA MINING

Author: Dorndorf U.
FRED W. GLOVER
GARY KOCHENBERGER
Glover F.
Mirkin B.
Publication venue: 'World Scientific Pub Co Pte Lt'
Publication date
Field of study

Crossref