Search CORE

1,592 research outputs found

Methods for fast and reliable clustering

Author: Kärkkäinen Ismo
Publication venue: University of Joensuu
Publication date
Field of study

A Bibliographic View on Constrained Clustering

Author: Hennessey Samuel
Kuncheva Ludmila
Williams Francis
Publication venue
Publication date: 22/09/2022
Field of study

A keyword search on constrained clustering on Web-of-Science returned just under 3,000 documents. We ran automatic analyses of those, and compiled our own bibliography of 183 papers which we analysed in more detail based on their topic and experimental study, if any. This paper presents general trends of the area and its sub-topics by Pareto analysis, using citation count and year of publication. We list available software and analyse the experimental sections of our reference collection. We found a notable lack of large comparison experiments. Among the topics we reviewed, applications studies were most abundant recently, alongside deep learning, active learning and ensemble learning.Comment: 18 pages, 11 figures, 177 reference

arXiv.org e-Print Archive

Online ensembles of local recommendation models

Author: Lúcia Raquel Brandão Moreira
Publication venue
Publication date: 14/07/2021
Field of study

Repositório Aberto da Universidade do Porto

Identification of Moving Bottlenecks in Production Systems

Author: Adeyinka Funmilayo Mofoluwasola
Publication venue: The Research Repository @ WVU
Publication date: 01/01/2021
Field of study

Manufacturing sector have been plagued by bottlenecks from time immemorial, leading to loss of productivity and profitability, various research effort has been expended towards identifying and mitigating the effects of bottlenecks on production lines. However, traditional approaches often fail in identifying moving bottlenecks. The current data boom and giant strides made in the machine learning field proffers an alternative means of using the large volume of data generated by machines in identifying bottlenecks. In this study, a hierarchical agglomerative clustering algorithm is used in identifying potential groups of bottlenecks within a serial production line. A serial production line with five workstations and zero buffer was simulated in ARENA® with data regarding blocked, producing and starvation time extracted. The extracted data was preprocessed using Python 3.7 to obtain a matrix of ones and zeros. The resultant matrix was fed into a complete linkage hierarchical agglomerative clustering algorithm to obtain clusters containing potential bottleneck workstations. Results obtained was validated using results obtained from simulation and an Elbow plot

The Research Repository @ WVU (West Virginia University)

Recommended from our members

Statistical clustering of data

Author: Zhang Lihao
Publication venue
Publication date: 16/11/2015
Field of study

textCluster analysis aims at segmenting objects into groups with similar members and, therefore helps to discover distribution of properties and correlations in large datasets. Data clustering has been widely studied as it arises in many domains in marketing, engineering, and social sciences. Especially, the occurrence of transactional and experimental datasets in large scale in recent years significantly increased the necessity of clustering techniques to reduce the size of the existing objects, to achieve a better knowledge of the data. This report introduced fundamental concepts related to cluster analysis, addressed the similarity and dissimilarity measurements for cluster definition, and clarified three major clustering algorithms-hierarchical clustering, K-means clustering and Gaussian mixture model fitted by Expectation-Maximization (EM) algorithm-theoretically and experimentally to illustrate the process of clustering. Finally, methods of determining the number of clusters and validating the clustering were presented as for clustering evaluation.Statistic

Texas ScholarWorks

Comparison of K-Means and Fuzzy C-Means Algorithms on Different Cluster Structures

Author: Cebeci Zeynel
Yildiz Figen
Publication venue: 'Journal of Agricultural Informatics'
Publication date: 01/01/2015
Field of study

Repository of the Academy's Library

An exploration of methodologies to improve semi-supervised hierarchical clustering with knowledge-based constraints

Author: Abeer Aljohani (1257537)
Publication venue
Publication date: 01/01/2019
Field of study

Clustering algorithms with constraints (also known as semi-supervised clustering algorithms) have been introduced to the field of machine learning as a significant variant to the conventional unsupervised clustering learning algorithms. They have been demonstrated to achieve better performance due to integrating prior knowledge during the clustering process, that enables uncovering relevant useful information from the data being clustered. However, the research conducted within the context of developing semi-supervised hierarchical clustering techniques are still an open and active investigation area. Majority of current semi-supervised clustering algorithms are developed as partitional clustering (PC) methods and only few research efforts have been made on developing semi-supervised hierarchical clustering methods. The aim of this research is to enhance hierarchical clustering (HC) algorithms based on prior knowledge, by adopting novel methodologies. [Continues.

Loughborough University Institutional Repository

Text Classification Aided by Clustering: a Literature Review

Author: Kyriakopoulou Antonia
Publication venue: 'IntechOpen'
Publication date: 01/08/2008
Field of study

IntechOpen

Crossref