1,592 research outputs found

    Methods for fast and reliable clustering

    Get PDF

    A Bibliographic View on Constrained Clustering

    Full text link
    A keyword search on constrained clustering on Web-of-Science returned just under 3,000 documents. We ran automatic analyses of those, and compiled our own bibliography of 183 papers which we analysed in more detail based on their topic and experimental study, if any. This paper presents general trends of the area and its sub-topics by Pareto analysis, using citation count and year of publication. We list available software and analyse the experimental sections of our reference collection. We found a notable lack of large comparison experiments. Among the topics we reviewed, applications studies were most abundant recently, alongside deep learning, active learning and ensemble learning.Comment: 18 pages, 11 figures, 177 reference

    Identification of Moving Bottlenecks in Production Systems

    Get PDF
    Manufacturing sector have been plagued by bottlenecks from time immemorial, leading to loss of productivity and profitability, various research effort has been expended towards identifying and mitigating the effects of bottlenecks on production lines. However, traditional approaches often fail in identifying moving bottlenecks. The current data boom and giant strides made in the machine learning field proffers an alternative means of using the large volume of data generated by machines in identifying bottlenecks. In this study, a hierarchical agglomerative clustering algorithm is used in identifying potential groups of bottlenecks within a serial production line. A serial production line with five workstations and zero buffer was simulated in ARENA® with data regarding blocked, producing and starvation time extracted. The extracted data was preprocessed using Python 3.7 to obtain a matrix of ones and zeros. The resultant matrix was fed into a complete linkage hierarchical agglomerative clustering algorithm to obtain clusters containing potential bottleneck workstations. Results obtained was validated using results obtained from simulation and an Elbow plot

    Comparison of K-Means and Fuzzy C-Means Algorithms on Different Cluster Structures

    Get PDF

    An exploration of methodologies to improve semi-supervised hierarchical clustering with knowledge-based constraints

    Get PDF
    Clustering algorithms with constraints (also known as semi-supervised clustering algorithms) have been introduced to the field of machine learning as a significant variant to the conventional unsupervised clustering learning algorithms. They have been demonstrated to achieve better performance due to integrating prior knowledge during the clustering process, that enables uncovering relevant useful information from the data being clustered. However, the research conducted within the context of developing semi-supervised hierarchical clustering techniques are still an open and active investigation area. Majority of current semi-supervised clustering algorithms are developed as partitional clustering (PC) methods and only few research efforts have been made on developing semi-supervised hierarchical clustering methods. The aim of this research is to enhance hierarchical clustering (HC) algorithms based on prior knowledge, by adopting novel methodologies. [Continues.

    Text Classification Aided by Clustering: a Literature Review

    Get PDF
    corecore