1,592 research outputs found
A Bibliographic View on Constrained Clustering
A keyword search on constrained clustering on Web-of-Science returned just
under 3,000 documents. We ran automatic analyses of those, and compiled our own
bibliography of 183 papers which we analysed in more detail based on their
topic and experimental study, if any. This paper presents general trends of the
area and its sub-topics by Pareto analysis, using citation count and year of
publication. We list available software and analyse the experimental sections
of our reference collection. We found a notable lack of large comparison
experiments. Among the topics we reviewed, applications studies were most
abundant recently, alongside deep learning, active learning and ensemble
learning.Comment: 18 pages, 11 figures, 177 reference
Identification of Moving Bottlenecks in Production Systems
Manufacturing sector have been plagued by bottlenecks from time immemorial, leading to loss of productivity and profitability, various research effort has been expended towards identifying and mitigating the effects of bottlenecks on production lines. However, traditional approaches often fail in identifying moving bottlenecks. The current data boom and giant strides made in the machine learning field proffers an alternative means of using the large volume of data generated by machines in identifying bottlenecks. In this study, a hierarchical agglomerative clustering algorithm is used in identifying potential groups of bottlenecks within a serial production line.
A serial production line with five workstations and zero buffer was simulated in ARENA® with data regarding blocked, producing and starvation time extracted. The extracted data was preprocessed using Python 3.7 to obtain a matrix of ones and zeros. The resultant matrix was fed into a complete linkage hierarchical agglomerative clustering algorithm to obtain clusters containing potential bottleneck workstations. Results obtained was validated using results obtained from simulation and an Elbow plot
Recommended from our members
Statistical clustering of data
textCluster analysis aims at segmenting objects into groups with similar members and, therefore helps to discover distribution of properties and correlations in large datasets. Data clustering has been widely studied as it arises in many domains in marketing, engineering, and social sciences. Especially, the occurrence of transactional and experimental datasets in large scale in recent years significantly increased the necessity of clustering techniques to reduce the size of the existing objects, to achieve a better knowledge of the data. This report introduced fundamental concepts related to cluster analysis, addressed the similarity and dissimilarity measurements for cluster definition, and clarified three major clustering algorithms-hierarchical clustering, K-means clustering and Gaussian mixture model fitted by Expectation-Maximization (EM) algorithm-theoretically and experimentally to illustrate the process of clustering. Finally, methods of determining the number of clusters and validating the clustering were presented as for clustering evaluation.Statistic
An exploration of methodologies to improve semi-supervised hierarchical clustering with knowledge-based constraints
Clustering algorithms with constraints (also known as semi-supervised clustering algorithms) have been introduced to the field of machine learning as a significant variant to the conventional unsupervised clustering learning algorithms. They have been demonstrated to achieve better performance due to integrating prior knowledge during the clustering process, that enables uncovering relevant useful information from the data being clustered. However, the research conducted within the context of developing semi-supervised hierarchical clustering techniques are still an open and active investigation area. Majority of current semi-supervised clustering algorithms are developed as partitional clustering (PC) methods and only few research efforts have been made on developing semi-supervised hierarchical clustering methods. The aim of this research is to enhance hierarchical clustering (HC) algorithms based on prior knowledge, by adopting novel methodologies. [Continues.
- …