Search CORE

5,985 research outputs found

Machine-Part cell formation through visual decipherable clustering of Self Organizing Map

Author: A Flexer
AK Jain
AM Mukattash
CH Chu
Cing Tay Joc
CO Malave
CT Mosier
E Jeffrey Schaller
F Guerrero
GA Carpenter
H Seifoddini
Haibo Wang
HM Chan
J Balakrishnan
J Li
J-SR Jang
JF Goncalves
JL Burbidge
KN Balasubramanian
KR Kumar
L Stanfel
M Murugan
Manojit Chattopadhyay
MP Chandrasekharan
MP Chandrasekharan
MS Yang
MY Kiang
N Laitinen
N Singh
NC Suresh
P Venkumar
P Venkumar
PH Waghodekar
Pranab K. Dan
R Tavakkoli-Moghaddam
RG Askin
S Brosse
S Carrie
S Kaski
S Kaski
S Kaski
S Mancuso
SAh Kioon
Shih-Wei Lin
SK Mukhopadhyay
Surajit Chattopadhyay
T Kohonen
T Kohonen
U Wemmerlov
V Pillai
V Venugopal
Y Kao
Yin Yong
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 06/05/2011
Field of study

Machine-part cell formation is used in cellular manufacturing in order to process a large variety, quality, lower work in process levels, reducing manufacturing lead-time and customer response time while retaining flexibility for new products. This paper presents a new and novel approach for obtaining machine cells and part families. In the cellular manufacturing the fundamental problem is the formation of part families and machine cells. The present paper deals with the Self Organising Map (SOM) method an unsupervised learning algorithm in Artificial Intelligence, and has been used as a visually decipherable clustering tool of machine-part cell formation. The objective of the paper is to cluster the binary machine-part matrix through visually decipherable cluster of SOM color-coding and labelling via the SOM map nodes in such a way that the part families are processed in that machine cells. The Umatrix, component plane, principal component projection, scatter plot and histogram of SOM have been reported in the present work for the successful visualization of the machine-part cell formation. Computational result with the proposed algorithm on a set of group technology problems available in the literature is also presented. The proposed SOM approach produced solutions with a grouping efficacy that is at least as good as any results earlier reported in the literature and improved the grouping efficacy for 70% of the problems and found immensely useful to both industry practitioners and researchers.Comment: 18 pages,3 table, 4 figure

arXiv.org e-Print Archive

Crossref

Fast Algorithm and Implementation of Dissimilarity Self-Organizing Maps

Author: Atkinson
Aïcha El Golli
Bahlmann
Brieuc Conan-Guez
Buhmann
El Golli
El Golli
Fabrice Rossi
Graepel
Graepel
Hammer
Hofmann
Hofmann
Kaufman
Kohonen
Kohonen
Kohonen
Kohonen
Levenshtein
MacQueen
Porter
Seo
Wei
Publication venue: 'Elsevier BV'
Publication date: 01/01/2006
Field of study

In many real world applications, data cannot be accurately represented by vectors. In those situations, one possible solution is to rely on dissimilarity measures that enable sensible comparison between observations. Kohonen's Self-Organizing Map (SOM) has been adapted to data described only through their dissimilarity matrix. This algorithm provides both non linear projection and clustering of non vector data. Unfortunately, the algorithm suffers from a high cost that makes it quite difficult to use with voluminous data sets. In this paper, we propose a new algorithm that provides an important reduction of the theoretical cost of the dissimilarity SOM without changing its outcome (the results are exactly the same as the ones obtained with the original algorithm). Moreover, we introduce implementation methods that result in very short running times. Improvements deduced from the theoretical cost model are validated on simulated and real world data (a word list clustering problem). We also demonstrate that the proposed implementation methods reduce by a factor up to 3 the running time of the fast algorithm over a standard implementation

arXiv.org e-Print Archive

CiteSeerX

Crossref

INRIA a CCSD electronic archive server

Self-organizing maps could improve the classification of Spanish mutual funds.

Author: Marco Paulina
Moreno David
Olmeda Ignacio
Publication venue
Publication date
Field of study

In this paper, we apply nonlinear techniques (Self-Organizing Maps, k-nearest neighbors and the k-means algorithm) to evaluate the official Spanish mutual funds classification. The methodology that we propose allows us to identify which mutual funds are misclassified in the sense that they have historical performances which do not conform to the investment objectives established in their official category. According to this, we conclude that, on average, over 40% of mutual funds could be misclassified. Then, we propose an alternative classification, based on a double-step methodology, and we find that it achieves a significantly lower rate of misclassifications. The portfolios obtained from this alternative classification also attain better performances in terms of return/risk and include a smaller number of assets.Finance; Mutual funds; Clustering; Self-organizing map (SOM); Investment analysis;

Research Papers in Economics

A survey of outlier detection methodologies

Author: Austin J.
Hodge V.J.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2004
Field of study

Outlier detection has been used for centuries to detect and, where appropriate, remove anomalous observations from data. Outliers arise due to mechanical faults, changes in system behaviour, fraudulent behaviour, human error, instrument error or simply through natural deviations in populations. Their detection can identify system faults and fraud before they escalate with potentially catastrophic consequences. It can identify errors and remove their contaminating effect on the data set and as such to purify the data for processing. The original outlier detection methods were arbitrary but now, principled and systematic techniques are used, drawn from the full gamut of Computer Science and Statistics. In this paper, we introduce a survey of contemporary techniques for outlier detection. We identify their respective motivations and distinguish their advantages and disadvantages in a comparative review

CiteSeerX

Crossref

White Rose Research Online

Batch kernel SOM and related Laplacian methods for social network analysis

Author: Alpert
Andras
Aronszajn
Auber
Berlinet
Bertrand Jouve
Bornholdt
Clauset
Conan-Guez
Cristianini
Di Battista
Donetti
Fabrice Rossi
Faloutsos
Filippone
Graepel
Graepel
Hammer
Hammer
Herman
Kaski
Kohohen
Kohonen
Kondor
Mac Donald
Miikkulainen
Mohar
Mossa
Nathalie Villa
Neville
Newman
Newman
Newman
Newman
Palla
Pons
Radicchi
Romain Boulet
Schaeffer
Schölkopf
Schölkopf
Smola
Strogatz
Ultsch
van den Heuvel
Vert
Villa
Watts
Watts
Zhou
Publication venue
Publication date: 01/01/2008
Field of study

Large graphs are natural mathematical models for describing the structure of the data in a wide variety of fields, such as web mining, social networks, information retrieval, biological networks, etc. For all these applications, automatic tools are required to get a synthetic view of the graph and to reach a good understanding of the underlying problem. In particular, discovering groups of tightly connected vertices and understanding the relations between those groups is very important in practice. This paper shows how a kernel version of the batch Self Organizing Map can be used to achieve these goals via kernels derived from the Laplacian matrix of the graph, especially when it is used in conjunction with more classical methods based on the spectral analysis of the graph. The proposed method is used to explore the structure of a medieval social network modeled through a weighted graph that has been directly built from a large corpus of agrarian contracts

arXiv.org e-Print Archive

Crossref

Scientific Publications of the University of Toulouse II Le Mirail

INRIA a CCSD electronic archive server

HAL-INSA Toulouse

Scalable aggregation predictive analytics: a query-driven machine learning approach

Author: Anagnostopoulos Christos
Savva Fotis
Triantafillou Peter
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/09/2018
Field of study

We introduce a predictive modeling solution that provides high quality predictive analytics over aggregation queries in Big Data environments. Our predictive methodology is generally applicable in environments in which large-scale data owners may or may not restrict access to their data and allow only aggregation operators like COUNT to be executed over their data. In this context, our methodology is based on historical queries and their answers to accurately predict ad-hoc queries’ answers. We focus on the widely used set-cardinality, i.e., COUNT, aggregation query, as COUNT is a fundamental operator for both internal data system optimizations and for aggregation-oriented data exploration and predictive analytics. We contribute a novel, query-driven Machine Learning (ML) model whose goals are to: (i) learn the query-answer space from past issued queries, (ii) associate the query space with local linear regression & associative function estimators, (iii) define query similarity, and (iv) predict the cardinality of the answer set of unseen incoming queries, referred to the Set Cardinality Prediction (SCP) problem. Our ML model incorporates incremental ML algorithms for ensuring high quality prediction results. The significance of contribution lies in that it (i) is the only query-driven solution applicable over general Big Data environments, which include restricted-access data, (ii) offers incremental learning adjusted for arriving ad-hoc queries, which is well suited for query-driven data exploration, and (iii) offers a performance (in terms of scalability, SCP accuracy, processing time, and memory requirements) that is superior to data-centric approaches. We provide a comprehensive performance evaluation of our model evaluating its sensitivity, scalability and efficiency for quality predictive analytics. In addition, we report on the development and incorporation of our ML model in Spark showing its superior performance compared to the Spark’s COUNT method

Warwick Research Archives Portal Repository

Enlighten