94 research outputs found

    Self organizing maps for outlier detection

    Get PDF
    In this paper we address the problem of multivariate outlier detection using the (unsupervised) self-organizing map (SOM) algorithm introduced by Kohonen. We examine a number of techniques, based on summary statistics and graphics derived from the trained SOM, and conclude that they work well in cooperation with each other. Useful tools include the median interneuron distance matrix and the projection ofthe trained map (via Sammon's projection). SOM quantization errors provide an important complementary source of information for certain type of outlying behavior. Empirical results are reported on both artificial and real data

    Vowel recognition using Kohonen\u27s self-organizing feature maps

    Get PDF
    An important organizing principle observed in the sensory pathways in the brain is the orderly placement of neurons. Although the neurons are structurally identical, the specialized role played by each unit is determined by its internal parameters that are made to change during early learning processes. In the human auditory system, the nerve cells and fibres are arranged in a manner that would elicit maximum response from the neurons when they are activated. Although most of this organization is genetically determined, some of the high level organization is created due to algorithms that promote self-organization. Kohonen\u27s self-organizing feature map is a neural net model that produces feature maps similar to the ones produced in the brain. These maps are capable of describing topological relationships of input signals using a one or two dimensional representation. This technique uses unlabeled data and requires no training as in supervised learning algorithms. It is hence immensely useful in speech and vision applications. This neutral net has been implemented for the recognition of vowels in the American English language. The net has been trained and tested with vowel data. The formation of internal clusters or categories has been observed and closely reflects the tonotopic relationships between the vowels. An analysis of the results has been carried out and the performance has been compared to other classification techniques. A graphical user interface has also been developed using Xview to help visualize the formation of the maps during the training and testing processes

    Simulating sensorimotor systems with cortical topology

    Get PDF
    Due to the character of the original source materials and the nature of batch digitization, quality control issues may be present in this document. Please report any quality issues you encounter to [email protected], referencing the URI of the item.Includes bibliographical references.Not availabl

    Cutting tool condition monitoring of the turning process using artificial intelligence

    Get PDF
    This thesis relates to the application of Artificial Intelligence to tool wear monitoring. The main objective is to develop an intelligent condition monitoring system able to detect when a cutting tool is worn out. To accomplish this objective it is proposed to use a combined Expert System and Neural Network able to process data coming from external sensors and combine this with information from the knowledge base and thereafter estimate the wear state of the tool. The novelty of this work is mainly associatedw ith the configurationo f the proposeds ystem.W ith the combination of sensor-baseidn formation and inferencer ules, the result is an on-line system that can learn from experience and can update the knowledge base pertaining to information associated with different cutting conditions. Two neural networks resolve the problem of interpreting the complex sensor inputs while the Expert System, keeping track of previous successe, stimatesw hich of the two neuraln etworks is more reliable. Also, mis-classificationsa re filtered out through the use of a rough but approximate estimator, the Taylor's tool life equation. In this study an on-line tool wear monitoring system for turning processesh as been developed which can reliably estimate the tool wear under common workshop conditions. The system's modular structurem akesi t easyt o updatea s requiredb y different machinesa nd/or processesT. he use of Taylor's tool life equation, although weak as a tool life estimator, proved to be crucial in achieving higher performance levels. The application of the Self Organizing Map to tool wear monitoring is, in itself, new and proved to be slightly more reliable then the Adaptive Resonance Theory neural network

    GTM: the generative topographic mapping

    Get PDF
    This thesis describes the Generative Topographic Mapping (GTM) --- a non-linear latent variable model, intended for modelling continuous, intrinsically low-dimensional probability distributions, embedded in high-dimensional spaces. It can be seen as a non-linear form of principal component analysis or factor analysis. It also provides a principled alternative to the self-organizing map --- a widely established neural network model for unsupervised learning --- resolving many of its associated theoretical problems. An important, potential application of the GTM is visualization of high-dimensional data. Since the GTM is non-linear, the relationship between data and its visual representation may be far from trivial, but a better understanding of this relationship can be gained by computing the so-called magnification factor. In essence, the magnification factor relates the distances between data points, as they appear when visualized, to the actual distances between those data points. There are two principal limitations of the basic GTM model. The computational effort required will grow exponentially with the intrinsic dimensionality of the density model. However, if the intended application is visualization, this will typically not be a problem. The other limitation is the inherent structure of the GTM, which makes it most suitable for modelling moderately curved probability distributions of approximately rectangular shape. When the target distribution is very different to that, theaim of maintaining an `interpretable' structure, suitable for visualizing data, may come in conflict with the aim of providing a good density model. The fact that the GTM is a probabilistic model means that results from probability theory and statistics can be used to address problems such as model complexity. Furthermore, this framework provides solid ground for extending the GTM to wider contexts than that of this thesis

    Text mining with the WEBSOM

    Get PDF
    The emerging field of text mining applies methods from data mining and exploratory data analysis to analyzing text collections and to conveying information to the user in an intuitive manner. Visual, map-like displays provide a powerful and fast medium for portraying information about large collections of text. Relationships between text items and collections, such as similarity, clusters, gaps and outliers can be communicated naturally using spatial relationships, shading, and colors. In the WEBSOM method the self-organizing map (SOM) algorithm is used to automatically organize very large and high-dimensional collections of text documents onto two-dimensional map displays. The map forms a document landscape where similar documents appear close to each other at points of the regular map grid. The landscape can be labeled with automatically identified descriptive words that convey properties of each area and also act as landmarks during exploration. With the help of an HTML-based interactive tool the ordered landscape can be used in browsing the document collection and in performing searches on the map. An organized map offers an overview of an unknown document collection helping the user in familiarizing herself with the domain. Map displays that are already familiar can be used as visual frames of reference for conveying properties of unknown text items. Static, thematically arranged document landscapes provide meaningful backgrounds for dynamic visualizations of for example time-related properties of the data. Search results can be visualized in the context of related documents. Experiments on document collections of various sizes, text types, and languages show that the WEBSOM method is scalable and generally applicable. Preliminary results in a text retrieval experiment indicate that even when the additional value provided by the visualization is disregarded the document maps perform at least comparably with more conventional retrieval methods.reviewe

    Self-organising maps : statistical analysis, treatment and applications.

    Get PDF
    This thesis presents some substantial theoretical analyses and optimal treatments of Kohonen's self-organising map (SOM) algorithm, and explores the practical application potential of the algorithm for vector quantisation, pattern classification, and image processing. It consists of two major parts. In the first part, the SOM algorithm is investigated and analysed from a statistical viewpoint. The proof of its universal convergence for any dimensionality is obtained using a novel and extended form of the Central Limit Theorem. Its feature space is shown to be an approximate multivariate Gaussian process, which will eventually converge and form a mapping, which minimises the mean-square distortion between the feature and input spaces. The diminishing effect of the initial states and implicit effects of the learning rate and neighbourhood function on its convergence and ordering are analysed and discussed. Distinct and meaningful definitions, and associated measures, of its ordering are presented in relation to map's fault-tolerance. The SOM algorithm is further enhanced by incorporating a proposed constraint, or Bayesian modification, in order to achieve optimal vector quantisation or pattern classification. The second part of this thesis addresses the task of unsupervised texture-image segmentation by means of SOM networks and model-based descriptions. A brief review of texture analysis in terms of definitions, perceptions, and approaches is given. Markov random field model-based approaches are discussed in detail. Arising from this a hierarchical self-organised segmentation structure, which consists of a local MRF parameter estimator, a SOM network, and a simple voting layer, is proposed and is shown, by theoretical analysis and practical experiment, to achieve a maximum likelihood or maximum a posteriori segmentation. A fast, simple, but efficient boundary relaxation algorithm is proposed as a post-processor to further refine the resulting segmentation. The class number validation problem in a fully unsupervised segmentation is approached by a classical, simple, and on-line minimum mean-square-error method. Experimental results indicate that this method is very efficient for texture segmentation problems. The thesis concludes with some suggestions for further work on SOM neural networks
    • …
    corecore