4 research outputs found

    An谩lisis Interactivo de Datos: Mapas Autoorganizados

    Get PDF
    Los fen贸menos f铆sicos, as铆 como los procesos industriales, producen vol煤menes cada vez m谩s cuantiosos de datos, a menudo de dif铆cil tratamiento. Se hace preceptiva la generaci贸n de sistemas y procedimientos que permitan extraer, en una primera etapa visual de an谩lisis, la informaci贸n subyacente a estos datos, orientando as铆 los c谩lculos y estudios posteriores que puedan aplicarse. El an谩lisis visual ser谩 reforzado si se complementa con elementos interactivos que permitan al usuario dirigirse hacia los focos de su inter茅s. Dentro de las t茅cnicas de visualizaci贸n de datos para los fines expuestos destacan los Mapas Autoorganizados (tambi茅n llamados SOM, acr贸nimo de Self-Organizing Maps), un tipo de redes neuronales visuales cuya efectividad ser谩 potenciada si son dotados de interactividad. Las aportaciones destacadas del presente trabajo son: 1. Desarrollo de una herramienta de c贸digo abierto de ayuda en entornos tecnol贸gicos, acad茅micos e industriales, que incorpora entrenamiento de Mapas Autoorganizados y sistema interactivo de visualizaci贸n de resultados. Dicha herramienta constituye un prototipo de c贸digo abierto, f谩cil de modificar y escalar, y compatible con los sistemas operativos de uso m谩s habitual en el mercado. Se realiza el entrenamiento de diferentes modelos de Mapas Autoorganizados: GSOM (Growing SOM), GHSOM (Growing Hierarchical SOM) y una nueva propuesta: GCHSOM. Los entrenamientos permiten diversidad de par谩metros y se realizan por lotes, almacenando los resultados que presenten mejores mediciones en sus medidas de calidad (error de cuantificaci贸n, error topogr谩fico e 铆ndice de Kaski y Lagus). 2. Presentaci贸n de los Mapas Autoorganizados como herramienta de visualizaci贸n de datos para su empleo en fases iniciales de an谩lisis de informaci贸n. Sus propiedades de cuantificaci贸n de datos y proyecci贸n de las relaciones existentes entre 茅stos en espacios de bajas dimensiones los convierten en 煤tiles muy eficaces para el an谩lisis visual de informaci贸n. 3. Mejora de los Mapas Autoorganizados mediante la adici贸n de interactividad, como respuesta a las necesidades actuales de an谩lisis visual de datos. Entre otras opciones, se destaca la realizaci贸n de una selecci贸n de distintos modos de color (secuencial, escala de grises, bipolares secuenciales, escalas de pseudocolores) que por sus caracter铆sticas facilitar谩n en algunos casos, la discriminaci贸n, y en otros, la cuantificaci贸n. Adem谩s se realizan agrupamientos de la informaci贸n que facilitan su comprensi贸n. Las opciones interactivas de tratamiento de los datos facilitan la discriminaci贸n de la informaci贸n, permitiendo as铆 la visualizaci贸n de muchas capas. 4. Optimizaciones en el proceso de entrenamiento de grupos de Mapas Autoorganizados para un mismo conjunto de datos para, posteriormente, seleccionar de entre 茅stos al que re煤na mejores condiciones. Se realizan una serie de mejoras sobre el algoritmo base de entrenamiento (paralelismo, cacheado de c谩lculos) que facilitan la realizaci贸n de experimentos. 5. Valoraci贸n de modelos de entrenamiento y propuesta de variantes en el 谩mbito de visualizaci贸n de datos. Concretamente se propone una variante de Mapa Autoorganizado bautizada como GCHSOM (Growing Cluster Hierarchical SOM) que consiste en una estructura de Mapas Autoorganizados de tipo GSOM. Esta variante permite que los datos a analizar se muestren gr谩ficamente en una primera instancia con alto nivel de detalle, permitiendo adem谩s al investigador realizar nuevas consultas visuales siguiendo una estructura jer谩rquica que le guiar谩 permitiendo el acceso a diferentes subconjuntos o contextos de los datos, proporcionando nuevos matices sobre cada uno de estos contextos.Physical phenomena, as well as industrial processes, produce increasingly large volumes of data, often of dificult treatment. It makes mandatory the generation of systems and procedures that allow to extract, in the first analysis stage, the underlying information to this data, facing this way the calculations and later studies that could be applied. Visual analysis will be strengthened if it is supplemented with interactive elements that allow the user to move towards the foci of interest. Among the techniques of visualization of data to the exposed ends are the Self-Organizing Maps (SOM), a kind of neural network whose effectiveness will be enhanced if they are endowed with interactivity. The contributions of this work are: 1. Development of an open source tool to aid in technological, academic and industrial environments, which incorporates training of Self-Organizing Maps and an interactive display of visualization of results. It is worth developing as it is difficult to find free software that can be ported to different operating systems. As it is open source, it can be modified to accomplish different purposes. Also, as its interface is made on the Processing framework, it is near-direct portable to Linux, Mac OS X and Windows, and makes use of Processing powerful graphics library. This software is of particular interest as it allows navigation through structures formed by different neural networks, compared to flat representations of the same. It can train several SOM models: GSOM (Growing SOM), GHSOM (Growing Hierarchical SOM) and a new proposal: GCHSOM. It allows diversity of training parameters and can perform batch, storing the resulting nets that present better measures in their measures of quality (quantization error, topographic error, and Kaski and Lagus rate). The users can choose the training options via the client interface, and select the data to be treated. Then they choose the number of times to repeat the experiment. According to those inputs, this piece of code provides the best SOM networks based on three qualitycriteria: a) quantization error b) topographical error. c) Kaski and Lagus Index 2. Presentation of Self-Organizing Maps as a data visualization tool for use in early stages of analysis. Its properties of data quantification and projection of the relationships between data on low-dimensional spaces make them very effective useful for visual analysis. 3. Improvement of Self-Organizing Maps by adding interactivity, in response to the current needs of visual data analysis. The user is offered the following options: a) Saving and loading the different experiments that have been conducted. b) Selecting the network that complies with the quality criteria for each training. c) Request new clusters of data for the calculated network. d) Select the color map used (sequential, grayscale, sequential bipolar scales, pseudo colors) whose characteristics provide in some cases, discrimination, and other, quantification. e) Manipulate the results (hiding on, pan, and zoom). f) Different partial selection modes for the display of summary information. g) The ability to re-train with subsets of information results. In short, the graphic interface serves not only as a bridge between the user and the training libraries, but also provides a number of functions that facilitate interactive data analysis and navigation through the most complex hierarchical structures. 4. Optimizations in the process of batch training Self-Organizing Maps groups for the same set of data to select from among them the ones which meet best conditions. A number of improvements on the training base (parallelism, caching calculations) that facilitate algorithm performing experiments are implemented. 5. Assessment of training models and variants proposed in the field of data visualization. A new proposal has been made, variant from GHSOM: GCHSOM (Hierarchical Cluster Growing SOM) consisting of a hierarchical structure of GSOM. This option allows data to be analyzed is graphically displayed at first sight with high level of detail, also allowing the researchers to make new visual queries following a hierarchical structure that will guide them, allowing the access to different subsets of data, providing new nuances of each of these contexts

    Nonsmooth optimization models and algorithms for data clustering and visualization

    Get PDF
    Cluster analysis deals with the problem of organization of a collection of patterns into clusters based on a similarity measure. Various distance functions can be used to define this measure. Clustering problems with the similarity measure defined by the squared Euclidean distance have been studied extensively over the last five decades. However, problems with other Minkowski norms have attracted significantly less attention. The use of different similarity measures may help to identify different cluster structures of a data set. This in turn may help to significantly improve the decision making process. High dimensional data visualization is another important task in the field of data mining and pattern recognition. To date, the principal component analysis and the self-organizing maps techniques have been used to solve such problems. In this thesis we develop algorithms for solving clustering problems in large data sets using various similarity measures. Such similarity measures are based on the squared LDoctor of Philosoph

    IEEE Transactions On Neural Networks And Learning Systems : Vol. 24, No. 8, August 2013

    No full text
    1. Sampled-data exponential synchronization of complex dynamical networks with time-varying coupling delay. 2. Dictionary learning-based subspace structure identification in spectral clustering. 3. Knowledge-leverage-based TSK Fuzzy system modeling. 4. A Cognitive fault diagnosis system for distributed sensor networks. 5. Boundedness and complete stability of complex-valued neural networks with time delay. 6. Fast neuromimetric object recognition using FPGA outperforms GPU implementations. 7. Improving the quality of self-organizing maps by self-intersection avoidance. 8. Quantum-based algorithm for optimizing artificial neural networks. 9. Hinging hyperplanes for time-series segmentation. 10. Ranking graph embedding for learning to rerank. 11. Feasibility and finite convergence analysis for accurate on-line v-support vector machine. 12. Exponential synchronization of coupled switched neural networks with mode-dependent impulsive effects, 13.Analysis of boundedness and convergence of online gradient method for two-layer feedforward neural networks. 14. Phase-noise-induced resonance in arrays of coupled excitable neural models. 15. Call for pappers: WCCI 2014 Etc

    IEEE Transactions On Neural Networks And Learning Systems : Vol. 24, No. 8, August 2013

    No full text
    1. Sampled-data exponential synchorinization of complex dynamical networks with time-varying coupling delay. 2. Dictionary learning-based subspace structure identification in spectral clustering. 3. Knowledge-leverage-based TSK Fuzzy System modeling. 4. A Cognitive fault diagnosis system for distributed sensor networks. 5. Boundedness and complete stability of complex-valued neural networks with time delay. 6. Fast neuromimetic object recognition using FPGA outperforms GPU implementations. 7. Improving the quality of self-organizing maps by self-intersection avoidance. 8. Quantum-based algorithm for optimizing artificial neural networks. 9. Hinging hyperplanes for time-series segmentation. 10. Ranking graph embedding for learning to rerank. 11. Feasibility abd finite convergence analysis for accurate on-line v-support vector machine. 12. Exponential synchronization of coupled switched neural networks with mode-dependent impulse effects. 13. Analysis of boudedness and convergence of online gradient method for two-layer feedforward neural networks. 14. Phase-noise-indeced resonance in arrays of coupled excitable neural models
    corecore