1,378 research outputs found
Asymptotic Level Density of the Elastic Net Self-Organizing Feature Map
Whileas the Kohonen Self Organizing Map shows an asymptotic level density
following a power law with a magnification exponent 2/3, it would be desired to
have an exponent 1 in order to provide optimal mapping in the sense of
information theory. In this paper, we study analytically and numerically the
magnification behaviour of the Elastic Net algorithm as a model for
self-organizing feature maps. In contrast to the Kohonen map the Elastic Net
shows no power law, but for onedimensional maps nevertheless the density
follows an universal magnification law, i.e. depends on the local stimulus
density only and is independent on position and decouples from the stimulus
density at other positions.Comment: 8 pages, 10 figures. Link to publisher under
http://link.springer.de/link/service/series/0558/bibs/2415/24150939.ht
Self-Organising Networks for Classification: developing Applications to Science Analysis for Astroparticle Physics
Physics analysis in astroparticle experiments requires the capability of
recognizing new phenomena; in order to establish what is new, it is important
to develop tools for automatic classification, able to compare the final result
with data from different detectors. A typical example is the problem of Gamma
Ray Burst detection, classification, and possible association to known sources:
for this task physicists will need in the next years tools to associate data
from optical databases, from satellite experiments (EGRET, GLAST), and from
Cherenkov telescopes (MAGIC, HESS, CANGAROO, VERITAS)
Recommended from our members
Soft topographic map for clustering and classification of bacteria
In this work a new method for clustering and building a
topographic representation of a bacteria taxonomy is presented. The method is based on the analysis of stable parts of the genome, the so-called “housekeeping genes”. The proposed method generates topographic maps of the bacteria taxonomy, where relations among different
type strains can be visually inspected and verified. Two well known DNA alignement algorithms are applied to the genomic sequences. Topographic maps are optimized to represent the similarity among the sequences according to their evolutionary distances. The experimental analysis is carried out on 147 type strains of the Gammaprotebacteria
class by means of the 16S rRNA housekeeping gene. Complete sequences of the gene have been retrieved from the NCBI public database. In the experimental tests the maps show clusters of homologous type strains and present some singular cases potentially due to incorrect classification
or erroneous annotations in the database
Phase transitions in optimal unsupervised learning
We determine the optimal performance of learning the orientation of the
symmetry axis of a set of P = alpha N points that are uniformly distributed in
all the directions but one on the N-dimensional sphere. The components along
the symmetry breaking direction, of unitary vector B, are sampled from a
mixture of two gaussians of variable separation and width. The typical optimal
performance is measured through the overlap Ropt=B.J* where J* is the optimal
guess of the symmetry breaking direction. Within this general scenario, the
learning curves Ropt(alpha) may present first order transitions if the clusters
are narrow enough. Close to these transitions, high performance states can be
obtained through the minimization of the corresponding optimal potential,
although these solutions are metastable, and therefore not learnable, within
the usual bayesian scenario.Comment: 9 pages, 8 figures, submitted to PRE, This new version of the paper
contains one new section, Bayesian versus optimal solutions, where we explain
in detail the results supporting our claim that bayesian learning may not be
optimal. Figures 4 of the first submission was difficult to understand. We
replaced it by two new figures (Figs. 4 and 5 in this new version) containing
more detail
JET ANALYSIS BY NEURAL NETWORKS IN HIGH ENERGY HADRON-HADRON COLLISIONS
We study the possibility to employ neural networks to simulate jet clustering
procedures in high energy hadron-hadron collisions. We concentrate our analysis
on the Fermilab Tevatron energy and on the algorithm. We consider both
supervised multilayer feed-forward network trained by the backpropagation
algorithm and unsupervised learning, where the neural network autonomously
organizes the events in clusters.Comment: 9 pages, latex, 2 figures not included
Probabilistic Quantum Memories
Typical address-oriented computer memories cannot recognize incomplete or
noisy information. Associative (content-addressable) memories solve this
problem but suffer from severe capacity shortages. I propose a model of a
quantum memory that solves both problems. The storage capacity is exponential
in the number of qbits and thus optimal. The retrieval mechanism for incomplete
or noisy inputs is probabilistic, with postselection of the measurement result.
The output is determined by a probability distribution on the memory which is
peaked around the stored patterns closest in Hamming distance to the input.Comment: Revised version to appear in Phys. Rev. Let
Mapping differential responses to cognitive training using machine learning.
We used two simple unsupervised machine learning techniques to identify differential trajectories of change in children who undergo intensive working memory (WM) training. We used self-organizing maps (SOMs)-a type of simple artificial neural network-to represent multivariate cognitive training data, and then tested whether the way tasks are represented changed as a result of training. The patterns of change we observed in the SOM weight matrices implied that the processes drawn upon to perform WM tasks changed following training. This was then combined with K-means clustering to identify distinct groups of children who respond to the training in different ways. Firstly, the K-means clustering was applied to an independent large sample (N = 616, Mage = 9.16 years, range = 5.16-17.91 years) to identify subgroups. We then allocated children who had been through cognitive training (N = 179, Mage = 9.00 years, range = 7.08-11.50 years) to these same four subgroups, both before and after their training. In doing so, we were able to map their improvement trajectories. Scores on a separate measure of fluid intelligence were predictive of a child's improvement trajectory. This paper provides an alternative approach to analysing cognitive training data that go beyond considering changes in individual tasks. This proof-of-principle demonstrates a potentially powerful way of distinguishing task-specific from domain-general changes following training and of establishing different profiles of response to training
Selection of radio pulsar candidates using artificial neural networks
Radio pulsar surveys are producing many more pulsar candidates than can be
inspected by human experts in a practical length of time. Here we present a
technique to automatically identify credible pulsar candidates from pulsar
surveys using an artificial neural network. The technique has been applied to
candidates from a recent re-analysis of the Parkes multi-beam pulsar survey
resulting in the discovery of a previously unidentified pulsar.Comment: Accepted for publication in Monthly Notices of the Royal Astronomical
Society. 9 pages, 7 figures, and 1 tabl
Clustering of gene expression data: performance and similarity analysis
BACKGROUND: DNA Microarray technology is an innovative methodology in experimental molecular biology, which has produced huge amounts of valuable data in the profile of gene expression. Many clustering algorithms have been proposed to analyze gene expression data, but little guidance is available to help choose among them. The evaluation of feasible and applicable clustering algorithms is becoming an important issue in today's bioinformatics research. RESULTS: In this paper we first experimentally study three major clustering algorithms: Hierarchical Clustering (HC), Self-Organizing Map (SOM), and Self Organizing Tree Algorithm (SOTA) using Yeast Saccharomyces cerevisiae gene expression data, and compare their performance. We then introduce Cluster Diff, a new data mining tool, to conduct the similarity analysis of clusters generated by different algorithms. The performance study shows that SOTA is more efficient than SOM while HC is the least efficient. The results of similarity analysis show that when given a target cluster, the Cluster Diff can efficiently determine the closest match from a set of clusters. Therefore, it is an effective approach for evaluating different clustering algorithms. CONCLUSION: HC methods allow a visual, convenient representation of genes. However, they are neither robust nor efficient. The SOM is more robust against noise. A disadvantage of SOM is that the number of clusters has to be fixed beforehand. The SOTA combines the advantages of both hierarchical and SOM clustering. It allows a visual representation of the clusters and their structure and is not sensitive to noises. The SOTA is also more flexible than the other two clustering methods. By using our data mining tool, Cluster Diff, it is possible to analyze the similarity of clusters generated by different algorithms and thereby enable comparisons of different clustering methods
Neuromorphic Detection of Vowel Representation Spaces
In this paper a layered architecture to spot and characterize vowel segments in running speech is presented. The detection process is based on neuromorphic principles, as is the use of Hebbian units in layers to implement lateral inhibition, band probability estimation and mutual exclusion. Results are presented showing how the association between the acoustic set of patterns and the phonologic set of symbols may be created. Possible applications of this methodology are to be found in speech event spotting, in the study of pathological voice and in speaker biometric characterization, among others
- …