11,761 research outputs found
Semantic HMC for Big Data Analysis
Analyzing Big Data can help corporations to im-prove their efficiency. In
this work we present a new vision to derive Value from Big Data using a
Semantic Hierarchical Multi-label Classification called Semantic HMC based in a
non-supervised Ontology learning process. We also proposea Semantic HMC
process, using scalable Machine-Learning techniques and Rule-based reasoning
Extracting the hierarchical organization of complex systems
Extracting understanding from the growing ``sea'' of biological and
socio-economic data is one of the most pressing scientific challenges facing
us. Here, we introduce and validate an unsupervised method that is able to
accurately extract the hierarchical organization of complex biological, social,
and technological networks. We define an ensemble of hierarchically nested
random graphs, which we use to validate the method. We then apply our method to
real-world networks, including the air-transportation network, an electronic
circuit, an email exchange network, and metabolic networks. We find that our
method enables us to obtain an accurate multi-scale descriptions of a complex
system.Comment: Figures in screen resolution. Version with full resolution figures
available at
http://amaral.chem-eng.northwestern.edu/Publications/Papers/sales-pardo-2007.pd
Hierarchical information clustering by means of topologically embedded graphs
We introduce a graph-theoretic approach to extract clusters and hierarchies
in complex data-sets in an unsupervised and deterministic manner, without the
use of any prior information. This is achieved by building topologically
embedded networks containing the subset of most significant links and analyzing
the network structure. For a planar embedding, this method provides both the
intra-cluster hierarchy, which describes the way clusters are composed, and the
inter-cluster hierarchy which describes how clusters gather together. We
discuss performance, robustness and reliability of this method by first
investigating several artificial data-sets, finding that it can outperform
significantly other established approaches. Then we show that our method can
successfully differentiate meaningful clusters and hierarchies in a variety of
real data-sets. In particular, we find that the application to gene expression
patterns of lymphoma samples uncovers biologically significant groups of genes
which play key-roles in diagnosis, prognosis and treatment of some of the most
relevant human lymphoid malignancies.Comment: 33 Pages, 18 Figures, 5 Table
From Data Topology to a Modular Classifier
This article describes an approach to designing a distributed and modular
neural classifier. This approach introduces a new hierarchical clustering that
enables one to determine reliable regions in the representation space by
exploiting supervised information. A multilayer perceptron is then associated
with each of these detected clusters and charged with recognizing elements of
the associated cluster while rejecting all others. The obtained global
classifier is comprised of a set of cooperating neural networks and completed
by a K-nearest neighbor classifier charged with treating elements rejected by
all the neural networks. Experimental results for the handwritten digit
recognition problem and comparison with neural and statistical nonmodular
classifiers are given
- …