6,088 research outputs found
Ensemble clustering for result diversification
This paper describes the participation of the University of Twente in the Web track of TREC 2012. Our baseline approach uses the Mirex toolkit, an open source tool that sequantially scans all the documents. For result diversification, we experimented with improving the quality of clusters through ensemble clustering. We combined clusters obtained by different clustering methods (such as LDA and K-means) and clusters obtained by using different types of data (such as document text and anchor text). Our two-layer ensemble run performed better than the LDA based diversification and also better than a non-diversification run
Real time clustering of time series using triangular potentials
Motivated by the problem of computing investment portfolio weightings we
investigate various methods of clustering as alternatives to traditional
mean-variance approaches. Such methods can have significant benefits from a
practical point of view since they remove the need to invert a sample
covariance matrix, which can suffer from estimation error and will almost
certainly be non-stationary. The general idea is to find groups of assets which
share similar return characteristics over time and treat each group as a single
composite asset. We then apply inverse volatility weightings to these new
composite assets. In the course of our investigation we devise a method of
clustering based on triangular potentials and we present associated theoretical
results as well as various examples based on synthetic data.Comment: AIFU1
Diversity, Stability, Recursivity, and Rule Generation in Biological System: Intra-inter Dynamics Approach
Basic problems for the construction of a scenario for the Life are discussed.
To study the problems in terms of dynamical systems theory, a scheme of
intra-inter dynamics is presented. It consists of internal dynamics of a unit,
interaction among the units, and the dynamics to change the dynamics itself,
for example by replication (and death) of units according to their internal
states. Applying the dynamics to cell differentiation, isologous
diversification theory is proposed. According to it, orbital instability leads
to diversified cell behaviors first. At the next stage, several cell types are
formed, first triggered by clustering of oscillations, and then as attracting
states of internal dynamics stabilized by the cell-to-cell interaction. At the
third stage, the differentiation is determined as a recursive state by cell
division. At the last stage, hierarchical differentiation proceeds, with the
emergence of stochastic rule for the differentiation to sub-groups, where
regulation of the probability for the differentiation provides the diversity
and stability of cell society. Relevance of the theory to cell biology is
discussed.Comment: 19 pages, Int.J. Mod. Phes. B (in press
Coupled Maps with Growth and Death: An Approach to Cell Differentiation
An extension of coupled maps is given which allows for the growth of the
number of elements, and is inspired by the cell differentiation problem. The
growth of elements is made possible first by clustering the phases, and then by
differentiating roles. The former leads to the time sharing of resources, while
the latter leads to the separation of roles for the growth. The mechanism of
the differentiation of elements is studied. An extension to a model with
several internal phase variables is given, which shows differentiation of
internal states. The relevance of interacting dynamics with internal states
(``intra-inter" dynamics) to biological problems is discussed with an emphasis
on heterogeneity by clustering, macroscopic robustness by partial
synchronization and recursivity with the selection of initial conditions and
digitalization.Comment: LatexText,figures are not included. submitted to PhysicaD
(1995,revised 1996 May
Multivariate Approaches to Classification in Extragalactic Astronomy
Clustering objects into synthetic groups is a natural activity of any
science. Astrophysics is not an exception and is now facing a deluge of data.
For galaxies, the one-century old Hubble classification and the Hubble tuning
fork are still largely in use, together with numerous mono-or bivariate
classifications most often made by eye. However, a classification must be
driven by the data, and sophisticated multivariate statistical tools are used
more and more often. In this paper we review these different approaches in
order to situate them in the general context of unsupervised and supervised
learning. We insist on the astrophysical outcomes of these studies to show that
multivariate analyses provide an obvious path toward a renewal of our
classification of galaxies and are invaluable tools to investigate the physics
and evolution of galaxies.Comment: Open Access paper.
http://www.frontiersin.org/milky\_way\_and\_galaxies/10.3389/fspas.2015.00003/abstract\>.
\<10.3389/fspas.2015.00003 \&g
Global divergence of microbial genome sequences mediated by propagating fronts
We model the competition between recombination and point mutation in
microbial genomes, and present evidence for two distinct phases, one uniform,
the other genetically diverse. Depending on the specifics of homologous
recombination, we find that global sequence divergence can be mediated by
fronts propagating along the genome, whose characteristic signature on genome
structure is elucidated, and apparently observed in closely-related {\it
Bacillus} strains. Front propagation provides an emergent, generic mechanism
for microbial "speciation", and suggests a classification of microorganisms on
the basis of their propensity to support propagating fronts
Theory of Robustness of Irreversible Differentiation in a Stem Cell System: Chaos hypothesis
Based on extensive study of a dynamical systems model of the development of a
cell society, a novel theory for stem cell differentiation and its regulation
is proposed as the ``chaos hypothesis''. Two fundamental features of stem cell
systems - stochastic differentiation of stem cells and the robustness of a
system due to regulation of this differentiation - are found to be general
properties of a system of interacting cells exhibiting chaotic intra-cellular
reaction dynamics and cell division, whose presence does not depend on the
detail of the model. It is found that stem cells differentiate into other cell
types stochastically due to a dynamical instability caused by cell-cell
interactions, in a manner described by the Isologous Diversification theory.
This developmental process is shown to be stable not only with respect to
molecular fluctuations but also with respect to removal of cells. With this
developmental process, the irreversible loss of multipotency accompanying the
change from a stem cell to a differentiated cell is shown to be characterized
by a decrease in the chemical diversity in the cell and of the complexity of
the cellular dynamics. The relationship between the division speed and this
loss of multipotency is also discussed. Using our model, some predictions that
can be tested experimentally are made for a stem cell system.Comment: 31 pages, 10 figures, submitted to Jour. Theor. Bio
- …