5,595 research outputs found
Faster k-Medoids Clustering: Improving the PAM, CLARA, and CLARANS Algorithms
Clustering non-Euclidean data is difficult, and one of the most used
algorithms besides hierarchical clustering is the popular algorithm
Partitioning Around Medoids (PAM), also simply referred to as k-medoids. In
Euclidean geometry the mean-as used in k-means-is a good estimator for the
cluster center, but this does not hold for arbitrary dissimilarities. PAM uses
the medoid instead, the object with the smallest dissimilarity to all others in
the cluster. This notion of centrality can be used with any (dis-)similarity,
and thus is of high relevance to many domains such as biology that require the
use of Jaccard, Gower, or more complex distances.
A key issue with PAM is its high run time cost. We propose modifications to
the PAM algorithm to achieve an O(k)-fold speedup in the second SWAP phase of
the algorithm, but will still find the same results as the original PAM
algorithm. If we slightly relax the choice of swaps performed (at comparable
quality), we can further accelerate the algorithm by performing up to k swaps
in each iteration. With the substantially faster SWAP, we can now also explore
alternative strategies for choosing the initial medoids. We also show how the
CLARA and CLARANS algorithms benefit from these modifications. It can easily be
combined with earlier approaches to use PAM and CLARA on big data (some of
which use PAM as a subroutine, hence can immediately benefit from these
improvements), where the performance with high k becomes increasingly
important.
In experiments on real data with k=100, we observed a 200-fold speedup
compared to the original PAM SWAP algorithm, making PAM applicable to larger
data sets as long as we can afford to compute a distance matrix, and in
particular to higher k (at k=2, the new SWAP was only 1.5 times faster, as the
speedup is expected to increase with k)
VEGF(164)-mediated inflammation is required for pathological, but not physiological, ischemia-induced retinal neovascularization
Hypoxia-induced VEGF governs both physiological retinal vascular development and pathological retinal neovascularization. In the current paper, the mechanisms of physiological and pathological neovascularization are compared and contrasted. During pathological neovascularization, both the absolute and relative expression levels for VEGF(164) increased to a greater degree than during physiological neovascularization. Furthermore, extensive leukocyte adhesion was observed at the leading edge of pathological, but not physiological, neovascularization. When a VEGF(164)-specific neutralizing aptamer was administered, it potently suppressed the leukocyte adhesion and pathological neovascularization, whereas it had little or no effect on physiological neovascularization. In parallel experiments, genetically altered VEGF(164)-deficient (VEGF(120/188)) mice exhibited no difference in physiological neovascularization when compared with wild-type (VEGF(+/+)) controls. In contrast, administration of a VEGFk-1/Fc fusion protein, which blocks all VEGF isoforms, led to significant suppression of both pathological and physiological neovascularization. In addition, the targeted inactivation of monocyte lineage cells with clodronate-liposomes led to the suppression of pathological neovascularization. Conversely, the blockade of T lymphocyte-mediated immune responses with an anti-CD2 antibody exacerbated pathological neovascularization. These data highlight important molecular and cellular differences between physiological and pathological retinal neovascularization. During pathological neovascularization, VEGF(164) selectively induces inflammation and cellular immunity. These processes provide positive and negative angiogenic regulation, respectively. Together, new therapeutic approaches for selectively targeting pathological, but not physiological, retinal neovascularization are outlined
Repeated, Delayed Torque Variations Following X-Ray Flux Enhancements in the Magnetar 1E 1048.1-5937
published_or_final_versio
Review of Recent Developments in the Random-Field Ising Model
A lot of progress has been made recently in our understanding of the
random-field Ising model thanks to large-scale numerical simulations. In
particular, it has been shown that, contrary to previous statements: the
critical exponents for different probability distributions of the random fields
and for diluted antiferromagnets in a field are the same. Therefore, critical
universality, which is a perturbative renormalization-group prediction, holds
beyond the validity regime of perturbation theory. Most notably, dimensional
reduction is restored at five dimensions, i.e., the exponents of the
random-field Ising model at five dimensions and those of the pure Ising
ferromagnet at three dimensions are the same.Comment: 11 pages, 4 figures, updated and extended version, to be published in
J. Stat. Phy
Ab initio prediction of Boron compounds arising from Borozene: Structural and electronic properties
Structure and electronic properties of two unusual boron clusters obtained by
fusion of borozene rings has been studied by means of first principles
calculations, based on the generalized-gradient approximation of the density
functional theory, and the semiempirical tight-binding method was used for the
transport calculations. The role of disorder has also been considered with
single vacancies and substitutional atoms. Results show that the pure boron
clusters are topologically planar and characterized by (3c-2e) bonds, which can
explain, together with the aromaticity (estimated by means of NICS), the
remarkable cohesive energy values obtained. Such feature makes these systems
competitive with the most stable boron clusters to date. On the contrary, the
introduction of impurities compromises stability and planarity in both cases.
The energy gap values indicate that these clusters possess a semiconducting
character, while when the larger system is considered, zero-values of the
density of states are found exclusively within the HOMO-LUMO gap. Electron
transport calculations within the Landauer formalism confirm these indications,
showing semiconductor-like low bias differential conductance for these
stuctures. Differences and similarities with Carbon clusters are highlighted in
the discussion.Comment: 10 pages, 2 tables, 5 figure
Recommended from our members
Epigenetic memory in induced pluripotent stem cells.
Somatic cell nuclear transfer and transcription-factor-based reprogramming revert adult cells to an embryonic state, and yield pluripotent stem cells that can generate all tissues. Through different mechanisms and kinetics, these two reprogramming methods reset genomic methylation, an epigenetic modification of DNA that influences gene expression, leading us to hypothesize that the resulting pluripotent stem cells might have different properties. Here we observe that low-passage induced pluripotent stem cells (iPSCs) derived by factor-based reprogramming of adult murine tissues harbour residual DNA methylation signatures characteristic of their somatic tissue of origin, which favours their differentiation along lineages related to the donor cell, while restricting alternative cell fates. Such an 'epigenetic memory' of the donor tissue could be reset by differentiation and serial reprogramming, or by treatment of iPSCs with chromatin-modifying drugs. In contrast, the differentiation and methylation of nuclear-transfer-derived pluripotent stem cells were more similar to classical embryonic stem cells than were iPSCs. Our data indicate that nuclear transfer is more effective at establishing the ground state of pluripotency than factor-based reprogramming, which can leave an epigenetic memory of the tissue of origin that may influence efforts at directed differentiation for applications in disease modelling or treatment
The actin-myosin regulatory MRCK kinases: regulation, biological functions and associations with human cancer
The contractile actin-myosin cytoskeleton provides much of the force required for numerous cellular activities such as motility, adhesion, cytokinesis and changes in morphology. Key elements that respond to various signal pathways are the myosin II regulatory light chains (MLC), which participate in actin-myosin contraction by modulating the ATPase activity and consequent contractile force generation mediated by myosin heavy chain heads. Considerable effort has focussed on the role of MLC kinases, and yet the contributions of the myotonic dystrophy-related Cdc42-binding kinases (MRCK) proteins in MLC phosphorylation and cytoskeleton regulation have not been well characterized. In contrast to the closely related ROCK1 and ROCK2 kinases that are regulated by the RhoA and RhoC GTPases, there is relatively little information about the CDC42-regulated MRCKα, MRCKβ and MRCKγ members of the AGC (PKA, PKG and PKC) kinase family. As well as differences in upstream activation pathways, MRCK and ROCK kinases apparently differ in the way that they spatially regulate MLC phosphorylation, which ultimately affects their influence on the organization and dynamics of the actin-myosin cytoskeleton. In this review, we will summarize the MRCK protein structures, expression patterns, small molecule inhibitors, biological functions and associations with human diseases such as cancer
Identification of key parameters controlling demographically structured vegetation dynamics in a land surface model: CLM4.5(FATES)
Vegetation plays an important role in regulating global carbon cycles and is a key component of the Earth system models (ESMs) that aim to project Earth's future climate. In the last decade, the vegetation component within ESMs has witnessed great progress from simple "big-leaf" approaches to demographically structured approaches, which have a better representation of plant size, canopy structure, and disturbances. These demographically structured vegetation models typically have a large number of input parameters, and sensitivity analysis is needed to quantify the impact of each parameter on the model outputs for a better understanding of model behavior. In this study, we conducted a comprehensive sensitivity analysis to diagnose the Community Land Model coupled to the Functionally Assembled Terrestrial Simulator, or CLM4.5(FATES). Specifically, we quantified the first- and second-order sensitivities of the model parameters to outputs that represent simulated growth and mortality as well as carbon fluxes and stocks for a tropical site with an extent of 1×1°. While the photosynthetic capacity parameter (Vc;max25) is found to be important for simulated carbon stocks and fluxes, we also show the importance of carbon storage and allometry parameters, which determine survival and growth strategies within the model. The parameter sensitivity changes with different sizes of trees and climate conditions. The results of this study highlight the importance of understanding the dynamics of the next generation of demographically enabled vegetation models within ESMs to improve model parameterization and structure for better model fidelity
Paternal obesity is associated with IGF2 hypomethylation in newborns: results from a Newborn Epigenetics Study (NEST) cohort
Data from epidemiological and animal model studies suggest that nutrition during pregnancy may affect the health status of subsequent generations. These transgenerational effects are now being explained by disruptions at the level of the epigenetic machinery. Besides in vitro environmental exposures, the possible impact on the reprogramming of methylation profiles at imprinted genes at a much earlier time point, such as during spermatogenesis or oogenesis, has not previously been considered. In this study, our aim was to determine associations between preconceptional obesity and DNA methylation profiles in the offspring, particularly at the differentially methylated regions (DMRs) of the imprinted Insulin-like Growth Factor 2 (IGF2) gene
- …
