8,324 research outputs found
A rigorous sequential update strategy for parallel kinetic Monte Carlo simulation
The kinetic Monte Carlo (kMC) method is used in many scientific fields in
applications involving rare-event transitions. Due to its discrete stochastic
nature, efforts to parallelize kMC approaches often produce unbalanced time
evolutions requiring complex implementations to ensure correct statistics. In
the context of parallel kMC, the sequential update technique has shown promise
by generating high quality distributions with high relative efficiencies for
short-range systems. In this work, we provide an extension of the sequential
update method in a parallel context that rigorously obeys detailed balance,
which guarantees exact equilibrium statistics for all parallelization settings.
Our approach also preserves nonequilibrium dynamics with minimal error for many
parallelization settings, and can be used to achieve highly precise sampling
Supervised Topical Key Phrase Extraction of News Stories using Crowdsourcing, Light Filtering and Co-reference Normalization
Fast and effective automated indexing is critical for search and personalized
services. Key phrases that consist of one or more words and represent the main
concepts of the document are often used for the purpose of indexing. In this
paper, we investigate the use of additional semantic features and
pre-processing steps to improve automatic key phrase extraction. These features
include the use of signal words and freebase categories. Some of these features
lead to significant improvements in the accuracy of the results. We also
experimented with 2 forms of document pre-processing that we call light
filtering and co-reference normalization. Light filtering removes sentences
from the document, which are judged peripheral to its main content.
Co-reference normalization unifies several written forms of the same named
entity into a unique form. We also needed a "Gold Standard" - a set of labeled
documents for training and evaluation. While the subjective nature of key
phrase selection precludes a true "Gold Standard", we used Amazon's Mechanical
Turk service to obtain a useful approximation. Our data indicates that the
biggest improvements in performance were due to shallow semantic features, news
categories, and rhetorical signals (nDCG 78.47% vs. 68.93%). The inclusion of
deeper semantic features such as Freebase sub-categories was not beneficial by
itself, but in combination with pre-processing, did cause slight improvements
in the nDCG scores.Comment: In 8th International Conference on Language Resources and Evaluation
(LREC 2012
The Casimir spectrum revisited
We examine the mathematical and physical significance of the spectral density
sigma(w) introduced by Ford in Phys. Rev. D38, 528 (1988), defining the
contribution of each frequency to the renormalised energy density of a quantum
field. Firstly, by considering a simple example, we argue that sigma(w) is well
defined, in the sense of being regulator independent, despite an apparently
regulator dependent definition. We then suggest that sigma(w) is a spectral
distribution, rather than a function, which only produces physically meaningful
results when integrated over a sufficiently large range of frequencies and with
a high energy smooth enough regulator. Moreover, sigma(w) is seen to be simply
the difference between the bare spectral density and the spectral density of
the reference background. This interpretation yields a simple `rule of thumb'
to writing down a (formal) expression for sigma(w) as shown in an explicit
example. Finally, by considering an example in which the sign of the Casimir
force varies, we show that the spectrum carries no manifest information about
this sign; it can only be inferred by integrating sigma(w).Comment: 10 pages, 4 figure
Hall-Effect Sign Anomaly and Small-Polaronic Conduction in (La_{1-x}Gd_x)_{0.67}Ca_{0.33}MnO_3
The Hall coefficient of Gd-doped La_{2/3}Ca_{1/3}MnO_3 exhibits Arrhenius
behavior over a temperature range from 2T_c to 4T_c, with an activation energy
very close to 2/3 that of the electrical conductivity. Although both the doping
level and thermoelectric coefficient indicate hole-like conduction, the Hall
coefficient is electron-like. This unusual result provides strong evidence in
favor of small-polaronic conduction in the paramagnetic regime of the
manganites.Comment: 11 pages, 4 figures, uses revtex.st
Smoothing Proximal Gradient Method for General Structured Sparse Learning
We study the problem of learning high dimensional regression models
regularized by a structured-sparsity-inducing penalty that encodes prior
structural information on either input or output sides. We consider two widely
adopted types of such penalties as our motivating examples: 1) overlapping
group lasso penalty, based on the l1/l2 mixed-norm penalty, and 2) graph-guided
fusion penalty. For both types of penalties, due to their non-separability,
developing an efficient optimization method has remained a challenging problem.
In this paper, we propose a general optimization approach, called smoothing
proximal gradient method, which can solve the structured sparse regression
problems with a smooth convex loss and a wide spectrum of
structured-sparsity-inducing penalties. Our approach is based on a general
smoothing technique of Nesterov. It achieves a convergence rate faster than the
standard first-order method, subgradient method, and is much more scalable than
the most widely used interior-point method. Numerical results are reported to
demonstrate the efficiency and scalability of the proposed method.Comment: arXiv admin note: substantial text overlap with arXiv:1005.471
- …