684 research outputs found
How Many Topics? Stability Analysis for Topic Models
Topic modeling refers to the task of discovering the underlying thematic
structure in a text corpus, where the output is commonly presented as a report
of the top terms appearing in each topic. Despite the diversity of topic
modeling algorithms that have been proposed, a common challenge in successfully
applying these techniques is the selection of an appropriate number of topics
for a given corpus. Choosing too few topics will produce results that are
overly broad, while choosing too many will result in the "over-clustering" of a
corpus into many small, highly-similar topics. In this paper, we propose a
term-centric stability analysis strategy to address this issue, the idea being
that a model with an appropriate number of topics will be more robust to
perturbations in the data. Using a topic modeling approach based on matrix
factorization, evaluations performed on a range of corpora show that this
strategy can successfully guide the model selection process.Comment: Improve readability of plots. Add minor clarification
Generating asymptotically plane wave spacetimes
In an attempt to study asymptotically plane wave spacetimes which admit an
event horizon, we find solutions to vacuum Einstein's equations in arbitrary
dimension which have a globally null Killing field and rotational symmetry. We
show that while such solutions can be deformed to include ones which are
asymptotically plane wave, they do not posses a regular event horizon. If we
allow for additional matter, such as in supergravity theories, we show that it
is possible to have extremal solutions with globally null Killing field, a
regular horizon, and which, in addition, are asymptotically plane wave. In
particular, we deform the extremal M2-brane solution in 11-dimensional
supergravity so that it behaves asymptotically as a 10-dimensional vacuum plane
wave times a real line.Comment: 23 pages, 1 eps figure; harvmac; v2:refs added; v3:minor comments
adde
Two-phase behavior in strained thin films of hole-doped manganites
We present a study of the effect of biaxial strain on the electrical and
magnetic properties of thin films of manganites. We observe that manganite
films grown under biaxial compressive strain exhibit island growth morphology
which leads to a non-uniform distribution of the strain. Transport and magnetic
properties of these films suggest the coexistence of two different phases, a
metallic ferromagnet and an insulating antiferromagnet. We suggest that the
high strain regions are insulating while the low strain regions are metallic.
In such non-uniformly strained samples, we observe a large magnetoresistance
and a field-induced insulator to metal transition.Comment: 5 pages ReVTeX, 5 figures included, Figures 3, 4 and 5 low
resolution, high resolution figures available on request from authors,
submitted to Phys. Rev.
Universal Correlations in Pion-less EFT with the Resonating Group Model: Three and Four Nucleons
The Effective Field Theory "without pions" at next-to-leading order is used
to analyze universal bound state and scattering properties of the 3- and
4-nucleon system. Results of a variety of phase shift equivalent nuclear
potentials are presented for bound state properties of 3H and 4He, and for the
singlet S-wave 3He-neutron scattering length a_0(3He-n). The calculations are
performed with the Refined Resonating Group Method and include a full treatment
of the Coulomb interaction and the leading-order 3-nucleon interaction. The
results compare favorably with data and values from AV18(+UIX) model
calculations. A new correlation between a_0(3He-n) and the 3H binding energy is
found. Furthermore, we confirm at next-to-leading order the correlations,
already found at leading-order, between the 3H binding energy and the 3H charge
radius, and the Tjon line. With the 3H binding energy as input, we get
predictions of the Effective Field Theory "without pions" at next-to-leading
order for the root mean square charge radius of 3H of (1.6\pm 0.2) fm, for the
4He binding energy of (28\pm 2.5) MeV, and for Re(a_0(3He-n)) of (7.5\pm
0.6)fm. Including the Coulomb interaction, the splitting in binding energy
between 3H and 3He is found to be (0.66\pm 0.03) MeV. The discrepancy to data
of (0.10\mp 0.03) MeV is model independently attributed to higher order charge
independence breaking interactions. We also demonstrate that different results
for the same observable stem from higher order effects, and carefully assess
that numerical uncertainties are negligible. Our results demonstrate the
convergence and usefulness of the pion-less theory at next-to-leading order in
the 4He channel. We conclude that no 4-nucleon interaction is needed to
renormalize the theory at next-to-leading order in the 4-nucleon sector.Comment: 24 pages revtex4, including 8 figures as .eps files embedded with
includegraphicx, leading-order results added, calculations include the LO
three-nucleon interaction explicitly, comment on Wigner bound added, minor
modification
Landau-Ginzburg Description of Boundary Critical Phenomena in Two Dimensions
The Virasoro minimal models with boundary are described in the
Landau-Ginzburg theory by introducing a boundary potential, function of the
boundary field value. The ground state field configurations become non-trivial
and are found to obey the soliton equations. The conformal invariant boundary
conditions are characterized by the reparametrization-invariant data of the
boundary potential, that are the number and degeneracies of the stationary
points. The boundary renormalization group flows are obtained by varying the
boundary potential while keeping the bulk critical: they satisfy new selection
rules and correspond to real deformations of the Arnold simple singularities of
A_k type. The description of conformal boundary conditions in terms of boundary
potential and associated ground state solitons is extended to the N=2
supersymmetric case, finding agreement with the analysis of A-type boundaries
by Hori, Iqbal and Vafa.Comment: 42 pages, 13 figure
Diet and feeding in the Cape Cobra, Naja nivea
It is widely reported that African cobras (genus Naja) are generalist predators, but examination of formal literature yields relatively little evidence of this for most species. Here, we review the diet of cape cobras (Naja nivea) based on examination of museum specimens, an extensive literature review, and social media reported natural history observations. We quantify dietary niche breadth and test the hypothesis that cape cobras are generalist predators. Additionally, we test the hypotheses that diet varies significantly (1) across biomes, and (2) across winter, summer and year-round rainfall regions. We gathered 101 feeding records for cape cobras and based on family-level prey frequencies calculated Levinsâ measure of niche breadth as B = 6.57, which we standardised to BA = 0.29. We found no association between prey classes and biomes or rainfall region. Our data indicate that cape cobras consume a wide range of vertebrate prey from all four tetrapod classes, including a large number and diversity of snakes. Our findings suggest that cape cobras may be involved in several important ecological processes. Moreover, our study demonstrates the utility and value of collating natural history observations reported on social media platforms
Phylogeny of snakes (Serpentes): combining morphological and molecular data in likelihood Bayesian and parsimony analyses
Copyright © 2007 The Natural history MuseumThe phylogeny of living and fossil snakes is assessed using likelihood and parsimony approaches and a dataset combining 263 morphological characters with mitochondrial (2693 bp) and nuclear (1092 bp) gene sequences. The âno common mechanismâ (NCMr) and âMarkovianâ (Mkv) models were employed for the morphological partition in likelihood analyses; likelihood scores in the NCMr model were more closely correlated with parsimony tree lengths. Both models accorded relatively less weight to the molecular data than did parsimony, with the effect being milder in the NCMr model. Partitioned branch and likelihood support values indicate that the mtDNA and nuclear gene partitions agree more closely with each other than with morphology. Despite differences between data partitions in phylogenetic signal, analytic models, and relative weighting, the parsimony and likelihood analyses all retrieved the following widely accepted groups: scolecophidians, alethinophidians, cylindrophiines, macrostomatans (sensu lato) and caenophidians. Anilius alone emerged as the most basal alethinophidian; the combined analyses resulted in a novel and stable position of uropeltines and cylindrophiines as the second-most basal clade of alethinophidians. The limbed marine pachyophiids, along with Dinilysia and Wonambi, were always basal to all living snakes. Other results stable in all combined analyses include: Xenopeltis and Loxocemus were sister taxa (fide morphology) but clustered with pythonines (fide molecules), and Ungaliophis clustered with a boine-erycine clade (fide molecules). Tropidophis remains enigmatic; it emerges as a basal alethinophidian in the parsimony analyses (fide molecules) but a derived form in the likelihood analyses (fide morphology), largely due to the different relative weighting accorded to data partitions.Michael S. Y. Lee, Andrew F. Hugall, Robin Lawson & John D. Scanlo
- âŠ