1,209 research outputs found
Large-scale Hierarchical Alignment for Data-driven Text Rewriting
We propose a simple unsupervised method for extracting pseudo-parallel
monolingual sentence pairs from comparable corpora representative of two
different text styles, such as news articles and scientific papers. Our
approach does not require a seed parallel corpus, but instead relies solely on
hierarchical search over pre-trained embeddings of documents and sentences. We
demonstrate the effectiveness of our method through automatic and extrinsic
evaluation on text simplification from the normal to the Simple Wikipedia. We
show that pseudo-parallel sentences extracted with our method not only
supplement existing parallel data, but can even lead to competitive performance
on their own.Comment: RANLP 201
Infinite dimensional Lie algebras in 4D conformal quantum field theory
The concept of global conformal invariance (GCI) opens the way of applying
algebraic techniques, developed in the context of 2-dimensional chiral
conformal field theory, to a higher (even) dimensional space-time. In
particular, a system of GCI scalar fields of conformal dimension two gives rise
to a Lie algebra of harmonic bilocal fields, V_m(x,y), where the m span a
finite dimensional real matrix algebra M closed under transposition. The
associative algebra M is irreducible iff its commutant M' coincides with one of
the three real division rings. The Lie algebra of (the modes of) the bilocal
fields is in each case an infinite dimensional Lie algebra: a central extension
of sp(infty,R) corresponding to the field R of reals, of u(infty,infty)
associated to the field C of complex numbers, and of so*(4 infty) related to
the algebra H of quaternions. They give rise to quantum field theory models
with superselection sectors governed by the (global) gauge groups O(N), U(N),
and U(N,H)=Sp(2N), respectively.Comment: 16 pages, with minor improvements as to appear in J. Phys.
Character-level Chinese-English Translation through ASCII Encoding
Character-level Neural Machine Translation (NMT) models have recently
achieved impressive results on many language pairs. They mainly do well for
Indo-European language pairs, where the languages share the same writing
system. However, for translating between Chinese and English, the gap between
the two different writing systems poses a major challenge because of a lack of
systematic correspondence between the individual linguistic units. In this
paper, we enable character-level NMT for Chinese, by breaking down Chinese
characters into linguistic units similar to that of Indo-European languages. We
use the Wubi encoding scheme, which preserves the original shape and semantic
information of the characters, while also being reversible. We show promising
results from training Wubi-based models on the character- and subword-level
with recurrent as well as convolutional models.Comment: 7 pages, 3 figures, 3rd Conference on Machine Translation (WMT18),
201
2MASSJ22560844+5954299: the newly discovered cataclysmic star with the deepest eclipse
Context: The SW Sex stars are assumed to represent a distinguished stage in
CV evolution, making it especially important to study them. Aims: We discovered
a new cataclysmic star and carried out prolonged and precise photometric
observations, as well as medium-resolution spectral observations. Modelling
these data allowed us to determine the psysical parameters and to establish its
peculiarities. Results: The newly discovered vataclysmic variable
2MASSJ22560844+5954299 shows the deepest eclipse amongst the known nova-like
stars. It was reproduced by totally covering a very luminous accretion disk by
a red secondary component. The temperature distribution of the disk is flatter
than that of steady-state disk. The target is unusual with the combination of a
low mass ratio q~1.0 (considerably below the limit q=1.2 of stable mass
transfer of CVs) and an M-star secondary. The intensity of the observed three
emission lines, H_alpha, He 5875, and He 6678, sharply increases around phase
0.0, accompanied by a Doppler jump to the shorter wavelength. The absence of
eclipses of the emission lines and their single-peaked profiles means that they
originate mainly in a vertically extended hot-spot halo. The emission H_alpha
line reveals S-wave wavelength shifts with semi-amplitude of around 210 km/s
and phase lag of 0.03. Conclusions: The non-steady-state emission of the
luminous accretion disk of 2MASSJ22560844+5954299 was attributed to the low
viscosity of the disk matter caused by its unusually high temperature. The star
shows all spectral properties of an SW Sex variable apart from the 0.5 central
absorption.Comment: Accepted for publication in Astronomy & Astrophysics. 12 pages, 11
figures, 6 table
Identification of Distinct Bacillus thuringiensis 4A4 Nematicidal Factors Using the Model Nematodes Pristionchus pacificus and Caenorhabditis elegans
Bacillus thuringiensis has been extensively used for the biological control of insect pests. Nematicidal B. thuringiensis strains have also been identified; however, virulence factors of such strains are poorly investigated. Here, we describe virulence factors of the nematicidal B. thuringiensis 4A4 strain, using the model nematodes Pristionchus pacificus and Caenorhabditis elegans. We show that B. thuringiensis 4A4 kills both nematodes via intestinal damage. Whole genome sequencing of B. thuringiensis 4A4 identified Cry21Ha, Cry1Ba, Vip1/Vip2 and β-exotoxin as potential nematicidal factors. Only Cry21Ha showed toxicity to C. elegans, while neither Cry nor Vip toxins were active against P. pacificus, when expressed in E. coli. Purified crystals also failed to intoxicate P. pacificus, while autoclaved spore-crystal mixture of B. thuringiensis 4A4 retained toxicity, suggesting that primary β-exotoxin is responsible for P. pacificus killing. In support of this, we found that a β-exotoxin-deficient variant of B. thuringiensis 4A4, generated by plasmid curing lost virulence to the nematodes. Thus, using two model nematodes we revealed virulence factors of the nematicidal strain B. thuringiensis 4A4 and showed the multifactorial nature of its virulence
Refined physical properties and g',r',i',z',J,H,K transmission spectrum of WASP-23b from the ground
Multi-band observations of planetary transits using the telescope defocus
technique may yield high-quality light curves suitable for refining the
physical properties of exoplanets even with small or medium size telescopes.
Such observations can be used to construct a broad-band transmission spectrum
of transiting planets and search for the presence of strong absorbers. We have
thoroughly characterised the orbital ephemeris and physical properties of the
transiting planet and host star in the WASP-23b system, constructed a
broad-band transmission spectrum of WASP-23b and performed a comparative
analysis with theoretical models of hot Jupiters. We observed a complete
transit of WASP-23b in seven bands simultaneously, using the GROND instrument
on the MPG/ESO 2.2m telescope at La Silla Observatory and telescope
defocussing. The optical data were taken in the Sloan g',r',i' and z' bands.
The resulting light curves are of high quality, with a root-mean-square scatter
of the residual as low as 330ppm in the z'-band, with a cadence of 90s.
Near-infrared data were obtained in the JHK bands. We performed MCMC analysis
of our photometry plus existing radial velocity data to refine measurements of
the ephemeris and physical properties of the WASP-23. We constructed a
broad-band transmission spectrum of WASP-23b and compared it with a theoretical
transmission spectrum of a Hot Jupiter. We measured the central transit time
with a precision about 8s. From this and earlier observations we obtain an
orbital period of P=2.9444300+/-0.0000011d. Our analysis also yielded a larger
radius and mass for the planet (Rp=1.067+0.045-0.038 RJup and,
Mp=0.917+0.040-0.039MJup). The transmission spectrum is marginally flat, given
the limited precision of the measurements for the planet radius and poor
spectral resolution of the data.Comment: 8 pages, 5 figures, accepted for publication in Astronomy &
Astrophysic
Jacobi Identity for Vertex Algebras in Higher Dimensions
Vertex algebras in higher dimensions provide an algebraic framework for
investigating axiomatic quantum field theory with global conformal invariance.
We develop further the theory of such vertex algebras by introducing formal
calculus techniques and investigating the notion of polylocal fields. We derive
a Jacobi identity which together with the vacuum axiom can be taken as an
equivalent definition of vertex algebra.Comment: 35 pages, references adde
- …