15,039 research outputs found
Large Margin Neural Language Model
We propose a large margin criterion for training neural language models.
Conventionally, neural language models are trained by minimizing perplexity
(PPL) on grammatical sentences. However, we demonstrate that PPL may not be the
best metric to optimize in some tasks, and further propose a large margin
formulation. The proposed method aims to enlarge the margin between the "good"
and "bad" sentences in a task-specific sense. It is trained end-to-end and can
be widely applied to tasks that involve re-scoring of generated text. Compared
with minimum-PPL training, our method gains up to 1.1 WER reduction for speech
recognition and 1.0 BLEU increase for machine translation.Comment: 9 pages. Accepted as a long paper in EMNLP201
Some Computations on Instanton Knot Homology
In a recent paper, the first author and his collaborator developed a method
to compute an upper bound of the dimension of instanton Floer homology via
Heegaard Diagrams of 3-manifolds. For a knot inside S3, we further develop an
algorithm that can compute an upper bound of the dimension of instanton knot
homology from knot diagrams. We test the effectiveness of the algorithm and
found that for all knots up to seven crossings, the algorithm provides sharp
bounds. In the second half of the paper, we show that, if the instanton knot
Floer homology of a knot has a specified form, then the knot must an instanton
L-space knot
Discovery of gamma-ray emission from a strongly lobe-dominated quasar 3C 275.1
We systematically analyze the 6-year {\it Fermi}/LAT data of the
lobe-dominated quasars (LDQs) in the complete LDQ sample from 3CRR survey and
report the discovery of high-energy -ray emission from 3C 275.1. The
-ray emission of 3C 207 is confirmed and significant variability of the
lightcurve is identified. We do not find statistically significant -ray
emission from other LDQs. 3C 275.1 is the known -ray quasar with the
lowest core dominance parameter (i.e., ). We also show that both the
northern radio hotspot and parsec jet models can reasonably reproduce the
-ray data. The parsec jet model, however, is favored by the potential
-ray variability at the timescale of months. We suggest that some
dimmer -ray LDQs will be detected in the future and LDQs could
contribute non-negligibly to the extragalactic -ray background.Comment: 26 pages, 10 figures, 3 tables; ApJ in pres
Configuring innovative societies: the crossvergent role of cultural and institutional varieties
The study aims to explore why some societies are more innovative than others in high-technology sectors. Following a crossvergence perspective, we generate nine causal conditions by accommodating both cultural and institutional varieties: uncertainty avoidance, masculinity, individualism and power distance as culture indicators, and union density, skill development, market capitalization to credit, prevalence of cluster and state dominance as institutional indicators. Applying the configurational approach, we conducted fuzzy-set qualitative comparative analysis (fsQCA) on Organisation for Economic Co-operation and Development (OECD) member countries. We confirm the equal importance of both cultural and institutional mechanisms as contributors to national innovativeness, and identify equifinal configurations of cultural and institutional varieties as leading to a high-tech society. The implication is that a society can adjust or develop various cultural and/or institutional conditions to maintain or create leadership in innovation
- …