14,139 research outputs found
Density Matching for Bilingual Word Embedding
Recent approaches to cross-lingual word embedding have generally been based
on linear transformations between the sets of embedding vectors in the two
languages. In this paper, we propose an approach that instead expresses the two
monolingual embedding spaces as probability densities defined by a Gaussian
mixture model, and matches the two densities using a method called normalizing
flow. The method requires no explicit supervision, and can be learned with only
a seed dictionary of words that have identical strings. We argue that this
formulation has several intuitively attractive properties, particularly with
the respect to improving robustness and generalization to mappings between
difficult language pairs or word pairs. On a benchmark data set of bilingual
lexicon induction and cross-lingual word similarity, our approach can achieve
competitive or superior performance compared to state-of-the-art published
results, with particularly strong results being found on etymologically distant
and/or morphologically rich languages.Comment: Accepted by NAACL-HLT 201
Modeling Collaboration in Academia: A Game Theoretic Approach
In this work, we aim to understand the mechanisms driving academic
collaboration. We begin by building a model for how researchers split their
effort between multiple papers, and how collaboration affects the number of
citations a paper receives, supported by observations from a large real-world
publication and citation dataset, which we call the h-Reinvestment model. Using
tools from the field of Game Theory, we study researchers' collaborative
behavior over time under this model, with the premise that each researcher
wants to maximize his or her academic success. We find analytically that there
is a strong incentive to collaborate rather than work in isolation, and that
studying collaborative behavior through a game-theoretic lens is a promising
approach to help us better understand the nature and dynamics of academic
collaboration.Comment: Presented at the 1st WWW Workshop on Big Scholarly Data (2014). 6
pages, 5 figure
Results of the WMT19 metrics shared task: segment-level and strong MT systems pose big challenges
This paper presents the results of the WMT19 Metrics Shared Task. Participants were asked to score the outputs of the translations systems competing in the WMT19 News Translation Task with automatic metrics. 13 research groups submitted 24 metrics, 10 of which are reference-less "metrics" and constitute submissions to the joint task with WMT19 Quality Estimation Task, "QE as a Metric". In addition, we computed 11 baseline metrics, with 8 commonly applied baselines (BLEU, SentBLEU, NIST, WER, PER, TER, CDER, and chrF) and 3 reimplementations (chrF+, sacreBLEU-BLEU, and sacreBLEU-chrF). Metrics were evaluated on the system level, how well a given metric correlates with the WMT19 official manual ranking, and segment level, how well the metric correlates with human judgements of segment quality. This year, we use direct assessment (DA) as our only form of manual evaluation
Has the Ultra Low Emission Zone in London improved air quality?
London introduced the world's most stringent emissions zone, the Ultra Low Emission Zone (ULEZ), in April 2019 to reduce air pollutant emissions from road transport and accelerate compliance with the EU air quality standards. Combining meteorological normalisation, change point detection, and a regression discontinuity design with time as the forcing variable, we provide an ex-post causal analysis of air quality improvements attributable to the London ULEZ. We observe that the ULEZ caused only small improvements in air quality in the context of a longer-term downward trend in London's air pollution levels. Structural changes in nitrogen dioxide (NO2) and ozone (O3) concentrations were detected at 70% and 24% of the (roadside and background) monitoring sites and amongst the sites that showed a response, the relative changes in air pollution ranged from −9% to 6% for NO2, −5% to 4% for O3, and −6% to 4% for particulate matter with an aerodynamic diameter less than 2.5 μm (PM2.5). Aggregating the responses across London, we find an average reduction of less than 3% for NO2 concentrations, and insignificant effects on O3 and PM2.5 concentrations. As other cities consider implementing similar schemes, this study implies that the ULEZ on its own is not an effective strategy in the sense that the marginal causal effects were small. On the other hand, the ULEZ is one of many policies implemented to tackle air pollution in London, and in combination these have led to improvements in air quality that are clearly observable. Thus, reducing air pollution requires a multi-faceted set of policies that aim to reduce emissions across sectors with coordination among local, regional and national government
- …