431 research outputs found
Increasing Social Integration in an Interdisciplinary MA Programme through Group Work
In an interdisciplinary MA programme, it is especially important that the students get socially integrated from the beginning. Most often not only the place and people will be new but also the field of study. This can be difficult to handle without a network. In this project I will investigate using group work to help initiate social integration. In this context, I will also reflect on the different group work I conducted
The dynamic effects of tax audits
Understanding causes of and solutions to non-compliance is important for a tax authority. In this paper we study how and why audits affect reported tax in the years after audit – the dynamic effect – for individual income taxpayers. We exploit data from a random audit program covering more than 53,000 income tax self assessment returns in the UK, combined with data on the population of tax filers between 1999 and 2012. We first document that there is substantial
non-compliance in this population. One in three filers underreports the tax owed. Third party information on an income source does not predict whether a taxpayer is non-compliant on that income source, though it does predict the extent of underreporting. Using the random nature of the audits, we provide evidence of dynamic effects. Audits raise reported tax liabilities for at least five years after audit, implying an additional yield 1.5 times the direct revenue raised from the audit. The magnitude of the impact falls over time, and this decline is faster for
less autocorrelated income sources. Taking an event study approach, we further show that the change in reporting behaviour comes only from those found to have made errors in their tax report. Finally, using an extension of the Allingham-Sandmo (1972) model, we show that these results are best explained by audits providing the tax authority with information, which then constrains taxpayers’ ability to misreport
syntactic recordering in statistical machine translation
Reordering has been an important topic in statistical machine translation
(SMT) as long as SMT has been around. State-of-the-art SMT systems such
as Pharaoh (Koehn, 2004a) still employ a simplistic model of the reordering
process to do non-local reordering. This model penalizes any reordering no
matter the words. The reordering is only selected if it leads to a translation
that looks like a much better sentence than the alternative.
Recent developments have, however, seen improvements in translation
quality following from syntax-based reordering. One such development
is the pre-translation approach that adjusts the source sentence to resemble
target language word order prior to translation. This is done based on
rules that are either manually created or automatically learned from word
aligned parallel corpora.
We introduce a novel approach to syntactic reordering. This approach
provides better exploitation of the information in the reordering rules and
eliminates problematic biases of previous approaches. Although the approach
is examined within a pre-translation reordering framework, it easily
extends to other frameworks. Our approach significantly outperforms a
state-of-the-art phrase-based SMT system and previous approaches to pretranslation
reordering, including (Li et al., 2007; Zhang et al., 2007b; Crego
& Mari˜ no, 2007). This is consistent both for a very close language pair,
English-Danish, and a very distant language pair, English-Arabic.
We also propose automatic reordering rule learning based on a rich set
of linguistic information. As opposed to most previous approaches that
extract a large set of rules, our approach produces a small set of predominantly
general rules. These provide a good reflection of the main reordering
issues of a given language pair. We examine the influence of several
parameters that may have influence on the quality of the rules learned.
Finally, we provide a new approach for improving automatic word alignment.
This word alignment is used in the above task of automatically learning
reordering rules. Our approach learns from hand aligned data how to
combine several automatic word alignments to one superior word alignment.
The automatic word alignments are created from the same data that
has been preprocessed with different tokenization schemes. Thus utilizing
the different strengths that different tokenization schemes exhibit in word
alignment. We achieve a 38% error reduction for the automatic word alignmen
Mesoproterozoic paleogeography: Supercontinent and beyond
A set of global paleogeographic reconstructions for the 1770–1270 Ma time interval is presented here through a compilation of reliable paleomagnetic data (at the 2009 Nordic Paleomagnetic Workshop in Luleå, Sweden) and geological constraints. Although currently available paleomagnetic results do not rule out the possibility of the formation of a supercontinent as early as ca. 1750 Ma, our synthesis suggests that the supercontinent Nuna/Columbia was assembled by at least ca. 1650–1580 Ma through joining at least two stable continental landmasses formed by ca. 1.7 Ga: West Nuna (Laurentia, Baltica and possibly India) and East Nuna (North, West and South Australia, Mawson craton of Antarctica and North China). It is possible, but not convincingly proven, that Siberia and Congo/São Francisco were combined as a third rigid continental entity and collided with Nuna at ca.1500 Ma. Nuna is suggested to have broken up at ca. 1450–1380 Ma. West Nuna, Siberia and possibly Congo/São Francisco were rigidly connected until after 1270 Ma. East Nuna was deformed during the breakup, and North China separated from it. There is currently no strong evidence indicating that Amazonia, West Africa and Kalahari were parts of Nuna
Incremental Re-training for Post-editing SMT
A method is presented for incremental retraining
of an SMT system, in which a local
phrase table is created and incrementally updated
as a file is translated and post-edited.
It is shown that translation data from within
the same file has higher value than other
domain-specific data. In two technical domains,
within-file data increases BLEU score
by several full points. Furthermore, a strong
recency effect is documented; nearby data
within the file has greater value than more
distant data. It is also shown that the value
of translation data is strongly correlated with
a metric defined over new occurrences of ngrams.
Finally, it is argued that the incremental
re-training prototype could serve as the basis
for a practical system which could be interactively
updated in real time in a post-editing
setting. Based on the results here, such an interactive
system has the potential to dramatically
improve translation quality
Paleomagnetism, magnetic anisotropy and U-Pb baddeleyite geochronology of the early Neoproterozoic Blekinge-Dalarna dolerite dykes, Sweden
Correction DOI: 10.1016/j.precamres.2018.10.013Peer reviewe
Who needs an implantable cardioverter-defibrillator? Controversies and opportunities after DANISH
No abstract available
- …