Search CORE

14 research outputs found

Integrating source-language context into log-linear models of statistical machine translation

Author: Haque Rejwanul
Publication venue: Dublin City University. School of Computing
Publication date: 01/11/2011
Field of study

The translation features typically used in state-of-the-art statistical machine translation (SMT) model dependencies between the source and target phrases, but not among the phrases in the source language themselves. A swathe of research has demonstrated that integrating source context modelling directly into log-linear phrase-based SMT (PB-SMT) and hierarchical PB-SMT (HPB-SMT), and can positively influence the weighting and selection of target phrases, and thus improve translation quality. In this thesis we present novel approaches to incorporate source-language contextual modelling into the state-of-the-art SMT models in order to enhance the quality of lexical selection. We investigate the effectiveness of use of a range of contextual features, including lexical features of neighbouring words, part-of-speech tags, supertags, sentence-similarity features, dependency information, and semantic roles. We explored a series of language pairs featuring typologically different languages, and examined the scalability of our research to larger amounts of training data. While our results are mixed across feature selections, language pairs, and learning curves, we observe that including contextual features of the source sentence in general produces improvements. The most significant improvements involve the integration of long-distance contextual features, such as dependency relations in combination with part-of-speech tags in Dutch-to-English subtitle translation, the combination of dependency parse and semantic role information in English-to-Dutch parliamentary debate translation, supertag features in English-to-Chinese translation, or combination of supertag and lexical features in English-to-Dutch subtitle translation. Furthermore, we investigate the applicability of our lexical contextual model in another closely related NLP problem, namely machine transliteration

Irish Universities

DCU Online Research Access Service

Improving the role of language model in statistical machine translation (Indonesian-Javanese)

Author: Sujaini Herry
Publication venue: Institute of Advanced Engineering and Science
Publication date: 01/04/2020
Field of study

The statistical machine translation (SMT) is widely used by researchers and practitioners in recent years. SMT works with quality that is determined by several important factors, two of which are language and translation model. Research on improving the translation model has been done quite a lot, but the problem of optimizing the language model for use on machine translators has not received much attention. On translator machines, language models usually use trigram models as standard. In this paper, we conducted experiments with four strategies to analyze the role of the language model used in the Indonesian-Javanese translation machine and show improvement compared to the baseline system with the standard language model. The results of this research indicate that the use of 3-gram language models is highly recommended in SMT

ZENODO

NEUROSURGERY ENTHUSIASTIC WOMEN SOCIETY

Institute of Advanced Engineering and Science

Unification-based constraints for statistical machine translation

Author: Williams Philip James
Publication venue: The University of Edinburgh
Publication date: 27/11/2014
Field of study

Morphology and syntax have both received attention in statistical machine translation research, but they are usually treated independently and the historical emphasis on translation into English has meant that many morphosyntactic issues remain under-researched. Languages with richer morphologies pose additional problems and conventional approaches tend to perform poorly when either source or target language has rich morphology. In both computational and theoretical linguistics, feature structures together with the associated operation of unification have proven a powerful tool for modelling many morphosyntactic aspects of natural language. In this thesis, we propose a framework that extends a state-of-the-art syntax-based model with a feature structure lexicon and unification-based constraints on the target-side of the synchronous grammar. Whilst our framework is language-independent, we focus on problems in the translation of English to German, a language pair that has a high degree of syntactic reordering and rich target-side morphology. We first apply our approach to modelling agreement and case government phenomena. We use the lexicon to link surface form words with grammatical feature values, such as case, gender, and number, and we use constraints to enforce feature value identity for the words in agreement and government relations. We demonstrate improvements in translation quality of up to 0.5 BLEU over a strong baseline model. We then examine verbal complex production, another aspect of translation that requires the coordination of linguistic features over multiple words, often with long-range discontinuities. We develop a feature structure representation of verbal complex types, using constraint failure as an indicator of translation error and use this to automatically identify and quantify errors that occur in our baseline system. A manual analysis and classification of errors informs an extended version of the model that incorporates information derived from a parse of the source. We identify clause spans and use model features to encourage the generation of complete verbal complex types. We are able to improve accuracy as measured using precision and recall against values extracted from the reference test sets. Our framework allows for the incorporation of rich linguistic information and we present sketches of further applications that could be explored in future work

Edinburgh Research Archive

Recommended from our members

The Roles of Language Models and Hierarchical Models in Neural Sequence-to-Sequence Prediction

Author: Stahlberg Felix
Publication venue: University of Cambridge
Publication date: 17/02/2020
Field of study

With the advent of deep learning, research in many areas of machine learning is converging towards the same set of methods and models. For example, long short-term memory networks are not only popular for various tasks in natural language processing (NLP) such as speech recognition, machine translation, handwriting recognition, syntactic parsing, etc., but they are also applicable to seemingly unrelated fields such as robot control, time series prediction, and bioinformatics. Recent advances in contextual word embeddings like BERT boast with achieving state-of-the-art results on 11 NLP tasks with the same model. Before deep learning, a speech recognizer and a syntactic parser used to have little in common as systems were much more tailored towards the task at hand. At the core of this development is the tendency to view each task as yet another data mapping problem, neglecting the particular characteristics and (soft) requirements tasks often have in practice. This often goes along with a sharp break of deep learning methods with previous research in the specific area. This work can be understood as an antithesis to this paradigm. We show how traditional symbolic statistical machine translation models can still improve neural machine translation (NMT) while reducing the risk for common pathologies of NMT such as hallucinations and neologisms. Other external symbolic models such as spell checkers and morphology databases help neural grammatical error correction. We also focus on language models that often do not play a role in vanilla end-to-end approaches and apply them in different ways to word reordering, grammatical error correction, low-resource NMT, and document-level NMT. Finally, we demonstrate the benefit of hierarchical models in sequence-to-sequence prediction. Hand-engineered covering grammars are effective in preventing catastrophic errors in neural text normalization systems. Our operation sequence model for interpretable NMT represents translation as a series of actions that modify the translation state, and can also be seen as derivation in a formal grammar.EPSRC grant EP/L027623/1 EPSRC Tier-2 capital grant EP/P020259/

Apollo (Cambridge)

A Formal Model of Ambiguity and its Applications in Machine Translation

Author: Dyer Christopher
Publication venue
Publication date: 01/01/2010
Field of study

Systems that process natural language must cope with and resolve ambiguity. In this dissertation, a model of language processing is advocated in which multiple inputs and multiple analyses of inputs are considered concurrently and a single analysis is only a last resort. Compared to conventional models, this approach can be understood as replacing single-element inputs and outputs with weighted sets of inputs and outputs. Although processing components must deal with sets (rather than individual elements), constraints are imposed on the elements of these sets, and the representations from existing models may be reused. However, to deal efficiently with large (or infinite) sets, compact representations of sets that share structure between elements, such as weighted finite-state transducers and synchronous context-free grammars, are necessary. These representations and algorithms for manipulating them are discussed in depth in depth. To establish the effectiveness and tractability of the proposed processing model, it is applied to several problems in machine translation. Starting with spoken language translation, it is shown that translating a set of transcription hypotheses yields better translations compared to a baseline in which a single (1-best) transcription hypothesis is selected and then translated, independent of the translation model formalism used. More subtle forms of ambiguity that arise even in text-only translation (such as decisions conventionally made during system development about how to preprocess text) are then discussed, and it is shown that the ambiguity-preserving paradigm can be employed in these cases as well, again leading to improved translation quality. A model for supervised learning that learns from training data where sets (rather than single elements) of correct labels are provided for each training instance and use it to learn a model of compound word segmentation is also introduced, which is used as a preprocessing step in machine translation

Digital Repository at the University of Maryland

Extending Hiero Decoding in Moses with Cube Growing

Author: Huang
Huang
Huang
Huang
Koehn
Koehn
Vilar
Vilar
Vilar
Vilar
Publication venue: 'Walter de Gruyter GmbH'
Publication date
Field of study

Crossref

Time and the quest for knowledge in the poetry of William Blake: a discussion of Tiriel, the Book of Urizen, the Song of Los and the Four Zoas

Author: Kittel Harald Alfred
Publication venue
Publication date: 01/01/1977
Field of study

The physical appearances and specific behaviour of the characters in Tiriel , even the subtly ironical choice of names, suggest Blake's persistent opposition to the prevalent materialist-determinist philosophy of his day and to any form of dogmatism. This opposition accounts for the imaginative assimilation of originally unrelated literary material within a new symbolic context. Human misery does not originate from innate limitations or from a primordial fall from Divine Grace. It is caused by the immanent phenomenon of legalism in thought,, ethics and aesthetics. Physical, intellectual and emotional oppression deformation and corruption begin in childhood and are primarily perpetrated and perpetuated by repressive methods of education. Har and Tiriel are self-centred promulgators and, together with the other members of their family, warped products of Natural Law and Natural Religion. Tiriel's quest demonstrates that an increase in empirical knowledge is not necessarily accompanied by spiritual progress, nor does it improve the human condition. The complex vagueness of aspects of the poem contributes toward a more definite shaping of Blake's thought and symbolism in his later 'prophecies.' Portions of The Book of Urizen may be read as satire directed against the philosophic premises of seventeenth and eighteenth-century rationalism in general, and of Locke's theory of knowledge,. in particular. Theme,, structure and symbolism of the poem reflect this opposition and implicitly affirm Blake's own idealist metaphysics of reality. Abstracted from Eternity, Urizen's monolithic world has no extrinsic cause. It is a projection of his limited self-awareness. However, his solipsism fails to resolve the persistent contradiction between ideality and reality, thought and thing, subject and object. Los imposes temporal order and physical form on Urizen's disorganised thoughts. The limited anthropomorphic universe, produced by this intervention is a prison for mind and body, thought and desire. Henceforth, sensation and reflection determine the will to act. Man has rendered himself dependent on the fictitious 'substance' of matter, and on an equally mysterious remote deity. Both are only known by their 'accidents.' Natural science and Natural Religion are their respective rationalised form of worship. Both the pursuits of knowledge and of happiness require the suspension of desire. In The Song of Los Blake adopts a supra-historical perspective. Representative personages from biblical history, the history of religions generally,, philosophy and science are associated by their common failure to sustain their visionary powers. Blake incorporates into his poetic typology of decline,, structural elements derived from biblical, classical and modern conceptions of history without adopting their respective philosophical backgrounds. The notion of scientific progress and the advance of civilisation, concurrent with linear historical process, are dismissed. The achievements of empirical science, organised religion and autocratic government--synonymous with intellectual and physical oppression--kindle Orcls "thought creating fires." Despite its apocalyptic connotations, his violent outburst is of a highly ambivalent nature. The Four Zoas adumbrates the spiritual history of mankind. The poem is also a complex epic phenomenology of the human mind. Eden is an aspect of ideal reality where natural and human organisms are identified, and where life is sustained by loving self-sacrifice. After the Han's Fall elemental uproar reflects the mind's regression to the level of a perturbed oceanic consciousness which can no longer integrate the dissociated phenomena of the generative world into a living human form, thriving on love and understanding. Nature is transformed into a self-engendering monster. The human mind is englobed by the illusion of reality conceived as external and material, and by a fatalistic view of temporal process. Nevertheless, both misconceptions impose a degree of stability and order on the anarchic forces released by the cosmic catastrophe. Man's Fall is due to the dissociation of reason and affection. "Mental forms" are externalised and idolised. Eventually, under Urizen's control, imaginative energy in forced into rigid geometric form and regular motion. The beautiful illusion of the pseudo-Platonic "Mundane Shell" reflects the essential structure of Urizen's intelligence. however, it does not provide a lasting solution to the human dilemma. after the Fall. After the collapse of his creation, Urizen explores his alien environment by empirical means. he is a prisoner of his own restricted conception of reality. Unexpectedly, in Night VII(a), the Spectre of Urthona and Los are transformed into labourers of the Apocalypse. Regenoration starts with the annihilation of 'self.' Aware of his responsibilities, Los builds Golgonoozat the city of art. Emulating Christ's self-sacrifice, visionary activity is a form of self-denial. Time becomes a function of imaginative creativity. The imaginative world created by Los incorporates visionary time and space. Natural existence is realised as being endowed with regenerative qualities. Los no longer rejects Orc but sublimates his energies. Orc's destructive powers become an integral aspect Of the Last Judgment. Throughout Night VIII the providential and redemptive character of mortal life is stressed. Plunging into "the river of space" is a baptismal, if painful, experience. Although guided by Divine Providence, individual man has to work for his own salvation. In Night IX prophetic and apocalyptic views are fused as Los acts in a temporal context when tearing down the material, social and metaphysical barriers to vision erected by Urizen. The symbolism of Revelation is employed to adumbrate the artist's ultimate task in history. History is not beyond human control. Submission to the "Divine Vision" is an active ethical achievement capable of generating a powerful social dynamic, rather than tentatively removing it. Tyranny is overthrown because once the visionary poet has revealed its deceptions, mankind follows his example and removes it physically. This optimistic vision of the Last Judgment is an affirmation of the poet's absolute faith in the power of inspired vision to regenerate and humanize all aspects of life in this world

Glasgow Theses Service

Writing as Material Practice: Substance, Surface and Medium

Author
Publication venue: 'Ubiquity Press, Ltd.'
Publication date: 01/01/2013
Field of study

Writing as Material Practice grapples with the issue of writing as a form of material culture in its ancient and more recent manifestations, and in the contexts of production and consumption. Fifteen case studies explore the artefactual nature of writing — the ways in which materials, techniques, colour, scale, orientation and visibility inform the creation of inscribed objects and spaces, as well as structure subsequent engagement, perception and meaning making. Covering a temporal span of some 5000 years, from c.3200 BCE to the present day, and ranging in spatial context from the Americas to the Near East, the chapters in this volume bring a variety of perspectives which contribute to both specific and broader questions of writing materialities. The authors also aim to place past graphical systems in their social contexts so they can be understood in relation to the people who created and attributed meaning to writing and associated symbolic modes through a diverse array of individual and wider social practices

UCL Discovery

Kenyon College Catalog 2017-2018

Author
Publication venue: Digital Kenyon: Research, Scholarship, and Creative Exchange
Publication date: 01/01/2017
Field of study

https://digital.kenyon.edu/coursecatalogs/1208/thumbnail.jp

Kenyon College: Digital Kenyon - Research, Scholarship, and Creative Exchange