27 research outputs found

    Building a resource for studying translation shifts

    Full text link
    This paper describes an interdisciplinary approach which brings together the fields of corpus linguistics and translation studies. It presents ongoing work on the creation of a corpus resource in which translation shifts are explicitly annotated. Translation shifts denote departures from formal correspondence between source and target text, i.e. deviations that have occurred during the translation process. A resource in which such shifts are annotated in a systematic way will make it possible to study those phenomena that need to be addressed if machine translation output is to resemble human translation. The resource described in this paper contains English source texts (parliamentary proceedings) and their German translations. The shift annotation is based on predicate-argument structures and proceeds in two steps: first, predicates and their arguments are annotated monolingually in a straightforward manner. Then, the corresponding English and German predicates and arguments are aligned with each other. Whenever a shift - mainly grammatical or semantic -has occurred, the alignment is tagged accordingly.Comment: 6 pages, 1 figur

    名詞項構造付与データの構築

    Get PDF
    会議名: 言語資源活用ワークショップ2016, 開催地: 国立国語研究所, 会期: 2017年3月7日-8日, 主催: 国立国語研究所 コーパス開発センター含意認識タスクなど言語処理での文間の表現を取り扱う際,名詞の意味的な関係を捉える必要がある。言語学の分析から名詞の中には名詞の意味を補完する外部情報が必要なものが分かっており,生成語彙における特質構造(クオリア構造) として記述することが提案されている。また言語資源ではNomBank に代表されるように名詞の項構造を事例とともに構築されている。本研究では,先行研究で提案された特質構造を利用した名詞の項構造データを基に言語処理の観点からより形式化した構築法を提案する。具体的には名詞の項構造の例文を構築するとともに,項を同定し,述語との関係を項構造を通して結び付ける記述枠組である。述語のデータとして述語項構造シソーラスを利用し,NTCIR のRITE-2 で出現した名詞を対象に項構造の例文および対応する述語と項の関係を記述したデータを構築した。本稿では,記述枠組,および具体的に構築した名詞項構造データの事例を説明すると共に,付与での問題点や現状について記述する

    Tree Alignment through Semantic Role Annotation Projection

    Get PDF
    Proceedings of the Workshop on Annotation and Exploitation of Parallel Corpora AEPC 2010. Editors: Lars Ahrenberg, Jörg Tiedemann and Martin Volk. NEALT Proceedings Series, Vol. 10 (2010), 73-82. © 2010 The editors and contributors. Published by Northern European Association for Language Technology (NEALT) http://omilia.uio.no/nealt . Electronically published at Tartu University Library (Estonia) http://hdl.handle.net/10062/15893

    Because Syntax does Matter: Improving Predicate-Argument Structures Parsing Using Syntactic Features

    Get PDF
    International audienceParsing full-fledged predicate-argument structures in a deep syntax framework requires graphs to be predicted. Using the DeepBank (Flickinger et al., 2012) and the Predicate-Argument Structure treebank (Miyao and Tsujii, 2005) as a test field, we show how transition-based parsers, extended to handle connected graphs, benefit from the use of topologically different syntactic features such as dependencies, tree fragments, spines or syntactic paths, bringing a much needed context to the parsing models, improving notably over long distance dependencies and elided coordinate structures. By confirming this positive impact on an accurate 2nd-order graph-based parser (Martins and Almeida, 2014), we establish a new state-of-the-art on these data sets

    Nominalization and Alternations in Biomedical Language

    Get PDF
    Background: This paper presents data on alternations in the argument structure of common domain-specific verbs and their associated verbal nominalizations in the PennBioIE corpus. Alternation is the term in theoretical linguistics for variations in the surface syntactic form of verbs, e.g. the different forms of stimulate in FSH stimulates follicular development and follicular development is stimulated by FSH. The data is used to assess the implications of alternations for biomedical text mining systems and to test the fit of the sublanguage model to biomedical texts. Methodology/Principal Findings: We examined 1,872 tokens of the ten most common domain-specific verbs or their zerorelated nouns in the PennBioIE corpus and labelled them for the presence or absence of three alternations. We then annotated the arguments of 746 tokens of the nominalizations related to these verbs and counted alternations related to the presence or absence of arguments and to the syntactic position of non-absent arguments. We found that alternations are quite common both for verbs and for nominalizations. We also found a previously undescribed alternation involving an adjectival present participle. Conclusions/Significance: We found that even in this semantically restricted domain, alternations are quite common, and alternations involving nominalizations are exceptionally diverse. Nonetheless, the sublanguage model applies to biomedica

    El bilingüismo y la enseñanza por proyectos de investigación en las aulas de primaria

    Get PDF
    La incorporación de la metodología por investigación y experimentación en la didáctica es una tarea fundamental que el educando tiene que realizar en su trabajo preparatorio como docente en el área que le corresponde. En la elaboración de ella, especialmente en la didáctica de inglés a través de la asignatura de Ciencias, el profesor tiene que emplear no solo las competencias que vienen elaboradas en el Currículo nacional, en el Currículo Regional y en el PEC, pero también con el conocimiento profundo de cada alumno y las familias, que el profesor debe tener, para poder así concluir con materiales que no solo ayudan el alumnado a dominar la materia, sino también motivarle y hacer que la experiencia del aprendizaje y autoaprendizaje resulta de la manera más agradable e inspirador, y la enseñanza a través de la investigación prueba a ser un método con resultados que aspiran en el caso del aprendizaje de una segunda idioma, lo más cerca posible a un nivel nativo.Investigation and experimentation in the field of language-learning for the Primary Education purposes falls in the category of everyday tasks for a teacher. In the subject of Science It is where all the work of a teacher as an organiser is focused with relation to the area corresponding to teaching. In the elaboration of it, the teacher must take into account and employ not only the Competencies that come established in the National, Regional Curriculum and ECP, but as well negotiate with a profound knowledge of the students that make up a classroom, their specifications and family background which they are part of. It is with this knowledge from where a teacher has to start from and conclude for the right materials and methodology, and teaching through investigation in Science has proven to be a method that that not only will help students to dominate the subject, but also motivate and make the learning experience a positive, joyful and inspirational one aiming closest to a native level of competence.Departamento de Filología InglesaGrado en Educación Primari

    Proceedings

    Get PDF
    Proceedings of the Workshop on Annotation and Exploitation of Parallel Corpora AEPC 2010. Editors: Lars Ahrenberg, Jörg Tiedemann and Martin Volk. NEALT Proceedings Series, Vol. 10 (2010), 98 pages. © 2010 The editors and contributors. Published by Northern European Association for Language Technology (NEALT) http://omilia.uio.no/nealt . Electronically published at Tartu University Library (Estonia) http://hdl.handle.net/10062/15893
    corecore