Search CORE

4,971 research outputs found

BILINGUALISM AND THE MAINTENANCE OF THE MOTHER TONGUE IN MULTILINGUAL SINGAPORE

Author: Bakar Mukhlis Abu
Publication venue
Publication date: 09/08/2016
Field of study

Two issues have been identified that act against the development of a strong bilingual and biliteracy proficiency among pupils in Singapore schools: 1. English is perceived as having a much higher status than the Mother Tongue (MT) and the language in which success is primarily measured given its primacy as the language of instruction in all subjects except the MT. Consequently, children’s use of their MT has been observed to decline in favour of English; 2. English and the MT are taught in effect as a form of double monolingualism, in watertight compartments, with no opportunity for bilingual learning or reflection in class on the relationship between the two languages. This artificially blocks pupils’ access to the other language and prevents the development of a more robust bilingual. In this presentation, I will describe the ways in which the MT is maintained in Singapore, and in particular focus on the research that has recently been completed on the use of dual language books to rejuvenate the interest and ability to read in Malay in bilingual Malay children

Diponegoro University Institutional Repository

Cross-Lingual Adaptation using Structural Correspondence Learning

Author: Prettenhofer Peter
Stein Benno
Publication venue
Publication date: 25/08/2010
Field of study

Cross-lingual adaptation, a special case of domain adaptation, refers to the transfer of classification knowledge between two languages. In this article we describe an extension of Structural Correspondence Learning (SCL), a recently proposed algorithm for domain adaptation, for cross-lingual adaptation. The proposed method uses unlabeled documents from both languages, along with a word translation oracle, to induce cross-lingual feature correspondences. From these correspondences a cross-lingual representation is created that enables the transfer of classification knowledge from the source to the target language. The main advantages of this approach over other approaches are its resource efficiency and task specificity. We conduct experiments in the area of cross-language topic and sentiment classification involving English as source language and German, French, and Japanese as target languages. The results show a significant improvement of the proposed method over a machine translation baseline, reducing the relative error due to cross-lingual adaptation by an average of 30% (topic classification) and 59% (sentiment classification). We further report on empirical analyses that reveal insights into the use of unlabeled data, the sensitivity with respect to important hyperparameters, and the nature of the induced cross-lingual correspondences

arXiv.org e-Print Archive

CiteSeerX

Building a sign language corpus for use in machine translation

Author: Dandapat Sandipan
Gilchrist Shane
Morrissey Sara
Smith Robert
Somers Harold
Publication venue
Publication date: 01/01/2010
Field of study

In recent years data-driven methods of machine translation (MT) have overtaken rule-based approaches as the predominant means of automatically translating between languages. A pre-requisite for such an approach is a parallel corpus of the source and target languages. Technological developments in sign language (SL) capturing, analysis and processing tools now mean that SL corpora are becoming increasingly available. With transcription and language analysis tools being mainly designed and used for linguistic purposes, we describe the process of creating a multimedia parallel corpus specifically for the purposes of English to Irish Sign Language (ISL) MT. As part of our larger project on localisation, our research is focussed on developing assistive technology for patients with limited English in the domain of healthcare. Focussing on the first point of contact a patient has with a GP’s office, the medical secretary, we sought to develop a corpus from the dialogue between the two parties when scheduling an appointment. Throughout the development process we have created one parallel corpus in six different modalities from this initial dialogue. In this paper we discuss the multi-stage process of the development of this parallel corpus as individual and interdependent entities, both for our own MT purposes and their usefulness in the wider MT and SL research domains

CiteSeerX

Irish Universities

DCU Online Research Access Service

Marketing and Advertising Translation: Humans vs Machines in the field of cosmetics

Author: Alonso del Caño Carmen
Publication venue
Publication date: 01/01/2019
Field of study

This undergraduate thesis focuses on a very specific field of specialized translation: advertising and marketing translation. Indeed, the high degree of specialization involved in this activity provides a testing ground for a reconsideration of the importance of the human translator and a reformulation of their role. The constant development of new technologies creates ever more sophisticated translation programs, which in turn revives the long-standing machine vs human translation debate. The aim of this project is to conduct a practical exercise targeted at verifying whether specialization in translation always requires the supervision of humans equipped with the relevant linguistic knowledge and technical background, or whether, on the contrary, machine translation can at present provide valid enough results and a sufficient level of reliability.El presente Trabajo de Fin de Grado se centra en un campo muy concreto de la traducción especializada: la traducción para la publicidad y la mercadotecnia. De hecho, el alto grado de especialización que implica esta actividad proporciona un campo de pruebas para una reconsideración de la importancia del traductor humano y una reformulación de su papel. El desarrollo creciente e ininterrumpido de las nuevas tecnologías está produciendo programas de traducción cada vez más sofisticados, lo que a su vez reaviva el viejo debate que confronta la traducción humana y la traducción automática. El objetivo de este proyecto es llevar a cabo un ejercicio práctico destinado a verificar si la especialización en la traducción siempre requiere la supervisión de personas con la formación lingüística y los conocimientos técnicos pertinentes, o si, por el contrario, la traducción automática puede en la actualidad proporcionar por si sola resultados suficientes y un nivel suficiente de fiabilidad.Grado en Estudios Inglese

Repositorio Documental de la Universidad de Valladolid

Comparison of SYSTRAN and Google Translate for English→ Portuguese

Author: Anastasiou Dimitra
Gomes de Oliveira Rodrigo
Publication venue: 'Universitat Autonoma de Barcelona'
Publication date: 01/01/2011
Field of study

Two machine translation (MT) systems, a statistical MT (SMT) system and a hybrid system (rule-based and SMT) were tested in order to compare various MT performances. The source language was English (EN) and the target language Portuguese (PT). The SMT tool gave much fewer errors than the hybrid system. Major problem areas of both systems concerned the transfer of verb systems from source to target language, and of the hybrid system the word-to-word translation, since its resources are mainly dictionaries and not corpora.Dos sistemes de traducció automàtica (MT), un sistema estadístic (SMT) i un sistema híbrid (sistema basat en regles i SMT) es van posar a prova per tal de comparar divereses resultats en MT. La llengua d'orígen era l'anglès (EN) i a llengua de destí el portuguès (PT). L'eina SMT va donar força menys errors que el sistema híbrid. La major part dels problemes d'ambdes eines se centraven en el sistema de transferència del verb de la llengua origen a la llengua destí, i en el cas del sistema híbrid en la traducció paraula a paraula, ja que els seus recursos són sobretot diccionaris i no pas corpus.Se pusieron a prueba dos sistemas de traducción automática (MT), uno de de ellos estadístico (SMT) y el otro híbrido (basado en reglas y SMT) a fin de comparar diversos resultados en MT. La lengua de origen era el inglés (EN) y la lengua de destino el portugués (PT). La herramienta SMT generó bastantes menos errores que el sistema híbrido. La mayor parte de los problemas de ambos instrumentos se centraron en el método de transferencia del verbo de la lengua origen a la lengua destino, y en el caso del sistema híbrido, en la traducción palabra a palabra, ya que sus recursos son sobre todo diccionarios y no corpus

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Diposit Digital de Documents de la UAB

Natural Language Processing at the School of Information Studies for Africa

Author: Eriksson Gunnar
Fourla Athanassia
Gambäck Björn
Publication venue
Publication date: 01/01/2005
Field of study

The lack of persons trained in computational linguistic methods is a severe obstacle to making the Internet and computers accessible to people all over the world in their own languages. The paper discusses the experiences of designing and teaching an introductory course in Natural Language Processing to graduate computer science students at Addis Ababa University, Ethiopia, in order to initiate the education of computational linguists in the Horn of Africa region

Crossref

RISE – Research Institutes of Sweden

Digitala Vetenskapliga Arkivet - Academic Archive On-line

Swedish Institute of Computer Science Publications Database

Natural language processing and advanced information management

Author: Hoard James E.
Publication venue
Publication date
Field of study

Integrating diverse information sources and application software in a principled and general manner will require a very capable advanced information management (AIM) system. In particular, such a system will need a comprehensive addressing scheme to locate the material in its docuverse. It will also need a natural language processing (NLP) system of great sophistication. It seems that the NLP system must serve three functions. First, it provides an natural language interface (NLI) for the users. Second, it serves as the core component that understands and makes use of the real-world interpretations (RWIs) contained in the docuverse. Third, it enables the reasoning specialists (RSs) to arrive at conclusions that can be transformed into procedures that will satisfy the users' requests. The best candidate for an intelligent agent that can satisfactorily make use of RSs and transform documents (TDs) appears to be an object oriented data base (OODB). OODBs have, apparently, an inherent capacity to use the large numbers of RSs and TDs that will be required by an AIM system and an inherent capacity to use them in an effective way

NASA Technical Reports Server