Search CORE

2 research outputs found

Survey on Hinglish to English Translation and Classification Techniques

Author: D’Souza Nicole
Patel Devarsh
Rao Ashwini
Saravta Jigyashu
Publication venue: Auricle Global Society of Education and Research
Publication date: 01/09/2023
Field of study

Code-mixing is the process of using many languages in one sentence and has a widespread occurrence in multilingual communities. It is particularly prevalent in texts on social media. Due to the widespread usage of social networking sites, a substantial amount of unstructured text is produced. Hinglish, i.e. code-mixed Hindi and English, is a frequent occurrence in everyday language use in India. Hence, a translation process is required to help monolingual users and to aid in the comprehension of language processing models. In this paper, we study the effective techniques for classification and translation tasks and also find gaps and challenges in the current research domain. After comparing a few existing methodologies for machine translation, a framework which showed an improvement in task of translation over the previous methods is proposed. &nbsp

International Journal on Recent and Innovation Trends in Computing and Communication

Modeling Classifier for Code Mixed Cross Script Questions.

Author: Akshit Bhatia
Rupal Bhargava
Shubham Khandelwal
Yashvardhan Sharma
Publication venue
Publication date: 01/01/2016
Field of study

ABSTRACT With a boom in the internet, the social media text had been increasing day by day and the user generated content (such as tweets and blogs) in Indian languages are written using Roman script due to various socio-cultural and technological reasons. A majority of these posts are multilingual in nature and many involve code mixing where lexical items and grammatical features from two languages appear in one sentence. Focusing on this current multilingual scenario, code-mixed cross-script (i.e., non-native script) data gives rise to a new problem and presents serious challenges to automatic Question Answering (QA) and for this question classification will be required which is an important step towards QA. This paper proposes an approach to handle cross script question classification as it is an important task of question analysis which detects the category of the question

CiteSeerX