Search CORE

420 research outputs found

A Comprehensive Review of Sentiment Analysis on Indian Regional Languages: Techniques, Challenges, and Trends

Author: Kale Sunil D.
Mahalle Parikshit N.
Mane Deepak T.
Potdar Girish P.
Prasad Rajesh
Upadhye Gopal D.
Publication venue: Auricle Global Society of Education and Research
Publication date: 31/08/2023
Field of study

Sentiment analysis (SA) is the process of understanding emotion within a text. It helps identify the opinion, attitude, and tone of a text categorizing it into positive, negative, or neutral. SA is frequently used today as more and more people get a chance to put out their thoughts due to the advent of social media. Sentiment analysis benefits industries around the globe, like finance, advertising, marketing, travel, hospitality, etc. Although the majority of work done in this field is on global languages like English, in recent years, the importance of SA in local languages has also been widely recognized. This has led to considerable research in the analysis of Indian regional languages. This paper comprehensively reviews SA in the following major Indian Regional languages: Marathi, Hindi, Tamil, Telugu, Malayalam, Bengali, Gujarati, and Urdu. Furthermore, this paper presents techniques, challenges, findings, recent research trends, and future scope for enhancing results accuracy

International Journal on Recent and Innovation Trends in Computing and Communication

Exploration of Corpus Augmentation Approach for English-Hindi Bidirectional Statistical Machine Translation System

Author: Gupta Deepa
Jaya K.
Publication venue: 'Institute of Advanced Engineering and Science'
Publication date: 01/06/2016
Field of study

Even though lot of Statistical Machine Translation(SMT) research work is happening for English-Hindi language pair, there is no effort done to standardize the dataset. Each of the research work uses different dataset, different parameters and different number of sentences during various phases of translation resulting in varied translation output. So comparing these models, understand the result of these models, to get insight into corpus behavior for these models, regenerating the result of these research work becomes tedious. This necessitates the need for standardization of dataset and to identify the common parameter for the development of model. The main contribution of this paper is to discuss an approach to standardize the dataset and to identify the best parameter which in combination gives best performance. It also investigates a novel corpus augmentation approach to improve the translation quality of English-Hindi bidirectional statistical machine translation system. This model works well for the scarce resource without incorporating the external parallel data corpus of the underlying language. This experiment is carried out using Open Source phrase-based toolkit Moses. Indian Languages Corpora Initiative (ILCI) Hindi-English tourism corpus is used. With limited dataset, considerable improvement is achieved using the corpus augmentation approach for the English-Hindi bidirectional SMT system

Crossref

Institute of Advanced Engineering and Science

International Conference on Human-Informed Translation and Interpreting Technology (HiT-IT 2023) Proceedings

Author: Corpas Pastor G.
Mitkov R.
Monti J.
Orăsan C.
Publication venue: Incoma Ltd.
Publication date: 01/01/2023
Field of study

Università degli Studi di Napoli L'Orientale: CINECA IRIS

Competing narratives on Bible translation in India:Missionary linguistics, postcolonial criticism and translation studies

Author: Israel Hephzibah
Publication venue: 'Oxford University Press (OUP)'
Publication date: 09/07/2020
Field of study

Edinburgh Research Explorer