Search CORE

22 research outputs found

Cross-domain opinion word extraction model

Author: Chetviorkin Ilia
Loukachevitch Natalia
Publication venue
Publication date: 01/01/2012
Field of study

In this paper we consider a new approach for domain-specific opinion word extraction in Russian. We propose a set of statistical features and algorithm combination that can discriminate opinion words in a particular domain. The extraction model is trained in a movie domain and then applied to four other domains. We evaluate the quality of obtained sentiment lexicons intrinsically. Finally, our method is adapted to a movie domain in English and demonstrates comparable results

Institutional repository of Ural Federal University named after the first President of Russia B.N.Yeltsin

Domain-Specific Sentiment Lexicon for Classification

Author: Kay Thi Yar
Khine Khine Htwe
Nyein Thwet Thwet Aung
Su Su Htay
Thet Thet Zin
Win Win Thant
Publication venue
Publication date: 02/11/2017
Field of study

Nowadays people express their opinions about products, government policies, schemes and programs over social media sites using web or mobile. At the present time, in our country, government changes policies in every sector and people follow with the eyes or the mind on these policies and express their opinion by writing comments on social media especially using Facebook news media pages. Therefore, our research group intends to do sentiment analysis on new articles. Domain-specific sentiment lexicon has played an important role in opinion mining system. Due to the ubiquitous domain diversity and absence of domain-specific prior knowledge, construction of domain-specific lexicon has become a challenging research topic in recent year. In this paper, lexicon construction for sentiment analysis is described. In this work, there are two main steps: (1) pre-processing on raw data comments that are extracted from Facebook news media pages and (2) constructing lexicon for coming classification work. The word correlation and chi-square statistic are applied to construct lexicon as desired. Experimental results on comments datasets demonstrate that proposed approach is suitable for construction the domain-specific lexicon

MERAL Portal

Task-specific Word Identification from Short Texts Using a Convolutional Neural Network

Author: Wu Xintao
Xiang Yang
Yuan Shuhan
Publication venue
Publication date: 02/06/2017
Field of study

Task-specific word identification aims to choose the task-related words that best describe a short text. Existing approaches require well-defined seed words or lexical dictionaries (e.g., WordNet), which are often unavailable for many applications such as social discrimination detection and fake review detection. However, we often have a set of labeled short texts where each short text has a task-related class label, e.g., discriminatory or non-discriminatory, specified by users or learned by classification algorithms. In this paper, we focus on identifying task-specific words and phrases from short texts by exploiting their class labels rather than using seed words or lexical dictionaries. We consider the task-specific word and phrase identification as feature learning. We train a convolutional neural network over a set of labeled texts and use score vectors to localize the task-specific words and phrases. Experimental results on sentiment word identification show that our approach significantly outperforms existing methods. We further conduct two case studies to show the effectiveness of our approach. One case study on a crawled tweets dataset demonstrates that our approach can successfully capture the discrimination-related words/phrases. The other case study on fake review detection shows that our approach can identify the fake-review words/phrases.Comment: accepted by Intelligent Data Analysis, an International Journa

arXiv.org e-Print Archive

ScholarWorks@UARK

Crossref

UARK (University of Arkansas )

Identifying Entity Aspects in Microblog Posts

Author: M Breuss
Portland
Usa M T Oregon
Publication venue
Publication date: 24/04/2020
Field of study

Identifying Entity Aspects in Microblog Posts Spina, D.; Meij, E.J.; de Rijke, M.; Oghina, A.; Bui, M.T.; Breuss, M. General rights It is not permitted to download or to forward/distribute the text or part of it without the consent of the author(s) and/or copyright holder(s), other than for strictly personal, individual use, unless the work is under an open content license (like Creative Commons). Disclaimer/Complaints regulations If you believe that digital publication of certain material infringes any of your rights or (privacy) interests, please let the Library know, stating your reasons. In case of a legitimate complaint, the Library will make the material inaccessible and/or remove it from the website. Please Ask the Library: https://uba.uva.nl/en/contact, or a letter to: Library of the University of Amsterdam, Secretariat, Singel 425, 1012 WP Amsterdam, The Netherlands. You will be contacted as soon as possible. ABSTRACT Online reputation management is about monitoring and handling the public image of entities (such as companies) on the Web. An important task in this area is identifying aspects of the entity of interest (such as products, services, competitors, key people, etc.) given a stream of microblog posts referring to the entity. In this paper we compare different IR techniques and opinion target identification methods for automatically identifying aspects and find that (i) simple statistical methods such as TF.IDF are a strong baseline for the task, significantly outperforming opinion-oriented methods, and (ii) only considering terms tagged as nouns improves the results for all the methods analyzed

CiteSeerX

Análisis de sentimientos de reseñas para determinar la acogida de un producto utilizando técnicas de machine learning y data mining

Author: Espitaleta Julián
García Kelly
Maza Jose
Publication venue: Barranquilla, Universidad del Norte, 2022
Publication date: 30/11/2022
Field of study

Leer múltiples reseñas de productos puede resultar tedioso, y concluir si un producto ha gustado o no a sus consumidores es complicado, por lo que es necesario implementar una herramienta que analice todas las reseñas de un producto y determine su polaridad. Lo anterior con el fin de agilizar y mejorar la toma de decisiones sobre un producto por parte de los interesados, así como la relación cliente-empresa, evaluando las reseñas bajo un mismo críterio. Durante el desarrollo del proyecto se diseñó e implementó la estrategia utilizando técnicas de Machine learning y Data mining para solucionar el problema planteado. Como resultado se implemento un modelo por medio de un dataset, luego se aplicó web scrapping a la página web de Amazon, un reconocido E-commerce, con el fin de extraer las reseñas de un producto dado, se visualizaron las reseñas de este a través de librerías de Python para luego ser procesadas y así realizar un analisis de sentimientos. Lo anterior permitió concluir la polaridad de un producto dado haciendo uso de tecnicas de machine learning y data mining.Reading multiple product reviews can be tedious, and concluding whether or not consumers liked a product is complicated, so it is necessary to implement a tool that analyzes all reviews of a product and determines their polarity. The foregoing in order to streamline and improve decision-making about a product by the interested parties, as well as the client-company relationship, evaluating the reviews under the same criteria. During the development of the project, the strategy was developed and implemented using Machine learning and Data mining techniques to solve the problem posed. As a result, a model was implemented through a data set, then web scrapping was applied to the Amazon website, a recognized E-commerce, in order to extract the reviews of a given product, the reviews of this product were displayed. through Python libraries to later be processed and thus carry out a sentiment analysis. The above concluded the polarity of a given product making use of machine learning and data mining techniques

Repositorio Digital de la Universidad del Norte

Построение модели для извлечения оценочной лексики в различных предметных областях

Author: I. Chetviorkin I.
N. Loukachevitch V.
Илья Четвёркин Игоревич
Наталья Лукашевич Валентиновна
Publication venue: 'P.G. Demidov Yaroslavl State University'
Publication date: 20/04/2013
Field of study

In this paper we consider a new approach for domain-specific opinion word extraction in the Russian language. We propose a set of statistical features and an algorithm combination that can extract opinion words in a particular domain. The extraction model was trained in the movie domain and then applied to four other domains. The quality of the obtained sentiment lexicons was evaluated intrinsically on the base of an expert markup and remained on the high level during the model transfer to various domains. Finally, our method is adapted to the movie domain in English and it demonstrated good results.В данной работе предлагается новый подход к извлечению оценочных слов для различных предметных областей. В рамках этого подхода была разработана модель, включающая набор характеристик и комбинацию алгоритмов, которые позволяют извлекать оценочные слова в конкретной предметной области. Данная модель была обучена в предметной области о фильмах и затем применена в четырёх других областях. Качество работы метода оценивалось на основании разметки экспертов и оставалось на высоком уровне при переносе модели на различные предметные области. Кроме того, созданная модель была использована в предметной области о фильмах на английском языке и продемонстрировала высокое качество извлечения оценочных слов

Modeling and Analysis of Information Systems / Моделирование и анализ информационных систем (МАИС)

Generating, Refining and Using Sentiment Lexicons

Author: Ackermans P.
de Rijke M.
Geleijnse G.
Jijkoun V.
Laan F.
Weerkamp W.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 11/11/2012
Field of study

Crossref

Springer - Publisher Connector

UvA-DARE

International Migration, Integration and Social Cohesion online publications

Exploiting External Collections for Query Expansion

Author: Amati G.
Arguello J.
Balog K.
Cartright M.-A.
Elsas J.
Ernsting B. J.
Fautsch C.
Hawking D.
Java A.
Jijkoun V.
Krisztian Balog
Kwok K. L.
Maarten de Rijke
Macdonald C.
Ounis I.
Ounis I.
Rocchio J.
Weerkamp W.
Westerveld T.
Wouter Weerkamp
Zhang W.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref