Search CORE

5 research outputs found

Task-specific Word Identification from Short Texts Using a Convolutional Neural Network

Author: Wu Xintao
Xiang Yang
Yuan Shuhan
Publication venue
Publication date: 02/06/2017
Field of study

Task-specific word identification aims to choose the task-related words that best describe a short text. Existing approaches require well-defined seed words or lexical dictionaries (e.g., WordNet), which are often unavailable for many applications such as social discrimination detection and fake review detection. However, we often have a set of labeled short texts where each short text has a task-related class label, e.g., discriminatory or non-discriminatory, specified by users or learned by classification algorithms. In this paper, we focus on identifying task-specific words and phrases from short texts by exploiting their class labels rather than using seed words or lexical dictionaries. We consider the task-specific word and phrase identification as feature learning. We train a convolutional neural network over a set of labeled texts and use score vectors to localize the task-specific words and phrases. Experimental results on sentiment word identification show that our approach significantly outperforms existing methods. We further conduct two case studies to show the effectiveness of our approach. One case study on a crawled tweets dataset demonstrates that our approach can successfully capture the discrimination-related words/phrases. The other case study on fake review detection shows that our approach can identify the fake-review words/phrases.Comment: accepted by Intelligent Data Analysis, an International Journa

arXiv.org e-Print Archive

ScholarWorks@UARK

Crossref

UARK (University of Arkansas )

Developing Real Estate Automated Valuation Models by Learning from Heterogeneous Data Sources

Author: Bergadano Francesco
Bertilone Roberto
Paolotti Daniela
Ruffo Giancarlo
Publication venue: 'Penerbit UTM Press'
Publication date: 01/01/2021
Field of study

In this paper we propose a data acquisition methodology, and a Machine Learning solution for the partially automated evaluation of real estate properties. The novelty and importance of the approach lies in two aspects: (1) when compared to Automated Valuation Models (AVMs) as available to real estate operators, it is highly adaptive and non-parametric, and integrates diverse data sources; (2) when compared to Machine Learning literature that has addressed real estate applications, it is more directly linked to the actual business processes of appraisal companies: in this context prices that are advertised online are normally not the most relevant source of information, while an appraisal document must be proposed by an expert and approved by a validator, possibly with the help of technological tools. We describe a case study using a set of 7988 appraisal documents for residential properties in Turin, Italy. Open data were also used, including location, nearby points of interest, comparable property prices, and the Italian revenue service area code. The observed mean error as measured on an independent test set was around 21 K€, for an average property value of about 190 K€. The AVM described here can help the stakeholders in this process (experts, appraisal company) to provide a reference price to be used by the expert, to allow the appraisal company to validate their evaluations in a faster and cheaper way, to help the expert in listing a set of comparable properties, that need to be included in the appraisal document

International Journal of Real Estate Studies

Archivio Istituzionale della Ricerca- Università del Piemonte Orientale

Developing Real Estate Automated Valuation Models by Learning from Heterogeneous Data Sources

Author: Bergadano Francesco
Bertilone Roberto
Paolotti Daniela
Ruffo Giancarlo
Publication venue: 'Penerbit UTM Press'
Publication date: 23/06/2021
Field of study

International Journal of Real Estate Studies

Stochastic Modeling of Semantic Structures of Online Movie Reviews

Author: Pu Limeng
Publication venue: LSU Digital Commons
Publication date: 01/01/2015
Field of study

Facing the enormous volumes of data available nowadays, we try to extract useful information from the data by properly modeling and characterizing the data. In this thesis, we focus on one particular type of semantic data --- online movie reviews, which can be found on all major movie websites. Our objective is mining movie review data to seek quantifiable patterns between reviews on the same movie, or reviews from the same reviewer. A novel approach is presented in this thesis to achieve this goal. The key idea is converting a movie review text into a list of tuples, where each tuple contains four elements: feature word, category of feature word, opinion word and polarity of opinion word. Then we further convert each tuple into an 18-dimension vector. Given a multinomial distribution representing a movie review, we can systematically and consistently quantify the similarity and dependence between reviews made by the same or different reviewers using metrics including KL distance and distance correlation, respectively. Such comparisons allow us to find reviewers sharing similarity in generated multinomial distributions, or demonstrating correlation patterns to certain extent. Among the identified pairs of frequent reviewers, we further investigate the category-wise dependency relationships between two reviewers, which are further captured by our proposed ordinary least square estimation models. The proposed data processing approaches, as well as the corresponding modeling framework, could be further leveraged to develop classification, prediction, and common randomness extraction algorithms for semantic movie review data

Louisiana State University

Incorporating appraisal expression patterns into topic modeling for aspect and sentiment word identification

Author: Baccianella
Blei
Bloom
Bloom
Brody
Bross
Bunescu
Carenini
Dave
De Marneffe
Ding
Fang
Ganapathibhotla
Ghani
Glorot
Griffiths
Guo
Hofmann
Hu
Hu
Jakob
Jo
Kim
Kobayashi
Kwei-Jay Lin
Lin
Lin
Lin
Liu
Liu
Liu
Lu
Lu
Mei
Meina Song
Miyoshi
Moghaddam
Moghaddam
Pan
Pang
Pang
Pang
Popescu
Qiu
Quirk
Riloff
Shaikh
Shi
Sim
Snow
Stoyanov
Su
Titov
Turney
Wang
Wang
Wogenstein
Xiaolin Zheng
Xiaowei Wang
Yu
Zhang
Zhao
Zhen Lin
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref