1,790 research outputs found

    A Review of Text Corpus-Based Tourism Big Data Mining

    Get PDF
    With the massive growth of the Internet, text data has become one of the main formats of tourism big data. As an effective expression means of tourists’ opinions, text mining of such data has big potential to inspire innovations for tourism practitioners. In the past decade, a variety of text mining techniques have been proposed and applied to tourism analysis to develop tourism value analysis models, build tourism recommendation systems, create tourist profiles, and make policies for supervising tourism markets. The successes of these techniques have been further boosted by the progress of natural language processing (NLP), machine learning, and deep learning. With the understanding of the complexity due to this diverse set of techniques and tourism text data sources, this work attempts to provide a detailed and up-to-date review of text mining techniques that have been, or have the potential to be, applied to modern tourism big data analysis. We summarize and discuss different text representation strategies, text-based NLP techniques for topic extraction, text classification, sentiment analysis, and text clustering in the context of tourism text mining, and their applications in tourist profiling, destination image analysis, market demand, etc. Our work also provides guidelines for constructing new tourism big data applications and outlines promising research areas in this field for incoming years

    A Review of Text Corpus-Based Tourism Big Data Mining

    Get PDF
    With the massive growth of the Internet, text data has become one of the main formats of tourism big data. As an effective expression means of tourists’ opinions, text mining of such data has big potential to inspire innovations for tourism practitioners. In the past decade, a variety of text mining techniques have been proposed and applied to tourism analysis to develop tourism value analysis models, build tourism recommendation systems, create tourist profiles, and make policies for supervising tourism markets. The successes of these techniques have been further boosted by the progress of natural language processing (NLP), machine learning, and deep learning. With the understanding of the complexity due to this diverse set of techniques and tourism text data sources, this work attempts to provide a detailed and up-to-date review of text mining techniques that have been, or have the potential to be, applied to modern tourism big data analysis. We summarize and discuss different text representation strategies, text-based NLP techniques for topic extraction, text classification, sentiment analysis, and text clustering in the context of tourism text mining, and their applications in tourist profiling, destination image analysis, market demand, etc. Our work also provides guidelines for constructing new tourism big data applications and outlines promising research areas in this field for incoming years

    Analyzing Tourism Online Reviews: An Extended Approach to Hierarchical Topic Detection Using Keyword Clustering

    Get PDF
    Tourism managers are increasingly turning to the online sphere to gain relevant customer insights. However, current approaches to analyzing vast and rapidly changing user-generated content (UGC) face several limitations. Supervised approaches require significant effort to provide pre-tagged training data and cannot dynamically identify topics mentioned in UGC. On the other hand, unsupervised approaches typically do not support different abstraction levels or enable a successive refinement of analysis in a drill-down manner, which is often expected as a practical requirement of tourism and destination management. Our research objective is, therefore, to extend current supervised approaches for identifying predefined topics by adopting unsupervised approaches using cluster analysis. The results emphasize that unsupervised approaches can (1) detect non-predefined topics dynamically with an accuracy similar to supervised approaches, thus demonstrating the potential to replace them and avoid the necessity of providing pre-tagged training data. (2) To build a topic hierarchy, unsupervised approaches sense more fine-grained topics as an enhancement of predefined topics on a lower level of abstraction, enabling more powerful drill-down-like analyses. Overall, the proposed extended approach to topic detection promises to support tourism management by meaningfully analyzing the increasing mass of visitors’ online feedback

    Sentiment Analysis of Textual Content in Social Networks. From Hand-Crafted to Deep Learning-Based Models

    Get PDF
    Aquesta tesi proposa diversos mètodes avançats per analitzar automàticament el contingut textual compartit a les xarxes socials i identificar les opinions, emocions i sentiments a diferents nivells d’anàlisi i en diferents idiomes. Comencem proposant un sistema d’anàlisi de sentiments, anomenat SentiRich, basat en un conjunt ric d’atributs, inclosa la informació extreta de lèxics de sentiments i models de word embedding pre-entrenats. A continuació, proposem un sistema basat en Xarxes Neurals Convolucionals i regressors XGboost per resoldre una sèrie de tasques d’anàlisi de sentiments i emocions a Twitter. Aquestes tasques van des de les tasques típiques d’anàlisi de sentiments fins a determinar automàticament la intensitat d’una emoció (com ara alegria, por, ira, etc.) i la intensitat del sentiment dels autors a partir dels seus tweets. També proposem un nou sistema basat en Deep Learning per solucionar el problema de classificació de les emocions múltiples a Twitter. A més, es va considerar el problema de l’anàlisi del sentiment depenent de l’objectiu. Per a aquest propòsit, proposem un sistema basat en Deep Learning que identifica i extreu l'objectiu dels tweets. Tot i que alguns idiomes, com l’anglès, disposen d’una àmplia gamma de recursos per permetre l’anàlisi del sentiment, a la majoria de llenguatges els hi manca. Per tant, utilitzem la tècnica d'anàlisi de sentiments entre idiomes per desenvolupar un sistema nou, multilingüe i basat en Deep Learning per a llenguatges amb pocs recursos lingüístics. Proposem combinar l’ajuda a la presa de decisions multi-criteri i anàlisis de sentiments per desenvolupar un sistema que permeti als usuaris la possibilitat d’explotar tant les opinions com les seves preferències en el procés de classificació d’alternatives. Finalment, vam aplicar els sistemes desenvolupats al camp de la comunicació de les marques de destinació a través de les xarxes socials. Amb aquesta finalitat, hem recollit tweets de persones locals, visitants i els gabinets oficials de Turisme de diferents destinacions turístiques i es van analitzar les opinions i les emocions compartides en ells. En general, els mètodes proposats en aquesta tesi milloren el rendiment dels enfocaments d’última generació i mostren troballes apassionants.Esta tesis propone varios métodos avanzados para analizar automáticamente el contenido textual compartido en las redes sociales e identificar opiniones, emociones y sentimientos, en diferentes niveles de análisis y en diferentes idiomas. Comenzamos proponiendo un sistema de análisis de sentimientos, llamado SentiRich, que está basado en un conjunto rico de características, que incluyen la información extraída de léxicos de sentimientos y modelos de word embedding previamente entrenados. Luego, proponemos un sistema basado en redes neuronales convolucionales y regresores XGboost para resolver una variedad de tareas de análisis de sentimientos y emociones en Twitter. Estas tareas van desde las típicas tareas de análisis de sentimientos hasta la determinación automática de la intensidad de una emoción (como alegría, miedo, ira, etc.) y la intensidad del sentimiento de los autores de los tweets. También proponemos un novedoso sistema basado en Deep Learning para abordar el problema de clasificación de emociones múltiples en Twitter. Además, consideramos el problema del análisis de sentimientos dependiente del objetivo. Para este propósito, proponemos un sistema basado en Deep Learning que identifica y extrae el objetivo de los tweets. Si bien algunos idiomas, como el inglés, tienen una amplia gama de recursos para permitir el análisis de sentimientos, la mayoría de los idiomas carecen de ellos. Por lo tanto, utilizamos la técnica de Análisis de Sentimiento Inter-lingual para desarrollar un sistema novedoso, multilingüe y basado en Deep Learning para los lenguajes con pocos recursos lingüísticos. Proponemos combinar la Ayuda a la Toma de Decisiones Multi-criterio y el análisis de sentimientos para desarrollar un sistema que brinde a los usuarios la capacidad de explotar las opiniones junto con sus preferencias en el proceso de clasificación de alternativas. Finalmente, aplicamos los sistemas desarrollados al campo de la comunicación de las marcas de destino a través de las redes sociales. Con este fin, recopilamos tweets de personas locales, visitantes, y gabinetes oficiales de Turismo de diferentes destinos turísticos y analizamos las opiniones y las emociones compartidas en ellos. En general, los métodos propuestos en esta tesis mejoran el rendimiento de los enfoques de vanguardia y muestran hallazgos interesa.This thesis proposes several advanced methods to automatically analyse textual content shared on social networks and identify people’ opinions, emotions and feelings at a different level of analysis and in different languages. We start by proposing a sentiment analysis system, called SentiRich, based on a set of rich features, including the information extracted from sentiment lexicons and pre-trained word embedding models. Then, we propose an ensemble system based on Convolutional Neural Networks and XGboost regressors to solve an array of sentiment and emotion analysis tasks on Twitter. These tasks range from the typical sentiment analysis tasks, to automatically determining the intensity of an emotion (such as joy, fear, anger, etc.) and the intensity of sentiment (aka valence) of the authors from their tweets. We also propose a novel Deep Learning-based system to address the multiple emotion classification problem on Twitter. Moreover, we considered the problem of target-dependent sentiment analysis. For this purpose, we propose a Deep Learning-based system that identifies and extracts the target of the tweets. While some languages, such as English, have a vast array of resources to enable sentiment analysis, most low-resource languages lack them. So, we utilise the Cross-lingual Sentiment Analysis technique to develop a novel, multi-lingual and Deep Learning-based system for low resource languages. We propose to combine Multi-Criteria Decision Aid and sentiment analysis to develop a system that gives users the ability to exploit reviews alongside their preferences in the process of alternatives ranking. Finally, we applied the developed systems to the field of communication of destination brands through social networks. To this end, we collected tweets of local people, visitors, and official brand destination offices from different tourist destinations and analysed the opinions and the emotions shared in these tweets

    Using sentiment analysis in tourism research: A systematic, bibliometric, and integrative review

    Full text link
    Purpose: Sentiment analysis is built from the information provided through text (reviews) to help understand the social sentiment toward their brand, product, or service. The main purpose of this paper is to draw an overview of the topics and the use of the sentiment analysis approach in tourism research. Methods: The study is a bibliometric analysis (VOSviewer), with a systematic and integrative review. The search occurred in March 2021 (Scopus) applying the search terms "sentiment analysis" and "tourism" in the title, abstract, or keywords, resulting in a final sample of 111 papers. Results: This analysis pointed out that China (35) and the United States (24) are the leading countries studying sentiment analysis with tourism. The first paper using sentiment analysis was published in 2012; there is a growing interest in this topic, presenting qualitative and quantitative approaches. The main results present four clusters to understand this subject. Cluster 1 discusses sentiment analysis and its application in tourism research, searching how online reviews can impact decision-making. Cluster 2 examines the resources used to make sentiment analysis, such as social media. Cluster 3 argues about methodological approaches in sentiment analysis and tourism, such as deep learning and sentiment classification, to understand the user-generated content. Cluster 4 highlights questions relating to the internet and tourism. Implications: The use of sentiment analysis in tourism research shows that government and entrepreneurship can draw and enhance communication strategies, reduce cost, and time, and mainly contribute to the decision-making process and understand consumer behavior

    Sentiment classification from reviews for tourism analytics

    Get PDF
    User-generated content is critical for tourism destination management as it could help them identify their customers' opinions and come up with solutions to upgrade their tourism organizations as it could help them identify customer opinions. There are many reviews on social media and it is difficult for these organizations to analyse the reviews manually. By applying sentiment classification, reviews can be classified into several classes and help ease decision-making. The reviews contain noisy contents, such as typos and emoticons, which could affect the accuracy of the classifiers. This study evaluates the reviews using Support Vector Machine and Random Forest models to identify a suitable classifier. The main phases in this study are data collection, data preparation, data labelling and modelling phases. The reviews are labelled into three sentiments; positive, neutral, and negative. During pre-processing, steps such as removing the missing value, tokenization, case folding, stop words removal, stemming, and applying n-grams are performed. The result of this research is evaluated by looking at the performance of the models based on accuracy where the result with the highest accuracy is chosen as the solution. In this study, data is data from TripAdvisor and Google reviews using web scraping tools. The findings show that the Support Vector Machine model with 5-fold cross-validation the most suitable classifier with an accuracy of 67.97% compared to Naive Bayes with 61.33% accuracy and Random Forest classifier with 63.55% accuracy. In conclusion, the result of this paper could provide important information in tourism besides determining the suitable algorithm to be used for Sentiment Analysis related to the tourism domain

    Application of an opinion consensus aggregation model based on OWA operators to the recommendation of tourist sites

    Get PDF
    Given the growth in tourism online data as a result of a large number of users posting their personal opinions in social networks and other online platforms with the idea to help other visitants, many authors have proposed a huge variety of ways to classify the sentiments contained in these opinions in order to recommend services (hotels, restaurants, etc.) and destinations to the users with the intention of facilitating their trip planning. In this paper, the authors propose a model to rank tourist sites of a city, based on OWA operators, with the objective of being used as a recommender system.The authors would like to acknowledge the financial support from the EU project H2020-MSCA-IF-2016- DeciTrustNET-746398. This paper has been elaborated with the financing of FEDER funds in the Spanish National research project TIN2016-75850-R

    Artificial Intelligence and Visual Analytics: A Deep-Learning Approach to Analyze Hotel Reviews & Responses

    Get PDF
    With a growing number of online reviews, consumers often rely on these reviews to make purchase decisions. However, little is known about managerial responses to online hotel reviews. This paper reports on a framework to integrate visual analytics and machine learning techniques to investigate whether hotel managers respond to positive and negative reviews differently and how to use a deep-learning approach to prioritize responses. In this study, forty 4- and 5-star hotels in London with 91,051 reviews and 70,397 responses were collected and analyzed. Visual analyses and machine learning were conducted. The results indicate most hotels (72.5%) showing no preference to respond to positive and negative reviews. Our proposed deep-learning approach outperformed existing algorithms to prioritize responses
    corecore