1,151 research outputs found

    都市の持続可能性に向けた旅行行動と知的移動データ統合に関する包括的研究

    Get PDF
    過去数十年にわたり世界中で都市の持続可能性がトレンドとなり研究対象となっている.人々は,非効率な天然資源の消費や社会経済活動による環境破壊など,地球環境に有害な活動を行い,これには都市計画や交通計画を始め,多くの分野が密接に関係している.現在では,これらを解決する新技術の開発や応用が広範囲な研究分野で日々取り組まれている.本研究では観光に関する問題を,交通と都市の研究の観点からさまざまなビックデータを使用し,持続可能な都市開発を目標とした具体的な解決策を示した.本研究では都市や地域の持続可能性に資するデータの活用方法として,Wi-Fiパケットセンサーを使用した旅行者にとって魅力的な観光目的地マネジメントに関する研究,およびETCプローブデータを使用した旅行時間の信頼性の観測における天候の影響に関する分析を組み合わせて示した.本論文では,都市の移動性の認知に対して以下に示す3つの研究から,特徴的な結果と有効な分析手法を確立した.1)Wi-Fiパッケージセンシング調査を使用した,広域観光エリアでの周遊パターンのマイニングベースの関連法則の調査,2)Wi-Fi追跡データでの大規模な観光地の持続可能な開発に向けた魅力的な目的地の抽出,3)ETC2.0プローブデータを使用して,様々な道路タイプを考慮した旅行の信頼性に対する降雪の影響の評価.以上の研究から,複数視点の考察を積み重ね,包括的な評価と提案を行い,いくつかの重要な結果が得られた.この論文の貢献は,より良い社会への問題解決への糸口となり,今後の政策立案者にとって有意義な内容となるだろう.According to sustainability, the trend is spreading out around the world for past decades. There are many area subjects involved, such as city planning, transportation planning, and so on, because people realized human activities harmful to the environment by consuming natural resources with less efficiency process or damage environment by social and economic movements. Currently, emerging technologies considered for the proactive procedure in extensive study areas regarding new technology application and knowledge based. In term of transport and urban study, including tourism concerns, we used intelligent data from deferent sources to be demonstrating the possible solutions which involve sustainable urban development concept. In this study, as a method of utilizing data that contributes to the sustainability of cities and regions, consideration of attractive destination management for tourists by using wireless probe data, and the weather impact on travel time reliability observation by using electronic toll collection probe data, it represented as combination experiments throughout comprehensive study. This dissertation addressed three contribution studies to the composed acknowledgment of urban mobility, and it obtained the intelligent data and specific method of research-based. It consists of; 1) an association rule mining-based exploration of travel patterns in wide tourism areas using a Wi-Fi package sensing survey, 2) Attractive destinations mining towards massive tourism area sustainable development on Wi-Fi tracking data, and 3) Assessment of the impact of snowfall on travel reliability considering different road types using ETC2.0 probe data. Hence, a stack of varying viewpoints researches provided a comprehensive review and suggestion throughout significant results. The contribution of this dissertation could be an advantage substance for strategy and policies planner to recognize alternative solutions leading to a better society.室蘭工業大学 (Muroran Institute of Technology)博士(工学

    Association Rule Mining Tourist-Attractive Destinations for the Sustainable Development of a Large Tourism Area in Hokkaido Using Wi-Fi Tracking Data

    Get PDF
    The rise of radiofrequency scanner technology has led to its potential application in the observation of people’s movements. This study used aWi-Fi scanner device to track tourists’ traveling behavior inHokkaido’s tourismarea,whichoccupies a large regionthat features auniquenatural landscape. Inbound tourists have significantly increased in recent years; thus, tourism’s sustainability is considered to be important formaintaining the tourismatmosphere in the long term. Using internet-enabled technology to conduct extensive area surveys can overcome the limitations imposed by conventional methods. This study aims to use digital footprint data to describe and understand traveler mobility in a large tourism area in Hokkaido. Association rule mining (ARM)—a machine learning methodology—was performed on a large dataset of transactions to identify the rules that link destinations visited by tourists. This process resulted in the discovery of traveling patterns that revealed the association rules between destinations, and the attractiveness of the destinations was scored on the basis of visiting frequency, with both inbound and outbound movements considered. A visualization method was used to illustrate the relationships between destinations and simplify the mathematical descriptions of traveler mobility in an attractive tourism area. Hence, mining the attractiveness of destinations in a large tourism area using an ARMmethod integrated with aWi-Fi mobility tracking approach can provide accurate information that forms a basis for developing sustainable destination management and tourism policies

    A study on text-score disagreement in online reviews

    Get PDF
    In this paper, we focus on online reviews and employ artificial intelligence tools, taken from the cognitive computing field, to help understanding the relationships between the textual part of the review and the assigned numerical score. We move from the intuitions that 1) a set of textual reviews expressing different sentiments may feature the same score (and vice-versa); and 2) detecting and analyzing the mismatches between the review content and the actual score may benefit both service providers and consumers, by highlighting specific factors of satisfaction (and dissatisfaction) in texts. To prove the intuitions, we adopt sentiment analysis techniques and we concentrate on hotel reviews, to find polarity mismatches therein. In particular, we first train a text classifier with a set of annotated hotel reviews, taken from the Booking website. Then, we analyze a large dataset, with around 160k hotel reviews collected from Tripadvisor, with the aim of detecting a polarity mismatch, indicating if the textual content of the review is in line, or not, with the associated score. Using well established artificial intelligence techniques and analyzing in depth the reviews featuring a mismatch between the text polarity and the score, we find that -on a scale of five stars- those reviews ranked with middle scores include a mixture of positive and negative aspects. The approach proposed here, beside acting as a polarity detector, provides an effective selection of reviews -on an initial very large dataset- that may allow both consumers and providers to focus directly on the review subset featuring a text/score disagreement, which conveniently convey to the user a summary of positive and negative features of the review target.Comment: This is the accepted version of the paper. The final version will be published in the Journal of Cognitive Computation, available at Springer via http://dx.doi.org/10.1007/s12559-017-9496-

    From Data to Actions in Intelligent Transportation Systems: A Prescription of Functional Requirements for Model Actionability

    Get PDF
    Advances in Data Science permeate every field of Transportation Science and Engineering, resulting in developments in the transportation sector that are data-driven. Nowadays, Intelligent Transportation Systems (ITS) could be arguably approached as a “story” intensively producing and consuming large amounts of data. A diversity of sensing devices densely spread over the infrastructure, vehicles or the travelers’ personal devices act as sources of data flows that are eventually fed into software running on automatic devices, actuators or control systems producing, in turn, complex information flows among users, traffic managers, data analysts, traffic modeling scientists, etc. These information flows provide enormous opportunities to improve model development and decision-making. This work aims to describe how data, coming from diverse ITS sources, can be used to learn and adapt data-driven models for efficiently operating ITS assets, systems and processes; in other words, for data-based models to fully become actionable. Grounded in this described data modeling pipeline for ITS, we define the characteristics, engineering requisites and challenges intrinsic to its three compounding stages, namely, data fusion, adaptive learning and model evaluation. We deliberately generalize model learning to be adaptive, since, in the core of our paper is the firm conviction that most learners will have to adapt to the ever-changing phenomenon scenario underlying the majority of ITS applications. Finally, we provide a prospect of current research lines within Data Science that can bring notable advances to data-based ITS modeling, which will eventually bridge the gap towards the practicality and actionability of such models.This work was supported in part by the Basque Government for its funding support through the EMAITEK program (3KIA, ref. KK-2020/00049). It has also received funding support from the Consolidated Research Group MATHMODE (IT1294-19) granted by the Department of Education of the Basque Government

    NSL-BP: A Meta Classifier Model Based Prediction of Amazon Product Reviews

    Get PDF
    In machine learning, the product rating prediction based on the semantic analysis of the consumers' reviews is a relevant topic. Amazon is one of the most popular online retailers, with millions of customers purchasing and reviewing products. In the literature, many research projects work on the rating prediction of a given review. In this research project, we introduce a novel approach to enhance the accuracy of rating prediction by machine learning methods by processing the reviewed text. We trained our model by using many methods, so we propose a combined model to predict the ratings of products corresponding to a given review content. First, using k-means and LDA, we cluster the products and topics so that it will be easy to predict the ratings having the same kind of products and reviews together. We trained low, neutral, and high models based on clusters and topics of products. Then, by adopting a stacking ensemble model, we combine Naïve Bayes, Logistic Regression, and SVM to predict the ratings. We will combine these models into a two-level stack. We called this newly introduced model, NSL model, and compared the prediction performance with other methods at state of the art

    Spatio-temporal Databases in Urban Transportation

    Get PDF

    Probabilistic Personalized Recommendation Models For Heterogeneous Social Data

    Get PDF
    Content recommendation has risen to a new dimension with the advent of platforms like Twitter, Facebook, FriendFeed, Dailybooth, and Instagram. Although this uproar of data has provided us with a goldmine of real-world information, the problem of information overload has become a major barrier in developing predictive models. Therefore, the objective of this The- sis is to propose various recommendation, prediction and information retrieval models that are capable of leveraging such vast heterogeneous content. More specifically, this Thesis focuses on proposing models based on probabilistic generative frameworks for the following tasks: (a) recommending backers and projects in Kickstarter crowdfunding domain and (b) point of interest recommendation in Foursquare. Through comprehensive set of experiments over a variety of datasets, we show that our models are capable of providing practically useful results for recommendation and information retrieval tasks

    Self-disclosure model for classifying & predicting text-based online disclosure

    Full text link
    Les médias sociaux et les sites de réseaux sociaux sont devenus des babillards numériques pour les internautes à cause de leur évolution accélérée. Comme ces sites encouragent les consommateurs à exposer des informations personnelles via des profils et des publications, l'utilisation accrue des médias sociaux a généré des problèmes d’invasion de la vie privée. Des chercheurs ont fait de nombreux efforts pour détecter l'auto-divulgation en utilisant des techniques d'extraction d'informations. Des recherches récentes sur l'apprentissage automatique et les méthodes de traitement du langage naturel montrent que la compréhension du sens contextuel des mots peut entraîner une meilleure précision que les méthodes d'extraction de données traditionnelles. Comme mentionné précédemment, les utilisateurs ignorent souvent la quantité d'informations personnelles publiées dans les forums en ligne. Il est donc nécessaire de détecter les diverses divulgations en langage naturel et de leur donner le choix de tester la possibilité de divulgation avant de publier. Pour ce faire, ce travail propose le « SD_ELECTRA », un modèle de langage spécifique au contexte. Ce type de modèle détecte les divulgations d'intérêts, de données personnelles, d'éducation et de travail, de relations, de personnalité, de résidence, de voyage et d'accueil dans les données des médias sociaux. L'objectif est de créer un modèle linguistique spécifique au contexte sur une plate-forme de médias sociaux qui fonctionne mieux que les modèles linguistiques généraux. De plus, les récents progrès des modèles de transformateurs ont ouvert la voie à la formation de modèles de langage à partir de zéro et à des scores plus élevés. Les résultats expérimentaux montrent que SD_ELECTRA a surpassé le modèle de base dans toutes les métriques considérées pour la méthode de classification de texte standard. En outre, les résultats montrent également que l'entraînement d'un modèle de langage avec un corpus spécifique au contexte de préentraînement plus petit sur un seul GPU peut améliorer les performances. Une application Web illustrative est conçue pour permettre aux utilisateurs de tester les possibilités de divulgation dans leurs publications sur les réseaux sociaux. En conséquence, en utilisant l'efficacité du modèle suggéré, les utilisateurs pourraient obtenir un apprentissage en temps réel sur l'auto-divulgation.Social media and social networking sites have evolved into digital billboards for internet users due to their rapid expansion. As these sites encourage consumers to expose personal information via profiles and postings, increased use of social media has generated privacy concerns. There have been notable efforts from researchers to detect self-disclosure using Information extraction (IE) techniques. Recent research on machine learning and natural language processing methods shows that understanding the contextual meaning of the words can result in better accuracy than traditional data extraction methods. Driven by the facts mentioned earlier, users are often ignorant of the quantity of personal information published in online forums, there is a need to detect various disclosures in natural language and give them a choice to test the possibility of disclosure before posting. For this purpose, this work proposes "SD_ELECTRA," a context-specific language model to detect Interest, Personal, Education and Work, Relationship, Personality, Residence, Travel plan, and Hospitality disclosures in social media data. The goal is to create a context-specific language model on a social media platform that performs better than the general language models. Moreover, recent advancements in transformer models paved the way to train language models from scratch and achieve higher scores. Experimental results show that SD_ELECTRA has outperformed the base model in all considered metrics for the standard text classification method. In addition, the results also show that training a language model with a smaller pre-training context-specific corpus on a single GPU can improve its performance. An illustrative web application designed allows users to test the disclosure possibilities in their social media posts. As a result, by utilizing the efficiency of the suggested model, users would be able to get real-time learning on self-disclosure
    corecore