12,283 research outputs found

    A Survey of Location Prediction on Twitter

    Full text link
    Locations, e.g., countries, states, cities, and point-of-interests, are central to news, emergency events, and people's daily lives. Automatic identification of locations associated with or mentioned in documents has been explored for decades. As one of the most popular online social network platforms, Twitter has attracted a large number of users who send millions of tweets on daily basis. Due to the world-wide coverage of its users and real-time freshness of tweets, location prediction on Twitter has gained significant attention in recent years. Research efforts are spent on dealing with new challenges and opportunities brought by the noisy, short, and context-rich nature of tweets. In this survey, we aim at offering an overall picture of location prediction on Twitter. Specifically, we concentrate on the prediction of user home locations, tweet locations, and mentioned locations. We first define the three tasks and review the evaluation metrics. By summarizing Twitter network, tweet content, and tweet context as potential inputs, we then structurally highlight how the problems depend on these inputs. Each dependency is illustrated by a comprehensive review of the corresponding strategies adopted in state-of-the-art approaches. In addition, we also briefly review two related problems, i.e., semantic location prediction and point-of-interest recommendation. Finally, we list future research directions.Comment: Accepted to TKDE. 30 pages, 1 figur

    A Personalized Travel Recommendation System Using Social Media Analysis

    Get PDF
    Personalization of recommender systems enables customized services to users. Social media is one resource that aids personalization. This study explores the use of twitter data to personalize travel recommendations. A machine learning classification model is used to identify travel related tweets. The travel tweets are then used to personalize recommendations regarding places of interest for the user. Places of interest are categorized as: historical buildings, museums, parks, and restaurants. To better personalize the model, travel tweets of the user\u27s friends and followers are also mined. Volunteer twitter users were asked to provide their twitter handle as well as rank their travel category preferences in a survey. We evaluated our model by comparing the predictions made by our model with the users choices in the survey. The evaluations show 68% prediction accuracy. The accuracy can be improved with a better travel-tweet training dataset as well as a better travel category identification technique using machine learning. The travel categories can be increased to include items like sports venues, musical events, entertainment, etc. and thereby fine-tune the recommendations. The proposed model lists \u27n\u27 places of interest from each category in proportion to the travel category score generated by the model

    Connecting Software Metrics across Versions to Predict Defects

    Full text link
    Accurate software defect prediction could help software practitioners allocate test resources to defect-prone modules effectively and efficiently. In the last decades, much effort has been devoted to build accurate defect prediction models, including developing quality defect predictors and modeling techniques. However, current widely used defect predictors such as code metrics and process metrics could not well describe how software modules change over the project evolution, which we believe is important for defect prediction. In order to deal with this problem, in this paper, we propose to use the Historical Version Sequence of Metrics (HVSM) in continuous software versions as defect predictors. Furthermore, we leverage Recurrent Neural Network (RNN), a popular modeling technique, to take HVSM as the input to build software prediction models. The experimental results show that, in most cases, the proposed HVSM-based RNN model has a significantly better effort-aware ranking effectiveness than the commonly used baseline models
    • …
    corecore