388 research outputs found

    Content Reuse and Interest Sharing in Tagging Communities

    Full text link
    Tagging communities represent a subclass of a broader class of user-generated content-sharing online communities. In such communities users introduce and tag content for later use. Although recent studies advocate and attempt to harness social knowledge in this context by exploiting collaboration among users, little research has been done to quantify the current level of user collaboration in these communities. This paper introduces two metrics to quantify the level of collaboration: content reuse and shared interest. Using these two metrics, this paper shows that the current level of collaboration in CiteULike and Connotea is consistently low, which significantly limits the potential of harnessing the social knowledge in communities. This study also discusses implications of these findings in the context of recommendation and reputation systems.Comment: 6 pages, 6 figures, AAAI Spring Symposium on Social Information Processin

    A study of feature exraction techniques for classifying topics and sentiments from news posts

    Get PDF
    Recently, many news channels have their own Facebook pages in which news posts have been released in a daily basis. Consequently, these news posts contain temporal opinions about social events that may change over time due to external factors as well as may use as a monitor to the significant events happened around the world. As a result, many text mining researches have been conducted in the area of Temporal Sentiment Analysis, which one of its most challenging tasks is to detect and extract the key features from news posts that arrive continuously overtime. However, extracting these features is a challenging task due to post’s complex properties, also posts about a specific topic may grow or vanish overtime leading in producing imbalanced datasets. Thus, this study has developed a comparative analysis on feature extraction Techniques which has examined various feature extraction techniques (TF-IDF, TF, BTO, IG, Chi-square) with three different n-gram features (Unigram, Bigram, Trigram), and using SVM as a classifier. The aim of this study is to discover the optimal Feature Extraction Technique (FET) that could achieve optimum accuracy results for both topic and sentiment classification. Accordingly, this analysis is conducted on three news channels’ datasets. The experimental results for topic classification have shown that Chi-square with unigram have proven to be the best FET compared to other techniques. Furthermore, to overcome the problem of imbalanced data, this study has combined the best FET with OverSampling technology. The evaluation results have shown an improvement in classifier’s performance and has achieved a higher accuracy at 93.37%, 92.89%, and 91.92 for BBC, Al-Arabiya, and Al-Jazeera, respectively, compared to what have been obtained on original datasets. Similarly, same combination (Chi-square+Unigram) has been used for sentiment classification and obtained accuracies at rates of 81.87%, 70.01%, 77.36%. However, testing the recognized optimal FET on unseen randomly selected news posts has shown a relatively very low accuracies for both topic and sentiment classification due to the changes of topics and sentiments over time

    How can Big Data from Social Media be used in Emergency Management? A case study of Twitter during the Paris attacks

    Get PDF
    Postponed access: the file will be accessible after 2019-06-11Over the past years, social media have impacted emergency management and disaster response in numerous ways. The access to live, continuous updates from the public brings new opportunities when it comes to detecing, coordinating and aiding in an emergency situation. The thesis present a research of social media during an emergency situation. The goal of the study is to discover how data from social media can be used for emergency management and determine if existing analysis services can be proven useful for the same occasion. To achieve the goal, a dataset from Twitter during the Paris attacks 2015 was collected. The dataset was analyzed using three different analysis tools; IBM Watson Discovery service, Microsoft Azure Text Analytics and an own developed Keyword Frequency Script. The results indicate that data from social media can be used for emergency management, in form of detecting and providing important information. Additional testing with larger datasets is needed to fully demonstrate the usefulness, in addition to interviews with emergency responders and social media users.Masteroppgave i informasjonsvitenskapINFO39

    Data analytics to support social media marketing: challenges and opportunities

    Get PDF
    Social media technologies and services have empowered individuals and organisations by revolutionalising the way people communicate, socialise and conduct business. Two examples of social media services are Facebook and Twitter. Many organisations have been quick to realise the potential benefits of social media and have adopted social media marketing practices. Social media analytics are conducted by organisations to analyse social media data in order to determine the influence of their marketing activities as well as those of their competitors. The discussion in this paper stresses the widespread adoption of social media on the African continent and the usage of social media by marketing departments in organisations. An assessment is conducted on the usefulness of online social media analytics services, and finally, the paper identifies some challenges and opportunities for social network data analytics
    • …
    corecore