1,111 research outputs found

    Social Media Text Classification by Enhancing Well-Formed Text Trained Model

    Get PDF
    Social media are a powerful communication tool in our era of digital information. The large amount of user-generated data is a useful novel source of data, even though it is not easy to extract the treasures from this vast and noisy trove. Since classification is an important part of text mining, many techniques have been proposed to classify this kind of information. We developed an effective technique of social media text classification by semi-supervised learning utilizing an online news source consisting of well-formed text. The computer first automatically extracts news categories, well-categorized by publishers, as classes for topic classification. A bag of words taken from news articles provides the initial keywords related to their category in the form of word vectors. The principal task is to retrieve a set of new productive keywords. Term Frequency-Inverse Document Frequency weighting (TF-IDF) and Word Article Matrix (WAM) are used as main methods. A modification of WAM is recomputed until it becomes the most effective model for social media text classification. The key success factor was enhancing our model with effective keywords from social media. A promising result of 99.50% accuracy was achieved, with more than 98.5% of Precision, Recall, and F-measure after updating the model three times

    Online Misinformation: Challenges and Future Directions

    Get PDF
    Misinformation has become a common part of our digital media environments and it is compromising the ability of our societies to form informed opinions. It generates misperceptions, which have affected the decision making processes in many domains, including economy, health, environment, and elections, among others. Misinformation and its generation, propagation, impact, and management is being studied through a variety of lenses (computer science, social science, journalism, psychology, etc.) since it widely affects multiple aspects of society. In this paper we analyse the phenomenon of misinformation from a technological point of view.We study the current socio-technical advancements towards addressing the problem, identify some of the key limitations of current technologies, and propose some ideas to target such limitations. The goal of this position paper is to reflect on the current state of the art and to stimulate discussions on the future design and development of algorithms, methodologies, and applications

    Developing Deployable Spoken Language Translation Systems given Limited Resources

    Get PDF
    Approaches are presented that support the deployment of spoken language translation systems. Newly developed methods allow low cost portability to new language pairs. Proposed translation model pruning techniques achieve a high translation performance even in low memory situations. The named entity and specialty vocabulary coverage, particularly on small and mobile devices, is targeted to an individual user by translation model personalization

    A Sea of Stories: Islands as Shima in Rattawut Lapcharoensap’s Sightseeing

    Get PDF

    Korean-popular Facebook fan page analytics in Thailand

    Get PDF
    The purpose of this study was to gather, analyze, and elaborate big data on Facebook’s essential information, with a specific focus on the information obtained from Korean-popular (K-Pop) fan pages on the social networking site. For this analysis, a total of 3,531,736 comments by Korean-pop fans were gathered from various K-pop Facebook pages. In order to interpret how 11 extremely popular Facebook pages shape Thai fans’ enthusiasm for the South Korean music industry, descriptive statistics and visualization analysis were employed. Finally, data analytics and correlation analysis were used to evaluate the essential understanding of the Facebook pages. The research revealed three key findings: i) K-pop fan pages provide more opportunities for Thai fans to express their support for K-pop artists and advocate for causes, ii) K-pop fan pages provide more opportunities for Thai fans to communicate with K-pop artists, and iii) K-pop fan pages build opportunities for Thai fans to establish a more glamorous online presence despite limitations concerning financial resources, foreign language skills, and opportunities. In the future, the research outcomes may be valuable for academic studies and practice

    TriggerCit: Early Flood Alerting using Twitter and Geolocation - A Comparison with Alternative Sources

    Get PDF
    Rapid impact assessment in the immediate aftermath of a natural disaster is essential to provide adequate information to international organisations, local authorities, and first responders. Social media can support emergency response with evidence-based content posted by citizens and organisations during ongoing events. In the paper, we propose TriggerCit: an early flood alerting tool with a multilanguage approach focused on timeliness and geolocation. The paper focuses on assessing the reliability of the approach as a triggering system, comparing it with alternative sources for alerts, and evaluating the quality and amount of complementary information gathered. Geolocated visual evidence extracted from Twitter by TriggerCit was analysed in two case studies on floods in Thailand and Nepal in 2021.Comment: 12 pages Keywords Social Media, Disaster management, Early Alertin
    • …
    corecore