421 research outputs found
Scalable Privacy-Compliant Virality Prediction on Twitter
The digital town hall of Twitter becomes a preferred medium of communication
for individuals and organizations across the globe. Some of them reach
audiences of millions, while others struggle to get noticed. Given the impact
of social media, the question remains more relevant than ever: how to model the
dynamics of attention in Twitter. Researchers around the world turn to machine
learning to predict the most influential tweets and authors, navigating the
volume, velocity, and variety of social big data, with many compromises. In
this paper, we revisit content popularity prediction on Twitter. We argue that
strict alignment of data acquisition, storage and analysis algorithms is
necessary to avoid the common trade-offs between scalability, accuracy and
privacy compliance. We propose a new framework for the rapid acquisition of
large-scale datasets, high accuracy supervisory signal and multilanguage
sentiment prediction while respecting every privacy request applicable. We then
apply a novel gradient boosting framework to achieve state-of-the-art results
in virality ranking, already before including tweet's visual or propagation
features. Our Gradient Boosted Regression Tree is the first to offer
explainable, strong ranking performance on benchmark datasets. Since the
analysis focused on features available early, the model is immediately
applicable to incoming tweets in 18 languages.Comment: AffCon@AAAI-19 Best Paper Award; Presented at AAAI-19 W1: Affective
Content Analysi
Developing a Framework for Stigmergic Human Collaboration with Technology Tools: Cases in Emergency Response
Information and Communications Technologies (ICTs), particularly social media and geographic information systems (GIS), have become a transformational force in emergency response. Social media enables ad hoc collaboration, providing timely, useful information dissemination and sharing, and helping to overcome limitations of time and place. Geographic information systems increase the level of situation awareness, serving geospatial data using interactive maps, animations, and computer generated imagery derived from sophisticated global remote sensing systems. Digital workspaces bring these technologies together and contribute to meeting ad hoc and formal emergency response challenges through their affordances of situation awareness and mass collaboration. Distributed ICTs that enable ad hoc emergency response via digital workspaces have arguably made traditional top-down system deployments less relevant in certain situations, including emergency response (Merrill, 2009; Heylighen, 2007a, b). Heylighen (2014, 2007a, b) theorizes that human cognitive stigmergy explains some self-organizing characteristics of ad hoc systems. Elliott (2007) identifies cognitive stigmergy as a factor in mass collaborations supported by digital workspaces. Stigmergy, a term from biology, refers to the phenomenon of self-organizing systems with agents that coordinate via perceived changes in the environment rather than direct communication. In the present research, ad hoc emergency response is examined through the lens of human cognitive stigmergy. The basic assertion is that ICTs and stigmergy together make possible highly effective ad hoc collaborations in circumstances where more typical collaborative methods break down. The research is organized into three essays: an in-depth analysis of the development and deployment of the Ushahidi emergency response software platform, a comparison of the emergency response ICTs used for emergency response during Hurricanes Katrina and Sandy, and a process model developed from the case studies and relevant academic literature is described
Alexandria: Extensible Framework for Rapid Exploration of Social Media
The Alexandria system under development at IBM Research provides an
extensible framework and platform for supporting a variety of big-data
analytics and visualizations. The system is currently focused on enabling rapid
exploration of text-based social media data. The system provides tools to help
with constructing "domain models" (i.e., families of keywords and extractors to
enable focus on tweets and other social media documents relevant to a project),
to rapidly extract and segment the relevant social media and its authors, to
apply further analytics (such as finding trends and anomalous terms), and
visualizing the results. The system architecture is centered around a variety
of REST-based service APIs to enable flexible orchestration of the system
capabilities; these are especially useful to support knowledge-worker driven
iterative exploration of social phenomena. The architecture also enables rapid
integration of Alexandria capabilities with other social media analytics
system, as has been demonstrated through an integration with IBM Research's
SystemG. This paper describes a prototypical usage scenario for Alexandria,
along with the architecture and key underlying analytics.Comment: 8 page
A rule dynamics approach to event detection in Twitter with its application to sports and politics
The increasing popularity of Twitter as social network tool for opinion expression as well as informa- tion retrieval has resulted in the need to derive computational means to detect and track relevant top- ics/events in the network. The application of topic detection and tracking methods to tweets enable users to extract newsworthy content from the vast and somehow chaotic Twitter stream. In this paper, we ap- ply our technique named Transaction-based Rule Change Mining to extract newsworthy hashtag keywords present in tweets from two different domains namely; sports (The English FA Cup 2012) and politics (US Presidential Elections 2012 and Super Tuesday 2012). Noting the peculiar nature of event dynamics in these two domains, we apply different time-windows and update rates to each of the datasets in order to study their impact on performance. The performance effectiveness results reveal that our approach is able to accurately detect and track newsworthy content. In addition, the results show that the adaptation of the time-window exhibits better performance especially on the sports dataset, which can be attributed to the usually shorter duration of football events
Social Media Analytics in Disaster Response: A Comprehensive Review
Social media has emerged as a valuable resource for disaster management,
revolutionizing the way emergency response and recovery efforts are conducted
during natural disasters. This review paper aims to provide a comprehensive
analysis of social media analytics for disaster management. The abstract begins
by highlighting the increasing prevalence of natural disasters and the need for
effective strategies to mitigate their impact. It then emphasizes the growing
influence of social media in disaster situations, discussing its role in
disaster detection, situational awareness, and emergency communication. The
abstract explores the challenges and opportunities associated with leveraging
social media data for disaster management purposes. It examines methodologies
and techniques used in social media analytics, including data collection,
preprocessing, and analysis, with a focus on data mining and machine learning
approaches. The abstract also presents a thorough examination of case studies
and best practices that demonstrate the successful application of social media
analytics in disaster response and recovery. Ethical considerations and privacy
concerns related to the use of social media data in disaster scenarios are
addressed. The abstract concludes by identifying future research directions and
potential advancements in social media analytics for disaster management. The
review paper aims to provide practitioners and researchers with a comprehensive
understanding of the current state of social media analytics in disaster
management, while highlighting the need for continued research and innovation
in this field.Comment: 11 page
Facilitating Efficient Information Seeking in Social Media
abstract: Online social media is popular due to its real-time nature, extensive connectivity and a large user base. This motivates users to employ social media for seeking information by reaching out to their large number of social connections. Information seeking can manifest in the form of requests for personal and time-critical information or gathering perspectives on important issues. Social media platforms are not designed for resource seeking and experience large volumes of messages, leading to requests not being fulfilled satisfactorily. Designing frameworks to facilitate efficient information seeking in social media will help users to obtain appropriate assistance for their needs
and help platforms to increase user satisfaction.
Several challenges exist in the way of facilitating information seeking in social media. First, the characteristics affecting the user’s response time for a question are not known, making it hard to identify prompt responders. Second, the social context in which the user has asked the question has to be determined to find personalized responders. Third, users employ rhetorical requests, which are statements having the
syntax of questions, and systems assisting information seeking might be hindered from focusing on genuine questions. Fouth, social media advocates of political campaigns employ nuanced strategies to prevent users from obtaining balanced perspectives on
issues of public importance.
Sociological and linguistic studies on user behavior while making or responding to information seeking requests provides concepts drawing from which we can address these challenges. We propose methods to estimate the response time of the user for a given question to identify prompt responders. We compute the question specific social context an asker shares with his social connections to identify personalized responders. We draw from theories of political mobilization to model the behaviors arising from the strategies of people trying to skew perspectives. We identify rhetorical questions by modeling user motivations to post them.Dissertation/ThesisDoctoral Dissertation Electrical Engineering 201
- …