312,648 research outputs found
Understanding the topics and opinions from social media content
Social media has become one indispensable part of people’s daily life, as it records and reflects people’s opinions and events of interest, as well as influences people’s perceptions. As the most commonly employed and easily accessed data format on social media, a great deal of the social media textual content is not only factual and objective, but also rich in opinionated information. Thus, besides the topics Internet users are talking about in social media textual content, it is also of great importance to understand the opinions they are expressing. In this thesis, I present my broadly applicable text mining approaches, in order to understand the topics and opinions of user-generated texts on social media, to provide insights about the thoughts of Internet users on entities, events, etc. Specifically, I develop approaches to understand the semantic differences between language-specific editions of Wikipedia, when discussing certain entities from the related topical aspects perspective and the aggregated sentiment bias perspective. Moreover, I employ effective features to detect the reputation-influential sentences for person and company entities in Wikipedia articles, which lead to the detected sentiment bias. Furthermore, I propose neural network models with different levels of attention mechanism, to detect the stances of tweets towards any given target. I also introduce an online timeline generation approach, to detect and summarise the relevant sub-topics in the tweet stream, in order to provide Internet users with some insights about the evolution of major events they are interested in
Recommended from our members
Using social media to inform policy making: to whom are we listening?
Domination of social media is giving today’s web users a venue for expressing their views and sharing their experiences with others. With well over a billion active users, social networking sites (SNS) have become dynamic sources of information on peoples’ interests, needs and opinions and are considered an extremely rich source of content to reach out to many millions of people. This is creating a revolutionary opportunity for governments to learn about the citizens and to engage with them more effectively. The potential is there for eParticipation applications to go from simply informing the public to unprecedented levels of interaction and engagement between Policy Makers (PMs) and the community, involving the public in deliberation processes leading to legislation.
Despite its great potential, several concerns arise from the exploitation of social media, especially when used to inform policy making. Among these issues we can highlight the lack of awareness of the characteristics of those citizens discussing policy topics in social media, and lack of awareness of the characteristics of their discussions. Although some studies have emerged in the last few years that aim to capture the demographics of social media users (e.g., gender, age, geographical locations) they tend not to focus on those specific users participating in policy discussions. Understanding who are the users discussing policy in social media and how policy topics are debated could help assessing how their views and opinions should be weighted and considered to inform policy making.
Aiming to provide a step forward in this direction, this paper investigates the characteristics of over 8K users involved in policy discussions in Twitter. These discussions were collected by monitoring, for one week, 42 different political topics selected by sixteen PMs from different political institutions in Germany. Our results indicate that: (i) a high volume of conversations around policy topics does not come from citizens, but from news agencies and other organisations, (ii) the average user discussing policy topics in Twitter is more active, popular and engaged than the average Twitter user and, (iii) users engaged in social media conversations around policy topics tend to be geographically concentrated in constituencies with high population density. Regarding the analysed conversations, a small subset of topics is extensively discussed while the majority go relatively unnoticed
Exploring Destination Loyalty: Application of Social Media Analytics in a Nature-based Tourism Setting
User-generated content across social media platforms is playing an increasingly important role in the tourism context. Understanding tourists’ experiences and opinions about tourism destinations has led to numerous opportunities to provide tourism providers and decision-makers with greater insight. Identifying sentiments, detecting topics of interest, and exploring loyalty behaviors from user-generated content can provide valuable direction for managerial decisions. Few if any studies on social media analytics have demonstrated the support for strategic decision-making. This paper presents a novel and inclusive approach that uses different analytical techniques such as sentiment analysis and topic modeling to extract sentiments and topics of interest from tourists’ conversational data on TripAdvisor from 2002 to 2019, and also explore destination loyalty statements using a keyword clustering approach. Previous destination loyalty literature was used to develop a keyword list that was applied to search for expression of loyalty in online reviews. The robustness of loyalty clusters and optimal number of clusters was also assessed prior to final analysis. Four leading loyalty-focused categories of destination offerings were observed: glaciers, waterfalls, lakes and islands, and hiking and trails. Prioritization of visitor experience enhancements relating to these loyalty inducing destination components are discussed
A network model of mass media opinion dynamics
The coexistence of diverse opinions is necessary for a pluralistic society in which people can confront ideas and make informed choices. The media functions as a primary source of information, and diversity across news sources in the media forms the basis for wider discourse in the public. However, due to numerous economic and social pressures, news sources frequently co-orient their content through what is known as intermedia agenda-setting. Past research on the subject has examined relationships between individual news sources. However, to understand emergent behaviour such as opinion diversity, we cannot simply analyse individual relationships in isolation, but instead need to view the media as a complex system of many interacting entities. The aim of this thesis is to develop and empirically test a method for understanding the network effects that intermedia agenda-setting has on the diversity of expressed opinions within the media. Utilising latent signals extracted from news articles, we put forward a methodology for inferring networks that capture how agendas propagate between news sources via the opinions they express on various topics. By applying this approach to a large dataset of news articles published by globally and locally prominent news organisations, we identify how the structure of intermedia networks is indicative of the level of opinion diversity across various topics. We then develop a theoretical model of opinion dynamics in noisy domains that is motivated by the empirical observations of intermedia agenda formation. From this, we derive a general analytical expression for opinion diversity that holds for any network and depends on the network's topology through its spectral properties alone. Finally, we validate the analytical expression in a linear model against empirical data. This thesis aids our understanding of how to model emergent behaviour of the media and promote diversity
Sensing Human Sentiment via Social Media Images: Methodologies and Applications
abstract: Social media refers computer-based technology that allows the sharing of information and building the virtual networks and communities. With the development of internet based services and applications, user can engage with social media via computer and smart mobile devices. In recent years, social media has taken the form of different activities such as social network, business network, text sharing, photo sharing, blogging, etc. With the increasing popularity of social media, it has accumulated a large amount of data which enables understanding the human behavior possible. Compared with traditional survey based methods, the analysis of social media provides us a golden opportunity to understand individuals at scale and in turn allows us to design better services that can tailor to individuals’ needs. From this perspective, we can view social media as sensors, which provides online signals from a virtual world that has no geographical boundaries for the real world individual's activity.
One of the key features for social media is social, where social media users actively interact to each via generating content and expressing the opinions, such as post and comment in Facebook. As a result, sentiment analysis, which refers a computational model to identify, extract or characterize subjective information expressed in a given piece of text, has successfully employs user signals and brings many real world applications in different domains such as e-commerce, politics, marketing, etc. The goal of sentiment analysis is to classify a user’s attitude towards various topics into positive, negative or neutral categories based on textual data in social media. However, recently, there is an increasing number of people start to use photos to express their daily life on social media platforms like Flickr and Instagram. Therefore, analyzing the sentiment from visual data is poise to have great improvement for user understanding.
In this dissertation, I study the problem of understanding human sentiments from large scale collection of social images based on both image features and contextual social network features. We show that neither
visual features nor the textual features are by themselves sufficient for accurate sentiment prediction. Therefore, we provide a way of using both of them, and formulate sentiment prediction problem in two scenarios: supervised and unsupervised. We first show that the proposed framework has flexibility to incorporate multiple modalities of information and has the capability to learn from heterogeneous features jointly with sufficient training data. Secondly, we observe that negative sentiment may related to human mental health issues. Based on this observation, we aim to understand the negative social media posts, especially the post related to depression e.g., self-harm content. Our analysis, the first of its kind, reveals a number of important findings. Thirdly, we extend the proposed sentiment prediction task to a general multi-label visual recognition task to demonstrate the methodology flexibility behind our sentiment analysis model.Dissertation/ThesisDoctoral Dissertation Computer Science 201
AI approaches to understand human deceptions, perceptions, and perspectives in social media
Social media platforms have created virtual space for sharing user generated information, connecting, and interacting among users. However, there are research and societal challenges: 1) The users are generating and sharing the disinformation 2) It is difficult to understand citizens\u27 perceptions or opinions expressed on wide variety of topics; and 3) There are overloaded information and echo chamber problems without overall understanding of the different perspectives taken by different people or groups.
This dissertation addresses these three research challenges with advanced AI and Machine Learning approaches. To address the fake news, as deceptions on the facts, this dissertation presents Machine Learning approaches for fake news detection models, and a hybrid method for topic identification, whether they are fake or real.
To understand the user\u27s perceptions or attitude toward some topics, this study analyzes the sentiments expressed in social media text. The sentiment analysis of posts can be used as an indicator to measure how topics are perceived by the users and how their perceptions as a whole can affect decision makers in government and industry, especially during the COVID-19 pandemic. It is difficult to measure the public perception of government policies issued during the pandemic. The citizen responses to the government policies are diverse, ranging from security or goodwill to confusion, fear, or anger. This dissertation provides a near real-time approach to track and monitor public reactions toward government policies by continuously collecting and analyzing Twitter posts about the COVID-19 pandemic.
To address the social media\u27s overwhelming number of posts, content echo-chamber, and information isolation issue, this dissertation provides a multiple view-based summarization framework where the same contents can be summarized according to different perspectives. This framework includes components of choosing the perspectives, and advanced text summarization approaches.
The proposed approaches in this dissertation are demonstrated with a prototype system to continuously collect Twitter data about COVID-19 government health policies and provide analysis of citizen concerns toward the policies, and the data is analyzed for fake news detection and for generating multiple-view summaries
Understanding misinformation on Twitter in the context of controversial issues
Social media is slowly supplementing, or even replacing, traditional media outlets such as television, newspapers, and radio. However, social media presents some drawbacks when it comes to circulating information. These drawbacks include spreading false information, rumors, and fake news. At least three main factors create these drawbacks: The filter bubble effect, misinformation, and information overload. These factors make gathering accurate and credible information online very challenging, which in turn may affect public trust in online information. These issues are even more challenging when the issue under discussion is a controversial topic. In this thesis, four main controversial topics are studied, each of which comes from a different domain. This variation of domains can give a broad view of how misinformation is manifested in social media, and how it is manifested differently in different domains.
This thesis aims to understand misinformation in the context of controversial issue discussions. This can be done through understanding how misinformation is manifested in social media as well as by understanding people’s opinions towards these controversial issues. In this thesis, three different aspects of a tweet are studied. These aspects are 1) the user sharing the information, 2) the information source shared, and 3) whether specific linguistic cues can help in assessing the credibility of information on social media. Finally, the web application tool TweetChecker is used to allow online users to have a more in-depth understanding of the discussions about five different controversial health issues. The results and recommendations of this study can be used to build solutions for the problem of trustworthiness of user-generated content on different social media platforms, especially for controversial issues
COVID-19 and the 5G conspiracy theory: social network analysis of twitter data
Background: Since the beginning of December 2019, the coronavirus disease (COVID-19) has spread rapidly around the world, which has led to increased discussions across online platforms. These conversations have also included various conspiracies shared by social media users. Amongst them, a popular theory has linked 5G to the spread of COVID-19, leading to misinformation and the burning of 5G towers in the United Kingdom. The understanding of the drivers of fake news and quick policies oriented to isolate and rebate misinformation are keys to combating it. Objective: The aim of this study is to develop an understanding of the drivers of the 5G COVID-19 conspiracy theory and strategies to deal with such misinformation. Methods: This paper performs a social network analysis and content analysis of Twitter data from a 7-day period (Friday, March 27, 2020, to Saturday, April 4, 2020) in which the #5GCoronavirus hashtag was trending on Twitter in the United Kingdom. Influential users were analyzed through social network graph clusters. The size of the nodes were ranked by their betweenness centrality score, and the graph's vertices were grouped by cluster using the Clauset-Newman-Moore algorithm. The topics and web sources used were also examined. Results: Social network analysis identified that the two largest network structures consisted of an isolates group and a broadcast group. The analysis also revealed that there was a lack of an authority figure who was actively combating such misinformation. Content analysis revealed that, of 233 sample tweets, 34.8% (n=81) contained views that 5G and COVID-19 were linked, 32.2% (n=75) denounced the conspiracy theory, and 33.0% (n=77) were general tweets not expressing any personal views or opinions. Thus, 65.2% (n=152) of tweets derived from nonconspiracy theory supporters, which suggests that, although the topic attracted high volume, only a handful of users genuinely believed the conspiracy. This paper also shows that fake news websites were the most popular web source shared by users; although, YouTube videos were also shared. The study also identified an account whose sole aim was to spread the conspiracy theory on Twitter. Conclusions: The combination of quick and targeted interventions oriented to delegitimize the sources of fake information is key to reducing their impact. Those users voicing their views against the conspiracy theory, link baiting, or sharing humorous tweets inadvertently raised the profile of the topic, suggesting that policymakers should insist in the efforts of isolating opinions that are based on fake news. Many social media platforms provide users with the ability to report inappropriate content, which should be used. This study is the first to analyze the 5G conspiracy theory in the context of COVID-19 on Twitter offering practical guidance to health authorities in how, in the context of a pandemic, rumors may be combated in the future
Tweeting the Mind and Instagramming the Heart: Exploring Differentiated Content Sharing on Social Media
Understanding the usage of multiple OSNs (Online Social Networks) has been of
significant research interest as it helps in identifying the unique and
distinguishing trait in each social media platform that contributes to its
continued existence. The comparison between the OSNs is insightful when it is
done based on the representative majority of the users holding active accounts
on all the platforms. In this research, we collected a set of user profiles
holding accounts on both Twitter and Instagram, these platforms being of
prominence among a majority of users. An extensive textual and visual analysis
on the media content posted by these users revealed that both these platforms
are indeed perceived differently at a fundamental level with Instagram engaging
more of the users' heart and Twitter capturing more of their mind. These
differences got reflected in almost every microscopic analysis done upon the
linguistic, topical and visual aspects.Comment: 4 pages, 8 figure
- …