482 research outputs found
On predicting religion labels in microblogging networks
National Research Foundation (NRF) Singapore under International Research Centres in Singapore Funding Initiativ
U.S. Religious Landscape on Twitter
Religiosity is a powerful force shaping human societies, affecting domains as
diverse as economic growth or the ability to cope with illness. As more
religious leaders and organizations as well as believers start using social
networking sites (e.g., Twitter, Facebook), online activities become important
extensions to traditional religious rituals and practices. However, there has
been lack of research on religiosity in online social networks. This paper
takes a step toward the understanding of several important aspects of
religiosity on Twitter, based on the analysis of more than 250k U.S. users who
self-declared their religions/belief, including Atheism, Buddhism,
Christianity, Hinduism, Islam, and Judaism. Specifically, (i) we examine the
correlation of geographic distribution of religious people between Twitter and
offline surveys. (ii) We analyze users' tweets and networks to identify
discriminative features of each religious group, and explore supervised methods
to identify believers of different religions. (iii) We study the linkage
preference of different religious groups, and observe a strong preference of
Twitter users connecting to others sharing the same religion.Comment: 10 page
Multi-class machine classification of suicide-related communication on Twitter
The World Wide Web, and online social networks in particular, have increased connectivity between people such that information can spread to millions of people in a matter of minutes. This form of online collective contagion has provided many benefits to society, such as providing reassurance and emergency management in the immediate aftermath of natural disasters. However, it also poses a potential risk to vulnerable Web users who receive this information and could subsequently come to harm. One example of this would be the spread of suicidal ideation in online social networks, about which concerns have been raised. In this paper we report the results of a number of machine classifiers built with the aim of classifying text relating to suicide on Twitter. The classifier distinguishes between the more worrying content, such as suicidal ideation, and other suicide-related topics such as reporting of a suicide, memorial, campaigning and support. It also aims to identify flippant references to suicide. We built a set of baseline classifiers using lexical, structural, emotive and psychological features extracted from Twitter posts. We then improved on the baseline classifiers by building an ensemble classifier using the Rotation Forest algorithm and a Maximum Probability voting classification decision method, based on the outcome of base classifiers. This achieved an F-measure of 0.728 overall (for 7 classes, including suicidal ideation) and 0.69 for the suicidal ideation class. We summarise the results by reflecting on the most significant predictive principle components of the suicidal ideation class to provide insight into the language used on Twitter to express suicidal ideation. Finally, we perform a 12-month case study of suicide-related posts where we further evaluate the classification approach - showing a sustained classification performance and providing anonymous insights into the trends and demographic profile of Twitter users posting content of this type
Are you Charlie or Ahmed? Cultural pluralism in Charlie Hebdo response on Twitter
We study the response to the Charlie Hebdo shootings of January 7, 2015 on
Twitter across the globe. We ask whether the stances on the issue of freedom of
speech can be modeled using established sociological theories, including
Huntington's culturalist Clash of Civilizations, and those taking into
consideration social context, including Density and Interdependence theories.
We find support for Huntington's culturalist explanation, in that the
established traditions and norms of one's "civilization" predetermine some of
one's opinion. However, at an individual level, we also find social context to
play a significant role, with non-Arabs living in Arab countries using
#JeSuisAhmed ("I am Ahmed") five times more often when they are embedded in a
mixed Arab/non-Arab (mention) network. Among Arabs living in the West, we find
a great variety of responses, not altogether associated with the size of their
expatriate community, suggesting other variables to be at play.Comment: International AAAI Conference on Web and Social Media (ICWSM), 201
Twitter user geolocation using web country noun searches
Several Web and social media analytics require user geolocation data. Although Twitter is a powerful source for social media analytics, its user geolocation is a nontrivial task. This paper presents a purely word distribution method for Twitter user country geolocation. In particular, we focus on the frequencies of tweet nouns and their statistical matches with Google Trends world country distributions (GTN method). Several experiments were conducted, using a recently created dataset of 744,830 tweets produced by 3298 users from 54 countries and written in 48 languages. Overall, the proposed GTN approach is competitive when compared with a state-of-the-art world distribution geolocation method. To reduce the number of Google Trends queries, we also tested a machine learning variant (GTN2) that is capable of matching the GTN responses with an 80% accuracy while being much faster than GTN.Research carried out with the support of resources of Big and Open Data Innovation Laboratory (BODaI-Lab), University of Brescia, granted by Fondazione Cariplo and Regione Lombardia. The work of P. Cortez was supported by FCT - Fundacao para a Ciencia e Tecnologia within the Project Scope UID/CEC/00319/2019. We would also like to thank the anonymous reviewers for their helpful suggestions
An analysis of the user occupational class through Twitter content
Social media content can be used as a complementary source to the traditional
methods for extracting and studying collective social attributes. This study focuses on the prediction of the occupational class for a public user profile. Our analysis is conducted on a new annotated corpus of Twitter users, their respective job titles, posted textual content and platform-related attributes. We frame our task as classification using latent feature representations such as word clusters and embeddings. The employed linear and, especially, non-linear methods can predict a userās occupational class with strong accuracy for the coarsest level of a standard occupation taxonomy which includes nine classes. Combined with a qualitative assessment, the derived results confirm the feasibility of our approach in inferring a new user attribute that can be embedded in a multitude of downstream applications
- ā¦