Search CORE

482 research outputs found

On predicting religion labels in microblogging networks

Author: Al Zamal F.
Bergsma S.
Bhargava A.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/07/2014
Field of study

National Research Foundation (NRF) Singapore under International Research Centres in Singapore Funding Initiativ

Crossref

Institutional Knowledge at Singapore Management University

U.S. Religious Landscape on Twitter

Author: Chen Lu
Okulicz-Kozaryn Adam
Weber Ingmar
Publication venue
Publication date: 30/09/2014
Field of study

Religiosity is a powerful force shaping human societies, affecting domains as diverse as economic growth or the ability to cope with illness. As more religious leaders and organizations as well as believers start using social networking sites (e.g., Twitter, Facebook), online activities become important extensions to traditional religious rituals and practices. However, there has been lack of research on religiosity in online social networks. This paper takes a step toward the understanding of several important aspects of religiosity on Twitter, based on the analysis of more than 250k U.S. users who self-declared their religions/belief, including Atheism, Buddhism, Christianity, Hinduism, Islam, and Judaism. Specifically, (i) we examine the correlation of geographic distribution of religious people between Twitter and offline surveys. (ii) We analyze users' tweets and networks to identify discriminative features of each religious group, and explore supervised methods to identify believers of different religions. (iii) We study the linkage preference of different religious groups, and observe a strong preference of Twitter users connecting to others sharing the same religion.Comment: 10 page

arXiv.org e-Print Archive

CORE

Multi-class machine classification of suicide-related communication on Twitter

Author: Amery Rosie
Burnap Pete
Colombo Gualtiero
Hodorog Andrei
Scourfield Jonathan
Publication venue: 'Elsevier BV'
Publication date: 01/08/2017
Field of study

The World Wide Web, and online social networks in particular, have increased connectivity between people such that information can spread to millions of people in a matter of minutes. This form of online collective contagion has provided many benefits to society, such as providing reassurance and emergency management in the immediate aftermath of natural disasters. However, it also poses a potential risk to vulnerable Web users who receive this information and could subsequently come to harm. One example of this would be the spread of suicidal ideation in online social networks, about which concerns have been raised. In this paper we report the results of a number of machine classifiers built with the aim of classifying text relating to suicide on Twitter. The classifier distinguishes between the more worrying content, such as suicidal ideation, and other suicide-related topics such as reporting of a suicide, memorial, campaigning and support. It also aims to identify flippant references to suicide. We built a set of baseline classifiers using lexical, structural, emotive and psychological features extracted from Twitter posts. We then improved on the baseline classifiers by building an ensemble classifier using the Rotation Forest algorithm and a Maximum Probability voting classification decision method, based on the outcome of base classifiers. This achieved an F-measure of 0.728 overall (for 7 classes, including suicidal ideation) and 0.69 for the suicidal ideation class. We summarise the results by reflecting on the most significant predictive principle components of the suicidal ideation class to provide insight into the language used on Twitter to express suicidal ideation. Finally, we perform a 12-month case study of suicide-related posts where we further evaluate the classification approach - showing a sustained classification performance and providing anonymous insights into the trends and demographic profile of Twitter users posting content of this type

Crossref

Online Research @ Cardiff

Are you Charlie or Ahmed? Cultural pluralism in Charlie Hebdo response on Twitter

Author: An Jisun
De Oger Sonia Alonso Saenz
Fortes Braulio Gomez
Kwak Haewoon
Mejova Yelena
Publication venue
Publication date: 02/03/2016
Field of study

We study the response to the Charlie Hebdo shootings of January 7, 2015 on Twitter across the globe. We ask whether the stances on the issue of freedom of speech can be modeled using established sociological theories, including Huntington's culturalist Clash of Civilizations, and those taking into consideration social context, including Density and Interdependence theories. We find support for Huntington's culturalist explanation, in that the established traditions and norms of one's "civilization" predetermine some of one's opinion. However, at an individual level, we also find social context to play a significant role, with non-Arabs living in Arab countries using #JeSuisAhmed ("I am Ahmed") five times more often when they are embedded in a mixed Arab/non-Arab (mention) network. Among Arabs living in the West, we find a great variety of responses, not altogether associated with the size of their expatriate community, suggesting other variables to be at play.Comment: International AAAI Conference on Web and Social Media (ICWSM), 201

arXiv.org e-Print Archive

Institutional Knowledge at Singapore Management University

Association for the Advancement of Artificial Intelligence: AAAI Publications

Twitter user geolocation using web country noun searches

Author: Carpita Maurizio
Cortez Paulo
Zola Paola
Publication venue: 'Elsevier BV'
Publication date: 01/01/2019
Field of study

Several Web and social media analytics require user geolocation data. Although Twitter is a powerful source for social media analytics, its user geolocation is a nontrivial task. This paper presents a purely word distribution method for Twitter user country geolocation. In particular, we focus on the frequencies of tweet nouns and their statistical matches with Google Trends world country distributions (GTN method). Several experiments were conducted, using a recently created dataset of 744,830 tweets produced by 3298 users from 54 countries and written in 48 languages. Overall, the proposed GTN approach is competitive when compared with a state-of-the-art world distribution geolocation method. To reduce the number of Google Trends queries, we also tested a machine learning variant (GTN2) that is capable of matching the GTN responses with an 80% accuracy while being much faster than GTN.Research carried out with the support of resources of Big and Open Data Innovation Laboratory (BODaI-Lab), University of Brescia, granted by Fondazione Cariplo and Regione Lombardia. The work of P. Cortez was supported by FCT - Fundacao para a Ciencia e Tecnologia within the Project Scope UID/CEC/00319/2019. We would also like to thank the anonymous reviewers for their helpful suggestions

Universidade do Minho: RepositoriUM

Archivio istituzionale della ricerca - Università di Brescia

User Behavior Mining in Microblogging

Author: HOANG Tuan Anh
Publication venue: Singapore Management University
Publication date: 01/06/2016
Field of study

Institutional Knowledge at Singapore Management University

An analysis of the user occupational class through Twitter content

Author: Aletras N
Lampos V
Preoţiuc-Pietro D
Publication venue: The Association for Computational Linguistics
Publication date: 01/07/2015
Field of study

Social media content can be used as a complementary source to the traditional methods for extracting and studying collective social attributes. This study focuses on the prediction of the occupational class for a public user profile. Our analysis is conducted on a new annotated corpus of Twitter users, their respective job titles, posted textual content and platform-related attributes. We frame our task as classification using latent feature representations such as word clusters and embeddings. The employed linear and, especially, non-linear methods can predict a user’s occupational class with strong accuracy for the coarsest level of a standard occupation taxonomy which includes nine classes. Combined with a qualitative assessment, the derived results confirm the feasibility of our approach in inferring a new user attribute that can be embedded in a multitude of downstream applications

UCL Discovery