14 research outputs found
#greysanatomy vs. #yankees: Demographics and Hashtag Use on Twitter
Demographics, in particular, gender, age, and race, are a key predictor of
human behavior. Despite the significant effect that demographics plays, most
scientific studies using online social media do not consider this factor,
mainly due to the lack of such information. In this work, we use
state-of-the-art face analysis software to infer gender, age, and race from
profile images of 350K Twitter users from New York. For the period from
November 1, 2014 to October 31, 2015, we study which hashtags are used by
different demographic groups. Though we find considerable overlap for the most
popular hashtags, there are also many group-specific hashtags.Comment: This is a preprint of an article appearing at ICWSM 201
Gender prediction from tweets: Improving neural representations with hand-crafted features
Author profiling is the characterization of an author through some key attributes
such as gender, age, and language. In this paper, a RNN model with Attention
(RNNwA) is proposed to predict the gender of a twitter user using their tweets.
Both word level and tweet level attentions are utilized to learn ’where to look’.
This model1 is improved by concatenating LSA-reduced n-gram features with the
learned neural representation of a user. Both models are tested on three languages:
English, Spanish, Arabic. The improved version of the proposed model (RNNwA
+ n-gram) achieves state-of-the-art performance on English and has competitive
results on Spanish and Arabic
Inferring Social Media Users’ Demographics from Profile Pictures: A Face++ Analysis on Twitter Users
In this research, we evaluate the applicability of using facial recognition of social media account profile pictures to infer the demographic attributes of gender, race, and age of the account owners leveraging a commercial and well-known image service, specifically Face++. Our goal is to determine the feasibility of this approach for actual system implementation. Using a dataset of approximately 10,000 Twitter profile pictures, we use Face++ to classify this set of images for gender, race, and age. We determine that about 30% of these profile pictures contain identifiable images of people using the current state-of-the-art automated means. We then employ human evaluations to manually tag both the set of images that were determined to contain faces and the set that was determined not to contain faces, comparing the results to Face++. Of the thirty percent that Face++ identified as containing a face, about 80% are more likely than not the account holder based on our manual classification, with a variety of issues in the remaining 20%. Of the images that Face++ was unable to detect a face, we isolate a variety of likely issues preventing this detection, when a face actually appeared in the image. Overall, we find the applicability of automatic facial recognition to infer demographics for system development to be problematic, despite the reported high accuracy achieved for image test collection
What are Your Pronouns? Examining Gender Pronoun Usage on Twitter
Stating your gender pronouns, along with your name, is becoming the new norm
of self-introductions at school, at the workplace, and online. The increasing
prevalence and awareness of nonconforming gender identities put discussions of
developing gender-inclusive language at the forefront. This work presents the
first empirical research on gender pronoun usage on large-scale social media.
Leveraging a Twitter dataset of over 2 billion tweets collected continuously
over two years, we find that the public declaration of gender pronouns is on
the rise, with most people declaring as using she series pronouns, followed by
he series pronouns, and a smaller but considerable amount of non-binary
pronouns. From analyzing Twitter posts and sharing activities, we can discern
users who use gender pronouns from those who do not and also distinguish users
of various gender identities. We further illustrate the relationship between
explicit forms of social network exposure to gender pronouns and their eventual
gender pronoun adoption. This work carries crucial implications for
gender-identity studies and initiates new research directions in gender-related
fairness and inclusion, as well as support against online harassment and
discrimination on social media.Comment: 23 pages, 11 figures, 2 table