2,021 research outputs found
Do Linguistic Style and Readability of Scientific Abstracts affect their Virality?
Reactions to textual content posted in an online social network show
different dynamics depending on the linguistic style and readability of the
submitted content. Do similar dynamics exist for responses to scientific
articles? Our intuition, supported by previous research, suggests that the
success of a scientific article depends on its content, rather than on its
linguistic style. In this article, we examine a corpus of scientific abstracts
and three forms of associated reactions: article downloads, citations, and
bookmarks. Through a class-based psycholinguistic analysis and readability
indices tests, we show that certain stylistic and readability features of
abstracts clearly concur in determining the success and viral capability of a
scientific article.Comment: Proceedings of the Sixth International AAAI Conference on Weblogs and
Social Media (ICWSM 2012), 4-8 June 2012, Dublin, Irelan
Profiling a set of personality traits of text author: what our words reveal about us
Authorship profiling, i.e. revealing information about an unknown author by analyzing their text, is a task of growing importance. One of the most urgent problems of authorship profiling (AP) is selecting text parameters which may correlate to an author’s personality. Most researchers’ selection of these is not underpinned by any theory. This article proposes an approach to AP which applies neuroscience data. The aim of the study is to assess the probability of self-destructive behaviour of an individual via formal parameters of their texts. Here we have used the “Personality Corpus”, which consists of Russian-language texts. A set of correlations between scores on the Freiburg Personality Inventory scales that are known to be indicative of self-destructive behaviour (“Spontaneous Aggressiveness”, “Depressiveness”, “Emotional Lability”, and “Composedness”) and text variables (average sentence length, lexical diversity etc.) has been calculated. Further, a mathematical model which predicts the probability of self-destructive behaviour has been obtained
Language Use Matters: Analysis of the Linguistic Structure of Question Texts Can Characterize Answerability in Quora
Quora is one of the most popular community Q&A sites of recent times.
However, many question posts on this Q&A site often do not get answered. In
this paper, we quantify various linguistic activities that discriminates an
answered question from an unanswered one. Our central finding is that the way
users use language while writing the question text can be a very effective
means to characterize answerability. This characterization helps us to predict
early if a question remaining unanswered for a specific time period t will
eventually be answered or not and achieve an accuracy of 76.26% (t = 1 month)
and 68.33% (t = 3 months). Notably, features representing the language use
patterns of the users are most discriminative and alone account for an accuracy
of 74.18%. We also compare our method with some of the similar works (Dror et
al., Yang et al.) achieving a maximum improvement of ~39% in terms of accuracy.Comment: 1 figure, 3 tables, ICWSM 2017 as poste
This Just In: Fake News Packs a Lot in Title, Uses Simpler, Repetitive Content in Text Body, More Similar to Satire than Real News
The problem of fake news has gained a lot of attention as it is claimed to
have had a significant impact on 2016 US Presidential Elections. Fake news is
not a new problem and its spread in social networks is well-studied. Often an
underlying assumption in fake news discussion is that it is written to look
like real news, fooling the reader who does not check for reliability of the
sources or the arguments in its content. Through a unique study of three data
sets and features that capture the style and the language of articles, we show
that this assumption is not true. Fake news in most cases is more similar to
satire than to real news, leading us to conclude that persuasion in fake news
is achieved through heuristics rather than the strength of arguments. We show
overall title structure and the use of proper nouns in titles are very
significant in differentiating fake from real. This leads us to conclude that
fake news is targeted for audiences who are not likely to read beyond titles
and is aimed at creating mental associations between entities and claims.Comment: Published at The 2nd International Workshop on News and Public
Opinion at ICWS
The Impact of Crowds on News Engagement: A Reddit Case Study
Today, users are reading the news through social platforms. These platforms
are built to facilitate crowd engagement, but not necessarily disseminate
useful news to inform the masses. Hence, the news that is highly engaged with
may not be the news that best informs. While predicting news popularity has
been well studied, it has not been studied in the context of crowd
manipulations. In this paper, we provide some preliminary results to a longer
term project on crowd and platform manipulations of news and news popularity.
In particular, we choose to study known features for predicting news popularity
and how those features may change on reddit.com, a social platform used
commonly for news aggregation. Along with this, we explore ways in which users
can alter the perception of news through changing the title of an article. We
find that news on reddit is predictable using previously studied sentiment and
content features and that posts with titles changed by reddit users tend to be
more popular than posts with the original article title.Comment: Published at The 2nd International Workshop on News and Public
Opinion at ICWSM 201
- …