Search CORE

190,901 research outputs found

Words are Malleable: Computing Semantic Shifts in Political and Media Discourse

Author: Gallie W. B.
Jin P.
Kusner M. J.
Le Q. V.
Mikolov T.
Mikolov T.
Mikolov T.
Reese S. D.
Publication venue
Publication date: 01/01/2017
Field of study

Recently, researchers started to pay attention to the detection of temporal shifts in the meaning of words. However, most (if not all) of these approaches restricted their efforts to uncovering change over time, thus neglecting other valuable dimensions such as social or political variability. We propose an approach for detecting semantic shifts between different viewpoints--broadly defined as a set of texts that share a specific metadata feature, which can be a time-period, but also a social entity such as a political party. For each viewpoint, we learn a semantic space in which each word is represented as a low dimensional neural embedded vector. The challenge is to compare the meaning of a word in one space to its meaning in another space and measure the size of the semantic shifts. We compare the effectiveness of a measure based on optimal transformations between the two spaces with a measure based on the similarity of the neighbors of the word in the respective spaces. Our experiments demonstrate that the combination of these two performs best. We show that the semantic shifts not only occur over time, but also along different viewpoints in a short period of time. For evaluation, we demonstrate how this approach captures meaningful semantic shifts and can help improve other tasks such as the contrastive viewpoint summarization and ideology detection (measured as classification accuracy) in political texts. We also show that the two laws of semantic change which were empirically shown to hold for temporal shifts also hold for shifts across viewpoints. These laws state that frequent words are less likely to shift meaning while words with many senses are more likely to do so.Comment: In Proceedings of the 26th ACM International on Conference on Information and Knowledge Management (CIKM2017

arXiv.org e-Print Archive

Crossref

International Migration, Integration and Social Cohesion online publications

UvA-DARE

Goal-setting And Achievement In Activity Tracking Apps: A Case Study Of MyFitnessPal

Author: Althoff Tim
Gordon Mitchell L.
Leskovec Jure
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 04/04/2019
Field of study

Activity tracking apps often make use of goals as one of their core motivational tools. There are two critical components to this tool: setting a goal, and subsequently achieving that goal. Despite its crucial role in how a number of prominent self-tracking apps function, there has been relatively little investigation of the goal-setting and achievement aspects of self-tracking apps. Here we explore this issue, investigating a particular goal setting and achievement process that is extensive, recorded, and crucial for both the app and its users' success: weight loss goals in MyFitnessPal. We present a large-scale study of 1.4 million users and weight loss goals, allowing for an unprecedented detailed view of how people set and achieve their goals. We find that, even for difficult long-term goals, behavior within the first 7 days predicts those who ultimately achieve their goals, that is, those who lose at least as much weight as they set out to, and those who do not. For instance, high amounts of early weight loss, which some researchers have classified as unsustainable, leads to higher goal achievement rates. We also show that early food intake, self-monitoring motivation, and attitude towards the goal are important factors. We then show that we can use our findings to predict goal achievement with an accuracy of 79% ROC AUC just 7 days after a goal is set. Finally, we discuss how our findings could inform steps to improve goal achievement in self-tracking apps

arXiv.org e-Print Archive

Crossref

Measuring, Predicting and Visualizing Short-Term Change in Word Representation and Usage in VKontakte Social Network

Author: Arendt Dustin
Bell Eric
Stewart Ian
Volkova Svitlana
Publication venue
Publication date: 20/03/2017
Field of study

Language in social media is extremely dynamic: new words emerge, trend and disappear, while the meaning of existing words can fluctuate over time. Such dynamics are especially notable during a period of crisis. This work addresses several important tasks of measuring, visualizing and predicting short term text representation shift, i.e. the change in a word's contextual semantics, and contrasting such shift with surface level word dynamics, or concept drift, observed in social media streams. Unlike previous approaches on learning word representations from text, we study the relationship between short-term concept drift and representation shift on a large social media corpus - VKontakte posts in Russian collected during the Russia-Ukraine crisis in 2014-2015. Our novel contributions include quantitative and qualitative approaches to (1) measure short-term representation shift and contrast it with surface level concept drift; (2) build predictive models to forecast short-term shifts in meaning from previous meaning as well as from concept drift; and (3) visualize short-term representation shift for example keywords to demonstrate the practical use of our approach to discover and track meaning of newly emerging terms in social media. We show that short-term representation shift can be accurately predicted up to several weeks in advance. Our unique approach to modeling and visualizing word representation shifts in social media can be used to explore and characterize specific aspects of the streaming corpus during crisis events and potentially improve other downstream classification tasks including real-time event detection

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

Deep Temporal-Recurrent-Replicated-Softmax for Topical Trends over Time

Author: Andrassy Bernt
Gupta Pankaj
Rajaram Subburam
Schütze Hinrich
Publication venue
Publication date: 01/01/2018
Field of study

Dynamic topic modeling facilitates the identification of topical trends over time in temporal collections of unstructured documents. We introduce a novel unsupervised neural dynamic topic model named as Recurrent Neural Network-Replicated Softmax Model (RNNRSM), where the discovered topics at each time influence the topic discovery in the subsequent time steps. We account for the temporal ordering of documents by explicitly modeling a joint distribution of latent topical dependencies over time, using distributional estimators with temporal recurrent connections. Applying RNN-RSM to 19 years of articles on NLP research, we demonstrate that compared to state-of-the art topic models, RNNRSM shows better generalization, topic interpretation, evolution and trends. We also introduce a metric (named as SPAN) to quantify the capability of dynamic topic model to capture word evolution in topics over time.Comment: In Proceedings of the 16th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT 2018

arXiv.org e-Print Archive

Crossref

Thematic Annotation: extracting concepts out of documents

Author: Andrews Pierre
Rajman Martin
Publication venue
Publication date: 29/12/2004
Field of study

Contrarily to standard approaches to topic annotation, the technique used in this work does not centrally rely on some sort of -- possibly statistical -- keyword extraction. In fact, the proposed annotation algorithm uses a large scale semantic database -- the EDR Electronic Dictionary -- that provides a concept hierarchy based on hyponym and hypernym relations. This concept hierarchy is used to generate a synthetic representation of the document by aggregating the words present in topically homogeneous document segments into a set of concepts best preserving the document's content. This new extraction technique uses an unexplored approach to topic selection. Instead of using semantic similarity measures based on a semantic resource, the later is processed to extract the part of the conceptual hierarchy relevant to the document content. Then this conceptual hierarchy is searched to extract the most relevant set of concepts to represent the topics discussed in the document. Notice that this algorithm is able to extract generic concepts that are not directly present in the document.Comment: Technical report EPFL/LIA. 81 pages, 16 figure

arXiv.org e-Print Archive

Infoscience - École polytechnique fédérale de Lausanne

Tracking the reflexivity of the (dis)engaged citizen: some methodological reflections

Author: Bird E.
Buckingham D.
Campbell A.
Couldry N.
Croteau D.
De Grazia S.
Gamson W.A.
Geuss R.
Hirschauer S.
Isin E.
Joseph Rowntree Charitable Trust.
LeBlanc R.
Lefebvre H.
Marcus G.
Nick Couldry
Stensland P.
Stevenson N.
Szalai A.
Taylor C.
Tim Markham
Touraine A.
Touraine A.
Weintraub J.
Williams R.
Publication venue: 'SAGE Publications'
Publication date: 01/01/2007
Field of study

The relationship between governments and citizens in many contemporary democracies is haunted by uncertainty and sociologists face the task of listening effectively to citizens’ own reflections on this uncertain relationship. This article reflects on the qualitative methodology of a recently completed UK project which used a combination of diary and multiple interviews/ focus groups to track over a fieldwork period of up to a year citizens’ reflections on their relationship to a public world and the contribution to this of their media consumption. In particular, the article considers how the project’s multiple methods enabled multiple angles on the inevitable artificiality and performative dimension of the diary process, resulting in rich data on people’s complex reflections on the uncertain position of the contemporary citizen

Crossref

LSE Research Online

Birkbeck Institutional Research Online

Recommended from our members

Situated cognition in implementation : what teacher professional development looks like from a socio-psychological perspective

Author: Warner Jayce Richard
Publication venue
Publication date: 06/11/2015
Field of study

textMuch has been documented regarding the characteristics of effective professional development, but there is a conspicuous lack of research that attends to the ways that situational factors influence its implementation. Identifying and interpreting these factors can have important implications for designers and evaluators of professional development programs, especially if we are to understand knowledge construction as being culturally mediated, agentic, and situated within local contexts. This study seeks to uncover the social and psychological factors that mediate the way local actors implement professional development by analyzing how facilitators and teachers enact a large-scale professional development program in the absence of strict fidelity expectations.Educational Psycholog

Texas ScholarWorks

AUGUR: Forecasting the Emergence of New Research Topics

Author: Duvvuru A.
Erten C.
Leydesdorff L.
Osborne F.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 23/05/2018
Field of study

Being able to rapidly recognise new research trends is strategic for many stakeholders, including universities, institutional funding bodies, academic publishers and companies. The literature presents several approaches to identifying the emergence of new research topics, which rely on the assumption that the topic is already exhibiting a certain degree of popularity and consistently referred to by a community of researchers. However, detecting the emergence of a new research area at an embryonic stage, i.e., before the topic has been consistently labelled by a community of researchers and associated with a number of publications, is still an open challenge. We address this issue by introducing Augur, a novel approach to the early detection of research topics. Augur analyses the diachronic relationships between research areas and is able to detect clusters of topics that exhibit dynamics correlated with the emergence of new research topics. Here we also present the Advanced Clique Percolation Method (ACPM), a new community detection algorithm developed specifically for supporting this task. Augur was evaluated on a gold standard of 1,408 debutant topics in the 2000-2011 interval and outperformed four alternative approaches in terms of both precision and recall

Crossref

Open Research Online (The Open University)

Stability and sensitivity of Learning Analytics based prediction models

Author: Giesbers B.
Rienties B.
Tempelaar D. T.
Publication venue: 'Scitepress'
Publication date: 01/01/2015
Field of study

Learning analytics seek to enhance the learning processes through systematic measurements of learning related data and to provide informative feedback to learners and educators. Track data from Learning Management Systems (LMS) constitute a main data source for learning analytics. This empirical contribution provides an application of Buckingham Shum and Deakin Crick’s theoretical framework of dispositional learning analytics: an infrastructure that combines learning dispositions data with data extracted from computer-assisted, formative assessments and LMSs. In two cohorts of a large introductory quantitative methods module, 2049 students were enrolled in a module based on principles of blended learning, combining face-to-face Problem-Based Learning sessions with e-tutorials. We investigated the predictive power of learning dispositions, outcomes of continuous formative assessments and other system generated data in modelling student performance and their potential to generate informative feedback. Using a dynamic, longitudinal perspective, computer-assisted formative assessments seem to be the best predictor for detecting underperforming students and academic performance, while basic LMS data did not substantially predict learning. If timely feedback is crucial, both use-intensity related track data from e-tutorial systems, and learning dispositions, are valuable sources for feedback generation

Crossref

Open Research Online (The Open University)