Search CORE

9 research outputs found

Guided Probabilistic Topic Models for Agenda-setting and Framing

Author: Nguyen Viet An
Publication venue
Publication date: 01/01/2015
Field of study

Probabilistic topic models are powerful methods to uncover hidden thematic structures in text by projecting each document into a low dimensional space spanned by a set of topics. Given observed text data, topic models infer these hidden structures and use them for data summarization, exploratory analysis, and predictions, which have been applied to a broad range of disciplines. Politics and political conflicts are often captured in text. Traditional approaches to analyze text in political science and other related fields often require close reading and manual labeling, which is labor-intensive and hinders the use of large-scale collections of text. Recent work, both in computer science and political science, has used automated content analysis methods, especially topic models to substantially reduce the cost of analyzing text at large scale. In this thesis, we follow this approach and develop a series of new probabilistic topic models, guided by additional information associated with the text, to discover and analyze agenda-setting (i.e., what topics people talk about) and framing (i.e., how people talk about those topics), a central research problem in political science, communication, public policy and other related fields. We first focus on study agendas and agenda control behavior in political debates and other conversations. The model we introduce, Speaker Identity for Topic Segmentation (SITS), is able to discover what topics that are talked about during the debates, when these topics change, and a speaker-specific measure of agenda control. To make the analysis process more effective, we build Argviz, an interactive visualization which leverages SITS's outputs to allow users to quickly grasp the conversational topic dynamics, discover when the topic changes and by whom, and interactively visualize the conversation's details on demand. We then analyze policy agendas in a more general setting of political text. We present the Label to Hierarchy (L2H) model to learn a hierarchy of topics from multi-labeled data, in which each document is tagged with multiple labels. The model captures the dependencies among labels using an interpretable tree-structured hierarchy, which helps provide insights about the political attentions that policymakers focus on, and how these policy issues relate to each other. We then go beyond just agenda-setting and expand our focus to framing--the study of how agenda issues are talked about, which can be viewed as second-level agenda-setting. To capture this hierarchical views of agendas and frames, we introduce the Supervised Hierarchical Latent Dirichlet Allocation (SHLDA) model, which jointly captures a collection of documents, each is associated with a continuous response variable such as the ideological position of the document's author on a liberal-conservative spectrum. In the topic hierarchy discovered by SHLDA, higher-level nodes map to more general agenda issues while lower-level nodes map to issue-specific frames. Although qualitative analysis shows that the topic hierarchies learned by SHLDA indeed capture the hierarchical view of agenda-setting and framing motivating the work, interpreting the discovered hierarchy still incurs moderately high cost due to the complex and abstract nature of framing. Motivated by improving the hierarchy, we introduce Hierarchical Ideal Point Topic Model (HIPTM) which jointly models a collection of votes (e.g., congressional roll call votes) and both the text associated with the voters (e.g., members of Congress) and the items (e.g., congressional bills). Customized specifically for capturing the two-level view of agendas and frames, HIPTM learns a two-level hierarchy of topics, in which first-level nodes map to an interpretable policy issue and second-level nodes map to issue-specific frames. In addition, instead of using pre-computed response variable, HIPTM also jointly estimates the ideological positions of voters on multiple interpretable dimensions

Digital Repository at the University of Maryland

Health Capital: An Empirical Study of Danish Healthcare Professionals’ Bodily Investments

Author: Harsløf Ivan
Hindhede Anette Lykke
Højbjerg Karin
Larsen Kristian
Publication venue
Publication date: 01/07/2018
Field of study

Copenhagen University Research Information System

VBN

The Whitworthian 2007-2008

Author: Whitworth University
Publication venue: Whitworth University
Publication date: 01/01/2008
Field of study

The Whitworthian student newspaper, September 2007-April 2008.https://digitalcommons.whitworth.edu/whitworthian/1092/thumbnail.jp

Whitworth University

The Whitworthian 2008-2009

Author: Whitworth University
Publication venue: Whitworth University
Publication date: 01/01/2009
Field of study

The Whitworthian student newspaper, September 2008-May 2009.https://digitalcommons.whitworth.edu/whitworthian/1093/thumbnail.jp

Whitworth University

Bowdoin Orient v.138, no.1-25 (2008-2009)

Author: The Bowdoin Orient
Publication venue: Bowdoin Digital Commons
Publication date: 10/01/2009
Field of study

https://digitalcommons.bowdoin.edu/bowdoinorient-2000s/1009/thumbnail.jp

Bowdoin College

Metal specific modulation of community permissiveness towards broad host range plasmids through stress

Author: Brandt K.K.
Dechesne Arnaud
Klümper Uli
Riber Leise
Smets Barth F.
Sørensen S.
Publication venue
Publication date: 01/01/2015
Field of study

Online Research Database In Technology

Metagenomic analysis of microbial communities in rapid sand filter treating groundwater. Community diversity and metabolic potential

Author: Palomo Alejandro
Rasmussen S.
Sicheritz-Pontén Thomas
Smets Barth F.
Publication venue
Publication date: 01/01/2015
Field of study

Online Research Database In Technology

Bowdoin Orient v.137, no.1-25 (2007-2008)

Author: The Bowdoin Orient
Publication venue: Bowdoin Digital Commons
Publication date: 09/01/2008
Field of study

https://digitalcommons.bowdoin.edu/bowdoinorient-2000s/1008/thumbnail.jp

Bowdoin College

Maritime expressions:a corpus based exploration of maritime metaphors

Author: Isserlis Simon Jonathon
Publication venue
Publication date
Field of study

This study uses a purpose-built corpus to explore the linguistic legacy of Britain’s maritime history found in the form of hundreds of specialised ‘Maritime Expressions’ (MEs), such as TAKEN ABACK, ANCHOR and ALOOF, that permeate modern English. Selecting just those expressions commencing with ’A’, it analyses 61 MEs in detail and describes the processes by which these technical expressions, from a highly specialised occupational discourse community, have made their way into modern English. The Maritime Text Corpus (MTC) comprises 8.8 million words, encompassing a range of text types and registers, selected to provide a cross-section of ‘maritime’ writing. It is analysed using WordSmith analytical software (Scott, 2010), with the 100 million-word British National Corpus (BNC) as a reference corpus. Using the MTC, a list of keywords of specific salience within the maritime discourse has been compiled and, using frequency data, concordances and collocations, these MEs are described in detail and their use and form in the MTC and the BNC is compared. The study examines the transformation from ME to figurative use in the general discourse, in terms of form and metaphoricity. MEs are classified according to their metaphorical strength and their transference from maritime usage into new registers and domains such as those of business, politics, sports and reportage etc. A revised model of metaphoricity is developed and a new category of figurative expression, the ‘resonator’, is proposed. Additionally, developing the work of Lakov and Johnson, Kovesces and others on Conceptual Metaphor Theory (CMT), a number of Maritime Conceptual Metaphors are identified and their cultural significance is discussed

Aston Publications Explorer