Search CORE

180,754 research outputs found

TopicViz: Semantic Navigation of Document Collections

Author: Chau Duen Horng "Polo"
Eisenstein Jacob
Kittur Aniket
Xing Eric P.
Publication venue
Publication date: 03/11/2011
Field of study

When people explore and manage information, they think in terms of topics and themes. However, the software that supports information exploration sees text at only the surface level. In this paper we show how topic modeling -- a technique for identifying latent themes across large collections of documents -- can support semantic exploration. We present TopicViz, an interactive environment for information exploration. TopicViz combines traditional search and citation-graph functionality with a range of novel interactive visualizations, centered around a force-directed layout that links documents to the latent themes discovered by the topic model. We describe several use scenarios in which TopicViz supports rapid sensemaking on large document collections

arXiv.org e-Print Archive

CiteSeerX

Exploratory topic modeling with distributional semantics

Author: A Treisman
DA Keim
DM Blei
J Risch
L Barth
M Bostock
S Fortunato
S Lohmann
S Palmer
Y Bengio
Publication venue
Publication date: 16/07/2015
Field of study

As we continue to collect and store textual data in a multitude of domains, we are regularly confronted with material whose largely unknown thematic structure we want to uncover. With unsupervised, exploratory analysis, no prior knowledge about the content is required and highly open-ended tasks can be supported. In the past few years, probabilistic topic modeling has emerged as a popular approach to this problem. Nevertheless, the representation of the latent topics as aggregations of semi-coherent terms limits their interpretability and level of detail. This paper presents an alternative approach to topic modeling that maps topics as a network for exploration, based on distributional semantics using learned word vectors. From the granular level of terms and their semantic similarity relations global topic structures emerge as clustered regions and gradients of concepts. Moreover, the paper discusses the visual interactive representation of the topic map, which plays an important role in supporting its exploration.Comment: Conference: The Fourteenth International Symposium on Intelligent Data Analysis (IDA 2015

arXiv.org e-Print Archive

Crossref

Iterative Seed Word Generation for Interactive Topic Modelling: a Mixed Text Processing and Qualitative Content Analysis Approach

Author: Akhlaghpour Saeed
Boyce James
Namvar Morteza
Sharifi Khajedehi Salma
Publication venue: AIS Electronic Library (AISeL)
Publication date: 12/12/2022
Field of study

Topic models have great potential for helping researchers and practitioners understand the electronic word of mouth (eWoM). This potential is thwarted by their purely unsupervised nature, which often leads to topics that are not entirely explainable. We develop a novel method to iteratively generate seed words to guide the interactive topic models. We assess the validity and applicability of the proposed method by investigating the critical phenomenon of Contact Tracing Mobile Applications (CTMAs) post-adoption during a time of the COVID-19 pandemic. The results show that constructs developed through our interactive topic modeling can capture primary research variables related to the phenomenon. Compared to existing topic modeling methods, our approach shows superior performance in explaining users’ satisfaction with CTMAs

AIS Electronic Library (AISeL)

Final report TransForum WP-046 : images of sustainable development of Dutch agriculture and green space

Author: Apeldoorn D.F., van
Beers P.J.
Hermans F.L.P.
Veldkamp T.
Vervoort J.M.
Publication venue: S.n.
Publication date
Field of study

In the project “Images of sustainable development of Dutch agriculture and green space” three PhD candidates studied the topic of images in sustainable development. Frans Hermans focused on the topic of societal images and their role and influence in innovation projects. The title of his subproject was “Social learning for sustainability in dynamic agricultural innovation networks.” Joost Vervoort explored the topic of “visualisation”, that is, using and producing images for specific purposes, in the context of innovation projects and programmes, in a subproject called “Step into the system: interactive media strategies for the exchange of insights on social-ecological change.” Finally, Dirk van Apeldoorn took a complex adaptive systems approach to images. He modelled various agro-ecosystems to compare images of those systems with the behaviour of those systems. His subproject was called “Modeling resilience of agro-ecosystems.

Wageningen University & Research Publications

The Design of an Interactive Topic Modeling Application for Media Content

Author: Ham Laura
Publication venue
Publication date: 18/08/2020
Field of study

Topic Modeling has been widely used by data scientists to analyze the increasing amount of text documents. Documents can be assigned to a distribution of topics with techniques like LDA or NMF, that are related to unsupervised soft clustering but consider text semantics. More recently, Interactive Topic Modeling (ITM) has been introduced to incorporate human expertise in the modeling process. This enables real-time hyperparameter optimization and topic manipulation on document and keyword level. However, current ITM applications are mostly accessible to experienced data scientists, who lack domain knowledge. Domain experts, on the other hand, usually lack the data science expertise to build and use ITM applications. This thesis presents an Interactive Topic Modeling application accessible to non-technical data analysts in the broadcasting domain. The application allows domain experts, like journalists, to explore themes in various produced media content in a dynamic, intuitive and efficient manner. An interactive interface, with an embedded NMF topic model, enables users to filter on various data sources, configure and refine the topic model, interpret and evaluate the output by visualizations, and analyze the data in wider context. This application was designed in collaboration with domain experts in focus group sessions, according to human-centered design principles. An evaluation study with ten participants shows that journalists and data analysts without any natural language processing knowledge agree that the application is not only usable, but also very user-friendly, effective and efficient. A SUS score of 81 was received, and user experience and user perceptions of control questionnaires both received an average of 4.1 on a five-point Likert scale. The ITM application thus enables this specific user group to extract meaningful topics from their produced media content, and use these results in broader perspective to perform exploratory data analysis. The success of the final application design presented in this thesis shows that the knowledge gap between data scientists and domain experts in the broadcasting field has been filled. In bigger perspective; machine learning applications can be made more accessible by translating hidden low-level details of complex models into high-level model interactions, presented in a user interface

Aaltodoc Publication Archive

Automated construction and analysis of political networks via open government and media sources

Author: Arias Vicente Marta
García-Olano Diego
Larriba Pey Josep
Publication venue
Publication date: 01/01/2016
Field of study

We present a tool to generate real world political networks from user provided lists of politicians and news sites. Additional output includes visualizations, interactive tools and maps that allow a user to better understand the politicians and their surrounding environments as portrayed by the media. As a case study, we construct a comprehensive list of current Texas politicians, select news sites that convey a spectrum of political viewpoints covering Texas politics, and examine the results. We propose a ”Combined” co-occurrence distance metric to better reflect the relationship between two entities. A topic modeling technique is also proposed as a novel, automated way of labeling communities that exist within a politician’s ”extended” network.Peer ReviewedPostprint (author's final draft

UPCommons. Portal del coneixement obert de la UPC