124,135 research outputs found
Term-community-based topic detection with variable resolution
Network-based procedures for topic detection in huge text collections offer
an intuitive alternative to probabilistic topic models. We present in detail a
method that is especially designed with the requirements of domain experts in
mind. Like similar methods, it employs community detection in term
co-occurrence graphs, but it is enhanced by including a resolution parameter
that can be used for changing the targeted topic granularity. We also establish
a term ranking and use semantic word-embedding for presenting term communities
in a way that facilitates their interpretation. We demonstrate the application
of our method with a widely used corpus of general news articles and show the
results of detailed social-sciences expert evaluations of detected topics at
various resolutions. A comparison with topics detected by Latent Dirichlet
Allocation is also included. Finally, we discuss factors that influence topic
interpretation.Comment: 31 pages, 6 figure
Network of the Day: Aggregating and Visualizing Entity Networks from Online Sources
This software demonstration paper presents a project on the interactive visualization of social media data. The data presentation fuses German Twitter data and a social relation network extracted from German online news. Such fusion allows for comparative analysis of the two types of media. Our system will additionally enable users to explore relationships between named entities, and to investigate events as they develop over time. Cooperative tagging of relationships is enabled through the active involvement of users. The system is available online for a broad user audience
Bank Networks from Text: Interrelations, Centrality and Determinants
In the wake of the still ongoing global financial crisis, bank
interdependencies have come into focus in trying to assess linkages among banks
and systemic risk. To date, such analysis has largely been based on numerical
data. By contrast, this study attempts to gain further insight into bank
interconnections by tapping into financial discourse. We present a
text-to-network process, which has its basis in co-occurrences of bank names
and can be analyzed quantitatively and visualized. To quantify bank importance,
we propose an information centrality measure to rank and assess trends of bank
centrality in discussion. For qualitative assessment of bank networks, we put
forward a visual, interactive interface for better illustrating network
structures. We illustrate the text-based approach on European Large and Complex
Banking Groups (LCBGs) during the ongoing financial crisis by quantifying bank
interrelations and centrality from discussion in 3M news articles, spanning
2007Q1 to 2014Q3.Comment: Quantitative Finance, forthcoming in 201
Automated construction and analysis of political networks via open government and media sources
We present a tool to generate real world political networks from user provided lists of politicians and news sites. Additional output includes visualizations, interactive tools and maps that allow a user to better understand the politicians and their surrounding environments as portrayed by the media. As a case study, we construct a comprehensive list of current Texas politicians, select news sites that convey a spectrum of political viewpoints covering Texas politics, and examine the results. We propose a ”Combined” co-occurrence distance metric to better reflect the relationship between two entities. A topic modeling technique is also proposed as a novel, automated way of labeling communities that exist within a politician’s ”extended” network.Peer ReviewedPostprint (author's final draft
- …