1,989 research outputs found
Visualizing Contextual Information in Aggregated Web Content Repositories
Understanding stakeholder perceptions and the impact of campaigns are key insights for communication experts and policy makers. A structured analysis of Web content can help answer these questions, particularly if this analysis involves the ability to extract, disambiguate and visualize contextual information. After summarizing methods used for acquiring and annotating Web content repositories, we present visualization techniques to explore the lexical, geospatial and relational context of entities in these repositories. The examples stem from the Media Watch on Climate Change, a publicly available Web portal that aggregates environmental resources from various online sources
Information Extraction in Illicit Domains
Extracting useful entities and attribute values from illicit domains such as
human trafficking is a challenging problem with the potential for widespread
social impact. Such domains employ atypical language models, have `long tails'
and suffer from the problem of concept drift. In this paper, we propose a
lightweight, feature-agnostic Information Extraction (IE) paradigm specifically
designed for such domains. Our approach uses raw, unlabeled text from an
initial corpus, and a few (12-120) seed annotations per domain-specific
attribute, to learn robust IE models for unobserved pages and websites.
Empirically, we demonstrate that our approach can outperform feature-centric
Conditional Random Field baselines by over 18\% F-Measure on five annotated
sets of real-world human trafficking datasets in both low-supervision and
high-supervision settings. We also show that our approach is demonstrably
robust to concept drift, and can be efficiently bootstrapped even in a serial
computing environment.Comment: 10 pages, ACM WWW 201
Overcoming Language Dichotomies: Toward Effective Program Comprehension for Mobile App Development
Mobile devices and platforms have become an established target for modern
software developers due to performant hardware and a large and growing user
base numbering in the billions. Despite their popularity, the software
development process for mobile apps comes with a set of unique, domain-specific
challenges rooted in program comprehension. Many of these challenges stem from
developer difficulties in reasoning about different representations of a
program, a phenomenon we define as a "language dichotomy". In this paper, we
reflect upon the various language dichotomies that contribute to open problems
in program comprehension and development for mobile apps. Furthermore, to help
guide the research community towards effective solutions for these problems, we
provide a roadmap of directions for future work.Comment: Invited Keynote Paper for the 26th IEEE/ACM International Conference
on Program Comprehension (ICPC'18
MACE: Deliverable D7.6 - Report on user interface design and community experiments
This deliverable presents the progress of the user interface design and community
building experiments within the MACE project.
In Chapter 2 we generally present the interface of the MACE portal, which is a
platform to discover and enrich architectural resources and, at the same time, to
support the community formed around architectural topics. Besides the advanced
search, the portal provides various visual tools for metadata based search and
browsing, tailored to architectural needs (see Chapter 3). Different metadata widgets
are used to visualize and access multiple dimensions of each resource, as presented
in Chapter 4. These widgets not only establish meaningful cross–connections
between resources, but also invite to add and edit metadata effortlessly.
In order to generate a critical mass of metadata and ensure sustainability of projects’
outcomes, supporting community and fostering end user contributions are critical. In
Chapter 5, we present the components deploied in this direction as well as an
analytical framework for incentive mechanisms.
Within the dissemination strategy, the MACE project has got a unique chance to
raise its public awareness at La Biennale of architecture in Venice, 2008. In this
context we designed an interactive installation, demonstrating, in an exhibition
setting, the benefits of resource interconnection via metadata (see Chapter 6).
Chapter 7 presents our preliminary conclusions and an overview of planned future
activities
Multiple Coordinated Views for Searching and Navigating Web Content Repositories
The advantages and positive effects of multiple coordinated views on search performance have been documented in several studies. This paper describes the implementation of multiple coordinated views within the Media Watch on Climate Change, a domain-specific news aggregation portal available at
www.ecoresearch.net/climate that combines a portfolio of semantic services with a visual information exploration and retrieval interface. The system builds contextualized information spaces by enriching the content repository with geospatial, semantic and temporal annotations, and by applying semi-automated ontology learning to create a controlled vocabulary for structuring the stored information. Portlets visualize the different dimensions of the contextualized information spaces, providing the user with multiple views on the latest news media coverage. Context information facilitates access to complex datasets and helps users navigate large repositories of Web documents. Currently, the system synchronizes information landscapes, domain ontologies, geographic maps, tag clouds and just-in-time information retrieval agents that suggest similar topics and nearby locations
Enhancing Scholarly Publications: Developing Hybrid Monographs in the Humanities and Social Sciences
Enhancing publications has a long history but is gaining acceleration as authors and publishers explore electronic tablets as devices for dissemination and presentation. Enhancement of scholarly publications, in contrast, more often takes place in a Web environment and is coupled with presentation of supplementary materials related to research. The approach to enhancing scholarly publications presented in this article goes a step further and involves the interlinking of the “objects” of a document: datasets, supplementary materials, secondary analyses, and post-publication interventions. This approach connects the user-centricity of Web 2.0 with the Semantic Web. It aims at facilitating long-term content structure through standardized formats intended to improve interoperability between concepts and terms within and across knowledge domains. We explored this conception of enhancement on a small set of books prepared for traditional academic publishers. While the project was primarily an exercise in development, the conclusion section of the article reflects on areas where conceptual and empirical studies could be initiated to complement this new direction in scholarly publishing. 
Usage Bibliometrics
Scholarly usage data provides unique opportunities to address the known
shortcomings of citation analysis. However, the collection, processing and
analysis of usage data remains an area of active research. This article
provides a review of the state-of-the-art in usage-based informetric, i.e. the
use of usage data to study the scholarly process.Comment: Publisher's PDF (by permission). Publisher web site:
books.infotoday.com/asist/arist44.shtm
- …