Search CORE

1,989 research outputs found

Visualizing Contextual Information in Aggregated Web Content Repositories

Author: Fischl Daniel
Jones Alistair
Kamolov Ruslan
Rafelsberger Walter
Scharl Arno
Publication venue
Publication date: 24/10/2014
Field of study

Understanding stakeholder perceptions and the impact of campaigns are key insights for communication experts and policy makers. A structured analysis of Web content can help answer these questions, particularly if this analysis involves the ability to extract, disambiguate and visualize contextual information. After summarizing methods used for acquiring and annotating Web content repositories, we present visualization techniques to explore the lexical, geospatial and relational context of entities in these repositories. The examples stem from the Media Watch on Climate Change, a publicly available Web portal that aggregates environmental resources from various online sources

webLyzard technology gmbh

Information Extraction in Illicit Domains

Author: Banko M.
Bauer F.
Chakrabarti S.
Kushmerick N.
Mikolov T.
Sahlgren M.
Wick M.
Zouaq A.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 08/03/2017
Field of study

Extracting useful entities and attribute values from illicit domains such as human trafficking is a challenging problem with the potential for widespread social impact. Such domains employ atypical language models, have `long tails' and suffer from the problem of concept drift. In this paper, we propose a lightweight, feature-agnostic Information Extraction (IE) paradigm specifically designed for such domains. Our approach uses raw, unlabeled text from an initial corpus, and a few (12-120) seed annotations per domain-specific attribute, to learn robust IE models for unobserved pages and websites. Empirically, we demonstrate that our approach can outperform feature-centric Conditional Random Field baselines by over 18\% F-Measure on five annotated sets of real-world human trafficking datasets in both low-supervision and high-supervision settings. We also show that our approach is demonstrably robust to concept drift, and can be efficiently bootstrapped even in a serial computing environment.Comment: 10 pages, ACM WWW 201

arXiv.org e-Print Archive

Crossref

Overcoming Language Dichotomies: Toward Effective Program Comprehension for Mobile App Development

Author: Bavota G.
Gu T.
Gui J.
Jabbarvand R.
Lelli V.
Linares-Vásquez M.
Liu Y.
Moran K.
Palomba F.
Robillard M. P.
Wan M.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 18/07/2018
Field of study

Mobile devices and platforms have become an established target for modern software developers due to performant hardware and a large and growing user base numbering in the billions. Despite their popularity, the software development process for mobile apps comes with a set of unique, domain-specific challenges rooted in program comprehension. Many of these challenges stem from developer difficulties in reasoning about different representations of a program, a phenomenon we define as a "language dichotomy". In this paper, we reflect upon the various language dichotomies that contribute to open problems in program comprehension and development for mobile apps. Furthermore, to help guide the research community towards effective solutions for these problems, we provide a roadmap of directions for future work.Comment: Invited Keynote Paper for the 26th IEEE/ACM International Conference on Program Comprehension (ICPC'18

arXiv.org e-Print Archive

Crossref

MACE: Deliverable D7.6 - Report on user interface design and community experiments

Author: Börner Dirk
Nagel Till
Specht Marcus
Stefaner Moritz
Wolpers Martin
Publication venue
Publication date: 02/02/2009
Field of study

This deliverable presents the progress of the user interface design and community building experiments within the MACE project. In Chapter 2 we generally present the interface of the MACE portal, which is a platform to discover and enrich architectural resources and, at the same time, to support the community formed around architectural topics. Besides the advanced search, the portal provides various visual tools for metadata based search and browsing, tailored to architectural needs (see Chapter 3). Different metadata widgets are used to visualize and access multiple dimensions of each resource, as presented in Chapter 4. These widgets not only establish meaningful cross–connections between resources, but also invite to add and edit metadata effortlessly. In order to generate a critical mass of metadata and ensure sustainability of projects’ outcomes, supporting community and fostering end user contributions are critical. In Chapter 5, we present the components deploied in this direction as well as an analytical framework for incentive mechanisms. Within the dissemination strategy, the MACE project has got a unique chance to raise its public awareness at La Biennale of architecture in Venice, 2008. In this context we designed an interactive installation, demonstrating, in an exhibition setting, the benefits of resource interconnection via metadata (see Chapter 6). Chapter 7 presents our preliminary conclusions and an overview of planned future activities

Open University of the Netherlands Research Portal

Exploring the characteristics of issue-related behaviors in GitHub using visualization techniques

Author: Chen Zhijie
Fan Xiaoping
He Dayu
Liao Zhifang
Liu Shengzong
Zhang Yan
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2018
Field of study

Crossref

ResearchOnline@GCU

Multiple Coordinated Views for Searching and Navigating Web Content Repositories

Author: Hubmann-Haidvogel Alexander
Scharl Arno
Weichselbraun Albert
Publication venue: 'Elsevier BV'
Publication date: 01/01/2009
Field of study

The advantages and positive effects of multiple coordinated views on search performance have been documented in several studies. This paper describes the implementation of multiple coordinated views within the Media Watch on Climate Change, a domain-speciﬁc news aggregation portal available at www.ecoresearch.net/climate that combines a portfolio of semantic services with a visual information exploration and retrieval interface. The system builds contextualized information spaces by enriching the content repository with geospatial, semantic and temporal annotations, and by applying semi-automated ontology learning to create a controlled vocabulary for structuring the stored information. Portlets visualize the different dimensions of the contextualized information spaces, providing the user with multiple views on the latest news media coverage. Context information facilitates access to complex datasets and helps users navigate large repositories of Web documents. Currently, the system synchronizes information landscapes, domain ontologies, geographic maps, tag clouds and just-in-time information retrieval agents that suggest similar topics and nearby locations

CiteSeerX

Crossref

webLyzard technology gmbh

Enhancing Scholarly Publications: Developing Hybrid Monographs in the Humanities and Social Sciences

Author: Jankowski Nicholas
Scharnhorst Andrea
Tatum Clifford
Tatum Zuotian
Publication venue: 'CISP Journal Services'
Publication date: 19/12/2012
Field of study

Enhancing publications has a long history but is gaining acceleration as authors and publishers explore electronic tablets as devices for dissemination and presentation. Enhancement of scholarly publications, in contrast, more often takes place in a Web environment and is coupled with presentation of supplementary materials related to research. The approach to enhancing scholarly publications presented in this article goes a step further and involves the interlinking of the “objects” of a document: datasets, supplementary materials, secondary analyses, and post-publication interventions. This approach connects the user-centricity of Web 2.0 with the Semantic Web. It aims at facilitating long-term content structure through standardized formats intended to improve interoperability between concepts and terms within and across knowledge domains. We explored this conception of enhancement on a small set of books prepared for traditional academic publishers. While the project was primarily an exercise in development, the conclusion section of the article reflects on areas where conceptual and empirical studies could be initiated to complement this new direction in scholarly publishing.&nbsp

Scholarly and Research Communication (E-Journal)

Measuring metadata quality

Author: Király Péter
Publication venue
Publication date: 24/06/2019
Field of study

Georg-August-University Göttingen

Usage Bibliometrics

Author: Abt
Accomazzi
Aggarwal
Baldi
Bar-Ilan
Bensman
Bertot
Blecic
Bollen
Bollen
Bollen
Bollen
Bollen
Bollen
Bollen
Bonitz
Borgman
Boyack
Boyack
Brin
Broadus
Broadus
Brody
Brody
Brookes
Burton
Börner
Börner
Castellano
Chen
Cooper
Craig
Cronin
Cronin
Cronin
Darmoni
Davis
Davis
Davis
Davis
Davis
Drott
Duy
Eason
Egghe
Eichhorn
Eysenbach
Eysenbach
Fortunato
Freire
Galvin
Gardner
Garfield
Garfield
Garfield
Gargouri
Georgakopoulos
Ginsparg
Ginsparg
Ginsparg
Ginsparg
Goldberg
Gosnell
Grant
Gross
Hajjem
Harnad
Harnad
Harnad
He
Henneken
Henneken
Henneken
Hider
Hood
Huntington
Jamali
Jansen
Jansen
Kaplan
King
King
King
King
Kurtz
Kurtz
Kurtz
Kurtz
Kurtz
Kurtz
Kurtz
Kurtz
Kurtz
Kurtz
Kurtz
Ladwig
Lawrence
Leydesdorff
Leydesdorff
Leydesdorff
Line
Line
Liu
Ludascher
Luther
MacRoberts
May
Mayr
McDonald
Meadows
Merton
Moed
Moed
Moed
Moya-Anegón
Nicholas
Norris
Pan
Parker
Peters
Pinski
Pirolli
Price
Price
Price
Rice
Rosvall
Rowlands
Rowlands
Scales
Shepherd
Small
Stankus
Szalay
Szalay
Tenopir
Tenopir
Tonta
Trimble
Trimble
Tsay
Tsay
Van de Sompel
Van de Sompel
Van de Sompel
Walter
Wang
Wasserman
White
Wilson
York
Publication venue: 'Wiley'
Publication date: 14/02/2011
Field of study

Scholarly usage data provides unique opportunities to address the known shortcomings of citation analysis. However, the collection, processing and analysis of usage data remains an area of active research. This article provides a review of the state-of-the-art in usage-based informetric, i.e. the use of usage data to study the scholarly process.Comment: Publisher's PDF (by permission). Publisher web site: books.infotoday.com/asist/arist44.shtm

arXiv.org e-Print Archive

Crossref

Visualisation of the organisation knowledge structure evolution

Author: Mostashari Ali
Stanković Tino
Štorga Mario
Publication venue: 'Emerald'
Publication date: 01/01/2013
Field of study

FAMENA Repository