8,629 research outputs found
Towards Cleaning-up Open Data Portals: A Metadata Reconciliation Approach
This paper presents an approach for metadata reconciliation, curation and
linking for Open Governamental Data Portals (ODPs). ODPs have been lately the
standard solution for governments willing to put their public data available
for the society. Portal managers use several types of metadata to organize the
datasets, one of the most important ones being the tags. However, the tagging
process is subject to many problems, such as synonyms, ambiguity or
incoherence, among others. As our empiric analysis of ODPs shows, these issues
are currently prevalent in most ODPs and effectively hinders the reuse of Open
Data. In order to address these problems, we develop and implement an approach
for tag reconciliation in Open Data Portals, encompassing local actions related
to individual portals, and global actions for adding a semantic metadata layer
above individual portals. The local part aims to enhance the quality of tags in
a single portal, and the global part is meant to interlink ODPs by establishing
relations between tags.Comment: 8 pages,10 Figures - Under Revision for ICSC201
Smart Signs: Showing the way in Smart Surroundings
This paper presents a context-aware guidance and messaging
system for large buildings and surrounding venues. Smart Signs are
a new type of electronic door- and way-sign based on wireless sensor networks.
Smart Signs present in-situ personalized guidance and messages,
are ubiquitous, and easy to understand. They combine the easiness of
use of traditional static signs with the flexibility and reactiveness of navigation
systems. The Smart Signs system uses context information such
as userâs mobility limitations, the weather, and possible emergency situations
to improve guidance and messaging.
Minimal infrastructure requirements and a simple deployment tool make
it feasible to easily deploy a Smart Signs system on demand.
An important design issue of the Smart Signs system is privacy: the
system secures communication links, does not track users, allow almost
complete anonymous use, and prevent the system to be used as a tool
for spying on users
SWI-Prolog and the Web
Where Prolog is commonly seen as a component in a Web application that is
either embedded or communicates using a proprietary protocol, we propose an
architecture where Prolog communicates to other components in a Web application
using the standard HTTP protocol. By avoiding embedding in external Web servers
development and deployment become much easier. To support this architecture, in
addition to the transfer protocol, we must also support parsing, representing
and generating the key Web document types such as HTML, XML and RDF.
This paper motivates the design decisions in the libraries and extensions to
Prolog for handling Web documents and protocols. The design has been guided by
the requirement to handle large documents efficiently. The described libraries
support a wide range of Web applications ranging from HTML and XML documents to
Semantic Web RDF processing.
To appear in Theory and Practice of Logic Programming (TPLP)Comment: 31 pages, 24 figures and 2 tables. To appear in Theory and Practice
of Logic Programming (TPLP
RePEc and S-WoPEc: Internet access to electronic preprints in Economics
The first electronic Economics preprint appeared in 1993. Since then the growth has been dramatic as the use of the World Wide Web has exploded. RePEc has been instrumental in facilitating access to Economics preprints and in bringing order to the chaos that the WWW frequently represents. In a related effort S-WoPEc provides user friendly tools for adding data to the RePEc system. While this is significant in itself it has also been instrumental in fulfilling S-WoPEc's second goal--to provide increased exposure to Swedish Economics research
Connected Information Management
Society is currently inundated with more information than ever, making efficient management
a necessity. Alas, most of current information management suffers from several
levels of disconnectedness: Applications partition data into segregated islands,
small notes donât fit into traditional application categories, navigating the data is different
for each kind of data; data is either available at a certain computer or only online,
but rarely both. Connected information management (CoIM) is an approach to information
management that avoids these ways of disconnectedness. The core idea of
CoIM is to keep all information in a central repository, with generic means for organization
such as tagging. The heterogeneity of data is taken into account by offering
specialized editors.
The central repository eliminates the islands of application-specific data and is formally
grounded by a CoIM model. The foundation for structured data is an RDF repository.
The RDF editing meta-model (REMM) enables form-based editing of this data,
similar to database applications such as MS access. Further kinds of data are supported
by extending RDF, as follows. Wiki text is stored as RDF and can both contain
structured text and be combined with structured data. Files are also supported by the
CoIM model and are kept externally. Notes can be quickly captured and annotated with
meta-data. Generic means for organization and navigation apply to all kinds of data.
Ubiquitous availability of data is ensured via two CoIM implementations, the web application
HYENA/Web and the desktop application HYENA/Eclipse. All data can be
synchronized between these applications. The applications were used to validate the
CoIM ideas
Automated construction and analysis of political networks via open government and media sources
We present a tool to generate real world political networks from user provided lists of politicians and news sites. Additional output includes visualizations, interactive tools and maps that allow a user to better understand the politicians and their surrounding environments as portrayed by the media. As a case study, we construct a comprehensive list of current Texas politicians, select news sites that convey a spectrum of political viewpoints covering Texas politics, and examine the results. We propose a âCombinedâ co-occurrence distance metric to better reflect the relationship between two entities. A topic modeling technique is also proposed as a novel, automated way of labeling communities that exist within a politicianâs âextendedâ network.Peer ReviewedPostprint (author's final draft
- âŠ