Search CORE

10,678 research outputs found

Doc2RDFa: Semantic Annotation for Web Documents

Author: Beno Martin
Filtz Erwin
Kirrane Sabrina
Polleres Axel
Publication venue: Published by its editors
Publication date: 01/01/2019
Field of study

Ever since its conception, the amount of data published on the worldwide web has been rapidly growing to the point where it has become an important source of both general and domain specific information. However, the majority of documents published online are not machine readable by default. Many researchers believe that the answer to this problem is to semantically annotate these documents, and thereby contribute to the linked "Web of Data". Yet, the process of annotating web documents remains an open challenge. While some efforts towards simplifying this process have been made in the recent years, there is still a lack of semantic content creation tools that integrate well with information worker toolsets. Towards this end, we introduce Doc2RDFa, an HTML rich text processor with the ability to automatically and manually annotate domain-specific Content

Elektronische Publikationen der Wirtschaftsuniversität Wien

Link Before You Share: Managing Privacy Policies through Blockchain

Author: Banerjee Agniva
Joshi Karuna Pande
Publication venue
Publication date: 15/10/2017
Field of study

With the advent of numerous online content providers, utilities and applications, each with their own specific version of privacy policies and its associated overhead, it is becoming increasingly difficult for concerned users to manage and track the confidential information that they share with the providers. Users consent to providers to gather and share their Personally Identifiable Information (PII). We have developed a novel framework to automatically track details about how a users' PII data is stored, used and shared by the provider. We have integrated our Data Privacy ontology with the properties of blockchain, to develop an automated access control and audit mechanism that enforces users' data privacy policies when sharing their data across third parties. We have also validated this framework by implementing a working system LinkShare. In this paper, we describe our framework on detail along with the LinkShare system. Our approach can be adopted by Big Data users to automatically apply their privacy policy on data operations and track the flow of that data across various stakeholders.Comment: 10 pages, 6 figures, Published in: 4th International Workshop on Privacy and Security of Big Data (PSBD 2017) in conjunction with 2017 IEEE International Conference on Big Data (IEEE BigData 2017) December 14, 2017, Boston, MA, US

arXiv.org e-Print Archive

Crossref

Recommended from our members

A conceptual model for semantically-based e-government portals

Author: Cabral Liliana
Domingue John
Gugliotta Alessio
Roberto Vito
Publication venue
Publication date: 01/01/2005
Field of study

Issues of semantic interoperability and service integration for e-government portals are the domain of interest of the present paper. We propose a Conceptual Model for One-Stop e-Government Portals based on the Semantic Web Service technology. We describe our research into building the three basic ontologies and their integration with standard ontologies. The result is a project-independent reusable model. At the same time, we outline a simple methodology for applying the proposed conceptual model into a specific scenario

Open Research Online (The Open University)

Recommended from our members

Extracting and re-using research data from chemistry e-theses: the SPECTRa-T project

Author: Downing Jim
Harvey Matt
Morgan Peter
Murray-Rust Peter
Rzepa Henry S
Stewart Diana
Tonge Alan
Townsend Joseph A
Publication venue: 11th International Symposium on Electronic Theses and Dissertations
Publication date: 01/06/2008
Field of study

Scientific e-theses are data-rich resources, but much of the information they contain is not readily accessible. For chemistry, the SPECTRa-T project has addressed this problem by developing data-mining techniques to extract experimental data, creating RDF (Resource Description Framework) triples for exposure to sophisticated Semantic Web searches. We used OSCAR3, an Open Source chemistry text-mining tool, to parse and extract data from theses in PDF, and from theses in Office Open XML document format. Theses in PDF suffered data corruption and a loss of formatting that prevented the identification of chemical objects. Theses in .docx yielded semantically rich SciXML that enabled the additional extraction of associated data. Chemical objects were placed in a data repository, and RDF triples deposited in a triplestore. Data-mining from chemistry e-theses is both desirable and feasible; but the use of PDF, the de facto format standard for deposit in most repositories, prevents the optimal extraction of data for semantic querying. In order to facilitate this, we recommend that universities also require deposition of chemistry e-theses in an XML document format. Further work is required to clarify the complex IPR issues and ensure that they do not become an unwarranted barrier to data extraction and re-use

Apollo (Cambridge)

Towards a service-oriented e-infrastructure for multidisciplinary environmental research

Author: Athanasiadis I.N.
Beulens A.J.M.
Kassahun A.
Krause A.
Makowski M.
Rizzoli A.E.
Scholten H.
Publication venue: iEMSs
Publication date: 01/07/2010
Field of study

Research e-infrastructures are considered to have generic and thematic parts. The generic part provids high-speed networks, grid (large-scale distributed computing) and database systems (digital repositories and data transfer systems) applicable to all research commnities irrespective of discipline. Thematic parts are specific deployments of e-infrastructures to support diverse virtual research communities. The needs of a virtual community of multidisciplinary envronmental researchers are yet to be investigated. We envisage and argue for an e-infrastructure that will enable environmental researchers to develop environmental models and software entirely out of existing components through loose coupling of diverse digital resources based on the service-oriented achitecture. We discuss four specific aspects for consideration for a future e-infrastructure: 1) provision of digital resources (data, models & tools) as web services, 2) dealing with stateless and non-transactional nature of web services using workflow management systems, 3) enabling web servce discovery, composition and orchestration through semantic registries, and 4) creating synergy with existing grid infrastructures

Wageningen University & Research Publications

International Institute for Applied Systems Analysis (IIASA)

A framework for deriving semantic web services

Author: A. Sheth
Antonio Merico
B. Medjahed
C. Partridge
D. Fensel
D. Martin
David Bell
F. Curbera
J. Blythe
J. Cardoso
J. Hendler
J. Nunamaker
K. Sycara
M. Burstein
M. Paolucci
Mark Lycett
Nicola Iacovelli
P. Fremantle
S. A. McIlraith
S. Kleijnen
S. March
S. Staab
S. Vinoski
Sergio de Cesare
T. Berners-Lee
T. Honderich
T. R. Gruber
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 19/12/2006
Field of study

Web service-based development represents an emerging approach for the development of distributed information systems. Web services have been mainly applied by software practitioners as a means to modularize system functionality that can be offered across a network (e.g., intranet and/or the Internet). Although web services have been predominantly developed as a technical solution for integrating software systems, there is a more business-oriented aspect that developers and enterprises need to deal with in order to benefit from the full potential of web services in an electronic market. This ‘ignored’ aspect is the representation of the semantics underlying the services themselves as well as the ‘things’ that the services manage. Currently languages like the Web Services Description Language (WSDL) provide the syntactic means to describe web services, but lack in providing a semantic underpinning. In order to harvest all the benefits of web services technology, a framework has been developed for deriving business semantics from syntactic descriptions of web services. The benefits of such a framework are two-fold. Firstly, the framework provides a way to gradually construct domain ontologies from previously defined technical services. Secondly, the framework enables the migration of syntactically defined web services toward semantic web services. The study follows a design research approach which (1) identifies the problem area and its relevance from an industrial case study and previous research, (2) develops the framework as a design artifact and (3) evaluates the application of the framework through a relevant scenario

Crossref

Brunel University Research Archive