Search CORE

22 research outputs found

Metadata for Big Data: A preliminary investigation of metadata quality issues in research data repositories

Author
Publication venue: 'IOS Press'
Publication date
Field of study

Enhancing Access to Legal Data through Ontology-based Representation:A Case Study with Brazilian Judicial Appeals

Author: Almeida João Paulo A.
Bourguet Jean Rémi
Costa Melissa Zorzanelli
Guizzardi Giancarlo
Vieira Thiago Baiense Peçanha
Publication venue
Publication date: 01/01/2023
Field of study

In Brazil, legal requirements for public information access, as mandated by Law no 12.527/2011, have amplified the role of the open data portals in disseminating data of collective and general interest. Despite legal provisions, there are persistent difficulties in presenting data in first-class semantic formats, which ultimately creates obstacles for digital citizens to fully exercise their newfound rights to information access. These obstacles can be addressed by building semantic data warehouses to enhance the use of open data through computational ontologies. In this paper, we demonstrate the use of a well-founded legal ontology for representing data from legal decisions extracted from a Brazilian judicial organ website. We focused our approach on a specific type of appeal in the Brazilian legal system, the Request for Standardization (RS) of interpretation of federal law, which seeks to standardize the understanding of the Appeals Panels of Federal Special Courts. Employing web scraping techniques, we built a complete ETL (Extract, Transform, Load) process to triplify data on RS appeals and their rulings. We used a gUFO-based OWL renderization of a previously developed OntoUML ontology (called OntoRS) to transform the extracted data into a suitable RDF format and populate a Virtuoso triple store. Thus, the OntoRS ontology allowed us to perform SPARQL queries to obtain new insights, metrics and small RDF graphs.</p

University of Twente Research Information

Bibliografski zapisi od kataložnih listića do semantičkog weba : uvjeti za funkcionalnost

Author: Ivan Miočić
Publication venue: 'Croatian Library Association'
Publication date: 01/01/2017
Field of study

Cilj. Rad daje kratak pregled prirode bibliografskih zapisa kroz povijest, od zapisa ručno ili strojno ispisanih na listiću pa do onih izrađenih u okruženju semantičkog weba. Pristup. Na osnovi pregleda literature stječe se uvid u mogućnosti koje su zapisi imali u doba kataložnih listića te one koje imaju tijekom aktualnog elektroničkog doba, u čijoj suvremenoj fazi do izražaja može doći njihov veći potencijal. Rezultati. Na presjeku pregleda literature i razmatranja aktualnih svjetskih primjera dobre prakse primijećena je nužnost napuštanja tradicionalnih načina katalogiziranja, osuvremenjivanja kataložnih pravilnika, razvijanja jedinstvene metapodatkovne infrastrukture te institucionalne suradnje u vidu međusobne razmjene metapodataka. Također je primijećena nužnost sinteze knjižničnih kataloga i komercijalnih web-tražilica u svrhu izgradnje relevantnog informacijskog univerzuma. Originalnost/vrijednost. U radu se razmatraju praktični načini za poboljšanje kvalitete i dostupnosti bibliografskih podataka kako bi se generirale nove usluge koje bi knjižnice u modernom umreženom društvu učinile više vidljivima. Shodno tomu, pred- lažu se mogućnosti izrade aplikacija utemeljenih na bibliografskim podacima, čime bi se taj segment knjižnične djelatnosti uspješnije promovirao

HRČAK - Portal of Croatian Scientific and Professional Journals

Hrčak - Portal of scientific journals of Croatia

Recommended from our members

OptiqueVQS: A visual query system over ontologies for industry

Author: Arenas
Bevan
Bobed
Brunetti
Calvanese
Catarci
Catarci
Civili
Cuenca Grau
Damljanovic
Dey
Epstein
Giese
Glimm
Hogan
Kaufmann
Khalili
Kharlamov
Kharlamov
Kogalovsky
López
Marchionini
Nikolaou
Poggi
Qiu
Romero
Schraefel
Shneiderman
Skjæveland
Soylu
Soylu
Soylu
Soylu
Soylu
Soylu
Soylu
Spanos
Sutcliffe
ter Hofstede
Tran
Uren
Vega-Gorgojo
Vega-Gorgojo
Publication venue: 'IOS Press'
Publication date: 01/01/2017
Field of study

An important application of semantic technologies in industry has been the formalisation of information models using OWL 2 ontologies and the use of RDF for storing and exchanging application data. Moreover, legacy data can be virtualised as RDF using ontologies following the ontology-based data access (OBDA) approach. In all these applications, it is important to provide domain experts with query formulation tools for expressing their information needs in terms of queries over ontologies. In this work, we present such a tool, OptiqueVQS, which is designed based on our experience with OBDA applications in Statoil and Siemens and on best HCI practices for interdisciplinary engineering environments. OptiqueVQS implements a number of unique techniques distinguishing it from analogous query formulation systems. In particular, it exploits ontology projection techniques to enable graph-based navigation over an ontology during query construction. Secondly, while OptiqueVQS is primarily ontology driven, it exploits sampled data to enhance selection of data values for some data attributes. Finally, OptiqueVQS is built on well-grounded requirements, design rationale, and quality attributes. We evaluated OptiqueVQS with both domain experts and casual users and qualitatively compared our system against prominent visual systems for ontology-driven query formulation and exploration of semantic data. OptiqueVQS is available online and can be downloaded together with an example OBDA scenario

City Research Online

Crossref

NORA - Norwegian Open Research Archives

Measuring metadata quality

Author: Király Péter
Publication venue
Publication date: 24/06/2019
Field of study

Georg-August-University Göttingen

Content Enrichment of Digital Libraries: Methods, Technologies and Implementations

Author: Hajra Arben
Publication venue
Publication date: 01/01/2020
Field of study

Parallel to the establishment of the concept of a "digital library", there have been rapid developments in the fields of semantic technologies, information retrieval and artificial intelligence. The idea is to use make use of these three fields to crosslink bibliographic data, i.e., library content, and to enrich it "intelligently" with additional, especially non-library, information. By linking the contents of a library, it is possible to offer users access to semantically similar contents of different digital libraries. For instance, a list of semantically similar publications from completely different subject areas and from different digital libraries can be made accessible. In addition, the user is able to see a wider profile about authors, enriched with information such as biographical details, name alternatives, images, job titles, institute affiliations, etc. This information comes from a wide variety of sources, most of which are not library sources. In order to make such scenarios a reality, this dissertation follows two approaches. The first approach is about crosslinking digital library content in order to offer semantically similar publications based on additional information for a publication. Hence, this approach uses publication-related metadata as a basis. The aligned terms between linked open data repositories/thesauri are considered as an important starting point by considering narrower, broader and related concepts through semantic data models such as SKOS. Information retrieval methods are applied to identify publications with high semantic similarity. For this purpose, approaches of vector space models and "word embedding" are applied and analyzed comparatively. The analyses are performed in digital libraries with different thematic focuses (e.g. economy and agriculture). Using machine learning techniques, metadata is enriched, e.g. with synonyms for content keywords, in order to further improve similarity calculations. To ensure quality, the proposed approaches will be analyzed comparatively with different metadata sets, which will be assessed by experts. Through the combination of different information retrieval methods, the quality of the results can be further improved. This is especially true when user interactions offer possibilities for adjusting the search properties. In the second approach, which this dissertation pursues, author-related data are harvested in order to generate a comprehensive author profile for a digital library. For this purpose, non-library sources, such as linked data repositories (e.g. WIKIDATA) and library sources, such as authority data, are used. If such different sources are used, the disambiguation of author names via the use of already existing persistent identifiers becomes necessary. To this end, we offer an algorithmic approach to disambiguate authors, which makes use of authority data such as the Virtual International Authority File (VIAF). Referring to computer sciences, the methodological value of this dissertation lies in the combination of semantic technologies with methods of information retrieval and artificial intelligence to increase the interoperability between digital libraries and between libraries with non-library sources. By positioning this dissertation as an application-oriented contribution to improve the interoperability, two major contributions are made in the context of digital libraries: (1) The retrieval of information from different Digital Libraries can be made possible via a single access. (2) Existing information about authors is collected from different sources and aggregated into one author profile.Parallel zur Etablierung des Konzepts einer „Digitalen Bibliothek“ gab es rasante Weiterentwicklungen in den Bereichen semantischer Technologien, Information Retrieval und künstliche Intelligenz. Die Idee ist es, mit ihrer Hilfe bibliographische Daten, also Inhalte von Bibliotheken, miteinander zu vernetzen und „intelligent“ mit zusätzlichen, insbesondere nicht-bibliothekarischen Informationen anzureichern. Durch die Verknüpfung von Inhalten einer Bibliothek wird es möglich, einen Zugang für Benutzer*innen anzubieten, über den semantisch ähnliche Inhalte unterschiedlicher Digitaler Bibliotheken zugänglich werden. Beispielsweise können hierüber ausgehend von einer bestimmten Publikation eine Liste semantisch ähnlicher Publikationen ggf. aus völlig unterschiedlichen Themenfeldern und aus verschiedenen digitalen Bibliotheken zugänglich gemacht werden. Darüber hinaus können sich Nutzer*innen ein breiteres Autoren-Profil anzeigen lassen, das mit Informationen wie biographischen Angaben, Namensalternativen, Bildern, Berufsbezeichnung, Instituts-Zugehörigkeiten usw. angereichert ist. Diese Informationen kommen aus unterschiedlichsten und in der Regel nicht-bibliothekarischen Quellen. Um derartige Szenarien Realität werden zu lassen, verfolgt diese Dissertation zwei Ansätze. Der erste Ansatz befasst sich mit der Vernetzung von Inhalten Digitaler Bibliotheken, um auf Basis zusätzlicher Informationen für eine Publikation semantisch ähnliche Publikationen anzubieten. Dieser Ansatz verwendet publikationsbezogene Metadaten als Grundlage. Die verknüpften Begriffe zwischen verlinkten offenen Datenrepositorien/Thesauri werden als wichtiger Angelpunkt betrachtet, indem Unterbegriffe, Oberbegriffe und verwandten Konzepte über semantische Datenmodelle, wie SKOS, berücksichtigt werden. Methoden des Information Retrieval werden angewandt, um v.a. Publikationen mit hoher semantischer Verwandtschaft zu identifizieren. Zu diesem Zweck werden Ansätze des Vektorraummodells und des „Word Embedding“ eingesetzt und vergleichend analysiert. Die Analysen werden in Digitalen Bibliotheken mit unterschiedlichen thematischen Schwerpunkten (z.B. Wirtschaft und Landwirtschaft) durchgeführt. Durch Techniken des maschinellen Lernens werden hierfür Metadaten angereichert, z.B. mit Synonymen für inhaltliche Schlagwörter, um so Ähnlichkeitsberechnungen weiter zu verbessern. Zur Sicherstellung der Qualität werden die beiden Ansätze mit verschiedenen Metadatensätzen vergleichend analysiert wobei die Beurteilung durch Expert*innen erfolgt. Durch die Verknüpfung verschiedener Methoden des Information Retrieval kann die Qualität der Ergebnisse weiter verbessert werden. Dies trifft insbesondere auch dann zu wenn Benutzerinteraktion Möglichkeiten zur Anpassung der Sucheigenschaften bieten. Im zweiten Ansatz, den diese Dissertation verfolgt, werden autorenbezogene Daten gesammelt, verbunden mit dem Ziel, ein umfassendes Autorenprofil für eine Digitale Bibliothek zu generieren. Für diesen Zweck kommen sowohl nicht-bibliothekarische Quellen, wie Linked Data-Repositorien (z.B. WIKIDATA) und als auch bibliothekarische Quellen, wie Normdatensysteme, zum Einsatz. Wenn solch unterschiedliche Quellen genutzt werden, wird die Disambiguierung von Autorennamen über die Nutzung bereits vorhandener persistenter Identifikatoren erforderlich. Hierfür bietet sich ein algorithmischer Ansatz für die Disambiguierung von Autoren an, der Normdaten, wie die des Virtual International Authority File (VIAF) nachnutzt. Mit Bezug zur Informatik liegt der methodische Wert dieser Dissertation in der Kombination von semantischen Technologien mit Verfahren des Information Retrievals und der künstlichen Intelligenz zur Erhöhung von Interoperabilität zwischen Digitalen Bibliotheken und zwischen Bibliotheken und nicht-bibliothekarischen Quellen. Mit der Positionierung dieser Dissertation als anwendungsorientierter Beitrag zur Verbesserung von Interoperabilität werden zwei wesentliche Beiträge im Kontext Digitaler Bibliotheken geleistet: (1) Die Recherche nach Informationen aus unterschiedlichen Digitalen Bibliotheken kann über einen Zugang ermöglicht werden. (2) Vorhandene Informationen über Autor*innen werden aus unterschiedlichsten Quellen eingesammelt und zu einem Autorenprofil aggregiert

MACAU: Open Access Repository of Kiel University

Modular structure of web-based decision support systems for integrated pest management. A review

Author: A Abdullah
A Johnen
A Johnen
A Khaliq
A Maanen Van
A McBratney
A Perini
A Rehman
AB Deraman
AB Hearn
AEA Stephens
AJ Haverkort
AK Bregt
AW Ferguson
B Hannon
B Kleinhenz
BA Croft
BR Barlet
BS Nietschke
Cross
D Dent
D Plénet
D Tilman
D Tilman
Damos
DC Kontodimas
DH Gent
DM Cairns
E Bouma
E Jorg
E Lichtfouse
F Zalom
FJ Orellana
FK Evert Van
FW Nutter
FW Nutter
G Bourgeois
GA Carlson
GA Norton
GF Backoulou
GN Agrios
GP Fitt
H Amano
HS Saini
HW Beck
IB Waheed
J Andrade-Piedra
J Frahm
J Samietz
J Sun
J Youen
JA Logan
JD Knight
JD Knight
JF Strand
JH Brown
JS Bery
K Kos
K Phelps
K Phelps
KA Garret
KP Pruess
L Cowell
L Willocquet
L Xinxing
LE Ehler
LG Higley
LK Branting
M d’Aquin
M Lefebvre
M Sporelder
M Trnka
MA Altieri
ML Flint
MM Taye
MP Bange
N Coulson
N McRoberts
NC Elliott
NC Nguyen
P Damos
P Damos
P Damos
P Damos
P Damos
P Damos
P Kamalak
P Rousse
PA Magarey
PB Goodell
Petros Damos
PJH Sharpe
PS Teng
R Cheke
R Don Wauchope
R Kaundal
R Philomine
R Prasad
RD Magarey
RE Plant
RM Strickland
RN Coulson
RW Sutherst
RW Sutherst
S Chakrabarti
S Kohle
S Zhang
SD Beck
SH Liao
SJ Yelapure
The Plant Ontology Consortium
TL Wagner
TL Wagner
TL Wagner
TTN Chi
VI Pajunen
VM Stern
VP Jonew
W Pavan
W Zhang
WD Batchelor
WI Bajwa
WI Bajwa
WJ Lewis
X Xu
Y Huang
Y Zheng
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

User Interfaces to the Web of Data based on Natural Language Generation

Author: Ell Basil
Publication venue: KIT Scientific Publishing, Karlsruhe
Publication date: 01/01/2017
Field of study

We explore how Virtual Research Environments based on Semantic Web technologies support research interactions with RDF data in various stages of corpus-based analysis, analyze the Web of Data in terms of human readability, derive labels from variables in SPARQL queries, apply Natural Language Generation to improve user interfaces to the Web of Data by verbalizing SPARQL queries and RDF graphs, and present a method to automatically induce RDF graph verbalization templates via distant supervision

KITopen

Directory of Open Access Books (DOAB)

Artificial intelligence for ocean science data integration:current state, gaps, and way forward

Author: Bar Koby
Lehahn Yoav
Sagi Tomer
Publication venue: 'University of California Press'
Publication date: 15/05/2020
Field of study

VBN