5 research outputs found
Doc2RDFa: Semantic Annotation for Web Documents
Ever since its conception, the amount of data published on the worldwide
web has been rapidly growing to the point where it has become an important
source of both general and domain specific information. However, the majority
of documents published online are not machine readable by default. Many researchers
believe that the answer to this problem is to semantically annotate these
documents, and thereby contribute to the linked "Web of Data". Yet, the process
of annotating web documents remains an open challenge. While some efforts towards
simplifying this process have been made in the recent years, there is still a
lack of semantic content creation tools that integrate well with information worker
toolsets. Towards this end, we introduce Doc2RDFa, an HTML rich text processor
with the ability to automatically and manually annotate domain-specific Content
Events Matter: Extraction of Events from Court Decisions
The analysis of court decisions and associated events is part of the daily life of many legal practitioners. Unfortunately, since court decision texts can often be long and complex, bringing all events relating to a case in order, to understand their connections and durations is a time-consuming task. Automated court decision timeline generation could provide a visual overview of what happened throughout a case by representing the main legal events, together with relevant temporal information. Tools and technologies to extract events from court decisions however are still underdeveloped. To this end, in the current paper we compare the effectiveness of three different extraction mechanisms, namely deep learning, conditional random fields, and rule-based method, to facilitate automated extraction of events and their components (i.e., the event type, who was involved, and when it happened). In addition, we provide a corpus of manually annotated decisions of the European Court of Human Rights, which shall serve as a gold standard not only for our own evaluation, but also for the research community for comparison and further experiments
The linked legal data landscape: linking legal data across different countries
The European Union is working towards harmonizing legislation across Europe, in order to improve cross-border interchange of legal information. This goal is supported for instance via standards such as the European Law Identifier (ELI) and the European Case Law Identifier (ECLI), which provide technical specifications for Web identifiers and suggestions for vocabularies to be used to describe metadata pertaining to legal documents in a machine readable format. Notably, these ECLI and ELI metadata standards adhere to the RDF data format which forms the basis of Linked Data, and therefore have the potential to form a basis for a pan-European legal Knowledge Graph. Unfortunately, to date said specifications have only been partially adopted by EU member states. In this paper we describe a methodology to transform the existing legal information system used in Austria to such a legal knowledge graph covering different steps from modeling national specific aspects, to population, and finally the integration of legal data from other countries through linked data. We demonstrate the usefulness of this approach by exemplifying practical use cases from legal information search, which are not possible in an automated fashion so far