Search CORE

184 research outputs found

Digital Parliamentary Data in Action (DiPaDA 2022): Introduction

Author: Hyvönen Eero
La Mela Matti
Norén Fredrik
Publication venue: CEUR
Publication date: 01/01/2022
Field of study

Peer reviewe

Publikationer från Umeå universitet

Digitala Vetenskapliga Arkivet - Academic Archive On-line

Helsingin yliopiston digitaalinen arkisto

Analysis of legal networks

Author: Boer A.
Trompper M.
Winkels R.
Publication venue
Publication date: 01/01/2016
Field of study

This report describes the main electronically available sources of law in the three target countries of the Openlaws.eu project: Austria, the Netherlands and the United Kingdom, plus those of the EU. It describes their strengths and weaknesses in terms of available data, formats and licensing. Since the world is dynamic, especially that of electronic data, the document was originally set up as a set of spreadsheets and a web site that is easier to maintain and update. This deliverable contains a snapshot of the status of these documents at the end of December 2014

ZENODO

UvA-DARE

PTPARL-D: Annotated Corpus of 44 years of Portuguese Parliament debates

Author: Almeida Paulo
Gonçalves-Sá Joana
Marques-Pita Manuel
Publication venue
Publication date: 26/04/2020
Field of study

In a representative democracy, some decide in the name of the rest, and these elected officials are commonly gathered in public assemblies, such as parliaments, where they discuss policies, legislate, and vote on fundamental initiatives. A core aspect of such democratic processes are the plenary debates, where important public discussions take place. Many parliaments around the world are increasingly keeping the transcripts of such debates, and other parliamentary data, in digital formats accessible to the public, increasing transparency and accountability. Furthermore, some parliaments are bringing old paper transcripts to semi-structured digital formats. However, these records are often only provided as raw text or even as images, with little to no annotation, and inconsistent formats, making them difficult to analyze and study, reducing both transparency and public reach. Here, we present PTPARL-D, an annotated corpus of debates in the Portuguese Parliament, from 1976 to 2019, covering the entire period of Portuguese democracy

arXiv.org e-Print Archive

The ParlaMint corpora of parliamentary proceedings

Author: Agnoloni Tommaso
Barkarson Starkaður
Coole Matthew
Darǵis Roberts
de Does Jesse
de Macedo Luciana D.
Depuydt Katrien
Erjavec Tomaž
Fišer Darja
Kopp Matyáš
Krilavičius Tomas
Ljubešić Nikola
Luxardo Giancarlo
Marx Maarten
Morkevičius Vaidas
Navarretta Costanza
Ogrodniczuk Maciej
Osenova Petya
Pančur Andrej
Pérez María Calzada
Rayson Paul
Ring Orsolya
Rudolf Michał
Simov Kiril
Steingrímsson Steinþór
van Heusden Ruben
Venturi Giulia
Çöltekin Çağrı
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2022
Field of study

This paper presents the ParlaMint corpora containing transcriptions of the sessions of the 17 European national parliaments with half a billion words. The corpora are uniformly encoded, contain rich meta-data about 11 thousand speakers, and are linguistically annotated following the Universal Dependencies formalism and with named entities. Samples of the corpora and conversion scripts are available from the project’s GitHub repository, and the complete corpora are openly available via the CLARIN.SI repository for download, as well as through the NoSketch Engine and KonText concordancers and the Parlameter interface for on-line exploration and analysis

Copenhagen University Research Information System

Repositori Institucional de la Universitat Jaume I

Lancaster E-Prints

Finnish Parliament on the Semantic Web: Using ParliamentSampo Data Service and Semantic Portal for Studying Political Culture and Language

Author: Drobac Senka
Elo Kimmo
Hyvönen Eero
Ikkala Esko
Kesäniemi Joonas
Koho Mikko
La Mela Matti
Leal Rafael
Leskinen Petri
Sinikallio Laura
Tamper Minna
Tuominen Jouni
Publication venue: CEUR
Publication date: 02/05/2022
Field of study

Peer reviewe

Helsingin yliopiston digitaalinen arkisto

Finnish Parliament on the Semantic Web: Using ParliamentSampo Data Service and Semantic Portal for Studying Political Culture and Language

Author: Drobac Senka
Elo Kimmo
Hyvönen Eero
Ikkala Esko
Kesäniemi Joonas
Koho Mikko
La Mela Matti
Leal Rafael
Leskinen Petri
Sinikallio Laura
Tamper Minna
Tuominen Jouni
Publication venue: CEUR-WS.org
Publication date: 01/01/2022
Field of study

Peer reviewe

Digitala Vetenskapliga Arkivet - Academic Archive On-line

Helsingin yliopiston digitaalinen arkisto

The Parla-CLARIN Recommendations for Encoding Corpora of Parliamentary Proceedings

Author: Erjavec Tomaž
Pančur Andrej
Publication venue: Journal of the Text Encoding Initiative
Publication date: 26/09/2022
Field of study

Parliamentary proceedings are a rich source of data that can be used by scholars in various humanities and social sciences disciplines. Unlike the sources of most other language corpora, parliamentary proceedings are not subject to copyright or personal privacy protections, and are typically available online, thus making them ideal for compilation into corpora and for open distribution. For these reasons many countries have already produced corpora of parliamentary proceedings, but each typically in their own encoding, limiting their comparability and utilization in a multilingual setting. In this paper we propose an encoding schema which could serve as an interchange format for parliamentary corpora compiled for the purposes of scholarly investigations. The schema, called Parla-CLARIN, was developed within the CLARIN research infrastructure, and is written as a TEI ODD which includes a TEI customization and prose guidelines with examples of use. We discuss the coverage and choices made in designing the recommendations, and give an overview of the guidelines. We also discuss two other standard schemas for encoding parliamentary data, Akoma Ntoso and RDF, and their relation to Parla-CLARIN. We conclude by presenting corpora already encoded in Parla-CLARIN and discussing further work, especially the provision of a set of example documents and of transformation scripts that would make the proposed encoding more usable

OpenEdition

CLARIN

Author
Publication venue: 'Walter de Gruyter GmbH'
Publication date: 30/01/2023
Field of study

The book provides a comprehensive overview of the Common Language Resources and Technology Infrastructure – CLARIN – for the humanities. It covers a broad range of CLARIN language resources and services, its underlying technological infrastructure, the achievements of national consortia, and challenges that CLARIN will tackle in the future. The book is published 10 years after establishing CLARIN as an Europ. Research Infrastructure Consortium