Search CORE

70 research outputs found

ELAN Audio Playback

Author: Russel A.
Trilsbeek P.
Publication venue
Publication date: 01/01/2004
Field of study

FLAT: A CLARIN-compatible repository solution based on Fedora Commons

Author: Trilsbeek P.
Windhouwer M.
Publication venue
Publication date: 21/11/2016
Field of study

This paper describes the development of a CLARIN-compatible repository solution that fulfils both the long-term preservation requirements as well as the current day discoverability and usability needs of an online data repository of language resources. The widely used Fedora Commons open source repository framework, combined with the Islandora discovery layer, forms the basis of the solution. On top of this existing solution, additional modules and tools are developed to make it suitable for the types of data and metadata that are used by the participating partners

MPG.PuRe

The Language Archiving Technology domain

Author: Koenig A.
Ringersma J.
Trilsbeek P.
Publication venue
Publication date: 06/11/2009
Field of study

The Max Planck Institute for Psycholinguistics (MPI) manages an archive of linguistic research data with a current size of almost 20 Terabytes. Apart from in-house researchers other projects also store their data in the archive, most notably the Documentation of Endangered Languages (DoBeS) projects. The archive is available online and can be accessed by anybody with Internet access. To be able to manage this large amount of data the MPI's technical group has developed a software suite called Language Archiving Technology (LAT) that on the one hand helps researchers and archive managers to manage the data and on the other hand helps users in enriching their primary data with additional layers. All the MPI software is Java-based and developed according to open source principles (GNU, 2007). All three major operating systems (Windows, Linux, MacOS) are supported and the software works similarly on all of them. As the archive is online, many of the tools, especially the ones for accessing the data, are browser based. Some of these browser-based tools make use of Adobe Flex to create nice-looking GUIs. The LAT suite is a complete set of management and enrichment tools, and given the interaction between the tools the result is a complete LAT software domain. Over the last 10 years, this domain has proven its functionality and use, and is being deployed to servers in other institutions. This deployment is an important step in getting the archived resources back to the members of the speech communities whose languages are documented. In the paper we give an overview of the tools of the LAT suite and we describe their functionality and role in the integrated process of archiving, management and enrichment of linguistic data

MPG.PuRe

New DoBeS web site: Contents & functions

Author: Skiba R.
Trilsbeek P.
Wittenburg F.
Publication venue
Publication date: 01/01/2004
Field of study

MPG.PuRe

The Language Archiving Technology domain

Author: Koenig A.
Ringersma J.
Trilsbeek P.
Publication venue
Publication date
Field of study

MPG.PuRe

Language documentation and digital humanities: The (DoBeS) Language Archive

Author: Broeder D.
Drude S.
Trilsbeek P.
Publication venue
Publication date
Field of study

MPG.PuRe

Best practices in the creation, archiving and dissemination of speech corpora at the Language Archive

Author: Broeder D.
Drude S.
Sloetjes H.
Trilsbeek P.
Publication venue
Publication date: 01/01/2014
Field of study

MPG.PuRe

FAIR Practices in Europe

Author: Baldauf C.
Lautenschlager M.
Thiemann H.
Trilsbeek P.
Wittenburg P.
Publication venue: 'MIT Press - Journals'
Publication date: 01/01/2020
Field of study

Institutions driving fundamental research at the cutting edge such as for example from the Max Planck Society (MPS) took steps to optimize data management and stewardship to be able to address new scientific questions. In this paper we selected three institutes from the MPS from the areas of humanities, environmental sciences and natural sciences as examples to indicate the efforts to integrate large amounts of data from collaborators worldwide to create a data space that is ready to be exploited to get new insights based on data intensive science methods. For this integration the typical challenges of fragmentation, bad quality and also social differences had to be overcome. In all three cases, well-managed repositories that are driven by the scientific needs and harmonization principles that have been agreed upon in the community were the core pillars. It is not surprising that these principles are very much aligned with what have now become the FAIR principles. The FAIR principles confirm the correctness of earlier decisions and their clear formulation identified the gaps which the projects need to address

MPG.PuRe

The Language Archive - a new hub for language resources

Author: Broeder D.
Drude S.
Trilsbeek P.
Wittenburg P.
Publication venue
Publication date
Field of study

MPG.PuRe

Language-sites: Accessing and presenting language resources via geographic information systems

Author: Dukers A.
Ringersma J.
Trilsbeek P.
Van Uytvanck D.
Publication venue
Publication date: 01/01/2008
Field of study

The emerging area of Geographic Information Systems (GIS) has proven to add an interesting dimension to many research projects. Within the language-sites initiative we have brought together a broad range of links to digital language corpora and resources. Via Google Earth's visually appealing 3D-interface users can spin the globe, zoom into an area they are interested in and access directly the relevant language resources. This paper focuses on several ways of relating the map and the online data (lexica, annotations, multimedia recordings, etc.). Furthermore, we discuss some of the implementation choices that have been made, including future challenges. In addition, we show how scholars (both linguists and anthropologists) are using GIS tools to fulfill their specific research needs by making use of practical examples. This illustrates how both scientists and the general public can benefit from geography-based access to digital language dat

MPG.PuRe