41 research outputs found
Citing on-line language resources
Although the possibility of referring or citing on-line data from publications is seen at least theoretically as an important means to provide immediate testable proof or simple illustration of a line of reasoning, the practice has not been wide-spread yet and no extensive experience has been gained about the possibilities and problems of referring to raw data-sets. This paper makes a case to investigate the possibility and need of persistent data visualization services that facilitate the inspection and evaluation of the cited data
Standardizing a component metadata infrastructure
This paper describes the status of the standardization efforts of a Component Metadata approach for describing Language Resources with metadata. Different linguistic and Language & Technology communities as CLARIN, META-SHARE and NaLiDa use this component approach and see its standardization of as a matter for cooperation that has the possibility to create a large interoperable domain of joint metadata. Starting with an overview of the component metadata approach together with the related semantic interoperability tools and services as the ISOcat data category registry and the relation registry we explain the standardization plan and efforts for component metadata within ISO TC37/SC4. Finally, we present information about uptake and plans of the use of component metadata within the three mentioned linguistic and L&T communities
Language-sites: Accessing and presenting language resources via geographic information systems
The emerging area of Geographic Information Systems (GIS) has proven to add an interesting dimension to many research projects. Within the language-sites initiative we have brought together a broad range of links to digital language corpora and resources. Via Google Earth's visually appealing 3D-interface users can spin the globe, zoom into an area they are interested in and access directly the relevant language resources. This paper focuses on several ways of relating the map and the online data (lexica, annotations, multimedia recordings, etc.). Furthermore, we discuss some of the implementation choices that have been made, including future challenges. In addition, we show how scholars (both linguists and anthropologists) are using GIS tools to fulfill their specific research needs by making use of practical examples. This illustrates how both scientists and the general public can benefit from geography-based access to digital language dat
Virtual language observatory: The portal to the language resources and technology universe
Over the years, the field of Language Resources and Technology (LRT) hasdeveloped a tremendous amount of resources and tools. However, there is noready-to-use map that researchers could use to gain a good overview andsteadfast orientation when searching for, say corpora or software tools tosupport their studies. It is rather the case that information is scatteredacross project- or organisation-specific sites, which makes it hard if notimpossible for less-experienced researchers to gather all relevant material.Clearly, the provision of metadata is central to resource and softwareexploration. However, in the LRT field, metadata comes in many forms, tastesand qualities, and therefore substantial harmonization and curation efforts arerequired to provide researchers with metadata-based guidance. To address thisissue a broad alliance of LRT providers (CLARIN, the Linguist List, DOBES,DELAMAN, DFKI, ELRA) have initiated the Virtual Language Observatory portal toprovide a low-barrier, easy-to-follow entry point to language resources andtools; it can be accessed via http://www.clarin.eu/vl
Semantic metadata mapping in practice: The Virtual Language Observatory
In this paper we present the Virtual Language Observatory (VLO), a metadata-based portal for language resources. It is completely based on the Component Metadata (CMDI) and ISOcat standards. This approach allows for the use of heterogeneous metadata schemas while maintaining the semantic compatibility. We describe the metadata harvesting process, based on OAI-PMH, and the conversion from several formats (OLAC, IMDI and the CLARIN LRT inventory) to their CMDI counterpart profiles. Then we focus on some post-processing steps to polish the harvested records. Next, the ingestion of the CMDI files into the VLO facet browser is described. We also include an overview of the changes since the first version of the VLO, based on user feedback from the CLARIN community. Finally there is an overview of additional ideas and improvements for future versions of the VLO
Impacts of Land Abandonment on Vegetation: Successional Pathways in European Habitats
Changes in traditional agricultural systems in Europe in recent decades have led to widespread abandonment and colonization of various habitats by shrubs and trees. We combined several vegetation databases to test whether patterns of changes in plant diversity after land abandonment in different habitats followed similar pathways. The impacts of land abandonment and subsequent woody colonization on vegetation composition and plant traits were studied in five semi-natural open habitats and two arable habitats in six regions of Europe. For each habitat, vegetation surveys were carried out in different stages of succession using either permanent or non-permanent plots. Consecutive stages of succession were defined on a physiognomic basis from initial open stages to late woody stages. Changes in vegetation composition, species richness, numbers of species on Red Lists, plant strategy types, Ellenberg indicator values of the vegetation, Grime CSR strategy types and seven ecological traits were assessed for each stage of the successional pathway. Abandonment of agro-pastoral land-use and subsequent woody colonization were associated with changes in floristic composition. Plant richness varied according to the different habitats and stages of succession, but semi-natural habitats differed from arable fields in several ecological traits and vegetation responses. Nevertheless, succession occurred along broadly predictable pathways. Vegetation in abandoned arable fields was characterized by a decreasing importance of R-strategists, annuals, seed plants with overwintering green leaves, insect-pollinated plants with hemi-rosette morphology and plants thriving in nutrient-rich conditions, but an increase in species considered as endangered according to the Red Lists. Conversely, changes in plant traits with succession within the initially-open semi-natural habitats showed an increase in plants thriving in nutrient-rich conditions, stress-tolerant plants and plants with sexual and vegetative reproduction, but a sharp decrease in protected species. In conclusion, our study showed a set of similarities in responses of the vegetation in plant traits after land abandonment, but we also highlighted differences between arable fields and semi-natural habitats, emphasizing the importance of land-use legacy