873 research outputs found

    Creating Lexical Resources in TEI P5 : a Schema for Multi-purpose Digital Dictionaries

    Get PDF
    Although most of the relevant dictionary productions of the recent past have relied on digital data and methods, there is little consensus on formats and standards. The Institute for Corpus Linguistics and Text Technology (ICLTT) of the Austrian Academy of Sciences has been conducting a number of varied lexicographic projects, both digitising print dictionaries and working on the creation of genuinely digital lexicographic data. This data was designed to serve varying purposes: machine-readability was only one. A second goal was interoperability with digital NLP tools. To achieve this end, a uniform encoding system applicable across all the projects was developed. The paper describes the constraints imposed on the content models of the various elements of the TEI dictionary module and provides arguments in favour of TEI P5 as an encoding system not only being used to represent digitised print dictionaries but also for NLP purposes

    Design and implementation of a research infrastructure for a corpus of spoken ELF

    Get PDF
    Die vorliegende Arbeit behandelt den Aufbau und die informationstechnische Umsetzung einer sprachwissenschaftlichen Ressource zur Erforschung von Englisch als Lingua Franca. In den einzelnen Kapiteln werden verschiedene Schwerpunkte gesetzt, die ein breites Spektrum von konzeptuellen und theoretischen Überlegungen bis hin zu Aspekten der direkten Umsetzung abdeckt. Die Hauptschwerpunkte der Arbeit liegen in einer grundlegenden Betrachtung sprach- wissenschaftlicher Annotationssysteme (siehe 2. Annotation p.4), der Beschreibung der formalen Struktur und der Entstehung des VOICE Transkriptionssystems, eines Überblicks über zusätzlich erforderliche Daten (d.h. Meta-Daten) bis hin zu einer Diskussion des verwendeten Dokumentenformates. In der Arbeit wird in den einzelnen Teilbereichen immer wieder auf das Spannungsfeld zwischen dem wissenschaftlich Wünschenswerten und dem informationstechnisch Mach- baren hingewiesen. Das beschriebene Transkriptionssystem vereint kognitive und formale Aspekte in einer Auszeichnungssprache um ein geeignetes Eingabeformat bereit zu stellen. Die darauf aufbauenden Formate, wie zum Beispiel das Korpusformat in XML, sind in dieser Arbeit konzeptionell und in wichtigen Implementationsdetails dokumentiert. In diesem Sinne ist der Text auch als Dokumentation für eine technische Annäherung an die Korpusresource VOICE zu lesen

    Creating Lexical Resources in TEI P5

    Get PDF
    Although most of the relevant dictionary productions of the recent past have relied on digital data and methods, there is little consensus on formats and standards. The Institute for Corpus Linguistics and Text Technology (ICLTT) of the Austrian Academy of Sciences has been conducting a number of varied lexicographic projects, both digitising print dictionaries and working on the creation of genuinely digital lexicographic data. This data was designed to serve varying purposes: machine-readability was only one. A second goal was interoperability with digital NLP tools. To achieve this end, a uniform encoding system applicable across all the projects was developed. The paper describes the constraints imposed on the content models of the various elements of the TEI dictionary module and provides arguments in favour of TEI P5 as an encoding system not only being used to represent digitised print dictionaries but also for NLP purposes

    Stellar over-densities in the halo: the extent of the Virgo over-density

    Full text link
    We map the three dimensional extent of the Virgo Over-density by combining distance information from RR Lyrae variables and projected spatial information from SEKBO (Keller et al. 2008) and Sloan Digital Sky Survey (SDSS) DR6 photometry. The Virgo Over-density is seen to comprise two filaments 14.5 x 3 degrees and 10 x 3 degrees and a circular structure 3 degrees in diameter. Together the three features span 38 degrees of right ascension and declinations of +2 to -15 degrees. RR Lyrae variables place the two filamentary features at heliocentric distances of 20 and 17 kpc respectively, with projected dimensions of 5 x 1 kpc and 3 x 1 kpc.Comment: 6 pages, 5 figures, MNRAS accepte

    Machine learning coarse-grained potentials of protein thermodynamics

    Get PDF
    A generalized understanding of protein dynamics is an unsolved scientific problem, the solution of which is critical to the interpretation of the structure-function relationships that govern essential biological processes. Here, we approach this problem by constructing coarse-grained molecular potentials based on artificial neural networks and grounded in statistical mechanics. For training, we build a unique dataset of unbiased all-atom molecular dynamics simulations of approximately 9 ms for twelve different proteins with multiple secondary structure arrangements. The coarse-grained models are capable of accelerating the dynamics by more than three orders of magnitude while preserving the thermodynamics of the systems. Coarse-grained simulations identify relevant structural states in the ensemble with comparable energetics to the all-atom systems. Furthermore, we show that a single coarse-grained potential can integrate all twelve proteins and can capture experimental structural features of mutated proteins. These results indicate that machine learning coarse-grained potentials could provide a feasible approach to simulate and understand protein dynamics

    The Chemistry of the Trailing arm of the Sagittarius Dwarf Galaxy

    Full text link
    We present abundances of C, O, Ti, and Fe for eleven M-giant stars in the trailing tidal arm of the Sagittarius dwarf (Sgr). The abundances were derived by comparing synthetic spectra with high-resolution infrared spectra obtained with the Phoenix spectrograph on the Gemini South telescope. The targeted stars are drawn from two regions of the Sgr trailing arm separated by 66 degrees (5 stars) and 132 degrees (6 stars) from the main body of Sgr. The trailing arm provides a more direct diagnostic of the chemical evolution of Sgr compared to the extensively phase-mixed leading arm. Within our restricted sample of ~2-3 Gyr old stars, we find that the stream material exhibits a significant metallicity gradient of -(2.4\pm0.3)x10^{-3} dex / degree (-(9.4\pm1.1)x10^{-4} dex / kpc) away from the main body of Sgr. The tidal disruption of Sgr is a relatively recently event. We therefore interpret the presence of a metallicity gradient in the debris as indicative of a similar gradient in the progenitor. The fact that such a metallicity gradient survived for almost a Hubble time indicates that the efficiency of radial mixing was very low in the Sgr progenitor. No significant gradient is seen to exist in the [alpha/Fe] abundance ratio along the trailing arm. Our results may be accounted for by a radial decrease in star formation efficiency and/or radial increase in the efficiency of galactic wind-driven metal loss in the chemical evolution of the Sgr progenitor. The [Ti/Fe] and [O/Fe] abundance ratios observed within the stream are distinct from those of the Galactic halo. We conclude that the fraction of the intermediate to metal-rich halo population contributed by the recent dissolution (<3 Gyr) of Sgr-like dwarf galaxies can not be substantial.Comment: 22 pages, 7 figures, ApJ accepte

    Kinematics & Chemistry of Halo Substructures: The Vicinity of the Virgo Over-Density

    Get PDF
    We present observations obtained with the AAT's 2dF wide field spectrograph AAOmega of K-type stars located within a region of the sky which contains the Virgo Over-Density and the leading arm of the Sagittarius Stream. On the basis of the resulting velocity histogram we isolate halo substructures in these overlapping regions including Sagittarius and previously discovered Virgo groups. Through comparisons with N-body models of the Galaxy-Sagittarius interaction, we find a tri-axial dark matter halo is favoured and we exclude a prolate shape. This result is contradictory with other observations along the Sagittarius leading arm, which typically favour prolate models. We have also uncovered K-giant members of Sagittarius that are notably more metal poor ([Fe/H] = -1.7 +/- 0.3 dex) than previous studies. This suggests a significantly wider metallicity distribution exists in the Sagittarius Stream than formerly considered. We also present data on five carbon stars which were discovered in our sample.Comment: accepted to A

    Recommendations for the Visibility of Open Access Publications in the Search Engine of the Austrian Library Network: Report of the OBV Working Group “Repositories in the Network”

    Get PDF
    Im vorliegenden Beitrag werden die Ergebnisse der OBV-Arbeitsgruppe „Repositorien im Verbund“ präsentiert. Die AG verfolgte das Ziel, einen Leitfaden für die Erfassung von Metadaten für Objekte in Repositorien, der dazu beiträgt, einheitliche Standards in dieser Hinsicht zu entwickeln, mit dessen Hilfe es in weiterer Folge ermöglicht werden soll, Repositorienbestände ohne Erzeugung von Dubletten in Alma bzw. im Verbundkatalog nachzuweisen. Weitere Ziele waren die Erarbeitung von Empfehlungen für eine zentrale Bereitstellung von Metadaten von Open Access-Publikationen zur Vereinfachung der lokalen Workflows (analog zum DFG-geförderten Projekt DeepGreen) mittels Teilautomatisierung sowie von Empfehlungen für eine Etablierung eines Reiters für Open Access-Materialien in der Suchmaschine des Österreichischen Bibliothekenverbundes (analog zu den Reitern „Fachliteratur“, „Hochschulschriften“ und „Nachlässe / Handschriften“).This paper presents the results of the OBV working group “Repositories in the Austrian Library Network”. The aim of the working group was to develop a guideline for the registration of metadata for objects in repositories, which would contribute to the development of uniform standards in this regard, and with the help of which it should subsequently be possible to identify repository holdings in Alma or in the Austrian Union Catalogue without creating duplicates. Further goals were the development of recommendations for a central provision of metadata of open access publications to simplify local workflows (analogous to the DFG-funded project DeepGreen) by means of partial automation, as well as recommendations for establishing a tab for open access materials in the search engine of the Austrian Library Network (analogous to the tabs “Literature”, “Theses and Dissertations” and “Bequests / Autographs”)
    • …
    corecore