873 research outputs found
Creating Lexical Resources in TEI P5 : a Schema for Multi-purpose Digital Dictionaries
Although most of the relevant dictionary productions of the recent past have relied on digital data and methods, there is little consensus on formats and standards. The Institute for Corpus Linguistics and Text Technology (ICLTT) of the Austrian Academy of Sciences has been conducting a number of varied lexicographic projects, both digitising print dictionaries and working on the creation of genuinely digital lexicographic data. This data was designed to serve varying purposes: machine-readability was only one. A second goal was interoperability with digital NLP tools. To achieve this end, a uniform encoding system applicable across all the projects was developed. The paper describes the constraints imposed on the content models of the various elements of the TEI dictionary module and provides arguments in favour of TEI P5 as an encoding system not only being used to represent digitised print dictionaries but also for NLP purposes
Design and implementation of a research infrastructure for a corpus of spoken ELF
Die vorliegende Arbeit behandelt den Aufbau und die informationstechnische Umsetzung
einer sprachwissenschaftlichen Ressource zur Erforschung von Englisch als Lingua Franca.
In den einzelnen Kapiteln werden verschiedene Schwerpunkte gesetzt, die ein breites
Spektrum von konzeptuellen und theoretischen Ăśberlegungen bis hin zu Aspekten der
direkten Umsetzung abdeckt.
Die Hauptschwerpunkte der Arbeit liegen in einer grundlegenden Betrachtung sprach-
wissenschaftlicher Annotationssysteme (siehe 2. Annotation p.4), der Beschreibung der
formalen Struktur und der Entstehung des VOICE Transkriptionssystems, eines Ăśberblicks
über zusätzlich erforderliche Daten (d.h. Meta-Daten) bis hin zu einer Diskussion des
verwendeten Dokumentenformates.
In der Arbeit wird in den einzelnen Teilbereichen immer wieder auf das Spannungsfeld
zwischen dem wissenschaftlich WĂĽnschenswerten und dem informationstechnisch Mach-
baren hingewiesen. Das beschriebene Transkriptionssystem vereint kognitive und formale
Aspekte in einer Auszeichnungssprache um ein geeignetes Eingabeformat bereit zu stellen.
Die darauf aufbauenden Formate, wie zum Beispiel das Korpusformat in XML, sind in
dieser Arbeit konzeptionell und in wichtigen Implementationsdetails dokumentiert. In
diesem Sinne ist der Text auch als Dokumentation für eine technische Annäherung an die
Korpusresource VOICE zu lesen
Creating Lexical Resources in TEI P5
Although most of the relevant dictionary productions of the recent past have relied on digital data and methods, there is little consensus on formats and standards. The Institute for Corpus Linguistics and Text Technology (ICLTT) of the Austrian Academy of Sciences has been conducting a number of varied lexicographic projects, both digitising print dictionaries and working on the creation of genuinely digital lexicographic data. This data was designed to serve varying purposes: machine-readability was only one. A second goal was interoperability with digital NLP tools. To achieve this end, a uniform encoding system applicable across all the projects was developed. The paper describes the constraints imposed on the content models of the various elements of the TEI dictionary module and provides arguments in favour of TEI P5 as an encoding system not only being used to represent digitised print dictionaries but also for NLP purposes
Stellar over-densities in the halo: the extent of the Virgo over-density
We map the three dimensional extent of the Virgo Over-density by combining
distance information from RR Lyrae variables and projected spatial information
from SEKBO (Keller et al. 2008) and Sloan Digital Sky Survey (SDSS) DR6
photometry. The Virgo Over-density is seen to comprise two filaments 14.5 x 3
degrees and 10 x 3 degrees and a circular structure 3 degrees in diameter.
Together the three features span 38 degrees of right ascension and declinations
of +2 to -15 degrees. RR Lyrae variables place the two filamentary features at
heliocentric distances of 20 and 17 kpc respectively, with projected dimensions
of 5 x 1 kpc and 3 x 1 kpc.Comment: 6 pages, 5 figures, MNRAS accepte
Machine learning coarse-grained potentials of protein thermodynamics
A generalized understanding of protein dynamics is an unsolved scientific problem, the solution of which is critical to the interpretation of the structure-function relationships that govern essential biological processes. Here, we approach this problem by constructing coarse-grained molecular potentials based on artificial neural networks and grounded in statistical mechanics. For training, we build a unique dataset of unbiased all-atom molecular dynamics simulations of approximately 9 ms for twelve different proteins with multiple secondary structure arrangements. The coarse-grained models are capable of accelerating the dynamics by more than three orders of magnitude while preserving the thermodynamics of the systems. Coarse-grained simulations identify relevant structural states in the ensemble with comparable energetics to the all-atom systems. Furthermore, we show that a single coarse-grained potential can integrate all twelve proteins and can capture experimental structural features of mutated proteins. These results indicate that machine learning coarse-grained potentials could provide a feasible approach to simulate and understand protein dynamics
The Chemistry of the Trailing arm of the Sagittarius Dwarf Galaxy
We present abundances of C, O, Ti, and Fe for eleven M-giant stars in the
trailing tidal arm of the Sagittarius dwarf (Sgr). The abundances were derived
by comparing synthetic spectra with high-resolution infrared spectra obtained
with the Phoenix spectrograph on the Gemini South telescope. The targeted stars
are drawn from two regions of the Sgr trailing arm separated by 66 degrees (5
stars) and 132 degrees (6 stars) from the main body of Sgr. The trailing arm
provides a more direct diagnostic of the chemical evolution of Sgr compared to
the extensively phase-mixed leading arm.
Within our restricted sample of ~2-3 Gyr old stars, we find that the stream
material exhibits a significant metallicity gradient of -(2.4\pm0.3)x10^{-3}
dex / degree (-(9.4\pm1.1)x10^{-4} dex / kpc) away from the main body of Sgr.
The tidal disruption of Sgr is a relatively recently event. We therefore
interpret the presence of a metallicity gradient in the debris as indicative of
a similar gradient in the progenitor. The fact that such a metallicity gradient
survived for almost a Hubble time indicates that the efficiency of radial
mixing was very low in the Sgr progenitor.
No significant gradient is seen to exist in the [alpha/Fe] abundance ratio
along the trailing arm. Our results may be accounted for by a radial decrease
in star formation efficiency and/or radial increase in the efficiency of
galactic wind-driven metal loss in the chemical evolution of the Sgr
progenitor. The [Ti/Fe] and [O/Fe] abundance ratios observed within the stream
are distinct from those of the Galactic halo. We conclude that the fraction of
the intermediate to metal-rich halo population contributed by the recent
dissolution (<3 Gyr) of Sgr-like dwarf galaxies can not be substantial.Comment: 22 pages, 7 figures, ApJ accepte
Kinematics & Chemistry of Halo Substructures: The Vicinity of the Virgo Over-Density
We present observations obtained with the AAT's 2dF wide field spectrograph
AAOmega of K-type stars located within a region of the sky which contains the
Virgo Over-Density and the leading arm of the Sagittarius Stream. On the basis
of the resulting velocity histogram we isolate halo substructures in these
overlapping regions including Sagittarius and previously discovered Virgo
groups. Through comparisons with N-body models of the Galaxy-Sagittarius
interaction, we find a tri-axial dark matter halo is favoured and we exclude a
prolate shape. This result is contradictory with other observations along the
Sagittarius leading arm, which typically favour prolate models. We have also
uncovered K-giant members of Sagittarius that are notably more metal poor
([Fe/H] = -1.7 +/- 0.3 dex) than previous studies. This suggests a
significantly wider metallicity distribution exists in the Sagittarius Stream
than formerly considered. We also present data on five carbon stars which were
discovered in our sample.Comment: accepted to A
Recommendations for the Visibility of Open Access Publications in the Search Engine of the Austrian Library Network: Report of the OBV Working Group “Repositories in the Network”
Im vorliegenden Beitrag werden die Ergebnisse der OBV-Arbeitsgruppe „Repositorien im Verbund“ präsentiert. Die AG verfolgte das Ziel, einen Leitfaden für die Erfassung von Metadaten für Objekte in Repositorien, der dazu beiträgt, einheitliche Standards in dieser Hinsicht zu entwickeln, mit dessen Hilfe es in weiterer Folge ermöglicht werden soll, Repositorienbestände ohne Erzeugung von Dubletten in Alma bzw. im Verbundkatalog nachzuweisen. Weitere Ziele waren die Erarbeitung von Empfehlungen für eine zentrale Bereitstellung von Metadaten von Open Access-Publikationen zur Vereinfachung der lokalen Workflows (analog zum DFG-geförderten Projekt DeepGreen) mittels Teilautomatisierung sowie von Empfehlungen für eine Etablierung eines Reiters für Open Access-Materialien in der Suchmaschine des Österreichischen Bibliothekenverbundes (analog zu den Reitern „Fachliteratur“, „Hochschulschriften“ und „Nachlässe / Handschriften“).This paper presents the results of the OBV working group “Repositories in the Austrian Library Network”. The aim of the working group was to develop a guideline for the registration of metadata for objects in repositories, which would contribute to the development of uniform standards in this regard, and with the help of which it should subsequently be possible to identify repository holdings in Alma or in the Austrian Union Catalogue without creating duplicates. Further goals were the development of recommendations for a central provision of metadata of open access publications to simplify local workflows (analogous to the DFG-funded project DeepGreen) by means of partial automation, as well as recommendations for establishing a tab for open access materials in the search engine of the Austrian Library Network (analogous to the tabs “Literature”, “Theses and Dissertations” and “Bequests / Autographs”)
- …