470 research outputs found
Pascal Sidler, Schwarzröcke, Jakobiner, Patrioten: Revolution, Kontinuität und Widerstand im konfessionell gemischten Toggenburg 1795-1803, 2013
No abstract available
Prefixal articles across domains: Syntactic licensing in Albanian
In this article we venture to elucidate the origin of the Albanian subjunctive marker të-. We contend that this marker is historically linked to a morphosyntactic device which is traditionally described as linking article and which licenses nominal syntactic units as constituents of larger syntactic units. Based on the observation that there is a substantial distributional, functional and semantic overlap between nonfinite verbal forms marked with të- and finite subjunctive predicates, we propose that the subjunctive marker spread across host classes from nominals to nonfinite predicates and to finite subjunctive predicates. The spread into the finite verbal domain is areally fostered, while the licensing device itself is an independent Albanian development that possibly picks up a vertical, Indo-European signal
Decoding Middle Welsh clauses or "Avoid Ambiguity”
Middle Welsh is a language with a restricted set of morphosyntactic distinctions for grammatical relations and with relatively free word order in positive main declarative causes. However, syntactic ambiguity rarely, if ever, arises in natural texts. The present article shows in a corpus-based study how syntactic ambiguity is prevented and how morphological features interact with two referential properties, namely animacy and accessibility, in order to successfully identify grammatical relations in Middle Welsh. Further lower-tier factors are the semantics of the verb and the wider narrative context. The article complements recent insights suggesting that subject-verb agreement is not only determined by wordorder patterns, but also by referential properties of subject
Areal and phylogenetic dimensions of word order variation in Indo-European languages
Both areal and phylogenetic affiliation have been discussed as driving factors of the distribution of word order in the languages of the world. However, disentangling the interaction of these two factors is challenging. Here we take Indo-European as a test case. Word order in this family is largely homogeneous both within areas and within branches, which makes it difficult to assess which factor was more important in shaping the present-day distribution. To break out of this impasse we turn to corpus data and explicit statistical modeling. Building on a parallel corpus of movie subtitles, we investigate word order on the sentence level under stable pragmatic conditions. We measure the similarity of word order variation between pairs of languages with an information-theoretic distance metric. Using cluster analysis and variation partitioning methods these distance metrics show that phylogenetic distance predicts more variation than geographical distance, but the most important predictor is the shared fraction where phylogeny and area overlap. We conclude that word order has evolved along both dimensions and cannot be reduced to a single one
Advancing Natural-Language Based Audio Retrieval with PaSST and Large Audio-Caption Data Sets
This work presents a text-to-audio-retrieval system based on pre-trained text
and spectrogram transformers. Our method projects recordings and textual
descriptions into a shared audio-caption space in which related examples from
different modalities are close. Through a systematic analysis, we examine how
each component of the system influences retrieval performance. As a result, we
identify two key components that play a crucial role in driving performance:
the self-attention-based audio encoder for audio embedding and the utilization
of additional human-generated and synthetic data sets during pre-training. We
further experimented with augmenting ClothoV2 captions with available keywords
to increase their variety; however, this only led to marginal improvements. Our
system ranked first in the 2023's DCASE Challenge, and it outperforms the
current state of the art on the ClothoV2 benchmark by 5.6 pp. [email protected]: submitted to DCASE Workshop 202
Atharvaveda Paippalāda manuscripts (1.0.0) [Dataset]
The collection of manuscript photographs presented here was provided by Arlo Griffiths, who assembled them during a series of field trips to various locations in Odisha (eastern India) in the years 1998 to 2001 and later added to.
The data were prepared for publication and supplemented by reference tables by Paul Widmer and Thomas Zehnder
Energy-aware Service Allocation for Cloud Computing
Energy efficiency has become an important managerial variable of IT management. Whereas cloud computing promises significantly higher levels of energy efficiency, it is still not known, if and to what extent outsourcing of software applications to cloud service providers affects the overall energy efficiency. This research is concerned with the allocation of cloud services from providers to customers and addresses the problem of energy-aware service allocation. The distributed nature of the problem, i.e., the multiple loci of control, entails the failure of centralised solutions. Hence, we approach this problem from a multiagent system perspective, which preserves the distributed setting of multiple service providers and customers. The contribution of our research is a game-theoretic framework for analysing service provider and customer interactions and a novel distributed allocation mechanism based on this framework to approximate energy-efficient, optimal allocations. We demonstrate the usefulness and efficacy of the proposed artifact in several simulation experiments
Pairing peers and pears: Changing conventions of Gheg Albanian heritage speakers
Migration events splitting speaker communities and establishing novel contact situations are among the major drivers of language variation and change. While the precise processes that lead to change cannot usually be determined for past events with any certainty, the study of minority and heritage language usage in apparent time may provide insight into the contribution of the linguistic behavior underlying the dynamics. We capitalize on this and compare parts of speech usage in Pear Story renarrations across Gheg Albanian speakers of three generations in German-speaking environments, applying methods from information theory. The results suggest that the changing conventions in parts of speech usage across generations and places of residence can be attributed to changing linguistic behavior within the speaker community in the migration setting. These findings highlight the impact of changing sociocultural embedding and the roles of vertical and horizontal transmission in language change
- …