36,506 research outputs found
Ergativity and depth of analysis
In this paper, I argue that âdepth of analysisâ does not deserve the prestige that it is sometimes given in general linguistics. While language description should certainly be as detailed as possible, general linguistics must rely on worldwide comparison of languages, and this cannot be based on language-particular analyses. Rigorous quantitative comparison requires uniform measurement, and this implies abstracting away from many language-particular peculiarities. I will illustrate this on the basis of ergative patterns, starting out from I.A. MelâcÌukâs (1981) proposal for Lezgian. This proposal was not successful, but why not? And why is Bakerâs (2015) theory of dependent case likewise unsuccessful? By contrast, quantitative worldwide research has found striking similarities of ergative coding patterns, which can be explained by the efficiency theory of asymmetric coding. I will argue that this success is due to a more cautious approach to understanding Human Language, which does not rely on the Mendeleyevian vision for grammar (that all grammars are made from the same innate building blocks)
On the accuracy of language trees
Historical linguistics aims at inferring the most likely language
phylogenetic tree starting from information concerning the evolutionary
relatedness of languages. The available information are typically lists of
homologous (lexical, phonological, syntactic) features or characters for many
different languages.
From this perspective the reconstruction of language trees is an example of
inverse problems: starting from present, incomplete and often noisy,
information, one aims at inferring the most likely past evolutionary history. A
fundamental issue in inverse problems is the evaluation of the inference made.
A standard way of dealing with this question is to generate data with
artificial models in order to have full access to the evolutionary process one
is going to infer. This procedure presents an intrinsic limitation: when
dealing with real data sets, one typically does not know which model of
evolution is the most suitable for them. A possible way out is to compare
algorithmic inference with expert classifications. This is the point of view we
take here by conducting a thorough survey of the accuracy of reconstruction
methods as compared with the Ethnologue expert classifications. We focus in
particular on state-of-the-art distance-based methods for phylogeny
reconstruction using worldwide linguistic databases.
In order to assess the accuracy of the inferred trees we introduce and
characterize two generalizations of standard definitions of distances between
trees. Based on these scores we quantify the relative performances of the
distance-based algorithms considered. Further we quantify how the completeness
and the coverage of the available databases affect the accuracy of the
reconstruction. Finally we draw some conclusions about where the accuracy of
the reconstructions in historical linguistics stands and about the leading
directions to improve it.Comment: 36 pages, 14 figure
Information structure in linguistic theory and in speech production : validation of a cross-linguistic data set
The aim of this paper is to validate a dataset collected by means of production experiments which are part of the Questionnaire on Information Structure. The experiments generate a range of information structure contexts that have been observed in the literature to induce specific constructions. This paper compares the speech production results from a subset of these experiments with specific claims about the reflexes of information structure in four different languages. The results allow us to evaluate and in most cases validate the efficacy of our elicitation paradigms, to identify potentially fruitful avenues of future research, and to highlight issues involved in interpreting speech production data of this kind
Token-based typology and word order entropy: A study based on universal dependencies
The present paper discusses the benefits and challenges of token-based typology, which takes into account the frequencies of words and constructions in language use. This approach makes it possible to introduce new criteria for language classification, which would be difficult or impossible to achieve with the traditional, type-based approach. This point is illustrated by several quantitative studies of word order variation, which can be measured as entropy at different levels of granularity. I argue that this variation can be explained by general functional mechanisms and pressures, which manifest themselves in language use, such as optimization of processing (including avoidance of ambiguity) and grammaticalization of predictable units occurring in chunks. The case studies are based on multilingual corpora, which have been parsed using the Universal Dependencies annotation scheme
Model-checking Quantitative Alternating-time Temporal Logic on One-counter Game Models
We consider quantitative extensions of the alternating-time temporal logics
ATL/ATLs called quantitative alternating-time temporal logics (QATL/QATLs) in
which the value of a counter can be compared to constants using equality,
inequality and modulo constraints. We interpret these logics in one-counter
game models which are infinite duration games played on finite control graphs
where each transition can increase or decrease the value of an unbounded
counter. That is, the state-space of these games are, generally, infinite. We
consider the model-checking problem of the logics QATL and QATLs on one-counter
game models with VASS semantics for which we develop algorithms and provide
matching lower bounds. Our algorithms are based on reductions of the
model-checking problems to model-checking games. This approach makes it quite
simple for us to deal with extensions of the logical languages as well as the
infinite state spaces. The framework generalizes on one hand qualitative
problems such as ATL/ATLs model-checking of finite-state systems,
model-checking of the branching-time temporal logics CTL and CTLs on
one-counter processes and the realizability problem of LTL specifications. On
the other hand the model-checking problem for QATL/QATLs generalizes
quantitative problems such as the fixed-initial credit problem for energy games
(in the case of QATL) and energy parity games (in the case of QATLs). Our
results are positive as we show that the generalizations are not too costly
with respect to complexity. As a byproduct we obtain new results on the
complexity of model-checking CTLs in one-counter processes and show that
deciding the winner in one-counter games with LTL objectives is
2ExpSpace-complete.Comment: 22 pages, 12 figure
A Multivariate Study of T/V Forms in European Languages Based on a Parallel Corpus of Film Subtitles
The present study investigates the cross-linguistic differences in the use of so-called T/V forms (e.g. French tu and vous, German du and Sie, Russian ty and vy) in ten European languages from different language families and genera. These constraints represent an elusive object of investigation because they depend on a large number of subtle contextual features and social distinctions, which should be cross-linguistically matched. Film subtitles in different languages offer a convenient solution because the situations of communication between film characters can serve as comparative concepts. I selected more than two hundred contexts that contain the pronouns you and yourself in the original English versions, which are then coded for fifteen contextual variables that describe the Speaker and the Hearer, their relationships and different situational properties. The creators of subtitles in the other languages have to choose between T and V when translating from English, where the T/V distinction is not expressed grammatically. On the basis of these situations translated in ten languages, I perform multivariate analyses using the method of conditional inference trees in order to identify the most relevant contextual variables that constrain the T/V variation in each language
Exploiting microvariation: How to make the best of your incomplete data
n this article we discuss the use of big corpuses or databases as a first step for qualitative analysis of linguistic data. We concentrate on ASIt, the Syntactic Atlas of Italy, and take into consideration the different types of dialectal data that can be collected from similar corpora and databases. We analyze all the methodological problems derived from the necessary compromise between the strict requirements imposed by a scientific inquiry and the management of big amounts of data. As a possible solution, we propose that the type of variation is per se a tool to derive meaningful generalizations. To implement this idea, we examine three different types of variation patterns that can be used in the study of morpho-syntax: the geographical distribution of properties (and their total or partial overlapping, or complementary distribution), the so-called leopard spots variation, and the lexical variation index, which can be used to determine the internal complexity of functional items
Many-valued logics. A mathematical and computational introduction.
2nd edition. Many-valued logics are those logics that have more than the two classical truth values, to wit, true and false; in fact, they can have from three to infinitely many truth values. This property, together with truth-functionality, provides a powerful formalism to reason in settings where classical logicâas well as other non-classical logicsâis of no avail. Indeed, originally motivated by philosophical concerns, these logics soon proved relevant for a plethora of applications ranging from switching theory to cognitive modeling, and they are today in more demand than ever, due to the realization that inconsistency and vagueness in knowledge bases and information processes are not only inevitable and acceptable, but also perhaps welcome.
The main modern applications of (any) logic are to be found in the digital computer, and we thus require the practical knowledge how to computerizeâwhich also means automateâdecisions (i.e. reasoning) in many-valued logics. This, in turn, necessitates a mathematical foundation for these logics. This book provides both these mathematical foundation and practical knowledge in a rigorous, yet accessible, text, while at the same time situating these logics in the context of the satisfiability problem (SAT) and automated deduction.
The main text is complemented with a large selection of exercises, a plus for the reader wishing to not only learn about, but also do something with, many-valued logics
- âŠ