31 research outputs found

    On the Effect of Anticipation on Reading Times

    Full text link
    Over the past two decades, numerous studies have demonstrated how less predictable (i.e., higher surprisal) words take more time to read. In general, these studies have implicitly assumed the reading process is purely responsive: Readers observe a new word and allocate time to process it as required. We argue that prior results are also compatible with a reading process that is at least partially anticipatory: Readers could make predictions about a future word and allocate time to process it based on their expectation. In this work, we operationalize this anticipation as a word's contextual entropy. We assess the effect of anticipation on reading by comparing how well surprisal and contextual entropy predict reading times on four naturalistic reading datasets: two self-paced and two eye-tracking. Experimentally, across datasets and analyses, we find substantial evidence for effects of contextual entropy over surprisal on a word's reading time (RT): in fact, entropy is sometimes better than surprisal in predicting a word's RT. Spillover effects, however, are generally not captured by entropy, but only by surprisal. Further, we hypothesize four cognitive mechanisms through which contextual entropy could impact RTs -- three of which we are able to design experiments to analyze. Overall, our results support a view of reading that is not just responsive, but also anticipatory.Comment: This is a pre-MIT Press publication version of the paper. Code is available in https://github.com/rycolab/anticipation-on-reading-time

    Testing the Predictions of Surprisal Theory in 11 Languages

    Full text link
    A fundamental result in psycholinguistics is that less predictable words take a longer time to process. One theoretical explanation for this finding is Surprisal Theory (Hale, 2001; Levy, 2008), which quantifies a word's predictability as its surprisal, i.e. its negative log-probability given a context. While evidence supporting the predictions of Surprisal Theory have been replicated widely, most have focused on a very narrow slice of data: native English speakers reading English texts. Indeed, no comprehensive multilingual analysis exists. We address this gap in the current literature by investigating the relationship between surprisal and reading times in eleven different languages, distributed across five language families. Deriving estimates from language models trained on monolingual and multilingual corpora, we test three predictions associated with surprisal theory: (i) whether surprisal is predictive of reading times; (ii) whether expected surprisal, i.e. contextual entropy, is predictive of reading times; (iii) and whether the linking function between surprisal and reading times is linear. We find that all three predictions are borne out crosslinguistically. By focusing on a more diverse set of languages, we argue that these results offer the most robust link to-date between information theory and incremental language processing across languages.Comment: This is a pre-MIT Press publication version of the pape

    Revisiting the Uniform Information Density Hypothesis

    Full text link
    The uniform information density (UID) hypothesis posits a preference among language users for utterances structured such that information is distributed uniformly across a signal. While its implications on language production have been well explored, the hypothesis potentially makes predictions about language comprehension and linguistic acceptability as well. Further, it is unclear how uniformity in a linguistic signal -- or lack thereof -- should be measured, and over which linguistic unit, e.g., the sentence or language level, this uniformity should hold. Here we investigate these facets of the UID hypothesis using reading time and acceptability data. While our reading time results are generally consistent with previous work, they are also consistent with a weakly super-linear effect of surprisal, which would be compatible with UID's predictions. For acceptability judgments, we find clearer evidence that non-uniformity in information density is predictive of lower acceptability. We then explore multiple operationalizations of UID, motivated by different interpretations of the original hypothesis, and analyze the scope over which the pressure towards uniformity is exerted. The explanatory power of a subset of the proposed operationalizations suggests that the strongest trend may be a regression towards a mean surprisal across the language, rather than the phrase, sentence, or document -- a finding that supports a typical interpretation of UID, namely that it is the byproduct of language users maximizing the use of a (hypothetical) communication channel

    A Cross-Linguistic Pressure for Uniform Information Density in Word Order

    Full text link
    While natural languages differ widely in both canonical word order and word order flexibility, their word orders still follow shared cross-linguistic statistical patterns, often attributed to functional pressures. In the effort to identify these pressures, prior work has compared real and counterfactual word orders. Yet one functional pressure has been overlooked in such investigations: the uniform information density (UID) hypothesis, which holds that information should be spread evenly throughout an utterance. Here, we ask whether a pressure for UID may have influenced word order patterns cross-linguistically. To this end, we use computational models to test whether real orders lead to greater information uniformity than counterfactual orders. In our empirical study of 10 typologically diverse languages, we find that: (i) among SVO languages, real word orders consistently have greater uniformity than reverse word orders, and (ii) only linguistically implausible counterfactual orders consistently exceed the uniformity of real orders. These findings are compatible with a pressure for information uniformity in the development and usage of natural languages

    Is Hamlet Scandinavian Crime Fiction?

    Get PDF
    This paper combines two of the suggested topics: “law as an instrument of ideology” and “ideological interpellation through law” as it explores whether or not it is possible to use literary fiction as part of an argument in legal argumentation. The use of such an argument is strongly connected with an attitude of state to art, culture and values included both in art and law. Art as a part of a socio-cultural system is one of the material sources of law so it would be natural to admit it in just judicial decision. Unfortunately it is not typical for law to use such relationships between art and law to make judicial decisions better or at least more persuasive. It may be caused by the fact that the choice of what literary fiction is suitable for legal argumentation and which is not can be seen as a kind of ideology. Therefore the state determines which art is “good enough” to be a part of legal reasoning. Usually it differs between “high art” and “mass culture.” It results in a form of “labelling” of art. Unfortunately, by evaluating art in such a manner each state manifests itself as almost a totalitarian one. Socialist realism was a very expressive example. So it is not a matter for the democratic state to decide which piece of art is capable to influence law, is it? In this contribution I will emphasize Žižek’s critique of ideology in order to deal with ideology in argumentation by literary fiction in law. Besides that I will draw inspiration from Law and Literature movements. The aim of my paper is to explain how literary fiction can be used as a legal argument in a proper way as a necessary social appeal through law.This paper combines two of the suggested topics: “law as an instrument of ideology” and “ideological interpellation through law” as it explores whether or not it is possible to use literary fiction as part of an argument in legal argumentation. The use of such an argument is strongly connected with an attitude of state to art, culture and values included both in art and law. Art as a part of a socio-cultural system is one of the material sources of law so it would be natural to admit it in just judicial decision. Unfortunately it is not typical for law to use such relationships between art and law to make judicial decisions better or at least more persuasive. It may be caused by the fact that the choice of what literary fiction is suitable for legal argumentation and which is not can be seen as a kind of ideology. Therefore the state determines which art is “good enough” to be a part of legal reasoning. Usually it differs between “high art” and “mass culture.” It results in a form of “labelling” of art. Unfortunately, by evaluating art in such a manner each state manifests itself as almost a totalitarian one. Socialist realism was a very expressive example. So it is not a matter for the democratic state to decide which piece of art is capable to influence law, is it? In this contribution I will emphasize Žižek’s critique of ideology in order to deal with ideology in argumentation by literary fiction in law. Besides that I will draw inspiration from Law and Literature movements. The aim of my paper is to explain how literary fiction can be used as a legal argument in a proper way as a necessary social appeal through law

    Navigating New Landscapes: The Contribution of Socio-Legal Scholarship in Mapping the Plurality of International Economic Law and Locating Power in International Economic Relations

    Get PDF
    The evolution of international economic law in the past two decades has been characterised by the growth and diversification of international economic actors, the expansion in the substantive areas governed by international law, and, crucially, the proliferation of multiple sites of international economic governance. This web of multi-layered international economic governance is, in turn, underpinned by complex dynamics of power which structure the legal and economic relations between the subjects of international economic law and other actors impacted by international legal rules and regulation. The challenge for international legal scholarship lay not only in mapping the multiple sites of international economic governance but also in unmasking the power dynamics inherent in international economic relations. Locating and analysing power relations underlying international economic law is to crucial to understanding the cause and effect of international economic rules and institutions for rulemaking. Conventional legal scholarship with its doctrinal focus, while useful in providing the foundational basis for analysis, cannot adequately capture the complexity of contemporary international economic law. Socio-legal approaches may be able to overcome these epistemological limitations by supplying: a) the methodologies to study international economic law beyond a focus on rules and institutions; and b) the critical theoretical lens to understand the power dynamics inherent in international legal relations. The objective of this paper is twofold: firstly, it will seek to identify the contributions of socio-legal approaches to the study of international economic law; and secondly, it will explore how socio-legal scholarship can provide a methodological and theoretical framework to construct an understanding of the pluralistic nature of international economic regulatory regimes and their underlying dynamics of power. In doing so, the paper will also consider the value of juxtaposing an empirical methodology for mapping legal regimes with a critical normative approach for analysing power relations in international economic law

    Sosiologi hukum

    No full text

    A Plug-and-Play Method for Controlled Text Generation

    No full text
    Large pre-trained language models have repeatedly shown their ability to produce fluent text. Yet even when starting from a prompt, generation can continue in many plausible directions. Current decoding methods with the goal of controlling generation, e.g., to ensure specific words are included, either require additional models or fine-tuning, or work poorly when the task at hand is semantically unconstrained, e.g., story generation. In this work, we present a plug-and-play decoding method for controlled language generation that is so simple and intuitive, it can be described in a single sentence: given a topic or keyword, we add a shift to the probability distribution over our vocabulary towards semantically similar words. We show how annealing this distribution can be used to impose hard constraints on language generation, something no other plug-and-play method is currently able to do with SOTA language generators. Despite the simplicity of this approach, we see it works incredibly well in practice: decoding from GPT-2 leads to diverse and fluent sentences while guaranteeing the appearance of given guide words. We perform two user studies, revealing that (1) our method outperforms competing methods in human evaluations; and (2) forcing the guide words to appear in the generated text has no impact on the fluency of the generated text
    corecore