5 research outputs found

    QueryForm: A Simple Zero-shot Form Entity Query Framework

    Full text link
    Zero-shot transfer learning for document understanding is a crucial yet under-investigated scenario to help reduce the high cost involved in annotating document entities. We present a novel query-based framework, QueryForm, that extracts entity values from form-like documents in a zero-shot fashion. QueryForm contains a dual prompting mechanism that composes both the document schema and a specific entity type into a query, which is used to prompt a Transformer model to perform a single entity extraction task. Furthermore, we propose to leverage large-scale query-entity pairs generated from form-like webpages with weak HTML annotations to pre-train QueryForm. By unifying pre-training and fine-tuning into the same query-based framework, QueryForm enables models to learn from structured documents containing various entities and layouts, leading to better generalization to target document types without the need for target-specific training data. QueryForm sets new state-of-the-art average F1 score on both the XFUND (+4.6%~10.1%) and the Payment (+3.2%~9.5%) zero-shot benchmark, with a smaller model size and no additional image input.Comment: Accepted to Findings of ACL 202

    Chain-of-Table: Evolving Tables in the Reasoning Chain for Table Understanding

    Full text link
    Table-based reasoning with large language models (LLMs) is a promising direction to tackle many table understanding tasks, such as table-based question answering and fact verification. Compared with generic reasoning, table-based reasoning requires the extraction of underlying semantics from both free-form questions and semi-structured tabular data. Chain-of-Thought and its similar approaches incorporate the reasoning chain in the form of textual context, but it is still an open question how to effectively leverage tabular data in the reasoning chain. We propose the Chain-of-Table framework, where tabular data is explicitly used in the reasoning chain as a proxy for intermediate thoughts. Specifically, we guide LLMs using in-context learning to iteratively generate operations and update the table to represent a tabular reasoning chain. LLMs can therefore dynamically plan the next operation based on the results of the previous ones. This continuous evolution of the table forms a chain, showing the reasoning process for a given tabular problem. The chain carries structured information of the intermediate results, enabling more accurate and reliable predictions. Chain-of-Table achieves new state-of-the-art performance on WikiTQ, FeTaQA, and TabFact benchmarks across multiple LLM choices.Comment: Accepted to ICLR 202

    FormNetV2: Multimodal Graph Contrastive Learning for Form Document Information Extraction

    Full text link
    The recent advent of self-supervised pre-training techniques has led to a surge in the use of multimodal learning in form document understanding. However, existing approaches that extend the mask language modeling to other modalities require careful multi-task tuning, complex reconstruction target designs, or additional pre-training data. In FormNetV2, we introduce a centralized multimodal graph contrastive learning strategy to unify self-supervised pre-training for all modalities in one loss. The graph contrastive objective maximizes the agreement of multimodal representations, providing a natural interplay for all modalities without special customization. In addition, we extract image features within the bounding box that joins a pair of tokens connected by a graph edge, capturing more targeted visual cues without loading a sophisticated and separately pre-trained image embedder. FormNetV2 establishes new state-of-the-art performance on FUNSD, CORD, SROIE and Payment benchmarks with a more compact model size.Comment: Accepted to ACL 202

    Emerging stability of forest productivity by mixing two species buffers temperature destabilizing effect

    Get PDF
    The increasing disturbances in monocultures around the world are testimony to their instability under global change. Many studies have claimed that temporal stability of productivity increases with species richness, although the ecological fundamentals have mainly been investigated through diversity experiments. To adequately manage forest ecosystems, it is necessary to have a comprehensive understanding of the effect of mixing species on the temporal stability of productivity and the way in which it is influenced by climate conditions across large geographical areas. Here, we used a unique dataset of 261 stands combining pure and two-species mixtures of four relevant tree species over a wide range of climate conditions in Europe to examine the effect of species mixing on the level and temporal stability of productivity. Structural equation modelling was employed to further explore the direct and indirect influence of climate, overyielding, species asynchrony and additive effect (i.e. temporal stability expected from the species growth in monospecific stands) on temporal stability in mixed forests. We showed that by adding only one tree species to monocultures, the level (overyielding: +6%) and stability (temporal stability: +12%) of stand growth increased significantly. We identified the key effect of temperature on destabilizing stand growth, which may be mitigated by mixing species. We further confirmed asynchrony as the main driver of temporal stability in mixed stands, through both the additive effect and species interactions, which modify between-species asynchrony in mixtures in comparison to monocultures. Synthesis and applications. This study highlights the emergent properties associated with mixing two species, which result in resource efficient and temporally stable production systems. We reveal the negative impact of mean temperature on temporal stability of forest productivity and how the stabilizing effect of mixing two species can counterbalance this impact. The overyielding and temporal stability of growth addressed in this paper are essential for ecosystem services closely linked with the level and rhythm of forest growth. Our results underline that mixing two species can be a realistic and effective nature-based climate solution, which could contribute towards meeting EU climate target policies
    corecore