Search CORE

5 research outputs found

QueryForm: A Simple Zero-shot Form Entity Query Framework

Author: Devlin Jacob
Dy Jennifer
Lee Chen-Yu
Perot Vincent
Pfister Tomas
Su Guolong
Wang Zifeng
Zhang Hao
Zhang Zizhao
Publication venue
Publication date: 27/06/2023
Field of study

Zero-shot transfer learning for document understanding is a crucial yet under-investigated scenario to help reduce the high cost involved in annotating document entities. We present a novel query-based framework, QueryForm, that extracts entity values from form-like documents in a zero-shot fashion. QueryForm contains a dual prompting mechanism that composes both the document schema and a specific entity type into a query, which is used to prompt a Transformer model to perform a single entity extraction task. Furthermore, we propose to leverage large-scale query-entity pairs generated from form-like webpages with weak HTML annotations to pre-train QueryForm. By unifying pre-training and fine-tuning into the same query-based framework, QueryForm enables models to learn from structured documents containing various entities and layouts, leading to better generalization to target document types without the need for target-specific training data. QueryForm sets new state-of-the-art average F1 score on both the XFUND (+4.6%~10.1%) and the Payment (+3.2%~9.5%) zero-shot benchmark, with a smaller model size and no additional image input.Comment: Accepted to Findings of ACL 202

arXiv.org e-Print Archive

Chain-of-Table: Evolving Tables in the Reasoning Chain for Table Understanding

Author: Eisenschlos Julian Martin
Fujii Yasuhisa
Lee Chen-Yu
Li Chun-Liang
Miculicich Lesly
Perot Vincent
Pfister Tomas
Shang Jingbo
Wang Zifeng
Wang Zilong
Zhang Hao
Publication venue
Publication date: 18/01/2024
Field of study

Table-based reasoning with large language models (LLMs) is a promising direction to tackle many table understanding tasks, such as table-based question answering and fact verification. Compared with generic reasoning, table-based reasoning requires the extraction of underlying semantics from both free-form questions and semi-structured tabular data. Chain-of-Thought and its similar approaches incorporate the reasoning chain in the form of textual context, but it is still an open question how to effectively leverage tabular data in the reasoning chain. We propose the Chain-of-Table framework, where tabular data is explicitly used in the reasoning chain as a proxy for intermediate thoughts. Specifically, we guide LLMs using in-context learning to iteratively generate operations and update the table to represent a tabular reasoning chain. LLMs can therefore dynamically plan the next operation based on the results of the previous ones. This continuous evolution of the table forms a chain, showing the reasoning process for a given tabular problem. The chain carries structured information of the intermediate results, enabling more accurate and reliable predictions. Chain-of-Table achieves new state-of-the-art performance on WikiTQ, FeTaQA, and TabFact benchmarks across multiple LLM choices.Comment: Accepted to ICLR 202

arXiv.org e-Print Archive

FormNetV2: Multimodal Graph Contrastive Learning for Form Document Information Extraction

Author: Ainslie Joshua
Dozat Timothy
Fujii Yasuhisa
Glushnev Nikolai
Hua Nan
Lee Chen-Yu
Li Chun-Liang
Long Shangbang
Perot Vincent
Pfister Tomas
Qin Siyang
Sohn Kihyuk
Su Guolong
Wang Renshen
Zhang Hao
Zhang Xiang
Publication venue
Publication date: 13/06/2023
Field of study

The recent advent of self-supervised pre-training techniques has led to a surge in the use of multimodal learning in form document understanding. However, existing approaches that extend the mask language modeling to other modalities require careful multi-task tuning, complex reconstruction target designs, or additional pre-training data. In FormNetV2, we introduce a centralized multimodal graph contrastive learning strategy to unify self-supervised pre-training for all modalities in one loss. The graph contrastive objective maximizes the agreement of multimodal representations, providing a natural interplay for all modalities without special customization. In addition, we extract image features within the bounding box that joins a pair of tokens connected by a graph edge, capturing more targeted visual cues without loading a sophisticated and separately pre-trained image embedder. FormNetV2 establishes new state-of-the-art performance on FUNSD, CORD, SROIE and Payment benchmarks with a more compact model size.Comment: Accepted to ACL 202

arXiv.org e-Print Archive

Emerging stability of forest productivity by mixing two species buffers temperature destabilizing effect

The increasing disturbances in monocultures around the world are testimony to their instability under global change. Many studies have claimed that temporal stability of productivity increases with species richness, although the ecological fundamentals have mainly been investigated through diversity experiments. To adequately manage forest ecosystems, it is necessary to have a comprehensive understanding of the effect of mixing species on the temporal stability of productivity and the way in which it is influenced by climate conditions across large geographical areas. Here, we used a unique dataset of 261 stands combining pure and two-species mixtures of four relevant tree species over a wide range of climate conditions in Europe to examine the effect of species mixing on the level and temporal stability of productivity. Structural equation modelling was employed to further explore the direct and indirect influence of climate, overyielding, species asynchrony and additive effect (i.e. temporal stability expected from the species growth in monospecific stands) on temporal stability in mixed forests. We showed that by adding only one tree species to monocultures, the level (overyielding: +6%) and stability (temporal stability: +12%) of stand growth increased significantly. We identified the key effect of temperature on destabilizing stand growth, which may be mitigated by mixing species. We further confirmed asynchrony as the main driver of temporal stability in mixed stands, through both the additive effect and species interactions, which modify between-species asynchrony in mixtures in comparison to monocultures. Synthesis and applications. This study highlights the emergent properties associated with mixing two species, which result in resource efficient and temporally stable production systems. We reveal the negative impact of mean temperature on temporal stability of forest productivity and how the stabilizing effect of mixing two species can counterbalance this impact. The overyielding and temporal stability of growth addressed in this paper are essential for ecosystem services closely linked with the level and rhythm of forest growth. Our results underline that mixing two species can be a realistic and effective nature-based climate solution, which could contribute towards meeting EU climate target policies

Epsilon Open Archive

Autologous Stem Cell Transplantation in Acute Myelocytic Leukemia

Author: Adler
Allieri
Almici
Appelbaum
Appelbaum
Ashwood-Smith
Ball
Barnes
Belkacemi
Benyunes
Brenner
Brenner
Brugger
Buckner
Bullock
Burnett
Burnett
Busca
Cahn
Cahn
Capizzi
Carella
Carella
Carlo-Stella
Cassileth
Cassileth
Cavins
Chang
Chao
Charak
Coulombel
Crump
Dann
Degos
Deisseroth
Dexter
Dicke
Dicke
Douay
Douay
Douay
Douay
Douay
Douay
Douay
Dufoir
Dusenbery
Dusenbery
Epstein
Ericson
Esteve
Fenaux
Ferrant
Frassoni
Giralt
Gondo
Gorin
Gorin
Gorin
Gorin
Gorin
Gorin
Gorin
Gorin
Gorin
Gorin
Gribben
Grimwade
Gulati
Harousseau
Hermans
Herrmann
Horowitz
Kanamaru
Kaneko
Klingebiel
Korbling
Lapidot
Laporte
Laporte
Linker
Lopez
Lowdell
Lowenberg
Mandelli
Mandelli
Martens
Mayer
Mc Millan
Mehta
Mehta
Meloni
Meloni
Meloni
Meloni
Miggiano
Miller
Miller
Mitus
Motta
N.C. Gorin
Nemunaitis
Newman
Notter
Olivieri
Perot
Petersen
Polge
Porter
Prentice
Ravindranath
Reading
Reiffers
Reiffers
Ringden
Ringden
Rizzoli
Robertson
Rowley
Rowley
Rowley
Salooja
San Miguel
Sanders
Santos
Sanz
Schaefer
Schiffman
Schultz
Selvaggi
Shah
Sharkis
Shpall
Sierra
Sierra
Sierra
Skorski
Snyder
Stein
Straetmans
Suciu
Swedish Council on Technology Assessment in Health Care
Testa
Thomas
Tiedemann
To
Tobal
Tomas
Tomas
Toren
Traweek
Tutschka
Van Den Akker
Verdonck
Vignetti
Visani
Vogler
Wahlin
Welch
Whitlock
Wiley
Woods
Yeager
Yeager
Yeager
Zhong
Zittoun
Publication venue: 'American Society of Hematology'
Publication date
Field of study

Crossref