56 research outputs found

    Towards Robust Visual Information Extraction in Real World: New Dataset and Novel Solution

    Full text link
    Visual information extraction (VIE) has attracted considerable attention recently owing to its various advanced applications such as document understanding, automatic marking and intelligent education. Most existing works decoupled this problem into several independent sub-tasks of text spotting (text detection and recognition) and information extraction, which completely ignored the high correlation among them during optimization. In this paper, we propose a robust visual information extraction system (VIES) towards real-world scenarios, which is a unified end-to-end trainable framework for simultaneous text detection, recognition and information extraction by taking a single document image as input and outputting the structured information. Specifically, the information extraction branch collects abundant visual and semantic representations from text spotting for multimodal feature fusion and conversely, provides higher-level semantic clues to contribute to the optimization of text spotting. Moreover, regarding the shortage of public benchmarks, we construct a fully-annotated dataset called EPHOIE (https://github.com/HCIILAB/EPHOIE), which is the first Chinese benchmark for both text spotting and visual information extraction. EPHOIE consists of 1,494 images of examination paper head with complex layouts and background, including a total of 15,771 Chinese handwritten or printed text instances. Compared with the state-of-the-art methods, our VIES shows significant superior performance on the EPHOIE dataset and achieves a 9.01% F-score gain on the widely used SROIE dataset under the end-to-end scenario.Comment: 8 pages, 5 figures, to be published in AAAI 202

    The community-centered freshwater biogeochemistry model unified RIVE v1.0: a unified version for water column

    Get PDF
    Research on mechanisms of organic matter degradation, bacterial activities, phytoplankton dynamics, and other processes has led to the development of numerous sophisticated water quality models. The earliest model, dating back to 1925, was based on first-order kinetics for organic matter degradation. The community-centered freshwater biogeochemistry model RIVE was initially developed in 1994 and has subsequently been integrated into several software programs such as Seneque-Riverstrahler, pyNuts-Riverstrahler, ProSe/ProSe-PA, and Barman. After 30 years of research, the use of different programming languages including QBasic, Visual Basic, Fortran, ANSI C, and Python, as well as parallel evolution and the addition of new formalisms, raises questions about their comparability. This paper presents a unified version of the RIVE model for the water column, including formalisms for bacterial communities (heterotrophic and nitrifying), primary producers, zooplankton, nutrients, inorganic carbon, and dissolved oxygen cycles. The unified RIVE model is open-source and implemented in Python 3 to create pyRIVE 1.0 and in ANSI C to create C-RIVE 0.32. The organic matter degradation module is validated by simulating batch experiments. The comparability of the pyRIVE 1.0 and C-RIVE 0.32 software is verified by modeling a river stretch case study. The case study considers the full biogeochemical cycles (microorganisms, nutrients, carbon, and oxygen) in the water column, as well as the effects of light and water temperature. The results show that the simulated concentrations of all state variables, including microorganisms and chemical species, are very similar for pyRIVE 1.0 and C-RIVE 0.32. This open-source project highly encourages contributions from the freshwater biogeochemistry community to further advance the project and achieve common objectives.</p

    Assessing water and energy fluxes in a regional hydrosystem: case study of the Seine basin

    Get PDF
    While it is well accepted that climate change and growing water needs affect long-term sustainable water resources management, performing accurate simulations of water cycle and energy balance dynamics at regional scale remains a challenging task.Traditional Soil-Vegetation-Atmosphere-Transfer (SVAT) models are used for numerical surface water and energy simulations. These models, by conception, do not account for the groundwater lower boundary that permits a full hydrosystem representation. Conversely, while addressing important features such as subsurface heterogeneity and river–aquifer exchanges, groundwater models often integrate overly simplified upper boundary conditions ignoring soil heating and the impacts of vegetation processes on radiation fluxes and root-zone uptakes. In this paper, one of the first attempts to jointly model water and energy fluxes with a special focus on both surface and groundwater at the regional scale is proposed on the Seine hydrosystem (78,650 km2^{2}), which overlays one of the main multi-aquifer systems of Europe.This study couples the SVAT model ORCHIDEE and the process-based hydrological–hydrogeological model CaWaQS, which describes water fluxes, via a one-way coupling approach from ORCHIDEE toward CaWaQS based on the blueprint published by [de Marsily et al., 1978]. An original transport library based on the resolution of the diffusion/advection transport equation was developed in order to simulate heat transfer in both 1D-river networks and pseudo-3D aquifer systems. In addition, an analytical solution is used to simulate heat transport through aquitards and streambeds. Simulated ORCHIDEE surface water and energy fluxes feed fast surface runoff and slow recharge respectively and then is used as CaWaQS forcings to compute river discharges, hydraulic heads and temperature dynamics through space and time, within each of the hydrosystem compartments. The tool makes it possible to establish a fully consistent water and energy budget over a period of 17 years. It also simulates temperature evolution in each aquifer and evaluates that river thermal regulation mostly relies by order of importance on short wave radiations (109.3 W⋅{\cdot }m−2^{-2}), groundwater fluxes (48.1 W⋅{\cdot }m−2^{-2}) and surface runoff (22.7 W⋅{\cdot }m−2^{-2})

    Simulation du métabolisme de la Seine par assimilation de données en continu

    Get PDF
    The aim of the thesis is to implement a data assimilation scheme in the hydro-biogeochemical model ProSe, in order to assimilate continuous measurements of dissolved oxygen in the water column and to determine the temporal evolution of the physiological properties of the communities of living species. First, a new parallel version of ProSe, ProSe-P, is developed coupling the three packages: hydrodynamic, transport and biogeochemical (C-RIVE). Second, a sensitivity analysis of the C-RIVE model allows the identification of a limited number of influentiel parameters controlling the dissolved oxygen concentrations. Based on the selection, a particle filtering algorithm is implemented in order to assimilate sequentially the high frequency oxygen data. The coupling ProSe-P-particle filtre, ProSe-PA is then applied on a synthetic case to tune the numerical settings for the data assimilation and to test the efficiency of the particle filter in river water quality models. Finally, the continuous measurements of dissolved oxygen of the year 2011 in the Seine River are assimilated by ProSe-PA. The results show that ProSe-PA improves significantly the simulation of the dissolved oxygen concentrations, especially the dynamics of algal blooms periods and the fast chute of O2 for the critical periods. This application to the real oxygen data reveals however some limits of the developed approach, especially the sensitivity to the boundary conditions. Some ideas are proposed to improve the performances of ProSe-PA.Cette thĂšse a pour objectif d'implĂ©menter un schĂ©ma d'assimilation de donnĂ©es dans le modĂšle hydro-biogĂ©ochimique ProSe, afin d’assimiler les mesures en continu d’oxygĂšne dissous de la colonne d’eau et de dĂ©terminer l’évolution temporelle des propriĂ©tĂ©s physiologiques des communautĂ©s vivantes. Dans un premier temps, une nouvelle version parallĂ©lisĂ©e de ProSe, ProSe-P, est dĂ©veloppĂ©e en couplant les librairies hydraulique, de transport et biogĂ©ochimique (C-RIVE). Dans un deuxiĂšme temps, une analyse de sensibilitĂ© du module C-RIVE permet d'identifier un nombre restreint de paramĂštres influençant fortement les concentrations en oxygĂšne dissous. BasĂ© sur cette sĂ©lection, un algorithme de filtrage particulaire est implĂ©mentĂ© afin d'assimiler sĂ©quentiellement les donnĂ©es haute frĂ©quence d'oxygĂšne dissous. Le couple ProSe-P-filtre particulaire, ProSe-PA, est ensuite appliquĂ© sur un cas synthĂ©tique afin d'identifier les paramĂštres numĂ©riques pertinents et de valider l'efficacitĂ© du filtre particulaire pour les modĂšles de qualitĂ© de l'eau en riviĂšre. Enfin, les mesures en continu d'O2 dissous de l'annĂ©e 2011 en Seine sont assimilĂ©es par ProSe-PA. Les rĂ©sultats montrent que ProSe-PA amĂ©liore significativement la simulation des concentrations en oxygĂšne dissous, notamment les dynamiques alguales et les chutes d'oxygĂšne pendant les pĂ©riodes de crise. L'application aux donnĂ©es rĂ©elles rĂ©vĂšle cependant les limites de l'approche dĂ©veloppĂ©e, notamment la sensibilitĂ© aux conditions aux limites. Plusieurs pistes sont proposĂ©es afin d'amĂ©liorer les performances de ProSe-PA

    Simulation of metabolism of Seine River by continuous data assimilation

    No full text
    Cette thĂšse a pour objectif d'implĂ©menter un schĂ©ma d'assimilation de donnĂ©es dans le modĂšle hydro-biogĂ©ochimique ProSe, afin d’assimiler les mesures en continu d’oxygĂšne dissous de la colonne d’eau et de dĂ©terminer l’évolution temporelle des propriĂ©tĂ©s physiologiques des communautĂ©s vivantes. Dans un premier temps, une nouvelle version parallĂ©lisĂ©e de ProSe, ProSe-P, est dĂ©veloppĂ©e en couplant les librairies hydraulique, de transport et biogĂ©ochimique (C-RIVE). Dans un deuxiĂšme temps, une analyse de sensibilitĂ© du module C-RIVE permet d'identifier un nombre restreint de paramĂštres influençant fortement les concentrations en oxygĂšne dissous. BasĂ© sur cette sĂ©lection, un algorithme de filtrage particulaire est implĂ©mentĂ© afin d'assimiler sĂ©quentiellement les donnĂ©es haute frĂ©quence d'oxygĂšne dissous. Le couple ProSe-P-filtre particulaire, ProSe-PA, est ensuite appliquĂ© sur un cas synthĂ©tique afin d'identifier les paramĂštres numĂ©riques pertinents et de valider l'efficacitĂ© du filtre particulaire pour les modĂšles de qualitĂ© de l'eau en riviĂšre. Enfin, les mesures en continu d'O2 dissous de l'annĂ©e 2011 en Seine sont assimilĂ©es par ProSe-PA. Les rĂ©sultats montrent que ProSe-PA amĂ©liore significativement la simulation des concentrations en oxygĂšne dissous, notamment les dynamiques alguales et les chutes d'oxygĂšne pendant les pĂ©riodes de crise. L'application aux donnĂ©es rĂ©elles rĂ©vĂšle cependant les limites de l'approche dĂ©veloppĂ©e, notamment la sensibilitĂ© aux conditions aux limites. Plusieurs pistes sont proposĂ©es afin d'amĂ©liorer les performances de ProSe-PA.The aim of the thesis is to implement a data assimilation scheme in the hydro-biogeochemical model ProSe, in order to assimilate continuous measurements of dissolved oxygen in the water column and to determine the temporal evolution of the physiological properties of the communities of living species. First, a new parallel version of ProSe, ProSe-P, is developed coupling the three packages: hydrodynamic, transport and biogeochemical (C-RIVE). Second, a sensitivity analysis of the C-RIVE model allows the identification of a limited number of influentiel parameters controlling the dissolved oxygen concentrations. Based on the selection, a particle filtering algorithm is implemented in order to assimilate sequentially the high frequency oxygen data. The coupling ProSe-P-particle filtre, ProSe-PA is then applied on a synthetic case to tune the numerical settings for the data assimilation and to test the efficiency of the particle filter in river water quality models. Finally, the continuous measurements of dissolved oxygen of the year 2011 in the Seine River are assimilated by ProSe-PA. The results show that ProSe-PA improves significantly the simulation of the dissolved oxygen concentrations, especially the dynamics of algal blooms periods and the fast chute of O2 for the critical periods. This application to the real oxygen data reveals however some limits of the developed approach, especially the sensitivity to the boundary conditions. Some ideas are proposed to improve the performances of ProSe-PA
    • 

    corecore