174 research outputs found
Multilingual simultaneous sentence end and punctuation prediction
This paper describes the model and its corresponding setup, proposed by the Unbabel & INESC-ID team for the 1st Shared Task on Sentence End and Punctuation Prediction in NLG Text (SEPP-NLG 2021). The shared task covers 4 languages (English, German, French and Italian) and includes two subtasks: Subtask 1 - detecting the end of a sentence, and subtask 2 - predicting a range of punctuation marks. Our team proposes a single multilingual and multitask model that is able to produce suitable results for all the languages and subtasks involved. The results show that it is possible to achieve state-of-the-art results using one single multilingual model for both tasks and multiple languages. Using a single multilingual model to solve the task for multiple languages is of particular importance, since training a different model for each language is a cumbersome and time-consuming process.info:eu-repo/semantics/publishedVersio
Chatbots’ greetings to human-computer communication
In the last years, chatbots have gained new attention, due to the interest showed by widely known personalities and companies. The concept is broad, and, in this paper we target the work developed by the (old) community that is typically associated with chatbot’s competitions. In our opinion, they contribute with very interesting know-how, but specially with large-scale corpora, gathered by interactions with real people, an invaluable resource considering the renewed interest in Deep Nets.info:eu-repo/semantics/publishedVersio
L2F/INESC-ID at SemEval-2019 Task 2: unsupervised lexical semantic frame induction using contextualized word representations
Building large datasets annotated with semantic information, such as FrameNet, is an expensive process. Consequently, such resources are unavailable for many languages and specific domains. This problem can be alleviated by using unsupervised approaches to induce the frames evoked by a collection of documents. That is the objective of the second task of SemEval 2019, which comprises three subtasks: clustering of verbs that evoke the same frame and clustering of arguments into both frame-specific slots and semantic roles. We approach all the subtasks by applying a graph clustering algorithm on contextualized embedding representations of the verbs and arguments. Using such representations is appropriate in the context of this task, since they provide cues for word-sense disambiguation. Thus, they can be used to identify different frames evoked by the same words. Using this approach we were able to outperform all of the baselines reported for the task on the test set in terms of Purity F1, as well as in terms of BCubed F1 in most cases.info:eu-repo/semantics/publishedVersio
Analysis of ozone and nitric acid in spring and summer Arctic pollution using aircraft, ground-based, satellite observations and MOZART-4 model: source attribution and partitioning
In this paper, we analyze tropospheric O_3 together with HNO_3 during the POLARCAT (Polar Study using Aircraft, Remote Sensing, Surface Measurements and Models, of Climate, Chemistry, Aerosols, and Transport) program, combining observations and model results. Aircraft observations from the NASA ARCTAS (Arctic Research of the Composition of the Troposphere from Aircraft and Satellites) and NOAA ARCPAC (Aerosol, Radiation and Cloud Processes affecting Arctic Climate) campaigns during spring and summer of 2008 are used together with the Model for Ozone and Related Chemical Tracers, version 4 (MOZART-4) to assist in the interpretation of the observations in terms of the source attribution and transport of O_3 and HNO_3 into the Arctic (north of 60° N). The MOZART-4 simulations reproduce the aircraft observations generally well (within 15%), but some discrepancies in the model are identified and discussed. The observed correlation of O_3 with HNO_3 is exploited to evaluate the MOZART-4 model performance for different air mass types (fresh plumes, free troposphere and stratospheric-contaminated air masses).
Based on model simulations of O_3 and HNO_3 tagged by source type and region, we find that the anthropogenic pollution from the Northern Hemisphere is the dominant source of O3 and HNO3 in the Arctic at pressures greater than 400 hPa, and that the stratospheric influence is the principal contribution at pressures less 400 hPa. During the summer, intense Russian fire emissions contribute some amount to the tropospheric columns of both gases over the American sector of the Arctic. North American fire emissions (California and Canada) also show an important impact on tropospheric ozone in the Arctic boundary layer.
Additional analysis of tropospheric O_3 measurements from ground-based FTIR and from the IASI satellite sounder made at the Eureka (Canada) and Thule (Greenland) polar sites during POLARCAT has been performed using the tagged contributions. It demonstrates the capability of these instruments for observing pollution at northern high latitudes. Differences between contributions from the sources to the tropospheric columns as measured by FTIR and IASI are discussed in terms of vertical sensitivity associated with these instruments. The first analysis of O_3 tropospheric columns observed by the IASI satellite instrument over the Arctic is also provided. Despite its limited vertical sensitivity in the lowermost atmospheric layers, we demonstrate that IASI is capable of detecting low-altitude pollution transported into the Arctic with some limitations
Measurements of hydrogen cyanide (HCN) and acetylene (C2H2) from the Infrared Atmospheric Sounding Interferometer (IASI)
Hydrogen cyanide (HCN) and acetylene (C2H2) are ubiquitous atmospheric trace gases with medium lifetime, which are frequently used as indicators of combustion sources and as tracers for atmospheric transport and chemistry. Because of their weak infrared absorption, overlapped by the CO2 Q branch near 720 cm−1, nadir sounders have up to now failed to measure these gases routinely. Taking into account CO2 line mixing, we provide for the first time extensive measurements of HCN and C2H2 total columns at Reunion Island (21° S, 55° E) and Jungfraujoch (46° N, 8° E) in 2009–2010 using observations from the Infrared Atmospheric Sounding Interferometer (IASI). A first order comparison with local ground-based Fourier transform InfraRed (FTIR) measurements has been carried out allowing tests of seasonal consistency which is reasonably captured, except for HCN at Jungfraujoch. The IASI data shows a greater tendency to high C2H2 values. We also examine a nonspecific biomass burning plume over austral Africa and show that the emission ratios with respect to CO agree with previously reported values
Cross-validation of IASI/MetOp derived tropospheric δD with TES and ground-based FTIR observations
International audienceThe Infrared Atmospheric Sounding Interferometer (IASI) flying on-board MetOpA and MetOpB is able to capture fine isotopic variations of the HDO to H2O ratio (δD) in the troposphere. Such observations at the high spatio temporal resolution of the sounder are of great interest to improve our understanding of the mechanisms controlling humidity in the troposphere. In this study we aim to empirically assess the validity of our error estimation previously evaluated theoretically. To achieve this, we compare IASI δD retrieved profiles with other available profiles of δD, from the TES infrared sounder onboard AURA and from three ground-based FTIR stations produced within the MUSICA project: the NDACC (Network for the Detection of Atmospheric Composition Change) sites Kiruna and Izana, and the TCCON site Karlsruhe, which in addition to near-infrared TCCON spectra also records mid-infrared spectra. We describe the achievable level of agreement between the different retrievals and show that these theoretical errors are in good agreement with empirical differences. The comparisons are made at different locations from tropical to Arctic latitudes, above sea and above land. Generally IASI and TES are similarly sensitive to δD in the free troposphere which allows to compare their measurements directly. At tropical latitudes where IASI's sensitivity is lower than that of TES, we show that the agreement improves when taking into account the sensitivity of IASI in the TES retrieval. For the comparison IASI-FTIR only direct comparisons are performed because of similar sensitivities. We identify a quasi negligible bias in the free troposphere (−3‰) between IASI retrieved δD with the TES one, which are bias corrected, but an important with the ground-based FTIR reaching −47‰. We also suggest that model-satellite observations comparisons could be optimized with IASI thanks to its high spatial and temporal sampling
Importance of the Saharan heat low in controlling the North Atlantic free tropospheric humidity budget deduced from IASI <i>δ</i>D observations
The isotopic composition of water vapour in the North Atlantic free
troposphere is investigated with Infrared Atmospheric Sounding Interferometer
(IASI) measurements of the D ∕ H ratio (δD) above the ocean. We
show that in the vicinity of West Africa, the seasonality of δD is
particularly strong (130 ‰), which is related with the influence of
the Saharan heat low (SHL) during summertime. The SHL indeed largely
influences the dynamic in that region by producing deep turbulent mixing
layers, yielding a specific water vapour isotopic footprint. The influence of
the SHL on the isotopic budget is analysed on various time and space scales
and is shown to be large, highlighting the importance of the SHL dynamics on
the moistening and the HDO enrichment of the free troposphere over the North
Atlantic. The potential influence of the SHL is also investigated on the
inter-annual scale as we also report important variations in δD above
the Canary archipelago region. We interpret the variability in the
enrichment, using backward trajectory analyses, in terms of the ratio of air
masses coming from the North Atlantic and air masses coming from the African
continent. Finally, the interest of IASI high sampling capabilities is
further illustrated by presenting spatial distributions of δD and
humidity above the North Atlantic from which we show that the different
sources and dehydration pathways controlling the humidity can be disentangled
thanks to the added value of δD observations. More generally, our
results demonstrate the utility of δD observations obtained from the
IASI sounder to gain insight into the hydrological cycle processes in the
West African region
Decoherence of matter waves by thermal emission of radiation
Emergent quantum technologies have led to increasing interest in decoherence
- the processes that limit the appearance of quantum effects and turn them into
classical phenomena. One important cause of decoherence is the interaction of a
quantum system with its environment, which 'entangles' the two and distributes
the quantum coherence over so many degrees of freedom as to render it
unobservable. Decoherence theory has been complemented by experiments using
matter waves coupled to external photons or molecules, and by investigations
using coherent photon states, trapped ions and electron interferometers. Large
molecules are particularly suitable for the investigation of the
quantum-classical transition because they can store much energy in numerous
internal degrees of freedom; the internal energy can be converted into thermal
radiation and thus induce decoherence. Here we report matter wave
interferometer experiments in which C70 molecules lose their quantum behaviour
by thermal emission of radiation. We find good quantitative agreement between
our experimental observations and microscopic decoherence theory. Decoherence
by emission of thermal radiation is a general mechanism that should be relevant
to all macroscopic bodies.Comment: 5 pages, 4 figure
Tropospheric methanol observations from space: retrieval evaluation and constraints on the seasonality of biogenic emissions
Methanol retrievals from nadir-viewing space-based sensors offer powerful new information for quantifying methanol emissions on a global scale. Here we apply an ensemble of aircraft observations over North America to evaluate new methanol measurements from the Tropospheric Emission Spectrometer (TES) on the Aura satellite, and combine the TES data with observations from the Infrared Atmospheric Sounding Interferometer (IASI) on the MetOp-A satellite to investigate the seasonality of methanol emissions from northern midlatitude ecosystems. Using the GEOS-Chem chemical transport model as an intercomparison platform, we find that the TES retrieval performs well when the degrees of freedom for signal (DOFS) are above 0.5, in which case the model : TES regressions are generally consistent with the model : aircraft comparisons. Including retrievals with DOFS below 0.5 degrades the comparisons, as these are excessively influenced by the a priori. The comparisons suggest DOFS > 0.5 as a minimum threshold for interpreting retrievals of trace gases with a weak tropospheric signal. We analyze one full year of satellite observations and find that GEOS-Chem, driven with MEGANv2.1 biogenic emissions, underestimates observed methanol concentrations throughout the midlatitudes in springtime, with the timing of the seasonal peak in model emissions 1-2 months too late. We attribute this discrepancy to an underestimate of emissions from new leaves in MEGAN, and apply the satellite data to better quantify the seasonal change in methanol emissions for midlatitude ecosystems. The derived parameters (relative emission factors of 11.0, 1.0, 0.05 and 8.6 for new, growing, mature, and old leaves, respectively, plus a leaf area index activity factor of 0.75 for expanding canopies with leaf area index < 2.0) provide a more realistic simulation of seasonal methanol concentrations in midlatitudes on the basis of IASI, TES, and ground-based measurements
Acetylene (C2H2) and hydrogen cyanide (HCN) from IASI satellite observations: Global distributions, validation, and comparison with model
We present global distributions of CH and hydrogen cyanide (HCN) total columns derived from the Infrared Atmospheric Sounding Interferometer (IASI) for the years 2008–2010. These distributions are obtained with a fast method allowing to retrieve CH abundance globally with a 5% precision and HCN abundance in the tropical (subtropical) belt with a 10% (25 %) precision. IASI data are compared for validation purposes with ground-based Fourier transform infrared (FTIR) spectrometer measurements at four selected stations.We show that there is an overall agreement between the ground-based and space measurements with correlation coefficients for daily mean measurements ranging from 0.28 to 0.81, depending on the site. Global CH and subtropical HCN abundances retrieved from IASI spectra show the expected seasonality linked to variations in the anthropogenic emissions and seasonal biomass burning activity, as well as exceptional events, and are in good agreement with previous spaceborne studies. Total columns simulated by the Model for Ozone and Related Chemical Tracers, version 4 (MOZART-4) are compared to the ground-based FTIR measurements at the four selected stations. The model is able to capture the seasonality in the two species in most of the cases, with correlation coefficients for daily mean measurements ranging from 0.50 to 0.86, depending on the site. IASI measurements are also compared to the distributions from MOZART-4. Seasonal cycles observed from satellite data are reasonably well reproduced by the model with correlation coefficients ranging from -0.31 to 0.93 for CH daily means, and from 0.09 to 0.86 for HCN daily means, depending on the considered region. However, the anthropogenic (biomass burning) emissions used in the model seem to be overestimated (underestimated), and a negative global mean bias of 1% (16 %) of the model relative to the satellite observations was found for CH (HCN)
- …