142 research outputs found

    MTEB-French: Resources for French Sentence Embedding Evaluation and Analysis

    Full text link
    Recently, numerous embedding models have been made available and widely used for various NLP tasks. The Massive Text Embedding Benchmark (MTEB) has primarily simplified the process of choosing a model that performs well for several tasks in English, but extensions to other languages remain challenging. This is why we expand MTEB to propose the first massive benchmark of sentence embeddings for French. We gather 15 existing datasets in an easy-to-use interface and create three new French datasets for a global evaluation of 8 task categories. We compare 51 carefully selected embedding models on a large scale, conduct comprehensive statistical tests, and analyze the correlation between model performance and many of their characteristics. We find out that even if no model is the best on all tasks, large multilingual models pre-trained on sentence similarity perform exceptionally well. Our work comes with open-source code, new datasets and a public leaderboard

    Social network data and epidemiological intelligence: A case study of avian influenza

    Full text link
    Purpose - Event Based Surveillance (EBS) systems detect and monitor diseases by analysing articles from online news papers and reports from health organizations (e.g. FAO, OIE, etc.). However, they partially integrate data from social networks, even though these data are present in large quantities on the web. The purpose of this study is to exploit social network data, such as Twitter and YouTube, to provide epidemiological and additional information for Avian Influenza surveillance. Methods & Materials - In this context, we propose new text-mining approaches combining lexical rules and statistical approaches in order to normalise textual data from Social Net- work ('h5 n8' - 'H5N8') and to correct errors from YouTube transcriptions (e.g. 'birth u' - 'bird u'). Another challenge consists of extracting epidemiological events automatically by identifying spatial entities (Where?), thematic entities (What?), and temporal information (When?). For this, we extended Named Entity Recognition (NER) tools like spaCy. Results - We collected 100 automatic transcripts of YouTube videos and 268 tweets in English dealing with avian influenza with dedicated API. We obtain encouraging results (i.e. accuracy around 0:6) in order to recognise automatically epidemiological information (e.g. hosts, symptoms, etc.) in textual data con- tents. Extraction of spatial information obtains better results (i.e. accuracy around 0:7). Conclusion – The final objective of this study consists of linking social media data based on these entities with official information from health organisations for the improvement of epidemiological monitoring

    Search for dark matter produced in association with bottom or top quarks in √s = 13 TeV pp collisions with the ATLAS detector

    Get PDF
    A search for weakly interacting massive particle dark matter produced in association with bottom or top quarks is presented. Final states containing third-generation quarks and miss- ing transverse momentum are considered. The analysis uses 36.1 fb−1 of proton–proton collision data recorded by the ATLAS experiment at √s = 13 TeV in 2015 and 2016. No significant excess of events above the estimated backgrounds is observed. The results are in- terpreted in the framework of simplified models of spin-0 dark-matter mediators. For colour- neutral spin-0 mediators produced in association with top quarks and decaying into a pair of dark-matter particles, mediator masses below 50 GeV are excluded assuming a dark-matter candidate mass of 1 GeV and unitary couplings. For scalar and pseudoscalar mediators produced in association with bottom quarks, the search sets limits on the production cross- section of 300 times the predicted rate for mediators with masses between 10 and 50 GeV and assuming a dark-matter mass of 1 GeV and unitary coupling. Constraints on colour- charged scalar simplified models are also presented. Assuming a dark-matter particle mass of 35 GeV, mediator particles with mass below 1.1 TeV are excluded for couplings yielding a dark-matter relic density consistent with measurements

    Structural and Spectroscopic Properties of Assemblies of Self-Replicating Peptide Macrocycles

    Get PDF
    Self-replication at the molecular level is often seen as essential to the early origins of life. Recently a mechanism of self-replication has been discovered in which replicator self-assembly drives the process. We have studied one of the examples of such self-assembling self-replicating molecules to a high level of structural detail using a combination of computational and spectroscopic techniques. Molecular Dynamics simulations of self-assembled stacks of peptide-derived replicators provide insights into the structural characteristics of the system and serve as the basis for semiempirical calculations of the UV-vis, circular dichroism (CD) and infrared (IR) absorption spectra that reflect the chiral organization and peptide secondary structure of the stacks. Two proposed structural models are tested by comparing calculated spectra to experimental data from electron microscopy, CD and IR spectroscopy, resulting in a better insight into the specific supramolecular interactions that lead to self-replication. Specifically, we find a cooperative self-assembly process in which β-sheet formation leads to well-organized structures, while also the aromatic core of the macrocycles plays an important role in the stability of the resulting fibers

    Rice APC/CTE controls tillering by mediating the degradation of MONOCULM 1

    Get PDF
    Rice MONOCULM 1 (MOC1) and its orthologues LS/LAS (lateral suppressor in tomato and Arabidopsis) are key promoting factors of shoot branching and tillering in higher plants. However, the molecular mechanisms regulating MOC1/LS/LAS have remained elusive. Here we show that the rice tiller enhancer (te) mutant displays a drastically increased tiller number. We demonstrate that TE encodes a rice homologue of Cdh1, and that TE acts as an activator of the anaphase promoting complex/cyclosome (APC/C) complex. We show that TE coexpresses with MOC1 in the axil of leaves, where the APC/CTE complex mediates the degradation of MOC1 by the ubiquitin–26S proteasome pathway, and consequently downregulates the expression of the meristem identity gene Oryza sativa homeobox 1, thus repressing axillary meristem initiation and formation. We conclude that besides having a conserved role in regulating cell cycle, APC/CTE has a unique function in regulating the plant-specific postembryonic shoot branching and tillering, which are major determinants of plant architecture and grain yield

    Ventral tegmental area dysfunction affects decision-making in patients with myotonic dystrophy type-1

    Get PDF
    The clinical manifestations of Myotonic Dystrophy type-1 (DM1) are associated with a complex mixture of multisystem features including cognitive dysfunctions that strongly impact on patients’ social and occupational functioning. Decision making, a function controlled by dopaminergic circuitry, is critical for succeeding in one’s social and professional life. We tested here the hypothesis that altered connectivity of the ventral tegmental area (VTA), one of the major sources of diffuse dopaminergic projections in the brain, might account for some higher-level dysfunctions observed in patients with DM1. In this case-control study, we recruited 31 patients with DM1 and 26 healthy controls who underwent the IOWA Gambling task and resting-state functional MRI (RS-fMRI) at 3T. Functional connectivity of the VTA was assessed using RS-fMRI. VTA connectivity was compared between 25 DM1 patients and all the controls, and the presence of associations between VTA connectivity and IOWA Gambling task performance was also investigated. DM1 patients performed significantly worse than controls at the IOWA Gambling task. A significant increase of functional connectivity was observed between VTA and the left supramarginal and superior temporal gyri in DM1 patients. Patients’ IOWA Gambling task net-scores were strictly associated with VTA-driven functional connectivity in the bilateral supplementary motor area and right precentral gyrus. This study demonstrates a prominent deficit of decision-making in patients with DM1. It might be related to increased connectivity between VTA and brain areas critically involved in the reward/punishment system and social cognition. These findings indicate that dopaminergic function is a potential target for pharmacological and non-pharmacological interventions in DM1

    Altimetry for the future: Building on 25 years of progress

    Get PDF
    In 2018 we celebrated 25 years of development of radar altimetry, and the progress achieved by this methodology in the fields of global and coastal oceanography, hydrology, geodesy and cryospheric sciences. Many symbolic major events have celebrated these developments, e.g., in Venice, Italy, the 15th (2006) and 20th (2012) years of progress and more recently, in 2018, in Ponta Delgada, Portugal, 25 Years of Progress in Radar Altimetry. On this latter occasion it was decided to collect contributions of scientists, engineers and managers involved in the worldwide altimetry community to depict the state of altimetry and propose recommendations for the altimetry of the future. This paper summarizes contributions and recommendations that were collected and provides guidance for future mission design, research activities, and sustainable operational radar altimetry data exploitation. Recommendations provided are fundamental for optimizing further scientific and operational advances of oceanographic observations by altimetry, including requirements for spatial and temporal resolution of altimetric measurements, their accuracy and continuity. There are also new challenges and new openings mentioned in the paper that are particularly crucial for observations at higher latitudes, for coastal oceanography, for cryospheric studies and for hydrology. The paper starts with a general introduction followed by a section on Earth System Science including Ocean Dynamics, Sea Level, the Coastal Ocean, Hydrology, the Cryosphere and Polar Oceans and the ‘‘Green” Ocean, extending the frontier from biogeochemistry to marine ecology. Applications are described in a subsequent section, which covers Operational Oceanography, Weather, Hurricane Wave and Wind Forecasting, Climate projection. Instruments’ development and satellite missions’ evolutions are described in a fourth section. A fifth section covers the key observations that altimeters provide and their potential complements, from other Earth observation measurements to in situ data. Section 6 identifies the data and methods and provides some accuracy and resolution requirements for the wet tropospheric correction, the orbit and other geodetic requirements, the Mean Sea Surface, Geoid and Mean Dynamic Topography, Calibration and Validation, data accuracy, data access and handling (including the DUACS system). Section 7 brings a transversal view on scales, integration, artificial intelligence, and capacity building (education and training). Section 8 reviews the programmatic issues followed by a conclusion

    The SIB Swiss Institute of Bioinformatics' resources: focus on curated databases

    Get PDF
    The SIB Swiss Institute of Bioinformatics (www.isb-sib.ch) provides world-class bioinformatics databases, software tools, services and training to the international life science community in academia and industry. These solutions allow life scientists to turn the exponentially growing amount of data into knowledge. Here, we provide an overview of SIB's resources and competence areas, with a strong focus on curated databases and SIB's most popular and widely used resources. In particular, SIB's Bioinformatics resource portal ExPASy features over 150 resources, including UniProtKB/Swiss-Prot, ENZYME, PROSITE, neXtProt, STRING, UniCarbKB, SugarBindDB, SwissRegulon, EPD, arrayMap, Bgee, SWISS-MODEL Repository, OMA, OrthoDB and other databases, which are briefly described in this article

    ATLAS Run 1 searches for direct pair production of third-generation squarks at the Large Hadron Collider

    Get PDF
    This paper reviews and extends searches for the direct pair production of the scalar supersymmetric partners of the top and bottom quarks in proton-proton collisions collected by the ATLAS collaboration during the LHC Run 1. Most of the analyses use 20 fb1^{-1} of collisions at a centre-of-mass energy of s\sqrt{s} = 8 TeV, although in some case an additional 4.7 fb1^{-1} of collision data at s\sqrt{s} = 7 TeV are used. New analyses are introduced to improve the sensitivity to specific regions of the model parameter space. Since no evidence of third-generation squarks is found, exclusion limits are derived by combining several analyses and are presented in both a simplified model framework, assuming simple decay chains, as well as within the context of more elaborate phenomenological supersymmetric models

    Measurement of the charge asymmetry in top-quark pair production in the lepton-plus-jets final state in pp collision data at s=8TeV\sqrt{s}=8\,\mathrm TeV{} with the ATLAS detector

    Get PDF
    corecore