31 research outputs found

    TweetLID : a benchmark for tweet language identification

    Get PDF
    Language identification, as the task of determining the language a given text is written in, has progressed substantially in recent decades. However, three main issues remain still unresolved: (1) distinction of similar languages, (2) detection of multilingualism in a single document, and (3) identifying the language of short texts. In this paper, we describe our work on the development of a benchmark to encourage further research in these three directions, set forth an evaluation framework suitable for the task, and make a dataset of annotated tweets publicly available for research purposes. We also describe the shared task we organized to validate and assess the evaluation framework and dataset with systems submitted by seven different participants, and analyze the performance of these systems. The evaluation of the results submitted by the participants of the shared task helped us shed some light on the shortcomings of state-of-the-art language identification systems, and gives insight into the extent to which the brevity, multilingualism, and language similarity found in texts exacerbate the performance of language identifiers. Our dataset with nearly 35,000 tweets and the evaluation framework provide researchers and practitioners with suitable resources to further study the aforementioned issues on language identification within a common setting that enables to compare results with one another

    Validation of diabetes mellitus and hypertension diagnosis in computerized medical records in primary health care

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Computerized Clinical Records, which are incorporated in primary health care practice, have great potential for research. In order to use this information, data quality and reliability must be assessed to prevent compromising the validity of the results.</p> <p>The aim of this study is to validate the diagnosis of hypertension and diabetes mellitus in the computerized clinical records of primary health care, taking the diagnosis criteria established in the most prominently used clinical guidelines as the gold standard against which what measure the sensitivity, specificity, and determine the predictive values.</p> <p>The gold standard for diabetes mellitus was the diagnostic criteria established in 2003 American Diabetes Association Consensus Statement for diabetic subjects. The gold standard for hypertension was the diagnostic criteria established in the Joint National Committee published in 2003.</p> <p>Methods</p> <p>A cross-sectional multicentre validation study of diabetes mellitus and hypertension diagnoses in computerized clinical records of primary health care was carried out. Diagnostic criteria from the most prominently clinical practice guidelines were considered for standard reference.</p> <p>Sensitivity, specificity, positive and negative predictive values, and global agreement (with kappa index), were calculated. Results were shown overall and stratified by sex and age groups.</p> <p>Results</p> <p>The agreement for diabetes mellitus with the reference standard as determined by the guideline was almost perfect (κ = 0.990), with a sensitivity of 99.53%, a specificity of 99.49%, a positive predictive value of 91.23% and a negative predictive value of 99.98%.</p> <p>Hypertension diagnosis showed substantial agreement with the reference standard as determined by the guideline (κ = 0.778), the sensitivity was 85.22%, the specificity 96.95%, the positive predictive value 85.24%, and the negative predictive value was 96.95%. Sensitivity results were worse in patients who also had diabetes and in those aged 70 years or over.</p> <p>Conclusions</p> <p>Our results substantiate the validity of using diagnoses of diabetes and hypertension found within the computerized clinical records for epidemiologic studies.</p

    Investigating Ebola virus pathogenicity using Molecular Dynamics

    Get PDF
    Background: Ebolaviruses have been known to cause deadly disease in humans for 40 years and have recently been demonstrated in West Africa to be able to cause large outbreaks. Four Ebolavirus species cause severe disease associated with high mortality in humans. Reston viruses are the only Ebolaviruses that do not cause disease in humans. Conserved amino acid changes in the Reston virus protein VP24 compared to VP24 of other Ebolaviruses have been suggested to alter VP24 binding to host cell karyopherins resulting in impaired inhibition of interferon signalling, which may explain the difference in human pathogenicity. Here we used protein structural analysis and molecular dynamics to further elucidate the interaction between VP24 and KPNA5. Results: As a control experiment, we compared the interaction of wild-type and R137A-mutant (known to affect KPNA5 binding) Ebola virus VP24 with KPNA5. Results confirmed that the R137A mutation weakens direct VP24-KPNA5 binding and enables water molecules to penetrate at the interface. Similarly, Reston virus VP24 displayed a weaker interaction with KPNA5 than Ebola virus VP24, which is likely to reduce the ability of Reston virus VP24 to prevent host cell interferon signalling. Conclusion: Our results provide novel molecular detail on the interaction of Reston virus VP24 and Ebola virus VP24 with human KPNA5. The results indicate a weaker interaction of Reston virus VP24 with KPNA5 than Ebola virus VP24, which is probably associated with a decreased ability to interfere with the host cell interferon response. Hence, our study provides further evidence that VP24 is a key player in determining Ebolavirus pathogenicity

    The impact of stress on tournament entry

    Get PDF
    Individual willingness to enter competitive environments predicts career choices and labor market outcomes. Meanwhile, many people experience competitive contexts as stressful. We use two laboratory experiments to investigate whether factors related to stress can help explain individual differences in tournament entry. Experiment 1 studies whether stress responses (measured as salivary cortisol) to taking part in a mandatory tournament predict individual willingness to participate in a voluntary tournament. We find that competing increases stress levels. This cortisol response does not predict tournament entry for men but is positively and significantly correlated with choosing to enter the tournament for women. In Experiment 2, we exogenously induce physiological stress using the cold-pressor task. We find a positive causal effect of stress on tournament entry for women but no effect for men. Finally, we show that although the effect of stress on tournament entry differs between the genders, stress reactions cannot explain the well-documented gender difference in willingness to compete
    corecore