258 research outputs found

    SentiBench - a benchmark comparison of state-of-the-practice sentiment analysis methods

    Get PDF
    In the last few years thousands of scientific papers have investigated sentiment analysis, several startups that measure opinions on real data have emerged and a number of innovative products related to this theme have been developed. There are multiple methods for measuring sentiments, including lexical-based and supervised machine learning methods. Despite the vast interest on the theme and wide popularity of some methods, it is unclear which one is better for identifying the polarity (i.e., positive or negative) of a message. Accordingly, there is a strong need to conduct a thorough apple-to-apple comparison of sentiment analysis methods, \textit{as they are used in practice}, across multiple datasets originated from different data sources. Such a comparison is key for understanding the potential limitations, advantages, and disadvantages of popular methods. This article aims at filling this gap by presenting a benchmark comparison of twenty-four popular sentiment analysis methods (which we call the state-of-the-practice methods). Our evaluation is based on a benchmark of eighteen labeled datasets, covering messages posted on social networks, movie and product reviews, as well as opinions and comments in news articles. Our results highlight the extent to which the prediction performance of these methods varies considerably across datasets. Aiming at boosting the development of this research area, we open the methods' codes and datasets used in this article, deploying them in a benchmark system, which provides an open API for accessing and comparing sentence-level sentiment analysis methods

    Spatial Evaluation and Modeling of Dengue Seroprevalence and Vector Density in Rio de Janeiro, Brazil

    Get PDF
    Dengue is a major public health problem in many tropical regions of the world, including Brazil, where Aedes aegypti is the main vector. We present a household study that combines data on dengue fever seroprevalence, recent dengue infection, and vector density, in three neighborhoods of Rio de Janeiro, Brazil, during its most devastating dengue epidemic to date. This integrated entomological–serological survey showed evidence of silent transmission even during a severe epidemic. Also, past exposure to dengue virus was highly associated with age and living in areas of high movement of individuals and social/commercial activity. No association was observed between household infestation index and risk of dengue infection in these areas. Our findings are discussed in the light of current theories regarding transmission thresholds and relative role of mosquitoes and humans as vectors of dengue viruses

    Pathogen-Specific Epitopes as Epidemiological Tools for Defining the Magnitude of Mycobacterium leprae Transmission in Areas Endemic for Leprosy

    Get PDF
    During recent years, comparative genomic analysis has allowed the identification of Mycobacterium leprae-specific genes with potential application for the diagnosis of leprosy. In a previous study, 58 synthetic peptides derived from these sequences were tested for their ability to induce production of IFN-γ in PBMC from endemic controls (EC) with unknown exposure to M. leprae, household contacts of leprosy patients and patients, indicating the potential of these synthetic peptides for the diagnosis of sub- or preclinical forms of leprosy. In the present study, the patterns of IFN-γ release of the individuals exposed or non-exposed to M. leprae were compared using an Artificial Neural Network algorithm, and the most promising M. leprae peptides for the identification of exposed people were selected. This subset of M. leprae-specific peptides allowed the differentiation of groups of individuals from sites hyperendemic for leprosy versus those from areas with lower level detection rates. A progressive reduction in the IFN-γ levels in response to the peptides was seen when contacts of multibacillary (MB) patients were compared to other less exposed groups, suggesting a down modulation of IFN-γ production with an increase in bacillary load or exposure to M. leprae. The data generated indicate that an IFN-γ assay based on these peptides applied individually or as a pool can be used as a new tool for predicting the magnitude of M. leprae transmission in a given population

    Impact of TGF-ß1 -509C/T and 869T/C polymorphisms on glioma risk and patient prognosis

    Get PDF
    Transforming growth factor beta (TGF-ß) plays an important role in carcinogenesis. Two polymorphisms in the TGF-ß1 gene (-509C/T and 869T/C) were described to influence susceptibility to gastric and breast cancers. The 869T/C polymorphism was also associated with overall survival in breast cancer patients. In the present study, we investigated the relevance of these TGF-ß1 polymorphism in glioma risk and prognosis. A case-control study that included 114 glioma patients and 138 cancer-free controls was performed. Single nucleotide polymorphisms (SNPs) were evaluated by polymerase chain reaction followed by restriction fragment length polymorphism (PCR-RFLP). Univariate and multivariate logistic regression analyses were used to calculate odds ratio (OR) and 95 % confidence intervals (95 % CI). The influence of TGF-ß1 -509C/T and 869T/C polymorphisms on glioma patient survival was evaluated by a Cox regression model adjusted for patients' age and sex and represented in Kaplan-Meier curves. Our results demonstrated that TGF-ß1 gene polymorphisms -509C/T and 869T/C are not significantly associated with glioma risk. Survival analyses showed that the homozygous -509TT genotype associates with longer overall survival of glioblastoma (GBM) patients when compared with patients carrying CC + CT genotypes (OR, 2.41; 95 % CI, 1.06-5.50; p = 0.036). In addition, the homozygous 869CC genotype is associated with increased overall survival of GBM patients when compared with 869TT + TC genotypes (OR, 2.62; 95 % CI, 1.11-6.17; p = 0.027). In conclusion, this study suggests that TGF-ß1 -509C/T and 869T/C polymorphisms are not significantly associated with risk for developing gliomas but may be relevant prognostic biomarkers in GBM patients.This work was supported by Fundação para a Ciência e Tecnologia, Portugal (PTDC/SAU-GMG/113795/2009 and SFRH/BPD/33612/2009 to B.M.C.; SFRH/BD/88121/2012 to J.V.C.; SFRH/BD/92786/2013 to C.S.G.; PTDC/SAU-ONC/115513/2009 to R.R.)

    Monoculture of Leafcutter Ant Gardens

    Get PDF
    Background -- Leafcutter ants depend on the cultivation of symbiotic Attamyces fungi for food, which are thought to be grown by the ants in single-strain, clonal monoculture throughout the hundreds to thousands of gardens within a leafcutter nest. Monoculture eliminates cultivar-cultivar competition that would select for competitive fungal traits that are detrimental to the ants, whereas polyculture of several fungi could increase nutritional diversity and disease resistance of genetically variable gardens. Methodology/Principal Findings -- Using three experimental approaches, we assessed cultivar diversity within nests of Atta leafcutter ants, which are most likely among all fungus-growing ants to cultivate distinct cultivar genotypes per nest because of the nests' enormous sizes (up to 5000 gardens) and extended lifespans (10–20 years). In Atta texana and in A. cephalotes, we resampled nests over a 5-year period to test for persistence of resident cultivar genotypes within each nest, and we tested for genetic differences between fungi from different nest sectors accessed through excavation. In A. texana, we also determined the number of Attamyces cells carried as a starter inoculum by a dispersing queens (minimally several thousand Attamyces cells), and we tested for genetic differences between Attamyces carried by sister queens dispersing from the same nest. Except for mutational variation arising during clonal Attamyces propagation, DNA fingerprinting revealed no evidence for fungal polyculture and no genotype turnover during the 5-year surveys. Conclusions/Significance -- Atta leafcutter ants can achieve stable, fungal monoculture over many years. Mutational variation emerging within an Attamyces monoculture could provide genetic diversity for symbiont choice (gardening biases of the ants favoring specific mutational variants), an analog of artificial selection.The research was supported by National Science Foundation awards DEB-0920138, DEB-0639879, and DEB-0110073 to UGM; DEB-0949689 to T.R. Schultz, N. Mehdiabadi, and UGM; and a Fellowship (02/05) from the Conselho Nacional de Desenvolvimento Científico e Tecnológico to AR. The funding agencies had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.Biological Sciences, School o

    DNA Barcode Detects High Genetic Structure within Neotropical Bird Species

    Get PDF
    BACKGROUND: Towards lower latitudes the number of recognized species is not only higher, but also phylogeographic subdivision within species is more pronounced. Moreover, new genetically isolated populations are often described in recent phylogenies of Neotropical birds suggesting that the number of species in the region is underestimated. Previous COI barcoding of Argentinean bird species showed more complex patterns of regional divergence in the Neotropical than in the North American avifauna. METHODS AND FINDINGS: Here we analyzed 1,431 samples from 561 different species to extend the Neotropical bird barcode survey to lower latitudes, and detected even higher geographic structure within species than reported previously. About 93% (520) of the species were identified correctly from their DNA barcodes. The remaining 41 species were not monophyletic in their COI sequences because they shared barcode sequences with closely related species (N = 21) or contained very divergent clusters suggestive of putative new species embedded within the gene tree (N = 20). Deep intraspecific divergences overlapping with among-species differences were detected in 48 species, often with samples from large geographic areas and several including multiple subspecies. This strong population genetic structure often coincided with breaks between different ecoregions or areas of endemism. CONCLUSIONS: The taxonomic uncertainty associated with the high incidence of non-monophyletic species and discovery of putative species obscures studies of historical patterns of species diversification in the Neotropical region. We showed that COI barcodes are a valuable tool to indicate which taxa would benefit from more extensive taxonomic revisions with multilocus approaches. Moreover, our results support hypotheses that the megadiversity of birds in the region is associated with multiple geographic processes starting well before the Quaternary and extending to more recent geological periods
    • …
    corecore