9 research outputs found
Improving Sentiment Polarity Detection Through Target Identification
In an opinionated long review, there may be several targets described by different potential terms. Traditional review-level techniques for Persian sentiment analysis addressed the problem using a one-method-fits-all solution in which the overall polarity of a review is calculated using all its opinionated words without considering their target. In this article, a new method is proposed, which first decomposes a long review into its constituent sentences and then detects the main target of each sentence. In the next step, five policies, including most occurring first (MOF), most general first (MGF), most specific first (MSF), first occurring first (FOF), and last occurring first (LOF), are proposed to come up with the main target of the review. Finally, using the part-of-speech (POS) tags, potential terms in the sentences are specified and a comprehensive sentiment lexicon is employed to compute the polarity of the sentences. In order to evaluate the proposed method, three data sets of user reviews about different topics, including digital equipment, hotels, and movies, are created as no previous study addressed the problem of target identification in the Persian language. The results of comparing the proposed method with a state-of-the-art lexicon-based method show that specifying the main targets of reviews can improve the performance of the systems about 17% and 12% in terms of accuracy and F1-measure. Moreover, the proposed method using the MGF policy achieves the best performance in finding the main target of reviews, while for finding the ultimate polarity of reviews, the MOF outperforms other policies
PARSINLU: A Suite of Language Understanding Challenges for Persian
Despite the progress made in recent years in addressing natural language understanding (NLU) challenges, the majority of this progress remains to be concentrated on resource-rich languages like English. This work focuses on Persian language, one of the widely spoken languages in the world, and yet there are few NLU datasets available for this language. The availability of high-quality evaluation datasets is a necessity for reliable assessment of the progress on different NLU tasks and domains. We introduce PARSINLU, the first benchmark in Persian language that includes a range of language understanding tasks-reading comprehension, textual entailment, and so on. These datasets are collected in a multitude of ways, often involving manual annotations by native speakers. This results in over 14.5k new instances across 6 distinct NLU tasks. Additionally, we present the first results on state-of-the-art monolingual and multilingual pre-trained language models on this benchmark and compare them with human performance, which provides valuable insights into our ability to tackle natural language understanding challenges in Persian. We hope PARSINLU fosters further research and advances in Persian language understanding.(1
Time-dependent analysis of extra length of stay and mortality due to ventilator-associated pneumonia in intensive-care units of ten limited-resources countries: findings of the International Nosocomial Infection Control Consortium (INICC)
Ventilator-associated pneumonias (VAPs) are a worldwide problem that significantly increases patient morbidity, mortality, and length of stay (LoS), and their effects should be estimated to account for the timing of infection. The purpose of the study was to estimate extra LoS and mortality in an intensive-care unit (ICU) due to a VAP in a cohort of 69 248 admissions followed for 283 069 days in ICUs from 10 countries. Data were arranged according to the multi-state format. Extra LoS and increased risk of death were estimated independently in each country, and their results were combined using a random-effects meta-analysis. VAP prolonged LoS by an average of 2.03 days (95% CI 1.52-2.54 days), and increased the risk of death by 14% (95% CI 2-27). The increased risk of death due to VAP was explained by confounding with patient morbidity
International Nosocomial Infection Control Consortiu (INICC) report, data summary of 43 countries for 2007-2012. Device-associated module
We report the results of an International Nosocomial Infection Control Consortium (INICC) surveillance study from January 2007-December 2012 in 503 intensive care units (ICUs) in Latin America, Asia, Africa, and Europe. During the 6-year study using the Centers for Disease Control and Prevention's (CDC) U.S. National Healthcare Safety Network (NHSN) definitions for device-associated health care–associated infection (DA-HAI), we collected prospective data from 605,310 patients hospitalized in the INICC's ICUs for an aggregate of 3,338,396 days. Although device utilization in the INICC's ICUs was similar to that reported from ICUs in the U.S. in the CDC's NHSN, rates of device-associated nosocomial infection were higher in the ICUs of the INICC hospitals: the pooled rate of central line–associated bloodstream infection in the INICC's ICUs, 4.9 per 1,000 central line days, is nearly 5-fold higher than the 0.9 per 1,000 central line days reported from comparable U.S. ICUs. The overall rate of ventilator-associated pneumonia was also higher (16.8 vs 1.1 per 1,000 ventilator days) as was the rate of catheter-associated urinary tract infection (5.5 vs 1.3 per 1,000 catheter days). Frequencies of resistance of Pseudomonas isolates to amikacin (42.8% vs 10%) and imipenem (42.4% vs 26.1%) and Klebsiella pneumoniae isolates to ceftazidime (71.2% vs 28.8%) and imipenem (19.6% vs 12.8%) were also higher in the INICC's ICUs compared with the ICUs of the CDC's NHSN