23 research outputs found

    Irony Detection in Twitter with Imbalanced Class Distributions

    Full text link
    [EN] Irony detection is a not trivial problem and can help to improve natural language processing tasks as sentiment analysis. When dealing with social media data in real scenarios, an important issue to address is data skew, i.e. the imbalance between available ironic and non-ironic samples available. In this work, the main objective is to address irony detection in Twitter considering various degrees of imbalanced distribution between classes. We rely on the emotIDM irony detection model. We evaluated it against both benchmark corpora and skewed Twitter datasets collected to simulate a realistic distribution of ironic tweets. We carry out a set of classification experiments aimed to determine the impact of class imbalance on detecting irony, and we evaluate the performance of irony detection when different scenarios are considered. We experiment with a set of classifiers applying class imbalance techniques to compensate class distribution. Our results indicate that by using such techniques, it is possible to improve the performance of irony detection in imbalanced class scenarios.The first author was funded by CONACYT project FC-2016/2410. Ronaldo Prati was supported by the São Paulo State (Brazil) research council FAPESP under project 2015/20606-6. Francisco Herrera was partially supported by the Spanish National Research Project TIN2017-89517-P. The work of Paolo Rosso was partially supported by the Spanish MICINN under the research project MISMIS (PGC2018-096212- B-C31) and by the Generalitat Valenciana under the grant PROMETEO/2019/121.Hernandez-Farias, DI.; Prati, R.; Herrera, F.; Rosso, P. (2020). Irony Detection in Twitter with Imbalanced Class Distributions. Journal of Intelligent & Fuzzy Systems. 39(2):2147-2163. https://doi.org/10.3233/JIFS-179880S21472163392Batista, G. E. A. P. A., Prati, R. C., & Monard, M. C. (2004). A study of the behavior of several methods for balancing machine learning training data. ACM SIGKDD Explorations Newsletter, 6(1), 20-29. doi:10.1145/1007730.1007735Chawla, N. V., Bowyer, K. W., Hall, L. O., & Kegelmeyer, W. P. (2002). SMOTE: Synthetic Minority Over-sampling Technique. Journal of Artificial Intelligence Research, 16, 321-357. doi:10.1613/jair.953Fernández A. , García S. , Galar M. , Prati R.C. , Krawczyk B. and Herrera F. , Learning from imbalanced data sets, Springer, (2018).Haibo He, & Garcia, E. A. (2009). Learning from Imbalanced Data. IEEE Transactions on Knowledge and Data Engineering, 21(9), 1263-1284. doi:10.1109/tkde.2008.239Farías, D. I. H., Patti, V., & Rosso, P. (2016). Irony Detection in Twitter. ACM Transactions on Internet Technology, 16(3), 1-24. doi:10.1145/2930663Japkowicz, N., & Stephen, S. (2002). The class imbalance problem: A systematic study1. Intelligent Data Analysis, 6(5), 429-449. doi:10.3233/ida-2002-6504Kumon-Nakamura, S., Glucksberg, S., & Brown, M. (1995). How about another piece of pie: The allusional pretense theory of discourse irony. Journal of Experimental Psychology: General, 124(1), 3-21. doi:10.1037/0096-3445.124.1.3López, V., Fernández, A., García, S., Palade, V., & Herrera, F. (2013). An insight into classification with imbalanced data: Empirical results and current trends on using data intrinsic characteristics. Information Sciences, 250, 113-141. doi:10.1016/j.ins.2013.07.007Mohammad, S. M., & Turney, P. D. (2012). CROWDSOURCING A WORD-EMOTION ASSOCIATION LEXICON. Computational Intelligence, 29(3), 436-465. doi:10.1111/j.1467-8640.2012.00460.xMohammad, S. M., Zhu, X., Kiritchenko, S., & Martin, J. (2015). Sentiment, emotion, purpose, and style in electoral tweets. Information Processing & Management, 51(4), 480-499. doi:10.1016/j.ipm.2014.09.003Poria, S., Gelbukh, A., Hussain, A., Howard, N., Das, D., & Bandyopadhyay, S. (2013). Enhanced SenticNet with Affective Labels for Concept-Based Opinion Mining. IEEE Intelligent Systems, 28(2), 31-38. doi:10.1109/mis.2013.4Prati, R. C., Batista, G. E. A. P. A., & Silva, D. F. (2014). Class imbalance revisited: a new experimental setup to assess the performance of treatment methods. Knowledge and Information Systems, 45(1), 247-270. doi:10.1007/s10115-014-0794-3Reyes, A., Rosso, P., & Veale, T. (2012). A multidimensional approach for detecting irony in Twitter. Language Resources and Evaluation, 47(1), 239-268. doi:10.1007/s10579-012-9196-xSulis, E., Irazú Hernández Farías, D., Rosso, P., Patti, V., & Ruffo, G. (2016). Figurative messages and affect in Twitter: Differences between #irony, #sarcasm and #not. Knowledge-Based Systems, 108, 132-143. doi:10.1016/j.knosys.2016.05.035Utsumi, A. (2000). Verbal irony as implicit display of ironic environment: Distinguishing ironic utterances from nonirony. Journal of Pragmatics, 32(12), 1777-1806. doi:10.1016/s0378-2166(99)00116-2Whissell, C. (2009). Using the Revised Dictionary of Affect in Language to Quantify the Emotional Undertones of Samples of Natural Language. Psychological Reports, 105(2), 509-521. doi:10.2466/pr0.105.2.509-521Wilson, D., & Sperber, D. (1992). On verbal irony. Lingua, 87(1-2), 53-76. doi:10.1016/0024-3841(92)90025-

    Assessment of the flotability of chalcopyrite, molybdenite and pyrite using biosolids and their main components as collectors for greening the froth flotation of copper sulphide ores.

    Get PDF
    Biosolids and representative compounds of their main components ? humic acids, sugars, and proteins ? have been tested as possible environment-friendly collectors and frothers for the flotation of copper sulphide ores. The floatability of chalcopyrite and molybdenite ? both valuable sulphide minerals present in these ores ? as well as non-valuable pyrite was assessed through Hallimond tube flotation tests. Humic acids exhibit similar collector ability for chalcopyrite and molybdenite as that of a commercial collector (Aero 6697 promoter). Biosolids show more affinity for pyrite. The copper recovery (85.9%) and copper grade (6.7%) of a rougher concentrate obtained using humic acids as main collector for the flotation of a copper sulphide ore from Chile, were very similar to those of a copper concentrate produced by froth flotation under the same conditions with a xanthate type commercial collector. This new and feasible end-use of biosolids and humic acids should be new environment-friendly organic froth flotation agents for greening the concentration of copper sulphide ore. Now, further research is needed in order to scale current laboratory assays to operational mining scales to determine efficiencies to industrial scale

    Impact of COVID-19 on cardiovascular testing in the United States versus the rest of the world

    Get PDF
    Objectives: This study sought to quantify and compare the decline in volumes of cardiovascular procedures between the United States and non-US institutions during the early phase of the coronavirus disease-2019 (COVID-19) pandemic. Background: The COVID-19 pandemic has disrupted the care of many non-COVID-19 illnesses. Reductions in diagnostic cardiovascular testing around the world have led to concerns over the implications of reduced testing for cardiovascular disease (CVD) morbidity and mortality. Methods: Data were submitted to the INCAPS-COVID (International Atomic Energy Agency Non-Invasive Cardiology Protocols Study of COVID-19), a multinational registry comprising 909 institutions in 108 countries (including 155 facilities in 40 U.S. states), assessing the impact of the COVID-19 pandemic on volumes of diagnostic cardiovascular procedures. Data were obtained for April 2020 and compared with volumes of baseline procedures from March 2019. We compared laboratory characteristics, practices, and procedure volumes between U.S. and non-U.S. facilities and between U.S. geographic regions and identified factors associated with volume reduction in the United States. Results: Reductions in the volumes of procedures in the United States were similar to those in non-U.S. facilities (68% vs. 63%, respectively; p = 0.237), although U.S. facilities reported greater reductions in invasive coronary angiography (69% vs. 53%, respectively; p < 0.001). Significantly more U.S. facilities reported increased use of telehealth and patient screening measures than non-U.S. facilities, such as temperature checks, symptom screenings, and COVID-19 testing. Reductions in volumes of procedures differed between U.S. regions, with larger declines observed in the Northeast (76%) and Midwest (74%) than in the South (62%) and West (44%). Prevalence of COVID-19, staff redeployments, outpatient centers, and urban centers were associated with greater reductions in volume in U.S. facilities in a multivariable analysis. Conclusions: We observed marked reductions in U.S. cardiovascular testing in the early phase of the pandemic and significant variability between U.S. regions. The association between reductions of volumes and COVID-19 prevalence in the United States highlighted the need for proactive efforts to maintain access to cardiovascular testing in areas most affected by outbreaks of COVID-19 infection

    Canagliflozin and renal outcomes in type 2 diabetes and nephropathy

    Get PDF
    BACKGROUND Type 2 diabetes mellitus is the leading cause of kidney failure worldwide, but few effective long-term treatments are available. In cardiovascular trials of inhibitors of sodium–glucose cotransporter 2 (SGLT2), exploratory results have suggested that such drugs may improve renal outcomes in patients with type 2 diabetes. METHODS In this double-blind, randomized trial, we assigned patients with type 2 diabetes and albuminuric chronic kidney disease to receive canagliflozin, an oral SGLT2 inhibitor, at a dose of 100 mg daily or placebo. All the patients had an estimated glomerular filtration rate (GFR) of 30 to &lt;90 ml per minute per 1.73 m2 of body-surface area and albuminuria (ratio of albumin [mg] to creatinine [g], &gt;300 to 5000) and were treated with renin–angiotensin system blockade. The primary outcome was a composite of end-stage kidney disease (dialysis, transplantation, or a sustained estimated GFR of &lt;15 ml per minute per 1.73 m2), a doubling of the serum creatinine level, or death from renal or cardiovascular causes. Prespecified secondary outcomes were tested hierarchically. RESULTS The trial was stopped early after a planned interim analysis on the recommendation of the data and safety monitoring committee. At that time, 4401 patients had undergone randomization, with a median follow-up of 2.62 years. The relative risk of the primary outcome was 30% lower in the canagliflozin group than in the placebo group, with event rates of 43.2 and 61.2 per 1000 patient-years, respectively (hazard ratio, 0.70; 95% confidence interval [CI], 0.59 to 0.82; P=0.00001). The relative risk of the renal-specific composite of end-stage kidney disease, a doubling of the creatinine level, or death from renal causes was lower by 34% (hazard ratio, 0.66; 95% CI, 0.53 to 0.81; P&lt;0.001), and the relative risk of end-stage kidney disease was lower by 32% (hazard ratio, 0.68; 95% CI, 0.54 to 0.86; P=0.002). The canagliflozin group also had a lower risk of cardiovascular death, myocardial infarction, or stroke (hazard ratio, 0.80; 95% CI, 0.67 to 0.95; P=0.01) and hospitalization for heart failure (hazard ratio, 0.61; 95% CI, 0.47 to 0.80; P&lt;0.001). There were no significant differences in rates of amputation or fracture. CONCLUSIONS In patients with type 2 diabetes and kidney disease, the risk of kidney failure and cardiovascular events was lower in the canagliflozin group than in the placebo group at a median follow-up of 2.62 years

    VIII Encuentro de Docentes e Investigadores en Historia del Diseño, la Arquitectura y la Ciudad

    Get PDF
    Acta de congresoLa conmemoración de los cien años de la Reforma Universitaria de 1918 se presentó como una ocasión propicia para debatir el rol de la historia, la teoría y la crítica en la formación y en la práctica profesional de diseñadores, arquitectos y urbanistas. En ese marco el VIII Encuentro de Docentes e Investigadores en Historia del Diseño, la Arquitectura y la Ciudad constituyó un espacio de intercambio y reflexión cuya realización ha sido posible gracias a la colaboración entre Facultades de Arquitectura, Urbanismo y Diseño de la Universidad Nacional y la Facultad de Arquitectura de la Universidad Católica de Córdoba, contando además con la activa participación de mayoría de las Facultades, Centros e Institutos de Historia de la Arquitectura del país y la región. Orientado en su convocatoria tanto a docentes como a estudiantes de Arquitectura y Diseño Industrial de todos los niveles de la FAUD-UNC promovió el debate de ideas a partir de experiencias concretas en instancias tales como mesas temáticas de carácter interdisciplinario, que adoptaron la modalidad de presentación de ponencias, entre otras actividades. En el ámbito de VIII Encuentro, desarrollado en la sede Ciudad Universitaria de Córdoba, se desplegaron numerosas posiciones sobre la enseñanza, la investigación y la formación en historia, teoría y crítica del diseño, la arquitectura y la ciudad; sumándose el aporte realizado a través de sus respectivas conferencias de Ana Clarisa Agüero, Bibiana Cicutti, Fernando Aliata y Alberto Petrina. El conjunto de ponencias que se publican en este Repositorio de la UNC son el resultado de dos intensas jornadas de exposiciones, cuyos contenidos han posibilitado actualizar viejos dilemas y promover nuevos debates. El evento recibió el apoyo de las autoridades de la FAUD-UNC, en especial de la Secretaría de Investigación y de la Biblioteca de nuestra casa, como así también de la Facultad de Arquitectura de la UCC; va para todos ellos un especial agradecimiento

    Diagnóstico das neuropatias periféricas alguns fatores relevantes para a realização do diagnóstico: some factors of relevance for diagnosis

    No full text
    O presente estudo foi planejado com a finalidade de avaliar a influência dos exames clínicos e complementares em relação ao diagnóstico das neuropatías periféricas. Avaliação laboratorial foi realizada em 81,8% dos pacientes, eletromiografia em 47,4% e biópsia em 22,5%. O diagnóstico sindrômico foi realizado em 99,0%, o topográfico em 98,6% e o etiológico em 73,2%. Foram solicitados em média 4,8 exames laboratoriais por paciente e, dos 93 diferentes tipos de exames pedidos, 36 foram sempre normais. A importância dos achados é discutida

    <title language="spa">Diagnóstico das neuropatias periféricas perfil dos pacientes sem diagnóstico etiológico estabelecido: profile of patients with non established etiological diagnosis. Diagnosis of peripheral neuropathies

    No full text
    Em 35 dentre os 209 pacientes (acometidos por neuropatias periféricas não se conseguiu estabelecer o diagnóstico etiológico. A média de idade foi 37,7 anos e a faixa etária preferencialmente acometida, entre 20 e 50 anos. A investigação laboratorial desses pacientes foi mais extensa que a dos com diagnóstico definido. A proporção entre exames laboratoriais normais e anormais foi 11:1. Os diagnósticos topográficos principais foram: poli-neuropatias axonais, 5 casos; multineuropatias e radiculopatias, 3 casos cada. Nossos adiados são discutidos e confrontados aos da literatura em relação às neuropatias de diagnóstico indefinido.<br>No etiological diagnosis was obtained, for 35 of 200 patients studied. Mean patient age was 37,7 years and the preferentially affected age range was 20 to 50 years. Laboratory investigation was more extensive among these patients than among patients with a defined diagnosis. An 11:1 ratio was obtained between normal and abnormal laboratory tests. The major topographic diagnoses were: axonal polyneuropathies, 5 cases; multineuropathies and radiculopathies, 3 cases each. The present findings are discussed and compared with the literature concerning neuropathies of undefined diagnosis

    Diagnóstico das neuropatias periféricas perfil dos pacientes sem diagnóstico etiológico estabelecido: profile of patients with non established etiological diagnosis.

    No full text
    Em 35 dentre os 209 pacientes (acometidos por neuropatias periféricas não se conseguiu estabelecer o diagnóstico etiológico. A média de idade foi 37,7 anos e a faixa etária preferencialmente acometida, entre 20 e 50 anos. A investigação laboratorial desses pacientes foi mais extensa que a dos com diagnóstico definido. A proporção entre exames laboratoriais normais e anormais foi 11:1. Os diagnósticos topográficos principais foram: poli-neuropatias axonais, 5 casos; multineuropatias e radiculopatias, 3 casos cada. Nossos adiados são discutidos e confrontados aos da literatura em relação às neuropatias de diagnóstico indefinido

    Learning from imbalanced data sets

    No full text
    corecore