58 research outputs found

    K-HATERS: A Hate Speech Detection Corpus in Korean with Target-Specific Ratings

    Full text link
    Numerous datasets have been proposed to combat the spread of online hate. Despite these efforts, a majority of these resources are English-centric, primarily focusing on overt forms of hate. This research gap calls for developing high-quality corpora in diverse languages that also encapsulate more subtle hate expressions. This study introduces K-HATERS, a new corpus for hate speech detection in Korean, comprising approximately 192K news comments with target-specific offensiveness ratings. This resource is the largest offensive language corpus in Korean and is the first to offer target-specific ratings on a three-point Likert scale, enabling the detection of hate expressions in Korean across varying degrees of offensiveness. We conduct experiments showing the effectiveness of the proposed corpus, including a comparison with existing datasets. Additionally, to address potential noise and bias in human annotations, we explore a novel idea of adopting the Cognitive Reflection Test, which is widely used in social science for assessing an individual's cognitive ability, as a proxy of labeling quality. Findings indicate that annotations from individuals with the lowest test scores tend to yield detection models that make biased predictions toward specific target groups and are less accurate. This study contributes to the NLP research on hate speech detection and resource construction. The code and dataset can be accessed at https://github.com/ssu-humane/K-HATERS.Comment: 15 pages, EMNLP 2023 (Findings

    Positivity Bias in Customer Satisfaction Ratings

    Full text link
    Customer ratings are valuable sources to understand their satisfaction and are critical for designing better customer experiences and recommendations. The majority of customers, however, do not respond to rating surveys, which makes the result less representative. To understand overall satisfaction, this paper aims to investigate how likely customers without responses had satisfactory experiences compared to those respondents. To infer customer satisfaction of such unlabeled sessions, we propose models using recurrent neural networks (RNNs) that learn continuous representations of unstructured text conversation. By analyzing online chat logs of over 170,000 sessions from Samsung's customer service department, we make a novel finding that while labeled sessions contributed by a small fraction of customers received overwhelmingly positive reviews, the majority of unlabeled sessions would have received lower ratings by customers. The data analytics presented in this paper not only have practical implications for helping detect dissatisfied customers on live chat services but also make theoretical contributions on discovering the level of biases in online rating platforms.Comment: This paper will be presented at WWW'18 conferenc

    Who Is Missing? Characterizing the Participation of Different Demographic Groups in a Korean Nationwide Daily Conversation Corpus

    Full text link
    A conversation corpus is essential to build interactive AI applications. However, the demographic information of the participants in such corpora is largely underexplored mainly due to the lack of individual data in many corpora. In this work, we analyze a Korean nationwide daily conversation corpus constructed by the National Institute of Korean Language (NIKL) to characterize the participation of different demographic (age and sex) groups in the corpus.Comment: Accepted in AAAI ICWSM'2

    Extension of the crRNA enhances Cpf1 gene editing in vitro and in vivo.

    Get PDF
    Engineering of the Cpf1 crRNA has the potential to enhance its gene editing efficiency and non-viral delivery to cells. Here, we demonstrate that extending the length of its crRNA at the 5 end can enhance the gene editing efficiency of Cpf1 both in cells and in vivo. Extending the 5 end of the crRNA enhances the gene editing efficiency of the Cpf1 RNP to induce non-homologous end-joining and homology-directed repair using electroporation in cells. Additionally, chemical modifications on the extended 5 end of the crRNA result in enhanced serum stability. Also, extending the 5 end of the crRNA by 59 nucleotides increases the delivery efficiency of Cpf1 RNP in cells and in vivo cationic delivery vehicles including polymer nanoparticle. Thus, 5 extension and chemical modification of the Cpf1 crRNA is an effective method for enhancing the gene editing efficiency of Cpf1 and its delivery in vivo

    Tracing out the Berry curvature dipole and multipoles in second harmonic Hall responses of time-reversal symmetric insulators

    Get PDF
    Various nonlinear characteristics of solid states, such as the circular photogalvanic effect of time-reversal symmetric insulators, the quantized photogalvanic effect of Weyl semimetals, and the nonlinear Hall effect of time-reversal symmetric metals, have been associated with the Berry curvature dipole (BCD). Here, we explore the question of whether the Berry curvature dipole and multipoles of time-reversal symmetric insulators can be traced in the nonlinear optical responses. We performed real-time time-dependent density functional theory calculations and examined the second harmonic generation susceptibility tensors. The two-band term of the susceptibility tensor is sharply proportional to the interband BCD, dominating over the Hall response once the cancellation effect of the multiple reflection symmetries is lifted. We suggest that the nonlinear Hall component of the second-harmonic spectra of insulators can also be utilized as an effective tool to extract the band structure geometry through Berry curvature dipole and possibly multipoles.Comment: main text: 21 pages with 3 figures; supplementary material: 11 pages with 3 figure

    Proceedings of IMECE2008

    Get PDF
    ABSTRACT Several healthcare products have been developed and marketed in recent times as a result of people's growing interest in personal health. Unstable shoes have been introduced to revitalize the muscles of the lower limbs and to modify the gait posture while walking. However, healthcare products for people should first be proved functional and safe, as some of those can sometimes result in severe injuries and side effects. Certification is, therefore, necessary in the case of unstable shoes. In this study, the functionality of unstable shoes was analyzed; it was proved that difference in pressure distribution resulting from the shape of the unstable shoes helps strengthen the muscles of the lower limbs. These analyses focused on the activation of the muscles by employing EMG (Electromyography). However, the approach involving EMG cannot carry out measurements on hidden muscles, and the noise involved is a source of potential error; therefore, this study utilizes the simulation software SIMM (Software for Interactive Musculoskeletal Modeling) for this purpose. We performed a biomechanical study using a full-body musculoskeletal model. Using the captured 3D motion data and ground reaction forces data, kinetic data was calculated in order to determine its influence on the adjacent segments. We captured the movements of six volunteers, all males in their twenties. The volunteers wore both unstable and normal shoes during each trial. This study focuses on the activation of muscles of the lower limbs when wearing unstable shoes. We inspected the muscles and analyzed the disparities between unstable and normal shoes. We observed from experimental results that most muscles of the lower limbs were revitalized. Further, we observed an improvement in the gait posture after unstable shoes were used for a period of 12 weeks. This analysis of inner muscles that cannot be examined by direct methods can help consumers make informed choices regarding healthcare products. Such analysis is made possible by simulation programs such as SIMM
    corecore