10,725 research outputs found
"How May I Help You?": Modeling Twitter Customer Service Conversations Using Fine-Grained Dialogue Acts
Given the increasing popularity of customer service dialogue on Twitter,
analysis of conversation data is essential to understand trends in customer and
agent behavior for the purpose of automating customer service interactions. In
this work, we develop a novel taxonomy of fine-grained "dialogue acts"
frequently observed in customer service, showcasing acts that are more suited
to the domain than the more generic existing taxonomies. Using a sequential
SVM-HMM model, we model conversation flow, predicting the dialogue act of a
given turn in real-time. We characterize differences between customer and agent
behavior in Twitter customer service conversations, and investigate the effect
of testing our system on different customer service industries. Finally, we use
a data-driven approach to predict important conversation outcomes: customer
satisfaction, customer frustration, and overall problem resolution. We show
that the type and location of certain dialogue acts in a conversation have a
significant effect on the probability of desirable and undesirable outcomes,
and present actionable rules based on our findings. The patterns and rules we
derive can be used as guidelines for outcome-driven automated customer service
platforms.Comment: 13 pages, 6 figures, IUI 201
Pattern Learning for Detecting Defect Reports and Improvement Requests in App Reviews
Online reviews are an important source of feedback for understanding
customers. In this study, we follow novel approaches that target this absence
of actionable insights by classifying reviews as defect reports and requests
for improvement. Unlike traditional classification methods based on expert
rules, we reduce the manual labour by employing a supervised system that is
capable of learning lexico-semantic patterns through genetic programming.
Additionally, we experiment with a distantly-supervised SVM that makes use of
noisy labels generated by patterns. Using a real-world dataset of app reviews,
we show that the automatically learned patterns outperform the manually created
ones, to be generated. Also the distantly-supervised SVM models are not far
behind the pattern-based solutions, showing the usefulness of this approach
when the amount of annotated data is limited.Comment: Accepted for publication in the 25th International Conference on
Natural Language & Information Systems (NLDB 2020), DFKI Saarbr\"ucken
Germany, June 24-26 202
Targeted gene next-generation sequencing panel in patients with advanced lung adenocarcinoma: Paving the way for clinical implementation
Identification of targetable molecular changes is essential for selecting appropriate treatment in patients with advanced lung adenocarcinoma. Methods: In this study, a Sanger sequencing plus Fluorescence In Situ Hybridization (FISH) sequential approach was compared with a Next-Generation Sequencing (NGS)-based approach for the detection of actionable genomic mutations in an experimental cohort (EC) of 117 patients with advanced lung adenocarcinoma. Its applicability was assessed in small biopsies and cytology specimens previously tested for epidermal growth factor receptor (EGFR) and anaplastic lymphoma kinase (ALK) mutational status, comparing the molecular changes identified and the impact on clinical outcomes. Subsequently, an NGS-based approach was applied and tested in an implementation cohort (IC) in clinical practice. Using Sanger and FISH, patients were classified as EGFR-mutated (n = 22, 18.8%), ALK-mutated (n = 9, 7.7%), and unclassifiable (UC) (n = 86, 73.5%). Retesting the EC with NGS led to the identification of at least one gene variant in 56 (47.9%) patients, totaling 68 variants among all samples. Still, in the EC, combining NGS plus FISH for ALK, patients were classified as 23 (19.7%) EGFR; 20 (17.1%) KRAS; five (4.3%) B-Raf proto-oncogene (BRAF); one (0.9%) Erb-B2 Receptor Tyrosine Kinase 2 (ERBB2); one (0.9%) STK11; one (0.9%) TP53, and nine (7.7%) ALK mutated. Only 57 (48.7%) remained genomically UC, reducing the UC rate by 24.8%. Fourteen (12.0%) patients presented synchronous alterations. Concordance between NGS and Sanger for EGFR status was very high (¿ = 0.972; 99.1%). In the IC, a combined DNA and RNA NGS panel was used in 123 patients. Genomic variants were found in 79 (64.2%). In addition, eight (6.3%) EML4-ALK, four (3.1%), KIF5B-RET, four (3.1%) CD74-ROS1, one (0.8%) TPM3-NTRK translocations and three (2.4%) exon 14 skipping MET Proto-Oncogene (MET) mutations were detected, and 36% were treatable alterations. Conclusions: This study supports the use of NGS as the first-line test for genomic profiling of patients with advanced lung adenocarcinoma.We acknowledge the work of the members of our department, especially doctors from the thoracic oncology unit, oncology and pulmonology nurses and, patients and their relatives. N. Martins would like to thank the Portuguese Foundation for Science and Technology (FCT-Portugal) for the Strategic project ref. UID/BIM/04293/2013 and "NORTE2020—Northern Regional Operational Program" (NORTE-01-0145-FEDER-000012)
Machine Learning for Actionable Warning Identification: A Comprehensive Survey
Actionable Warning Identification (AWI) plays a crucial role in improving the
usability of static code analyzers. With recent advances in Machine Learning
(ML), various approaches have been proposed to incorporate ML techniques into
AWI. These ML-based AWI approaches, benefiting from ML's strong ability to
learn subtle and previously unseen patterns from historical data, have
demonstrated superior performance. However, a comprehensive overview of these
approaches is missing, which could hinder researchers/practitioners from
understanding the current process and discovering potential for future
improvement in the ML-based AWI community. In this paper, we systematically
review the state-of-the-art ML-based AWI approaches. First, we employ a
meticulous survey methodology and gather 50 primary studies from 2000/01/01 to
2023/09/01. Then, we outline the typical ML-based AWI workflow, including
warning dataset preparation, preprocessing, AWI model construction, and
evaluation stages. In such a workflow, we categorize ML-based AWI approaches
based on the warning output format. Besides, we analyze the techniques used in
each stage, along with their strengths, weaknesses, and distribution. Finally,
we provide practical research directions for future ML-based AWI approaches,
focusing on aspects like data improvement (e.g., enhancing the warning labeling
strategy) and model exploration (e.g., exploring large language models for
AWI)
AUTOMATED META-ACTIONS DISCOVERY FOR PERSONALIZED MEDICAL TREATMENTS
Healthcare, among other domains, provides an attractive ground of work for knowl- edge discovery researchers. There exist several branches of health informatics and health data-mining from which we find actionable knowledge discovery is underserved. Actionable knowledge is best represented by patterns of structured actions that in- form decision makers about actions to take rather than providing static information that may or may not hint to actions. The Action rules model is a good example of active structured action patterns that informs us about the actions to perform to reach a desired outcome. It is augmented by the meta-actions model that rep- resents passive structured effects triggered by the application of an action. In this dissertation, we focus primarily on the meta-actions model that can be mapped to medical treatments and their effects in the healthcare arena. Our core contribution lies in structuring meta-actions and their effects (positive, neutral, negative, and side effects) along with mining techniques and evaluation metrics for meta-action effects. In addition to the mining techniques for treatment effects, this dissertation provides analysis and prediction of side effects, personalized action rules, alternatives for treat- ments with negative outcomes, evaluation for treatments success, and personalized recommendations for treatments. We used the tinnitus handicap dataset and the Healthcare Cost and Utilization Project (HCUP) Florida State Inpatient Databases (SID 2010) to validate our work. The results show the efficiency of our methods
- …