Search CORE

70 research outputs found

A Review of Reinforcement Learning for Natural Language Processing, and Applications in Healthcare

Author: Hoetzlein Rama
Hou Yu
Li Mingchen
Liu Ying
Wang Fang
Wang Haozhu
Zhang Rui
Zhou Huixue
Zhou Sicheng
Publication venue
Publication date: 23/10/2023
Field of study

Reinforcement learning (RL) has emerged as a powerful approach for tackling complex medical decision-making problems such as treatment planning, personalized medicine, and optimizing the scheduling of surgeries and appointments. It has gained significant attention in the field of Natural Language Processing (NLP) due to its ability to learn optimal strategies for tasks such as dialogue systems, machine translation, and question-answering. This paper presents a review of the RL techniques in NLP, highlighting key advancements, challenges, and applications in healthcare. The review begins by visualizing a roadmap of machine learning and its applications in healthcare. And then it explores the integration of RL with NLP tasks. We examined dialogue systems where RL enables the learning of conversational strategies, RL-based machine translation models, question-answering systems, text summarization, and information extraction. Additionally, ethical considerations and biases in RL-NLP systems are addressed

arXiv.org e-Print Archive

Analysis and automatic identification of spontaneous emotions in speech from human-human and human-machine communication

Author: De Velasco Vázquez Mikel
Publication venue
Publication date: 13/09/2023
Field of study

383 p.This research mainly focuses on improving our understanding of human-human and human-machineinteractions by analysing paricipants¿ emotional status. For this purpose, we have developed andenhanced Speech Emotion Recognition (SER) systems for both interactions in real-life scenarios,explicitly emphasising the Spanish language. In this framework, we have conducted an in-depth analysisof how humans express emotions using speech when communicating with other persons or machines inactual situations. Thus, we have analysed and studied the way in which emotional information isexpressed in a variety of true-to-life environments, which is a crucial aspect for the development of SERsystems. This study aimed to comprehensively understand the challenge we wanted to address:identifying emotional information on speech using machine learning technologies. Neural networks havebeen demonstrated to be adequate tools for identifying events in speech and language. Most of themaimed to make local comparisons between some specific aspects; thus, the experimental conditions weretailored to each particular analysis. The experiments across different articles (from P1 to P19) are hardlycomparable due to our continuous learning of dealing with the difficult task of identifying emotions inspeech. In order to make a fair comparison, additional unpublished results are presented in the Appendix.These experiments were carried out under identical and rigorous conditions. This general comparisonoffers an overview of the advantages and disadvantages of the different methodologies for the automaticrecognition of emotions in speech

Archivo Digital para la Docencia y la Investigación

Software-based dialogue systems: Survey, taxonomy and challenges

Author: Franch Gutiérrez Javier
Marco Gómez Jordi
Motger de la Encarnación Joaquim
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 13/04/2022
Field of study

The use of natural language interfaces in the field of human-computer interaction is undergoing intense study through dedicated scientific and industrial research. The latest contributions in the field, including deep learning approaches like recurrent neural networks, the potential of context-aware strategies and user-centred design approaches, have brought back the attention of the community to software-based dialogue systems, generally known as conversational agents or chatbots. Nonetheless, and given the novelty of the field, a generic, context-independent overview on the current state of research of conversational agents covering all research perspectives involved is missing. Motivated by this context, this paper reports a survey of the current state of research of conversational agents through a systematic literature review of secondary studies. The conducted research is designed to develop an exhaustive perspective through a clear presentation of the aggregated knowledge published by recent literature within a variety of domains, research focuses and contexts. As a result, this research proposes a holistic taxonomy of the different dimensions involved in the conversational agents’ field, which is expected to help researchers and to lay the groundwork for future research in the field of natural language interfaces.With the support from the Secretariat for Universities and Research of the Ministry of Business and Knowledge of the Government of Catalonia and the European Social Fund. The corresponding author gratefully acknowledges the Universitat Politècnica de Catalunya and Banco Santander for the inancial support of his predoctoral grant FPI-UPC. This paper has been funded by the Spanish Ministerio de Ciencia e Innovación under project / funding scheme PID2020-117191RB-I00 / AEI/10.13039/501100011033.Peer ReviewedPostprint (author's final draft

UPCommons. Portal del coneixement obert de la UPC

Confirmation Report: Modelling Interlocutor Confusion in Situated Human Robot Interaction

Author: Li Na
Publication venue: Technological University Dublin
Publication date: 20/10/2023
Field of study

Human-Robot Interaction (HRI) is an important but challenging field focused on improving the interaction between humans and robots such to make the interaction more intelligent and effective. However, building a natural conversational HRI is an interdisciplinary challenge for scholars, engineers, and designers. It is generally assumed that the pinnacle of human- robot interaction will be having fluid naturalistic conversational interaction that in important ways mimics that of how humans interact with each other. This of course is challenging at a number of levels, and in particular there are considerable difficulties when it comes to naturally monitoring and responding to the user’s mental state. On the topic of mental states, one field that has received little attention to date is moni- toring the user for possible confusion states. Confusion is a non-trivial mental state which can be seen as having at least two substates. There two confusion states can be thought of as being associated with either negative or positive emotions. In the former, when people are productively confused, they have a passion to solve any current difficulties. Meanwhile, people who are in unproductive confusion may lose their engagement and motivation to overcome those difficulties, which in turn may even lead them to drop the current conversation. While there has been some research on confusion monitoring and detection, it has been limited with the most focused on evaluating confusion states in online learning tasks. The central hypothesis of this research is that the monitoring and detection of confusion states in users is essential to fluid task-centric HRI and that it should be possible to detect such confusion and adjust policies to mitigate the confusion in users. In this report, I expand on this hypothesis and set out several research questions. I also provide a comprehensive literature review before outlining work done to date towards my research hypothesis, I also set out plans for future experimental work

Arrow@TUDublin

Cozmo4Resto:A Practical AI Application for Human-Robot Interaction

Author: Bohy Hugo
El Haddad Kevin
Tits Noë
Velner Ella
Publication venue: Bilkent University
Publication date: 01/01/2020
Field of study

University of Twente Research Information

Ubiquitous Technologies for Emotion Recognition

Author
Publication venue: 'MDPI AG'
Publication date: 11/01/2022
Field of study

Emotions play a very important role in how we think and behave. As such, the emotions we feel every day can compel us to act and influence the decisions and plans we make about our lives. Being able to measure, analyze, and better comprehend how or why our emotions may change is thus of much relevance to understand human behavior and its consequences. Despite the great efforts made in the past in the study of human emotions, it is only now, with the advent of wearable, mobile, and ubiquitous technologies, that we can aim to sense and recognize emotions, continuously and in real time. This book brings together the latest experiences, findings, and developments regarding ubiquitous sensing, modeling, and the recognition of human emotions

Directory of Open Access Books (DOAB)

Personal and Personalized Conversations:Designing Agents Who Want To Connect With You

Author: van Waterschoot Jelte Barachia
Publication venue: University of Twente
Publication date: 02/12/2021
Field of study

University of Twente Research Information

Development and evaluation of a haptic framework supporting telerehabilitation robotics and group interaction

Author: Le H.
Le H.
Publication venue
Publication date: 01/01/2022
Field of study

Telerehabilitation robotics has grown remarkably in the past few years. It can provide intensive training to people with special needs remotely while facilitating therapists to observe the whole process. Telerehabilitation robotics is a promising solution supporting routine care which can help to transform face-to-face and one-on-one treatment sessions that require not only intensive human resource but are also restricted to some specialised care centres to treatments that are technology-based (less human involvement) and easy to access remotely from anywhere. However, there are some limitations such as network latency, jitter, and delay of the internet that can affect negatively user experience and quality of the treatment session. Moreover, the lack of social interaction since all treatments are performed over the internet can reduce motivation of the patients. As a result, these limitations are making it very difficult to deliver an efficient recovery plan. This thesis developed and evaluated a new framework designed to facilitate telerehabilitation robotics. The framework integrates multiple cutting-edge technologies to generate playful activities that involve group interaction with binaural audio, visual, and haptic feedback with robot interaction in a variety of environments. The research questions asked were: 1) Can activity mediated by technology motivate and influence the behaviour of users, so that they engage in the activity and sustain a good level of motivation? 2) Will working as a group enhance users’ motivation and interaction? 3) Can we transfer real life activity involving group interaction to virtual domain and deliver it reliably via the internet? There were three goals in this work: first was to compare people’s behaviours and motivations while doing the task in a group and on their own; second was to determine whether group interaction in virtual and reala environments was different from each other in terms of performance, engagement and strategy to complete the task; finally was to test out the effectiveness of the framework based on the benchmarks generated from socially assistive robotics literature. Three studies have been conducted to achieve the first goal, two with healthy participants and one with seven autistic children. The first study observed how people react in a challenging group task while the other two studies compared group and individual interactions. The results obtained from these studies showed that the group interactions were more enjoyable than individual interactions and most likely had more positive effects in terms of user behaviours. This suggests that the group interaction approach has the potential to motivate individuals to make more movements and be more active and could be applied in the future for more serious therapy. Another study has been conducted to measure group interaction’s performance in virtual and real environments and pointed out which aspect influences users’ strategy for dealing with the task. The results from this study helped to form a better understanding to predict a user’s behaviour in a collaborative task. A simulation has been run to compare the results generated from the predictor and the real data. It has shown that, with an appropriate training method, the predictor can perform very well. This thesis has demonstrated the feasibility of group interaction via the internet using robotic technology which could be beneficial for people who require social interaction (e.g. stroke patients and autistic children) in their treatments without regular visits to the clinical centres

Middlesex University Research Repository

Recommended from our members

Detecting deception using interview assistive technology

Author: Ashby Colin
Publication venue
Publication date: 23/05/2022
Field of study

This thesis presents the design, implementation and evaluation of an application designed to support interviewers in detecting deception. This application is evaluated in a job interviewing study using novice interviewers, which shows it to be a highly effective method of de- ception detection, correctly identifying 68.8% of deceivers overall, an increase of 107% and 97% over two baselines without application sup- port, while reducing false positives. We follow work that suggests effective test questioning is the key to detecting deception in interviewing. The rationale behind this ap- proach is that a good breadth and depth of questioning increases cog- nitive load in deceivers, which greatly increases the chance of eliciting detectable behaviour change indicative of deception. Our application is based on Controlled Cognitive Engagement (CCE). Our motivation for supporting interviewers is the difficulty of the interviewing task. Interviewers must simultaneously manage the in- terview process, observe and control the interviewee while generat- ing probing test questions for subjects they potentially know little or nothing about. The application developed in this thesis, called Intek, for Interview Technology, is designed to assist interviewers in generating test ques- tions and providing checkable answers, while also providing a basis to keep track of interview progress. The information supplied by In- tek aims to provide unexpected tests of expected knowledge relevant to the specific personal information provided in a CV or elicited dur- ing an interview. Intek uses multiple information extraction pipelines, from multiple data sources, driven by state-of-the-art Natural Language Processing (NLP) techniques, such as BART for abstractive summarisation, spaCy for fast and accurate Named Entity Recognition (NER) and BERT fine-tuned on the CoNLL-2003 NER dataset for slower but best accur- acy NER. These pipelines integrate into a single simple user interface which may be used by an interviewer for real-time questioning. While most of the underlying NLP technology we used was "off the shelf", we discovered an opportunity to investigate a novel approach to web named entity recognition using HTML tags. Our Text+Tags ap- proach resulted in F1 improvements of between 0.9% and 13.2% over a collection of five datasets and two NER models. Our approach is suitable for extracting named entities from websites containing vary- ing amounts of HTML structure, as well as applicable to other NLP tasks

Sussex Research Online