Search CORE

167 research outputs found

Ontology-based Fuzzy Markup Language Agent for Student and Robot Co-Learning

Author: Chen Li-Chung
Huang Tzong-Xiang
Huang Yung-Ching
Hung Pi-Hsia
Kubota Naoyuki
Lee Chang-Shing
Tseng Chien-Hsun
Wang Mei-Hui
Yang Sheng-Chi
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 25/01/2018
Field of study

An intelligent robot agent based on domain ontology, machine learning mechanism, and Fuzzy Markup Language (FML) for students and robot co-learning is presented in this paper. The machine-human co-learning model is established to help various students learn the mathematical concepts based on their learning ability and performance. Meanwhile, the robot acts as a teacher's assistant to co-learn with children in the class. The FML-based knowledge base and rule base are embedded in the robot so that the teachers can get feedback from the robot on whether students make progress or not. Next, we inferred students' learning performance based on learning content's difficulty and students' ability, concentration level, as well as teamwork sprit in the class. Experimental results show that learning with the robot is helpful for disadvantaged and below-basic children. Moreover, the accuracy of the intelligent FML-based agent for student learning is increased after machine learning mechanism.Comment: This paper is submitted to IEEE WCCI 2018 Conference for revie

arXiv.org e-Print Archive

Crossref

FML-based Prediction Agent and Its Application to Game of Go

Author: Kao Chia-Hsiu
Kubota Naoyuki
Lee Chang-Shing
Nojima Yusuke
Saga Ryosuke
Shuo Nan
Wang Mei-Hui
Yang Sheng-Chi
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 16/04/2017
Field of study

In this paper, we present a robotic prediction agent including a darkforest Go engine, a fuzzy markup language (FML) assessment engine, an FML-based decision support engine, and a robot engine for game of Go application. The knowledge base and rule base of FML assessment engine are constructed by referring the information from the darkforest Go engine located in NUTN and OPU, for example, the number of MCTS simulations and winning rate prediction. The proposed robotic prediction agent first retrieves the database of Go competition website, and then the FML assessment engine infers the winning possibility based on the information generated by darkforest Go engine. The FML-based decision support engine computes the winning possibility based on the partial game situation inferred by FML assessment engine. Finally, the robot engine combines with the human-friendly robot partner PALRO, produced by Fujisoft incorporated, to report the game situation to human Go players. Experimental results show that the FML-based prediction agent can work effectively.Comment: 6 pages, 12 figures, Joint 17th World Congress of International Fuzzy Systems Association and 9th International Conference on Soft Computing and Intelligent Systems (IFSA-SCIS 2017), Otsu, Japan, Jun. 27-30, 201

arXiv.org e-Print Archive

Crossref

Entrainment Promoting Robot Based on Physiological Signals towards Human Cooperation

Author: Huang Tzong-Xiang
コウシュウショウ
黄宗祥
Publication venue
Publication date: 25/03/2023
Field of study

東京都立大学Tokyo Metropolitan University博士（情報科学）doctoral thesi

Tokyo Metropolitan University Institutional Repository Miyako-Dori / 首都大学東京機関リポジトリ

Personal and Personalized Conversations:Designing Agents Who Want To Connect With You

Author: van Waterschoot Jelte Barachia
Publication venue: University of Twente
Publication date: 02/12/2021
Field of study

University of Twente Research Information

Learning Icelandic in Virtual Reykjavik: Simulating real-life conversations with embodied conversational agents using multimodal clarification requests

Author: Bédi Branislav
Publication venue: Háskóli Íslands, Hugvísindasvið, Mála- og menningardeild
Publication date: 26/10/2020
Field of study

Þessi doktorsritgerð er hluti af verkefninu Icelandic Language and Culture Training in Virtual Reykjavik, þrívíddartölvuleik sem gerir þeim sem eru að læra íslensku sem annað mál kleift að æfa tal og hlustun. Markmið verkefnisins var að búa til tölvuleik með sýndarspjallverum (e. embodied conversational agents) sem byggju yfir raunsærri fjölþættri hegðun, með það langtímamarkmið að styðja við hagnýta kennslu á íslensku máli og menningu þar sem mál úr raunverulegum samskiptum er notað. Markmið doktorsverkefnisins beindist að því að rannsaka raunveruleg yrt og óyrt atriði í skýringarbeiðnum meðal Íslendinga (e. clarification requests, CRs). Lögð voru til sex fjölþætt líkön af skýringarbeiðnum sem áttu að stuðla að raunhæfari samspili manna og sýndarspjallvera í Virtual Reykjavik. Þróun doktorsritgerðar fór fram í þremur lotum. Fyrst var gerð stutt könnun til að komast að því hvaða væntingar notendur hefðu til Virtual Reykjavik-þrívíddarforritsins. Nemendurnir sögðust eiga í erfiðleikum með að æfa sig í að tala íslensku við þá sem hafa íslensku að móðurmáli og kynnu því að meta að fá sýndarnámsumhverfi til að æfa sig í tali. Kennslufræðilegur grunnur Virtual Reykjavik tekur mið af samskiptaaðferðum, námi á grundvelli verkefna og leikja og fjölþættum og einstaklingsmiðuðum aðferðum í tungumálanámi. Sýndarspjallverur Virtual Reykjavik búa yfir fjölþættri hegðun sem er í samræmi við íslenska menningu. Með því að taka þátt í leiknum komast notendur í tæri við íslenskt mál og menningu í sýndarnámsveruleika áður en þeir eiga samskipti við Íslendinga. Meginviðfang rannsóknarinnar var samskiptaþátturinn skýringarbeiðni (CR) en nauðsynlegt var að afmarka rannsóknina við einn samskiptaþátt svo unnt væri að nota fjölþætta greiningu sem dugði til að forrita sýndarverurnar. Skýringarbeiðni er ein algengasta tegund segða í samtölum (Purver, 2004). Hún hjálpar til við að skýra það sem áður hefur verið sagt en sem viðmælandi hefur af einhverjum sökum ekki skilið og stuðlar þannig að góðu samtalsflæði. Af þessum sökum eru skýringarbeiðnir mjög mikilvægar til þess að ná fram raunsæjum samskiptum milli notanda og sýndarspjallveru í kerfum eins og okkar sem sameina sjálfvirka talgreiningu og samtöl sem skipulögð eru fyrir fram. Í næstu lotu rannsóknarinnar var málgögnum safnað til þess að greina yrta og óyrta þætti í mismunandi tegundum af skýringarbeiðnum. Vegna þess hversu flókið talmál er og fjölbreytileg samtöl geta verið var aðeins safnað samtölum þar sem ókunnugir spurðu til vegar í miðbæ Reykjavíkur. Þetta endurspeglaðist svo í þeim verkefnum sem nemendur þyrftu að leysa í Virtual Reykjavik. Þar spyrja þeir sýndarspjallverur til vegar í miðbæ Reykjavíkur og verurnar nota skýringaraðferðir til að vísa til vegar á sem raunsæjastan hátt. Þó ber ekki að líta svo á að þetta sé tæmandi rannsókn á eðli skýringarbeiðna heldur fjölþætt lýsing á skýringarbeiðnum, notkun þeirra í sérstökum samræðuaðstæðum í leiknum og beitingu þeirra til að líkja eftir mannlegri hegðun. Sex mismunandi fjölþættar skýringarbeiðnategundir voru búnar til á grundvelli gagnagrunns með myndbandsupptökum af raunverulegum samtölum milli fólks með íslensku að móðurmáli og fólks sem ekki hefur íslensku að móðurmáli. Þetta voru í heild 165 upptökur, 1.59.02 klst. á lengd, 108 pör fólks þar sem annar aðilinn hefur íslensku að móðurmáli en hinn ekki og 57 pör þar sem báðir aðilar hafa íslensku að móðurmáli, karlmenn og konur. Aldur þeirra sem höfðu íslensku að móðurmáli var á bilinu 18–70 ár og meðalaldurinn u.þ.b. 35 ár en þeir sem ekki höfðu íslensku að móðurmáli voru á aldrinum 20–40 ára og meðalaldur þar u.þ.b. 30 ár. Úr þessum gagnagrunni var búinn til fjölþættur stofn skýringarbeiðna sem samanstóð af yrtum og óyrtum gögnum fyrir hverja tegund af skýringarbeiðni. Myndbandsupptökur voru greindar með ELAN merkingar- og skýringapakkanum. Í hverri greiningu var fjölþættum gögnum lýst. Fjölþættri nálgun við tungumál og fjölþættri greiningu á samskiptum var beitt til að greina yrta og óyrta þætti skýringarbeiðna. Vegna takmarka á umfangi rannsóknarinnar voru aðeins tvær gerðir beiðna notaðar, úrfelling og innskotsaðferð. Að lokum var framkvæmd notendakönnun til að komast að því hvernig nemendur skynjuðu fjölþætta hegðun sýndarspjallveranna í leiknum og hvort þeir tækju eftir þessum tveimur tegundum skýringarbeiðna í honum. Nemendum þótti innskotsaðferðin vera eðlilegust þótt þeim hefði fundist henni stundum vera beitt dálítið ruddalega eða hún verið notuð of mikið af sýndarspjallverunum. Það hversu spjallverurnar notuðu mikið skýringarbeiðnirnar var ekki mælt þar sem einblínt var á nemendur sem notendur í þessari frumútgáfu leiksins. Könnunin leiddi í ljós fjölda möguleika til að betrumbæta fjölþætta hegðun spjallveranna í framtíðarútgáfum leiksins. Sérstaklega bentu notendur á að ákveðin svipbrigði og að spjallverurnar gætu ekki brosað gerði það að verkum að þær virkuðu „óhugnanlegar“. Í stuttu máli eru færð rök fyrir því í ritgerðinni að þrívíddartölvuleikir nýtist vel til að kenna íslenska tungu og menningu, með sérstakri áherslu á að æfa talmálsfærni. Fjallað er um og stutt með kennslufræðilegum kenningum hvernig bæta megi námsupplifun og kalla fram alvörusamskipti í sýndarveruleika með raunsærri og fjölþættri hegðun spjallvera. Skoðaðar voru sex skýringaraðferðir sem fólk með íslensku að móðurmáli notaði til að vísa til vegar, annars vegar af fólki með íslensku að móðurmáli og hins vegar þeim sem ekki hafa íslensku að móðurmáli. Í ritgerðinni er einnig bent á hugsanlegar nýjar rannsóknir í sambandi við skýringarbeiðnir og Virtual Reykjavik. Skoða mætti frekar fjölþættar skýringarbeiðnir í samtölum við aðrar aðstæður og í öðrum tungumálum. Slíkt myndi gagnast við að betrumbæta þær skýringarbeiðnir sem sýndarspjallverur í Virtual Reykjavik nota. Ágætis byrjun á áframhaldandi vinnu væri að framkvæma nýja könnun með fullkomnari leiðbeiningum, námsefni og stoðbúnaði, talgreinikerfi sem virkar á allan hátt í Virtual Reykjavik og með sýndarspjallverum sem byggju yfir fleiri eiginleikum, gætu t.d. brosað.This thesis forms part of the project Icelandic Language and Culture Training in Virtual Reykjavik, a 3D computer game that enables learners of Icelandic to practise oral language and listening. The aim of the project was to build a computer game populated with embodied conversational agents (ECAs) endowed with realistic multimodal behaviour, with a long-term goal of supporting authentic teaching of Icelandic language and culture. The part of the project reported in this thesis focused on examining human verbal and non-verbal features in clarification requests (CRs). Six multimodal CR models were suggested for implementation, with the intention of promoting a more realistic human-agent interaction in Virtual Reykjavik. The research took place in three phases. First, a small survey was carried out, eliciting learners’ expectations from Virtual Reykjavik. It informed about learners’ expectations of a 3D application. Learners reported difficulties in practising spoken Icelandic with native speakers in real life and for this reason said they would appreciate a virtual learning environment for practising oral language. The pedagogical foundation of Virtual Reykjavik considers the communicative approach in language instruction, task- and game-based learning, and multimodal and individual language learning approaches. Virtual Reykjavik was populated with ECAs endowed with multimodal behaviour that is authentic to Icelandic culture. Engaging in the game provided learners with an opportunity to experience Icelandic language as it is spoken in the target culture but in a virtual learning environment, and prior to engaging with speakers in the real world. The communicative function CR was chosen as the main object of multimodal analysis, in order to narrow down the focus to a specific topic in natural language research. CR is one of the most commonly used utterance-types in spoken conversations (Purver, 2004); it helps to clarify what has previously been said but for whatever reason not understood by the recipient, and as such facilitates smooth conversational flow. For these reasons, CR is very important in achieving a realistic human-agent interaction in systems, like ours, which combine automatic speech recognition and pre-planned dialogues. In this second phase, natural language data was collected in order to analyse the verbal and non-verbal features in various types of CRs. Due to the complexity of spoken language and a wide range of possible conversational scenarios, data were collected only during first encounters asking for directions to a location in central Reykjavik. This in turn reflected the same task learners would need to do in Virtual Reykjavik - they would ask agents for directions in central Virtual Reykjavik and the agents would use clarification strategies in an authentic way. It should, however, not be seen as an exhaustive treatise about the nature of CRs but rather as a multimodal description of CRs, their use in a particular conversational scenario in the game, and their application to the development of human-like behaviour. Based on a database of video recordings of real-life conversations between native and non-native speakers of Icelandic, six different multimodal CR types were characterised. (165 recordings with total recorded time 1 hour, 59 minutes and 2 seconds; 108 native-non-native speaker pairs and 57 native-native speaker pairs, men and women; ages of native speakers between 18-70 with average age approximately 35 years, and ages of non-native speakers between 20-40 with average age approximately 30 years). Out of this database, a multimodal corpus of CRs was created, consisting of verbal and non-verbal data for each type of CR. Video recordings were analysed using the ELAN tagging and annotation package. Each analysis consisted of a description of multimodal data. The multimodal approach to language and the multimodal interaction analysis were used to analyse the verbal and non-verbal features of CRs. Due to resource constraints, only two types, the Ellipsis and the Fragment (Interjection Strategy), were implemented. Finally, a user response study was conducted in order to find out how learners perceived multimodal behaviour of ECAs in the game, and whether surveyed learners noticed the two implemented CRs. Learners perceived the CR Fragment (Interjection Strategy) as the most natural, despite its being perceived as slightly rude or used too frequently by the ECAs. The frequency of use of CRs by the ECAs was not measured, since the focus was on learners as users of this game prototype. The study revealed many possibilities for improving the multimodal behaviour of ECAs which could be implemented in future versions. In particular, certain facial expressions, and their lack of ability to smile, were commonly perceived by learners as “creepy”. In summary, this thesis presents the rationale for building a 3D computer game for teaching Icelandic language and culture, with a focus on practising oral language skills. It presents pedagogical background for including authentic features into the multimodal behaviour of ECAs in a computer game to achieve a more realistic human-agent interaction, and thus to contribute to an improved learning experience in an online virtual learning environment. Six clarification strategies used by native speakers of Icelandic were observed when they were approached by other native and non-native speakers asking for directions. The thesis also outlines points for future work on CRs and Virtual Reykjavik. Exploration of multimodal CRs in other conversational settings and languages would be useful for further improving ECA CRs used in Virtual Reykjavik. A good starting point for a continuation would be to conduct a new study with more complete instructions, learning materials and scaffolding, a fully functioning speech recognition system in Virtual Reykjavik, and ECAs endowed with additional features including smiling.RANNÍ

Opin visindi

Control and communication systems for automated vehicles cooperation and coordination

Author: Hussein Ahmed Ali Mahmoud
Publication venue
Publication date: 01/01/2018
Field of study

Mención Internacional en el título de doctorThe technological advances in the Intelligent Transportation Systems (ITS) are exponentially improving over the last century. The objective is to provide intelligent and innovative services for the different modes of transportation, towards a better, safer, coordinated and smarter transport networks. The Intelligent Transportation Systems (ITS) focus is divided into two main categories; the first is to improve existing components of the transport networks, while the second is to develop intelligent vehicles which facilitate the transportation process. Different research efforts have been exerted to tackle various aspects in the fields of the automated vehicles. Accordingly, this thesis is addressing the problem of multiple automated vehicles cooperation and coordination. At first, 3DCoAutoSim driving simulator was developed in Unity game engine and connected to Robot Operating System (ROS) framework and Simulation of Urban Mobility (SUMO). 3DCoAutoSim is an abbreviation for "3D Simulator for Cooperative Advanced Driver Assistance Systems (ADAS) and Automated Vehicles Simulator". 3DCoAutoSim was tested under different circumstances and conditions, afterward, it was validated through carrying-out several controlled experiments and compare the results against their counter reality experiments. The obtained results showed the efficiency of the simulator to handle different situations, emulating real world vehicles. Next is the development of the iCab platforms, which is an abbreviation for "Intelligent Campus Automobile". The platforms are two electric golf-carts that were modified mechanically, electronically and electrically towards the goal of automated driving. Each iCab was equipped with several on-board embedded computers, perception sensors and auxiliary devices, in order to execute the necessary actions for self-driving. Moreover, the platforms are capable of several Vehicle-to-Everything (V2X) communication schemes, applying three layers of control, utilizing cooperation architecture for platooning, executing localization systems, mapping systems, perception systems, and finally several planning systems. Hundreds of experiments were carried-out for the validation of each system in the iCab platform. Results proved the functionality of the platform to self-drive from one point to another with minimal human intervention.Los avances tecnológicos en Sistemas Inteligentes de Transporte (ITS) han crecido de forma exponencial durante el último siglo. El objetivo de estos avances es el de proveer de sistemas innovadores e inteligentes para ser aplicados a los diferentes medios de transporte, con el fin de conseguir un transporte mas eficiente, seguro, coordinado e inteligente. El foco de los ITS se divide principalmente en dos categorías; la primera es la mejora de los componentes ya existentes en las redes de transporte, mientras que la segunda es la de desarrollar vehículos inteligentes que hagan más fácil y eficiente el transporte. Diferentes esfuerzos de investigación se han llevado a cabo con el fin de solucionar los numerosos aspectos asociados con la conducción autónoma. Esta tesis propone una solución para la cooperación y coordinación de múltiples vehículos. Para ello, en primer lugar se desarrolló un simulador (3DCoAutoSim) de conducción basado en el motor de juegos Unity, conectado al framework Robot Operating System (ROS) y al simulador Simulation of Urban Mobility (SUMO). 3DCoAutoSim ha sido probado en diferentes condiciones y circunstancias, para posteriormente validarlo con resultados a través de varios experimentos reales controlados. Los resultados obtenidos mostraron la eficiencia del simulador para manejar diferentes situaciones, emulando los vehículos en el mundo real. En segundo lugar, se desarrolló la plataforma de investigación Intelligent Campus Automobile (iCab), que consiste en dos carritos eléctricos de golf, que fueron modificados eléctrica, mecánica y electrónicamente para darle capacidades autónomas. Cada iCab se equipó con diferentes computadoras embebidas, sensores de percepción y unidades auxiliares, con la finalidad de transformarlos en vehículos autónomos. Además, se les han dado capacidad de comunicación multimodal (V2X), se les han aplicado tres capas de control, incorporando una arquitectura de cooperación para operación en modo tren, diferentes esquemas de localización, mapeado, percepción y planificación de rutas. Innumerables experimentos han sido realizados para validar cada uno de los diferentes sistemas incorporados. Los resultados prueban la funcionalidad de esta plataforma para realizar conducción autónoma y cooperativa con mínima intervención humana.Programa Oficial de Doctorado en Ingeniería Eléctrica, Electrónica y AutomáticaPresidente: Francisco Javier Otamendi Fernández de la Puebla.- Secretario: Hanno Hildmann.- Vocal: Pietro Cerr

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Universidad Carlos III de Madrid e-Archivo