86 research outputs found
Symbol Emergence in Robotics: A Survey
Humans can learn the use of language through physical interaction with their
environment and semiotic communication with other people. It is very important
to obtain a computational understanding of how humans can form a symbol system
and obtain semiotic skills through their autonomous mental development.
Recently, many studies have been conducted on the construction of robotic
systems and machine-learning methods that can learn the use of language through
embodied multimodal interaction with their environment and other systems.
Understanding human social interactions and developing a robot that can
smoothly communicate with human users in the long term, requires an
understanding of the dynamics of symbol systems and is crucially important. The
embodied cognition and social interaction of participants gradually change a
symbol system in a constructive manner. In this paper, we introduce a field of
research called symbol emergence in robotics (SER). SER is a constructive
approach towards an emergent symbol system. The emergent symbol system is
socially self-organized through both semiotic communications and physical
interactions with autonomous cognitive developmental agents, i.e., humans and
developmental robots. Specifically, we describe some state-of-art research
topics concerning SER, e.g., multimodal categorization, word discovery, and a
double articulation analysis, that enable a robot to obtain words and their
embodied meanings from raw sensory--motor information, including visual
information, haptic information, auditory information, and acoustic speech
signals, in a totally unsupervised manner. Finally, we suggest future
directions of research in SER.Comment: submitted to Advanced Robotic
Learning and Leveraging Structured Knowledge from User-Generated Social Media Data
Knowledge has long been a crucial element in Artificial Intelligence (AI), which can be traced back to knowledge-based systems, or expert systems, in the 1960s. Knowledge provides contexts to facilitate machine understanding and improves the explainability and performance of many semantic-based applications. The acquisition of knowledge is, however, a complex step, normally requiring much effort and time from domain experts. In machine learning as one key domain of AI, the learning and leveraging of structured knowledge, such as ontologies and knowledge graphs, have become popular in recent years with the advent of massive user-generated social media data. The main hypothesis in this thesis is therefore that a substantial amount of useful knowledge can be derived from user-generated social media data. A popular, common type of social media data is social tagging data, accumulated from users' tagging in social media platforms. Social tagging data exhibit unstructured characteristics, including noisiness, flatness, sparsity, incompleteness, which prevent their efficient knowledge discovery and usage. The aim of this thesis is thus to learn useful structured knowledge from social media data regarding these unstructured characteristics. Several research questions have then been formulated related to the hypothesis and the research challenges. A knowledge-centred view has been considered throughout this thesis: knowledge bridges the gap between massive user-generated data to semantic-based applications. The study first reviews concepts related to structured knowledge, then focuses on two main parts, learning structured knowledge and leveraging structured knowledge from social tagging data. To learn structured knowledge, a machine learning system is proposed to predict subsumption relations from social tags. The main idea is to learn to predict accurate relations with features, generated with probabilistic topic modelling and founded on a formal set of assumptions on deriving subsumption relations. Tag concept hierarchies can then be organised to enrich existing Knowledge Bases (KBs), such as DBpedia and ACM Computing Classification Systems. The study presents relation-level evaluation, ontology-level evaluation, and the novel, Knowledge Base Enrichment based evaluation, and shows that the proposed approach can generate high quality and meaningful hierarchies to enrich existing KBs. To leverage structured knowledge of tags, the research focuses on the task of automated social annotation and propose a knowledge-enhanced deep learning model. Semantic-based loss regularisation has been proposed to enhance the deep learning model with the similarity and subsumption relations between tags. Besides, a novel, guided attention mechanism, has been proposed to mimic the users' behaviour of reading the title before digesting the content for annotation. The integrated model, Joint Multi-label Attention Network (JMAN), significantly outperformed the state-of-the-art, popular baseline methods, with consistent performance gain of the semantic-based loss regularisers on several deep learning models, on four real-world datasets. With the careful treatment of the unstructured characteristics and with the novel probabilistic and neural network based approaches, useful knowledge can be learned from user-generated social media data and leveraged to support semantic-based applications. This validates the hypothesis of the research and addresses the research questions. Future studies are considered to explore methods to efficiently learn and leverage other various types of structured knowledge and to extend current approaches to other user-generated data
A whole brain probabilistic generative model: Toward realizing cognitive architectures for developmental robots
Building a human-like integrative artificial cognitive system, that is, an artificial general intelligence (AGI), is the holy grail of the artificial intelligence (AI) field. Furthermore, a computational model that enables an artificial system to achieve cognitive development will be an excellent reference for brain and cognitive science. This paper describes an approach to develop a cognitive architecture by integrating elemental cognitive modules to enable the training of the modules as a whole. This approach is based on two ideas: (1) brain-inspired AI, learning human brain architecture to build human-level intelligence, and (2) a probabilistic generative model (PGM)-based cognitive architecture to develop a cognitive system for developmental robots by integrating PGMs. The proposed development framework is called a whole brain PGM (WB-PGM), which differs fundamentally from existing cognitive architectures in that it can learn continuously through a system based on sensory-motor information.In this paper, we describe the rationale for WB-PGM, the current status of PGM-based elemental cognitive modules, their relationship with the human brain, the approach to the integration of the cognitive modules, and future challenges. Our findings can serve as a reference for brain studies. As PGMs describe explicit informational relationships between variables, WB-PGM provides interpretable guidance from computational sciences to brain science. By providing such information, researchers in neuroscience can provide feedback to researchers in AI and robotics on what the current models lack with reference to the brain. Further, it can facilitate collaboration among researchers in neuro-cognitive sciences as well as AI and robotics
The Future of Humanoid Robots
This book provides state of the art scientific and engineering research findings and developments in the field of humanoid robotics and its applications. It is expected that humanoids will change the way we interact with machines, and will have the ability to blend perfectly into an environment already designed for humans. The book contains chapters that aim to discover the future abilities of humanoid robots by presenting a variety of integrated research in various scientific and engineering fields, such as locomotion, perception, adaptive behavior, human-robot interaction, neuroscience and machine learning. The book is designed to be accessible and practical, with an emphasis on useful information to those working in the fields of robotics, cognitive science, artificial intelligence, computational methods and other fields of science directly or indirectly related to the development and usage of future humanoid robots. The editor of the book has extensive R&D experience, patents, and publications in the area of humanoid robotics, and his experience is reflected in editing the content of the book
Cultural Heritage Storytelling, Engagement and Management in the Era of Big Data and the Semantic Web
The current Special Issue launched with the aim of further enlightening important CH areas, inviting researchers to submit original/featured multidisciplinary research works related to heritage crowdsourcing, documentation, management, authoring, storytelling, and dissemination. Audience engagement is considered very important at both sites of the CH productionâconsumption chain (i.e., push and pull ends). At the same time, sustainability factors are placed at the center of the envisioned analysis. A total of eleven (11) contributions were finally published within this Special Issue, enlightening various aspects of contemporary heritage strategies placed in todayâs ubiquitous society. The finally published papers are related but not limited to the following multidisciplinary topics:Digital storytelling for cultural heritage;Audience engagement in cultural heritage;Sustainability impact indicators of cultural heritage;Cultural heritage digitization, organization, and management;Collaborative cultural heritage archiving, dissemination, and management;Cultural heritage communication and education for sustainable development;Semantic services of cultural heritage;Big data of cultural heritage;Smart systems for Historical cities â smart cities;Smart systems for cultural heritage sustainability
Affective Computing
This book provides an overview of state of the art research in Affective Computing. It presents new ideas, original results and practical experiences in this increasingly important research field. The book consists of 23 chapters categorized into four sections. Since one of the most important means of human communication is facial expression, the first section of this book (Chapters 1 to 7) presents a research on synthesis and recognition of facial expressions. Given that we not only use the face but also body movements to express ourselves, in the second section (Chapters 8 to 11) we present a research on perception and generation of emotional expressions by using full-body motions. The third section of the book (Chapters 12 to 16) presents computational models on emotion, as well as findings from neuroscience research. In the last section of the book (Chapters 17 to 22) we present applications related to affective computing
ãã«ãã¢ãŒãã«æœåšçãã£ãªã¯ã¬é åæ³ã®å€å±€åã«ããç¥èã®ç¢ºççè¡šçŸ
è¿å¹ŽïŒãããããšäººã®å
±åãç®æãããã®ç 究ãçãã«è¡ãããŠããïŒçŸç¶ã®ããããæè¡ã«ãããŠïŒæ§ã
ãªãããããéçºãããŠãããïŒéãããç°å¢ã§ç¹å®ã®ã¿ã¹ã¯ãå®è¡ãããã®ãæ®ã©ã§ããïŒã¿ã¹ã¯ã«å¿
èŠãªè¡åãå
¥åãã¿ãŒã³ã«å¯Ÿããå¿çãªã©ã人ãå
šãŠäºåã«äžããªããã°ãªããªãïŒããããã人ãšèªç¶ã«æ®ããããã«ã¯ïŒäººã®èšèãç解ããå¿
èŠãããïŒãã®èšèã®èåŸã«ããæœåšçãªæå³ã解éããŠè¡åããªããã°ãªããªãïŒãŸãïŒã³ãã¥ãã±ãŒã·ã§ã³ã®ããã«ïŒããããèªèº«ã®æå³ãèšèªãšããŠåµåºããããšãæãŸããïŒæ§æ¥ã®äººå·¥ç¥èœã®ç 究ã§ã¯ïŒåèªãåãªãèšå·ãšããŠæ±ãïŒãã®èšå·ã§éããäžçã®äžã§èšèªãç解ããåªåãç¶ããŠããïŒèªç¶èšèªåŠçã»ç解ã¯ïŒãã®æµãã匷ãåããŠããïŒããã«å¯ŸããŠè¿å¹Žã®ãããã£ã¯ã¹ã»äººå·¥ç¥èœç 究ã§ã¯ïŒããããèšå·æ¥å°åé¡ãåºæ¬ãšããŠïŒèšèªã®æ¬è³ªçãªæå³ãæ±ãå§ããŠãããïŒæªã ã«èšèªã®ç解ãçæã®æ¬è³ªçãªè§£æ±ºã«ã¯é ãåã°ãªãïŒæ¬è«æã§ã¯ïŒãããããçµéšã«ãã£ãŠåŸããã«ãã¢ãŒãã«æ
å ±ã«åºã¥ããŠå€æ§ãªæŠå¿µã圢æãïŒãã®æŠå¿µãåºç€ãšããèšèªç解ã»çæãèããããšã§ãã®åé¡ã解決ããæ°ããªæ¹åæ§ã瀺ãïŒããã§ïŒæŠå¿µãšã¯ãã«ãã¢ãŒãã«ãªæ
å ±ãåé¡ããŠåœ¢æããããã«ããŽãªãã§ããïŒãã®æŠå¿µãéããŠæ§ã
ãªäºæž¬ãããããšããç解ãã§ãããšå®çŸ©ããïŒããã«èšèªã¯ïŒããããæŠå¿µãšçµã³ä»ããé³é»ã©ãã«ã§ããïŒäººãšã®èªç¶ãªã€ã³ã¿ã©ã¯ã·ã§ã³ã®äžã§ç²åŸããããšãå¯èœã§ããïŒã€ãŸãæ¬è«æã§ææ¡ããã¢ãã«ã¯ïŒãããããæ¥åžžã®æŽ»åã«ãã£ãŠåŸãããšã®ã§ããæ
å ±ãåºç€ã«æŠå¿µã圢æãïŒé³é»ã©ãã«ãšã®çµã³ä»ããèªã®é çªãæå³ããææ³ãããã ã¢ããã«ç²åŸããããšã§ïŒèšèªã®æå³ç解ãçæãå®çŸãããã®ã§ããïŒãããŸã§ïŒãã«ãã¢ãŒãã«æ
å ±ãçšããç©äœã®ã«ããŽãªåé¡ææ³ã¯äžæãã«ãã£ãŠææ¡ãããŠããïŒå®éã«ïŒãããããçµéšããããšã«ãã£ãŠåŸãæ
å ±ãã«ããŽãªåé¡ããããšã§ïŒäººéã®æèŠã«è¿ãç©äœæŠå¿µã®åœ¢æãå¯èœã§ããããšã瀺ããŠããïŒãŸãïŒåœ¢æãããæŠå¿µãå©çšããŠæªèŠ³æž¬æ
å ±ãäºæž¬ããããšãã§ãïŒããããã«ããç©äœã®ç解ãåè¿°ã®å®çŸ©ã®ç¯å²ã§å¯èœã§ãããšèšããïŒãããïŒãã人éã®ããã«æè»ãªç解ãããããã§å®çŸããããã«ã¯ïŒç©äœæŠå¿µã®ç²åŸã ãã§ã¯äžååã§ããããšã¯æããã§ããïŒãªããªãïŒã»ãšãã©ã®ç©äœã¯ããã䜿ã人ã䜿ã人ã®åãïŒäœ¿ãããå Žæãªã©ãé¢é£ããŠããïŒãããã®æ
å ±ãäºæž¬ã§ããªãéããã®ç©äœãç解ãããšã¯èšããªãããã§ããïŒã€ãŸãïŒç©äœæŠå¿µã®ã¿ãªãã人ã®åãæŠå¿µãå ŽææŠå¿µãªã©å€æ§ãªæŠå¿µãåŠç¿ãããšåæã«ïŒãããã®é¢ä¿æ§ãç²åŸããå¿
èŠãããïŒãã®ãããªå€æ§ãªæŠå¿µã®ç²åŸã¯ïŒãã«ãã¢ãŒãã«æ
å ±ã®éå±€çã«ããŽãªåé¡ãžãšçºå±ãããããšã§å®çŸããããšã§å¯èœã§ããïŒæçµçã«ã¯ãããããããã«ãããäºç©ã®çã®ç解ã®èšç®ã¢ãã«ããšãªãããšãæããã«ããïŒãããæ¬è«æã®ãŽãŒã«ã§ããïŒæ¬è«æã§ã¯ãŸãïŒç¬¬2ç« ã§ããããã家åºç°å¢ã§äœæ¥ããããšãèæ
®ãïŒãããŸã§èè
ãéçºãããã¥ãŒããã€ãã«ããæé€ã¿ã¹ã¯ãäžäŸãšããŠåãäžããïŒæé€ã¿ã¹ã¯ãè¡ãããã«ïŒãæé€ããå®çŸ©ããå¿
èŠãããïŒãã®å®çŸ©ã«åŸã£ãã¿ã¹ã¯ã®å®çŸã«å¿
èŠãªèŠèŠèªèã·ã¹ãã ãã¿ã¹ã¯ã®å¶åŸ¡ãªã©ãå®è£
ããïŒããã«ãã£ãŠå®çŸ©ç¯å²å
ã®ç©äœèªèãææè¡åãªã©ãå®çŸããããšãã§ãããïŒæªç¥ãªç°å¢ã«å¯ŸããŠæè»ã«ã¿ã¹ã¯ãè¡ãããšãã§ããªãïŒãã®çµæãèžãŸããŠïŒãæé€ãã®æ¬è³ªçãªæå³ãèå¯ããïŒäŸãã°ïŒãæé€æ©ããããããšããè¡åã¯æé€æ©ãæã£ãŠçŽ°ãããã¿ã®äžã§åããããšã§ãããšèãïŒãæé€æ©ããšããç©äœæŠå¿µïŒãäœãã®äžã§åããããšããåãæŠå¿µã®çžäºé¢ä¿ãã圢æãããæŠå¿µã§ãããšèããããšãã§ããïŒããªãã¡ïŒãæé€ããšã¯å€æ§ãªæŠå¿µã®éå±€çãªçžäºäŸåé¢ä¿ããæ§æãããæŠå¿µã§ãããšèããïŒããããå€æ§ãªæŠå¿µã®åœ¢æãšãããã®éå±€çãªæ§é ã®æ§ç¯ãããããã®ç¥èãšããŠéèŠã§ããïŒç¬¬2ç« ã§ã®è°è«ã«åºã¥ã第3ç« ã§ã¯ïŒããããã®ç¢ºççç¥èè¡šçŸã®ããã®ãã«ãã¢ãŒãã«æ
å ±ã®éå±€çã«ããŽãªåé¡ææ³ãææ¡ããïŒææ¡ææ³ã¯ïŒãã«ãã¢ãŒãã«æœåšçãã£ãªã¯ã¬é
åæ³ïŒMultimodal Latent Dirichlet AllocationïŒMLDAïŒãéå±€åããå€å±€ãã«ãã¢ãŒãã«æœåšçãã£ãªã¯ã¬é
åæ³ïŒmultilayered MLDAïŒmMLDAïŒã§ããïŒäžå±€ã®MLDAã§ã¯äžäœæŠå¿µã§ããïŒç©äœïŒåãïŒå ŽæïŒäººç©ã®æŠå¿µããããã圢æããïŒäžå±€ã®MLDA ã§ã¯ãããã®æŠå¿µãçµ±åããäžäœæŠå¿µã圢æãããïŒãã®ã¢ãã«ãçšããããšã§äŸãã°ïŒäžäœæŠå¿µãšããŠãžã¥ãŒã¹ãšããç©äœæŠå¿µãç©ãå£ã«éã¶ãšããåãæŠå¿µïŒãã€ãã³ã°ãšããå ŽææŠå¿µãªã©ã圢æãããïŒäžäœå±€ã§ã¯ãããã®é¢ä¿æ§ãåŠç¿ããïŒã飲ãããšããè¡åæŠå¿µã圢æãããïŒããã«ããïŒãžã¥ãŒã¹ãèŠãããšã§ãããå£ã«éã¶ã飲ãããšããè¡åãïŒãã®ã飲ãããšããè¡åãããã€ãã³ã°ããšããå Žæã§è¡ãªããããããšãã£ãæªèŠ³æž¬æ
å ±ã®äºæž¬ãè¡ãããšãå¯èœãšãªãïŒç¬¬4ç« ã§ã¯ïŒåœ¢æãããå€æ§ãªæŠå¿µãå©çšãïŒåæã«èªæãææ³ãç²åŸããããšã§ïŒèŠ³æž¬ããã·ãŒã³ãæç« ã§è¡šçŸããææ³ãæ€èšããïŒããã§æ±ãåé¡ã¯ïŒéå±€çãªæŠå¿µã«ãããèªæã®ç²åŸã§ããïŒã©ã®éå±€ã®ã©ã®æŠå¿µã«ã©ã®åèªãçµã³ä»ãããšããåé¡ã解ãå¿
èŠãããïŒæ¬è«æã§ã¯ïŒåèªãšæŠå¿µéã®çžäºæ
å ±éãçšããããšã§ïŒã©ã®åèªãæ¬æ¥ã©ã®æŠå¿µã«çµã³ä»ããŠããã®ããèªåçã«æšå®ããææ³ãææ¡ããïŒããã«ããåèªãšæŠå¿µã®çµã³ä»ããåŠç¿ããããšãå¯èœã§ããïŒååèªã«å¯Ÿå¿ããïŒç©äœïŒå Žæã人ãªã©ãšãã£ãæŠå¿µã¯ã©ã¹ã®æšå®ãå¯èœã§ããïŒåŸã£ãŠïŒæ瀺çºè©±ã«ãããæŠå¿µã¯ã©ã¹ã®çèµ·é ãåŠç¿ããããšã§ïŒæŠå¿µã¯ã©ã¹ã®é·ç§»ç¢ºçãšãã圢ã§è¡šçŸããã確çææ³ãåŠç¿ããããšãã§ããïŒããã«ãã£ãŠïŒããããã«ããèšèªã®æå³ç解ãçæãå®çŸããããšãå¯èœãšãªãïŒäžæ¹ïŒå®éã®ã³ãã¥ãã±ãŒã·ã§ã³ã¯ïŒèæ¯ç¥èãåšèŸºã®ç¶æ³ãªã©ãšãã£ãæèãèæ
®ããªããã°æç«ããªãïŒã€ãŸãïŒäºç©ã«å¯Ÿããç解ãããæè»ã«è¡ãããã«ã¯ïŒåŠãã§ããå€æ§ãªæŠå¿µã掻çšããäžã§ïŒæ§ã
ãªæèãèæ
®ããå¿
èŠãããïŒç¬¬5ç« ã§ã¯ïŒããããã人ãšç掻ããäžã§ïŒæ§ã
ãªæèã«ãããŠã©ã®ããã«è¡å決å®ããããè°è«ããïŒã€ãŸãïŒç²åŸããå€æ§ãªæŠå¿µãšæèãšçµ±åããããšã§ïŒé©åãªè¡åã決å®ããææ³ãææ¡ããïŒããã«ããäŸãã°ïŒäººãæ®æ®µãœãã¡ãŒã§ãã¬ããèŠãŠãããšãã«ïŒãèåãé£ã¹ãªãããè¶ã飲ãã§ãããšããããšãç¥ã£ãŠããã°ïŒäººãããèåãæã£ãŠããŠããšåœä»€ããéã®é³å£°èªèã«èª€ããçãããšããŠãïŒãã®ãšãã«ããœãã¡ãŒã§ãã¬ããèŠãŠããŠãè¶ã飲ãã§ããããšããæèãçšããããšã§ïŒãããããé©åã«å€æãããŠæ£ããè¡åããšãããšãã§ããå¯èœæ§ãããïŒç¬¬6ç« ã§ã¯ïŒæ¬è«æã®ãŸãšããšä»åŸã®èª²é¡ã«ã€ããŠè¿°ã¹ãïŒé»æ°é信倧åŠ201
Proceedings of the Eighth Italian Conference on Computational Linguistics CliC-it 2021
The eighth edition of the Italian Conference on Computational Linguistics (CLiC-it 2021) was held at Università degli Studi di Milano-Bicocca from 26th to 28th January 2022. After the edition of 2020, which was held in fully virtual mode due to the health emergency related to Covid-19, CLiC-it 2021 represented the first moment for the Italian research community of Computational Linguistics to meet in person after more than one year of full/partial lockdown
- âŠ