Search CORE

85 research outputs found

Logic Programming and Machine Ethics

Author: Costantini Stefania
Dyoub Abeer
Lisi Francesca A.
Publication venue: 'Open Publishing Association'
Publication date: 01/01/2020
Field of study

Transparency is a key requirement for ethical machines. Verified ethical behavior is not enough to establish justified trust in autonomous intelligent agents: it needs to be supported by the ability to explain decisions. Logic Programming (LP) has a great potential for developing such perspective ethical systems, as in fact logic rules are easily comprehensible by humans. Furthermore, LP is able to model causality, which is crucial for ethical decision making.Comment: In Proceedings ICLP 2020, arXiv:2009.09158. Invited paper for the ICLP2020 Panel on "Machine Ethics". arXiv admin note: text overlap with arXiv:1909.0825

arXiv.org e-Print Archive

Archivio istituzionale della ricerca - Università di Bari

Open Access Repository

How Fast Can We Play Tetris Greedily With Rectangular Pieces?

Author: Dallant Justin
Iacono John
Publication venue
Publication date: 01/01/2022
Field of study

Consider a variant of Tetris played on a board of width

w

and infinite height, where the pieces are axis-aligned rectangles of arbitrary integer dimensions, the pieces can only be moved before letting them drop, and a row does not disappear once it is full. Suppose we want to follow a greedy strategy: let each rectangle fall where it will end up the lowest given the current state of the board. To do so, we want a data structure which can always suggest a greedy move. In other words, we want a data structure which maintains a set of

O(n)

rectangles, supports queries which return where to drop the rectangle, and updates which insert a rectangle dropped at a certain position and return the height of the highest point in the updated set of rectangles. We show via a reduction to the Multiphase problem [P\u{a}tra\c{s}cu, 2010] that on a board of width

w=\Theta(n)

, if the OMv conjecture [Henzinger et al., 2015] is true, then both operations cannot be supported in time

O(n^{1/2-\epsilon})

simultaneously. The reduction also implies polynomial bounds from the 3-SUM conjecture and the APSP conjecture. On the other hand, we show that there is a data structure supporting both operations in

O(n^{1/2}\log^{3/2}n)

time on boards of width

n^{O(1)}

, matching the lower bound up to a

n^{o(1)}

factor.Comment: Correction of typos and other minor correction

arXiv.org e-Print Archive

Dagstuhl Research Online Publication Server

DI-fusion

On the Necessity of Metalearning: Learning Suitable Parameterizations for Learning Processes

Author: Hamidi Massinissa
Osmani Aomar
Publication venue
Publication date: 31/12/2023
Field of study

In this paper we will discuss metalearning and how we can go beyond the current classical learning paradigm. We will first address the importance of inductive biases in the learning process and what is at stake: the quantities of data necessary to learn. We will subsequently see the importance of choosing suitable parameterizations to end up with well-defined learning processes. Especially since in the context of real-world applications, we face numerous biases due, e.g., to the specificities of sensors, the heterogeneity of data sources, the multiplicity of points of view, etc. This will lead us to the idea of exploiting the structuring of the concepts to be learned in order to organize the learning process that we published previously. We conclude by discussing the perspectives around parameter-tying schemes and the emergence of universal aspects in the models thus learned

arXiv.org e-Print Archive

Population-coding and Dynamic-neurons improved Spiking Actor Network for Reinforcement Learning

Author: Cheng Xiang
Jia Shuncheng
Xu Bo
Zhang Duzhen
Zhang Tielin
Publication venue
Publication date: 14/06/2021
Field of study

With the Deep Neural Networks (DNNs) as a powerful function approximator, Deep Reinforcement Learning (DRL) has been excellently demonstrated on robotic control tasks. Compared to DNNs with vanilla artificial neurons, the biologically plausible Spiking Neural Network (SNN) contains a diverse population of spiking neurons, making it naturally powerful on state representation with spatial and temporal information. Based on a hybrid learning framework, where a spike actor-network infers actions from states and a deep critic network evaluates the actor, we propose a Population-coding and Dynamic-neurons improved Spiking Actor Network (PDSAN) for efficient state representation from two different scales: input coding and neuronal coding. For input coding, we apply population coding with dynamically receptive fields to directly encode each input state component. For neuronal coding, we propose different types of dynamic-neurons (containing 1st-order and 2nd-order neuronal dynamics) to describe much more complex neuronal dynamics. Finally, the PDSAN is trained in conjunction with deep critic networks using the Twin Delayed Deep Deterministic policy gradient algorithm (TD3-PDSAN). Extensive experimental results show that our TD3-PDSAN model achieves better performance than state-of-the-art models on four OpenAI gym benchmark tasks. It is an important attempt to improve RL with SNN towards the effective computation satisfying biological plausibility.Comment: 27 pages, 11 figures, accepted by Journal of Neural Network

arXiv.org e-Print Archive

Educational Technology and Education Conferences, January to June 2016

Author: Wright Clayton R.
Publication venue: 'Ovid Technologies (Wolters Kluwer Health)'
Publication date
Field of study

ALT Open Access Repository

On learning history based policies for controlling Markov decision processes

Author: Mahajan Aditya
Patil Gandharv
Precup Doina
Publication venue
Publication date: 05/11/2022
Field of study

Reinforcementlearning(RL)folkloresuggeststhathistory-basedfunctionapproximationmethods,suchas recurrent neural nets or history-based state abstraction, perform better than their memory-less counterparts, due to the fact that function approximation in Markov decision processes (MDP) can be viewed as inducing a Partially observable MDP. However, there has been little formal analysis of such history-based algorithms, as most existing frameworks focus exclusively on memory-less features. In this paper, we introduce a theoretical framework for studying the behaviour of RL algorithms that learn to control an MDP using history-based feature abstraction mappings. Furthermore, we use this framework to design a practical RL algorithm and we numerically evaluate its effectiveness on a set of continuous control tasks

arXiv.org e-Print Archive

Stateful Memory-Augmented Transformers for Dialogue Modeling

Author: Wu Qingyang
Yu Zhou
Publication venue
Publication date: 15/09/2022
Field of study

Transformer encoder-decoder models have shown impressive performance in dialogue modeling. However, as Transformers are inefficient in processing long sequences, dialogue history length often needs to be truncated. To address this problem, we propose a new memory-augmented Transformer that is compatible with existing pre-trained encoder-decoder models and enables efficient preservation of history information. It incorporates a separate memory module alongside the pre-trained Transformer to effectively interchange information between the memory states and the current input context. We evaluate our model on three dialogue datasets and two language modeling datasets. Experimental results show that our method has achieved superior efficiency and performance compared to other pre-trained Transformer baselines

arXiv.org e-Print Archive

Educational Technology and Related Education Conferences for June to December 2015

Author: Wright Clayton R.
Publication venue: 'Ovid Technologies (Wolters Kluwer Health)'
Publication date
Field of study

The 33rd edition of the conference list covers selected events that primarily focus on the use of technology in educational settings and on teaching, learning, and educational administration. Only listings until December 2015 are complete as dates, locations, or Internet addresses (URLs) were not available for a number of events held from January 2016 onward. In order to protect the privacy of individuals, only URLs are used in the listing as this enables readers of the list to obtain event information without submitting their e-mail addresses to anyone. A significant challenge during the assembly of this list is incomplete or conflicting information on websites and the lack of a link between conference websites from one year to the next

ALT Open Access Repository