Search CORE

9 research outputs found

Efficient model-free Q-factor approximation in value space via log-sum-exp neural networks

Author: Calafiore Giuseppe Carlo
Possieri Corrado
Publication venue: IFAC
Publication date: 01/01/2020
Field of study

We propose an efficient technique for performing data-driven optimal control of discrete-time systems. In particular, we show that log-sum-exp (

lse

) neural networks, which are smooth and convex universal approximators of convex functions, can be efficiently used to approximate Q-factors arising from finite-horizon optimal control problems with continuous state space. The key advantage of these networks over classical approximation techniques is that they are convex and hence readily amenable to efficient optimization

Crossref

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)

ART

Informer: Beyond Efficient Transformer for Long Sequence Time-Series Forecasting

Author: Li Jianxin
Peng Jieqi
Xiong Hui
Zhang Shanghang
Zhang Shuai
Zhang Wancai
Zhou Haoyi
Publication venue
Publication date: 28/03/2021
Field of study

Many real-world applications require the prediction of long sequence time-series, such as electricity consumption planning. Long sequence time-series forecasting (LSTF) demands a high prediction capacity of the model, which is the ability to capture precise long-range dependency coupling between output and input efficiently. Recent studies have shown the potential of Transformer to increase the prediction capacity. However, there are several severe issues with Transformer that prevent it from being directly applicable to LSTF, including quadratic time complexity, high memory usage, and inherent limitation of the encoder-decoder architecture. To address these issues, we design an efficient transformer-based model for LSTF, named Informer, with three distinctive characteristics: (i) a

ProbSparse

self-attention mechanism, which achieves

O(L \log L)

in time complexity and memory usage, and has comparable performance on sequences' dependency alignment. (ii) the self-attention distilling highlights dominating attention by halving cascading layer input, and efficiently handles extreme long input sequences. (iii) the generative style decoder, while conceptually simple, predicts the long time-series sequences at one forward operation rather than a step-by-step way, which drastically improves the inference speed of long-sequence predictions. Extensive experiments on four large-scale datasets demonstrate that Informer significantly outperforms existing methods and provides a new solution to the LSTF problem.Comment: 8 pages (main), 5 pages (appendix) and to be appeared in AAAI202

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

Log-sum-exp neural networks and posynomial models for convex and log-log-convex data

Author: Calafiore Giuseppe C.
Gaubert Stéphane
Possieri Corrado
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 08/12/2018
Field of study

International audienceWe show that a one-layer feedforward neural network with exponential activation functions in the inner layer and logarithmic activation in the output neuron is an universal approximator of convex functions. Such a network represents a family of scaled log-sum exponential functions, here named LSET. Under a suitable exponential transformation, the class of LSET functions maps to a family of generalized posynomials GPOST, which we similarly show to be universal approximators for log-log-convex functions. A key feature of an LSET network is that, once it is trained on data, the resulting model is convex in the variables, which makes it readily amenable to efficient design based on convex optimization. Similarly, once a GPOST model is trained on data, it yields a posynomial model that can be efficiently optimized with respect to its variables by using geometric programming (GP). The proposed methodology is illustrated by two numerical examples, in which, first, models are constructed from simulation data of the two physical processes (namely, the level of vibration in a vehicle suspension system, and the peak power generated by the combustion of propane), and then optimization-based design is performed on these models

arXiv.org e-Print Archive

INRIA a CCSD electronic archive server

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)

HAL-Polytechnique

Log-Sum-Exp Neural Networks and Posynomial Models for Convex and Log-Log-Convex Data

Author: Calafiore G. C.
Gaubert S.
Possieri C.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2020
Field of study

In this paper, we show that a one-layer feedforward neural network with exponential activation functions in the inner layer and logarithmic activation in the output neuron is a universal approximator of convex functions. Such a network represents a family of scaled log-sum exponential functions, here named log-sum-exp (

mathrm {LSE}_{T}

). Under a suitable exponential transformation, the class of

mathrm {LSE}_{T}

functions maps to a family of generalized posynomials

mathrm {GPOS}_{T}

, which we similarly show to be universal approximators for log-log-convex functions. A key feature of an

mathrm {LSE}_{T}

network is that, once it is trained on data, the resulting model is convex in the variables, which makes it readily amenable to efficient design based on convex optimization. Similarly, once a

mathrm {GPOS}_{T}

model is trained on data, it yields a posynomial model that can be efficiently optimized with respect to its variables by using geometric programming (GP). The proposed methodology is illustrated by two numerical examples, in which, first, models are constructed from simulation data of the two physical processes (namely, the level of vibration in a vehicle suspension system, and the peak power generated by the combustion of propane), and then optimization-based design is performed on these models

ART

Understanding and monitoring the evolution of the Covid-19 epidemic from medical emergency calls: the example of the Paris area

Author: Baptiste Colin
Caroline Télion
Christophe Leroy
David P. Parsons
Frédéric Adnet
Frédéric Lapostolle
Jean-Sébastien Marx
Laurent Goix
Laurent Massoulié
Laurent Tréluyer
Marianne Akian
Marin Boyet
Pierre Carli
Stéphane Gaubert
Thomas Loeb
Théotime Grohens
Xavier Allamigeon
Éric Lecarpentier
Érick Chanzy
Publication venue: 'Cellule MathDoc/CEDRAM'
Publication date: 01/01/2020
Field of study

Comptes Rendus Mathématique