Search CORE

537 research outputs found

Design of of model-based controllers via parametric programming

Author: Sakizlis Vassilis
Sakizlis Vassilis
Publication venue
Publication date: 01/01/2003
Field of study

Imperial Users onl

Spiral - Imperial College Digital Repository

Applications of linear estimation theory to chemical processes

Author: Goldmann Stephen Frederick
Goldmann Stephen Frederick
Publication venue
Publication date: 01/01/1969
Field of study

Imperial Users onl

Spiral - Imperial College Digital Repository

OmniForce: On Human-Centered, Large Model Empowered and Cloud-Edge Collaborative AutoML System

Author: Bian Rongcheng
Cai Bohua
Cao Qiong
Chen Guanpu
Chen Shixiang
Ding Liang
He Fengxiang
Li Chang
Li Jiaxing
Liu Daqing
Liu Dongkai
Liu Wei
Liu Xiangyang
Peng Xuyang
Shen Li
Tao Dacheng
Wang Chaoyue
Wang Zhenfang
Xie Shuai
Xue Chao
Yang Yibo
Zhan Yibing
Zhang Jing
Zhang Shijin
Zhang Yukang
Zhao Shanshan
Zhao Yiyan
Zheng Heliang
Publication venue
Publication date: 08/07/2023
Field of study

Automated machine learning (AutoML) seeks to build ML models with minimal human effort. While considerable research has been conducted in the area of AutoML in general, aiming to take humans out of the loop when building artificial intelligence (AI) applications, scant literature has focused on how AutoML works well in open-environment scenarios such as the process of training and updating large models, industrial supply chains or the industrial metaverse, where people often face open-loop problems during the search process: they must continuously collect data, update data and models, satisfy the requirements of the development and deployment environment, support massive devices, modify evaluation metrics, etc. Addressing the open-environment issue with pure data-driven approaches requires considerable data, computing resources, and effort from dedicated data engineers, making current AutoML systems and platforms inefficient and computationally intractable. Human-computer interaction is a practical and feasible way to tackle the problem of open-environment AI. In this paper, we introduce OmniForce, a human-centered AutoML (HAML) system that yields both human-assisted ML and ML-assisted human techniques, to put an AutoML system into practice and build adaptive AI in open-environment scenarios. Specifically, we present OmniForce in terms of ML version management; pipeline-driven development and deployment collaborations; a flexible search strategy framework; and widely provisioned and crowdsourced application algorithms, including large models. Furthermore, the (large) models constructed by OmniForce can be automatically turned into remote services in a few minutes; this process is dubbed model as a service (MaaS). Experimental results obtained in multiple search spaces and real-world use cases demonstrate the efficacy and efficiency of OmniForce

arXiv.org e-Print Archive

Reparameterized Policy Learning for Multimodal Trajectory Optimization

Author: Gan Chuang
Huang Zhiao
Li Xuanlin
Liang Litian
Ling Zhan
Su Hao
Publication venue
Publication date: 20/07/2023
Field of study

We investigate the challenge of parametrizing policies for reinforcement learning (RL) in high-dimensional continuous action spaces. Our objective is to develop a multimodal policy that overcomes limitations inherent in the commonly-used Gaussian parameterization. To achieve this, we propose a principled framework that models the continuous RL policy as a generative model of optimal trajectories. By conditioning the policy on a latent variable, we derive a novel variational bound as the optimization objective, which promotes exploration of the environment. We then present a practical model-based RL method, called Reparameterized Policy Gradient (RPG), which leverages the multimodal policy parameterization and learned world model to achieve strong exploration capabilities and high data efficiency. Empirical results demonstrate that our method can help agents evade local optima in tasks with dense rewards and solve challenging sparse-reward environments by incorporating an object-centric intrinsic reward. Our method consistently outperforms previous approaches across a range of tasks. Code and supplementary materials are available on the project page https://haosulab.github.io/RPG

arXiv.org e-Print Archive