291 research outputs found

    Safe and Efficient Switching Controller Design for Partially Observed Linear-Gaussian Systems

    Full text link
    Switching control strategies that unite a potentially high-performance but uncertified controller and a stabilizing albeit conservative controller are shown to be able to balance safety with efficiency, but have been less studied under partial observation of state. To address this gap, we propose a switching control strategy for partially observed linear-Gaussian systems with provable performance guarantees. We show that the proposed switching strategy is both safe and efficient, in the sense that: (1) the linear-quadratic cost of the system is always bounded even if the original uncertified controller is destabilizing; (2) in the case when the uncertified controller is stabilizing, the performance loss induced by the conservativeness of switching converges super-exponentially to zero. The effectiveness of the switching strategy is also demonstrated via numerical simulation on the Tennessee Eastman Process

    Almost Surely T\sqrt{T} Regret Bound for Adaptive LQR

    Full text link
    The Linear-Quadratic Regulation (LQR) problem with unknown system parameters has been widely studied, but it has remained unclear whether O~(T)\tilde{ \mathcal{O}}(\sqrt{T}) regret, which is the best known dependence on time, can be achieved almost surely. In this paper, we propose an adaptive LQR controller with almost surely O~(T)\tilde{ \mathcal{O}}(\sqrt{T}) regret upper bound. The controller features a circuit-breaking mechanism, which circumvents potential safety breach and guarantees the convergence of the system parameter estimate, but is shown to be triggered only finitely often and hence has negligible effect on the asymptotic performance of the controller. The proposed controller is also validated via simulation on Tennessee Eastman Process~(TEP), a commonly used industrial process example

    Research on the Evaluation of Enterprise-University-Research Cooperation Ability in Hubei Province

    Get PDF
    The measurement of enterprise-university-research cooperative efficiency has important meanings in improving the cooperative efficiency, strengthening the effective integration of regional resource, enhancing the ability of regional innovation and promoting the development of regional economy. The paper constructs the DEA method and DEA-Malmquist productivity index method to research the cooperation efficiency of Hubei by making comparisons with other provinces in China. The study found out the index of technology efficiency is 0.52 and the enterprise-university- research cooperative efficiency is Non-DEA efficient. To realize the DEA efficiency of Hubei province, the amount of 1652.596 R&D employees and 638.368 R&D employees' full time equivalence should be reduced or 137.89 billion yuan of new products' sales income be increased. Finally, it puts forward policy recommendations on existing problems to strengthen the standings of the cooperation, realize the effective application of the research results, and improve the level of management of enterprise-university-research cooperation efficiency

    Dopad plateb třetích stran na ziskovost komerčních bank

    Get PDF
    Abstraktní Tato práce vybírá data z finanční výroční zprávy 15 různých druhů komerčních bank v Číně od roku 2016 do roku 2019. Mezitím se výnos banky na celkových aktivech (ROA) a neúrokový výnosový poměr (NIIR) považují za závislé proměnné a ostatní proměnné se považují za nezávislé proměnné. Cílem této práce je zkoumat vliv vývoje plateb třetích stran na ziskovost komerčních bank a to, zda je vliv odlišný vzhledem k různým typům bank. Na konci práce jsou navrženy návrhy, jak by banky měly odolat rizikům a zlepšit dohled. Klasifikace JEL F12 Klíčová slova platby třetím stranám, ziskovost komerčních bank, bankovní dohled, regrese Titul Dopad plateb třetích stran na ziskovost komerčních bank.This thesis selects data from the financial annual report of 15 different kinds of commercial banks in China from 2016 to 2019. Meanwhile, the bank's return on total assets (ROA) and non-interest income ratio (NIIR) are considered as dependent variables and other variables are considered as the independent variables. The aim of this thesis is to examine the effect of third-party payment developments on the profitability of commercial banks and whether the effect is different due to different types of banks. At the end of the thesis, suggestions are proposed for banks to withstand risks and improve supervision. JEL Classification F12 Keywords third-party payment , commercial bank profitability, bank supervision , regression Title The impact of third-party payment on the profitability of commercial banks.Institute of Economic StudiesInstitut ekonomických studiíFakulta sociálních vědFaculty of Social Science

    Dopad plateb třetích stran na ziskovost komerčních bank

    Get PDF
    This article explores the influence of Chinese third-party payment transaction volume on the profitability of Chinese 15 commercial banks. The article starts from the operating model of third-party payment , and then builds four corresponding models from the four dimensions of deposits, loans, non-interest income, and return on assets to test the impact of third-party payment on the profitability of commercial banks. The article concludes with conclusion and recommendations for commercial banks to mitigate risks, supervise third-party payment platforms, and collaborate with third-party payment platforms. JEL Classification F12 Keywords third-party payment , commercial bank profitability, bank supervision , regression Title The impact of third-party payment on the profitability of Chinese commercial banks.Abstraktní Tento článek zkoumá vliv objemu čínských platebních transakcí třetích stran na ziskovost čínských 15 komerčních bank. Článek vychází z provozního modelu plateb třetími stranami a poté staví čtyři odpovídající modely ze čtyř dimenzí vkladů, půjček, neúrokových výnosů a návratnosti aktiv, aby se otestoval dopad plateb třetích stran na ziskovost komerční banky. Článek končí závěrem a doporučeními pro komerční banky ke zmírnění rizik, dohledu nad platebními platformami třetích stran a spolupráci s platebními platformami třetích stran. JEL Classification F12 Keywords regressionplatby třetím stranám, ziskovost komerčních, bankovní dohled, regrese Title Dopad plateb třetích stran na ziskovost čínských komerčních bank.Institut ekonomických studiíInstitute of Economic StudiesFaculty of Social SciencesFakulta sociálních vě

    Tackling Non-Stationarity in Reinforcement Learning via Causal-Origin Representation

    Full text link
    In real-world scenarios, the application of reinforcement learning is significantly challenged by complex non-stationarity. Most existing methods attempt to model changes in the environment explicitly, often requiring impractical prior knowledge. In this paper, we propose a new perspective, positing that non-stationarity can propagate and accumulate through complex causal relationships during state transitions, thereby compounding its sophistication and affecting policy learning. We believe that this challenge can be more effectively addressed by tracing the causal origin of non-stationarity. To this end, we introduce the Causal-Origin REPresentation (COREP) algorithm. COREP primarily employs a guided updating mechanism to learn a stable graph representation for states termed as causal-origin representation. By leveraging this representation, the learned policy exhibits impressive resilience to non-stationarity. We supplement our approach with a theoretical analysis grounded in the causal interpretation for non-stationary reinforcement learning, advocating for the validity of the causal-origin representation. Experimental results further demonstrate the superior performance of COREP over existing methods in tackling non-stationarity
    corecore