38 research outputs found

    Stochastic Nonlinear Control via Finite-dimensional Spectral Dynamic Embedding

    Full text link
    Optimal control is notoriously difficult for stochastic nonlinear systems. Ren et al. introduced Spectral Dynamics Embedding for developing reinforcement learning methods for controlling an unknown system. It uses an infinite-dimensional feature to linearly represent the state-value function and exploits finite-dimensional truncation approximation for practical implementation. However, the finite-dimensional approximation properties in control have not been investigated even when the model is known. In this paper, we provide a tractable stochastic nonlinear control algorithm that exploits the nonlinear dynamics upon the finite-dimensional feature approximation, Spectral Dynamics Embedding Control (SDEC), with an in-depth theoretical analysis to characterize the approximation error induced by the finite-dimension truncation and statistical error induced by finite-sample approximation in both policy evaluation and policy optimization. We also empirically test the algorithm and compare the performance with Koopman-based methods and iLQR methods on the pendulum swingup problem

    Divide and Conquer Partition for Fourier Reconstruction Sparse Inversion with its Applications

    Get PDF
    A partition method, with an efficient divide and conquer partition strategy, for the non-uniform sampling signal reconstruction based on Fourier reconstruction sparse inversion (FRSI) is developed. The novel partition FRSI(P-FRSI) is motivated by the observation that the partition processing of multi-dimensional signals can reduce the reconstruction difficulty and save the reconstruction time. Moreover, it is helpful to choose suitable reconstruction parameters. The P-FRSI employs divide and conquer strategy, and the signal is firstly partitioned into some blocks. Following that, traditional FRSI is applied to reconstruct signals in each block. We adopt linear or nonlinear superposition to determine the weight coefficients during integrating these blocks. Finally, P-FRSI is applied to two-dimensional seismic signal reconstruction. The superiority of the new method over conventional FRSI is demonstrated by numerical reconstruction experiments

    SEABED INFRASTRUCTURE DEFENSE ANALYSIS

    Get PDF
    Traditional fleet operations and technologies are not adequately suited to counter the growing threat to undersea infrastructure from autonomous undersea systems. A cost-effective unmanned and manned system of systems is required to provide defense of this seabed infrastructure. This paper proposes possible system architectures to defend against this emerging threat to include passive barriers and active defense systems. The effectiveness of those candidate systems is evaluated through multiple agent-based modeling simulations of UUV versus UUV engagements. Analysis resulted in two major findings. First, point defense of critical assets is more effective than barrier defense. Second, system design must focus on minimizing the time required to effectively engage and neutralize threats, either through improvement to defensive UUV speed or investment in more UUV docking stations and sensor arrays. Cost analysis suggests that acquisition and operations cost of the recommended defensive system is less than the projected financial impact of a successful attack.http://archive.org/details/seabedinfrastruc1094562767Lieutenant, United States NavyLieutenant, United States NavyLieutenant, United States NavyMajor, Israel Defence ForcesMajor, Republic of Singapore Air ForceMajor, Republic of Singapore Air ForceCaptain, Singapore ArmyLieutenant, United States NavyLieutenant, United States NavyLieutenant, United States NavyMajor, Republic of Singapore Air ForceCaptain, Singapore ArmyCivilian, Ministry of Defense, SingaporeLieutenant, United States NavyLieutenant Commander, United States NavyLieutenant Junior Grade, United States NavyCivilian, Ministry of Defense, SingaporeCivilian, Ministry of Defense, SingaporeMajor, Republic of Singapore Air ForceMajor, United States Marine CorpsMajor, Singapore ArmyApproved for public release; distribution is unlimited

    Delay-Adaptive Distributed Stochastic Optimization

    No full text
    In large-scale optimization problems, distributed asynchronous stochastic gradient descent (DASGD) is a commonly used algorithm. In most applications, there are often a large number of computing nodes asynchronously computing gradient information. As such, the gradient information received at a given iteration is often stale. In the presence of such delays, which can be unbounded, the convergence of DASGD is uncertain. The contribution of this paper is twofold. First, we propose a delay-adaptive variant of DASGD where we adjust each iteration's step-size based on the size of the delay, and prove asymptotic convergence of the algorithm on variationally coherent stochastic problems, a class of functions which properly includes convex, quasi-convex and star-convex functions. Second, we extend the convergence results of standard DASGD, used usually for problems with bounded domains, to problems with unbounded domains. In this way, we extend the frontier of theoretical guarantees for distributed asynchronous optimization, and provide new insights for practitioners working on large-scale optimization problems

    Research on the Deformation and Failure Characteristics and Control Technology of Mining Area Rises under the Influence of Mining Stress

    No full text
    Affected by mining stress, roadways surrounding rock face problems such as serious deformation and failure and difficult support. In this study, with the II2 mining area rise in Taoyuan Coal Mine taken as the engineering background, the evolution laws of stress, deformation and plastic zone area of the mining area rises during the advance process of the working face were explored with the aid of FLAC3D software. The results suggested that the stress, deformation and plastic zone area of the surrounding rock increase significantly when the distance between the working face and the track rise is less than 20 m. Based on this finding, it was further determined that the stopping line of the II8222 working face should be at least 20 m away from the track rise. Furthermore, in accordance with the deformation and failure characteristics of surrounding rock under the influence of mining stress, this paper conducted a simulation on four support schemes of mining area rises, and quantitatively analyzed the mechanical response of a roadway surrounding rock under these support schemes. The simulation results revealed that the support scheme of “bolt-mesh-spray-cable + grouting bolt” can effectively deal with the influence of mining stress on the working face. Meanwhile, an engineering application was carried out. By monitoring the surface displacement of the surrounding rock, it was found that the deformation of the roadway surrounding rock was effectively controlled, and a remarkable support effect was achieved. In short, the proposed support scheme greatly improved the stability and safety of surrounding rock in the mining area rise under the influence of mining stress

    Experimental Study on Relative Permeability Characteristics for CO2 in Sandstone under High Temperature and Overburden Pressure

    No full text
    In this study, CO2 seepage of sandstone samples from the Taiyuan-Shanxi Formation coal seam roof in Ordos Basin, China, under temperature-stress coupling was studied with the aid of the TAWD-2000 coal rock mechanics-seepage test system. Furthermore, the evolution law and influencing factors on permeability for CO2 in sandstone samples with temperature and axial pressure were systematically analyzed. The results disclose that the permeability of sandstone decreases with the increase in stress. The lower the stress is, the more sensitive the permeability is to stress variation. High stress results in a decrease in permeability, and when the sample is about to fail, the permeability surges. The permeability of sandstone falls first and then rises with the rise of temperature, which is caused by the coupling among the thermal expansion of sandstone, the desorption of CO2, and the evaporation of residual water in fractures. Finally, a quadratic function mathematical model with a fitting degree of 98.2% was constructed between the temperature-stress coupling effect and the permeability for CO2 in sandstone. The model provides necessary data support for subsequent numerical calculation and practical engineering application. The experimental study on the permeability characteristics for CO2 in sandstone under high temperature and overburden pressure is crucial for evaluating the storage potential and predicting the CO2 migration evolution in underground coal gasification coupling CO2 storage projects

    Research on the Deformation and Failure Characteristics and Control Technology of Mining Area Rises under the Influence of Mining Stress

    No full text
    Affected by mining stress, roadways surrounding rock face problems such as serious deformation and failure and difficult support. In this study, with the II2 mining area rise in Taoyuan Coal Mine taken as the engineering background, the evolution laws of stress, deformation and plastic zone area of the mining area rises during the advance process of the working face were explored with the aid of FLAC3D software. The results suggested that the stress, deformation and plastic zone area of the surrounding rock increase significantly when the distance between the working face and the track rise is less than 20 m. Based on this finding, it was further determined that the stopping line of the II8222 working face should be at least 20 m away from the track rise. Furthermore, in accordance with the deformation and failure characteristics of surrounding rock under the influence of mining stress, this paper conducted a simulation on four support schemes of mining area rises, and quantitatively analyzed the mechanical response of a roadway surrounding rock under these support schemes. The simulation results revealed that the support scheme of “bolt-mesh-spray-cable + grouting bolt” can effectively deal with the influence of mining stress on the working face. Meanwhile, an engineering application was carried out. By monitoring the surface displacement of the surrounding rock, it was found that the deformation of the roadway surrounding rock was effectively controlled, and a remarkable support effect was achieved. In short, the proposed support scheme greatly improved the stability and safety of surrounding rock in the mining area rise under the influence of mining stress
    corecore