In this paper, we investigate the power of online learning in stochastic
network optimization with unknown system statistics {\it a priori}. We are
interested in understanding how information and learning can be efficiently
incorporated into system control techniques, and what are the fundamental
benefits of doing so. We propose two \emph{Online Learning-Aided Control}
techniques, $\mathtt{OLAC}$ and $\mathtt{OLAC2}$, that explicitly utilize the
past system information in current system control via a learning procedure
called \emph{dual learning}. We prove strong performance guarantees of the
proposed algorithms: $\mathtt{OLAC}$ and $\mathtt{OLAC2}$ achieve the
near-optimal $[O(\epsilon), O([\log(1/\epsilon)]^2)]$ utility-delay tradeoff
and $\mathtt{OLAC2}$ possesses an $O(\epsilon^{-2/3})$ convergence time.
$\mathtt{OLAC}$ and $\mathtt{OLAC2}$ are probably the first algorithms that
simultaneously possess explicit near-optimal delay guarantee and sub-linear
convergence time. Simulation results also confirm the superior performance of
the proposed algorithms in practice. To the best of our knowledge, our attempt
is the first to explicitly incorporate online learning into stochastic network
optimization and to demonstrate its power in both theory and practice

Hao, Xiaohong

Huang, Longbo

Liu, Xin

English

arXiv

Longbo Huang

Xin Liu

Xiaohong Hao

MUCC (Crossref)

The power of online learning in stochastic network optimization

A method, based on ideas from control theory, is described for the synchronization of discrete time transmitter /receiver dynamics. Conceptually, the methodology consists of constructing observer-receiver dynamics that exploit at each time instant the drive signal and past values of the drive signal. In this way, the method can be viewed as a dynamic reconstruction mechanism. PACS numbers: 02.10.Jf 02.90.+p 05.45.+b 47.52.+j 89.90.+n 1 Introduction  Following Pecora and Caroll [14] a huge interest in the synchronization of two coupled systems has arisen. This research is partly motivated by its possible use in secure communications, cf. [6]. Often, like in [14] a drive/response, or transmitter/receiver, viewpoint is assumed. In a discrete-time context, this typically allows for a description of the transmitter as a n-dimensional dynamical system x 1 (k+1) = f 1 (x 1 (k); x 2 (k)) (1)  x 2 (k+1) = f 2 (x 1 (k); x 2 (k)) (2) where x 1 (\Delta) and x 2 (\Delta) are vectors of dimension m ..

Henri Huijberts

Torsten Lilge

Henk Nijmeijer

CiteSeerX

A control perspective on synchronization and the Takens-Aeyels-Sauer Reconstruction Theorem

Huijberts, H.J.C.

Lilge, T.

Nijmeijer, H.

Pure OAI Repository

Control perspective on synchronization and the Takens-Aeyels-Sauer reconstruction theorem

method, based on ideas from control theory, is described for the synchronization of discrete time transmitter and receiver dynamics. Conceptually, the methodology consists of constructing observer-receiver dynamics that exploit the drive signal and past values of the drive signal at each time instant. In this way, the method can be viewed as a dynamic reconstruction mechanism

Huijberts, Henri

Lilge, Torsten

Nijmeijer, Henk

University of Twente Research Information

landsPHYSICAL REVIEW E APRIL 1999VOLUME 59, NUMBER 4Control perspective on synchronization and the Takens-Aeyels-Sauer reconstruction theoremHenri Huijberts*Faculty of Mathematics and Computing Science, Eindhoven University of Technology, P.O. Box 513,5600 MB Eindhoven, The NetherlandsTorsten LilgeInstitut für Regelungstechnik, Universita¨t Hannover, Appelstrasse 11, D-30167 Hannover, GermanyHenk NijmeijerFaculty of Mathematical Sciences, University of Twente, P.O. Box 217, 7500 AE Enschede, The Netherlandsand Faculty of Mechanical Engineering, Eindhoven University of Technology, P.O. Box 513, 5600 MB Eindhoven, The Nether~Received 15 June 1998!A method, based on ideas from control theory, is described for the synchronization of discrete time trans-mitter and receiver dynamics. Conceptually, the methodology consists of constructing observer-receiver dy-namics that exploit the drive signal and past values of the drive signal at each time instant. In this way, themethod can be viewed as a dynamic reconstruction mechanism.@S1063-651X~99!04904-1#PACS number~s!: 05.45.2a, 07.05.Dz, 02.10.Jfteuar alEret,lydsarelyreenin--intionly.g.,I. INTRODUCTIONFollowing Pecora and Carroll@1#, a great deal of interesin the synchronization of two coupled systems has arisThis research is partly motivated by its possible use in seccommunications, cf. Ref.@2#. Often, as in Ref.@1#, a driveand response, or transmitter and receiver, viewpoint issumed. In a discrete-time context, this typically allows fodescription of the transmitter as ann-dimensional dynamicasystem,x1~k11!5 f 1„x1~k!,x2~k!…, ~1!x2~k11!5 f 2„x1~k!,x2~k!…, ~2!wherex1(•) and x2(•) are vectors of dimensionsm and l,with m1 l 5n andx(k)5„x1(k),x2(k)…. Givenx1(•) as thedrive signal, the receiver dynamics are taken as a copy of~2!:x̃2~k11!5 f 2„x1~k!,x̃2~k!…. ~3!Synchronization of the transmitter and receiver now corsponds to the asymptotic matching of Eqs.~2! and ~3!, thatis,limk→`ix2~k!2 x̃2~k!i50. ~4!Clearly Eq.~4! will not be satisfied in general and, in facconditions onf 1 and f 2 that guarantee this condition are onpartially known, cf. Ref.@3#. For that reason several methofor achieving synchronization of signals likex2(•) andx̃2(•) have been proposed. In particular, we wish to recthe ~reduced! observer viewpoint advocated in Ref.@4#,which basically admits the construction of dynamics*Author to whom correspondence should be addressed.PRE 591063-651X/99/59~4!/4691~4!/$15.00n.res-q.-llx̃2~k11!5 f̃ 2„x1~k!,x̃2~k!… ~5!such that Eq.~4! holds, whatever initial conditions Eqs.~1!,~2!, and~5! have. Although Eq.~5! supports the idea of usingthe copy@Eq. ~3!# for Eq. ~2!, there are many systems fowhich Eq.~4! will not be met, no matter howf̃ 2 in Eq. ~5! ischosen.There is, however, a natural generalization of Eq.~5!that consists of exploiting the drive signalx1(k) andx1(k21),...,x1(k2N) at each time instantk. Thus, as re-ceiver dynamics, we use the following system:x̃~k11!5 f̃ „x̃~k!,x1~k!,...,x1~k2N!…. ~6!Here x̃(•) is n dimensional, andf̃ (•,•) andN are such thatlimk→`ix~k!2 x̃~k!u50. ~7!The receiver@Eq. ~6!# acts as an ‘‘extended’’ observer for thsystem of equations~1! and ~2! in that past values of thedrive signalx1(•) are also used. It turns out that under fairweak conditions receiver dynamics@Eq. ~6!# exist such thatthe transmitter@Eqs.~1! and~2!# and Eq.~6! synchronize; seeSec. II. Actually, the necessary conditions involved aclosely related toglobal observability, cf. Ref. @5# or theTakens-Aeyels-Sauer reconstruction theorem~see Refs.@6–9,3#!. However, a crucial difference in our work with threconstruction theorem is that Eq.~6! forms adynamic‘‘in-version’’ for the statex(•), whereas in the reconstructiotheorem one computes the state at some time instant byverting the observability map, which determinesx2(k) fromx1(k),...,x1(k2N). It is interesting to note that an alternative using look-up tables for this procedure was proposedRef. @10#.The proposed transmitter and receiver synchronizausing a receiver of form~6! can be demonstrated numericalon several examples from the literature; see, e4691 ©1999 The American Physical Societys,sobobTonserita-ecaete--er4692 PRE 59BRIEF REPORTSRefs.@11,12#. In this paper, we will consider, among otherthe example from Ref.@11#. The organization of this paper ias follows. In Sec. II we present a design procedure forserver dynamics@Eq. ~6!#, whereN5n21. Section III pre-sents numerical simulations of some synchronization prlems where an observer presented in Sec. II is used.paper ends with some concluding remarks.II. OBSERVER DESIGNIn this section, we focus on an observer design for nlinear, discrete-time, autonomous, single output systemthe formsx~k11!5 f „x~k!…, y~k!5h„x~k!… ~8!for k50,1,2, . . . , wherex(•) is a vector of dimension andy(•) is a scalar. Assuming that the Jacobian ofh isnonzero—which implies that a nontrivial signal from thdynamics is transmitted—we can, at least locally, rewEq. ~8! in a form like Eqs.~1! and ~2!, with y(k)5x1(k)being one dimensional. Within the context of synchroniztion, it is desired to reconstruct~asymptotically! the(n21)-dimensionalx2(•) on the basis of the sequencex1(k)(k51,2, . . . ). Wewill do this using a suitably selected dynamics of form~6!, which basically means that we treat thsynchronization problem as a sort of observer problem;Ref. @4#. Without loss of generality we can assume thf (0)50 andh(0)50.For Eq. ~8! we define the so-calledobservability mapcbyc~x!ªF h~x!h+ f ~x!]h+ f n21~x!G , ~9!whereh+ f (x)ªh„f (x)…, f 1ª f , and f jª f + f j 21. System~8!is calledstrongly locally observablearoundx50 if the Jaco-bian (]c/]x)(0) is invertible.We now sketch a procedure to derive two different typof observers for the strongly locally observable system~8!.This procedure was proposed in Refs.@13,14#, and representsan extension of Refs.@15,16#. For clarity of presentation, wewill restrict ourselves to the case in whichn53. Extensionsto other cases are straightforward.Thus we consider a strongly locally observable system~8!with n53, and definesi(x)ªh+ fi 21(x) ( i 51, 2, and 3!.Since Eq. ~8! is strongly locally accessible, s5col(s1 ,s2 ,s3) forms a new set of coordinates for Eq.~8!aroundx50. In what follows, we will assume throughouthat s forms a new set of coordinatesglobally, i.e., c in Eq.~9! is a global diffeomorphism onRn. It is straightforwardlychecked that in these new coordinates the system~8! takesthe forms~k11!5F s2~k!s3~k!f s„s~k!…G , y~k!5s1~k!, ~10!where f s(s)ªh+ f3„c21(s)…. Next, define--he-ofe-f.tsz3~k!ªs1~k!,z2~k!ªs2~k!2 f s„y~k22!,y~k21!,s1~k!…,z3~k!ªs3~k!2 f s„y~k21!,s1~k!,s2~k!…. ~11!It then follows from Eqs. ~10! and ~11! that z5col(z1 ,z2 ,z3) satisfiesz~k11!5F 010001000Gz~k!1F 00f s„y~k22!,y~k21!,y~k!…G ,y~k!5z3~k! ~12!@where the first matrix isE and the second isF„y(k22),y(k21),y(k)…#.An observer of type 1now has the formẑ~k11!5Eẑ~k!1F„y~k22!,y~k21!,y~k!…1F q0q1q2G @y~k!2 ŷ~k!#,ŷ~k!5 ẑ3~k!, k>2, ~13!whereq0 , q1 , andq2 are still to be determined. Defining therror signaleª ẑ2z, we obtain the error dynamicse~k11!5F 0100012q02q12q2Ge~k!, ~14!where the matrix is represented byA. The characteristicpolynomial pA(l) of A is given by pA(l)5l31q2l21q1l1q0 . Choosingq0 , q1 , andq2 in such a way that alleigenvalues ofA are located within the unit circle, the observer errore(k) vanishes fork→`, and condition~7! ismet. From this it follows that the dynamics~13! initialized atan arbitrary pointẑ(0) will asymptotically ~even exponen-tially! match the transmitter dynamics~12!. Therefore, thereceiver dynamics~13! which is fed with the buffered transmitted signal „y(k22),y(k21),y(k)…, synchronizes withEq. ~12!.The derivation of anobserver of type 2starts from theobservation that the solutions of Eq.~12! satisfy z1(k)5z2(k)50 for k>2. This suggests that one should considan observer of the formẑ~k11!5F„y~k22!,y~k21!,y~k!…1F l1ẑ1~k!l2ẑ2~k!l3„ŷ~k!2y~k!…G ,ŷ~k!5 ẑ3~k!, k>2. ~15!Again defining the error signaleª ẑ2z, we now obtain theerror dynamicse~k11!5F l1000l2000l3Ge~ k̇! ~16!-havecoanorncerittas-imeves-b.raedeessd 4.-rverasallyhem isob-nceRef.noronsiza-gyPRE 59 4693BRIEF REPORTSfor k>2. The convergence rate of thei th component cannow be assigned byl i , without affecting the other components. As was the case with observer 1, here we againthat the receiver dynamics~15!, which is fed with the buff-ered transmitted signal„y(k22),y(k21),y(k)…, synchro-nizes with Eq.~12!.Comparing both observer types, we see that the congence rate of each of the components of observer type 2be assigned independently, while this is not the case forserver type 1. Thus observer type 2 will give a better trsient behavior than observer type 1. On the other hand, hever, observer type 1 with properly chosenq0 , q1 , andq2 isin general more robust to~measurement! noise than observetype 2; cf. Refs.@13,14#.III. EXAMPLESAs an example, consider the transmitter systemsx1~k11!5m~12e!x1~k!@12x1~k!#1ex2~k!,x2~k11!5m~12e!x2~k!@12x2~k!#1ex1~k! ~17!presented in Ref.@11#. Taking x1(k) as the drive signal (m5 l 51), Badola, Tambe, and Kulkarni investigated the sychronization ofx2(k) and the receiver signalx3(k) of whichthe dynamics were taken asx3~k11!5m~12e!x3~k!@12x3~k!#1ex1~k!. ~18!@In Ref. @11#, x2(k) is considered as the drive signal. Sinthe coupled system given by Eq.~17! is symmetric, we canexchangex1(k) andx2(k)#. Our aim is to apply an observepresented in Sec. II as the receiver dynamics for transm~17!. With y(k)5x1(k), it is possible to design observersin Sec. II in order to obtain the estimatesx̂1(k) andx̂2(k) forthe signalsx1(k) and x2(k). The resulting observer equations are omitted for reasons of space. For subsequent slations, the initial conditionsx1(0)50.2, x2(0)50.4, andx̂1(0)5 x̂2(0)50.7 and parametersm53.7 ande50.09 wereused. Following Ref.@11#, x2(k) andx3(k) do not synchro-nize for these parameters andx3(0)5 x̂2(0)50.7, while theobservers obtained here show satisfactory behavior. Explary simulations of the observer errors applying obsertypes 1 and 2 can be seen in Figs. 1 and 2 forl15l250.5~for observer type 1, this corresponds to the choicesq050.25 andq1521). Both observers provide very good etimations after 20 iterations with a maximum absolute oserver error less than 0.002. As already mentioned in Secobserver type 2 shows smaller observer errors during tsient time than observer type 1.As a second example, we want to extend system~17! tothe third order transmitter systemx1~k11!5m~12e!x1~k!@12x1~k!#1ex2~k!,x2~k11!5m~12e!x2~k!@12x2~k!#1ex3~k!,x3~k11!5m~12e!x3~k!@12x3~k!#1ex1~k!, ~19!with the drive signaly(k)5x1(k) (m51, l 52). In this case,observing the unknown signalsx2(k) andx3(k) is more dif-ficult becausex3(k) does not directly influence the measurver-anb--w--eru-m-r-II,n-drive signalx1(k), but only viax2(k). For this reason, thecoupling parametere was increased up to 0.35 while thsecond parameterm53.7 was not changed. Forx1(0)50.2,x2(0)50.4, x3(0)50.6, andx̂i(0)50.7, i 51, 2, and 3, andeigenvalues of the observer error dynamicsl i50.5, i 51, 2,and 3 ~for observer type 1, this corresponds to the choicq0520.125,q150.75, andq2521.5), the observer errorapplying observer types 1 and 2 are shown in Figs. 3 anIt can be seen thatue3(k)u reaches very high values~up to7500 with observer type 1! during the transient time. Nevertheless, after 20 iterations the maximum absolute obseerror is less than 0.007.The examples show the efficiency of observers takenreceiver dynamics in synchronization problems, especiwhen taking into consideration that synchronization of ttransmitter system and observer is guaranteed if the systeglobally observable. Moreover, the eigenvalues of theserver error dynamics, and consequently the convergerate, are selectable. For synchronization as presented in@11#, one is neither able to guarantee synchronizationable to influence the number of steps until synchronizatioccurs.IV. CONCLUDING REMARKSWe have presented a control perspective on synchrontion of discrete-time transmitter systems. The methodoloFIG. 1. Observer errorsei(k)5 x̂i(k)2xi(k) ( i 51 and 2! forsystem~17! and observer type 1@Eq. ~13!#.FIG. 2. Observer errorsei(k)5 x̂i(k)2xi(k) ( i 51 and 2! forsystem~17! and observer type 2@Eq. ~15!#.seoaerfdonemhaso-theasetheny4694 PRE 59BRIEF REPORTSof designing an observer as the receiver system enableexponential synchronization of the transmitter and receivand does not require any condition on conditional Lyapunexponents, as is often the case when identical transmitterreceiver systems are used. Essentially, the observer schthat is used in this paper exploits the lastn21 measurementsof the drive signaly(k),y(k21),...,y(k2n11) at eachtime instantk, with n being the dimension of the transmittedynamics, and can be viewed as a dynamic mechanismthe ~Takens-Aeyels-Sauer! econstruction theorem, providethe system satisfies a global observability condition. CFIG. 3. Observer errorsei(k)5 x̂i(k)2xi(k) ( i 51, 2, and 3! forsystem~19! and observer type 1@Eq. ~13!#.et.,rether,vndmeor-trary to Ref. @11#, our results are valid no matter how thinitial conditions are chosen.The observer viewpoint on the synchronization problehas also been advocated for continuous time systems~seeRef. @4#!, but the scheme we used here in discrete timeno direct analog in continuous time. An obvious way to prceed in continuous time, therefore, could exist in a~f st!sampling of the continuous time transmitter and thendesign of a discrete-time observer as receiver. In that cthe synchronization error becomes small—depending onsampling time—but not identically zero. However, in maapplications this will not be a big problem.FIG. 4. Observer errorsei(k)5 x̂i(k)2xi(k) ( i 51, 2, and 3! forsystem~19! and observer type 2@Eq. ~15!#..ol.on-@1# L. M. Pecora and T. L. Carroll, Phys. Rev. Lett.69, 821~1990!.@2# K. M. Cuomo and A. V. Oppenheim, Phys. Rev. Lett.71, 65~1993!.@3# Coping with Chaos, edited by E. Ott, T. Sauer, and J. A. York~Wiley, New York, 1994!.@4# H. Nijmeijer and I. M. Y. Mareels, IEEE Trans. Circuits SysI: Fundam. Theory Appl.44, 882 ~1997!.@5# H. Nijmeijer, Int. J. Control36, 867 ~1982!.@6# F. Takens, inDynamical Systems and Turbulence, edited by D.A. Rand and L. S. Young~Springer-Verlag, Berlin, 1981!.@7# D. Aeyels, SIAM J. Control Optim.19, 595 ~1981!.@8# D. Aeyels, Syst. Control Lett.1, 92 ~1981!.@9# T. Sauer, inTime Series Prediction: Forecasting the Futuand Understanding the Past, edited by A. S. Weigend and NA. Gershenfield~Addison-Wesley, Reading, MA, 1993!.@10# P. E. Moraal and J. W. Grizzle, IEEE Trans. Autom. Contr40, 395 ~1995!.@11# P. Badola, S. S. Tamble, and B. D. Kulkarni, Phys. Rev. A46,6735 ~1992!.@12# M. Loecher, and E. R. Hunt, Phys. Rev. Lett.79, 63 ~1997!.@13# T. Lilge, Eur. J. Control4, 306 ~1998!.@14# T. Lilge, in Proceedings of the 1997 COSY Workshop on Ctrol of Nonlinear and Uncertain Systems, edited by A. Isidoriand F. Allgöwer ~ETH, Zürich, 1997!, p. 202.@15# M. Brodmann, Beobachterentwurf fu¨r Nichtlineare Zeitdis-krete Systeme~VDI-Verlag, Düsseldorf, 1994!.@16# W. Lin and C. I. Byrnes, Syst. Control Lett.25, 31 ~1995!.

Huijberts, HJC Henri

Lilge, T

Nijmeijer, H Henk

Repository TU/e

A control perspective on synchronization and the Takens-Aeyels-Sauer reconstruction theorem

NARCIS 

The Power of Online Learning in Stochastic Network Optimization

The Power of Online Learning in Stochastic Network Optimization

Abstract

Similar works

Full text

Available Versions

MUCC (Crossref)

CiteSeerX

Pure OAI Repository

University of Twente Research Information

Pure OAI Repository

Repository TU/e

University of Twente Research Information

NARCIS

NARCIS