Abstract-In this brief, we propose a discrete-time framework for the modeling and studying of all-digital phase-locked loop (ADPLL) networks with applications in clock-generating systems. The framework is based on a set of nonlinear stochastic iterating maps and allows us to study a distributed ADPLL network of arbitrary topology. We determine the optimal set of control parameters for the reliable synchronous clocking regime, taking into account the intrinsic noise from both local and reference oscillators. The simulation results demonstrate very good agreement with experimental measurements of a 65-nm CMOS ADPLL network. This brief shows that an ADPLL network can be synchronized both in frequency and phase. We show that for a large Cartesian network the average network jitter increases insignificantly with the size of the system.
I. INTRODUCTION
A PHASE-Locked Loop (PLL) network is a spatially distributed array of controlled oscillators interacting with each other through the exchange of error signals. It has been shown in studies [1] - [6] that networks can be used for generating a distributed clock signal in Systems-on-a-Chip (SoCs). This approach has become immensely important as the complexity of microchips and microsystems has increased. Thus supplying them with a coherent clock signal, using conventional clock tree routing, has been proven to be less efficient. The major reason for this is that numerous buffers in clock trees consume significant energy and have propagation delays due to internal relaxation processes. This results in signal distortions with skew and jitter, which are the major parasitic effects in clock generating networks.
In the most common case, each oscillator (or PLL) of a network can be allocated as a node on a Cartesian grid as shown in Fig. 1(a) . Every oscillator has conventional components such as a voltage (or digitally) controlled oscillator (VCO or DCO respectively) that generates a signal with a given frequency, a control block (also known as a loop filter) Manuscript that provides a control signal for the controlled generator and a phase-frequency detector (PFD) that produces an error signal for the control block. The number of PFDs for each oscillator in a network is equal to the number of its neighbours. It has been shown in [3] that when at least one node in the network is connected to an external reference signal, the network can be synchronized with this signal both in frequency and phase. An ADPLL network has some common features with pulsecoupled oscillators for wireless sensor networks. Both types of networks can be seen as event-driven networks with promising potential for Internet of Things applications [7] , [8] . In a simple case, pulse-coupled networks consist of arrays of distributed phase oscillators having identical natural frequencies. The interaction between the oscillators occurs due to exchanging pulses at firing time instances, i.e., when the phase of an oscillator is a multiple of 2π . At the same time, the phase of other topologically interconnected oscillators instantaneously changes by a value determined by the phase response function. If the natural frequencies are non-identical, their phases may not be synchronized and additional optimisation techniques are required [9] . In ADPLL networks, the phase of each oscillator is a continuous function of time, but its frequency changes instantaneously at the beginning of a new clocking cycle with respect to the input control code.
ADPLLs are complex systems and the conventional method for their analysis is behavioural modelling using MATLAB Simulink or hardware/mixed-signal description languages. To date there are only a few analytical models that have been suggested: discrete-time models [4] , [10] , [11] and pulse-event driven models for synchronization in wireless networks [7] , [12] - [14] .
In this brief, we expand our model to the case of ADPLL networks having arbitrary topology. Based on this model, we investigate the dynamics of an ADPLL network by varying the parameters of the system. The validation of the mathematical model is carried out through a number of experiments made on chip that has been proposed in [15] .
This brief is organised as follows. Section II describes an ADPLL network in general. The description is extended in Section III where we use a graph and adjacency matrix to define the topology of a network. The experimental verification is given in Section IV, and key results obtained from the model are presented in Section V.
II. STATEMENT OF THE PROBLEM An ADPLL network is an array of DCOs topologically connected with each other via digital time detectors (DTDs) as shown in Fig. 1(a) . In this brief we describe a DCO as a single block DCO and its divider. The external reference signal (its period is T R , frequency is F R and T R = 1/F R ) is a part of the network and represented by a reference oscillator (see oscillator 1 in Fig. 1(a) ).
The network is described in terms of a directed graph G = (V, E) with vertices V and edges E. Each oscillator is located at a vertex of the graph, and we number the oscillators from 1 to N DCO + 1, where N DCO is the number of DCOs in the network. Every detector in the network we associate with an edge of the graph and number the edges from 1 to N DTD , where N DTD is the number of detectors in the network. We will then refer to a detector using the index k. For example, the detector DTD[k] connects two vertices (oscillators) numbered i and j.
A DTD has two inputs for the comparison of the time interval between clocking events of oscillators it connected to. We assume that the signal coming to DTD[k] from the oscillator whose number is minimal in the pair (i, j) is the reference (with respect to the k th detector) while the signal coming from the other oscillator is local. Every detector updates its state (intrinsic set of variables such as memory and error values) only when a rising edge of either the reference or local signal arrives at its input. At DTD[k] the time intervals to the closest local and reference events are denoted as L k and R k respectively. The sets of local and reference time events for all time detectors k = 1, . . . , N DTD are presented as L and R.
The output of a DTD represents the timing error ε k (also called the error signal in this brief) that characterises the mismatch in the DCO clocking events of adjacent oscillators, as shown in Fig. 1(b) . The error signal ε k controls the state of the local oscillator (see the red arrow in Fig. 1(a) ). The same error but with the opposite sign, −ε k , controls the reference oscillators (see the blue arrow in Fig. 1(a) ).
The timing error from several DTDs is averaged, and control of a DCO is done by passing the averaged timing error E j through a PI controller, with integral and proportional gains K i and K p respectively. As a result, we obtain the control code
where j is the accumulated error.
We employ a DTD introduced in [4] which consists of the combination of a finite-state machine and time-to-digital converter. It transforms the time differenceτ k between the rising edges of the reference and local signals into the error τ k H − → ε k represented as an integer number in the range [−7; −1] ∪ [1; 7] , where H is a quantization function of the DTD. A DCO generates a rectangular clock signal with the frequency f j = f 0 + f v j , where f 0 is the initial frequency and f is the frequency gain. The frequency lies within a range bounded by minimum and maximum values. The range depends on the integrator bit resolution and the frequency gain. If the control block is unable to provide a frequency value beyond the control range, it resets its integrator to zero and sets the DCO frequency to its initial value. The model of such an ADPLL was developed in [16] . It describes self-sampling, event-based frequency tuning, quantization of control signals and noise effects. We will use this model here, keeping in mind that the main focus of this brief is the investigation of the network. An interested reader can find the equations in the above reference.
III. DEFINING THE TOPOLOGY OF AN ADPLL
NETWORK AND SIMULATION ALGORITHM In this section we will represent the graph of an ADPLL network graph through an adjacency matrix A, where A i,j = 1 if the i th DCO affects the j th DCO, otherwise A i,j = 0. In our case the first column is all-zero due to the presence of an external reference signal, whilst for a purely bi-directional graph, the matrix is symmetric. Each DCO does not interact with itself, therefore, all diagonal elements of the matrix are equal to zero.
Having determined the adjacency matrix, we define a set D of DTDs in a straightforward manner. For every nonzero element in A we create a 2-tuple with corresponding matrix indexes sorted in ascending order. Next, we create the array D based on the tuples excluding repeated elements. The length of D is equal to the number of DTDs in the network N DTD . Each
According to the definition of the reference and local oscillators in Section II, the first element in D k corresponds to the index of its reference oscillator while the second element corresponds to the local oscillator. Now we assign the matrix W of weight coefficients shown in Fig. 1(b) . Here we assume that each DCO is tuned by the average error from its neighbours. The total number of neighbours for the j th oscillator is determined by the adja-
A i,j . Thus, the weight matrix is introduced as follows: if g j = 0 ⇒W i,j = A i,j /g j , otherwise W i,j = 0. Note that as it is costly to divide by 3 in digital implementation, the prototype presented in this brief uses division by 4.
Below, we describe the structural algorithm for the time evolution of the ADPLL network, where the average error E i that is supplied to the PI controller of a DCO in the network is calculated using Algorithm 1.
1) Generate an ADPLL network with a set of given initial conditions and system parameters. 
2) Find the minimum time to the next clocking event among all detectors: t min = min(R ∪ L). 3) Make time shift for all detectors: 
If new value appears to be outside generating range then reset its value to the initial DCO frequency f 0 and set integrator to zero i = 0. c) Generate frequency jitter by multiplying it with a log-normal random number:
Here N(0, σ ) is the Gaussian normal distribution with zero average and standard deviation σ . d) Update clocking time for detectors connected to selected oscillators:
Update state of all detectors and simulating time. 8) Go to step 2 while current simulation time is less than the target value. In this brief we assumed that all control and model parameters such as K p , K i , σ , f 0 , f , etc., are the same for every DCO.
IV. EXPERIMENTAL VALIDATION AND
COMPARISON WITH THE MODEL The aim of the experiment is to validate the framework proposed in Section III.
The validation consists of experimentally measured transient responses to a periodical frequency change of the reference signal in an ADPLL network. The experiment has been performed using the prototype of a 4 × 4 (can be reconfigured as 2×2) ADPLL network fabricated in 65nm CMOS technology. The architecture of the microchip is described in [5] . Fig. 1(a) . The simulated dynamics is shown by red. Because all the DCOs are synchronized, their waveforms overlap and cannot be clearly distinguished in the figure. The reference frequency is switched between 1000/6 MHz and 1000/6.5 MHz with 50% duty cycle in cases (a) and (b) for a 2 × 2 network, then between 950/6 MHz and 950/6.5 MHz with 50% duty cycle in case (c) for a 2 × 2 network and then between 1000/7 MHz and 1000/6.5 MHz with 75%-25% duty cycle in case (c) for a 4 × 4 network.
In order to observe a transient response, the reference signal generator was programmed to produce a rectangular signal, whose frequency was swapping between two values. This was achieved by the periodic changing of the generator's division coefficient (with period ≈ 17 μs) between two values chosen from the set {6, 6.5, 7}. The main frequency of the generator in the experiments (before applying the division coefficient) was either 1000 MHz or 950 MHz.
After each switching, the reference frequency remains constant for enough time to observe the frequency acquisition by the network. The frequency acquisition process is limited by the switching period (and does not always settle to a synchronous mode). The transient process lasts till the next switching. Because we apply a periodic signal of the modulated reference frequency, we observe a sequence of repeating similar transient waveforms.
This setup was designed to verify the framework from Section III for modelling the ADPLL network. The results are shown in Fig. 2 The parameters of the network were taken from the experiment in Fig. 2(e) .
quickly, just after the network is switched on. After that, the network behaves as a single oscillator whose frequency moves slowly toward the reference frequency.
We note that Fig. 2 (a)- Fig. 2(c) display the results for a 2 × 2 ADPLL network, and Fig. 2(d) shows the result for a 4 × 4 ADPLL network. Because its behaviour is qualitatively the same as in the 2 × 2 network case, we have chosen one experiment to demonstrate frequency dynamics. Also in this case we applied a "non-symmetrical" duty cycle of reference frequency modulation (75% / 25%) to prove that the model will capture the behaviour of the network even in this cases.
The comparison presented Fig. 2 allows us to conclude that the model is verified and describes various dynamics, transient processes and synchronized modes. It is also extremely accurate in the prediction of the frequency acquisition rate F . There is indeed a difference between the envelope width in steady state. This is clearly due to larger supply noise which negatively affects the jitter measured from the chip (since the output pad buffers are also supplied by noisy V DD ). The contribution of V DD is difficult to quantify precisely, given the complexity of the microchip. However, the fact that the model predicts consistent dynamical behaviour in transient mode is an indication of its correctness.
In addition, the figure gives rise to the hypothesis that the frequency acquisition rate likely does not depend on K p , and it decreases with the size of the network if other conditions remain the same. In the next section, we use our framework to show that this is indeed true. Moreover, we show the parameters domain where synchronization is possible and suggest how the overall network performance could be optimised.
V. PERFORMANCE OF THE NETWORK
In this section we consider dynamics of ADPLL networks based on the framework proposed in Section III. The simulation parameters correspond to the experimental setup described in Section IV. We note that these parameters have not been chosen through a fitting, but are taken "as is" from the experiment. Fig. 3 shows a typical synchronization pattern that appears in ADPLL networks given the proper choice of control parameters, such as K p and K i . From the picture all basic dynamical features of a PLL are clearly seen: phase-frequency acquisition, phase locking and phase tracking. We can see that for random phase and frequency initial conditions all DCOs in the network synchronize between themselves, and later they synchronise with the external reference signal both in frequency and phase.
In order to quantify the synchronization in ADPLL networks we shall use the averaged network jitter | τ | as the measure of synchronisity:
This measure represents the average through all the detectors and time errorτ k (t). It is equal to zero when the network operates synchronously and all rising edges come at the same time. In addition to this, we shall consider the relative average jitter that is normalised to the reference period: <| τ |>/T R · 100%.
Frequency acquisition: Fig. 4(a) shows the dependence of the frequency acquisition rate F on the number of DCOs in a network. Here, we have neglected the intrinsic cycle jitter of the reference signal and DCOs. We set the PI control parameters to be K p = 2 and K i = 0.2 (as in the experiment in Fig. 2(b) to ensure a synchronization in the network). We can see that F decreases following a power law as the number of DCOs in the network increases. We shall give a brief intuitive interpretation of this phenomenon, but a more precise explanation is an issue for future study.
When the frequencies of all DCOs significantly differ from the reference frequency, the average frequency acquisition rate is predominantly defined by the error signal between the reference clock and the first oscillator. As we add more oscillators to a network, this "driving" signal propagates within the whole network with a finite speed that depends on the DCO frequencies. For larger networks, it takes more time for the signal to propagate from the reference clock to the network's edges. Therefore, the average frequency acquisition rate decreases. 5 . The dependence of relative network jitter on the PI control parameters for 4 × 4 ADPLL network in a steady-state. The DCOs' intrinsic jitter is set to be 0.1%, and F R = 167 MHz. The orange line shows the optimal set of K p that minimises the average network jitter for a given K i . The dashed area corresponds to ≈ 0.5% network jitter. The plane contains 400 × 400 points. Fig. 4(b) shows the dependence of the frequency acquisition rate on the integral gain factor for different proportional gain values. We can see that for small K i (K i < 0.5 K p ) F depends on K i linearly, and does not depend on K p .
Network jitter: Fig. 4(c) shows the relative average jitter in a square Cartesian network. The intrinsic cycle jitter of both reference and local oscillators is set to zero and PI control parameters are K p = 2 and K i = 0.2. We can see that |τ | changes insignificantly when the DCOs number is large, therefore, the network tends to be stable when the number of oscillators increases. This behaviour is similar to the dynamics of a network of noisy Kuramoto-like oscillators, where, under certain conditions, the order parameter stays still close to unity even though the number of oscillators approaches to infinity. Fig. 4(d) shows the dependence of network jitter on the intrinsic noise of DCOs in a steady state. We can see that the dependence tends to be the same for any large network. The detailed analysis of the asymptotic behaviour of large ADPLL networks could be an interesting theoretical challenge for a future work.
Plane of parameters: Fig. 5 shows the relative averaged network jitter in the plane spanned by the PI control parameters. From this figure, we conclude that the system experiences dynamical instability when the integral gain K i exceeds ≈ 0.5K p . We can see that in this case, the relative network jitter equals ≈ 50%. This can be the case when all rising edges of DCOs in the network are independently and uniformly distributed in time within one period of the reference clock being desynchronized in phase. Moreover, for every K i there is a value K p where |τ | is minimal. Due to the fact that the frequency acquisition rate is proportional to K i (see Fig. 4(b) ), it appears to be possible to choose an optimal proportional gain K p that minimises the network jitter for given frequency acquisition rate.
VI. CONCLUSION Using the analysis developed in this brief we have proved the feasibility of ADPLL networks to generate a synchronous clocking signal for parallel computations, for example, in application to ASIC design. We have shown that the network synchronizes in phase and frequency for a given set of control parameters. No mode-locking was observed during experiments or numerical simulations. We have highlighted the possibility to minimise network jitter for a given frequency acquisition rate by the optimal choice of the proportional gain factor. We have shown that for large networks the average jitter increases insignificantly with respect to the oscillator's number. However, the frequency acquisition rate decreases in accordance to the power law. Finally, due to the discreet nature of the model, it works faster than existing behavioral models.
