Wideband spectrum sensing for cognitive radios requires very demanding analog-to-digital conversion (ADC) speed and dynamic range. In this paper, a mixed-signal parallel compressive sensing architecture is developed to realize wideband spectrum sensing for cognitive radios at sub-Nqyuist rates by exploiting the sparsity in current frequency usage. Overlapping windowed integrators are used for analog basis expansion, that provides flexible filter nulls for clock leakage spur rejection. A low-speed experimental system, built with off-the-shelf components, is presented. The impact of circuit nonidealities is considered in detail, providing insight for a future integrated circuit implementation.
Introduction
Cognitive Radio (CR), first proposed in [1] , provides a new paradigm to improve spectrum efficiency by enabling Dynamic Spectrum Access (DSA). In CR, spectrum holes that are unoccupied by primary users can be assigned to appropriate secondary users as long as the interference introduced by secondary users is not harmful to the primary users [2] [3] [4] . The design of cognitive radio networks is a complicated cross-layer procedure [5] . In this paper, we focus on the spectrum sensing problem in CR, in which sensing and detection of primary users is done in order to realize Dynamic Spectrum Access.
Spectrum sensing can be a very challenging task for CR due to many factors. First, for the sake of improving the frequency usage efficiency, the sensing bandwidth for CR can expand from hundreds of MHz to several GHz. Second, the sensing radio should be able to detect very weak primary users, which arise due to fading and the hidden terminal problem [5] . With traditional time-domain Nyquist sampling, sensors are needed with both wide bandwidth and high dynamic range, stressing technology, and demanding higher power [6, 7] . Conventional wideband sensing with a highspeed and high-resolution ADC becomes less appealing as the bandwidth becomes significant. Alternative approaches, such as a fixed bank of analog filters followed by parallel ADCs, impose strict requirements on the filter design.
It has been observed that today's spectrum usage presents some sparsity in the sense that only a small portion of the available frequency bands are heavily loaded while others are partially or rarely occupied [5] . This frequency usage sparsity can be exploited under the framework of Compressed Sensing (CS) [8, 9] to effectively reduce the sampling rate. The sparse signal can be captured via projection over a random basis that is incoherent with respect to the signal basis, and perfect signal reconstruction from these projections can be obtained with high probability, where the number of random projections is on the order of the signal's information rate rather than the Nyquist rate.
The idea of applying CS for wideband spectrum sensing was reported, for example, in [10] . However, this approach assumes full-rate analog-to-digital conversion which does not reduce the complexity of the spectrum sensing receiver. We have proposed a mixed-signal parallel segmented compressive sensing (PSCS) architecture for wideband spectrum sensing [11] , where the high-speed ADCs were avoided by 2 International Journal of Digital Multimedia Broadcasting carrying out an analog basis expansion in parallel before sampling. In this paper we elaborate on the idea of applying the PSCS front-end [11] , with special emphasis on implementation issues such as spurious frequency tones, timing, and other mismatches. First, we show that the proposed overlapping windowed integration in the PSCS architecture provides a scheme to mitigate the spurs due to clock leakage by setting the lowpass filter nulls flexibly, which is favorable for practical implementation. Second, a low-speed prototype built with off-the-shelf components is presented in detail from the overall system configuration to building blocks, in which practical constraining issues are addressed.
The remainder of the paper is organized as follows. A brief background on CS is provided in Section 2 and the spectral occupancy signal modeling is given in Section 3. Section 4 introduces the mixed-signal parallel compressive spectrum sensing scheme. Section 5 discusses the spurious frequency rejection schemes in the PSCS front-end. A lowspeed prototype is introduced in Section 6. Conclusions are made in Section 7. According to CS theory, a signal that is sparse or compressible over a known basis Ψ can be sampled and reconstructed at sub-Nyquist rate, and the sampling rate reduction depends on the signal's sparsity and the reconstruction algorithms. Specifically, the sub-Nyquist rate sampling is achieved by projecting the signal into a transform-domain over which the sampling operation occurs, which is different from the traditional way of sampling the signal in the timedomain. Mathematically, this procedure can be described as y = ΦΨa, where y are the collected samples, and Φ is incoherent with Ψ which is the basis for the transformdomain. The reconstruction of the original signal relies on the estimation of the coefficients a, which is obtained by solving the following l 1 -norm optimization problems, for which many convex optimization techniques or iterative greedy algorithms can be used:
Compressive Sensing Background

A signal r(t) that is spanned by S basis functions
(i) noiseless case:
(ii) noisy case:
where is the error due to the noise.
Note that, in this paper, we generally do not differentiate between sparse and compressible unless specifically noted. Figure 1 : Illustration of the multiband analog signal to the sensing radio.
Signal Modeling
The received signal r(t) is a modeled as a multiband analog signal whose spectrum is illustrated in Figure 1 . Specifically, we assume that r(t), with a frequency span from f l to f h , is the superposition of primary users, perhaps using W different wireless standards [5] . Each wireless standard occupies a certain finite frequency band which consists of multiple channels. According to the measurements done by FCC in the US [12] , in many cases the current frequency usage exhibits sparsity because only a part of the allocated channels is utilized at a given time.
Without loss of generality, we assume that r(t) is bandlimited to [0, f h ]; so r(t) can be written as
where R( f ) is the Fourier transform of r(t). The continuous-time analog signal can be captured with a finite dimensional model; for example, see [13, 14] . We directly approximate r(t) with a model of finite dimension as follows:
where Δ f = 1/T s is the resolution on the frequency axis and where n(t) is additive white Gaussian noise (AWGN),
, and a has only K S nonzero elements. Since Δ f is a scalar, for simplicity, we discard it in the rest of the paper. The spectrum hole detection, for example, energy or feature detection, is usually based on the observed signal spectrum R(sΔ f ), or equivalently, the estimation of the coefficients a s .
Wideband Parallel Compressive Spectrum Sensing
Wideband spectrum sensing is composed of several crucial steps: first, spectrum estimation; second, calculate the sufficient statistics, during which digital signal processing is needed to improve the front-end sensing sensitivity by processing gain and identification of the primary users based on knowledge of the signal characteristics [5] ; last, to decide whether there exist primary users based on the sufficient statistics. Here we focus on the wideband spectrum estimation step, that is, estimating the unknown coefficients a in (5).
Mixed-Signal Compressive Sensing Architecture.
The parallel segmented compressive sensing (PSCS) structure is shown in Figure 2 , which we first proposed in [11] . For the completeness of this paper, in this section we recap how the analog compressive sensing at sub-Nyquist rate is realized via the PSCS architecture.
In the PSCS architecture, the input signal r(t) is sent to N parallel paths. In the n th path, r(t) is mixed with a random basis function Φ n (t). A good choice for the random basis is to use PN (Pseudonoise) sequences because they can be conveniently generated by digital logic circuits. The output of the mixer is then sent to a sliding window with a width of T c and integrated. Two adjacent windows have an overlapping time T c − T m , which defines an overlapping percentage OVR = (T c − T m )/T c , as shown in Figure 3 . The output of the integrators is sampled and M samples are collected at each path. The m th sample of the n th branch is given by
There are a total of L = MN samples collected every T s seconds and these samples are organized into a vector as follows:
where
T is the vector consisting of the m th samples from all N branches.
Similarly, we can calculate the reconstruction matrix V = ΦΨ = {v i, j } L×S . The element at the mN + n row and the s column is given by
Therefore, we have y = Va. Then, we can estimate a by solving the problem in (1) and reconstruct the original signal using r = Ψ a.
A Wideband Spectrum Sensing Example.
To show the effectiveness of the proposed wideband PSCS architecture we present a simulation, where the input signal is modeled as a frequency-domain sparse multi-carrier signal as given in (5) . The mixed-signal compressive sensing based on the PSCS architecture given in Figure 2 is used for spectrum From top to bottom, the four plots represent the transmitted signal by primary users, the received primary users' signal at the sensing radio, the reconstructed signal from the time-domain samples via the Nyquist rate ADC, and the reconstructed signal from the transform-domain samples via mixed-CS at an NSR of 0.32. estimation. The sampling rate reduction is measured by the Normalized Sampling Rate (NSR), which is defined as
where f CS is the sampling rate required using the PSCS and f Nq is the corresponding Nyquist sampling rate. The signal reconstruction quality is evaluated by the normalized Mean Square Error (MSE), which is equal to
In the simulation, the input signal to the PSCS architecture is assumed to be a 17-sparse frequencydomain multi-carrier signal with 128 subcarriers, that is, The location of the spurious leakage frequency relative to the filter nulls with different overlapping ratio. With OVR = 0, the strongest clock leakage is close to the peak of the filter's 3rd sidelobe; with OVR = 0.1125, the strongest clock leakage is on the 4th null of the filter.
based on mixed-signal PSCS is more robust against noise than the traditional digital approach based on the DFT, because CS takes advantage of the knowledge of the signal structure and its sparsity.
Flexible Spur Rejection via the Overlapping Windowed Integration
In addition to the capability of sensing and reconstructing sparse signals at sub-Nqyuist rate, the PSCS architecture has many special characteristics. For example, the parallel architecture gives a design tradeoff between the sampling rate and the system complexity [11] . In this section, we focus on the PSCS architecture's spurious frequency rejection schemes. Since one critical type of spur in the PSCS architecture is the leakage of the clocks for the PN generators to the integrator, as illustrated in Figure 6 , we will focus on this particular type of spur in this section, although the rejection scheme applies more generally.
Recall that in Figure 2 , the output after the mixer is sent to a sliding window with a width of T c and integrated over T c seconds, and there is an overlap time of T c × OVR between two adjacent windows as illustrated in Figure 3 . The integrator, with a reset every T c seconds, provides a simple realization of a sinc type lowpass filter with nulls at frequencies of f 0 × k, where f 0 = 1/T c . By setting the random generator clock frequency equal to a harmonic of the reset frequency, the sinc nulls coincide with spur frequencies from the random generator clock and so filters them, where the overlapping scheme provides the flexibility on setting the locations of the nulls. In some cases, without the overlapping scheme, the objective of setting the clock frequency on the nulls of the sinc type lowpass filter may conflict with the sampling rate requirement which is determined by the signal's sparsity. In order to show this, consider the following example.
Let the input signal to the PSCS architecture be a 19-sparse frequency-domain multi-carrier signal with 128 subcarriers, that is, S = 128 and K = 19, which corresponds to a sparsity of 15%. The subcarrier spacing is Δ f = 1 GHz/128 = 7.8125 MHz and the symbol duration time is T s = 1/Δ f = 128 nanoseconds. The locations of the K active subcarriers are chosen randomly and changed every T s seconds. According to simulation results, the minimum NSR is 0. There may exist some leakage into the integrators from the clock signal, as illustrated in Figure 6 . According to the CS theory, the clock frequency is usually at the Nyquist frequency f Nq where f Nq = 1 GHz in this example. Because f Nq / f 0 ≈ 3.56, the spurs due to the clock leakages will fall near the 3rd sidelobe's peak of the sinc type lowpass filter and bring distortion to the reconstructed signal. With the overlapping scheme, we can choose T c = 4 ns and f 0 = 250 MHz by introducing an overlapping ratio of 11.43%, then f Nq / f 0 = 4 and the spurs due to the clock leakage can be filtered (considering the clock resolution requirement, an overlapping ratio of 11.25% is suggested in practice.). Based on Figure 3 , this can be mathematically expressed as
According to (14) , given a desired sampling rate, or equivalently speaking, a specific M, varying OVR will change the relative location of the leakage frequency to the filter nulls, as illustrated in Figure 7 .
Note that if we do not want to introduce any overlapping but still wish to null out the clock leakage, the only option in the above example is to increase the sampling rate and make f Nq / f 0 an integer no less than 3.56 . By introducing a nonzero OVR, we can conveniently make f Nq / f 0 an integer without increasing the sampling rate. Figure 8 shows the MSE of the reconstructed signal versus the overlapping ratio when there is some clock leakage into the integrators. Note that in the simulation the amplitude of each subcarrier is set to 1. Since K = 19, the signal's peak amplitude is 19. Allowing a 10 dB margin to account for the multi-carrier signal's large peakto-average ratio, the clock leakage with an amplitude of 0.1 (0.4) is roughly 35 dB (23 dB) below the signal's average power. As shown in Figure 8 , the flexibility of setting the null frequencies by the overlapping scheme can bring about 20 dB gain after filtering the spurs due to the clock leakage.
Note also that the overlap in the integration windows provides wider filter nulls than the sinc filter. Because of the existence of the phase noise on the clock signal in practice, even if we can set the clock on the null frequency, it is inevitable that remains some leakage due to the widening of the spurs spectrum. The wider nulls provides the possibility of further improving the harmonic rejection when the phase noise is significant.
Low-Speed Off-the-Shelf Component Prototype
As a proof of concept, we built a low-speed prototype using off-the-shelf components, where the input signal is a real BPSK modulated multi-carrier signal with 4 active subcarriers and the active subcarriers hop over the frequencies (i * 2 − 1) KHz (i = 1, 2, . . . , 100) every 500 microseconds.
Considering the system complexity, we employed 4 parallel paths for the prototype. Simulation shows that the signal can be reconstructed perfectly when each parallel path produces 16 samples every 500 microseconds, which corresponds to 32% of the Nyquist sampling rate.
Overall Configuration.
The overall configuration of the prototype is shown in Figure 9 , where the digital part is responsible for generating the input sparse signal, the triggering signal, the pseudorandom basis, and the clock. The analog part is used to realize the random basis projection that is essential for the signal reconstruction. The built-in ADC in the oscilloscope is used to collect the sampled data. Then, the collected data is sent to a PC and processed via Matlab code to reconstruct the signal. In the following sections, each building block will be introduced in detail.
Multicarrier Signal
Generator. An Agilent 33120A arbitrary waveform generator is used to generate the input multitone sparse signal. Specifically, the multitone signal is programmed in the PC first and then downloaded into the wave generator. The output port of the generator is triggered by the microcontroller in order to synchronize with the integrator clock that is also generated by the microcontroller. Figure 10 depicts the macromodel of one path in the prototype. As shown, the input signal is first translated into current by the OTA and then mixed with the pseudorandom signal. After mixing, the signal is integrated in the sampling capacitor with a timing window. In the sampling circuit the interleaving capacitor is employed. Finally the ADC yields digital output data. The OTA we employed is a TIOPA861 with Gm of 116mS
Mixers and Integrators.
International Journal of Digital Multimedia Broadcasting
S15
To ADC− Figure 11 : Circuit implementation of one parallel path. Figure 12 : Pin connection of the integrator in one parallel path. and all the switches are implemented with transmission gate CD4066BCN. The pseudorandom number (PN) is −1 or 1, whose spectrum is a sinc function. The main lobe is from 0 to 1/T clk , where T clk is the clock period of the PN generator. In our test bed T clk is 1μs. After the mixing, the signal is shaped by the embedded lowpass filter provided by the integration window. The frequency response of the LPF is a sinc function. The main lobe spans 1/T c , where T c is the integration time. In our test bed the 1/T c is roughly 30 KHz.
The random projection of the input analog signals is realized with mixers and integrators. Figure 11 gives the circuit implementation of one parallel path and Figure 12 gives the corresponding pin connection relationship for the integrator. The transconductance amplifier (G m stage) translates the signal voltage into current, which can be easily mixed with the pseudorandom numbers (1/−1) by the following passive switch mixer. After mixing, the signal is integrated with an overlapping window and then sampled by the ADC in each path. The circuit is built up differentially so that the system is more robust to supply noise, clock jitter, and even-order harmonics. The double balanced passive mixer does not introduce significant noise and distortions.
At each path, the mixer consists of transmission-gate switches controlled by PN sequences. The PN sequence is implemented with a linear feedback shift register (LFSR). In our prototype, the clock frequency is chosen to be 1 MHz, which is higher than the Nyquist sampling rate. Because the PN sequences are repeated every 500 μs and there are 4 parallel paths, we need 4 independent PN sequences with a length of 500. An 11-bit LFSR is used to generate a PN sequence with a length of 2047 and then divided into 4 segments. As a check, the autocorrelation function of the PN sequences is calculated to make sure that the four PN sequences are incoherent.
An overlapped time-interleaving charge-domain sampling integrator is chosen for the analog path. The integrator schematic is shown in Figure 13 , φ 1 and φ 1 are two integration switches for the left and right branches, respectively. φ o1 and φ o2 are readout switches; φ r1 and φ r2 are reset switches. By utilizing these six switches combined with the two integration capacitors C 1 and C 2 , according to the clock diagram shown in Figure 14 , we can realize a conventional time-interleaving charge-domain integrator without overlapping. Time interleaving means when the left branch is integrating while the right is reading out, and vice versa. By doing this, a complete sampling of the signal is achieved. In addition to time interleaving, a small overlapping time is introduced by one more capacitor C ov and two control switches φ 2 and φ 2 .
As shown in Figure 14 , phase1 and phase3 are to realize the overlapping through charge redistribution and sharing, and phase2 and phase4 are the readout times for the right and the left branches, respectively. During phase1, the input current charges both C 1 and C ov while C 2 is idle. Since all capacitors have the same value, the current splits equally by half into both capacitors. In the succeeding phase, C ov is switch-connected to C 2 and readout together, so that C ov is integrating for the right branch during phase1. Equivalently, as shown in the timing window diagram, the window splits by half during the overlapping time. The key point here is that both branches are integrating and no data is readout during window overlapping times.
Note that the overlapping windowing realized using the circuit in Figure 13 is somewhat different from the overlapping windowing in Figure 3 , as shown in Figure 15 . In Figure 3 , the charges accumulated during the current window period include 100% of the charges from the last T ov seconds of the previous windowing period but no charges from the next windowing period. In Figure 13 , the charges accumulated during the current windowing period include 50% of the charges from both the last T ov seconds of the previous windowing period and the first T ov seconds of the next windowing period, which is more realistic from the implementation perspective.
Data Collection and Signal Reconstruction.
For simplicity, we use the inherent ADC of the oscilloscope (Tectronix TDS 3054 500 MHz, 5 Gs/s) to sample the output of the integrators. The sampled data is transferred to the PC via the GBIP port. With the collected samples, the signal is reconstructed as described in Section 2.
Dealing with Circuit Nonidealities.
While implementing the prototype, it is inevitable that the system has some nonidealities such as the delay caused by each component, the gain variation, and the mismatch among parallel paths. Considering all the nonideal factors, the actual relationship between the collected samples y and the coefficients a becomes
where the element at the mN + n row and the s column of V is given by Here, δt 1 and δt 2 reflects the timing error on the slicing window, δ f reflects the frequency offset, α and θ reflects the gain and phase mismatches, and the δΦ n (t) reflects the error of the random basis which could be attributed to the jitter and nonzero response time.
Because the actual relationship between y and a is given by (15) , we need to replace V with V in (1) when estimating a; otherwise, some extra error will be introduced. In [15] , the authors discussed the impact of some circuit imperfections, such as the finite settling time of the PN sequences, and the timing uncertainty, and a background calibration algorithm based on LMS was proposed to compensate for the error due to these circuit nonideal factors. Because of the complexity of the background calibration, here we use a more simple approach based on direct training to deal with the circuit nonidealities. The direct training approach is illustrated in Figure 16 . During the training stage, we inject a single-tone signal one at a time to the prototype and collect the samples from the 4 parallel paths, so that these samples will fill one column of the reconstruction matrix V. After sending 100 single-tone signals, we obtain a complete matrix which will be used for signal reconstruction.
This pilot-based method is based on the assumption that the system is linear and time-invariant. Fortunately, our circuit level design ensures that the input signal swing is within the linear range of the system, and the microcontroller ensures that the system has the same initial condition for every run. Therefore, the linear time-variant assumption is reasonable. Implementing the background calibration for circuit imperfection compensation is part of our future work.
Testing Results.
The testing setup for the prototype is shown in Figure 17 . A series of experiments are done to test the functionality of the system. Table 1 summarizes the testing results, where + and − stand for the polarity of the BPSK modulation. Note that we scale the amplitude of each subcarrier according to the number of tones such that the amplitude of the multi-carrier signal is within the dynamic range of the system. From the testing results, the prototype achieves the design specification.
Conclusions
The Parallel Segmented Compressive Sensing (PSCS) frontend is able to sample and reconstruct analog sparse and compressive signals at sub-Nqyuist rate. The overlapping windowed integration in the PSCS front-end provides a spurious frequency rejection scheme by setting the lowpass filter nulls on the spurious frequencies without sacrificing the sampling rate requirement. A low-speed prototype is built with off-the-shelf components, which is able to sense sparse analog signals at sub-Nyquist rate.
