The use of digital predistortion for linearizing a millimeter-wave power amplifier (PA) is investigated. A PA operating at 38 GHz is designed using an accurate non-quasi-static transistor model, taking into account both short-and long-term memory effects. A realistic test signal is then used for the identification of a nonlinear auto-regressive moving average (NARMA) behavioral model of the PA. The NARMA-based digital predistorter is then derived and formulated in terms of basic predistortion cells, especially suitable for efficient implementation in an FPGA. The performance of the predistortion solution is preliminarily assessed by means of computer simulations. 
I . I N T R O D U C T I O N
Modern telecommunication standards make use of multilevel and/or multicarrier modulation schemes in order to accommodate high information rates in a limited bandwidth and to provide immunity to channel distortion and multipath effects. Spectral efficiency and immunity to channel conditions, however, are achieved at the expense of a high peak-to-average ratio (PAPR) transmitted signal and of a high sensitivity to intermodulation distortion (IMD). IMD is normally generated by nonlinear dynamic effects in the power amplifier (PA) at the end of the transmission chain. Moreover, the modulation process and PA nonlinearity generate out-of-band distortion, which must remain inside the mask specified by the communication standard. If no linearization scheme is adopted at the transmitter, the only way of meeting the in-band and the out-of-band distortion requirements for signals with high PAPRs is to operate the PA at a high level of backoff, which normally results in poor power efficiency. On the contrary, if the proper linearization scheme is used, significant improvements in system performance and costs can be obtained. In this paper, we address the linearization of a millimeter (mm)-wave PA using digital baseband predistortion [1] [2] [3] . Possible applications include current mm-wave high-bandwidth applications such as the bidirectional satellite service at Ka band, IEEE 802.16-SC, LMDS, Point-to-Point Microwave Service and Private Operational Fixed Service (i.e., TV Broadcast, 38.6-40.0 GHz), HAPS, Amateur Radio (licensed), and defense applications. To this aim, and to preliminarily test the validity of the approach, we employed a circuit-level model of a PA operating at 38 GHz, which is based on a simple yet accurate non-quasi-static model of a GaAs pHEMT device. Computer simulations were then performed in order to identify a nonlinear auto-regressive moving average (NARMA) behavioral model for the PA from which the predistorter was derived. The predistortion function, also of NARMA type, was expressed as a combination of basic predistortion cells (BPCs) in order to be efficiently implemented in an FPGA, which is normally already present in most digital transmitters. The performance of the proposed solution was then evaluated for different single-and multi-carrier modulation schemes using computer simulations.
The rest of the paper is organized as follows: section II presents a review of linearization techniques, section III describes the device model and the main considerations made in the design of the mm-wave PA chosen for linearization, section IV presents the NARMA behavioral model used to represent the PA in the predistortion solution, section V describes the predistortion solution, and section VI presents validation results obtained from computer simulations with different modulated test signals.
I I . P A L I N E A R I Z A T I O N
A number of linearization solutions have been proposed in the literature [4] . However, an optimal linearization technique does not exist and so the choice of the linearization scheme is strongly dependent on the application and on a number of factors such as the linearity requirements, the complexity added by the linearizer (and the trade-off between accuracy and complexity), the signal bandwidth, the technology used (e.g., analog or digital), the power efficiency requirements, the robustness against system changes, etc. Feedback-type linearizers (Envelope, Cartesian, Polar) have been extensively used in the past and are still used in specific applications (e.g., circuit level linearization). However, stability problems make them unsuitable for current high-bandwidth applications. In addition, feedback linearizers reduce overall system gain, which can affect system efficiency. On the other hand, they are less sensitive to disturbances or changes in the transmitter subsystem and therefore good IMD cancellation can be achieved for low or moderate signal bandwidths. Different from feedback linearizers, Envelope Elimination and Restoration and the outphasing techniques (LINC, CALLUM) have revived thanks to the use of current high-speed digital signal processors (DSPs), which have permitted one to overcome certain drawbacks associated with analog circuitry. One of the biggest challenges of these techniques is with respect to power efficiency, since power losses associated with the combination of the signal components can in some cases outweigh the benefits of employing highly efficient PAs. On the other hand, the processing speed of current DSPs limits the linearization bandwidth, although this may change in the future as processing speed improves. Feedforward together with digital predistortion linearization are the most suitable linearization techniques to cope with wideband signals. Feedforward linearization is unconditionally stable, can achieve effective cancelation of IMD, and is immune to PA memory effects. However, its open-loop nature makes it sensitive to possible unbalances in cancelation loops, effectively degrading the overall linearity performance whenever imperfect cancelation occurs. This can be compensated by means of a proper loop control, but it requires additional circuitry. Moreover, the error amplifier and the losses associated with the output combiner strongly impact on the IMD cancelation performance and on overall power efficiency. Analog predistortion is a simpler solution with respect to digital predistortion, but although it can cope with high bandwidths, it only gives moderate IMD suppression. Moreover, analog solutions operating at RF are very sensitive to the electrical lengths in the circuit, which become critical for mm-wave applications. A similar dependence on implementation maladjustments appears in LINC transmitters, due to the need for a good equilibrium between the PA in the two branches. Digital predistortion performed at baseband can achieve better correction capabilities (it is independent of the final carrier frequency of operation) than analog solutions and can be adaptive, thus allowing for corrections of possible unexpected unbalances. The main drawbacks are related to power consumption associated to the DSP device and bandwidth limitations due to the speed of the DSP. Since DSPs are currently present in most communications transmitters for codification and digital modulation purposes, digital baseband predistortion can be a good linearization solution for a high-bandwidth mm-wave application such as the one we are considering, especially when memory effects in the PA are of significant importance in IMD cancelation performance.
I I I . P A D E S I G N
To avoid the uncertainties associated with high-frequency prototyping and measurements, and thus to be able to concentrate on the performance of the PA behavioral model and predistortion scheme, it was decided to preliminarily validate the linearization approach using an accurate circuit-level description of the PA. Accordingly, the main goal in the design of the PA was to obtain a realistic mm-wave PA which was, at the same time, computationally efficient even with complex modulated signals.
This section describes the main characteristics of the PA and the considerations made during the design. The first part presents the non-quasi-static device model that was employed, whereas the second part illustrates the main features of the PA.
A) Non-quasi-static device modeling
To obtain a realistic circuit-level model of a high-frequency PA, an accurate device model is needed that is capable of accounting for short-memory (i.e., non-quasi-static) effects occurring at high frequencies [5] . Moreover, for non-constant envelope applications, low-frequency dispersive effects need to be accounted for. Besides accuracy in predicting the device behavior, the numerical efficiency of the model is also important, as it will be used with complex modulated signals and simulated in the mixed, time-frequency, envelope domain. This section briefly describes the nonquasi-static equivalent-voltage model (NQS-EVM) [5] of a FET device, which was used to account for nonlinear short memory effects at the intrinsic active device of the PA. Long-term memory effects were accounted for by the empirical model described in [6] .
The NQS-EVM model is an empirical electron device model featuring a reliable identification procedure and a simple CAD implementation. It also provides good prediction capabilities up to very high frequencies under mildly nonlinear operation. Although the model is technology independent and can be implemented using either look-up tables (LUTs) or equivalent circuits, only the LUT-based version for FET devices will be considered in the following.
The basic idea of the model is to use an associated QS device controlled by suitably defined equivalent voltages: in such a way, NQS effects arising at high frequencies are modeled by means of voltage perturbations imposed by the equivalent voltage sources.
In QS conditions, the behavior of an intrinsic FET device can be simply described by the well-known voltage-controlled equations:
where i and v are the vectors of the port currents and voltages, respectively, and F and C are purely algebraic functions. At high frequencies, NQS phenomena, which are ignored in (1), become important. We define suitable equivalent voltages
in order to provide the proper NQS currents in a virtually QS device described by (1) when the actual controlling voltages are replaced with the equivalent voltages:
In particular, the equivalent voltage domain can be obtained according to (1) by means of the voltage perturbations Dv d and Dv c , which take into account all the fast dynamics of the NQS phenomena.
General considerations and empirical evidence described in [5] suggest that a linear dependence can be assumed for Dv d and Dv c , that the two voltage perturbations are not mutually independent, and that in the case of a strictly intrinsic GaAs pHEMT the voltage perturbations appear only in series to the gate. In such a way, just two voltage corrections, linearly voltage dependent, account for all NQS effects, according to Fig. 1 .
Provided that the duration of memory effects is short with respect to the period of the operating signals, a "modified" dynamic Volterra-series approach [7] can be adopted to model the voltage perturbations, which, by discretizing the memory time in N D time slots Dt, can be written by means of linear discrete convolutions:
The dependence on the dynamic voltage deviations v(t 2 t) 2 v(t) allows for a vanishing contribution of (5) and (6) when the signal frequencies involved are not high enough for NQS effects to manifest.
B) Model identification and validation
The first part of model identification involved the derivation of the "strictly intrinsic" device measurements. These were obtained by de-embedding the measured S parameters at the extrinsic ports both from the extrinsic parasitic elements and from all the linear capacitances in parallel to the intrinsic device. Parasitic elements were, in this case, identified through standard cold-FET procedures in pinched-off conditions, and also was the purely capacitive parallel network. The strictly intrinsic measurements were then used in order to identify the quasi-static device and the non-quasi-static voltage corrections.
The chosen device was a 0.25 mm GaAs pHEMT having 10 fingers and 600 mm of total gate periphery. In this case, a LUT-based definition was chosen for the associated quasi-static device both for the conductive currents and for the displacement currents. The conductive currents include a low-frequency dispersive term that accounts for self-heating and surface trap level states, and is modeled using an empirical approach proposed in [6] . The quasi-static capacitive currents are modeled through nonlinear LUT-based capacitors that are identified from the imaginary part of the strictly intrinsic admittance parameters, in a frequency range where reactive effects are evident, but non-quasi-static phenomena are still negligible.
The second part of model identification concerns the identification of parameters a 1p , a 2p , and r, defining the non-quasi-static voltage corrections. This was achieved by resolving a simple overdetermined system of equations based on
which gives the strictly intrinsic bias-dependent admittance parameters in the frequency domain and which is derived by inspection of Fig. 1 and simple algebraic manipulations. For the device considered, a total memory length of 1.8 ps divided into three elementary delays gave good results. Equation 7 was evaluated over a reduced bias grid (V GS : 21.2 to 20.2 V; V DS : 2-10 V), so as to have maximum accuracy over the region defined by the dynamic load curve of the device in large-signal operation of the PA. Moreover, in order to properly identify the non-quasi-static corrections, (7) was evaluated in a frequency range covering both quasi-static and non-quasi-static phenomena (3.5 GHz , f , 76 GHz). Figure 2 shows some small-signal validation results that refer to the intrinsic admittance parameters Y 11 and Y 21 up to twice the fundamental frequency of the PA (76 GHz) with the device biased at the nominal quiescent point of the PA (V GS : 20.55 V; V DS : 6.5 V). The figures show the measured and modeled parameters, as well as the results that would have been obtained if a conventional quasi-static model had been employed. As can be seen from the figures, for this bias condition where the device is very conductive, the behavior at high frequencies is far from quasi-static and is accurately accounted for by the model thanks to the voltagecontrolled dynamic deviation generators. Similar results are obtained for Y 12 and Y 22 , which however are less affected by non-quasi-static phenomena. On the contrary, for bias conditions where the device channel is pinched-off, an almost quasi-static behavior is observed (not shown here due to lack of space) and the model is still able to predict it owing to the fact that it was identified at the strictly intrinsic ports. Figure 3 , instead, shows, for the fundamental frequency of the PA (38 GHz), the measured and modeled admittance parameters as a function of the bias voltages in the region where the model was identified. As can be seen from the figures, even if bias-independent coefficients were assumed for the dynamic deviation generators, the model is able to predict the measured parameters inside a relatively large area of operation for the device including off-as well as on-state conditions. behavioral modeling and linearization of a millimeter-wave power amplifier
C) Model implementation
After the identification procedure, the model was implemented in an Agilent ADS CAD environment, as shown in Fig. 4 for the intrinsic device. The conductive currents, including low-frequency dispersive effects due to traps and self-heating, are modeled using LUTs and the empirical low-frequency model described in [6] .
The displacement currents of the quasi-static device are modeled with nonlinear LUT-based capacitors and implemented using a symbolically defined device. The purely dynamic voltage corrections applied to both conduction and displacement parts of the quasi-static device are implemented through standard linear VCVSs, which depend on intrinsic port voltages and on model parameters a 1p , a 2p , r, Dt, and N D , which were previously identified. A linear parallel capacitive network separates the strictly intrinsic from the intrinsic device ports, and the standard lumped parasitic network depicted in Fig. 5 is embedded in order to obtain the extrinsic device ports.
D) Power Amplifier
This section describes the design of an integrated mm-wave PA using the non-quasi-static device model identified and implemented in the previous sections. In keeping with the objective of obtaining an efficient yet realistic circuit-level model of the amplifier, a single-stage single-cell PA was considered, which can represent the typical output stage of a real mm-wave PA. Due to linearity requirements, class-AB operation was selected, and the device was biased at 35% of IDSS. The load and source impedance were optimized taking into account IMD and gain, using a two-tone sourceand load-pull measurement setup and were also validated with simulations using the non-quasi-static device model [5] . Matching networks were synthesized using distributed networks, as would be the case in any integrated PA operating at such a frequency. Figure 6 shows a schematic of the designed PA and Table 1 reports its main performances.
High-frequency PAs typically exhibit both short-and longterm memory effects due to electrical and thermal phenomena [8] , which generally make their linearization more difficult. Any realistic PA model, therefore, needs to account for these effects as they will have an impact in the design and performance of the predistorter. In the application under consideration, however, and taking into account that the signal bandwidth even for very high bit rate communication standards is in the order of tens of MHz, and that therefore the relative bandwidth is small with respect to an mm-wave carrier, only long-term 1 memory effects become important. 1 Long with respect to the period of the carrier but not necessarily with respect to the period of the smallest frequency of the baseband envelope.
2 Short-term memory effects in the device must still be accurately modelled in order to predict PA performances related to the high-frequency carrier such as gain, output power, etc. This is so because the response of the matching networks and the device are substantially flat over such a small relative bandwidth and therefore they do not exhibit any dynamic effects. Long-term memory effects, instead, are generally associated with thermal phenomena, surface charge traps in the active devices, and low-frequency dispersive bias circuits. The first two were modeled by the low-frequency part of the NQS model whereas bias networks effects were accounted for by careful design of the bias circuit. In fact, if ideal bias networks are considered, long-term memory effects are hardly noticeable, and, as shown in Fig. 7 , the two-tone IM3 term is almost independent of tone spacing (for tone spacings up to 20 MHz). The same figure also shows that when a more realistic bias system is considered, not only the IM3 terms depend on tone spacing but also some asymmetry between the left and right IM3 terms appear, as is typically observed in real PAs. This variation dependence over such a narrow frequency range is a symptom of long-term memory effects. 4 . CAD implementation of the non-quasi-static device model at the intrinsic ports including displacement (top) and conductive low-frequency-dispersive currents (bottom). Non-quasi-static effects are accounted for by dynamic-deviation-based VCVSs. A purely capacitive linear network separates the "strictly intrinsic" ports from the intrinsic ones.
I V . B E H A V I O R A L M O D E L I N G
The PA is a nonlinear dynamic system whose memory effects can be of significant importance when wideband signals are considered. The digital predistortion (DPD) linearization technique can compensate for both nonlinear and dynamic PA unwanted effects. The DPD technique requires nonlinear dynamic behavioral models capable of reproducing the PA nonlinear behavior as well as its memory effects. In the last few years, a significant number of behavioral models for PAs have been proposed, as discussed in [9] . However, to be suitable for DPD purposes behavioral models not only need to be accurate enough to mimic PA behavior but also have to be invertible and computationally efficient.
For our particular application, a NARMA model, described in [10] , was chosen as a good compromise between accuracy and computational efficiency. The NARMA model can be seen as an extension of a simpler nonlinear moving average (NMA) model that incorporates a nonlinear auto-regressive block. The advantage of using a NARMA model is the introduction of a nonlinear feedback path (infinite impulse response -IIR -terms) that may allow one to relax the number of delayed samples considered to model the PA, in comparison with a model using only finite impulse response (FIR) terms.
One of the main weaknesses of the NARMA model regards its stability. The use of nonlinear feedback paths can derive in the overall system instability. Fortunately, the stability of a NARMA model can be studied and guaranteed using a technique based on the small-signal gain theory, which is advanced in [10] . Figure 8 depicts the block diagram that describes the NARMA model.
The general input-output mathematical expression in a NARMA model can be described aŝ
with f i ( . ) and g j ( . ) being nonlinear memoryless functions, and where t i (t 0 ¼ 0) and t j (t , N) are the most significant sparse delays of the input and output, respectively, contributing to the description of the PA memory effects. Further details on how to obtain the best sparse delays to characterize the PA dynamics can be found in [11, 12] . By considering memoryless nonlinear functions f i ( . ) and g j ( . ) characterized by polynomials of order P, 
Parameter Value
Small-signal gain 6.9 dB Bandwidth (22 dB) 37.5 GHz-38.5 GHz Output power (I/C: 238 dBc) 14.5 dBm Gain (I/C: 230 dBc) 6.5 dB Output power (1 dBc) 18.0 dBm Fig. 7 . PA two-tone IMD tests for different input powers and bias systems. Fig. 8 . NARMA model structure.
: (12) In order to extract the vector of parameters d, we performed a least-squares (LS) identification from input-output (x PA , y PA ) observations of the PA. The identification error is defined as
and the LS solution for d is thereforê
with
T being the vector of output observations of length L.
The normalized mean square error achieved using the NARMA model, just considering one delayed sample of the input and another delayed sample of the output in the NARMA structure (small influence of memory effects), was about 243 dB. Figure 9 presents a block diagram of the digital baseband predistortion scheme that was adopted. To derive the predistortion function from the original NARMA behavioral model, it is necessary to rewrite the equations to obtain x A (k) in terms of the desired linear output y D (k). A detailed explanation on this procedure can be found in [13] .
V . D I G I T A L P R E D I S T O R T I O N L I N E A R I Z A T I O N
To implement the DPD in an FPGA, it is necessary to express the predistortion function as a combination of fundamental building blocks called BPCs. A BPC requires simple hardware blocks: a complex multiplier, a dual port RAM memory block acting as a LUT, an address calculator, and two ports of control. Further details on the implementation in an FPGA device can be found in [14] . The structure of the predistortion function, in terms of BPCs, is depicted in Fig. 10 .
The predistortion function can be expressed in terms of combinations of BPCs as
where z(k) is defined as
where G LUT (for both ĝ j and f i ) are complex gains stored in their corresponding LUT, x A (k) is the output of the DPD, and y D (k) is the desired output defined as the signal to be transmitted (x T (k)) multiplied by a linear amplification (G linear ),
Then, the adaptive process followed by the digital adaptive DPD in order to perform the PA linearization consists in the following four steps: (1) to identify new nonlinear functions f i and ĝ j by monitoring current PA input and output samples, (2) to test the stability of both PA and DPD NARMA models using the method proposed in [10] , (3) to invert the nonlinear function f 0 to find the DPD output, and (4) to generate all necessary LUT contents to implement the predistorter function; and then back to step 1. The identification of the PA behavioral model is performed obtaining the LS solution presented in (14) . For this particular implementation, the LS takes complex data buffers of 2048 samples and a forgetting factor is adopted in order to prevent abrupt changes in the parameters that define the PA model.
In the case of the designed PA, good results were obtained with a predistorter function having only three BPCs to compensate for the nonlinear distortion and memory effects: in particular: the static BPC (associated to 
V I . V A L I D A T I O N R E S U L T S
In a first approach toward a final implementation, it is necessary to analyze the linearization capabilities shown by this NARMA-based DPD. The DPD linearizer has to be capable of compensating both in-band and out-of-band nonlinear distortion. To validate the functioning of the adaptive predistorter, an FPGA DPD simulator has been implemented in Matlab. This simulator emulates the DPD function as it is performed in an FPGA, that is, by means of a set of BPCs. Moreover, a NARMA behavioral model has been used to reproduce the nonlinear dynamics of the designed PA operating at 38 GHz. Both single carrier (SC) and OFDM 16-QAM test signals (roll-off ¼ 0.22) presenting PAPRs around 9-10 dB have been considered.
To ensure a fair comparison, it is necessary to take into account unlinearized backed-off signals presenting similar mean output power to the linearized output signal. Figure 11 shows the AM-AM characteristics for: unlinearized PA, unlinearized PA with backoff operation, inverse PA characteristic, and the linearized PA. It is possible to observe that the DPD is capable of counteracting nonlinear distortion and get into compression in a linear way. Moreover, Fig. 12 and Fig. 14 shows the in-band distortion compensation achieved by the DPD, measured in terms of EVM. The 16-QAM constellation of the backed-off signal presents an EVM of around 11% (SC) and 37% (OFDM), respectively, while thanks to predistortion linearization it can be reduced up to less than 1%. Finally, Figs 13 and 15 show the spectral regrowth reduction (out-of-band distortion compensation) of 6 dB achieved by the DPD in comparison to an unlinearized backed-off signal for both SC and OFDM configurations (Figs 13, 15) .
V I I . C O N C L U S I O N
This paper has addressed the linearization of an mm-wave PA by means of an adaptive digital predistorter operating at baseband. This DPD is based in a NARMA structure that has been used for both characterizing and compensating PA nonlinearities and memory effects. Simulation results have evidenced that by means of the adaptive DPD it is possible to counteract the PA nonlinear behavior and thus to compensate both in-band and out-of-band distortion.
A C K N O W L E D G E M E N T
This work was partly supported by the former EU network TARGET (IST-1-507893-NOE), by the Italian Ministry of University and Research (MIUR), and by the Spanish Government under project TEC2005-07985-C03-02. behavioral modeling and linearization of a millimeter-wave power amplifier
