Abstract-A new nonlinear charge-conservative scalable dynamic electro-thermal compact model for laterally defused MOS (LDMOS) RF power transistors is described in this paper. The transistor is characterized using pulsed I-V and S-parameter measurements, to ensure isothermal conditions. A new extrinsic network and extrinsic parameter-extraction methodology is developed for high-power RF LDMOS transistor modeling, using manifold deembedding by electromagnetic simulation, and optimization of the extrinsic network parameter values over a broad frequency range. The intrinsic model comprises controlled charge and current sources that have been implemented using artificial neural networks, designed to permit accurate extrapolation of the transistor's performance outside of the measured data domain. A thermal sub-circuit is coupled to the nonlinear model. Large-signal validation of this new model shows a very good agreement with measurements at 2.14 GHz.
I. INTRODUCTION

L
ATERALLY diffused MOS (LDMOS) field-effect transistors (FETs) are used almost exclusively for high power transistors for wireless infrastructure or base-station applications. They provide an unmatchable combination of performance and cost, and are capable of delivering hundreds of watts of RF power.
This combination of high powers and high frequencies in RF and microwave power amplifiers brings together a unique set of challenges for the device modeling engineer. The power transistors themselves are physically large, and may be a significant fraction of a wavelength wide, even at microwave frequencies. The electrical behavior in this distributed environment must be captured in the model. The device will generate a lot of heat, and the thermal effects on the transistor's electrical behavior will also need to be characterized and included in the model. Further, power amplifiers for wireless communications systems are tightly specified in terms of their linearity performance, bandwidth, etc., placing a premium on the availability of accurate nonlinear transistor models.
In this paper, we shall outline a new nonlinear scalable model for the LDMOS transistor, and its extraction and implementation. The transistor model architecture is shown in Fig. 1 . The intrinsic part of the model, shown in the center of this figure, describes the nonlinear behavior of the transistor, using voltagecontrolled current and charge sources: the model state functions or constitutive relations. This basic structure is no different in principle to those nonlinear models presented by [1] - [3] . A charge-conservative approach is adopted, as this is crucial to the accurate prediction of the low-level nonlinearities such as those described by intermodulation (IM) products, adjacent channel power, and so forth [4] . Included in this model are accurate continuously differentiable functions for the model currents and charges, and a self-consistent electro-thermal model coupled to these controlled sources.
These model functions are designed to compute, rapidly and accurately, values of the constitutive relations anywhere within the measured characterization plane. It is also necessary for these functions to be able to handle evaluations of the state functions outside of the measured data range; in other words, these functions must have predictable and controlled extrapolation properties. The reason for this is that in circuit simulation, as the simulation iterates towards the solution, the harmonic balance simulator (for example) may at some point require a solution for input voltages that lie outside the measurement space. Poor extrapolation behavior can lead to nonconvergence even if the final solution exists within the characterized region [5] . Here we also present a new formulation for approximating the state 0018-9480/$25.00 © 2009 IEEE functions for a measurement-based model that permits well-behaved and accurate extrapolation. The methodology is extended to include the thermal dependency of the state functions.
The extrinsic network describes the electrical behavior of the transistor layout; it accounts for the metallizations, substrate, and semiconductor that connect the intrinsic model to the outside world. How the electrical behavior of this network changes with device size determines how the extrinsic network component values scale. Scaling is a significant consideration: the RF power transistor may have a total gatewidth of over 100 mm: this is impossible to model in a traditional small-signal S-parameter environment-the RF powers are too high, and the device impedances are too small to permit measurements of sufficient accuracy for model extraction, if indeed the device is stable in a 50-environment. Instead, we measure and extract a model for a smaller device structure, and scale this model to the appropriate size.
II. TRANSISTOR CHARACTERIZATION
The transistor characterization is carried out on-wafer. The test transistor has the same basic structure as the production device, but is much smaller: it has fewer gate fingers. The RF calibration to the ground-signal-ground (GSG) probe tips is carried out using an impedance standard substrate. The current-voltage and S-parameter measurements are all made under pulsed conditions. A low pulse duty cycle is used to ensure that the data are captured isothermally. Using pulsed conditions also enables us to measure beyond the dc dissipation limit of the device, into realistic instantaneous voltage and current regimes. A dense pattern of the pulsed I-V and S-parameter measurements is made over the gate-drain voltage space of the transistor, bounded by the maximum drain current, breakdown voltage, and the maximum allowable power dissipation. Fig. 2 shows the metallization in a typical test structure used for transistor model development. The problem is segmented into two areas of analysis viz the GSG launches and the manifolds. The goal here is to remove the effects of the launch and manifolds, leaving only the response of the transistor including extrinsic elements. The electrical behavior of the GSG launches is determined from measurements of transmission line structures with care taken over the placement of the calibration reference plane. The GSG launches can then be modeled by error boxes placed at the measurement reference planes. The effects of the manifolds can then be determined, and the GSG measurements are then deembedded to the reference plane shown in Fig. 2 .
III. MANIFOLD MODELING AND DEEMBEDDING
The behavior of the manifolds in turn is obtained through electromagnetic simulation, using Sonnet's em. An accurate substrate definition is key to the manifold extraction. The substrate definition is found from careful analysis of measured S-parameters for transmission lines of various lengths and widths. By converting these S-parameters to the telegrapher's parameters for each line, the properties of each layer in the substrate and metal stack can be determined, and the appropriate metal model chosen. This is particularly challenging in LDMOS because the silicon substrate is quite lossy.
The substrate definition is then used in the simulation of S-parameters for each of the manifold structures. For the results presented here, the wafer used for transistor modeling also included structures for determination of the simulation substrate definition.
To validate the procedure, measured S-parameters of a loaded manifold were compared to a circuit simulator result for simulated manifolds plus measured loads. Large manifold two-port GSG structures were built, both with capacitive loads and unloaded. A typical structure is illustrated in Fig. 3 . The loads were shunt capacitors placed periodically along the manifold width with capacitance matching the effective capacitance of a typical LDMOS device under bias. S-parameters of a single such capacitor were also measured directly. Electromagnetic simulations of the manifolds were combined with the measured load S-parameters in a circuit simulator.
The two-port measurements of the loaded manifold structure, illustrated in Fig. 3 , are in good agreement with simulations. The transmission through the loaded manifold structure is plotted against measured results in Fig. 4 . Although not plotted, the remaining S-parameters are in similar agreement.
IV. EXTRINSIC NETWORK PARAMETER EXTRACTION
The traditional extrinsic network for a MESFET comprises a series resistor-inductor network connected to each gate and drain, and source nodes of the intrinsic model-the Z-shell, followed by shunt capacitors from gate and drain to ground, and between gate and drain-the Y-shell [6] .
For LDMOS transistors, we have augmented this simple network with a resistor in parallel with the series gate and drain inductance, to account for the frequency-dependent behavior of the silicon substrate material in an empirical manner. Adding these shunt resistors enables an improved broadband fit of the return losses. Further, we have observed coupling effects between the gate and drain metallizations in these power transistors. The unit gatewidths in these power transistors are quite significant, over 500 m, and there is measurable mutual inductance between these long metal traces.
We use the cold-FET method [6] - [8] as the basis for our extrinsic parameter extraction, using optimization over a broad frequency range. Our methodology is outlined below, and described in detail in [9] . Broadband S-parameters were measured under bias conditions of 0 V on gate and drain terminals of the transistor: this ensures the LDMOS transistor is biased below threshold, and is in the passive or "cold" condition. The manifolds were then deembedded from these data by using their S-parameter blocks determined as described in Section III. The cold-FET circuit for the intrinsic transistor is assumed to be a purely capacitive network. The extrinsic network and cold-FET circuit are illustrated in schematic form in Fig. 5 .
The extrinsic network parameter values were extracted from this deembedded data using the following sequence of actions: first, the extrinsic network components were deembedded to obtain intrinsic S-parameters; next, the cold-FET intrinsic parameter values were calculated from these intrinsic S-parameters; then this intrinsic model was re-embedded with the extrinsic network; and finally, these model S-, -, and -parameters were compared to the measured values (after deembedding of the manifolds). The extrinsic network parameters were adjusted until the measured and model data were suitably close over the broadband frequency range. We also ensured that the resulting small-signal intrinsic component parameters were physically realistic, e.g., the model resistances were nonnegative. This procedure was performed in Agilent-EEsof ADS, using the sequencer component and the tuning simulator. As a starting point, either nominal or zero values for the extrinsic network components can be used with equal success, indicating the robustness of the technique. The broadband S-parameters for the small-signal model and extrinsic network are compared to the measured and deembedded S-parameters in Fig. 6 , showing excellent agreement.
V. NONLINEAR INTRINSIC MODEL
The manifold structure and the extrinsic network are deembedded to obtain S-parameter data at the intrinsic model reference planes [9] . After converting to -parameters, the LDMOS transistor model current and charge state functions can then be obtained by integration of the small-signal voltage-dependent parameters using the following equations [3] : The integrations are carried out starting from the quiescent bias point, denoted by . Typically for LDMOS power transistors, this corresponds to class AB quiescent bias conditions V and mA/mm corresponding to V. For LDMOS, the gate current source is also set to zero.
A. Drain Current Model
The drain current expression given by (1) is a "high-frequency" drain current, and using this for the model at RF overcomes some of the dispersion issues that are present in some FET technologies, such as, for example, trap-related dispersion. Even though LDMOS does not suffer from significant trap-induced dispersion, we use the high-frequency drain current to derive the model so that the model generation process is a generic one that can be applied to different FET technologies.
In our nonlinear model implementation, we can choose from a number of possible drain current functions, all of which are analytical expressions that result in a smooth and differentiable relationship, enabling accurate prediction of high order nonlinearities, and better interpolation, or, more strictly, function approximation, between the measured data points.
One example of a drain current function approximation that we can use is the analytical expression due to Fager et al. [10] to fit the high-frequency drain current data [11] . This expression has an improved fit, compared with earlier FET models, to the drain current data in the near-threshold region, which is typically where the LDMOS power transistor is biased for power amplifier applications.
The focus of this study is the use of artificial neural networks (ANNs) to approximate the drain current data. This approach shows similar levels of accuracy as methods of function approximation using elementary functions with fitting parameters [3] . The function approximation using ANNs is described in more detail in Section VI-A in the context of controlled extrapolation. In addition to this ANN model, which is used to fit the data in the active on-state and threshold regions, we add functions to mimic the drain current behavior in the on-and off-state breakdown regions of the FET. These breakdown currents are modeled using diode-like expressions, as used in the Motorola electro-thermal (MET) model [12] . It is essential to include these effects in the model for large-signal applications.
B. Charge Model
One of the main objectives of this new nonlinear model development was to improve the charge model description over that of the existing MET model. The integral formulation for determining the charges described by (2) and (3) ensures a conservative charge formulation, which is essential for accurate prediction of low-level phase nonlinearity, and for convergence in time-domain simulations [3] , [13] . A charge-conservative model formulation, approximated by a smooth and differentiable function, and capable of including temperature dependence, was seen as an essential goal for this model.
The conservative gate and drain charges and of (2) and (3) were obtained through a Root model extraction from the measured intrinsic data using the commercial Agilent-EEsof IC-CAP function. In our model extraction, this produces tables of the charge data indexed by the intrinsic gate and drain voltages. These data are then approximated using ANNs, resulting in smooth and infinitely differentiable 2-D charge functions that are used directly in the model in Fig. 1 . It is also possible to obtain the neural network charge functions by fitting an adjoint neural net to the small-signal capacitances, avoiding the direct integration step in (2) and (3) [14] .
VI. EXTRAPOLATION CONSIDERATIONS FOR LARGE-SIGNAL DEVICE MODELING
Extrapolation is a major concern for high-power RF transistor modeling. The high current densities associated with power transistors, and the limited power and current capabilities of the measurement instrumentation often mean that the region of the device output characteristics that can be accessed and measured is much smaller than the region that is covered in RF operation of the transistor. This is illustrated in Fig. 7 , which shows a dynamic load line under RF drive superimposed on the measured dc I-V characteristics. While the dynamic loadline includes displacement currents, it can be seen from this illustration that the load-line trajectory can have a significant excursion into the unmeasured region. Hence, the extrapolation characteristics of the model state functions needs to be well behaved and accurate.
Extrapolation is the ability of the function approximation to return a value for input data that is outside the original training or characterization region. Generally, this statement is taken to mean that the extrapolated value returned by the function approximation is sufficiently close to the actual or true value of the data outside the training region. While this is one requirement of successful extrapolation, there is a further consideration for the simulation of large-signal models. The circuit simulator uses an optimization algorithm to determine the circuit voltages and currents, and for successful convergence there must be no local minima in the solution domain. The model nonlinear functions must, therefore, be extrapolated in such a way as to avoid these nonphysical local minima: the extrapolation of the function outside the characterization data set should be able to redirect the simulator into the characterization domain.
In Fig. 8 , we plot the measured characterization data, region I, and the regions of extrapolation corresponding to accurate function approximation, denoted as region II, and to redirection of the function, denoted as region III. In region I, the characterized region, the model used must approximate the measured data accurately, to within some acceptable level of error. In region II, accurate extrapolation is required since the load line of the transistor may pass through this region. In region III, the breakdown region, provided that the form of the breakdown currents is correctly modeled, the main emphasis is to ensure robust convergence of the simulator; fidelity to the actual drain current is less important.
A. ANNs for Function Approximation and Extrapolation
The universal approximation theorem was a key factor in establishing ANNs as a useful approximation method [15] . This theorem states that a network with a single hidden layer of neurons is capable of approximating any given function, with any degree of accuracy, provided that there are sufficient neurons, and that their activation functions are nonconstant, bounded, and monotonically increasing [16] , [17] . A single hidden-layer neural network function is expressed as (4) where is the data that is to be approximated, is the input data, and are the adjustable parameters of the function, is the number of neurons in the single hidden layer, and is the number of inputs-in our case, this would be two: the gate and drain voltage values. The nonlinear activation function of the neuron is typically a hyperbolic tangent function, and this is used in our model.
A known problem with a single hidden layer neural network is that the neurons tend to interact with each other globally: modifying the weights to make the approximation better at one point in the data will degrade it at another: the "water-bed" effect. This global interaction can be overcome through the use of two hidden layers. Neurons in the first layer tend to partition the space into sub-domains and approximate the function locally. The second layer fits more global features. That is, the outputs of the first layer tend to operate only on localized areas of the function range and output zero elsewhere. The second layer combines these localized regions [16] , [18] .
We use a two-hidden-layer neural network for good approximation and extrapolation of the state function data. Further, a number of other techniques are used in the training of the neural network to improve the generality of the function approximation. One technique is that of regularization, where the performance function of the approximation is modified from a simple mean square error estimation of the function approximation. Often this performance function is modified to include the mean of the sum of the squares of the weights and biases of the network. Bayesian regularization assumes that the weights and biases are random variables with given distributions, and the regularization is performed using statistical techniques. Another technique is that of "early stopping" and cross-validation, wherein the training data set is divided into several different subsets that can be used for training and validation of the neural network function approximation. The network is trained on one set of data, and the function approximation checked on another set of data: once the error in the approximation of the validation data set begins to increase, the training is terminated [19] .
We use The Mathworks' MATLAB Neural Network Toolbox as our basic tool for training the ANNs. We normalize the input and output data to ensure that the nonlinear neural network functions operate over their optimal range, and to prevent ill conditioning of the optimization process. The training also employs a combination of the above methods to obtain high accuracy of approximation, and good generalization of the neural network. Using these techniques, the network output can represent the data accurately within the training region, and be well behaved outside of the training region, as will be demonstrated later.
We have implemented a generic drain current expression using a combination of a neural network and a breakdown model in the following manner: (5) where is the temperature, and is a neural network capable of an accurate approximation of the data over the measured characterization region. The breakdown model for is of the following general exponential form: (6) where is a constant and is a temperature-dependent breakdown voltage parameter. In fact, the actual breakdown model has more parameters than this to ensure good global fit; details of this breakdown model can be found in the documentation for the MET model [12] .
This drain current formulation is very convenient to implement within a circuit simulator and the derivatives of can be readily computed as it is the product of two differentiable functions. The parameters of the breakdown model are set so that the function has no influence on the neural network for the drain current within the characterized region and minimal influence within the extrapolation region. Extrapolation is governed the two-layer feed-forward neural network and the breakdown model.
The extrapolation characteristics of the two-layer neural network are demonstrated using a set of drain current data obtained from measurement to train the network. The measurement system was limited to points with less than 15-W dissipation: this defines the boundary between regions I and II in Fig. 8 . The neural network was trained only on data from this region. The neural network is then used to predict the drain current in regions II and III, and the results are plotted in Fig. 9 , where it can be seen that the neural network predicts the drain current accurately within the training data range, and is well behaved in the extrapolation regions. The drain current, plotted as a surface in Fig. 10 , shows a smooth surface without local minima, and again, the extrapolation is well behaved. Typically, we find that using two hidden layers, each containing from 5 to 15 neurons, are sufficient for this level of accuracy and extrapolation performance.
VII. MODEL IMPLEMENTATION AND VERIFICATION
The model has been implemented in ADS using the Agilent-EEsof MINT modeling interface. A modular approach has been adopted allowing us to choose from and compare a variety of model implementations for the current and charge models: table-based models, MET expressions, neural networks, and the new model descriptions. This new model has been compared against the measured data, and existing MET and Root models extracted from the same data.
A 4.8-mm gate periphery LDMOS transistor was characterized using pulsed I-V and S-parameter measurements, and these data were used to train the neural networks used to model this device. The measurement region was defined by several factors: a maximum permissible power dissipation for the transistor, a maximum permissible drain current, and a gate-drain breakdown voltage, which has limits on the derivatives of to prevent the measurement procedure from taking the transistor into on-or off-state breakdown regions where the possibility of permanent damage to the transistor occurs. The measurement The drain current model, presented earlier in Fig. 9 , has very accurate approximation by the neural network within the measurement region, and reliable extrapolation beyond the measurement region boundary, giving confidence in the simulated drain current performance of this model.
The new model also displays excellent fidelity to the nonlinear device behavior at RF frequencies. This is shown in Fig. 11 , where we show that the new model predicts accurately the measured third-order intermodulation (IM3) over a wide range of input drive. Also shown is the prediction using a table model: while this is accurate at high powers, at lower powers, the interpolation can cause errors when the signal level is small. The new model also exhibits the correct asymptotic behavior of the IM products at low drive powers. This is also true for higher order IM products, as shown in Fig. 12 , in contrast to table-or data-based models, which are often limited by the order of the spline interpolation.
The approximation and extrapolation of the charge state functions are also well modeled by the neural networks. In Fig. 13 , we show the gate charge as a function of gate and drain voltages. The surface is well behaved with no local minima or sharp gradients, and the extrapolation is smooth and well behaved. The measured region used during the neural network training process is also indicated on this figure. The charges exhibit only small temperature dependence, and the model also extrapolates well as a function of temperature. Similar results are found for the drain charge surface. As an illustration of the robust extrapolation property of this neural network formulation, we show the model gate charge function over an extreme range of drain voltage. The function is smooth, well behaved, and convergent.
The derivatives predicted by the neural networks fit the numerical derivatives of the charge very well. In Fig. 14 , the derivatives of the gate charge, as a function of gate voltage, for various drain voltages are plotted. Fig. 12 . Simulation results of the IM distortion products for f = 2:14 GHz with a tone spacing of 100 kHz, as a function of the input power, at the intrinsic reference plane, comparing MET and Root models to the new model. Fig. 13 . Plot of Q versus V and V . Outside the measured region indicated by the thick line, the charge surface predicted by the neural network is smooth and very well behaved, even at extremely high voltages, which would never be experienced in practice, but may be used by the simulator during convergence. 
VIII. ELECTRO-THERMAL MODEL
Power transistors dissipate a lot of heat and thus it is necessary to include temperature dependence into the model param- eters. Further, the information signals used in modern wireless communications have large peak-to-average ratios of the signal voltage or power, therefore, the dynamical response of the transistor's thermal behavior to these inputs must also be captured in the model. An accurate dynamical electro-thermal model is required.
A simple way of including the thermal effects on the output current is to use a de-rating function on the drain current expression [3] , [11] , [20] .
The thermal variation of the threshold voltage acts in the opposite sense to the drain current de-rating, producing a point in the drain current-gate voltage relationship that is independent of temperature: the zero-temperature coefficient (ZTC) point [21] . This threshold voltage variation is accommodated by an additional parameter in the drain current model, as in the MET model [12] .
In addition to this accurate isothermal representation, the neural network model also returns accurate predictions of the drain current over temperature. In Fig. 15 , we show the drain current dependence on the gate voltage in the region close to threshold, where competing thermal effects on the drain current exist, resulting in an observed point of thermally independent drain current: the "zero temperature coefficient" current. The neural network predicts the nonlinear drain current behavior and its thermal dependences accurately with the modeled and measured data being virtually indistinguishable. The neural network also predicts well behaved currents at temperatures outside the measured range, maintaining the ZTC point as expected.
IX. VALIDATION OF THE MODEL
The correct prediction of large-signal performance by the nonlinear model is also crucial. In Fig. 16 , the drive-up characteristics of the model are compared with measurement at two different biases, showing very good agreement, even in compression. In Fig. 17 , we show the measured transducer gain contours for a 4.8-mm gatewidth LDMOS power transistor operating in class-AB bias, taken at 2.14 GHz. This device is of identical layout to the device used for the model extraction. The model matches the contours and peak gain point with excellent accuracy over the range of output powers and load impedances. The measured and predicted power-added efficiency load-pull contours for various output powers are shown in Fig. 18 . This is a stringent test of a model: the large-signal high-frequency nonlinear behavior and the dc conditions must be predicted accurately at the same time, and to do this over a range of load conditions and output powers indicates that the model is accurate, tracking the measured data very closely. This new model also executes in the simulator with no speed penalty, when compared with the industry-standard MET and Root models, as indicated in Table I . It should be noted that we have not yet optimized the new model for speed.
X. SCALING CONSIDERATIONS
The scaling behavior of the complete transistor model is heavily dependent on the scaling of the extrinsic network. Therefore, the scaling performance of the extrinsic network parameters is a good test of how well this electrical behavior is modeled over a range of transistor gate peripheries or widths. For RF power transistors, the unit gatewidth is generally maintained constant for a given application, and scaling of the device is simply by the number of gate fingers. However, the unit gatewidth will be different for different applications. For example, the driver transistors in a multistage integrated circuit (IC) power amplifier will generally be much shorter than the power transistor gatewidths, and the unit gatewidth will be larger for lower frequency applications, from considerations of overall die size. Moreover, from a modeling perspective, it is most useful to have one model kernel that can be used for all circuit and frequency applications; hence, the ability to scale accurately with transistor gate periphery is extremely beneficial.
In Figs. 19-21 , we illustrate the scaling behavior of the extrinsic resistance and inductance parameters. Four devices were chosen, having 4, 8, 16 , and 32 gate fingers and a constant unit gatewidth of 600 m, yielding total gate peripheries of 2.4, 4.8, 9.6, and 19.2 mm, respectively. These devices were measured under cold-FET conditions, as outlined in Section IV, and the extrinsic parameters were extracted: the series gate and drain inductances; the gate, drain, and source resistances; and the shunt gate and drain resistances. These extrinsic network parameters are seen to scale with total gatewidth in either a linear or inverse linear manner. The only outlier is the gate inductance of the 19.2-mm device: this is only one sample, and the higher than anticipated inductance may be a result of feeding a relatively wide device from a single point. Such simple and direct scaling indicates that the extrinsic network captures the electrical characteristics of the device layout very successfully. The extracted extrinsic capacitances are relatively independent of the gate periphery over this range. Further, as indicated in Fig. 5 , we have identified a mutual inductance as a component of the extrinsic network, modeling the inductive coupling between the long fingers that are characteristic of power transistors. In Fig. 22 , the extracted mutual inductance is shown as a function of the inverse of the total gate periphery for transistors with a unit gatewidth of 600 m. The mutual inductance is seen to scale inversely with gatewidth, or, alternatively, inversely with the number of gate fingers for a constant unit gatewidth.
XI. CONCLUSION
We have described the architecture, extraction, and implementation of a new nonlinear charge-conservative dynamic electro-thermal compact transistor model for RF power LDMOS FETs. This model includes a new extrinsic network for RF power LDMOS transistors that is designed to accommodate features attributable to the lossy silicon substrate and to the layout of the power transistor. The parameter-extraction procedure takes advantage of sequencer and tuning capabilities of the circuit simulator to identify the network parameter values over a broad frequency range. The new network was observed to scale with gatewidth effectively and simply. The intrinsic part of the model uses voltage-controlled current and charge sources, these later being defined in a correct charge-conservative manner. These sources are implemented using two hidden-layer artificial neural networks, and are shown to be accurate and to extrapolate well. The new Freescale Electro-Thermal LDMOS FET model is found to be more accurate than both the industry-standard MET model for power FETs and the isothermal Root model, which is another charge-conservative formulation. The new model is also at least as fast in simulation of typical power amplifier design applications, saving valuable design time. 
