I. INTRODUCTION
T HE principles of the broad-band capability of distributed amplifiers (DAs) are well known [1] - [3] . The advantages of uniform gain, flat group delay, and low voltage standing-wave ratio (VSWR) over wide frequency ranges in DAs have also made it possible to implement a broad-band millimeter-wave receiver for digital optical communications [4] - [8] and other pulse applications. In addition to the conventional commonsource topology to implement DAs, there were many other reported circuit topologies to implement high-performance DAs for digital optical communications, such as cascode [3] , [4] , [8] , [10] , dual-gate [11] , matrix [12] , differential [5] , [13] , attenuation compensation [14] , twin-cascode [5] , and cascade [9] , [15]- [17] using GaAs, InP, SiGe, GaN, and CMOS foundry processes. However, the performances of the conventional distributed amplifier (CDA) are gain-bandwidth limited due to its optimum number of stages [2] . The cascode configuration (a common-source field-effect transistor (FET) connected with a common-gate FET) DA, known for its high maximum available gain, wide bandwidth, improved input-output isolation, and variable gain control capability, has been utilized in many applications such as distributed mixers [18] - [20] , and DAs [3] , [4] , [8] , [10] . To make a very compact monolithic microwave integrated circuit (MMIC) DA design possible, the cascode FET gain cell are sometimes realized as a dual-gate structure. However, the dc power consumption of the cascode DA is higher since the dc voltage across the cascode cell is doubled as in the CDA and also the dc current flows through the drain and gate termination resistors. The differential and twin-cascode DAs are promising topologies to obtain better gain performance and are less noisy than that of the CDA, but the chip size and dc power consumption are of concern. The attenuation compensation technique used in the DA design could reduce the gateand drain-line transmission losses and enhanced the gain performance in the high-frequency band, but the stability in the high frequency and chip size will be issues. The cascaded singlestage distributed amplifier (CSSDA), unlike the CDA, does not need to equalize the phase velocity on the gate and drain lines, but still needs to match the characteristic impedance of interstage transmission lines. The CSSDA shows excellent performance with high gain, good gain flatness, lower input and output VSWRs, flat group delays, and a low noise figure.
Using a more advanced MMIC process, the DAs fabricated on InP substrates [7] - [10] have achieved higher gain-bandwidth products (GBWs) than those fabricated on GaAs [4] - [6] , [13] , [14] , [21] , [22] . In order to obtain high-gain performance, it is useful to cascade two identical DAs or using the twin cascode methodology [5] , but the chip size and dc power consumption will also be increased. In this study, by adopting the proposed circuit topology on the GaAs substrates, a novel broad-band MMIC DA has been designed and demonstrates the GBW of 503 GHz, which could be comparable with those of DAs using an InP-based high electron-mobility transistor (HEMT) MMIC process.
There are several DA designs that tried to add gain stages to improve gain/power capabilities, but most of their gain performances result in bandpass shape [23] - [25] , not as the full-band shape in CDAs, and these broad-band amplifiers are band limited. By using the concept of traveling-wave gain stages to maintain the DA broad-band performance, the CSSDAs [15] , [16] are selected and designed as broad-band gain stages. This paper, for the first time, proposes a novel low-pass filter (LPF) topology to combine the CDA and CSSDA into a single chip, called the CDA-CSSDA-2, to achieve the high gain and wide bandwidth simultaneously . For proof of concept, a two-stage cascaded single-stage DA (2-CSSDA) and CDA combined with one-/two-stage CSSDAs (i.e., CDA-CSSDA-1 and CDA-CSSDA-2) have been designed, fabricated, and tested. The 2-CSSDA shows band-limited performance, below 22 GHz. Adding a two-stage CDA in front of a one-stage CSSDA to form the CDA-CSSDA-1 can extend the bandwidth up to 27 GHz. Finally, a CDA-CSSDA-2 with a seven-stage CDA and two-stage CSSDA can provide high-gain (22 dB) and broad-band performance up to 40 GHz. The CDA-CSSDA-2 demonstrates wide-band high-gain with 1.5-dB ripple and flat group-delay performance, which also proves the feasibility of this approach for broad-band applications.
II. DEVICE CHARACTERISTICS AND MMIC FABRICATIONS
These MMIC DAs were fabricated using a GaAs-based pseudomorphic high electron-mobility transistor (pHEMT) MMIC foundry process provided by TRW Inc., Redondo Beach, CA [26] . The active device is a 0.15-m gate-length pHEMT with a unit current gain frequency and a maximum oscillation frequency of 81 and 120 GHz. The peak of transconductance and maximum current at peak transconductance are 400 mS/mm and 200 mA/mm, respectively. The passive components include GaAs thin-film resistors, metal-insulator-metal (MIM) capacitors, inductors, and via-holes through a 100-m GaAs substrate. The entire chip is also protected by silicon-nitride passivation for reliability concern.
III. CIRCUIT DESIGN AND ANALYSIS

A. 2-CSSDA
The schematic representation and small-signal equivalentcircuit model of a two-cascaded single-stage distributed amplifier (2-CSSDA) is shown in Fig. 1 . The transistor input and output capacitances ( and ) are absorbed into synthetic transmission lines (LC ladder networks) and the transmission-line equivalent capacitances are also considered in the circuit. These line sections are much shorter than a wavelength and can be modeled with an LC -sections equivalent circuit. The bandwidth of the amplifier is then limited by the synthetic line cutoff frequency , the frequency-dependent losses associated with the transistor input and output capacitances ( and ) , and the corresponding inductances of the artificial transmission lines, as shown in Fig. 1(b) . The amplifier utilizes high-impedance artificial transmission lines in conjunction with active devices, which is composed of second-order wide-band LPFs. Each stages of the amplifier are formed by a T-section network. Compared with CDA, the 2-CSSDA demonstrates a significantly higher available gain [15] and improved gain flatness near cutoff frequency.
The forward available gain for a 2-CSSDA with a lossless transmission line is given by (1) and (2) [15] , [16] as follows: The high-gain performance can be significantly improved by increasing the number of stages. The input and output impedances of the 2-CSSDA are derived under the assumption of , , , , and , respectively, as follows:
where is the normalized frequency, ; , is the transconductance of the active device, is the inter-stage characteristic impedance, is the gate line characteristic impedance of the first stage, and is the drain line characteristic impedance of the last stage.
In low-frequency approximation, the input and output impedances can be simplified as if (6) if (7) where . In the low frequency, the and termination resistors would dominate return losses and the real part of the input impedance is bigger than the image part, as shown in Fig. 2(a) , which means the input matching is easy to obtain under cutoff frequency . However, above the cutoff frequency , due to the transmission-line effect, the imaginary part of the input impedance will dominate so it is difficult to get good input impedance matching in high frequency for the 2-CSSDA. On the contrary, the imaginary part of the output impedance is comparable to the real part, as shown in Fig. 2(b) , which means more design efforts on the CSSDA output matching than on the input port. It is assumed that so can be also expressed as (8) , shown at bottom of the following page.
By selecting the proper value and of the HEMT, one could get good output impedance matching below the cutoff frequency of the synthetic transmission line, as shown in Fig. 2(b) and (c). For higher , the output return loss will be decreased, but the high ratio is difficult to realize. In this design, the four-finger 80-m device has of 515. 6 for the first stage, and the four-finger 200-m device has of 296.54 for the second stage. Above the cutoff frequency, due to the transmission-line effect, the output port return loss will be degraded.
The input and output stages of CSSDA are designed to match to the 50-environments. To maintain the broad-band matching, the input/output matching can be realized through equalizing capacitor value as (9) Other than at these two ports, all the linewidths of the transmission lines are the same with different lengths to achieve an optimal overall performance.
B. CDA-CSSDA-1 and CDA-CSSDA-2
The forward available gain of a CDA for a lossless -stage amplifier is given [1] - [3] as follows: (10) where is the number of stages of the CDA, is the transconductance of the active device, and and are the characteristic impedance of the drain and gate lines, respectively. In the CDA configurations, there will be a cutoff frequency limited by both the gate and drain capacitances ( and ) and the corresponding inductances of the artificial transmission lines. Therefore, of the CDA is limited to 7-14 dB [15] , which is due to the fact that the configuration of the CDA is fixed to an optimum number of active devices, i.e., four or five for hybrid and six or eight for monolithic technologies [29] , and there is also a instability problem as the frequency approaching the cutoff frequency of the gate or drain transmission lines.
Due to the second-order LPF configuration, the bandwidth of the 2-CSSDA is band limited compared with the CDA. The simulation results of the 2-CSSDA show a limited bandwidth (below 22 GHz). As the number of stages of the amplifier increases, the low-frequency gain also increases so it is not easy to design a flat gain performance for a multistage CSSDA. Therefore, the novel methodology for broad-band MMIC DAs, which combined the CDA used to extend wider bandwidth and CSSDA/2-CSSDA as a gain stage, is proposed to operate at a high gain and over several octaves of bandwidth simultaneously. The small-signal equivalent circuit is shown in Fig. 3 with the -stage CDA cascaded -stage CSSDA. In this proposed MMIC DA, the CDA is used in the first stage and the design methodology of the CDA is the same as the CDA [1] - [3] . Thus, the amplified forward signal in the CDA output will go through the LPF, part of the signal will travel down to the CSSDA, and the others will be absorbed by the termination resistor , as shown in Fig. 3 . The LPF topology is adopted between the CDA and CSSDA and by tuning the gain shape of the CDA and CSSDA and, thus, the broad-band and high-gain performances are obtained. To cover the same bandwidth as the drain line in the CDA, the device size of first stage in the CSSDA is selected as the same size as in the CDA. The device size in the second stage is approximately twice that as in the first stage for output power consideration. The CSSDA design follows the previous design (1)-(9).
The forward available gain of the proposed DA for combining the -stage CDA and -stage CSSDA is given as (11) This proposed amplifier overcomes the drawbacks of the CDA and CSSDA because of the following reasons.
1) The CDA is used as the first stage, extended the bandwidth; the input VSWR of the proposed DA is low over a very wide-band frequency. 2) There is also no stability problem as the frequency approaches the cutoff frequency of the gate or drain line due to the control of the length of the transmission line between the CDA and CSSDA and the capacitor of the first stage of the CSSDA, which is absorbed into the artificial transmission line of the drain line of the CDA and is composed of a broad-band LPF. The shunt capacitor can be optimized through the device size selections.
3) Tuning the gain shape of the CDA and CSSDA can control the gain flatness of the proposed DA; extend to low frequency range. 4) The number of devices, dc power consumption, and chip size are less compared with the method of using the cascade two DAs and twin cascade DA.
C. MMIC Design and Simulation
The linear pHEMT model used in the simulation is provided by the foundry and is implemented in the commercial computer-aided design (CAD) software (LIBRA from HP-EESOF). In order to demonstrate the concepts of the proposed DAs, three types of DAs, i.e., 2-CSSDA, CDA-CSSDA-1, and CDA-CSSDA-2, are designed. The schematic diagrams of the proposed DAs are shown in Figs. 1(a) and 4 . The chip photographs of these broad-band DAs are shown in Fig. 5 . In Fig. 1(a) , the 2-CSSDA is designed. The port return losses of linear simulation results for the amplifier are better than 5 dB and small-signal gain of 19 1 dB in the desire frequency band are shown in Fig. 6 (a) and the bandwidth is limited. Fig. 5(a) shows the chip photograph of the MMIC with a die size of 1.5 1 mm . In Fig. 4(a) , the two-stage CDA cascaded with the one-stage CSSDA (CDA-CSSDA-1, , ) is designed. The port return losses of linear simulation results for the amplifier better than 8 dB and small-signal gain of 19 1 dB in the desired frequency band are shown in Fig. 6(b) . Compared with the input return loss of the 2-CSSDA, the CDA-CSSDA-1 can obtain better return loss due to use of the CDA in the first stage. Fig. 5(b) shows the chip photograph of the MMIC with a die size of 1.5 1 mm . In order to demonstrate the concepts for millimeter-wave application, the CDA-CSSDA-2 is also designed ( , ), as shown in Fig. 4(b) . The port return losses of linear simulation results for the amplifier better than 8 dB and small-signal gain of 22 1 dB in the desired frequency band are shown in Fig. 6(c) . The CDA-CSSDA-2 can obtain better return loss due to use of the CDA in the first stage. Fig. 5(c) shows the chip photograph of the MMIC with a die size of 1.5 2 mm .
The stability is a very important issue in the DA design especially as the frequency approaching the cutoff frequency of the synthetic transmission line. The factor of the complete CDA-CSSDA-2 is greater than one. The CDA-CSSDA-2 was separated into two sub-circuits (CDA and CSSDA) and the individual factors and small-signal gain performance were both inspected. It was observed that, as the frequency approaches to the cutoff frequency of the gate and drain synthetic transmission lines, the circuit factors of CSSDA-2 and CDA will be lower than one. By checking the loop-oscillation condition [27] in the CDA and the inter-stage stability circles between the CDA and CSSDA, there is no stability problem in the complete circuit CDA-CSSDA-2. This stability improvement is due to that we can control the cutoff frequency of the synthetic transmission line between the CDA and CSSDA by adjusting the length transmission line and input capacitance of the CSSDA.
IV. CIRCUIT PERFORMANCE
The three MMIC DAs, i.e., 2-CSSDA, CDA-CSSDA-1, and CDA-CSSDA-2, were all measured via on-wafer probing. The , group delays, and output were evaluated. Fig. 6 (a) shows small-signal gain of 20 dB with flatness 1 dB and port return losses better than 5 dB for the 2-CSSDA in the frequency range of 0.5-22 GHz with total dc power consumption of 180 mW. Due to selecting the optimal value and of the device, the output port return loss is better than 10 dB in high frequency. For the CDA-CSSDA-1, the measured small-signal gain is 19 1 dB and port return losses are better than 8 dB in the frequency range of 0.5-27 GHz, as shown in Fig. 6(b) , with total dc power consumption of 324 mW. It is also observed that by using the CDA in the first stage of the DA, the input port return loss is better than the 2-CSSDA. For the CDA-CSSDA-2, the measured small-signal gain is 22 1.5 dB and port return losses are better than 5 dB in the frequency range of 0.1-40 GHz, as shown in Fig. 6 (c), with total dc power consumption of 484 mW. The measurement results agree well with the simulation results. It is noted that the gain rolloff of these amplifiers are very gradual. This is because the sharp gain rolloff, which is often observed in DAs, will lead to excessive group delay peaking and a deteriorated eye diagram [5] . Fig. 7 shows the average group delay 30 10 ps of the CDA-CSSDA-2 and 12 5 ps of the 2-CSSDA and CDA-CSSDA-1. The flat group delay of the proposed DA is very important for digital optical communications. The output of all three DAs from 2 to 30 GHz are shown in Fig. 8 , which shows the output of 10, 11.5, and 12 dB for 2-CSSDA, CDA-CSSDA-1, and CDA-CSSDA-2, respectively. The output versus input frequency of all three DAs are also presented in Fig. 9 . The output of 18-20 dB for the 2-CSSDA and 22-20 dB for the CDA-CSSDA-1 in the frequency range of 2-25 GHz is obtained. It is observed the output of CDA-CSSDA-1 is higher than the 2-CSSDA 2 dB in high frequency. The output is 22-20 dB in the frequency range of 2-30 GHz for the CDA-CSSDA-2 and operated at 30 GHz also demonstrates the power gain of 22 dB for the RF input power level lower than 10 dBm.
Considering the dc power consumption issue, a GaAs pHEMT cascode DA [28] with eight stages and a total of 1280-m device periphery shows 15-dB gain up to 40 GHz and dc bias at 7 V and 180 mA. Compared with the CDA-CSSDA-2 in this paper, we can achieve 40-GHz bandwidth with half dc power consumption (484 mW) and much higher gain (22 dB). Table I summarized the features and performances of the previously published DAs and this study. Compared with the previously published results [4] - [9] , this MMIC CDA-CSSDA-2 demonstrated the highest GBW performance among these MMICs using GaAs-based HEMTs and are also comparable with those DAs using InP-based HEMT technologies.
V. CONCLUSION
This paper presented the novel high-gain and broad-band MMIC DAs, which combined a CDA and CSSDA. The distributed amplifier (CDA-CSSDA-2) produces a GBW of 503 GHz, which is significantly higher than GaAs-based DAs. By selecting the LPF topology between the CDA and CSSDA and tuning the gain shape of the CDA and CSSDA separately, the broad-band performances are obtained. The detailed design equations are derived for the broad-band matching design of this CDA-CSSDA-2. To verify the design equations, two other MMICs, i.e., a two-stage CSSDA and two-stage CDA-CSSDA-1, are also included in this paper. The number of devices, dc power consumption, and chip size are less compared with the method of using the cascade two DAs and twin-cascade DA. The flat group delay performance also proves the feasibility of this approach, which is suitable for digital optical communication and broad-band pulse applications.
ACKNOWLEDGMENT
The MMIC foundry service was provided by TRW through the Chip Implementation Center (CIC), Taiwan, R.O.C. The authors would like to thank Dr. Y.-Z. Juang, CIC, for the foundry service coordinate effort.
