Abstract-One of the most important promises of capacitive micromachined ultrasonic transducer (CMUT) technology is integration with electronics. This approach is required to minimize the parasitic capacitances in the receive mode, especially in catheter-based volumetric imaging arrays, for which the elements must be small. Furthermore, optimization of the available silicon area and minimized number of connections occurs when the CMUTs are fabricated directly above the associated electronics. Here, we describe successful fabrication and performance evaluation of CMUT arrays for intravascular imaging on custom-designed CMOS receiver electronics from a commercial IC foundry. The CMUT-on-CMOS process starts with surface isolation and mechanical planarization of the CMOS electronics to reduce topography. The rest of the CMUT fabrication is achieved by modifying a low-temperature micromachining process through the addition of a single mask and developing a dry etching step to produce sloped sidewalls for simple and reliable CMUT-to-CMOS interconnection. This CMUT-to-CMOS interconnect method reduced the parasitic capacitance by a factor of 200 when compared with a standard wire-bonding method. Characterization experiments indicate that the CMUT-on-CMOS elements are uniform in frequency response and are similar to CMUTs simultaneously fabricated on standard silicon wafers without electronics integration. Experiments on a 1.6-mm-diameter dual-ring CMUT array with a center frequency of 15 MHz show that both the CMUTs and the integrated CMOS electronics are fully functional. The SNR measurements indicate that the performance is adequate for imaging chronic total occlusions located 1 cm from the CMUT array.
I. Introduction c atheter-based ultrasound imaging devices have been widely used for guiding interventions in blood vessels and in the heart [1] , [2] . most of these devices provide a cross section of an artery or a 2-D side view. This can be severely limiting when dealing with chronic total occlusions (cTos) [3] [4] [5] . a 2-D image of the internal structure of the heart may not be adequate for guiding certain types of interventions [6] . capacitive micromachined ultrasonic transducers (cmUTs) have been found especially suitable for these particular imaging applications because of the flexibility of different array structures at this small scale and most importantly because of the potential for electronics integration. Fig. 1 shows a schematic diagram of a forward-looking dual-ring annular array for volumetric imaging inside an artery and a micrograph of a typical array element [7] . The cmUT transducer element is approximately 70 × 70 μm in size and is comprised of four individual membranes. The outer membranes are 27 μm wide; the inner membrane is 24 μm wide. The center of the silicon device can be opened to allow for the use of a guide wire.
This paper focuses on process development and evaluation for intravascular imaging applications that take greater advantage of cmUT design flexibility and monolithic integration. In particular, we consider cmos integration with dual-ring cmUT annular arrays for forward-looking intravascular ultrasound (Fl-IVUs), which have a very small element size and overall area for electronics. These arrays have been previously fabricated and tested on bulk silicon for integration with an IVUs catheter with a diameter on the order of 1 to 2 mm [7] [8] [9] . Direct integration with electronics not only increases the snr of cmUTs by decreasing parasitic capacitances, but also significantly reduces the complexity of the design and optimizes the use of catheter-tip real estate. Estimated parasitic capacitances associated with a single receive (rx) element up to the point of connection to an amplifier, shown in Fig.  1 , are reduced by more than a factor of 200 compared with wire bonding by monolithic integration as numerically calculated (see Table I ). The transimpedance amplifier (TIa) input capacitance, which was used in this study, was simulated to be 400 fF. For an ideal case, the electronic and cmUT design parameters should not significantly affect each other. additionally, the process flow for the post-processing of cmUTs on electronics should keep additional steps to a minimum so as to preserve the yield. We attempted to keep these considerations in mind and have chosen to have custom cmos electronics fabricated using the commercial foundry on 200-mm (8-in) wafers using a 0.35-μm process [10] . The cmUTs were separately optimized for monolithic integration via post-processing of the cmos wafer. Fig. 2 shows a schematic cross section of an rx element integrated with a cmos wafer. The bottom electrode is connected to the amplifier with a common electrode above manuscript received July 24, 2011; accepted october 11, 2011 for the application of the dc bias. multiple rx elements, 8 or 16 depending upon the size of the array, are multiplexed to one of four output buffers and then connected to an external transmission line. The connections to the electronics are accomplished through the use of conformal metal sputtering through sloped sidewall etching, as shown at the TIa input and buffer output. The silicon-based fabrication methods employed are ideally suited for IVUs applications and easily integrate with cmos electronics. Well-established micromachining techniques allow for densely populated 2-D arrays capable of forward-looking volumetric imaging. The transducer density necessary for volumetric imaging is difficult to achieve with current piezoelectric fabrication techniques [6] .
single-and dual-ring arrays have been previously investigated and fabricated specifically for Fl-IVUs applications. Here, we briefly review the literature on cmUTelectonics integration to place the current work in the proper context. We then describe the fabrication process in detail, especially focusing on isolation and connection between the cmUT and cmos layers, which are both custom-designed. We then present the electrical and acoustic characterization results along with an initial volumetric imaging demonstration obtained from a 1.6-mm dual-ring cmUT-on-cmos array with 15 mHz center frequency.
II. cmUT-cmos Integration Techniques
Previous cmUT-electronics integration has been successful, but the complexity and tradeoffs of the various methods used can be so significant as to be prohibitive for particular applications. In the following subsections, we outline the main advantages and disadvantages of previously reported cmUT-electronics integration techniques.
A. CMUT-in-CMOS
Processing of the cmUTs can be simultaneously accomplished with the fabrication of electronics utilizing reliable and well-established cmos processing techniques [11] . because of the parallel processing, the overall fabrication time and processing costs are reduced. However, the design space for the cmUTs suffers from the limited cmos fabrication options in terms of material types, material properties, and layer thicknesses. Total device area also increases because the electronics and cmUTs must be positioned side-byside. Therefore, this is not an ideal scheme for Fl-IVUs applications, in which available space is limited.
B. Interleaved CMUT-CMOS
To improve upon the cmUT design space, it is possible to interrupt the electronics fabrication for outside processing [12] , [13] . This method has been utilized to generate smaller gap heights compared with the aforementioned cmUT-in-cmos processing. This technique increases cost and fabrication complexity and may experience resistance from commercial cmos foundries because of contamination concerns. In addition, even though cmUT performance can be improved, this process is still similar to the standard cmUT-in-cmos technique, in which available area is not optimally utilized.
C. Flip Chip Integration
To improve utilization of available device area, and to allow better optimization of cmUT material properties, separately fabricated cmos and cmUT chips can be elec- trically connected through flip chip bonding [14] , [15] . In this case, the cmUTs are not limited to low-temperature, cmos-compatible fabrication processes, nor are they limited to the materials and layer thicknesses designated by standardized foundry processes. However, this interconnect method requires several additional fabrication steps that include deep reactive ion etching (rIE), conformal metal and dielectric depositions, backside patterning, and the final wafer bonding step [16] . Furthermore, the packaging techniques associated with this technology are complex in nature, adding to the overall cost. This technique is attractive for larger cmUT arrays [17] , but may not be ideal for Fl-IVUs applications because the minimum solder bump pitch approaches the cmUT element pitch.
D. CMUT-on-CMOS
This paper will outline a process in which cmUTs are fabricated directly on top of pre-processed cmos wafers, similar to those described by noble et al. in [18] and [19] and Daft et al. in [20] . This interconnect approach minimizes parasitic capacitances associated with connection lines and introduces only one additional masking step. The total occupied area is minimized because the cmUTs can be located over cmos metal layers, similar to the flip chip bonding method, but without the complexity and spacing requirements associated with solder bump technology. In addition, the cmUTs are not constrained by the cmos processing steps, so it is possible to optimize material properties and produce thinner vacuum gaps for low-voltage operation and increased electromechanical coupling efficiency. a drawback to this method is that fabrication is limited to processes under 250°c, so as not to damage the cmos. However, we show that the performance of cmUTs fabricated with such low-temperature plasma-enhanced chemical vapor deposited (PEcVD) silicon nitride and oxide are not significantly affected.
The main design trade-offs associated with different cmos integration approaches are summarized in Table  II . For the design constraints associated with Fl-IVUs, cmUT-on-cmos fabrication was pursued as a good compromise, with advantages in optimal space utilization, minimal parasitic capacitances, and relative ease of fabrication and final assembly.
III. Fabrication and Processing of cmUTs on cmos
Integration of the cmUTs with the cmos electronics requires modification of the cmos surface before cmUT fabrication, and the process flow for the cmUTs includes additional steps for electrical connections. The cmos surface preparation, cmUT-cmos interconnection, and subsequent processing steps are discussed in the following sections.
A. CMOS Wafers
custom cmos electronics were fabricated through the Tsmc's 0.35-μm processing technology (Taiwan semiconductor manufacturing company, Hsinchu, Taiwan) on 200-mm silicon wafers with 48 repeated squares per wafer, as shown in Fig. 3 (a). Each square contained many dies with a variety of designs with multiplexers and TIas for the different cmUT designs. These wafers were subsequently diced yielding 6 rectangular blocks approximately 4 × 7 cm (3 × 2 die) to allow for cmUT fabrication using standard micromachining tools designed for 100-mm (4-in) wafers, as shown in Fig. 3(b) . no carrier wafers were used because the lithography tools utilized accepted rect- TablE II angular samples. because of its rectangular shape, processing uniformity is adversely affected in the upper right and lower left corners of the cmos block, as indicated in Fig. 3 (b). This result was attributed to the pooling of photoresist in the corners of the rectangular blocks during the photoresist spinning process. Through profilometer scans, it was found that layers from the cmos can protrude as much as 1 μm in critical regions where cmUTs are fabricated, as shown in Fig. 4 (a). subsequent cmUT processing on such topography could lead to problems such as open circuitry in the cmUT metallization layers as well as non-uniformity in cmUT geometry.
B. Improved CMUT-on-CMOS Integration 1) Polishing:
To address the cmos topography, planarization through mechanical polishing was pursued. a 3-μm layer of PEcVD sio 2 was deposited to electrically isolate the bottom electrodes of the cmUT devices from the cmos electronics beneath and to provide extra base surface material to be removed during polishing. Fig. 4(b) displays the profilometer scan of the cmos surface taken after the oxide deposition. The oxide layer was then planarized using a logitech Pm5 automated polisher (logitech ltd., Glasgow, scotland) with 1-μm aluminum oxide slurry (lEco corp., st. Joseph, mI) for approximately 3 h. Profilometer scans showed that the surface topography had been significantly reduced, and all previously observed steep slopes had been polished down to very gradual inclines of less than 0.5 degrees from the horizontal [see Fig. 4(c) ]. residual slurry contamination from the polishing process was removed in an ultrasonic cleaner with acetone and, if necessary, a dip in aluminum Etch Type a (Transene co. Inc., Danvers, ma), which attacks aluminum oxide but not the sio 2 layer. Further surface preparation included 0.2 μm of PEcVD silicon nitride deposition after polishing to improve stiction with the first aluminum layer in the cmUT process, which also forms the bond pad regions. In previous fabrication processes, the bottom electrode metal layer would often delaminate from the polished oxide surface during wire bonding. This new approach improved the wire bond success rate from approximately 10% to nearly 100%. The silicon nitride addition was preferred to oxide because previous testing revealed that it exhibited better bond pad adhesion, and etches twice as fast as oxide in subsequent processing (see Table III ).
2) CMUT Connections to CMOS Electronics:
We have previously reported on an interconnect scheme that uses sloped sidewall vias with conformal metal sputtering. This method is simple, only introducing one fabrication mask [21] , and uses the advanced Vacuum Vision 320 reactive ion etch (advanced Vacuum ab, lomma, sweden) with a modified recipe to produce sloped sidewalls (see Table  III ). scanning electron microscope images in Fig. 5 show the sidewall profile of 3-μm-diameter test vias.
because of the rectangular shape of the samples, spunon photoresists collect at opposing corners, forming a thicker region, and in crevices formed by the remaining cmos topography. Therefore, it was necessary to overexpose and overdevelop to ensure proper via opening across the entire sample. aluminum metal connections in the cmos layer act as an etch stop for the rIE process, pre- venting overetching of the isolation layer. Table III outlines the detailed cmos interconnect fabrication process parameters.
Electrical tests were performed with point probes, verifying the continuity through two vias with deposited aluminum bond pads connected through a metal cmos layer. The thickness of the metal trace on the cmos layer was not known, but with a width of 25 μm and utilizing all metal layers, it was assumed that the internal resistance of the trace would be small. a total resistance on the order of 10 Ω was measured across multiple samples, indicating that electrical connections were established through two separate vias simultaneously.
C. Post-CMOS Fabrication of CMUTs
Further cmUT processing follows well-defined recipes described in Knight et al. [22] and details are omitted here. Dimensions for the layer thicknesses can be found in Table  IV . In all lithography steps following the opening of the cmos connections, shipley 1813 (shipley co. Inc., marlboro, ma) was used instead of shipley 1827 with similar overexposure and development. The thinner photoresist coupled with increased exposure and development helped to prevent residual photoresist from collecting in the crevices of the remaining cmos topography and preventing shorting of the metal layers. some of the finer features of the cmUTs suffer from the over-developing in that the sharp corners of the square areas became slightly rounded similar to a chamfered corner. However, testing indicated no significant adverse effects to cmUT performance.
With the aid of in-house polishing, improved cmos connection vias, and photolithography modifications, we have successfully integrated different cmUT arrays with custom cmos electronics for intravascular ultrasound applications. Fig. 6 shows two examples of these arrays. The 1.6-mm-diameter dual-ring Fl-IVUs array on the left consists of 64 receiver elements on the outer ring and 24 transmitters in the inner ring. The cmos electronics contain one TIa for each receiver, and four 16 × 1 multiplexers to route 4 receiver channels to the 4 output buffers. The electronics also include the digital decoding and control lines to collect data in a sequential manner from all transmit-receive combinations. In this version of the electronics, the transmitters are driven by outside pulsers, but will be integrated into the same chip in future versions. In the Fl-IVUs array shown in Fig. 6(a) , the electronics are not located directly beneath the cmUTs because this was a proof-of-concept design. concurrently, other Fl-IVUs arrays with smaller dimensions were also fabricated with electronics directly below cmUT arrays without complication, Fig. 6(b) . These devices have transmit (Tx) elements connected to external bond pads for wire bonding and testing purposes.
D. CMOS-CMUT-Specific Design Issues
During the design phase of the cmos electronics and associated cmUT masks, certain considerations were kept in mind to ensure reliable integration. First, alignment marks in the shape of crosshairs were included in the cmos metal layers which could be observed on the finished electronics for cmUT mask alignment. second, the cmos electronic connection studs where the cmUT connects to the associated electronic input were defined to be on the top metal layer. The studs were larger than the connection etch holes to act as an etch stop during the via opening so that other cmos layers would not be accidentally etched. The stud and via locations were placed to the sides of the cmUTs because they introduced topography on the order of microns. If the cmUT membrane were to be directly above a via, it could interfere with the membrane performance. The cmUT mask for the metal layer following the via etching, which connected the cmUTs to the electronics, masked the entire via location with ~25% additional coverage to prevent accidental etching of the metal studs if misaligned. as mentioned previously, all 6 . (a) a fully fabricated cmUT-on-cmos dual-ring array with a 1.6-mm-diameter array, and (b) an 840-μm-diameter array fabricated directly on top of the cmos application-specific integrated circuit (asIc) for forward-looking intravascular ultrasound application.
processing steps were limited to temperatures no greater 250°c, so as not to adversely affect the cmos electronics. outside of these considerations, standard cmos and cmUT design rules were adhered to. Finally, the sio 2 isolation layer thickness should be greater than 1 μm to ensure that there is no dielectric breakdown between the high voltage (100 to 200 V) bias lines and the underlying low-voltage cmos and ground lines.
IV. Experimental characterization
For electrical and acoustical characterization of the cmUT-on-cmos devices, the dual-ring Fl-IVUs array shown in Fig. 6(a) was used because this array has larger area and larger number of array elements than the other Fl-IVUs arrays concurrently fabricated. Each element consists of 4 rectangular cmUT membranes, as seen in Fig 1. as shown, there are two differently sized membranes that are electrically connected to form each element. The motivation behind this design was to maximize the active transducer area in the circular transducer region available. smaller ring arrays for Fl-IVUs with cmos directly beneath cmUTs [shown in Fig. 6(b) ] have also been successfully tested as reported in Gurun et al. [23] , but those were designed with fewer elements and were not as ideal to demonstrate volumetric imaging in this frequency range.
because of the small membrane sizes and the number of Tx and rx elements, it is possible for misalignment and process variation to decrease the uniformity across an array. as a figure of merit, we investigated the uniformity of a cmUT-on-cmos array and an array with an identical mask design fabricated on a standard silicon wafer for comparison. additional experimental characterization included pulse-echo experiments to evaluate the frequency response and initial volumetric images obtained by the Fl-IVUs arrays.
A. Electrical Characterization
To characterize uniformity, a cmUT-on-cmos FlIVUs array was attached to a custom printed circuit board (Pcb) with each of the 24 Tx cmUT elements wirebonded to a separate connection pad. an agilent 8753Es network analyzer (agilent Technologies Inc., santa clara, ca) was used to measure the resonant frequency of the Tx elements at 70 V dc bias, well below the average collapse voltage of 96 V. It was not possible to directly test the rx elements because they were only connected to the cmos, with no external testing connections. Testing in air, as opposed to operation in a fluid medium, is beneficial for uniformity characterization because the differences in resonances are more apparent with the higher quality factor. because of the two different membrane sizes within a cmUT element, two distinct resonance peaks are seen in the real part of the electrical impedance for a single Tx element, as shown in Fig. 7 . For uniformity evaluation, impedance data were collected for each Tx element of the integrated cmUTon-cmos dual-ring array. The two distinct resonant peak frequencies for each Tx element across the 24-element array were measured with average resonances calculated as 13.7 and 16.7 mHz. numerical methods based on a thinplate approximation and a finite difference method [24] were used to calculate the resonances in air of the two differently sized membranes, which were found to be 13.7 and 16.7 mHz, in excellent agreement with measurements. The difference for each Tx element from the average resonance frequency does not exceed 6%, implying acceptable uniformity from the fabrication. most of this variation is likely a result of the pre-process polishing and geometrical variation introduced while working with rectangular cmos samples, including the associated misalignment errors. any non-uniformity in mechanical behavior resulting from the PEcVD nitride depositions should be minimal, considering that a given device occupies such a relatively small space on the wafer. similar data were obtained from cmUT arrays fabricated on a blank 100-mm silicon wafer using the cmUTon-cmos mask set. The resonances in air were measured to be, on average, 14.7 and 17.1 mHz, with the difference of each Tx element not exceeding 6%. The calculated resonances were 14.4 and 17.2 mHz. The higher resonances are attributed to a slight increase in the membrane thickness, ~0.1 μm, which occurred during fabrication. This indicates that the cmUT-on-cmos fabrication techniques implemented do not significantly affect uniformity of the cmUT operation from element to element. It should be noted that in a more recent cmUT-on-cmos fabrication, with better alignment and reduced surface roughness from improved polishing and optimized metal depositions, uniformity was measured to be within 1.5% of the mean. This validates the assumption that PEcVD nitride should exhibit minimal variation over the elements of a single device. 
B. Acoustic Characterization
a completed 1.6-mm-diameter cmUT-on-cmos array, as shown in Fig. 8 , was wire bonded in a ceramic dual inline package (DIP) (csb04075, spectrum semiconductor Inc., san Jose, ca) and coated with 3 μm of Parylene c (specialty coating systems, Indianapolis, In) for electrical isolation in water. a 60-mm-diameter, 15-mm-tall polystyrene Petri dish was modified by milling a square opening in the base and then using epoxy to attach it to the ceramic DIP. This formed a cavity that could then be filled with water. External pulsers were connected to the Tx elements and the reflections from the air-water interface were received by rx elements integrated with cmos amplifiers and multiplexers. The dc bias for the Tx and rx elements was set to 80 V, below the typical 95% of collapse voltage to avoid premature failure. a sample pulseecho signal from the water-air interface at 5 mm is shown in Fig. 9 with the corresponding frequency response, which was filtered from 5 to 30 mHz during post processing. The center frequency of the array elements is around 15 mHz, as desired. There are significant spurious features in the frequency response, including a dip at 7.5 mHz which is due to standing waves in the 550-μm-thick silicon substrate [25] , [26] . acoustic crosstalk in the array resulting from the initial transmit pulse is also significant and corrupts the echo signal with features occurring before the target echo signal. This issue must be considered in future designs with realistic doughnut-shaped geometries, but for the purposes of this paper, the pulse-echo response shows that the signal levels from single Tx-rx pairs are strong and uniform. The standard deviation in the pulse echo peak-to-peak amplitudes was 1.3 db when the output for all active receiver elements was analyzed while using the same single transmitter. a simple geometric analysis of the Tx-rx distance combinations shows that the difference in the distance varies by not more than 5% for an acceptance angle deviation less than 18° from the vertical. considering the small magnitude of these distance variations, the results from scanning all rx channels demonstrate consistent and uniform acoustic response in the receiver side.
To have an estimate of the achievable pulse-echo snr, the receive bias was increased to 86 V, the water-air interface distance was set to 1 cm and the Tx element was driven with a 160-Vpp bipolar pulse centered at 15 mHz. The resultant snr was calculated to be 41 db using the rms value of the received pulse divided by the rms of the received signal without transmission, i.e., background cmos electronic noise. The measurement bandwidth was 40 mHz, limited by the cmos amplifiers. To evaluate the feasibility of imaging cTos with this snr level, we performed a calculation including diffraction losses and attenuation in blood at 15 mHz with 0.14 db/mHz·cm frequency dependence. considering an acoustical impedance of 1.68 mrayl for blood and reflection from a typical fat tissue with impedance of 1.33 mrayl, the snr with 48 Tx elements and 64 rx elements at a distance of 1 cm was calculated to be 34 db, suitable for detecting cTos in front of a catheter [27] . simulations and measurements on a noise-optimized TIa implemented in cmUT-oncmos shows that the snr can be further improved by 15 db [23] .
To demonstrate that real-time imaging data can be acquired with the Fl-IVUs dual ring arrays, a custom data-collection setup was constructed. It consisted of a custom high-voltage pulser board to actuate the Tx elements, a field-programmable gate array board (Virtex 5, Xilinx, san Jose, ca) for Tx element selection and multiplexing of the on-chip rx elements, and a Pc with 2 digitizer cards (UltraFast m3i.4142, strategic Test corp., Woburn, ma) for data collection and storage. The synthetic phased-array method was used for image processing. The pulse-echo data for every Tx-rx element pair were acquired for 20 μs (~1.5 cm imaging depth) without any time gain compensation or averaging. This results in a real-time data acquisition rate of 130 frames/s considering 1536 Tx-rx combinations and 4 parallel receive channels, which is suitable for Fl-IVUs imaging. Delay-and-sum algorithms are then used to create a volumetric images using synthetic phased-array image reconstruction. a more detailed description of the imaging setup with sample images can be found in [28] .
V. conclusions
In this paper, we have described an improved fabrication process for monolithic cmUT-on-cmos integration with minimal additional processing. The surface preparation of the custom cmos electronics wafers has been shown to significantly reduce the surface topography and improve the bond pad metal stiction, yielding nearly perfect wirebond success.
The sloped sidewall vias have been shown to be a simple but reliable method for integrating cmUTs with cmos electronics while minimizing the parasitic capacitances.
We have successfully fabricated and tested dual-ring arrays for Fl-IVUs applications. Experimental uniformity characterization shows that cmUT performance has not been altered significantly by the improved fabrication process and is comparable to cmUTs processed on standard silicon wafers. Initial real-time data collection experiments at 100 frames/s data rate was performed using a customdesigned setup. The pulse echo data from array elements show that both Tx and rx elements are functional and successful integration with cmos electronics yields snr levels suitable for volumetric images of tissue-like reflectors in blood over an imaging depth of 1 cm. With proper electrical interconnect and integration to catheters, the single chip cmUT-on-cmos approach appears to be a promising technology for challenging volumetric intravascular imaging applications. 
