21 research outputs found

    A 90 nm CMOS 16 Gb/s Transceiver for Optical Interconnects

    Get PDF
    Interconnect architectures which leverage high-bandwidth optical channels offer a promising solution to address the increasing chip-to-chip I/O bandwidth demands. This paper describes a dense, high-speed, and low-power CMOS optical interconnect transceiver architecture. Vertical-cavity surface-emitting laser (VCSEL) data rate is extended for a given average current and corresponding reliability level with a four-tap current summing FIR transmitter. A low-voltage integrating and double-sampling optical receiver front-end provides adequate sensitivity in a power efficient manner by avoiding linear high-gain elements common in conventional transimpedance-amplifier (TIA) receivers. Clock recovery is performed with a dual-loop architecture which employs baud-rate phase detection and feedback interpolation to achieve reduced power consumption, while high-precision phase spacing is ensured at both the transmitter and receiver through adjustable delay clock buffers. A prototype chip fabricated in 1 V 90 nm CMOS achieves 16 Gb/s operation while consuming 129 mW and occupying 0.105 mm^2

    Equalization of multi-Gb/s chip-to-chip interconnects affected by manufacturing tolerances

    Get PDF
    Electrical chip-to-chip interconnects suffer from considerable intersymbol interference at multi-Gb/s data rates, due to the frequency-dependent attenuation. Hence, reliable communication at high data rates requires equalization, to compensate for the channel response. As these interconnects are prone to manufacturing tolerances, the equalizer must be adjusted to each specific channel realization to perform optimally. We adopt a reduced-complexity equalization scheme where (part of) the equalizer is fixed, by involving the channel statistics into the equalizer derivation. For a 10โ€ฏcm on-board microstrip interconnect with a 10% tolerance on its parameters, we point out that 2-PAM transmission using a fixed prefilter and an adjustable feedback filter, both with few taps, yields only a moderate bit error rate degradation, compared to the all-adjustable equalizer; at a bit error rate of 1e-12 these degradations are about 1.1โ€ฏ dB and 3โ€ฏ dB, when operating at 20 Gb/s and 80 Gb/s, respectively

    Technological University in 1984 and 1988, respectively

    Get PDF
    Abstract We present an analysis comparing multi-level signaling to standard NRZ signaling for module-to-module on-board electrical interconnects. To study on-board electrical performance, duobinary and PAM4 I/O models were created and compared to NRZ signaling in behavioral link-level simulations. A great variety of high-density, high-speed on-board module-to-module electrical links were analyzed, and specific interconnect channels were validated experimentally with programmable equalization transceiver chips communicating through a set of fabricated test structures. Link performance was measured with on-chip eye monitoring circuits and an oscilloscope. Simulation results show that NRZ signaling with FFE and DFE equalization offers the best electrical performance

    PAM4-๋ฐ”์ด๋„ˆ๋ฆฌ ๋ธŒ๋ฆฌ์ง€ ์นฉ์šฉ PAM4 ํŠธ๋žœ์Šค๋ฏธํ„ฐ ์„ค๊ณ„

    Get PDF
    ํ•™์œ„๋…ผ๋ฌธ(์„์‚ฌ) -- ์„œ์šธ๋Œ€ํ•™๊ต๋Œ€ํ•™์› : ๊ณต๊ณผ๋Œ€ํ•™ ์ „๊ธฐยท์ •๋ณด๊ณตํ•™๋ถ€, 2022. 8. ์ •๋•๊ท .๊ณ ์„ฑ๋Šฅ ์ปดํ“จํŒ… ์‹œ์Šคํ…œ, ๋Œ€์šฉ๋Ÿ‰์˜ ๋ฐ์ดํ„ฐ ์„ผํ„ฐ, AI ๊ธฐ์ˆ ์˜ ๋ฐœ์ „์œผ๋กœ ์ธํ•ด ์œ ์„  ํ†ต์‹ ์˜ ๋Œ€์—ญํญ ์š”๊ตฌ ์ˆ˜์ค€์€ ๊ธฐํ•˜๊ธ‰์ˆ˜์ ์œผ๋กœ ์ฆ๊ฐ€ํ•˜๊ณ  ์žˆ๋‹ค. ๊ทธ๋Ÿฌ๋‚˜ I/O ํšŒ๋กœ์˜ ํ•€๋‹น ๋Œ€์—ญํญ์˜ ํ–ฅ์ƒ์€ ํ†ต์‹  ์ฑ„๋„์˜ ๋‹ค์–‘ํ•œ ํ•œ๊ณ„๋กœ ์ธํ•ด ์–ด๋ ค์›€์„ ๊ฒช๊ณ  ์žˆ๋‹ค. ์ด๋Š” ์ฐจ์„ธ๋Œ€ DRAM ๋ถ„์•ผ์—์„œ๋„ ์˜ˆ์™ธ๋Š” ์•„๋‹ˆ๋‹ค. ํ•€๋‹น ๋ฐ์ดํ„ฐ ์ „์†ก ์†๋„๋ฅผ ์ฆ๊ฐ€์‹œํ‚ค๋Š” ์—ฐ๊ตฌ ๋ฐฉํ–ฅ์—์„œ๋Š” ์–ด๋Š ์ •๋„ ํ•œ๊ณ„์— ๋ด‰์ฐฉํ•˜๋ฉด์„œ ์ตœ๊ทผ์—๋Š” High Bandwidth Memory (HBM)์™€ ๊ฐ™์ด ํ•€์˜ ๊ฐœ์ˆ˜๋ฅผ ๊ธ‰๊ฒฉํžˆ ๋Š˜๋ ค์„œ ๋Œ€์—ญํญ์„ ์ฆ๊ฐ€์‹œํ‚ค๋Š” ๊ธฐ์ˆ ๋„ ๋ฐœ์ „ํ•˜๊ณ  ์žˆ๋‹ค. ๋‹ค๋ฅธ ์ ‘๊ทผ ๋ฐฉ์‹ ์ค‘ ํ•œ๊ฐ€์ง€๊ฐ€ ๋‹ค์ค‘ ๋ ˆ๋ฒจ ์‹ ํ˜ธ ๋ฐฉ์‹์ด๋‹ค. ๊ธฐ์กด์˜ Non-Return-to-Zero (NRZ) ์‹ ํ˜ธ ๋Œ€์‹ ์— ๋‹ค์ค‘ ๋ ˆ๋ฒจ ์‹ ํ˜ธ ๋ฐฉ์‹์„ ์ด์šฉํ•˜๋ฉด ๋™์ผํ•œ Nyquist ์ฃผํŒŒ์ˆ˜์—์„œ ๋ฐ์ดํ„ฐ ์†๋„๋ฅผ ๋†’์ผ ์ˆ˜ ์žˆ๊ณ  ์ด๋Š” DRAM์˜ ์ฐจ์„ธ๋Œ€ ๊ณ ๋Œ€์—ญํญ I/O ์ธํ„ฐํŽ˜์ด์Šค์— ์ข‹์€ ์†”๋ฃจ์…˜์ด ๋  ์ˆ˜ ์žˆ์œผ๋ฉฐ ํ˜„์žฌ๊นŒ์ง€๋Š” 4๋ ˆ๋ฒจ ํŽ„์Šค ์ง„ํญ ๋ณ€์กฐ ๋ฐฉ์‹ (PAM-4)์ด ๋„๋ฆฌ ์ฑ„ํƒ๋˜์–ด ์žˆ๋‹ค. ํ•˜์ง€๋งŒ ํ˜„์žฌ PAM-4 ๋ฐฉ์‹ DRAM์ด ์–‘์‚ฐ ๋‹จ๊ณ„๊ฐ€ ์•„๋‹ˆ๊ธฐ ๋•Œ๋ฌธ์— PAM-4 ์ „์šฉ Memory Tester๊ฐ€ ์—†๋Š” ์ƒํ™ฉ์ด๋‹ค. ๋ณธ ๋…ผ๋ฌธ์—์„œ๋Š” ์ฐจ์„ธ๋Œ€ ๋ฉ”๋ชจ๋ฆฌ ํ…Œ์ŠคํŠธ๋ฅผ ์œ„ํ•œ 32 Gb/s PAM4 ๋ฐ”์ด๋„ˆ๋ฆฌ ๋ธŒ๋ฆฌ์ง€์—์„œ์˜ ํŠธ๋žœ์Šค๋ฏธํ„ฐ๋ฅผ ์ œ์•ˆํ•œ๋‹ค. NRZ ํ…Œ์Šคํ„ฐ์—์„œ ๋ธŒ๋ฆฌ์ง€๋กœ ์ „์†ก๋œ ์ €์† ๋ฐ์ดํ„ฐ๋Š” ๊ณ ์† PAM4 ๋ฐ์ดํ„ฐ๋กœ ๋ณ€ํ™˜๋˜์–ด ๋ฉ”๋ชจ๋ฆฌ๋กœ ์ „๋‹ฌ๋œ๋‹ค. ์ ‘์ง€ ์ข…๋‹จ PAM4 ๋“œ๋ผ์ด๋ฒ„๋Š” 2-ํƒญ ํ”ผ๋“œํฌ์›Œ๋“œ ์ดํ€„๋ผ์ด์ €๋กœ ์ถœ๋ ฅ ์ „๋ฅ˜๋ฅผ ์ œ์–ดํ•˜์—ฌ 0.95์˜ ๋ ˆ๋ฒจ ๋ถˆ์ผ์น˜ ๋น„์œจ (RLM)์„ ๋‹ฌ์„ฑํ•จ์œผ๋กœ์จ ๋‹จ์ผ ์ข…๋‹จ ์ถœ๋ ฅ์„ ์ œ๊ณตํ•œ๋‹ค. 40 nm CMOS ๊ธฐ์ˆ ๋กœ ์ œ์ž‘๋œ ๋ธŒ๋ฆฌ์ง€ ํŠธ๋žœ์Šค๋ฏธํ„ฐ๋Š” 0.57 mm2์˜ ํ™œ์„ฑ ์˜์—ญ์„ ์ฐจ์ง€ํ•˜๊ณ  102.1 mW์˜ ์ „๋ ฅ์„ ์†Œ๋ชจํ•œ๋‹ค.With the advancement of high-performance computing systems, large-capacity data centers, and AI technologies, the level of bandwidth demand for wired communication is increasing exponentially. However, the improvement of the bandwidth per pin in the I/O circuit compared to the required bandwidth level is difficult due to various limitations of the transmission channel. This is no exception in the next generation of DRAM. While facing limitations from the perspective of research that increases data transmission speed per pin, technologies that increase I/O bandwidth by rapidly increasing the number of pins, such as High Bandwidth Memory (HBM), have also recently developed. One of the other approaches is a multi-level signaling method. Using a multi-level signaling method instead of a conventional Non-Return-to-Zero (NRZ) signal can increase data speed at the same Nyquist frequency, which can be a good solution for the next-generation high-bandwidth I/O interface of DRAM, and so far, a four-level Pulse Amplitude Modulation (PAM-4) has been widely adopted. However, since PAM4 DRAM is not in the mass production stage yet, there is no memory tester dedicated to PAM4 signaling. This paper proposes a transmitter block on a 32 Gb/s PAM4 binary bridge for next-generation memory testing. The low-speed data transmitted from the NRZ tester to the bridge is converted into high-speed PAM4 data through half-rate clock control and transferred to the memory. The ground termination PAM4 driver provides a single-ended output by controlling the output current with a two-tap feed forward equalizer to achieve a Level separation Mismatch Ratio (RLM) of 0.95. Bridge transmitter manufactured with 40 nm CMOS technology occupies an active area of 0.57 mm2 and consumes 102.1 mW of power.ABSTRACT I CONTENTS โ…ข LIST OF FIGURES โ…ค LIST OF TABLES โ…ฆ CHAPTER 1 INTRODUCTION 1 1.1 MOTIVATION 1 1.2 THESIS ORGANIZATION 4 CHAPTER 2 BACKGROUNDS 5 2.1 OVERVIEW 5 2.2 BASIC OF MULTI LEVEL SIGNALING 7 2.3 NECESSITY OF PAM4-BINARY BRIDGE 11 CHAPTER 3 DESIGN OF PAM4 TRANSMITTER FOR PAM4-BINARY BRIDGE 14 3.1 DESIGN CONSIDERATION 14 3.2 OVERALL ARCHITECTURE 17 3.3 CIRCUIT IMPLEMENTATION 19 3.3.1 CLOCK GENERATOR 19 3.3.2 PARALLEL PRBS GENERATOR 23 3.3.3 DATA ALIGN / GRAY CODE ENDCODER 26 3.3.4 FFE CONTROL/ SERIALIZER 30 3.3.5 PAM4 DRIVER 33 CHAPTER 4 MEASUREMENT RESULTS 38 4.1 CHIP PHOTOMICROGRAPH 38 4.2 MEASUREMENT SETUP 39 4.3 MEASUREMENT RESULTS 40 4.4 PERFORMANCE SUMMARY 42 CHAPTER 5 CONCLUSION 46 BIBLIOGRAPHY 47 ์ดˆ ๋ก 50์„

    Feed Forward Equalization Simulation Model for High-Speed Channel Applications

    Get PDF
    Proyecto de Graduaciรณn (Licenciatura en Ingenierรญa Electrรณnica) Instituto Tecnolรณgico de Costa Rica, Escuela de Ingenierรญa Electrรณnica, 2016.This thesis describes the development of simulation models of FFE stages (Feed Forward Equalizer) in transmitters for high-speed channel analysis. The implementation of this equalizer was performed using two different tools: ADS (Advance Design System) by Keysight, which is a commercial tool and SPITDS (Signal and Power Integrity Time Domain Simulator), which is a simulator developed by the Institute of Electromagnetic Theory at the Technical University of Hamburg in Germany. Additionally, the LMS (Least Mean Square) algorithm was implemented in MATLAB and ADS to adjust the equalizer coefficients, in order to have a better signal recovery. Different channels were designed to make comparisons and verify the performance of different implementations of FFE with LMS. Both implementations of the Feed Forward Equalizer and LMS algorithm worked properly, which allows the application of the equalization FFE and the calculation of the equalizer coefficients for signal integrity analysis

    State of the art in chip-to-chip interconnects

    Get PDF
    This thesis presents a study of short-range links for chips mounted in the same package, on printed circuit boards or interposers. Implemented in CMOS technology between 7 and 250 nm, with links that operate at a data rate between 0,4 and 112 Gb/s/pin and with energy efficiencies from 0,3 to 67,7 pJ/bit. The links operate on channels with an attenuation lower than 50 dB. A comparison is made with graphical representations between the different articles that shows the correlation between the different essential metrics of chip-to-chip interconnects, as well as its evolution over the last 20 years.Esta tesis presenta un estudio de enlaces de corto alcance para chips montados en un mismo paquete, en placas de circuito impreso o intercaladores. Implementado en tecnologรญa CMOS entre 7 y 250 nm, con enlaces que operan a una velocidad de datos entre 0,4 y 112 Gb/s/pin y con eficiencias energรฉticas de 0,3 a 67,7 pJ/bit. Los enlaces operan en canales con una atenuaciรณn inferior a 50 dB. Se realiza una comparaciรณn con representaciones grรกficas entre los diferentes artรญculos que muestra la correlaciรณn entre las distintas mรฉtricas esenciales de las interconexiones chip a chip, asรญ como su evoluciรณn en los รบltimos 20 aรฑos.Aquesta tesi presenta un estudi d'enllaรงos de curt abast per a xips muntats en el mateix paquet, en plaques de circuits impresos o interposers. Implementat en tecnologia CMOS entre 7 i 250 nm, amb enllaรงos que funcionen a una velocitat de dades entre 0,4 i 112 Gb/s/pin i amb eficiรจncies energรจtiques de 0,3 a 67,7 pJ/bit. Els enllaรงos funcionen en canals amb una atenuaciรณ inferior a 50 dB. Es fa una comparaciรณ amb representacions grร fiques entre els diferents articles que mostra la correlaciรณ entre les diferents mรจtriques essencials d'interconnexions xip a xip, aixรญ com la seva evoluciรณ en els darrers 20 anys

    LPDDR5์˜ ์™ธ์žฅ ์ž๊ฐ€ ํ…Œ์ŠคํŠธ๋ฅผ ์œ„ํ•œ ๊ณ ์† ์†ก์‹ ๊ธฐ์˜ ์„ค๊ณ„

    Get PDF
    ํ•™์œ„๋…ผ๋ฌธ(์„์‚ฌ) -- ์„œ์šธ๋Œ€ํ•™๊ต๋Œ€ํ•™์› : ๊ณต๊ณผ๋Œ€ํ•™ ์ „๊ธฐยท์ •๋ณด๊ณตํ•™๋ถ€, 2022.2. ์ •๋•๊ท .To overcome the speed gap between Automatic Test Equipment (ATE) and memory, the concept of Built-out Self-test (BOST) was introduced. This thesis presents the design of a transmitter for BOST of LPDDR5. It transmits high-speed DQS and WCK to DRAM while receiving low-speed clocks from ATE. Since they donโ€™t always have clock-toggle, a digital block generates some data patterns. Also, by phase interpolators, phases of the outputs are shifted by desired. The analog part of the transmitter consists of phase interpolators, serializers, and drivers. Phase interpolators and drivers are designed in a current mode to be resistant to supply noise. The divider of the serializer is newly proposed so that the timings of all outputs are the same. In addition, the time it takes to receive enabling signals from ATE and transmit outputs to DRAM is constant. As a result, the transmitter sends DQS and WCK with data patterns to DRAM at the desired timing. The proposed transmitter is fabricated in a 40 nm CMOS process. 1 TX lane consumes 31.4 mW and occupies 0.06 mm2. Measured DQS has a swing of 230 mV and an RMS jitter of 770 fs at 10 Gb/s with 50 ฮฉ termination. And WCK has a swing of 185 mV and an RMS jitter of 894 fs at 10 Gb/s with 40 ฮฉ termination.์ž๋™ ํ…Œ์ŠคํŠธ ์žฅ๋น„ (ATE)์™€ ๋ฉ”๋ชจ๋ฆฌ ๊ฐ„์˜ ์†๋„ ์ฐจ์ด๋ฅผ ๊ทน๋ณตํ•˜๊ธฐ ์œ„ํ•ด ์™ธ์žฅ ์ž๊ฐ€ ํ…Œ์ŠคํŠธ (BOST) ๊ฐœ๋…์ด ๋„์ž…๋˜์—ˆ๋‹ค. ๋ณธ ๋…ผ๋ฌธ์€ LPDDR5์˜ BOST๋ฅผ ์œ„ํ•œ ์†ก์‹ ๊ธฐ ์„ค๊ณ„๋ฅผ ์ œ์‹œํ•œ๋‹ค. ์†ก์‹ ๊ธฐ๋Š” ATE์—์„œ ์ €์† ํด๋Ÿญ์„ ๋ฐ›์•„์„œ ๊ณ ์† DQS์™€ WCK๋ฅผ DRAM์— ์ „์†กํ•œ๋‹ค. ์ถœ๋ ฅ์— ํ•ญ์ƒ ํด๋Ÿญ ํ† ๊ธ€๋งŒ ์žˆ๋Š” ๊ฒƒ์€ ์•„๋‹ˆ๋ฏ€๋กœ ๋ฐ์ดํ„ฐ ํŒจํ„ด์ด ๋””์ง€ํ„ธ ๋ธ”๋ก์—์„œ ์ƒ์„ฑ๋œ๋‹ค. ๋˜ํ•œ ์œ„์ƒ ๋ณด๊ฐ„๊ธฐ๋กœ ์ถœ๋ ฅ์˜ ์œ„์ƒ์„ ์›ํ•˜๋Š” ๋Œ€๋กœ ์›€์ง์ธ๋‹ค. ์†ก์‹ ๊ธฐ์˜ ์•„๋‚ ๋กœ๊ทธ ๋ถ€๋ถ„์€ ์œ„์ƒ ๋ณด๊ฐ„๊ธฐ, ์‹œ๋ฆฌ์–ผ๋ผ์ด์ €, ๋“œ๋ผ์ด๋ฒ„๋กœ ๊ตฌ์„ฑ๋œ๋‹ค. ์œ„์ƒ ๋ณด๊ฐ„๊ธฐ์™€ ๋“œ๋ผ์ด๋ฒ„๋Š” ๊ณต๊ธ‰ ๋…ธ์ด์ฆˆ์— ๊ฒฌ๊ณ ํ•˜๋„๋ก ์ „๋ฅ˜ ๋ชจ๋“œ๋กœ ์„ค๊ณ„๋˜์—ˆ๋‹ค. ์‹œ๋ฆฌ์–ผ๋ผ์ด์ €์˜ ๋””๋ฐ”์ด๋”๊ฐ€ ์ƒˆ๋กญ๊ฒŒ ์ œ์•ˆ๋˜์–ด์„œ ๋ชจ๋“  ์ถœ๋ ฅ์˜ ํƒ€์ด๋ฐ์ด ๊ฐ™๋‹ค. ๋˜ํ•œ ATE์—์„œ ํ™œ์„ฑํ™” ์‹ ํ˜ธ๋ฅผ ๋ฐ›์•„์„œ DRAM์œผ๋กœ ์ถœ๋ ฅ์„ ์ „์†กํ•˜๋Š”๋ฐ ๊ฑธ๋ฆฌ๋Š” ์‹œ๊ฐ„๋„ ์ผ์ •ํ•˜๋‹ค. ๊ทธ ๊ฒฐ๊ณผ ์†ก์‹ ๊ธฐ๋Š” ๋ฐ์ดํ„ฐ ํŒจํ„ด์ด ์žˆ๋Š” DQS์™€ WCK๋ฅผ ์›ํ•˜๋Š” ํƒ€์ด๋ฐ์— DRAM์œผ๋กœ ์ „์†กํ•œ๋‹ค. ์ œ์•ˆ๋œ ์†ก์‹ ๊ธฐ๋Š” 40 nm CMOS ๊ณต์ •์œผ๋กœ ์ œ์ž‘๋˜์—ˆ๋‹ค. ์†ก์‹ ๊ธฐ์˜ ํ•˜๋‚˜์˜ ๋ ˆ์ธ์€ 31.4 mW๋ฅผ ์†Œ๋น„ํ•˜๊ณ  0.06mm2๋ฅผ ์ฐจ์ง€ํ•œ๋‹ค. ์ธก์ •๋œ DQS๋Š” 50 ฮฉ ํ„ฐ๋ฏธ๋„ค์ด์…˜์ผ ๋•Œ 10 Gb/s์—์„œ 230 mV์˜ ์Šค์œ™๊ณผ 770 fs์˜ RMS ์ง€ํ„ฐ๋ฅผ ๊ฐ€์ง„๋‹ค. ๊ทธ๋ฆฌ๊ณ  WCK๋Š” 40 ฮฉ ํ„ฐ๋ฏธ๋„ค์ด์…˜์ผ ๋•Œ 10 Gb/s์—์„œ 185 mV์˜ ์Šค์œ™๊ณผ 894 fs์˜ RMS ์ง€ํ„ฐ๋ฅผ ๊ฐ–๋Š”๋‹ค.ABSTRACT I CONTENTS II LIST OF FIGURES IV LIST OF TABLES VII CHAPTER 1 INTRODUCTION 1 1.1 MOTIVATION 1 1.2 THESIS ORGANIZATION 3 CHAPTER 2 BACKGROUND ON SERIAL LINK 4 2.1 OVERVIEW 4 2.2 BASIS OF MEMORY INTERFACE 7 2.3 BUILDING BLOCKS 9 2.3.1 PHASE INTERPOLATOR 9 2.3.2 SERIALIZER 14 2.3.3 DRIVER 18 CHAPTER 3 DESIGN OF TRANSMITTER FOR BOST 22 3.1 DESIGN CONSIDERATION 22 3.2 OVERALL ARCHITECTURE 24 3.3 CIRCUIT IMPLEMENTATION 26 3.3.1 CLOCK PATH 26 3.3.2 PHASE INTERPOLATOR 29 3.3.3 SERIALIZER 33 3.3.4 DRIVER 41 CHAPTER 4 MEASUREMENTS RESULTS 48 4.1 DIE PHOTOMICROGRAPH 48 4.2 MEASUREMENT SETUP 49 4.3 MEASUREMENT RESULTS 51 4.4 PERFORMANCE SUMMARY 57 CHAPTER 5 CONCLUSION 59 BIBLIOGRAPHY 60 ์ดˆ ๋ก 63์„

    Modeling, Optimization and Power Efficiency Comparison of High-speed Inter-chip Electrical and Optical Interconnect Architectures in Nanometer CMOS Technologies

    Get PDF
    Inter-chip input-output (I/O) communication bandwidth demand, which rapidly scaled with integrated circuit scaling, has leveraged equalization techniques to operate reliably on band-limited channels at additional power and area complexity. High-bandwidth inter-chip optical interconnect architectures have the potential to address this increasing I/O bandwidth. Considering future tera-scale systems, power dissipation of the high-speed I/O link becomes a significant concern. This work presents a design flow for the power optimization and comparison of high-speed electrical and optical links at a given data rate and channel type in 90 nm and 45 nm CMOS technologies. The electrical I/O design framework combines statistical link analysis techniques, which are used to determine the link margins at a given bit-error rate (BER), with circuit power estimates based on normalized transistor parameters extracted with a constant current density methodology to predict the power-optimum equalization architecture, circuit style, and transmit swing at a given data rate and process node for three different channels. The transmitter output swing is scaled to operate the link at optimal power efficiency. Under consideration for optical links are a near-term architecture consisting of discrete vertical-cavity surface-emitting lasers (VCSEL) with p-i-n photodetectors (PD) and three long-term integrated photonic architectures that use waveguide metal-semiconductor-metal (MSM) photodetectors and either electro-absorption modulator (EAM), ring resonator modulator (RRM), or Mach-Zehnder modulator (MZM) sources. The normalized transistor parameters are applied to jointly optimize the transmitter and receiver circuitry to minimize total optical link power dissipation for a specified data rate and process technology at a given BER. Analysis results shows that low loss channel characteristics and minimal circuit complexity, together with scaling of transmitter output swing, allows electrical links to achieve excellent power efficiency at high data rates. While the high-loss channel is primarily limited by severe frequency dependent losses to 12 Gb/s, the critical timing path of the first tap of the decision feedback equalizer (DFE) limits the operation of low-loss channels above 20 Gb/s. Among the optical links, the VCSEL-based link is limited by its bandwidth and maximum power levels to a data rate of 24 Gb/s whereas EAM and RRM are both attractive integrated photonic technologies capable of scaling data rates past 30 Gb/s achieving excellent power efficiency in the 45 nm node and are primarily limited by coupling and device insertion losses. While MZM offers robust operation due to its wide optical bandwidth, significant improvements in power efficiency must be achieved to become applicable for high density applications
    corecore