A10-Gb/slow-power adaptive continuous-time linear equalizer using asynchronous under-sampling histogram

Similar documents
An 8-Gb/s Inductorless Adaptive Passive Equalizer in µm CMOS Technology

A 10Gbps Analog Adaptive Equalizer and Pulse Shaping Circuit for Backplane Interface

A 10-Gb/s Multiphase Clock and Data Recovery Circuit with a Rotational Bang-Bang Phase Detector

A 5-8 Gb/s Low-Power Transmitter with 2-Tap Pre-Emphasis Based on Toggling Serialization

REDUCING power consumption and enhancing energy

To learn fundamentals of high speed I/O link equalization techniques.

A 5-Gb/s 156-mW Transceiver with FFE/Analog Equalizer in 90-nm CMOS Technology Wang Xinghua a, Wang Zhengchen b, Gui Xiaoyan c,

A 0.18µm CMOS Gb/s Digitally Controlled Adaptive Line Equalizer with Feed-Forward Swing Control for Backplane Serial Link

WITH the growth of data communication in internet, high

A Clock and Data Recovery Circuit With Programmable Multi-Level Phase Detector Characteristics and a Built-in Jitter Monitor

DESIGN AND IMPLEMENTATION OF A LOW VOLTAGE LOW POWER DOUBLE TAIL COMPARATOR

ISSCC 2003 / SESSION 10 / HIGH SPEED BUILDING BLOCKS / PAPER 10.8

A Variable-Frequency Parallel I/O Interface with Adaptive Power Supply Regulation

1-13GHz Wideband LNA utilizing a Transformer as a Compact Inter-stage Network in 65nm CMOS

Wide frequency range duty cycle correction circuit for DDR interface

Fractional- N PLL with 90 Phase Shift Lock and Active Switched- Capacitor Loop Filter

An All-digital Delay-locked Loop using a Lock-in Pre-search Algorithm for High-speed DRAMs

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II: EXPRESS BRIEFS, VOL. 59, NO. 3, MARCH

International Journal of Modern Trends in Engineering and Research

WITH the rapid evolution of liquid crystal display (LCD)

A Random and Systematic Jitter Suppressed DLL-Based Clock Generator with Effective Negative Feedback Loop

A wide-range all-digital duty-cycle corrector with output clock phase alignment in 65 nm CMOS technology

/$ IEEE

Design of Low Power High Speed Fully Dynamic CMOS Latched Comparator

IN HIGH-SPEED wireline transceivers, a (DFE) is often

Ultra-high-speed Interconnect Technology for Processor Communication

Design of a 3.3-V 1-GHz CMOS Phase Locked Loop with a Two-Stage Self-Feedback Ring Oscillator

A High-Resolution Dual-Loop Digital DLL

Fast-lock all-digital DLL and digitally-controlled phase shifter for DDR controller applications

5Gbps Serial Link Transmitter with Pre-emphasis

A 3-10GHz Ultra-Wideband Pulser

A digital phase corrector with a duty cycle detector and transmitter for a Quad Data Rate I/O scheme

Single-Ended to Differential Converter for Multiple-Stage Single-Ended Ring Oscillators

Design technique of broadband CMOS LNA for DC 11 GHz SDR

PHASE-LOCKED loops (PLLs) are widely used in many

Design of a Low Voltage low Power Double tail comparator in 180nm cmos Technology

BER-optimal ADC for Serial Links

A 10-GHz CMOS LC VCO with Wide Tuning Range Using Capacitive Degeneration

A design of 16-bit adiabatic Microprocessor core

SINCE the performance of personal computers (PCs) has

Source Coding and Pre-emphasis for Double-Edged Pulse width Modulation Serial Communication

ISSCC 2004 / SESSION 26 / OPTICAL AND FAST I/O / 26.8

ISSCC 2003 / SESSION 20 / WIRELESS LOCAL AREA NETWORKING / PAPER 20.2

Conference Guide IEEE International Symposium on Circuits and Systems. Rio de Janeiro, May 15 18, 2011

Acounter-basedall-digital spread-spectrum clock generatorwithhighemi reductionin65nmcmos

A Low-Jitter Phase-Locked Loop Based on a Charge Pump Using a Current-Bypass Technique

A Low Power Single Ended Inductorless Wideband CMOS LNA with G m Enhancement and Noise Cancellation

10 Gb/s Radiation-Hard VCSEL Array Driver

DESIGN AND PERFORMANCE VERIFICATION OF CURRENT CONVEYOR BASED PIPELINE A/D CONVERTER USING 180 NM TECHNOLOGY

A 82.5% Power Efficiency at 1.2 mw Buck Converter with Sleep Control

RECENTLY, low-voltage and low-power circuit design

NEW WIRELESS applications are emerging where

ISSCC 2006 / SESSION 20 / WLAN/WPAN / 20.5

Design of low phase noise InGaP/GaAs HBT-based differential Colpitts VCOs for interference cancellation system

A Clock Generating System for USB 2.0 with a High-PSR Bandgap Reference Generator

A Low Power Single Phase Clock Distribution Multiband Network

A Frequency Synthesis of All Digital Phase Locked Loop

Due to the absence of internal nodes, inverter-based Gm-C filters [1,2] allow achieving bandwidths beyond what is possible

Accomplishment and Timing Presentation: Clock Generation of CMOS in VLSI

Phase interpolation technique based on high-speed SERDES chip CDR Meidong Lin, Zhiping Wen, Lei Chen, Xuewu Li

ISSCC 2004 / SESSION 26 / OPTICAL AND FAST I/O / 26.6

2008 IEEE ASIA PACIFIC CONFERENCE ON CIRCUITS AND SYSTEMS

A low-if 2.4 GHz Integrated RF Receiver for Bluetooth Applications Lai Jiang a, Shaohua Liu b, Hang Yu c and Yan Li d

ALTHOUGH zero-if and low-if architectures have been

DISTRIBUTED amplification is a popular technique for

ECEN 720 High-Speed Links: Circuits and Systems. Lab3 Transmitter Circuits. Objective. Introduction. Transmitter Automatic Termination Adjustment

Publication [P3] By choosing to view this document, you agree to all provisions of the copyright laws protecting it.

A High Gain and Improved Linearity 5.7GHz CMOS LNA with Inductive Source Degeneration Topology

Design for MOSIS Educational Program (Research) Testing Report for Project Number 89742

A 5.4-Gb/s Clock and Data Recovery Circuit Using Seamless Loop Transition Scheme With Minimal Phase Noise Degradation

TIMING recovery (TR) is one of the most challenging receiver

Bootstrapped ring oscillator with feedforward inputs for ultra-low-voltage application

Design of 10-bit current steering DAC with binary and segmented architecture

A Reset-Free Anti-Harmonic Programmable MDLL- Based Frequency Multiplier

ECEN689: Special Topics in High-Speed Links Circuits and Systems Spring 2012

Gigabit Transmission in 60-GHz-Band Using Optical Frequency Up-Conversion by Semiconductor Optical Amplifier and Photodiode Configuration

Linearization Method Using Variable Capacitance in Inter-Stage Matching Networks for CMOS Power Amplifier

Simple odd number frequency divider with 50% duty cycle

A fully synthesizable injection-locked PLL with feedback current output DAC in 28 nm FDSOI

Highly linear common-gate mixer employing intrinsic second and third order distortion cancellation

ECEN689: Special Topics in High-Speed Links Circuits and Systems Spring 2012

LETTER A 1.25-Gb/s Burst-Mode Half-Rate Clock and Data Recovery Circuit Using Realigned Oscillation

Design of CMOS Based PLC Receiver

Delay-based clock generator with edge transmission and reset

The GBTIA, a 5 Gbit/s Radiation-Hard Optical Receiver for the SLHC Upgrades

High-Speed Circuits and Systems Laboratory B.M.Yu. High-Speed Circuits and Systems Lab.

Inductorless CMOS Receiver Front-End Circuits for 10-Gb/s Optical Communications

DESIGN AND VERIFICATION OF ANALOG PHASE LOCKED LOOP CIRCUIT

THE reference spur for a phase-locked loop (PLL) is generated

High-Speed Interconnect Technology for Servers

Design of Low Power Reduced Area Cyclic DAC

ISSN:

LSI and Circuit Technologies for the SX-8 Supercomputer

ECEN720: High-Speed Links Circuits and Systems Spring 2017

A 14-bit 2.5 GS/s DAC based on Multi-Clock Synchronization. Hegang Hou*, Zongmin Wang, Ying Kong, Xinmang Peng, Haitao Guan, Jinhao Wang, Yan Ren

IN RECENT years, low-dropout linear regulators (LDOs) are

Low-Power RF Integrated Circuit Design Techniques for Short-Range Wireless Connectivity

SV2C 28 Gbps, 8 Lane SerDes Tester

3Gb/s CMOS Adaptive Equalizer for Backplane Serial Links

A Comparative Study of Dynamic Latch Comparator

Transcription:

LETTER IEICE Electronics Express, Vol.10, No.4, 1 8 A10-Gb/slow-power adaptive continuous-time linear equalizer using asynchronous under-sampling histogram Wang-Soo Kim and Woo-Young Choi a) Department of Electrical and Electronic Engineering, Yonsei University, 134 Shinchon-dong, Seodaemoon-gu, Seoul, 120 749, Korea a) wchoi@yonsei.ac.kr Abstract: This paper presents a 10-Gb/s low-power adaptive continuous-time linear equalizer, which automatically determines the optimal equalization condition by searching for the equalization coefficient producing the largest peak value in histograms obtained with asynchronous under-sampling. To reduce the power consumption, the integrated digital controller turns off the circuit blocks used for the adaptation process once adaptation is complete. A prototype equalizer realized in 65-nm CMOS technology consumes 4.66 mw during adaptation and 2.49 mw after adaptation. For 10-Gb/s 2 31 1 PRBS data transmitted over 40-cm FR4 PCB trace, our equalizer achieves less than 10 13 BER and 26.6 ps peak-to-peak jitter. Keywords: low-power adaptive equalizer, continuous-time linear equalizer (CTLE), asynchronous under-sampling histogram Classification: Integrated circuits References [1] D. H. Shin, J. E. Jang, F. O Mahony, and C. P. Yue, A 1-mW 12-Gb/s continuous-time adaptive passive equalizer in 90-nm CMOS, IEEE Custom Integrated Circuits Conf Dig Tech Papers, pp. 117 120, Sept. 2009. [2] B. Kim, et al., A 10-Gb/s Compact Low-Power Serial I/O With DFE-IIR Equalization in 65-nm CMOS, IEEE J Solid State Circuits, vol. 44, pp. 3526 3538, Dec. 2009. [3] Y. Tomita, M. Kibune, J. Ogawa, W. W. Walker, H. Tamura, and T. Kuroda, A 10-Gb/s receiver with series equalizer and on-chip ISI monitor in 0.11- m CMOS, IEEE J Solid State Circuits, vol. 40, no. 4, pp. 986 993, April 2005. [4] Y. M. Lee, S. Sheikhaei, and S. Mirabbasi, A 10Gb/s active-inductor structure with peaking control in 90 nm CMOS, Proc ASSCC 2008, pp. 229 232, Nov. 2008. [5] J.-H. Lu, K.-H. Chen, and S.-I. Liu, A 10-Gb/s inductorless CMOS analog equalizer with an interleaved active feedback topology, IEEE Trans Circuits Syst II Exp Briefs, vol. 56, no. 2, pp. 97 101, Feb. 2009. [6] D. Lee, J. Han, G. Han, and S. M. Park, 10 Gbit/s 0.0065 mm 2 6mW analogue adaptive equaliser utilising negative capacitance, Electron Lett, 1

vol. 45, no. 17, pp. 863 865, Aug. 2009. [7] W.-S. Kim, C.-K. Seong, and W.-Y. Choi, A 5.4-Gb/s adaptive continuoustime linear equalizer using asynchronous undersampling histograms, IEEE Trans Circuits Syst II Exp Briefs, vol. 59, no. 9, pp. 553 557, Sept. 2012. 1. Introduction As the required data rate for wire-line communication increases, channel bandwidth limitation becomes a critical problem as inter-symbol interference (ISI) due to high-frequency channel loss greatly degrades the transmission performance. In order to compensate this, an equalizer is essential in high-speed serial links. In addition, adaptive equalization is required so that the equalizer can properly work even with channel condition variations and process-voltage-temperature (PVT) variations. In high-speed serial link applications, the demand for power consumption reduction is becoming stronger. Various low-power adaptive equalizers have been reported. A tunable passive filter with power comparison adaptation can achieve low-power consumption but further improvement is required in input impedance matching and output swing levels [1]. Decision feedback equalizers (DFEs) have been used in low-power transceivers [2]. For DFEs, however, clock recovery and distribution circuits are required, making it difficult to use these equalizers if received data eyes are initially closed. Adaptive Continuous-Time Linear Equalizers (CTLEs) are widely used due to their simple architecture and good linearity [3, 4, 5, 6]. In addition, they do not require synchronous clock signals for their operation. We have previously demonstrated a 5.4-Gb/s adaptive CTLE based on asynchronous under-sampling histograms [7]. Our adaptation algorithm is based on the simple observation that the clearest eye diagram produces the largest peak value in the histogram of received data amplitudes. Our CLTE scans all possible equalization coefficients and selects the one that produces a histogram having the largest peak value as the optimal equalization coefficient. In this paper, we report a 10-Gb/s low-power adaptive CTLE based on the same adaptation technique. We achieve low-power operation by turning off the circuit blocks used for adaptation after adaptation is accomplished. This paper is organized as follows. After introduction in Section 1, Section 2 describes details of circuit implementation and Section 3 gives measurement results. Conclusion is given in Section 4. 2. Circuit implementation Our design goal is 10-Gb/s data transmission over 40-cm FR4 PCB trace with low power consumption. The minimum power consumption of analog circuits is basically determined by the supply voltage, the signal peak-topeak amplitude and the required bandwidth. We set 250-mV peak-to-peak swing from 1-V supply as our design specification. Current-mode logic circuits are designed with small-size input pairs using low threshold voltage transistors for improvement in both power dissipation and circuit speed. Passive inductors are not used in order to reduce the silicon area. 2

The target channel has 10.48 db loss at 5 GHz. Our adaptive CTLE is designed to have up to 15 db equalizing gain at 5 GHz. Fig. 1 shows the overall configuration of the proposed low-power adaptive CTLE. It has a 3- bit variable CTLE filter, two unit-gain buffers, output buffers, a 4-phase clock divider, two digital-to-analog converters (DACs), four track-and-hold circuits, a clocked sense amplifier, and an integrated digital controller. The digital controller turns off the circuit blocks such as DACs, clocked sense amplifier, clock divider, and unit gain buffer with high threshold voltage transistor switches after the adaptation process is complete. Also, the dynamic power of a digital controller is reduced due to operating clock off. Fig. 1. Proposed low-power adaptive CTLE. Table I compares performances of previously reported low-power CTLE filters. An active inductor technique with low voltage headroom provides area-efficient alternative for passive inductive terminations and offers channel loss compensation through tunable peaking, while consuming a small amount of overhead power. However, it cannot fulfill our design target due to limited boosting gains [4]. An interleaved active feedback topology incorporated with additional capacitive and resistive source Table I. Performance comparison of the CTLE filters. 3

degeneration achieves enough boosting gain and bandwidth extension. But several cascading gain cells with interleaved active feedback cells cannot satisfy power constraint [5]. A filter using capacitive source degeneration and negative capacitance circuits as shown in Fig. 2 provides enough highfrequency boosting without sacrificing DC gain and lowest power consumption [6]. The 2-stage filters are used for our design specification. With these filters, adaptation is achieved by tuning the zero of negative capacitor. Our CTLE is designed to have 4.5-dB to 15-dB equalizing gain with 3-bit capacitor array, which provides 8-different gain levels with approximately 1.5-dB increment controlled by 3-bit equalization coefficients. It guarantees uniform output swing despite changing equalization coefficient so that histogram is properly calculated with the fixed DAC resolution. On-chip 50- resistors are integrated for input impedance matching. Fig. 2. The schematic of CTLE filter. The track-and-hold circuit is made up of a pass-gate logic consisting of NMOS and PMOS switches. The unit gain buffer prevents CTLE output from clock feed-through of track-and-hold circuits. A 5-bit DAC generates 32-level reference voltages with 15.625 mv resolution. The reference level is controlled from 500 mv to 1 V by PMOS load switches with 5-bit DAC coefficients. The 4-input differentially clocked sense amplifier compares equalized data with DAC references. The clock divider provides 107-MHz quadrature clocks from external 214-MHz clock which are asynchronous to the target data rate. These quadrature sampling clocks are sequentially provided to track-and-hold circuits, the comparator, and the digital controller for stable operation. The adaptive process in digital controller is shown in Fig. 3. Initially, a finite state machine (FSM) applies an equalization coefficient to the equalizing filter and the DAC coefficient to DAC. After that, a reference voltage, V DAC, is applied to the comparator. When the signal level, V EQ, is higher than V DAC at the sampling point, a clocked sense amplifier generates a high pulse, V COMP. A counter in FSM counts up by one and the register stores the accumulated value during 4096 samples. With a full scan of reference voltages, the resulting cumulative distribution function (CDF) for the given equalization coefficient is obtained. By differentiating the CDF, the probability density function (PDF) is obtained. The PDF values are stored in 32 registers. The maximum value selector detects the peak value 4

Fig. 3. The adaptive process in digital controller. of the PDF. When the current peak value is higher than the previous peak value, the maximum value register stores the current equalization coefficient and previous value register is updated. This process is repeated for each of 8-equalization coefficients. The equalization coefficients detector determines the digital coefficient having the largest peak value and chooses this as the optimal equalization coefficient. After adaptation, the FSM turns off all the circuit blocks needed for adaptation so that power consumption can be minimized. They can be turned on again when demanded by a higher level control (not implemented) if there is any degradation in received data quality. With 4096 samples used for each histogram, the total time required for adaptation is estimated 9.8 ms. 3. Measurement results Fig. 4 shows a die photograph of our adaptive CTLE chip fabricated in 65- nm CMOS technology. The core occupies 0.0386-mm 2 die area excluding output buffers. Fig. 4. Chip photograph of the designed adaptive CTLE. Fig. 5 shows the measurement setup. A pulse pattern generator (PPG) provides differential 10-Gb/s 2 31 1 pseudo random bit sequence (PRBS) data to the chip on a probe station through 40-cm FR4 PCB trace. A frequency synthesizer provides 214-MHz asynchronous clock. The equalized data is observed by an oscilloscope and a BER tester. The integrated digital controller can be operated in internal- or external-control mode. In the 5

Fig. 5. Measurement Setup. external-control mode, histogram data for any equalization coefficient can be delivered to a field-programmable gate array (FPGA) for testing purpose. In the internal control mode, the digital controller automatically determines the optimal equalization coefficient. As the first verification, we measured the eye diagram and the histogram of equalized output for each of 8-equalization coefficients when data are transmitted through 40-cm FR4 PCB trace with the controller in the external-control mode. Fig. 6 shows measured eye diagrams and histograms for 8 different equalization coefficients. As can be seen, equalization coefficient 100 can be easily identified as the one producing the clearest eye diagram and the histogram having the largest peak value. Although histograms usually have two peaks, one of the two is larger due to uncertainty in the sampling. The controller simply searches for the peak having a larger value. With the controller in the internal-control mode, the Fig. 6. Measured eye-diagram and histogram for each equalization coefficient. 6

controller correctly selects equalization coefficient 100 as the optimal coefficient. The power consumption is 4.66 mw without output buffers from 1-V supply in the external-control mode when all the adaptation circuits are on. Fig. 7 shows measured eye diagrams before and after equalization for 40- cm FR4 PCB trace with 10-Gb/s 2 31 1 PRBS data. Eyes are clearly open after equalization. For this eye measurement, the optimal equalization coefficient was adaptively determined without any external control. Our adaptive equalizer achieves 10-Gb/s data transmission with less than 10 13 BER. The measured maximum peak-to-peak jitters are 26.6ps. The power consumption is 2.49 mw after adaptation is completed. Fig. 7. Eye-diagrams of (a) before and (b) after equalization. Table II compares the performance of our CTLE reported in this work with previously reported low-power adaptive equalizers operating at around 10 Gb/s. Although the passive adaptive equalizer in [1] has the lowest power consumption, it should be noted that the equalizer in [1] does not include a limiting amplifier, which is required for boosting up the output to the desired level and can consume a substantial amount of power at 10 Gb/s. Although direct comparison of power consumption is difficult for CTLEs realized in different process technologies and having different amounts of peaking gain, our CTLE achieves the lowest power consumption. Furthermore, it has the advantage in that adaptation algorithm is 7

Table II. Performance comparison of the proposed equalizer. performed with the integrated digital block, which consumes about 1 mw little power, is robust against any PVT variations, and can be turned off when not needed. We believe our adaptive CTLE can be a promising solution for demanding high-speed wire-line applications, especially for systems whose channel conditions does not change frequently. 4. Conclusion We presented a 10-Gb/s low-power adaptive CTLE which automatically selects the optimal equalization coefficient among several pre-determined values by searching for the equalization coefficient that produces the largest peak value in histograms obtained with asynchronous undersampling. The blocks used for adaptation are turned off for power reduction after the optimal equalization condition is achieved. A prototype chip realized in 65-nm CMOS technology successfully achieves adaptive equalization for 10-Gb/s 2 31 1 PRBS data through 40-cm FR4 PCB trace. It consumes 4.66 mw during adaptation and 2.49 mw after adaptation from 1-V supply and occupies 0.0386 mm 2 of die area. Acknowledgments This work was supported by Samsung Electronics. We also acknowledge the support of IC Design Education Center (IDEC) for EDA software support, and Samsung Electronics for chip fabrication. 8