Phase interpolation technique based on high-speed SERDES chip CDR Meidong Lin, Zhiping Wen, Lei Chen, Xuewu Li

Similar documents
Research on Self-biased PLL Technique for High Speed SERDES Chips

A Variable-Frequency Parallel I/O Interface with Adaptive Power Supply Regulation

A 5-Gb/s 156-mW Transceiver with FFE/Analog Equalizer in 90-nm CMOS Technology Wang Xinghua a, Wang Zhengchen b, Gui Xiaoyan c,

ECEN620: Network Theory Broadband Circuit Design Fall 2014

DESIGN AND VERIFICATION OF ANALOG PHASE LOCKED LOOP CIRCUIT

ECEN620: Network Theory Broadband Circuit Design Fall 2012

ECEN720: High-Speed Links Circuits and Systems Spring 2017

Lecture 160 Examples of CDR Circuits in CMOS (09/04/03) Page 160-1

DESIGN OF MULTIPLYING DELAY LOCKED LOOP FOR DIFFERENT MULTIPLYING FACTORS

Delay-Locked Loop Using 4 Cell Delay Line with Extended Inverters

Integrated Circuit Design for High-Speed Frequency Synthesis

Fractional- N PLL with 90 Phase Shift Lock and Active Switched- Capacitor Loop Filter

Accomplishment and Timing Presentation: Clock Generation of CMOS in VLSI

This chapter discusses the design issues related to the CDR architectures. The

Dedication. To Mum and Dad

THE serial advanced technology attachment (SATA) is becoming

LETTER A 1.25-Gb/s Burst-Mode Half-Rate Clock and Data Recovery Circuit Using Realigned Oscillation

Self-Biased PLL/DLL. ECG minute Final Project Presentation. Wenlan Wu Electrical and Computer Engineering University of Nevada Las Vegas

A10-Gb/slow-power adaptive continuous-time linear equalizer using asynchronous under-sampling histogram

A Fully Integrated CMOS Phase-Locked Loop With 30MHz to 2GHz Locking Range and ±35 ps Jitter

Design of Low Noise 16-bit CMOS Digitally Controlled Oscillator

Optimization of Digitally Controlled Oscillator with Low Power

THE reference spur for a phase-locked loop (PLL) is generated

ISSN:

EE290C - Spring 2004 Advanced Topics in Circuit Design High-Speed Electrical Interfaces. Announcements

A PROCESS AND TEMPERATURE COMPENSATED RING OSCILLATOR

An Analog Phase-Locked Loop

A Low-Jitter Phase-Locked Loop Based on a Charge Pump Using a Current-Bypass Technique

Self Biased PLL/DLL. ECG 721 Memory Circuit Design (Spring 2017) Dane Gentry 4/17/17

ISSCC 2003 / SESSION 4 / CLOCK RECOVERY AND BACKPLANE TRANSCEIVERS / PAPER 4.3

/$ IEEE

Source Coding and Pre-emphasis for Double-Edged Pulse width Modulation Serial Communication

ECE1352. Term Paper Low Voltage Phase-Locked Loop Design Technique

A Random and Systematic Jitter Suppressed DLL-Based Clock Generator with Effective Negative Feedback Loop

THE SELF-BIAS PLL IN STANDARD CMOS

Comparison And Performance Analysis Of Phase Frequency Detector With Charge Pump And Voltage Controlled Oscillator For PLL In 180nm Technology

Analysis of phase Locked Loop using Ring Voltage Controlled Oscillator

A 14-bit 2.5 GS/s DAC based on Multi-Clock Synchronization. Hegang Hou*, Zongmin Wang, Ying Kong, Xinmang Peng, Haitao Guan, Jinhao Wang, Yan Ren

A Low-Jitter MHz DLL Based on a Simple PD and Common-Mode Voltage Level Corrected Differential Delay Elements

Design and Implementation of High-Speed CMOS Clock and Data Recovery Circuit for Optical Interconnection Applications. Seong-Jun Song. Dec.

THE DEMANDS of a high-bandwidth dynamic random access

5Gbps Serial Link Transmitter with Pre-emphasis

A 2.2GHZ-2.9V CHARGE PUMP PHASE LOCKED LOOP DESIGN AND ANALYSIS

ECEN689: Special Topics in High-Speed Links Circuits and Systems Spring 2012

Dual-Rate Fibre Channel Repeaters

Single-Ended to Differential Converter for Multiple-Stage Single-Ended Ring Oscillators

CHAPTER 6 PHASE LOCKED LOOP ARCHITECTURE FOR ADC

DESIGN AND ANALYSIS OF PHASE-LOCKED LOOP AND PERFORMANCE PARAMETERS

A CMOS Phase Locked Loop based PWM Generator using 90nm Technology Rajeev Pankaj Nelapati 1 B.K.Arun Teja 2 K.Sai Ravi Teja 3

A 5Gbit/s CMOS Clock and Data Recovery Circuit

on-chip Design for LAr Front-end Readout

High-Speed Interconnect Technology for Servers

A 1.25-Gb/s Digitally-Contolled Dual-Loop Clock and Data Recovery Circuit with Enhanced Phase Resolution

10 GHz Voltage Controlled Ring Oscillator for High Speed Application in 130nm CMOS Technology

A Fully Integrated 20 Gb/s Optoelectronic Transceiver Implemented in a Standard

Low Power Phase Locked Loop Design with Minimum Jitter

Design of Phase Locked Loop as a Frequency Synthesizer Muttappa 1 Akalpita L Kulkarni 2

Designing Nano Scale CMOS Adaptive PLL to Deal, Process Variability and Leakage Current for Better Circuit Performance

A New Phase-Locked Loop with High Speed Phase Frequency Detector and Enhanced Lock-in

A MASH ΔΣ time-todigital converter based on two-stage time quantization

ISSN:

A 3-10GHz Ultra-Wideband Pulser

Taheri: A 4-4.8GHz Adaptive Bandwidth, Adaptive Jitter Phase Locked Loop

Design and Characterization of a 10 Gb/s Clock and Data Recovery Circuit Implemented with Phase-Locked Loop

Electronic Circuits EE359A

Design of a 3.3-V 1-GHz CMOS Phase Locked Loop with a Two-Stage Self-Feedback Ring Oscillator

THE BASIC BUILDING BLOCKS OF 1.8 GHZ PLL

ECEN620: Network Theory Broadband Circuit Design Fall 2012

Energy Efficient and High Speed Charge-Pump Phase Locked Loop

LSI and Circuit Technologies for the SX-8 Supercomputer

DESIGN OF HIGH FREQUENCY CMOS FRACTIONAL-N FREQUENCY DIVIDER

Digital Controller Chip Set for Isolated DC Power Supplies

DOUBLE DATA RATE (DDR) technology is one solution

A 0.2-to-1.45GHz Subsampling Fractional-N All-Digital MDLL with Zero-Offset Aperture PD-Based Spur Cancellation and In-Situ Timing Mismatch Detection

Lecture 23: PLLs. Office hour on Monday moved to 1-2pm and 3:30-4pm Final exam next Wednesday, in class

Lecture 11: Clocking

A Wide-Range Delay-Locked Loop With a Fixed Latency of One Clock Cycle

Fast-lock all-digital DLL and digitally-controlled phase shifter for DDR controller applications

Digital Transceiver using H-Ternary Line Coding Technique

Analysis and Design of a 1GHz PLL for Fast Phase and Frequency Acquisition

Analysis and Design of High Speed Low Power Comparator in ADC

A Multiobjective Optimization based Fast and Robust Design Methodology for Low Power and Low Phase Noise Current Starved VCO Gaurav Sharma 1

A 5.4-Gb/s Clock and Data Recovery Circuit Using Seamless Loop Transition Scheme With Minimal Phase Noise Degradation

DESIGN OF MULTI-BIT DELTA-SIGMA A/D CONVERTERS

Marvell MV78200-A1 Discovery Innovation Series Embedded Processor

15.3 A 9.9G-10.8Gb/s Rate-Adaptive Clock and Data-Recovery with No External Reference Clock for WDM Optical Fiber Transmission.

Introduction to CMOS RF Integrated Circuits Design

A Fast Locking Digital Phase-Locked Loop using Frequency Difference Stage

A Radiation Tolerant 4.8 Gb/s Serializer for the Giga-Bit Transceiver

Sudatta Mohanty, Madhusmita Panda, Dr Ashis kumar Mal

A 10Gbps Analog Adaptive Equalizer and Pulse Shaping Circuit for Backplane Interface

6.976 High Speed Communication Circuits and Systems Lecture 21 MSK Modulation and Clock and Data Recovery Circuits

Design of an Efficient Phase Frequency Detector for a Digital Phase Locked Loop

Chapter 3 Novel Digital-to-Analog Converter with Gamma Correction for On-Panel Data Driver

Analysis and Design of High Speed Low Power Comparator in ADC

A wide-range all-digital duty-cycle corrector with output clock phase alignment in 65 nm CMOS technology

Synchronous Mirror Delays. ECG 721 Memory Circuit Design Kevin Buck

CLOCK AND DATA RECOVERY (CDR) circuits incorporating

Delay-based clock generator with edge transmission and reset

ECEN689: Special Topics in High-Speed Links Circuits and Systems Spring 2010

Transcription:

5th International Conference on Computer Sciences and Automation Engineering (ICCSAE 2015) Phase interpolation technique based on high-speed SERDES chip CDR Meidong Lin, Zhiping Wen, Lei Chen, Xuewu Li Beijing Microelectronics Tech. Institution (BMTI) 100076, China 415922977@qq.com Keywords: phase interpolation; speed; serdes chips; the broadband rate Introduction Abstract. This design combines the advantages of CDR CDR circuit two structures PID and PI-based clock data is based on the structure of semi-digital dual loop recovery system. Using TSMC-0.25μm CMOS process to achieve the PLL design, the operating frequency range of 1.6-2.7GHz, and successfully applied a SERDES chip. Small footprint annular VCO wide frequency adjustment range, and can easily produce the CDR SerDes required multi-phase clock. I. INTRODUCTION In the high-speed SERDES chip applications, in order to be able to transmit high-speed data, will clock information hidden in the serial data to be transmitted, and therefore, SERDES interface chip clock and data recovery circuit (Clock and Data Recovery: CDR) The main function is to receive an input signal and determines the phase of the high-speed serial signal to extract the clock information[1]. The actual system, when the high-speed signal transmission PCB board-level circuit or cable, since the path distance and parasitic capacitance and other factors, the signal arrives at the receiving end from the transmitting side, is already generating a random delay, coupled with the transmission in the process of introducing jitter and sending and receiving ends inherently synchronous clock frequency deviation will cause the received data in phase with the receiving end of the sampling clock error occurred[2]. A simple clock data recovery circuit diagram shown in Figure. Clock data recovery to complete two work, one is clock recovery, a data retiming, is to restore the data. Clock recovery mainly from the received NRZ (non-return to zero) code embedded clock information in the data extracted. CDR is generally an oscillator feedback loop, by adjusting the phase of the oscillation loop to track the input clock data embedded clock[3]. By analyzing the characteristics of the NRZ code may know, in the spectral density of the random binary data, it does not contain lines at a data rate that there is no direct information needed for clock extraction. To find the clock information, the general approach taken is edge-detection technology. In order to determine the final sampling clock phase, CDR must also have a phase error detecting circuit[4]. II. CIRCUIT DESIGN A. CDR implementations Profile To achieve synchronization, the receiver must restore circuit generates a synchronous sampling clock signal by the clock data. In order to obtain maximum timing margins, the receiver should be in the center of the eye pattern of data bits sampled. Clock edge with respect to the position of data bits determines the performance of the link. Typically there are three ways to realize the clock data recovery: The first and most common is to use a phase-locked loop, the structure via a feedback loop to detect and adjust the position of the sampling clock edge. The second method is based on the data oversampling phase picking technology. The third method is the phase interpolation techniques. 2016. The authors - Published by Atlantis Press 160

1) based on the phase-locked loop (PLL-based CDR) structures PLL-based CDR structure, the relationship between the oscillation frequency of the phase detector detects the input data and the VCO between generating a phase error associated with its pulse output, the phase drive charge pump, the phase error is converted into a control current through After the filter as the VCO control voltage Vctrl, controls the oscillation frequency of the VCO, until both reach the match that is locked. 2) oversampling (Oversampling CDR) architecture [36] [5] The technique of the data stream in each of a plurality of bit phase point sampling, the data stream to obtain said oversampling. Hopping information extracted from the data sampling information. Sample data based on the bit information data center transition, and the data can be the closest sample point as received data. Data based on the phase picking oversampling technology needs for data oversampling feature makes the technology more suitable for low data rate applications. 3) based on the phase interpolation (Phase Interpolator-based) structure of CDR CDR phase interpolation is based Rambus Inc. proposed a structure which uses as a reference loop PLL or DLL to generate multi-phase high-speed reference clock signal, the reference phase input to the CDR loops, CDR loop control circuit select from a and interpolation operation on the interpolated result as data recovery clock signal. The clock data recovery circuit structure many different structures for selection depends on the speed limit and the power supply voltage under different conditions, while also subject to the system of power and jitter requirements. Phase interpolation and oversampling clock data recovery structure to achieve is to use a digital implementation of clock and data alignment, belong based on digital CDR[6]. This type of CDR simple structure, low cost, but with limited ability to track the frequency changes, the jitter performance than PLL-based CDR. Phase interpolation and oversampling clock data recovery structure to achieve based on this design uses a CDR with a PI-based semi-digital dual-loop. B. The overall structure of the clock data recovery circuit This design is based CDR circuit digital dual loop half clock data recovery system structure, the overall structure shown in Figure 1. Figure1 CDR overall block diagram 161

Compared with such a circuit structure based on the structure of the traditional analog phase-locked loop clock and data recovery system, it has the following advantages[7]: (1) Most of the phase locked loop circuit is implemented by a digital circuit design, reducing the overall circuit sensitivity to the process, it has better reusability. (2) dual-loop structure so that the frequency locked loop and phase locked loop has a relatively high independence, the loop bandwidth can be selected easily satisfy loop stability and bandwidth requirements. (3) In the case of multi-channel serial communication between the passages can share a frequencylocked loop, can reduce overall power consumption and area of the system. System consists of a frequency locked loop and phase locked loop composition. Frequency locked loop is a digital-analog hybrid PLL, its reference clock input from the outside, the main role is to produce a set of multi-phase low-jitter clock. Phase-locked loop for receiving data mainly to complete phase tracking and data retiming, restore the data[8]. This structure makes the frequency and phase locked loop locked relatively independent loop PLL loop bandwidth can be designed to be wider, to more effectively suppress loop noise. To avoid the impact of the phase-locked loop PLL loop work, with the same four buffer the oscillator and a phase locked loop to isolate, to avoid interaction between the two loops[9]. Phase-locked loop, mainly by Bang-Bang phase detector, a digital low-pass filter, phase interpolation unit, and a digital control unit FSM (Finite State Machine) components. Relationship edge phase detector detects the input data with a multi-phase clock between the output phase advance (up) or delayed (dn) of the comparison result, digital filter phase detector output signal is filtered, and the output of the phase interpolator the control signal, the clock signal to change to a different weighting factor. FSM is a phase locked loop state machine, used to select the input phase interpolator clock signal, which produce phase zone interpolation. In order effectively filter out noise, can be phase-locked loop bandwidth is narrower than design, so by way of dual-loop, effectively solve the problem of single-loop bandwidth compromise, and in the use of multi-channel data communication can be shared between multiple channel frequency locked loop, effectively reducing the overall power consumption and circuit area. C. PI circuit design Before designed CPPLL constituted by the ring oscillator 4, the output signal is approximately sinusoidal oscillation, the phase selector selection, as the input clock signal PI. Because of its large rise time, and PI output node small time constant, the output voltage follows the input voltage can be a good signal to the PI has a higher linearity. For PI output has a good linearity, asked to enter phase two clock signals PI interval to be small. In this design, when four of the ring oscillator operating at 1.25GHz, the interval between the adjacent phase is 45, ie Δt = 100ps. The right to control the interpolation median 8bit, ie w = 1, 2,..., N (N = 16). In the ideal case, the interpolated output clock after the minimum step length interval of 100ps / 8 = 12.5ps. Figure 2 PI unit circuit 162

PI basic circuit structure shown in Figure 2, the clock input stage, the current weight control stage and load components. Uppermost four PMOS transistor structure remains symmetrical load, the bias voltage from the bias voltage generating circuit VCO. Differential input ω1_a, ω1_b and ω2_a, ω2_b, CPPLL generated from a set of adjacent differential clock input signal, w0 ~ w7 is 8bit phase interpolation weight control signal, the signal from the external digital control section, namely 8bit shift register shifter output signal for controlling the tail current flow in the circuit path, the internal structure of a simple differential pair. When the control bit w0 ~ w7 all set to 0, PI output clock phase to follow ω2_a, ω2_b change; when w0 ~ w7 full of 1:00, PI output clock follow ω1_a, ω1_b change; when the number w0 ~ w7 1 and 0 when the number of equal, that control the weight bit (0000,1111), or (0101,0101), when (0110,0110), etc., PI output clock phase between the aforementioned two phase clock signal, which is Since each weight control bit weight values are the same. In fact, because this design uses a bidirectional shift register structures to control, weight control output 0 and 1 will not occur at intervals appear, but the same number as long as weight control bits to 1 or 0, then the output After interpolation phase of the clock is the same. D. Design of the phase selection circuit The output signal input phase interpolator clock signal derived from the ring oscillator frequency locked loop, in order to avoid the influence of the phase locked loop of the VCO oscillation frequency, using the same buffer unit set the VCO and phase locked loop isolated. Phase interpolation input signal is a group of adjacent differential clock signals, and take into account the needs in the range of the interpolation of 360, the interpolation of the input clock, and to facilitate the generation of quadrature output clock signal, we need to use a phase selection circuit. Ring oscillator constituted by the delay unit 4, from the first stage to the fourth stage to complete the 180 phase shift of the phase, and therefore phase change between adjacent cells 45. Interpolating the input clock signal is phase interval of VCO output waveform of 45. If you need to operate in the 180 interpolation clock phase between 225 and can signal by 0 and 45 signal reversal, you can get a phase of 180, respectively, and 225 of the clock signal. Similarly, the other can get the desired phase signal. Phase selection circuit transistor level implementation shown in Figure 3, the uppermost portion of the circuit is still symmetrical PMOS transistor constituting the load 4, the oscillation clock of the VCO input ω1_a, ω1_b and ω2_a, ω2_b, VBN for the tail current source bias set voltage input. S1 ~ S4 of four control signal input, in any normal operating conditions, the four have one and only one is high, the other for low-level signals. For example: When S1 is high, S2 ~ S4 is low, the result can be equivalent to a pair of differential input clock signal from the VCO output after a differential amplifier; and when S2 is high, S1, S3, S4 is low, the inverted output equivalent to that of the clock signal by a differential amplifier later. By controlling the selection signals S1 ~ S4, can easily achieve the phase inversion of the input clock to meet the phase interpolator clock input signal covers the entire phase range of 0 ~ 360. Figure 3 phase selection (PS) circuit 163

III. LAYOUT VERIFICATION SERDES with full custom layout mode design, the overall layout meet TSMC 0.13um / 0.11um CMOS Logic Design Rule (C013G 1P4M + L-MARK) design rules. Map details see Table 1. Table 1 contains information on the overall layout Device Type Quantity PMOS 25151 NMOS 22215 Resistors 568 Diode 165 TSMC design rules in accordance with independent design in ESD design. IV. CONCLUSIONS From the design of wide-input range of the phase locked loop circuit, adopting self-biased technology, noise has a good inhibitory effect. Small footprint annular VCO wide frequency adjustment range, and can easily generate multi-phase clocks CDR serdes required, to meet the design requirements. High-speed SERDES chip die size 2433um * 2505um, specific layout shown in Figure 4. Figure 4 chip layout REFERENCES [1] PCI Express TM Jitter Modeling,Revision 1.0RD,http://www.pcisig.com,July 14th,2004 [2] Hyung-Wook Jang,Sung-Sop Lee,Jin-Ku Kang A Clock Recovery Circuit using Half-rate 4X oversampling PD [3] S. I. Ahmed,Tad A. Kwasniewski,Overview of oversampling clock and data recovery circuits [4] J. Lanmoureux and S. Wilton, FPGA clock network architecure: flexibility vs. area and power, Proceedings of the 2006 ACM/SIGDA 14th international symposium on Field-Programmable Logic and Applications, 2006, pp. 101-108. [5] Martin E. Heimann, Glitch-Free Clock Multiplexer, U.S. Patent 5,357,146, Oct. 18, 1994. [6] Markku Ruuskanen, Method And Circuit For Switching Between A Pair of Asynchronous Clock Signals, U.S. Patent 5,726,593, Mar. 10, 1998. [7] Alexander Bronfer, Svetlana Slotzkin, Self-Compensating Glitch Free Clock Switch, U.S. Patent 6,809,556, Oct. 26, 2004. 164

[8] Rafey Mahmud, Techniques to make clock switching glitch free EETIMES,[Online]. Jun. 26, 2003. Available: [9] B. Razavi,Monolithic Phase-Locked Loops and Clock Recovery Circuits: Theory and Design,New York,IEEE Press,1996 165