ECEN720: High-Speed Links Circuits and Systems Spring 2017

Similar documents
ECEN620: Network Theory Broadband Circuit Design Fall 2014

ECEN620: Network Theory Broadband Circuit Design Fall 2012

ECEN689: Special Topics in High-Speed Links Circuits and Systems Spring 2010

ECEN689: Special Topics in High-Speed Links Circuits and Systems Spring 2010

6.976 High Speed Communication Circuits and Systems Lecture 21 MSK Modulation and Clock and Data Recovery Circuits

High-speed Serial Interface

ECEN620: Network Theory Broadband Circuit Design Fall 2014

INF4420 Phase locked loops

CLOCK AND DATA RECOVERY (CDR) circuits incorporating

ECEN620: Network Theory Broadband Circuit Design Fall 2014

A 0.18µm SiGe BiCMOS Receiver and Transmitter Chipset for SONET OC-768 Transmission Systems

ISSCC 2003 / SESSION 4 / CLOCK RECOVERY AND BACKPLANE TRANSCEIVERS / PAPER 4.3

ECEN620: Network Theory Broadband Circuit Design Fall 2012

Phase interpolation technique based on high-speed SERDES chip CDR Meidong Lin, Zhiping Wen, Lei Chen, Xuewu Li

Lecture 15: Clock Recovery

A 5.4-Gb/s Clock and Data Recovery Circuit Using Seamless Loop Transition Scheme With Minimal Phase Noise Degradation

SV2C 28 Gbps, 8 Lane SerDes Tester

Jitter in Digital Communication Systems, Part 1

ECEN689: Special Topics in High-Speed Links Circuits and Systems Spring 2012

Short Course On Phase-Locked Loops and Their Applications Day 5, AM Lecture. Advanced PLL Examples (Part I)

Lecture 15: Clock Recovery

ECEN 620: Network Theory Broadband Circuit Design Fall 2012

EE290C - Spring 2004 Advanced Topics in Circuit Design High-Speed Electrical Interfaces. Announcements

Choosing Loop Bandwidth for PLLs

A fully digital clock and data recovery with fast frequency offset acquisition technique for MIPI LLI applications

Lecture 23: PLLs. Office hour on Monday moved to 1-2pm and 3:30-4pm Final exam next Wednesday, in class

ISSCC 2006 / SESSION 13 / OPTICAL COMMUNICATION / 13.2

To learn fundamentals of high speed I/O link equalization techniques.

Dual-Rate Fibre Channel Repeaters

This chapter discusses the design issues related to the CDR architectures. The

Lecture 160 Examples of CDR Circuits in CMOS (09/04/03) Page 160-1

Delay-Locked Loop Using 4 Cell Delay Line with Extended Inverters

A CMOS Multi-Gb/s 4-PAM Serial Link Transceiver*

A 2.2GHZ-2.9V CHARGE PUMP PHASE LOCKED LOOP DESIGN AND ANALYSIS

ISSN:

A 10-Gb/s Multiphase Clock and Data Recovery Circuit with a Rotational Bang-Bang Phase Detector

ECEN720: High-Speed Links Circuits and Systems Spring 2017

Introduction to CMOS RF Integrated Circuits Design

THE DEMANDS of a high-bandwidth dynamic random access

A 0.3-m CMOS 8-Gb/s 4-PAM Serial Link Transceiver

Circuit Design for a 2.2 GByte/s Memory Interface

Real Time Jitter Analysis

A Clock and Data Recovery Circuit With Programmable Multi-Level Phase Detector Characteristics and a Built-in Jitter Monitor

Dedication. To Mum and Dad

ECEN720: High-Speed Links Circuits and Systems Spring 2017

High-Speed Serial IO Testing: Jitter Extraction & Bit-Error Rate Estimation. Serial Signaling Speed Trend

Clock and Data Recovery Circuit

/$ IEEE

ECEN689: Special Topics in High-Speed Links Circuits and Systems Spring 2010

ECEN689: Special Topics in High-Speed Links Circuits and Systems Spring 2012

Sudatta Mohanty, Madhusmita Panda, Dr Ashis kumar Mal

Self-Biased PLL/DLL. ECG minute Final Project Presentation. Wenlan Wu Electrical and Computer Engineering University of Nevada Las Vegas

Phase Locked Loop Design for Fast Phase and Frequency Acquisition

Taheri: A 4-4.8GHz Adaptive Bandwidth, Adaptive Jitter Phase Locked Loop

6.976 High Speed Communication Circuits and Systems Lecture 17 Advanced Frequency Synthesizers

A Complete 64Gb/s/lane Active Electrical Repeater. Yue Lu, Jaeduk Han, Nicholas Sutardja Prof. Elad Alon January 23, 2014

Cost Effective Spread Spectrum Clock Generator Design Chulwoo Kim, Minyoung Song, Sewook Hwang

Noise Analysis of Phase Locked Loops

ECEN 720 High-Speed Links: Circuits and Systems

A Variable-Frequency Parallel I/O Interface with Adaptive Power Supply Regulation

ECEN 720 High-Speed Links Circuits and Systems

Tuesday, March 29th, 9:15 11:30

Synchronization. EE442 Lecture 17. All digital receivers must be synchronized to the incoming signal s(t).

Low-overhead solutions for clock generation and synchronization.

DESIGN OF MULTIPLYING DELAY LOCKED LOOP FOR DIFFERENT MULTIPLYING FACTORS

15.3 A 9.9G-10.8Gb/s Rate-Adaptive Clock and Data-Recovery with No External Reference Clock for WDM Optical Fiber Transmission.

A 5-Gb/s 156-mW Transceiver with FFE/Analog Equalizer in 90-nm CMOS Technology Wang Xinghua a, Wang Zhengchen b, Gui Xiaoyan c,

2284 IEEE JOURNAL OF SOLID-STATE CIRCUITS, VOL. 43, NO. 10, OCTOBER /$ IEEE

TIMING recovery (TR) is one of the most challenging receiver

A HYBRID PHASE-LOCKED LOOP FOR CLOCK AND DATA RECOVERY APPLICATIONS. Mohammad Sadegh Jalali. B.Sc., The University of Tehran, 2008.

Lecture 3. FIR Design and Decision Feedback Equalization

A Random and Systematic Jitter Suppressed DLL-Based Clock Generator with Effective Negative Feedback Loop

Lecture 3. FIR Design and Decision Feedback Equalization

Considerations for CRU BW and Amount of Untracked Jitter

High-Speed Interconnect Technology for Servers

Overview of System Interfaces

Top-Down Design of a Low-Power Multi-Channel 2.5-Gbit/s/Channel Gated Oscillator Clock-Recovery Circuit

Integrated Circuit Design for High-Speed Frequency Synthesis

Other Effects in PLLs. Behzad Razavi Electrical Engineering Department University of California, Los Angeles

Jitter Fundamentals: Jitter Tolerance Testing with Agilent ParBERT. Application Note. Introduction

FFT Analysis, Simulation of Computational Model and Netlist Model of Digital Phase Locked Loop

OVERVIEW OF OVERSAMPLING CLOCK AND DATA RECOVERY CIRCUITS

Accomplishment and Timing Presentation: Clock Generation of CMOS in VLSI

f o Fig ECE 6440 Frequency Synthesizers P.E. Allen Frequency Magnitude Spectral impurity Frequency Fig010-03

VLSI Broadband Communication Circuits

ECEN620: Network Theory Broadband Circuit Design Fall 2014

ECEN689: Special Topics in High-Speed Links Circuits and Systems Spring 2012

DESIGN AND VERIFICATION OF ANALOG PHASE LOCKED LOOP CIRCUIT

Based on IEEE 802.3ae Draft 3.1 Howard Baumer, Jurgen van Engelen Broadcom Corp.

Multiple Reference Clock Generator

SHF Communication Technologies AG. Wilhelm-von-Siemens-Str. 23D Berlin Germany. Phone Fax

HIGH-SPEED LOW-POWER ON-CHIP GLOBAL SIGNALING DESIGN OVERVIEW. Xi Chen, John Wilson, John Poulton, Rizwan Bashirullah, Tom Gray

A High-Resolution Dual-Loop Digital DLL

Statistical Link Modeling

European Conference on Nanoelectronics and Embedded Systems for Electric Mobility

Jitter analysis with the R&S RTO oscilloscope

SERIALIZED data transmission systems are usually

Symbol Timing Recovery for Low-SNR Partial Response Recording Channels

Verilog-A Modeling of DFFsin CDRs

A 1.25 GHz Low Power Multi-phase PLL Using Phase Interpolation between Two Complementary Clocks

Transcription:

ECEN720: High-Speed Links Circuits and Systems Spring 2017 Lecture 12: CDRs Sam Palermo Analog & Mixed-Signal Center Texas A&M University

Announcements Project Preliminary Report #2 due Apr. 20 Expand upon Report 1 with more simulation results Email it to me by 5PM Exam 2 is on Thursday April 27 Comprehensive, but will focus on lectures 7-14 85 minutes 1 standard 8.5x11 note sheet (front & back) Bring your calculator 2

Agenda CDR overview CDR phase detectors Dual-loop CDRs CDR circuits Phase interpolators Delay-locked loops CDR jitter properties 3

Embedded Clock I/O Circuits TX PLL TX Clock Distribution CDR Per-channel PLL-based Dual-loop w/ Global PLL & Local DLL/PI Local Phase-Rotator PLLs Global PLL requires RX clock distribution to individual channels 4

Clock and Data Recovery [Razavi] A clock and data recovery system (CDR) produces the clocks to sample incoming data The clock(s) must have an effective frequency equal to the incoming data rate 10GHz for 10Gb/s data rate OR, multiple clocks spaced at 100ps Additional clocks may be used for phase detection Sampling clocks should have the proper phase relationship with the incoming data for sufficient timing margin to achieve the desired biterror-rate (BER) CDR should exhibit small effective jitter 5

Embedded Clocking (CDR) PLL-based CDR V CTRL Frequency Synthesis PLL Dual-Loop CDR V ctrl 5-stage coupled VCO CP PLL[0] PFD 4 800MHZ Ref Clk D in RX[n:0] early/ late proportional gain Loop Filter integral gain 5:1 5 Mux/ MUX Interpolator Pairs (16Gb/s) PLL[4:0] (3.2GHz) RX PD early/ late PLL[4:0] 5:1 MUX FSM sel 10 15 Phase-Recovery Loop Clock frequency and optimum phase position are extracted from incoming data Phase detection continuously running Jitter tracking limited by CDR bandwidth With technology scaling we can make CDRs with higher bandwidths and the jitter tracking advantages of source synchronous systems is diminished Possible CDR implementations Stand-alone PLL Dual-loop architecture with a PLL or DLL and phase interpolators (PI) Phase-rotator PLL 6

CDR Phase Detectors [Perrott] A primary difference between CDRs and PLLs is that the incoming data signal is not periodic like the incoming reference clock of a PLL A CDR phase detector must operate properly with missing transition edges in the input data sequence 7

CDR Phase Detectors CDR phase detectors compare the phase between the input data and the recovered clock sampling this data and provides information to adjust the sampling clocks phase Phase detectors can be linear or non-linear Linear phase detectors provide both sign and magnitude information regarding the sampling phase error Hogge Non-linear phase detectors provide only sign information regarding the sampling phase error Alexander or 2x-Oversampled or Bang-Bang Oversampling (>2) Baud-Rate 8

Hogge Phase Detector Late Tb/2 ref [Razavi] Late Tb/2 ref Linear phase detector With a data transition and assuming a full-rate clock The late signal produces a signal whose pulse width is proportional to the phase difference between the incoming data and the sampling clock A Tb/2 reference signal is produced with a Tb/2 delay If the clock is sampling early, the late signal will be shorter than Tb/2 and vice-versa 9

Hogge Phase Detector Late Tb/2 ref (Late Tb/2 ref) [Razavi] Late Tb/2 ref [Lee] -1 Average Output Amplitude 1 Average Output Amplitude K PD 1 TD For phase transfer 0rad is w.r.t optimal Tb/2 () spacing between sampling clock and data e = in clk TD is the transition density no transitions, no information A value of 0.5 can be assumed for random data 10

PLL-Based CDR with a Hogge PD [Razavi] XOR outputs can directly drive the charge pump Need a relatively high-speed charge pump 11

Alexander (2x-Oversampled) Phase Detector Most commonly used CDR phase detector Non-linear (Binary) Bang-Bang PD Only provides sign information of phase error (not magnitude) Phase detector uses 2 data samples and one edge sample Data transition necessary D n D n1 If edge sample is same as second bit (or different from first), then the clock is sampling late En D n If edge sample is same as first bit (or different from second), then the clock is sampling early E n D n1 E n E n [Sheikholeslami] 12

Alexander Phase Detector Characteristic (No Noise) (Late Early) [Lee] Phase detector only outputs phase error sign information in the form of a late OR early pulse whose width doesn t vary Phase detector gain is ideally infinite at zero phase error Finite gain will be present with noise, clock jitter, sampler metastability, ISI 13

Alexander Phase Detector Characteristic (With Noise) Total transfer characteristic is the convolution of the ideal PD transfer characteristic and the noise PDF Noise linearizes the phase detector over a phase region corresponding to the peak-to-peak jitter K PD 2 J PP TD TD is the transition density no transitions, no information A value of 0.5 can be assumed for random data Output Pulse Width -1 Average Output Amplitude 1 Average Output Amplitude Output Pulse Width [Lee] 14

Mueller-Muller Baud-Rate Phase Detector Baud-rate phase detector only requires one sample clock per symbol (bit) 1 [Musa] Mueller-Muller phase detector commonly used -1-1 Attempting to equalize the amplitude of samples taken before and after a pulse 15

Mueller-Muller Baud-Rate Phase Detector [Spagna ISSCC 2010] 16

Analog PLL-based CDR Linearized K PD [Lee] 17

Analog PLL-based CDR [Lee] CDR bandwidth will vary with input phase variation amplitude with a non-linear phase detector Final performance verification should be done with a time-domain non-linear model 18

Single-Loop CDR Issues PLL-based CDR V CTRL RX[n:0] proportional gain D in early/ late Loop Filter integral gain Phase detectors have limited frequency acquisition range Results in long lock times or not locking at all Can potentially lock to harmonics of correct clock frequency VCO frequency range variation with process, voltage, and temperature can exceed PLL lock range if only a phase detector is employed 19

Phase Interpolator (PI) Based CDR Frequency synthesis loop can be a global PLL Can be difficult to distribute multiple phases long distance Need to preserve phase spacing Clock distribution power increases with phase number If CDR needs more than 4 phases consider local phase generation 20

DLL Local Phase Generation Only differential clock is distributed from global PLL Delay-Locked Loop (DLL) locally generates the multiple clock phases for the phase interpolators DLL can be per-channel or shared by a small number (4) Same architecture can be used in a forwarded-clock system Replace frequency synthesis PLL with forwarded-clock signals 21

Phase Rotator PLL Phase interpolators can be expensive in terms of power and area Phase rotator PLL places one interpolator in PLL feedback to adjust all VCO output phases simultaneously Now frequency synthesis and phase recovery loops are coupled Need PLL bandwidth greater than phase loop Useful in filtering VCO noise 22

Phase Interpolators Phase interpolators realize digital-to-phase conversion (DPC) Produce an output clock that is a weighted sum of two input clock phases Common circuit structures Tail current summation interpolation Voltage-mode interpolation Interpolator code mapping techniques Sinusoidal Linear [Weinlader] [Bulzacchelli] 23

Sinusoidal Phase Interpolation X I Asin( t) X Q Asin( t / 2) Acos t Y Acos cos Asin t sint Asin cost X I sin X Q a1 X I a2 X Q 0 2 Arbitrary phase shift can be generated with linear summation of I/Q clock signal Y where 1 Asin t a cos a 2 1 a 2 2 a 1 and 1 X 1 a a 2 2 X sin Q 24

Sinusoidal vs Linear Phase Interpolation [Kreienkamp] It can be difficult to generate a circuit that implements sinusoidal weighting a 2 2 1 a2 In practice, a linear weighting is often used a 1 1 a2 1 25

Phase Interpolator Model [Weinlader] w/ ideal step inputs (worst case) small output Interpolation linearity is a function of the phase spacing, t, to output time constant, RC, ratio Important that interpolator output time constant is not too small (fast) for phase mixing quality large output 26

Phase Interpolator Model w/ ideal step inputs w/ finite input transition time Spice simulation w/ ideal step inputs: w/ finite input transition time: For more details see D. Weinlader s Stanford PhD thesis 27

Tail-Current Summation PI [Bulzacchelli JSSC 2006] Control of I/Q polarity allows for full 360 phase rotation with phase step determined by resolution of weighting DAC For linearity over a wide frequency range, important to control either input or output time constant (slew rate) 28

Voltage-Mode Summation PI [Joshi VLSI Symp 2009] For linearity over a wide frequency range, important to control either input or output time constant (slew rate) 29

Delay-Locked Loop (DLL) [Sidiropoulos JSSC 1997] DLLs lock delay of a voltage-controlled delay line (VCDL) Typically lock the delay to 1 or ½ input clock cycles If locking to ½ clock cycle the DLL is sensitive to clock duty cycle DLL does not self-generate the output clock, only delays the input clock 30

Voltage-Controlled Delay Line K DL [Sidiropoulos] 31

DLL Delay Transfer Function [Maneatis] First-order loop as delay line doesn t introduce a (low-frequency) pole The delay between reference and feedback signal is low-pass filtered Unconditionally stable as long as continuous-time approximation holds, i.e. n < ref /10 32

CDR Jitter Properties Jitter Transfer Jitter Generation Jitter Tolerance 33

CDR Jitter Model Linearized K PD [Lee] 34

Jitter Transfer Linearized K PD [Lee] Jitter transfer is how much input jitter transfers to the output If the PLL has any peaking in the phase transfer function, this jitter can actually be amplified 35

Jitter Transfer Measurement System recovered clock Clean Clock System input clock with sinusoidal phase modulation (jitter) Sinusoidal output voltage Sinusoidal input voltage for phase mod. [Walker] 36

Jitter Transfer Specification [Walker] 37

Jitter Generation [Mansuri] Jitter generation is how much jitter the CDR generates Assumed to be dominated by VCO Assumes jitter-free serial data input VCO Phase Noise: H n VCO out s 2 2 n VCO s 2 K Loop N 2 2 s s K Loop s 2ns n RCs N For CDR, N should be 1 38

Jitter Generation High-Pass Transfer Function Jitter accumulates up to time 1/PLL bandwidth 20log 10 out (s) vcon (s) SONET specification: rms output jitter 0.01 UI [McNeill] 39

Jitter Tolerance How much sinusoidal jitter can the CDR tolerate and still achieve a given BER? [Sheikholeslami] Maximum tolerable e e s 1 in out s s n. in s Timing Margin 2 [Lee] JTOL s 2 n. in s TM s out 1 s in 40

Jitter Tolerance Measurement [Lee] Random and sinusoidal jitter are added by modulating the BERT clock Deterministic jitter is added by passing the data through the channel For a given frequency, sinusoidal jitter amplitude is increased until the minimum acceptable BER (10-12 ) is recorded 41

Jitter Tolerance Measurement [Lee] (within CDR bandwidth) Flat region is beyond CDR bandwidth JTOL s 2 n. in s TM out s 1 s in 42

Next Time Forwarded-Clock Deskew Circuits Clock Distribution Techniques 43