Lecture 12: Introduction to Link Design

Similar documents
Lecture 15: Transmitter and Receiver Design

Lecture 12: Introduction to Link Design

ECEN689: Special Topics in High-Speed Links Circuits and Systems Spring 2012

EE290C - Spring 2004 Advanced Topics in Circuit Design High-Speed Electrical Interfaces. Outline

Circuit Design for a 2.2 GByte/s Memory Interface

ECEN 720 High-Speed Links: Circuits and Systems. Lab3 Transmitter Circuits. Objective. Introduction. Transmitter Automatic Termination Adjustment

5Gbps Serial Link Transmitter with Pre-emphasis

ECEN 720 High-Speed Links: Circuits and Systems

ECEN 720 High-Speed Links Circuits and Systems

High-speed Serial Interface

A Variable-Frequency Parallel I/O Interface with Adaptive Power Supply Regulation

Signal Technologies 1

ECEN689: Special Topics in High-Speed Links Circuits and Systems Spring 2012

ECEN720: High-Speed Links Circuits and Systems Spring 2017

A CMOS Multi-Gb/s 4-PAM Serial Link Transceiver*

Lecture 10. Circuit Pitfalls

A 5-Gb/s 156-mW Transceiver with FFE/Analog Equalizer in 90-nm CMOS Technology Wang Xinghua a, Wang Zhengchen b, Gui Xiaoyan c,

ECEN620: Network Theory Broadband Circuit Design Fall 2014

Lecture 9: Clocking for High Performance Processors

Low Jitter, Low Emission Timing Solutions For High Speed Digital Systems. A Design Methodology

Multi-gigabit signaling with CMOS

High-Performance Electrical Signaling

To learn fundamentals of high speed I/O link equalization techniques.

12 Bit 1.5 GS/s Return to Zero DAC

A Serial Link Transceiver Based on 8 GSa/s A/D and D/A Converters

HIGH-SPEED LOW-POWER ON-CHIP GLOBAL SIGNALING DESIGN OVERVIEW. Xi Chen, John Wilson, John Poulton, Rizwan Bashirullah, Tom Gray

ECEN689: Special Topics in High-Speed Links Circuits and Systems Spring 2010

20Gb/s 0.13um CMOS Serial Link

Lecture 11: Clocking

CHAPTER 6 PHASE LOCKED LOOP ARCHITECTURE FOR ADC

A 2-byte Parallel 1.25 Gb/s Interconnect I/O Interface with Self-configurable Link and Plesiochronous Clocking

Lecture 3 Switched-Capacitor Circuits Trevor Caldwell

High-Speed Links. Agenda : High Speed Links

EE290C - Spring 2004 Advanced Topics in Circuit Design High-Speed Electrical Interfaces. Announcements

IN the design of the fine comparator for a CMOS two-step flash A/D converter, the main design issues are offset cancelation

Low-Power Pipelined ADC Design for Wireless LANs

Analog CMOS Interface Circuits for UMSI Chip of Environmental Monitoring Microsystem

A Serial Link Transceiver Based on 8 GSa/s A/D and D/A Converters

Lecture 23: PLLs. Office hour on Monday moved to 1-2pm and 3:30-4pm Final exam next Wednesday, in class

ECEN720: High-Speed Links Circuits and Systems Spring 2017

ECE 497 JS Lecture - 22 Timing & Signaling

TOP VIEW. Maxim Integrated Products 1

ECEN620: Network Theory Broadband Circuit Design Fall 2012

Phil Lehwalder ECE526 Summer 2011 Dr. Chiang

A 4 GSample/s 8-bit ADC in. Ken Poulton, Robert Neff, Art Muto, Wei Liu, Andrew Burstein*, Mehrdad Heshami* Agilent Laboratories Palo Alto, California

10.1: A 4 GSample/s 8b ADC in 0.35-um CMOS

12 Bit 1.2 GS/s 4:1 MUXDAC

A Clock Generating System for USB 2.0 with a High-PSR Bandgap Reference Generator

Lecture 160 Examples of CDR Circuits in CMOS (09/04/03) Page 160-1

INL PLOT REFIN DAC AMPLIFIER DAC REGISTER INPUT CONTROL LOGIC, REGISTERS AND LATCHES

Self-Biased PLL/DLL. ECG minute Final Project Presentation. Wenlan Wu Electrical and Computer Engineering University of Nevada Las Vegas

Introduction to CMOS RF Integrated Circuits Design

ISSCC 2006 / SESSION 13 / OPTICAL COMMUNICATION / 13.2

10-Bit, Low-Power, Rail-to-Rail Voltage-Output Serial DAC in SOT23

ULTRASCALE DDR4 DE-EMPHASIS AND CTLE FEATURE OPTIMIZATION WITH STATISTICAL ENGINE FOR BER SPECIFICATION

To learn statistical bit-error-rate (BER) simulation, BER link noise budgeting and using ADS to model high speed I/O link circuits

Digital Systems Power, Speed and Packages II CMPE 650

Phase interpolation technique based on high-speed SERDES chip CDR Meidong Lin, Zhiping Wen, Lei Chen, Xuewu Li

A 14-bit 2.5 GS/s DAC based on Multi-Clock Synchronization. Hegang Hou*, Zongmin Wang, Ying Kong, Xinmang Peng, Haitao Guan, Jinhao Wang, Yan Ren

40 Gbps SiGe PATTERN GENERATOR IC WITH VARIABLE CLOCK SKEW AND OUTPUT LEVELS MATTHEW JOHN ZAHLLER

Lecture 15: Clock Recovery

Burst Mode Technology

10-Bit µp-compatible D/A converter

Lecture 15: Clock Recovery

A10-Gb/slow-power adaptive continuous-time linear equalizer using asynchronous under-sampling histogram

A 10Gbps Analog Adaptive Equalizer and Pulse Shaping Circuit for Backplane Interface

High-Frequency Programmable PECL Clock Generator

12 Bit 1.3 GS/s Master-Slave 4:1 MUXDAC. 12 BIT 4:1 MUX 1.3GS/s DAC, DIE Lead HSD Package 12 BIT 4:1 MUX 1.3GS/s DAC, 88 Lead QFP Package

A 0.8-pm CMOS 2.5 Gb/s Oversampling Receiver and Transmitter for Serial Links. ext-clk Loop. ~ Control I U e 1

Quad 12-Bit Digital-to-Analog Converter (Serial Interface)

Lecture 3. FIR Design and Decision Feedback Equalization

A Phase-Locked Loop with Embedded Analog-to-Digital Converter for Digital Control

L10: Analog Building Blocks (OpAmps,, A/D, D/A)

Lecture 3. FIR Design and Decision Feedback Equalization

On Chip Active Decoupling Capacitors for Supply Noise Reduction for Power Gating and Dynamic Dual Vdd Circuits in Digital VLSI

ECE 6770 FINAL PROJECT

2. ADC Architectures and CMOS Circuits

ECEN620: Network Theory Broadband Circuit Design Fall 2014

4 x 10 bit Free Run A/D 4 x Hi Comparator 4 x Low Comparator IRQ on Compare MX839. C-BUS Interface & Control Logic

Accomplishment and Timing Presentation: Clock Generation of CMOS in VLSI

Synchronous Mirror Delays. ECG 721 Memory Circuit Design Kevin Buck

Decoupling Technique for Reducing Sensitivity of Differential Pairs to Power-Supply-Induced Jitter

UNIT-II LOW POWER VLSI DESIGN APPROACHES

Transmitter Equalization for 4Gb/s Signalling

DESIGN OF MULTIPLYING DELAY LOCKED LOOP FOR DIFFERENT MULTIPLYING FACTORS

Lecture 19: Design for Skew

ENEE 359a Digital VLSI Design

INF4420 Switched capacitor circuits Outline

Dedication. To Mum and Dad

SWITCHED CAPACITOR CIRCUITS

12-Bit, Low-Power, Dual, Voltage-Output DAC with Serial Interface

EE 330 Lecture 43. Digital Circuits. Other Logic Styles Dynamic Logic Circuits

A 3-10GHz Ultra-Wideband Pulser

INF3430 Clock and Synchronization

10-Bit 5MHz Pipeline A/D Converter. Kannan Sockalingam and Rick Thibodeau

Advanced Operational Amplifiers

800Mbps LVDS/LVPECL-to-LVDS 2 x 2 Crosspoint Switch

Engineering the Power Delivery Network

ICS1885. High-Performance Communications PHYceiver TM. Integrated Circuit Systems, Inc. General Description. Pin Configuration.

LMV331TP / LMV393TP. General Purpose, 1.8V, RRI, Open-Drain Output Comparators. Features. Description. Applications. Pin Configuration (Top View)

Transcription:

Lecture 12: Introduction to Link Design Computer Systems Laboratory Stanford University horowitz@stanford.edu Copyright 2000 by Mark Horowitz EE371 Lecture 12-1 Horowitz

Overview Reading Chapter 19 - High Speed Link Design, by Ken Yang, Stefanos Sidiropoulos Introduction There has been an explosion of interest in high-speed IO over the past 10 years. It is now being used in products ranging from DRAMs to inteconnects in high-end servers and routers. This lecture will give an overview of the basic elements needed in a high-speed link, and will set up what we will discuss in the next few lectures. We start be looking at what makes driving external wires different from the work we have done driving internal wires. EE371 Lecture 12-2 Horowitz

Basic IO Design All external signal paths can be represented by three elements: Tx Channel Rx Transmitter Converts bits to an analog electrical signal to transmit on pin Channel Transmission media for the analog signal, which is sometimes pretty nasty to signal fidelity Receiver Convert the analog signals back to bits (quantized in voltage and time) It is the need to convert back to digital signals that can be a problem EE371 Lecture 12-3 Horowitz

Basic Issues Voltage Margins: making sure voltage quantization yields the correct result: R TERM R TERM Tx Channel Rx Timing Margins: knowing which bit is which: 1 0 0 1 0 1 0 t bit /2 For a high-speed signal, bits are pretty short (< 1ns) EE371 Lecture 12-4 Horowitz

What is the Problem Deal with analog signals all the time on chip. What is the issue with IO? Why are timing and voltage margins an issues for external signals? The speed of light is not infinite Wire connecting the gates is not an equipotential - It is not even an RC line So life gets complicated EE371 Lecture 12-5 Horowitz

Finite Speed of Light Signals on wires must experience delay in reaching destination -- T d = L/ν - Bit arrive at a different time then when they were sent - Must sample the data at the correct time - And the clocks to the two chips might not arrive at the same time Wires store energy Tx return current While the signal is travelling on the wire, - Current from Tx is initially set by the wire (can t see resistor at t=0). V/i for the line is its impedance, Z, and is set by the geometry of the wire - Signal is a pair of currents that propagates out from source (current and return) EE371 Lecture 12-6 Horowitz

Transmission Lines (What wires are called when you notice ν is not infinite) Two constraints govern behavior at any junction: Voltage at all components must be equal Electrically connected Power flow into junction must equal power flow out of junction Conservation of energy Leads to reflections Z2 Z1 2Z2 ------------------- ------------------- Z1 + Z2 Z1 + Z2 Z1 Z2 Signals can return to the source, as well as propagate forward Note: the signal return path is usually drawn; it is as important as the signal EE371 Lecture 12-7 Horowitz

Conventional Buses IC #1 IC #2 IC #N bus lines bus CLK t 1N t 12 t ck1 t ckn Have many problems Electrical distances from chip to chip vary, have stubs in the transmission lines - Make signal environment difficult EE371 Lecture 12-8 Horowitz

Stubs Can t connect to the middle of a transmission line without causing trouble: Z Z Unless stub is short, it will cause reflections, since energy will split and only part will go into each transmission like segment Add lots of noise to the signal Slow the signal propagation - Energy must reflect off all the stubs before settling down EE371 Lecture 12-9 Horowitz

What Length is Short Enough? Length is compared to distance light travels over what time? Related to the rise time of the signal, not frequency - If the signal does not change much by the time the reflection returns, you won t see the reflection -- the reflection settles during the transition - If stub is short, energy storage in line is not significant - Model by lumped parameters Since fewer transmission lines are better, we want to slow signal edge rates - Bit should be 1/3 rise, 1/3 high, 1/3 fall - Risetime should be 1/2 to 2/3 of bit time Nice edge rate for bits reflections are of small effective amplitude EE371 Lecture 12-10 Horowitz

High Speed Links ref CLK DLL/PLL data CLK ref Almost all high-speed links are point to point. - Sending a clock as a additional data bit helps determine timing if the data cable lengths are all matched - Called source synchronous links EE371 Lecture 12-11 Horowitz

Coupling All transmission lines need a current return path Current return And are really differential systems Voltage difference between input and return is equal to the voltage difference between output and the voltage of the return at that end. The two return need not be at exactly the same voltage. If return path is far away, another signal can see the signal too Coupled transmission lines - Vout = a Vin1 + b Vin2 Coupling on PC board, and coax cables are small Coupling on Twisted pairs and IC packages is significant EE371 Lecture 12-12 Horowitz

Metrics Need to measure links Look at a couple of metrics Performance Bit rate - Normalize out the fabrication technology As technology scales, how fast will link become? - Use FO4 Bit Error Rate (BER) - Link reliability - Signal to noise ratio - Should scale with technology, performance easy to predict EE371 Lecture 12-13 Horowitz

Bit Error Rate Receiver needs to convert analog signal to digital value Possible to make an error -- noise is greater than signal - Voltage noise - Timing noise Reduces the amount of signal available BER - Depends on signal to noise ratio (SNR) - Also depends on noise statistics overlay of bits bit stream eye diagram with voltage and timing noise EE371 Lecture 12-14 Horowitz

BER and SNR Many textbooks give plots like: Show BER exponentially related to SNR But assume gaussian noise Real noise is not gaussian - Small white (or colored noise) gaussian - Large self-induced noise Not true noise, but hard to calculate Calculated Bit Err Rate (log) -1.0-3.0-5.0-7.0-9.0-11.0-13.0-15.0-17.0 10.0 12.0 14.0 16.0 18.0 20.0 22.0 Signal/Noise Ratio (db) Take true signal (signal - self-induced noise) and compare to white noise Give effective SNR White noise is small (mvs, but hard to estimate) Can make BER for most electronic links (non-optical) very, very small EE371 Lecture 12-15 Horowitz

Summary Electronics need to deal with: Transmission line impedance - Need to have some method of dissipating the energy - Need to drive relatively low impedances (< 100 ohms) Noisy Signals - Some noise is proportional to signal amplitude Can t perfectly set impedance or resistance, will have reflections Coupling of other signals - Fundamental noise in analog world Line Delay - Need to extract timing from signal or some other reference. EE371 Lecture 12-16 Horowitz

Transmitter and Receiver Design Outline System Architectures - What does the system look like? Noise - What does the signal integrity engineer have to do? Drivers - How do I generate these 500-mV swing signals out of a 3.3-V chip? Receivers - How do I restore these 500-mV signals to 3.3-V? Bidirectional Signalling - What can I do to save pins and wires? EE371 Lecture 12-17 Horowitz

The Conventional Bus Bottleneck #1 #2 #N bus-clk Timing is uncertain: - Distances of data from chip to chip and from clock to any chip vary - -> So we need to slow down to have margins for the worst case Signals don t look that great either: - Multiple discontinuities on bus transmission line create reflections - Using a conventional buffer to drive a low impedance generates noise and burns a lot of power (3.3V to 50 Ohms ~ 210 mwatts!!) EE371 Lecture 12-18 Horowitz

Point-to-Point Parallel Links Source Synchronous /low-swing design: ref CLK DLL/PLL data CLK Transmitter timing Receiver timing ref CLK ref CLK data D0 D1 D2 D3 data D0 D1 D2 D3 CLK Bandwidth is set by delay uncertainty and not total delay through wires Uncertainty is created by: skew, jitter, rcv/xmit offsets, setup+hold time. PLL/DLL used to create the 90 o clock on the receiver side. Use small swing signals to minimize power and noise EE371 Lecture 12-19 Horowitz

High Speed Buses Rambus channel: talk only from master->slave, or slave->master SL-1 SL-2 Sl-N bus master data CK m-s CK s-m ck Same timing idea: make sure data & clock travel the same distance - Now both transmitter and receiver need to allign with the system clock More difficult environment than point-point: - Multiple discontinuities on transmission line are dealt with carefull package and board design Again PLL/DLL used for timing. More on these later... EE371 Lecture 12-20 Horowitz

Noise Need to send signals that can be distinguished from environment noise + = Independent noise - Gaussian (unbounded) but very small probability (< 10-20 ) for appreciable (1- mv) noise. - Unrelated power supply noise: background activity of the chip and other drivers switching unpredicrably. Proportional noise (scales with signal swing): - Self Induced di/dt noise (also called signal return noise) - Crosstalk/Coupling from other signals. - Mistermination -> reflections EE371 Lecture 12-21 Horowitz

Aside on Supply Noise On-chip switching Vdd Cd + Vss CL - Causes Vdd and Vss to droop out of phase. On chip Vdd-Vss capacitance can be used to minimize this effect by supplying the required charge. Off chip driving Vdd + Cd Vss Zl - Causes Vdd and Vss to move in phase. The on chip Vdd-Vss capacitance does not help minimize the noise. It prevents the supply from colapsing. EE371 Lecture 12-22 Horowitz

Noise: What can you do. Overpower it with large signal swings - Works great for Gaussian noise and unrelated bounded noise Cancel by using differential signalling - Works for self-induced di/dt noise crosstalk and unrelated PS noise - Pseudo-differential signalling works to a certain extent Minimize by carefull/conservative design - Don t route large swing signals close to low swing signals - Route differential signals close together Always do worst case estimation: E.g. N*L*dI/dt use max N, max L, FF corner to get the max di/dt EE371 Lecture 12-23 Horowitz

Output Drivers Output Impedance: High -> parallel terminated current source more power, better supply rejection Low -> series terminated voltage source lower power, poor supply rejection Output swing: 300 mv - 1 V (scalable with Vdd) Differential or Single-Ended Differential: more wires and pins but better noise immunity Single-Ended: Pure single ended has lots of problems due to unrelated PS noise. Usually generate a reference and share it among many pins. Still more problems with noise than fully-differential. EE371 Lecture 12-24 Horowitz

High Impedance Drivers in Single-ended A Zo Td V tt B Ro o Differential o Zo Zo in Vtt VIH Vtt-Zo*Idrv Vbias Td Keep current source in saturation region Vtt-Vswing > Vdsat of transistor Keep driver current constant: -> IR drops will shift the bias point: use thick Vss lines or current references -> can use feedback to set Vbias (or adjust tail-cs width) EE371 Lecture 12-25 Horowitz

Source Terminated Drivers in Push-pull Rs A B C Zo in Zd A Rs Open drain B Zo Td Rt Vtt C in Zd Vsw Rs A B Td Td Zo C Zd+Rs = Zo = Rt or Rs=0, Zd<<Zo=Rt in A Vtt*Zd/(Zd+Rs+Zo) in A B C Zd+Rs = Zo Vsw*Zd/(2*Zo) Vsw/2 Td Td Vsw B C Vtt*(Zd+Rs)/(Zd+Rs+Zo) Td You can use differential signalling by duplicating the drivers or generating a reference voltage. EE371 Lecture 12-26 Horowitz

Example: Push-pull signalling +1-V local CLK clk DLL x N data +1-V - + data-p data-n x N Reference voltage can be generated on-chip but noise tracking is limited Loading of reference on the receiver side is much larger than that of the signal EE371 Lecture 12-27 Horowitz

Driver Issues Driver Impedance/Current control use active circuits to compensate for process/supply/temp variations Drivers turn-on time is an issue (slew rate) If turn on is too fast it will increase the self-induced di/dt noise so we need to control the slew rate of the pre-driver. This is hard to do: if you compensate for the FF corner the SS corner will become too slow. EE371 Lecture 12-28 Horowitz

Driver Impedance/Current Control Need to match the driver impedance to the line impedance (Zd=Zo) or regulate the current to keep the swing constant. Adjust the width of the driver digitally control register df N binary sized devices sig S0 S1 d0 d1 df d0 d1 F w 2xw F should give Z max >Z o at FF corner (2 N -1)xW should give Z min <Z o at SS corner (S0=..=SN=1) EE371 Lecture 12-29 Horowitz

Driver Impedance Control (cont d) How do you set the value of the control register? Set it with scan at system power-up (what about variations?) Integrate a feedback mechanism with a replica driver Vswing Ro replica driver d[n:1] U/D cnt control register to real buffers Vref=Vswing/2 FSM LoadEn Move the value of the counter to the control register periodically Glitches when changing from 011... to 100... -> Assert LoadEn only when not transmitting -> Change from binary weights to thermometer-like code EE371 Lecture 12-30 Horowitz

Output Slew Rate Control Problem Sharp slew-rates introduces high-frequency components EMI issue at the output and reflections from parasitics on the channel So we need to control the slew rate of the pre-driver... but it is a hard problem. Slow down the pre-driver? min. data rate max. di/dt 70% SS process corners FF If you compensate for the FF corner the SS corner will become too slow and cause inter-symbol interference of the data. EE371 Lecture 12-31 Horowitz

Slew Rate Control Delay the turn on. Use RC delay (or buffer delays) [TI] out R V pre-driver δ δ δ Set the pre-driver slew-rate using a control voltage from a process indicator [6]. pre-driver out ctrl from process indicator (i.e. a VCO) time EE371 Lecture 12-32 Horowitz

Output Driver Summary R s Z o R o Z o R o Voltage-mode driver series-terminated voltage source lower power Worse supply rejection Current-mode driver parallel-terminated current source more power, less reflection noise better supply rejection Deal with process variations: control the current and output impedance using a feedback. control the slew rate using feedback Differential signalling reduces noise but uses 2x the number of pins. Are we done? Not yet. What s the bandwidth limitation? EE371 Lecture 12-33 Horowitz

Where is the Bandwidth Limit? clk t pw R o data D Q C pad predriver R o C pad at the output? No, usually very small since R o <= 50Ω. Minimum pulse width (t pw )? Maybe, 3x t prop-dly of predriver. Clock cycle-time? Yes, FO-4 buffer chain need clock period of 6-8 FO-4 delay. Solution: use more bits/cycle EE371 Lecture 12-34 Horowitz

Parallelism Use multiplexer to improve the bandwidth. clk data odd data even 50Ω data out Driver C pad clk data out datao datae 2:1 multiplexer has a bit-time limit of 2 FO-4. clk clkb 25 data O data E pulse width closure (%) 20 15 10 5 0 1.5 2 2.5 3 3.5 bit time (normalized to FO4) Clock is still limits bit-time (3-4 FO-4), but higher multiplexing is limited by mux EE371 Lecture 12-35 Horowitz

More Bits/Cycle Use low swings and higher fan-in mux. Convenient to mux at the output. (trades off larger output RC) D out sel 0 D 0 D 1 D 2 sel 0 sel 1 sel 2 x N sel 1 D out0 D out1 D out2 Multiplexer Limited by the minimum pulse width on-chip (2 FO-4), Use multiple phases and overlapping currents. Reach bit-time of 1 FO-4.[11] ck1 ck2 clock(ck3) Tx-PLL VCO ck0 ck1 ck2 ck3 D0 D1 D2 data(ck0) Current Pulse ck3 R TERM data R TERM out out x 8 data Amplitude reduction (%) fan-in = 8 25 20 15 10 5 0.60 0 0.70 0.80 0.90 1.00 bit-width (# FO-4) EE371 Lecture 12-36 Horowitz

Receiver Vi+ Vi- + - V os + - Clk Amplify and latch the signal stream into a digital bit sequence. Issues bandwidth resolution limited by noise and offset ensure good timing margin EE371 Lecture 12-37 Horowitz

Timing Margin Factors that degrade the margin: Sampling clock jitter: Data jitter: Transmitter clock t jc Receiver uncertainty window: offset, noise, metastability (t setup-hold ) t jd t sh Remaining: t margin = 0.5*(t bit - t jc - t jd - t sh ) EE371 Lecture 12-38 Horowitz

Receiver Design Differential vs single-ended: Every receiver has a reference voltage (implicit for single-ended) Differential receiver rejects common-mode noise can be used for singled-ended inputs (pseudo-differential). Try to use the reference information sent along with the signal. Circuit topology Vin+ + - clk D Q dout Vin+ Vinclk clk Vinclk Amplifier followed by a latch. Latching sense-amplifier structures EE371 Lecture 12-39 Horowitz

Amplifying receiver [1] V-/Vref V+ V o ck Self biased amplifier with medium/high input common mode self biasing improves P/N tracking. can use the dual structure if inputs have low common mode. Resolution input-referred offset: transistor random mismatch (V T, K P ) and systematic errors (V o_min from latch) Timing Errors The delay is sensitive to PS increase the uncertainty on the switching time of V o. Setup-hold time depends on latch (which can be poor.) Gain-bandwidth limitation introduces inter-symbol interference for high data rates. (4-6 FO-4) EE371 Lecture 12-40 Horowitz

Sampling receiver [7] ck ck Grey device show cross-coupled inverters that regenerate. Vo- Vo+ Need a latch at the output to hold th data for the full clock cycle. ck ck Vi+ ck Vick Vo+ S/H track input hold input LTC precharge regenerate No ISI because the outputs are equalized for each incoming bit. Slightly worse input offset than before: 50-100mV Setup/hold window of < 100ps Be careful about sampling noise and charge-kick back. Bit-time is limited by the cycle-time (to have enough gain) of 6-8 FO-4. EE371 Lecture 12-41 Horowitz

Sampling Receiver sample In Strong-Arm Latch Small Kick-back onto inputs Good gain EE371 Lecture 12-42 Horowitz

Demultiplexing Double the data bandwidth (bit-time of 3-4 FO-4) with 2:1 demultiplexing d in Rcv0 Rcv1 clk RX d in0 d in clk TX d in1 d in0 sample points ref clk RX d in1 Can extend to higher bandwidth (~ 0.5 FO-4) [11] Limit in data rate is really the sampling aperture of the samplers and not the cycle time of the latch. D in D 0 D 1 D 2 ck0 ck1 ck2 x N D in0 ck 0 ck 1 ck 2 D in1 D in2 Demultiplexer EE371 Lecture 12-43 Horowitz

Input Offset Correction Resolution is limited by offset (V T and K P ) between differential inputs, but it s a static offset. Statically trim the offset per latch can use digital correction (DAC) in + + _ in + + _ + _ DAC ctrl register Active offset cancellation: connect in a feedback [8]. EE371 Lecture 12-44 Horowitz

Parallel Link Example V tt d 0e d 0o W d 0 + - + d 0e d 0o x N - V tt V dd W/2 ref clk Current-mode driver Latching receiver Share the reference to save pins and wires. Sending reference along allows some tracking of driver side noise. But the noise tracking is limited, especially at the receiver... EE371 Lecture 12-45 Horowitz

Reference Noise is Different Reference is filtered differently from data (for multiple parallel inputs) so noise couples differently between signal and reference. R D L P 0 R D L P C IN C REF V SS V IN V REF Noise Amplitude 1.5 1.0 0.5 0.0 10 7 10 8 10 9 10 10 VSS Noise Frequency (MHz) So far we only take a single sample of the data noise can occur any time. EE371 Lecture 12-46 Horowitz

Integrating Receiver To increase robustness: Take multiple samples and do averaging [12] Integrate the input data and decide at the end [5]. C C φ V o φ V i I Noise does not affect polarity of V o. You can amplify and latch V o with a conventional receiver afterwards. EE371 Lecture 12-47 Horowitz

Receiver Summary Two types of receivers: amplify + latch: better offsets but bandwidth limited by amplifier sample + latching: no ISI but sampling noise. Bandwidth: Can reach 3-4 FO-4 easily using 1:2 demultiplexing. More demultiplex for better bandwidth: sampling bandwidth limits to 0.5 FO-4. Resolution: Static offsets: cancel with offset cancellation Differential to reduce noise. Reference noise: need to filter the input. What about timing noise? EE371 Lecture 12-48 Horowitz

Transmitter and Receiver References [1] B. Chappel, et. al. Fast CMOS ECL Receivers With 100 mv Sensitivity, IEEE Journal of Solid State Circuits, vol. 23, no. 1, Feb. 1988. [2] N. Kushiyama et. al., A 500Mbyte/sec Data-Rate 4.5M DRAM, IEEE Journal of Solid State Circuits, vol. 28, no. 4, April 1993 [3] A. DeHon et. al. Automatic Impedance Control, International Solid State Circuits Conference Digest of Technical Papers, pp. 164-165, Feb. 1993. [4] S. Kim et. al. A pseudo-synchronous skew-insensitive I/O scheme for high bandwidth memories, IEEE Symposium on VLSI Circuits, June 1994. [5] S. Sidiropoulos, M. Horowitz, A 700 Mbps/pin CMOS Signalling Interface Using Current Integrating Receivers, IEEE Symposium on VLSI Circuits, Jun. 1996. [6] K. Donelly et. al., A 660Mb/s Interface Megacell Portable Circuit in 0.35um-0.7um CMOS ASIC, International Solid State Circuits Conference Digest of Technical Papers, pp. 290-291, Feb. 1996. [7] A. Yukawa, et. al. A CMOS 8-bit high speed A/D converter IC. 1988 Proceedings of the Tenth European Solid-State Circuits Conference p. 193-6 [8] J.T. Wu, et. al. A 100-MHz pipelined CMOS comparator IEEE Journal of Solid-State Circuits, Jun. 1988, vol. 23, no.6, p. 1379-85 EE371 Lecture 12-49 Horowitz

[9] B. Gunning, et. al. A CMOS low-voltage-swing transmission-line transceiver, 1992 IEEE International Solid-State Circuits Conference Digest of Technical Papers, Feb. 1992, p. 58-9 [10] S. Sidiropoulos, et. al. A CMOS 500 Mbps/pin synchronous point to point link interface Proceedings of 1994 IEEE Symposium on VLSI Circuits. Digest of Technical Papers p. 43-4 [11] C.K. Yang, et. al. A 0.5-µm CMOS 4.0-Gbps Serial Link Transceiver with Data Recovery using Oversampling, IEEE Journal of Solid State Circuits, May 1998, vol.33, no.5, p. 713-22 [12] S. Kim, et. al. An 800Mbps Multi-Channel CMOS Serial Link with 3x Oversampling, IEEE 1995 Custom Integrated Circuits Conference Proceedings, pp. 451, Feb. 1995. [13] JEDEC, Stub Series Terminated Logic for 3.3V (SSTL_3), EIA/JESD8-8, www.jedec.org [14] JEDEC, High-speed Transceiver Logic (HSTL), EIA/JESD8-6, www.jedec.org EE371 Lecture 12-50 Horowitz