Reality Check: Challenges of mixed-signal VLSI design for high-speed optical communications

Similar documents
The Fujitsu 56GSa/s Analog-to-Digital Converter Enables 100GbE Transport

Micram DAC7201 and DAC GS/s Digital to Analog Converter Systems. Data Sheet

A 4 GSample/s 8-bit ADC in. Ken Poulton, Robert Neff, Art Muto, Wei Liu, Andrew Burstein*, Mehrdad Heshami* Agilent Laboratories Palo Alto, California

Signal Integrity Design of TSV-Based 3D IC

100 Gb/s: The High Speed Connectivity Race is On

Low Jitter, Low Emission Timing Solutions For High Speed Digital Systems. A Design Methodology

A 0.18µm SiGe BiCMOS Receiver and Transmitter Chipset for SONET OC-768 Transmission Systems

Measurement Results for a High Throughput MCM

ISSCC 2006 / SESSION 13 / OPTICAL COMMUNICATION / 13.2

A 10 bit, 1.8 GS/s Time Interleaved Pipeline ADC

Microcircuit Electrical Issues

Ultra-high-speed Interconnect Technology for Processor Communication

Development of a 20 GS/s Sampling Chip in 130nm CMOS Technology

FPGA based Prototyping of Next Generation Forward Error Correction

Signal Integrity Modeling and Measurement of TSV in 3D IC

Considerations in High-Speed High Performance Die-Package-Board Co-Design. Jenny Jiang Altera Packaging Department October 2014

FLYOVER QSFP APPLICATION DESIGN GUIDE

EE434 ASIC & Digital Systems. Partha Pande School of EECS Washington State University

on-chip Design for LAr Front-end Readout

10.1: A 4 GSample/s 8b ADC in 0.35-um CMOS

Design Challenges in Multi-GHz Microprocessors

The Practical Limitations of S Parameter Measurements and the Impact on Time- Domain Simulations of High Speed Interconnects

High-speed Serial Interface

Another way to implement a folding ADC

Electrical Characterization of a 64 Ball Grid Array Package

SV2C 28 Gbps, 8 Lane SerDes Tester

A Fully Integrated 20 Gb/s Optoelectronic Transceiver Implemented in a Standard

P14155A: 128 Channel Cross-correlator ASIC Datasheet Rev 2.1

Development of Radiation-Hard ASICs for the ATLAS Phase-1 Liquid Argon Calorimeter Readout Electronics Upgrade

Preliminary Product Overview

Architectures and circuits for timeinterleaved. Sandeep Gupta Teranetics, Santa Clara, CA

Transmission-Line-Based, Shared-Media On-Chip. Interconnects for Multi-Core Processors

Micram DAC10001 and DAC GS/s Digital to Analog Converter System. Data Sheet

Lecture 11: Clocking

Proposing. An Interpolated Pipeline ADC

The Design Challenge to Integrate High Performance Organic Packaging into High End ASIC Strategic Space Based Applications.

LSI and Circuit Technologies for the SX-8 Supercomputer

High Speed Characterization Report

Lecture 160 Examples of CDR Circuits in CMOS (09/04/03) Page 160-1

Source: Nanju Na Jean Audet David R Stauffer IBM Systems and Technology Group

High Speed Characterization Report

High Speed Characterization Report

Design Considerations for Highly Integrated 3D SiP for Mobile Applications

High-Performance Analog and RF Circuit Simulation using the Analog FastSPICE Platform at Columbia University. Columbia University

12 Bit 1.2 GS/s 4:1 MUXDAC

A 1.5 Gbps Transceiver Chipset in 0.13-mm CMOS for Serial Digital Interface

EVALUATION KIT AVAILABLE +3.3V, Low-Jitter Crystal to LVPECL Clock Generator QA_C. 125MHz QA QA. 125MHz MAX3679A QB0 QB MHz QB1 QB

Low Power DSP and Photonic Integration in Optical Networks. Atul Srivastava CTO, NTT Electronics - America. Market Focus ECOC 2014

AD9300 SPECIFICATIONS ELECTRICAL CHARACTERISTICS ( V S = 12 V 5%; C L = 10 pf; R L = 2 k, unless otherwise noted) COMMERCIAL 0 C to +70 C Test AD9300K

ECEN689: Special Topics in High-Speed Links Circuits and Systems Spring 2012

3D IC-Package-Board Co-analysis using 3D EM Simulation for Mobile Applications

Fractional- N PLL with 90 Phase Shift Lock and Active Switched- Capacitor Loop Filter

ICS2510C. 3.3V Phase-Lock Loop Clock Driver. Integrated Circuit Systems, Inc. General Description. Pin Configuration.

Synchronous Mirror Delays. ECG 721 Memory Circuit Design Kevin Buck

Basic Functional Analysis. Sample Report Richmond Road, Suite 500, Ottawa, ON K2H 5B7 Canada Tel:

Note Using the PXIe-5785 in a manner not described in this document might impair the protection the PXIe-5785 provides.

A 10Gbps Analog Adaptive Equalizer and Pulse Shaping Circuit for Backplane Interface

Pipeline vs. Sigma Delta ADC for Communications Applications

DESIGN OF A 500MHZ, 4-BIT LOW POWER ADC FOR UWB APPLICATION

High Speed Characterization Report

LSI and Circuit Technologies of the SX-9

High Speed Characterization Report

Synthesis of Optimal On-Chip Baluns

EA/MZ Modulator Driver PMCC_EAMD12G

/$ IEEE

ECEN 720 High-Speed Links: Circuits and Systems

Low-Jitter, Precision Clock Generator with Two Outputs

HIFAS: Wide-band spectrometer ASIC

A 4 Channel Waveform Sampling ASIC in 130 nm CMOS

PCB Design Guidelines for GPS chipset designs. Section 1. Section 2. Section 3. Section 4. Section 5

SiNANO-NEREID Workshop:

Accomplishment and Timing Presentation: Clock Generation of CMOS in VLSI

A 2-bit/step SAR ADC structure with one radix-4 DAC

77 GHz VCO for Car Radar Systems T625_VCO2_W Preliminary Data Sheet

UT90nHBD Hardened-by-Design (HBD) Standard Cell Data Sheet February

HA4600. Features. 480MHz, SOT-23, Video Buffer with Output Disable. Applications. Pinouts. Ordering Information. Truth Table

Multiple Reference Clock Generator

Noise on Vcc Webinar. Presenters: Vitali Penso, Applications Engineer Murat Eron, VP of Engineering. Page 1

IP Specification. 12-Bit 125 MSPS Duel ADC in SMIC40L IPS_S40L_ADC12X2_125M FEATURES APPLICATIONS GENERAL DESCRIPTION. Single Supply 1.

SERDES Reference Clock

Engineering the Power Delivery Network

Chapter 16 PCB Layout and Stackup

Power Distribution Network Design for Stratix IV GX and Arria II GX FPGAs

Section 1. Fundamentals of DDS Technology

FIRST TELECOM APPLICATION OF DIGITAL AND MIXED COMPONENT DEVELOPMENTS: 65NM ASIC AND DATA CONVERTERS

Phase-Locked Loops and Their Applications. Advanced PLL Examples (Part II)

5Gbps Serial Link Transmitter with Pre-emphasis

High Speed Characterization Report

Impact of On-Chip Multi-Layered Inductor on Signal and Power Integrity of Underlying Power-Ground Net

Clock Tree 101. by Linda Lua

Low-Jitter, 8kHz Reference Clock Synthesizer Outputs MHz

Relationship Between Signal Integrity and EMC

Reducing Development Risk in Communications Applications with High-Performance Oscillators

Maximizing GSPS ADC SFDR Performance: Sources of Spurs and Methods of Mitigation

1.25Gbps/2.5Gbps, +3V to +5.5V, Low-Noise Transimpedance Preamplifiers for LANs

High Speed Digital Design & Verification Seminar. Measurement fundamentals

A 42 fj 8-bit 1.0-GS/s folding and interpolating ADC with 1 GHz signal bandwidth

High Speed Characterization Report

7 GHz INTEGER N SYNTHESIZER CONTINUOUS (N = ), NON-CONTINUOUS (N = 16-54) Features

Analogue circuit design for RF immunity

Transcription:

Reality Check: Challenges of mixed-signal VLSI design for high-speed optical communications

Mixed-signal VLSI for 100G and beyond 100G optical transport system Why single-chip CMOS? So what is so difficult? CHAIS ADC On-chip noise coupling Package and PCB design Testing issues Future challenges ECOC2009 Towards the Shannon limit Fujitsu Microelectronics Europe - http://emea.fujitsu.com/microelectronics 2

100G Optical Transport system 100G MUX To system (Router) 100G client module for * 100G Ethernet or * OTU-4 or 10 * 10G client module SFI-S / MLD / XFI SFI-S / MLD / XFI OTU-4 Framer / FEC SFI-S SFI-S 10 * 11.1 Gbps SFI-S SFI-S Precoding DSP 10 to 4 MUX ADC ADC ADC ADC 4 * 28 Gbps Electrical -> Optical Optical -> Electrical To network OTU-4 112 Gbps 100G Coherent Receiver ADCDSP Optical Module ECOC2009 Towards the Shannon limit Fujitsu Microelectronics Europe - http://emea.fujitsu.com/microelectronics 3

Why single-chip CMOS for 100G? Massive data bandwidth between ADC/DAC and digital 4-channel 8b 56Gs/s ADC/DAC means 1.8Tb/s of data at interface Getting this from one chip to another costs power and chip area 10G SERDES link ~250mW/channel ~10W per ADC or DAC Critical performance factor is power efficiency, not just speed Discrete ADC/DAC (e.g SiGe) dissipating ~20W each (including I/O) are difficult to use Very high total power dissipation in package (>100W for multiple channels) Skew management/calibration problem (especially over temperature/lifetime) Single-chip CMOS solution is the Holy Grail Integrate on ASIC with >50M gates or memory (size limited by power dissipation) Leverage CMOS technology advances to drive down power and cost ADC and DAC get faster and lower power at the same rate as digital -- hopefully ECOC2009 Towards the Shannon limit Fujitsu Microelectronics Europe - http://emea.fujitsu.com/microelectronics 4

ADCDSP -- so what is so difficult? ADC is the biggest circuit design problem Ultra-high speed, low noise and jitter, low power consumption all at the same time Conventional techniques cannot easily deliver required performance Digital-analogue noise coupling Sampler/clock jitter ~100fs on same chip as DSP with >100A current spikes Wide bandwidth (>20GHz) and good S11 (up to >30GHz) Sampler, package, PCB design all very challenging with high pin count FCBGA On-chip DSP design is very out-of-the-ordinary (multiple TeraOPS) Extremely power-efficient use massive parallelism, not GHz clocks (Pentium 4 ) Test Performance verification challenges limits of test equipment Need at-speed performance verification in production, not just functional testing ECOC2009 Towards the Shannon limit Fujitsu Microelectronics Europe - http://emea.fujitsu.com/microelectronics 5

The ADC problem Wideband low-noise sampler + demultiplexer + interleaved ADC array Smaller CMOS geometries higher speed worse mismatch and noise Single 56Gs/s track/hold very difficult due to extreme speed <9ps to acquire, <9ps to transfer to following interleaved T/H stages Interleaved track/hold (e.g. 4-channel 14Gs/s) also very difficult Signal/clock delays must match to <<1ps how do you measure this? Noise, mismatch and power of cascaded circuits all adds up Multiple sampling capacitors, buffers, switches, demultiplexers Layout and interconnect extremely challenging Design the circuits, then find you can t actually connect everything up Interleaved ADC back-end is not so difficult (only in comparison!) Design for best power and area efficiency rather than highest speed Interleave as many as necessary to achieve required sampling rate 8 x 175Ms/s 8b SAR ADCs fit underneath 1 solder bump 45Gs/s per sq mm ECOC2009 Towards the Shannon limit Fujitsu Microelectronics Europe - http://emea.fujitsu.com/microelectronics 6

A 56Gs/s CMOS ADC solution CHArge-mode Interleaved Sampler (CHAIS) 14GHz VCO (1 per ADC pair) DEMUX 80 x ADC Inputs 80 x 8b ADC Outputs Clocks Input 4 Phase Sampler DEMUX A DEMUX B DEMUX C DEMUX D 80 80 80 80 ADCBANK A ADCBANK B ADCBANK C ADCBANK D 80 80 80 80 Digital Output 1024b 437.5MHz Trim Voltages Calibration ECOC2009 Towards the Shannon limit Fujitsu Microelectronics Europe - http://emea.fujitsu.com/microelectronics 7

Dual ADC layout (4mm x 4mm test chip) Sampler PLL Sampler Demux Bias Demux SAR ADC array ADC Refs ADC logic SAR ADC array Waveform memory ECOC2009 Towards the Shannon limit Fujitsu Microelectronics Europe - http://emea.fujitsu.com/microelectronics 8

Example of 100G coherent receiver ASIC Architecture: Single CMOS die Technology: Interconnect: Die size: Gate count: 65nm CMOS 12 layer metal 15 mm x 15 mm ~50 million gates Package: FCBGA, >1000 pins M/S macros: 4 channel 56 Gs/s ADC 24 channel CEI-11G TX ADC power : ~2W/channel ECOC2009 Towards the Shannon limit Fujitsu Microelectronics Europe - http://emea.fujitsu.com/microelectronics 9

The DSP problem Digital design tools (and designers) *really* don t like this type of DSP The tools (and designers) synthesize circuits, then worry about how to connect them up But interconnect capacitance causes ~90% of power dissipation, not circuits Massive data bus widths (4k bits at ADC outputs) massive interconnect problem Partitioning into usable size blocks may be more difficult than it appears Tools don t like doing flat designs with tens of millions of gates (turn-around time) OK, lets split that big DSP block into two and add some pipelining Erm, about this 16k bit wide data bus you ve just introduced Better system/architecture tools for this type of design are needed Should really design/optimise the data flow, then shovel the circuits in underneath Designers brains (and system-level design tools) don t really think this way ECOC2009 Towards the Shannon limit Fujitsu Microelectronics Europe - http://emea.fujitsu.com/microelectronics 10

On-chip noise coupling Reduce aggressor (DSP logic) noise generation Use intentional skew of clock timing within each block and between blocks Reduces peak current and spreads out in time >10x lower di/dt Lots of on-chip (~400nF) and ultra-low-inductance (~4pH) in-package decoupling Increase victim (ADC analogue) immunity Fewest possible noise/jitter sensitive circuits, all fully differential Lots of on-chip (~100nF) and low-inductance in-package decoupling Improve victim-aggressor isolation Avoid low-resistance epi substrate (short-circuit for substrate noise) Build nested walls of isolation with most sensitive circuits in the middle SAR ADCs (not jitter-sensitive) form the first line of defense Isolation walls through package and into chip form the next line Demux and other analogue circuits (calibration etc.) form the next line Sampler and PLL are hidden away inside all these layers of isolation Measurements show very little noise makes it past all the defenses ECOC2009 Towards the Shannon limit Fujitsu Microelectronics Europe - http://emea.fujitsu.com/microelectronics 11

Package and PCB design 1mm pitch FCBGA, >1000 pins, 19 internal layers, copper lid Use similar package for test chips as typical ASIC to get same performance Low-loss high-tce LTCC (12ppm/C) for improved second-level reliability Multiple power/ground regions and shields for noise isolation Ultra-low-inductance internal decoupling for supplies and bias/reference Multiple interleaved VDD/VSS planes connect chip to multi-terminal decouplers Noise dealt with inside package predictable (stops end user getting it wrong) Coaxial via and waveguide structures, <1dB loss at 20GHz Ground planes completely removed above signal balls to reduce capacitance Dual 100ohm balanced lines used to connect coaxial via structure to G-S-G pads Optimized launch to G-S-G coplanar waveguide on low-loss PCB Balls on row inside signal pins removed to reduce capacitance, grounds cut back Outer PCB layers use MEW Megtron 6 (very low loss, lead-free multilayer compatible) ECOC2009 Towards the Shannon limit Fujitsu Microelectronics Europe - http://emea.fujitsu.com/microelectronics 12

Package + PCB EM field simulations ECOC2009 Towards the Shannon limit Fujitsu Microelectronics Europe - http://emea.fujitsu.com/microelectronics 13

BATBOARD and ROBIN ECOC2009 Towards the Shannon limit Fujitsu Microelectronics Europe - http://emea.fujitsu.com/microelectronics 14

Bandwidth measurement using TDR step HP8665B Signal Generator Agilent DCA-J 86100C Picosecond Pulse Labs TDR heads <10ps risetime Model 4022 TDR/TDT Source Model 4020RPH-RP head REFCLK 1.75GHz Differential Reference Clock CH1 CH2 ADC_INP ADC_INM BATBOARD Colby Instruments delay line with 1ps resolution Model 4020RPH-RN head SPI Interface PC running MATLAB ECOC2009 Towards the Shannon limit Fujitsu Microelectronics Europe - http://emea.fujitsu.com/microelectronics 15

Frequency Response (test setup and ADC) Frequency response of test setup TDR step (measured) Batboard PCB (measured) ENIG not Ag finish (Ni is lossy!) Socket (estimated -1dB @ 20 GHz) Test setup loss similar to ADC response Corrected ADC frequency response accurate measurements are not easy ADC -3dB bandwidth ~ 15GHz very close to simulation and specification ECOC2009 Towards the Shannon limit Fujitsu Microelectronics Europe - http://emea.fujitsu.com/microelectronics 16

Production test Need proper performance verification, not just functionality Increased confidence that chip actually meets design specifications Make chip self-testing as far as possible and do at-speed performance tests Drive ADC inputs from wideband n-way power combiners Sum outputs of multiple CEI-11G channels with sinewave input(s) Enable and disable channels/clocks instead of switching (avoid 20GHz+ relays) Test ADC ENOB using sinewave input(s) Sampled data stored in on-chip RAMs then read out and analysed (ENOB) Signal source TBD (filtered DRO? phase locked to REF?) high quality essential Test CEI-11G outputs by looping back into 56Gs/s ADC inputs 5 samples per bit gives complete waveform analysis on *all* TX channels Full-speed measurement of eye opening and jitter ECOC2009 Towards the Shannon limit Fujitsu Microelectronics Europe - http://emea.fujitsu.com/microelectronics 17

Future challenges -- what obstacles are there to progress beyond 100Gb/s? Sampler noise/bandwidth/interleave skew/clock jitter Can be solved using new CMOS techniques instead of exotic technology CHAIS sampler/demux/adc is capable of >100Gs/s even in 65nm Bandwidth scales with clock rate (-3dB at ~0.3Fs) Input bandwidth increase and S11 improvement FBGA package modifications to optimize design for very high frequencies Smaller ball pitch conflicts with second-level reliability and PCB issues Power consumption DSP issue, ADC is ~2W/channel (65nm, scales like digital) DSP power is several times ADC power, especially with more complex systems Power increase (complexity) is outrunning power savings (process shrink) Layout (interconnect and floorplan) feasibility Everything wants to be on top of everything else with zero-length connections Could need unconventional layouts ADCs might look like dartboards ECOC2009 Towards the Shannon limit Fujitsu Microelectronics Europe - http://emea.fujitsu.com/microelectronics 18

ECOC2009 Towards the Shannon limit Fujitsu Microelectronics Europe - http://emea.fujitsu.com/microelectronics 19