Enabling Improved DSP Based Receivers for 100G Backplane

Similar documents
Precoding proposal for PAM4

56+ Gb/s Serial Transmission using Duobinary Signaling

Precoding and Signal Shaping for Digital Transmission

Performance Analysis of n Wireless LAN Physical Layer

High-Speed Circuits and Systems Laboratory B.M.Yu. High-Speed Circuits and Systems Lab.

SPACE TIME coding for multiple transmit antennas has attracted

Chapter 9. Digital Communication Through Band-Limited Channels. Muris Sarajlic

Transmission Proposal for 10GBASE-T G. Zimmerman, SolarFlare

QAM-Based Transceiver Solutions for Full-Duplex Gigabit Ethernet Over 4 Pairs of UTP-5 Cable. Motivation for Using QAM

ON SYMBOL TIMING RECOVERY IN ALL-DIGITAL RECEIVERS

Maximum Likelihood Sequence Detection (MLSD) and the utilization of the Viterbi Algorithm

Department of Electronics and Communication Engineering 1

Master s Thesis Defense

Notes 15: Concatenated Codes, Turbo Codes and Iterative Processing

MULTILEVEL CODING (MLC) with multistage decoding

A Differential Detection Scheme for Transmit Diversity

Channel Precoding for Indoor Radio Communications Using Dimension Partitioning. Yuk-Lun Chan and Weihua Zhuang, Member, IEEE

ECEN720: High-Speed Links Circuits and Systems Spring 2017

TABLE OF CONTENTS CHAPTER TITLE PAGE

International Journal of Scientific & Engineering Research Volume 9, Issue 3, March ISSN

Computer Exercises in. Communication Theory SMS016

DESIGN, IMPLEMENTATION AND OPTIMISATION OF 4X4 MIMO-OFDM TRANSMITTER FOR

Intro to coding and convolutional codes

IN MOST situations, the wireless channel suffers attenuation

Outline. Communications Engineering 1

QAM-Based 1000BASE-T Transceiver

A 5-Gb/s 156-mW Transceiver with FFE/Analog Equalizer in 90-nm CMOS Technology Wang Xinghua a, Wang Zhengchen b, Gui Xiaoyan c,

BER-optimal ADC for Serial Links

A Novel Joint Synchronization Scheme for Low SNR GSM System

Array Like Runtime Reconfigurable MIMO Detector for n WLAN:A design case study

10GBASE-T T Tutorial. SolarFlare Communications IEEE Kauai, Hawaii. November 11, 2002

Chapter 3 Convolutional Codes and Trellis Coded Modulation

Implementation of MLSE equalizer in OptSim and evaluation of its performance

THE computational complexity of optimum equalization of

A System-Level Description of a SOQPSK- TG Demodulator for FEC Applications

HARDWARE-EFFICIENT IMPLEMENTATION OF THE SOVA FOR SOQPSK-TG

Reduced Complexity by Incorporating Sphere Decoder with MIMO STBC HARQ Systems

THE idea behind constellation shaping is that signals with

1000BASE-RH PHY system simulations

Decoding of Block Turbo Codes

INFRARED (IR) radiation using intensity modulation with

6. FUNDAMENTALS OF CHANNEL CODER

Performance of Nonuniform M-ary QAM Constellation on Nonlinear Channels

Simulation Study for the Decoding of UHF RFID Signals

Know your Algorithm! Architectural Trade-offs in the Implementation of a Viterbi Decoder. Matthias Kamuf,

Partial Response Signaling for Backplane Applications

10Gb/s PMD Using PAM-5 Trellis Coded Modulation

FPGA Implementation of Viterbi Algorithm for Decoding of Convolution Codes

Disclaimer. Primer. Agenda. previous work at the EIT Department, activities at Ericsson

UNIVERSITY OF CALIFORNIA College of Engineering Department of Electrical Engineering and Computer Sciences EECS 121 FINAL EXAM

TSTE17 System Design, CDIO. General project hints. Behavioral Model. General project hints, cont. Lecture 5. Required documents Modulation, cont.

Performance Comparison of RAKE and Hypothesis Feedback Direct Sequence Spread Spectrum Techniques for Underwater Communication Applications

IN A TYPICAL indoor wireless environment, a transmitted

2005 Viterbi Conference. Applications of the Viterbi Algorithm in Data Storage Technology

Department of Electronic Engineering FINAL YEAR PROJECT REPORT

Embedded Orthogonal Space-Time Codes for High Rate and Low Decoding Complexity

Lecture #2. EE 471C / EE 381K-17 Wireless Communication Lab. Professor Robert W. Heath Jr.

Improved concatenated (RS-CC) for OFDM systems

Joint Viterbi Detector/Decoder for Satellite Comms.

VOL. 3, NO.11 Nov, 2012 ISSN Journal of Emerging Trends in Computing and Information Sciences CIS Journal. All rights reserved.

Beyond 25 Gbps: A Study of NRZ & Multi-Level Modulation in Alternative Backplane Architectures

THE rapid growth of the laptop and handheld computer

Implementation and Complexity Analysis of List Sphere Detector for MIMO-OFDM systems

n Based on the decision rule Po- Ning Chapter Po- Ning Chapter

CHAPTER 5 DIVERSITY. Xijun Wang

Impact of DFE Error Propagation on FEC-Based High-Speed I/O Links

An Iterative Noncoherent Relay Receiver for the Two-way Relay Channel

High Data Rate QPSK Modulator with CCSDS Punctured FEC channel Coding for Geo-Imaging Satellite

RADIO SYSTEMS ETIN15. Channel Coding. Ove Edfors, Department of Electrical and Information Technology

Performance Analysis of Maximum Likelihood Detection in a MIMO Antenna System

a) Abasebanddigitalcommunicationsystemhasthetransmitterfilterg(t) thatisshowninthe figure, and a matched filter at the receiver.

2002 IEEE International Solid-State Circuits Conference 2002 IEEE

A New Approach to Layered Space-Time Code Design

Multiple Input Multiple Output Dirty Paper Coding: System Design and Performance

Design Trade-offs in the VLSI Implementation of High-Speed Viterbi Decoders and their Application to MLSE in ISI Cancellation

Near-Optimal Low Complexity MLSE Equalization

An HARQ scheme with antenna switching for V-BLAST system

To learn fundamentals of high speed I/O link equalization techniques.

Wireless Communication Systems: Implementation perspective

Digital Communications I: Modulation and Coding Course. Term Catharina Logothetis Lecture 12

Combining Modern Codes and Set- Partitioning for Multilevel Storage Systems

EE290C - Spring 2004 Advanced Topics in Circuit Design High-Speed Electrical Interfaces

10 Mb/s Single Twisted Pair Ethernet Implementation Thoughts Proof of Concept Steffen Graber Pepperl+Fuchs

ECEN689: Special Topics in High-Speed Links Circuits and Systems Spring 2012

AN INTRODUCTION TO ERROR CORRECTING CODES Part 2

DFE Error Performance Under 1000BASE-T1 Noise Environments

A Sphere Decoding Algorithm for MIMO

Convolutional Coding Using Booth Algorithm For Application in Wireless Communication

Goa, India, October Question: 4/15 SOURCE 1 : IBM. G.gen: Low-density parity-check codes for DSL transmission.

Study of Turbo Coded OFDM over Fading Channel

EFFECTIVE CHANNEL CODING OF SERIALLY CONCATENATED ENCODERS AND CPM OVER AWGN AND RICIAN CHANNELS

UNIVERSITY OF SOUTHAMPTON

CHANNEL MEASUREMENT. Channel measurement doesn t help for single bit transmission in flat Rayleigh fading.

Implementation of Different Interleaving Techniques for Performance Evaluation of CDMA System

100 Gb/s: The High Speed Connectivity Race is On

Low Power Digital Receivers for Multi- Gb/s Wireline/Optical Communication

Using TCM Techniques to Decrease BER Without Bandwidth Compromise. Using TCM Techniques to Decrease BER Without Bandwidth Compromise. nutaq.

TDECQ update noise treatment and equalizer optimization (revision of king_3bs_02_0217_smf)

NOVEL 6-PSK TRELLIS CODES

Comparison of Linear and Non-Linear Equalizer using the Maltlab

Transcription:

Enabling Improved DSP Based Receivers for 100G Backplane Dariush Dabiri 802.3bj Task Force IEEE 802.3 Interim September 2011 1

Agenda Goals Introduction Partial Response Channel (PRC) Signaling Quasi-catastrophic Trellis Precoding MLSD vs Symbol-by-Symbol Detection Critical Path of the Viterbi Detector Double Terminated Trellis Simulation Results Conclusion 2

Goal It has been shown that the DSP based PAM4 solution can cover the vast majority of the installed base backplanes with 6dB SNR margin (parthasarathy_01a_0511.pdf). Our goal is to show that we can increase the link margin to about 9 db with a minor modification at the transmitter. Increase in SNR margin will allow more reliable operation over worst case channel and better power/performance tradeoffs. 3

Introduction For M-PAM signaling, Maximum Likelihood Sequence Detection (MLSD) will provide 3 db gain over symbol-by-symbol detection, if M is a small number. Both MLSD and symbol-by-symbol detection (e.g. DFE) suffer from error propagation for worst case BP channels. A simple, low cost 1/(1+D) pre-coding: mitigates error propagation from the DFE based solution. eliminates quasi-catastrophic paths from the MLSD trellis. We will show that a small modification in the transmitter allows a low power implementation of the MLSD. 4

Basic Partial Response Channel PAM4- Constellation Mapper Precoder 3 Tap Linear Deemphasis Filter/ Line Driver Back Plane Channel CTLF/ PGA ADC FFF MLSD/Symbol-by- Symbol Detector Equivalent PRC Combination of the de-emphasis filter, the backplane channel, the CTLF and the FFF (Feed Forward Filter) creates an equivalent channel which approximates a simple 1+D channel. At the output of the FFF one can employ either: Symbol-by-Symbol Detector: Implemented by a DFE with small tap weights. MLSD using Viterbi Detection A combination of both 5

MLSD Trellis 3/[6, 4, 2, 0] 1/[4, 2, 0, -2] -1/[2, 0, -2, -4] -3/[0, -2, -4, -6] Trellis Structure for Precoded MLSD Each state signifies the previous input to the channel. Each branch signifies the output of the channel. Each section of the trellis has 4 states and 16 branches. 6

Quasi-Catastrophic Trellis A trellis is quasi-catastrophic if it contains distinct states for which one or more of the output sequences that start from those states are identical for all subsequent time, but for which the total probability of all such non-distinguishable sequences is zero, Forney and Calderbank: Coset Codes for Partial Response Channels: or, Coset Codes with Spectral Nulls, IEEE Trans. IT, Nov 1989. The trellis of the (1 + D) channel is quasicatastrophic : all-zero output sequence can start from any state. For a precoded channel, the output sequence is congruent to the input of the precoder modulo M. 7

MLSD vs Symbol-by-Symbol Detection It is well known that MLSD offers approximately 3 db extra gain compare to the symbol-by-symbol detection for the 1+D channel. If the impulse response of the equivalent channel deviates from 1 + D, the MLSD gain will be less, but never worse than symbol-by-symbol detection. The MLSD requires implementing the Viterbi algorithm or one of its approximations. The Viterbi algorithm has the critical loop of Add- Compare-Select (ACS) which makes it hard to implement in high speed applications. 8

Viterbi Algorithm An example: One needs to recursively computes metrics for each state at each stage of the trellis: metric(s) = min(metric(s ) + metric(s s )) Only the path s s which takes on the minimum value survives. The logic implementation of the algorithm at 14G is challenging: linear speed up with number of processing units. 9

Block MLSD: Double Terminated Trellis In double terminated trellis the two ends of the trellis are known at the receiver: 3/[6, 4, 2, 0] 1/[4, 2, 0, -2] -1/[2, 0, -2, -4] -3/[0, -2, -4, -6] Viterbi Detection of Sequence: 0, 4, 0, -6, 6, 2,0 Note that the complete detection can be achieved by the information available within the block boundary defined by termination symbols. Therefore one can allow implementation of the detection of multiple blocks to overlap in time. 10

Complete Model Termination Symbols PRBS FEC Encoder PAM4- Constellation Mapper + Mod 3 Tap Linear Deemphasis Filter/ Line Driver D DFE Channel CTLF/ PGA ADC FFF + Slicer MLSD FEC Decoder Remove the termination symbol before FEC decoder 11

Termination Symbol The PRBS sequence is locked between the transmitter and receiver during the start-up: It is known to both ends. The termination channel symbol corresponds to exactly one PAM4 symbol before precoder: At the transmitter, one can add the termination symbol after the precoder, and remove it after the detector. End-to-end frame structure will not be affected by insertion and deletion of the termination symbol. One possible choice is to add one termination symbols for N = 255 symbols: Corresponds to 0.39% over-clocking SNR loss of around 0.05 db due to over-clocking. 12

Simulation Results Before FEC Decoder 13

Complexity Block processing allows an straight forward implementation of a 4-state Viterbi detector to work at 100G: Alternatively one has to unroll the ACS loop: Fettweis and Meyr, High-rate Viterbi processor: a systolic array solution, IEEE JSAC, Oct, 1990. Synthesis results indicates that the power/area of the detector is less or comparable with the FEC decoder Based on a 28nm node library. 14

Conclusion We can increase the performance of the DSP based implementations by about 3 db with a very minor modification in the transmitter and relatively small increase in power at the receiver. The proposed change can easily be adapted to all proposed FEC frame structures. 15

Thank you! 16