FPGA-Based Design and Implementation of a Multi-Gbps LDPC Decoder

Similar documents
FPGA-BASED DESIGN AND IMPLEMENTATION OF A MULTI-GBPS LDPC DECODER. Alexios Balatsoukas-Stimming and Apostolos Dollas

Low-complexity Low-Precision LDPC Decoding for SSD Controllers

A WiMAX/LTE Compliant FPGA Implementation of a High-Throughput Low-Complexity 4x4 64-QAM Soft MIMO Receiver

Digital Television Lecture 5

Vector-LDPC Codes for Mobile Broadband Communications

LDPC Decoding: VLSI Architectures and Implementations

FPGA Implementation Of An LDPC Decoder And Decoding. Algorithm Performance

A 32 Gbps 2048-bit 10GBASE-T Ethernet Energy Efficient LDPC Decoder with Split-Row Threshold Decoding Method

High-performance Parallel Concatenated Polar-CRC Decoder Architecture

XJ-BP: Express Journey Belief Propagation Decoding for Polar Codes

Performance Evaluation of Low Density Parity Check codes with Hard and Soft decision Decoding

Reduced-Complexity VLSI Architectures for Binary and Nonbinary LDPC Codes

LDPC Code Length Reduction

EE 435/535: Error Correcting Codes Project 1, Fall 2009: Extended Hamming Code. 1 Introduction. 2 Extended Hamming Code: Encoding. 1.

Constellation Shaping for LDPC-Coded APSK

The Case for Optimum Detection Algorithms in MIMO Wireless Systems. Helmut Bölcskei

FOR THE PAST few years, there has been a great amount

THE extension of binary Low-Density Parity-Check

Project. Title. Submitted Sources: {se.park,

On the reduced-complexity of LDPC decoders for ultra-high-speed optical transmission

POLAR codes [1] received a lot of attention in the recent. PolarBear: A 28-nm FD-SOI ASIC for Decoding of Polar Codes

Iterative Joint Source/Channel Decoding for JPEG2000

Observations on Polar Coding with CRC-Aided List Decoding

Decoding of Block Turbo Codes

Error Patterns in Belief Propagation Decoding of Polar Codes and Their Mitigation Methods

ARCHITECTURE AND FINITE PRECISION OPTIMIZATION FOR LAYERED LDPC DECODERS

Frequency-Hopped Spread-Spectrum

Implementation of Block Turbo Codes for High Speed Communication Systems

Optimized BPSK and QAM Techniques for OFDM Systems

Low Complexity, Flexible LDPC Decoders

Low-Complexity LDPC-coded Iterative MIMO Receiver Based on Belief Propagation algorithm for Detection

The throughput analysis of different IR-HARQ schemes based on fountain codes

Hardware-Efficient Node Processing Unit Architectures for Flexible LDPC Decoder Implementations

Coding & Signal Processing for Holographic Data Storage. Vijayakumar Bhagavatula

Q-ary LDPC Decoders with Reduced Complexity

Optimized Codes for the Binary Coded Side-Information Problem

Design High speed Reed Solomon Decoder on FPGA

White Paper Unlocking the Potential of LDPC, New FlexLDPC Coding from. Datum Systems. for PSM-500, 500L & 500LT Series Modems

Evaluation of flexible SPA based LDPC decoder using hardware friendly approximation methods

LDPC Communication Project

High-Throughput VLSI Implementations of Iterative Decoders and Related Code Construction Problems

Low Complexity Belief Propagation Polar Code Decoder

IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 50, NO. 1, JANUARY

Performance of Combined Error Correction and Error Detection for very Short Block Length Codes

Low Power LDPC Decoder design for ad standard

LDPC FEC PROPOSAL FOR EPOC. Richard S. Prodan Broadcom Corporation

n Based on the decision rule Po- Ning Chapter Po- Ning Chapter

FPGA based Prototyping of Next Generation Forward Error Correction

FORWARD ERROR CORRECTION PROPOSAL FOR EPOC PHY LAYER

Asymptotic Analysis And Design Of Iterative Receivers For Non Linear ISI Channels

Performance Enhancement of MC-CDMA System through Turbo Block Codes

A Novel High-Throughput, Low-Complexity Bit-Flipping Decoder for LDPC Codes

CHAPTER 4 LINK ADAPTATION USING NEURAL NETWORK

Design and implementation of LDPC decoder using time domain-ams processing

Joint Viterbi Detector/Decoder for Satellite Comms.

LDPC decoder architecture for DVB-S2 and DVB-S2X standards

3GPP TSG RAN WG1 Meeting #85 R Decoding algorithm** Max-log-MAP min-sum List-X

Performance Optimization of Hybrid Combination of LDPC and RS Codes Using Image Transmission System Over Fading Channels

A Survey of Advanced FEC Systems

An adaptive low-power LDPC decoder using SNR estimation

ITERATIVE decoding of classic codes has created much

p J Data bits P1 P2 P3 P4 P5 P6 Parity bits C2 Fig. 3. p p p p p p C9 p p p P7 P8 P9 Code structure of RC-LDPC codes. the truncated parity blocks, hig

On Path Memory in List Successive Cancellation Decoder of Polar Codes

Iterative Soft Decision Based Complex K-best MIMO Decoder

HARDWARE-EFFICIENT IMPLEMENTATION OF THE SOVA FOR SOQPSK-TG

DESIGN OF LOW POWER / HIGH SPEED MULTIPLIER USING SPURIOUS POWER SUPPRESSION TECHNIQUE (SPST)

Transmission Channel Noise Aware Energy Effective LDPC Decoding

Optimized Degree Distributions for Binary and Non-Binary LDPC Codes in Flash Memory

Chaotic Architectures for Secure Free-Space Optical Communication

University of Bristol - Explore Bristol Research. Peer reviewed version. Link to published version (if available): /ICCE.2012.

Rate-Adaptive LDPC Convolutional Coding with Joint Layered Scheduling and Shortening Design

Improving LDPC Decoders via Informed Dynamic Scheduling

Construction of Adaptive Short LDPC Codes for Distributed Transmit Beamforming

LOW POWER LOW-DENSITY PARITY-CHECKING (LDPC) CODES DECODER DESIGN USING DYNAMIC VOLTAGE AND FREQUENCY SCALING. A Thesis WEIHUANG WANG

IEEE C /02R1. IEEE Mobile Broadband Wireless Access <

Rekha S.M, Manoj P.B. International Journal of Engineering and Advanced Technology (IJEAT) ISSN: , Volume-2, Issue-6, August 2013

A Novel LDPC Decoder for DVB-S2 IP

Study on AR4JA Code in Deep Space Fading Channel

Improvement Of Block Product Turbo Coding By Using A New Concept Of Soft Hamming Decoder

Blind Detection of Polar Codes

SourceSync. Exploiting Sender Diversity

An Iterative BP-CNN Architecture for Channel Decoding

AREA EFFICIENT DISTRIBUTED ARITHMETIC DISCRETE COSINE TRANSFORM USING MODIFIED WALLACE TREE MULTIPLIER

LDPC codes for OFDM over an Inter-symbol Interference Channel

NONBINARY low-density parity-check (NB-LDPC)

Advanced channel coding : a good basis. Alexandre Giulietti, on behalf of the team

CprE 583 Reconfigurable Computing

Video Transmission over Wireless Channel

New Forward Error Correction and Modulation Technologies Low Density Parity Check (LDPC) Coding and 8-QAM Modulation in the CDM-600 Satellite Modem

Performance comparison of convolutional and block turbo codes

BER Mitigation in LDPC-COFDM System with Different Modulation Schemes over Generalized Gamma Fading Channel

Energy Consumption of Channel Decoders for OFDM-based UWB Systems

Hardware Implementation of BCH Error-Correcting Codes on a FPGA

Time Division Multiplexing for Green Broadcasting

Semi-Parallel Architectures For Real-Time LDPC Coding

Using TCM Techniques to Decrease BER Without Bandwidth Compromise. Using TCM Techniques to Decrease BER Without Bandwidth Compromise. nutaq.

End-To-End Communication Model based on DVB-S2 s Low-Density Parity-Check Coding

By Kung Chi Cinnati Loi. c Kung Chi Cinnati Loi, August All rights reserved.

INCREMENTAL REDUNDANCY LOW-DENSITY PARITY-CHECK CODES FOR HYBRID FEC/ARQ SCHEMES

Application of QAP in Modulation Diversity (MoDiv) Design

Transcription:

FPGA-Based Design and Implementation of a Multi-Gbps LDPC Decoder Alexios Balatsoukas-Stimming and Apostolos Dollas Technical University of Crete Dept. of Electronic and Computer Engineering August 30, 2012 FPL 2012, Oslo A. Balatsoukas-Stimming and A. Dollas FPL 12: FPGA-Based Multi-Gbps LDPC Decoder 1 / 18

1 Introduction 2 Channel Model LDPC Codes Decoding Algorithm 3 4 A. Balatsoukas-Stimming and A. Dollas FPL 12: FPGA-Based Multi-Gbps LDPC Decoder 2 / 18

LDPC Codes LDPC codes: Forward Error Correction (FEC) codes that exhibit excellent error correction performance. Adopted by many present and future standards. Hardware-friendly due to inherent parallelism of decoding algorithms. FPGAs can support multiple standards and rates via runtime reconfiguration. A. Balatsoukas-Stimming and A. Dollas FPL 12: FPGA-Based Multi-Gbps LDPC Decoder 3 / 18

Channel Model LDPC Codes Decoding Algorithm Channel Model Memoryless AWGN channel: y i = x i + n i, n i N (0, σ 2 ). Decoding consists of finding most likely x i, based on y, for all i = 1,..., n: ˆx i = arg max p(x i y) NP-hard! x i A. Balatsoukas-Stimming and A. Dollas FPL 12: FPGA-Based Multi-Gbps LDPC Decoder 4 / 18

LDPC Codes Introduction Channel Model LDPC Codes Decoding Algorithm Defined through a parity-check matrix H. Represented by a Tanner graph Decoding via Min-Sum message passing on the graph. A. Balatsoukas-Stimming and A. Dollas FPL 12: FPGA-Based Multi-Gbps LDPC Decoder 5 / 18

Channel Model LDPC Codes Decoding Algorithm Min-Sum Decoding Algorithm Variable-to-check: L ij = Check-to-variable: R ij = Initial LLR {}}{ 2y i /σ 2 + k V(i)/j k C(i)/j R ki. sign(l ki ) min L ki. k V(i)/j A maximum of k iterations is performed. A. Balatsoukas-Stimming and A. Dollas FPL 12: FPGA-Based Multi-Gbps LDPC Decoder 6 / 18

LDPC s Fully Parallel: Every VN and CN represented in hardware. Very high throughput. High hardware utilization, very complex routing. Serial: One VN and one CN. Low throughput. Very efficient hardware utilization, minimal routing. Partially Parallel: Compromise between the two. A. Balatsoukas-Stimming and A. Dollas FPL 12: FPGA-Based Multi-Gbps LDPC Decoder 7 / 18

(n, m) signed fixed point quantization: Total of n bits for each message. m bits are used for fractional part. (n 1, m 1 ) (n 2, m 2 ) hybrid quantization: (n 1, m 1 ) quantization for initial LLR messages. (n 2, m 2 ) quantization for variable-to-check and check-to-variable messages. By choosing n 2 < n 1, routing and processing unit complexity can be significantly reduced. A. Balatsoukas-Stimming and A. Dollas FPL 12: FPGA-Based Multi-Gbps LDPC Decoder 8 / 18

Effect of on Performance 10 10 2 Bit Error Rate 10 3 Uncoded 10 4 Min Sum (2,1) (3,1) 10 5 (4,1) (5,1) 10 6 (3,1) (2,1) (4,1) (2,1) (4,1) (3,1) 10 7 1 1.5 2 2.5 3 3.5 Eb/N0 (db) Comparison of (4,1) with hybrid: (4, 1) (3, 1): negligible loss, -25% wires, -45% LUTs. (3, 1) (2, 1): 0.75 db loss, -50% wires, -71% LUTs. A. Balatsoukas-Stimming and A. Dollas FPL 12: FPGA-Based Multi-Gbps LDPC Decoder 9 / 18

Straightforward combinational logic implementing the variable and check node update rules. Variable node. Check node. A. Balatsoukas-Stimming and A. Dollas FPL 12: FPGA-Based Multi-Gbps LDPC Decoder 10 / 18

Usually, one decoding iteration is considered to last one clock cycle high path delays. Idea: add registers to reduce path delays. Problem: decoding now takes twice as many cycles. Observation: at each cycle, either VNs or CNs are idle. Solution: decode two codewords simultaneously. A. Balatsoukas-Stimming and A. Dollas FPL 12: FPGA-Based Multi-Gbps LDPC Decoder 11 / 18

If after some iteration we have reached a valid codeword, decoding can halt. At high Eb/N0, this can lead to significant increase in average throughput. Significant I/O problems due to non-uniform distribution of required iterations. A. Balatsoukas-Stimming and A. Dollas FPL 12: FPGA-Based Multi-Gbps LDPC Decoder 12 / 18

Relative Frequency 0.7 0.6 0.5 0.4 0.3 0.2 0.1 Relative Frequency 0.35 0.3 0.25 0.2 0.15 0.1 0.05 0 1 2 3 4 5 6 7 8 9 10 Iterations 0 1 2 3 4 5 6 Iterations Eb/N0 = 2 db. Eb/N0 = 3.5 db. Idea: force decoder to perform at least k/2 iterations. Small impact on throughput. k guaranteed cycles for I/O. A. Balatsoukas-Stimming and A. Dollas FPL 12: FPGA-Based Multi-Gbps LDPC Decoder 13 / 18

Overall Datapath A. Balatsoukas-Stimming and A. Dollas FPL 12: FPGA-Based Multi-Gbps LDPC Decoder 14 / 18

Results This work Chandrasetty gain/ (4,1) (3,1) & Aziz (2011) loss Decoding Algorithm Min-Sum MMS Clock Frequency 154.30 MHz 149.00 MHz 3.5% Eb/N0 at 10 6 BER 3.5 db 4 db 0.50 db Av. Iter. at 10 6 BER 5.8 6.8 14.7% Av. Throughput 14.6 Gbps 12.6 Gbps 15.9% LUT Utilization 89.4% 98.5% 9.1% Max. Delay 220 ns 121 ns 81.8% A. Balatsoukas-Stimming and A. Dollas FPL 12: FPGA-Based Multi-Gbps LDPC Decoder 15 / 18

Results This work Chandrasetty gain/ (3,1) (2,1) & Aziz (2011) loss Decoding Algorithm Min-Sum MMS Clock Frequency 211.40 MHz 149.00 MHz 41.9% Eb/N0 at 10 6 BER 4.25 db 4 db 0.25 db Av. Iter. at 10 6 BER 5.6 6.8 17.6% Av. Throughput 21.6 Gbps 12.6 Gbps 71.4% LUT Utilization 47.6% 98.5% 50.9% Max. Delay 161 ns 121 ns 33.1% A. Balatsoukas-Stimming and A. Dollas FPL 12: FPGA-Based Multi-Gbps LDPC Decoder 16 / 18

Conclusion We presented an FPGA-based LDPC decoder architecture which: 1 Outperforms the state of the art by: 15.9% at a 0.50 db lower Eb/N0. 71.4% at a 0.25 db higher Eb/N0. 2 Requires 9.1% and 50.9% less logic, respectively. 3 Fully addresses the I/O problems due to early termination. A. Balatsoukas-Stimming and A. Dollas FPL 12: FPGA-Based Multi-Gbps LDPC Decoder 17 / 18

Thank you! Questions? A. Balatsoukas-Stimming and A. Dollas FPL 12: FPGA-Based Multi-Gbps LDPC Decoder 18 / 18