INTERNATIONAL JOURNAL OF PROFESSIONAL ENGINEERING STUDIES Volume VIII /Issue 1 / DEC 2016

Similar documents
Design and Analysis of Partially Parallel Encoder for 16-Bit Polar Codes

ENCODER ARCHITECTURE FOR LONG POLAR CODES

High-performance Parallel Concatenated Polar-CRC Decoder Architecture

On Path Memory in List Successive Cancellation Decoder of Polar Codes

Low Complexity Belief Propagation Polar Code Decoder

Error Patterns in Belief Propagation Decoding of Polar Codes and Their Mitigation Methods

XJ-BP: Express Journey Belief Propagation Decoding for Polar Codes

Polar Codes for Magnetic Recording Channels

Low Complexity List Successive Cancellation Decoding of Polar Codes

Low Power Error Correcting Codes Using Majority Logic Decoding

Hamming net based Low Complexity Successive Cancellation Polar Decoder

AN EFFICIENT DESIGN OF ROBA MULTIPLIERS 1 BADDI. MOUNIKA, 2 V. RAMA RAO M.Tech, Assistant professor

On the Construction and Decoding of Concatenated Polar Codes

A Novel High-Rate Polar-Staircase Coding Scheme

arxiv: v1 [cs.it] 31 Aug 2015

Modified Booth Encoding Multiplier for both Signed and Unsigned Radix Based Multi-Modulus Multiplier

Modified Booth Multiplier Based Low-Cost FIR Filter Design Shelja Jose, Shereena Mytheen

Design and Implementation of 64-bit MAC Unit for DSP Applications using verilog HDL

On Error-Correction Performance and Implementation of Polar Code List Decoders for 5G

JDT LOW POWER FIR FILTER ARCHITECTURE USING ACCUMULATOR BASED RADIX-2 MULTIPLIER

An Optimized Wallace Tree Multiplier using Parallel Prefix Han-Carlson Adder for DSP Processors

High performance Radix-16 Booth Partial Product Generator for 64-bit Binary Multipliers

FPGA Implementation of Area-Delay and Power Efficient Carry Select Adder

Design of Area and Power Efficient FIR Filter Using Truncated Multiplier Technique

Area Power and Delay Efficient Carry Select Adder (CSLA) Using Bit Excess Technique

Optimization of energy consumption in a NOC link by using novel data encoding technique

Postprint. This is the accepted version of a paper presented at IEEE International Microwave Symposium, Hawaii.

Design of Rate-Compatible Parallel Concatenated Punctured Polar Codes for IR-HARQ Transmission Schemes

Design and Implementation of Digit Serial Fir Filter

Capacity-Achieving Rateless Polar Codes

Decoding Distance-preserving Permutation Codes for Power-line Communications

Blind Detection of Polar Codes

An FPGA Implementation Architecture for Decoding of Polar Codes

S.Nagaraj 1, R.Mallikarjuna Reddy 2

FOR HIGH SPEED LOW POWER APPLICATIONS USING RADIX-4 MODIFIED BOOTH ENCODER

DESIGN & IMPLEMENTATION OF FIXED WIDTH MODIFIED BOOTH MULTIPLIER

Combination of SDC-SDF Architecture for I/O Pipelined Radix-2 FFT

Observations on Polar Coding with CRC-Aided List Decoding

Low-Complexity High-Order Vector-Based Mismatch Shaping in Multibit ΔΣ ADCs Nan Sun, Member, IEEE, and Peiyan Cao, Student Member, IEEE

High Speed Binary Counters Based on Wallace Tree Multiplier in VHDL

Implementation of Reed Solomon Decoder for Area Critical Applications

How (Information Theoretically) Optimal Are Distributed Decisions?

Efficient Implementation on Carry Select Adder Using Sum and Carry Generation Unit

Reconfigurable Sequential Minimal Optimization Algorithm for High- Throughput MIMO-OFDM Systems

MULTIPATH fading could severely degrade the performance

SQRT CSLA with Less Delay and Reduced Area Using FPGA

Cooperative Punctured Polar Coding (CPPC) Scheme Based on Plotkin s Construction

2 Assistant Professor, Dept of ECE, Universal College of Engineering & Technology, AP, India,

Design and Implementation of 128-bit SQRT-CSLA using Area-delaypower efficient CSLA

Polar Codes for Probabilistic Amplitude Shaping

R.S. ENCODERS OF LOW POWER DESIGN

VHDL based Design of Convolutional Encoder using Vedic Mathematics and Viterbi Decoder using Parallel Processing

An Efficient Design of Parallel Pipelined FFT Architecture

An Efficient Method for Implementation of Convolution

A Study of Polar Codes for MLC NAND Flash Memories

2 Assoc Prof, Dept of ECE, George Institute of Engineering & Technology, Markapur, AP, India,

Lecture 13 February 23

Irregular Polar Coding for Massive MIMO Channels

Area and Energy-Efficient Crosstalk Avoidance Codes for On-Chip Buses

Outline. Communications Engineering 1

THE ever-increasing demand to accommodate various

LOW POWER AND AREA EFFICIENT PARALLEL FIR DIGITAL FILTER STRUCTURE USING MODIFIED SQRT CARRY SELECT ADDER

UNEQUAL POWER ALLOCATION FOR JPEG TRANSMISSION OVER MIMO SYSTEMS. Muhammad F. Sabir, Robert W. Heath Jr. and Alan C. Bovik

A Survey on Power Reduction Techniques in FIR Filter

HIGH SPEED FIXED-WIDTH MODIFIED BOOTH MULTIPLIERS

Removal of High Density Salt and Pepper Noise through Modified Decision based Un Symmetric Trimmed Median Filter

Estimation of I/Q Imblance in Mimo OFDM System

Fixed Point Lms Adaptive Filter Using Partial Product Generator

A FPGA Implementation of Power Efficient Encoding Schemes for NoC with Error Detection

Bit-permuted coded modulation for polar codes

A High-Throughput Memory-Based VLC Decoder with Codeword Boundary Prediction

A High Speed Wallace Tree Multiplier Using Modified Booth Algorithm for Fast Arithmetic Circuits

Energy Efficient Decoder Architecture For Communication System

A Hardware Efficient FIR Filter for Wireless Sensor Networks

REVIEW OF COOPERATIVE SCHEMES BASED ON DISTRIBUTED CODING STRATEGY

DESIGN OF MULTIPLE CONSTANT MULTIPLICATION ALGORITHM FOR FIR FILTER

Methods for Reducing the Activity Switching Factor

Reversible Data Hiding in JPEG Images Based on Adjustable Padding

Design and Analysis of Approximate Compressors for Multiplication

Performance Analysis of Multipliers in VLSI Design

THE ERROR correcting code in a magnetic recording

Decoding of Block Turbo Codes

6. FUNDAMENTALS OF CHANNEL CODER

Low Power FIR Filter Design Based on Bitonic Sorting of an Hardware Optimized Multiplier S. KAVITHA POORNIMA 1, D.RAHUL.M.S 2

Architecture for Canonic RFFT based on Canonic Sign Digit Multiplier and Carry Select Adder

BER Performance of Polar Coded OFDM in Multipath Fading

An Analysis of Multipliers in a New Binary System

International Journal of Scientific & Engineering Research Volume 9, Issue 3, March ISSN

JDT EFFECTIVE METHOD FOR IMPLEMENTATION OF WALLACE TREE MULTIPLIER USING FAST ADDERS

Design and Performance Analysis of 64 bit Multiplier using Carry Save Adder and its DSP Application using Cadence

Study of Turbo Coded OFDM over Fading Channel

A Combined SDC-SDF Architecture for Normal I/O Pipelined Radix-2 FFT

An Design of Radix-4 Modified Booth Encoded Multiplier and Optimised Carry Select Adder Design for Efficient Area and Delay

Design of Area-Delay-Power Efficient Carry Select Adder Using Cadence Tool

An Optimized Design for Parallel MAC based on Radix-4 MBA

Design A Redundant Binary Multiplier Using Dual Logic Level Technique

Coding and Modulation

n Based on the decision rule Po- Ning Chapter Po- Ning Chapter

ISSN: X International Journal of Advanced Research in Electronics and Communication Engineering (IJARECE) Volume 1, Issue 5, November 2012

VLSI Implementation of Impulse Noise Suppression in Images

Transcription:

VLSI DESIGN OF A HIGH SPEED PARTIALLY PARALLEL ENCODER ARCHITECTURE THROUGH VERILOG HDL Pagadala Shivannarayana Reddy 1 K.Babu Rao 2 E.Rama Krishna Reddy 3 A.V.Prabu 4 pagadala1857@gmail.com 1,baburaokodavati@gmail.com 2,ece.rkreddy@usharama.in 3,prabu.deva@gmail.com 4 1 PG Scholar, Dept Of VLSI & Embedded systems,usha Rama College Of Engineering And Technology, Unguturu(M), Telaprolu(V), Krishna Dist., Andhra Pradesh 2 Associate Professor, Usha Rama College Of Engineering And Technology, Unguturu(M), Telaprolu(V), Krishna Dist., Andhra Pradesh 3 Assistant Professor, Usha Rama College Of Engineering And Technology, Unguturu(M), Telaprolu(V), Krishna Dist., Andhra Pradesh 4 Associate Professor, K.L University, Vaddeswaram,Guntur Dist., Andhra Pradesh Abstract:Due to the channel achieving property, the polar code has become one of the most favorable error-correcting codes. As the polar code achieves the property asymptotically, however, it should be long enough to have a good errorcorrecting performance. Although the previous fully parallel encoder is intuitive and easy to implement, it is not suitable for long polar codes because of the huge hardware complexity required. In this brief, we analyze the encoding process in the viewpoint of very-large-scale integration implementation and propose a new efficient encoder architecture that is adequate for long polar codes and effective in alleviating the hardware complexity. As the proposed encoder allows high-throughput encoding with small hardware complexity, it can be systematically applied to the design of any polar code and to any level of parallelism. Index Terms--Polar codes, polar encoder, very-large-scale integration (VLSI) optimization. I.Introduction The polar code is a new class of error correcting codes that provably achieves the capacity of the underlying channels. Though the polar code achieves the underlying channel capacity, the property is asymptotical, since a good error correcting performance is obtained when the code is sufficiently long. The polar code has been regarded as being associated with low complexity, such a long polar code suffers from severe hardware complexity and long latency. Therefore, an architecture that can efficiently deal with long polar codes is necessary to make the VLSI implementation feasible.among a few manuscripts dealing with the hardware implementation presented a straightforward encoding architecture that processes all the message bits in a fully parallel manner. The fully parallel architecture is intuitive and easy to implement, but it is not suitable for long polar codes due to the excessive hardware complexity. Hence we present the encoding process in the viewpoint of VLSI implementation and proposes a partially parallel architecture. The proposed encoder is highly attractive in implementing a long polar encoder, as it can achieve a high throughput with a small hardware complexity. II. Polar Encoding The polar code utilizes the channel polarization phenomenon that each channel approaches either a perfectly reliable or a completely noisy channel as the code length goes to infinity over a combined channel constructed with a set of N identical sub-channels. As the reliability of each sub-channel is known as priori, K most reliable sub-channels are set to

predetermined values to construct a polar (N,K) code. Since the polar code belongs to the class of linear block codes, the encoding process can be characterized by the generator matrix. The generator matrix GN for code length N (2n ) is obtained by applying the nth Kronecker power to the kernel matrix. Given the generator matrix, the code word is computed by x=ugn, where u and x represent information and code word vectors, respectively. Throughout the paper, we assume that information vector u is arranged in a natural order, whereas code word vector x is arranged in a bit reversed order.. III. Proposed Polar Encoder In this section, we propose a partially parallel structure to encode long polar codes efficiently. To clearly show the proposed approach and how to transform the architecture, a 4-parallel encoding architecture for the 16-bit polar code is exemplified in depth. The fully parallel encoding architecture is first transformed to a folded form. Given the 16-bit DFG, the 4-parallel folded architecture that processes 4 bits at a time can be realized with placing two functional units in each stage since the functional unit computes 2 bits at a time. Existing system A polar code with a length of 16 bits is implemented with 32 XOR gates and the processed with 4 stages as shown in fig.1. The fully parallel encoder is intuitively designed based on the generator matrix, but implementing such an encoder becomes a significant burden when a long polar code is used to achieve a good error correcting performance. The memory size and the number of XOR gates increase as the code length increases. Fig. 1. Fully parallel architecture for encoding a 16-bit polar code. Fig 2. Proposed 4-parallel folded architecture for encoding the polar (16, K) codes. In the folding transformation, determining a folding set, which represents the order of operations to be executed in a functional unit, is the most important design factor.to construct efficient folding sets, all operations in the fully parallel encoding are first classified as separate folding sets. Since the input is in a natural order, it is reasonable to alternatively distribute the operations in the consecutive order. Thus, each stage consists of two folding sets, each of which contains only odd or even operations to be performed by a separate unit.

Considering the four-parallel input sequence in a natural order, stage 1 has two folding sets of {A0, A2, A4, A6} and {A1, A3, A5, A7}. Each folding set contains four elements, and the position of an element represents the operational order in the corresponding functional unit. Two functional units for stage 1 execute A0 and A1 simultaneously at the beginning and A2 and 3 at the next cycle, and so forth. The folding sets of stage 2 have the same order as those of stage 1, i.e., { B0, B2, B4, B6} and {B1, B3, B5, B7}, since the four-parallel input sequence of stage 2 is equal to that of stage 1. Furthermore, to determine the folding sets of another stage s, the property that the functional unit processes a pair of inputs whose indices differ by 2s 1 is exploited. In the case of stage 3, two data whose indices differ by 4 are processed together, which implies that the operational distance of the corresponding data is two as the kernel functional unit computes two data at a time. For instance, w 2,0 and w2,4 that come from B0 and B2 are used as the inputs to C0. Since both inputs should be valid to be processed in a functional unit, the operations in stage 3 are aligned to the late input data. Cyclic shifting the folding sets right by one, which can be realized by inserting a delay of one time unit, is to enable full utilization of the functional units by overlapping adjacent iterations. As a result, the folding sets of stage 3 are determined to {C6, C0, C2, C4} and {C7, C1, C3, C5}, where C6 in the current iteration is overlapped with A0 and B0 in the next iteration. In the same manner, the property that the functional unit processes a pair of inputs whose indices differ by 8 is exploited in stage 4. The folding sets of stage 4 are {D2, D4, D6, D0} and {D3, D5, D7, D1}, which are obtained by cyclic shifting the previous folding sets of stage 3 by two. Generally speaking, a stage whose index s is less than or equal to log2 P, where P is the level of parallelism, has the same folding sets determined by evenly interleaving the operations in the consecutive order, and another stage whose index s is larger than log2 P has the folding sets obtained by cyclic shifting the previous folding sets of stage s 1 right by s log2 P. IV.Results: Simulation results: 4 bit Design summary:

Synthesis report: 32 bit Simulation results: V. Conclusion This brief has presented a new partially parallel encoder architecture developed for long polar codes. Many optimization techniques have been applied to derive the proposed architecture. Experimental results show that the proposed architecture can save the hardware by up to 73% compared with that of the fully parallel architecture. Finally, the relationship between the hardware complexity and the throughputs is analyzed to select the most suitable architecture for a given application. Therefore, the proposed architecture provides a practical solution for encoding a long polar code. REFERENCES [1] E. Arikan, Channel polarization: A method for constructing capacity achieving codes for symmetric binary-input memory less channels, IEEE Trans. Inf. Theory, vol. 55, no. 7, pp. 3051 3073, Jul. 2009. Design summary: [2] R. Mori and T. Tanaka, Performance of polar codes with the construction using density evolution, IEEE Commun. Lett., vol. 13, no. 7, pp. 519 521, Jul. 2009. [3] S. B. Korada, E. Sasoglu, and R. Urbanke, Polar codes: Characterization of exponent, bounds, constructions, IEEE Trans. Inf. Theory, vol. 56, no. 12, pp. 6253 6264, Dec. 2010. Synthesis report: [4] I. Tal and A. Vardy, List decoding of polar codes, inproc. IEEE ISIT, 2011, pp. 1 5. [5] K. Chen, K. Niu, and J. Lin, Improved successive cancellation decoding of polar codes, IEEE Trans. Commun., vol. 61, no. 8, pp. 3100 3107, Aug. 2013. [6] G. Sarkis and W. J. Gross, Polar codes for data storage applications, in Proc. ICNC, 2013, pp. 840 844.

[7] G. Sarkis, P. Giard, A. Vardy, C. Thibeault, and W. J. Gross, Fast polar decoders: Algorithm and implementation, IEEE J. Sel. Areas Commun., vol. 32, no. 5, pp. 946 957, May 2014. [8] G. Berhault, C. Leroux, C. Jego, and D. Dallet, Partial sums generation architecture for successive cancellation decoding of polar codes, inproc. IEEE Workshop SiPS, Oct. 2013, pp. 407 412. [9] B. Yuan and K. K. Parhi, Low-latency successive-cancellation polar decoder architectures using 2-bit decoding, IEEE Trans. Circuits Syst. I, Reg. Papers, vol. 61, no. 4, pp. 1241 1254, Apr. 2014. [10] C. Leroux, A. J. Raymond, G. Sarkis, and W. J. Gross, A semi-parallel successive-cancellation decoder for polar codes, IEEE Trans. Signal Process., vol. 61, no. 2, pp. 289 299, Jan. 2013.