Design and Analysis of RNS Based FIR Filter Using Verilog Language

Similar documents
A New RNS 4-moduli Set for the Implementation of FIR Filters. Gayathri Chalivendra

JDT LOW POWER FIR FILTER ARCHITECTURE USING ACCUMULATOR BASED RADIX-2 MULTIPLIER

A Survey on Power Reduction Techniques in FIR Filter

Digital Finite Impulse Response Filter based on Residue Number System

Design and Analysis of CMOS Based DADDA Multiplier

An area optimized FIR Digital filter using DA Algorithm based on FPGA

Mahendra Engineering College, Namakkal, Tamilnadu, India.

Comparative Study and Analysis of Performances among RNS, DBNS, TBNS and MNS for DSP Applications

Reduced Complexity Wallace Tree Mulplier and Enhanced Carry Look-Ahead Adder for Digital FIR Filter

Design and Performance Analysis of 64 bit Multiplier using Carry Save Adder and its DSP Application using Cadence

Design of FIR Filter Using Modified Montgomery Multiplier with Pipelining Technique

A New High Speed Low Power Performance of 8- Bit Parallel Multiplier-Accumulator Using Modified Radix-2 Booth Encoded Algorithm

VLSI Implementation of Digital Down Converter (DDC)

Design of a High Speed FIR Filter on FPGA by Using DA-OBC Algorithm

Area Power and Delay Efficient Carry Select Adder (CSLA) Using Bit Excess Technique

Fast Fourier Transform utilizing Modified 4:2 & 7:2 Compressor

SINGLE MAC IMPLEMENTATION OF A 32- COEFFICIENT FIR FILTER USING XILINX

International Journal of Advanced Research in Computer Science and Software Engineering

Design of a Power Optimal Reversible FIR Filter ASIC Speech Signal Processing

Analysis Parameter of Discrete Hartley Transform using Kogge-stone Adder

Design and Performance Analysis of a Reconfigurable Fir Filter

Adder (electronics) - Wikipedia, the free encyclopedia

The Comparative Study of FPGA based FIR Filter Design Using Optimized Convolution Method and Overlap Save Method

[Devi*, 5(4): April, 2016] ISSN: (I2OR), Publication Impact Factor: 3.785

Power Efficient Weighted Modulo 2 n +1 Adder

Implementation of FPGA based Design for Digital Signal Processing

IMPLEMENTATION OF DIGITAL FILTER ON FPGA FOR ECG SIGNAL PROCESSING

International Journal of Advance Engineering and Research Development

Multiplier Design and Performance Estimation with Distributed Arithmetic Algorithm

Modified Design of High Speed Baugh Wooley Multiplier

Design of Digital FIR Filter using Modified MAC Unit

FPGA Implementation of Adaptive Noise Canceller

An Efficient Method for Implementation of Convolution

High performance Radix-16 Booth Partial Product Generator for 64-bit Binary Multipliers

Performance Analysis of a 64-bit signed Multiplier with a Carry Select Adder Using VHDL

Digital Signal Processing

Rapid Design of FIR Filters in the SDR- 500 Software Defined Radio Evaluation System using the ASN Filter Designer

Design and Implementation of Complex Multiplier Using Compressors

Modified Booth Encoding Multiplier for both Signed and Unsigned Radix Based Multi-Modulus Multiplier

An Extensive Review on Residue Number System for Improving Computer Arithmetic Operations

FIR Filter Design on Chip Using VHDL

SIGNED PIPELINED MULTIPLIER USING HIGH SPEED COMPRESSORS

Efficient Multi-Operand Adders in VLSI Technology

FPGA Implementation of Digital Modulation Techniques BPSK and QPSK using HDL Verilog

Resource Efficient Reconfigurable Processor for DSP Applications

ASIC Design and Implementation of SPST in FIR Filter

Digital Integrated CircuitDesign

Index Terms. Adaptive filters, Reconfigurable filter, circuit optimization, fixed-point arithmetic, least mean square (LMS) algorithms. 1.

FPGA Implementation of Serial and Parallel FIR Filters by using Vedic and Wallace tree Multiplier

Design of Multiplier Less 32 Tap FIR Filter using VHDL

VLSI IMPLEMENTATION OF MODIFIED DISTRIBUTED ARITHMETIC BASED LOW POWER AND HIGH PERFORMANCE DIGITAL FIR FILTER Dr. S.Satheeskumaran 1 K.

Area Efficient and Low Power Reconfiurable Fir Filter

An Optimized Implementation of CSLA and CLLA for 32-bit Unsigned Multiplier Using Verilog

International Journal Of Scientific Research And Education Volume 3 Issue 6 Pages June-2015 ISSN (e): Website:

Redundant Residue Number System Based Fault Tolerant Architecture over Wireless Network

Performance Analysis of FIR Filter Design Using Reconfigurable Mac Unit

II. QUATERNARY CONVERTER CIRCUITS

High Speed and Reduced Power Radix-2 Booth Multiplier

Low Power Approach for Fir Filter Using Modified Booth Multiprecision Multiplier

PERFORMANCE COMPARISON OF HIGHER RADIX BOOTH MULTIPLIER USING 45nm TECHNOLOGY

Globally Asynchronous Locally Synchronous (GALS) Microprogrammed Parallel FIR Filter

DESIGN OF HIGH SPEED 32 BIT UNSIGNED MULTIPLIER USING CLAA AND CSLA

Modified Booth Multiplier Based Low-Cost FIR Filter Design Shelja Jose, Shereena Mytheen

DESIGN OF AREA EFFICIENT TRUNCATED MULTIPLIER FOR DIGITAL SIGNAL PROCESSING APPLICATIONS

Implementation of Parallel Multiplier-Accumulator using Radix- 2 Modified Booth Algorithm and SPST

FPGA Implementation of Booth Encoded Multi-Modulus {2 n -1, 2 n, 2 n +1} RNS Multiplier

Innovative Approach Architecture Designed For Realizing Fixed Point Least Mean Square Adaptive Filter with Less Adaptation Delay

FPGA Implementation of Desensitized Half Band Filters

Tirupur, Tamilnadu, India 1 2

Comparative Analysis of Various Adders using VHDL

Design of Area and Power Efficient FIR Filter Using Truncated Multiplier Technique

DESIGN OF A HIGH SPEED MULTIPLIER BY USING ANCIENT VEDIC MATHEMATICS APPROACH FOR DIGITAL ARITHMETIC

Design and Implementation of Scalable Micro Programmed Fir Filter Using Wallace Tree and Birecoder

An Design of Radix-4 Modified Booth Encoded Multiplier and Optimised Carry Select Adder Design for Efficient Area and Delay

Implementation and Comparison of Low Pass FIR Filter on FPGA Using Different Techniques

A Fixed-Width Modified Baugh-Wooley Multiplier Using Verilog

DESIGN OF FIR FILTER ARCHITECTURE USING VARIOUS EFFICIENT MULTIPLIERS Indumathi M #1, Vijaya Bala V #2

Design and Implementation of High Speed Carry Select Adder Korrapatti Mohammed Ghouse 1 K.Bala. 2

Keywords: Adaptive filtering, LMS algorithm, Noise cancellation, VHDL Design, Signal to noise ratio (SNR), Convergence Speed.

Discrete Square Root. Çetin Kaya Koç Winter / 11

IJCSIET-- International Journal of Computer Science information and Engg., Technologies ISSN

An Optimized Design for Parallel MAC based on Radix-4 MBA

Area Efficient Fft/Ifft Processor for Wireless Communication

VLSI Implementation of Reconfigurable Low Power Fir Filter Architecture

CHAPTER 4 ANALYSIS OF LOW POWER, AREA EFFICIENT AND HIGH SPEED MULTIPLIER TOPOLOGIES

Channelization and Frequency Tuning using FPGA for UMTS Baseband Application

Design and Simulation of 16x16 Hybrid Multiplier based on Modified Booth algorithm and Wallace tree Structure

AREA AND DELAY EFFICIENT DESIGN FOR PARALLEL PREFIX FINITE FIELD MULTIPLIER

FPGA Implementation of Wallace Tree Multiplier using CSLA / CLA

IJCSIET--International Journal of Computer Science information and Engg., Technologies ISSN

International Journal of Scientific & Engineering Research Volume 3, Issue 12, December ISSN

IMPLEMENTATION OF AREA EFFICIENT MULTIPLIER AND ADDER ARCHITECTURE IN DIGITAL FIR FILTER

An Efficient Implementation of Downsampler and Upsampler Application to Multirate Filters

VLSI Design and FPGA Implementation of N Binary Multiplier Using N-1 Binary Multipliers

ADVANCES in NATURAL and APPLIED SCIENCES

Area and Power Efficient Booth s Multipliers Based on Non Redundant Radix-4 Signed- Digit Encoding

Optimized FIR filter design using Truncated Multiplier Technique

Design and Implementation of Digit Serial Fir Filter

Fixed Point Lms Adaptive Filter Using Partial Product Generator

Design of Efficient 64 Bit Mac Unit Using Vedic Multiplier

Transcription:

International Journal of Computational Engineering & Management, Vol. 16 Issue 6, November 2013 www..org 61 Design and Analysis of RNS Based FIR Filter Using Verilog Language P. Samundiswary 1, S. Kalpana 2 1 Department of Electronics, Pondicherry University, Puducherry, India 2 Department of Electronics, Pondicherry University, Puducherry, India Abstract Digital filter plays an important role in Very Large Scale Integration (VLSI) technology. The existing Finite Impulse Response (FIR) filter has long transient response which is the major limitation. To overcome this drawback, Residue Number System (RNS) based FIR filters is developed which is described in this paper. High-speed is obtained by introducing the residue arithmetic concept that permits the computation of the filter output by using N FIR sub filters of reduced dynamic range operating in parallel form. Three moduli sets are used in RNS based Filter. 4- tap Low Pass Filter (LPF) type of FIR filter and RNS based FIR filter with 4-tap LPF are designed using Verilog language and analyzed in this paper. The simulation is done by using Xilinx tool Integrated Software Environment (ISE)-13.1 Keywords: FIR Filter, Moduli, RNS Filter. In RNS, arithmetic operations on large integers are done by splitting them into smaller residues and performing the operations independently and parallel form, thereby speeding up the whole operation referred in paper [2, 3]. In this paper, an attempt has been made to design and simulate LPF type of RNS based FIR filters with the consideration of 4 tap using Verilog. The rest of the paper is organized as follows: Section 2 deals with the basic concepts of the FIR filter and their properties. Section 3 discusses about the mathematical fundamentals involved to derive RNS and also describes about the choice of moduli, forward and reverse conversion algorithms used in RNS. In Section 4 the analysis of simulation results of FIR filter and RNS based FIR filter is done and the conclusion is drawn in section 5. 1. Introduction In recent years, there has been significant development in the field of Digital Signal Processing (DSP) along with the advancement in VLSI technology. Various applications of DSP includes audio, image and video processing and consumer electronics [1]. FIR digital filters are widely used in digital signal processing by virtue of stability and easy implementation. The main drawback of FIR filters is the increased amount of computation needed to process a signal through the FIR filter. The advent of VLSI technology and DSP processors provides an opportunity for significantly increased efficiency of RNS to minimize the delay. RNS are becoming popular for designing high performance DSP processors because of their ability to offer carry free arithmetic operation. The carry free operations lead to concurrent execution of arithmetic operation on the residues. However in RNS, moduli selection is one of the most important parameter that determines bit efficiency, area, power consumption, speed etc. 2. FIR Filter The output y of a linear time invariant system is determined by convolving its input signal x with its impulse response b. For a discrete time FIR filter, the output is a weighted sum of the current and a finite number of previous values of the input. The operation is described by the following equation, which defines the output sequence y[n] in terms of its input sequence x[n]: where: is the input signal, is the output signal, (1)

International Journal of Computational Engineering & Management, Vol. 16 Issue 6, November 2013 www..org 62 are the filter coefficients, also known as tap weights, that make up the impulse response and is the filter order. An th-order filter has terms on the right-hand side. The in these terms are commonly referred to as taps, based on the structure of a tapped delay line that in many implementations or block diagrams provides the delayed inputs to the multiplication operations.. FIR digital filters have exactly linear phase response and suffer less from the effects of finite word length as compared with IIR digital filters. The main components of FIR filter are adder, multiplier and delay. The carry propagation delay is a limiting factor of the adder and multiplier. The delay of FIR filters is reduced by incorporating RNS based modulo adder and modulo multiplier in the simple FIR filter. Before the simulation of FIR filter, the coefficients are scaled by the following rules given below. 2.1 Scaling of Co-efficients The implementation of the filter uses a fixed point method to represent data. But the coefficients used in the design of the filter are going to be fractional in nature and also it may be negative. So the method of scaling of data has been used to represent fractions. The scaling was done using scaling factors of 2 s complement and then shifting the data to the left or to the right. For example if 0.375 is one of coefficient which is to be multiplied by 2 to get 0.75 as the answer, then first 0.375 is represented in digital as 0.0110. For doing the above operation first the data is shifted to the left by four bits giving 0110.0, this data is then multiplied by 2 which gives 1100.0. Now the data 1100.0 is again shifted to the right by four bits which gives 0.1100 this value is the same as the required answer of 0.75. An extra bit has been included for considering the overflow of the data. 3. RNS Based FIR Filter Researchers have discussed about the optimized RNS based FIR filter model [4, 5]. It is defined by a set of relatively prime integers called the moduli. The moduli set are denoted as where is the modulus. Each integer can be represented as a set of smaller integers called the residues. The residue-set is denoted as where is the residue. The residue is defined as the least positive remainder when X is divided by the moduli [5]. This relation can be notationally written based on the congruence and the equation is given below. (2) The same congruence can be written in an alternative notation as: (3) The RNS is capable of uniquely representing all integers X that lie in its dynamic range. The dynamic range is determined by the moduli-set {m 1,m 2,..,m n } and denoted as M where: (4) The RNS provides unique representation for all integers in the range between 0 and M-1. If the integer X is greater than M-1, the RNS representation repeats itself. Therefore, more than one integer might have the same residue representation. It is important to emphasize that the moduli have to be relatively prime to be able to exploit the full dynamic range M. The below figure 1 shows the general structure of RNS based FIR filter. X(t) and Y(t) are the input and output of this figure. The forward and reverse conversion is based on the special moduli set and the New Chinese Remainder Theorem (NCRT) and the three FIR filter blocks are used here to speed up the processes. 3.1 Choice of Moduli Figure 1: RNS based FIR filter The choice of moduli should satisfy the following conditions. They should be relatively prime. The moduli should be as small as possible so that operations modulo require minimum computational time. The moduli should imply simple weighted to RNS and RNS to weighted conversions as well as simple RNS

International Journal of Computational Engineering & Management, Vol. 16 Issue 6, November 2013 www..org 63 arithmetic. The moduli set should be of the forms and for simple conversions and simple arithmetic in RNS system. The product of the moduli should be large enough in order to implement the desired dynamic range. The moduli should create a balanced decomposition of the dynamic range. The speed and cost also depend on the moduli chosen.. Based on the above conditions, 3 moduli set is used in this design as. This set is used to design both forward and reverse conversion of the RNS based FIR filter. The forward conversion process is discussed below. 3.2 Forward Conversion of modulo structure is equal to the delay of two (n+1)- bit adders as well as the delay of one (n+1)-bit 2 1multiplexer referred in [8]. Figure 3: modulo 2 n -1 adder The forward conversion stage is of paramount importance as it is considered as an overhead in the overall RNS. Forward converters are usually classified into two categories based on the moduli used. The first category includes forward converters based on arbitrary moduli-sets. These converters are usually built using look-up tables. The second category includes forward converters based on special moduli-sets. The use of special moduli-sets simplifies the forward conversion algorithms and architectures referred in [6]. Usually, the special moduli-sets are referred to as low-cost moduli-sets. In this section, special moduli-set is focused as it is the most commonly used moduli-set used for this design. Researchers have discussed about the importance of special moduli-set forward converters. A typical architecture for the implementation of a forward converter from binary to RNS representation using the special moduli-set is shown in Figure 2. One way of implementing a residue adder for modulo m structure is composed of one n-bit adder referred in [7]. 3.3 Reverse Conversion Figure 4: modulo 2 n +1 adder Converting residue number to binary number is called reverse conversion and is presented in paper [9]. Reverse conversion algorithms in the literature are all based on three types. Chinese Remainder Theorem (CRT) New Chinese Remainder Theorem (NCRT) Mixed Radix Theorem (MRT) The new Chinese Remainder Theorems (CRTI) makes the computations faster and efficient without any extra overheads. But, New CRTs are hardware intensive as they require many inverse modulus operators, multipliers and dividers. Dividers and inverse modulus operators in turn needs many half and full adders and also subtractors. So, some kind of optimization is necessary to implement these theorems practically. Figure 2: Converter Forward The modulo 2 n -1 adder structure is shown in figure 3.This adder adds two n-bit numbers, X and Y in the first step. In the second step, the result is given to the multiplexer for 2 n -1 adder. In adder, the first stage is same as that of 2 n -1 adder but the second step is done with the two s complement of modulo m. The final result is selected between the two outputs according to the two output carries. Modulo adder in series method is shown in Figure 4 The delay The reverse conversion is one of the most difficult operations in RNS and has been a major, if not the major, limiting ratio factor to a wider use of RNS. In general, the realization of a VLSI implementation of R/B converters is still complex and costly. Based on the survey New Chinese Remainder Theorem (NCRT) is an optimized one used for this design. 3.4 New Chinese Remainder Theorems A new high-speed ROM-less residue-to binary converter for the three moduli residue number system of the form { 2" - 1,2", 2" + 1). Unlike any other converter, its delay involves the time of only one 1's complement addition of two 2n-bit

International Journal of Computational Engineering & Management, Vol. 16 Issue 6, November 2013 www..org 64 numbers which is only 2/3 of the binary range of the RNS equal to a as 3n. Thus, it is potentially the fastest known residue-to-binary converter for any nontrivial RNS for a given a. At present, R-to-B converter with the best hardware and time performance can be chosen from those referred in [10] depending on hardware versus time performance preferences. The new realization of the converter relies on the theory which is briefly presented here for completeness and is shown in figure 5. Theorems (New CRT) have been implemented. This theorem reduces the size of the modulo operation required by the CRT. Figure 6: Logic scheme of the (4, 15), MOMA using CSA/CPA with EAC. Figure 5: Detailed hardware realization of a reverse converter Let the three residues be denoted as,,. MSB are given first the decimal value of X and it can be computed as The decimal value of X can be computed by the below formulae where the variables A, B, and C are defined by the following 2n bit vector (5) (6) (7) (8) (9) (10) (11) (12) (13) This formulae reduces the size of the modulo operation from M to Ni at the expense that some part of the dynamic range [0, M) will not be useable. Recently, some alternative general conversion algorithms, namely the New Chinese Remainder The MOMA is built of two stages of 2n-bit carrysave adders (CSA's) with EAC and one 2n-bit 1's complement adder (the latter can be efficiently implemented as a 2n-bit CPA with EAC). For readers convenience, the logic diagram of a sample (4, 15) MOMA is given in Figure 6. The above figure represents the hardware realization 2n-bit CSA, with EAC (End Around Carry) block. 4. Simulation Results The design and simulation of LPF type of FIR filter and RNS based FIR filter with 4 tap is done by using Verilog language. Further, the forward (binary to RNS) and reverse (RNS to binary) conversion blocks of RNS based FIR filters have been designed for moduli set. Then the incorporation of forward and reverse conversion is done in FIR filter to develop 4 tap LPF type of RNS based FIR filter and the same is simulated by Spartan-6 Platform using ISE 13.1 tools with the help of Verilog language. Figure 7 & 8 represents the RTL view and the output waveform for 4 th order low pass FIR filter. and is the input and output of the FIR filter as shown in figure 8. Figure 9 and 10 represents the RTL view and the output waveform for forward conversion with as three binary inputs and as three residue outputs. Figure 11 and 12 represents the RTL view and the output waveform for reverse conversion with as residue inputs and as binary output for reverse conversion. Figure 13 and 14 represents RTL view and the output waveform for RNS based FIR filter.

International Journal of Computational Engineering & Management, Vol. 16 Issue 6, November 2013 www..org 65 Figure 7: RTL view of 4th order low pass FIR filter Figure 11: RTL view of reverse conversion Figure 8: Output waveform of 4th order low pass FIR filter Figure 12: Output waveform of reverse conversion Figure 9: RTL view for forward conversion Figure 13: RTL view of RNS based FIR filter Figure 10: Output waveform of forward conversion Figure 14: Output waveform of RNS based FIR filter

International Journal of Computational Engineering & Management, Vol. 16 Issue 6, November 2013 www..org 66 5. Conclusion This work mainly describes the design of FIR filter and RNS based FIR filter considering 4-tap LPF by using Verilog language. The forward (binary to RNS) and reverse (RNS to binary) conversion blocks of RNS based FIR filters have been designed for moduli and then incorporated in the RNS based FIR filter which is simulated and analysed. References [1] Richard Conway, John Nelson, Improved RNS FIR filter Architecture, IEEE Transaction on Circuits and Systems, vol.51, no.1, pp.26-28, January 2004. [2] Salvatore Pontraelli, Gian Carlo Cardarilli, Marco Re, Adelio Salsano, Optimized implementation of RNS FIR filters based on FPGAs", Springer Journal Signal Processing System, vol. 67, no.3,pp.201-212, September 2012. P.Samundiswary received the B.Tech degree (1997), M.Tech degree (2003) and Ph.D. (2011) in the department of Electronics and Communication Engineering from Pondicherry Engineering College affiliated to Pondicherry University, India. She is having 15 years of teaching experience. She is currently working as Assistant Professor in the Dept. of Electronics Engineering, School of Engineering and Technology, Pondicherry University, Pondicherry, India. Her research interests include Wireless Communication and Wireless Networks. S.Kalpana obtained her B.Tech degree (2010) in Electrical and Electronics Engineering from Regency Institute of Technology, Yanam affiliated to Pondicherry University. Recently, she has obtained her M.Tech degree (2013) in the Dept. of Electronics Engineering, School of Engineering and Technology, Pondicherry University, Pondicherry, India. [3] Pemmaraju V. Ananda Mohan, RNS to Binary converter for a Three moduli set {2 n+1-1,2 n, 2 n -1}, IEEE Transaction on Circuits and Systems, vol.54, no.9, pp.775-779, September 2007. [4] E. Grosswald, Topics from the theory of Numbers, Collier Macmillan Ltd., New York, 1996.. [5] A. Omondi and B. Premkumar, Residue Number System: Theory and implementation, Imperial College Press, 2007, (ISBN 978-1-86094-866-4). [6] Jean-Luc Beuchat, Some Modular Adder and Multipliers for Field Programmable Gate Arrays, Proceedings of the IEEE 17th International Symposium on Parallel and Distributed Processing, Washington DC, USA, pp-190.2, April 2003. [7] Somayeh Timarchi, Keivan Navi, Improved Modulo 2n +1 Adder Design, World Academy of Science, Engineering and Technology, 2001. [8] Dina Younes, Pavel Steffan, Novel modulo Subtractor And Multiplier, Proceedings of Sixth International Conference on Systems, pp.36-38, January, 2011. [9] Marco Re, Alberto Nannarelli, Gian Carlo Cardarilli, Roberto Lojacono, FPGA realization of RNS to Binary signed conversion architecture, Proceedings of IEEE International Symposium on Circuits and Systems, Sydney, Australia, vol. 4, pp.350-353, May 2001. [10] S.Piestrak, A high speed realization of a residue to binary number system Converter, IEEE Transactions on Circuits and systems-ii, vol. 42, no. 10, pp.661-663, October 1995.