An Efficient Reconfigurable Fir Filter based on Twin Precision Multiplier and Low Power Adder

Similar documents
Design and Performance Analysis of a Reconfigurable Fir Filter

Low Power Approach for Fir Filter Using Modified Booth Multiprecision Multiplier

A Reconfigurable FIR Filter Architecture to Trade Off Filter Performance for Dynamic Power Consumption

Design of Area and Power Efficient FIR Filter Using Truncated Multiplier Technique

VLSI Implementation of Reconfigurable Low Power Fir Filter Architecture

DESIGN OF MULTIPLE CONSTANT MULTIPLICATION ALGORITHM FOR FIR FILTER

Design and Implementation of Truncated Multipliers for Precision Improvement and Its Application to a Filter Structure

Modified Booth Encoding Multiplier for both Signed and Unsigned Radix Based Multi-Modulus Multiplier

NOVEL HIGH SPEED IMPLEMENTATION OF 32 BIT MULTIPLIER USING CSLA and CLAA

2 Assoc Prof, Dept of ECE, George Institute of Engineering & Technology, Markapur, AP, India,

Tirupur, Tamilnadu, India 1 2

Data Word Length Reduction for Low-Power DSP Software

Efficient FIR Filter Design Using Modified Carry Select Adder & Wallace Tree Multiplier

Efficient Carry Select Adder Using VLSI Techniques With Advantages of Area, Delay And Power

Optimized FIR filter design using Truncated Multiplier Technique

Index Terms: Low Power, CSLA, Area Efficient, BEC.

High performance Radix-16 Booth Partial Product Generator for 64-bit Binary Multipliers

Design and Implementation of High Speed Carry Select Adder

International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 5, Sep-Oct 2014

An Design of Radix-4 Modified Booth Encoded Multiplier and Optimised Carry Select Adder Design for Efficient Area and Delay

International Journal of Scientific & Engineering Research, Volume 7, Issue 3, March-2016 ISSN

Area Power and Delay Efficient Carry Select Adder (CSLA) Using Bit Excess Technique

Design of Delay-Power Efficient Carry Select Adder using 3-T XOR Gate

Faster and Low Power Twin Precision Multiplier

A Highly Efficient Carry Select Adder

JDT LOW POWER FIR FILTER ARCHITECTURE USING ACCUMULATOR BASED RADIX-2 MULTIPLIER

Design and Characterization of 16 Bit Multiplier Accumulator Based on Radix-2 Modified Booth Algorithm

Design of a Power Optimal Reversible FIR Filter ASIC Speech Signal Processing

Design and Implementation of High Speed Carry Select Adder Korrapatti Mohammed Ghouse 1 K.Bala. 2

AN EFFICIENT APPROACH TO MINIMIZE POWER AND AREA IN CARRY SELECT ADDER USING BINARY TO EXCESS ONE CONVERTER

IMPLEMENTATION OF UNSIGNED MULTIPLIER USING MODIFIED CSLA

FPGA Implementation of Area-Delay and Power Efficient Carry Select Adder

Implementation of 32-Bit Unsigned Multiplier Using CLAA and CSLA

An Efficient Implementation of Downsampler and Upsampler Application to Multirate Filters

FOR HIGH SPEED LOW POWER APPLICATIONS USING RADIX-4 MODIFIED BOOTH ENCODER

INTERNATIONAL JOURNAL OF ENGINEERING SCIENCES & RESEARCH TECHNOLOGY

High Speed Binary Counters Based on Wallace Tree Multiplier in VHDL

Design and Implementation of High Speed Area Efficient Carry Select Adder Using Spanning Tree Adder Technique

I. INTRODUCTION VANAPARLA ASHOK 1, CH.LAVANYA 2. KEYWORDS Low Area, Carry, Adder, Half-sum, Half-carry.

Design A Redundant Binary Multiplier Using Dual Logic Level Technique

128 BIT MODIFIED SQUARE ROOT CARRY SELECT ADDER

Low-Power Multipliers with Data Wordlength Reduction

Design and Implementation of Reconfigurable FIR Filter

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

Design and Implementation of 64-bit MAC Unit for DSP Applications using verilog HDL

DESIGN AND IMPLEMENTATION OF AREA EFFICIENT, LOW-POWER AND HIGH SPEED 128-BIT REGULAR SQUARE ROOT CARRY SELECT ADDER

Mahendra Engineering College, Namakkal, Tamilnadu, India.

Low Power and Area EfficientALU Design

Efficient Implementation on Carry Select Adder Using Sum and Carry Generation Unit

Low Power FIR Filter Design Based on Bitonic Sorting of an Hardware Optimized Multiplier S. KAVITHA POORNIMA 1, D.RAHUL.M.S 2

DESIGN & IMPLEMENTATION OF FIXED WIDTH MODIFIED BOOTH MULTIPLIER

FPGA Implementation of Area Efficient and Delay Optimized 32-Bit SQRT CSLA with First Addition Logic

Area Efficient Carry Select Adder with Half-Sum and Half-Carry Method

An Optimized Wallace Tree Multiplier using Parallel Prefix Han-Carlson Adder for DSP Processors

DESIGN AND IMPLEMENTATION OF 64- BIT CARRY SELECT ADDER IN FPGA

SIGNED PIPELINED MULTIPLIER USING HIGH SPEED COMPRESSORS

Implementation of Parallel Multiplier-Accumulator using Radix- 2 Modified Booth Algorithm and SPST

A Novel Approach to 32-Bit Approximate Adder

ENHANCING SPEED AND REDUCING POWER OF SHIFT AND ADD MULTIPLIER

Design and Implementation of Carry Select Adder Using Binary to Excess-One Converter

Design of an optimized multiplier based on approximation logic

A Novel High-Speed, Higher-Order 128 bit Adders for Digital Signal Processing Applications Using Advanced EDA Tools

PUBLICATIONS OF PROBLEMS & APPLICATION IN ENGINEERING RESEARCH - PAPER CSEA2012 ISSN: ; e-issn:

Design and Analysis of CMOS based Low Power Carry Select Full Adder

Design of a Power Optimal Reversible FIR Filter for Speech Signal Processing

International Journal of Computer Engineering and Applications, Volume XI, Issue XI, Nov. 17, ISSN

Design and Implementation of Efficient Carry Select Adder using Novel Logic Algorithm

DESIGN OF FIR FILTER ARCHITECTURE USING VARIOUS EFFICIENT MULTIPLIERS Indumathi M #1, Vijaya Bala V #2

A VLSI Implementation of Fast Addition Using an Efficient CSLAs Architecture

A New High Speed Low Power Performance of 8- Bit Parallel Multiplier-Accumulator Using Modified Radix-2 Booth Encoded Algorithm

IJCSIET--International Journal of Computer Science information and Engg., Technologies ISSN

Design and Analysis of Row Bypass Multiplier using various logic Full Adders

Multiple Constant Multiplication for Digit-Serial Implementation of Low Power FIR Filters

Design and Implementation of 128-bit SQRT-CSLA using Area-delaypower efficient CSLA

Low Power FIR Filter Structure Design Using Reversible Logic Gates for Speech Signal Processing

Multiplier and Accumulator Using Csla

High Speed, Low power and Area Efficient Processor Design Using Square Root Carry Select Adder

A New network multiplier using modified high order encoder and optimized hybrid adder in CMOS technology

Design of 32-bit Carry Select Adder with Reduced Area

Design of 8-4 and 9-4 Compressors Forhigh Speed Multiplication

DESIGN OF AREA EFFICIENT TRUNCATED MULTIPLIER FOR DIGITAL SIGNAL PROCESSING APPLICATIONS

Implementation of Cmos Adder for Area & Energy Efficient Arithmetic Applications

DESIGN & FPGA IMPLEMENTATION OF RECONFIGURABLE FIR FILTER ARCHITECTURE FOR DSP APPLICATIONS

An Efficient VLSI Architecture of a Reconfigurable Pulse- Shaping FIR Interpolation Filter for Multi standard DUC

Efficient Optimization of Carry Select Adder

High Speed Vedic Multiplier Designs Using Novel Carry Select Adder

AN EFFICIENT DESIGN OF ROBA MULTIPLIERS 1 BADDI. MOUNIKA, 2 V. RAMA RAO M.Tech, Assistant professor

AN EFFICIENT MAC DESIGN IN DIGITAL FILTERS

Modified Booth Multiplier Based Low-Cost FIR Filter Design Shelja Jose, Shereena Mytheen

A VLSI Implementation of Fast Addition Using an Efficient CSLAs Architecture

IMPLEMENTATION OF AREA EFFICIENT AND LOW POWER CARRY SELECT ADDER USING BEC-1 CONVERTER

An Efficent Real Time Analysis of Carry Select Adder

Implementation of 256-bit High Speed and Area Efficient Carry Select Adder

A Novel Multiplier Design using Adaptive Hold Logic to Mitigate BTI Effect

Ajmer, Sikar Road Ajmer,Rajasthan,India. Ajmer, Sikar Road Ajmer,Rajasthan,India.

CHAPTER 4 ANALYSIS OF LOW POWER, AREA EFFICIENT AND HIGH SPEED MULTIPLIER TOPOLOGIES

HIGH SPEED FIXED-WIDTH MODIFIED BOOTH MULTIPLIERS

A Parallel Multiplier - Accumulator Based On Radix 4 Modified Booth Algorithms by Using Spurious Power Suppression Technique

REALIAZATION OF LOW POWER VLSI ARCHITECTURE FOR RECONFIGURABLE FIR FILTER USING DYNAMIC SWITCHING ACITIVITY OF MULTIPLIERS

Trade-Offs in Multiplier Block Algorithms for Low Power Digit-Serial FIR Filters

Transcription:

An Efficient Reconfigurable Fir Filter based on Twin Precision Multiplier and Low Power Adder Sony Sethukumar, Prajeesh R, Sri Vellappally Natesan College of Engineering SVNCE, Kerala, India. Manukrishna V R Vellore Institute of Technology - VIT, Tamil Nadu, India. Abstract This paper proposes a low power reconfigurable finite impulse response (FIR) filter using a twin-precision multiplier and a low power carry select adder. The multiplier is capable of performing two independent N/2-bit multiplications in parallel which reduces the power based on partial products reordering. Reordering of partial products is applied on High Performance Multiplication(HPM).The adder used is the low power and area efficient carry select adder(csla). The structure of this adder that there is scope for reducing the area and power consumption in the CSLA. The proposed design has reduced area and power as compared with the regular Reconfigurable FIR Filter. The results analysis shows that the proposed structure is better than the regular FIR structure. Keywords: Low power filter, reconfigurable design, twin precision, HPM, CSLA INTRODUCTION One of the most widely used operations performed in the mobile computing and various multimedia applications is finite impulse response(fir)filtering. Many efforts for reducing the power consumption of FIR filter is done by the optimization of filter coefficients. FIR filter operations performs the convolution sum and so by minimizing the number of additions and multiplications and accelerating the operations is one of the main goals of this research. The amount of computation and the corresponding power consumption of FIR filter are directly proportional to the filter order, if we change the filter order by turning off some of multipliers, significant power savings can be achieved. However, performance degradation should be carefully considered when we change the filter order. Multiplication is one of the most area consuming arithmetic operations in high-performance circuits. As a consequence many research works deal with low power design of high speed multipliers. Multiplication involves two basic operations, the generation of the partial products and their sum. In the fixed point arithmetic of FIR filter, full operand bit-widths of the multiplier outputs is not generally used. In other words, when the bit-widths of data inputs and coefficients are 16, the multiplier generates 32-bit outputs. However, considering the circuit area of the following adders, the LSBs of multipliers outputs are usually truncated or rounded off, which incurs quantization errors. When we turn off the multiplier in the FIR filter, if we can carefully select the input and coefficient amplitudes such that the multiplication of those two numbers is as small as the quantization error, filter performance degradation can be made negligible. Several studies has been done on operand bandwidth and it has been shown that for SPECint95 benchmarks more than 50% of the instructions are instructions where both operands are less than or equal to 16 bits. This property was explored to save power, through operand guarding n which the most significant bits of the operands are not switched and thus power is saved when narrow width operations are computed consecutively. Moreover in this paper we present a reconfigurable FIR filter architecture, where the filter order can be dynamically changed depending on the amplitude of both the filter coefficients and the inputs. In other words, when the data sample multiplied to the coefficient is so small as to mitigate the effect of partial sum in FIR filter, the multiplication operation can be simply canceled.so in effective to the operand guarding using the twin precision technique and the multiplier being turned off when the input coefficients are lower than the threshold, power saving can be done to a large extend compared to the earlier proposed FIR architecture. Addition operation has been performed in the proposed architecture by the area and power efficient low power carry select adder(csla) in place of the normal adder. In the conventional digital adders, the speed of addition is limited by the time required to propagate a carry through the adder. The sum for each bit position in an elementary adder is generated sequentially only after the previous bit position has been summed and a carry propagated into the next position. But in CSLA adder the problem of carry propagation delay is done by by independently generating multiple carries and then select a carry to generate the sum.it uses a Binary to Excess-1 Converter (BEC) instead of Ripple Carry Adder with C in = 1 in the regular CSLA to achieve lower area and power consumption [2] [4]. The main advantage of this BEC logic comes from the lesser number of logic gates than the n bit Full Adder (FA) structure. The details of the BEC logic are discussed in Section III. 5201

LOW POWER MULTIPLIER USING TWIN PRECISION TECHNIQUE USED IN THE PROPOSED FIR FILTER In twin precision multiplier it is possible to perform two N/2 bit multiplications in parallel in an N bit multiplier. One N/2 N/2 bit multiplication is performed using N/2 N/2 bit partial product bits in the least significant part of the multiplier and another N/2 N/2 bit multiplication using partial product bits in the most significant part. Twin precision multiplier mainly operates in three modes namely Full or Double Precision mode (FDP), Dual Single Precision mode (DSP) and Single Precision mode (SP). The FDP mode is used for normal N x N multiplications. In this mode nearly 100% of the twin precision circuit is used. In DSP mode, two N/2 multiplications can be performed concurrently. In this mode more than 50% of the circuit is not used. And in SP mode, only one N/2 Multiplication is performed and 75% of the circuit is not used. Figure 1. Partial product representation of an unsigned 8 bit multiplication where a 4-bit multiplication, shown in white, is computed in parallel with a second 4-bit multiplication, shown in black The illustration of 8-bit unsigned multiplication, by performing two (black and white) 4-bit multiplications in concurrent is given in Fig 1. The white and grey are the important partial products. When performing a 4-bit multiplication in a standard multiplier normally one quarter of the logic cells are performing any useful operations, which is depicted in the grey regions. The twin-precision technique rearranges logic to greatly reduce power dissipation and delay for 4-bit multiplications. Also, this technique allows the logic gates, which are not used in the 4-bit multiplication taking place in the grey region, to perform a second, independent 4- bit multiplication (the black regions). The basic operation of generating a partial product is that of a 1-bit multiplication using a 2-input AND gate, where one of the input signals is one bit of the multiplier and the second input signal is one bit of the multiplicand. A. HPM IMPLEMENTATION To obtain high speed and/or low-power implementations, a logarithmic reduction HPM tree [11] is preferred for summation of the partial products. A logarithmic reduction tree has the advantage that the logic depth is shorter. Further, a logarithmic tree has fewer glitches making it less power dissipating. Regarding the connectivity of cells, a triangular reduction tree is preferred rather than a rectangular. This is because the total wire length for an 8-bit multiplier is approximately 38%shorter for the triangle compared to the rectangle. The partial products which are needed in DSP and SP modes are named as important (IMP), which is represented by > (Figure 2). The unused IMP partial products are in DSP and SP are termed as unimportant (UIMP), which is represented as x. The reordering of partial products has been performed in three stages and this technique is applied in HPM implementation. In the first stage the two operands of the multiplier are multiplied and partial products are formed as parallelogram structure. (Fig 2(a)) In the second stage the parallelogram structure of partial products is rearranged into triangular by moving the higher height partial products to the lower height. (Fig 2(b)). This helps to reduce the total wire length in the cell placement compare to parallelogram structure. Finally, the reordering of partial products is done by moving all the IMP partial products to the higher height the IMP partial products to the higher height of the partial product tree.(fig2(c)) The partial products are formed by using two input AND gates for the IMP partial products. An extra control signal is used to control the UIMP partial products in twin precision multiplier, so 3-input AND gate is used in generating the UIMP partial products. Then the generated partial products are reordered. This reordered partial products causes to switch less number of half adder and full adders. In the proposed FIR architecture the multiplier is turned off by converting the 3 input AND gate into a 4 input AND gate where the 4 th input signal is used to turn of the multiplier by setting it as zero when the input coefficients are less than the threshold. The column compression is done using HPM. In the HPM algorithm, the height of the tree is reduced by one for each row. For example rows 8, 7, 6, 5, 4, 3, 2 for an 8-bit multiplier. By using this column compression technique the height of the tree is reduced to two. The Fig.3 shows the block diagram for unsigned 8-bit twin precision multiplier by using reordering of partial products and HPM reduction tree. The architecture shows that the transitions that are taking place in DSP and SP mode is reduced compared to basic HPM which leads to the reduction of switching power. In Fig 3, a 4-bit multiplication computed by most significant bits are represented using the full adders are shown in black. Another 4-bit multiplication which is computed by the least significant bits are shown in white. The adder blocks given in grey remains idle in the DSP mode. The white blocks are enabled only in SP mode. 5202

LOW POWER CARRY SELECT ADDER USED IN THE FIR ARCHITECTURE Addition operation has been performed in the proposed architecture by the area and power efficient low power carry select adder(csla) in place of the normal adder. In the conventional digital adders, the speed of addition is limited by the time required to propagate a carry through the adder. The sum for each bit position in an elementary adder is generated sequentially only after the previous bit position has been summed and a carry propagated into the next position. But in CSLA adder the problem of carry propagation delay is done by by independently generating multiple carries and then select a carry to generate the sum.it uses a Binary to Excess-1 Converter (BEC) instead of Ripple Carry Adder with C in = 1 in the regular CSLA to achieve lower area and power consumption [2] [4]. The main advantage of this BEC logic comes from the lesser number of logic gates than the n bit Full Adder (FA) structure. The basic function of the CSLA is obtained by using the 4-bit BEC together with the mux. One input of the 8:4 mux gets as it input (B3, B2, B1, and B0) and another input of the mux is the BEC output. This produces the two possible partial results in parallel and the mux is used to select either the BEC output or the direct inputs according to the control signal Cin. The importance of the BEC logic stems from the large silicon area reduction when the CSLA with large number of bits are designed. Figure 2. Reordering of partial products for lower switching activity Figure 4. BEC with 8:4 mux used in the adder Figure 3. Architecture of an unsigned 8-bit twin precision multiplier by using reordering of partial products and HPM reduction tree. Figure 5. Amplitude detection logic(ad) 5203

ARCHITECTURE OF PROPOSED MODIFIED FIR FILTER INCORPORATING THE TWIN PRECISION MULTIPLIER AND LOW POWER ADDER Figure 6. Modified FIR filter architecture using Twin precision technique and low power carry select adder A. AMPLITUDE DETECTION LOGIC AND CONTROL GENERATOR In order to monitor the amplitudes of input samples x(n) and cancel the right multiplication operations in the fir filter, amplitude detector (AD) in Fig. 5 is used. When the absolute value of input is smaller than the threshold, the output of AD is set to 1. AD can be implemented using a simple comparator. Fig. 7. shows the ctrl signal generator design. As an input smaller than the threshold comes in and AD output is set to 1, the counter is counting up. When the counter reaches m, the ctrl signal in the figure changes to 1, which indicates that consecutive m small inputs are monitored and the multipliers are turned off. In the proposed reconfigurable filter, if we turn off the multiplier by considering each of the input amplitude only, then, if the amplitude of input abruptly changes for every cycle, the multiplier will be turned on and off continuously, which incurs considerable switching activities.. Using ctrl signal generator, the number of input samples consecutively smaller than threshold are counted and the multipliers are turned off only when m consecutive input samples are smaller than threshold. SIMULATION RESULTS Figure7. Schematic of control signal generator 5204

Figure 8. Simulated power analysis Figure 9. Simulation waveforms Figure 10. RTL Schematic 5205

CONCLUSION In this paper a modified reconfigurable FIR filter is proposed by incorporating the twin precision multiplier and a low power carry select adder. The reordering of partial products technique using HPM implementation has less power consumption compared to the normal conventional multipliers used in the existing FIR filter. Also the power and area savings has been enhanced by replacing the normal adders by low power carry select adder in the proposed modified FIR filter. So the two major operations, multiplication and addition in the FIR filter are done at the cost of low power and area compared to the normal existing FIR architecture. Thus a low power reconfigurable FIR filter architecture for low power application is presented to allow better trade off between the filter performance and computational energy. REFERENCES [1] H. Samueli, An improved search algorithm for the design of multiplierless FIR filter with powers-oftwo coefficients, IEEE Trans. Circuits Syst., vol. 36, no. 7, pp. 1044 1047, Jul. 1989. [2] R. I. Hartley, Subexpression sharing in filters using canonical signed digit multipliers, IEEE Trans. Circuits Syst. II, Analog Digit. Signal Process., vol. 43, no. 10, pp. 677 688, Oct. 1996. [3] O. Gustafsson, A difference based adder graph heuristic for multiple constant multiplication problems, in Proc. IEEE Int. Symp. Circuits Syst., 2007, pp. 1097 1100. [4] S. H. Nawab, A. V. Oppenheim, A. P. Chandrakasan, J. M. Winograd, and J. T. Ludwig, Approximate signal processing, J. VLSI Signal Process., vol. 15, no. 1 2, pp. 177 200, Jan. 1997. [5] J. Ludwig, H. Nawab, and A. P. Chandrakasan, Low power digital filtering using approximate. processing, IEEE J. Solid-State Circuits, vol. 31, no. 3, pp. 395 400, Mar. 1996. [6] A. Sinha, A. Wang, and A. P. Chandrakasan, Energy scalable system design, IEEE Trans. Very Large Scale Integr. Syst., vol. 10, no. 2, pp. 135 145, Apr. 2002. [7] K.-H. Chen and T.-D. Chiueh, A low-power digitbased reconfigurable FIR filter, IEEE Trans. Circuits Syst. II, Exp. Briefs, vol. 53,no. 8, pp. 617 621, Dec. 2006. [8] R. Mahesh and A. P. Vinod, New reconfigurable architectures for implementing filters wit Des. Integr. Circuits Syst., vol. 29, no. 2, pp. 275 288, Feb. 2010. [9] Z. Yu, M.-L. Yu, K. Azadet, and A. N. Wilson Symp. Circuits Syst., 2008, pp. 81 84. [10] R. Mahesh and A. P. Vinod, Coefficient decimation approach for realizing reconfigurable finite impulse response filters, in Proc. IEEE Int. Symp. Circuits Syst., 2008, pp. 81 84. [11] J. Park and K. Roy, A low complexity reconfigurable DCT architecture to trade off image quality for power consumption, J. Signal Process. Syst., vol. 53, no. 3, pp. 399 410, Dec. 2008. [12] J. G. Proakis, Digital Communications, 3rd ed. New York: McGraw-Hill, 1995. [13] Synopsys, Inc., Nanosim Reference Guide, 2007. [14] S. Hwang, G. Han, S. Kang, and J. Kim, New distributed arithmetic algorithm for low-power FIR filter implementation, IEEE Signal Process. Lett., vol. 11, no. 5, pp. 463 466, May 2004. 5206