A MODIFIED ARCHITECTURE OF MULTIPLIER AND ACCUMULATOR USING SPURIOUS POWER SUPPRESSION TECHNIQUE

Similar documents
Design and Characterization of 16 Bit Multiplier Accumulator Based on Radix-2 Modified Booth Algorithm

Implementation of Parallel Multiplier-Accumulator using Radix- 2 Modified Booth Algorithm and SPST

A Parallel Multiplier - Accumulator Based On Radix 4 Modified Booth Algorithms by Using Spurious Power Suppression Technique

Design of Parallel MAC Based On Radix-4 & Radix-8 Modified Booth Algorithm

Design and Implementation of 64-bit MAC Unit for DSP Applications using verilog HDL

A New High Speed Low Power Performance of 8- Bit Parallel Multiplier-Accumulator Using Modified Radix-2 Booth Encoded Algorithm

IJCSIET--International Journal of Computer Science information and Engg., Technologies ISSN

An Optimized Design for Parallel MAC based on Radix-4 MBA

AN EFFICIENT MAC DESIGN IN DIGITAL FILTERS

/$ IEEE

Mahendra Engineering College, Namakkal, Tamilnadu, India.

Novel Architecture of High Speed Parallel MAC using Carry Select Adder

High Speed Vedic Multiplier Designs Using Novel Carry Select Adder

SIGNED PIPELINED MULTIPLIER USING HIGH SPEED COMPRESSORS

Modified Booth Multiplier Based Low-Cost FIR Filter Design Shelja Jose, Shereena Mytheen

Ajmer, Sikar Road Ajmer,Rajasthan,India. Ajmer, Sikar Road Ajmer,Rajasthan,India.

Modified Booth Encoding Multiplier for both Signed and Unsigned Radix Based Multi-Modulus Multiplier

Design and Implementation of FPGA Radix-4 Booth Multiplication Algorithm

An Design of Radix-4 Modified Booth Encoded Multiplier and Optimised Carry Select Adder Design for Efficient Area and Delay

JDT LOW POWER FIR FILTER ARCHITECTURE USING ACCUMULATOR BASED RADIX-2 MULTIPLIER

DESIGNING OF MODIFIED BOOTH ENCODER WITH POWER SUPPRESSION TECHNIQUE

Design of high speed multiplier using Modified Booth Algorithm with hybrid carry look-ahead adder

Review of Booth Algorithm for Design of Multiplier

Design and Implementation Radix-8 High Performance Multiplier Using High Speed Compressors

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

International Journal of Advanced Research in Computer Science and Software Engineering

FOR HIGH SPEED LOW POWER APPLICATIONS USING RADIX-4 MODIFIED BOOTH ENCODER

Design of an optimized multiplier based on approximation logic

DESIGN OF LOW POWER / HIGH SPEED MULTIPLIER USING SPURIOUS POWER SUPPRESSION TECHNIQUE (SPST)

Multiplier Design and Performance Estimation with Distributed Arithmetic Algorithm

Implementation of Efficient 16-Bit MAC Using Modified Booth Algorithm and Different Adders

HIGH PERFORMANCE BAUGH WOOLEY MULTIPLIER USING CARRY SKIP ADDER STRUCTURE

CHAPTER 4 ANALYSIS OF LOW POWER, AREA EFFICIENT AND HIGH SPEED MULTIPLIER TOPOLOGIES

Modified Partial Product Generator for Redundant Binary Multiplier with High Modularity and Carry-Free Addition

An Optimized Wallace Tree Multiplier using Parallel Prefix Han-Carlson Adder for DSP Processors

ISSN Vol.07,Issue.08, July-2015, Pages:

INTERNATIONAL JOURNAL OF ENGINEERING SCIENCES & RESEARCH TECHNOLOGY

ISSN Vol.03,Issue.02, February-2014, Pages:

Design and Simulation of Convolution Using Booth Encoded Wallace Tree Multiplier

VLSI Designing of High Speed Parallel Multiplier Accumulator Based On Radix4 Booths Multiplier

Design of a Power Optimal Reversible FIR Filter ASIC Speech Signal Processing

Design and Implementation of Scalable Micro Programmed Fir Filter Using Wallace Tree and Birecoder

Design and Simulation of 16x16 Hybrid Multiplier based on Modified Booth algorithm and Wallace tree Structure

AN ADVANCED VLSI ARCHITECTURE OF PARALLEL MULTIPLIER BASED ON HIGHER ORDER MODIFIED BOOTH ALGORITHM

High performance Radix-16 Booth Partial Product Generator for 64-bit Binary Multipliers

MODIFIED BOOTH ALGORITHM FOR HIGH SPEED MULTIPLIER USING HYBRID CARRY LOOK-AHEAD ADDER

Compressors Based High Speed 8 Bit Multipliers Using Urdhava Tiryakbhyam Method

INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATION ENGINEERING & TECHNOLOGY (IJECET) STUDY ON COMPARISON OF VARIOUS MULTIPLIERS

A New network multiplier using modified high order encoder and optimized hybrid adder in CMOS technology

ADVANCES in NATURAL and APPLIED SCIENCES

An Optimized Implementation of CSLA and CLLA for 32-bit Unsigned Multiplier Using Verilog

A Novel High Performance 64-bit MAC Unit with Modified Wallace Tree Multiplier

Digital Integrated CircuitDesign

DESIGN OF FIR FILTER ARCHITECTURE USING VARIOUS EFFICIENT MULTIPLIERS Indumathi M #1, Vijaya Bala V #2

Performance Analysis of a 64-bit signed Multiplier with a Carry Select Adder Using VHDL

A High Speed Wallace Tree Multiplier Using Modified Booth Algorithm for Fast Arithmetic Circuits

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

IMPLEMENTATION OF AREA EFFICIENT MULTIPLIER AND ADDER ARCHITECTURE IN DIGITAL FIR FILTER

Implementation and Performance Analysis of different Multipliers

Implementation of Parallel MAC Unit in 8*8 Pre- Encoded NR4SD Multipliers

DESIGN OF AREA EFFICIENT TRUNCATED MULTIPLIER FOR DIGITAL SIGNAL PROCESSING APPLICATIONS

DESIGN AND IMPLEMENTATION OF AREA EFFICIENT, LOW-POWER AND HIGH SPEED 128-BIT REGULAR SQUARE ROOT CARRY SELECT ADDER

Design of High Speed Carry Select Adder using Spurious Power Suppression Technique

Design and Implementation of High Radix Booth Multiplier using Koggestone Adder and Carry Select Adder

Efficient FIR Filter Design Using Modified Carry Select Adder & Wallace Tree Multiplier

A Review on Different Multiplier Techniques

Design and Performance Analysis of 64 bit Multiplier using Carry Save Adder and its DSP Application using Cadence

ENHANCING SPEED AND REDUCING POWER OF SHIFT AND ADD MULTIPLIER

Tirupur, Tamilnadu, India 1 2

Design of Area and Power Efficient FIR Filter Using Truncated Multiplier Technique

Reduced Complexity Wallace Tree Mulplier and Enhanced Carry Look-Ahead Adder for Digital FIR Filter

Design of Digital FIR Filter using Modified MAC Unit

PERFORMANCE COMPARISON OF HIGHER RADIX BOOTH MULTIPLIER USING 45nm TECHNOLOGY

Low-Power Multipliers with Data Wordlength Reduction

Design A Redundant Binary Multiplier Using Dual Logic Level Technique

Design of Efficient 64 Bit Mac Unit Using Vedic Multiplier

DESIGN OF MULTIPLE CONSTANT MULTIPLICATION ALGORITHM FOR FIR FILTER

Design and Implementation of Truncated Multipliers for Precision Improvement and Its Application to a Filter Structure

A Survey on Power Reduction Techniques in FIR Filter

ASIC Design and Implementation of SPST in FIR Filter

International Journal of Advanced Research in Biology Engineering Science and Technology (IJARBEST)

Optimized FIR filter design using Truncated Multiplier Technique

Design and Simulation of Low Power and Area Efficient 16x16 bit Hybrid Multiplier

Implementation of FPGA based Design for Digital Signal Processing

High Performance 128 Bits Multiplexer Based MBE Multiplier for Signed-Unsigned Number Operating at 1GHz

Design and Performance Analysis of a Reconfigurable Fir Filter

FPGA Implementation of Wallace Tree Multiplier using CSLA / CLA

Design of a Power Optimal Reversible FIR Filter for Speech Signal Processing

IJCSIET-- International Journal of Computer Science information and Engg., Technologies ISSN

Design and Analysis of Row Bypass Multiplier using various logic Full Adders

A Survey on A High Performance Approximate Adder And Two High Performance Approximate Multipliers

Low Power Approach for Fir Filter Using Modified Booth Multiprecision Multiplier

International Journal Of Scientific Research And Education Volume 3 Issue 6 Pages June-2015 ISSN (e): Website:

S.Nagaraj 1, R.Mallikarjuna Reddy 2

Low Power FIR Filter Structure Design Using Reversible Logic Gates for Speech Signal Processing

Design and Implementation of Carry Select Adder Using Binary to Excess-One Converter

A Fixed-Width Modified Baugh-Wooley Multiplier Using Verilog

High Speed Binary Counters Based on Wallace Tree Multiplier in VHDL

Design of a High Speed FIR Filter on FPGA by Using DA-OBC Algorithm

High Speed and Reduced Power Radix-2 Booth Multiplier

Transcription:

A MODIFIED ARCHITECTURE OF MULTIPLIER AND ACCUMULATOR USING SPURIOUS POWER SUPPRESSION TECHNIQUE R.Mohanapriya #1, K. Rajesh*² # PG Scholar (VLSI Design), Knowledge Institute of Technology, Salem * Assistant Professor, Department of ECE, Knowledge Institute of Tehnology, Salem mohanapriya488@gmail.com krece@kiot.ac.in Abstract----High speed and low power Multiplier and Accumulator (MAC) unit is at most requirement of today s VLSI systems and digital signal processing (DSP) applications like FFT, Finite Impulse response filters, convolution etc. In this modified architecture, Radix-4 Modified Booth Encoding (MBE) is used to produce the partial products. In this multiplication and accumulation has been combined using a hybrid type of Carry Save Adder (CSA).So the performance will be improved. A Carry Look ahead Adder is inserted in the CSA tree to reduce the number of bits in the final adder. In booth multiplication, when two numbers are multiplied some portion of the data may be zero. By neglecting those data, power has been reduced. For this purpose Spurious Power Suppression Technique (SPST) is used to remove useless portion of the data in addition process. In this modified architecture, the overall process is three stages to produce the result. The modified MAC operation is coded with Verilog and simulated using Xilinx 12.1. Keywords----DSP, MAC, Radix-4, Modified Booth Encoding, CSA, CLA, SPST, Verilog. I. INTRODUCTION For Digital Signal Processing (DSP) like Fast Fourier Transform (FFT), Finite Impulse Response (FIR) filters, convolution etc required high speed and low power MAC. The multiplier and Multiplier and Accumulator (MAC) [1] are the important blocks of the processor and have a great impact on the speed of the processor. MAC is the necessary element of the digital processing system such as filtering, convolution and inner products. Many researchers are designing MAC for high performance and low power consumption. An efficient multiplier should have following characteristics: Accuracy- A good multiplier should give correct result, Speed-Multiplier should perform operation at high speed. Area-multiplier should occupy less number of slices and LUT s, Power-Multiplier should consume less power.basically Multiplier consists of three basic steps: Booth Encoding, Partial product generation and final addition. For MAC additionally accumulator used. Modified Booth Algorithm (MBA) [4] is the method commonly used for achieving high speed multiplication and also reduce number of partial products.also by using radix-4, radix-8, radix-16 and radix-32 booth Encoding technique, the partial products are further reduced, which increases complexity and improves the performance [1]. Also by using MBA algorithm speed, has been increased. For this purpose many parallel multiplication have been analyzed. [2]-[4]. Elguibaly [5], proposed one of the advanced types of MAC for general-purpose digital signal processing. In this hybrid type of Carry Save Adder has been used for combining the partial product summation with accumulation. This CSA tree compresses partial products with accumulation process. In this adder for accumulation process was eliminated. In this paper, a new architecture for a high-speed MAC is proposed. The most effective way to increase the speed of a multiplier is to reduce the number of the partial products by Radix-4 modified booth multiplier. In this MAC, the computations of multiplication and accumulation are combined and a hybrid-type CSA structure is proposed to reduce the critical path and improve the output rate. Increasing the operands density is important aspect.this has been done with the help of a modified array structure.in this modified architecture a hybrid type of CSA is used.this CSA contais the Carry Look ahead Adder,Full Adder and Half Adder Also Intermediate calculation results are accumulated in the form of sum and carry instead of the final adder outputs. Then Spurious Power Suppression Technique has been used to achieve less power consumption.whenever two numbers are multiplied,some of the rows will be zero.by neglecting those datas power has been reduced. II. OVERVIEW OF MAC UNIT In digital signal processing,the basic operation is multiplication and accumulation.the MAC unit provides the http://www.giapjournals.org/ijsrtm.html 258

operations such as high speed multiplication,multiplication with cumulative addition and subtraction,saturation,clear operations. Hence, If a MAC is working under high speed operation,it can be able to support multiple operations and parallel MAC is comprises of three important sections: 1.Adder, 2.Multiplier 3.Accumulator. from step 2 instead of that from step 3, step 3 does not have to be run until the point at which the result for the final accumulation is needed. III. PROPOSED MAC ARCHITECTURE In this modified architecture, Radix-4 Modified Booth Encoding (MBE) is used to produce the partial products. In booth multiplication, when two numbers are multiplied some portion of the data may be zero. By neglecting those data, power has been reduced. For this purpose Spurious Power Suppression Technique (SPST) is used to remove ineffective portion of the data in addition process. In this modified architecture the accumulation is carried out along with partial product summation. So that number of stages will get reduced. Multiplier (N bits) and multiplicand (N bits) will be given to the Radix-4 Modified Booth Encoder. To multiply multiplier and multiplicand Radix-4 booth algorithm starts from grouping multiplicand by three bits and encoding them into one of (-2,-2, 0, 1, 2). (i.e). Then these three bits will be converted into single bit. Here 4 rows of partial products will be generated. Fig: 1 Basic MAC unit Fig: 3 Architecture of Proposed MAC Fig:2 Basic arithmetic steps of multiplication and accumulation (3 stages) If an operation to multiply two N bit numbers and accumulates into a 2N-bit number is considered, the critical path is determined by the 2N -bit accumulation operation.since accumulation is carried out using the result By using Spurious Power Suppression Technique (SPST) ineffective portion of the data is removed for reducing the power consumption. This technique is controlled by detection unit. The detection unit has one of the operands as its input to check unwanted portion. http://www.giapjournals.org/ijsrtm.html 259

Steps: 1. Radix-4 Modified Booth Encoder 2. CSA and Accumulator 3. Final addition If the accumulator has been eliminated, the critical path is then determined by the final adder in the multiplier. The basic method to improve the performance of the final adder is to decrease the number of input bits. In order to reduce this number of input bits, the multiple partial products are compressed into a sum and a carry by CSA. A 2-bit CLA is used to add the lower bits in the CSA. In addition, to increase the output rate when pipelining is applied, the sums and carries from the CSA are accumulated instead of the outputs from the final adder in the manner that the sum and carry from the CSA in the previous cycle are inputted to CSA. IV. RADIX-4 BOOTH ENCODING Booth multiplication is a technique that allows for smaller, faster multiplication circuits, by recoding the numbers that are multiplied. It is possible to reduce the number of partial products by half, by using the technique of radix-4 Booth recoding. Grouping starts from the LSB, and the first block only uses two bits of the multiplier. Figure 4 shows the grouping of bits from the multiplier term for use in modified booth encoding.each block is decoded to generate the correct partial product. The encoding of the multiplier Y, using the modified booth algorithm, generates the following five signed digits, -2, -1, 0, +1, +2. Each encoded digit in the multiplier performs a certain operation on the multiplicand, X, as illustrated in Table 1. V. SPURIOUS POWER SUPPRESSION TECHNIQUE The SPST technique is basically depend on the radix 4 modified booth algorithm. It deals with the recoding of the given multiplicand and reduces the number of the intermediate stages in the multiplication operation which maintains the speed of the process at the same time the power consumed will be reduced.the SPST uses a detection logic circuit to detect the effective data range of arithmetic units, e.g., adders or multipliers. When a portion of data does not affect the final computing results, the data controlling circuits of the SPST latch this portion to avoid useless data transitions occurring inside the arithmetic units. The basic idea is that, instead of shifting and adding for every column of the multiplier term and multiplying by 1 or 0, we only take every second column, and multiply by ±1, ±2, or 0, to obtain the same results. The advantage of this method is the halving of the number of partial products. To Booth recode the multiplier term, we consider the bits in blocks of three, such that each block overlaps the previous block by one bit. Fig 4: Grouping of bits from the multiplier term Table 1: Radix-4 Booth Recoding Block Recoded Digit 000 0 0 001 +1 +1 010 +1 +1 011 +2 +2 100-2 -2 101-1 -1 110-1 -1 111 0 0 Operation on X Fig:5 Spurious power suppression technique http://www.giapjournals.org/ijsrtm.html 260

VI. CSA ARCHITECTURE Hybrid type of CSA architecture works with the help of proposed MAC is shown in Fig.6.It performs 8*8 operation (i.e) 8 bit operation. In that architecture, Ni is used to compensate 1 s complement number. Then Si is used to simplify the sign extension. From Booth encoder partial products are generated. Partial products are P0 [7:0], P1 [7:0], P2 [7:0], P3 [7:0].Only four rows of partial products generated instead of eight rows. By using this architecture, five FA adder rows needed. One more row is for accumulation process. For any N*N bit operation, the total number of CSA is (n/2+1) levels.in fig.6, for Full Adder operation, White Square Fig 6:CSA and Accumulator architecture box is used, the gray square box is used for Half Adder operation and rectangular symbol is used for 2 bit carry Look ahead Adder operation. This CLA contains five inputs with carry input. VII.FINAL ADDER This stage is also crucial for any MAC because in this stage addition of large size operands is performed so in this stage fast carry propagate adders like Carry-look Ahead Adder or Carry Skip Adder or Carry Select Adder and other adders such as Carry Save Adder can be used as per requirement. But analyzing all the adders, Carry Look ahead Adder gives low power and speed of http://www.giapjournals.org/ijsrtm.html 261

operation. Also it provides least area-delay product. This adder is augends and addend if a higher order carry is generated. based on the principle of looking at the lower order bits of the VIII.RESULTS AND DISCUSSION Fig 7: Architecture of CSA using Microwind tool Fig: 8 Simulation Results for 8*8 Booth multiplier Fig: 9Simulation Results for 16*16 Booth multiplier http://www.giapjournals.org/ijsrtm.html 262

In figure 7, the architecture of (8*8) multiplication and accumulation has been performed using Microwind tool. By using Radix-4 Booth algorithm the partial products are generated. In this architecture half adders, full Adders and Carry look ahead adders are used for addition process. By passing partial products into the adder blocks 16 bit MAC output has been obtained. The simulation results for 8-bit and 16-bit booth multiplier using Radix-4 Modified Booth algorithm shown in figure 8 and 9. By using Radix-4 Booth algorithm the partial products are reduced by n/2. From figure 8, 4 rows of partial products only produced. From figure 9, 8 rows of partial products only produced. Also delay has been analyzed. Then these partial products are added and produced the 16 bit and 32 bit multiplied output. From figure 8 to 9, results have been obtained with the help of Xilinx 12.1 ISE design suite. Table 2:Delay analysis 8*8 Booth Multiplier 16.234 ns 16*16 Booth Multiplier 23.610 ns [4] N. Ohkubo, M. Suzuki, T. Shinbo, T. Yamanaka, A. Shimizu, K.Sasaki, and Y. Nakagome, A 4.4 ns CMOS 54 54 multiplier usingpass-transistor multiplexer, IEEE J. Solid-State Circuits, vol. 30, no.3, pp. 251 257, Mar.1995. [5] F. Elguibaly, A fast parallel multiplier accumulator using the modifiedbooth algorithm, IEEE Trans. Circuits Syst., vol. 27, no. 9, pp.902 908, Sep. 2000. [6] M.Young,TheTechnicalWriter'sHandbook.MillValley,CA:University Science,1989. [7] Young-Ho Seo ; Kwangwoon Univ., Seoul, South Korea; Dongwook Kim. (2010), A New VLSI Architecture of Parallel Multiplier Accumulator Based on Radix-2 Modified Booth Algorithm, Very Large Scale Integration (VLSI) Systems, IEEE Transactions, vol.18, Issue.2, pp.201 208. [8] AvisekSen, ParthaMitra, DebarshiDatta. (2013), Low Power MAC Unit for DSP Processor, International Journal of Recent Technology and Engineering (IJRTE) Volume.1, Issue.6, pp. 93 95. [9] M.Jayaprakash, M.PeerMohamed,Dr.A.Shanmugam. (2013), Low Power and Area Efficient Multiplier for MAC, International Journal of Advanced Research in Electronics and Communication Engineering (IJARECE) Volume 2, Issue 11,pp.888-893. IX. CONCLUSION Higher radix MBA and partial reduction technique gives good result in terms of speed as well as area. Radix-4 modified booth algorithm reduces the number of partial products, which improves the speed. By removing independent accumulation process that has the largest delay and merging it to the compression process of the partial products, the overall MAC performance has been improved. Also by using Spurious Power Suppression Technique, power has been reduced by eliminating ineffective portion of data. REFERENCES [1] A. R. Omondi, Computer Arithmetic Systems. Englewood Cliffs, NJ: Prentice-Hall, 1994. [2] G. Goto, T. Sato, M. Nakajima, and T. Sukemura, A 54*54 regularstructured tree multiplier, IEEE J. Solid-State Circuits, vol. 27, no. 9,pp. 1229 1236, Sep. 1992. [3] J. Fadavi-Ardekani, M*N Booth encoded multiplier generator usingoptimizedwallace trees, IEEE Trans. Very Large Scale Integr. (VLSI)Syst., vol. 1, no. 2, pp. 120 125, Jun. 1993. ABOUT THE AUTHORS R.Mohanapriya is currently pursuing her M.E degree in VLSI Design from Anna University, Chennai in Knowledge Institute of Technology, Salem. She has received her B.E degree in Electronica and Communication from Anna University, Chennai in Gnanamani college of Technology,Namakkal. She is a member in IEEE. K.Rajesh,Assistant Professor in the Department of ECE in Knowledge Institute of Technology,Salem.He received received his B.E (ECE) in Sona College of Technology, Salem and M.E (VLSI Design) in Bannari Amman Institute of Technology, Sathyamangalam. Research interest in device modeling in VLSI,Anolgdesign.He is a member in IEEE. http://www.giapjournals.org/ijsrtm.html 263