An Efficient Baugh-WooleyArchitecture forbothsigned & Unsigned Multiplication

Similar documents
Modified Design of High Speed Baugh Wooley Multiplier

HIGH PERFORMANCE BAUGH WOOLEY MULTIPLIER USING CARRY SKIP ADDER STRUCTURE

Design of Baugh Wooley Multiplier with Adaptive Hold Logic. M.Kavia, V.Meenakshi

SIGNED PIPELINED MULTIPLIER USING HIGH SPEED COMPRESSORS

DESIGN OF FIR FILTER ARCHITECTURE USING VARIOUS EFFICIENT MULTIPLIERS Indumathi M #1, Vijaya Bala V #2

High Speed Vedic Multiplier Designs Using Novel Carry Select Adder

Tirupur, Tamilnadu, India 1 2

A Fixed-Width Modified Baugh-Wooley Multiplier Using Verilog

An Optimized Wallace Tree Multiplier using Parallel Prefix Han-Carlson Adder for DSP Processors

COMPARISION OF LOW POWER AND DELAY USING BAUGH WOOLEY AND WALLACE TREE MULTIPLIERS

AREA EFFICIENT DISTRIBUTED ARITHMETIC DISCRETE COSINE TRANSFORM USING MODIFIED WALLACE TREE MULTIPLIER

CHAPTER 4 ANALYSIS OF LOW POWER, AREA EFFICIENT AND HIGH SPEED MULTIPLIER TOPOLOGIES

Keywords , IJARCSSE All Rights Reserved Page Lecturer, EN Dept., DBACER,

High Speed IIR Notch Filter Using Pipelined Technique

Faster and Low Power Twin Precision Multiplier

IJCSIET--International Journal of Computer Science information and Engg., Technologies ISSN

Design of an optimized multiplier based on approximation logic

DESIGN OF LOW POWER MULTIPLIER USING COMPOUND CONSTANT DELAY LOGIC STYLE

Design and Simulation of Convolution Using Booth Encoded Wallace Tree Multiplier

Comparative Study of Different Variable Truncated Multipliers

Reconfigurable High Performance Baugh-Wooley Multiplier for DSP Applications

CHAPTER 1 INTRODUCTION

Design of a Power Optimal Reversible FIR Filter ASIC Speech Signal Processing

[Krishna, 2(9): September, 2013] ISSN: Impact Factor: INTERNATIONAL JOURNAL OF ENGINEERING SCIENCES & RESEARCH TECHNOLOGY

International Journal Of Scientific Research And Education Volume 3 Issue 6 Pages June-2015 ISSN (e): Website:

Innovative Approach Architecture Designed For Realizing Fixed Point Least Mean Square Adaptive Filter with Less Adaptation Delay

Design and Implementation of Wallace Tree Multiplier Using Kogge Stone Adder and Brent Kung Adder

INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATION ENGINEERING & TECHNOLOGY (IJECET) STUDY ON COMPARISON OF VARIOUS MULTIPLIERS

A New Architecture for Signed Radix-2 m Pure Array Multipliers

Implementation of Parallel Multiplier-Accumulator using Radix- 2 Modified Booth Algorithm and SPST

Mahendra Engineering College, Namakkal, Tamilnadu, India.

International Journal of Advanced Research in Biology Engineering Science and Technology (IJARBEST)

Review On Design Of Low Power Multiply And Accumulate Unit Using Baugh-Wooley Based Multiplier

A Novel High Performance 64-bit MAC Unit with Modified Wallace Tree Multiplier

Design and Implementation of High Speed Area Efficient Carry Select Adder Using Spanning Tree Adder Technique

An Efficient Design of Parallel Pipelined FFT Architecture

Implementation of 32-Bit Unsigned Multiplier Using CLAA and CSLA

JDT LOW POWER FIR FILTER ARCHITECTURE USING ACCUMULATOR BASED RADIX-2 MULTIPLIER

High Speed Binary Counters Based on Wallace Tree Multiplier in VHDL

Modified Booth Encoding Multiplier for both Signed and Unsigned Radix Based Multi-Modulus Multiplier

Design A Redundant Binary Multiplier Using Dual Logic Level Technique

Design and Characterization of 16 Bit Multiplier Accumulator Based on Radix-2 Modified Booth Algorithm

DESIGN OF LOW POWER MULTIPLIERS

A Review on Different Multiplier Techniques

ISSN Vol.03,Issue.02, February-2014, Pages:

Keywords: Column bypassing multiplier, Modified booth algorithm, Spartan-3AN.

Design and Analysis of Row Bypass Multiplier using various logic Full Adders

IMPLEMENTATION OF UNSIGNED MULTIPLIER USING MODIFIED CSLA

International Journal of Scientific & Engineering Research Volume 3, Issue 12, December ISSN

REALIZATION OF FPGA BASED Q-FORMAT ARITHMETIC LOGIC UNIT FOR POWER ELECTRONIC CONVERTER APPLICATIONS

JDT EFFECTIVE METHOD FOR IMPLEMENTATION OF WALLACE TREE MULTIPLIER USING FAST ADDERS

Performance Analysis of an Efficient Reconfigurable Multiplier for Multirate Systems

Design of Area and Power Efficient FIR Filter Using Truncated Multiplier Technique

Implementation of Booths Algorithm i.e Multiplication of Two 16 Bit Signed Numbers using VHDL and Concept of Pipelining

FPGA Implementation of Wallace Tree Multiplier using CSLA / CLA

VLSI Design and FPGA Implementation of N Binary Multiplier Using N-1 Binary Multipliers

A New High Speed Low Power Performance of 8- Bit Parallel Multiplier-Accumulator Using Modified Radix-2 Booth Encoded Algorithm

International Journal of Modern Trends in Engineering and Research

Performance Analysis of Multipliers in VLSI Design

An Optimized Implementation of CSLA and CLLA for 32-bit Unsigned Multiplier Using Verilog

VHDL based Design of Convolutional Encoder using Vedic Mathematics and Viterbi Decoder using Parallel Processing

DESIGN OF MULTIPLE CONSTANT MULTIPLICATION ALGORITHM FOR FIR FILTER

Comparative Analysis of Multiplier in Quaternary logic

High performance Radix-16 Booth Partial Product Generator for 64-bit Binary Multipliers

Design of 8-4 and 9-4 Compressors Forhigh Speed Multiplication

Design and Implementation of Truncated Multipliers for Precision Improvement and Its Application to a Filter Structure

Comparative Analysis of different Algorithm for Design of High-Speed Multiplier Accumulator Unit (MAC)

FOR HIGH SPEED LOW POWER APPLICATIONS USING RADIX-4 MODIFIED BOOTH ENCODER

Area Efficient and Low Power Reconfiurable Fir Filter

Reduced Complexity Wallace Tree Mulplier and Enhanced Carry Look-Ahead Adder for Digital FIR Filter

An Analysis of Multipliers in a New Binary System

Implementation of 32-Bit Carry Select Adder using Brent-Kung Adder

Design and Implementation Radix-8 High Performance Multiplier Using High Speed Compressors

A New network multiplier using modified high order encoder and optimized hybrid adder in CMOS technology

Design of Roba Mutiplier Using Booth Signed Multiplier and Brent Kung Adder

An Optimized Design for Parallel MAC based on Radix-4 MBA

Efficient Serial Multiplier Design using Ripple Counters,Kogge-Stone Adder and Full Adder

Globally Asynchronous Locally Synchronous (GALS) Microprogrammed Parallel FIR Filter

INTERNATIONAL JOURNAL OF ENGINEERING SCIENCES & RESEARCH TECHNOLOGY

Design and Simulation of 16x16 Hybrid Multiplier based on Modified Booth algorithm and Wallace tree Structure

Digital Integrated CircuitDesign

Design and Implementation of Scalable Micro Programmed Fir Filter Using Wallace Tree and Birecoder

CHAPTER 2 LITERATURE SURVEY

Implementation and Performance Analysis of different Multipliers

Design and Analysis of CMOS Based DADDA Multiplier

AS growing demands on portable computing and communication

High Performance 128 Bits Multiplexer Based MBE Multiplier for Signed-Unsigned Number Operating at 1GHz

AN ADVANCED VLSI ARCHITECTURE OF PARALLEL MULTIPLIER BASED ON HIGHER ORDER MODIFIED BOOTH ALGORITHM

Class Project: Low power Design of Electronic Circuits (ELEC 6970) 1

A Survey on Power Reduction Techniques in FIR Filter

PERFORMANCE COMPARISION OF CONVENTIONAL MULTIPLIER WITH VEDIC MULTIPLIER USING ISE SIMULATOR

Design and implementation of LDPC decoder using time domain-ams processing

Sno Projects List IEEE. High - Throughput Finite Field Multipliers Using Redundant Basis For FPGA And ASIC Implementations

Design of a High Speed FIR Filter on FPGA by Using DA-OBC Algorithm

PIPELINED VEDIC MULTIPLIER

AREA AND POWER EFFICIENT CARRY SELECT ADDER USING BRENT KUNG ARCHITECTURE

VLSI Designing of High Speed Parallel Multiplier Accumulator Based On Radix4 Booths Multiplier

Reducing the Computation Time in Two s Complement Multipliers A. Hari Priya 1 1 Assistant Professor, Dept. of ECE,

ISSN: [Hamid* et al., 7(4): April, 2018] Impact Factor: 5.164

NOVEL HIGH SPEED IMPLEMENTATION OF 32 BIT MULTIPLIER USING CSLA and CLAA

Transcription:

An Efficient Baugh-WooleyArchitecture forbothsigned & Unsigned Multiplication PramodiniMohanty VLSIDesign, Department of Electrical &Electronics Engineering Noida Institute of Engineering & Technology 2011-2012 Email-id:Pramodini_swain@yahoo.co.in Pamina2009@gmail.com Abstract: RashmiRanjan VLSI Design, Department of Electrical & Electronics Engineering Noida Institute of Engineering & Technology 2011-2012 This project presents an efficient implementation of a high speed multiplier using the shift and adds method of Baugh-Wooley Multiplier. This parallel multiplier useslesser adders and lesser iterative steps. As a result of which they occupy lesser space as compared to the serial multiplier. This is very important criteria because in the fabrication of chips and high performance system requires components which are as small as possible. Experimental result demonstrate that the proposed circuit not only improves the accurate performance but also reduces the hardware complexity and also less power consumption that is dynamic power of 15.3mW and maximum clock period of 3.912ns is required which is very efficient as compared to the reference paper. Keywords: Baugh-Wooley Multiplier, Pipeline resister, PowerEfficient, Carry Save Adder. 1 Introduction Multiplication involves 2 basic operations: the generation of the partial product and their accumulation [5].Therefore, there are Possible ways to speed up themultiplication: reduces the complexity, and as a result reduces the time needed to accumulate the partial products.both solutions can be applied simultaneously. Baugh-WooleyTwo s Compliment Signed Multiplier:Two scompliments is the most popular method in representing signed integers in Computer sciences.it is also an operation of negation(converting positive to negative numbers or vice versa) in computers which represent negative numbers using two s compliments.its use is so wide today because it does not require the addition and subtraction circuitry to examine the signs of the operands to determine whether to add or subtract. Two s compliment and one s complimentrepresentations are commonly used since arithmetic units are simpler to design.fig 1 shows Two s compliment and one s complimentrepresentations. FIG1: Two s compliment & one s compliment representation ISSN : 2229-3345 Vol. 3 No. 4 April 2012 94

Baugh-Wooley Two s compliment Signed numbers: Baugh-Wooley Two s compliment Signed multipliers is the best known algorithm for signed multiplication because it maximizes the regularity of the multiplier and allow all the partial products to have positive sign bits[3].baugh Wooley technique was developed to design direct multipliers for Two s compliment numbers [9].When multiplying two s compliment numbers directly, each of the partial products to be added is a signed numbers. Thus each partial product has to be sign extended to the width of the final product in order to form a correct sum by the Carry Save Adder (CSA) tree. According to Baugh-Wooleyapproach, an efficient method of adding extra entries to the bit matrix suggested to avoid having deal with the negatively weighted bits in the partial product matrix. In fig1 (a) & (b)partial product arrays of 5*5 bits Unsigned and Signed bits are shown: FIG1 (a): 5*5 unsignedmultiplications FIG1 (b): 5*5 Signed Multiplication Figure 1 (c) shows how this algorithm works inthe case of a 5x5 multiplication. The first three rows are referred to as PM (partial products withmagnitude part) and generated by one NAND and three AND operations. The fourth row is called as PS (partial products with sign bit) and generated by one AND andthree NAND operations with a sign bit. Consider the partialproducts of PM. Suppose b 2 = b 0 in figure1 (c). Then the third row can be obtained by shifting the first rowby 2 bits. Likewise, shift operation can be used to obtain a partial product of different bit level as in sign magnitude multiplication. FIG1 (c): 5*5 Multiplication Example of Baugh-WooleyArchitecture Baugh-Wooley schemes becomean area consuming when operands are greater than or equal to 32 bits. The rest of the paper is organised as follows. The baugh-wooley architecture is explained in section 2. Implementation results in terms of power, area, and speed 4 bit multipliers and comparison are presented. 2 Baugh-Wooley Architecture Hardware architecture for Baugh-Wooleymultiplier is shown in fig 2.It follows left shift ISSN : 2229-3345 Vol. 3 No. 4 April 2012 95

algorithm. Through mux we can select which bitwill multiply. Fig 2: Signed 2 s-complement Baugh-Wooley Hardware Multiplier Suppose we are adding +5 and -5 in decimal we get 0. Now, represent these numbers in 2 s complement form, and then we get +5 as 0101 and -5 as 1011. On adding these two numbers we get 10000. Discard carry, then the number is represented as 0. Baugh-WooleyMultiplier [5]: Baugh-Wooley Multiplier is used for both unsigned and signed number multiplication. Signed Number operands which are represented in 2 s complemented form. Partial Products are adjusted such that negative sign move to last step, which in turn maximize the regularity of the multiplication array. Baugh-Wooley Multiplier operates on signed operands with 2 s complement representation to make sure that the signs of all partial products are positive. Fig 2: Block diagram of a 4*4 Baugh-Wooley multiplier Here are using fewer steps and also lesser adders. Here a0, a1, a2, a3& b0, b1, b2, b3 are the inputs. I am getting the outputs that are p0, p1... p7. As I am using pipelining resister in this architecture,so it will take less time to multiply large number of 2 s compliment but less than 32 bit.above 32 bit Modified Baugh-Wooley Multiplier is used. ISSN : 2229-3345 Vol. 3 No. 4 April 2012 96

Multiplier Architecture Number of LUTS Fan out Clock- Period (ns) Power Dissipation (mw) Add-and- Shift Baugh- Wooley 74 18 8.939 68.89 32 104 15.029 67.84 Table 1: Synthesis results of different 4-bit pipelined multiplier architectures Multiplier Architecture Add-and- Shift Baugh- Wooley Number of LUTS Fan out Clock- Period (ns) 74 18 8.939 68.89 30 46 3.921 15.3 Power Dissipation (mw) Table 2: My Synthesis results of different 4-bit pipelined multiplier architectures 3 Implementation Results and Comparisons In this study, I use 4-bit pipelined multipliers and are implemented in VHDL and logic simulation is done by using ModelSim Designer and the synthesis is done using Xilinx ISE 8.2i of Device 4VFX20FF672-12.The synthesis result of 4-bit pipelined Baugh-Wooley architecture is shown in table above of the reference paper and my paper. In my paper I amusing Brent-Kung adder (BK adder), an advanced design prefixadder, which is a very good balance between area and power cost and also it will present better performance. This adder has a complex carry and inverse carry tree. A tree can be divided into 2 types that is a tree and an inverse tree. The upper tree based on periodic power of 2. The inverse tree is offset 1, beginning from the bottom of the matrix and expanding outwards at powers of 2. Fig 3: Synthesis Report of Baugh-Wooley Architecture The results show that the Baugh-Wooley multiplierhas increased speed since clock period isonly15.861ns. Pipeline stages further improve thebaugh - Wooley architecture speed. Number of LUTs represents the area required for implementation. Thenumber oflutsrequiredin Baugh-Wooley architecture is 30 compared to 32. The fan-out of the multiplier architectureis also given which directly gives the possibility of the multiplier to form large circuits. This can be extended to the pipelined multiplier architecture also to verify the parameters. Latency and speed are the ISSN : 2229-3345 Vol. 3 No. 4 April 2012 97

important factors with pipelining under consideration. The synthesis results of 4-bit pipelined multipliers are shown in Table 2.Power consumption in Baugh-Wooley multipliers isminimum compared to other conventionalmultiplier units. So it clears that the signed binary multiplicationthrough Baugh- Wooleymultiplication is suited for large multiplier implementation. Theimprovements in constraint can be used to make Baugh-Wooley multiplier more efficient.the fan-out of the multiplier architecturesare also given which directly gives the possibility of themultiplier to form large circuits. This can be extended tothe pipelined multiplier architecture also to verify theparameters. Latency and speed are the important factors with pipelining under consideration. The synthesis resultsof 4-bitpipelined multipliers are shown in Table 2. Thepipeline constraint increases the speed of the multiplier considerably with a increase in powerconsumption. For the Baugh-Wooley multipliers, the clock period reduces to 3.321ns as a result of pipelineregisters implemented. This improves the speed whichmay reduce due to the BK adder which I used in my architecture. The maximum delay for this architecture is 2.143ns.i am using 65 Flip Flop out of 17088 and maximum frequency is 527.037MHZ which is a good sign.the incorporation of the pipeline multipliers thus canbe effectively done to make the chip efficientlyreconfigurable among the two reconfiguration modes andthis work is in progress. The possibility of other reconfiguration constraints is under work and theimplementation of the reconfiguration modes accordingto these constraints are the future work. Output Simulation Result For Baugh- Wooley Architecture: Simulation Result For Both Unsigned Numbers Multiplication ISSN : 2229-3345 Vol. 3 No. 4 April 2012 98

Simulation Result For Unsigned and Signed Number Multiplication 4 Conclusion An efficient multiplier todeal with the latency problemis proposed. This paperpresents acomparison between various multiplierarchitectures with area, speed and power as the mainconstraints. It is observed that the Baugh- Wooleyarchitecturegives optimized values for variousconstraints andhence suited for long bitmultiplication up to less than 32 bit. The pipelining resister techniques are used to improve themultiplier characteristics. REFERENCES: [1] Power Reduction Techniques for Ultra-Low-Power Solutions by Virage Logic Corporation. [2] R.M.Badghare, S.K.Mangal, R.B.Deshmukh, R.M.Patrikar (2009), Design of Low Power Parallel Multiplier, Journal of Low Power Electronics, Volume 5, Number 1, April 2009, 31-39. [3] A. Dandapat, S. Ghosal, P. Sarkar, D. Mukhopadhyay (2009), A 1.2- ns16 16-Bit Binary Multiplier Using. High Speed Compressors, International Journal of Electrical, Computer, and Systems Engineering, 2009, 234-239. [4] K.Z. Pekmestzi, "Complex Number Multipliers" IEE Proceedings (Computers and Digital Technology), Vol. 136, No. 1, 1989, pp. 70-75. [5] Jin-HaoTu and Lan-Da Van, Power-Efficient Pipelined Reconfigurable Fixed-Width Baugh-Wooley Multipliers IEEE Transactions on computers, vol. 58, No. 10, October 2009. [6] C. R. Baugh and B. A. Wooley, A Two s Complement Parallel Array Multiplication Algorithm, IEEE Transactions on Computers, vol. 22, pp. 1045 1047, December 1973. [7] H. Eriksson, P. Larsson-Edefors, M.Sheeran, M.Själander, D. Johansson, and M.Schölin, Multiplier Reduction Tree with Logarithmic Logic Depth and Regular Connectivity, in IEEE International Symposium on Circuits and Systems, May 2006. [8] E.E.Swartzlander, Jr., Truncated multiplication with approximate rounding, in Proc. 33rd Asilomar Conference on Signals, Systems, and Computers, 1999, vol. 2, pp. 1480-1483 [9] J. Di and J. S. Yuan, Run-time reconfigurable power-aware pipelined signed array multiplier design, in Proc. IEEE International Symposium on Signals Circuits, and Systems, July 2003, vol. 2, pp. 405-406 [10] S.Krithivasan, M. J. Schulte, and J. Glossner, A sub word-parallel multiplication and sum-of-squares unit, IEEE Comp. society Annual Symposium on VLSI, pp. 273-274, Feb. 2004 ISSN : 2229-3345 Vol. 3 No. 4 April 2012 99