Efficient Shift-Add Multiplier Design Using Parallel Prefix Adder

Similar documents
Implementation of Parallel Multiplier-Accumulator using Radix- 2 Modified Booth Algorithm and SPST

An Optimized Wallace Tree Multiplier using Parallel Prefix Han-Carlson Adder for DSP Processors

Design Of 64-Bit Parallel Prefix VLSI Adder For High Speed Arithmetic Circuits

ISSN:

FOR HIGH SPEED LOW POWER APPLICATIONS USING RADIX-4 MODIFIED BOOTH ENCODER

A LOW POWER MULTIPLIER USING ENCODING AND BYPASSING TECHNIQUE

A Survey on A High Performance Approximate Adder And Two High Performance Approximate Multipliers

Design of Roba Mutiplier Using Booth Signed Multiplier and Brent Kung Adder

Performance Analysis of Multipliers in VLSI Design

An Optimized Design of High-Speed and Energy- Efficient Carry Skip Adder with Variable Latency Extension

Design of an optimized multiplier based on approximation logic

LOW POWER MULTIPLIER USING BYPASSINGZERO ARCHITECTURE

SIGNED PIPELINED MULTIPLIER USING HIGH SPEED COMPRESSORS

VLSI Design and FPGA Implementation of N Binary Multiplier Using N-1 Binary Multipliers

A New network multiplier using modified high order encoder and optimized hybrid adder in CMOS technology

High Speed Binary Counters Based on Wallace Tree Multiplier in VHDL

Modified Booth Encoding Multiplier for both Signed and Unsigned Radix Based Multi-Modulus Multiplier

Design of Efficient 32-Bit Parallel PrefixBrentKung Adder

Design and Implementation of High Speed Carry Select Adder

By Dayadi Lakshmaiah, Dr. M. V. Subramanyam & Dr. K. Satya Prasad Jawaharlal Nehru Technological University, India

Design of High Speed Carry Select Adder using Spurious Power Suppression Technique

A Novel Approach For Designing A Low Power Parallel Prefix Adders

Design of Low Power Column bypass Multiplier using FPGA

A High Speed Wallace Tree Multiplier Using Modified Booth Algorithm for Fast Arithmetic Circuits

A Novel High-Speed, Higher-Order 128 bit Adders for Digital Signal Processing Applications Using Advanced EDA Tools

Design of Baugh Wooley Multiplier with Adaptive Hold Logic. M.Kavia, V.Meenakshi

VLSI IMPLEMENTATION OF ARITHMETIC OPERATION

An Optimized Design for Parallel MAC based on Radix-4 MBA

Pipelined Linear Convolution Based On Hierarchical Overlay UT Multiplier

Comparison of Conventional Multiplier with Bypass Zero Multiplier

MODIFIED UNIVERSAL SHIFT REGISTER BASED LOW POWER MULTIPLIER ARCHITECTURE

DESIGN OF HIGH PERFORMANCE MODIFIED RADIX8 BOOTH MULTIPLIER

Design and Analysis of Row Bypass Multiplier using various logic Full Adders

CHAPTER 4 ANALYSIS OF LOW POWER, AREA EFFICIENT AND HIGH SPEED MULTIPLIER TOPOLOGIES

An Efficient Design of Low Power Speculative Han-Carlson Adder Using Concurrent Subtraction

Power Efficient Weighted Modulo 2 n +1 Adder

Parallel Prefix Han-Carlson Adder

On Built-In Self-Test for Adders

A Novel Approach to 32-Bit Approximate Adder

Design and Implementation of High Speed Carry Select Adder Korrapatti Mohammed Ghouse 1 K.Bala. 2

Reduced Complexity Wallace Tree Mulplier and Enhanced Carry Look-Ahead Adder for Digital FIR Filter

PROMINENT SPEED ARITHMETIC UNIT ARCHITECTURE FOR PROFICIENT ALU

Design and Implementation of Truncated Multipliers for Precision Improvement and Its Application to a Filter Structure

International Journal of Advanced Research in Computer Science and Software Engineering

DESIGN OF PARALLEL MULTIPLIERS USING HIGH SPEED ADDER

Design and Comparative Analysis of Conventional Adders and Parallel Prefix Adders K. Madhavi 1, Kuppam N Chandrasekar 2

2. URDHAVA TIRYAKBHYAM METHOD

Anitha R 1, Alekhya Nelapati 2, Lincy Jesima W 3, V. Bagyaveereswaran 4, IEEE member, VIT University, Vellore

A Novel 128-Bit QCA Adder

A Taxonomy of Parallel Prefix Networks

A NOVEL IMPLEMENTATION OF HIGH SPEED MULTIPLIER USING BRENT KUNG CARRY SELECT ADDER K. Golda Hepzibha 1 and Subha 2

Implementation of Efficient 16-Bit MAC Using Modified Booth Algorithm and Different Adders

Design and Implementation of 64-bit MAC Unit for DSP Applications using verilog HDL

Low-Power Multipliers with Data Wordlength Reduction

DESIGN OF LOW POWER HIGH SPEED ERROR TOLERANT ADDERS USING FPGA

Area and Delay Efficient Carry Select Adder using Carry Prediction Approach

DESIGN OF EFFICIENT MULTIPLIER USING ADAPTIVE HOLD LOGIC

A New High Speed Low Power Performance of 8- Bit Parallel Multiplier-Accumulator Using Modified Radix-2 Booth Encoded Algorithm

Design of Signed Multiplier Using T-Flip Flop

ISSN: X International Journal of Advanced Research in Electronics and Communication Engineering (IJARECE) Volume 1, Issue 5, November 2012

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

ENHANCING SPEED AND REDUCING POWER OF SHIFT AND ADD MULTIPLIER

Implementation and Performance Analysis of different Multipliers

Binary Adder- Subtracter in QCA

Design and Implementation of High Speed Area Efficient Carry Select Adder Using Spanning Tree Adder Technique

PERFORMANCE COMPARISON OF HIGHER RADIX BOOTH MULTIPLIER USING 45nm TECHNOLOGY

FPGA Implementation of Wallace Tree Multiplier using CSLA / CLA

Modified Partial Product Generator for Redundant Binary Multiplier with High Modularity and Carry-Free Addition

Design and Implementation of a delay and area efficient 32x32bit Vedic Multiplier using Brent Kung Adder

AREA AND DELAY EFFICIENT DESIGN FOR PARALLEL PREFIX FINITE FIELD MULTIPLIER

Key words High speed arithmetic, error tolerant technique, power dissipation, Digital Signal Processi (DSP),

AN ADVANCED VLSI ARCHITECTURE OF PARALLEL MULTIPLIER BASED ON HIGHER ORDER MODIFIED BOOTH ALGORITHM

A Low Complexity and Highly Robust Multiplier Design using Adaptive Hold Logic Vaishak Narayanan 1 Mr.G.RajeshBabu 2

Design and Implementation of High Radix Booth Multiplier using Koggestone Adder and Carry Select Adder

Design and Implementation of Complex Multiplier Using Compressors

Efficient Implementation of Parallel Prefix Adders Using Verilog HDL

Design and Implementation of Hybrid Parallel Prefix Adder

Analysis of Parallel Prefix Adders

Design and implementation of Parallel Prefix Adders using FPGAs

II. Previous Work. III. New 8T Adder Design

HIGH SPEED FIXED-WIDTH MODIFIED BOOTH MULTIPLIERS

Design and Characterization of Parallel Prefix Adders using FPGAs

High performance Radix-16 Booth Partial Product Generator for 64-bit Binary Multipliers

Keywords: Column bypassing multiplier, Modified booth algorithm, Spartan-3AN.

Compressor Based Area-Efficient Low-Power 8x8 Vedic Multiplier

Implementation of 256-bit High Speed and Area Efficient Carry Select Adder

Implementation of a High Speed and Power Efficient Reliable Multiplier Using Adaptive Hold Technique

High Performance 128 Bits Multiplexer Based MBE Multiplier for Signed-Unsigned Number Operating at 1GHz

Area Delay Efficient Novel Adder By QCA Technology

Review Paper on an Efficient Processing by Linear Convolution using Vedic Mathematics

Modified Booth Multiplier Based Low-Cost FIR Filter Design Shelja Jose, Shereena Mytheen

An Area Efficient FFT Implementation for OFDM

Tirupur, Tamilnadu, India 1 2

SYNTHESIS OF ADDER CIRCUIT USING CARTESIAN GENETIC PROGRAMMING

An Design of Radix-4 Modified Booth Encoded Multiplier and Optimised Carry Select Adder Design for Efficient Area and Delay

Design and Implementation of Wallace Tree Multiplier Using Kogge Stone Adder and Brent Kung Adder

Design A Redundant Binary Multiplier Using Dual Logic Level Technique

COMPARISION OF LOW POWER AND DELAY USING BAUGH WOOLEY AND WALLACE TREE MULTIPLIERS

Ajmer, Sikar Road Ajmer,Rajasthan,India. Ajmer, Sikar Road Ajmer,Rajasthan,India.

International Journal Of Scientific Research And Education Volume 3 Issue 6 Pages June-2015 ISSN (e): Website:

Transcription:

IJCTA, 9(39), 2016, pp. 45-53 International Science Press Closed Loop Control of Soft Switched Forward Converter Using Intelligent Controller 45 Efficient Shift-Add Multiplier Design Using Parallel Prefix Adder Rohan Pinto* and Kumara Shama* Abstract : The overall performance of any DSP system depends on the performance of the arithmetic unit. Multiplication is one of the pivotal operation in it. The speed and area of the multiplier is always a matter of concern for the better performance of any processor. In this paper a simple and efficient multiplier for 8-bit and 16-bit have been proposed. The proposed structure is a modified version of Bypass Zero, Feed A Directly (BZ-FAD) multiplier. It has been implemented on FPGA, Spartan 6 device. This structure has low power and area because of its uniqueness in using the proposed parallel prefix structure for addition of partial products in multiplier. The proposed multiplier lowers the switching activity by 55% and area by 64%. Keywords : BZ-FAD; Parallel prefix adder; Ling adder; FPGA implementation. 1. INTRODUCTION Multiplication and addition are most common and heavily used arithmetic operations that figure out to be important in all digital signal processing applications. Multiplication is hardware concentrated. Major part of the researcher s interest lies in its high speed, low power and low area. Multiplication is repeated form of shifting and addition operation. The main concern in shift-add multiplication is to speed up the partial product addition. Variety of multiplication algorithms and designs have been discussed in the past. Shen and Chen [1] proposed a low power 16-bit multiplier. It reduced the switching activities taking place inside the multiplier than the conventional multiplier with a reasonable increase in area. Chen et al. [2] proposed 16-bit row based, column based and hybrid based multipliers that dissipate less power. Partial product count were reduced by booth encoding which in turn reduced the switching activities. Wang et al. [3] developed a fixed width multiplier using left to right algorithm. It reduced the partial product leading to low power. Wang and Sung [4] proposed 8-bit low power multiplier using bypassing technique. Power saving was upto 75% at the cost of large area. Huang and Milos [5] designed different structures of linear array multipliers. The structures led to high performance and low power. Chen and Chu [6] applied spurious power suppression technique (SPST) on multiplier which led to low power performance and high speed as compared to other multipliers. Mottaghi et al. [7] proposed a structure, Bypass zero, feed A directly to reduce the switching activities during multiplication. The structure had low power and area. Marimuthu and Thangaraj [8] implemented BZ-FAD structure using latches and flip-flops to reduce the switching activities in multiplier. Vijaykumar and Sumathy [9] designed an error tolerant 8-bit shift-add multiplier. It was a low power and high speed structure. Valan and Baulkani [10] developed a shift-add multiplier structure using modified universal shift register and Johnson counter. It reduced the switching activities in the multiplier as compared to conventional multiplier. Liu et al. [11] proposed an approximate multiplier for high performance application. It reduced the power dissipation and critical path delay. D Nikolos et al. [12] presented a hybrid prefix adder that combined conventional and ling carry computation. Giorgos and Nikolos [13] proposed a high speed parallel prefix adder. Ling adder equations were modified * Manipal Institute of Technology, Manipal-576 104, Karnataka, India E-Mail: rohan.pinto@yahoo.co.in

Mux M2 46 Rohan Pinto and Kumara Shama and incorporated in the structure. Poornima and Kanchana [14] developed a novel structure by combining two other prefix adders proposed by Ladner-Fischer [15] and Kogge-Stone [16]. In this paper a modified BZ-FAD structure based on shift-add multiplication is proposed. This structure has marginally increased speed with low power and area because of the elimination of some of the components from the conventional BZ-FAD structure [7]. The proposed structure also has an efficient adder being used for addition operation which increases the speed. The proposed 8-bit and 16-bit multipliers have been implemented using XILINX spartan 6 FPGA and it has been noted that the proposed adder outperforms other adders in terms of speed, area and power dissipation. Rest of the paper is organized as follows: section 2 give the details of shift add multipliers. Section 3 introduces parallel prefix adders used for shift-add multiplication. Section 4 discusses the results, and conclusions are drawn in section 5. 2. SHIFT-ADD MULTIPLIERS A. Bypass Zero, Feed A Directly (BZ-FAD) Multiplier Mottaghi et al. [7] proposed bypass zero, feed A directly (BZ-FAD) multiplier structure depicted in Fig. 1. Here the switching activities in the multiplier were reduced which led to low power. The conventional shift-add multiplier which multiplies two number X and Y undergoes six switching activities. 1) The switching activity when the multiplier Y is shifted 2) Switching activity in the counter 3) Switching activity in the adder 4) Switching activity in the multiplexer that selects either 0 or X for addition 5) Switching activity in the multiplexer select line which is controlled by 0 th bit of Y 6) Switching activity due to shifting of partial product. A Adder A Mux M3 Bypass Mux M1 D-FF PP(k to 1) PP(0) Ring Counter Feeder P-Latch Clock Figure 1: Bypass Zero, Feed A Directly (BZ-FAD) Multiplier [7] BZ-FAD structure reduced the switching activities in the multiplier. Instead of shifting the multiplier bits to right every time as done in conventional multiplier to check whether the 0 th bit of multiplier is 0 or 1, in BZ-FAD one hot encoded bus selector chooses the hot bit of Y in every cycle. A low power ring counter was used to select the required bits in each cycle. This reduced the switching activity in multiplier.

Efficient Shift-Add Multiplier Design Using Parallel Prefix Adder When the 0 th bit of the multiplier is zero, then zero is added to the previous partial product and if it is one then multiplicand X is adder to previous partial product in conventional multipliers, this increases the switching activity in the adder, instead in BZ-FAD addition of zero was skipped when multiplier 0 th bit was zero using feeder and bypass register. After the generation of each partial product in every cycle the 0 th bit of the partial product is not processed further. It forms the respective final product bit. P-latch was used to store the LSB bits of the product which were obtained in the first few cycles. A low power ring counter was used to open up the respective latch to store the product LSB bits. MSB bits of the product were stored in the feeder register. This eliminated the process of shifting the partial product in every cycle for processing as done in conventional multiplier. A. Proposed Bypass Zero, Feed A Directly Multiplier The proposed structure as shown in Fig. 2 further reduces the switching activity taking place in the multiplication operation thereby reducing power. Few of the components are eliminated from the conventional BZ-FAD Multiplier, thereby reducing the delay. The process of reducing the switching activity is explained below. Shifting of multiplier bits : In the structure proposed by Mottaghi et al. [7] two multiplexer, D flip flop and low power ring counter was used to select the 0 th bit of the multiplier. These components can be eliminated and instead AND gates can be used to check the content of the multiplier. This reduces the area to a larger extent. Shifting of multiplier bits are eliminated thereby reducing the switching activity. Constant 47 Add Mux Prod(0) Prod(k to 1) Prod reg Register 1 Multiplier Figure 2: Modified Bypass Zero, Feed A Directly (BZ-FAD) Multiplier Activities of adder : In BZ-FAD feeder and bypass registers were used to skip the addition of zero to the previous partial product when the multiplier 0 th bit was zero. In proposed structure only one register is used to store the intermediate result. The register contents are fed back to the multiplexer when the 0 th bit of the multiplier is zero and when the 0 th bit of the multiplier is one then the register contents are fed to the adder. This reduces the activity in the adder where addition is done only when the 0 th bit of the multiplier is one.

48 Rohan Pinto and Kumara Shama Shifting of partial product : Here in the proposed structure, ring counter existing in Fig. 1 is eliminated that opens up the latch to store the LSB of product register. Instead 0 th bit of the partial product moves to the respective position in the product register and remaining bits of the partial product are shifted and moved back to register for further processing. This completely removes the shifting of partial product as done in conventional shift-add multiplier thereby reducing the switching activities in the partial product. 3. PARALLEL PREFIX ADDER Multiplication can be done by cumulative partial product and successively adding it to properly shifted term. Addition is the fundamental operation in multiplication. A fast and area efficient multiplier is highly influenced by the performance of the adder. Hence an efficient parallel prefix 8-bit and 16-bit adder based on modified ling equation is proposed to be used in the modified BZ-FAD multiplier. This reduces the area and power of the proposed multiplier. Ling [17] proposed a modified equation of carry look ahead to attain a significant saving in the hardware. The technique depends on calculating pseudo carry H i instead of conventional carry c i. This technique saves one logic level at each bit position. Although computation of pseudo carry is simpler compared to conventional carry, but the sum calculation is complicated. Hence ling equation is modified to generate the real carry c i out of the pseudo carry H i, so that the final sum calculation be reduced to simple XOR operation. Consider the pseudo carry equation for 4 th bit position. H 4 = g 4 + g 3 + p 3. g 2 + p 3. g 1 + p 3. p 1. g 0 (1) Since, g i = g i. p i, eq. (1) can be rewritten as H 4 = (g 4 + g 3 ) + p 3. (g 2 + g 1 ) + p 3. p 1. p 0. g 0 * * Let, G i and P i be the intermediate generate and intermediate propagate bits respectively [13] given as: * * G i = g i + g i + 1 and P i = p i. p i 1, 0 i n 1. * * With g 1 = p 1 = 0 and G k = P k = 0, for k < 0; H 4 = G 4:3 + P 3:2. G 2:1 + P 3:2. P 1:0. G 0: 1 The associative operator associates pairs of generate and propagate bits as (g i, p i ) (g i, p i ) = (g i + p i. g i, p i. p i ). Therefore H 4 can now be rewritten using the associative operator as H 4 = (G 4:3, P 3:2 ) (G 2:1, P 1:0 ) (G 0: 1, P 1: 2 ) * (G 4*, P 3 ) (G 2*, P 1* ) (G 0*, P * ) (2) 1 The pseudo carries of even H i and odd H i + 1 indexed bit position are given as H i = (G i*, P * ) i 1 (G*, i 2 P* )... (G, P i 3 0* * ) 1 H i + 1 = (G *, P * ) (G *, i + 1 i i 1 P* )... (G *, P i 2 1 0* ) The real carries are now expressed as: c 0 = H 0. p 0 c 1 = H 1. p 1 c 2 = H 2 c 3 = H 3. p 3 c 4 = (G 4:3 + P 3:2. G 2:1 ) p 4 c 5 = (G 5:3 + P 4:3. G 3:0 ) p 5 c 6 = (G 6:3 + P 5:2. G 2:1 ) p 6 c 7 = (G 7:3 + P 6:3. G 3:0 ) p 7 The proposed 8-bit parallel prefix adder based on modified ling equation is shown in Fig. 3.

Efficient Shift-Add Multiplier Design Using Parallel Prefix Adder The white square in Fig. 3 calculates generate bit, propagate bit and half sum bit as shown in Fig. 4(a). Black square provides intermediate generate and propagate bits shown in Fig. 4(b). Group generate and group propagate bits are calculated from intermediate generate and propagate bits as shown in Fig. 5. The proposed prefix cell which calculates the real carry is shown in Fig. 6(a). The white circle with an alphabet A in it is the prefix cell which generates carry for the lower order bits from 0 to 3 for 8-bit adder. This is shown in Fig. 6(b). 49 Figure 3: Proposed 8-bit parallel prefix adder (a) (b) Figure 4: (a) Generate, propagate and half sum computing node. (b) Intermediate generate and propagate computing node [13]

50 Rohan Pinto and Kumara Shama Figure 5: Group generate and propagate computing node [18] (a) (b) Figure 6: (a) Proposed prefix cell for computing real carries. (b) Prefix cell for computing real carry [19] Proposed 16-bit parallel prefix adder based on modified ling equation is shown in Fig. 7. These adders are area and power efficient. This increases the efficiency of the multiplier to further extent. 4. RESULTS The proposed multiplier was modelled in VHDL, simulated by ISE simulator (ISim) and synthesized using Xilinx Synthesis Technology (XST) tool. Power dissipation was analyzed using Xpower Analyzer. The obtained results were compared with 8-bit BZ-FAD multiplier structure proposed by Marimuthu and Thangaraj [8] and Conventional shift-add multiplier in Table 1. It shows that the proposed structure is efficient in all parameters. Table 1 Area, power and delay comparison of 8-bit multipliers Structure Delay (ns) Power(mW) Area(slices) Proposed Structure 8-bit 21.74 14.0 119 BZ-FAD, 8-bit [8] 48.81 105.0 334 Conventional multiplier, 8-bit [8] 151.1 151.1 662

Efficient Shift-Add Multiplier Design Using Parallel Prefix Adder The proposed structure was also compared with other multipliers proposed by Valan and Baulkani [10], 16-bit BZ-FAD structure implemented in FPGA [9], error tolerant shift-add multiplier proposed by Vijaykumar and Sumathy [9] and high performance error tolerant multiplier proposed by Liu et al. [11]. Table 2 and Table 3 give power and delay comparison of 8-bit and 16-bit multipliers respectively. Comparison results show that the proposed structure is area and power efficient. a 15 b 15 a 14 b 14 a 13 b 13 a 12 b 12 a 11 b 11 a 10 b 10 a 9 b 9 a 8 b 8 a 7 b 7 a 6 b 6 a 5 b 5 a 4 b 4 a 3 b 3 a 2 b 2 a 1 b 1 a 0 b 10 51 Figure 7: Proposed 16-bit parallel prefix adder Table 2 Power and delay comparison of 8-bit multipliers Structure Power (mw) Delay (ns) Proposed Structure 8-bit 14.0 21.74 Valan [10] 172.0 Vijaykumar [9] 228.0 49 BZ-FAD [9] 271.0 61 Conventional Multiplier [9] 295.0 95 Table 3 Power and delay comparison of 16-bit multipliers Structure Power (mw) Delay (ns) Proposed Structure 16-bit 14.0 46.5 Valan [10] 177.0 BZ-FAD [10] 271.0 Liu et al. [11] 150.0 13.9 Table 4 shows the area, delay and power of proposed 8-bit and 16-bit modified BZ-FAD multiplier.

52 Rohan Pinto and Kumara Shama Table 4 Area, power and delay of proposed 8-bit and 16-bit multipliers 5. CONCLUSION Multiplier bits Area(slices) Delay(ns) Power(mW) 8-bit 119 21.7 14.0 16-bit 494 46.5 14.0 In this paper an area-power efficient modified BZ-FAD multiplier for 8-bit and 16-bit is proposed. The proposed multiplier is fast and efficient because of the parallel prefix adder design which does the carry propagation quickly. Here 8-bit and16-bit adder design based on modified ling equation is also proposed. When compared to conventional BZ-FAD 8-bit multiplier [8] power saving is significant and delay is lowered by 55%. 6. REFERENCES 1. Nan-Ying Shen and Oscal T-C. Chen, Low-power multipliers by minimizing switching activities of partial products, Proc. IEEE Int. Symp. Circuits and Systems (ISCAS), Vol. 4, pp. 93-96, 2002. 2. Oscal T-C Chen, Sandy Wang and Yi-Wen Wu, Minimization of Switching Activities of Partial Products for Designing Low-Power Multipliers, IEEE Trans. Very Large Scale Integration (VLSI) Systems, Vol. 11, No. 3, pp. 418 433, 2003. 3. Jinn-Shyan Wang, Chien-Nan Kuo and Tsung-Han Yang, Low-power fixed width array multipliers, Proc. IEEE Int. Symp. Low Power Electronics and Design, ISLPED 04, pp. 307 312, 2004. 4. Chua-Chin Wang and Gang-Neng Sung, Low-Power Multiplier Design Using a Bypassing Technique, J Sign Process System, Vol. 57, pp. 331 338, 2009. 5. Zhijun Huang and Milos D. Ercegovac, High-Performance Low-Power Left-to-Right Array Multiplier Design, IEEE Trans. Computers, Vol. 54, No. 3, pp. 272 283, 2005. 6. Kuan-Hung Chen and Yuan-Sun Chu, A Low-Power Multiplier with the Spurious Power Suppression Technique, IEEE Trans. on Very Large Scale Integration (VLSI) Systems, Vol. 15, No. 7, pp. 846 850, 2007. 7. M. Mottaghi-Dastjerdi, A. Afzali-Kusha and M. Pedram, BZ-FAD: A Low-Power Low-Area Multiplier Based on Shift-and-Add Architecture, IEEE Trans. Very Large Scale Integration (VLSI) Systems, Vol. 17, No. 2, pp. 302 306, 2009. 8. C. N. Marimuthu and P. Thangaraj, Low Power Multiplier Design Using Latches and Flip-Flops, J. Computer Science, Vol. 6, No. 10, pp. 1117-1122, 2010. 9. K. N. Vijaykumar and V. Sumathy, Design of Low- Power High-Speed Error Tolerant Shift and Add Multiplier, J. Computer Science, Vol. 7, No. 12, pp. 1839-1845, 2011. 10. S. P. Valan Arasu and Dr. S. Baulkani, Modified Universal Shift Register Based Low Power Multiplier Architecture, J. Theoretical and Applied Information Technology, Vol. 65, No. 1, 2014. 11. Cong Liu, Jie Han and Fabrizio Lombardi, A Low-Power, High-Performance Approximate Multiplier with Configurable Partial Error Recovery, IEEE Conf. Design, Automation & Test in Europe Conference & Exhibition (DATE), pp. 1 4, 2014. 12. C. Efstathiou, H.T. Vergos and D. Nikolos, Ling Adders in Standard CMOS Technologies, Proc. IEEE Int. Conf. Electronics, Circuits and Systems (ICECS), vol. 2, pp. 485-48, 2002. 13. Dimitrakopoulos Giorgos and Dimitris Nikolos, High-Speed parallel-prefix VLSI ling adders, IEEE Trans. Computers, Vol. 54, No. 2, pp. 225-231, 2005.

Efficient Shift-Add Multiplier Design Using Parallel Prefix Adder 14. Poornima N and V S Kanchana Bhaaskaran, Area efficient hybrid parallel prefix adders, Procedia Materials Science, vol. 10, pp. 371-380, 2015. 15. R.E. Ladner and M.J. Fisher, Parallel prefix Computation, J. ACM, Vol. 27, No. 4, pp. 831-838, 1980. 16. P.M. Kogge and H.S. Stone, A parallel algorithm for the efficient solution of a general class of recurrence equations, IEEE Trans. Computers, Vol. 22, No. 8, pp. 786-793, 1973. 17. Huey Ling, High-speed binary adder, IBM J. R&D, Vol. 25, No. 3, pp. 156-166, 1981. 18. R. P. Brent and H.T. Kung, A regular layout for Parallel adders, IEEE Trans. Computers, Vol. 31, No. 3, pp. 260-264, 1982. 19. Tso-Bing Juang, Pramod Kumar Meher and Chung-Chun Kuan, Area-Efficient parallel-prefix ling adders, Proc. IEEE Asia Pacific Conf. Circuits and Systems (APCCAS), Kuala Lumpur, pp.736-39, 2010. 53