Low Power 3-2 and 4-2 Adder Compressors Implemented Using ASTRAN

Similar documents
Design and Implementation of Complex Multiplier Using Compressors

Design of an Energy Efficient 4-2 Compressor

International Journal for Research in Applied Science & Engineering Technology (IJRASET) Design A Power Efficient Compressor Using Adders Abstract

II. Previous Work. III. New 8T Adder Design

Implementation of Carry Select Adder using CMOS Full Adder

High Performance Low-Power Signed Multiplier

A New network multiplier using modified high order encoder and optimized hybrid adder in CMOS technology

AN EFFICIENT APPROACH TO MINIMIZE POWER AND AREA IN CARRY SELECT ADDER USING BINARY TO EXCESS ONE CONVERTER

Abstract. 2. MUX Vs XOR-XNOR. 1. Introduction.

High Speed Vedic Multiplier Designs Using Novel Carry Select Adder

An Optimized Wallace Tree Multiplier using Parallel Prefix Han-Carlson Adder for DSP Processors

Implementation of High Performance Carry Save Adder Using Domino Logic

International Journal of Advance Engineering and Research Development

Design of Two High Performance 1-Bit CMOS Full Adder Cells

Investigation on Performance of high speed CMOS Full adder Circuits

CHAPTER 4 ANALYSIS OF LOW POWER, AREA EFFICIENT AND HIGH SPEED MULTIPLIER TOPOLOGIES

DESIGN AND IMPLEMENTATION OF AREA EFFICIENT, LOW-POWER AND HIGH SPEED 128-BIT REGULAR SQUARE ROOT CARRY SELECT ADDER

Implementation of Efficient 5:3 & 7:3 Compressors for High Speed and Low-Power Operations

PERFORMANCE ANALYSIS OF LOW POWER FULL ADDER CELLS USING 45NM CMOS TECHNOLOGY

[Krishna, 2(9): September, 2013] ISSN: Impact Factor: INTERNATIONAL JOURNAL OF ENGINEERING SCIENCES & RESEARCH TECHNOLOGY

SIGNED PIPELINED MULTIPLIER USING HIGH SPEED COMPRESSORS

High Speed Binary Counters Based on Wallace Tree Multiplier in VHDL

Electronic Design Automation at Transistor Level by Ricardo Reis. Preamble

Two New Low Power High Performance Full Adders with Minimum Gates

Two New Low Power High Performance Full Adders with Minimum Gates

Multiplier and Accumulator Using Csla

Design of New Full Swing Low-Power and High- Performance Full Adder for Low-Voltage Designs

DESIGN OF LOW POWER MULTIPLIER USING COMPOUND CONSTANT DELAY LOGIC STYLE

Design of 8-4 and 9-4 Compressors Forhigh Speed Multiplication

Low power high speed hybrid CMOS Full Adder By using sub-micron technology

ENHANCING SPEED AND REDUCING POWER OF SHIFT AND ADD MULTIPLIER

Performance Analysis Comparison of 4-2 Compressors in 180nm CMOS Technology

Design and Analysis of Row Bypass Multiplier using various logic Full Adders

Published by: PIONEER RESEARCH & DEVELOPMENT GROUP ( 1

2 Assoc Prof, Dept of ECE, George Institute of Engineering & Technology, Markapur, AP, India,

A New Architecture for Signed Radix-2 m Pure Array Multipliers

A NOVEL IMPLEMENTATION OF HIGH SPEED MULTIPLIER USING BRENT KUNG CARRY SELECT ADDER K. Golda Hepzibha 1 and Subha 2

A Low Power Array Multiplier Design using Modified Gate Diffusion Input (GDI)

Design and Implementation of High Radix Booth Multiplier using Koggestone Adder and Carry Select Adder

LOW POWER NOVEL HYBRID ADDERS FOR DATAPATH CIRCUITS IN DSP PROCESSOR

JDT LOW POWER FIR FILTER ARCHITECTURE USING ACCUMULATOR BASED RADIX-2 MULTIPLIER

Design of an optimized multiplier based on approximation logic

Implementation of 1-bit Full Adder using Gate Difuision Input (GDI) cell

Implementation and Performance Analysis of a Vedic Multiplier Using Tanner EDA Tool

DESIGN AND ANALYSIS OF LOW POWER 10- TRANSISTOR FULL ADDERS USING NOVEL X-NOR GATES

An Optimized Implementation of CSLA and CLLA for 32-bit Unsigned Multiplier Using Verilog

Impact of Logic and Circuit Implementation on Full Adder Performance in 50-NM Technologies

DESIGN OF EXTENDED 4-BIT FULL ADDER CIRCUIT USING HYBRID-CMOS LOGIC

A Novel Hybrid Full Adder using 13 Transistors

Efficient FIR Filter Design Using Modified Carry Select Adder & Wallace Tree Multiplier

Design and Implementation of combinational circuits in different low power logic styles

An energy efficient full adder cell for low voltage

High Speed NP-CMOS and Multi-Output Dynamic Full Adder Cells

A Review on Low Power Compressors for High Speed Arithmetic Circuits

Ciência e Natura ISSN: Universidade Federal de Santa Maria Brasil

Power Efficient adder Cell For Low Power Bio MedicalDevices

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

Comparison of Multiplier Design with Various Full Adders

A Survey on A High Performance Approximate Adder And Two High Performance Approximate Multipliers

A Low-Power High-speed Pipelined Accumulator Design Using CMOS Logic for DSP Applications

Performance Analysis of High Speed CMOS Full Adder Circuits For Embedded System

A SUBSTRATE BIASED FULL ADDER CIRCUIT

A High Speed Wallace Tree Multiplier Using Modified Booth Algorithm for Fast Arithmetic Circuits

DESIGN AND ANALYSIS OF VEDIC MULTIPLIER USING MICROWIND

A Novel High-Speed, Higher-Order 128 bit Adders for Digital Signal Processing Applications Using Advanced EDA Tools

Design of 32-bit Carry Select Adder with Reduced Area

By Dayadi Lakshmaiah, Dr. M. V. Subramanyam & Dr. K. Satya Prasad Jawaharlal Nehru Technological University, India

Design of Delay-Power Efficient Carry Select Adder using 3-T XOR Gate

Full Adder Circuits using Static Cmos Logic Style: A Review

Design and Implementation of High Speed Carry Select Adder Korrapatti Mohammed Ghouse 1 K.Bala. 2

A new 6-T multiplexer based full-adder for low power and leakage current optimization

Tiago Reimann Cliff Sze Ricardo Reis. Gate Sizing and Threshold Voltage Assignment for High Performance Microprocessor Designs

A NOVEL WALLACE TREE MULTIPLIER FOR USING FAST ADDERS

CHAPTER 3 NEW SLEEPY- PASS GATE

A New Configurable Full Adder For Low Power Applications

Design and Analysis of Low-Power 11- Transistor Full Adder

Design & Analysis of Low Power Full Adder

Comparative Analysis of Multiplier in Quaternary logic

Design and Performance Analysis of High Speed Low Power 1 bit Full Adder

ASIC Implementation of High Speed Area Efficient Arithmetic Unit using GDI based Vedic Multiplier

An Design of Radix-4 Modified Booth Encoded Multiplier and Optimised Carry Select Adder Design for Efficient Area and Delay

Wave Pipelined Circuit with Self Tuning for Clock Skew and Clock Period Using BIST Approach

Domino CMOS Implementation of Power Optimized and High Performance CLA adder

Performance Analysis of a 64-bit signed Multiplier with a Carry Select Adder Using VHDL

LOW POWER & LOW VOLTAGE APPROXIMATION ADDERS IMPLEMENTATION FOR DIGITAL SIGNAL PROCESSING Raja Shekhar P* 1, G. Anad Babu 2

Ultra Low Power MUX Based Compressors for Wallace and Dadda Multipliers in Sub-threshold Regime

Efficient Dedicated Multiplication Blocks for 2 s Complement Radix-2m Array Multipliers

A NOVEL 4-Bit ARITHMETIC LOGIC UNIT DESIGN FOR POWER AND AREA OPTIMIZATION

Comparative Analysis of Various Adders using VHDL

Design of Two New High-Performance Full Adders in Sub-threshold Region for Ultra-Low Power Applications

PUBLICATIONS OF PROBLEMS & APPLICATION IN ENGINEERING RESEARCH - PAPER CSEA2012 ISSN: ; e-issn:

Novel Low-Overhead Operand Isolation Techniques for Low-Power Datapath Synthesis

ISSN Vol.03,Issue.02, February-2014, Pages:

HIGH PERFORMANCE BAUGH WOOLEY MULTIPLIER USING CARRY SKIP ADDER STRUCTURE

A Compact Design of 8X8 Bit Vedic Multiplier Using Reversible Logic Based Compressor

National Conference on Emerging Trends in Information, Digital & Embedded Systems(NC e-tides-2016)

Low-Power Multipliers with Data Wordlength Reduction

A Low-Power 12 Transistor Full Adder Design using 3 Transistor XOR Gates

Low Power Design for Systems on a Chip. Tutorial Outline

Analysis of Different Full Adder Designs with Power using CMOS 130nm Technology

Transcription:

XXVII SIM - South Symposium on Microelectronics 1 Low Power 3-2 and 4-2 Adder Compressors Implemented Using ASTRAN Jorge Tonfat, Ricardo Reis jorgetonfat@ieee.org, reis@inf.ufrgs.br Grupo de Microeletrônica (GME) PGMICR UFRGS, Porto Alegre, RS, Brasil Abstract This paper presents two adder compressors architectures addressing high-speed and low power. Adder compressors are used to implement arithmetic circuits such as multipliers and digital signal processing units like the Fast Fourier Transform (FTT). To address the objective of high-speed and low power, it is well known that optimization efforts should be applied in all abstraction levels. In this paper are combined optimizations at logic, electrical and physical level. At the logic level, the circuit is optimized by using multiplexers instead of gates to reduce delay, power and area. At the electrical level, this work presents an architecture that generate the and XNR signals simultaneously, this reduce internal glitches hence dynamic power as well. And finally at the physical level, and automatic layout generation tool (ASTRAN) is used to make the adder compressors layouts. This tool has proved to reduce power consumption and delay due to the smaller input capacitances of the complex gates generated compared to manual-designed layouts. 1. Introduction Nowadays, it is advised an enormous development of mobile electronic gadgets ready for multimedia environments such as smartphones, portable video games and tablets. The main features of these devices are low power consumption and high performance. These two characteristics are contradictory. Hence we must find the best trade-off between power and delay to design such devices. Process technology scaling seems to complicate this situation. Thus design solutions are the main mechanism to keep the power consumption budget in control. The design optimization effort should be applied in all abstraction levels to obtain better results [1]. Traditional design flows as the standard cell approach limits the optimization effort because cell's library limits the number of logical functions as well as the limited sizing versions of these functions. In this context, the library-free approach appears as a possible solution [2]. The ASTRAN [3] (Automatic Synthesis of Transistor Networks) tool is used in this work to generate layouts of CMS cells from its transistor level netlist description. Results from [4] show the effectiveness in terms of power and delay reduction compared to manual-designed layouts from a standard cell library. This paper is organized as follows: In Section 2 is discussed previous works on adder compressors and is presented the architectures analyzed in this work. Subsequently in section 3 is presented the ASTRAN tool used in this work to automatic generate the layouts of the circuits. Simulation results for both compressors are shown in section 4 and finally the conclusions are presented in section 5. 2. Adder Compressor Architectures Adder compressors have been used to implement arithmetic and digital signal processing (DSP) circuits for low power and high performance applications [5]. Compressors are also used in multiplier architectures. Multipliers are structured into three functions: partial-product generation, partial-product accumulation and final addition. The main source of power, delay and area came from the partial-product accumulation stage [6]. Compressors usually implement this stage because they contribute to the reduction of the partial products (reducing the number of adders at the final stage) and also contribute to reduce the critical path which is important to maintain the circuit's performance. Fig. 1 -XNR module. (b)-xnr implemented with CMS logic. Compressors are composed basically of two types of modules: -XNR complex gates and multiplexers (MUX). A CMS logic implementation of both modules is shown in Figure 1(b) and Figure 2(b).

2 XXVII SIM - South Symposium on Microelectronics A design space exploration for 4-2 and 5-2 compressors is presented in [6]. It shows the different possibilities to implement adder compressors based on different logic styles but with the same architecture. The results show some combinations of logic styles for the -XNR and MUX modules that achieve better results in terms of delay and power. For the -XNR module, the circuit from Figure 3 presents better features than others. And for the MUX module, a MUX based on transmission gates with an output buffer (Figure 3(b)) presents best results. A B S MUX Fig. 2 Multiplexer (MUX) with two outputs. (b) Multiplexer (MUX) implemented with CMS logic. A new architecture is presented in [7]. In this architecture the emphasis lay on the use of multiplexers instead of gates. This is because the use of multiplexers improves the speed when placed in the critical path [8]. For the -XNR module, this work uses the traditional CMS logic style and for the MUX module they use a combination of a traditional CMS logic style with a transmission gate logic style that is only used in the internal paths of the adder due to the limited driving capability. A S B (b) Fig. 3 -XNR module proposed in [6] (b) Multiplexer (MUX) implemented with transmission gates. Based on previous works presented in [6] and [7], 3-2 and 4-2 Compressors are implemented in this work using ASTRAN. 2.1. 3-2 Compressor In Figure 4 is presented the 3-2 adder compressor. As shown in the truth table, the operation is the same of the full adder. It takes 3 inputs A, B, C to generate 2 outputs, the sum and the carry bits.

XXVII SIM - South Symposium on Microelectronics 3 A B C Sum Carry 0 0 0 0 0 0 0 1 1 0 0 1 0 1 0 0 1 1 0 1 1 0 0 1 0 1 0 1 0 1 1 1 0 0 1 1 1 1 1 1 Fig. 4 3-2 adder compressor. (b) Truth table for the 3-2 adder compressor. This compressor is governed by the following equation: + + = + 2 (1) In Figure 5 are shown the two architectures implemented in this work. The traditional architecture is shown in Figure 5 and is implemented using the CMS logic style of the -XNR and MUX modules. This architecture is governed by the following equations: = (2) = ( ) + ( ) (3) Fig. 5 Traditional architecture of 3-2 Compressors. (b) Enhanced architecture for a 3-2 Compressor. In Figure 5(b) is shown the enhanced version of the 3-2 compressor architecture. This architecture uses the -XNR module presented in 3 and the transmission gate version of the MUX module presented in Figure 3(b). This architecture is governed by the following equations: = ( ) + ( ) (4) = ( ) + ( ) (5) 2.2. 4-2 Compressor The 4-2 Compressor has 5 inputs A, B, C, D and Cin to generate 3 outputs Sum, Carry and Cout as shown in Figure 6. The 4 inputs A, B, C and D and the output Sum have the same weight. The input Cin is the output from a previous lower significant compressor and the Cout output is for the compressor in the next significant stage. The conventional approach to implement 4-2 compressors is with 2 full adders connected serially as shown in Figure 6(b).

4 XXVII SIM - South Symposium on Microelectronics A B C D Full Adder Cin Cout Full Adder (b) Sum Carry Fig. 6 4-2 adder compressor. (b) 4-2 adder compressor implemented with full adders. The 4-2 compressor is governed by the following equation: + + + + = + 2 ( + ) (6) In Figure 7 are shown the two architectures implemented in this work. The traditional architecture is shown in Figure 7 and uses the CMS logic style of the -XNR and MUX modules similarly to the 3-2 compressor. This architecture is governed by the following equations: = (7) = ( ) + ( ) (8) = ( ) + ( ) (9) A B C D MUX Cin Cout MUX Sum Carry Fig. 7 Traditional architecture of 4-2 Compressors. (b) Enhanced architecture with the -XNR and MUX modules of a 4-2 Compressor. In Figure 7(b) is shown the enhanced version of the 4-2 compressor architecture. This architecture uses the -XNR module presented in Figure 3 and the transmission gate version of the MUX module presented in Figure 3(b). This architecture is governed by the following equations: = ( ) ( ) + ( ) ( ) + ( ) ( ) + ( ) ( ) (10) = ( ) + ( ) (11) = ( ) + ( ) (12) 3. Automatic Layout Generation using ASTRAN ASTRAN [3] is a cell synthesis tool for automatic layout generation of CMS cells from its transistor level netlist description, allowing different transistor sizes and no restrictions on the transistor network organization. The tool receives as input a file in SPICE format with the netlist of the cells (with their respective and individually sized transistors), a configuration file (which defines the layout topology and control parameters to the generator), and a technology file (which contains a description of the design rules). For a given transistor network, the objective is to place and route the transistors using the proposed layout style in such a way that the

XXVII SIM - South Symposium on Microelectronics 5 cell width is minimized. At the end, the circuit is compacted to produce a layout in CIF, GDSII and LEF formats. ASTRAN currently only supports designs in technology node equal or above 350nm, so in this work the AMS 350nm CMS technology is used. In Figures 8 and 8(b) are shown the layouts generated. These layouts correspond to the enhanced architectures of the adder compressors. (b) Fig. 8 3-2 compressor layout. (b) 4-2 compressor layout. 4. Simulation Results 4.1. Simulation Environment Simulations are performed by the HSPICE simulator. The simulation environments are shown in Figure 9 Each input is driven by buffers and each output is loaded by buffers to provide a realistic environment. For the 4-2 compressor environment, two compressors are placed in parallel because the critical path may cross adjacent compressors. The left most compressor is analyzed because it is most probable to have the longest delay. For both compressors, two configurations are proposed. The first is Architecture 1 and is composed by the CMS logic style of the -XNR and MUX modules and the traditional architecture for each compressor. The Architecture 2 is composed by the modified -XNR module in Figure 3 and the transmission gate version of MUX with the enhanced architectures for each compressor. Fig. 9 3-2 Compressor simulation environment. (b) 4-2 Compressor simulation environment. 4.2. Simulation Results for 3-2 Compressor In Table 1 are shown the results for the 3-2 compressor. As expected, the changes in the logic style and architecture reduce the power while the delay is slightly affected. The power-delay product (PDP) at different supply voltages indicates that architecture 2 is more power efficient. Tab.1 - Delay, Power and Power-delay product (PDP) comparison for the 3-2 compressor. Delay (ns) 1.2 V 1.8 V 2.5 V 3.3 V Architecture 1 6.80 2.62 1.61 1.21 Architecture 2 7.59 2.45 1.43 1.05 Power (uw) 1.2 V 1.8 V 2.5 V 3.3 V Architecture 1 10 23.3 45.75 81.61 Architecture 2 8.15 19.45 38.42 71.35 PDP (fj) 1.2 V 1.8 V 2.5 V 3.3 V Architecture 1 68 61.05 73.66 98.75 Architecture 2 61.86 47.65 54.94 74.92 4.3. Simulation Results for 3-2 Compressor In Table 2 are shown the results for the 4-2 compressor. Analyzing the power results, it seems that architecture 1 is better in terms of power consumption, but the PDP shows that when the delay is considered, architecture 2 offers better results.

6 XXVII SIM - South Symposium on Microelectronics Tab.2 - Delay, Power and Power-delay product (PDP) comparison for the 4-2 compressor. Delay (ns) 1.2 V 1.8 V 2.5 V 3.3 V Architecture 1 10.01 3.85 2.36 1.76 Architecture 2 9.97 3.26 1.88 1.37 Power (uw) 1.2 V 1.8 V 2.5 V 3.3 V Architecture 1 15.78 38.68 76.23 136.1 Architecture 2 16.7 39.9 79.71 144.3 PDP (fj) 1.2 V 1.8 V 2.5 V 3.3 V Architecture 1 157.96 148.92 179.90 239.54 Architecture 2 166.50 130.07 149.85 197.69 Both circuits show that the better trade-off in terms of power and delay appears when the supply voltage is lower than the nominal (3.3V), around 1.8 V. 5. Conclusions and Future Work The architectures of 3-2 and 4-2 compressors are analyzed using 2 logic styles for the -XNR and MUX modules. The layouts for these compressors are automatic generated with the ASTRAN tool. According to previous works [4], the layouts generated by ASTRAN contribute to reduce the power consumption of the circuits. ptimization techniques for low power should be applied in all abstraction levels, and this work the logic, electrical and physical levels are explored. As future works we are already working on applying these optimizations on a 65 nm process, and analyze the circuit sensitivity to PVT (Process, Voltage and Temperature) variations. 6. Acknowledgment The authors would like to thank CNPq and CAPES agencies for their financial support. 7. References [1] J. M. Rabaey, Low Power Design Essentials. New York: Springer, 2009. [2] R. Reis, Design Automation of Transistor Networks, a New Challenge, in Circuits and Systems (ISCAS), 2011 IEEE International Symposium on, may 2011, pp. 2485 2488. [3] A. Ziesemer, C. Lazzari, R. Reis Transistor Level Automatic Layout Generator for noncomplementary CMS Cells, in Very Large Scale Integration, 2007. VLSI - SoC 2007. IFIP International Conference on, oct. 2007, pp. 116 121. [4] G. Posser, et.al., A Study on Layout Quality of Automatic Generated Cells, in Electronics, Circuits, and Systems (ICECS), 2010 17th IEEE International Conference on, dec. 2010, pp. 651 654. [5] M. Fonseca, et. al., Design of Pipelined Butterflies from Radix-2 FFT with Decimation in Time Algorithm using Efficient Adder Compressors, in Circuits and Systems (LASCAS), 2011 IEEE Second Latin American Symposium on, feb. 2011, pp. 1 4. [6] C.-H. Chang, et. al., Ultra low-voltage Low-Power CMS 4-2 and 5-2 Compressors for Fast Arithmetic Circuits, Circuits and Systems I: Regular Papers, IEEE Transactions on, vol. 51, no. 10, pp. 1985 1997, oct. 2004. [7] S. Veeramachaneni, et. al., Novel Architectures for High-Speed and Low-Power 3-2, 4-2 and 5-2 Compressors, in VLSI Design, 2007. Held jointly with 6th International Conference on Embedded Systems., 20th International Conference on, jan. 2007, pp. 324 329. [8] R. Zimmermann, W. Fichtner, Low-Power Logic Styles: CMS Versus Pass-Transistor Logic, Solid- State Circuits, IEEE Journal of, vol. 32, no. 7, pp. 1079 1090, jul 1997.