Fully Parallel 6T-2MTJ Nonvolatile TCAM with Single-Transistor-Based Self Match-Line Discharge Control

Similar documents
High-Throughput Low-Energy Content-Addressable Memory Based on Self-Timed Overlapped Search Mechanism

Sleepy Keeper Approach for Power Performance Tuning in VLSI Design

Variation-tolerant Non-volatile Ternary Content Addressable Memory with Magnetic Tunnel Junction

Reducing Energy in a Ternary Cam Using Charge Sharing Technique

90% Write Power Saving SRAM Using Sense-Amplifying Memory Cell

ISSCC 2001 / SESSION 11 / SRAM / 11.4

Ultra Low Voltage Operation with Bootstrap Scheme for Single Power Supply SOI-SRAM

THE content-addressable memory (CAM) is one of the most

CHAPTER 3 PERFORMANCE OF A TWO INPUT NAND GATE USING SUBTHRESHOLD LEAKAGE CONTROL TECHNIQUES

Lecture #29. Moore s Law

Low Power TCAM Design And Simulation Rahul Nigam Department of electronics and communication, NIT, Calicut- India.

Ultra Low Power VLSI Design: A Review

CHAPTER 3 NEW SLEEPY- PASS GATE

Static Random Access Memory - SRAM Dr. Lynn Fuller Webpage:

Leakage Power Reduction for Logic Circuits Using Variable Body Biasing Technique

EE 330 Lecture 44. Digital Circuits. Ring Oscillators Sequential Logic Array Logic Memory Arrays. Final: Tuesday May 2 7:30-9:30

Low-Power VLSI. Seong-Ook Jung VLSI SYSTEM LAB, YONSEI University School of Electrical & Electronic Engineering

Electronic Circuits EE359A

Design and Implement of Low Power Consumption SRAM Based on Single Port Sense Amplifier in 65 nm

EEC 216 Lecture #10: Ultra Low Voltage and Subthreshold Circuit Design. Rajeevan Amirtharajah University of California, Davis

EE 330 Lecture 44. Digital Circuits. Dynamic Logic Circuits. Course Evaluation Reminder - All Electronic

A Ternary Content Addressable Cell Using a Single Phase Change Memory (PCM)

Total reduction of leakage power through combined effect of Sleep stack and variable body biasing technique

CHAPTER 5 DESIGN AND ANALYSIS OF COMPLEMENTARY PASS- TRANSISTOR WITH ASYNCHRONOUS ADIABATIC LOGIC CIRCUITS

A NEW APPROACH FOR DELAY AND LEAKAGE POWER REDUCTION IN CMOS VLSI CIRCUITS

Memory (Part 1) RAM memory

CMPEN 411 VLSI Digital Circuits Spring Lecture 24: Peripheral Memory Circuits

DESIGNING OF SRAM USING LECTOR TECHNIQUE TO REDUCE LEAKAGE POWER

1. Short answer questions. (30) a. What impact does increasing the length of a transistor have on power and delay? Why? (6)

CHAPTER 7 A BICS DESIGN TO DETECT SOFT ERROR IN CMOS SRAM

Design and Evaluation of two MTJ-Based Content Addressable Non-Volatile Memory Cells

EEC 118 Lecture #12: Dynamic Logic

[Vivekanand*, 4.(12): December, 2015] ISSN: (I2OR), Publication Impact Factor: 3.785

ECE520 VLSI Design. Lecture 5: Basic CMOS Inverter. Payman Zarkesh-Ha

STUDY OF VOLTAGE AND CURRENT SENSE AMPLIFIER

Analysis of Low Power-High Speed Sense Amplifier in Submicron Technology

Silicon Odometer: An On-Chip Reliability Monitor for Measuring Frequency Degradation of Digital Circuits

A PCM-based TCAM cell using NDR

A Novel Dual Stack Sleep Technique for Reactivation Noise suppression in MTCMOS circuits

A Study of The Advancement of CMOS ALU & Full Adder Circuit Design For Modern Design

A Low Power Array Multiplier Design using Modified Gate Diffusion Input (GDI)

RTN Induced Frequency Shift Measurements Using a Ring Oscillator Based Circuit

Opportunities and Challenges in Ultra Low Voltage CMOS. Rajeevan Amirtharajah University of California, Davis

Design of a Hybrid Non-Volatile SRAM Cell for Concurrent SEU Detection and Correction

Technical Paper FA 10.3

Combinational Logic Gates in CMOS

A Three-Port Adiabatic Register File Suitable for Embedded Applications

Ruixing Yang

High-Performance of Domino Logic Circuit for Wide Fan-In Gates Using Mentor Graphics Tools

Homework 10 posted just for practice. Office hours next week, schedule TBD. HKN review today. Your feedback is important!

A Novel Low-Power Scan Design Technique Using Supply Gating

A Survey of the Low Power Design Techniques at the Circuit Level

Energy Efficiency of Power-Gating in Low-Power Clocked Storage Elements

TCAM Core Design in 3D IC for Low Matchline Capacitance and Low Power

A Low-Power SRAM Design Using Quiet-Bitline Architecture

Memory Basics. historically defined as memory array with individual bit access refers to memory with both Read and Write capabilities

A Differential 2R Crosspoint RRAM Array with Zero Standby Current

ACONTENT-ADDRESSABLE memory (CAM) is a

Leakage Power Reduction in 5-Bit Full Adder using Keeper & Footer Transistor

Topic 6. CMOS Static & Dynamic Logic Gates. Static CMOS Circuit. NMOS Transistors in Series/Parallel Connection

A Literature Review on Leakage and Power Reduction Techniques in CMOS VLSI Design

Design and Analysis of Sram Cell for Reducing Leakage in Submicron Technologies Using Cadence Tool

Design of High Performance Arithmetic and Logic Circuits in DSM Technology

Chapter 6 Combinational CMOS Circuit and Logic Design. Jin-Fu Li Department of Electrical Engineering National Central University Jungli, Taiwan

SCALING power supply has become popular in lowpower

EE 330 Lecture 44. Digital Circuits. Other Logic Styles Dynamic Logic Circuits

Implementation of dual stack technique for reducing leakage and dynamic power

UNIT-II LOW POWER VLSI DESIGN APPROACHES

Design of Nano-Electro Mechanical (NEM) Relay Based Nano Transistor for Power Efficient VLSI Circuits

LEAKAGE POWER REDUCTION IN CMOS CIRCUITS USING LEAKAGE CONTROL TRANSISTOR TECHNIQUE IN NANOSCALE TECHNOLOGY

PERFORMANCE ANALYSIS ON VARIOUS LOW POWER CMOS DIGITAL DESIGN TECHNIQUES

Comparative Study of Different Low Power Design Techniques for Reduction of Leakage Power in CMOS VLSI Circuits

Low Power Realization of Subthreshold Digital Logic Circuits using Body Bias Technique

Performance Analysis of Novel Domino XNOR Gate in Sub 45nm CMOS Technology

An Energy Efficient Match-Line Sensing Scheme for High-Speed and Highly-Reliable Ternary Content Addressable Memory

Leakage Power Reduction in CMOS VLSI

LOW LEAKAGE CNTFET FULL ADDERS

Design of a high speed and low power Sense Amplifier

SRAM Read Performance Degradation under Asymmetric NBTI and PBTI Stress: Characterization Vehicle and Statistical Aging

Announcements. Advanced Digital Integrated Circuits. Quiz #3 today Homework #4 posted This lecture until 4pm

Optimization of power in different circuits using MTCMOS Technique

Power and Energy. Courtesy of Dr. Daehyun Dr. Dr. Shmuel and Dr.

DESIGN & IMPLEMENTATION OF SELF TIME DUMMY REPLICA TECHNIQUE IN 128X128 LOW VOLTAGE SRAM

Reduce Power Consumption for Digital Cmos Circuits Using Dvts Algoritham

A High Performance IDDQ Testable Cache for Scaled CMOS Technologies

Variable Body Biasing Technique to Reduce Leakage Current in 4x4 DRAM in VLSI

Comparison and Performance Analysis of Various Low Power Digital design Techniques

Design of an efficient NOR Content Addressable Memory Bit cell Using memristor and MT-CMOS in FinFET Technology

DESIGN AND STATISTICAL ANALYSIS (MONTECARLO) OF LOW-POWER AND HIGH STABLE PROPOSED SRAM CELL STRUCTURE

Leakage Control Techniques for Designing Robust, Low Power Wide-OR Domino Logic for Sub-130nm CMOS Technologies

Ultralow-Power and Robust Embedded Memory for Bioimplantable Microsystems

Low Power Design of Schmitt Trigger Based SRAM Cell Using NBTI Technique

Two New Low Power High Performance Full Adders with Minimum Gates

Two New Low Power High Performance Full Adders with Minimum Gates

Design and Optimization of Half Subtractor Circuits for Low-Voltage Low-Power Applications

ISSN:

Leakage Current Analysis

1. Introduction. Volume 6 Issue 6, June Licensed Under Creative Commons Attribution CC BY. Sumit Kumar Srivastava 1, Amit Kumar 2

A novel sensing algorithm for Spin-Transfer-Torque magnetic RAM (STT-MRAM) by utilizing dynamic reference

EE 42/100 Lecture 23: CMOS Transistors and Logic Gates. Rev A 4/15/2012 (10:39 AM) Prof. Ali M. Niknejad

Transcription:

Fully Parallel 6T-2MTJ Nonvolatile TCAM with Single-Transistor-Based Self Match-Line Discharge Control Shoun Matsunaga 1,2, Akira Katsumata 2, Masanori Natsui 1,2, Shunsuke Fukami 1,3, Tetsuo Endoh 1,2,4, Hideo Ohno 1,2, and Takahiro Hanyu 1,2 1 Center for Spintronics Integrated Systems, Tohoku University 2 Research Institute of Electrical Communication, Tohoku University 3 NEC Corporation 4 Center for Interdisciplinary Research, Tohoku University Acknowledgment: This research is supported by the Japan Society for the Promotion of Science (JSPS) through its "Funding Program for World-Leading Innovative R&D on Science and Technology (FIRST Program)." Symposia on VLSI Technology and Circuits June 17, 2011.

Outline Background & Purpose 6T-2MTJ-Based NV-TCAM Cell and Test Chip Fabrication Design Example of Low-Power NV-TCAM Using Three-Level Segmented Match-Line Scheme Conclusions Slide 1

Background TCAM (Ternary Content-Addressable Memory) Fully Parallel equality-search High-speed pattern matching Applications: Database, virus checker, network router, etc. Demands: Large capacity, Low-power consumption Search word 1 1 0 0 0 X 0 0 1 0 X X 1 0 X 1 1 0 1 1 X Parallel Input &Output Hit Hit TCAM cell structure 2-bit volatile storage Comp. Logic Leakage current Problems: Many device counts of TCAM cell Large cell area Increasing leakage current High standby power Purpose : Realize a compact and low-power TCAM Slide 2

Merits of MTJ-Based Nonvolatile Logic-in-Memory MTJ device Hardware structure Function MTJ layer CMOS layer Storage Logic Nonvolatile storage Perfectly cut off the power supply. Low standby power 3D-stacking structure Greatly reduce area overhead of storage. Compact CMOS/MTJ-hybrid logic Merge storage and logic functions. More compact Nonvolatile Logic-in-Memory structure is suitable for a compact and low-standby-power TCAM. Slide 3

Approaches for a Compact and Low-Power TCAM For Large Capacity (Small Cell Size) Utilize nonvolatile logic-in-memory - 3D-stacking structure - CMOS/MTJ-hybrid logic Compact ( & Nonvolatile) For Low Power (with maintaining search speed) Eliminate wasted cell activation Power OFF at standby state using nonvolatility NV-TCAM cell Low active power Low standby power Match-line (ML) 100% of activity High Power Large capacitive load Small load (Fast) Decreased activity Low Power Disabled (Standby) Medium load (Medium speed) Total load is the same Comparable speed Slide 4

Outline Background & Purpose 6T-2MTJ-Based NV-TCAM Cell and Test Chip Fabrication Design Example of Low-Power NV-TCAM Using Three-Level Segmented Match-Line Scheme Conclusions Slide 5

Mem. (b 1 ) Cell circuit S Conventional TCAM Cell S Comparison circuit Mem. (b 2 ) ML Match Line GND 2-bit Volatile Mem. (SRAM cells) Large cell size (16Tr. or 12Tr.) High leakage Stored data Input Matched result B (b 1, b 2 ) S ML 0 (0, 1) 1 (1, 0) X Don t care Truth table (0, 0) 0 1 (Hit) 1 0 () 0 0 () 1 1 (Hit) 0 1 (Hit) Masked 1 ML = b 1. S + b 2. S It is desirable to realize a compact and nonvolatile TCAM cell. Slide 6

ML VDD BIAS Proposed NV-TCAM Cell 6T-2MTJ NV-TCAM cell WL1 SL(S) ML voltage Keeper (Diode switch) Load SL(S) WL2 I CELL [µa] 120 100 80 Measured I-V char. VCELL_H VCELL_L (Hit) () Hit b 1. S + b 2. S b 1 b 2 60 40 V CELL-L V CELL (0.35V) V CELL-H BL BL Nonvolatile storage Comparison circuit 20 0 PMOS load 0 0.2 0.4 0.6 0.8 1.0 1.2 VCELL [V] Slide 7

Mechanism of ML voltage keeper Minimum voltage detection Match-line (ML) V ML OUT ML voltage keeper Comp. Circuit V CELL Comp. Circuit 1-bit cell If Full-bit Hit V ML : H If V ML : L ML voltage keeper Comp. Circuit V ML is almost the same as minimum voltage of each V CELL. V CELL = V CELL_H (Hit) V CELL_L () Easy to sense Slide 8

Self Match-Line Discharge Control in Word Circuit H Full-bit Hit V ML > V CELL_H H H ML is precharged to V DD. ML GND H V ML > V CELL_H H M ML GND V ML = V CELL_H ML GND V ML = V CELL_H M ML GND V ML V DD V CELL_H V CELL_L Slight degradation Hit ML voltage swing Time V ML.. V CELL_L ML GND Slide 9

Match-Line Voltage Swing 1-bit miss detection (worst case) Reversed Hit Hit current Hit VCELL-H VCELL-H VCELL-H I Diode Word length Current into a miss cell (exponential scale) Reversed current from hit cells (linear scale) V ML V ML ML voltage keeper (Weak inversion) V CELL-L V CELL-L logarithmic scale V CELL-H VML-L VML-H V ML Sufficient ML voltage swing can be obtained even in longer word circuit. Slide 10

Fabricated NV-TCAM Test Chip Column Dec. 3.08 µm 3.36 µm Fabricated TCAM cell 226 µm Row Dec. 2-kb TCAM cell array Sense Amp. Process Cell structure MTJ size 90 nm 1P5M CMOS/MTJ 6T-2MTJ 100 nm x 200 nm Cell size 10.35 µm 2 174 µm Array configuration Match delay 32bits x 64words 0.29 ns Supply voltage 1.2 V Slide 11

Match-Line Voltage Swings (Simulated and Measured) Match-line voltage swing [V] 0.5 0.4 0.3 0.2 0.1 MRR=400% MRR=250% MRR=172% (Typical) MRR=172% (10% Vth, 10% MRR) MRR=100% Measured V ML : 0.23V@32bits Criterion for sufficient voltage swing Logarithmically degraded Estimated V ML : 0.19V@144bits Easy to detect 0 1 2 4 8 16 32 Word length [bit] 64 128 256 Sufficient match-line voltage swing is obtained. Slide 12

Chip Measurements RMTJ [kω] 3.5 3 2.5 2 1.5 1 0.5 R AP R P MRR = x 100 R P = 172% R AP = 3.0 kω R P = 1.1 kω -400-200 0 200 400 I MTJ [µa] Voltages CLK SL Input key OUT P E P E 1 0 Hit Time P : Precharge E : Evaluate 100ns 1.2V Basic behaviors of the fabricated MTJ device and NV-TCAM have been successfully confirmed. Slide 13

Measured Waveforms of Instant-ON/OFF Voltages V DD Power OFF Power OFF Power OFF Power OFF 1.2V Power OFF 1µs CLK SL Input key OUT The same The same 1 output 1 0 output 0 Hit Hit Time Instant ON/OFF of the fabricated chip has been successfully confirmed. Slide 14

Outline Background & Purpose 6T-2MTJ-Based NV-TCAM Cell and Test Chip Fabrication Design Example of Low-Power NV-TCAM Using Three-Level Segmented Match-Line Scheme Conclusions Slide 15

Approach for Low-Power NV-TCAM Three-Level Segmented Match-Line Scheme 1st segment 2nd segment 3rd segment Hit Disabled Disabled Disabled cell Hit cell If current segment is, then next segment is disabled. Decreased activity rate Low-Power Slide 16

Image of Cell Array Activity m-word x n-bit cell array i bits j bits n - i - j bits Hit probability of cell (p): 2/3 (using masked bit) 1/2 (not using masked bit) p i m (1 - p i )m (Active) Hit (Active) Disabled (Inactive) (Active) Hit (Active) Most words are Disabled (Inactive) Enabled (Active) Ultra Low Activity Hit Slide 17

Cell Activity Rate Activity rate of cells [%] 10 8 6 4 2 0 8 7 Minimum activity rate @ 1st 3-bit / 2nd 7-bit segmentation 6 5 2.8% Word length of 1st segment [bit] 4 3 2 1 7 8 1 2 3 4 5 6 9 8 7 6 5 4 3 2 Word length of 2nd segment [bit] Slide 18

Performance Evaluations Array structure 144-bit x 256-word Cell activity [%] 2.8 Search energy [fj/bit/search] 1.04 (Comparable to CMOS-based TCAM within a few fj/bit/search) Standby power [W] Sleep mode Search mode 0 (@Power-OFF) Negligible (@2.8% activity) HSPICE simulation under a 90nm CMOS technology @1.2V Low-standby-power nonvolatile TCAM is successfully realized under comparable search energy with CMOS-based one. Slide 19

Outline Background & Purpose 6T-2MTJ-Based NV-TCAM Cell and Test Chip Fabrication Design Example of Low-Power NV-TCAM Using Three-Level Segmented Match-Line Scheme Conclusions Slide 20

Conclusions We have proposed and demonstrated 6T-2MTJ-based fully-parallel NV-TCAM. Cell Circuit Techniques: - Fewest transistor counts with nonvolatility - Bit-parallel equality-search capability in a long word based on 1-transistor ML voltage keeper array Word Circuit Techniques: - Eliminate wasted cell activation based on three-level segmented match-line scheme Negligible standby power under comparable search energy with CMOS-based TCAM Slide 21