Scan Side Channel Analysis: a New Way for Non-Invasive Reverse Engineering of a VLSI Device

Similar documents
Data Compression via Logic Synthesis

EC 1354-Principles of VLSI Design

CS 61C: Great Ideas in Computer Architecture Finite State Machines, Functional Units

Design For Test. VLSI Design I. Design for Test. page 1. What can we do to increase testability?

Design for Testability & Design for Debug

Contemplation of Synchronous Gray Code Counter and its Variants using Reversible Logic Gates

Recursive Pseudo-Exhaustive Two-Pattern Generator PRIYANSHU PANDEY 1, VINOD KAPSE 2 1 M.TECH IV SEM, HOD 2

A BIST Circuit for Fault Detection Using Recursive Pseudo- Exhaustive Two Pattern Generator

VLSI Design Verification and Test Delay Faults II CMPE 646

PROCESS-VOLTAGE-TEMPERATURE (PVT) VARIATIONS AND STATIC TIMING ANALYSIS

EECS150 - Digital Design Lecture 28 Course Wrap Up. Recap 1

COMBINATIONAL and SEQUENTIAL LOGIC CIRCUITS Hardware implementation and software design

Digital Signal Processing

Digital Integrated CircuitDesign

2010 HSC Software Design and Development Marking Guidelines

First Name: Last Name: Lab Cover Page. Teaching Assistant to whom you are submitting

Sno Projects List IEEE. High - Throughput Finite Field Multipliers Using Redundant Basis For FPGA And ASIC Implementations

PROPOSED SCHEME OF COURSE WORK

CHAPTER 1 INTRODUCTION

Overview ECE 553: TESTING AND TESTABLE DESIGN OF DIGITAL SYSTES. Motivation. Modeling Levels. Hierarchical Model: A Full-Adder 9/6/2002

An Design of Radix-4 Modified Booth Encoded Multiplier and Optimised Carry Select Adder Design for Efficient Area and Delay

Keywords: Adaptive filtering, LMS algorithm, Noise cancellation, VHDL Design, Signal to noise ratio (SNR), Convergence Speed.

A Novel Low-Power Scan Design Technique Using Supply Gating

THE INTERNATIONAL JOURNAL OF SCIENCE & TECHNOLEDGE

ECOM 4311 Digital System Design using VHDL. Chapter 9 Sequential Circuit Design: Practice

Performance Enhancement of the RSA Algorithm by Optimize Partial Product of Booth Multiplier

Path Delay Test Compaction with Process Variation Tolerance

SPIRO SOLUTIONS PVT LTD

Run-Length Based Huffman Coding

Single Event Transient Effects on Microsemi ProASIC Flash-based FPGAs: analysis and possible solutions

Interconnect-Power Dissipation in a Microprocessor

Policy Perspective: The Current and Proposed Security Framework

Testing Digital Systems II

VLSI System Testing. Outline

METHODS FOR TRUE ENERGY- PERFORMANCE OPTIMIZATION. Naga Harika Chinta

Testing Digital Systems I

Chapter 3 LEAST SIGNIFICANT BIT STEGANOGRAPHY TECHNIQUE FOR HIDING COMPRESSED ENCRYPTED DATA USING VARIOUS FILE FORMATS

Computer Arithmetic (2)

Design and Analysis of Improved Sparse Channel Adder with Optimization of Energy Delay

UNIVERSITY OF MASSACHUSETTS Dept. of Electrical & Computer Engineering. Digital Computer Arithmetic ECE 666

Novel Low-Overhead Operand Isolation Techniques for Low-Power Datapath Synthesis

Design of Modified Shannon Based Full Adder Cell Using PTL Logic for Low Power Applications

Lecture 16: Design for Testability. MAH, AEN EE271 Lecture 16 1

FAULT SIMULATION AND TEST GENERATION FOR SMALL DELAY FAULTS. A Dissertation WANGQI QIU

DIGITAL ELECTRONICS QUESTION BANK

Research Statement. Sorin Cotofana

Review of Booth Algorithm for Design of Multiplier

A New network multiplier using modified high order encoder and optimized hybrid adder in CMOS technology

Fast Statistical Timing Analysis By Probabilistic Event Propagation

Lecture 20 November 13, 2014

Chapter 3 Digital Logic Structures

Super-Resolution and Reconstruction of Sparse Sub-Wavelength Images

Testing Digital Systems II. Problem: Fault Diagnosis

Adder (electronics) - Wikipedia, the free encyclopedia

Dr. Nicola Nicolici COE/EE2DI4 Midterm Test #1 Oct 18, 2006

EE 434 ASIC & Digital Systems

Challenges of in-circuit functional timing testing of System-on-a-Chip

Design of High Speed Power Efficient Combinational and Sequential Circuits Using Reversible Logic

CHAPTER 5 DESIGN OF COMBINATIONAL LOGIC CIRCUITS IN QCA

AREA OPTIMIZED ARITHMETIC AND LOGIC UNIT USING LOW POWER 1-BIT FULL ADDER

EECS 427 Lecture 21: Design for Test (DFT) Reminders

Gates and Circuits 1

Design of a Power Optimal Reversible FIR Filter ASIC Speech Signal Processing

AN EFFICIENT DESIGN OF ROBA MULTIPLIERS 1 BADDI. MOUNIKA, 2 V. RAMA RAO M.Tech, Assistant professor

SYNCHRONOUS stream ciphers are lightweight

CS302 Digital Logic Design Solved Objective Midterm Papers For Preparation of Midterm Exam

FULL ADDER USING MULTIPLEXER

Introduction to CMOS VLSI Design (E158) Lecture 5: Logic

Mixed Synchronous/Asynchronous State Memory for Low Power FSM Design

Low-Power Multipliers with Data Wordlength Reduction

Proc. IEEE Intern. Conf. on Application Specific Array Processors, (Eds. Capello et. al.), IEEE Computer Society Press, 1995, 76-84

TECHNOLOGY scaling, aided by innovative circuit techniques,

A Low Power Array Multiplier Design using Modified Gate Diffusion Input (GDI)

CS302 - Digital Logic Design Glossary By

UNIT-II LOW POWER VLSI DESIGN APPROACHES

A Scan Shifting Method based on Clock Gating of Multiple Groups for Low Power Scan Testing

Exam #2 EE 209: Fall 2017

Globally Asynchronous Locally Synchronous (GALS) Microprogrammed Parallel FIR Filter

Multiple Transient Faults in Combinational and Sequential Circuits: A Systematic Approach

Bricken Technologies Corporation Presentations: Bricken Technologies Corporation Corporate: Bricken Technologies Corporation Marketing:

Reversible data hiding based on histogram modification using S-type and Hilbert curve scanning

Chapter 4 MASK Encryption: Results with Image Analysis

Implementation of Parallel Multiplier-Accumulator using Radix- 2 Modified Booth Algorithm and SPST

Outline. Communications Engineering 1

A Novel High-Speed, Higher-Order 128 bit Adders for Digital Signal Processing Applications Using Advanced EDA Tools

Design and Characterization of 16 Bit Multiplier Accumulator Based on Radix-2 Modified Booth Algorithm

Design of Parallel Prefix Tree Based High Speed Scalable CMOS Comparator for converters

A High Performance Variable Body Biasing Design with Low Power Clocking System Using MTCMOS

Design of Area-Delay-Power Efficient Carry Select Adder Using Cadence Tool

Video Enhancement Algorithms on System on Chip

Required Course Numbers. Test Content Categories. Computer Science 8 12 Curriculum Crosswalk Page 2 of 14

Types of Control. Programmed Non-programmed. Program Counter Hardwired

A Low-Power High-speed Pipelined Accumulator Design Using CMOS Logic for DSP Applications

DIGITAL INTEGRATED CIRCUITS A DESIGN PERSPECTIVE 2 N D E D I T I O N

Objective Questions. (a) Light (b) Temperature (c) Sound (d) all of these

Testability Synthesis for Jumping Carry Adders

Formal Hardware Verification: Theory Meets Practice


Lecture 1: Digital Systems and VLSI

The Design of a Low Power Asynchronous Multiplier

Transcription:

Scan Side Channel Analysis: a New Way for Non-Invasive Reverse Engineering of a VLSI Device Leonid Azriel Technion Israel Institute of Technology May 6, 2015 May 6, 2015 1

Side Channel Attacks Side Channel Attack: Exploits Weaknesses in Algorithm Implementation May 6, 2015 2

Side Channel Attacks Side Channel Attack: Exploits Weaknesses in Algorithm Implementation Simple/Differential Power Analysis (SPA/DPA) key May 6, 2015 3

Side Channel Attacks Side Channel Attack: Exploits Weaknesses in Algorithm Implementation Simple/Differential Power Analysis (SPA/DPA) Timing key May 6, 2015 4

Side Channel Attacks Side Channel Attack: Exploits Weaknesses in Algorithm Implementation Simple/Differential Power Analysis (SPA/DPA) Timing EM Radiation key May 6, 2015 5

Side Channel Attacks Side Channel Attack: Exploits Weaknesses in Algorithm Implementation Simple/Differential Power Analysis (SPA/DPA) Timing EM Radiation Temperature key May 6, 2015 6

Side Channel Attacks Side Channel Attack: Exploits Weaknesses in Algorithm Implementation Simple/Differential Power Analysis (SPA/DPA) Timing EM Radiation Temperature Test Circuits (Scan) key May 6, 2015 7

The Scan Technique May 6, 2015 8

The Scan Technique Sequential Cells (FFs / Latches) May 6, 2015 9

The Scan Technique Scan Insertion May 6, 2015 10

The Scan Technique 01010101 ATE Scan Shift May 6, 2015 11

The Scan Technique ATE 0 1 1 1 0 1 1 0 1 1 1 0 0 1 0 1 Capture May 6, 2015 12

The Scan Technique ATE 0 1 1 1 0 1 1 0 1 1 1 0 0 1 0 1 Shift-Out May 6, 2015 13

Exploiting Scan - Retrieving Secrets May 6, 2015 14

Exploiting Scan Altering the Flow May 6, 2015 15

Reverse Engineering of an ASIC Phase 1 Invasive ASIC Circuit Delayering SEM Nanoscale Imaging Cross-section Phase 2 Algorithmic Circuit Spec FSM Extraction Model Checking SAT May 6, 2015 16

Reverse Engineering of an ASIC Phase 1 Invasive ASIC Circuit Delayering SEM Nanoscale Imaging Cross-section Phase 2 Algorithmic Circuit Spec FSM Extraction Model Checking SAT Scan Side Channel makes Phase 1 non-invasive May 6, 2015 17

Unfolding Sequential Circuits with Scan Combinational Logic Scan turns the ASIC to a stateless circuit Mapped to the Boolean Function Learning problem: {0,1} n {0,1} n Exhaustive Search: Extract the Truth Table by running queries for all inputs Exponential Size May 6, 2015 18

Unfolding Sequential Circuits with Scan 0 1 0 0 0 Combinational Logic F = 1 1 0 0 1 0 1 1................. Scan turns the ASIC to a stateless circuit Mapped to the Boolean Function Learning problem: {0,1} n {0,1} n Exhaustive Search: Extract the Truth Table by running queries for all inputs Exponential Size: 2Number of Registers May 6, 2015 19

Unfolding Sequential Circuits with Scan 0 1 0 0 0 Combinational Logic F = 1 1 0 0 1 0 1 1................. Scan turns the ASIC to a stateless circuit Mapped to the Boolean Function Learning problem: {0,1} n {0,1} n Exhaustive Search: Extract the Truth Table by running queries for all inputs Exponential Size: 2 n May 6, 2015 20

Shannon Effect Shannon Effect: almost all Boolean functions have a complexity close to the maximal possible (~O(2 n )) for the uniform probability distribution Corollary: For large n, almost all Boolean functions are not realizable in VLSI technology 22 n functions Search space for realizable digital circuits May 6, 2015 21

Limited Transitive Fan-in In practice, logic cones have limited number of inputs: Transitive Fan In = K May 6, 2015 22

Algorithm for Limited Transitive Fan-in Suppose F(0) = 0 (simple extension to any F) Example for K = 3: Testing all values of input v with Hamming Weight 3 or less covers all combinations of {a,b,c} v 0 0 a 0 b 0 c 0 0 0 Runtime ~ n K F i May 6, 2015 23

Transitive Fan-in Statistics for ITC 99 benchmark 400 100 Number of registers and outputs 200 75 50 25 Cumulative Percentage 0 0 50 100 150 200 250 300 0 Fan In May 6, 2015 24

Locality Hierarchical structure loose connectivity between blocks: clustering Physical locality: adjacent registers in the chain are likely to belong to the same function Often the same sub-circuit is shared by a few logic cones May 6, 2015 25

Incremental K-Bounded Search K=0 K=1 K=K init = Boolean cube May 6, 2015 26

Incremental K-Bounded Search K=0 K=1 K=K init = Boolean cube = Implicant: a cube, for which F i =1 for some i May 6, 2015 27

Incremental K-Bounded Search K=0 K=1 K=K init K=K init +K step May 6, 2015 28

Incremental K-Bounded Search K=0 K=1 K=K init K=K init +K step K=K init +i*k step Continue while there is a change May 6, 2015 29

Example: Arithmetic Circuits Number of probes 10 15 10 10 10 5 10 0 ESoTT KSoTT/CSoTT ISoTT 10 15 20 25 30 35 40 10 10 ESoTT 10 5 10 0 KSoTT/CSoTT ISoTT 0 5 10 15 20 25 30 Space 10 15 10 10 10 5 ESoTT KSoTT (bound) CSoTT/ISoTT 10 15 10 10 10 5 ESoTT KSoTT (bound) CSoTT/ISoTT 10 0 10 15 20 25 30 35 40 N Adder 10 0 0 5 10 15 20 25 30 N Multiplier May 6, 2015 30

Pipelined Accumulator + 10 20 10 15 ESoTT KSoTT/CSoTT ISoTT 10 20 10 15 ESoTT KSoTT (bound) CSoTT/ISoTT + Number of probes 10 10 10 5 Space 10 10 10 5 + 10 0 10 20 30 40 50 N (~Width*Depth) Runtime 10 0 10 20 30 40 50 N (~Width*Depth) Space May 6, 2015 31

Summary Reverse Engineering can be non-invasive Scan Side Channel is a security and IP protection threat Polynomial time reconstruction is possible thanks to the limited transitive fan-in Additional accuracy is achieved with incremental heuristic based algorithm May 6, 2015 32

Future Work Enhancing the algorithms Machine Learning PAC learning Overcoming Practical Limitations Compression, Masking, Protection Methods Hide the function without sacrificing testability Finding Hardware Trojans Detecting mismatches with scan May 6, 2015 33

Thanks! May 6, 2015 34