Low-Power Communications and Neural Spike Sorting

Similar documents
EE M255, BME M260, NS M206:

A 130-µW, 64-Channel Spike-Sorting DSP Chip

This document is downloaded from DR-NTU, Nanyang Technological University Library, Singapore.

From Antenna to Bits:

ENCOR-Phase 2. Enabling Methods for Dynamic Spectrum Access and Cognitive Radio

2015 The MathWorks, Inc. 1

Implementation and Complexity Analysis of List Sphere Detector for MIMO-OFDM systems

Modernised GNSS Receiver and Design Methodology

A/D Conversion and Filtering for Ultra Low Power Radios. Dejan Radjen Yasser Sherazi. Advanced Digital IC Design. Contents. Why is this important?

Wideband Down-Conversion and Channelisation Techniques for FPGA. Eddy Fry RF Engines Ltd

A to nj/bit/iteration Scalable 3GPP LTE Turbo Decoder with an Adaptive Sub-Block Parallel Scheme and an Embedded DVFS Engine

A10-Gb/slow-power adaptive continuous-time linear equalizer using asynchronous under-sampling histogram

Software Radio Satellite Terminal: an experimental test-bed

ni.com The NI PXIe-5644R Vector Signal Transceiver World s First Software-Designed Instrument

VLSI implementation of the discrete wavelet transform

What s Behind 5G Wireless Communications?

Supplemental Slides: MIMO Testbed Development at the MPRG Lab

Ph.D. Qualifying Exam

A Hardware Design for In-Brain Neural Spike Sorting

Towards an ADC for the Liquid Argon Electronics Upgrade

EE 434 ASIC and Digital Systems. Prof. Dae Hyun Kim School of Electrical Engineering and Computer Science Washington State University.

Fast and Accurate RF component characterization enabled by FPGA technology

Design of Multiplier Less 32 Tap FIR Filter using VHDL

Datorstödd Elektronikkonstruktion

High Performance DSP Solutions for Ultrasound

Technology Timeline. Transistors ICs (General) SRAMs & DRAMs Microprocessors SPLDs CPLDs ASICs. FPGAs. The Design Warrior s Guide to.

ULTRA-LOW POWER PLATFORMS FOR HUMAN ENHANCEMENT

ELT Radio Architectures and Signal Processing. Motivation, Some Background & Scope

Digital Systems Design

Real-Time Testing Made Easy with Simulink Real-Time

Introduction. EE216B: VLSI Signal Processing. Prof. Dejan Marković EE216B Elevator Pitch. Area/energy-efficient mapping

100 Gb/s: The High Speed Connectivity Race is On

Overview and Challenges

SiNANO-NEREID Workshop:

DESIGN OF LOW POWER SAR ADC FOR ECG USING 45nm CMOS TECHNOLOGY

Bi-directional brain computer interface with brain implantable Arm-based SoCs Joseph Fernando Principal architect

BPSK_DEMOD. Binary-PSK Demodulator Rev Key Design Features. Block Diagram. Applications. General Description. Generic Parameters

Cognitive Radio Platform Technology

2002 IEEE International Solid-State Circuits Conference 2002 IEEE

Integrated Solutions for Testing Wireless Communication Systems

A Two- Bit- per- Cycle Successive- Approximation ADC with Background Offset Calibration

SV2C 28 Gbps, 8 Lane SerDes Tester

FPGA based Prototyping of Next Generation Forward Error Correction

FUNDAMENTALS OF ANALOG TO DIGITAL CONVERTERS: PART I.1

Merging Propagation Physics, Theory and Hardware in Wireless. Ada Poon

Chapter 2 Energy-Efficient Digital Processing for Neural Action Potentials

Wideband Spectral Measurement Using Time-Gated Acquisition Implemented on a User-Programmable FPGA

UCB Picocube A modular approach to miniature wireless 1 cm μw P avg

Copyright 2007 Year IEEE. Reprinted from ISCAS 2007 International Symposium on Circuits and Systems, May This material is posted here

Satellite Tuner Single Chip Simulation with Advanced Design System

22. VLSI in Communications

Low-Power CMOS VLSI Design

Pipeline vs. Sigma Delta ADC for Communications Applications

(VE2: Verilog HDL) Software Development & Education Center

CHAPTER 4 FUZZY BASED DYNAMIC PWM CONTROL

SpectraTronix C700. Modular Test & Development Platform. Ideal Solution for Cognitive Radio, DSP, Wireless Communications & Massive MIMO Applications

Hardware-based Image Retrieval and Classifier System

AC : ORTHOGONAL FREQUENCY DIVISION MULTIPLEX- ING (OFDM) DEVELOPMENT AND TEACHING PLATFORM

Mobile Radio Transformation in the Age of 5G: A Perspective on Opportunities for SOI

26.8: A 1.9GHz Single-Chip CMOS PHS Cellphone

Fully Automated Network- Centric Spectrum Analysis and Signal Intelligence System

EE241 - Spring 2004 Advanced Digital Integrated Circuits. Announcements. Borivoje Nikolic. Lecture 15 Low-Power Design: Supply Voltage Scaling

Software Design of Digital Receiver using FPGA

ULTRASCALE DDR4 DE-EMPHASIS AND CTLE FEATURE OPTIMIZATION WITH STATISTICAL ENGINE FOR BER SPECIFICATION

Application of PC Vias to Configurable RF Circuits

Radio Testbeds Using BEE2

Nutaq Radio420X Multimode SDR FMC RF transceiver PRODUCT SHEET

Challenges of 5G mmwave RF Module. Ren-Jr Chen M300/ICL/ITRI 2018/06/20

Another way to implement a folding ADC

In this lecture, we will look at how different electronic modules communicate with each other. We will consider the following topics:

A 1.7-to-2.2GHz Full-Duplex Transceiver System with >50dB Self-Interference Cancellation over 42MHz Bandwidth

Fixed-Point Aspects of MIMO OFDM Detection on SDR Platforms

Audio Sample Rate Conversion in FPGAs

Nutaq Radio420X I MONTREAL I NEW YORK I. Multimode SDR FMC RF transceiver PRODUCT SHEET. RoHS. nutaq.com QUEBEC

DESIGN OF A MEASUREMENT PLATFORM FOR COMMUNICATIONS SYSTEMS

Wireless Communication Systems: Implementation perspective

FEATURES DESCRIPTION BENEFITS APPLICATIONS. Preliminary PT4501 Sub-1 GHz Wideband FSK Transceiver

Prototyping Next-Generation Communication Systems with Software-Defined Radio

OQPSK COGNITIVE MODULATOR FULLY FPGA-IMPLEMENTED VIA DYNAMIC PARTIAL RECONFIGURATION AND RAPID PROTOTYPING TOOLS

Research on Flexible radios. Dr. Krishnamurthy Soumyanath Intel Fellow Director, Communications Circuits Lab

Hot Topics and Cool Ideas in Scaled CMOS Analog Design

Proposing. An Interpolated Pipeline ADC

BPSK System on Spartan 3E FPGA

CHAPTER 4 FIELD PROGRAMMABLE GATE ARRAY IMPLEMENTATION OF FIVE LEVEL CASCADED MULTILEVEL INVERTER

Multiple Reference Clock Generator

Mohit Arora. The Art of Hardware Architecture. Design Methods and Techniques. for Digital Circuits. Springer

Low Power Design Part I Introduction and VHDL design. Ricardo Santos LSCAD/FACOM/UFMS

RTT TECHNOLOGY TOPIC January G DSP

Design and FPGA Implementation of an Adaptive Demodulator. Design and FPGA Implementation of an Adaptive Demodulator

PAR4CR: THE DEVELOPMENT OF A NEW SDR-BASED PLATFORM TOWARDS COGNITIVE RADIO

Computer Aided Design of Electronics

Does The Radio Even Matter? - Transceiver Characterization Testing Framework

Real-time FPGA Implementation of Transmitter Based DSP

An All CMOS, 2.4 GHz, Fully Adaptive, Scalable, Frequency Hopped Transceiver

RF and Microwave Test and Design Roadshow Cape Town & Midrand

Some Radio Implementation Challenges in 3G-LTE Context

A SWITCHED-CAPACITOR POWER AMPLIFIER FOR EER/POLAR TRANSMITTERS

Development of a 20 GS/s Sampling Chip in 130nm CMOS Technology

Spectrum Detector for Cognitive Radios. Andrew Tolboe

Short Range UWB Radio Systems. Finding the power/area limits of

Transcription:

CASPER Workshop 2010 Low-Power Communications and Neural Spike Sorting CASPER Tools in Front-to-Back DSP ASIC Development Henry Chen henryic@ee.ucla.edu August, 2010

Introduction Parallel Data Architectures group @ UCLA EE Prof. Dejan Marković + 14 grad students http://icslwebs.ee.ucla.edu/dejan/researchwiki/ Effective handling of power efficiency and design complexity issues in scaled CMOS technologies Emphasis on DSP algorithms for communications and neuroscience

Energy-Delay Design Tradeoff suboptimal infeasible

Energy-Delay Optimization

Design Freedoms Algorithm Architecture (parallelize, pipeline, time-multiplex) Circuit (static/dynamic, DVFS, high/low V T, etc.) Design choices have cross-level implications

Hierarchical Optimization

Multi-Step Development Cycle Multiple engineering teams & design descriptions Algorithm (Matlab, C, C++) Architecture (Verilog, VHDL) Circuit (RTL, schematic/layout, standard cells) Test (pattern generator, logic analyzer) Each step requires translation & re-verification I.e., floating-point to fixed-point

Approach Unified Matlab/Simulink description Algorithm design and verification FPGA hardware emulation Optimized ASIC implementation ASIC test and verification Sounds like a familiar tune

Front-to-Back Solution Architectural & algorithmic exploration/optimization High-performance/high-throughput computation Closed-loop test environment

Neural Spike Sorting Design Optimization for khz-rate DSP Sarah Gibson, Vaibhav Karkare

Spike Sorting Neural microelectrodes often pick up signals from multiple neurons surrounding probe Each action potential needs to be resolved to its originating neuron Single-unit activity needed for basic neuroscience and biomedical applications

Spike Sorting Process Absolute value threshold Nonlinear energy operator Stationary wavelet transform product Principle component analysis Discrete wavelet transform Discrete derivatives Integral transform

On-chip Spike Sorting Conventional method: wired, software post-process Proposed: implantable wireless spike sorting

Need for khz-rate DSPs Signal bandwidths ~20kHz in biomedical apps. Implantable chips need to be << 800µW/mm 2 to prevent tissue damage Must have long battery life (~5-10yrs) or scavenge energy

Algorithm Selection Spike Detection MOPS Area (mm 2 ) Absolute Val. 0.4806 0.06104 NEO 4.224 0.02950 SWTP 86.75 56.70 Feature Extraction MOPS Area (mm 2 ) PCA 1.265 0.2862 DWT 3.125 0.06105 DD 0.1064 0.04725 IT 0.05440 0.03709

Interleaved Architecture

Multi-mode Spike Sorting Chip Power 130µW for 64 channels 52µW for 16 channels Area Die: 7.07mm 2 Core: 4mm 2 Classification accuracy: Over 90% for SNR > 0dB Power Density: 30µW/mm 2 91% data reduction

Flexible Radio Digital Front-Ends Chip Testing for GHz-rate DSP Rashmi Nanda

Why Flexible DFEs? Multi-radio integration (cellular, WiFi, GPS) Cognitive radios Evolving wireless standards WiMAX Cellular LTE

Conventional RX Architecture Downsides: No reconfigurability for multi-standard support Analog components don t get same power/area benefits from technology scaling

Replacing the RX Front-End

RX DFE Convert from ADC frequency F S1 to modem frequency F S2 with negligible SNR degradation

ASIC Test @ BWRC, ca. 2006

ASIC Testing w/ ROACH

Test Setup Matlab 1GbE PowerPC QDR SRAM BRAM FPGA LVDS IO ASIC Test Board ASIC

Testing Requirements High TX clock rate (400MHz target) Beyond practical limits of IBOB s V2P Long test vectors (~4Mb) Asynchronous clock domains for TX and RX

Asynchronous Clock Domains Manually merged separate designs for test vector and readback datapaths Fixed 60MHz RX 255-315 MHz TX

Results Tested to 315MHz (1.89GHz RF) w/ loadable vectors in QDR Up to 340MHz (2.04GHz RF) with pre-compiled vectors in ROMs Tech. / V DD 0.13µm/1.5V 65 nm / 1.0V Area??? 0.4 mm 2 Maximum 104 MHz 2.52 GHz input sample rate Maximum 5 MHz 20 MHz signal bandwidth Current consumption 21 ma 13 ma Maximum SNR 84 db (5MHz bandwidth) 55 db (20 MHz bandwidth)

Limitations DDR output FF critical path @ 340MHz (clock out) QDR SRAM bus interface critical path @ 315MHz Output clock jitter?

Summary Unified description for DSP design allows multi-level iterative optimization Can be applied to any power/performance specs Matlab + BORPH/KATCP provides great test environment for DSP applications

Thank you!