AI Application Processing Requirements

Similar documents
Embedding Artificial Intelligence into Our Lives

Vision with Precision Webinar Series Augmented & Virtual Reality Aaron Behman, Xilinx Mark Beccue, Tractica. Copyright 2016 Xilinx

Harnessing the Power of AI: An Easy Start with Lattice s sensai

RANA: Towards Efficient Neural Acceleration with Refresh-Optimized Embedded DRAM

23270: AUGMENTED REALITY FOR NAVIGATION AND INFORMATIONAL ADAS. Sergii Bykov Technical Lead Machine Learning 12 Oct 2017

Neural Networks The New Moore s Law

Proposers Day Workshop

Multiband NFC for High-Throughput Wireless Computer Vision Sensor Network

Low Power Microphone Acquisition and Processing for Always-on Applications Based on Microcontrollers

Transer Learning : Super Intelligence

ARMY RDT&E BUDGET ITEM JUSTIFICATION (R2 Exhibit)

A.I in Automotive? Why and When.

Vision & Industry 4.0: Towards smarter sensors. Dr. Amina Chebira Vision Embedded Systems, CSEM SA October 4 th, 2016

Designing with STM32F3x

NCCT IEEE PROJECTS ADVANCED ROBOTICS SOLUTIONS. Latest Projects, in various Domains. Promise for the Best Projects

INTRODUCTION. What is the LSN50

Energy autonomous wireless sensors: InterSync Project. FIMA Autumn Conference 2011, Nov 23 rd, 2011, Tampere Vesa Pentikäinen VTT

GPU ACCELERATED DEEP LEARNING WITH CUDNN

SiNANO-NEREID Workshop:

Teleoperated Robot Controlling Interface: an Internet of Things Based Approach

A Flying-Domain DC-DC Converter Powering a Cortex-M0 Processor with 90.8% Efficiency

* Intelli Robotic Wheel Chair for Specialty Operations & Physically Challenged

Enhancing Shipboard Maintenance with Augmented Reality

DTMF Controlled Robot

Visvesvaraya Technological University, Belagavi

AST Catania Lab STMicroelectronics

Control Systems Overview REV II

How Machine Learning and AI Are Disrupting the Current Healthcare System. Session #30, March 6, 2018 Cris Ross, CIO Mayo Clinic, Jim Golden, PwC

Deep Learning Overview

AIS Annual Investor Day 2016 Digital Transformation at AIS. 18 November 2016

5G R&D at Huawei: An Insider Look

REAL TIME DIGITAL SIGNAL PROCESSING. Introduction

SPY ROBOT CONTROLLING THROUGH ZIGBEE USING MATLAB

A SPAD-Based, Direct Time-of-Flight, 64 Zone, 15fps, Parallel Ranging Device Based on 40nm CMOS SPAD Technology

DTP4700 Next Generation Software Defined Radio Platform

HeroX - Untethered VR Training in Sync'ed Physical Spaces

Application Areas of AI Artificial intelligence is divided into different branches which are mentioned below:

Lineup for Compact Cameras from

GPU Computing for Cognitive Robotics

Unit level 5 Credit value 15. Introduction. Learning Outcomes

The Key to the Internet-of-Things: Conquering Complexity One Step at a Time

CALL FOR PAPERS. embedded world Conference. -Embedded Intelligence- embedded world Conference Nürnberg, Germany

RIGEL RESM SYSTEM NAVAL

SHAPING THE FUTURE OF IOT: PLATFORMS FOR CO-CREATION, RAPID PROTOTYPING AND SUCCESSFUL INDUSTRIALIZATION

Leverage always-on voice trigger IP to reach ultra-low power consumption in voicecontrolled

SpiNNaker SPIKING NEURAL NETWORK ARCHITECTURE MAX BROWN NICK BARLOW

32-bit ARM Cortex-M0, Cortex-M3 and Cortex-M4F microcontrollers

CMOS MT9D111Camera Module 1/3.2-Inch 2-Megapixel Module Datasheet

New Export Requirements for Emerging and Foundational Technologies

Advances and Perspectives in Health Information Standards

A NEW NEUROMORPHIC STRATEGY FOR THE FUTURE OF VISION FOR MACHINES June Xavier Lagorce Head of Computer Vision & Systems

Computer Vision at the Edge and in the Cloud: Architectures, Algorithms, Processors, and Tools

Dr. Ashish Dutta. Professor, Dept. of Mechanical Engineering Indian Institute of Technology Kanpur, INDIA

Definitions and Application Areas

MACHINE LEARNING Games and Beyond. Calvin Lin, NVIDIA

Implementing Physical Capabilities for an Existing Chatbot by Using a Repurposed Animatronic to Synchronize Motor Positioning with Speech

Service Robots in an Intelligent House

Fpga Implementations Of Neural Networks Springer

Institute of Computer Technology

Portfolio. Swaroop Kumar Pal swarooppal.wordpress.com github.com/swarooppal1088

STM RH-ASIC capability

Software Computer Vision - Driver Assistance

Architecting Systems of the Future, page 1

What Is And How Will Machine Learning Change Our Lives. Fair Use Agreement

Building Perceptive Robots with INTEL Euclid Development kit

GNSS in Autonomous Vehicles MM Vision

Implementation of a Self-Driven Robot for Remote Surveillance

THE VISIONLAB TEAM engineers - 1 physicist. Feasibility study and prototyping Hardware benchmarking Open and closed source libraries

An energy-efficient coarse grained spatial architecture for convolutional neural networks AlexNet

Pushing Ultra-Low-Power Digital Circuits

Smart-M3-Based Robot Interaction in Cyber-Physical Systems

Model-Based Design for Sensor Systems

24GHz Modules Industrial Radar Solution

5 th Generation Non-Orthogonal Waveforms for Asynchronous Signaling. Final Review. Brussels, Work Package 5

ARTIFICIAL ROBOT NAVIGATION BASED ON GESTURE AND SPEECH RECOGNITION

Wireless Broadband Solutions for Unmanned Aerial Systems

Intelligent Technology for More Advanced Autonomous Driving

MEMS Accelerometer sensor controlled robot with wireless video camera mounted on it

Presentation at Salland Engineering Test Symposium Sept 14 th 2018

ADVANCED EMBEDDED MONITORING SYSTEM FOR ELECTROMAGNETIC RADIATION

Exploring Computation- Communication Tradeoffs in Camera Systems

Pathbreaking robots for pathbreaking research. Introducing. KINOVA Gen3 Ultra lightweight robot. kinovarobotics.com 1

Lecture Perspectives. Administrivia

USING EMBEDDED PROCESSORS IN HARDWARE MODELS OF ARTIFICIAL NEURAL NETWORKS

COSC343: Artificial Intelligence

New export control and CFIUS restrictions on emerging technologies becoming a reality

RFID Integrated Teacher Monitoring

11/13/18. Introduction to RNNs for NLP. About Me. Overview SHANG GAO

Artificial Intelligence and Deep Learning

Deep Learning for Autonomous Driving

IRIDA Labs Your eyes to the future

Artificial Intelligence The next big growth driver for the semiconductor industry

Accelerating Collective Innovation: Investing in the Innovation Landscape

Humanification Go Digital, Stay Human

Lecture 30. Perspectives. Digital Integrated Circuits Perspectives

Sensor Network Platforms and Tools

Team Description Paper: HuroEvolution Humanoid Robot for Robocup 2014 Humanoid League

Due to the absence of internal nodes, inverter-based Gm-C filters [1,2] allow achieving bandwidths beyond what is possible

The Key to the Internet-of-Things: Conquering Complexity One Step at a Time

Transcription:

AI Application Processing Requirements 1 Low Medium High Sensor analysis Activity Recognition (motion sensors) Stress Analysis or Attention Analysis Audio & sound Speech Recognition Object detection Computer Vision Multiple Objects Detection/Classification/Tracking Speech Synthesis STM32 From IP embedded in MCU/MPU to dedicated SOC Audio use cases with individual commands Classic motion sensor use cases Mandatory to support advanced Audio and Video complex use cases.

ST Solutions for Artificial Intelligence 2 ST Developed tool Neural Network Converter & Code Generator DCNN SoC AI specific IC Prototype Demonstration Product Demonstration STM32 Microcontrollers (Bare/RTOS) Roadmap STM32 + DCNN enhanced architecture (Bare/RTOS) DCNN = Deep Convolutional Neural Network

ST Solutions for Artificial Intelligence 3 ST Developed tool Neural Network Converter & Code Generator DCNN SoC AI specific IC Prototype Demonstration Product Demonstration STM32 Microcontrollers (Bare/RTOS) Roadmap STM32 + DCNN enhanced architecture (Bare/RTOS) DCNN = Deep Convolutional Neural Network

ST Enables AI on STM32 4 Benefits of Deep Learning now available across all STM32 portfolio Input your Framework dependent, Pre-Trained neural network into STM32CubeMX.AI Automatic and fast generation of an STM32- optimized library STM32CubeMX.AI guarantees interoperability with state-of-the-art Deep Learning Design Frameworks

Neural Networks Ecosystem 5 Benefits of Deep Learning now available across all STM32 portfolio How to create a Neural Network? 1. Define a problem 2. Collect/find/label data 3. Define topology 4. Design with Off-the-Shelf Deep Learning Design Frameworks are available to train and create Artificial Neural Networks Pre-Trained Neural Network Representation (Creation tool specific format)

Neural Networks Available Now for STM32 6 Benefits of Deep Learning now available across all STM32 portfolio Off-the-shelf : Pre-trained Neural Network Model Deep Learning Framework dependent Embedded Solution Optimized Neural Network Code generated for STM32 Deep Learning SW Solution... brings your AI-based innovation to the existing STM32 Portfolio

: Architecture 7 Benefits of Deep Learning now available across all STM32 portfolio Off-the-shelf : Pre-trained Neural Network Model Deep Learning Framework dependent Neural Network Exporter DL Framework Independent Neural Network Representation Code Generator Embedded Solution Optimized Neural Network Code generated for STM32 NN Layers Library for STM32 Neural Networks API s This optimized STM32 neural network model can be included into the user project (using KEIL, IAR, OpenSTM32) and can be compiled and ported onto the final device for field trials

Artificial Intelligence is Everywhere 8 Gaming Mobile Surveillance Virtual/augmented Reality Smart home Drone Relationship robot Companion Robot Domestic Robot Security/Eye tracking Retail

Artificial Intelligence for Everything 2 Where am I? Scene classification (audio, video, environmental sensors) Which objects are in the scene, where are they? Video object detection/classification What am I doing? Activity recognition (audio, video, inertial sensors) What s happening? Analysis Event recognition (audio, video, inertial sensors, environmental sensors). User Interaction Command detection (audio) Speech Recognition (audio) Gesture Recognition (inertial sensors, video) User identification and mood detection (audio, video) Continuous Learning How can I detect unpredictable, unclassified events in dynamic environments? Recurrent networks (audio, video, inertial sensors, environmental sensors)

Distributed Artificial Intelligence is a Must NODES to Increase Systems Efficiency 10 Processing Connectivity Sensor Data Processing Connectivity Sensor Data Processing Connectivity Security Sensing & Actuating Actions Security Actions Security 1 Sensor 100 Sensors 10,000 Sensors Processing Requirements Connectivity/Bandwidth Requirements

Neural Networks are Key for Intelligent Nodes 11 Artificial Intelligence Machine Learning Deep Learning Subset of Machine Learning algorithms based on Artificial Neural Networks (DNN, CNN, RNN, SOANN, ect) What is an Artificial Neural Network? Models for input-output transfer function approximation inspired by biological neural networks. Capable of modeling and processing highly time varying and non linear relationships between inputs and outputs. Exponentially faster and more efficient than traditional computer processing models for typical AI uses like detection, classification, prediction. How to design ANN s? Off-the-Shelf Deep Learning Design Frameworks are available to design and train Artificial Neural Networks Example: A neural network trained to classify an object in a picture. Pre-Trained Neural Network Representation (Creation tool specific format)

ST Enables AI @ the Edge 12 Benefits Code for ARM Cortex-M Microcontroller Available today Runs on any STM32 Typical use cases Pre-Trained Neural Network Representation (Creation tool specific format) Automatic ST Tool Neural Network Optimizer & Code Generator STM32 Microcontrollers (Bare/RTOS) Smart Industry: preventive maintenance Wearable: Human activity recognition Consumer : Entry level IoT node Audio : Intelligent microphones Benefits Performance/power optimized Intensive data & processing applications Typical use cases Code for ST AI-Specific IC ULTRA HIGH POWER/AREA EFFICIENCY NEURAL NETWORK SOC Smart Driving: In-vehicle driver monitoring Smart surveillance cameras Consumer : AR, Drones, Robots

HW Accelerated Deep Learning SoC 6 6.2mm A configurable, scalable and design time parametric Convolutional Neural Network Processing Engine OTP 5.6mm High Speed Camera IF PLL Chip To Chip M4 (DSP) Cores And Local Memories Coprocessors Subsystem Global Memory Subsystem DCNN Convolutional Layers accounts for more than 90% DCNN operations, hence 8 Convolution HW Accelerators allow high efficiency in area vs GOPS vs power consumption In addition to ARM Cortex M4, 8 DSP Clusters allow both programmability and flexible mapping of diversified, custom DCNN s Embedded Memory enables further reduction of power consumption required by IOT applications.

DCNN SoC Test Chip Main Features 7 (*) 1 MAC defined as 2 OPS (ADD + MUL) Technology FD-SOI 28nm Package FBGA 15x15x1.83 Clock freq 200MHz 1.175GHz Supply voltages 0.575V 1.1V digital 1.8V I/O On-chip RAM 4x1 MB (Global), 8x192 KB (DSP), 128 KB (Host) Host ARM Cortex -M4 DSPs Nr 16 Peak DSP performance (1.175GHz, 1.1V) Convolutional Accelerators Nr 8 CA size (including local memory) 0.27 sqmm Max Tot CAs performance (1.175GHz, 1.1V) Power (**) @200MHz, 0.6V, 2 CAs Power (**) @1GHz, 1V, 8 CAs CAs Peak Efficiency @200Mhz, 0.575V (Alexnet) 75 GOPS (dual 16b MAC loop) 676 GOPS 37.5mW @ 10 FPS 600mW @ 60FPS (not optimized yet) 2.9 TOPS/W