Non-uniform Selective Way Cache の動的制御による組込みプロセッサの省エネルギー化

Similar documents
ジェスチャ併用型 Voice-to-MIDI システムの提案 第五回知識創造支援システムシンポジウム報告書 : 本著作物の著作権は著者に帰属します

科学研究費助成事業 ( 科学研究費補助金 ) 研究成果報告書

23 May 2018, Galveson, TX Science of Team Science 2018 Conference Ge WANG 1,3 and Ken-ichi SATO 2,3

[1] 大橋和也, 森拓哉, 古関隆章 運転整理時における乗車率に応じた旅客行動の変化のモデル化 電気学会論文誌 D,J-Rail 2013 特集,2015,pp

A Software Technique to Improve Yield of Processor Chips in Presence of Ultra-Leaky SRAM Cells Caused by Process Variation

先進情報科学特別講義 Ⅱ,Ⅳ 高スループット無線通信システムに関する研究動向. Research Trends on High Throughput Wireless Communication Systems

Reducing Dynamic Power and Leakage Power for Embedded Systems

情Propagation Characteristics of 700MHz Band V2X Wireless Communication*

An adaptive protocol for distributed beamforming Simulations and experiments

(Osaka Industrial Technology - Platform)

A Software Technique to Improve Yield of Processor Chips in Presence of Ultra-Leaky SRAM Cells Caused by Process Variation

JSPS Science Dialog Program Kofu Higashi High School

電子回路論第 6 回 Electric Circuits for Physicists

Conventional 4-Way Set-Associative Cache

Service Research and Innovation in Japan

Electrical Engineering

Real-Time Task Scheduling for a Variable Voltage Processor

Impedance Inverter Z L Z Fig. 3 Operation of impedance inverter. i 1 An equivalent circuit of a two receiver wireless power transfer system is shown i

Study in Patent Risk and Countermeasures Related to Open Management in Interaction Design

IEEE. s Magazine 電子情報通信学会誌 電気学会誌第 123 巻 4 号 年 4 月. IEEE Photonics Tech. Lett.,

Grotesqueness and Cruelty in George MacDonald's The Princess and the Goblin

車載カメラにおける信号機認識および危険運転イベント検知 Traffic Light Recognition and Detection of Dangerous Driving Events from Surveillance Video of Vehicle Camera

Ramon Canal NCD Master MIRI. NCD Master MIRI 1

A Co-worker Robot PaDY" for Automobile Assembly Line

Low-Power VLSI. Seong-Ook Jung VLSI SYSTEM LAB, YONSEI University School of Electrical & Electronic Engineering

社団法人 電子情報通信学会 THE THEINSTITUTE OF ELECTRONICS, OF ELECTRONICS, INFORMATION AND COMMUNICATION AND COMMUNICATION ENGINEERS ENGINEERS 信学技報 IEICE Technica

RUDOLF MODLEY'S CONTRIBUTION TO THE STANDARDIZATION OF GRAPHIC SYMBOLS

Ductility is Damage People Don t want to live in Damaged Buildings Akira Wada Professor Emeritus Tokyo Institute of Technology

Onboard Antenna for 700 MHz Band V2X Communication *

Multi-bit Sigma-Delta TDC Architecture for Digital Signal Timing Measurement

Big Data and High Performance Computing

安全動作領域を確保するゲート制御型電力半導体デバイスの設計法

CACTI 5.1. Shyamkumar Thoziyoor, Naveen Muralimanohar, Jung Ho Ahn, and Norman P. Jouppi HP Laboratories, Palo Alto HPL April 2, 2008*

Ecological Characteristics of Information and Its Scientific Research 1

FY 2013 Briefing Session for JST Strategic Basic Research Programs (CREST, PRESTO)

Radio Equipment Approval in Japan. November 17, 2017 ANF Seminar

Recent Science and Technology Policy in JAPAN and S&T Related Activities of MEXT

What to discuss about data?

The seven pillars of Data Science

Bus Serialization for Reducing Power Consumption

Title of the body. Citation. Issue Date Conference Paper. Text version author. Right

Modeling and Control of Bidirectional DC-DC Converters for DC Power Systems with Renewable Energy

Antonello da Messina : A Study of Iconography and Influences

Optimal Choice of FinFET Devices for Energy Minimization in Deeply-Scaled Technologies

Title inside of Narrow Hole by Needle-Typ. Issue Date Journal Article. Text version author.

研究開発評価に関する国際的な視点や国際動向

Life Science Innovation

Present Status of SMEs I

Measuring the performance of Knowledge Transfer from Universities to Industry in China. ZHONG Wei Renmin Univ

SanjigenJiten : Game System for Acquiring New Languages Visually 三次元辞典 : 第二言語学習のためのゲームシステム. Robert Howland Emily Olmstead Junichi Hoshino

SS 32. October 12, Yuji NAKAKITA. Land Mobile Communications Division, Radio Department Ministry of Internal Affairs and Communications, Japan

Challenge for Analog Circuit Testing in Mixed-Signal SOC

Chapter 4 Radio Communication Basics

The Electronic Design Automation (EDA) Lab

l Reef in Ishigaki Island- Author(s) a, Tanouchi, Hiroki, Nasu, Seigo

An Energy-Efficient Heterogeneous CMP based on Hybrid TFET-CMOS Cores

Global Standards Collaboration (GSC) 14. ITS Radiocommunication Systems in Japan

M. Khosarvy, M.R. Asharif, K. Yamashita, AN EFFICIENT ICA BASED APPROACH Multi-Carrier System, Vol. 41, pp.47-56, 2009

アルゴリズムの設計と解析. 教授 : 黄潤和 (W4022) SA: 広野史明 (A4/A8)

Overview of Issues and Discussions in Regulatory Science and Engineering over the Past Four Years in Global Arena

Hierarchical Modulation & SFN

博士学位論文. Doctoral Thesis 内容の要旨 審査結果の要旨. Thesis Abstracts and Summaries of the Thesis Review Results. The Twelfth Issue. The University of Aizu

特集 Circuit Specifications for Radio Noise Reduction in Vehicle-mounted Communication Networks* Specification Development Using Inverse Calculation

A Feed-Foreward Dynamic Voltage Frequency Management by Workload Prediction for a Low Power Motion Video Compression

Inventions and the innovation process in Japan and the US: Highlights from the US-Japan Inventor Survey

Embedded System Hardware - Reconfigurable Hardware -

Summer School on GNSS 2015

Creation of Digital Archive of Japanese Products Design process

ISSCC 2001 / SESSION 11 / SRAM / 11.4

NINJA LASER INNOVATORS BY DESIGN SINCE 1770

日独学長シンポジウムと日仏高等教育改革シンポジウムが開催されました.

レーダー流星ヘッドエコー DB 作成グループ (murmhed at nipr.ac.jp) 本規定は レーダー流星ヘッドエコー DB 作成グループの作成した MU レーダー流星ヘッド エコーデータベース ( 以下 本データベース ) の利用方法を定めるものである

Low Power High Performance 10T Full Adder for Low Voltage CMOS Technology Using Dual Threshold Voltage

Run-time Power Control Scheme Using Software Feedback Loop for Low-Power Real-time Applications

Hardware-Software Codesign. 0. Organization

科学技術 学術審議会大型プロジェクト作業部会 2015 年 12 月 22 日 永野博

Low Power Design Part I Introduction and VHDL design. Ricardo Santos LSCAD/FACOM/UFMS

Seong-Ook Jung VLSI SYSTEM LAB, YONSEI University

国際会議 ACM CHI ( ) HCI で生まれた研究例 2012/10/3 人とコンピュータの相互作用 WHAT IS HCI? (Human-Computer Interaction (HCI)

Investigation of VHF Argon Plasma at High Pressure by Balanced Power Feeding Using Laser Thomson Scattering

2.3 Preferred Topology An interesting topology called totem-pole dualboost PFC 8) is presented in Fig. 3. This circuit

Communications Policy and ITS in Japan

Lesson 5 What The Last Supper Tells Us

電子回路論第 7 回 Electric Circuits for Physicists #7

Keio EDGE Program. Kane Ishibashi Project Assistant Professor, Graduate School of System Design and Management

電子回路論第 7 回 Electric Circuits for Physicists

特集 米国におけるコンシューマ向けブロードバンド衛星サービスの現状

樊晉源簡歷 元智大學 / 工業工程與管理研究所 / 博士 (2005/06/30~2009/06/30) 大葉大學 / 事業經營研究所 / 碩士 (2001/06/30~2003/06/30) 科技政策研究與資訊中心政策研究組副研究員 (2014/01/01~ 迄今 )

SMART Manufacturing Technologies - A Chinese Approach to Sustainable Manufacturing-

CLIPPER: Counter-based Low Impact Processor Power Estimation at Run-time

Implementation as a Trickle-down Process of Knowledge and Technology to a Local Community

Shyamkumar Thoziyoor, Naveen Muralimanohar, and Norman P. Jouppi Advanced Architecture Laboratory HP Laboratories HPL October 19, 2007*

九州工業大学学術機関リポジトリ. Title with Hole Pockets by Bosch Deep Tre. Author(s) Ichiro. Issue Date

Sho Kobayashi Minseok Kim Jun-ichi Takada. Tokyo Institute of Technology. FINJAP Wrap-up seminar. December 13, 2012

MTCMOS Post-Mask Performance Enhancement

Low Power Design of Successive Approximation Registers

Performance Evaluation of Multi-Threaded System vs. Chip-Multi-Processor System

Share patents, and they shall be given you: An empirical study on consequences of patent commons

NINJA Experiment : Neutrino Interaction research with Nuclear emulsion and J-PARC Accelerator

A 0.9 V Low-power 16-bit DSP Based on a Top-down Design Methodology

Transcription:

九州大学学術情報リポジトリ Kyushu University Institutional epository Non-uniform Selective Way Cache の動的制御による組込みプロセッサの省エネルギー化 石飛, 百合子九州大学大学院システム情報科学府 石原, 亨九州大学システム LSI 研究センター 安浦, 寛人九州大学 Ishitobi, Yuriko Graduate School of Inf. Sci. & E.E., Kyushu University 他 http://hdl.handle.net/2324/13843 出版情報 : 電子情報通信学会技術研究報告. 108 (464), pp.13-18, 2009-03-05. 電子情報通信学会バージョン :accepted 権利関係 :

THE INSTITUTE OF ELECTONICS, INFOMATION AND COMMUNICATION ENGINEES TECHNICAL EPOT OF IEICE. Non-uniform Selective Way Cache 819 0395 744 LSI 814 0001 3 8 33 812 8581 6 10 1 E-mail: ishitobi@c.csce.kyushu-u.ac.jp, ishihara@slrc.kyushu-u.ac.jp, yasuura@c.csce.kyushu-u.ac.jp Non-uniform Selective Way Cache(NSWC) NSWC 2 NSWC 7% 20% A Dynamic Management Technique of a Non-uniform Selective Way Cache for educing the Energy Consumption of Embedded Processors Yuriko ISHITOBI, Tohru ISHIHAA, and Hiroto YASUUA Graduate School of Inf. Sci. & E.E.,Kyushu University Motooka 744, Nishi-ku, Fukuoka-shi, 819 0395 Japan System LSI esearch Center,Kyushu University Momochihama 3 8 33, Sawara-ku, Fukuoka-shi, 814 0001 Japan Kyushu University Hakozaki 6 10 1, Higashi-ku, Fukuoka-shi, 812 8581 Japan E-mail: ishitobi@c.csce.kyushu-u.ac.jp, ishihara@slrc.kyushu-u.ac.jp, yasuura@c.csce.kyushu-u.ac.jp Abstract This paper proposes a dynamic management technique of Non-uniform Selective Way Cache(NSWC) for reducing the total energy consumption of a CPU core, cache memories, and off-chip memories. NSWC has a way uses low supply(vdd) and low threshould(vth). In our approach, we decide insert points of instructions to change available ways in the Non-uniform Selective Way Cache. Experiments using parameters of a commercial embedded processor and an off-chip SDAM demonstrate that our algorithm reduces the energy consumption of the processor system by 7%-20% compared to the result of a processor with a same size set associative cache memory. Key words embedded processor, energy reduction, cache memory 1. AM920T TM 44% StrongAM 27% [1] [3] 1

[7] [7] Selective Way Cache(SWC) [6] SWC Non-uniform Selective Way Cache(NSWC) NSWC 2 SWC NSWC SP DP DP 2 3 NSWC 4 5 2. 2. 1 Selective Way Cache(SWC) David H. Albonesi [6] David H. Albonesi SWC 1 SWC Cache Way Select #103254 "! #%$'& # $'&,/-,.- $)(+* # &1(56 1 SWC egister(cws) CWS [6] Way Placement Timothy M. Jones [7] Timothy M. Jones (Normal access) Normal area 1 (Way placement access) Way placement area 2 Way placement access Way placement area 1 Way placement area 2. 2 Way Placement SWC SWC Way Placement Way Placement [7] Way placement access Normal access Normal access SWC 2

* +,!#"$%%& ')( @BACEDGF &EḦ I J$KML KON N KQP J$KUL KUN N KTP KQS K KON N KTP KTS K KUN N KQP - &).0/0132547698:<; &#132"4 6#/ 7.*8:979<;=6#/.<8>?. @AB 8 993; @0AB 8" $. " # $ %! JKML KUN N KTP $= KQS K KON N KTP JKML KUN N KQP 5#?> KTS K KUN N KQP &('*) & '-).0/ '*+,.0/ 2 SWC & )+"5 Way Placement SWC 2 A D 2 A B 1 B C 2 A C 3 1 2 Way Placement 3 A C C Normal area Way placement access SWC SWC SAM NSWC(Nonuniform Selective Way Cache) 3. NSWC 3. 1 NSWC 3. 1. 1 NSWC NSWC 3 NSWC SWC NSWC ( SP ) 3 NSWC ( DP ) DP SP DP DP SP DP NSWC CWS CWS CWS 3. 1. 2 NSWC T E total = T E NSW C + T E DCACHE + T E main + T E logic (1) (1) T E NSW C T E DCACHE T E main T E logic NSWC T E NSW C T E NSW C = n DP E DP + n SP E SP +n wch E wch + n missi E missi +t all (N DP P DP leak + N SP P SP leak ) (2) (2) n DP n SP DP SP E DP E SP DP SP n wch NSWC E wch n missi NSWC E missi NSWC P DP leak 3

P SP leak DP SP N DP N SP NSWC DP SP t all (1) T E DCACHE T E DCACHE = n dcache E dcache +n missd E missd +t all P leakd (3) (3) n dcache E dcache n missd E missd P leakd (1) T E logic T E main P logic P main E main n main T E logic = t all P logic (4) T E main = t all P main + n main E main (5) t all t all = n hit T hit + n missi T miss + α (6) n hit T hit T miss NSWC 3. 2 SWC 1 2 3 1 2 1 3 3. 2. 1 4 4 CFG CFG 5! / 0211-!3 04115- "!# $&%'(!)+*, %-. 78& 29 %70;:-< =3 >? 49 %@0 :3< =! >?"! # 02113!-& 0211-3 "!6 $%'!)+*, %3. CFG 5 nop nop nop 4 1 1 1 1 1 1 3. 2. 2 [5] 4

4 #%$ &(')+*-,/.0 #1$ &+' )(*-,.0 6! " 2354687 6 1 1 1 3. 2. 3 (1) 4. 4. 1 (SA) [7] Way Placement (WP) SWC (SWC) (NSWC) Way Placement SWC NSWC (1) EEMBC 4. 2 MeP SAIF Synopsys Power Compiler MICON Mobile DD SDAM [10] 4KB 2 32 AM CACTI5.1 [9] CACTI HP(High Performance) LSTP(Low Standby Power) LOP(Low Operating Power) 3 SP DP LSTP LOP CWS AM CWS 2 4. 3 7 (SA) (NSWC) 7% 20% 5

65, B %$ '& + 1 34 2 /0 -. TS g\hi UWVXGY Z\[^] i VakjWlGemTnIoWpIqCrIsCtkub _ÌaIbIcbGdIe f v w w VaIj\lTemTx ykz {k }k~crisctkukb hs S U VŽk IŠeW GŒ\ G k T Cz VŽk IŠe IŒ niowpiqirisctub bgo rgsctub v hs V Ge o ƒ}~irisctkub S S U V IbIˆ! IŠWe IŒW}~krCsItkub " # 7 8 9;: 8 9=< >??: 8A@CB D : 8 EF@GB D : 8 E?H D @IB?HDD 8 J @CB K : 8 E @IB K : 8 E @IB K 8 J EMLN O E @GP Q LN O < B N ()* 7 1 LSTP LOP V dd [V] 1.2 0.8 V th [mv] 554 315 ead energy[pj] 52.43 25.06 Leak power[mw] 0.0118 1.431 Access time [ns] 1.93704 1.0999 2 CWS 15[ns] 48.34896[pJ] 1% Way Placement (WP) 2% SWC (SWC) 2.3% WP SWC NSWC nop SWC NSWC SWC NSWC NSWC 1 LOP DP LSTP SP 1 DP NSWC 5. NSWC NSWC ( ). (JST) (CEST) [1] Ching-Long Su and Alvin Despain Cache Design Tradeoffs for Power and Performance Optimization: A Case Study In Proc. of ISLPED pp.63-68 August 1995. [2] Patrick Hicks Matthew Walnock and obert Michael Owens Analysis of Power Consumption in Memory Hierarchies In Proc. of ISLPED pp.239-242 August 1997. [3] S.Seger Low Power Design Techniques for Microprocessors ISSCC Tutorial note February 2001. [4] Lars Wehmeyer, Peter Marwedel, Fast, Efficient and Predictable Memory Accesses, Springer [5] Hiroyuki Tomiyama and Hiroto Yasuura Optimal Code Placement of Embedded Software for Instruction Cachea In Proc. of European Design and Test Conference pp.96-101 March 1996. [6] David H. Albonesi, Selective Cache Ways: On-Demand Cache esource Allocation, In Proc of the 32nd Annual IEEE/ACM International Symposium on Microarchitecture, pp. 248-259, 1999. [7] Timothy M. Jones, Sandro Bartolini, Bruno De Bus, John Cavazos, Michael F.P. O Boyle, Instruction Cache Energy Saving Through Compiler Way-Placement, In Proc. of Design Automation and Test in Europe, pp. 1196-1201, March 2008. [8], DA 2008 pp. 13-18. [9] Shyamkumar Thoziyoor, Naveen Muralimanohar, Jung Ho Ahn, Norman P. Jouppi, CACTI5.1, Technical eports, HP Labs, http://www.hpl.hp.com/techreports/2008 [10] 128Mb: x16, x32 Mobile DD SDAM Features, http://www.micron.com/ 6