Outline Single Cycle Processor Design Multi cycle Processor. Pipelined Processor Design. Overall clock period. Analyzing performance 3/18/2015

Similar documents
IF ID EX MEM WB 400 ps 225 ps 350 ps 450 ps 300 ps

IF ID EX MEM WB 400 ps 225 ps 350 ps 450 ps 300 ps

7/11/2012. Single Cycle (Review) CSE 2021: Computer Organization. Multi-Cycle Implementation. Single Cycle with Jump. Pipelining Analogy

Chapter 4. Pipelining Analogy. The Processor. Pipelined laundry: overlapping execution. Parallelism improves performance. Four loads: Non-stop:

Computer Hardware. Pipeline

Go over Quizzes 2 and 4. Introduction to pipelining Maybe hazards

RISC Design: Pipelining


Technology Trends & Issues in High-Speed Digital Systems

Lecture Topics. Announcements. Today: Pipelined Processors (P&H ) Next: continued. Milestone #4 (due 2/23) Milestone #5 (due 3/2)

Computer Architecture

Suggested Readings! Lecture 12" Introduction to Pipelining! Example: We have to build x cars...! ...Each car takes 6 steps to build...! ! Readings!

ECE 2300 Digital Logic & Computer Organization. More Pipelined Microprocessor

CS420/520 Computer Architecture I

ECE473 Computer Architecture and Organization. Pipeline: Introduction

EE 457 Homework 5 Redekopp Name: Score: / 100_

7/19/2012. IF for Load (Review) CSE 2021: Computer Organization. EX for Load (Review) ID for Load (Review) WB for Load (Review) MEM for Load (Review)

CSE 2021: Computer Organization

x O O 3 O 05. Questions on Conditional Probability Q1. The probability that it will rain on a day in June is 0.

L A-B-C dei Segnali Spread-Spectrum

How to Shorten First Order Unit Testing Time. Piotr Mróz 1

LECTURE 8. Pipelining: Datapath and Control

Chapter 14: Bandpass Digital Transmission. A. Bruce Carlson Paul B. Crilly 2010 The McGraw-Hill Companies

Generating Polar Modulation with R&S SMU200A

Pipelined Processor Design

Communications II Lecture 7: Performance of digital modulation

Chapter 2 Introduction: From Phase-Locked Loop to Costas Loop

= f 8 f 2 L C. i C. 8 f C. Q1 open Q2 close (1+D)T DT 2. i C = i L. Figure 2: Typical Waveforms of a Step-Down Converter.

Notes on the Fourier Transform

CS 110 Computer Architecture Lecture 11: Pipelining

Lecture 5: DC-DC Conversion

Network Performance Metrics

EECS150 - Digital Design Lecture 2 - Synchronous Digital Systems Review Part 1. Outline

Investigation and Simulation Model Results of High Density Wireless Power Harvesting and Transfer Method

Synchronization of single-channel stepper motor drivers reduces noise and interference

EECE 321: Computer Organiza5on

4.5 Biasing in BJT Amplifier Circuits

ISSCC 2007 / SESSION 29 / ANALOG AND POWER MANAGEMENT TECHNIQUES / 29.8

ECMA st Edition / June Near Field Communication Wired Interface (NFC-WI)

ECE 4750 Computer Architecture, Fall 2016 T09 Advanced Processors: Superscalar Execution

6.S084 Tutorial Problems L19 Control Hazards in Pipelined Processors

SystemC-AMS Hands-On Lab Part 2

FAA/EUROCONTROL ATM Seminar 2015 Model for Longitudinal Uncertainty during Controlled Time of Arrival Operations

Simulation Series Termination

CMP 301B Computer Architecture. Appendix C

Teacher Supplement to Operation Comics, Issue #5

ECMA-373. Near Field Communication Wired Interface (NFC-WI) 2 nd Edition / June Reference number ECMA-123:2009

EE 40 Final Project Basic Circuit

Optical Short Pulse Generation and Measurement Based on Fiber Polarization Effects

Single-Cycle CPU The following exercises are taken from Hennessy and Patterson, CO&D 2 nd, 3 rd, and 4 th Ed.

Phase-Shifting Control of Double Pulse in Harmonic Elimination Wei Peng1, a*, Junhong Zhang1, Jianxin gao1, b, Guangyi Li1, c

Will my next WLAN work at 1 Gbps?

M2 3 Introduction to Switching Regulators. 1. What is a switching power supply? 2. What types of switchers are available?

4 20mA Interface-IC AM462 for industrial µ-processor applications

Precise State Recovery. Out-of-Order Pipelines

PMOS Testing at. Rochester Institute of Technology. Dr. Lynn Fuller

MX629. DELTA MODULATION CODEC meets Mil-Std DATA BULLETIN. Military Communications Multiplexers, Switches, & Phones

ECE-517 Reinforcement Learning in Artificial Intelligence

Pipelining A B C D. Readings: Example: Doing the laundry. Ann, Brian, Cathy, & Dave. each have one load of clothes to wash, dry, and fold

Question 1 TELE4353. Average Delay Spread. RMS Delay Spread = = Channel response (2) Channel response (1)

EE368/CS232 Digital Image Processing Winter Homework #1 Released: Monday, January 8 Due: Wednesday, January 17, 1:30pm

10. The Series Resistor and Inductor Circuit

Analog/Digital Communications Primer

(This lesson plan assumes the students are using an air-powered rocket as described in the Materials section.)

Lab 3 Acceleration. What You Need To Know: Physics 211 Lab

Software solutions to mitigate the EM emission of power modules

Pointwise Image Operations

MEASUREMENTS OF VARYING VOLTAGES

EECE 301 Signals & Systems Prof. Mark Fowler

Square Waves, Sinusoids and Gaussian White Noise: A Matching Pursuit Conundrum? Don Percival

Family of Single-Inductor Multi-Output DC-DC Converters

Errata and Updates for ASM Exam MLC (Fourteenth Edition) Sorted by Page

Proceedings of International Conference on Mechanical, Electrical and Medical Intelligent System 2017

Product Specifications

Announcement. Allowed

A Harmonic Circulation Current Reduction Method for Parallel Operation of UPS with a Three-Phase PWM Inverter

Examination Mobile & Wireless Networking ( ) April 12,

Social-aware Dynamic Router Node Placement in Wireless Mesh Networks

Universal microprocessor-based ON/OFF and P programmable controller MS8122A MS8122B

Lecture 11. Digital Transmission Fundamentals

A New Voltage Sag and Swell Compensator Switched by Hysteresis Voltage Control Method

Analog Circuits EC / EE / IN. For

Potato IC. Contact Potato Semiconductor for IP or detail. PotatoSemi High frequency noise cancellation technology.

Introduction to Orthogonal Frequency Division Multiplexing (OFDM)

MATLAB/SIMULINK TECHNOLOGY OF THE SYGNAL MODULATION

Lecture 13: Capacity of Cellular Systems

A B C D. Ann, Brian, Cathy, & Dave each have one load of clothes to wash, dry, and fold. Time

EECS 470 Lecture 4. Pipelining & Hazards II. Winter Prof. Ronald Dreslinski h8p://

Performance Analysis of High-Rate Full-Diversity Space Time Frequency/Space Frequency Codes for Multiuser MIMO-OFDM

Chapter 16 - Instruction-Level Parallelism and Superscalar Processors

FROM ANALOG TO DIGITAL

Dimensions. Model Number. Electrical connection emitter. Features. Electrical connection receiver. Product information. Indicators/operating means

High Throughput Parallel-Pipeline 2-D DCT/IDCT Processor Chip

Solution of ECE 342 Test 2 S12

RISC Central Processing Unit

I2C Demonstration Board I 2 C-bus Protocol

Lecture 14: Datapath Functional Units Adders

ECE ANALOG COMMUNICATIONS - INVESTIGATION 7 INTRODUCTION TO AMPLITUDE MODULATION - PART II

The design of an improved matched filter in DSSS-GMSK system

Lecture 4. EITN Chapter 12, 13 Modulation and diversity. Antenna noise is usually given as a noise temperature!

Transcription:

3/8/5 Pipelined Processor Design. Sahu CSE, T Guwahai Please be updaed wih hp://ainga.iig.erne.in/~asahu/c/ Ouline Single Cycle Processor Design Muli cycle Processor Merging M and, emoving dder and dder Synchronized : nroducing Mux and egiser Pipelined Processor (Daa Pah) Pipeline : nroducion and Performance Cos Hazards and emoval nalyzing performance Componens Delay egiser dder LU MUX nsrucion ory Daa ory m Bi manipulaion or Overall :, SW :, M max LW :, M Beq :,, J :, max M nalyzing performance Componens Delay Example egiser ns dder ns LU 5ns MUX ns 3ns nsrucion 6ns ory Daa ory m 6ns Bi manipulaion ns NS SW LW Beq J Delay 7ns ns 3ns ns 6ns Problems wih single cycle design Slowes rucion pulls down he frequency esource uilizaion is poor There are some rucions which are impossible o be implemened in his manner bi more complex NST Floaing poinnst

3/8/5 Clock in single cycle design class Muli cycle Processor 7 Clock in muli cycle design class Muli cycle Timing nalysis ssume Cycle ime 6 ns Given a Program wih N nsr NS MX : 6%, LW %, SW %, BEQ=5%, J=5% =, LW=5, SW=, BEQ= or 3, J= Compared o Single Cycle Design Cycle ime 3ns (Max ime of all ypes) Time o execue Single Cycle : 3*N = 3 N Muli Cycle: No benefis in erm of ime N* 6*(.6*.*5.*.5*.5.5*) = N*6(...5.5)=N*6*3.975= 3.85N mproving resource uilizaion Merging M & : muli cycle design Can we eliminae wo ders? How o share (or reuse) a resource (say LU) in differen cycles? class Sore resuls in regisers. Of course, more muliplexing may be required! esources in his design:, LU, MEM. M ge used in s Cycle Dm ge used in h Cycle Why o keep wo separae ory?

3/8/5 emoving Firsdder: muli cycle design emoving nd dder: muli cycle design class class DDE ge used in s Cycle LU ge used in 3 rd Cycle Why o keep boh LU and DDE? DDE ge used in nd Cycle LU ge used in 3 rd Cycle GN Why o keep boh LU and DDE? Muli cycle Daa Pah dd eg & Mux : Mulicycle Daa Pah [5 ] 8 [3 8] a[3 ] [5 ] 8 a[3 ] [5 ] r rd [ 6] r [5 ] [5 ] 6 LU [5 ] [ 6] [5 ] D [5 ] r rd r 6 B [3 8] 3 LU es 6 Pipe Pipeline Pipelined Processor Pipeline 3

3/8/5 Car ssembly Pipeline Car ssembly Pipeline Single Cycle Poor esource Uilizaion, TC >= long nsr laency Muli Cycle TC > Longer Sage, Beer Uilizaion, Sill performance need o improve using pipeline When Decoding NS i you can Fech NS i Pipeline Pipeline Design nsrucion Pipeline Time or Cycle 3 5 6 7 8 NS NS NS 3 NS NS 5 ll he Sages work in parallel, No resource can be shared by sages Performance: rucion per Cycle Clock T nsrucion Pipeline Time To Execue N nsrucion : Toal Time= T*N*5 Toal Time= T*(N) No Pipeline Pipeline 3 Performance ssume Cycle ime 6 ns Given a Program wih N nsr NS MX : 6%, LW %, SW %, BEQ=5%, J=5% =, LW=5, SW=, BEQ= or 3, J= Pipeline Every nsrucion is 5 cycle bu pipelined manner deal Condiion Time o execue Single Cycle : 3*N = 3 N Muli Cycle: N*6*(.6*.*5.*.5*.5.5*) = N*6(...5.5)=N*6*3.975= 3.85N Pipelined : N cycles

3/8/5 Single cycle daapah (absrac) Pipelined daapah F D EX F/D D/EX EX/ / M r rd LU M r w rd rd LU Don share resources in Sages Pu back muliplexers n Muli Cycle Design LU used for and Offsedding Used for sdder and nd dder egiser FLE is used in nd and h Cycle n Pipeline Use Separae resource sdder, nd dder & LU egiser FLE is accesses s Half of nd Cycle and nd Half of h Cycle F D EX M F/D D/EX EX/ / W r r rd LU 7 Correcion for sage F D EX F/D D/EX EX/ / bsrac: dding conrol conrol ol M r r rd W LU M r r rd crl LU 5

3/8/5 Conrol signals wih delays Correcion for wrie signal conrol ol conrol ol M r r rd W crl LU M r r rd crl LU Correcion for wrie signal Degree of overlap conrol ol Serial Pipelined M r r rd crl LU Overlapped Super Pipelined Pipeline Deph Shallow Superscalar Pipeline: Penium Single Pipeline Deep Pipeline Pipeline Pipeline3 Fech 3 rucions Decode 3 rucions Execue 3 nsrucions 36 6

3/8/5 Difficulies in Pipeline Hazards in Pipelining esource conflics => Srucural hazards use of same resource in differen sages Daa dependencies => Daa hazards W (re afer wrie) W (wrie afer re) WW (wrie afer wrie) Procedural dependencies => Conrol hazards condiional and uncondiional branches, calls/reurns Srucural Hazards Srucural Hazards Caused by esource Conflics Use of a hardware resource in more han one cycle Differen sequences of resource usage by differen rucions B C B C B C D B C C B D Non pipelined muli cycle resources F D X X F D X X Pipeline wih Shared esources Used LU for : Mulicycle Daa Pah F/D D/EX EX/ / [5 ] 8 a[3 ] [5 ] [ 6] [5 ] D [5 ] r rd r 6 B [3 8] 3 LU es 7