Instructor: Randy H. Katz hap://inst.eecs.berkeley.edu/~cs61c/fa13. Fall Lecture #20. Warehouse Scale Computer
|
|
- Irma Ross
- 5 years ago
- Views:
Transcription
1 CS 61C: Geat Ideas in Compute Achitectue Contol and Pipelining Instucto: Randy H. Katz hap://inst.eecs.bekeley.edu/~cs61c/fa13 11/5/13 Fall Lectue #20 1 So0wae Paallel Requests Assigned to compute e.g., Seach Katz Paallel Theads Assigned to coe e.g., Lookup, Ads Paallel InstucVons >1 one Vme e.g., 5 pipelined instucvons Paallel Data >1 data one Vme e.g., Add of 4 pais of wods Hadwae descipvons All one Vme Pogamming Languages You Ae Hee! Haness Paallelism & Achieve High Pefomance Hadwae Today s Lectue Waehouse Scale Compute Coe Memoy Input/Output InstucVon Unit(s) Main Memoy Coe 2 Compute (Cache) Coe FuncVonal Unit(s) A 0 +B 0 A 1 +B 1 A 2 +B 2 A 3 +B 3 Smat Phone Logic Gates 1
2 Machine Intepeta4on Levels of RepesentaVon/ IntepetaVon High Level Language Pogam (e.g., C) Compile Assembly Language Pogam (e.g., MIPS) Assemble Machine Language Pogam (MIPS) Hadwae Achitectue DescipCon (e.g., block diagams) Achitectue Implementa4on Logic Cicuit DescipCon (Cicuit SchemaCc Diagams) temp = v[k]; v[k] = v[k+1]; v[k+1] = temp; lw $t0, 0($2) lw $t1, 4($2) sw $t1, 0($2) sw $t0, 4($2) Anything can be epesented as a numbe, i.e., data o instucvons ! 3 InstucVon Level Paallelism (ILP) Anothe paallelism fom to go with Request Level Paallelism and Data Level Paallelism RLP e.g., Waehouse Scale CompuVng DLP e.g., SIMD, Map- Reduce ILP e.g., Pipelined InstucBon ExecuBon 5 stage pipeline => 5 instucvons execuvng simultaneously, one at each pipeline stage 4 2
3 Pipelined ExecuVon Pipelined Datapath Agenda Stuctual and Data Hazads Contol Hazads 5 Pipelined ExecuVon Pipelined Datapath Agenda Stuctual and Data Hazads Contol Hazads 6 3
4 Review: Single- Cycle Pocesso Five steps to design a pocesso: 1. Analyze instucvon set à Pocesso datapath equiements Contol 2. Select set of datapath Memoy components & establish Datapath clock methodology 3. Assemble datapath meevng the equiements: e- examine fo pipelining 4. Analyze implementavon of each instucvon to detemine semng of contol points that effects the egiste tansfe. 5. Assemble the contol logic Fomulate Logic EquaVons Design Cicuits 7 Input Output Pipeline Analogy: Doing Laundy Ann, Bian, Cathy, Dave each have one load of clothes to wash, dy, fold, and put away Washe takes 30 minutes Dye takes 30 minutes Folde takes 30 minutes Stashe takes 30 minutes to put clothes into dawes A B C D 8 4
5 SequenVal Laundy 6 PM AM T a s k O d e A B C D Time SequenVal laundy takes 8 hous fo 4 loads 9 Pipelined Laundy 12 2 AM 6 PM T a s k O d e A B C D Pipelined laundy takes 3.5 hous fo 4 loads! Time 10 5
6 T a s k O d e 6 PM A B C D Pipelining Lessons (1/2) Time Pipelining doesn t help latency of single task, it helps thoughput of enve wokload MulVple tasks opeavng simultaneously using diffeent esouces PotenVal speedup = Numbe pipe stages (4 in this case) Time to fill pipeline and Vme to dain it educes speedup: 8 hous/3.5 hous o 2.3X v. potenval 4X in this example 11 T a s k O d e 6 PM A B C D Pipelining Lessons (2/2) Time Suppose new Washe takes 20 minutes, new Stashe takes 20 minutes. How much faste is pipeline? Pipeline ate limited by slowest pipeline stage Unbalanced lengths of pipe stages educes speedup 12 6
7 Pipelined ExecuVon Pipelined Datapath Agenda Stuctual and Data Hazads Contol Hazads 13 Review: RISC Design Pinciples A simple coe is a faste coe ReducVon in the numbe and complexity of instucvons in the ISA à simplifies pipelined implementavon Common RISC stategies: Fixed instucvon length, geneally a single wod (MIPS = 32b); Simplifies pocess of fetching instucvons fom memoy Simplified addessing modes; (MIPS just egiste + offset) Simplifies pocess of fetching opeands fom memoy Fewe and simple instucvons in the instucvon set; Simplifies pocess of execuvng instucvons Simplified memoy access: only load and stoe instucvons access memoy; Let the compile do it. Use a good compile to beak complex high- level language statements into a numbe of simple assembly language statements 14 7
8 Review: Single Cycle Datapath op s t immediate Data Memoy {R[s] + SignExt[imm16]} = R[t] RegDst= Rd RegW= busw 32 npc_sel= 1 clk Rs 5 5 Rw imm16 Rt 0 Ra Rb RegFile 16 clk Rt 5 ExtOp= Extende busa busb inst fetch unit Rs Rt Rd Imm16 zeo ct= MemtoReg= Sc= InstucVon<31:0> <21:25> = 32 Data In clk <16:20> MemW= 32 <11:15> WEn Ad <0:15> Data Memoy Steps in ExecuVng MIPS 1) IF: InstucVon Fetch, Incement PC 2) ID: InstucVon Decode, Read Registes 3) EX: ExecuVon Mem- ef: Calculate Addess Aith- log: Pefom OpeaVon 4) Mem: Load: Read Data fom Memoy Stoe: Wite Data to Memoy 5) WB: Wite Data Back to Registe 16 8
9 Redawn Single- Cycle Datapath PC instucvon memoy d s t egistes Data memoy +4 imm 1. InstucVon Fetch 2. Decode/ Registe Read 3. Execute 4. Memoy 5. Wite Back 17 Pipelined Datapath PC instucvon memoy d s t egistes Data memoy +4 imm 1. InstucVon Fetch 2. Decode/ Registe Read 3. Execute 4. Memoy 5. Wite Back Add egistes between stages Hold infomavon poduced in pevious cycle 5 stage pipeline; clock ate potenval 5X faste 18 9
10 Moe Detailed Pipeline Registes named fo adjacent stages, e.g., IF/ID 19 IF fo Load, Stoe, Highlight combinavonal logic components used + ight half of state logic on ead, le~ half on wite 20 10
11 ID fo Load, Stoe, 21 EX fo Load 22 11
12 MEM fo Load 23 WB fo Load Has Bug that was in 1 st edivon of textbook! Wong egiste numbe 24 12
13 Coected Datapath fo Load Coect egiste numbe 25 Pipelined ExecuVon RepesentaVon Time IF ID EX Mem WB IF ID EX Mem WB IF ID EX Mem WB IF ID EX Mem WB IF ID EX Mem WB IF ID EX Mem WB Evey instucvon must take same numbe of steps, also called pipeline stages, so some will go idle somevmes 26 13
14 Gaphical Pipeline Diagams PC instucvon memoy d s t egistes Data memoy +4 imm 1. InstucVon Fetch 2. Decode/ Registe Read 3. Execute 4. Memoy 5. Wite Back Use datapath figue below to epesent pipeline IF ID EX Mem WB 27 I n s t. O d e Gaphical Pipeline RepesentaVon (In Reg, ight half highlight ead, lev half wite) Time (clock cycles) Load Add Stoe Sub O I$ Reg I$ Reg I$ D$ Reg I$ 28 Reg D$ Reg I$ Reg D$ Reg Reg D$ Reg D$ Reg 14
15 Pipeline Pefomance Assume Vme fo stages is 100ps fo egiste ead o wite 200ps fo othe stages What is pipelined clock ate? Compae pipelined datapath with single- cycle datapath Inst Inst fetch Registe ead op Memoy access Registe wite Total time lw 200ps 100 ps 200ps 200ps 100 ps 800ps sw 200ps 100 ps 200ps 200ps 700ps R-fomat 200ps 100 ps 200ps 100 ps 600ps beq 200ps 100 ps 200ps 500ps 29 Student RouleAe? Pipeline Pefomance Single- cycle (T c = 800ps) Pipelined (T c = 200ps) 30 15
16 Pipeline Speedup If all stages ae balanced i.e., all take the same Vme Time between instucvons pipelined = Time between instucvons nonpipelined Numbe of stages If not balanced, speedup is less Speedup due to inceased thoughput Latency (Vme fo each instucvon) does not decease 31 Pipelined ExecuVon Pipelined Datapath Agenda Stuctual and Data Hazads Contol Hazads 32 16
17 Hazads SituaVons that pevent stavng the next logical instucvon in the next clock cycle 1. Stuctual hazads Requied esouce is busy (e.g., stashe is studying) 2. Data hazad Need to wait fo pevious instucvon to complete its data ead/wite (e.g., pai of socks in diffeent loads) 3. Contol hazad Deciding on contol acvon depends on pevious instucvon (e.g., how much detegent based on how clean pio load tuns out) Stuctual Hazads Conflict fo use of a esouce In MIPS pipeline with a single memoy Load/Stoe equies memoy access fo data InstucVon fetch would have to stall fo that cycle Causes a pipeline bubble Hence, pipelined datapaths equie sepaate instucvon/data memoies In eality, povide sepaate L1 instucvon cache and L1 data cache 34 17
18 I n s t. O d e 1. Stuctual Hazad #1: Single Memoy Time (clock cycles) Load Inst 1 Inst 2 Inst 3 Inst 4 I$ Read same memoy twice in same clock cycle Reg D$ Reg 35 I n s t. 1. Stuctual Hazad #2: Registes (1/2) sw Inst 1 Time (clock cycles) O d e Inst 2 Inst 3 Inst 4 I$ Reg D$ Reg Can we ead and wite to egistes simultaneously? 36 18
19 1. Stuctual Hazad #2: Registes (2/2) Two diffeent soluvons have been used: 1) RegFile access is VERY fast: takes less than half the Vme of stage Wite to Registes duing fist half of each clock cycle Read fom Registes duing second half of each clock cycle 2) Build RegFile with independent ead and wite pots Result: can pefom Read and Wite duing same clock cycle Data Hazads An instucvon depends on complevon of data access by a pevious instucvon add $s0, $t0, $t1 sub $t2, $s0, $t
20 Fowading (aka Bypassing) Use esult when it is computed Don t wait fo it to be stoed in a egiste Requies exta connecvons in the datapath 39 Coected Datapath fo Fowading? 40 20
21 Fowading Paths Chapte 4 The Pocesso 41 Load- Use Data Hazad Can t always avoid stalls by fowading If value not computed when needed Can t fowad backwad in Vme! 42 21
22 Stall/Bubble in the Pipeline Stall inseted hee Chapte 4 The Pocesso 43 Pipelining and ISA Design MIPS InstucVon Set designed fo pipelining All instucvons ae 32- bits Easie to fetch and decode in one cycle x86: 1- to 17- byte instucvons (x86 HW actually tanslates to intenal RISC instucvons!) Few and egula instucvon fomats, 2 souce egiste fields always in same place Can decode and ead egistes in one step Memoy opeands only in Loads and Stoes Can calculate addess 3 d stage, access memoy 4 th stage Alignment of memoy opeands Memoy access takes only one cycle 44 22
23 Why Isn t the DesVnaVon Registe Always in the Same Field in MIPS ISA? op s t d shamt funct Need to have 2 pat immediate if 2 souces and 1 desvnavon always in same place bits 26 5 bits 21 5 bits 16 5 bits 5 bits 6 bits 0 op s t immediate 6 bits 5 bits 5 bits 16 bits SPUR pocesso (A poject Dave PaAeson and Randy woked on togethe) Contol Hazads Banch detemines flow of contol Fetching next instucvon depends on banch outcome Pipeline can t always fetch coect instucvon SVll woking on ID stage of banch BEQ, BNE in MIPS pipeline Simple soluvon OpVon 1: Stall on evey banch unvl have new PC value Would add 2 bubbles/clock cycles fo evey Banch! (~ 20% of instucvons executed) 46 23
24 I n s t. O d e beq Inst 1 Inst 2 Inst 3 Inst 4 Stall => 2 Bubbles/Clocks Time (clock cycles) I$ Reg D$ Reg Whee do we do the compae fo the banch? Contol Hazad: Banching OpVmizaVon #1: Inset special banch compaato in Stage 2 As soon as instucvon is decoded (Opcode idenvfies it as a banch), immediately make a decision and set the new value of the PC Benefit: since banch is complete in Stage 2, only one unnecessay instucvon is fetched, so only one no- op is needed Side Note: means that banches ae idle in Stages 3, 4 and
25 Coected Datapath fo BEQ/BNE? 49 Student RouleAe? One Clock Cycle Stall Time (clock cycles) I n s t. O d e beq Inst 1 Inst 2 Inst 3 Inst 4 I$ Reg D$ Reg Banch compaato moved to Decode stage
26 Pipelined ExecuVon Pipelined Datapath Agenda Stuctual and Data Hazads Contol Hazads Contol Hazads OpVon 2: Pedict outcome of a banch, fix up if guess wong Must cancel all instucvons in pipeline that depended on guess that was wong Simplest hadwae if we pedict that all banches ae NOT taken Why? 52 26
27 3. Contol Hazad: Banching OpVon #3: Redefine banches Old definivon: if we take the banch, none of the instucvons a~e the banch get executed by accident New definivon: whethe o not we take the banch, the single instucvon immediately following the banch gets executed (the banch- delay slot) Delayed Banch means we always execute inst a0e banch This opvmizavon is used with MIPS Contol Hazad: Banching Notes on Banch- Delay Slot Wost- Case Scenaio: put a no- op in the banch- delay slot BeAe Case: place some instucvon peceding the banch in the banch- delay slot as long as the changed doesn t affect the logic of pogam Re- odeing instucvons is common way to speed up pogams Compile usually finds such an instucvon 50% of Vme Jumps also have a delay slot 54 27
28 Example: Nondelayed vs. Delayed Banch Nondelayed Banch Delayed Banch o $8, $9, $10 add $1, $2, $3 add $1, $2, $3 sub $4, $5, $6 beq $1, $4, Exit xo $10, $1, $11 sub $4, $5, $6 beq $1, $4, Exit o $8, $9, $10 xo $10, $1, $11 Exit: Exit: 55 Delayed Banch/Jump and MIPS ISA? Why does JAL put PC+8 in egiste 31? 56 28
29 Code Scheduling to Avoid Stalls Reode code to avoid use of load esult in the next instucvon C code fo A = B + E; C = B + F; stall stall lw $t1, 0($t0) lw $t2, 4($t0) add $t3, $t1, $t2 sw $t3, 12($t0) lw $t4, 8($t0) add $t5, $t1, $t4 sw $t5, 16($t0) 13 cycles lw $t1, 0($t0) lw $t2, 4($t0) lw $t4, 8($t0) add $t3, $t1, $t2 sw $t3, 12($t0) add $t5, $t1, $t4 sw $t5, 16($t0) 11 cycles 58 Pee InstucVon I. Thanks to pipelining, I have educed the time it took me to wash my one shit. II. Longe pipelines ae always a win (since less wok pe stage & a faste clock). A)(oange) I is Tue and II is Tue B)(geen) I is False and II is Tue C)(pink) I is Tue and II is False 59 29
30 And, in Conclusion, Pipelining impoves pefomance by inceasing instucvon thoughput: exploits ILP Executes mulvple instucvons in paallel Each instucvon has the same latency Key enable is placing egistes between pipeline stages Subject to hazads Stuctue, data, contol Stalls educe pefomance But ae equied to get coect esults Compile can aange code to avoid hazads and stalls Requies knowledge of the pipeline stuctue 61 30
CS 61C: Great Ideas in Computer Architecture Pipelining. Anything can be represented as a number, i.e., data or instrucvons
CS 61C: Geat Ideas in Compute Achitectue Pipelining 4/8/12 Instucto: David A. Pa@eson h@p://inst.eecs.bekeley.edu/~cs61c/sp12 Sping 2012 - - Lectue #21 1 Paallel Requests Assigned to compute e.g., Seach
More informationPipelining and ISA Design
Pipelined instuc.on Execu.on 1 Pipelining and ISA Design MIPS Instuc:on Set designed fo pipelining All instuc:ons ae 32- bits Easie to fetch and decode in one cycle x86: 1- to 17- byte instuc:ons (x86
More informationCS 61C: Great Ideas in Computer Architecture. Pipelining Hazards. Instructor: Senior Lecturer SOE Dan Garcia
CS 61C: Geat Ideas in Compute Achitectue Pipelining Hazads Instucto: Senio Lectue SOE Dan Gacia 1 Geat Idea #4: Paallelism So9wae Paallel Requests Assigned to compute e.g. seach Gacia Paallel Theads Assigned
More informationCS61C : Machine Structures
inst.eecs.bekeley.edu/~cs61c CS61C : Machine Stuctues Lectue 29 Intoduction to Pipelined Execution Lectue PSOE Dan Gacia www.cs.bekeley.edu/~ddgacia Bionic Eyes let blind see! Johns Hopkins eseaches have
More informationCS61C : Machine Structures
Election Data is now available Puple Ameica! inst.eecs.bekeley.edu/~cs61c CS61C : Machine Stuctues Lectue 31 Pipelined Execution, pat II 2004-11-10 Lectue PSOE Dan Gacia www.cs.bekeley.edu/~ddgacia The
More informationCS 110 Computer Architecture Lecture 11: Pipelining
CS 110 Computer Architecture Lecture 11: Pipelining Instructor: Sören Schwertfeger http://shtech.org/courses/ca/ School of Information Science and Technology SIST ShanghaiTech University Slides based on
More informationCMSC 611: Advanced Computer Architecture
CMSC 611: Advanced Compute Achitectue Pipelining Some mateial adapted fom Mohamed Younis, UMBC CMSC 611 Sp 2003 couse slides Some mateial adapted fom Hennessy & Patteson / 2003 Elsevie Science Pipeline
More information7/11/2012. Single Cycle (Review) CSE 2021: Computer Organization. Multi-Cycle Implementation. Single Cycle with Jump. Pipelining Analogy
CSE 2021: Computer Organization Single Cycle (Review) Lecture-10 CPU Design : Pipelining-1 Overview, Datapath and control Shakil M. Khan CSE-2021 July-12-2012 2 Single Cycle with Jump Multi-Cycle Implementation
More informationChapter 4. Pipelining Analogy. The Processor. Pipelined laundry: overlapping execution. Parallelism improves performance. Four loads: Non-stop:
Chapter 4 The Processor Part II Pipelining Analogy Pipelined laundry: overlapping execution Parallelism improves performance Four loads: Speedup = 8/3.5 = 2.3 Non-stop: Speedup p = 2n/(0.5n + 1.5) 4 =
More informationLecture Topics. Announcements. Today: Pipelined Processors (P&H ) Next: continued. Milestone #4 (due 2/23) Milestone #5 (due 3/2)
Lecture Topics Today: Pipelined Processors (P&H 4.5-4.10) Next: continued 1 Announcements Milestone #4 (due 2/23) Milestone #5 (due 3/2) 2 1 ISA Implementations Three different strategies: single-cycle
More informationPipelining A B C D. Readings: Example: Doing the laundry. Ann, Brian, Cathy, & Dave. each have one load of clothes to wash, dry, and fold
Pipelining Readings: 4.5-4.8 Example: Doing the laundry Ann, Brian, Cathy, & Dave A B C D each have one load of clothes to wash, dry, and fold Washer takes 30 minutes Dryer takes 40 minutes Folder takes
More informationA B C D. Ann, Brian, Cathy, & Dave each have one load of clothes to wash, dry, and fold. Time
Pipelining Readings: 4.5-4.8 Example: Doing the laundry A B C D Ann, Brian, Cathy, & Dave each have one load of clothes to wash, dry, and fold Washer takes 30 minutes Dryer takes 40 minutes Folder takes
More informationPipelined Processor Design
Pipelined Processor Design COE 38 Computer Architecture Prof. Muhamed Mudawar Computer Engineering Department King Fahd University of Petroleum and Minerals Presentation Outline Pipelining versus Serial
More informationCMSC 611: Advanced Computer Architecture
CMSC 611: Advanced Computer Architecture Pipelining Some material adapted from Mohamed Younis, UMBC CMSC 611 Spr 2003 course slides Some material adapted from Hennessy & Patterson / 2003 Elsevier Science
More informationLecture 4: Introduction to Pipelining
Lecture 4: Introduction to Pipelining Pipelining Laundry Example Ann, Brian, Cathy, Dave each have one load of clothes to wash, dry, and fold Washer takes 30 minutes A B C D Dryer takes 40 minutes Folder
More informationLECTURE 8. Pipelining: Datapath and Control
LECTURE 8 Pipelining: Datapath and Control PIPELINED DATAPATH As with the single-cycle and multi-cycle implementations, we will start by looking at the datapath for pipelining. We already know that pipelining
More informationECE473 Computer Architecture and Organization. Pipeline: Introduction
Computer Architecture and Organization Pipeline: Introduction Lecturer: Prof. Yifeng Zhu Fall, 2015 Portions of these slides are derived from: Dave Patterson UCB Lec 11.1 The Laundry Analogy Student A,
More informationYou are Here! Processor Design Process. Agenda. Agenda 10/25/12. CS 61C: Great Ideas in Computer Architecture Single Cycle MIPS CPU Part II
/26/2 CS 6C: Great Ideas in Computer Architecture Single Cycle MIPS CPU Part II /25/2 ructors: Krste Asanovic, Randy H. Katz hcp://inst.eecs.berkeley.edu/~cs6c/fa2 Fall 22 - - Lecture #26 Parallel Requests
More informationEECE 321: Computer Organiza5on
EECE 321: Computer Organiza5on Mohammad M. Mansour Dept. of Electrical and Compute Engineering American University of Beirut Lecture 21: Pipelining Processor Pipelining Same principles can be applied to
More informationCS420/520 Computer Architecture I
CS42/52 Computer rchitecture I Designing a Pipeline Processor (C4: ppendix ) Dr. Xiaobo Zhou Department of Computer Science CS42/52 pipeline. UC. Colorado Springs dapted from UCB97 & UCB3 Branch Jump Recap:
More information7/19/2012. IF for Load (Review) CSE 2021: Computer Organization. EX for Load (Review) ID for Load (Review) WB for Load (Review) MEM for Load (Review)
CSE 2021: Computer Organization IF for Load (Review) Lecture-11 CPU Design : Pipelining-2 Review, Hazards Shakil M. Khan CSE-2021 July-19-2012 2 ID for Load (Review) EX for Load (Review) CSE-2021 July-19-2012
More informationCSE 2021: Computer Organization
CSE 2021: Computer Organization Lecture-11 CPU Design : Pipelining-2 Review, Hazards Shakil M. Khan IF for Load (Review) CSE-2021 July-14-2011 2 ID for Load (Review) CSE-2021 July-14-2011 3 EX for Load
More informationInstruction Level Parallelism. Data Dependence Static Scheduling
Instruction Level Parallelism Data Dependence Static Scheduling Basic Block A straight line code sequence with no branches in except to the entry and no branches out except at the exit Loop: L.D ADD.D
More informationRISC Design: Pipelining
RISC Design: Pipelining Virendra Singh Associate Professor Computer Architecture and Dependable Systems Lab Department of Electrical Engineering Indian Institute of Technology Bombay http://www.ee.iitb.ac.in/~viren/
More informationIF ID EX MEM WB 400 ps 225 ps 350 ps 450 ps 300 ps
CSE 30321 Computer Architecture I Fall 2011 Homework 06 Pipelined Processors 75 points Assigned: November 1, 2011 Due: November 8, 2011 PLEASE DO THE ASSIGNMENT ON THIS HANDOUT!!! Problem 1: (15 points)
More informationCS429: Computer Organization and Architecture
CS429: Computer Organization and Architecture Dr. Bill Young Department of Computer Sciences University of Texas at Austin Last updated: November 8, 2017 at 09:27 CS429 Slideset 14: 1 Overview What s wrong
More informationSuggested Readings! Lecture 12" Introduction to Pipelining! Example: We have to build x cars...! ...Each car takes 6 steps to build...! ! Readings!
1! CSE 30321 Lecture 12 Introduction to Pipelining! CSE 30321 Lecture 12 Introduction to Pipelining! 2! Suggested Readings!! Readings!! H&P: Chapter 4.5-4.7!! (Over the next 3-4 lectures)! Lecture 12"
More informationComputer Architecture
Computer Architecture An Introduction Virendra Singh Associate Professor Computer Architecture and Dependable Systems Lab Department of Electrical Engineering Indian Institute of Technology Bombay http://www.ee.iitb.ac.in/~viren/
More informationIF ID EX MEM WB 400 ps 225 ps 350 ps 450 ps 300 ps
CSE 30321 Computer Architecture I Fall 2010 Homework 06 Pipelined Processors 85 points Assigned: November 2, 2010 Due: November 9, 2010 PLEASE DO THE ASSIGNMENT ON THIS HANDOUT!!! Problem 1: (25 points)
More informationAsanovic/Devadas Spring Pipeline Hazards. Krste Asanovic Laboratory for Computer Science M.I.T.
Pipeline Hazards Krste Asanovic Laboratory for Computer Science M.I.T. Pipelined DLX Datapath without interlocks and jumps 31 0x4 RegDst RegWrite inst Inst rs1 rs2 rd1 ws wd rd2 GPRs Imm Ext A B OpSel
More informationAn Efficient Control Approach for DC-DC Buck-Boost Converter
2016 Published in 4th Intenational Symposium on Innovative Technologies in Engineeing and Science 3-5 Novembe 2016 (ISITES2016 Alanya/Antalya - Tukey) An Efficient Contol Appoach fo DC-DC Buck-Boost Convete
More informationSelected Solutions to Problem-Set #3 COE 608: Computer Organization and Architecture Single Cycle Datapath and Control
Selected Solutions to Problem-Set #3 COE 608: Computer Organization and Architecture Single Cycle Datapath and Control 4.1. Done in the class 4.2. Try it yourself Q4.3. 4.3.1 a. Logic Only b. Logic Only
More informationDesign and Implementation of 4 - QAM VLSI Architecture for OFDM Communication
Design and Implementation of 4 - QAM VLSI Achitectue fo OFDM Communication R. Achitha 1, S. Bhagyalakshmi 2, V. Jaya Suthi 3, D. T. Menakadevi 4 U.G. Students, Depatment of ECE, Adhiyamaan College of Engineeing,
More informationN2-1. The Voltage Source. V = ε ri. The Current Source
DC Cicuit nalysis The simplest cicuits to undestand and analyze ae those that cay diect cuent (DC). n this note we continue ou study of DC cicuits with the topics of DC voltage and cuent souces, the idea
More informationStatement of Works Data Template Version: 4.0 Date:
Statement of Woks Data Template Vesion: 4.0 Date: 16.08.17 This Statement of Woks (SoW) Data Template is to be completed by Distibution Netwok Opeatos (DNOs) in ode to povide National Gid the equied data
More informationRISC Central Processing Unit
RISC Central Processing Unit Lan-Da Van ( 范倫達 ), Ph. D. Department of Computer Science National Chiao Tung University Taiwan, R.O.C. Spring, 2014 ldvan@cs.nctu.edu.tw http://www.cs.nctu.edu.tw/~ldvan/
More informationABSTRACTT FFT FFT-' Proc. of SPIE Vol U-1
Phase econstuction stategies in phase-locking system based on multi-apetue wave font sensos P.A. Semenov, S.D. Pol skikh Shvabe-Reseach, Moscow, Russian Fedeation; e-mail: pite@bk.u ABSTRACTT System of
More informationExperimental Investigation of Influence on Non-destructive Testing by Form of Eddy Current Sensor Probe
Expeimental Investigation of Influence on Non-destuctive Testing by Fom of Eddy Cuent Senso Pobe Fengyun Xie * and Jihui Zhou School of Mechanical and Electonical Engineeing, East China Jiaotong Univesity,
More informationVLSI Implementation of Low Complexity MIMO Detection Algorithms
, Impact Facto :.643 eissn : 349-000 & pissn : 394-4544 Intenational Jounal of Reseach and Applications (Ap-Jun 015 Tansactions) (6): 309-313 Intenational Confeence on Emeging Tends in Electonics & Telecommunications
More informationEECS150 - Digital Design Lecture 2 - Synchronous Digital Systems Review Part 1. Outline
EECS5 - Digital Design Lecture 2 - Synchronous Digital Systems Review Part January 2, 2 John Wawrzynek Electrical Engineering and Computer Sciences University of California, Berkeley http://www-inst.eecs.berkeley.edu/~cs5
More informationInvestigation. Name: a About how long would the threaded rod need to be if the jack is to be stored with
Think Unit bout 6 This Lesson Situation 1 Investigation 1 Name: Think about the design and function of this automobile jack. Use the uto Jack custom tool to test ou ideas. a bout how long would the theaded
More informationProposal of Circuit Breaker Type Disconnector for Surge Protective Device
Poposal of Cicuit Beake Type Disconnecto fo Suge Potective Device MASAO SHIBAYAMA, HITOSHI KIJIMA Polytechnic Univesity 2-32-1 Ogawanishi, Kodaia, Tokyo, 187-0035 JAPAN hkijima@uitec.ac.jp Abstact: - A
More informationOPTIMUM MEDIUM ACCESS TECHNIQUE FOR NEXT GENERATION WIRELESS SYSTEMS
9 th Febuay. Vol. 3 No. 5 - JATIT & LLS. All ights eseved. ISSN: 99-5 www.jatit.og E-ISSN: 7-395 OPTIMUM MEDIUM ACCESS TECHNIQUE FOR NEXT GENERATION WIRELESS SYSTEMS N.AMUTHA PRABHA, V.MANIANDAN VIT UNIVERSITY,
More informationwhere and are polynomials with real coefficients and of degrees m and n, respectively. Assume that and have no zero on axis.
function whee is an unknown constant epesents fo the un-modeled dynamics The pape investigates the position contol of electical moto dives that can be configued as stuctue of Fig 1 This poblem is fomulated
More informationLecture 2: Review of Pipelines
The Instction Set: a Citical Inteface softwae Lecte 2: Review of Pipelines instction set hadwae AP Sp. 98 UCB 1 Lec 1.2 Instction Set Achitecte... the attibtes of a [compting] system as seen by the pogamme,
More informationPipelined Beta. Handouts: Lecture Slides. Where are the registers? Spring /10/01. L16 Pipelined Beta 1
Pipelined Beta Where are the registers? Handouts: Lecture Slides L16 Pipelined Beta 1 Increasing CPU Performance MIPS = Freq CPI MIPS = Millions of Instructions/Second Freq = Clock Frequency, MHz CPI =
More informationECEN326: Electronic Circuits Fall 2017
ECEN36: Electonic Cicuits Fall 07 Lectue 4: Cascode Stages and Cuent Mios Sam Palemo Analog & Mixed-Signal Cente Texas A&M Univesity Announcements HW3 due 0/4 Exam 0/9 9:0-0:0 (0 exta minutes) Closed book
More informationSliding Mode Control for Half-Wave Zero Current Switching Quasi-Resonant Buck Converter
Sliding Mode Contol fo Half-Wave Zeo Cuent Switching Quasi-Resonant Buck Convete M. Ahmed, Student membe, IEEE, M. Kuisma, P. Silventoinen Abstact This pape focuses on the pactical implementation of sliding
More informationAnalysis and Implementation of LLC Burst Mode for Light Load Efficiency Improvement
Analysis and Implementation of LLC Bust Mode fo Light Load Efficiency Impovement Bin Wang, Xiaoni Xin, Stone Wu, Hongyang Wu, Jianping Ying Delta Powe Electonics Cente 238 Minxia Road, Caolu Industy Zone,
More informationSTACK DECODING OF LINEAR BLOCK CODES FOR DISCRETE MEMORYLESS CHANNEL USING TREE DIAGRAM
H PRASHANTHA KUMAR et. al.: STACK DECODING OF LINEAR BLOCK CODES FOR DISCRETE MEMORYLESS CHANNEL USING TREE DIAGRAM DOI:.297/ijct.22.69 STACK DECODING OF LINEAR BLOCK CODES FOR DISCRETE MEMORYLESS CHANNEL
More informationChapter 9 Cascode Stages and Current Mirrors
Chapte 9 Cascode Stages and Cuent Mios 9. Cascode Stage 9. Cuent Mios CH 9 Cascode Stages and Cuent Mios Boosted Output Impedances S O S m out E O E m out g g Bipola Cascode Stage [ g ( )] out m O O O
More informationWeek 5. Lecture Quiz 1. Forces of Friction, cont. Forces of Friction. Forces of Friction, final. Static Friction
Lectue Quiz 1 Week 5 Fiction (Chapte 5, section 8) & Cicula Motion (Chapte 6, sections 1-) You hae a machine which can acceleate pucks on fictionless ice. Stating fom est, the puck taels a distance x in
More informationMinimizing Ringing and Crosstalk
Minimizing Ringing and Cosstalk By Glen Dash, Ampyx LLC, GlenDash at alum.mit.edu Copyight 1998, 26 Ampyx LLC When viewed on a schematic, a wie is just a wie. Howeve, when isetimes shink to a few nanoseconds
More informationEfficient Power Control for Broadcast in Wireless Communication Systems
Efficient Powe Contol fo Boadcast in Wieless Communication Systems A. T. Chonopoulos Compute Science Depatment Univesity of Texas at San Antonio San Antonio, TX Email:atc@cs.utsa.edu P. Cotae Depatment
More information6.S084 Tutorial Problems L19 Control Hazards in Pipelined Processors
6.S084 Tutorial Problems L19 Control Hazards in Pipelined Processors Options for dealing with data and control hazards: stall, bypass, speculate 6.S084 Worksheet - 1 of 10 - L19 Control Hazards in Pipelined
More informationConfigurable M-factor VLSI DVB-S2 LDPC decoder architecture with optimized memory tiling design
RESEARCH Open Access Configuable M-facto VLSI DVB-S2 LDPC decode achitectue with optimized memoy tiling design Gabiel Falcao 1,2*, Maco Gomes 1,2, Vito Silva 1,2, Leonel Sousa 3,4 and Joao Cacheia 2 Abstact
More informationTHE UNIVERSITY OF NEW SOUTH WALES. School of Electrical Engineering & Telecommunications
THE UNIESITY OF NEW SOUTH WAES School of Electical Engineeing & Telecommunications EE97 POWE EETONIS FO ENEWABE AND DISTIBUTED GENEATION EXAMINATION Session (Supplementay Exam) TIME AOWED: 3 hous TOTA
More informationA multichannel Satellite Scheduling Algorithm
A multichannel Satellite Scheduling Algoithm J.S. Gilmoe and R. Wolhute Depatment of Electonic Engineeing Univesity of Stellenbosch 7600 Stellenbosch, South Afica Email: jgilmoe@dsp.sun.ac.za, wolhute@sun.ac.za
More information10! !. 3. Find the probability that a five-card poker hand (i.e. 5 cards out of a 52-card deck) will be:
MATH 0(001 Fall 2018 Homewok 2 Solutions Please infom you instucto if you find any eos in the solutions 1 Suppose that thee ae duck huntes, each with a pefect shot A flock of ducks fly ove, and each hunte
More informationA New Buck-Boost DC/DC Converter of High Efficiency by Soft Switching Technique
A New Buck-Boost D/D onvete of High Efficiency by Soft Switching Technique Dong-Kul Kwak, Seung-Ho Lee, and Do-Young Jung Pofessional Gaduate School of Disaste Pevention, Kangwon National Univesity, 45-711,
More informationAnalysis of Occurrence of Digit 0 in Natural Numbers Less Than 10 n
meican Intenational Jounal of Reseach in Fomal, pplied & Natual Sciences vailable online at http://www.iasi.net ISSN (Pint): 2328-3777, ISSN (Online): 2328-3785, ISSN (CD-ROM): 2328-3793 IJRFNS is a efeeed,
More informationE /11/2018 AA
B-3 CU 1 CU 2 CU 3 EVISIONS BY GFI,WP GFI AHU 1 AHU 2 AHU 3 GFI,WP MECH. WOMEN'S 6 MEN'S 7 OFFICE 3 8 TEACE 14 OFFICE 4 9 TC 10 PNL 'A' PNL 'B' A-31 GFI A-33 F A-30 A-28 OFFICE 5 11 GFI A-35 OFFICE 2 5
More informationDiscussion #7 Example Problem This problem illustrates how Fourier series are helpful tools for analyzing electronic circuits. Often in electronic
Discussion #7 Example Poblem This poblem illustates how Fouie seies ae helpful tools fo analyzing electonic cicuits. Often in electonic cicuits we need sinusoids of vaious fequencies But we may aleady
More informationAssignment 0/0 2 /0 8 /0 16 Version: 3.2a Last Updated: 9/20/ :29 PM Binary Ones Comp Twos Comp
* Dynamic Memoy *Big O Notation*Stacks *Exteme Pogamming*Selection Sot*Insetion Sot*Watefall Model Sting*Aays*AayList*Client Seve*Atificial Intelligence*Inheitance*Files*Video Games*Shot cicuit evaluation*
More informationECE 2300 Digital Logic & Computer Organization. More Pipelined Microprocessor
ECE 2300 Digital ogic & Computer Organization Spring 2018 ore Pipelined icroprocessor ecture 18: 1 nnouncements No instructor office hour today Rescheduled to onday pril 16, 4:00-5:30pm Prelim 2 review
More informationLecture 23. OUTLINE BJT Differential Amplifiers (cont d) Reading: Chapter
Lectue 23 OUTLINE BJT Diffeential Amplifies (cont d) ascode diffeential amplifies ommon mode ejection Diffeential pai with active load eading: hapte 0.4 0.6. EE05 Sping 2008 Lectue 23, Slide Pof. Wu, U
More informationDesign and Characterization of Conformal Microstrip Antennas Integrated into 3D Orthogonal Woven Fabrics
Design and Chaacteization of Confomal Micostip Antennas Integated into 3D Othogonal Woven Fabics Xin Wang 1, Lan Yao 1, Fujun Xu 2, Dongchun Zhou 1, Yiping Qiu 1 1 College of Textiles, Donghua Univesity,
More informationComputer Hardware. Pipeline
Computer Hardware Pipeline Conventional Datapath 2.4 ns is required to perform a single operation (i.e. 416.7 MHz). Register file MUX B 0.6 ns Clock 0.6 ns 0.2 ns Function unit 0.8 ns MUX D 0.2 ns c. Production
More informationOptimization of the law of variation of shunt regulator impedance for Proximity Contactless Smart Card Applications to reduce the loading effect.
Optimization of the law of vaiation of shunt egulato impedance fo Poximity Contactless Smat Cad Applications to educe the loading effect. Catheine Maechal, Dominique Paet. Laboatoie LIT ESIGETEL, ue du
More informationIEEE Broadband Wireless Access Working Group < Modifications to the Feedback Methodologies in UL Sounding
Poject Title Date Submitted IEEE 802.16 Boadband Wieless Access Woking Goup Modifications to the Feedback Methodologies in UL Sounding 2008-01-14 Souce(s) Fed Vook, Jeff Zhuang,
More informationHYBRID FUZZY PD CONTROL OF TEMPERATURE OF COLD STORAGE WITH PLC
Jounal of Theoetical and Applied Infomation Technology 28 th Febuay 2013. Vol. 48 No.3 2005-2013 JATIT & LLS. All ights eseved. ISSN: 1992-8645 www.jatit.og E-ISSN: 1817-3195 HYBRID FUZZY PD CONTROL OF
More informationINCREMENTAL REDUNDANCY (IR) SCHEMES FOR W-CDMA HS-DSCH
ICREMETAL REDUDACY (IR) SCHEMES FOR W-CDMA HS-DSCH Amitava Ghosh 1, Kenneth Stewat, Rapeepat Ratasuk 1, Eoin Buckley, and Raa Bachu 1 Advanced Radio Technology, GTSS, Motoola, Alington Heights, IL, USA
More informationGRADE 6 FLORIDA. Division WORKSHEETS
GRADE 6 FLORIDA Division WORKSHEETS Mental division stategies invese opeations As we know, multiplication and division ae invese opeations. 8 9 = 7 This means they do the evese of each othe: 7 9 = 8 We
More informationCSE502: Computer Architecture CSE 502: Computer Architecture
CSE 502: Computer Architecture Out-of-Order Execution and Register Rename In Search of Parallelism rivial Parallelism is limited What is trivial parallelism? In-order: sequential instructions do not have
More informationShort-Circuit Fault Protection Strategy of Parallel Three-phase Inverters
Shot-Cicuit Fault Potection Stategy of Paallel Thee-phase Invetes Hongliang Wang, Membe, IEEE, Xuejun Pei, Membe, IEEE, Yu Chen, Membe, IEEE,Yong Kang College of Electical and Electonics Engineeing Huazhong
More informationSpectrum Sharing between Public Safety and Commercial Users in 4G-LTE
1 Spectum Shaing between Public Safety and Commecial Uses in 4G-LTE Haya Shajaiah, Ahmed Abdel-Hadi and Chales Clancy Badley Depatment of Electical and Compute Engineeing Viginia Tech, Alington, VA, 22203,
More informationDevelopment of Corona Ozonizer Using High Voltage Controlling of Produce Ozone Gas for Cleaning in Cage
Moden Envionmental Science and Engineeing (ISSN 333-58) July 07, Volume 3, No. 7, pp. 505-509 Doi: 0.534/mese(333-58)/07.03.07/0 Academic Sta Publishing Company, 07 www.academicsta.us Development of Coona
More informationSurface Areas of Cylinders ACTIVITY: Finding Surface Area. ACTIVITY: Finding Area. How can you find the surface area of. a cylinder?
9.3 Suface Aeas of Cylindes a cylinde? How can you find the suface aea of A cylinde is a solid that has two paallel, identical cicula s. lateal suface h 1 ACTIVITY: Finding Aea Wok with a patne. Use a
More informationOptimal Design of Smart Mobile Terminal Antennas for Wireless Communication and Computing Systems
Optimal Design of Smat Mobile Teminal Antennas fo Wieless Communication and Computing Systems Autho Lu, Junwei, Yang, Shiyou Published 2007 Confeence Title 2007 4th Intenational Symposium on Electomagnetic
More informationSynopsis of Technical Report: Designing and Specifying Aspheres for Manufacturability By Jay Kumler
OPTI 51 Synopsis (Gad Requiement #1) G. Desoches Synopsis of Technical Repot: Designing and Specifying Asphees fo Manufactuability By Jay Kumle Novembe 1, 007 Reviewed by: Gead Desoches Abstact Since asphees
More informationCSE502: Computer Architecture CSE 502: Computer Architecture
CSE 502: Computer Architecture Out-of-Order Execution and Register Rename In Search of Parallelism rivial Parallelism is limited What is trivial parallelism? In-order: sequential instructions do not have
More informationAn Ultra Low Power Segmented Digital-to-Analog Converter
An Ulta Low Powe Segmented Digital-to-Analog onvete Manoj Kuma Univesity Institute of Engineeing and Technology, Mahashi Dayanand Univesity, Rohtak-4, Hayana, India. Raj Kuma Pofesso and Diecto, Mata Raj
More informationInternational Journal of Advance Engineering and Research Development. Implementation of Vector Oriented Control for Induction Motor Using DS1104
Scientific Jounal of Impact Facto (SJIF): 4.72 Intenational Jounal of Advance Engineeing and Reseach Development Volume 4, Issue 1, Januay -2017 e-issn (O): 2348-4470 p-issn (P): 2348-6406 Implementation
More informationComputer Science 246. Advanced Computer Architecture. Spring 2010 Harvard University. Instructor: Prof. David Brooks
Advanced Computer Architecture Spring 2010 Harvard University Instructor: Prof. dbrooks@eecs.harvard.edu Lecture Outline Instruction-Level Parallelism Scoreboarding (A.8) Instruction Level Parallelism
More informationNovel Analytic Technique for PID and PIDA Controller Design. Seul Jung and Richard C. Dorf. Department of Electrical and Computer Engineering
Novel Analytic Technique fo PID and PIDA Contolle Deign Seul Jung and Richad C. Dof Depatment of Electical and Compute Engineeing U niveity of Calif onia; Davi Davi, CA 9566 e-mail:jung@ece.ucdavi.edu,
More informationLow-Complexity Time-Domain SNR Estimation for OFDM Systems
Low-Complexity Time-Domain SR Estimation fo OFDM Systems A. jaz, A.B. Awoseyila and B.G. Evans A low-complexity SR estimation algoithm fo OFDM systems in fequency-selective fading channels is poposed.
More informationMultiagent Reinforcement Learning Dynamic Spectrum Access in Cognitive Radios
Sensos & Tansduces 204 by IFSA Publishing, S L http://wwwsensospotalcom Multiagent Reinfocement Leaning Dynamic Spectum Access in Cognitive Radios Wu Chun, 2 Yin Mingyong, 2 Ma Shaoliang, Jiang Hong School
More informationBLACKBOARD SYSTEM AND TOP-DOWN PROCESSING FOR THE TRANSCRIPTION OF SIMPLE POLYPHONIC MUSIC. Juan Pablo Bello and Mark Sandler
BLACKBOARD SYSTEM AND TOP-DOWN PROCESSING FOR THE TRANSCRIPTION OF SIMPLE POLYPHONIC MUSIC Juan Pablo Bello and Mak Sandle Depatment of Electonic Engineeing, King s College London, Stand, London WC2R 2LS,
More information6.1 Reciprocal, Quotient, and Pythagorean Identities
Chapte 6 Tigonometic Identities 1 6.1 Recipocal, Quotient, and Pthagoean Identities Wam-up Wite each epession with a common denominato. Detemine the estictions. a c a a) b d b) b c d c) a 1 c b c b a Definition
More information1 Performance and Cost
Pefomance and Cost Analysis and Reseach of Ai-Cooled Using Small Diamete Coppe Tubes Wu Yang, Li Changsheng and Deng Bin Abstact Replacing coppe tubes with aluminum tubes and using coppe tubes with smalle
More informationAn Improved Implementation of Activity Based Costing Using Wireless Mesh Networks with MIMO Channels
Poceedings of the 10th WSEAS Intenational Confenence on APPLIED MATHEMATICS, Dallas, Texas, USA, Novembe 1-3, 2006 262 An Impoved Implementation of Activity Based Costing Using Wieless Mesh Netwoks with
More informationPERFORMANCE OF TOA ESTIMATION TECHNIQUES IN INDOOR MULTIPATH CHANNELS
PERFORMANCE OF TOA ESTIMATION TECHNIQUES IN INDOOR MULTIPATH CHANNELS Xinong Li, Kaveh Pahlavan, and Jacques Beneat Cente fo Wiele Infomation Netwo Studies Electical and Compute Engineeing Depatment, Woceste
More informationFigure Geometry for Computing the Antenna Parameters.
Spheical Coodinate Systems Definitions Figue 1.2.1 Geomety fo Computing the Antenna Paametes. Antenna Radiation Patten: The distibution of adiated enegy fom an antenna ove a suface of constant adius centeed
More informationOptimized Fuzzy Controller Design to Stabilize Voltage and Frequency Amplitude in a Wind Turbine Based on Induction Generator Using Firefly Algorithm
Intenational Reseach Jounal of Management Sciences. Vol., 3 (3), 12-117, 215 Available online at http://www.ijmsjounal.com ISSN 2147-964x 215 Optimized Fuzzy Contolle Design to Stabilize Voltage and Fequency
More informationFigure 1-1 Sample Antenna Pattern
1.0 ANTENNAS 1.1 INTRODUCTION In EE 619 we discussed antennas fom the view point of antenna apetue, beam width and gain, and how they elate. Moe specifically, we dealt with the equations and 4 Ae G (1-1)
More informationDynamic Scheduling I
basic pipeline started with single, in-order issue, single-cycle operations have extended this basic pipeline with multi-cycle operations multiple issue (superscalar) now: dynamic scheduling (out-of-order
More informationAudio Engineering Society. Convention Paper. Presented at the 120th Convention 2006 May Paris, France
Audio Engineeing Society Convention Pape Pesented at the 120th Convention 2006 May 20 23 Pais, Fance This convention pape has been epoduced fom the autho's advance manuscipt, without editing, coections,
More information77 GHz ACC Radar Simulation Platform
77 GHz ACC Rada Simulation Platfom Camilla Känfelt, Alain Péden, Ali Bazzi, Ghayath El Haj Shhadé, Mohamad Abbas, Thiey Chonavel and Fantz Bodeeau Abstact The development of a system simulation platfom
More informationDistributive Radiation Characterization Based on the PEEC Method
Title Distibutive Radiation haacteization Based on the PEE Method Autho(s) ao, Y; Jiang, L; Ruehli, AE itation The IEEE Intenational Symposium on Electomagnetic ompatibility (EM), Raleigh, Noth aolina,
More informationOptimal Strategies in Jamming Resistant Uncoordinated Frequency Hopping Systems. Bingwen Zhang
Optimal Stategies in Jamming Resistant Uncoodinated Fequency Hopping Systems by Bingwen Zhang A Thesis Submitted to the Faculty of the WORCESTER POLYTECHNIC INSTITUTE in patial fulfillment of the equiements
More information