Subword Permutation Instructions for Two-Dimensional Multimedia Processing in MicroSIMD Architectures

Size: px
Start display at page:

Download "Subword Permutation Instructions for Two-Dimensional Multimedia Processing in MicroSIMD Architectures"

Transcription

1 Suword Permuttion Instructions for Two-Dimensionl Multimedi Processing in MicroSIMD rchitectures Ruy. Lee Princeton University strct MicroSIMD rchitectures incorporting suword prllelism re very efficient for ppliction-specific medi processors s well s for fst multimedi informtion processing in generl-purpose processors. This pper ddresses the unsolved prolem of the need to permute the suwords pcked in registers for mximum prllelism performnce, especilly for two-dimensionl (2-D) multimedi lgorithms. We propose new systemtic pproch for identifying the fundmentl dt rerrngement needs in current nd future 2-D pixel processing progrms sed on the hierrchicl decomposition of frmes nd ojects into tomic 2-D structures. We define new suword permuttion instructions, Check, Excheck, Exchnge, nd Permset, tht chieve these dt rerrngements cross multiple registers. We lso define n lphet of suword permuttion primitives, including these new instructions nd the Mix instruction defined for P-RISC MX-2 nd I-64, which supports the dt rerrngement needs of 2-D frmes nd ojects. We show the sufficiency nd efficiency of this lphet for chieving ll possile permuttions of hierrchicl 2-D locks. 1. Introduction Multimedi informtion processing cn e considered n incresing prt of the generlpurpose worklod or specil-purpose ppliction re. In this pper, we consider new instructions for ccelerting multimedi processing in ny progrmmle processor, whether generl-purpose or ppliction-specific. The focus is on simple, single-cycle instructions, which cn e used to construct ny type of permuttions needed in two-dimensionl (2-D) multimedi processing. Multimedi extensions hve een dded to generl-purpose processors to ccelerte the processing of different medi types [1-7,15,16]. The types of ppliction-specific processors we trget in this pper re designed to execute vrious multimedi progrms, rther thn just one. They include digitl signl processors [8], video signl processors [9, 10], nd mediprocessors [11, 12]. Suword prllelism [1,4] is now widely deployed y multimedi instructions in microprocessor rchitectures [1-7, 15,16] nd in medi processors [12] to ccelerte the processing of lower-precision dt, like 16-it udio smples or 8-it pixel components. We lso cll this microsimd rchitecture [13], since it pplies SIMD (Single Instruction Multiple Dt) prllel processor techniques [14] within single processor. suword-prllel (or microsimd) instruction performs the sme opertion in prllel on multiple pirs of suwords pcked into two registers, which re typiclly 32 to 128 its wide in tody s microprocessors nd mediprocessors (see figure 1). For exmple, 64-it word-oriented dtpth cn e

2 prtitioned into eight 8-it suwords, or four 16-it suwords, or two 32-it suwords. Sustntil performnce improvements hve een relized using suword prllel instructions, t very low cost compred to other forms of prllelism, like supersclr, VLIW or prllel processor orgniztions, for the sme degree of opertion prllelism [13]. Register File Suword rith.unit Shift/Permute Unit Multiple Suword rithmetic nd Shift/Permute functionl units cn e implemented Figure 1: microsimd suword prllelism leverging word dtpths With pcked suwords in registers, we now need to e le to re-rrnge suwords within register, nd etween registers. This is necessry to chieve the mximum prllelism for susequent processing. Unfortuntely, suword permuttion opertions re not understood s clerly s suword rithmetic opertions. They require moving severl fields (suwords) in prllel. Conventionl shift nd rotte instructions move ll the its in register y the sme mount. Extrct nd deposit instructions, found in instruction-set rchitectures like P-RISC [17], move one field using one or two instructions. Erly suword permuttion instructions like mix nd permute [4] in the P-RISC MX-2 multimedi instructions re first ttempt to find efficient nd generl-purpose suword permuttion primitives. However, the sufficiency or efficiency of these permuttion primitives in chieving ny ritrry permuttion hs not een demonstrted. The prolem is further complicted y the fct tht imge, video or grphics processing require mpping two-dimensionl ojects onto suwords in multiple registers, nd then permuting these suwords etween registers. In ddition, since permuttions hve not een esily chieved y progrmmle processors, lgorithm designers my not hve optimized lgorithms using permuttions. Hence, one cnnot just com through ll the common multimedi lgorithms to determine wht permuttions re used nd the performnce impct they hve: one would often need to re-think lgorithms to see if efficient permuttions would help improve the performnce. Furthermore, in designing suword permuttion primitives, we need to project the permuttion needs of future, yet-to-e defined multimedi lgorithms, nd this seems to e n intrctle prolem. In this pper, we propose systemtic solution to this unsolved prolem of finding generic suword permuttion primitives for oth current nd future lgorithms for processing two-dimensionl multimedi dt. We lso define smll set of suword permuttion primitives, nd show tht this is oth sufficient nd n efficient set. In section 2, we descrie how 2-dimensionl frmes cn e mpped into the pcked suwords of microsimd rchitectures. We lso show tht two-dimensionl ojects cn e decomposed into smller locks or polygons, nd ultimtely into tomic 2x2 mtrices nd tringles. In section 3, we review the suword permuttion instructions tht hve een defined in the multimedi instructions MX-2 for P-RISC processors [4] nd for I-64 EPIC processors [15], especilly the mix instruction. We show n exmple of how permuttion on 2-D oject cn e decomposed into hierrchicl permuttions on 2x2 mtrices. In section 4, we investigte the suword permuttion needs of tomic 2-D structures, nd postulte tht

3 these re generic primitives since ll 2-D frmes nd ojects cn e decomposed into these tomic 2-D structures. In section 5, we propose smll susets of suword permuttion primitives tht re sufficient nd lso efficient for different performnce nd cost levels. Section 6 summrizes nd concludes the pper. 2. Mpping nd decomposition of 2-D locks To use microsimd rchitectures for mximum performnce, we need to mp multimedi dt into pcked suwords in wy tht permits mximum prllel execution, SIMD style. Pixel-oriented multimedi dt in imges, grphics, video or nimtion, re two-dimensionl (2-D) in nture. How should 2-D locks of dt e mpped into the pcked suwords of micro- SIMD rchitectures? 2-D rry of pixels in memory is normlly stored in row-mjor formt: elements of row one re stored sequentilly in successive memory loctions, followed y elements of row two, nd so forth. When words re loded into registers from memory, this trnsltes into mpping the first row into set of registers, mpping the second row into nother set of registers, nd so forth. This is clled re-mpping [13] of 2-D lock different rows of the 2-D lock re held in different registers. 2-D imge or frme is esily decomposed into smller 2-D locks. The smllest 2-D lock is 2x2 lock (or mtrix). 2-D oject within frme cn lso e decomposed into smller locks, where gin the smllest 2-D rectngulr lock is 2x2 mtrix of pixels. For exmple, n 8x8 mtrix used in DCT or IDCT cn e decomposed into four 4x4 mtrices, ech stored in four 64-it registers, s shown in Figure 2, where ech element is 16-it suword. Ech such 4x4 mtrix cn e further decomposed into four 2x2 mtrices (Figure 2). Mtrices with dimensions tht re power of two cn e successively decomposed into smller mtrices, nd ultimtely into the smllest 2x2 mtrix. () re mpping of 4x4 mtrix: R1 = R2 = R3 = R4 = () Decomposition into four 2x2 mtrices: R1 = R2 = R3 = c00 c01 d00 d01 R4 = c10 c11 d10 d11 Figure 2: re mpping nd decomposition of 2-D locks Non-rectngulr ojects my more ccurtely e decomposed into non-rectngulr polygons, the smllest of which is tringle. Since ll 2-D frmes nd ojects cn e decomposed into tomic 2-D units like the 2x2 mtrix nd the tringle, we postulte tht if we cn determine the permuttion needs of these tomic units, they cn serve s permuttion primitives for the entire frme or oject. t the lowest level, we permute the four pixels of 2x2 mtrix. t the next higher level, we gin permute 2x2 mtrix, where ech element is now itself 2x2 mtrix.

4 3. Mix permuttion instruction For microprocessor multimedi instructions, only P-RISC MX-2[4], I-64[15] nd the PowerPC ltivec[16] hve few instructions designed for generl-purpose suword permuttion. ecuse our focus in this pper is on 2-D multimedi processing, nd rempped 2-D ojects spn t lest two registers, we seek permuttion primitives tht reorder suwords from two source registers. We descrie mix, defined y the uthor for MX-2 nd I-64, which is currently the only suword permuttion instruction with two source registers Definition of Mix instruction The mix opertion selects either ll even elements, or ll odd elements, from the two source registers [4,15]. The pir of mixl nd mixr opertions is defined s follows: mixl: interleve the corresponding even elements from the two source registers, strting from the leftmost elements in ech register mixr: interleve the corresponding odd elements from the two source registers, ending with the rightmost elements in ech register Tle 1 defines these mix instructions, for three different suword sizes: 8 its, 16 its nd 32 its. Ech letter in the register contents represents n 8-it suword, nd ech register holds totl of 64 its. Tle 1: Definition of Mix instruction Register Contents: R1 = c d e f g h R2 = C D E F G H Instruction: mixl,8 R1,R2,R3 mixr,8 R1,R2,R3 mixl,16 R1,R2,R3 mixr,16 R1,R2,R3 mixl,32 R1,R2,R3 mixr,32 R1,R2,R3 Definition: R3 = c C e E g G R3 = d D f F h H R3 = e f E F R3 = c d C D g h G H R3 = c d C D R3 = e f g h E F G H 3.2. Exmple of decomposle suword permuttions. common permuttion of 2-D oject is mtrix trnspose, where the mtrix is flipped long its digonl: rows ecome columns, nd columns ecome rows. This is decomposle permuttion. For exmple, n 8x8 mtrix of 16-it elements stored in 16 registers cn e decomposed into four 4x4 mtrices (Figure 2), ech of which cn e further decomposed into four 2x2 mtrices (Figure 2). y trnsposing ech of the 2x2 mtrices, then trnsposing the lrger 2x2 mtrix, where ech element is itself one of these 2x2 mtrices, we otin the mtrix trnspose of 4x4 mtrix (see Figure 3). The mix instructions cn perform these hierrchicl 2x2 mtrix trnspositions. The mixl nd mixr instructions re used in pirs t the level of

5 suword size equl to the mtrix element size. Then, they re used t the size of suwords tht re twice s lrge. Repeting this on ech of the four 4x4 mtrices completes the trnspose of the originl 8x8 mtrix. r r r3 c00 c01 d00 d01 r4 c10 c11 d10 d mixl,16 r1,r2, t mixr,16 r1,r2, t2 c00 c10 d00 d10 mixl,16 r3,r4, t3 mixr,16 r3,r4, r4 c01 c11 d01 d11 mixl,32 t1,t3, r1 mixl,32 t2,r4, r2 mixr,32 t1,t3, r3 mixr,32 t2,r4, r4 t1 t2 t3 r c00 c10 r c01 c11 r d00 d10 r d01 d11 r4 Figure 3: Hierrchicl Decomposition of Mtrix Trnspose Permuttion 4. Fundmentl dt rerrngements in 2-D locks We propose tht systemtic pproch to finding set of permuttion primitives for current nd future 2-D multimedi progrms cn e sed on decomposing imges nd ojects into tomic units, then finding the permuttions desired for these 2-D uilding locks. The suword permuttion instructions for these 2-D uilding locks re lso defined for lrger suword sizes t successively higher hierrchicl levels. We propose studying the permuttions of 2x2 mtrix, nd the permuttions of the four tringles contined within this 2x2 uilding lock. Wht re the useful dt rerrngements in 2x2 mtrix nd its four emedded tringles (section 4.1)? Wht re permuttion primitives tht cn perform these dt rerrngements (section 4.2)? re these permuttion primitives sufficient nd efficient (section 4.3)? Cn they e generlized (section 4.4)? 4.1. Chrcteriztion of 2-D dt rerrngements The first set of dt rerrngements likely to e needed in 2x2 mtrix is to e le to swp elements verticlly, horizontlly nd digonlly. This is sed on oserving tht nerest neighor interctions re perhps the most common 2-D pixel opertions. The eight nerestneighor movements for pixel in 2-D frme re shown in Figure 4. Figure 4 expresses the 9-element mtrix of Figure 4 s four 2x2 mtrices (outlined in old). Here, n element of 2x2 mtrix cn move to its right (or left) neighor, its downwrd (or upwrd) neighor, or its digonl right (or left) neighor. Figure 4c shows ll possile nerest neighor movements, for one or two pirs of elements for 2x2 mtrix. The four elements of 2x2 mtrix cn lso e rotted clockwise y 1, 2 or 3 positions (Figure 5). This is equivlent to rotting counter-clockwise y 3, 2 or 1 position. lso, rotting y 2 positions is equivlent to swpping oth the digonl nd nti-digonl elements,

6 s lredy covered in Figure 4c. Hence, we need only consider rotting clockwise or nticlockwise y 1 position. ) Nerest Neighor Movement for 2-D locks Up-down Right-Left Digonl- ntidigonl Identity Mtrix trnspose ) Nerest Neighor Movement for 2x2 Mtrices c) Nerest Neighor Moves for 2x2 Mtrix Figure 4: Nerest Neighor Permuttions Rotte y 2 elements = swp digonl nd ntidigonl elements clockwise ) Rottion of 2x2 Mtrix ) Rottions of the four emedded tringles nti-clockwise Figure 5: Rottions of 2x2 mtrix nd its emedded tringles 2x2 mtrix contins four tringles, ech of which cn e rotted clockwise or nticlockwise y 1 position. This results in 8 different permuttions of the 2x2 mtrix, s shown in Figure 5. Tringles re useful for representing non-rectngulr shpes.

7 We postulte tht these permuttions of 2x2 mtrices nd tringles should e efficiently supported, t ll suword sizes (powers of 2), for use in decomposle permuttions of 2-D ojects. Wht suword permuttion instructions cn chieve these common dt rerrngements? 4.2. Check, Exchnge nd Excheck opertions To chieve these common dt rerrngements, we only need to define three new suword permuttion primitives (see Tle 2). The check instruction llows the downwrd nd upwrd swpping of elements, the exchnge instruction llows the right nd left movement, while the excheck instruction llows the rottion of tringle of three elements within 2x2 mtrix. The mixl nd mixr opertions, defined erlier, chieve the swpping of digonl elements. The check instruction performs checkerord pttern: it selects lterntely from the corresponding suwords in the two source registers, for ech position in the result register (see Tle 2). Exchnge is n opertion on single source register: it swps djcent suwords in ech pir of consecutive suwords. Excheck cn e descried s composite opertion: it performs check on the two source registers, followed y n exchnge opertion on the result. Tle 2: Definition of Check, Exchnge nd Excheck 4.3. Sufficiency nd efficiency of permuttion primitives Register Contents: R1 = c d e f g h R2 = C D E F G H Instruction: Definition: check,8 R1,R2,R3 R3 = c D e F g H check,16 R1,R2,R3 R3 = C D e f G H check,32 R1,R2,R3 R3 = c d E F G H exchnge,8 R1,R3 R3 = d c f e h g exchnge,16 R1,R3 R3 = c d g h e f exchnge,32 R1,R3 R3 = e f g h c d excheck,8 R1,R2,R3 R3 = D c F e H g excheck,16 R1,R2,R3 R3 = C D G H e f excheck,32 R1,R2,R3 R3 = E F G H c d In Tle 3, we systemticlly enumerte the permuttions of re-mpped 2x2 mtrices, to verify tht the suword permuttion instructions defined ove cn indeed perform ll these permuttions efficiently. R1 nd R2 contin four 2x2 mtrices. It is esier to follow just the leftmost mtrix (in old), which is leled s in figures 4-6, initilly in R1 nd in R2. The permuttions re enumerted s follows: ech of the 4 elements in resulting 2x2 mtrix cn e in the top left corner in R3. Therefter, ech of the 3 remining elements cn e in the top right corner in R3. This gives 12 possiilities for the top row, which is used for the numeric numering of the cses. The two remining elements of ech 2x2 mtrix re in the ottom row in R4, nd their two possile orderings give the () nd () numering in Tle 3.

8 Opernd registers: 1() ttopleft 1() 2() 2() 3() 3() 4() ttopleft 4() 5() 5() 6() 6() 7() ttopleft 7() 8() 8() 9() 9() 10() ttopleft 10() 11() 11() 12() 12() Tle 3: ll Permuttions of Four re-mpped 2x2 Mtrices R1 = c d e f g h R2 = C D E F G H Result Registers: Instructions Used: Type of Dt Movement: R3 = c d e f g h ;R3=R1 identity permuttion R4 = C D E F G H ;R4=R2 R3 = c d e f g h ;R3=R1 swp ottom row elements rightleft R4 = D C F E H G ;R4=exchnge(R2) R3 = c D e F g H ;R3=check(R1,R2) swp right column elements updown R4 = C d E f G h ;R4=check(R2,R1) R3 = c D e F g H ;R3=check(R1,R2) rotte ottom-right tringle nticlockwise R4 = d C f E h G ;R4=excheck(R2,R1) R3 = c C e E g G ;R3=mixL(R1,R2) swp digonl elements R4 = d D f F h H ;R4=mixR(R1,R2) =trnspose R3 = c C e E g G ;R3=mixL(R1,R2) rotte ottom-right tringle R4 = D d F f H h ;R4=mixR(R2,R1) clockwise R3 = d c f e h g ;R3=exchnge(R1) swp top row elements right-left R4 = C D E F G H ;R4=R2 R3 = d c f e h g ;R3=exchnge(R1) swp oth rows elements rightleft R4 = D C F E H G ;R4=exchnge(R2) R3 = d D f F h H ;R3=mixR(R1,R2) rotte top-right tringle nticlockwise R4 = C c E e G g ;R4=mixL(R2,R1) R3 = d D f F h H ;R3=mixR(R1,R2) rotte nti-clockwise 1 element R4 = c C e E g G ;R4=mixL(R1,R2) R3 = d C f E h G ;R3=excheck(R2,R1) rotte top-left tringle nticlockwise R4 = c D e F g H ;R4=check(R1,R2) R3 = d C f E h G ;R3=excheck(R2,R1) R4 = D c F e H g ;R4=excheck(R1,R2) R3 = C c E e G g ;R3=mixL(R2,R1) rotte top-left tringle clockwise R4 = d D f F h H ;R4=mixR(R1,R2) R3 = C c E e G g ;R3=mixL(R2,R1) rotte clockwise 1 element R4 = D d F f H h ;R4=mixR(R2,R1) R3 = C d E f G h ;R3=check(R2,R1) swp left column elements updown R4 = c D e F g H ;R4=check(R1,R2) R3 = C d E f G h ;R3=check(R2,R1) rotte ottom-left tringle R4 = D c F e H g ;R4=excheck(R1,R2) clockwise R3 = C D E F G H ;R3=R2 swp left nd right column R4 = c d e f g h ;R4=R1 elements up-down R3 = C D E F G H ;R3=R2 R4 = d c f e h g ;R4=exchnge(R1) R3 = D c F e H g ;R3=excheck(R1,R2) rotte top-right tringle clockwise R4 = C d E f G h ;R4=check(R2,R1) R3 = D c F e H g ;R3=excheck(R1,R2) R4 = d C f E h G ;R4=excheck(R2,R1) R3 = D d F f H h ;R3=mixR(R2,R1) rotte ottom-left tringle nticlockwise R4 = c C e E g G ;R4=mixL(R1,R2) R3 = D d F f H h ;R3=mixR(R2,R1) swp nti-digonl elements R4 = C c E e G g ;R4=mixL(R2,R1) R3 = D C F E H G ;R3=exchnge(R2) R4 = c d e f g h R3 = D C F E H G R4 = d c f e h g ;R4=R1 ;R3=exchnge(R2) ;R4=exchnge(R1) swp digonl nd nti-digonl elements =rotte clockwise y 2 The suword permuttion instructions used to chieve ech of the 2x2 lock permuttions re shown. Only the 5 suword permuttion primitives defined erlier re needed: mixl, mixr, exchnge, check,ndexcheck. If the processor hs t lest two permuttion units, then

9 ech cse in Tle 3 cn e executed in one cycle, since there re no dependencies in generting R3 nd R4. This estlishes the efficiency of these permuttion primitives. Ech 2x2 mtrix permuttion is lso leled with one of the 20 dt movements (including identity) descried in Figures 4c, 5 nd 5. There re four permuttions in Tle 3 tht re not leled with dt movement descried erlier. They correspond to more esoteric dt rerrngements of 2x2 mtrix, descried est s chnging rows into digonls, nd chnging digonls into columns (Figure 6). Even though these four permuttions were not initilly identified s dt rerrngements to e supported, the permuttion primitives we defined efficiently support them. This supports the thesis tht if we cn define permuttion primitives tht somehow form sis set, they cn e used to implement other permuttions tht my e needed in lgorithms yet to e invented. Identity Chnging Rows to Digonls Chnging Digonls to Columns Figure 6: Four unleled permuttions of 2x2 mtrix 4.4. Repeting permuttions on smller susets of suwords The exchnge instruction cn e replced y more generl permset instruction, which repets permuttion on suset of elements over the rest of the elements in the register. Permset is lso generliztion of the permute instruction in MX-2 [4]. The suwords in the source register re numered, nd permute specifies the new ordering desired in terms of this numering. The mux instruction in I-64 [15] nd the vperm instruction in ltivec [16] re similr. Tle 4 gives exmples of this permute instruction on 8-it nd 16-it suwords. Tle 4: Exmples of Permute Instruction on 8-it nd 16-it Suwords Opernd register: R1 = c d e f g h permute Instruction Result register contents Type of Permuttion permute,8, R1, Rt Rt = c d e f g h identity permuttion permute,8, R1, Rt Rt = d c f e h g exchnge permute,8, R1, Rt Rt = g g g ggggg rodcst permute,8, R1, Rt Rt = h g f e d c reverse permute,8, R1, Rt Rt = f c h g d e ritrry permuttion permute,8, R1, Rt Rt = f f d g g permuttion with repetitions permute,16,0213 R1, Rt Rt = e f c d g h permuting four 16-it suwords There is limit to the efficiency of the permute instruction for permuting mny suwords, since the control its quickly exceed the numer of its permuted. Permuting four suwords requires only 8 control its, which cn e encoded in the permute instruction itself [4, 15]. eyond four elements nd up to sixteen elements, ny ritrry permuttion cn still e

10 performed with one instruction, y providing the control its for the permuttion in second source register [16], rther thn in the 32-it instruction. Permuting 32 elements requires 160 its, nd permuting 64 elements requires 384 its (n*log n its). Hence, permuting more thn 16 elements cnnot e chieved y single instruction with two source registers, using this method of specifying permuttions. To permute more suwords without incresing the numer of control its required, we define new permset instruction which permutes suset of m suwords, where m is less thn the numer of suwords in the register. The sme permuttion is repeted on consecutive susets of m suwords. If the totl numer of suwords in the register is not multiple of m, we cn pd this lst set of suwords with zeros. Tle 5: Replcing 8-element Permute with 4-element Permset instructions Permute exmple Equivlent Permset instructions Type of permuttion permute,8, R1, Rt permset, 8,4,0123 R1, Rt identity permute,8, R1, Rt permset, 8,4,1032 R1, Rt exchnge permute,8, R1, Rt permset, 8,4,2222 R1, Rt rodcst permset,16,4,2222 Rt, Rt permute,8, R1, Rt permset, 8,4,3210 R1, Rt reverse permset,16,4,2301 Rt, Rt permute instruction cn e turned into permset instruction, y inserting new prmeter which specifies the numer of elements to e permuted in ech set. In Tle 5, this cn e second prmeter, inserted etween the two existing prmeters of suword size nd permuttion control its. Using this new permset instruction, the first four permuttions in Tle 4 cn lso e specified s permuttions on sets of 4 elements, s shown in Tle 5. The identity nd exchnge opertions cn e replced y exctly one such permset instruction. The rodcst nd reverse opertions ech need two permset instructions, with 4-element permute sets. The next two permute instructions in Tle 4 cnnot e ccomplished in 1 or 2 instructions, ecuse of the lck of symmetry in the permuttion done on consecutive sets of 4 elements. So, while the permset instruction with 4-element sets is not s generl s the full permute instruction on 8 elements, it cn specify ll possile permuttions of 2x2 mtrices, with lower implementtion cost. 5. lphet of Suword Permuttion Primitives n lphet is smll set of sic primitives from which words, phrses, sentences, prgrphs nd stories cn e uilt. Mny of these stories nd words were not even conceived when the lphet ws designed. We propose n lphet of fundmentl permuttion primitives, which re simple yet powerful enough to express ll dt rerrngement needs of current nd future 2-D medi processing progrms. The mix opertions pper to e truly fundmentl, selecting firly etween elements cross the width of oth source registers, emodying the powerful even-odd prdigm. lthough the check instruction cn e derived from the mix opertion, it cn lso e considered fundmentl permuttion since it emodies the checkerord pttern. The exchnge opertion, while useful permuttion primitive in itself, cn e replced y the more generl permset instruction, s descried ove. n initil lphet of suword permuttion primitives is shown in Figure 7, including mixl, mixr, permset, check nd excheck, defined on 8, 16 nd 32 it suwords. For very low cost

11 implementtions, t slightly reduced performnce, miniml lphet could exclude check nd excheck. Check my e excluded from miniml set, ecuse Shift_Left of the second opernd, followed y mixl instruction cn ccomplish it. Excheck is the composition of check followed y exchnge, so it my lso e omitted from miniml set of fundmentl permuttions. They re included in the initil lphet for efficiency nd uniformity in performnce, so tht every permuttion of sic 2x2 mtrix, s enumerted in Tle 3, cn e done in single cycle (or single step). Miniml lphet: mixl, mixr on 8, 16 nd 32 it suwords permset on 8, 16 nd 32 it suwords, with 4-element sets dditionl Primitives: check on 8, 16 nd 32-it suwords excheck on 8, 16 nd 32-it suwords Figure 7: lphet of Suword Permuttion Primitives The miniml set of mixl, mixr nd permset my e further reduced depending on the size of the registers in the processor. For exmple, if registers re only 64 its wide, then permuttion instructions for the two 32-it suwords my not e needed, since they cn esily e specified s permuttions on the four 16-it suwords. These permuttion instructions my lso e extended down to suwords of 4 its, 2 its nd 1 it, especilly if it is lso desired to support permuttions for cryptogrphy efficiently. 6. Summry MicroSIMD rchitecture incorporting suword prllelism is very efficient for pplictionspecific medi processors, s well s for fst multimedi informtion processing in generlpurpose microprocessors. This is ecuse, in the lrge mjority of cses, microsimd rchitectures cn exploit the dt-prllelism present in multimedi progrms s efficiently s other more expensive prllel rchitectures. The reduced complexity in register ports nd register ypssing in microsimd rchitectures results in fster cycle times, less re nd less design complexity for the sme degree of prllelism s other prllel rchitectures like VLIW, supersclr, or conventionl SIMD or MIMD prllel processor rchitectures [13]. We pose the prolem of finding smll set of fundmentl suword permuttion opertions tht cn e used efficiently for current nd future two-dimensionl multimedi progrms. Such suword permuttion instruction rerrnges dt etween suword trcks in microsimd rchitectures, performing function like tht of interconnection networks which move dt etween prllel processors in conventionl SIMD or MIMD prllel processor rchitectures. While this initilly ppers to e n intrctle prolem, this pper descries novel pproch to solving this prolem systemticlly. We first descrie how 2-dimensionl ojects re loded into registers s pcked suwords in re-mpped formt, corresponding to how 2-dimensionl dt is usully stored in memory. We use the 2x2 mtrix s sic uilding lock to which 2-dimensionl frmes of pixels nd 2-D ojects cn e hierrchiclly decomposed. We then chrcterize the interesting permuttion opertions of this sic 2x2 mtrix, s well s the four tringles tht it contins. These re verticl, horizontl, digonl, nd rottionl rerrngements of vrious kinds. We define new suword permuttion primitives: check, exchnge, excheck, ndpermset. The check instruction llows the downwrd nd upwrd swpping of elements, the exchnge

12 instruction llows the right nd left movement, while the excheck instruction llows the rottion of tringles. The mixl nd mixr opertions defined erlier [4] chieve the swpping of digonl elements. Permset llows the permuttion of smller set of suwords to e repeted on other suwords in the source register, enling symmetric permuttions to e specified on mny more elements, without incresing the numer of permuttion control its. Exchnge is one exmple of the permset instruction. We then define n initil lphet (lphet ) of suword permuttions which contins mix, permset, check nd excheck. Processors designed for high performnce cn implement lphet, while very cost sensitive processors cn choose to implement n even smller set - miniml lphet of only mix nd permset instructions. The omitted instructions, check nd excheck in lphet, cn e composed from mix nd permset. Tht this miniml set is essentilly equivlent to the set consisting of mix nd permute in MX-2 is prtil vlidtion of the sufficiency of the suword permuttion instructions chosen for MX-2 [4]. We verify tht ll the 24 permuttions of 2x2 mtrix cn e otined using only instructions from lphet, in single cycle, in processor with t lest two permuttion units. Just s suword prllelism is useful eyond multimedi processing for ccelerting ll forms of dt-prllel computtions on lower precision dt, we expect tht suword permuttions will e eqully useful. The prolem is tht there re so mny possile rerrngements of rectngulr grid of pixels of ritrry size tht it is extremely difficult to select set of fundmentl permuttion primitives, from which ll other permuttions cn e uilt. This pper hs proposed systemtic pproch to solving this prolem, nd hs proposed very smll lphet of fundmentl suword permuttion primitives for existing nd future two-dimensionl processing in microsimd rchitectures. 7. References 1. Ruy Lee, "ccelerting Multimedi with Enhnced Microprocessors", IEEE Micro, Vol. 15 No. 2, pril 1995, pp Mrc Tremly, J. O Connor, V. Nrynn, nd L. He, VIS Speeds New Medi Processing, IEEE Micro, Vol. 16 No. 4, ugust 1996, pp lex Peleg nd Uri Weiser, MMX Technology Extension to the Intel rchitecture, IEEE Micro, Vol. 16 No. 4, ugust 1996, pp Ruy Lee, "Suword Prllelism with MX-2", IEEE Micro, Vol. 16 No. 4, ugust 1996, pp Ninth nnul Microprocessor Forum, Octoer 21-24, 1996, Sn Jose, Cliforni (Mips nd lph multimedi). 6. Ruy Lee, "Multimedi Extensions for Generl-Purpose Processors", IEEE Workshop on Signl Processing Systems SiPS97 Design nd Implementtion, Novemer 3-5, 1997, Leicester, United Kingdom, pp S. Oermn, F. Weer, N. Juff, G. Fvor, MD 3Dnow! Technology nd the K6-2 Microprocessor, Hot Chips 10 Symposium on High-Performnce Chips, ugust 16-18, 1998, Plo lto, Cliforni, pp J. Golston, Single-Chip H.324 Videoconferencing, IEEE Micro, Vol. 16 No. 4, ugust 1996, pp S. Dutt, K. O Connor, W. Wolf, nd. Wolfe, Design Study of 0.25-um Video Signl Processor, IEEE Trnsctions on Circuits nd Systems for Video Technology, Vol. 8 No. 4, ugust Krl Guttg et l, Single-Chip Multiprocessor for Multimedi: the MVP, IEEE Computer Grphics nd pplictions, Vol. 12 No. 6, Novemer 1992, pp P. Foley, The Mpct Medi Processor Redefines the Multimedi PC, Proc. Compcon, IEEE Computer Society Press, Los lmitos, Clif., 1996, pp C. soglu, W. Lee, J. O Donnell, The MP1000 VLIW Mediprocessor, Equtor Technologies Inc. 13. Ruy Lee, Efficiency of microsimd rchitectures nd index-mpped dt for medi processing, Proceedings of IS&T/SPIE Symposium on Electric Imging: Medi Processors 99, Jnury 1999, Sn Jose, Cliforni. 14. Michel Flynn, Very High-Speed Computing Systems, Proceedings of IEEE, Vol. 54 No. 12, 1966, pp I-64 ppliction Developer s rchitecture Guide, Intel Corportion, Order Numer: , My ltivec Extension to PowerPC Instruction Set rchitecture Specifiction. Motorol, Inc., My Ruy Lee, Precision rchitecture, IEEE Computer, Vol. 22, No. 1, Jn 1989, pp

CHAPTER 2 LITERATURE STUDY

CHAPTER 2 LITERATURE STUDY CHAPTER LITERATURE STUDY. Introduction Multipliction involves two bsic opertions: the genertion of the prtil products nd their ccumultion. Therefore, there re two possible wys to speed up the multipliction:

More information

Geometric quantities for polar curves

Geometric quantities for polar curves Roerto s Notes on Integrl Clculus Chpter 5: Bsic pplictions of integrtion Section 10 Geometric quntities for polr curves Wht you need to know lredy: How to use integrls to compute res nd lengths of regions

More information

Mixed CMOS PTL Adders

Mixed CMOS PTL Adders Anis do XXVI Congresso d SBC WCOMPA l I Workshop de Computção e Aplicções 14 20 de julho de 2006 Cmpo Grnde, MS Mixed CMOS PTL Adders Déor Mott, Reginldo d N. Tvres Engenhri em Sistems Digitis Universidde

More information

Math Circles Finite Automata Question Sheet 3 (Solutions)

Math Circles Finite Automata Question Sheet 3 (Solutions) Mth Circles Finite Automt Question Sheet 3 (Solutions) Nickols Rollick nrollick@uwterloo.c Novemer 2, 28 Note: These solutions my give you the nswers to ll the prolems, ut they usully won t tell you how

More information

Kirchhoff s Rules. Kirchhoff s Laws. Kirchhoff s Rules. Kirchhoff s Laws. Practice. Understanding SPH4UW. Kirchhoff s Voltage Rule (KVR):

Kirchhoff s Rules. Kirchhoff s Laws. Kirchhoff s Rules. Kirchhoff s Laws. Practice. Understanding SPH4UW. Kirchhoff s Voltage Rule (KVR): SPH4UW Kirchhoff s ules Kirchhoff s oltge ule (K): Sum of voltge drops round loop is zero. Kirchhoff s Lws Kirchhoff s Current ule (KC): Current going in equls current coming out. Kirchhoff s ules etween

More information

Algorithms for Memory Hierarchies Lecture 14

Algorithms for Memory Hierarchies Lecture 14 Algorithms for emory Hierrchies Lecture 4 Lecturer: Nodri Sitchinv Scribe: ichel Hmnn Prllelism nd Cche Obliviousness The combintion of prllelism nd cche obliviousness is n ongoing topic of reserch, in

More information

Area-Time Efficient Digit-Serial-Serial Two s Complement Multiplier

Area-Time Efficient Digit-Serial-Serial Two s Complement Multiplier Are-Time Efficient Digit-Seril-Seril Two s Complement Multiplier Essm Elsyed nd Htem M. El-Boghddi Computer Engineering Deprtment, Ciro University, Egypt Astrct - Multipliction is n importnt primitive

More information

Chapter 2 Literature Review

Chapter 2 Literature Review Chpter 2 Literture Review 2.1 ADDER TOPOLOGIES Mny different dder rchitectures hve een proposed for inry ddition since 1950 s to improve vrious spects of speed, re nd power. Ripple Crry Adder hve the simplest

More information

Student Book SERIES. Fractions. Name

Student Book SERIES. Fractions. Name D Student Book Nme Series D Contents Topic Introducing frctions (pp. ) modelling frctions frctions of collection compring nd ordering frctions frction ingo pply Dte completed / / / / / / / / Topic Types

More information

DIGITAL multipliers [1], [2] are the core components of

DIGITAL multipliers [1], [2] are the core components of World Acdemy of Science, Engineering nd Technology 9 8 A Reduced-Bit Multipliction Algorithm for Digitl Arithmetic Hrpreet Singh Dhillon nd Ahijit Mitr Astrct A reduced-it multipliction lgorithm sed on

More information

Implementation of Different Architectures of Forward 4x4 Integer DCT For H.264/AVC Encoder

Implementation of Different Architectures of Forward 4x4 Integer DCT For H.264/AVC Encoder Implementtion of Different Architectures of Forwrd 4x4 Integer DCT For H.64/AVC Encoder Bunji Antoinette Ringnyu, Ali Tngel, Emre Krulut 3 Koceli University, Institute of Science nd Technology, Koceli,

More information

On the Description of Communications Between Software Components with UML

On the Description of Communications Between Software Components with UML On the Description of Communictions Between Softwre Components with UML Zhiwei An Dennis Peters Fculty of Engineering nd Applied Science Memoril University of Newfoundlnd St. John s NL A1B 3X5 zhiwei@engr.mun.c

More information

& Y Connected resistors, Light emitting diode.

& Y Connected resistors, Light emitting diode. & Y Connected resistors, Light emitting diode. Experiment # 02 Ojectives: To get some hndson experience with the physicl instruments. To investigte the equivlent resistors, nd Y connected resistors, nd

More information

Student Book SERIES. Patterns and Algebra. Name

Student Book SERIES. Patterns and Algebra. Name E Student Book 3 + 7 5 + 5 Nme Contents Series E Topic Ptterns nd functions (pp. ) identifying nd creting ptterns skip counting completing nd descriing ptterns predicting repeting ptterns predicting growing

More information

CS 135: Computer Architecture I. Boolean Algebra. Basic Logic Gates

CS 135: Computer Architecture I. Boolean Algebra. Basic Logic Gates Bsic Logic Gtes : Computer Architecture I Boolen Algebr Instructor: Prof. Bhgi Nrhri Dept. of Computer Science Course URL: www.ses.gwu.edu/~bhgiweb/cs35/ Digitl Logic Circuits We sw how we cn build the

More information

First Round Solutions Grades 4, 5, and 6

First Round Solutions Grades 4, 5, and 6 First Round Solutions Grdes 4, 5, nd 1) There re four bsic rectngles not mde up of smller ones There re three more rectngles mde up of two smller ones ech, two rectngles mde up of three smller ones ech,

More information

Use of compiler optimization of software bypassing as a method to improve energy efficiency of exposed data path architectures

Use of compiler optimization of software bypassing as a method to improve energy efficiency of exposed data path architectures Guzm et l. EURASIP Journl on Emedded Systems 213, 213:9 RESEARCH Open Access Use of compiler optimiztion of softwre ypssing s method to improve energy efficiency of exposed dt pth rchitectures Vldimír

More information

Regular languages can be expressed as regular expressions.

Regular languages can be expressed as regular expressions. Regulr lnguges cn e expressed s regulr expressions. A generl nondeterministic finite utomton (GNFA) is kind of NFA such tht: There is unique strt stte nd is unique ccept stte. Every pir of nodes re connected

More information

Solutions to exercise 1 in ETS052 Computer Communication

Solutions to exercise 1 in ETS052 Computer Communication Solutions to exercise in TS52 Computer Communiction 23 Septemer, 23 If it occupies millisecond = 3 seconds, then second is occupied y 3 = 3 its = kps. kps If it occupies 2 microseconds = 2 6 seconds, then

More information

Dataflow Language Model. DataFlow Models. Applications of Dataflow. Dataflow Languages. Kahn process networks. A Kahn Process (1)

Dataflow Language Model. DataFlow Models. Applications of Dataflow. Dataflow Languages. Kahn process networks. A Kahn Process (1) The slides contin revisited mterils from: Peter Mrwedel, TU Dortmund Lothr Thiele, ETH Zurich Frnk Vhid, University of liforni, Riverside Dtflow Lnguge Model Drsticlly different wy of looking t computtion:

More information

Sequential Logic (2) Synchronous vs Asynchronous Sequential Circuit. Clock Signal. Synchronous Sequential Circuits. FSM Overview 9/10/12

Sequential Logic (2) Synchronous vs Asynchronous Sequential Circuit. Clock Signal. Synchronous Sequential Circuits. FSM Overview 9/10/12 9//2 Sequentil (2) ENGG5 st Semester, 22 Dr. Hden So Deprtment of Electricl nd Electronic Engineering http://www.eee.hku.hk/~engg5 Snchronous vs Asnchronous Sequentil Circuit This Course snchronous Sequentil

More information

MAXIMUM FLOWS IN FUZZY NETWORKS WITH FUNNEL-SHAPED NODES

MAXIMUM FLOWS IN FUZZY NETWORKS WITH FUNNEL-SHAPED NODES MAXIMUM FLOWS IN FUZZY NETWORKS WITH FUNNEL-SHAPED NODES Romn V. Tyshchuk Informtion Systems Deprtment, AMI corportion, Donetsk, Ukrine E-mil: rt_science@hotmil.com 1 INTRODUCTION During the considertion

More information

Color gamut reduction techniques for printing with custom inks

Color gamut reduction techniques for printing with custom inks Color gmut reduction techniques for printing with custom inks Sylvin M. CHOSSON *, Roger D. HERSCH * Ecole Polytechnique Fédérle de usnne (EPF) STRCT Printing with custom inks is of interest oth for rtistic

More information

To provide data transmission in indoor

To provide data transmission in indoor Hittite Journl of Science nd Engineering, 2018, 5 (1) 25-29 ISSN NUMBER: 2148-4171 DOI: 10.17350/HJSE19030000074 A New Demodultor For Inverse Pulse Position Modultion Technique Mehmet Sönmez Osmniye Korkut

More information

Multi-beam antennas in a broadband wireless access system

Multi-beam antennas in a broadband wireless access system Multi-em ntenns in rodnd wireless ccess system Ulrik Engström, Mrtin Johnsson, nders Derneryd nd jörn Johnnisson ntenn Reserch Center Ericsson Reserch Ericsson SE-4 84 Mölndl Sweden E-mil: ulrik.engstrom@ericsson.com,

More information

Example. Check that the Jacobian of the transformation to spherical coordinates is

Example. Check that the Jacobian of the transformation to spherical coordinates is lss, given on Feb 3, 2, for Mth 3, Winter 2 Recll tht the fctor which ppers in chnge of vrible formul when integrting is the Jcobin, which is the determinnt of mtrix of first order prtil derivtives. Exmple.

More information

SOLVING TRIANGLES USING THE SINE AND COSINE RULES

SOLVING TRIANGLES USING THE SINE AND COSINE RULES Mthemtics Revision Guides - Solving Generl Tringles - Sine nd Cosine Rules Pge 1 of 17 M.K. HOME TUITION Mthemtics Revision Guides Level: GCSE Higher Tier SOLVING TRIANGLES USING THE SINE AND COSINE RULES

More information

Three-Phase Synchronous Machines The synchronous machine can be used to operate as: 1. Synchronous motors 2. Synchronous generators (Alternator)

Three-Phase Synchronous Machines The synchronous machine can be used to operate as: 1. Synchronous motors 2. Synchronous generators (Alternator) Three-Phse Synchronous Mchines The synchronous mchine cn be used to operte s: 1. Synchronous motors 2. Synchronous genertors (Alterntor) Synchronous genertor is lso referred to s lterntor since it genertes

More information

Proceedings of Meetings on Acoustics

Proceedings of Meetings on Acoustics Proceedings of Meetings on Acoustics Volume 19, 2013 http://cousticlsociety.org/ ICA 2013 Montrel Montrel, Cnd 2-7 June 2013 Signl Processing in Acoustics Session 4SP: Sensor Arry Bemforming nd Its Applictions

More information

Spiral Tilings with C-curves

Spiral Tilings with C-curves Spirl Tilings with -curves Using ombintorics to Augment Trdition hris K. Plmer 19 North Albny Avenue hicgo, Illinois, 0 chris@shdowfolds.com www.shdowfolds.com Abstrct Spirl tilings used by rtisns through

More information

EE Controls Lab #2: Implementing State-Transition Logic on a PLC

EE Controls Lab #2: Implementing State-Transition Logic on a PLC Objective: EE 44 - Controls Lb #2: Implementing Stte-rnsition Logic on PLC ssuming tht speed is not of essence, PLC's cn be used to implement stte trnsition logic. he dvntge of using PLC over using hrdwre

More information

Homework #1 due Monday at 6pm. White drop box in Student Lounge on the second floor of Cory. Tuesday labs cancelled next week

Homework #1 due Monday at 6pm. White drop box in Student Lounge on the second floor of Cory. Tuesday labs cancelled next week Announcements Homework #1 due Mondy t 6pm White drop ox in Student Lounge on the second floor of Cory Tuesdy ls cncelled next week Attend your other l slot Books on reserve in Bechtel Hmley, 2 nd nd 3

More information

CHAPTER 3 AMPLIFIER DESIGN TECHNIQUES

CHAPTER 3 AMPLIFIER DESIGN TECHNIQUES CHAPTER 3 AMPLIFIER DEIGN TECHNIQUE 3.0 Introduction olid-stte microwve mplifiers ply n importnt role in communiction where it hs different pplictions, including low noise, high gin, nd high power mplifiers.

More information

INSTITUTE OF AERONAUTICAL ENGINEERING (Autonomous) Dundigal, Hyderabad

INSTITUTE OF AERONAUTICAL ENGINEERING (Autonomous) Dundigal, Hyderabad Hll Ticket No Question Pper Code: AEC009 INSTITUTE OF AERONAUTICAL ENGINEERING (Autonomous) Dundigl, Hyderd - 500 043 MODEL QUESTION PAPER Four Yer B.Tech V Semester End Exmintions, Novemer - 2018 Regultions:

More information

10.4 AREAS AND LENGTHS IN POLAR COORDINATES

10.4 AREAS AND LENGTHS IN POLAR COORDINATES 65 CHAPTER PARAMETRIC EQUATINS AND PLAR CRDINATES.4 AREAS AND LENGTHS IN PLAR CRDINATES In this section we develop the formul for the re of region whose oundry is given y polr eqution. We need to use the

More information

A Development of Earthing-Resistance-Estimation Instrument

A Development of Earthing-Resistance-Estimation Instrument A Development of Erthing-Resistnce-Estimtion Instrument HITOSHI KIJIMA Abstrct: - Whenever erth construction work is done, the implnted number nd depth of electrodes hve to be estimted in order to obtin

More information

Computing Logic-Stage Delays Using Circuit Simulation and Symbolic Elmore Analysis

Computing Logic-Stage Delays Using Circuit Simulation and Symbolic Elmore Analysis Computing Logic-Stge Delys Using Circuit Simultion nd Symolic Elmore Anlysis Clyton B. McDonld Rndl E. Brynt Deprtment of Electricl nd Computer Engineering Crnegie Mellon University, Pittsurgh, PA 15213

More information

arxiv: v1 [cs.cc] 29 Mar 2012

arxiv: v1 [cs.cc] 29 Mar 2012 Solving Mhjong Solitire ords with peeking Michiel de Bondt rxiv:1203.6559v1 [cs.cc] 29 Mr 2012 Decemer 22, 2013 Astrct We first prove tht solving Mhjong Solitire ords with peeking is NPcomplete, even if

More information

(1) Non-linear system

(1) Non-linear system Liner vs. non-liner systems in impednce mesurements I INTRODUCTION Electrochemicl Impednce Spectroscopy (EIS) is n interesting tool devoted to the study of liner systems. However, electrochemicl systems

More information

CS2204 DIGITAL LOGIC & STATE MACHINE DESIGN SPRING 2005

CS2204 DIGITAL LOGIC & STATE MACHINE DESIGN SPRING 2005 CS2204 DIGITAL LOGIC & STATE MACHINE DESIGN SPRING 2005 EXPERIMENT 1 FUNDAMENTALS 1. GOALS : Lern how to develop cr lrm digitl circuit during which the following re introduced : CS2204 l fundmentls, nd

More information

The Math Learning Center PO Box 12929, Salem, Oregon Math Learning Center

The Math Learning Center PO Box 12929, Salem, Oregon Math Learning Center Resource Overview Quntile Mesure: Skill or Concept: 300Q Model the concept of ddition for sums to 10. (QT N 36) Model the concept of sutrction using numers less thn or equl to 10. (QT N 37) Write ddition

More information

Exercise 1-1. The Sine Wave EXERCISE OBJECTIVE DISCUSSION OUTLINE. Relationship between a rotating phasor and a sine wave DISCUSSION

Exercise 1-1. The Sine Wave EXERCISE OBJECTIVE DISCUSSION OUTLINE. Relationship between a rotating phasor and a sine wave DISCUSSION Exercise 1-1 The Sine Wve EXERCISE OBJECTIVE When you hve completed this exercise, you will be fmilir with the notion of sine wve nd how it cn be expressed s phsor rotting round the center of circle. You

More information

Software for the automatic scaling of critical frequency f 0 F2 and MUF(3000)F2 from ionograms applied at the Ionospheric Observatory of Gibilmanna

Software for the automatic scaling of critical frequency f 0 F2 and MUF(3000)F2 from ionograms applied at the Ionospheric Observatory of Gibilmanna ANNALS OF GEOPHYSICS, VOL. 47, N. 6, Decemer 2004 Softwre for the utomtic scling of criticl frequency f 0 F2 nd MUF(3000)F2 from ionogrms pplied t the Ionospheric Oservtory of Giilmnn Michel Pezzopne nd

More information

Seven Sisters. Visit for video tutorials

Seven Sisters. Visit   for video tutorials Seven Sisters This imge is from www.quiltstudy.org. Plese visit this website for more informtion on Seven Sisters quilt ptterns. Visit www.blocloc.com for video tutorils 1 The Seven Sisters design cn be

More information

Genetic Representations for Evolutionary Minimization of Network Coding Resources

Genetic Representations for Evolutionary Minimization of Network Coding Resources Genetic Representtions for Evolutionry Minimiztion of Network Coding Resources Minkyu Kim 1, Vrun Aggrwl 2, Un-My O Reilly 2, Muriel Médrd 1, nd Wonsik Kim 1 1 Lortory for Informtion nd Decision Systems

More information

Aquauno Select MINUTES. (duration) FREQUENCY LED. OFF 8h AQUAUNO SELECT 5 MIN FREQUENCY. the timer is being programmed;

Aquauno Select MINUTES. (duration) FREQUENCY LED. OFF 8h AQUAUNO SELECT 5 MIN FREQUENCY. the timer is being programmed; Aquuno Select Pg. INSTALLATION. Attch the timer to cold wter tp, following these simple instructions. Do not instll the timer in pit or vlve ox, elow ground level or indoors. Do not use the timer with

More information

Design and Development of 8-Bits Fast Multiplier for Low Power Applications

Design and Development of 8-Bits Fast Multiplier for Low Power Applications IACSIT Interntionl Journl of Engineering nd Technology, Vol. 4, No. 6, Decemer 22 Design nd Development of 8-Bits Fst Multiplier for Low Power Applictions Vsudev G. nd Rjendr Hegdi, Memer, IACSIT proportionl

More information

Convolutional Networks. Lecture slides for Chapter 9 of Deep Learning Ian Goodfellow

Convolutional Networks. Lecture slides for Chapter 9 of Deep Learning Ian Goodfellow Convolutionl Networks Lecture slides for Chpter 9 of Deep Lerning In Goodfellow 2016-09-12 Convolutionl Networks Scle up neurl networks to process very lrge imges / video sequences Sprse connections Prmeter

More information

Module 9. DC Machines. Version 2 EE IIT, Kharagpur

Module 9. DC Machines. Version 2 EE IIT, Kharagpur Module 9 DC Mchines Version EE IIT, Khrgpur esson 40 osses, Efficiency nd Testing of D.C. Mchines Version EE IIT, Khrgpur Contents 40 osses, efficiency nd testing of D.C. mchines (esson-40) 4 40.1 Gols

More information

MATH 118 PROBLEM SET 6

MATH 118 PROBLEM SET 6 MATH 118 PROBLEM SET 6 WASEEM LUTFI, GABRIEL MATSON, AND AMY PIRCHER Section 1 #16: Show tht if is qudrtic residue modulo m, nd b 1 (mod m, then b is lso qudrtic residue Then rove tht the roduct of the

More information

EQ: What are the similarities and differences between matrices and real numbers?

EQ: What are the similarities and differences between matrices and real numbers? Unit 4 Lesson 1 Essentil Question Stndrds Objectives Vocbulry Mtrices Mtrix Opertions Wht re the similrities nd differences between mtrices nd rel numbers? M.ALGII.2.4 Unit 4: Lesson 1 Describe how you

More information

April 9, 2000 DIS chapter 10 CHAPTER 3 : INTEGRATED PROCESSOR-LEVEL ARCHITECTURES FOR REAL-TIME DIGITAL SIGNAL PROCESSING

April 9, 2000 DIS chapter 10 CHAPTER 3 : INTEGRATED PROCESSOR-LEVEL ARCHITECTURES FOR REAL-TIME DIGITAL SIGNAL PROCESSING April 9, 2000 DIS chpter 0 CHAPTE 3 : INTEGATED POCESSO-LEVEL ACHITECTUES FO EAL-TIME DIGITAL SIGNAL POCESSING April 9, 2000 DIS chpter 3.. INTODUCTION The purpose of this chpter is twofold. Firstly, bsic

More information

Operation Manual. Addendum. Embroidery Machine. Product Code: 884-T13

Operation Manual. Addendum. Embroidery Machine. Product Code: 884-T13 Emroidery Mchine Opertion Mnul Addendum Product Code: 884-T13 Be sure to red this document efore using the mchine. We recommend tht you keep this document nery for future reference. ABOUT THIS MANUAL ABOUT

More information

Automatic Synthesis of Compressor Trees: Reevaluating Large Counters

Automatic Synthesis of Compressor Trees: Reevaluating Large Counters Automtic Snthesis of Compressor Trees: Reevluting Lrge Counters Aj K. Verm AjKumr.Verm@epfl.ch Polo Ienne Polo.Ienne@epfl.ch Ecole Poltechnique Fédérle de Lusnne (EPFL) School of Computer nd Communiction

More information

METHOD OF LOCATION USING SIGNALS OF UNKNOWN ORIGIN. Inventor: Brian L. Baskin

METHOD OF LOCATION USING SIGNALS OF UNKNOWN ORIGIN. Inventor: Brian L. Baskin METHOD OF LOCATION USING SIGNALS OF UNKNOWN ORIGIN Inventor: Brin L. Bskin 1 ABSTRACT The present invention encompsses method of loction comprising: using plurlity of signl trnsceivers to receive one or

More information

Localization of Latent Image in Heterophase AgBr(I) Tabular Microcrystals

Localization of Latent Image in Heterophase AgBr(I) Tabular Microcrystals Interntionl ymposium on ilver Hlide Technology Locliztion of Ltent Imge in Heterophse AgBr(I) Tulr Microcrystls Elen V. Prosvirkin, Aigul B. Aishev, Timothy A. Lrichev, Boris A. echkrev Kemerovo tte University,

More information

Two-layer slotted-waveguide antenna array with broad reflection/gain bandwidth at millimetre-wave frequencies

Two-layer slotted-waveguide antenna array with broad reflection/gain bandwidth at millimetre-wave frequencies Two-lyer slotted-wveguide ntenn rry with rod reflection/gin ndwidth t millimetre-wve frequencies S.-S. Oh, J.-W. Lee, M.-S. Song nd Y.-S. Kim Astrct: A 24 24 slotted-wveguide rry ntenn is presented in

More information

Application Note. Differential Amplifier

Application Note. Differential Amplifier Appliction Note AN367 Differentil Amplifier Author: Dve n Ess Associted Project: Yes Associted Prt Fmily: CY8C9x66, CY8C7x43, CY8C4x3A PSoC Designer ersion: 4. SP3 Abstrct For mny sensing pplictions, desirble

More information

Network Theorems. Objectives 9.1 INTRODUCTION 9.2 SUPERPOSITION THEOREM

Network Theorems. Objectives 9.1 INTRODUCTION 9.2 SUPERPOSITION THEOREM M09_BOYL3605_13_S_C09.indd Pge 359 24/11/14 1:59 PM f403 /204/PH01893/9780133923605_BOYLSTAD/BOYLSTAD_NTRO_CRCUT_ANALYSS13_S_978013... Network Theorems Ojectives Become fmilir with the superposition theorem

More information

Patterns and Relationships

Patterns and Relationships Series Techer Ptterns nd Reltionships opyright 009 3P Lerning. All rights reserved. First edition printed 009 in Austrli. A ctlogue record for this ook is ville from 3P Lerning Ltd. ISBN 978-1-91860-3-4

More information

ABOUT THIS MANUAL ABOUT THIS MANUAL

ABOUT THIS MANUAL ABOUT THIS MANUAL ABOUT THIS MANUAL ABOUT THIS MANUAL This mnul provides detils on IQ Designer, which is ville with the upgrde. Mke sure tht the mchine hs een upgrded to the most recent version. When you find this icon

More information

ISSCC 2006 / SESSION 21 / ADVANCED CLOCKING, LOGIC AND SIGNALING TECHNIQUES / 21.5

ISSCC 2006 / SESSION 21 / ADVANCED CLOCKING, LOGIC AND SIGNALING TECHNIQUES / 21.5 21.5 A 1.1GHz Chrge-Recovery Logic Visvesh Sthe, Jung-Ying Chueh, Mrios Ppefthymiou University of Michign, Ann Aror, MI Boost Logic is chrge-recovery circuit fmily cple of operting t GHz-clss frequencies

More information

Engineer-to-Engineer Note

Engineer-to-Engineer Note Engineer-to-Engineer Note EE-236 Technicl notes on using Anlog Devices DSPs, processors nd development tools Contct our technicl support t dsp.support@nlog.com nd t dsptools.support@nlog.com Or visit our

More information

Francis Gaspalou Second edition of February 10, 2012 (First edition on January 28, 2012) HOW MANY SQUARES ARE THERE, Mr TARRY?

Francis Gaspalou Second edition of February 10, 2012 (First edition on January 28, 2012) HOW MANY SQUARES ARE THERE, Mr TARRY? Frncis Gslou Second edition of Ferury 10, 2012 (First edition on Jnury 28, 2012) HOW MANY SQUARES ARE THERE, Mr TARRY? ABSTRACT In this er, I enumerte ll the 8x8 imgic sures given y the Trry s ttern. This

More information

Asynchronous Data-Driven Circuit Synthesis

Asynchronous Data-Driven Circuit Synthesis Asynchronous Dt-Driven Circuit Synthesis Sm Tylor, Doug Edwrds, Luis A Pln, Senior Memer, IEEE nd Luis A. Trzon D., Student Memer, IEEE Astrct A method is descried for synthesising synchronous circuits

More information

Efficient and Resilient Key Discovery based on Pseudo-Random Key Pre-Deployment

Efficient and Resilient Key Discovery based on Pseudo-Random Key Pre-Deployment Efficient nd Resilient Key Discovery sed on Pseudo-Rndom Key Pre-Deployment p. 1 Efficient nd Resilient Key Discovery sed on Pseudo-Rndom Key Pre-Deployment Roerto Di Pietro, Luigi V. Mncini, nd Alessndro

More information

Lecture 20. Intro to line integrals. Dan Nichols MATH 233, Spring 2018 University of Massachusetts.

Lecture 20. Intro to line integrals. Dan Nichols MATH 233, Spring 2018 University of Massachusetts. Lecture 2 Intro to line integrls Dn Nichols nichols@mth.umss.edu MATH 233, Spring 218 University of Msschusetts April 12, 218 (2) onservtive vector fields We wnt to determine if F P (x, y), Q(x, y) is

More information

THE STUDY OF INFLUENCE CORE MATERIALS ON TECHNOLOGICAL PROPERTIES OF UNIVERSAL BENTONITE MOULDING MATERIALS. Matej BEZNÁK, Vladimír HANZEN, Ján VRABEC

THE STUDY OF INFLUENCE CORE MATERIALS ON TECHNOLOGICAL PROPERTIES OF UNIVERSAL BENTONITE MOULDING MATERIALS. Matej BEZNÁK, Vladimír HANZEN, Ján VRABEC THE STUDY OF INFLUENCE CORE MATERIALS ON TECHNOLOGICAL PROPERTIES OF UNIVERSAL BENTONITE MOULDING MATERIALS Mtej BEZNÁK, Vldimír HANZEN, Ján VRABEC Authors: Mtej Beznák, Assoc. Prof. PhD., Vldimír Hnzen,

More information

Performance Comparison between Network Coding in Space and Routing in Space

Performance Comparison between Network Coding in Space and Routing in Space Performnce omprison etween Network oding in Spce nd Routing in Spce Yunqing Ye, Xin Hung, Ting Wen, Jiqing Hung nd lfred Uwitonze eprtment of lectronics nd Informtion ngineering, Huzhong University of

More information

Direct Current Circuits. Chapter Outline Electromotive Force 28.2 Resistors in Series and in Parallel 28.3 Kirchhoff s Rules 28.

Direct Current Circuits. Chapter Outline Electromotive Force 28.2 Resistors in Series and in Parallel 28.3 Kirchhoff s Rules 28. P U Z Z L E R If ll these pplinces were operting t one time, circuit reker would proly e tripped, preventing potentilly dngerous sitution. Wht cuses circuit reker to trip when too mny electricl devices

More information

Digital Design. Sequential Logic Design -- Controllers. Copyright 2007 Frank Vahid

Digital Design. Sequential Logic Design -- Controllers. Copyright 2007 Frank Vahid Digitl Design Sequentil Logic Design -- Controllers Slides to ccompny the tetook Digitl Design, First Edition, y, John Wiley nd Sons Pulishers, 27. http://www.ddvhid.com Copyright 27 Instructors of courses

More information

Analysis of Coding-aware MAC Protocols based on Reverse Direction Protocol for IEEE based Wireless Networks using Network Coding*

Analysis of Coding-aware MAC Protocols based on Reverse Direction Protocol for IEEE based Wireless Networks using Network Coding* Anlysis of oding-wre MA Protocols sed on Reverse irection Protocol for IEEE 8.-sed Wireless Networks using Network oding* Rul Plcios, Htegereil Kssye Hile, Jesus Alonso-Zrte nd Frizio Grnelli {plciostrujillo,

More information

ABB STOTZ-KONTAKT. ABB i-bus EIB Current Module SM/S Intelligent Installation Systems. User Manual SM/S In = 16 A AC Un = 230 V AC

ABB STOTZ-KONTAKT. ABB i-bus EIB Current Module SM/S Intelligent Installation Systems. User Manual SM/S In = 16 A AC Un = 230 V AC User Mnul ntelligent nstlltion Systems A B 1 2 3 4 5 6 7 8 30 ma 30 ma n = AC Un = 230 V AC 30 ma 9 10 11 12 C ABB STOTZ-KONTAKT Appliction Softwre Current Vlue Threshold/1 Contents Pge 1 Device Chrcteristics...

More information

Domination and Independence on Square Chessboard

Domination and Independence on Square Chessboard Engineering nd Technology Journl Vol. 5, Prt, No. 1, 017 A.A. Omrn Deprtment of Mthemtics, College of Eduction for Pure Science, University of bylon, bylon, Irq pure.hmed.omrn@uobby lon.edu.iq Domintion

More information

LATEST CALIBRATION OF GLONASS P-CODE TIME RECEIVERS

LATEST CALIBRATION OF GLONASS P-CODE TIME RECEIVERS LATEST CALIBRATION OF GLONASS P-CODE TIME RECEIVERS A. Fos 1, J. Nwroci 2, nd W. Lewndowsi 3 1 Spce Reserch Centre of Polish Acdemy of Sciences, ul. Brtyc 18A, 00-716 Wrsw, Polnd; E-mil: fos@c.ww.pl; Tel.:

More information

Polar Coordinates. July 30, 2014

Polar Coordinates. July 30, 2014 Polr Coordintes July 3, 4 Sometimes it is more helpful to look t point in the xy-plne not in terms of how fr it is horizontlly nd verticlly (this would men looking t the Crtesin, or rectngulr, coordintes

More information

Lab 8. Speed Control of a D.C. motor. The Motor Drive

Lab 8. Speed Control of a D.C. motor. The Motor Drive Lb 8. Speed Control of D.C. motor The Motor Drive Motor Speed Control Project 1. Generte PWM wveform 2. Amplify the wveform to drive the motor 3. Mesure motor speed 4. Mesure motor prmeters 5. Control

More information

Development and application of a patent-based design around. process

Development and application of a patent-based design around. process Authors: Yeh-Ling Hsu, Po-Er Hsu, Yung-Chieh Hung, Y-Dn Xio (--4); recommended: Yeh-Ling Hsu (-6-9). Note: his pper is presented t the st Interntionl Conference on Systemtic Innovtion, Hsinchu, iwn, Jnury.

More information

Design and implementation of a high-speed bit-serial SFQ adder based on the binary decision diagram

Design and implementation of a high-speed bit-serial SFQ adder based on the binary decision diagram INSTITUTE OFPHYSICS PUBLISHING Supercond. Sci. Technol. 16 (23) 1497 152 SUPERCONDUCTORSCIENCE AND TECHNOLOGY PII: S953-248(3)67111-3 Design nd implementtion of high-speed it-seril SFQ dder sed on the

More information

Experiment 3: Non-Ideal Operational Amplifiers

Experiment 3: Non-Ideal Operational Amplifiers Experiment 3: Non-Idel Opertionl Amplifiers Fll 2009 Equivlent Circuits The bsic ssumptions for n idel opertionl mplifier re n infinite differentil gin ( d ), n infinite input resistnce (R i ), zero output

More information

EET 438a Automatic Control Systems Technology Laboratory 5 Control of a Separately Excited DC Machine

EET 438a Automatic Control Systems Technology Laboratory 5 Control of a Separately Excited DC Machine EE 438 Automtic Control Systems echnology bortory 5 Control of Seprtely Excited DC Mchine Objective: Apply proportionl controller to n electromechnicl system nd observe the effects tht feedbck control

More information

Re: PCT Minimum Documentation: Updating of the Inventory of Patent Documents According to PCT Rule 34.1

Re: PCT Minimum Documentation: Updating of the Inventory of Patent Documents According to PCT Rule 34.1 C. SCIT 2508 00 August 10, 2000 Re: PCT Minimum Documenttion: Updting of the Inventory of Ptent Documents According to PCT Rule 34.1 Sir, Mdm, The current version of the Inventory of Ptent Documents for

More information

Fitting & User Instructions

Fitting & User Instructions Issue 03 Alexnder Universl Furniture Risers Note: These frmes require the ddition of n ttchment. (See pges 4 - ) Bse Frme Mrk Bse Frme Mrk Fitting & User Instructions These instructions pply to the rising

More information

Understanding Basic Analog Ideal Op Amps

Understanding Basic Analog Ideal Op Amps Appliction Report SLAA068A - April 2000 Understnding Bsic Anlog Idel Op Amps Ron Mncini Mixed Signl Products ABSTRACT This ppliction report develops the equtions for the idel opertionl mplifier (op mp).

More information

Basic Embroidery. Chapter 5 COMBINING PATTERNS BEFORE EMBROIDERING PREPARING THE FABRIC SELECTING PATTERNS...

Basic Embroidery. Chapter 5 COMBINING PATTERNS BEFORE EMBROIDERING PREPARING THE FABRIC SELECTING PATTERNS... Chpter 5 Bsic Emroidery BEFORE EMBROIDERING... 196 Emroidery Step y Step...196 Attching Emroidery Foot W+ with LED pointer...197 Attching the Emroidery Unit...197 Aout the Emroidery Unit... 197 Removing

More information

A New Algorithm to Compute Alternate Paths in Reliable OSPF (ROSPF)

A New Algorithm to Compute Alternate Paths in Reliable OSPF (ROSPF) A New Algorithm to Compute Alternte Pths in Relile OSPF (ROSPF) Jin Pu *, Eric Mnning, Gholmli C. Shoj, Annd Srinivsn ** PANDA Group, Computer Science Deprtment University of Victori Victori, BC, Cnd Astrct

More information

Experiment 8 Series DC Motor (II)

Experiment 8 Series DC Motor (II) Ojectives To control the speed of loded series dc motor y chnging rmture voltge. To control the speed of loded series dc motor y dding resistnce in prllel with the rmture circuit. To control the speed

More information

Improved Ensemble Empirical Mode Decomposition and its Applications to Gearbox Fault Signal Processing

Improved Ensemble Empirical Mode Decomposition and its Applications to Gearbox Fault Signal Processing IJCSI Interntionl Journl of Computer Science Issues, Vol. 9, Issue, No, Novemer ISSN (Online): 9- www.ijcsi.org 9 Improved Ensemle Empiricl Mode Decomposition nd its Applictions to Gerox Fult Signl Processing

More information

Patterns and Algebra

Patterns and Algebra Student Book Series D Mthletis Instnt Workooks Copyright Series D Contents Topi Ptterns nd funtions identifying nd reting ptterns skip ounting ompleting nd desriing ptterns numer ptterns in tles growing

More information

Experiment 3: Non-Ideal Operational Amplifiers

Experiment 3: Non-Ideal Operational Amplifiers Experiment 3: Non-Idel Opertionl Amplifiers 9/11/06 Equivlent Circuits The bsic ssumptions for n idel opertionl mplifier re n infinite differentil gin ( d ), n infinite input resistnce (R i ), zero output

More information

PRO LIGNO Vol. 11 N pp

PRO LIGNO Vol. 11 N pp THE INFLUENCE OF THE TOOL POINT ANGLE AND FEED RATE ON THE DELAMINATION AT DRILLING OF PRE-LAMINATED PARTICLEBOARD Mihi ISPAS Prof.dr.eng. Trnsilvni University of Brsov Fculty of Wood Engineering Address:

More information

PB-735 HD DP. Industrial Line. Automatic punch and bind machine for books and calendars

PB-735 HD DP. Industrial Line. Automatic punch and bind machine for books and calendars PB-735 HD DP Automtic punch nd bind mchine for books nd clendrs A further step for the utomtion of double loop binding. A clever nd flexible mchine ble to punch nd bind in line up to 9/16. Using the best

More information

Redundancy Data Elimination Scheme Based on Stitching Technique in Image Senor Networks

Redundancy Data Elimination Scheme Based on Stitching Technique in Image Senor Networks Sensors & Trnsducers 204 by IFSA Publishing, S. L. http://www.sensorsportl.com Redundncy Dt Elimintion Scheme Bsed on Stitching Technique in Imge Senor Networks hunling Tng hongqing Technology nd Business

More information

AN ANALYSIS ON SYNTHETIC APERTURE RADAR DATA AND ENHANCEMENT OF RECONSTRUCTED IMAGES

AN ANALYSIS ON SYNTHETIC APERTURE RADAR DATA AND ENHANCEMENT OF RECONSTRUCTED IMAGES AN ANALYSIS ON SYNTHETIC APERTURE RADAR DATA AND ENHANCEMENT OF RECONSTRUCTED IMAGES Cihn Erş e-mil: ers@eh.itu.edu.tr Istnul Technicl University, Fculty of Electricl nd Electronics Engineering, Deprtment

More information

Section 16.3 Double Integrals over General Regions

Section 16.3 Double Integrals over General Regions Section 6.3 Double Integrls over Generl egions Not ever region is rectngle In the lst two sections we considered the problem of integrting function of two vribles over rectngle. This sitution however is

More information

D I G I TA L C A M E R A S PA RT 4

D I G I TA L C A M E R A S PA RT 4 Digitl Cmer Technologies for Scientific Bio-Imging. Prt 4: Signl-to-Noise Rtio nd Imge Comprison of Cmers Yshvinder Shrwl, Solexis Advisors LLC, Austin, TX, USA B I O G R A P H Y Yshvinder Shrwl hs BS

More information

On the Effectivity of Different Pseudo-Noise and Orthogonal Sequences for Speech Encryption from Correlation Properties

On the Effectivity of Different Pseudo-Noise and Orthogonal Sequences for Speech Encryption from Correlation Properties On the Effectivity of Different Pseudo-Noise nd Orthogonl Sequences for Speech Encryption from Correltion Properties V. Anil Kumr, Ahijit Mitr nd S. R. Mhdev Prsnn Astrct We nlyze the effectivity of different

More information

FPGA Based Five-Phase Sinusoidal PWM Generator

FPGA Based Five-Phase Sinusoidal PWM Generator 22 IEEE Interntionl Conference on Power nd Energy (PECon), 25 Decemer 22, Kot Kinlu Sh, Mlysi FPGA Bsed FivePhse Sinusoidl PWM Genertor Tole Sutikno Dept. of Electricl Engineering Universits Ahmd Dhln

More information

A COMPARISON OF CIRCUIT IMPLEMENTATIONS FROM A SECURITY PERSPECTIVE

A COMPARISON OF CIRCUIT IMPLEMENTATIONS FROM A SECURITY PERSPECTIVE A COMPARISON OF CIRCUIT IMPLEMENTATIONS FROM A SECURITY PERSPECTIVE Mster Thesis Division of Electronic Devices Deprtment of Electricl Engineering Linköping University y Timmy Sundström LITH-ISY-EX--05/3698--SE

More information

PRACTICE NO. PT-TE-1414 RELIABILITY PAGE 1 OF 6 PRACTICES ELECTROSTATIC DISCHARGE (ESD) TEST PRACTICES

PRACTICE NO. PT-TE-1414 RELIABILITY PAGE 1 OF 6 PRACTICES ELECTROSTATIC DISCHARGE (ESD) TEST PRACTICES PREFERRED PRACTICE NO. PT-TE-1414 RELIABILITY PAGE 1 OF 6 ELECTROSTATIC DISCHARGE (ESD) TEST Prctice: Test stellites for the ility to survive the effects of electrosttic dischrges (ESDs) cused y spce chrging

More information