Stochastic Policy Gradient Reinforcement Learning on a Simple 3D Biped

Size: px
Start display at page:

Download "Stochastic Policy Gradient Reinforcement Learning on a Simple 3D Biped"

Transcription

1 Stochastc Polcy Gradent Renforcement Learnng on a Smple 3D Bped Russ Tedrake Computer Scence & Artfcal Intellgence Lab, Center for Bts & Atoms Massachusetts Insttute of Technology Cambrdge, MA 0239 Emal: russt@a.mt.edu Teresa Weru Zhang Department of Mechancal Engneerng Bran & Cogntve Scences Massachusetts Insttute of Technology Cambrdge, MA 0239 Emal: resa@mt.edu H. Sebastan Seung Howard Hughes Medcal Insttute Bran & Cogntve Scences Center for Bts & Atoms Massachusetts Insttute of Technology Cambrdge, MA 0239 Emal: seung@mt.edu Abstract We present a learnng system whch s able to quckly and relably acqure a robust feedback control polcy for 3D dynamc walkng from a blank-slate usng only trals mplemented on our physcal robot. The robot begns walkng wthn a mnute and learnng converges n approxmately 20 mnutes. Ths success can be attrbuted to the mechancs of our robot, whch are modeled after a passve dynamc walker, and to a dramatc reducton n the dmensonalty of the learnng problem. We reduce the dmensonalty by desgnng a robot wth only 6 nternal degrees of freedom and 4 actuators, by decomposng the control system n the frontal and sagttal planes, and by formulatng the learnng problem on the dscrete return map dynamcs. We apply a stochastc polcy gradent algorthm to ths reduced problem and decrease the varance of the update usng a state-based estmate of the expected cost. Ths optmzed learnng system works quckly enough that the robot s able to contnually adapt to the terran as t walks. I. INTRODUCTION Recent advances n bpedal walkng technology have produced robots capable of leavng the laboratory envronment to nteract wth the unknown and uncertan envronments of the real world. Despte our best efforts, t s unlkely that we wll be able to preprogram these robots for every possble stuaton wthout sacrfcng performance. Endowng our robots wth the ablty to learn from experence and adapt to ther envronment seems crtcal for the success of any real world robot. Dynamc bpedal walkng s dffcult to learn for a number of reasons. Frst, walkng robots typcally have many degrees of freedom, whch can cause a combnatoral exploson for learnng systems that attempt to optmze performance n every possble confguraton of the robot. Second, detals of the robot dynamcs such as uncertantes n the ground contact and nonlnear frcton n the jonts are dffcult to model well n smulaton, makng t unlkely that a controller optmzed n a smulaton wll perform optmally on the real robot. Snce t s only practcal to run a small number of learnng trals on the real robot, the learnng algorthms must perform well after obtanng a very lmted amount of data. Fnally, learnng algorthms for dynamc walkng must deal wth dynamc dscontnutes caused by collsons wth the ground and wth the problem of delayed reward - torques appled at one tme may have an effect on the performance many steps nto the future. Although there s a great deal of lterature on learnng control for dynamcally walkng bpedal robots, there are relatvely few examples of learnng algorthms actually mplemented on the robot or whch work quckly enough to allow the robot to adapt onlne to changng terran. Some researchers attempt to learn a controller n smulaton that s robust enough to run on the real robot [], treatng dfferences between the smulaton and the robot as dsturbances. The Unversty of New Hampshre bpeds were two early examples of onlne learnng whch acqured a basc gat by tunng parameters n a hand-desgned controller ([2], [3]). In ths paper we generalze these results to obtanng a controller from scratch nstead of tunng an exstng controller. Learnng control has also been successfully mplemented on Sony s quadrupedal robot AIBO (.e., [4]). The learned controllers for AIBO are open-loop trajectores, but trajectory feedback s essental for robust, dynamc, bpedal walkng. In order to study learnng feedback control for walkng, we performed our ntal experments on a smplfed robot whch captures the essence of dynamc walkng but whch mnmzes many of the complcatons. Our robot has only 6 nternal degrees of freedom and 4 actuators. The mechancal desgn of our robot s based on a passve dynamc walker ([5], [6]). Ths allows us to solve a porton of the control problem n the mechancal desgn, and makes the robot mechancally very stable; most polces n our search space result n ether stable walkng or faled walkng where the robot ends up smply standng stll. The learnng on our robot s performed by a polcy gradent renforcement learnng algorthm ([7], [8], [9]). The goal of ths paper s to descrbe our formulaton of the learnng problem and the algorthm that we use to solve t. We nclude our expermental results on ths smplfed bped, and dscuss the possblty of applyng the same algorthm to a more complcated walkng system. The standard for 3D bpeds s to have at least 2 nternal degrees of freedom and 2 actuators n the legs

2 II. THE ROBOT Fg.. The robot on the left s a smple passve dynamc walker. The robot on the rght s our actuated verson of the same robot. The passve dynamc walker shown on the left n Fgure represents the smplest machne that we could buld whch captures the essence of stable dynamc walkng n three dmensons. It has only a sngle passve pn jont at the hp. When placed at the top of a small ramp and gven a push sdeways, the walker wll begn fallng down the ramp and eventually converge to a stable lmt cycle trajectory that has been compared to the waddlng gat of a pengun [0]. The energetcs of ths passve walker are common to all passve walkers: energy lost due to frcton and collsons when the swng leg returns to the ground s balanced by the gradual converson of potental energy nto knetc energy as the walker moves down the slope. The mechancal desgn of ths robot and some expermental stablty results are presented n []. We desgned our learnng robot by addng a small number of actuators to ths passve desgn. The robot shown on the rght n fgure, whch s also descrbed n [], has passve jonts at the hp and 2 degrees of actuaton (roll and ptch) at each ankle. The ankle actuators are poston controlled servo motors whch, when commanded to hold ther zero poston, allow the actuated robot to walk stably down a small ramp, smulatng the passve walker. The shape of the large, curved feet s desgned to make the robot walk passvely at 0.8Hz, and to take steps of approxmately 6.5 cm when walkng down a ramp of 0.03 radans. The robot stands 44 cm tall and weghs approxmately 2.9 kg, whch ncludes the CPU and batteres that are carred on-board. The most recent addtons to ths robot are the passve arms, whch are mechancally coupled to the opposte leg to provde mechancal yaw compensaton. When placed on flat terran, the passve walker waddles back and forth, slowly losng energy, untl t comes to rest standng stll. In order to acheve stable walkng on flat terran, the actuators on our learnng robot must restore energy nto the system that would have been restored by gravty when walkng down a slope. III. THE LEARNING PROBLEM The goal of learnng s to acqure a feedback control polcy whch makes the robot s gat nvarant to small slopes. In total, the system has 9 degrees of freedom 2, and the equatons of moton can be wrtten n the form where H(q) q + C(q, q) q + G(q) = τ + D(t), () q =[θ yaw, θ lp tch, θ bp tch, θ rp tch, θ roll, θ raroll, θ laroll, θ rap tch, θ lap tch ] T, τ =[0, 0, 0, 0, τ raroll, τ laroll, τ rap tch, τ lap tch ] T. H s the state dependent nertal matrx, C contans nteracton torques between the lnks, G represents the effect of gravty, τ are the motor torques, and D are random dsturbances to the system. Our shorthand lp tch, bp tch, and rp tch refer to left leg ptch, body ptch, and rght leg ptch, respectvely. raroll, laroll, rap tch, and lap tch are short for rght and left ankle roll and ptch. The actual output of the controller s a motor command vector u = [u raroll, u laroll, u rap tch, u lap tch ] T, whch generates torques τ = h(q, q, u). The functon h descrbes the lnear feedback controller mplemented by the servo boards and the nonlnear knematc transformaton nto jont torques. The robot uses a determnstc feedback control polcy whch s represented usng a lnear functon approxmator parameterzed by vector w and usng nonlnear features φ: u = π w (ˆx) = [ q q] w φ (ˆx), wth x =. (2) The notaton ˆx represents a nosy estmate of the state x. Before learnng, w s ntalzed to all zeros, makng the polcy outputs zero everywhere, so that the robot smulates the passve walker. To quantfy the stablty of our nonlnear, stochastc, perodc trajectory, we consder the dynamcs on the return map, taken around the pont where θ roll = 0 and θ roll > 0. The return map dynamcs are a Markov random sequence wth the probablty at the (n + )th crossng of the return map gven by f w (x, x) = P ˆX(n + ) = x ˆX(n) = x, W(n) = w. (3) f w (x, x) represents the probablty densty functon over the state space whch contans the dynamcs n equatons and 2 ntegrated over one cycle. We do not make any assumptons about ts form, except that t s Markov. Note that the element of f w representng θ roll s the delta functon, ndependent of 2 6 nternal DOFs and 3 DOFs for the robot s orentaton. We assume that the robot s always n contact wth the ground at a sngle pont, and nfer the robot s absolute (x, y) poston n space drectly from the remanng varables.

3 x. The return map dynamcs are represented as a Markov chan that depends on the parameter vector w nstead of the equvalent Markov decson process for smplfcaton because the feedback controller s evaluated many tmes durng a sngle step (our controller runs at 00 Hz and our robot steps at around 0.8 Hz). The stochastcty n f w comes from the random dsturbances D(t) and the state estmaton error, ˆx x. The cost functon for learnng uses a constant desred value, x d, on the return map: g(x) = 2 x xd 2. (4) Ths desred value can be consdered a reference trajectory on the return map, and s taken from the gat of the walker down a slope of 0.03 radans; no reference trajectory s requred for the lmt cycle between steps. For a gven trajectory ˆx = [ˆx(0), ˆx(),..., ˆx(N)], we defne the average cost G(ˆx) = N N g(ˆx(n)). (5) Our goal s to fnd the parameter vector w whch mnmzes lm E G(ˆx). (6) By mnmzng ths error, we are effectvely mnmzng the egenvalues of return map, and maxmzng the stablty of the desred lmt cycle. IV. THE LEARNING ALGORITHM The learnng algorthm s a statstcal algorthm whch makes small changes to the control parameters w on each step and uses correlatons between changes n w and changes n the return map error to clmb the performance gradent. Ths can be accomplshed wth a very smple onlne learnng rule whch changes w wth each step that the robot takes. The partcular algorthm that we present here was orgnally proposed by [7]. We present a thorough dervaton of ths algorthm n the next secton. The algorthm makes use of an ntermedate representaton whch we call the value functon, J(x). The value of state x s the expected average cost to be ncurred by followng polcy π w startng from state x: J(x) = lm N N g(x(n)), wth x(0) = x. Ĵ v (x) s an estmate of the value functon parameterzed by vector v. Ths value estmate s represented n another functon approxmator: Ĵ v (ˆx) = v ψ (ˆx). (7) Durng learnng, we add stochastcty to our determnstc control polcy by varyng w. Let Z(n) be a Gaussan random vector wth EZ (n) = 0 and EZ (n)z j (n ) = σ 2 δ j δ nn. Durng the nth step that the robot takes, we evaluate the controller usng the parameter vector w (n) = w(n) + z(n). The algorthm uses a storage varable, e(n), whch we call the elgblty trace. We begn wth w(0) = e(0) = 0. At the end of the nth step, we make the updates: δ(n) =g (ˆx(n)) + γĵv (ˆx(n + )) Ĵv (ˆx(n)) (8) e (n) =γe (n ) + b (n)z (n) (9) w (n) = η w δ(n)e (n) (0) v (n) =η v δ(n)ψ (ˆx(n)). () η w 0 and η v 0 are the learnng rates and γ s the dscount factor of the elgblty trace, whch wll be dscussed n more detal n the algorthm dervaton. b (n) s a boolean one step elgblty, whch s f the parameter w s actvated (φ (ˆx) > 0) at any pont durng step n and 0 otherwse. δ(n) s called the one step temporal dfference error. The algorthm can be understood ntutvely. On each step the robot receves some cost g(ˆx(n)). Ths cost s compared to cost that we expect to receve, as estmated by Ĵv(x). If the cost s lower than expected, then ηδ(n) s postve, so we add a scaled verson of the nose terms, z, nto w. Smlarly, f the cost s hgher than expected, then we move n the opposte drecton. Ths smple onlne algorthm performs approxmate stochastc gradent descent on the expected value of the average nfnte-horzon cost. V. ALGORITHM DERIVATION The expected value of the average cost, G, s gven by: EG(ˆx) = G(ˆx)P w ˆX = ˆxdˆx. The probablty of trajectory ˆx s ˆx P w ˆX = ˆx = P ˆX(0) N = ˆx(0) f w (ˆx(n + ), ˆx(n)). Takng the gradent of EG(ˆx) wth respect to w we fnd EG(ˆx) = G(ˆx) P w ˆX = ˆxdˆx w ˆx w = G(ˆx)P w ˆX = ˆx log P w ˆX = ˆxdˆx ˆx w =E G(ˆx) log P w ˆX = ˆx w ( N ) =E G(ˆx) log f w (ˆx(m + ), ˆx(m)) w Recall that f w (x, x) s a complcated functon whch ncludes the ntegrated dynamcs of the controller and the robot. Nevertheless, w log f w s smply: log f w (x (m + ), x(m)) = w [ log P w ˆX = x ˆX ] = x, W = w P w W = w = log P w W (m) = w = z (m) w σ 2

4 Substtutng, we have EG(ˆx) = w Nσ 2 E = Nσ 2 E ( N ) ( N ) g(ˆx(n)) z (m) n= N g(ˆx(n)) n b (m)z (m). Ths fnal reducton s based on the observaton that Eg(ˆx(n)) N m=n z (m) = 0 (nose added to the controller on or after step n s not correlated to the cost at step n). Smlarly, random changes to a weght that s not used durng the nth step (b (m) = 0) have zero expectaton, and can be excluded from the sum. Observe that the varance of ths gradent estmate grows wthout bound as N [8]. To bound the varance, we use a based estmate of ths gradent whch artfcally dscounts the elgblty trace: EG(ˆx) w N Nσ 2 E n g(ˆx(n)) γ n m b (m)z (m) = N Nσ 2 E g(ˆx(n))e (n), wth 0 γ. The dscount factor γ parameterzes the bas-varance trade-off. Next, observe that we can subtract any mean zero baselne from ths quantty wthout effectng the expected value of our estmate [2]. Includng ths baselne can dramatcally mprove the performance of our algorthm because t can reduce the varance of our gradent estmate. In partcular, we subtract a mean-zero term contanng an estmate of the value functon as recommended by [7]: lm EG(ˆx) w N lm N E g(x(n))e (n) = lm N E N g(x(n))e (n)+ N N ˆV π (x(n))z (n) N ( z (n) γ m n γ ˆV π (x(m + )) ˆV ) π (x(m)) m=n N = lm N E g(x(n))e (n)+ N ( γ ˆV π (x(n + )) ˆV ) π (x(n)) e (n) N = lm N E δ(n)e (n) By ths dervaton, we can see that the average of the weght update gven n equatons 8- s n the drecton of the performance gradent: lm N N E w (n) η lm VI. LEARNING IMPLEMENTATION w EG(ˆx). In our ntal mplementaton of the algorthm, we decded to further smplfy the problem by decomposng the control n the frontal and sagttal planes. In ths decomposton, the ankle roll actuators are responsble for stablzng the oscllatons of the robot n the frontal plane. The ankle ptch actuators cause the robot to lean forward or backward, whch moves the poston of the center of mass relatve to the ground contact pont on the foot. Because the hp jont on our robot s passve, f the center of mass s n front of the ground contact when the swng foot leaves the ground, then the robot wll begn to walk forward. The dstance of between the center of mass and the ground contact s monotoncally related to the step sze and to the walkng speed. Due to the smplcty of the sagttal plane control, we only need to learn a control polcy for the two ankle roll actuators whch stablze the roll oscllaton n the frontal plane. Ths strategy wll change as the robot walks at dfferent speeds, but we hope the learnng algorthm wll adapt quckly enough to compensate for those dfferences. Wth these smplfcatons n mnd, we constran the feedback polcy to be a functon of only two varables: θ roll and θ roll. The choce of these two varables s not arbtrary; they are the only varables that we use when wrtng a nonlearnng feedback controller that stablzes the oscllaton. We also constran the polcy to be symmetrc - the controller for the left ankle s smply a mrror mage of the controller for the rght ankle. Therefore, the learned control polcy only has a sngle output. The value functon s approxmated as a functon of only a sngle varable: θroll. Ths very low dmensonalty allows the algorthm to tran very quckly. The control polcy and value functons are both represented usng lnear functon approxmators of the form descrbed n Equatons 2 and 7, whch are fast and very convenent to ntalze. We use a non-overlappng tle-codng for our approxmator bass functons: 35 tles for the polcy (5 n θ roll 7 n θ roll ) and tles for the value functon. In order to make the robot explore the state space durng learnng, we hand-desgned a smple controller to place the robot n random ntal condtons on the return map. The random dstrbuton s based accordng to the dstrbuton of ponts that the robot has already experenced on the return map - the most lkely ntal condton s the state that the robot experenced least often. We use ths controller to randomly rentalze the robot every tme that t comes to a halt standng stll, or every 0 seconds, whchever comes frst. Ths heurstc makes the dstrbuton on the return map more unform, and ncreases the lkelhood of the algorthm convergng on the same polcy each tme that t learns from a blank slate.

5 VII. EXPERIMENTAL RESULTS When the learnng begns, the polcy parameters, w, are set to 0 and the baselne parameters, v, are ntalzed so that Ĵ v (x) g(x) γ. We typcally tran the robot on flat terran usng short trals wth random ntal condtons. Durng the frst few trals, the polcy does not restore suffcent energy nto the system, and the robot comes to a halt standng stll. Wthn a mnute of tranng, the robot acheves foot clearance on nearly every step; ths s the mnmal defnton of walkng on ths system. The learnng easly converges to a robust gat wth the desred fxed pont on the return map wthn 20 mnutes (approxmately 960 steps at 0.8 Hz). Error obtaned durng learnng depends on the random ntal condtons of each tral, and s therefore a very nosy stochastc varable. For ths reason, n Fgure 2 we plot a typcal learnng curve n terms of the average error per step. Fgure 3 plots a typcal trajectory of the learned controller walkng on flat terran. Fgure 4 dsplays the fnal polcy. learnng (w = 0) and after 000 steps. In general, the return map for our 9 DOF robot s 7 dmensonal (9 states + 9 dervatves - ), and the projecton of these dynamcs onto a sngle dmenson s dffcult to nterpret. The plots n Fgure 5 where made wth the robot walkng n place on flat terran. In ths partcular stuaton, most of the return map varables are close to zero throughout the dynamcs, and a two dmensonal return map captures the desred dynamcs. As expected, before learnng the return map llustrates a sngle fxed pont at θ roll = 0, whch means the robot s standng stll. After learnng, we obtan a sngle fxed pont at the desred value ( θ roll =.0 radans / second), and the basn of attracton of ths fxed pont extends over the entre doman that we tested. On the rare occason that the robot falls over, the system does not return to the map and stops producng ponts on ths graph. Fg. 5. Expermental return maps, before (left) and after (rght) learnng. Fxed ponts exst at the ntersectons of the return map (blue) and the lne of slope one (red). Fg. 2. A typcal learnng curve, plotted as the average error on each step. seconds Fg. 3. θ roll trajectory of the robot startng from standng stll. Fg. 4. Learned feedback control polcy u raroll = π w(ˆx). In Fgure 5 we plot the return maps of the system before To quantfy the stablty of the learned controller, we measure the egenvalues of the return map. Lnearzng around the fxed pont n Fgure 5 suggests that the system has a sngle egenvalue of 0.5. To obtan the egenvalues of the return map when the robot s walkng, we run the robot from a large number of ntal condtons and record the return map trajectores ˆx (n), 9 vectors whch represent the state of the system (wth fxed ankles) on the nth crossng of the th tral. For each tral we estmate ˆx ( ), the equlbrum of the return map. Fnally, we perform a least squares ft of the matrx A to satsfy the relaton [ˆx (n + ) ˆx ( )] = A[ˆx (n) ˆx ( )]. The egenvalues of A for the learned controller and for our hand-desgned controllers (descrbed n []) are: Controller Egenvalues Passve walkng 0.88 ± 0.0, 0.75, 0.66 ± 0.03, (63 trals) 0.54, 0.36, 0.32 ± 0.3 Hand-desgned 0.80, 0.60, 0.49 ± 0.04, 0.36, feed-forward (89 trals) 0.25, 0.20 ± 0.0, 0.0 Hand-desgned 0.78, 0.69 ± 0.03, 0.36, 0.25, feedback (58 trals) 0.20 ± 0.0, 0.0 Learned feedback 0.74 ± 0.05, 0.53 ± 0.09, 0.43, (42 trals) 0.30 ± 0.02, 0.5, 0.07 All of these experments were on flat terran except the passve walkng, whch was on a slope of radans. The

6 convergence of the system to the nomnal trajectory s largely governed by the largest egenvalues. Ths analyss suggests that our learned controller converges to the steady state trajectory more quckly that the passve walker on a ramp and more quckly than any of our hand-desgned controllers. Our stochastc polcy gradent algorthm solves the temporal credt assgnment problem n by accumulatng the elgblty wthn a step and dscountng elgblty between steps. Interestly, our algorthm performs best wth heavy dscountng between steps (0 γ 0.2). Ths suggests that our one dmensonal value estmate does a good job of solatng the credt assgnment to a sngle step. Whle t took a few mnutes to learn a controller from a blank slate, adjustng the learned controller to adapt to small changes n the terran appears to happen very quckly. The non-learnng controllers requre constant attenton and small manual changes to the parameters as the robot walks down the hall, on tles, and on carpet. The learnng controller easly adapts to these stuatons. VIII. DISCUSSION Desgnng our robot lke a passve dynamc walker changes the learnng problem n a number of ways. It allows us to learn a polcy wth only a sngle output whch controlled a 9 DOF system, and allows us to formulate the problem on the return map dynamcs. It also dramatcally ncreases the number of polces n the search space whch could generate stable walkng. The learnng algorthm works extremely well on ths smple robot, but wll the technque scale to more complcated robots? One factor n our success was the formulaton of the learnng problem on the dscrete dynamcs of the return map nstead of the contnuous dynamcs along the entre trajectory. Ths formulaton reles on the fact that our passve walker produces perodc trajectores even before the learnng begns. It s possble for passve walkers to have knees and arms [3], or on a more tradtonal humanod robot ths algorthm could be used to augment and mprove and exstng walkng controller whch produces nomnal walkng trajectores. As the number of degrees of freedom ncreases, the stochastc polcy gradent algorthm may have problems wth scalng. The algorthm correlates changes n the polcy parameters wth changes n the performance on the return map. As we add degrees of freedom, the assgnment of credt to a partcular actuator wll become more dffcult, requrng more learnng trals to obtan a good estmate of the correlaton. Ths scalng problem s an open and nterestng research queston and a prmary focus of our current research. IX. CONCLUSIONS We have presented a learnng formulaton and learnng algorthm whch works very well on our smplfed 3D dynamc bped. The robot begns to walk after only one mnute of learnng from a blank slate, and the learnng converges to the desred trajectory n less than 20 mnutes. Ths learned controller s quantfably more stable, usng the egenvalues of the return map, than any controller we were able to derve for the same robot by hand. Once the controller s learned, to robot s able to quckly adapt to small changes n the terran. Buldng a robot to smplfy the learnng allowed us to gan some practcal nsghts nto the learnng problem for dynamc bpedal locomoton. Implementng these algorthms on the real robot proved to be a very dfferent problem than workng n smulaton. We would lke to take two basc drectons to contnue ths research. Frst, we are removng many of the smplfyng assumptons used n ths paper (such as the decomposed control polcy) to better approxmate optmal walkng on ths smple platform and to test our learnng controller s ablty to compensate for rough terran. Second, we are scalng these results up to more sophstcated bpeds, ncludng a passve dynamc walker wth knees and humanods that already have a basc control system n place. ACKNOWLEDGMENTS Ths work was supported by the Davd and Luclle Packard Foundaton (contract 99-47), the Natonal Scence Foundaton (grant CCR-02249). Specal thanks to Mng-fa Fong and Derrck Tan for ther help wth desgnng and buldng the expermental platform. REFERENCES [] J. Mormoto and C. Atkeson, Mnmax dfferental dynamc programmng: An applcaton to robust bped walkng. Neural Informaton Processng Systems, [2] W. T. Mller, III, Real-tme neural network control of a bped walkng robot, IEEE Control Systems Magazne, vol. 4, no., pp. 4 48, Feb 994. [3] H. Benbrahm and J. A. Frankln, Bped dynamc walkng usng renforcement learnng, Robotcs and Autonomous Systems, vol. 22, pp , 997. [4] N. Kohl and P. Stone, Polcy gradent renforcement learnng for fast quadrupedal locomoton. IEEE Internatonal Conference on Robotcs and Automaton, [5] T. McGeer, Passve dynamc walkng, Internatonal Journal of Robotcs Research, vol. 9, no. 2, pp , Aprl 990. [6] M. J. Coleman and A. Runa, An uncontrolled toy that can walk but cannot stand stll, Physcal Revew Letters, vol. 80, no. 6, pp , Aprl 998. [7] H. Kmura and S. Kobayash, An analyss of actor/crtc algorthms usng elgblty traces: Renforcement learnng wth mperfect value functons. Internatonal Conference on Machne Learnng (ICML 98), 998, pp [8] J. Baxter and P. Bartlett, Infnte-horzon polcy-gradent estmaton, Journal of Artfcal Intellgence Research, vol. 5, pp , 200. [9] R. S. Sutton, D. McAllester, S. Sngh, and Y. Mansour, Polcy gradent methods for renforcement learnng wth functon approxmaton. Advances n Neural Informaton Processng Systems, 999. [0] J. E. Wlson, Walkng toy, Unted States Patent Offce, Tech. Rep., October [] R. Tedrake, T. W. Zhang, M. Fong, and H. S. Seung, Actuatng a smple 3d passve dynamc walker. IEEE Internatonal Conference on Robotcs and Automaton, [2] R. Wllams, Smple statstcal gradent-followng algorthms for connectonst renforcement learnng, Machne Learnng, vol. 8, pp , 992. [3] S. H. Collns, M. Wsse, and A. Runa, A three-dmensonal passve-dynamc walkng robot wth two legs and knees, Internatonal Journal of Robotcs Research, vol. 20, no. 7, pp , July 200.

Dynamic Optimization. Assignment 1. Sasanka Nagavalli January 29, 2013 Robotics Institute Carnegie Mellon University

Dynamic Optimization. Assignment 1. Sasanka Nagavalli January 29, 2013 Robotics Institute Carnegie Mellon University Dynamc Optmzaton Assgnment 1 Sasanka Nagavall snagaval@andrew.cmu.edu 16-745 January 29, 213 Robotcs Insttute Carnege Mellon Unversty Table of Contents 1. Problem and Approach... 1 2. Optmzaton wthout

More information

To: Professor Avitabile Date: February 4, 2003 From: Mechanical Student Subject: Experiment #1 Numerical Methods Using Excel

To: Professor Avitabile Date: February 4, 2003 From: Mechanical Student Subject: Experiment #1 Numerical Methods Using Excel To: Professor Avtable Date: February 4, 3 From: Mechancal Student Subject:.3 Experment # Numercal Methods Usng Excel Introducton Mcrosoft Excel s a spreadsheet program that can be used for data analyss,

More information

Learning Ensembles of Convolutional Neural Networks

Learning Ensembles of Convolutional Neural Networks Learnng Ensembles of Convolutonal Neural Networks Lran Chen The Unversty of Chcago Faculty Mentor: Greg Shakhnarovch Toyota Technologcal Insttute at Chcago 1 Introducton Convolutonal Neural Networks (CNN)

More information

IEE Electronics Letters, vol 34, no 17, August 1998, pp ESTIMATING STARTING POINT OF CONDUCTION OF CMOS GATES

IEE Electronics Letters, vol 34, no 17, August 1998, pp ESTIMATING STARTING POINT OF CONDUCTION OF CMOS GATES IEE Electroncs Letters, vol 34, no 17, August 1998, pp. 1622-1624. ESTIMATING STARTING POINT OF CONDUCTION OF CMOS GATES A. Chatzgeorgou, S. Nkolads 1 and I. Tsoukalas Computer Scence Department, 1 Department

More information

ANNUAL OF NAVIGATION 11/2006

ANNUAL OF NAVIGATION 11/2006 ANNUAL OF NAVIGATION 11/2006 TOMASZ PRACZYK Naval Unversty of Gdyna A FEEDFORWARD LINEAR NEURAL NETWORK WITH HEBBA SELFORGANIZATION IN RADAR IMAGE COMPRESSION ABSTRACT The artcle presents the applcaton

More information

Optimal Placement of PMU and RTU by Hybrid Genetic Algorithm and Simulated Annealing for Multiarea Power System State Estimation

Optimal Placement of PMU and RTU by Hybrid Genetic Algorithm and Simulated Annealing for Multiarea Power System State Estimation T. Kerdchuen and W. Ongsakul / GMSARN Internatonal Journal (09) - Optmal Placement of and by Hybrd Genetc Algorthm and Smulated Annealng for Multarea Power System State Estmaton Thawatch Kerdchuen and

More information

PRACTICAL, COMPUTATION EFFICIENT HIGH-ORDER NEURAL NETWORK FOR ROTATION AND SHIFT INVARIANT PATTERN RECOGNITION. Evgeny Artyomov and Orly Yadid-Pecht

PRACTICAL, COMPUTATION EFFICIENT HIGH-ORDER NEURAL NETWORK FOR ROTATION AND SHIFT INVARIANT PATTERN RECOGNITION. Evgeny Artyomov and Orly Yadid-Pecht 68 Internatonal Journal "Informaton Theores & Applcatons" Vol.11 PRACTICAL, COMPUTATION EFFICIENT HIGH-ORDER NEURAL NETWORK FOR ROTATION AND SHIFT INVARIANT PATTERN RECOGNITION Evgeny Artyomov and Orly

More information

Calculation of the received voltage due to the radiation from multiple co-frequency sources

Calculation of the received voltage due to the radiation from multiple co-frequency sources Rec. ITU-R SM.1271-0 1 RECOMMENDATION ITU-R SM.1271-0 * EFFICIENT SPECTRUM UTILIZATION USING PROBABILISTIC METHODS Rec. ITU-R SM.1271 (1997) The ITU Radocommuncaton Assembly, consderng a) that communcatons

More information

A Comparison of Two Equivalent Real Formulations for Complex-Valued Linear Systems Part 2: Results

A Comparison of Two Equivalent Real Formulations for Complex-Valued Linear Systems Part 2: Results AMERICAN JOURNAL OF UNDERGRADUATE RESEARCH VOL. 1 NO. () A Comparson of Two Equvalent Real Formulatons for Complex-Valued Lnear Systems Part : Results Abnta Munankarmy and Mchael A. Heroux Department of

More information

Control Chart. Control Chart - history. Process in control. Developed in 1920 s. By Dr. Walter A. Shewhart

Control Chart. Control Chart - history. Process in control. Developed in 1920 s. By Dr. Walter A. Shewhart Control Chart - hstory Control Chart Developed n 920 s By Dr. Walter A. Shewhart 2 Process n control A phenomenon s sad to be controlled when, through the use of past experence, we can predct, at least

More information

Research of Dispatching Method in Elevator Group Control System Based on Fuzzy Neural Network. Yufeng Dai a, Yun Du b

Research of Dispatching Method in Elevator Group Control System Based on Fuzzy Neural Network. Yufeng Dai a, Yun Du b 2nd Internatonal Conference on Computer Engneerng, Informaton Scence & Applcaton Technology (ICCIA 207) Research of Dspatchng Method n Elevator Group Control System Based on Fuzzy Neural Network Yufeng

More information

Chaotic Filter Bank for Computer Cryptography

Chaotic Filter Bank for Computer Cryptography Chaotc Flter Bank for Computer Cryptography Bngo Wng-uen Lng Telephone: 44 () 784894 Fax: 44 () 784893 Emal: HTwng-kuen.lng@kcl.ac.ukTH Department of Electronc Engneerng, Dvson of Engneerng, ng s College

More information

High Speed ADC Sampling Transients

High Speed ADC Sampling Transients Hgh Speed ADC Samplng Transents Doug Stuetzle Hgh speed analog to dgtal converters (ADCs) are, at the analog sgnal nterface, track and hold devces. As such, they nclude samplng capactors and samplng swtches.

More information

Application of Intelligent Voltage Control System to Korean Power Systems

Application of Intelligent Voltage Control System to Korean Power Systems Applcaton of Intellgent Voltage Control System to Korean Power Systems WonKun Yu a,1 and HeungJae Lee b, *,2 a Department of Power System, Seol Unversty, South Korea. b Department of Power System, Kwangwoon

More information

Enhanced Artificial Neural Networks Using Complex Numbers

Enhanced Artificial Neural Networks Using Complex Numbers Enhanced Artfcal Neural Networks Usng Complex Numers Howard E. Mchel and A. A. S. Awwal Computer Scence Department Unversty of Dayton Dayton, OH 45469-60 mchel@cps.udayton.edu Computer Scence & Engneerng

More information

Comparison of Two Measurement Devices I. Fundamental Ideas.

Comparison of Two Measurement Devices I. Fundamental Ideas. Comparson of Two Measurement Devces I. Fundamental Ideas. ASQ-RS Qualty Conference March 16, 005 Joseph G. Voelkel, COE, RIT Bruce Sskowsk Rechert, Inc. Topcs The Problem, Eample, Mathematcal Model One

More information

Analysis of Time Delays in Synchronous and. Asynchronous Control Loops. Bj rn Wittenmark, Ben Bastian, and Johan Nilsson

Analysis of Time Delays in Synchronous and. Asynchronous Control Loops. Bj rn Wittenmark, Ben Bastian, and Johan Nilsson 37th CDC, Tampa, December 1998 Analyss of Delays n Synchronous and Asynchronous Control Loops Bj rn Wttenmark, Ben Bastan, and Johan Nlsson emal: bjorn@control.lth.se, ben@control.lth.se, and johan@control.lth.se

More information

Priority based Dynamic Multiple Robot Path Planning

Priority based Dynamic Multiple Robot Path Planning 2nd Internatonal Conference on Autonomous obots and Agents Prorty based Dynamc Multple obot Path Plannng Abstract Taxong Zheng Department of Automaton Chongqng Unversty of Post and Telecommuncaton, Chna

More information

CHAPTER 29 AN EXAMPLE OF THE MONTECARLO SIMULATIONS: THE LANGEVIN DYNAMICS

CHAPTER 29 AN EXAMPLE OF THE MONTECARLO SIMULATIONS: THE LANGEVIN DYNAMICS CHAPTER 9 AN EXAMPLE OF THE MONTECARLO SIMULATIONS: THE LANGEVIN DYNAMICS Objectves After completng the readng of ths chapter, you wll be able to: Construct a polymer chan. Devse a smple MC smulaton program

More information

EE 508 Lecture 6. Degrees of Freedom The Approximation Problem

EE 508 Lecture 6. Degrees of Freedom The Approximation Problem EE 508 Lecture 6 Degrees of Freedom The Approxmaton Problem Revew from Last Tme Desgn Strategy Theorem: A crcut wth transfer functon T(s) can be obtaned from a crcut wth normalzed transfer functon T n

More information

1 GSW Multipath Channel Models

1 GSW Multipath Channel Models In the general case, the moble rado channel s pretty unpleasant: there are a lot of echoes dstortng the receved sgnal, and the mpulse response keeps changng. Fortunately, there are some smplfyng assumptons

More information

Adaptive Modulation for Multiple Antenna Channels

Adaptive Modulation for Multiple Antenna Channels Adaptve Modulaton for Multple Antenna Channels June Chul Roh and Bhaskar D. Rao Department of Electrcal and Computer Engneerng Unversty of Calforna, San Dego La Jolla, CA 993-7 E-mal: jroh@ece.ucsd.edu,

More information

Networks. Backpropagation. Backpropagation. Introduction to. Backpropagation Network training. Backpropagation Learning Details 1.04.

Networks. Backpropagation. Backpropagation. Introduction to. Backpropagation Network training. Backpropagation Learning Details 1.04. Networs Introducton to - In 1986 a method for learnng n mult-layer wor,, was nvented by Rumelhart Paper Why are what and where processed by separate cortcal vsual systems? - The algorthm s a sensble approach

More information

Sensors for Motion and Position Measurement

Sensors for Motion and Position Measurement Sensors for Moton and Poston Measurement Introducton An ntegrated manufacturng envronment conssts of 5 elements:- - Machne tools - Inspecton devces - Materal handlng devces - Packagng machnes - Area where

More information

Control of Chaos in Positive Output Luo Converter by means of Time Delay Feedback

Control of Chaos in Positive Output Luo Converter by means of Time Delay Feedback Control of Chaos n Postve Output Luo Converter by means of Tme Delay Feedback Nagulapat nkran.ped@gmal.com Abstract Faster development n Dc to Dc converter technques are undergong very drastc changes due

More information

A High-Sensitivity Oversampling Digital Signal Detection Technique for CMOS Image Sensors Using Non-destructive Intermediate High-Speed Readout Mode

A High-Sensitivity Oversampling Digital Signal Detection Technique for CMOS Image Sensors Using Non-destructive Intermediate High-Speed Readout Mode A Hgh-Senstvty Oversamplng Dgtal Sgnal Detecton Technque for CMOS Image Sensors Usng Non-destructve Intermedate Hgh-Speed Readout Mode Shoj Kawahto*, Nobuhro Kawa** and Yoshak Tadokoro** *Research Insttute

More information

Queen Bee genetic optimization of an heuristic based fuzzy control scheme for a mobile robot 1

Queen Bee genetic optimization of an heuristic based fuzzy control scheme for a mobile robot 1 Queen Bee genetc optmzaton of an heurstc based fuzzy control scheme for a moble robot 1 Rodrgo A. Carrasco Schmdt Pontfca Unversdad Católca de Chle Abstract Ths work presents both a novel control scheme

More information

MTBF PREDICTION REPORT

MTBF PREDICTION REPORT MTBF PREDICTION REPORT PRODUCT NAME: BLE112-A-V2 Issued date: 01-23-2015 Rev:1.0 Copyrght@2015 Bluegga Technologes. All rghts reserved. 1 MTBF PREDICTION REPORT... 1 PRODUCT NAME: BLE112-A-V2... 1 1.0

More information

NETWORK 2001 Transportation Planning Under Multiple Objectives

NETWORK 2001 Transportation Planning Under Multiple Objectives NETWORK 200 Transportaton Plannng Under Multple Objectves Woodam Chung Graduate Research Assstant, Department of Forest Engneerng, Oregon State Unversty, Corvalls, OR9733, Tel: (54) 737-4952, Fax: (54)

More information

Uncertainty in measurements of power and energy on power networks

Uncertainty in measurements of power and energy on power networks Uncertanty n measurements of power and energy on power networks E. Manov, N. Kolev Department of Measurement and Instrumentaton, Techncal Unversty Sofa, bul. Klment Ohrdsk No8, bl., 000 Sofa, Bulgara Tel./fax:

More information

problems palette of David Rock and Mary K. Porter 6. A local musician comes to your school to give a performance

problems palette of David Rock and Mary K. Porter 6. A local musician comes to your school to give a performance palette of problems Davd Rock and Mary K. Porter 1. If n represents an nteger, whch of the followng expressons yelds the greatest value? n,, n, n, n n. A 60-watt lghtbulb s used for 95 hours before t burns

More information

POLYTECHNIC UNIVERSITY Electrical Engineering Department. EE SOPHOMORE LABORATORY Experiment 1 Laboratory Energy Sources

POLYTECHNIC UNIVERSITY Electrical Engineering Department. EE SOPHOMORE LABORATORY Experiment 1 Laboratory Energy Sources POLYTECHNIC UNIERSITY Electrcal Engneerng Department EE SOPHOMORE LABORATORY Experment 1 Laboratory Energy Sources Modfed for Physcs 18, Brooklyn College I. Oerew of the Experment Ths experment has three

More information

熊本大学学術リポジトリ. Kumamoto University Repositor

熊本大学学術リポジトリ. Kumamoto University Repositor 熊本大学学術リポジトリ Kumamoto Unversty Repostor Ttle Wreless LAN Based Indoor Poston and Its Smulaton Author(s) Ktasuka, Teruak; Nakansh, Tsune CtatonIEEE Pacfc RIM Conference on Comm Computers, and Sgnal Processng

More information

Desensitized Kalman Filtering with Analytical Gain

Desensitized Kalman Filtering with Analytical Gain Desenstzed Kalman Flterng wth Analytcal Gan ashan Lou School of Electrc and Informaton Engneerng, Zhengzhou Unversty of Lght Industry, Zhengzhou, 45002, Chna, tayzan@sna.com Abstract: he possble methodologes

More information

Fast Code Detection Using High Speed Time Delay Neural Networks

Fast Code Detection Using High Speed Time Delay Neural Networks Fast Code Detecton Usng Hgh Speed Tme Delay Neural Networks Hazem M. El-Bakry 1 and Nkos Mastoraks 1 Faculty of Computer Scence & Informaton Systems, Mansoura Unversty, Egypt helbakry0@yahoo.com Department

More information

Micro-grid Inverter Parallel Droop Control Method for Improving Dynamic Properties and the Effect of Power Sharing

Micro-grid Inverter Parallel Droop Control Method for Improving Dynamic Properties and the Effect of Power Sharing 2015 AASRI Internatonal Conference on Industral Electroncs and Applcatons (IEA 2015) Mcro-grd Inverter Parallel Droop Control Method for Improvng Dynamc Propertes and the Effect of Power Sharng aohong

More information

Walsh Function Based Synthesis Method of PWM Pattern for Full-Bridge Inverter

Walsh Function Based Synthesis Method of PWM Pattern for Full-Bridge Inverter Walsh Functon Based Synthess Method of PWM Pattern for Full-Brdge Inverter Sej Kondo and Krt Choesa Nagaoka Unversty of Technology 63-, Kamtomoka-cho, Nagaoka 9-, JAPAN Fax: +8-58-7-95, Phone: +8-58-7-957

More information

MODEL ORDER REDUCTION AND CONTROLLER DESIGN OF DISCRETE SYSTEM EMPLOYING REAL CODED GENETIC ALGORITHM J. S. Yadav, N. P. Patidar, J.

MODEL ORDER REDUCTION AND CONTROLLER DESIGN OF DISCRETE SYSTEM EMPLOYING REAL CODED GENETIC ALGORITHM J. S. Yadav, N. P. Patidar, J. ABSTRACT Research Artcle MODEL ORDER REDUCTION AND CONTROLLER DESIGN OF DISCRETE SYSTEM EMPLOYING REAL CODED GENETIC ALGORITHM J. S. Yadav, N. P. Patdar, J. Sngha Address for Correspondence Maulana Azad

More information

Throughput Maximization by Adaptive Threshold Adjustment for AMC Systems

Throughput Maximization by Adaptive Threshold Adjustment for AMC Systems APSIPA ASC 2011 X an Throughput Maxmzaton by Adaptve Threshold Adjustment for AMC Systems We-Shun Lao and Hsuan-Jung Su Graduate Insttute of Communcaton Engneerng Department of Electrcal Engneerng Natonal

More information

Shunt Active Filters (SAF)

Shunt Active Filters (SAF) EN-TH05-/004 Martt Tuomanen (9) Shunt Actve Flters (SAF) Operaton prncple of a Shunt Actve Flter. Non-lnear loads lke Varable Speed Drves, Unnterrupted Power Supples and all knd of rectfers draw a non-snusodal

More information

NATIONAL RADIO ASTRONOMY OBSERVATORY Green Bank, West Virginia SPECTRAL PROCESSOR MEMO NO. 25. MEMORANDUM February 13, 1985

NATIONAL RADIO ASTRONOMY OBSERVATORY Green Bank, West Virginia SPECTRAL PROCESSOR MEMO NO. 25. MEMORANDUM February 13, 1985 NATONAL RADO ASTRONOMY OBSERVATORY Green Bank, West Vrgna SPECTRAL PROCESSOR MEMO NO. 25 MEMORANDUM February 13, 1985 To: Spectral Processor Group From: R. Fsher Subj: Some Experments wth an nteger FFT

More information

Reliability and Quality Improvement of Robotic Manipulation Systems

Reliability and Quality Improvement of Robotic Manipulation Systems Yaser Maddah, Al Maddah Relablty and Qualty Improvement of Robotc Manpulaton Systems Yaser Maddah Department of Mechancal and Manufacturng Engneerng Unversty of Mantoba Wnnpeg, MB R3T 5V6 CANADA Al Maddah

More information

A MODIFIED DIFFERENTIAL EVOLUTION ALGORITHM IN SPARSE LINEAR ANTENNA ARRAY SYNTHESIS

A MODIFIED DIFFERENTIAL EVOLUTION ALGORITHM IN SPARSE LINEAR ANTENNA ARRAY SYNTHESIS A MODIFIED DIFFERENTIAL EVOLUTION ALORITHM IN SPARSE LINEAR ANTENNA ARRAY SYNTHESIS Kaml Dmller Department of Electrcal-Electroncs Engneerng rne Amercan Unversty North Cyprus, Mersn TURKEY kdmller@gau.edu.tr

More information

arxiv: v1 [cs.lg] 8 Jul 2016

arxiv: v1 [cs.lg] 8 Jul 2016 Overcomng Challenges n Fxed Pont Tranng of Deep Convolutonal Networks arxv:1607.02241v1 [cs.lg] 8 Jul 2016 Darryl D. Ln Qualcomm Research, San Dego, CA 92121 USA Sachn S. Talath Qualcomm Research, San

More information

antenna antenna (4.139)

antenna antenna (4.139) .6.6 The Lmts of Usable Input Levels for LNAs The sgnal voltage level delvered to the nput of an LNA from the antenna may vary n a very wde nterval, from very weak sgnals comparable to the nose level,

More information

Particle Filters. Ioannis Rekleitis

Particle Filters. Ioannis Rekleitis Partcle Flters Ioanns Reklets Bayesan Flter Estmate state x from data Z What s the probablty of the robot beng at x? x could be robot locaton, map nformaton, locatons of targets, etc Z could be sensor

More information

The Performance Improvement of BASK System for Giga-Bit MODEM Using the Fuzzy System

The Performance Improvement of BASK System for Giga-Bit MODEM Using the Fuzzy System Int. J. Communcatons, Network and System Scences, 10, 3, 1-5 do:10.36/jcns.10.358 Publshed Onlne May 10 (http://www.scrp.org/journal/jcns/) The Performance Improvement of BASK System for Gga-Bt MODEM Usng

More information

Adaptive System Control with PID Neural Networks

Adaptive System Control with PID Neural Networks Adaptve System Control wth PID Neural Networs F. Shahra a, M.A. Fanae b, A.R. Aromandzadeh a a Department of Chemcal Engneerng, Unversty of Sstan and Baluchestan, Zahedan, Iran. b Department of Chemcal

More information

Define Y = # of mobiles from M total mobiles that have an adequate link. Measure of average portion of mobiles allocated a link of adequate quality.

Define Y = # of mobiles from M total mobiles that have an adequate link. Measure of average portion of mobiles allocated a link of adequate quality. Wreless Communcatons Technologes 6::559 (Advanced Topcs n Communcatons) Lecture 5 (Aprl th ) and Lecture 6 (May st ) Instructor: Professor Narayan Mandayam Summarzed by: Steve Leung (leungs@ece.rutgers.edu)

More information

Comparative Analysis of Reuse 1 and 3 in Cellular Network Based On SIR Distribution and Rate

Comparative Analysis of Reuse 1 and 3 in Cellular Network Based On SIR Distribution and Rate Comparatve Analyss of Reuse and 3 n ular Network Based On IR Dstrbuton and Rate Chandra Thapa M.Tech. II, DEC V College of Engneerng & Technology R.V.. Nagar, Chttoor-5727, A.P. Inda Emal: chandra2thapa@gmal.com

More information

Figure 1. DC-DC Boost Converter

Figure 1. DC-DC Boost Converter EE46, Power Electroncs, DC-DC Boost Converter Verson Oct. 3, 11 Overvew Boost converters make t possble to effcently convert a DC voltage from a lower level to a hgher level. Theory of Operaton Relaton

More information

Development of Neural Networks for Noise Reduction

Development of Neural Networks for Noise Reduction The Internatonal Arab Journal of Informaton Technology, Vol. 7, No. 3, July 00 89 Development of Neural Networks for Nose Reducton Lubna Badr Faculty of Engneerng, Phladelpha Unversty, Jordan Abstract:

More information

Chapter 2 Two-Degree-of-Freedom PID Controllers Structures

Chapter 2 Two-Degree-of-Freedom PID Controllers Structures Chapter 2 Two-Degree-of-Freedom PID Controllers Structures As n most of the exstng ndustral process control applcatons, the desred value of the controlled varable, or set-pont, normally remans constant

More information

Rejection of PSK Interference in DS-SS/PSK System Using Adaptive Transversal Filter with Conditional Response Recalculation

Rejection of PSK Interference in DS-SS/PSK System Using Adaptive Transversal Filter with Conditional Response Recalculation SERBIAN JOURNAL OF ELECTRICAL ENGINEERING Vol., No., November 23, 3-9 Rejecton of PSK Interference n DS-SS/PSK System Usng Adaptve Transversal Flter wth Condtonal Response Recalculaton Zorca Nkolć, Bojan

More information

Discrete Time Sliding Mode Control of Magnetic Levitation System with Enhanced Exponential Reaching Law

Discrete Time Sliding Mode Control of Magnetic Levitation System with Enhanced Exponential Reaching Law Dscrete Tme Sldng Mode Control of Magnetc Levtaton System wth Enhanced Eponental Reachng Law Dnçer Maden, İrfan Yazıcı Duzce Unversty, Department of Electrcal & Electroncs Engneerng, 86 Konuralp, Düzce,

More information

Performance Analysis of Multi User MIMO System with Block-Diagonalization Precoding Scheme

Performance Analysis of Multi User MIMO System with Block-Diagonalization Precoding Scheme Performance Analyss of Mult User MIMO System wth Block-Dagonalzaton Precodng Scheme Yoon Hyun m and Jn Young m, wanwoon Unversty, Department of Electroncs Convergence Engneerng, Wolgye-Dong, Nowon-Gu,

More information

Latency Insertion Method (LIM) for IR Drop Analysis in Power Grid

Latency Insertion Method (LIM) for IR Drop Analysis in Power Grid Abstract Latency Inserton Method (LIM) for IR Drop Analyss n Power Grd Dmtr Klokotov, and José Schutt-Ané Wth the steadly growng number of transstors on a chp, and constantly tghtenng voltage budgets,

More information

Side-Match Vector Quantizers Using Neural Network Based Variance Predictor for Image Coding

Side-Match Vector Quantizers Using Neural Network Based Variance Predictor for Image Coding Sde-Match Vector Quantzers Usng Neural Network Based Varance Predctor for Image Codng Shuangteng Zhang Department of Computer Scence Eastern Kentucky Unversty Rchmond, KY 40475, U.S.A. shuangteng.zhang@eku.edu

More information

The Spectrum Sharing in Cognitive Radio Networks Based on Competitive Price Game

The Spectrum Sharing in Cognitive Radio Networks Based on Competitive Price Game 8 Y. B. LI, R. YAG, Y. LI, F. YE, THE SPECTRUM SHARIG I COGITIVE RADIO ETWORKS BASED O COMPETITIVE The Spectrum Sharng n Cogntve Rado etworks Based on Compettve Prce Game Y-bng LI, Ru YAG., Yun LI, Fang

More information

High Speed, Low Power And Area Efficient Carry-Select Adder

High Speed, Low Power And Area Efficient Carry-Select Adder Internatonal Journal of Scence, Engneerng and Technology Research (IJSETR), Volume 5, Issue 3, March 2016 Hgh Speed, Low Power And Area Effcent Carry-Select Adder Nelant Harsh M.tech.VLSI Desgn Electroncs

More information

Graph Method for Solving Switched Capacitors Circuits

Graph Method for Solving Switched Capacitors Circuits Recent Advances n rcuts, ystems, gnal and Telecommuncatons Graph Method for olvng wtched apactors rcuts BHUMIL BRTNÍ Department of lectroncs and Informatcs ollege of Polytechncs Jhlava Tolstého 6, 586

More information

Digital Transmission

Digital Transmission Dgtal Transmsson Most modern communcaton systems are dgtal, meanng that the transmtted normaton sgnal carres bts and symbols rather than an analog sgnal. The eect o C/N rato ncrease or decrease on dgtal

More information

A NSGA-II algorithm to solve a bi-objective optimization of the redundancy allocation problem for series-parallel systems

A NSGA-II algorithm to solve a bi-objective optimization of the redundancy allocation problem for series-parallel systems 0 nd Internatonal Conference on Industral Technology and Management (ICITM 0) IPCSIT vol. 49 (0) (0) IACSIT Press, Sngapore DOI: 0.776/IPCSIT.0.V49.8 A NSGA-II algorthm to solve a b-obectve optmzaton of

More information

An Alternation Diffusion LMS Estimation Strategy over Wireless Sensor Network

An Alternation Diffusion LMS Estimation Strategy over Wireless Sensor Network Progress In Electromagnetcs Research M, Vol. 70, 135 143, 2018 An Alternaton Dffuson LMS Estmaton Strategy over Wreless Sensor Network Ln L * and Donghu L Abstract Ths paper presents a dstrbuted estmaton

More information

Parameter Free Iterative Decoding Metrics for Non-Coherent Orthogonal Modulation

Parameter Free Iterative Decoding Metrics for Non-Coherent Orthogonal Modulation 1 Parameter Free Iteratve Decodng Metrcs for Non-Coherent Orthogonal Modulaton Albert Gullén Fàbregas and Alex Grant Abstract We study decoder metrcs suted for teratve decodng of non-coherently detected

More information

ROBUST IDENTIFICATION AND PREDICTION USING WILCOXON NORM AND PARTICLE SWARM OPTIMIZATION

ROBUST IDENTIFICATION AND PREDICTION USING WILCOXON NORM AND PARTICLE SWARM OPTIMIZATION 7th European Sgnal Processng Conference (EUSIPCO 9 Glasgow, Scotland, August 4-8, 9 ROBUST IDENTIFICATION AND PREDICTION USING WILCOXON NORM AND PARTICLE SWARM OPTIMIZATION Babta Majh, G. Panda and B.

More information

Network Theory. EC / EE / IN. for

Network Theory.   EC / EE / IN. for Network Theory for / / IN By www.thegateacademy.com Syllabus Syllabus for Networks Network Graphs: Matrces Assocated Wth Graphs: Incdence, Fundamental ut Set and Fundamental rcut Matrces. Soluton Methods:

More information

Generalized Incomplete Trojan-Type Designs with Unequal Cell Sizes

Generalized Incomplete Trojan-Type Designs with Unequal Cell Sizes Internatonal Journal of Theoretcal & Appled Scences 6(1): 50-54(2014) ISSN No. (Prnt): 0975-1718 ISSN No. (Onlne): 2249-3247 Generalzed Incomplete Trojan-Type Desgns wth Unequal Cell Szes Cn Varghese,

More information

Markov Chain Monte Carlo Detection for Underwater Acoustic Channels

Markov Chain Monte Carlo Detection for Underwater Acoustic Channels Markov Chan Monte Carlo Detecton for Underwater Acoustc Channels Hong Wan, Rong-Rong Chen, Jun Won Cho, Andrew Snger, James Presg, and Behrouz Farhang-Boroujeny Dept. of ECE, Unversty of Utah Dept. of

More information

Simplified Motion Modeling for Snake Robots

Simplified Motion Modeling for Snake Robots Smplfed Moton Modelng for Snake Robots Floran Enner, Davd Rollnson and Howe Choset Abstract We present a general method of estmatng a snake robot s moton over flat ground usng only knowledge of the robot

More information

Understanding the Spike Algorithm

Understanding the Spike Algorithm Understandng the Spke Algorthm Vctor Ejkhout and Robert van de Gejn May, ntroducton The parallel soluton of lnear systems has a long hstory, spannng both drect and teratve methods Whle drect methods exst

More information

Passive Filters. References: Barbow (pp ), Hayes & Horowitz (pp 32-60), Rizzoni (Chap. 6)

Passive Filters. References: Barbow (pp ), Hayes & Horowitz (pp 32-60), Rizzoni (Chap. 6) Passve Flters eferences: Barbow (pp 6575), Hayes & Horowtz (pp 360), zzon (Chap. 6) Frequencyselectve or flter crcuts pass to the output only those nput sgnals that are n a desred range of frequences (called

More information

Fall 2018 #11 Games and Nimbers. A. Game. 0.5 seconds, 64 megabytes

Fall 2018 #11 Games and Nimbers. A. Game. 0.5 seconds, 64 megabytes 5-95 Fall 08 # Games and Nmbers A. Game 0.5 seconds, 64 megabytes There s a legend n the IT Cty college. A student that faled to answer all questons on the game theory exam s gven one more chance by hs

More information

ECE315 / ECE515 Lecture 5 Date:

ECE315 / ECE515 Lecture 5 Date: Lecture 5 Date: 18.08.2016 Common Source Amplfer MOSFET Amplfer Dstorton Example 1 One Realstc CS Amplfer Crcut: C c1 : Couplng Capactor serves as perfect short crcut at all sgnal frequences whle blockng

More information

Introduction to Coalescent Models. Biostatistics 666

Introduction to Coalescent Models. Biostatistics 666 Introducton to Coalescent Models Bostatstcs 666 Prevously Allele frequences Hardy Wenberg Equlbrum Lnkage Equlbrum Expected state for dstant markers Lnkage Dsequlbrum Assocaton between neghborng alleles

More information

Efficient Large Integers Arithmetic by Adopting Squaring and Complement Recoding Techniques

Efficient Large Integers Arithmetic by Adopting Squaring and Complement Recoding Techniques The th Worshop on Combnatoral Mathematcs and Computaton Theory Effcent Large Integers Arthmetc by Adoptng Squarng and Complement Recodng Technques Cha-Long Wu*, Der-Chyuan Lou, and Te-Jen Chang *Department

More information

TECHNICAL NOTE TERMINATION FOR POINT- TO-POINT SYSTEMS TN TERMINATON FOR POINT-TO-POINT SYSTEMS. Zo = L C. ω - angular frequency = 2πf

TECHNICAL NOTE TERMINATION FOR POINT- TO-POINT SYSTEMS TN TERMINATON FOR POINT-TO-POINT SYSTEMS. Zo = L C. ω - angular frequency = 2πf TECHNICAL NOTE TERMINATION FOR POINT- TO-POINT SYSTEMS INTRODUCTION Because dgtal sgnal rates n computng systems are ncreasng at an astonshng rate, sgnal ntegrty ssues have become far more mportant to

More information

A Novel Optimization of the Distance Source Routing (DSR) Protocol for the Mobile Ad Hoc Networks (MANET)

A Novel Optimization of the Distance Source Routing (DSR) Protocol for the Mobile Ad Hoc Networks (MANET) A Novel Optmzaton of the Dstance Source Routng (DSR) Protocol for the Moble Ad Hoc Networs (MANET) Syed S. Rzv 1, Majd A. Jafr, and Khaled Ellethy Computer Scence and Engneerng Department Unversty of Brdgeport

More information

A Preliminary Study on Targets Association Algorithm of Radar and AIS Using BP Neural Network

A Preliminary Study on Targets Association Algorithm of Radar and AIS Using BP Neural Network Avalable onlne at www.scencedrect.com Proceda Engneerng 5 (2 44 445 A Prelmnary Study on Targets Assocaton Algorthm of Radar and AIS Usng BP Neural Networ Hu Xaoru a, Ln Changchuan a a Navgaton Insttute

More information

Space Time Equalization-space time codes System Model for STCM

Space Time Equalization-space time codes System Model for STCM Space Tme Eualzaton-space tme codes System Model for STCM The system under consderaton conssts of ST encoder, fadng channel model wth AWGN, two transmt antennas, one receve antenna, Vterb eualzer wth deal

More information

An Effective Approach for Distribution System Power Flow Solution

An Effective Approach for Distribution System Power Flow Solution World Academy of Scence, Engneerng and Technology nternatonal Journal of Electrcal and Computer Engneerng ol:, No:, 9 An Effectve Approach for Dstrbuton System Power Flow Soluton A. Alsaad, and. Gholam

More information

Prevention of Sequential Message Loss in CAN Systems

Prevention of Sequential Message Loss in CAN Systems Preventon of Sequental Message Loss n CAN Systems Shengbng Jang Electrcal & Controls Integraton Lab GM R&D Center, MC: 480-106-390 30500 Mound Road, Warren, MI 48090 shengbng.jang@gm.com Ratnesh Kumar

More information

Discussion on How to Express a Regional GPS Solution in the ITRF

Discussion on How to Express a Regional GPS Solution in the ITRF 162 Dscusson on How to Express a Regonal GPS Soluton n the ITRF Z. ALTAMIMI 1 Abstract The usefulness of the densfcaton of the Internatonal Terrestral Reference Frame (ITRF) s to facltate ts access as

More information

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, VOL. 13, NO. 12, DECEMBER

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, VOL. 13, NO. 12, DECEMBER IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, VOL. 3, NO. 2, DECEMBER 204 695 On Spatal Capacty of Wreless Ad Hoc Networks wth Threshold Based Schedulng Yue Lng Che, Student Member, IEEE, Ru Zhang, Member,

More information

New Approach to Achieving Stand Alone GPS Attitude Determination using Dual Short Baselines for Small-Satellite

New Approach to Achieving Stand Alone GPS Attitude Determination using Dual Short Baselines for Small-Satellite SSC99-XI- New Approach to Achevng Stand Alone GPS Atttude Determnaton usng Dual Short Baselnes for Small-Satellte S. Purvgrapong, M.S. Hodgart,Y. Hashda, M.J. Unwn Surrey Space Centre, UK Abstract Ths

More information

Figure 1. DC-DC Boost Converter

Figure 1. DC-DC Boost Converter EE36L, Power Electroncs, DC-DC Boost Converter Verson Feb. 8, 9 Overvew Boost converters make t possble to effcently convert a DC voltage from a lower level to a hgher level. Theory of Operaton Relaton

More information

A study of turbo codes for multilevel modulations in Gaussian and mobile channels

A study of turbo codes for multilevel modulations in Gaussian and mobile channels A study of turbo codes for multlevel modulatons n Gaussan and moble channels Lamne Sylla and Paul Forter (sylla, forter)@gel.ulaval.ca Department of Electrcal and Computer Engneerng Laval Unversty, Ste-Foy,

More information

Figure.1. Basic model of an impedance source converter JCHPS Special Issue 12: August Page 13

Figure.1. Basic model of an impedance source converter JCHPS Special Issue 12: August Page 13 A Hgh Gan DC - DC Converter wth Soft Swtchng and Power actor Correcton for Renewable Energy Applcaton T. Selvakumaran* and. Svachdambaranathan Department of EEE, Sathyabama Unversty, Chenna, Inda. *Correspondng

More information

Adaptive Fuzzy Sliding Controller with Dynamic Compensation for Multi-Axis Machining

Adaptive Fuzzy Sliding Controller with Dynamic Compensation for Multi-Axis Machining J. Software Engneerng & Applcatons, 009, : 88-94 do:10.436/jsea.009.4037 Publshed Onlne November 009 (http://www.scrp.org/journal/jsea) Adaptve Fuzzy Sldng Controller wth Dynamc Compensaton for Mult-Axs

More information

The Dynamic Utilization of Substation Measurements to Maintain Power System Observability

The Dynamic Utilization of Substation Measurements to Maintain Power System Observability 1 The Dynamc Utlzaton of Substaton Measurements to Mantan Power System Observablty Y. Wu, Student Member, IEEE, M. Kezunovc, Fellow, IEEE and T. Kostc, Member, IEEE Abstract-- In a power system State Estmator

More information

Multi-sensor optimal information fusion Kalman filter with mobile agents in ring sensor networks

Multi-sensor optimal information fusion Kalman filter with mobile agents in ring sensor networks Mult-sensor optmal nformaton fuson Kalman flter wth moble agents n rng sensor networs Behrouz Safarneadan *, Kazem asanpoor ** *Shraz Unversty of echnology, safarnead@sutech.ac.r ** Shraz Unversty of echnology,.hasanpor@gmal.com

More information

Introduction to Coalescent Models. Biostatistics 666 Lecture 4

Introduction to Coalescent Models. Biostatistics 666 Lecture 4 Introducton to Coalescent Models Bostatstcs 666 Lecture 4 Last Lecture Lnkage Equlbrum Expected state for dstant markers Lnkage Dsequlbrum Assocaton between neghborng alleles Expected to decrease wth dstance

More information

Time-frequency Analysis Based State Diagnosis of Transformers Windings under the Short-Circuit Shock

Time-frequency Analysis Based State Diagnosis of Transformers Windings under the Short-Circuit Shock Tme-frequency Analyss Based State Dagnoss of Transformers Wndngs under the Short-Crcut Shock YUYING SHAO, ZHUSHI RAO School of Mechancal Engneerng ZHIJIAN JIN Hgh Voltage Lab Shangha Jao Tong Unversty

More information

Cooperative localization method for multi-robot based on PF-EKF

Cooperative localization method for multi-robot based on PF-EKF Scence n Chna Seres F: Informaton Scences 008 SCIENCE IN CHINA PRESS Sprnger www.scchna.com nfo.scchna.com www.sprngerln.com Cooperatve localzaton method for mult-robot based on PF-EKF WANG Lng, WAN JanWe,

More information

Topology Control for C-RAN Architecture Based on Complex Network

Topology Control for C-RAN Architecture Based on Complex Network Topology Control for C-RAN Archtecture Based on Complex Network Zhanun Lu, Yung He, Yunpeng L, Zhaoy L, Ka Dng Chongqng key laboratory of moble communcatons technology Chongqng unversty of post and telecommuncaton

More information

Review: Our Approach 2. CSC310 Information Theory

Review: Our Approach 2. CSC310 Information Theory CSC30 Informaton Theory Sam Rowes Lecture 3: Provng the Kraft-McMllan Inequaltes September 8, 6 Revew: Our Approach The study of both compresson and transmsson requres that we abstract data and messages

More information

Performance Study of OFDMA vs. OFDM/SDMA

Performance Study of OFDMA vs. OFDM/SDMA Performance Study of OFDA vs. OFD/SDA Zhua Guo and Wenwu Zhu crosoft Research, Asa 3F, Beng Sgma Center, No. 49, Zhchun Road adan Dstrct, Beng 00080, P. R. Chna {zhguo, wwzhu}@mcrosoft.com Abstract: In

More information

Open Access Node Localization Method for Wireless Sensor Networks Based on Hybrid Optimization of Differential Evolution and Particle Swarm Algorithm

Open Access Node Localization Method for Wireless Sensor Networks Based on Hybrid Optimization of Differential Evolution and Particle Swarm Algorithm Send Orders for Reprnts to reprnts@benthamscence.ae The Open Automaton and Control Systems Journal, 014, 6, 61-68 61 Open Access Node Localzaton Method for Wreless Sensor Networks Based on Hybrd Optmzaton

More information

A TWO-PLAYER MODEL FOR THE SIMULTANEOUS LOCATION OF FRANCHISING SERVICES WITH PREFERENTIAL RIGHTS

A TWO-PLAYER MODEL FOR THE SIMULTANEOUS LOCATION OF FRANCHISING SERVICES WITH PREFERENTIAL RIGHTS A TWO-PLAYER MODEL FOR THE SIMULTANEOUS LOCATION OF FRANCHISING SERVICES WITH PREFERENTIAL RIGHTS Pedro Godnho and oana Das Faculdade de Economa and GEMF Unversdade de Combra Av. Das da Slva 65 3004-5

More information

Journal of Chemical and Pharmaceutical Research, 2016, 8(4): Research Article

Journal of Chemical and Pharmaceutical Research, 2016, 8(4): Research Article Avalable onlne www.ocpr.com Journal of Chemcal and Pharmaceutcal Research, 2016, 8(4):788-793 Research Artcle ISSN : 0975-7384 CODEN(USA) : JCPRC5 Vrtual Force Coverage Enhancement Optmzaton Algorthm Based

More information