A Self-Organizing Multiple Model Architecture for Motor Imitation. Axel Tidemann* and

Size: px
Start display at page:

Download "A Self-Organizing Multiple Model Architecture for Motor Imitation. Axel Tidemann* and"

Transcription

1 Int. J. Intelligent Information and Database Systems, Vol. 4, No., xxxx A Self-Organizing Multiple Model Architecture for Motor Imitation Axel Tidemann* and Pinar Öztürk Self-Organizing Systems Group, Department of Computer Science, Norwegian University of Science and Technology Sem Sælandsvei 7-9, 749 Norway Fax: tidemann@idi.ntnu.no pinar@idi.ntnu.no Corresponding author Abstract: Learning by imitation allows humans to easily transfer motor knowledge between individuals. Our research is aimed towards equipping robots with imitative capabilities, so humans can simply show a robot what to do. This will greatly simplify how humans program robots. To achieve imitative behaviour, we have implemented a selforganizing connectionist modular architecture on a simulated robot. Motion tracking was used to gather data of human dance movements. When imitating the dance movements, the architecture self-organizes the decomposition of movements into submovements, which are controlled by different modules. The modules both collaborate and compete for control during the movement. The trajectory recorded during motion tracking was repeated, revealing recurrent neural activation patterns of the inverse models (i.e. controllers), indicating that the modules specialize on specific parts of the trajectory. Keywords: Human Robot Interaction, Agent Architectures, Neural Networks, Imitation Learning Reference to this paper should be made as follows: Axel Tidemann and Pinar Öztürk (xxxx) A Self-Organizing Multiple Model Architecture for Motor Imitation, International Journal of Intelligent Information and Database Systems, Vol. 4, No., pp.xxx xxx. Biographical Notes: Axel Tidemann is a PhD student at the Norwegian University of Science and Technology (NTNU), where he obtained his MSc in Computer Science. His research focus is on learning by imitation, and how this mechanism can be used to model human expressiveness. Pinar Öztürk is an associate professor at Norwegian University of science and Technology (NTNU). She has an MSc in Computer Science from the University of Oslo and PhD in artificial intelligence from NTNU. Her current research interests are at the intersection between artificial intelligence, cognitive psychology, neuroscience, and control theory. She is particularly interested in understanding how AI hypotheses Copyright c x Inderscience Enterprises Ltd.

2 Axel Tidemann and Pinar Öztürk and systems can be informed by findings from other relevant research disciplines. A special focus in her research is imitative learning and the involved internal representations. She has previously worked with knowledge modeling and case-based reasoning, and multi-agent systems. Introduction The ability to learn motor skills by imitating another human being is regarded as an important part of human behaviour. Meltzoff and Moore [997] believe this is an innate mechanism for humans, since it is observable in neonates only minutes after birth. Piaget [96] relates imitation to the adaptation of sensory-motor schemas to the external world. Rizzolatti et al. [996] discovered recurring neural activation both when observing and producing the same action. These neurons were dubbed mirror neurons, and are hypothesized to be building blocks of imitation [Schaal, 999], language [Arbib, ] and mind reading [Gallese and Goldman, 998]. Within the AI community, research on imitation learning can be divided into two groups; solving the correspondence problem, which is the transformation from an extrinsic to an intrinsic coordinate system [Nehaniv and Dautenhahn, ], or making the assumption that sensory input has already been transformed into a meaningful representation of the observer, and is ready to be used in an actionperception system [Schaal, 999]. Model-based learning has been regarded as the most interesting approach to implement imitation learning [Schaal, 999]. A modelbased approach consists of pairing an inverse model (i.e. controller) with a forward model (i.e. predictor). This approach has been taken by Demiris and Hayes [], Demiris and Khadhouri [6] and Wolpert et al. [3]. Besides computational neuroscience and robotics, the use of coupled inverse and forward models is also wellknown in the control literature [Jordan and Rumelhart, 99]. Wolpert et al. [998] argue that the cerebellum contains inverse/forward model pairings, and draws upon biological inspiration to develop an architecture that uses inverse/forward models. Our architecture is deeply influenced by the work of Demiris and Wolpert, and is implemented using Echo State Networks. We have selected dance as the current field of study, since it clearly demonstrates the need for an imitative capability. Students watch the teacher, and use nothing other than visual input to guide their own motor systems. Apart from imitating the desired movement, our architecture self-organizes the decomposition of movements into submovements. These submovements are learned by modules that become experts of that particular part of the trajectory. Since these decompositions of the movement leads to specialized modules, we regard these decompositions as motor primitives. These motor primitives form the basis of the observed complex movements, and we try to understand how the architecture carves out these niches in the motor control space. The Multiple Paired Models Architecture Our multiple paired models architecture (abbreviated MPMA) is inspired by Demiris HAMMER [Demiris and Khadhouri, 6] and Wolpert s MOSAIC [Wolpert

3 A Self-Organizing Multiple Model Architecture for Motor Imitation 3 et al., 3] architectures. Both HAMMER and MOSAIC employ multiple paired inverse/forward models, and are architectures for motor learning and control used in an imitation setting. The MPMA seeks to combine the best of both architectures. The HAMMER architecture has a consistent inverse/forward pairing both when observing and executing an action. MOSAIC has a different ordering for observation and execution of an action: when observing an action, the ordering is the same as that of HAMMER. The difference arises when MOSAIC is used for executing an action; then the output of all the inverse models are summed and given as input to the forward models, and the prediction errors of the forward models are used to give responsibilities to the inverse models. This subtle difference is not present in the MPMA, since it complicates the architecture by having two modus operandi. Both HAMMER and MOSAIC have mechanisms to arbitrate between inverse models. The HAMMER architecture uses a confidence measure, based on the prediction error of the corresponding forward models. The MOSAIC architecture uses the same approach, but adds a responsibility predictor that uses context information to determine which inverse model is appropriate for controlling the robot prior to movement. We regard the responsibility predictor as an important part of building an architecture that can be predictive (i.e. knowing what to do before any movement is initiated) as opposed to the more reactive approach of continuously comparing the predicted state with the actual state (which can only be done after the actual state has occurred). The use of multiple paired inverse/forward models (i.e. splitting motor control over different neural networks) is similar to Jacobs mixture of experts [Jacobs et al., 99]. Furthermore, by spreading motor knowledge across several models the architecture can code redundancy, an important part of robust intelligent systems [Pfeifer and Scheier, ]. This approach can also avoid the effects of catastrophic forgetting [Ans et al., 4], where learning new concepts destroys those previously stored, since the representational capacity of the architecture can be augmented by adding more models (although this is not done in the current implementation). The modular architecture allows for the study of low-level neural signals from different models, and further insight into how and why the models divide the movement to be imitated into smaller submovements. Farrar and Zipser [999] propose that a modular structure where different neural networks are responsible for different submovements can efficiently deal with the complexity that arises when controlling 3D movements. The MPMA has the advantage of both local and distributed representations. The local representation corresponds to an inverse/forward model coupling, and by examining the input/output relationship of the inverse/forward models, it is possible to tell where different concepts are stored. The neural networks used to implement the inverse/forward models are distributed representations, tolerant of noise and faulty network nodes. The architecture will be explained in more detail in the following section. In the following text, the term module is used to group three models together: the inverse model, forward model and the responsibility predictor.

4 4 Axel Tidemann and Pinar Öztürk. The Models in the Architecture The MPMA consists of several modules, each with a paired inverse/forward model, and a responsibility predictor. The architecture is shown in figure. Imitation of movement involves activation of different modules in different points of the movement. The dashed arrows show the error signals for all models, demonstrating how all the neural networks start from a random state and are trained based on the self-organization of the architecture. The models will now be explained in more detail. The inverse model is a motor controller or behaviour. It has two input signals: the current state x t and the desired state x t+ of the system. The state x t describes the position of the system in a certain coordinate system. The task of the inverse model is to issue the motor commands u i t that will achieve the desired state, given the current state. The error signal is given by the feedback motor error command, u feedback. The u feedback is a simple way to pull the system towards the desired state when the inverse models issue bad motor commands [Kawato, 99], typically during early stages of training. The corrective motor command u feedback is based on the difference between the target state x t+ and actual state x t+, even though motor commands and state are not in the same coordinate system (state is typically coordinates, whereas motor commands are joint velocities or torques). There are many ways to achieve a desired state in redundant robot systems [Jordan and Rumelhart, 99]. The advantage of using the u feedback is that it guarantees to find a solution to training the inverse model. Since it is based on the difference between the desired and actual state, its influence will decrease as the performance of the system increases. Another advantage is the increase in robustness of the system, since it acts as an online correction controller. If there are sudden perturbations or noise that the inverse and forward models are unable to cope with, the u feedback will pull the system towards the desired state. The forward model is a predictor. Its inputs are the current state x t and the motor commands u i t of its paired inverse model. The forward model predicts the consequences of the motor commands applied to the current state of the robot, i.e. the next state of the system ˆx i t+. The difference between the actual next state and the predicted state is used as the error signal to train the forward model. The responsibility predictor (RP) is another predictor. Based on the context signal y t, the RP predicts the suitability p i t of the module to control the robot prior to movement. How does the forward model and RP differ? The forward model predicts the consequences of the motor commands, issued by its paired inverse model. The RP uses the context signal (i.e. something other than the state of the system and the motor commands) to predict how well its own module will perform, before any motor commands are issued. Wolpert uses the following example: if there are different inverse models for lifting a cup depending on whether it is full or empty, the context (full/empty) can aid the system to pick the correct inverse model. Without the RP, the system would not be able to tell which of the inverse models were most appropriate for the task, and might have to try several before finding the best inverse model. Such an approach will have lesser performance compared to a situation where the best inverse model is selected immediately. We hypothesize this model is crucial for building a predictive architecture. Its error signal is taken from the λ t vector (explained shortly).

5 A Self-Organizing Multiple Model Architecture for Motor Imitation 5 A module outputs a confidence signal λ i t representing how much control it should exert over the robot. This is calculated by multiplying the p i t with the likelihood lt. i The likelihood represents a scalar value based upon the predictive performance of the forward model (which is a vector), by summing the difference between the predicted next state and the actual next state. The lt i assumes the presence of Gaussian noise, as shown in equation (). () lt i = e x t ˆx σ πσ i t All the confidence signals are normalized into the final λ t vector. This vector is multiplied with the corresponding inverse model output before all the motor commands are summed and applied to the robot. Modules that make good predictions will have more influence over the robot than modules with bad predictions. This is how the architecture implements switching of control between modules. Furthermore, it allows for multiple modules to collaborate to control the robot. The λ signal also promotes modules with good predictions, since it gates the error signal of each module. This is achieved by multiplying the error signal with the corresponding λ value, which allows good modules to further refine their models.. Input/output of the MPMA The desired state was the 3D coordinates of the elbow and wrist position of both arms of the demonstrator. The elbow coordinates were normalized to the range [, ], with the shoulder as the origo. The wrist position was normalized to the range [, ], with the elbow as origo. The state of the robot was defined in the same way to overcome the correspondence problem [Nehaniv and Dautenhahn, ]. Findings in neuroscience anticipate a geometric stage where sensory input is transformed to postural trajectories that are meaningful to the motor system of the observer [Torres and Zipser, 4]. The same approach has been taken in previous work [Demiris and Hayes,, Tidemann and Öztürk, 7], but with joint angles as input instead of 3D coordinates. Using coordinates as opposed to joint angles presents a bigger challenge to the inverse models, since the transformation from coordinates to joint angle velocities is harder to learn than the transformation from joint angles to joint angle velocities (which is simply the derivative). The modules use the context signal to determine which module is more suitable to control the robot prior to movement. The context signal in the current experiment is the melody playing while dancing. The dancer must continuously listen to the music while dancing, thus the melody is an appropriate context signal. The MPMA outputs motor commands to the robot in the form of joint angle velocities. Real world robots use forces, but the simulator uses joint angle velocities as motor commands. Since the direction and speed of the motor commands are defined, replacing the simulator with an inverse dynamics controller that calculates forces to be applied on a real-world robot should be straightforward. Using joint angle velocity as part of the state is common in many robot systems, however this is not done in the MPMA since the inverse/forward models are dynamic systems with memory (see the next section), capable of representing the changes in coordinates (i.e. velocity) internally.

6 6 Axel Tidemann and Pinar Öztürk 3 Experimental Setup The human dance movement to be imitated was the dance to the song YMCA by The Village People, see figure. The dance was chosen for two reasons: ) the song (and dance) is well-known and easy to explain verbally (e.g. as forming the letters YMCA with your arms) and ) it is complex enough to make it an interesting imitation task. To track human dance movements, a Pro Reflex tracking system was used. By placing fluorescent markers on the body of the dancer, the system is able to track the movement in three dimensions over time using five infrared cameras, with a sampling frequency f = Hz. Markers were put on the wrist, elbow and shoulder of both arms of the dancer. After tracking, each th sample was extracted from the noisy data (in other words, the models had to predict.5 seconds into the future) and used as desired state to the MPMA. Since we wanted to examine to which extent the modules capture specific parts of the movement, we repeated the desired state three times (i.e. the movement to be imitated consisted of the YMCA trajectory four times) and added small amounts of noise (%) during training. If the modules became experts on particular parts of the trajectory, their activations should exhibit a repeating pattern. We implemented a four degree of freedom (DOF) model of a human arm [Tolani and Badler, 996] as the robot simulator, since the experiment consisted solely of imitating arm movements. Each arm had a 3DOF spherical shoulder joint and a DOF revolute elbow joint; the entire simulated robot was described with 8DOF. All the models (inverse/forward/rp) in the architecture were implemented using Echo State Networks (ESNs) [Jaeger and Haas, 4]. An ESN is a recurrent neural network that is characterized with two features: ) a large, sparsely connected hidden layer and ) only the output layer weights are modified during training. When the ESN is created, the input weights are randomly generated, and not changed during training. The desired output sequence can then be found simply by linear regression. Traditional backpropagation networks modify all the layers during training, and therefore require a lot more computations to converge. The input/output of each model will now be specified. The inverse model had 4 inputs. signals defined the current state of the simulated robot; recall that both elbow and wrist positions were given as 3D coordinates in the range [, ] for each arm. The desired state was defined the same way. The inverse model had 8 outputs in the range [, ], corresponding to the degrees of freedom for the robot, as explained earlier. The forward model had input signals: signals for the current state, and 8 signals from the paired inverse model. The forward model had output signals (range [, ]), describing the predicted next state. The RP had four inputs representing the context signal (defined as the melody playing while dancing, see figure ). It had a single output, representing the predicted suitability of the module to control the robot, and was in the range [, ]. We wanted to test the performance of the system with different number of nodes in the hidden layer; there were four different configurations for all the networks in the architecture: 5,, and 4 nodes in the hidden layer. Common to all the networks were the spectral radius α, defining the length of the memory (range [, ]). All the networks in all configurations had α =., enabling fast memory. The noise level v =. for all networks, adding % noise to the internal

7 A Self-Organizing Multiple Model Architecture for Motor Imitation 7 state of the network. These parameters were found by experimental validation, as recommended by Jaeger [5]. With such a high dimensional system, it is crucial to have good error signals to ensure convergence. The arm model of Tolani and Badler allow joint angle rotations to be found analytically from the positions of the elbow and wrist. This allows differences in desired and actual state to be expressed as differences in rotational angles, which the feedback controller adds to the final motor command to pull the system towards the desired state. Recall that the u feedback also serves as training signal for the inverse models; since it is found analytically from differences in desired and actual state it is an accurate error signal, guiding the inverse models towards learning the correct input/output relationship. To further exploit the good error signals and ensure quick convergence, the output gain K of the feeback error controller was stronger than the output gain L of the inverse models during the early stages of training. For the first training epoch, K =, L =., which would force the system towards the desired trajectory based almost exclusively on the feedback error signal. Since all the networks are generated at random, they will not produce the desired behaviour at the first epoch. L increased and K decreased linearly as the performance of the system increased. The training would not stop until L = and K <.5, to ensure that most of the motor control came from the inverse models of the architecture, but still allowing online corrective control from the feedback controller. The likelihood li t quantifies prediction performance as a scalar, based on a prediction error vector. li t should thus give a high probability to predictions that have a relatively low error. Through trial and error, we found a rule of thumb that σ should be in the range of -5% of the maximum error signal. The likelihood follows the Gaussian distribution; an error within one σ from µ = (i.e. no error) will be rewarded fairly high, since N(σ, σ)/n(, σ) =.665. The forward model has outputs in this experiment with range [, ], yielding the maximum summed error as 4 (e.g. desired state was all, actual state all ). σ = 3 (.5% of the maximum error) is thus a good choice. If the σ value is too small, the architecture becomes unstable, since only perfect predictions will achieve a high likelihood. If the σ value is too large, the performance of the forward model will not matter at all, since all predictions will be given a relatively high likelihood. It is therefore important to define an appopriate σ value; our rule of thumb should be a good starting point. There were two stopping criteria related to the performance of the architecture: ) the output p t i of the RP with respect to the normalized λt i had to be less than 3%, and ) the trajectory produced by the system could not differ more than 3% from the desired trajectory. There were four modules in the architecture. We intended that the architecture would decompose the system in accordance with the context signal, i.e. one module would control the movement for each of the letters in YMCA (see figure ), coinciding with the melody playing. In other words, the context signal represents our intention of how the movement should be decomposed in terms of motor primitives. Haruno et al. [] implemented an earlier version of the MOSAIC architecture using three modules corresponding to three different objects that were to be moved. In the current experiment there are four letters to be learned (i.e.

8 8 Axel Tidemann and Pinar Öztürk one letter corresponds to the trajectory required to form a letter with the arms), thus four modules would seem to be an appropriate choice. We also wanted to examine how the responsibility predictor influenced the architecture. Recall that the novelty with the MPMA is the consistent inverse/forward ordering (present in HAMMER and in MOSAIC during action observation; this is also similar to the approach by Jordan and Rumelhart [99]) with the use of the responsibility predictor. Two extra conditions were examined: ) after training the architecture the RP was disabled, and the architecture was set to imitate the movement it had learned during training, and ) the architecture was trained without the use of a RP. These tests will show how important the RP is as a part of a predictive architecture. Without the RP, the likelihood is the only way to differentiate between modules when controlling the robot, this is similar to the confidence measure used in HAMMER. The system was implemented in MatLab. 4 Results Each of the network configurations (5,, and 4 nodes in the hidden layer) were run times. Table shows how the architecture performed with the different network configurations. A hidden layer size of might be best with regards to the number of epochs required to train the architecture and the Σu feedback /Σu t ratio, however performance-wise the networks do not differ much. An example of the imitative performance of the architecture can be seen in figure 3; the strict stopping criteria ensured a close match between desired and actual trajectory (see figures 4 and 5). The Σu feedback /Σu t ratio shows how much control the feedback motor controller exerted over the simulated robot at the last epoch. Even though the gain of the u feedback was less than.5 (inverse model gain was ), it still has a significant influence (slightly less than /4 on average). Even though the inverse models of the architecture control most of the output, the u feedback is needed to ensure robustness. Table also shows the average number of active modules when the simulated robot was to imitate the YMCA. We defined a module to be active during one of the letters when λ i t >. for at least 5% of the context signal, excluding small bursts of activity, but including persisting small contributions. The result is an indication of how much the modules would collaborate or dominate when controlling the robot. An example can be seen in figure 6, demonstrating collaboration and domination between modules, and how the architecture successfully self-organizes the decomposition of the target trajectory into submovements. To complement figure 6, we examined the neural activation of each module. Figure 7 shows the actions of each of the inverse models. Although the motor output activation patterns might be hard to interpret in terms of physical actions since they show 8 rotational velocities, it presents another important result: the recurrent neural activation patterns confirm that modules become experts on specific parts of the trajectory. The next step consisted of disabling the RP of the trained networks, to see how well the architecture performed without it. Table shows the performance of the different networks. For all network configurations, the performance error mean is more than twice as high as for the original experiment. Even if the error is twice as high, it is still around 6.5%, which is a fairly low error. The feedback error ratio

9 A Self-Organizing Multiple Model Architecture for Motor Imitation 9 reveals that the architecture performs worse without the RP; the ratios are about double compared to the experiments done with RP. These high ratios explain the low performance error, since the feedback error motor controller successfully pulls the system in the right direction when the architecture issues bad motor commands. Furthermore, the number of active modules have risen for all experiments. Instead of separating control of different parts of the movement, all modules contribute to the movement at the same time. This indicates that the RP plays an important role in dividing the motor space between the modules. However, since the architecture was trained with RP, it is not too surprising that the removal of the RP leads to worse performance. This is the reason why we trained the architecture without the RP as well. Initially, all parameters were kept the same when training the networks without RP. However, none of the networks would converge to the stopping criteria of a performance error less than 3%. The stopping criteria had to be tripled to 9%, and even then some of the network configurations would not converge. Table 3 shows the results of experiments where the architecture was trained without RP. Even with the relaxed stopping criteria, all of the networks with 5 and nodes in the hidden layer did not converge. Only % of the networks with nodes in the hidden layer converged; 75% converged with 4 nodes in the hidden layer. Both the performance error and feedback error ratios are much higher than the experiments where the RP was disabled after training; similarily, all modules were active during all parts of the movement. The decrease in performance and increase in motor error ratio is a testament to the importance of the RP. When training with RP, it clearly aids the separation and specialization of modules, since the performance error and feedback error ratios were lower when the RP was disabled but used during training, compared to when it was not used during training at all. There is also an indication that the RP allows for neural resources to be used more efficiently: when trained without RP, the networks with 5 and nodes in the hidden layer did not converge, and only % of the networks with nodes in the hidden layer. When trained with the RP, the networks with 5 and nodes in the hidden layer performed on par with the networks with bigger hidden layers. 5 Discussion Previous work has shown that the modules were capable of decomposing the movement to be imitated into different submovements [Tidemann and Öztürk, 7], albeit in a much simpler environment: there were only four degrees of freedom for the robot; state was described by four joint angles as opposed to 3D coordinates, and the neural networks were standard backpropagation recurrent neural networks with fewer nodes in the hidden layer. The previous work did not establish to what extent the modules captured different motor primitives. When the movement to be imitated includes a repetition of a certain trajectory, we have shown how each module becomes experts on specific parts of the trajectory. This is confirmed by the repeated domination/collaboration when controlling the robot (figure 6) and the repeated neural activation patterns (figure 7). The results show that a single module tends to dominate control during the Y and A parts of the movement. For all experiments, the control during the M and

10 Axel Tidemann and Pinar Öztürk C letters tends to be collaborative, i.e. modules share control of the robot. The Y and A letters are symmetrical and are mostly controlled by the shoulder joints. On the other hand, the elbow joints become involved during control of the letters M and C. The tendency to share responsibility during these letters could then be due to the increase in number of joints moving to form the letters (figure 7 shows how the motor outputs of the elbows become active during the M and C letters). Table also shows that most modules collaborate during control of the letter C. The letter C has movements both along the shoulder joints and the right elbow joint. It is also the only letter which is not symmetrical in its movement, unlike Y, M and A. This indicates that the letter C is harder to learn due to its lack of symmetry in addition to moving both shoulder and elbow joints, and therefore more modules collaborate to control the robot. The claim that the increase in complexity (and the reduction in symmetry) may be the reason why modules tend to collaborate, is backed up by neuroscientific findings, where nonsymmetric action in bimanual movements interfere and takes longer time to execute than symmetrical movements [Diedrichsen et al., ]. Asymmetric bimanual movements also demonstrate less interhemispheric correlations compared to symmetrical ones, indicating that the reduction in coupling between areas in the brain is why asymmetric movements are harder to do [Cardoso de Oliveira et al., ]. Is there an explanation to why sharing of motor control occurs? According to Cardoso de Oliveira [], both shared and separate motor codes exist in the brain, and these can explain why different degrees of coupling in neural activity are observed when the subject is performing symmetric (strong correlation) and asymmetric (weak correlation) movements. The model also fits nicely with the observation of active modules in the MPMA, i.e. some motor control is shared (collaboration), whereas some is separate (domination). For all experiments when the architecture is trained and tested with RP, the modules tend to switch in accordance with the context signal. This happens regardless of whether the modules dominate or collaborate (see figure 6). We designed the switches in the context signal according to our cognitive grouping of the movement (i.e. we set the boundaries in accordance with the letters formed), but it is the modules that determine how to represent these trajectories in the motor control space. We speculate that these trajectories on which modules become specialized can be called motor primitives. In the MPMA, these motor primitives are represented similarily to the muscle synergy notion explained by d Avella and Bizzi [5]. A motor primitive is not necessarily represented by a single module but is often distributed to several modules, meaning that the representation is grounded on muscle- or joint-synergies. The ability to spread motor primitives across modules can be seen as a desirable property of the MPMA; Davidson and Wolpert [4] has suggested that MOSAIC should be revised to allow for share of motor control between modules. 6 Conclusion We have implemented a connectionist self-organizing architecture for motor learning and control that builds upon the HAMMER and MOSAIC architectures. We have shown through studies of low-level neural activation how the architecture

11 A Self-Organizing Multiple Model Architecture for Motor Imitation self-organizes the decomposition of the movement into submovements, represented by specific modules. We have also shown how Wolpert s idea of introducing a responsibility predictor is crucial to make the neural resources specialize into different parts of the movement. This specialization was also shown to increase the performance of the architecture. Furthermore, the connections between theories and experimentally verified claims in psychology and neuroscience and what is observed in the MPMA is appealing. However, it is important to keep focus on the scope of the architecture. The MPMA is nowhere near the complexity of the brain. But observing these phenomena in an architecture that was inspired from how the brain works is an indication that the architecture inhibits certain desirable properties, and that it is a good starting point for further work when it comes to develop agent architectures for motor control and learning. 7 Future Work Future work will investigate how the MPMA scales with increasing lengths and complexities of movements to be imitated, to see if there are saturation points in the architecture where more neural resources must be added. Related to this is investigating how the MPMA functions with different number of modules. Another focus is investigating how the MPMA captures attractors through self-organization [Kuniyoshi et al., 3], along with methodology for evaluating which modules capture what in the motor control space. We are also interested in how the architecture reacts to perturbations of the target state. 8 Acknowledgements The authors wish to thank Ruud van der Weel for using the Pro Reflex system. References Bernard Ans, Stéphane Rousset, Robert M. French, and Serban Musca. Selfrefreshing memory in artificial neural networks: learning temporal structures without catastrophic forgetting. Connection Science, 6():7 99, June 4. Michael Arbib. Imitation in animals and artifacts, chapter The Mirror System, Imitation, and the Evolution of Language, pages 9 8. MIT Press, Cambridge,. S. Cardoso de Oliveira. The neuronal basis of bimanual coordination: Recent neurophysiological evidence and functional models. Acta Psychologica, :39 59,. S. Cardoso de Oliveira, A. Gribova, O. Donchin, H. Bergman, and E. Vaadia. Neural interactions between motor cortical hemispheres during bimanual and unimanual arm movements. European Journal of Neuroscience, 4():88 896,. ISSN 95386X.

12 Axel Tidemann and Pinar Öztürk Andrea d Avella and Emilio Bizzi. Shared and specific muscle synergies in natural motor behaviors. PNAS, (8):376 38, 5. doi:.73/pnas.599. URL P.R. Davidson and D.M. Wolpert. Internal models underlying grasp can be additively combined. Experimental Brain Research, 55(3):334 34, 4. Yiannis Demiris and Gillian Hayes. Imitation in animals and artifacts, chapter Imitation as a dual-route process featuring predictive and learning components: a biologically-plausible computational model, pages MIT Press, Cambridge,. Yiannis Demiris and Bassam Khadhouri. Hierarchical attentive multiple models for execution and recognition of actions. Robotics and Autonomous Systems, 54: , 6. Jorn Diedrichsen, Eliot Hazeltine, Steven Kennerley, and Richard B. Ivry. Moving to directly cued locations abolishes spatial interference during bimanual actions. Psychological Science, (6): ,. doi:./ URL D.S. Farrar and D. Zipser. Neural network models of bilateral coordination. Biol Cybern, 8(3):5 5, 999. Vittorio Gallese and Alvin Goldman. Mirror neurons and the simulation theory of mind-reading. Trends in Cognitive Sciences, (), 998. Masahiko Haruno, Daniel M. Wolpert, and Mitsuo Kawato. MOSAIC model for sensorimotor learning and control. Neural Comp., 3():,. URL Robert A. Jacobs, Micheal I. Jordan, Steven J. Nowlan, and Geoffrey E. Hinton. Adaptive mixtures of local experts. Neural Computation, 3:79 87, 99. Herbert Jaeger. Tutorial on training recurrent neural networks, covering BPPT, RTRL, EKF and the echo state network. Technical report, German National Research Institute for Information Technology, 5. Herbert Jaeger and Harald Haas. Harnessing Nonlinearity: Predicting Chaotic Systems and Saving Energy in Wireless Communication. Science, 34(5667):78 8, 4. doi:.6/science.977. URL Michael I. Jordan and David E. Rumelhart. Forward models: Supervised learning with a distal teacher. Cognitive Science, 6:37 354, 99. Mitsuo Kawato. Feedback-error-learning neural network for supervised motor learning. In R. Eckmiller, editor, Advanced neural computers, pages , 99. Yasuo Kuniyoshi, Yasuaki Yorozu, Yoshiyuki Ohmura, Koji Terada, Takuya Otani, Akihiko Nagakubo, and Tomoyuki Yamamoto. From humanoid embodiment to theory of mind. In Samuel Pierre, Michel Barbeau, and Evangelos Kranakis, editors, Embodied Artificial Intelligence, volume 865 of Lecture Notes in Computer Science, pages 8. Springer, 3. ISBN

13 A Self-Organizing Multiple Model Architecture for Motor Imitation 3 Andrew N. Meltzoff and M. Keith Moore. Explaining facial imitation: A theoretical model. Early Development and Parenting, 6:79 9, 997. Chrystopher L. Nehaniv and Kerstin Dautenhahn. Imitation in Animals and Artifacts, chapter The Correspondence Problem, pages MIT Press, Cambridge,. Rolf Pfeifer and Christian Scheier. Understanding Intelligence. MIT Press, Cambridge, MA, USA,. ISBN 6665X. Illustrator-Isabelle Follath. Jean Piaget. Play, dreams and imitation in childhood. W. W. Norton, New York, 96. ISBN Giacomo Rizzolatti, Luciano Fadiga, Vittorio Gallese, and Leonardo Fogassi. Premotor cortex and the recognition of motor actions. Cognitive Brain Research, 3: 3 4, 996. Stefan Schaal. Is imitation learning the route to humanoid robots? Trends in Cognitive Sciences, 3(6):33 4, 999. Axel Tidemann and Pinar Öztürk. Self-organizing multiple models for imitation: Teaching a robot to dance the YMCA. In IEA/AIE, volume 457 of Lecture Notes in Computer Science, pages 9 3. Springer, June 7. Deepak Tolani and Norman I. Badler. Real-time inverse kinematics of the human arm. Presence, 5(4):393 4, 996. Elizabeth B. Torres and David Zipser. Simultaneous control of hand displacements and rotations in orientation-matching experiments. J Appl Physiol, 96(5): , 4. doi:.5/japplphysiol URL Daniel M. Wolpert, R. Chris Miall, and Mitsuo Kawato. Internal models in the cerebellum. Trends in Cognitive Sciences, (9), 998. Daniel M. Wolpert, Kenji Doya, and Mitsuo Kawato. A unifying computational framework for motor control and social interaction. Philosophical Transactions: Biological Sciences, 358(43):593 6, 3.

14 4 9 Axel Tidemann and Pinar O ztu rk Figures yt x i pt RESPONSIBILITY PREDICTOR i λt lit xt xt uit x't+ - x it+ INVERSE MODEL NORMALIZATION LIKELIHOOD λt FORWARD MODEL x - FEEDBACK CONTROLLER + ufeedback PLANT ut xt+ Figure The multiple paired models architecture, inspired from Wolpert et al. [3] and Demiris and Khadhouri [6]. See the text for details I 48 I 4 I I Figure The movement to be imitated by the simulated robot. This is the dance to the song YMCA by The Village People. The letters Y M C A are formed using arm movements. The numbers show at which timestep the letter begins, coinciding with the context signal (seen as the four-digit vector). The entire trajectory was repeated three times Figure 3 An example of the imitation performed by the simulated robot

15 A Self-Organizing Multiple Model Architecture for Motor Imitation 5.5 Target state Actual state Right elbow X Right elbow Y Right elbow Z Right wrist X Right wrist Y Right wrist Z Figure 4 Actual trajectory versus desired trajectory, showing the close match between actual and desired trajectory, right arm. The grey background shows when the context signal switches, along with the letter formed with the arms.

16 6 Axel Tidemann and Pinar Öztürk Left elbow X Left elbow Y Left elbow Z Left wrist X Left wrist Y Left wrist Z Figure 5 Same as for figure 5, this shows the match for the left arm.

17 A Self-Organizing Multiple Model Architecture for Motor Imitation 7 epoch.5 λ rp λ values and RP output, module λ values and RP output, module λ values and RP output, module λ values and RP output, module 4 Figure 6 An example of collaboration and domination between modules when imitating the YMCA (4 nodes in the hidden layer, same experiment as in figures 4 and 5). The plot shows how the MPMA self-organizes the decomposition of the movement into submovements, and how these submovements are represented by different modules. The overlap in RP output and λ demonstrates stability in the architecture, since the RP made accurate predictions of how much its module would end up controlling the robot. λ rp λ rp λ rp

18 8 Axel Tidemann and Pinar Öztürk. Y M C A Y M C A Y M C A Y M C Module A Module. Module Module 45 ufb.... Motor output, right shoulder θ() Motor output, right shoulder θ() Motor output, right shoulder θ(3) Motor output, right elbow θ Motor output, left shoulder θ() Motor output, left shoulder θ() Motor output, left shoulder θ(3) Motor output, left elbow θ Figure 7 Motor outputs of each module and u feedback as they are sent to the robot, same experiment as in figures 4-6. The recurring neural activation patterns show how the modules become experts on specific parts of the trajectory.

19 A Self-Organizing Multiple Model Architecture for Motor Imitation 9 Tables Table The performance error p e shows the difference (in percent) from the desired state to the actual state at the last epoch. The Σu fb /Σu t ratio shows how much the feedback error motor controller influenced the total motor command at the last epoch. The average number of active modules shows when and to what extent modules would dominate or collaborate when controlling the robot. µ and σ are parameters of the normal distribution. Nodes in Epochs p e Σu fb /Σu t Average active modules hidden layer µ/σ µ/σ µ/σ Y M C A 5 8/ %/.3%.3/ /5.7.73%/.7%.4/ / %/.7%./ /4.7.79%/.6%.3/ Table Experiments where the RP was used to train the architecture, but testing was performed where the RP was disabled. Same variables as in table. The architecture clearly performs worse than when trained with RP, as can be seen as the increase in p e and Σu fb /Σu t ratio. Nodes in p e Σu fb /Σu t Average active modules hidden layer µ/σ µ/σ Y M C A %/.65%.44/ %/.37%.45/ %/.3%.4/ %/.64%.43/ Table 3 Training and testing without the RP. Stopping criteria needed to be tripled to 9%, otherwise none of the experiments would converge. Still, the networks with 5 and nodes in the hidden layer did not converge. Performance of the converging networks are worse compared to both training and testing with RP (table ) and training with RP and disabling it during testing (table ). Nodes Conv. Epochs p e Σu fb /Σu t Average active modules exp. µ/σ µ/σ µ/σ Y M C A 5 % N/A N/A N/A N/A N/A N/A N/A % N/A N/A N/A N/A N/A N/A N/A % 5/.7 8.9%/.%.58/ % 7/ %/.8%.55/

Cognition & Robotics. EUCog - European Network for the Advancement of Artificial Cognitive Systems, Interaction and Robotics

Cognition & Robotics. EUCog - European Network for the Advancement of Artificial Cognitive Systems, Interaction and Robotics Cognition & Robotics Recent debates in Cognitive Robotics bring about ways to seek a definitional connection between cognition and robotics, ponder upon the questions: EUCog - European Network for the

More information

Robot Learning by Demonstration using Forward Models of Schema-Based Behaviors

Robot Learning by Demonstration using Forward Models of Schema-Based Behaviors Robot Learning by Demonstration using Forward Models of Schema-Based Behaviors Adam Olenderski, Monica Nicolescu, Sushil Louis University of Nevada, Reno 1664 N. Virginia St., MS 171, Reno, NV, 89523 {olenders,

More information

Evolving High-Dimensional, Adaptive Camera-Based Speed Sensors

Evolving High-Dimensional, Adaptive Camera-Based Speed Sensors In: M.H. Hamza (ed.), Proceedings of the 21st IASTED Conference on Applied Informatics, pp. 1278-128. Held February, 1-1, 2, Insbruck, Austria Evolving High-Dimensional, Adaptive Camera-Based Speed Sensors

More information

Implicit Fitness Functions for Evolving a Drawing Robot

Implicit Fitness Functions for Evolving a Drawing Robot Implicit Fitness Functions for Evolving a Drawing Robot Jon Bird, Phil Husbands, Martin Perris, Bill Bigge and Paul Brown Centre for Computational Neuroscience and Robotics University of Sussex, Brighton,

More information

Adaptive Humanoid Robot Arm Motion Generation by Evolved Neural Controllers

Adaptive Humanoid Robot Arm Motion Generation by Evolved Neural Controllers Proceedings of the 3 rd International Conference on Mechanical Engineering and Mechatronics Prague, Czech Republic, August 14-15, 2014 Paper No. 170 Adaptive Humanoid Robot Arm Motion Generation by Evolved

More information

EMERGENCE OF COMMUNICATION IN TEAMS OF EMBODIED AND SITUATED AGENTS

EMERGENCE OF COMMUNICATION IN TEAMS OF EMBODIED AND SITUATED AGENTS EMERGENCE OF COMMUNICATION IN TEAMS OF EMBODIED AND SITUATED AGENTS DAVIDE MAROCCO STEFANO NOLFI Institute of Cognitive Science and Technologies, CNR, Via San Martino della Battaglia 44, Rome, 00185, Italy

More information

Dipartimento di Elettronica Informazione e Bioingegneria Robotics

Dipartimento di Elettronica Informazione e Bioingegneria Robotics Dipartimento di Elettronica Informazione e Bioingegneria Robotics Behavioral robotics @ 2014 Behaviorism behave is what organisms do Behaviorism is built on this assumption, and its goal is to promote

More information

A Numerical Approach to Understanding Oscillator Neural Networks

A Numerical Approach to Understanding Oscillator Neural Networks A Numerical Approach to Understanding Oscillator Neural Networks Natalie Klein Mentored by Jon Wilkins Networks of coupled oscillators are a form of dynamical network originally inspired by various biological

More information

A neuronal structure for learning by imitation. ENSEA, 6, avenue du Ponceau, F-95014, Cergy-Pontoise cedex, France. fmoga,

A neuronal structure for learning by imitation. ENSEA, 6, avenue du Ponceau, F-95014, Cergy-Pontoise cedex, France. fmoga, A neuronal structure for learning by imitation Sorin Moga and Philippe Gaussier ETIS / CNRS 2235, Groupe Neurocybernetique, ENSEA, 6, avenue du Ponceau, F-9514, Cergy-Pontoise cedex, France fmoga, gaussierg@ensea.fr

More information

From exploration to imitation: using learnt internal models to imitate others

From exploration to imitation: using learnt internal models to imitate others From exploration to imitation: using learnt internal models to imitate others Anthony Dearden and Yiannis Demiris 1 Abstract. We present an architecture that enables asocial and social learning mechanisms

More information

Optic Flow Based Skill Learning for A Humanoid to Trap, Approach to, and Pass a Ball

Optic Flow Based Skill Learning for A Humanoid to Trap, Approach to, and Pass a Ball Optic Flow Based Skill Learning for A Humanoid to Trap, Approach to, and Pass a Ball Masaki Ogino 1, Masaaki Kikuchi 1, Jun ichiro Ooga 1, Masahiro Aono 1 and Minoru Asada 1,2 1 Dept. of Adaptive Machine

More information

The Digital Synaptic Neural Substrate: Size and Quality Matters

The Digital Synaptic Neural Substrate: Size and Quality Matters The Digital Synaptic Neural Substrate: Size and Quality Matters Azlan Iqbal College of Computer Science and Information Technology, Universiti Tenaga Nasional Putrajaya Campus, Jalan IKRAM-UNITEN, 43000

More information

Robots Learning from Robots: A proof of Concept Study for Co-Manipulation Tasks. Luka Peternel and Arash Ajoudani Presented by Halishia Chugani

Robots Learning from Robots: A proof of Concept Study for Co-Manipulation Tasks. Luka Peternel and Arash Ajoudani Presented by Halishia Chugani Robots Learning from Robots: A proof of Concept Study for Co-Manipulation Tasks Luka Peternel and Arash Ajoudani Presented by Halishia Chugani Robots learning from humans 1. Robots learn from humans 2.

More information

PREDICTION OF FINGER FLEXION FROM ELECTROCORTICOGRAPHY DATA

PREDICTION OF FINGER FLEXION FROM ELECTROCORTICOGRAPHY DATA University of Tartu Institute of Computer Science Course Introduction to Computational Neuroscience Roberts Mencis PREDICTION OF FINGER FLEXION FROM ELECTROCORTICOGRAPHY DATA Abstract This project aims

More information

Interaction rule learning with a human partner based on an imitation faculty with a simple visuo-motor mapping

Interaction rule learning with a human partner based on an imitation faculty with a simple visuo-motor mapping Robotics and Autonomous Systems 54 (2006) 414 418 www.elsevier.com/locate/robot Interaction rule learning with a human partner based on an imitation faculty with a simple visuo-motor mapping Masaki Ogino

More information

PeriPersonal Space on the icub

PeriPersonal Space on the icub EXPANDING SENSORIMOTOR CAPABILITIES OF HUMANOID ROBOTS THROUGH MULTISENSORY INTEGRATION : RobotCub Consortium. License GPL v2.0. This content is excluded from our Creative Commons license. For more information,

More information

Computing with Biologically Inspired Neural Oscillators: Application to Color Image Segmentation

Computing with Biologically Inspired Neural Oscillators: Application to Color Image Segmentation Computing with Biologically Inspired Neural Oscillators: Application to Color Image Segmentation Authors: Ammar Belatreche, Liam Maguire, Martin McGinnity, Liam McDaid and Arfan Ghani Published: Advances

More information

Behaviour-Based Control. IAR Lecture 5 Barbara Webb

Behaviour-Based Control. IAR Lecture 5 Barbara Webb Behaviour-Based Control IAR Lecture 5 Barbara Webb Traditional sense-plan-act approach suggests a vertical (serial) task decomposition Sensors Actuators perception modelling planning task execution motor

More information

Multi-Platform Soccer Robot Development System

Multi-Platform Soccer Robot Development System Multi-Platform Soccer Robot Development System Hui Wang, Han Wang, Chunmiao Wang, William Y. C. Soh Division of Control & Instrumentation, School of EEE Nanyang Technological University Nanyang Avenue,

More information

Booklet of teaching units

Booklet of teaching units International Master Program in Mechatronic Systems for Rehabilitation Booklet of teaching units Third semester (M2 S1) Master Sciences de l Ingénieur Université Pierre et Marie Curie Paris 6 Boite 164,

More information

Real-time human control of robots for robot skill synthesis (and a bit

Real-time human control of robots for robot skill synthesis (and a bit Real-time human control of robots for robot skill synthesis (and a bit about imitation) Erhan Oztop JST/ICORP, ATR/CNS, JAPAN 1/31 IMITATION IN ARTIFICIAL SYSTEMS (1) Robotic systems that are able to imitate

More information

Artificial Neural Networks. Artificial Intelligence Santa Clara, 2016

Artificial Neural Networks. Artificial Intelligence Santa Clara, 2016 Artificial Neural Networks Artificial Intelligence Santa Clara, 2016 Simulate the functioning of the brain Can simulate actual neurons: Computational neuroscience Can introduce simplified neurons: Neural

More information

SECOND YEAR PROJECT SUMMARY

SECOND YEAR PROJECT SUMMARY SECOND YEAR PROJECT SUMMARY Grant Agreement number: 215805 Project acronym: Project title: CHRIS Cooperative Human Robot Interaction Systems Period covered: from 01 March 2009 to 28 Feb 2010 Contact Details

More information

Robot Imitation from Human Body Movements

Robot Imitation from Human Body Movements Robot Imitation from Human Body Movements Carlos A. Acosta Calderon and Huosheng Hu Department of Computer Science, University of Essex Wivenhoe Park, Colchester CO4 3SQ, United Kingdom caacos@essex.ac.uk,

More information

Affordance based Human Motion Synthesizing System

Affordance based Human Motion Synthesizing System Affordance based Human Motion Synthesizing System H. Ishii, N. Ichiguchi, D. Komaki, H. Shimoda and H. Yoshikawa Graduate School of Energy Science Kyoto University Uji-shi, Kyoto, 611-0011, Japan Abstract

More information

CHAPTER 8: EXTENDED TETRACHORD CLASSIFICATION

CHAPTER 8: EXTENDED TETRACHORD CLASSIFICATION CHAPTER 8: EXTENDED TETRACHORD CLASSIFICATION Chapter 7 introduced the notion of strange circles: using various circles of musical intervals as equivalence classes to which input pitch-classes are assigned.

More information

ENHANCED HUMAN-AGENT INTERACTION: AUGMENTING INTERACTION MODELS WITH EMBODIED AGENTS BY SERAFIN BENTO. MASTER OF SCIENCE in INFORMATION SYSTEMS

ENHANCED HUMAN-AGENT INTERACTION: AUGMENTING INTERACTION MODELS WITH EMBODIED AGENTS BY SERAFIN BENTO. MASTER OF SCIENCE in INFORMATION SYSTEMS BY SERAFIN BENTO MASTER OF SCIENCE in INFORMATION SYSTEMS Edmonton, Alberta September, 2015 ABSTRACT The popularity of software agents demands for more comprehensive HAI design processes. The outcome of

More information

A Robust Neural Robot Navigation Using a Combination of Deliberative and Reactive Control Architectures

A Robust Neural Robot Navigation Using a Combination of Deliberative and Reactive Control Architectures A Robust Neural Robot Navigation Using a Combination of Deliberative and Reactive Control Architectures D.M. Rojas Castro, A. Revel and M. Ménard * Laboratory of Informatics, Image and Interaction (L3I)

More information

Cynthia Breazeal and Brian Scassellati

Cynthia Breazeal and Brian Scassellati Cynthia Breazeal and Brian Scassellati The study of social learning in robotics has been motivated by both scientific interest in the learning process and practical desires to produce machines that are

More information

! The architecture of the robot control system! Also maybe some aspects of its body/motors/sensors

! The architecture of the robot control system! Also maybe some aspects of its body/motors/sensors Towards the more concrete end of the Alife spectrum is robotics. Alife -- because it is the attempt to synthesise -- at some level -- 'lifelike behaviour. AI is often associated with a particular style

More information

Emergent imitative behavior on a robotic arm based on visuo-motor associative memories

Emergent imitative behavior on a robotic arm based on visuo-motor associative memories The 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems October 18-22, 2010, Taipei, Taiwan Emergent imitative behavior on a robotic arm based on visuo-motor associative memories Antoine

More information

MINE 432 Industrial Automation and Robotics

MINE 432 Industrial Automation and Robotics MINE 432 Industrial Automation and Robotics Part 3, Lecture 5 Overview of Artificial Neural Networks A. Farzanegan (Visiting Associate Professor) Fall 2014 Norman B. Keevil Institute of Mining Engineering

More information

Joint attention between a humanoid robot and users in imitation game

Joint attention between a humanoid robot and users in imitation game Joint attention between a humanoid robot and users in imitation game Masato Ito Sony Corporation 6-7-35 Kitashinagawa, Shinagawa-ku Tokyo, 141-0001, Japan masato@pdp.crl.sony.co.jp Jun Tani Brain Science

More information

Classifying the Brain's Motor Activity via Deep Learning

Classifying the Brain's Motor Activity via Deep Learning Final Report Classifying the Brain's Motor Activity via Deep Learning Tania Morimoto & Sean Sketch Motivation Over 50 million Americans suffer from mobility or dexterity impairments. Over the past few

More information

Hierarchical Controller for Robotic Soccer

Hierarchical Controller for Robotic Soccer Hierarchical Controller for Robotic Soccer Byron Knoll Cognitive Systems 402 April 13, 2008 ABSTRACT RoboCup is an initiative aimed at advancing Artificial Intelligence (AI) and robotics research. This

More information

AN IMPROVED NEURAL NETWORK-BASED DECODER SCHEME FOR SYSTEMATIC CONVOLUTIONAL CODE. A Thesis by. Andrew J. Zerngast

AN IMPROVED NEURAL NETWORK-BASED DECODER SCHEME FOR SYSTEMATIC CONVOLUTIONAL CODE. A Thesis by. Andrew J. Zerngast AN IMPROVED NEURAL NETWORK-BASED DECODER SCHEME FOR SYSTEMATIC CONVOLUTIONAL CODE A Thesis by Andrew J. Zerngast Bachelor of Science, Wichita State University, 2008 Submitted to the Department of Electrical

More information

TEMPORAL DIFFERENCE LEARNING IN CHINESE CHESS

TEMPORAL DIFFERENCE LEARNING IN CHINESE CHESS TEMPORAL DIFFERENCE LEARNING IN CHINESE CHESS Thong B. Trinh, Anwer S. Bashi, Nikhil Deshpande Department of Electrical Engineering University of New Orleans New Orleans, LA 70148 Tel: (504) 280-7383 Fax:

More information

Behavior Emergence in Autonomous Robot Control by Means of Feedforward and Recurrent Neural Networks

Behavior Emergence in Autonomous Robot Control by Means of Feedforward and Recurrent Neural Networks Behavior Emergence in Autonomous Robot Control by Means of Feedforward and Recurrent Neural Networks Stanislav Slušný, Petra Vidnerová, Roman Neruda Abstract We study the emergence of intelligent behavior

More information

Converting Motion between Different Types of Humanoid Robots Using Genetic Algorithms

Converting Motion between Different Types of Humanoid Robots Using Genetic Algorithms Converting Motion between Different Types of Humanoid Robots Using Genetic Algorithms Mari Nishiyama and Hitoshi Iba Abstract The imitation between different types of robots remains an unsolved task for

More information

A Novel Fuzzy Neural Network Based Distance Relaying Scheme

A Novel Fuzzy Neural Network Based Distance Relaying Scheme 902 IEEE TRANSACTIONS ON POWER DELIVERY, VOL. 15, NO. 3, JULY 2000 A Novel Fuzzy Neural Network Based Distance Relaying Scheme P. K. Dash, A. K. Pradhan, and G. Panda Abstract This paper presents a new

More information

This is a repository copy of Complex robot training tasks through bootstrapping system identification.

This is a repository copy of Complex robot training tasks through bootstrapping system identification. This is a repository copy of Complex robot training tasks through bootstrapping system identification. White Rose Research Online URL for this paper: http://eprints.whiterose.ac.uk/74638/ Monograph: Akanyeti,

More information

Robot Task-Level Programming Language and Simulation

Robot Task-Level Programming Language and Simulation Robot Task-Level Programming Language and Simulation M. Samaka Abstract This paper presents the development of a software application for Off-line robot task programming and simulation. Such application

More information

Hybrid architectures. IAR Lecture 6 Barbara Webb

Hybrid architectures. IAR Lecture 6 Barbara Webb Hybrid architectures IAR Lecture 6 Barbara Webb Behaviour Based: Conclusions But arbitrary and difficult to design emergent behaviour for a given task. Architectures do not impose strong constraints Options?

More information

Knowledge Representation and Reasoning

Knowledge Representation and Reasoning Master of Science in Artificial Intelligence, 2012-2014 Knowledge Representation and Reasoning University "Politehnica" of Bucharest Department of Computer Science Fall 2012 Adina Magda Florea The AI Debate

More information

Dissociating Ideomotor and Spatial Compatibility: Empirical Evidence and Connectionist Models

Dissociating Ideomotor and Spatial Compatibility: Empirical Evidence and Connectionist Models Dissociating Ideomotor and Spatial Compatibility: Empirical Evidence and Connectionist Models Ty W. Boyer (tywboyer@indiana.edu) Matthias Scheutz (mscheutz@indiana.edu) Bennett I. Bertenthal (bbertent@indiana.edu)

More information

Towards Strategic Kriegspiel Play with Opponent Modeling

Towards Strategic Kriegspiel Play with Opponent Modeling Towards Strategic Kriegspiel Play with Opponent Modeling Antonio Del Giudice and Piotr Gmytrasiewicz Department of Computer Science, University of Illinois at Chicago Chicago, IL, 60607-7053, USA E-mail:

More information

Schema Design and Implementation of the Grasp-Related Mirror Neuron System

Schema Design and Implementation of the Grasp-Related Mirror Neuron System Schema Design and Implementation of the Grasp-Related Mirror Neuron System Erhan Oztop and Michael A. Arbib erhan@java.usc.edu, arbib@pollux.usc.edu USC Brain Project University of Southern California

More information

Policy Forum. Science 26 January 2001: Vol no. 5504, pp DOI: /science Prev Table of Contents Next

Policy Forum. Science 26 January 2001: Vol no. 5504, pp DOI: /science Prev Table of Contents Next Science 26 January 2001: Vol. 291. no. 5504, pp. 599-600 DOI: 10.1126/science.291.5504.599 Prev Table of Contents Next Policy Forum ARTIFICIAL INTELLIGENCE: Autonomous Mental Development by Robots and

More information

USE OF WHITE NOISE IN TRACE/PARCS ANALYSIS OF ATWS WITH INSTABILITY

USE OF WHITE NOISE IN TRACE/PARCS ANALYSIS OF ATWS WITH INSTABILITY USE OF WHITE NOISE IN TRACE/PARCS ANALYSIS OF ATWS WITH INSTABILITY T. Zaki and P. Yarsky Nuclear Regulatory Commission Office of Nuclear Regulatory Research U.S. Nuclear Regulatory Commission, MS CSB-3A07M,

More information

Nonlinear System Identification Using Recurrent Networks

Nonlinear System Identification Using Recurrent Networks Syracuse University SURFACE Electrical Engineering and Computer Science Technical Reports College of Engineering and Computer Science 7-1991 Nonlinear System Identification Using Recurrent Networks Hyungkeun

More information

Robust Haptic Teleoperation of a Mobile Manipulation Platform

Robust Haptic Teleoperation of a Mobile Manipulation Platform Robust Haptic Teleoperation of a Mobile Manipulation Platform Jaeheung Park and Oussama Khatib Stanford AI Laboratory Stanford University http://robotics.stanford.edu Abstract. This paper presents a new

More information

Advanced Techniques for Mobile Robotics Location-Based Activity Recognition

Advanced Techniques for Mobile Robotics Location-Based Activity Recognition Advanced Techniques for Mobile Robotics Location-Based Activity Recognition Wolfram Burgard, Cyrill Stachniss, Kai Arras, Maren Bennewitz Activity Recognition Based on L. Liao, D. J. Patterson, D. Fox,

More information

Birth of An Intelligent Humanoid Robot in Singapore

Birth of An Intelligent Humanoid Robot in Singapore Birth of An Intelligent Humanoid Robot in Singapore Ming Xie Nanyang Technological University Singapore 639798 Email: mmxie@ntu.edu.sg Abstract. Since 1996, we have embarked into the journey of developing

More information

Simulating development in a real robot

Simulating development in a real robot Simulating development in a real robot Gabriel Gómez, Max Lungarella, Peter Eggenberger Hotz, Kojiro Matsushita and Rolf Pfeifer Artificial Intelligence Laboratory Department of Information Technology,

More information

IBM SPSS Neural Networks

IBM SPSS Neural Networks IBM Software IBM SPSS Neural Networks 20 IBM SPSS Neural Networks New tools for building predictive models Highlights Explore subtle or hidden patterns in your data. Build better-performing models No programming

More information

Designing Human-Robot Interactions: The Good, the Bad and the Uncanny

Designing Human-Robot Interactions: The Good, the Bad and the Uncanny Designing Human-Robot Interactions: The Good, the Bad and the Uncanny Frank Pollick Department of Psychology University of Glasgow paco.psy.gla.ac.uk/ Talk available at: www.psy.gla.ac.uk/~frank/talks.html

More information

Interference in stimuli employed to assess masking by substitution. Bernt Christian Skottun. Ullevaalsalleen 4C Oslo. Norway

Interference in stimuli employed to assess masking by substitution. Bernt Christian Skottun. Ullevaalsalleen 4C Oslo. Norway Interference in stimuli employed to assess masking by substitution Bernt Christian Skottun Ullevaalsalleen 4C 0852 Oslo Norway Short heading: Interference ABSTRACT Enns and Di Lollo (1997, Psychological

More information

Jane Li. Assistant Professor Mechanical Engineering Department, Robotic Engineering Program Worcester Polytechnic Institute

Jane Li. Assistant Professor Mechanical Engineering Department, Robotic Engineering Program Worcester Polytechnic Institute Jane Li Assistant Professor Mechanical Engineering Department, Robotic Engineering Program Worcester Polytechnic Institute Use an example to explain what is admittance control? You may refer to exoskeleton

More information

Surveillance and Calibration Verification Using Autoassociative Neural Networks

Surveillance and Calibration Verification Using Autoassociative Neural Networks Surveillance and Calibration Verification Using Autoassociative Neural Networks Darryl J. Wrest, J. Wesley Hines, and Robert E. Uhrig* Department of Nuclear Engineering, University of Tennessee, Knoxville,

More information

Levels of Description: A Role for Robots in Cognitive Science Education

Levels of Description: A Role for Robots in Cognitive Science Education Levels of Description: A Role for Robots in Cognitive Science Education Terry Stewart 1 and Robert West 2 1 Department of Cognitive Science 2 Department of Psychology Carleton University In this paper,

More information

YDDON. Humans, Robots, & Intelligent Objects New communication approaches

YDDON. Humans, Robots, & Intelligent Objects New communication approaches YDDON Humans, Robots, & Intelligent Objects New communication approaches Building Robot intelligence Interdisciplinarity Turning things into robots www.ydrobotics.co m Edifício A Moagem Cidade do Engenho

More information

Lecture IV. Sensory processing during active versus passive movements

Lecture IV. Sensory processing during active versus passive movements Lecture IV Sensory processing during active versus passive movements The ability to distinguish sensory inputs that are a consequence of our own actions (reafference) from those that result from changes

More information

Dynamic Throttle Estimation by Machine Learning from Professionals

Dynamic Throttle Estimation by Machine Learning from Professionals Dynamic Throttle Estimation by Machine Learning from Professionals Nathan Spielberg and John Alsterda Department of Mechanical Engineering, Stanford University Abstract To increase the capabilities of

More information

A developmental approach to grasping

A developmental approach to grasping A developmental approach to grasping Lorenzo Natale, Giorgio Metta and Giulio Sandini LIRA-Lab, DIST, University of Genoa Viale Causa 13, 16145, Genova Italy email: {nat, pasa, sandini}@liralab.it Abstract

More information

Chapter 2 Distributed Consensus Estimation of Wireless Sensor Networks

Chapter 2 Distributed Consensus Estimation of Wireless Sensor Networks Chapter 2 Distributed Consensus Estimation of Wireless Sensor Networks Recently, consensus based distributed estimation has attracted considerable attention from various fields to estimate deterministic

More information

CSC384 Intro to Artificial Intelligence* *The following slides are based on Fahiem Bacchus course lecture notes.

CSC384 Intro to Artificial Intelligence* *The following slides are based on Fahiem Bacchus course lecture notes. CSC384 Intro to Artificial Intelligence* *The following slides are based on Fahiem Bacchus course lecture notes. Artificial Intelligence A branch of Computer Science. Examines how we can achieve intelligent

More information

AI Principles, Semester 2, Week 1, Lecture 2, Cognitive Science and AI Applications. The Computational and Representational Understanding of Mind

AI Principles, Semester 2, Week 1, Lecture 2, Cognitive Science and AI Applications. The Computational and Representational Understanding of Mind AI Principles, Semester 2, Week 1, Lecture 2, Cognitive Science and AI Applications How simulations can act as scientific theories The Computational and Representational Understanding of Mind Boundaries

More information

An Autonomous Mobile Robot Architecture Using Belief Networks and Neural Networks

An Autonomous Mobile Robot Architecture Using Belief Networks and Neural Networks An Autonomous Mobile Robot Architecture Using Belief Networks and Neural Networks Mehran Sahami, John Lilly and Bryan Rollins Computer Science Department Stanford University Stanford, CA 94305 {sahami,lilly,rollins}@cs.stanford.edu

More information

Chapter 2 Direct-Sequence Systems

Chapter 2 Direct-Sequence Systems Chapter 2 Direct-Sequence Systems A spread-spectrum signal is one with an extra modulation that expands the signal bandwidth greatly beyond what is required by the underlying coded-data modulation. Spread-spectrum

More information

Prediction and Correction Algorithm for a Gesture Controlled Robotic Arm

Prediction and Correction Algorithm for a Gesture Controlled Robotic Arm Prediction and Correction Algorithm for a Gesture Controlled Robotic Arm Pushkar Shukla 1, Shehjar Safaya 2, Utkarsh Sharma 3 B.Tech, College of Engineering Roorkee, Roorkee, India 1 B.Tech, College of

More information

Navigation of Transport Mobile Robot in Bionic Assembly System

Navigation of Transport Mobile Robot in Bionic Assembly System Navigation of Transport Mobile obot in Bionic ssembly System leksandar Lazinica Intelligent Manufacturing Systems IFT Karlsplatz 13/311, -1040 Vienna Tel : +43-1-58801-311141 Fax :+43-1-58801-31199 e-mail

More information

Why we need to know what AI is. Overview. Artificial Intelligence is it finally arriving?

Why we need to know what AI is. Overview. Artificial Intelligence is it finally arriving? Artificial Intelligence is it finally arriving? Artificial Intelligence is it finally arriving? Are we nearly there yet? Leslie Smith Computing Science and Mathematics University of Stirling May 2 2013.

More information

The Māori Marae as a structural attractor: exploring the generative, convergent and unifying dynamics within indigenous entrepreneurship

The Māori Marae as a structural attractor: exploring the generative, convergent and unifying dynamics within indigenous entrepreneurship 2nd Research Colloquium on Societal Entrepreneurship and Innovation RMIT University 26-28 November 2014 Associate Professor Christine Woods, University of Auckland (co-authors Associate Professor Mānuka

More information

Live Feeling on Movement of an Autonomous Robot Using a Biological Signal

Live Feeling on Movement of an Autonomous Robot Using a Biological Signal Live Feeling on Movement of an Autonomous Robot Using a Biological Signal Shigeru Sakurazawa, Keisuke Yanagihara, Yasuo Tsukahara, Hitoshi Matsubara Future University-Hakodate, System Information Science,

More information

Evolutions of communication

Evolutions of communication Evolutions of communication Alex Bell, Andrew Pace, and Raul Santos May 12, 2009 Abstract In this paper a experiment is presented in which two simulated robots evolved a form of communication to allow

More information

A Hybrid Architecture using Cross Correlation and Recurrent Neural Networks for Acoustic Tracking in Robots

A Hybrid Architecture using Cross Correlation and Recurrent Neural Networks for Acoustic Tracking in Robots A Hybrid Architecture using Cross Correlation and Recurrent Neural Networks for Acoustic Tracking in Robots John C. Murray, Harry Erwin and Stefan Wermter Hybrid Intelligent Systems School for Computing

More information

Initialisation improvement in engineering feedforward ANN models.

Initialisation improvement in engineering feedforward ANN models. Initialisation improvement in engineering feedforward ANN models. A. Krimpenis and G.-C. Vosniakos National Technical University of Athens, School of Mechanical Engineering, Manufacturing Technology Division,

More information

INTELLIGENT CONTROL OF AUTONOMOUS SIX-LEGGED ROBOTS BY NEURAL NETWORKS

INTELLIGENT CONTROL OF AUTONOMOUS SIX-LEGGED ROBOTS BY NEURAL NETWORKS INTELLIGENT CONTROL OF AUTONOMOUS SIX-LEGGED ROBOTS BY NEURAL NETWORKS Prof. Dr. W. Lechner 1 Dipl.-Ing. Frank Müller 2 Fachhochschule Hannover University of Applied Sciences and Arts Computer Science

More information

Embedded Control Project -Iterative learning control for

Embedded Control Project -Iterative learning control for Embedded Control Project -Iterative learning control for Author : Axel Andersson Hariprasad Govindharajan Shahrzad Khodayari Project Guide : Alexander Medvedev Program : Embedded Systems and Engineering

More information

Digital image processing vs. computer vision Higher-level anchoring

Digital image processing vs. computer vision Higher-level anchoring Digital image processing vs. computer vision Higher-level anchoring Václav Hlaváč Czech Technical University in Prague Faculty of Electrical Engineering, Department of Cybernetics Center for Machine Perception

More information

Real Robots Controlled by Brain Signals - A BMI Approach

Real Robots Controlled by Brain Signals - A BMI Approach International Journal of Advanced Intelligence Volume 2, Number 1, pp.25-35, July, 2010. c AIA International Advanced Information Institute Real Robots Controlled by Brain Signals - A BMI Approach Genci

More information

Learning and Using Models of Kicking Motions for Legged Robots

Learning and Using Models of Kicking Motions for Legged Robots Learning and Using Models of Kicking Motions for Legged Robots Sonia Chernova and Manuela Veloso Computer Science Department Carnegie Mellon University Pittsburgh, PA 15213 {soniac, mmv}@cs.cmu.edu Abstract

More information

REINFORCEMENT LEARNING (DD3359) O-03 END-TO-END LEARNING

REINFORCEMENT LEARNING (DD3359) O-03 END-TO-END LEARNING REINFORCEMENT LEARNING (DD3359) O-03 END-TO-END LEARNING RIKA ANTONOVA ANTONOVA@KTH.SE ALI GHADIRZADEH ALGH@KTH.SE RL: What We Know So Far Formulate the problem as an MDP (or POMDP) State space captures

More information

Supplementary information accompanying the manuscript Biologically Inspired Modular Neural Control for a Leg-Wheel Hybrid Robot

Supplementary information accompanying the manuscript Biologically Inspired Modular Neural Control for a Leg-Wheel Hybrid Robot Supplementary information accompanying the manuscript Biologically Inspired Modular Neural Control for a Leg-Wheel Hybrid Robot Poramate Manoonpong a,, Florentin Wörgötter a, Pudit Laksanacharoen b a)

More information

- Basics of informatics - Computer network - Software engineering - Intelligent media processing - Human interface. Professor. Professor.

- Basics of informatics - Computer network - Software engineering - Intelligent media processing - Human interface. Professor. Professor. - Basics of informatics - Computer network - Software engineering - Intelligent media processing - Human interface Computer-Aided Engineering Research of power/signal integrity analysis and EMC design

More information

IOC, Vector sum, and squaring: three different motion effects or one?

IOC, Vector sum, and squaring: three different motion effects or one? Vision Research 41 (2001) 965 972 www.elsevier.com/locate/visres IOC, Vector sum, and squaring: three different motion effects or one? L. Bowns * School of Psychology, Uni ersity of Nottingham, Uni ersity

More information

Key-Words: - Neural Networks, Cerebellum, Cerebellar Model Articulation Controller (CMAC), Auto-pilot

Key-Words: - Neural Networks, Cerebellum, Cerebellar Model Articulation Controller (CMAC), Auto-pilot erebellum Based ar Auto-Pilot System B. HSIEH,.QUEK and A.WAHAB Intelligent Systems Laboratory, School of omputer Engineering Nanyang Technological University, Blk N4 #2A-32 Nanyang Avenue, Singapore 639798

More information

Transactions on Information and Communications Technologies vol 6, 1994 WIT Press, ISSN

Transactions on Information and Communications Technologies vol 6, 1994 WIT Press,   ISSN Application of artificial neural networks to the robot path planning problem P. Martin & A.P. del Pobil Department of Computer Science, Jaume I University, Campus de Penyeta Roja, 207 Castellon, Spain

More information

Ziemke, Tom. (2003). What s that Thing Called Embodiment?

Ziemke, Tom. (2003). What s that Thing Called Embodiment? Ziemke, Tom. (2003). What s that Thing Called Embodiment? Aleš Oblak MEi: CogSci, 2017 Before After Carravagio (1602 CE). San Matteo e l angelo Myron (460 450 BCE). Discobolus Six Views of Embodied Cognition

More information

New developments in the philosophy of AI. Vincent C. Müller. Anatolia College/ACT February 2015

New developments in the philosophy of AI. Vincent C. Müller. Anatolia College/ACT   February 2015 Müller, Vincent C. (2016), New developments in the philosophy of AI, in Vincent C. Müller (ed.), Fundamental Issues of Artificial Intelligence (Synthese Library; Berlin: Springer). http://www.sophia.de

More information

Learning Actions from Demonstration

Learning Actions from Demonstration Learning Actions from Demonstration Michael Tirtowidjojo, Matthew Frierson, Benjamin Singer, Palak Hirpara October 2, 2016 Abstract The goal of our project is twofold. First, we will design a controller

More information

Learning Reactive Neurocontrollers using Simulated Annealing for Mobile Robots

Learning Reactive Neurocontrollers using Simulated Annealing for Mobile Robots Learning Reactive Neurocontrollers using Simulated Annealing for Mobile Robots Philippe Lucidarme, Alain Liégeois LIRMM, University Montpellier II, France, lucidarm@lirmm.fr Abstract This paper presents

More information

Using Rank Order Filters to Decompose the Electromyogram

Using Rank Order Filters to Decompose the Electromyogram Using Rank Order Filters to Decompose the Electromyogram D.J. Roberson C.B. Schrader droberson@utsa.edu schrader@utsa.edu Postdoctoral Fellow Professor The University of Texas at San Antonio, San Antonio,

More information

GPU Computing for Cognitive Robotics

GPU Computing for Cognitive Robotics GPU Computing for Cognitive Robotics Martin Peniak, Davide Marocco, Angelo Cangelosi GPU Technology Conference, San Jose, California, 25 March, 2014 Acknowledgements This study was financed by: EU Integrating

More information

Cognitive Robotics 2017/2018

Cognitive Robotics 2017/2018 Cognitive Robotics 2017/2018 Course Introduction Matteo Matteucci matteo.matteucci@polimi.it Artificial Intelligence and Robotics Lab - Politecnico di Milano About me and my lectures Lectures given by

More information

FACE RECOGNITION USING NEURAL NETWORKS

FACE RECOGNITION USING NEURAL NETWORKS Int. J. Elec&Electr.Eng&Telecoms. 2014 Vinoda Yaragatti and Bhaskar B, 2014 Research Paper ISSN 2319 2518 www.ijeetc.com Vol. 3, No. 3, July 2014 2014 IJEETC. All Rights Reserved FACE RECOGNITION USING

More information

MULTI-LAYERED HYBRID ARCHITECTURE TO SOLVE COMPLEX TASKS OF AN AUTONOMOUS MOBILE ROBOT

MULTI-LAYERED HYBRID ARCHITECTURE TO SOLVE COMPLEX TASKS OF AN AUTONOMOUS MOBILE ROBOT MULTI-LAYERED HYBRID ARCHITECTURE TO SOLVE COMPLEX TASKS OF AN AUTONOMOUS MOBILE ROBOT F. TIECHE, C. FACCHINETTI and H. HUGLI Institute of Microtechnology, University of Neuchâtel, Rue de Tivoli 28, CH-2003

More information

Capturing and Adapting Traces for Character Control in Computer Role Playing Games

Capturing and Adapting Traces for Character Control in Computer Role Playing Games Capturing and Adapting Traces for Character Control in Computer Role Playing Games Jonathan Rubin and Ashwin Ram Palo Alto Research Center 3333 Coyote Hill Road, Palo Alto, CA 94304 USA Jonathan.Rubin@parc.com,

More information

CONTROL IMPROVEMENT OF UNDER-DAMPED SYSTEMS AND STRUCTURES BY INPUT SHAPING

CONTROL IMPROVEMENT OF UNDER-DAMPED SYSTEMS AND STRUCTURES BY INPUT SHAPING CONTROL IMPROVEMENT OF UNDER-DAMPED SYSTEMS AND STRUCTURES BY INPUT SHAPING Igor Arolovich a, Grigory Agranovich b Ariel University of Samaria a igor.arolovich@outlook.com, b agr@ariel.ac.il Abstract -

More information

Winner-Take-All Networks with Lateral Excitation

Winner-Take-All Networks with Lateral Excitation Analog Integrated Circuits and Signal Processing, 13, 185 193 (1997) c 1997 Kluwer Academic Publishers, Boston. Manufactured in The Netherlands. Winner-Take-All Networks with Lateral Excitation GIACOMO

More information