Asada, Minoru


On- and Off-Policy Monotonic Policy Improvement

arXiv.org Machine Learning

Monotonic policy improvement and off-policy learning are two main desirable properties for reinforcement learning algorithms. In this paper, by lower bounding the performance difference of two policies, we show that the monotonic policy improvement is guaranteed from on- and off-policy mixture samples. An optimization procedure which applies the proposed bound can be regarded as an off-policy natural policy gradient method. In order to support the theoretical result, we provide a trust region policy optimization method using experience replay as a naive application of our bound, and evaluate its performance in two classical benchmark problems.


Real-time face swapping as a tool for understanding infant self-recognition

arXiv.org Artificial Intelligence

To study the preference of infants for contingency of movements and familiarity of faces during self-recognition task, we built, as an accurate and instantaneous imitator, a real-time face- swapper for videos. We present a non-constraint face-swapper based on 3D visual tracking that achieves real-time performance through parallel computing. Our imitator system is par- ticularly suited for experiments involving children with Autistic Spectrum Disorder who are often strongly disturbed by the constraints of other methods.


Between Frustration and Elation: Sense of Control Regulates the lntrinsic Motivation for Motor Learning

AAAI Conferences

Frustration has been generally viewed in a negative light and its potential role in learning neglected. We propose a new approach to intrinsically motivated learning where frustration is a key factor that allows to dynamically balance exploration and exploitation. Moreover, based on the result obtained from our experiment with older infants, we propose that a temporary decrease in learning from negative feedback can also be beneficial in fine-tuning a newly learned behavior. We suggest that this temporal indifference to the outcome of an action may be related to the sense of control, and results from the state of elation, that is the experience of overcoming a very difficult task after prolonged frustration. Our preliminary simulation results serve as a proof-of-concept for our approach.


An Overview of RoboCup-2002 Fukuoka/Busan

AI Magazine

This article reports on the Sixth Robot World Cup Competition and Conference (RoboCup-2002) Fukuoka/Busan, which took place from 19 to 25 June in Fukuoka, Japan. It was the largest Robo- Cup since 1997 and held the first humanoid league competition in the world. Further, the first ROBOTREX (robot trade and exhibitions) was held with about 50 companies, universities, and institutes represented. To the best of our knowledge, this was the largest robotic event in history.


An Overview of RoboCup-2002 Fukuoka/Busan

AI Magazine

This article reports on the Sixth Robot World Cup Competition and Conference (RoboCup-2002) Fukuoka/Busan, which took place from 19 to 25 June in Fukuoka, Japan. It was the largest Robo- Cup since 1997 and held the first humanoid league competition in the world. Further, the first ROBOTREX (robot trade and exhibitions) was held with about 50 companies, universities, and institutes represented. A total of 117,000 spectators witnessed this marvelous event. To the best of our knowledge, this was the largest robotic event in history.


RoboCup-2001: The Fifth Robotic Soccer World Championships

AI Magazine

RoboCup-2001 was the Fifth International RoboCup Competition and Conference. It was held for the first time in the United States, following RoboCup-2000 in Melbourne, Australia; RoboCup-99 in Stockholm; RoboCup-98 in Paris; and RoboCup-97 in Osaka.


RoboCup-2001: The Fifth Robotic Soccer World Championships

AI Magazine

RoboCup-2001 was the Fifth International RoboCup Competition and Conference. It was held for the first time in the United States, following RoboCup-2000 in Melbourne, Australia; RoboCup-99 in Stockholm; RoboCup-98 in Paris; and RoboCup-97 in Osaka. This article discusses in detail each one of the events at RoboCup-2001, focusing on the competition leagues.


RoboCup-2000: The Fourth Robotic Soccer World Championships

AI Magazine

The Fourth Robotic Soccer World Championships (RoboCup-2000) was held from 27 August to 3 September 2000 at the Melbourne Exhibition Center in Melbourne, Australia. RoboCup-2000 showed dramatic improvement over past years in each of the existing robotic soccer leagues (legged, small size, mid size, and simulation) and introduced RoboCup Jr. competitions and RoboCup Rescue and Humanoid demonstration events. The RoboCup Workshop, held in conjunction with the championships, provided a forum for the exchange of ideas and experiences among the different leagues. This article summarizes the advances seen at RoboCup-2000, including reports from the championship teams and overviews of all the RoboCup events.


RoboCup-2000: The Fourth Robotic Soccer World Championships

AI Magazine

The Fourth Robotic Soccer World Championships (RoboCup-2000) was held from 27 August to 3 September 2000 at the Melbourne Exhibition Center in Melbourne, Australia. In total, 83 teams, consisting of about 500 people, participated in RoboCup-2000, and about 5000 spectators watched the events. RoboCup-2000 showed dramatic improvement over past years in each of the existing robotic soccer leagues (legged, small size, mid size, and simulation) and introduced RoboCup Jr. competitions and RoboCup Rescue and Humanoid demonstration events. The RoboCup Workshop, held in conjunction with the championships, provided a forum for the exchange of ideas and experiences among the different leagues. This article summarizes the advances seen at RoboCup-2000, including reports from the championship teams and overviews of all the RoboCup events.