Reinforcement Learning of Bimanual Robot Skills by Adrià Colomé & Carme Torras
Author:Adrià Colomé & Carme Torras
Language: eng
Format: epub
ISBN: 9783030263263
Publisher: Springer International Publishing
Now we briefly present two of the most popular PS algorithms found in literature: REPS and PI2, which have been used in the experiments throughout this monograph.
5.1.1.1 Relative Entropy Policy Search (REPS)
Formally, REPS [3, 13] finds the policy that maximizes the expected reward for a given task. The REPS algorithm uses Kullback-Leibler (KL) divergence [8], which is a non-symmetric indicator of the difference between two probability distributions p, q over a random variable x:
(5.5)
Given the previous policy , the new policy is obtained by adding a KL-Divergence bound between the newly obtained policy and the previous one to the optimization of the expected reward. The bound on the KL-Divergence limits the variation on the new policy and prevents the PS algorithm from being too greedy. Too greedy algorithms can be a wrong approach in some robotics applications, where a drastic change in the policy may result in an unpredictable, dangerous behavior of the robot. Such new policy is then computed as the solution of:
Download
This site does not store any files on its server. We only index and link to content provided by other sites. Please contact the content providers to delete copyright contents if any and email us, we'll remove relevant links or contents immediately.
Algorithms of the Intelligent Web by Haralambos Marmanis;Dmitry Babenko(8309)
Test-Driven Development with Java by Alan Mellor(6798)
Data Augmentation with Python by Duc Haba(6715)
Principles of Data Fabric by Sonia Mezzetta(6462)
Learn Blender Simulations the Right Way by Stephen Pearson(6367)
Microservices with Spring Boot 3 and Spring Cloud by Magnus Larsson(6234)
Hadoop in Practice by Alex Holmes(5965)
Jquery UI in Action : Master the concepts Of Jquery UI: A Step By Step Approach by ANMOL GOYAL(5814)
RPA Solution Architect's Handbook by Sachin Sahgal(5636)
Big Data Analysis with Python by Ivan Marin(5399)
The Infinite Retina by Robert Scoble Irena Cronin(5323)
Life 3.0: Being Human in the Age of Artificial Intelligence by Tegmark Max(5159)
Pretrain Vision and Large Language Models in Python by Emily Webber(4363)
Infrastructure as Code for Beginners by Russ McKendrick(4132)
Functional Programming in JavaScript by Mantyla Dan(4044)
The Age of Surveillance Capitalism by Shoshana Zuboff(3964)
WordPress Plugin Development Cookbook by Yannick Lefebvre(3844)
Embracing Microservices Design by Ovais Mehboob Ahmed Khan Nabil Siddiqui and Timothy Oleson(3648)
Applied Machine Learning for Healthcare and Life Sciences Using AWS by Ujjwal Ratan(3621)
