First Congress of Greek Mathematicians by Ioannis Emmanouil Anargyros Fellouris Apostolos Giannopoulos Sofia Lambropoulou

First Congress of Greek Mathematicians by Ioannis Emmanouil Anargyros Fellouris Apostolos Giannopoulos Sofia Lambropoulou

Author:Ioannis Emmanouil, Anargyros Fellouris, Apostolos Giannopoulos, Sofia Lambropoulou
Language: eng
Format: epub
Publisher: De Gruyter
Published: 2020-03-23T15:11:16.808000+00:00


Reinforcement learning: a comparison of UCB versus alternative adaptive policies

Wesley Cowan Department of Computer Science, Rutgers University, 110 Frelinghuysen Rd., Piscataway, NJ, 08854, USA

Michael N. Katehakis Department of Management Science and Information Systems, 100 Rockafeller Road, Piscataway, NJ, 08854, USA

Daniel Pirutinsky Department of Management Science and Information Systems, 100 Rockafeller Road, Piscataway, NJ, 08854, USA

Abstract

In this paper, we consider the basic version of Reinforcement Learning (RL) that involves computing optimal data driven (adaptive) policies for Markovian decision process with unknown transition probabilities. We provide a brief survey of the state of the art of the area, and we compare the performance of the classic UCB policy of Burnetas and Katehakis [10] with a new policy developed herein that we call MDP-Deterministic Minimum Empirical Divergence (MDP-DMED) and a method based on Posterior sampling (MDP-PS).



Download



Copyright Disclaimer:
This site does not store any files on its server. We only index and link to content provided by other sites. Please contact the content providers to delete copyright contents if any and email us, we'll remove relevant links or contents immediately.
Popular ebooks
Localization of mitochondria in plant cells by vital staining with rhodamine 123 by Unknown(570)
Essentials of Geology by Stephen Marshak(372)
Spectral Geometry and Inverse Scattering Theory by Huaian Diao & Hongyu Liu(259)
Probably Overthinking It: How to Use Data to Answer Questions, Avoid Statistical Traps, and Make Better Decisions by Allen B. Downey(255)
The Readable Darwin by Pechenik Jan A.;(246)
How to succeed in EPSO numerical reasoning tests by Franco Reverte José María(221)
Great Ways to Learn Anatomy and Physiology by McKissock Charmaine;(220)
Quantum International Relations by James Der Derian(217)
Weathering: Types, Processes and Effects: Types, Processes and Effects by Matthew J. J. Colon(214)
Research and Publication Ethics by Santosh Kumar Yadav(213)
Dark Matter in the Universe by John N. Bahcall(209)
Mathematical Models in Economics. Lections by Shananin(208)
Fusion of Defects by Arthur Bartels; Christopher Douglas; André Henriques(199)
Mathematics Booster-1 by Singh Manoj Kumar(193)
The Structure of Scientific Inference by Mary B. Hesse(190)
Sampling by Lohr Sharon L.;(186)
Pearls from a Lost City: The LVOV School of Mathematics (History of Mathematics) (History of Mathematics, 40) by Roman Duda(186)
Hydrocarbon transformations in sediments from the Cathedral Hill hydrothermal vent complex at Guaymas Basin, Gulf of California – A chemometric study of shallow seep architecture by unknow(182)
Flora Unveiled by Taiz Lincoln;Taiz Lee;(179)
Social Insects: Structure, Function, and Behavior : Structure, Function, and Behavior by Emily M. Stewart(172)