mediaTUM - Media and Publication Server

User: Guest

INF 1 - Institut für Theoretische Informatik, Mathematik und Operations Research

Back
Back to start of result list
Permanent link for displayed object

Authors:: Moll, Maximilian; Schilling, Matthias; Pickl, Stefan
Document type:: Konferenzbeitrag / Conference Paper
Title:: Statistical analysis of reinforcement learning training
Collection editors:: Voigt, Guido; Fliedner, Malte; Haase, Knut; Brüggemann, Wolfgang; Hoberg, Kai; Meissner, Joern
Title of conference publication:: Operations Research Proceedings 2023
Subtitle of conference publication:: Selected Papers of the Annual International Conference of the German Operations Research Society (GOR), Germany, August 29 - September 1, 2023
Series title:: Lecture Notes in Operations Research (LNOR)
Place of publication:: Cham
Publisher:: Springer Nature
Year:: 2025
Pages from - to:: 447-452
Language:: Englisch
Abstract:: One of the most urgent challenges in Reinforcement Learning research is the lack of reproducibility. Therefore, to further the understanding of the training behavior of Reinforcement Learning agents, we analyze the training of agents playing the established baseline environment Taxi. In particular, we contrast results based on different forms of exploration. In addition, we can demonstrate that in this context penalization without termination is to be the preferred punishment for incorrect actions. «
One of the most urgent challenges in Reinforcement Learning research is the lack of reproducibility. Therefore, to further the understanding of the training behavior of Reinforcement Learning agents, we analyze the training of agents playing the established baseline environment Taxi. In particular, we contrast results based on different forms of exploration. In addition, we can demonstrate that in this context penalization without termination is to be the preferred punishment for incorrect actio... »
ISBN:: 978-3-031-58405-3
ISSN:: 2731-0418
DOI:: 10.1007/978-3-031-58405-3_57
URL:: https://doi.org/10.1007/978-3-031-58405-3_57
Department:: Fakultät für Informatik
Institute:: INF 1 - Institut für Theoretische Informatik, Mathematik und Operations Research
Chair:: Brattka, Vasco
Open Access yes or no?:: Nein / No
BibTeX

Occurrences:

Home / Alle Inhalte Publikationen Fakultäten (univ.)Fakultät für Informatik INF 1 - Institut für Theoretische Informatik, Mathematik und Operations Research