Averaging rewards as a first approach towards Interpolated Experience Replay
Author:
Abstract
Reinforcement learning and especially deep reinforcement learning are research areas which are getting more and more attention. The mathematical method of interpolation is used to get information of data points in an area where only neighboring samples are known and thus seems like a good expansion for the experience replay which is a major component of a variety of deep reinforcement learning methods. Interpolated experiences stored in the experience replay could speed up learning in the early phase and reduce the overall amount of exploration needed. A first approach of averaging rewards in a setting with unstable transition function and very low exploration is implemented and shows promising results that encourage further investigation.
- Citation
- BibTeX
Pilar von Pilchau, W.,
(2019).
Averaging rewards as a first approach towards Interpolated Experience Replay.
In:
Draude, C., Lange, M. & Sick, B.
(Hrsg.),
INFORMATIK 2019: 50 Jahre Gesellschaft für Informatik – Informatik für Gesellschaft (Workshop-Beiträge).
Bonn:
Gesellschaft für Informatik e.V..
(S. 493-506).
DOI: 10.18420/inf2019_ws53
@inproceedings{mci/Pilar von Pilchau2019,
author = {Pilar von Pilchau, Wenzel},
title = {Averaging rewards as a first approach towards Interpolated Experience Replay},
booktitle = {INFORMATIK 2019: 50 Jahre Gesellschaft für Informatik – Informatik für Gesellschaft (Workshop-Beiträge)},
year = {2019},
editor = {Draude, Claude AND Lange, Martin AND Sick, Bernhard} ,
pages = { 493-506 } ,
doi = { 10.18420/inf2019_ws53 },
publisher = {Gesellschaft für Informatik e.V.},
address = {Bonn}
}
author = {Pilar von Pilchau, Wenzel},
title = {Averaging rewards as a first approach towards Interpolated Experience Replay},
booktitle = {INFORMATIK 2019: 50 Jahre Gesellschaft für Informatik – Informatik für Gesellschaft (Workshop-Beiträge)},
year = {2019},
editor = {Draude, Claude AND Lange, Martin AND Sick, Bernhard} ,
pages = { 493-506 } ,
doi = { 10.18420/inf2019_ws53 },
publisher = {Gesellschaft für Informatik e.V.},
address = {Bonn}
}
Dateien | Groesse | Format | Anzeige | |
---|---|---|---|---|
paper11_03.pdf | 275.5Kb | View/ |
Sollte hier kein Volltext (PDF) verlinkt sein, dann kann es sein, dass dieser aus verschiedenen Gruenden (z.B. Lizenzen oder Copyright) nur in einer anderen Digital Library verfuegbar ist. Versuchen Sie in diesem Fall einen Zugriff ueber die verlinkte DOI: 10.18420/inf2019_ws53
Haben Sie fehlerhafte Angaben entdeckt? Sagen Sie uns Bescheid: Send Feedback
More Info
ISBN: 978-3-88579-689-3
ISSN: 1617-5468
xmlui.MetaDataDisplay.field.date: 2019
Language: (en)
Content Type: Text/Conference Paper