INFORMATIK 2017 Lötzsch, Winfried; Vitay, Julien; Hamker, Fred
Recent advances in deep reinforcement learning methods have attracted a lot of attention, because of their ability to use raw signals such as video streams as inputs, instead of pre-processed state variables. However, the most popular methods (value-based methods, e.g. deep Q-networks) focus on discrete action spaces ...