Markov Decision Process
2022
K. Schweighofer, M.-c. Dinu, A. Radler, M. Hofmarcher, V. P. Patil, A. Bitto-Nemling, H. Eghbal-zadeh, and S. Hochreiter (2022) A Dataset Perspective on Offline Reinforcement Learning. Conference on Lifelong Learning Agents, Proceedings of Machine Learning Research, 199, 470-517, 2022-11-28. (more) (download)
F. Paischer, T. Adler, V. Patil, A. Bitto-Nemling, M. Holzleitner, S. Lehner, H. Eghbal-zadeh, and S. Hochreiter (2022) History Compression via Language Models in Reinforcement Learning. arXiv:2205.12258, 2022-05-24. (more) (download)