Markov Decision Process
2022
R. Siripurapu, V. P. Patil, K. Schweighofer, M.-C. Dinu, T. Schmied, L. E. F. Diez, M. Holzleitner, H. Eghbal-Zadeh, M. K. Kopp, and S. Hochreiter (2022) InfODist: Online Distillation with Informative Rewards Improves Generalization in Curriculum Learning. Deep Reinforcement Learning Workshop at NeurIPS 2022, 2022-12-09. (more) (download)
K. Schweighofer, M.-c. Dinu, A. Radler, M. Hofmarcher, V. P. Patil, A. Bitto-Nemling, H. Eghbal-zadeh, and S. Hochreiter (2022) A Dataset Perspective on Offline Reinforcement Learning. Conference on Lifelong Learning Agents, Proceedings of Machine Learning Research, 199, 470-517, 2022-11-28. (more) (download)
F. Paischer, T. Adler, V. Patil, A. Bitto-Nemling, M. Holzleitner, S. Lehner, H. Eghbal-zadeh, and S. Hochreiter (2022) History Compression via Language Models in Reinforcement Learning. Proceedings of the 39th International Conference on Machine Learning, PMLR, 162, 17156-17185, 2022-06-28. (more) (download)