2020

V. P. Patil, M. Hofmarcher, M.-C. Dinu, M. Dorfer, P. M. Blies, J. Brandstetter, J. A. Arjona-Medina, and S. Hochreiter (2020) Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution. arXiv:2009.14108, 2020-09-29. (more) (download)

2019

J. A. Arjona-Medina, M. Gillhofer, M. Widrich, T. Unterthiner, J. Brandstetter, and S. Hochreiter (2019) RUDDER – Return Decomposition with Delayed Rewards. Advances in Neural Information Processing Systems 32 (NeurIPS 2019), 13566;  e-print also at arXiv:1806.07857v3, 2019-09-10. (more) (download)

©2023 IARAI - INSTITUTE OF ADVANCED RESEARCH IN ARTIFICIAL INTELLIGENCE

Imprint | Privacy Policy

Stay in the know with developments at IARAI

We can let you know if there’s any

updates from the Institute.
You can later also tailor your news feed to specific research areas or keywords (Privacy)
Loading

Log in with your credentials

Forgot your details?

Create Account