Mahmood, Ashique Rupam
Name: | Mahmood, Ashique Rupam |
Email: | |
Organization: | |
Webpage: | none |
Interest(s): | |
Publications: |
1. | H. Yu, A. Mahmood, R. Sutton. "On Generalized Bellman Equations and Temporal-Difference Learning". Journal of Machine Learning Research (JMLR), 19(48), pp 1-49, January 2018. |
2. | R. Sutton, A. Mahmood, M. White. "An Emphatic Approach to the Problem of Off-policy Temporal-Difference Learning". Journal of Machine Learning Research (JMLR), (ed: Shie Mannor), 17(73), pp 1-29, January 2016. |
3. | H. Seije, A. Mahmood, P. Pilarski, M. Machado, R. Sutton. "True Online Temporal-Difference Learning". Journal of Machine Learning Research (JMLR), 17(145), pp n/a, January 2016. |
4. | H. Seijen, A. Mahmood, P. Pilarski, R. Sutton. "An empirical evaluation of True Online TD(lambda)". European Workshop on Reinforcement Learning (EWRL), July 2015. |
Author List