Not Logged In



Publications by Mahmood, Ashique Rupam

In Journal (refereed)

1. H. Yu, A. Mahmood, R. Sutton. "On Generalized Bellman Equations and Temporal-Difference Learning". Journal of Machine Learning Research (JMLR), 19(48), pp 1-49, January 2018. PDFview
2. R. Sutton, A. Mahmood, M. White. "An Emphatic Approach to the Problem of Off-policy Temporal-Difference Learning". Journal of Machine Learning Research (JMLR), (ed: Shie Mannor), 17(73), pp 1-29, January 2016. view
3. H. Seije, A. Mahmood, P. Pilarski, M. Machado, R. Sutton. "True Online Temporal-Difference Learning". Journal of Machine Learning Research (JMLR), 17(145), pp n/a, January 2016. PDFview

In Conference (refereed)

4. H. Seijen, A. Mahmood, P. Pilarski, R. Sutton. "An empirical evaluation of True Online TD(lambda)". European Workshop on Reinforcement Learning (EWRL), July 2015. view
University of Alberta Logo AICML Logo