Not Logged In

Publications by Antos, Andras

In Journal (refereed)

1. A. Antos, C. Szepesvari, R. Munos. "Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path". Machine Learning Journal (MLJ), June 2007. PDFview

In Conference (refereed)

2. A. Antos, C. Szepesvari, R. Munos. "Value-Iteration Based Fitted Policy Iteration: Learning with a Single Trajectory". Symposium on Approximate Dynamic Programming and Reinforcement Learning, pp 330-337, April 2007. view
3. A. Antos, C. Szepesvari, R. Munos. "Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path". Conference on Learning Theory (COLT), January 2006. PDFview
University of Alberta Logo AICML Logo