Not Logged In

Effective sketching methods for value function approximation

Full Text: 81.pdf PDF

High-dimensional representations, such as radial basis function networks or tile coding, are common choices for policy evaluation in reinforcement learning. Learning with such high-dimensional representations, however, can be expensive, particularly for matrix methods, such as least-squares temporal difference learning or quasi-Newton methods that approximate matrix step-sizes. In this work, we explore the utility of sketching for these two classes of algorithms. We highlight issues with sketching the high-dimensional features directly, which can incur significant bias. As a remedy, we demonstrate how to use sketching more sparingly, with only a left-sided sketch, that can still enable significant computational gains and the use of these matrix-based learning algorithms that are less sensitive to parameters. We empirically investigate these algorithms, in four domains with a variety of representations. Our aim is to provide insights into effective use of sketching in practice.

Citation

Y. Pan, E. Azer, M. White. "Effective sketching methods for value function approximation". Conference on Uncertainty in Artificial Intelligence (UAI), (ed: Gal Elidan, Kristian Kersting, Alexander T. Ihler), pp n/a, August 2017.

Keywords:  
Category: In Conference
Web Links: UAI

BibTeX

@incollection{Pan+al:UAI17,
  author = {Yangchen Pan and Erfan Sadeqi Azer and Martha White},
  title = {Effective sketching methods for value function approximation},
  Editor = {Gal Elidan, Kristian Kersting, Alexander T. Ihler},
  Pages = {n/a},
  booktitle = {Conference on Uncertainty in Artificial Intelligence (UAI)},
  year = 2017,
}

Last Updated: February 25, 2020
Submitted by Sabina P

University of Alberta Logo AICML Logo