Not Logged In

Publications with keyword "Q-learning"

1.	R. Sutton, D. Precup, S. Singh. "Between MDPs and Semi-MDPs: A Framework for Temporal Abstractions in Reinforcement Learning". Artificial Intelligence (AIJ), 112, pp 181-211, January 1999.

2.	R. Sutton. "Integrated Modeling and Control Based on Reinforcement Learning and Dynamic Programming". January 1991.