Not Logged In

Publications by Singh, Satinder

In Journal (refereed)

1.	R. Sutton, D. Precup, S. Singh. "Between MDPs and Semi-MDPs: A Framework for Temporal Abstractions in Reinforcement Learning". Artificial Intelligence (AIJ), 112, pp 181-211, January 1999.

2.	S. Singh, R. Sutton. "Reinforcement Learning With Replacing Eligibility Traces". Machine Learning Journal (MLJ), (22), pp 123-158, January 1996.

In Conference (refereed)

3.	D. Precup, R. Sutton, C. Paduraru, A. Koop, S. Singh. "Off-Policy Learning With Recognizers". Neural Information Processing Systems (NIPS), Vancouver, British Columbia, Canada, January 2005.

4.	M. Littman, R. Sutton, S. Singh. "Predictive Representations of State". Neural Information Processing Systems (NIPS), Vancouver, British Columbia, Canada, January 2001.

5.	D. Precup, R. Sutton, S. Singh. "Eligibility Traces for Off-Policy Policy Evaluation". International Conference on Machine Learning (ICML), Stanford University, pp 759-766, January 2000.

6.	R. Sutton, S. Singh, D. Precup, B. Ravindran. "Improved Switching Among Temporally Abstract Actions". Neural Information Processing Systems (NIPS), Denver, CO, USA, pp 1066-1072, January 1999.

7.	R. Sutton, D. McAllester, S. Singh, Y. Mansour. "Policy Gradient Methods for Reinforcement Learning With Function Approximation". Neural Information Processing Systems (NIPS), Denver, CO, USA, pp 1057-1063, January 1999.

8.	R. Sutton, D. Precup, S. Singh. "Intra-Option Learning About Temporally Abstract Actions". International Conference on Machine Learning (ICML), Madison, Wisconsin USA, pp 556-564, January 1998.

9.	D. Precup, R. Sutton, S. Singh. "Theoretical Results on Reinforcement Learning With Temporally Abstract Options". European Conference on Machine Learning (ECML), Chemnitz, Germany, pp 382-393, January 1998.

10.	D. Precup, R. Sutton, S. Singh. "Planning with Closed-Loop Macro Actions". National Conference on Artificial Intelligence (AAAI), Providence, Rhode Island, pp 73-76, May 1997.

In Workshop

11.	P. Stone, R. Sutton, S. Singh. "Reinforcement Learning for 3 vs. 2 Keepaway". RoboCup, January 2001.

12.	A. McGovern, D. Precup, B. Ravindran, S. Singh, R. Sutton. "Hierarchical Optimal Control of MDPs". Yale Workshop on Adaptive and Learning Systems, pp 186-191, January 1998.

13.	R. Sutton, S. Singh. "On Bias and Step Size in Temporal-Difference Learning". Yale Workshop on Adaptive and Learning Systems, pp 91-96, January 1994.

Other Categories

14.	M. Littman, R. Sutton, S. Singh. "Predictive Representations of State". Predictive Representations of World Knowledge, January 2002.

15.	D. Precup, R. Sutton, S. Singh. "Notes". National Conference on Artificial Intelligence (AAAI), Providence, Rhode Island, January 1997.

Copyright © 2002-2008