Publications by Precup, Doina
In Journal (refereed)
1. | R. Sutton, D. Precup, S. Singh. "Between MDPs and Semi-MDPs: A Framework for Temporal Abstractions in Reinforcement Learning". Artificial Intelligence (AIJ), 112, pp 181-211, January 1999. |
In Conference (refereed)
2. | D. Precup, R. Sutton, C. Paduraru, A. Koop, S. Singh. "Off-Policy Learning With Recognizers". Neural Information Processing Systems (NIPS), Vancouver, British Columbia, Canada, January 2005. |
3. | D. Precup, R. Sutton, S. Dasgupta. "Off-Policy Temporal-Difference Learning With Function Approximation". International Conference on Machine Learning (ICML), Williams College, pp 417-424, January 2001. |
4. | D. Precup, R. Sutton, S. Singh. "Eligibility Traces for Off-Policy Policy Evaluation". International Conference on Machine Learning (ICML), Stanford University, pp 759-766, January 2000. |
5. | R. Sutton, S. Singh, D. Precup, B. Ravindran. "Improved Switching Among Temporally Abstract Actions". Neural Information Processing Systems (NIPS), Denver, CO, USA, pp 1066-1072, January 1999. |
6. | R. Sutton, D. Precup, S. Singh. "Intra-Option Learning About Temporally Abstract Actions". International Conference on Machine Learning (ICML), Madison, Wisconsin USA, pp 556-564, January 1998. |
7. | D. Precup, R. Sutton. "Multi-Time Models for Temporally Abstract Planning". Neural Information Processing Systems (NIPS), Denver, CO, USA, pp 1050-1056, January 1998. |
8. | D. Precup, R. Sutton, S. Singh. "Theoretical Results on Reinforcement Learning With Temporally Abstract Options". European Conference on Machine Learning (ECML), Chemnitz, Germany, pp 382-393, January 1998. |
9. | D. Precup, R. Sutton. "Exponentiated Gradient Methods for Reinforcement Learning". International Conference on Machine Learning (ICML), Nashville, pp 272-277, July 1997. |
10. | D. Precup, R. Sutton. "Multi-Time Models for Reinforcement Learning". International Conference on Machine Learning (ICML), Nashville, July 1997. |
11. | D. Precup, R. Sutton, S. Singh. "Planning with Closed-Loop Macro Actions". National Conference on Artificial Intelligence (AAAI), Providence, Rhode Island, pp 73-76, May 1997. |
In Workshop
12. | A. McGovern, D. Precup, B. Ravindran, S. Singh, R. Sutton. "Hierarchical Optimal Control of MDPs". Yale Workshop on Adaptive and Learning Systems, pp 186-191, January 1998. |
Other Categories
13. | D. Precup, R. Sutton, S. Singh. "Notes". National Conference on Artificial Intelligence (AAAI), Providence, Rhode Island, January 1997. |
14. | D. Precup, R. Sutton. "Empirical Comparison of Gradient Descent and Exponentiated Gradient Descent in Supervised and Reinforcement Learning". Technical Report, January 1996. |