Not Logged In



Publications by Szepesvari, Csaba

In Journal (refereed)

1. A. Antos, C. Szepesvari, R. Munos. "Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path". Machine Learning Journal (MLJ), June 2007. PDFview
2. R. Munos, C. Szepesvari. "Finite Time Bounds for Sampling Based Fitted Value Iteration". Journal of Machine Learning Research (JMLR), March 2007. PDFview
3. P. Torma, C. Szepesvari. "Local Importance Sampling: A Novel Technique to Enhance Particle Filtering". Journal of Multimedia (JMM), 1, pp 32-43, January 2006. PDFview
4. L. Kocsis, C. Szepesvari. "Universal Parameter Optimisation in Games Based on SPSA". Machine Learning Journal (MLJ), 63, pp 249-286, January 2006. PDFview

In Conference (refereed)

5. N. Zolghadr, G. Bartok, R. Greiner, C. Szepesvari, A. Gyorgy. "Online Learning with Costly Features and Labels". Neural Information Processing Systems (NIPS), December 2013. PDFview
6. L. Li, B. Poczos, C. Szepesvari, R. Greiner. "Budgeted Distribution Learning of Belief Net Parameters". International Conference on Machine Learning (ICML), June 2010. PDFview
7. Y. Yu, Y. Li, C. Szepesvari, D. Schuurmans. "A general projection property for distribution families". Neural Information Processing Systems (NIPS), December 2009. view
8. A. Farhangfar, R. Greiner, C. Szepesvari. " Learning to Segment from a Few Well-Selected Training Images". International Conference on Machine Learning (ICML), June 2009. PDFview
9. B. Poczos, Y. Abbasi-Yadkori, C. Szepesvari, R. Greiner, N. Sturtevant. "Learning when to stop thinking and do something!". International Conference on Machine Learning (ICML), June 2009. PDFview
10. A. Farahmand, A. Shademan, M. Jagersand, C. Szepesvari. "Model-based and model-free reinforcement learning for visual servoing". IEEE International Conference on Robotics and Automation (ICRA), pp 2917-2924, May 2009. PDFview
11. Y. Li, C. Szepesvari, D. Schuurmans. "Learning exercise policies for American options". Artificial Intelligence and Statistics, April 2009. view
12. A. Isaza, C. Szepesvari, R. Greiner, V. Bulitko. "Speeding Up Planning in Markov Decision Processes via Automatically Constructed Abstractions". Conference on Uncertainty in Artificial Intelligence (UAI), pp 306--314, July 2008. PDFview
13. J. Audibert, R. Munos, C. Szepesvari. "Tuning bandit algorithms in stochastic environments". Algorithmic Learning Theory (ALT), October 2007. PDFview
14. G. Neu, C. Szepesvari. " Apprenticeship Learning using Inverse Reinforcement Learning and Gradient Methods". Conference on Uncertainty in Artificial Intelligence (UAI), pp 295-302, July 2007. PDFview
15. A. Antos, C. Szepesvari, R. Munos. "Value-Iteration Based Fitted Policy Iteration: Learning with a Single Trajectory". Symposium on Approximate Dynamic Programming and Reinforcement Learning, pp 330-337, April 2007. view
16. P. Auer, R. Ortner, C. Szepesvari. "Improved Rates for the Stochastic Continuum-Armed Bandit Problem". Conference on Learning Theory (COLT), San Diego, CA, March 2007. PDFview
17. A. Farahmand, J. Audibert, C. Szepesvari. "Manifold-Adaptive Dimension Estimation". International Conference on Machine Learning (ICML), March 2007. PDFview
18. I. Biro, Z. Szamonek, C. Szepesvari. "Sequence prediction exploiting similarity information". International Joint Conference on Artificial Intelligence (IJCAI), Hyderabad, India, March 2007. PDFview
19. A. Gyorgy, L. Kocsis, I. Szabo, C. Szepesvari. "Continuous Time Associative Bandit Problems". International Joint Conference on Artificial Intelligence (IJCAI), Hyderabad, India, January 2007. PDFview
20. L. Kocsis, C. Szepesvari. "Bandit Based Monte-Carlo Planning". European Conference on Machine Learning (ECML), Berlin, Germany, pp 282-293, September 2006. PDFview
21. A. Antos, C. Szepesvari, R. Munos. "Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path". Conference on Learning Theory (COLT), January 2006. PDFview
22. C. Szepesvari, R. Munos. "Finite Time Bounds for Sampling Based Fitted Value Iteration". International Conference on Machine Learning (ICML), Bonn, Germany, pp 881-886, January 2005. PDFview
23. L. Gerencser, M. Rasonyi, Z. Vago, C. Szepesvari. "Log-optimal currency portfolios and control Lyapunov exponents". IEEE, pp 1764-1769, January 2005. PDFview
24. L. Kocsis, C. Szepesvari. "Reduced-Variance Payoff Estimation in Adversarial Bandit Problems". European Conference on Machine Learning (ECML), Porto, Portugal, January 2005. PDFview
25. P. Torma, C. Szepesvari. "Enhancing Particle Filters using Local Likelihood Sampling". European Conference on Computer Vision (ECCV), Prague, Czech Republic, January 2004. PDFview
26. C. Szepesvari, W. Smart. "Interpolation-based Q-learning". International Conference on Machine Learning (ICML), pp 791-798, January 2004. PDFview
27. A. Kocsor, K. Kovacs, C. Szepesvari. "Margin Maximizing Discriminant Analysis". European Conference on Machine Learning (ECML), Pisa, Italy, January 2004. PDFview
28. C. Szepesvari. "Shortest Path Discovery Problems: A Framework, Algorithms and Experimental Results". National Conference on Artificial Intelligence (AAAI), San Jose, California, USA, pp 550-555, January 2004. PDFview
29. P. Torma, C. Szepesvari. "Combining Local Search, Neural Networks and Particle Filters to Achieve Fast and Reliable Contour Tracking". IEEE, January 2003. PDFview
30. P. Torma, C. Szepesvari. "Sequential Importance Sampling for Visual Tracking Reconsidered". International Workshop on Artificial Intelligence and Statistics (AISTATS), pp 271-278, January 2003. PDFview
31. P. Torma, C. Szepesvari. "Towards Facial Pose Tracking". Hungarian Computer Graphics and Geometry Conference, pp 10-16, January 2002. PDFview

Other Categories

32. A. Farahmand, C. Szepesvari, J. Audibert. "Towards Manifold-Adaptive Learning". Workshop on Topology Learning, December 2007. view
33. P. Torma, C. Szepesvari. "On using Likelihood-adjusted Proposals in Particle Filtering: Local Importance Sampling". Symposium on Image and Signal Processing and Analysis, January 2005. PDFview
34. L. Kocsis, C. Szepesvari, M. Winands. "RSPSA: Enhanced Parameter Optimisation in Games". Advances in Computer Games (ACG), January 2005. PDFview
University of Alberta Logo AICML Logo