Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
1 | Andreas Vlachos 0001 |
An investigation of imitation learning algorithms for structured prediction. |
EWRL |
2012 |
DBLP BibTeX RDF |
|
1 | Sergiu Goschin, Ari Weinstein, Michael L. Littman, Erick Chastain |
Planning in Reward-Rich Domains via PAC Bandits. |
EWRL |
2012 |
DBLP BibTeX RDF |
|
1 | Michal Valko, Mohammad Ghavamzadeh, Alessandro Lazaric |
Semi-Supervised Apprenticeship Learning. |
EWRL |
2012 |
DBLP BibTeX RDF |
|
1 | Scott Sanner, Marcus Hutter (eds.) |
Recent Advances in Reinforcement Learning - 9th European Workshop, EWRL 2011, Athens, Greece, September 9-11, 2011, Revised Selected Papers |
EWRL |
2012 |
DBLP DOI BibTeX RDF |
|
1 | Ari Weinstein, Michael L. Littman, Sergiu Goschin |
Rollout-based Game-tree Search Outprunes Traditional Alpha-beta. |
EWRL |
2012 |
DBLP BibTeX RDF |
|
1 | Mayank Daswani, Peter Sunehag, Marcus Hutter |
Feature Reinforcement Learning using Looping Suffix Trees. |
EWRL |
2012 |
DBLP BibTeX RDF |
|
1 | Timothy A. Mann, Yoonsuck Choe |
Directed Exploration in Reinforcement Learning with Transferred Knowledge. |
EWRL |
2012 |
DBLP BibTeX RDF |
|
1 | Yevgeny Seldin, Csaba Szepesvári, Peter Auer, Yasin Abbasi-Yadkori |
Evaluation and Analysis of the Performance of the EXP3 Algorithm in Stochastic Environments. |
EWRL |
2012 |
DBLP BibTeX RDF |
|
1 | Cosmin Paduraru, Doina Precup, Joelle Pineau, Gheorghe Comanici |
An Empirical Analysis of Off-policy Learning in Discrete MDPs. |
EWRL |
2012 |
DBLP BibTeX RDF |
|
1 | David Silver |
Gradient Temporal Difference Networks. |
EWRL |
2012 |
DBLP BibTeX RDF |
|
1 | Jan Hendrik Metzen |
Online Skill Discovery using Graph-based Clustering. |
EWRL |
2012 |
DBLP BibTeX RDF |
|
1 | Marc Peter Deisenroth, Csaba Szepesvári, Jan Peters 0001 (eds.) |
Proceedings of the Tenth European Workshop on Reinforcement Learning, EWRL 2012, Edinburgh, Scotland, UK, June, 2012 |
EWRL |
2012 |
DBLP BibTeX RDF |
|
1 | Nicolas Heess, David Silver, Yee Whye Teh |
Actor-Critic Reinforcement Learning with Energy-Based Policies. |
EWRL |
2012 |
DBLP BibTeX RDF |
|
1 | Marc Peter Deisenroth, Csaba Szepesvári, Jan Peters 0001 |
Preface. |
EWRL |
2012 |
DBLP BibTeX RDF |
|
1 | Michael Castronovo, Francis Maes, Raphael Fonteneau, Damien Ernst |
Learning Exploration/Exploitation Strategies for Single Trajectory Reinforcement Learning. |
EWRL |
2012 |
DBLP BibTeX RDF |
|
1 | Seiya Kuroda, Kazuteru Miyazaki, Hiroaki Kobayashi |
Introduction of Fixed Mode States into Online Profit Sharing and Its Application to Waist Trajectory Generation of Biped Robot. |
EWRL |
2011 |
DBLP DOI BibTeX RDF |
|
1 | Christos Dimitrakakis |
Robust Bayesian Reinforcement Learning through Tight Lower Bounds. |
EWRL |
2011 |
DBLP DOI BibTeX RDF |
|
1 | Mauricio Araya-López, Olivier Buffet, Vincent Thomas, François Charpillet |
Active Learning of MDP Models. |
EWRL |
2011 |
DBLP DOI BibTeX RDF |
|
1 | Sylvie C. W. Ong, Yuri Grinberg, Joelle Pineau |
Goal-Directed Online Learning of Predictive Models. |
EWRL |
2011 |
DBLP DOI BibTeX RDF |
|
1 | Peter Stone |
Invited Talk: PRISM - Practical RL: Representation, Interaction, Synthesis, and Mortality. |
EWRL |
2011 |
DBLP DOI BibTeX RDF |
|
1 | Peter Auer |
Invited Talk: UCRL and Autonomous Exploration. |
EWRL |
2011 |
DBLP DOI BibTeX RDF |
|
1 | Edouard Klein, Matthieu Geist, Olivier Pietquin |
Batch, Off-Policy and Model-Free Apprenticeship Learning. |
EWRL |
2011 |
DBLP DOI BibTeX RDF |
|
1 | Cosmin Paduraru, Doina Precup, Joelle Pineau |
A Framework for Computing Bounds for the Return of a Policy. |
EWRL |
2011 |
DBLP DOI BibTeX RDF |
|
1 | Kyriakos C. Chatzidimitriou, Ioannis Partalas, Pericles A. Mitkas, Ioannis P. Vlahavas |
Transferring Evolved Reservoir Features in Reinforcement Learning Tasks. |
EWRL |
2011 |
DBLP DOI BibTeX RDF |
|
1 | Matthieu Geist, Bruno Scherrer |
ℓ1-Penalized Projected Bellman Residual. |
EWRL |
2011 |
DBLP DOI BibTeX RDF |
|
1 | Bruno Scherrer, Matthieu Geist |
Recursive Least-Squares Learning with Eligibility Traces. |
EWRL |
2011 |
DBLP DOI BibTeX RDF |
|
1 | Ioannis Lambrou, Vassilis Vassiliades, Chris Christodoulou |
An Extension of a Hierarchical Reinforcement Learning Algorithm for Multiagent Settings. |
EWRL |
2011 |
DBLP DOI BibTeX RDF |
|
1 | Tohgoroh Matsui, Takashi Goto 0004, Kiyoshi Izumi, Yu Chen 0007 |
Compound Reinforcement Learning: Theory and an Application to Finance. |
EWRL |
2011 |
DBLP DOI BibTeX RDF |
|
1 | Phuong Minh Nguyen, Peter Sunehag, Marcus Hutter |
Feature Reinforcement Learning in Practice. |
EWRL |
2011 |
DBLP DOI BibTeX RDF |
|
1 | Georgios Boutsioukis, Ioannis Partalas, Ioannis P. Vlahavas |
Transfer Learning in Multi-Agent Reinforcement Learning Domains. |
EWRL |
2011 |
DBLP DOI BibTeX RDF |
|
1 | Csaba Szepesvári |
Invited Talk: Towards Robust Reinforcement Learning Algorithms. |
EWRL |
2011 |
DBLP DOI BibTeX RDF |
|
1 | Boris Lesner, Bruno Zanuttini |
Handling Ambiguous Effects in Action Learning. |
EWRL |
2011 |
DBLP DOI BibTeX RDF |
|
1 | Matthew W. Hoffman, Alessandro Lazaric, Mohammad Ghavamzadeh, Rémi Munos |
Regularized Least Squares Temporal Difference Learning with Nested ℓ2 and ℓ1 Penalization. |
EWRL |
2011 |
DBLP DOI BibTeX RDF |
|
1 | Matthijs Snel, Shimon Whiteson |
Multi-Task Reinforcement Learning: Shaping and Feature Selection. |
EWRL |
2011 |
DBLP DOI BibTeX RDF |
|
1 | Charles Elkan |
Reinforcement Learning with a Bilinear Q Function. |
EWRL |
2011 |
DBLP DOI BibTeX RDF |
|
1 | Yuxi Li, Dale Schuurmans |
MapReduce for Parallel Reinforcement Learning. |
EWRL |
2011 |
DBLP DOI BibTeX RDF |
|
1 | Kristian Kersting |
Invited Talk: Increasing Representational Power and Scaling Inference in Reinforcement Learning. |
EWRL |
2011 |
DBLP DOI BibTeX RDF |
|
1 | Kazuteru Miyazaki, Masaaki Ida |
Proposal and Evaluation of the Active Course Classification Support System with Exploitation-Oriented Learning. |
EWRL |
2011 |
DBLP DOI BibTeX RDF |
|
1 | Pablo Samuel Castro, Doina Precup |
Automatic Construction of Temporally Extended Actions for MDPs Using Bisimulation Metrics. |
EWRL |
2011 |
DBLP DOI BibTeX RDF |
|
1 | Matthew W. Robards, Peter Sunehag |
Gradient Based Algorithms with Loss Functions and Kernels for Improved On-Policy Control. |
EWRL |
2011 |
DBLP DOI BibTeX RDF |
|
1 | Francis Maes, Louis Wehenkel, Damien Ernst |
Optimized Look-ahead Tree Search Policies. |
EWRL |
2011 |
DBLP DOI BibTeX RDF |
|
1 | Kfir Y. Levy, Nahum Shimkin |
Unified Inter and Intra Options Learning Using Policy Gradient Methods. |
EWRL |
2011 |
DBLP DOI BibTeX RDF |
|
1 | Francis Maes, Louis Wehenkel, Damien Ernst |
Automatic Discovery of Ranking Formulas for Playing with Multi-armed Bandits. |
EWRL |
2011 |
DBLP DOI BibTeX RDF |
|
1 | Munu Sairamesh, Balaraman Ravindran |
Options with Exceptions. |
EWRL |
2011 |
DBLP DOI BibTeX RDF |
|
1 | Anestis Fachantidis, Ioannis Partalas, Matthew E. Taylor, Ioannis P. Vlahavas |
Transfer Learning via Multiple Inter-task Mappings. |
EWRL |
2011 |
DBLP DOI BibTeX RDF |
|
1 | Christos Dimitrakakis, Constantin A. Rothkopf |
Bayesian Multitask Inverse Reinforcement Learning. |
EWRL |
2011 |
DBLP DOI BibTeX RDF |
|
1 | Nikolaos Tziortziotis, Konstantinos Blekas |
Value Function Approximation through Sparse Bayesian Modeling. |
EWRL |
2011 |
DBLP DOI BibTeX RDF |
|
1 | Matthieu Geist, Olivier Pietquin, Gabriel Fricout |
Bayesian Reward Filtering. |
EWRL |
2008 |
DBLP DOI BibTeX RDF |
Reinforcement Learning, Function Approximation, Bayesian Filtering |
1 | Kirill Dyagilev, Shie Mannor, Nahum Shimkin |
Efficient Reinforcement Learning in Parameterized Models: Discrete Parameter Case. |
EWRL |
2008 |
DBLP DOI BibTeX RDF |
|
1 | Yuxi Li, Dale Schuurmans |
Policy Iteration for Learning an Exercise Policy for American Options. |
EWRL |
2008 |
DBLP DOI BibTeX RDF |
|
1 | Carl Edward Rasmussen, Marc Peter Deisenroth |
Probabilistic Inference for Fast Learning in Control. |
EWRL |
2008 |
DBLP DOI BibTeX RDF |
|
1 | Verena Heidrich-Meisner, Christian Igel |
Variable Metric Reinforcement Learning Methods Applied to the Noisy Mountain Car Problem. |
EWRL |
2008 |
DBLP DOI BibTeX RDF |
|
1 | Thomas Degris, Olivier Sigaud, Pierre-Henri Wuillemin |
Exploiting Additive Structure in Factored MDPs for Reinforcement Learning. |
EWRL |
2008 |
DBLP DOI BibTeX RDF |
|
1 | Noel Welsh, Jeremy L. Wyatt |
United We Stand: Population Based Methods for Solving Unknown POMDPs. |
EWRL |
2008 |
DBLP DOI BibTeX RDF |
|
1 | Robby Goetschalckx, Scott Sanner, Kurt Driessens |
Reinforcement Learning with the Use of Costly Features. |
EWRL |
2008 |
DBLP DOI BibTeX RDF |
|
1 | Boris Defourny, Damien Ernst, Louis Wehenkel |
Lazy Planning under Uncertainty by Optimizing Decisions on an Ensemble of Incomplete Disturbance Trees. |
EWRL |
2008 |
DBLP DOI BibTeX RDF |
Stochastic dynamic programming, Ensemble methods |
1 | Thomas Gabel, Martin A. Riedmiller |
Evaluation of Batch-Mode Reinforcement Learning Methods for Solving DEC-MDPs with Changing Action Sets. |
EWRL |
2008 |
DBLP DOI BibTeX RDF |
|
1 | Amir Massoud Farahmand, Mohammad Ghavamzadeh, Csaba Szepesvári, Shie Mannor |
Regularized Fitted Q-Iteration: Application to Planning. |
EWRL |
2008 |
DBLP DOI BibTeX RDF |
|
1 | Sertan Girgin, Manuel Loth, Rémi Munos, Philippe Preux, Daniil Ryabko (eds.) |
Recent Advances in Reinforcement Learning, 8th European Workshop, EWRL 2008, Villeneuve d'Ascq, France, June 30 - July 3, 2008, Revised and Selected Papers |
EWRL |
2008 |
DBLP DOI BibTeX RDF |
|
1 | Sertan Girgin, Philippe Preux |
Basis Expansion in Natural Actor Critic Methods. |
EWRL |
2008 |
DBLP DOI BibTeX RDF |
|
1 | José David Martín-Guerrero, Emilio Soria-Olivas, Marcelino Martínez-Sober, Antonio J. Serrano-López, José Rafael Magdalena Benedicto, Juan Gómez-Sanchís |
Use of Reinforcement Learning in Two Real Applications. |
EWRL |
2008 |
DBLP DOI BibTeX RDF |
|
1 | Christos Dimitrakakis, Michail G. Lagoudakis |
Algorithms and Bounds for Rollout Sampling Approximate Policy Iteration. |
EWRL |
2008 |
DBLP DOI BibTeX RDF |
|
1 | Jan Peters 0001, Jens Kober, Duy Nguyen-Tuong |
Policy Learning - A Unified Perspective with Applications in Robotics. |
EWRL |
2008 |
DBLP DOI BibTeX RDF |
|
1 | Huizhen Yu, Dimitri P. Bertsekas |
New Error Bounds for Approximations from Projected Linear Equations. |
EWRL |
2008 |
DBLP DOI BibTeX RDF |
|
1 | Daniele Loiacono, Pier Luca Lanzi |
Tile Coding Based on Hyperplane Tiles. |
EWRL |
2008 |
DBLP DOI BibTeX RDF |
|
1 | Jean-François Hren, Rémi Munos |
Optimistic Planning of Deterministic Systems. |
EWRL |
2008 |
DBLP DOI BibTeX RDF |
|
1 | Francis Maes, Ludovic Denoyer, Patrick Gallinari |
Applications of Reinforcement Learning to Structured Prediction. |
EWRL |
2008 |
DBLP DOI BibTeX RDF |
|
1 | Jia Yuan Yu, Shie Mannor, Nahum Shimkin |
Markov Decision Processes with Arbitrary Reward Processes. |
EWRL |
2008 |
DBLP DOI BibTeX RDF |
|
1 | Sarah Filippi, Olivier Cappé, Fabrice Clérot, Eric Moulines |
A Near Optimal Policy for Channel Allocation in Cognitive Radio. |
EWRL |
2008 |
DBLP DOI BibTeX RDF |
|