374 citations found. Retrieving documents...
L.P. Kaelbling, M.L. Littman, and Andrew Moore. Reinforcement learning: A survey. Journal of Artificial Intelligence Research, 4:237--285, 1996.

 Home/Search   Document Details and Download   Summary   Related Articles   Check  

This paper is cited in the following contexts:

First 50 documents  Next 50

Evolutionary Reinforcement Learning in Relational Domains - Tijmen Joppe Muller   Self-citation (Kaelbling)   (Correct)

No context found.

Kaelbling, L., Littman, M., & Moore, A. (1996). Reinforcement learning: A survey. JAIR, 4, 237--285.


Algorithms for Planning under Uncertainty in Prediction .. - O'Kane, Tovar, Cheng.. (2005)   (Correct)

No context found.

L.P. Kaelbling, M.L. Littman, and Andrew Moore. Reinforcement learning: A survey. Journal of Artificial Intelligence Research, 4:237--285, 1996.


Learning in One-Shot Strategic Form Games - Alon Altman Avivit   (Correct)

No context found.

Kaelbling, L.P., Littman, M.L., Moore, A.W.: Reinforcement learning: A survey. Journal of AI Research 4 (1996) 237--285


Approximate Solutions to Factored Markov Decision Processes via - Greedy Search In   (Correct)

No context found.

Kaelbling, L. P.; Littman, M. L.; and Moore, A. W. 1996. Reinforcement learning: A survey. Journal of Artificial Intelligence Research 4:237--285.


Neuro-fuzzy Learning of Strategies for Optimal Control Problems - Kaivan Kamali Lijun   (Correct)

No context found.

L. Kaelbling, M. Littman, and A. Moore. Reinforcement learning: A survey. J. of AI Research, 4(1):237-- 285, 1996.


Shuffling a Stacked Deck: The Case for Partially - Randomized Ranking Of   (Correct)

No context found.

L. P. Kaelbling, M. L. Littman, and A. P. Moore. Reinforcement learning: A survey. Journal of Artificial Intelligence Research, 4:237--285, 1996.


Approximation Algorithms for Orienteering and.. - Avrim Blum Shuchi (2003)   (Correct)

No context found.

L. P. Kaelbling, M. L. Littman, and A. W. Moore. Reinforcement learning: A survey. Journal of Artificial Intelligence Research, 4, 1996.


Learning and Using Models of Kicking Motions for Legged Robots - Sonia Chernova And   (Correct)

No context found.

L. P. Kaelbling, M. Littman, and A. Moore, "Reinforcement learning: A survey," Journal of Artificial Intelligence Research, vol. 4, pp. 237--285, 1996.


An Approach to the Design of Reinforcement Functions.. - Bonarini, Bonacina..   (Correct)

No context found.

L. P. Kaelbling, M. L. Littman, and A. W. Moore, \Reinforcement learning: a survey," Journal of Arti cial Intelligence Research, vol. 4, pp. 237-285, 1996.


Learning Fuzzy Classifier Systems: Architecture and Exploration.. - Matteucci (2000)   (Correct)

No context found.

L. P. Kaelbling, M. L. Littman, and A. W. Moore. Reinforcement learning: a survey. Journal of Arti cial Intelligence Research, 4:237-285, 1996.


Exploiting Multi-Agent Interactions - For Identifying The (2005)   (Correct)

No context found.

L. Kaelbling, M. Littman, and A. Moore. Reinforcement learning: A survey. Journal of Artifical Intelligence Research, 4:237--285, 1996.


Shuffling a Stacked Deck: The Case for Partially.. - Pandey, Roy.. (2005)   (Correct)

No context found.

L. P. Kaelbling, M. L. Littman, and A. P. Moore. Reinforcement learning: A survey. Journal of Artificial Intelligence Research, 4:237--285, 1996.


Spoken Dialogue Management Using Hierarchical Reinforcement.. - Cuayįhuitl (2005)   (Correct)

No context found.

Kaelbling, L. P., Litman, M., Moore, A. (1996). Reinforcement Learning: A Survey. In Journal of Artificial Intelligence Research, 4, pp. 237-285.


Part 1: POMDPs - Pomdps Markov Decision   (Correct)

No context found.

Kaelbling, L., M. Littman, & A. Moore (1996). Reinforcement learning: A survey. Journal of Artificial Intelligence Research 4, 237--285.


A Probabilistic Framework for Model-Based Imitation Learning - Aaron Shon David   (Correct)

No context found.

Kaelbling, L. P., Littman, L. M., and Moore, A. W. (1996). Reinforcement learning: A survey. J. Artificial Intelligence Res., 4:237--285.


A Model-Based Goal-Directed Bayesian Framework for.. - Shon, Grimes.. (2004)   (Correct)

No context found.

Kaelbling, L. P., Littman, L. M., and Moore, A. W. (1996). Reinforcement learning: A survey. J. Artificial Intelligence Res., 4:237--285.


Model based Bayesian Exploration - Richard Dearden Department (1999)   (14 citations)  (Correct)

No context found.

Kaelbling, L. P., Littman, M. L. & Moore, A. W. (1996), `Reinforcement learning: A survey', Journal of Artificial Intelligence Research 4, 237--285.


Adaptive Online Time Allocation to Search Algorithms - Gagliolo, Zhumatiy.. (2004)   (Correct)

No context found.

L.P. Kaelbling, M.L. Littman, and A.W. Moore. Reinforcement learning: a survey. Journal of AI research, 4:237--285, 1996.


Variable Resolution Discretization in the Joint Space - Monson, Wingate, Seppi..   (Correct)

No context found.

L. P. Kaelbling, M. L. Littman, and A. P. Moore. Reinforcement learning: A survey. Journal of Artificial Intelligence Research, 4:237--285, 1996.


Reinforcement Learning in the Joint Space: Value Iteration in.. - Monson (2003)   (Correct)

No context found.

L. P. Kaelbling, M. L. Littman, and A. P. Moore, Reinforcement learning: A survey, Journal of Artificial Intelligence Research 4 (1996), 237--285.


Dynamic Algorithm Portfolios - And   (Correct)

No context found.

Kaelbling, L., Littman, M., Moore, A.: Reinforcement learning: a survey. Journal of AI research 4 (1996) 237--285


Learning Unknown Additive Normal Form Games - Mykel Kochenderfer Stanford (2001)   (Correct)

No context found.

Kaelbling, L. P.; Littman, M. L.; and Moore, A. P. 1996. Reinforcement learning: A survey. Journal of Artificial Intelligence Research 4:237--285.


Heuristically Accelerated Q-Learning: a new approach to.. - Bianchi, Ribeiro, Costa   (Correct)

No context found.

L. P. Kaelbling, M. L. Littman, and A. W. Moore. Reinforcement learning: A survey. Journal of Artificial Intelligence Research, 4:237--285, 1996.


Shuffling a Stacked Deck: The Case for Partially.. - Pandey, Roy.. (2005)   (Correct)

No context found.

L. P. Kaelbling, M. L. Littman, and A. P. Moore. Reinforcement learning: A survey. Journal of Artificial Intelligence Research, 4:237--285, 1996.


Reinforcement Learning for Parameter Control of Text Detection .. - Taylor, Wolf (2004)   (Correct)

No context found.

L. Kaelbling, M. Littman, and A. Moore. Reinforcement learning: A survey. Journal of Artificial Intelligence Research, 4:237--285, 1996.

First 50 documents  Next 50

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC