149 citations found. Retrieving documents...
Watkins, C. J. C. H. and Dayan, P.: "Technical Note: Q-Learning," Machine Learning, Vol. 8, pp. 55--68, 1992.

 Home/Search   Document Not in Database   Summary   Related Articles   Check  

This paper is cited in the following contexts:

First 50 documents  Next 50

Q-learning and Sarsa Agents in Bargaining Game - Tokyo Institute Of   (Correct)

No context found.

Watkins, C. J. C. H. and Dayan, P.: "Technical Note: Q-Learning," Machine Learning, Vol. 8, pp. 55--68, 1992.


Journal of Machine Learning Research 7 (2006) 1789--1828 .. - Payoff Propagation Jelle   (Correct)

No context found.

C. Watkins and P. Dayan. Technical note: Q-learning. Machine Learning, 8(3-4):279--292, 1992.


Online Adaptive Policies for Ensemble Classifiers - Dimitrakakis, Bengio (2004)   (Correct)

No context found.

C. J. Watkins, P. Dayan, Technical note Q-learning, Machine Learning 8 (1992) 279.


Resource Allocation in the Grid Using Reinforcement Learning - Aram Galstyan Karl (2004)   (4 citations)  (Correct)

No context found.

C. J. C. H. Watkins, , and P. Dayan". "Technical note: Q-learning". PhD thesis, 1992.


Exchanging Advice and Learning to Trust - Lus Nunes And (2003)   (Correct)

No context found.

Watkins, C.J.C.H., Dayan, P.D.: Technical note: Q-learning. Machine Learning 8 (1992) 279--292


Learning from Multiple Sources - Nunes, Oliveira (2004)   (Correct)

No context found.

C. J. C. H. Watkins and P. D. Dayan. Technical note: Qlearning. Machine Learning, 8(3):279--292, 1992.


Learning from Multiple Sources - Nunes, Oliveira (2004)   (Correct)

No context found.

C. J. C. H. Watkins and P. D. Dayan. Technical note: Qlearning. Machine Learning, 8(3):279--292, 1992.


Distributed Learning in Swarm Systems: A Case Study - Li   (Correct)

No context found.

C. J. C. H. Watkins and P. Dayan. Technical note: Q- learning. Machine Learning, 8:279--292, 1992.


Best-Response Multiagent Learning in Non-Stationary.. - Weinberg, Rosenschein (2004)   (Correct)

No context found.

C. J. C. H. Watkins and P. Dayan. Technical note: Q-learning. Machine Learning, 8(3):279--292, 1992.


Learning to Trade via Direct Reinforcement - Moody, Saffell (2001)   (6 citations)  (Correct)

No context found.

C. J. Watkins and P. Dayan, "Technical note: Q-Learning," Machine Learning, vol. 8, no. 3, pp. 279--292, 1992.


Some Approaches to Learning in Problem Solving - Millan   (Correct)

No context found.

C. J. C. H. Watkins and P. Dayan. Technical note: Q-learning. Machine Learning, 8(3/4):279--292, May 1992.


A Cognitive Robot Architecture based on Tactile and.. - Kazunori Terada Takayuki   (Correct)

No context found.

C.J.C.H. Watkins and P. Dayan. Technical note: Q-learning. Machin Learning, 8:39--46, 1992. 14


Multi-Agent Reinforcement Learning: a critical survey - Shoham, Powers, Grenager (2003)   (11 citations)  (Correct)

No context found.

C. J. C. H. Watkins and P. Dayan. Technical note: Q-learning. Machine Learning, 8(3/4):279--292, May 1992.


Appendix A - Motor Schema Formulations   (Correct)

No context found.

C. Watkins and P. Dayan. Technical note: Q learning. Machine Learning, 8:279--292, 1992.


VQQL: Applying Vector Quantization to Reinforcement Learning - Fernandez, Borrajo (2000)   (Correct)

No context found.

C. J. C. H. Watkins and P. Dayan. Technical note: Q-learning. Machine Learning, 8(3/4):279-292, May 1992.


VQQL: Applying Vector Quantization to Reinforcement Learning - Fernandez, Borrajo (1999)   (Correct)

No context found.

C. J. C. H. Watkins and P. Dayan. Technical note: Q-learning. Machine Learning, 8(3/4):279-292, May 1992.


Reinforcement Learning: a brief overview. - Jeremy Wyatt School   (Correct)

No context found.

C.J.C.H. Watkins and P. Dayan. Technical note: Q-learning. Machine Learning, 8(3/4):279-292, 1992.


Run the GAMUT: A Comprehensive Approach To.. - Nudelman.. (2004)   (2 citations)  (Correct)

No context found.

C. J. C. H. Watkins and P. Dayan. Technical note: Q-learning. Machine Learning, 8(3/4), May 1992.


Cooperative Information Sharing to Improve Distributed.. - Partha Dutta Srinandan (2005)   (1 citation)  (Correct)

No context found.

C. J. C. H. Watkins and P. Dayan. Technical note: Q-learning. Machine Learning, 8:279--292, 1992.


Run the GAMUT: A Comprehensive Approach To.. - Nudelman.. (2004)   (2 citations)  (Correct)

No context found.

C. J. C. H. Watkins and P. Dayan. Technical note: Q-learning. Machine Learning, 8(3/4), May 1992.


Self-Adjusting Reinforcement Learning - Der, Herrmann   (Correct)

No context found.

C. J. C. H. Watkins, P. Dayan (1992) Technical note: Q-learning. Machine Learning 8, 279-292. E-mail: der@informatik.uni-leipzig.de, michael@zoo.riken.go.jp, mherrma@gwdg.de New address of MH: MPI SF, Postfach 2853, 37018 Gottingen, Germany


Resource Allocation in the Grid Using Reinforcement Learning - Galstyan, Czajkowski.. (2004)   (4 citations)  (Correct)

No context found.

C. J. C. H. Watkins, , and P. Dayan". "Technical note: Q-learning". PhD thesis, 1992.


Distributed Learning in Swarm Systems: A Case Study - Li (2002)   (Correct)

No context found.

C. J. C. H. Watkins and P. Dayan. Technical note: Q- learning. Machine Learning, 8:279--292, 1992.


Learning Connectedness in Binary Images - Erik Van Der   (Correct)

No context found.

C.J.C.H. Watkins and P. Dayan. Technical note: Q-learning. Machine Learning, 8:279--292, 1992.


Mutual Learning By Autonomous Mobile - Robots Ian Kelly   (Correct)

No context found.

WATKINS, C.J. & DAYAN, P. "Technical note : Q-learning." Machine Learning. Vol. 8(3-4), pp. 279-292. 1992.

First 50 documents  Next 50

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC