S. Bhatnagar and S. Kumar. A simultaneous perturbation stochastic approximation based actorcritic algorithm for markov decision processes. IEEE Transactions on Automatic Control, 49(4): 592--598, 2004.

 Home/Search   Document Not in Database   Summary   Related Articles  

This paper is cited in the following contexts:
A Simulation-Based Algorithm for Ergodic Control of.. - Bhatnagar, Borkar, al. (2006)   Self-citation (Bhatnagar)   (Correct)

No context found.

S. Bhatnagar and S. Kumar. A simultaneous perturbation stochastic approximation based actorcritic algorithm for markov decision processes. IEEE Transactions on Automatic Control, 49(4): 592--598, 2004.

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC