| "Automatic Programming of Behavior-based Robots using Reinforcement Leraning", Sridhar Mahadevan and Jonathan Connell, Proceedings, AAAI-91, July 1991. |
....with the subsumption architecture. See [Brooks 90b] for a recent overview. Some of these robots have had learning capabilities. But they have not followed the reinforcement learning techniques like Q learning of [Watkins 89] as has the work, with physical robots, of [Kaelbling 90] and [Mahadevan and Connell 91] 2.3 Sensors and Action Real sensors are very noisy. They do not give the same sort of simple mapping from actual world state to a clean input vector that we are used to expecting after using simulated robots. Simple sensors have characteristics which distort the mapping from world state to ....
"Automatic Programming of Behavior-based Robots using Reinforcement Leraning", Sridhar Mahadevan and Jonathan Connell, Proceedings, AAAI-91, July 1991.
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC