| A. Linden and F. Weber. Implementing inner drive by competence reflection. In Proceedings of the 2nd International Conference on Simulation of Adaptive Behavior, Hawaii, 1992. |
....is how to combine these models. 3 Modeling with a single agent An agent is a general modeling process producing a y value for each x value, using the information in L. When the tuples (x; y) are chosen by the agent itself (and not, for example, by a teacher) the process is called exploration [29, 11]. The following quantities can be defined for an agent. Model accuracy. An agent is graded by the accuracy of its predictions. This may be estimated statistically by comparing a set of its predictions with events in the real world. The accuracy A ff (M) of an agent ff tested on a set M of (x; ....
A. Linden and F. Weber. Implementing inner drive by competence reflection. In Proceedings of the 2nd International Conference on Simulation of Adaptive Behavior, Hawaii, 1992.
....in some prespecified number of steps. In general, though, we will have to resort to some online process to decide what to try next. Some successful heuristic exploration strategies include trying to visit unvisited states [Schaal and Atkeson, 1994] trying to visit places where we perform poorly [Linden and Weber, 1993], taking actions that improved our performance in similar situations [Schmidhuber and Storck, 1993] or maintaining a heuristic confidence map [Thrun and M oller, 1992] Some researchers, in cases where the exploration is considered a secondary problem, provide the learner with a uniformly ....
A. Linden and F. Weber. (1993) Implementing Inner Drive by Competence Reflection, in H. Roitblat et al., eds., Proceedings of the 2nd International Conference on Simulation of Adaptive Behavior, MIT Press, Cambridge, MA.
....output y, and incorporate the new examples ( x; y) into its training set. The primary question of active learning is how to choose which x to try next. There are many heuristics for choosing x based on intuition, including choosing places where we don t have data, where we perform poorly [Linden and Weber, 1993], where we have low confidence [Thrun and Moller, 1992] where we expect it to change our model [Cohn et al., 1990] and where we previously found data that resulted in learning [Schmidhuber and Storck, 1993] In this paper we consider how one may select x optimally from a statistical ....
A. Linden and F. Weber. (1993) Implementing inner drive by competence reflection. In H. Roitblat et al., eds., Proc. 2nd Int. Conf. on Simulation of Adaptive Behavior, MIT Press, Cambridge.
....query, experiment, or action, depending on the research field and problem domain. The question we will be concerned with is how to choose which x to try next. There are many heuristics for choosing x, including choosing places where we don t have data (Whitehead, 1991) where we perform poorly (Linden Weber, 1993), where we have low confidence (Thrun Moller, 1992) where we expect it to change our model (Cohn, Atlas, Ladner, 1990, 1994) and where we previously found data that resulted in learning (Schmidhuber Storck, 1993) In this paper we will consider how one may select x in a statistically ....
Linden, A., & Weber, F. (1993). Implementing inner drive by competence reflection. In Roitblat, H. (Ed.), Proceedings of the 2nd International Conference on Simulation of Adaptive Behavior. MIT Press, Cambridge, MA.
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC