Publication View

Why Experimentation can be better than "Perfect Guidance" (1997)

Abstract
Many problems correspond to the classical control task of determining the appropriate control action to take, given some (sequence of) observations. One standard approach to learning these control rules, called behavior cloning , involves watching a perfect operator operate a plant, and then trying to emulate its behavior. In the experimental learning approach, by contrast, the learner first guesses an initial operation-to-action policy and tries it out. If this policy performs suboptimally, the learner can modify it to produce a new policy, and recur. This paper discusses the relative effectiveness of these two approaches, especially in the presence of perceptual aliasing, showing in particular that the experimental learner can often learn more effectively than the cloning one. 1 INTRODUCTION Many real-world tasks require controlling some type of "plant" (e.g., factory [DB95], grinding tool [Bro93], robot [Kae93]); in each case, using information received from the sensors to decide ...

Publication details
Download http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.55.8992
Source http://ki.cs.tu-berlin.de/~scheffer/papers/icml97.ps
Contributors CiteSeerX
Repository CiteSeerX - Scientific Literature Digital Library and Search Engine (United States)
Type text
Language English
Relation 10.1.1.12.4269, 10.1.1.121.7835, 10.1.1.79.8944, 10.1.1.11.1390