Publication View

Why experimentation can be better than perfect guidance (1997)

Abstract
The full version of this paper appeared at ICML-97. Many problems correspond to the classical control task of determining the appropriate control action to take, given some (sequence of) observations. One standard approach to learning these control rules, called behavior cloning, involves watching a perfect operator operate a plant, and then trying to emulate its behavior. In the experimental learning approach, by contrast, the learner first guesses an initial operation-toaction policy and tries it out. If this policy performs sub-optimally, the learner can modify it to produce a new policy, and recur. This paper discusses the relative effectiveness of these two approaches, especially in the presence of perceptual aliasing, showing in particular that the experimental learner can often learn more effectively than the cloning one. 1

Publication details
Download http://citeseerx.ist.psu.edu/viewdoc/summary?doi=?doi=10.1.1.27.8228
Source ftp://ftp.aifb.uni-karlsruhe.de/pub/mwi/scheffer2.ps.Z
Publisher Morgan Kaufmann
Contributors CiteSeerX
Repository CiteSeerX - Scientific Literature Digital Library and Search Engine (United States)
Type text
Language English
Relation 10.1.1.103.3872, 10.1.1.53.3725, 10.1.1.86.1260, 10.1.1.71.5843, 10.1.1.1.7100, 10.1.1.13.379, 10.1.1.11.1390, 10.1.1.12.5699, 10.1.1.121.7835, 10.1.1.79.8944