Learning to generate subgoals for action sequences

Research output: Chapter in Book/Report/Conference proceedingConference contribution

7 Scopus citations

Abstract

Summary form only given. It is pointed out that none of the existing learning algorithms for neural networks in time-varying environments addresses the problem of learning 'to divide and conquer'. It is argued that algorithms based on pure gradient descent or on adaptive critic methods are not suitable for dynamic control problems with long time lags between actions and consequences, and that there is a need for algorithms that perform 'compositional learning'. The author discusses a system which solves at least one problem associated with compositional learning. The system learns to generate subgoals. This is done with the help of 'time-bridging' adaptive models that predict the effects of the system's subprograms. An experiment (obstacle avoidance in a two-dimensional environment) illustrates the approach.
Original languageEnglish (US)
Title of host publicationProceedings. IJCNN - International Joint Conference on Neural Networks
PublisherPubl by IEEEPiscataway, NJ, United States
ISBN (Print)0780301641
StatePublished - Jan 1 1992
Externally publishedYes

Fingerprint

Dive into the research topics of 'Learning to generate subgoals for action sequences'. Together they form a unique fingerprint.

Cite this