Abstract
Summary form only given. It is pointed out that none of the existing learning algorithms for neural networks in time-varying environments addresses the problem of learning 'to divide and conquer'. It is argued that algorithms based on pure gradient descent or on adaptive critic methods are not suitable for dynamic control problems with long time lags between actions and consequences, and that there is a need for algorithms that perform 'compositional learning'. The author discusses a system which solves at least one problem associated with compositional learning. The system learns to generate subgoals. This is done with the help of 'time-bridging' adaptive models that predict the effects of the system's subprograms. An experiment (obstacle avoidance in a two-dimensional environment) illustrates the approach.
Original language | English (US) |
---|---|
Title of host publication | Proceedings. IJCNN - International Joint Conference on Neural Networks |
Publisher | Publ by IEEEPiscataway, NJ, United States |
ISBN (Print) | 0780301641 |
State | Published - Jan 1 1992 |
Externally published | Yes |