Evolino: Hybrid neuroevolution / optimal linear search for sequence learning

Jürgen Schmidhuber, Daan Wierstra, Faustino Gomez

Research output: Chapter in Book/Report/Conference proceedingConference contribution

96 Scopus citations

Abstract

Current Neural Network learning algorithms are limited in their ability to model non-linear dynamical systems. Most supervised gradient-based recurrent neural networks (RNNs) suffer from a vanishing error signal that prevents learning from inputs far in the past. Those that do not, still have problems when there are numerous local minima. We introduce a general framework for sequence learning, EVOlution of recurrent systems with LINear outputs (Evolino). Evolino uses evolution to discover good RNN hidden node weights, while using methods such as linear regression or quadratic programming to compute optimal linear mappings from hidden state to output. Using the Long Short-Term Memory RNN Architecture, the method is tested in three very different problem domains: 1) context-sensitive languages, 2) multiple superimposed sine waves, and 3) the Mackey-Glass system. Evolino performs exceptionally well across all tasks, where other methods show notable deficiencies in some.
Original languageEnglish (US)
Title of host publicationIJCAI International Joint Conference on Artificial Intelligence
Pages853-858
Number of pages6
StatePublished - Dec 1 2005
Externally publishedYes

Fingerprint

Dive into the research topics of 'Evolino: Hybrid neuroevolution / optimal linear search for sequence learning'. Together they form a unique fingerprint.

Cite this