TY - GEN
T1 - Continually adding self-invented problems to the repertoire: First experiments with POWERPLAY
AU - Srivastava, Rupesh Kumar
AU - Steunebrink, Bas R.
AU - Stollenga, Marijn
AU - Schmidhuber, Jürgen
N1 - Generated from Scopus record by KAUST IRTS on 2022-09-14
PY - 2012/12/1
Y1 - 2012/12/1
N2 - Pure scientists do not only invent new methods to solve given problems. They also invent new problems. The recent POWERPLAY framework formalizes this type of curiosity and creativity in a new, general, yet practical way. To acquire problem solving prowess through playing, POWERPLAY-based artificial explorers by design continually come up with the fastest to find, initially novel, but eventually solvable problems. They also continually simplify or speed up solutions to previous problems. We report on results of first experiments with POWERPLAY. A self-delimiting recurrent neural network (SLIM RNN) is used as a general computational architecture to implement the system's solver. Its weights can encode arbitrary, self-delimiting, halting or non-halting programs affecting both environment (through effectors) and internal states encoding abstractions of event sequences. In open-ended fashion, our POWERPLAY-driven RNNs learn to become increasingly general problem solvers, continually adding new problem solving procedures to the growing repertoire, exhibiting interesting developmental stages. © 2012 IEEE.
AB - Pure scientists do not only invent new methods to solve given problems. They also invent new problems. The recent POWERPLAY framework formalizes this type of curiosity and creativity in a new, general, yet practical way. To acquire problem solving prowess through playing, POWERPLAY-based artificial explorers by design continually come up with the fastest to find, initially novel, but eventually solvable problems. They also continually simplify or speed up solutions to previous problems. We report on results of first experiments with POWERPLAY. A self-delimiting recurrent neural network (SLIM RNN) is used as a general computational architecture to implement the system's solver. Its weights can encode arbitrary, self-delimiting, halting or non-halting programs affecting both environment (through effectors) and internal states encoding abstractions of event sequences. In open-ended fashion, our POWERPLAY-driven RNNs learn to become increasingly general problem solvers, continually adding new problem solving procedures to the growing repertoire, exhibiting interesting developmental stages. © 2012 IEEE.
UR - http://ieeexplore.ieee.org/document/6400843/
UR - http://www.scopus.com/inward/record.url?scp=84872854684&partnerID=8YFLogxK
U2 - 10.1109/DevLrn.2012.6400843
DO - 10.1109/DevLrn.2012.6400843
M3 - Conference contribution
SN - 9781467349635
BT - 2012 IEEE International Conference on Development and Learning and Epigenetic Robotics, ICDL 2012
ER -