Mindstorms in natural language-based societies of mind

Mingchen Zhuge, Haozhe Liu, Francesco Faccio, Dylan R. Ashley, Robert Csordás, Anand Gopalakrishnan, Abdullah Hamdi, Hasan Abed Al Kader Hammoud, Vincent Herrmann, Kazuki Irie, Louis Kirsch, Bing Li, Guohao Li, Shuming Liu, Jinjie Mai, Piotr Piȩkos, Aditya A. Ramesh, Imanol Schlag, Weimin Shi, Aleksandar StanićWenyi Wang, Yuhui Wang, Mengmeng Xu, Deng Ping Fan*, Bernard Ghanem*, Jurgen Schmidhuber*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Inspired by Minsky's Society of Mind, Schmidhuber's Learning to Think, and other more recent works, this paper proposes and advocates for the concept of natural language-based societies of mind (NLSOMs). We imagine these societies as consisting of a collection of multimodal neural networks, including large language models, which engage in a 'mindstorm' to solve problems using a shared natural language interface. Here, we work to identify and discuss key questions about the social structure, governance, and economic principles for NLSOMs, emphasizing their impact on the future of AI. Our demonstrations with NLSOMs - which feature up to 129 agents - show their effectiveness in various tasks, including visual question answering, image captioning, and prompt generation for text-to-image synthesis.

Original languageEnglish (US)
Pages (from-to)29-81
Number of pages53
JournalComputational Visual Media
Volume11
Issue number1
DOIs
StatePublished - 2025

Keywords

  • large language models (LLMs)
  • learning to think
  • mindstorm
  • multimodal learning
  • society of mind (SOM)

ASJC Scopus subject areas

  • Computer Vision and Pattern Recognition
  • Computer Graphics and Computer-Aided Design
  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'Mindstorms in natural language-based societies of mind'. Together they form a unique fingerprint.

Cite this