Papers
Topics
Authors
Recent
Search
2000 character limit reached

Recording First-person Experiences to Build a New Type of Foundation Model

Published 31 Jul 2024 in cs.AI, cs.HC, and cs.LG | (2408.02680v1)

Abstract: Foundation models have had a big impact in recent years and billions of dollars are being invested in them in the current AI boom. The more popular ones, such as Chat-GPT, are trained on large amounts of Internet data. However, it is becoming apparent that this data is likely to be exhausted soon, and technology companies are looking for new sources of data to train the next generation of foundation models. Reinforcement learning, RAG, prompt engineering and cognitive modelling are often used to fine-tune and augment the behaviour of foundation models. These techniques have been used to replicate people, such as Caryn Marjorie. These chatbots are not based on people's actual emotional and physiological responses to their environment, so they are, at best, a surface-level approximation to the characters they are imitating. To address these issues, we have developed a recording rig that captures what the wearer is seeing and hearing as well as their skin conductance (GSR), facial expression and brain state (14 channel EEG). AI algorithms are used to process this data into a rich picture of the environment and internal states of the subject. Foundation models trained on this data could replicate human behaviour much more accurately than the personality models that have been developed so far. This type of model has many potential applications, including recommendation, personal assistance, GAN systems, dating and recruitment. This paper gives some background to this work and describes the recording rig and preliminary tests of its functionality. It then suggests how a new type of foundation model could be created from the data captured by the rig and outlines some applications. Data gathering and model training are expensive, so we are currently working on the launch of a start-up that could raise funds for the next stage of the project.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (16)
  1. Unleashing the potential of prompt engineering in large language models: A comprehensive review. arXiv:cs.CL/2310.14735, 2023.
  2. Simple and controllable music generation. Advances in Neural Information Processing Systems, 36, 2024.
  3. The socio-moral image database (smid): A novel stimulus set for the study of social, moral and affective processes. PLOS ONE, 13(1):1–34, 2018.
  4. A. R. Damasio. Descartes’ Error: Emotion, Reason, and the Human Brain. G.P. Putnam, New York, 1994.
  5. J. M. George and E. Dane. Affect, emotion, and decision making. Organizational Behavior and Human Decision Processes, 136:47–55, 2016.
  6. V. Goel. Reason and Less: Pursuing Food, Sex, and Politics. The MIT Press, Cambridge, Massachusetts, 2022.
  7. The descriptive experience sampling method. Phenomenology and the Cognitive Sciences, 5:271–301, 2006.
  8. L. Itti and C. Koch. Computational modelling of visual attention. Nature Reviews Neuroscience, 2(3):194–203, 2001.
  9. Exploiting language models as a source of knowledge for cognitive agents. In C. Geib and R. Petrick, editors, Proceedings of the 2023 AAAI Fall Symposia, volume 2, pages 286–94. The AAAI Press, 2023.
  10. Emotion and decision making. Annual Review of Psychology, 66:799–823, 2015.
  11. Generative agents: Interactive simulacra of human behavior. In UIST ’23: Proceedings of the 36th Annual ACM Symposium on User Interface Software and Technology, pages 1–22, 2023.
  12. L. Pessoa. Intelligent architectures for robotics: The merging of cognition and emotion. Physics of Life Reviews, 31:157–170, 2019.
  13. D. Seetharaman. For data-guzzling ai companies, the internet is too small. The Wall Street Journal, 2024. URL https://www.wsj.com/tech/ai/ai-training-data-synthetic-openai-anthropic-9230f8d8.
  14. Attention is all you need. In I. Guyon, U. V. Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan, and R. Garnett, editors, Advances in Neural Information Processing Systems, volume 30. Curran Associates, Inc., 2017.
  15. Will we run out of data? an analysis of the limits of scaling datasets in machine learning. arXiv:cs.LG/2211.04325, 2022.
  16. A brief overview of chatgpt: The history, status quo and potential future development. IEEE/CAA Journal of Automatica Sinica, 10(5):1122–1136, 2023.

Summary

Paper to Video (Beta)

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.

Tweets

Sign up for free to view the 13 tweets with 138 likes about this paper.