Multi-Excitation Projective Simulation with a Many-Body Physics Inspired Inductive Bias
Abstract: With the impressive progress of deep learning, applications relying on machine learning are increasingly being integrated into daily life. However, most deep learning models have an opaque, oracle-like nature making it difficult to interpret and understand their decisions. This problem led to the development of the field known as eXplainable Artificial Intelligence (XAI). One method in this field known as Projective Simulation (PS) models a chain-of-thought as a random walk of a particle on a graph with vertices that have concepts attached to them. While this description has various benefits, including the possibility of quantization, it cannot be naturally used to model thoughts that combine several concepts simultaneously. To overcome this limitation, we introduce Multi-Excitation Projective Simulation (mePS), a generalization that considers a chain-of-thought to be a random walk of several particles on a hypergraph. A definition for a dynamic hypergraph is put forward to describe the agent's training history along with applications to AI and hypergraph visualization. An inductive bias inspired by the remarkably successful few-body interaction models used in quantum many-body physics is formalized for our classical mePS framework and employed to tackle the exponential complexity associated with naive implementations of hypergraphs. We prove that our inductive bias reduces the complexity from exponential to polynomial, with the exponent representing the cutoff on how many particles can interact. We numerically apply our method to two toy environments and a more complex scenario modelling the diagnosis of a broken computer. These environments demonstrate the resource savings provided by an appropriate choice of inductive bias, as well as showcasing aspects of interpretability. A quantum model for mePS is also briefly outlined and some future directions for it are discussed.
- P. P. Shinde and S. Shah, A review of machine learning and deep learning applications, in 2018 Fourth International Conference on Computing Communication Control and Automation (ICCUBEA) (2018) pp. 1–6.
- J. Liu and Y. Jin, A comprehensive survey of robust deep learning in computer vision, Journal of Automation and Intelligence (In Press, Corrected Proof), https://doi.org/10.1016/j.jai.2023.10.002 (2023).
- C.-J. H. Yao Li, Minhao Cheng and T. C. M. Lee, A review of adversarial attack and defense for classification methods, The American Statistician 76, 329 (2022), https://doi.org/10.1080/00031305.2021.2006781 .
- A. Saranya and R. Subhashini, A systematic review of explainable artificial intelligence models and applications: Recent developments and future trends, Decision Analytics Journal 7, 100230 (2023).
- H. J. Briegel and G. D. las Cuevas, Proective simulation for artificial intelligence, Sci. Rep. 2, 400 (2012).
- R. S. Sutton and A. G. Barto, Reinforcement learning: An introduction (MIT press, 2018).
- A. A. Melnikov, A. Makmal, and H. J. Briegel, Benchmarking projective simulation in navigation problems, IEEE Access 6, 64639 (2018a).
- A. Bretto, Hypergraph Theory - An Introduction (Springer Cham, 2013).
- Q. Dai and Y. Gao, Hypergraph Computation (Springer, 2023).
- A. Goyal and Y. Bengio, Inductive biases for deep learning of higher-level cognition, Proceedings of the Royal Society A 478, 20210068 (2022).
- M. Lutter, Inductive Biases in Machine Learning for Robotics and Control, Vol. 156 (Springer Nature, 2023).
- L. S. Rendsburg, Inductive Bias in Machine Learning, Ph.D. thesis, Eberhard Karls Universität Tübingen (2022).
- A. Ajit, K. Acharya, and A. Samanta, A review of convolutional neural networks, in 2020 International Conference on Emerging Trends in Information Technology and Engineering (ic-ETITE) (2020) pp. 1–5.
- S. Cong and Y. Zhou, A review of convolutional neural network architectures and their optimizations, Artificial Intelligence Review 56, 1905 (2023).
- H. Sun and I. Guyon, Modularity in deep learning: A survey, in Science and Information Conference (Springer, 2023) pp. 561–595.
- M. Amer and T. Maul, A review of modularization techniques in artificial neural networks, Artificial Intelligence Review 52, 527 (2019).
- M. Fabrizio, A Course in Quantum Many-body Theory: From Conventional Fermi Liquids to Strongly Correlated Systems (Springer Nature, 2022).
- P. Coleman, Introduction to many-body physics (Cambridge University Press, 2015).
- M. E. Peskin and D. V. Schroeder, An Introduction to quantum field theory (Addison-Wesley, Reading, USA, 1995).
- M. D. Schwartz, Quantum field theory and the standard model (Cambridge university press, 2014).
- B. R. Martin and G. Shaw, Nuclear and particle physics: an introduction (John Wiley & Sons, 2019).
- L. Zhou, A survey on contextual multi-armed bandits, arXiv preprint arXiv:1508.03326 (2015).
- D. Bouneffouf, I. Rish, and C. Aggarwal, Survey on applications of multi-armed and contextual bandits, in 2020 IEEE Congress on Evolutionary Computation (CEC) (IEEE, 2020) pp. 1–8.
- C. Vehlow, F. Beck, and D. Weiskopf, Visualizing dynamic hierarchies in graph sequences, IEEE Transactions on Visualization and Computer Graphics 22, 2343 (2016).
- R. M. Wald, General Relativity 2nd Ed. (University of Chicago Press, 2010).
- F. Flamini, N. Spagnolo, and F. Sciarrino, Photonic quantum information processing: a review, Reports on Progress in Physics 82, 016001 (2018).
- P. A. LeMaitre and M. Krumm, https://github.com/MariusKrumm/ManyBodyMEPS.
- I. M. Georgescu, S. Ashhab, and F. Nori, Quantum simulation, Rev. Mod. Phys. 86, 153 (2014).
- Z. Niu, G. Zhong, and H. Yu, A review on the attention mechanism of deep learning, Neurocomputing 452, 48 (2021).
- T. Müller and H. Briegel, A stochastic process model for free agency under indeterminism, Dialectica 72, 219 (2018).
- A. Barbu and S.-C. Zhu, Monte Carlo Methods (Springer Singapore, 2020).
- A. Ankan and A. Panda, Hands-on Markov models with python: Implement probabilistic models for learning complex data sequences using the Python ecosystem (Packt Publishing Ltd, 2018).
- S. K. Lam, A. Pitrou, and S. Seibert, Numba: A llvm-based python jit compiler, in Proceedings of the Second Workshop on the LLVM Compiler Infrastructure in HPC (2015) pp. 1–6.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.