What does it mean to understand a neural network?

Published 15 Jul 2019 in cs.LG, q-bio.NC, and stat.ML | (1907.06374v1)

Abstract: We can define a neural network that can learn to recognize objects in less than 100 lines of code. However, after training, it is characterized by millions of weights that contain the knowledge about many object types across visual scenes. Such networks are thus dramatically easier to understand in terms of the code that makes them than the resulting properties, such as tuning or connections. In analogy, we conjecture that rules for development and learning in brains may be far easier to understand than their resulting properties. The analogy suggests that neuroscience would benefit from a focus on learning and development.

Abstract PDF Upgrade to Chat

Authors (2)

Citations (41)

View on Semantic Scholar

Summary

The paper argues that while neural networks are constructed with simple rules, their post-training complexity hinders complete interpretability.
It compares network compressibility to games like tic-tac-toe and Go, illustrating the challenge of reducing complex systems into concise models.
The paper proposes shifting focus from static, post-training states to learning and developmental processes to advance insights in AI and neuroscience.

Understanding Neural Networks: A Focus on Learning and Development

The paper "What does it mean to understand a neural network?" by Timothy P. Lillicrap and Konrad P. Kording presents a comprehensive examination of the challenges associated with comprehending neural networks, both artificial and biological. The central thesis posits that, while we can construct and describe neural networks through high-level programming codes, achieving a deep understanding of their complex workings post-training remains elusive. The authors suggest that the pathway to true understanding lies in focusing on the development and learning processes of these networks, rather than their intricate and opaque end-state properties.

The Complexity Challenge

A key argument presented is the contrast between the simplicity of neural network development and the complexity of their operational states. While the rules and codes for building powerful neural networks, such as those used for tasks like image recognition or gameplay, can be condensed into a relatively small set of instructions, the resultant system characterized by millions of parameters is vastly more complex. This disconnect substantially impairs our ability to provide a compact and interpretable description of network functionality post-training.

The authors introduce the notion of compressibility by comparing various systems. For instance, a simple game like tic-tac-toe can be efficiently described using a small rule set, whereas a complex game like Go requires a far more substantial descriptive framework. This analogy extends to neural networks, implying that systems achieving human-like performance across diverse disciplines are inherently less compressible. Current efforts to synthesize networks into smaller, interpretable frameworks fall short of adequately capturing the high-level abstraction of network mechanics.

Implications for Neuroscience

Offering a provocative view, the paper draws an analogy between artificial neural networks and the human brain, highlighting the formidable challenge of understanding neural systems. The authors conjecture that, akin to artificial networks, the human brain is developed through complex, distributed patterns of weight changes, posing significant hurdles in achieving interpretability of post-learning representations and connections. The suggestion is made that neuroscience could benefit from emphasizing the underlying rules and developmental processes that shape brain functionality rather than attempting to decode the static, learned patterns.

This perspective challenges the prevalent notion in neuroscience that an intermediary, compact descriptive model similar to that in statistical physics might be achievable. Instead, the authors argue that for systems as complex and varied as the human brain, such simplification does not hold promise.

Future Directions

The paper concludes with an outline of potential future research directions. One possibility is improving understanding through genetic and molecular analyses of brain development processes, which could elucidate architectures, learning rules, and anatomical setups. Another proposed avenue is reconciling principles of artificial intelligence with neuroscience to explore synergies that might offer new insights.

Overall, the authors present a compelling argument that directs the discourse towards understanding the formative processes underlying neural computations, whether in artificial systems or biological brains. This shift in focus has profound implications for both AI research and neuroscience, potentially reshaping how these domains approach the understanding of complex neural systems. By advocating for a concentration on the mechanisms of development and learning, the paper places itself at a pivotal intersection of these fields, opening avenues for exploration that align with advancing towards fuller comprehension of neural functionalities.

Markdown Report Issue