2000 character limit reached
Emergent Equivariance in Deep Ensembles
Published 5 Mar 2024 in cs.LG | (2403.03103v2)
Abstract: We show that deep ensembles become equivariant for all inputs and at all training times by simply using data augmentation. Crucially, equivariance holds off-manifold and for any architecture in the infinite width limit. The equivariance is emergent in the sense that predictions of individual ensemble members are not equivariant but their collective prediction is. Neural tangent kernel theory is used to derive this result and we verify our theoretical insights using detailed numerical experiments.
- On Exact Computation with an Infinitely Wide Neural Net. In Advances in Neural Information Processing Systems, volume 32. Curran Associates, Inc., 2019.
- Equivariant few-shot learning from pretrained models. arXiv preprint arXiv:2305.09900, 2023a.
- Equi-tuning: Group equivariant fine-tuning of pretrained models. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 37, pp. 6788–6796, 2023b.
- Roto-Translation Covariant Convolutional Networks for Medical Image Analysis. In Frangi, A. F., Schnabel, J. A., Davatzikos, C., Alberola-López, C., and Fichtinger, G. (eds.), Medical Image Computing and Computer Assisted Intervention – MICCAI 2018, Lecture Notes in Computer Science, pp. 440–448, Cham, 2018. Springer International Publishing. ISBN 978-3-030-00928-1. doi: 10.1007/978-3-030-00928-1˙50.
- A program to build E(N)-equivariant steerable CNNs. In International Conference on Learning Representations, 2022. URL https://openreview.net/forum?id=WE4qe9xlnQw.
- A Kernel Theory of Modern Data Augmentation. In Proceedings of the 36th International Conference on Machine Learning, pp. 1528–1537. PMLR, May 2019.
- Provably Strict Generalisation Benefit for Equivariant Models. In Proceedings of the 38th International Conference on Machine Learning, pp. 2959–2969. PMLR, July 2021.
- Nonperturbative renormalization for the neural network-QFT correspondence. Machine Learning: Science and Technology, 3(1):015027, March 2022. ISSN 2632-2153. doi: 10.1088/2632-2153/ac4f69.
- Optimization Dynamics of Equivariant and Augmented Neural Networks. (arXiv:2303.13458), March 2023. doi: 10.48550/arXiv.2303.13458.
- A Neural Tangent Kernel Perspective of GANs. In Proceedings of the 39th International Conference on Machine Learning, pp. 6660–6704. PMLR, June 2022. doi: 10.48550/arXiv.2106.05566.
- Ensemble deep learning: A review. Engineering Applications of Artificial Intelligence, 115:105151, October 2022. ISSN 09521976. doi: 10.1016/j.engappai.2022.105151.
- Equivariance versus Augmentation for Spherical Images. In Proceedings of the 39th International Conference on Machine Learning, pp. 7404–7421. PMLR, June 2022. doi: 10.48550/arXiv.2202.03990.
- Geometric deep learning and equivariant neural networks. Artificial Intelligence Review, June 2023. ISSN 1573-7462. doi: 10.1007/s10462-023-10502-7.
- Neural Tangent Kernel: A Survey. (arXiv:2208.13614), August 2022. doi: 10.48550/arXiv.2208.13614.
- Neural Networks and Quantum Field Theory. Machine Learning: Science and Technology, 2(3):035002, September 2021. ISSN 2632-2153. doi: 10.1088/2632-2153/abeca3.
- Few-shot Backdoor Attacks via Neural Tangent Kernels. In The Eleventh International Conference on Learning Representations, September 2022. doi: 10.48550/arXiv.2210.05929.
- Dynamics of Deep Neural Networks and Neural Tangent Hierarchy. In Proceedings of the 37th International Conference on Machine Learning, pp. 4542–4551. PMLR, November 2020. doi: 10.48550/arXiv.1909.08156.
- Neural Tangent Kernel: Convergence and Generalization in Neural Networks. In Advances in Neural Information Processing Systems, volume 31. Curran Associates, Inc., 2018.
- Equivariance with learned canonicalization functions. In International Conference on Machine Learning, pp. 15546–15566. PMLR, 2023.
- 100,000 histological images of human colorectal cancer and healthy tissue. April 2018. doi: 10.5281/zenodo.1214456.
- Simple and scalable predictive uncertainty estimation using deep ensembles. Advances in neural information processing systems, 30, 2017.
- Deep Neural Networks as Gaussian Processes. In International Conference on Learning Representations, February 2018.
- Wide Neural Networks of Any Depth Evolve as Linear Models Under Gradient Descent. In Advances in Neural Information Processing Systems, volume 32. Curran Associates, Inc., 2019. doi: 10.1088/1742-5468/abc62b.
- Enhanced Convolutional Neural Tangent Kernels. arXiv:1911.00809 [cs, stat], November 2019.
- Learning with invariances in random features and kernel models. In Proceedings of Thirty Fourth Conference on Learning Theory, pp. 3351–3418. PMLR, July 2021.
- Equivariant adaptation of large pre-trained models. arXiv preprint arXiv:2310.01647, 2023.
- Learning with Group Invariant Features: A Kernel Perspective. In Advances in Neural Information Processing Systems, volume 28. Curran Associates, Inc., 2015.
- Neal, R. M. Bayesian Learning for Neural Networks. Springer Science & Business Media, 1996. ISBN 978-1-4612-0745-0.
- Neural Tangents: Fast and Easy Infinite Neural Networks in Python. In Eighth International Conference on Learning Representations, April 2020.
- Frame averaging for invariant and equivariant network design. arXiv preprint arXiv:2110.03336, 2021.
- Local Group Invariant Representations via Orbit Embeddings. In Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, pp. 1225–1235. PMLR, April 2017.
- Fast, accurate antibody structure prediction from deep learning on massive set of natural antibodies. Nature communications, 14(1):2389, 2023.
- Hierarchical deep learning ensemble to automate the classification of breast cancer pathology reports by icd-o topography. arXiv preprint arXiv:2008.12571, 2020.
- Improved generalization bounds of group invariant / equivariant deep networks via quotient feature spaces. In Proceedings of the Thirty-Seventh Conference on Uncertainty in Artificial Intelligence, pp. 771–780. PMLR, December 2021. doi: 10.48550/arXiv.1910.06552.
- Tensor field networks: Rotation- and translation-equivariant neural networks for 3D point clouds. arXiv:1802.08219 [cs], May 2018.
- When and why PINNs fail to train: A neural tangent kernel perspective. Journal of Computational Physics, 449:110768, January 2022. ISSN 0021-9991. doi: 10.1016/j.jcp.2021.110768.
- General E(2)-Equivariant Steerable CNNs. In Conference on Neural Information Processing Systems (NeurIPS), 2019. URL https://arxiv.org/abs/1911.08251.
- Fashion-mnist: a novel image dataset for benchmarking machine learning algorithms, 2017.
- Yaida, S. Non-Gaussian processes and neural networks at finite widths. In Proceedings of The First Mathematical and Scientific Machine Learning Conference, pp. 165–192. PMLR, August 2020.
- Yang, G. Scaling Limits of Wide Neural Networks with Weight Sharing: Gaussian Process Behavior, Gradient Independence, and Neural Tangent Kernel Derivation. arXiv:1902.04760 [cond-mat, physics:math-ph, stat], April 2020.
- Feature Learning in Infinite-Width Neural Networks. (arXiv:2011.14522), July 2022. doi: 10.48550/arXiv.2011.14522.
- On the Neural Tangent Kernel Analysis of Randomly Pruned Neural Networks. In Proceedings of The 26th International Conference on Artificial Intelligence and Statistics, pp. 1513–1553. PMLR, April 2023.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.