Papers
Topics
Authors
Recent
Search
2000 character limit reached

Efficacy of Neural Prediction-Based Zero-Shot NAS

Published 31 Aug 2023 in cs.LG and cs.AI | (2308.16775v3)

Abstract: In prediction-based Neural Architecture Search (NAS), performance indicators derived from graph convolutional networks have shown remarkable success. These indicators, achieved by representing feed-forward structures as component graphs through one-hot encoding, face a limitation: their inability to evaluate architecture performance across varying search spaces. In contrast, handcrafted performance indicators (zero-shot NAS), which use the same architecture with random initialization, can generalize across multiple search spaces. Addressing this limitation, we propose a novel approach for zero-shot NAS using deep learning. Our method employs Fourier sum of sines encoding for convolutional kernels, enabling the construction of a computational feed-forward graph with a structure similar to the architecture under evaluation. These encodings are learnable and offer a comprehensive view of the architecture's topological information. An accompanying multi-layer perceptron (MLP) then ranks these architectures based on their encodings. Experimental results show that our approach surpasses previous methods using graph convolutional networks in terms of correlation on the NAS-Bench-201 dataset and exhibits a higher convergence rate. Moreover, our extracted feature representation trained on each NAS benchmark is transferable to other NAS benchmarks, showing promising generalizability across multiple search spaces. The code is available at: https://github.com/minh1409/DFT-NPZS-NAS

Definition Search Book Streamline Icon: https://streamlinehq.com
References (8)
  1. Mathematics into Type, American Mathematical Society. Online available:
  2. The LATEXCompanion, by F. Mittelbach and M. Goossens
  3. More Math into LaTeX, by G. Grätzer
  4. AMS-StyleGuide-online.pdf, published by the American Mathematical Society
  5. H. Sira-Ramirez. “On the sliding mode control of nonlinear systems,” Systems & Control Letters, vol. 19, pp. 303–312, 1992.
  6. A. Levant. “Exact differentiation of signals with unbounded higher derivatives,” in Proceedings of the 45th IEEE Conference on Decision and Control, San Diego, California, USA, pp. 5585–5590, 2006.
  7. M. Fliess, C. Join, and H. Sira-Ramirez. “Non-linear estimation is easy,” International Journal of Modelling, Identification and Control, vol. 4, no. 1, pp. 12–27, 2008.
  8. R. Ortega, A. Astolfi, G. Bastin, and H. Rodriguez. “Stabilization of food-chain systems using a port-controlled Hamiltonian description,” in Proceedings of the American Control Conference, Chicago, Illinois, USA, pp. 2245–2249, 2000.

Summary

No one has generated a summary of this paper yet.

Paper to Video (Beta)

No one has generated a video about this paper yet.

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.

GitHub