Scholars@Duke publication: Neural Predictor for Neural Architecture Search

Neural Predictor for Neural Architecture Search

Publication , Conference

Wen, W; Liu, H; Chen, Y; Li, H; Bender, G; Kindermans, PJ

Published in: Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics

January 1, 2020

Published version (DOI)

Neural Architecture Search methods are effective but often use complex algorithms to come up with the best architecture. We propose an approach with three basic steps that is conceptually much simpler. First we train N random architectures to generate N (architecture, validation accuracy) pairs and use them to train a regression model that predicts accuracies for architectures. Next, we use this regression model to predict the validation accuracies of a large number of random architectures. Finally, we train the top-K predicted architectures and deploy the model with the best validation result. While this approach seems simple, it is more than 20 × as sample efficient as Regularized Evolution on the NASBench-101 benchmark. On ImageNet, it approaches the efficiency of more complex and restrictive approaches based on weight sharing such as ProxylessNAS while being fully (embarrassingly) parallelizable and friendly to hyper-parameter tuning.

Duke Scholars

Author Hai "Helen" Li Electrical and Computer Engineering

Author Yiran Chen Electrical and Computer Engineering

Published In

Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics

DOI

10.1007/978-3-030-58526-6_39

EISSN

1611-3349

ISSN

0302-9743

Publication Date

January 1, 2020

Volume

12374 LNCS

Start / End Page

660 / 676

Related Subject Headings

Artificial Intelligence & Image Processing
46 Information and computing sciences

Citation

APA

Chicago

ICMJE

MLA

NLM

Wen, W., Liu, H., Chen, Y., Li, H., Bender, G., & Kindermans, P. J. (2020). Neural Predictor for Neural Architecture Search. In Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics (Vol. 12374 LNCS, pp. 660–676). https://doi.org/10.1007/978-3-030-58526-6_39

Wen, W., H. Liu, Y. Chen, H. Li, G. Bender, and P. J. Kindermans. “Neural Predictor for Neural Architecture Search.” In Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, 12374 LNCS:660–76, 2020. https://doi.org/10.1007/978-3-030-58526-6_39.

Wen W, Liu H, Chen Y, Li H, Bender G, Kindermans PJ. Neural Predictor for Neural Architecture Search. In: Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics. 2020. p. 660–76.

Wen, W., et al. “Neural Predictor for Neural Architecture Search.” Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, vol. 12374 LNCS, 2020, pp. 660–76. Scopus, doi:10.1007/978-3-030-58526-6_39.

Wen W, Liu H, Chen Y, Li H, Bender G, Kindermans PJ. Neural Predictor for Neural Architecture Search. Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics. 2020. p. 660–676.

Published In

Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics

DOI

10.1007/978-3-030-58526-6_39

EISSN

1611-3349

ISSN

0302-9743

Publication Date

January 1, 2020

Volume

12374 LNCS

Start / End Page

660 / 676

Related Subject Headings

Artificial Intelligence & Image Processing
46 Information and computing sciences