Scholars@Duke publication: DistDNAS: Search Efficient Feature Interactions within 2 Hours

DistDNAS: Search Efficient Feature Interactions within 2 Hours

Publication , Conference

Zhang, T; Wen, W; Fedorov, I; Liu, X; Zhang, B; Han, F; Chen, WY; Han, Y; Yan, F; Li, H; Chen, Y

Published in: Proceedings 2024 IEEE International Conference on Big Data Bigdata 2024

January 1, 2024

Search efficiency and serving efficiency are two major axes in building feature interactions and expediting the model development process in recommender systems. Searching for the optimal feature interaction design on large-scale benchmarks requires extensive cost due to the sequential workflow on the large volume of data. In addition, fusing interactions of various sources, orders, and mathematical operations introduces potential conflicts and additional redundancy toward recommender models, leading to sub-optimal trade-offs in performance and serving cost. This paper presents DistDNAS as a neat solution to brew swift and efficient feature interaction design. DistDNAS proposes a supernet incorporating interaction modules of varying orders and types as a search space. To optimize search efficiency, DistDNAS distributes the search and aggregates the choice of optimal interaction modules on varying data dates, achieving a speed-up of over 25× and reducing the search cost from 2 days to 2 hours. To optimize serving efficiency, DistDNAS introduces a differentiable cost-aware loss to penalize the selection of redundant interaction modules, enhancing the efficiency of discovered feature interactions in serving. We extensively evaluate the best models crafted by DistDNAS on a 1TB Criteo Terabyte dataset. Experimental evaluations demonstrate 0.001 AUC improvement and 60% FLOPs saving over current state-of-the-art CTR models.

Duke Scholars

Author Yiran Chen Electrical and Computer Engineering

Published In

Proceedings 2024 IEEE International Conference on Big Data Bigdata 2024

DOI

10.1109/BigData62323.2024.10825061

Publication Date

January 1, 2024

Start / End Page

1492 / 1499

Citation

APA

Chicago

ICMJE

MLA

NLM

Zhang, T., Wen, W., Fedorov, I., Liu, X., Zhang, B., Han, F., … Chen, Y. (2024). DistDNAS: Search Efficient Feature Interactions within 2 Hours. In Proceedings 2024 IEEE International Conference on Big Data Bigdata 2024 (pp. 1492–1499). https://doi.org/10.1109/BigData62323.2024.10825061

Zhang, T., W. Wen, I. Fedorov, X. Liu, B. Zhang, F. Han, W. Y. Chen, et al. “DistDNAS: Search Efficient Feature Interactions within 2 Hours.” In Proceedings 2024 IEEE International Conference on Big Data Bigdata 2024, 1492–99, 2024. https://doi.org/10.1109/BigData62323.2024.10825061.

Zhang T, Wen W, Fedorov I, Liu X, Zhang B, Han F, et al. DistDNAS: Search Efficient Feature Interactions within 2 Hours. In: Proceedings 2024 IEEE International Conference on Big Data Bigdata 2024. 2024. p. 1492–9.

Zhang, T., et al. “DistDNAS: Search Efficient Feature Interactions within 2 Hours.” Proceedings 2024 IEEE International Conference on Big Data Bigdata 2024, 2024, pp. 1492–99. Scopus, doi:10.1109/BigData62323.2024.10825061.

Zhang T, Wen W, Fedorov I, Liu X, Zhang B, Han F, Chen WY, Han Y, Yan F, Li H, Chen Y. DistDNAS: Search Efficient Feature Interactions within 2 Hours. Proceedings 2024 IEEE International Conference on Big Data Bigdata 2024. 2024. p. 1492–1499.

Published In

Proceedings 2024 IEEE International Conference on Big Data Bigdata 2024

DOI

10.1109/BigData62323.2024.10825061

Publication Date

January 1, 2024

Start / End Page

1492 / 1499