Scholars@Duke publication: NDSEARCH: Accelerating Graph-Traversal-Based Approximate Nearest Neighbor Search through Near Data Processing

NDSEARCH: Accelerating Graph-Traversal-Based Approximate Nearest Neighbor Search through Near Data Processing

Publication , Conference

Wang, Y; Li, S; Zheng, Q; Song, L; Li, Z; Chang, A; Li, HLH; Chen, Y

Published in: Proceedings International Symposium on Computer Architecture

January 1, 2024

Approximate nearest neighbor search (ANNS) is a key retrieval technique for vector database and many data center applications, such as person re-identification and recommendation systems. It is also fundamental to retrieval augmented generation (RAG) for large language models (LLM) now. Among all the ANNS algorithms, graph-traversal-based ANNS achieves the highest recall rate. However, as the size of dataset increases, the graph may require hundreds of gigabytes of memory, exceeding the main memory capacity of a single workstation node. Although we can do partitioning and use solid-state drive (SSD) as the backing storage, the limited SSD I/O bandwidth severely degrades the performance of the system. To address this challenge, we present NDSEARCh, a hardware-software co-designed near-data processing (NDP) solution for ANNS processing. NDSeARCH consists of a novel in-storage computing architecture, namely, SEARSSD, that supports the ANNS kernels and leverages logic unit (LUN)-level parallelism inside the NAND flash chips. NDSEARCH also includes a processing model that is customized for NDP and cooperates with SearSSD. The processing model enables us to apply a two-level scheduling to improve the data locality and exploit the internal bandwidth in NDSearch, and a speculative searching mechanism to further accelerate the ANNS workload. Our results show that NDSEARCH improves the throughput by up to 31.7 ×, 14.6 ×, 7.4 × 2.9 × over CPU, GPU, a state-of-the-art SmartSSD-only design, and DeepStore, respectively. NDSEARCH also achieves two orders-of-magnitude higher energy efficiency than CPU and GPU.

Duke Scholars

Author Yiran Chen Pierre R. Lamond Department of Electrical and Computer Engin ...

Author Hai "Helen" Li Pierre R. Lamond Department of Electrical and Computer Engin ...

Published In

Proceedings International Symposium on Computer Architecture

DOI

10.1109/ISCA59077.2024.00035

EISSN

2575-713X

ISSN

1063-6897

Publication Date

January 1, 2024

Start / End Page

368 / 381

Citation

APA

Chicago

ICMJE

MLA

NLM

Wang, Y., Li, S., Zheng, Q., Song, L., Li, Z., Chang, A., … Chen, Y. (2024). NDSEARCH: Accelerating Graph-Traversal-Based Approximate Nearest Neighbor Search through Near Data Processing. In Proceedings International Symposium on Computer Architecture (pp. 368–381). https://doi.org/10.1109/ISCA59077.2024.00035

Wang, Y., S. Li, Q. Zheng, L. Song, Z. Li, A. Chang, H. L. H. Li, and Y. Chen. “NDSEARCH: Accelerating Graph-Traversal-Based Approximate Nearest Neighbor Search through Near Data Processing.” In Proceedings International Symposium on Computer Architecture, 368–81, 2024. https://doi.org/10.1109/ISCA59077.2024.00035.

Wang Y, Li S, Zheng Q, Song L, Li Z, Chang A, et al. NDSEARCH: Accelerating Graph-Traversal-Based Approximate Nearest Neighbor Search through Near Data Processing. In: Proceedings International Symposium on Computer Architecture. 2024. p. 368–81.

Wang, Y., et al. “NDSEARCH: Accelerating Graph-Traversal-Based Approximate Nearest Neighbor Search through Near Data Processing.” Proceedings International Symposium on Computer Architecture, 2024, pp. 368–81. Scopus, doi:10.1109/ISCA59077.2024.00035.

Wang Y, Li S, Zheng Q, Song L, Li Z, Chang A, Li HLH, Chen Y. NDSEARCH: Accelerating Graph-Traversal-Based Approximate Nearest Neighbor Search through Near Data Processing. Proceedings International Symposium on Computer Architecture. 2024. p. 368–381.

Published In

Proceedings International Symposium on Computer Architecture

DOI

10.1109/ISCA59077.2024.00035

EISSN

2575-713X

ISSN

1063-6897

Publication Date

January 1, 2024

Start / End Page

368 / 381