Skip to main content

NDSEARCH: Accelerating Graph-Traversal-Based Approximate Nearest Neighbor Search through Near Data Processing

Publication ,  Conference
Wang, Y; Li, S; Zheng, Q; Song, L; Li, Z; Chang, A; Li, HLH; Chen, Y
Published in: Proceedings - International Symposium on Computer Architecture
January 1, 2024

Approximate nearest neighbor search (ANNS) is a key retrieval technique for vector database and many data center applications, such as person re-identification and recommendation systems. It is also fundamental to retrieval augmented generation (RAG) for large language models (LLM) now. Among all the ANNS algorithms, graph-traversal-based ANNS achieves the highest recall rate. However, as the size of dataset increases, the graph may require hundreds of gigabytes of memory, exceeding the main memory capacity of a single workstation node. Although we can do partitioning and use solid-state drive (SSD) as the backing storage, the limited SSD I/O bandwidth severely degrades the performance of the system. To address this challenge, we present NDSEARCh, a hardware-software co-designed near-data processing (NDP) solution for ANNS processing. NDSeARCH consists of a novel in-storage computing architecture, namely, SEARSSD, that supports the ANNS kernels and leverages logic unit (LUN)-level parallelism inside the NAND flash chips. NDSEARCH also includes a processing model that is customized for NDP and cooperates with SearSSD. The processing model enables us to apply a two-level scheduling to improve the data locality and exploit the internal bandwidth in NDSearch, and a speculative searching mechanism to further accelerate the ANNS workload. Our results show that NDSEARCH improves the throughput by up to 31.7 ×, 14.6 ×, 7.4 × 2.9 × over CPU, GPU, a state-of-the-art SmartSSD-only design, and DeepStore, respectively. NDSEARCH also achieves two orders-of-magnitude higher energy efficiency than CPU and GPU.

Duke Scholars

Published In

Proceedings - International Symposium on Computer Architecture

DOI

EISSN

2575-713X

ISSN

1063-6897

Publication Date

January 1, 2024

Start / End Page

368 / 381
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Wang, Y., Li, S., Zheng, Q., Song, L., Li, Z., Chang, A., … Chen, Y. (2024). NDSEARCH: Accelerating Graph-Traversal-Based Approximate Nearest Neighbor Search through Near Data Processing. In Proceedings - International Symposium on Computer Architecture (pp. 368–381). https://doi.org/10.1109/ISCA59077.2024.00035
Wang, Y., S. Li, Q. Zheng, L. Song, Z. Li, A. Chang, H. L. H. Li, and Y. Chen. “NDSEARCH: Accelerating Graph-Traversal-Based Approximate Nearest Neighbor Search through Near Data Processing.” In Proceedings - International Symposium on Computer Architecture, 368–81, 2024. https://doi.org/10.1109/ISCA59077.2024.00035.
Wang Y, Li S, Zheng Q, Song L, Li Z, Chang A, et al. NDSEARCH: Accelerating Graph-Traversal-Based Approximate Nearest Neighbor Search through Near Data Processing. In: Proceedings - International Symposium on Computer Architecture. 2024. p. 368–81.
Wang, Y., et al. “NDSEARCH: Accelerating Graph-Traversal-Based Approximate Nearest Neighbor Search through Near Data Processing.” Proceedings - International Symposium on Computer Architecture, 2024, pp. 368–81. Scopus, doi:10.1109/ISCA59077.2024.00035.
Wang Y, Li S, Zheng Q, Song L, Li Z, Chang A, Li HLH, Chen Y. NDSEARCH: Accelerating Graph-Traversal-Based Approximate Nearest Neighbor Search through Near Data Processing. Proceedings - International Symposium on Computer Architecture. 2024. p. 368–381.

Published In

Proceedings - International Symposium on Computer Architecture

DOI

EISSN

2575-713X

ISSN

1063-6897

Publication Date

January 1, 2024

Start / End Page

368 / 381