Overview
Member of Duke Systems Group
Research interests are (1) datacenter and cloud computing and (2) machine learning systems.
Research interests are (1) datacenter and cloud computing and (2) machine learning systems.
Current Appointments & Affiliations
Assistant Professor of Computer Science
·
2020 - Present
Computer Science,
Trinity College of Arts & Sciences
Recent Publications
LLM.265: Video Codecs are Secretly Tensor Codecs
Conference Proceedings of the Annual International Symposium on Microarchitecture Micro · October 17, 2025 As the parameter size of large language models (LLMs) continues to expand, the need for a large memory footprint and high communication bandwidth have become significant bottlenecks for the training and inference of LLMs. To mitigate these bottlenecks, var ... Full text CiteCan Large Language Models Verify System Software? A Case Study Using FSCQ as a Benchmark
Conference HOTOS 2025 Proceedings of the 2025 Workshop in Hot Topics in Operating Systems · June 6, 2025 Large language models (LLMs) have demonstrated remarkable coding capabilities. They excel in code synthesis benchmarks across diverse domains and have become ubiquitous in coding tools. Recently, they have also shown promise in generating mathematical proo ... Full text CiteRethinking RPC Communication for Microservices-based Applications
Conference HOTOS 2025 Proceedings of the 2025 Workshop in Hot Topics in Operating Systems · June 6, 2025 Fast and efficient RPCs are key to the performance of applications based on microservices. But RPC communication suffers from significant overhead today because it relies on the standard, layered protocol stack and loose coupling between the end host and i ... Full text CiteRecent Grants
CAREER: OS-Managed Remote Procedure Call for Datacenter Applications
ResearchPrincipal Investigator · Awarded by National Science Foundation · 2023 - 2028Collaborative Research: NeTS: Medium: Application Defined Networking
ResearchPrincipal Investigator · Awarded by National Science Foundation · 2024 - 2027CC* Integration-Large: Scaling Scientific Workloads on Distributed Commodity GPUs and Storage through Campus-level RDMA Networking
ResearchPrincipal Investigator · Awarded by National Science Foundation · 2025 - 2027View All Grants
Education, Training & Certifications
University of Washington ·
2019
Ph.D.