Publications
You can also find my articles on my Google Scholar profile.
[1]
KVDrive: A Holistic Multi-Tier KV Cache Management System for Long-Context LLM Inference
ACM SIGMOD/PODS International Conference on Management of Data (SIGMOD), 2026
[2]
DIRECTOR: Accelerating Distributed MoE Serving via Online Proactive Expert Placement
IEEE International Conference on Computer Communications (INFOCOM), 2026
[3]
PPAI: Enabling Personalized LLM Agent Interoperability for Collaborative Edge Intelligence
IEEE International Conference on Computer Communications (INFOCOM), 2026
[4]
MELL: Memory-Efficient Large Language Model Serving via Multi-GPU KV Cache Management
IEEE International Conference on Computer Communications (INFOCOM), 2025
