Full Publication List

Conference Paper

One-Size-Fits-None: Understanding and Enhancing Slow-Fault Tolerance in Modern Distributed Systems
Ruiming Lu, Yunchi Lu, Yuxuan Jiang, Guangtao Xue, Peng Huang
NSDI 2025    [Preprint]    [Software]

CSAL: the Next-Gen Local Disks for the Cloud
Yanbo Zhou, Erci Xu, Li Zhang, Kapil Karkra, Mariusz Barczak, Wayne Gao, Wojciech Malikowski, Mateusz Kozlowski, Łukasz Łasek, Ruiming Lu, Feng Yang, Lilong Huang, Xiaolu Zhang, Wenrui Li, Jinhu Li, Keqiang Niu, Jiaji Zhu, Jiesheng Wu
Eurosys 2024    [Software]
Perseus: A Fail-Slow Detection Framework for Cloud Storage Systems
Ruiming Lu*, Erci Xu*, Yiming Zhang, Fengyi Zhu, Zhaosheng Zhu, Mengtian Wang, Zongpeng Zhu, Guangtao Xue, Jiwu Shu, Minglu Li, Jiesheng Wu (*Co-first)
FAST 2023   (Best Paper Award, Inivited to Appear in USENIX ;login:, Fast-tracked to ToS)
[PDF]   [Slides]   [Video]   [Dataset]
Press   [AliCloud]   [CitiNews]

NVMe SSD Failures in the Field: the Fail-Stop and the Fail-Slow
Ruiming Lu*, Erci Xu*, Yiming Zhang, Zhaosheng Zhu, Mengtian Wang, Zongpeng Zhu, Guangtao Xue, Minglu Li, Jiesheng Wu (*Co-first)
ATC 2022 [PDF]   [Slides]   [Video]   [Dataset]
Press   [ChinaSys]   [Shanghai Computer Association - Storage]

Journal and Magazine Articles

MasterPlan: A Reinforcement Learning Based Scheduler for Archive Storage
Xinqi Chen, Erci Xu, Dengyao Mo, Ruiming Lu, Haonan Wu, Dian Ding, Guangtao Xue
ACM Transactions on Architecture and Code Optimization, Volume 22, Issue 1 (March 2025)

From Missteps to Milestones: A Journey to Practical Fail-Slow Detection
Ruiming Lu, Erci Xu, Yiming Zhang, Fengyi Zhu, Zhaosheng Zhu, Mengtian Wang, Zongpeng Zhu, Guangtao Xue, Jiwu Shu, Minglu Li, Jiesheng Wu
ACM Transactions on Storage, Volume 19, Issue 4 (November 2023)

Detecting Fail-Slow Failures in Large-Scale Cloud Storage Systems
Ruiming Lu, Erci Xu, Yiming Zhang, Fengyi Zhu, Zhaosheng Zhu, Mengtian Wang, Zongpeng Zhu, Guangtao Xue, Jiwu Shu, Minglu Li, Jiesheng Wu
USNNIX ;login: Online, Feb 9, 2023