Zherui Liu
Publications by Year
Research Areas
Cloud Computing and Resource Management, IoT and Edge/Fog Computing, Topic Modeling, Parallel Computing and Optimization Techniques, Natural Language Processing Techniques
Most-Cited Works
- → Lyra: Elastic Scheduling for Deep Learning Clusters(2023)46 cited
- → MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs(2024)24 cited
- → Predicting GPU Failures With High Precision Under Deep Learning Workloads(2023)4 cited
- → Robust LLM Training Infrastructure at ByteDance(2025)1 cited
- → Understanding Stragglers in Large Model Training Using What-if Analysis(2025)