Yuxiong He
Seoul National University(KR)Bellevue Hospital Center(US)
Publications by Year
Research Areas
Advanced Neural Network Applications, Topic Modeling, Cloud Computing and Resource Management, Parallel Computing and Optimization Techniques, Distributed and Parallel Computing Systems
Most-Cited Works
- → ZeRO: Memory optimizations Toward Training Trillion Parameter Models(2020)697 cited
- → DeepSpeed(2020)672 cited
- → Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, A Large-Scale Generative Language Model(2022)299 cited
- → OpenFold: retraining AlphaFold2 yields new insights into its learning mechanisms and capacity for generalization(2024)252 cited
- → DeepSpeed- Inference: Enabling Efficient Inference of Transformer Models at Unprecedented Scale(2022)208 cited
- → ZeRO-infinity(2021)205 cited
- → The Cilkview scalability analyzer(2010)108 cited