Conglong Li
Microsoft Research (United Kingdom)(GB)
Publications by Year
Research Areas
Topic Modeling, Advanced Neural Network Applications, Adversarial Robustness in Machine Learning, Machine Learning and Algorithms, Machine Learning and Data Classification
Most-Cited Works
- → ZeroQuant: Efficient and Affordable Post-Training Quantization for Large-Scale Transformers(2022)72 cited
- → DeepSpeed-MoE: Advancing Mixture-of-Experts Inference and Training to Power Next-Generation AI Scale(2022)55 cited
- → DeepSpeed Data Efficiency: Improving Deep Learning Model Quality and Training Efficiency via Efficient Data Sampling and Routing(2024)18 cited
- → DeepSpeed-Chat: Easy, Fast and Affordable RLHF Training of ChatGPT-like Models at All Scales(2023)10 cited
- → DeepSpeed4Science Initiative: Enabling Large-Scale Scientific Discovery through Sophisticated AI System Technologies(2023)8 cited
- → LLM-jp: A Cross-organizational Project for the Research and Development of Fully Open Japanese LLMs(2024)5 cited
- → DeepSpeed-VisualChat: Multi-Round Multi-Image Interleave Chat via Multi-Modal Causal Attention(2023)4 cited
- → The Stability-Efficiency Dilemma: Investigating Sequence Length Warmup for Training GPT Models(2022)