Elton Zheng
Publications by Year
Research Areas
Advanced Neural Network Applications, Topic Modeling, Medical Imaging Techniques and Applications, Advanced Graph Neural Networks, Domain Adaptation and Few-Shot Learning
Most-Cited Works
- → DeepSpeed- Inference: Enabling Efficient Inference of Transformer Models at Unprecedented Scale(2022)208 cited
- Deep Learning Inference Service at Microsoft(2019)
- → DeepSpeed Inference: Enabling Efficient Inference of Transformer Models at Unprecedented Scale(2022)8 cited
- Accelerating Large Scale Deep Learning Inference through DeepCPU at Microsoft.(2019)
- → ZeroQuant-HERO: Hardware-Enhanced Robust Optimized Post-Training Quantization Framework for W8A8 Transformers(2023)1 cited
- Deep Learning at Microsoft – presented at TVM19(2019)