Stephen Youn
Microsoft Research (United Kingdom)(GB)
Publications by Year
Research Areas
Topic Modeling, Advanced Neural Network Applications, Natural Language Processing Techniques, Ferroelectric and Negative Capacitance Devices, Speech Recognition and Synthesis
Most-Cited Works
- → Exploring Post-training Quantization in LLMs from Comprehensive Study to Low Rank Compensation(2024)16 cited
- → ZeroQuant-V2: Exploring Post-training Quantization in LLMs from Comprehensive Study to Low Rank Compensation(2023)15 cited
- → FP6-LLM: Efficiently Serving Large Language Models Through FP6-Centric Algorithm-System Co-Design(2024)3 cited
- → ZeroQuant-HERO: Hardware-Enhanced Robust Optimized Post-Training Quantization Framework for W8A8 Transformers(2023)1 cited
- → ZeroQuant(4+2): Redefining LLMs Quantization with a New FP6-Centric Strategy for Diverse Generative Tasks(2023)1 cited