Vijay Anand Korthikanti
Publications by Year
Research Areas
Topic Modeling, Multimodal Machine Learning Applications, Parallel Computing and Optimization Techniques, Interconnection Networks and Systems, Formal Methods in Verification
Most-Cited Works
- → Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, A Large-Scale Generative Language Model(2022)299 cited
- → Synthesizing geometry constructions(2011)91 cited
- → Reducing Activation Recomputation in Large Transformer Models(2022)52 cited
- → Towards optimizing energy costs of algorithms for shared memory architectures(2010)51 cited
- → Reasoning about MDPs as Transformers of Probability Distributions(2010)40 cited
- → Efficient large-scale language model training on GPU clusters using megatron-LM(2021)38 cited
- → Analysis of Parallel Algorithms for Energy Conservation in Scalable Multicore Architectures(2009)36 cited
- Efficient Large-Scale Language Model Training on GPU Clusters(2021)
- → Re-ViLM: Retrieval-Augmented Visual Language Model for Zero and Few-Shot Image Captioning(2023)23 cited