Quentin Anthony
The Ohio State University(US)
Publications by Year
Research Areas
Advanced Neural Network Applications, Topic Modeling, Natural Language Processing Techniques, Stochastic Gradient Optimization Techniques, Parallel Computing and Optimization Techniques
Most-Cited Works
- → GPT-NeoX-20B: An Open-Source Autoregressive Language Model(2022)383 cited
- → RWKV: Reinventing RNNs for the Transformer Era(2023)281 cited
- → GEMS: GPU-Enabled Memory-Aware Model-Parallelism System for Distributed DNN Training(2020)45 cited
- → Performance Characterization of DNN Training using TensorFlow and PyTorch on Modern Clusters(2019)35 cited
- → Emergent and Predictable Memorization in Large Language Models(2023)26 cited
- → Accelerating MPI All-to-All Communication with Online Compression on Modern GPU Clusters(2022)26 cited
- → RedPajama: an Open Dataset for Training Large Language Models(2024)17 cited
- → Adaptive and Hierarchical Large Message All-to-all Communication Algorithms for Large-scale Dense GPU Systems(2021)14 cited