Chengruidong Zhang
Publications by Year
Research Areas
Topic Modeling, Advanced Neural Network Applications, Natural Language Processing Techniques, Multimodal Machine Learning Applications, Domain Adaptation and Few-Shot Learning
Most-Cited Works
- → PIT: Optimization of Dynamic Sparse Deep Learning Models via Permutation Invariant Transformation(2023)19 cited
- → LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens(2024)15 cited
- → MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via Dynamic Sparse Attention(2024)4 cited
- → RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval(2024)1 cited
- → SortedRL: Accelerating RL Training for LLMs through Online Length-Aware Scheduling(2026)
- → RetroInfer: A Vector-Storage Approach for Scalable Long-Context LLM Inference(2025)
- → Region-Adaptive Sampling for Diffusion Transformers(2025)
- → MMInference: Accelerating Pre-filling for Long-Context VLMs via Modality-Aware Permutation Sparse Attention(2025)
- → SCBench: A KV Cache-Centric Analysis of Long-Context Methods(2024)