Yangtian Zi
Northeastern University(US)
Publications by Year
Research Areas
Software Engineering Research, Topic Modeling, Machine Learning and Data Classification, Natural Language Processing Techniques, Software Testing and Debugging Techniques
Most-Cited Works
- → StarCoder: may the source be with you!(2023)192 cited
- → MultiPL-E: A Scalable and Polyglot Approach to Benchmarking Neural Code Generation(2023)90 cited
- → SantaCoder: don't reach for the stars!(2023)51 cited
- → How Beginning Programmers and Code LLMs (Mis)read Each Other(2024)42 cited
- → MultiPL-E: A Scalable and Extensible Approach to Benchmarking Neural Code Generation(2022)13 cited
- → StudentEval: A Benchmark of Student-Written Prompts for Large Language Models of Code(2024)11 cited
- → ``I Would Have Written My Code Differently': Beginners Struggle to Understand LLM-Generated Code(2025)2 cited
- → More Than a Score: Probing the Impact of Prompt Specificity on LLM Code Generation(2025)