Guillaume Wenzek
Publications by Year
Research Areas
Natural Language Processing Techniques, Topic Modeling, Multimodal Machine Learning Applications, Text Readability and Simplification, Speech and dialogue systems
Most-Cited Works
- → Unsupervised Cross-lingual Representation Learning at Scale(2020)539 cited
- → Beyond English-Centric Multilingual Machine Translation(2020)468 cited
- → No Language Left Behind: Scaling Human-Centered Machine Translation(2022)360 cited
- → CCNet: Extracting High Quality Monolingual Datasets from Web Crawl Data(2019)242 cited
- → The Flores-101 Evaluation Benchmark for Low-Resource and Multilingual Machine Translation(2022)176 cited
- → Scaling neural machine translation to 200 languages(2024)70 cited
- → CCMatrix: Mining Billions of High-Quality Parallel Sentences on the Web(2021)61 cited
- → Seamless: Multilingual Expressive and Streaming Speech Translation(2023)39 cited