Lucile Saulnier
Publications by Year
Research Areas
Topic Modeling, Natural Language Processing Techniques, Multimodal Machine Learning Applications, Scientific Computing and Data Management, Machine Learning and Data Classification
Most-Cited Works
- → Mistral 7B(2023)260 cited
- → Mixtral of Experts(2024)115 cited
- → The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset(2023)65 cited
- → OBELICS: An Open Web-Scale Filtered Dataset of Interleaved Image-Text Documents(2023)48 cited
- → What Language Model to Train if You Have One Million GPU Hours?(2022)6 cited
- → Pixtral 12B(2024)4 cited
- → Distributed Deep Learning in Open Collaborations(2021)4 cited
- → Magistral(2025)1 cited
- → Voxtral(2025)
- → Training Transformers Together(2022)