Sokhar Samb
Publications by Year
Research Areas
Natural Language Processing Techniques, Topic Modeling, Multilingual Education and Policy, Language and cultural evolution, Text Readability and Simplification
Most-Cited Works
- → Quality at a Glance: An Audit of Web-Crawled Multilingual Datasets(2022)163 cited
- → IrokoBench: A New Benchmark for African Languages in the Age of Large Language Models(2024)3 cited
- → Findings from the Bambara - French Machine Translation Competition (BFMT 2023)(2023)2 cited
- → INJONGO: A Multicultural Intent Detection and Slot-filling Dataset for 16 African Languages(2025)