Niklas Muennighoff
Stanford University(US)
Publications by Year
Research Areas
Topic Modeling, Natural Language Processing Techniques, Multimodal Machine Learning Applications, Software Engineering Research, Advanced Graph Neural Networks
Most-Cited Works
- → MTEB: Massive Text Embedding Benchmark(2023)335 cited
- → Crosslingual Generalization through Multitask Finetuning(2023)312 cited
- → StarCoder: may the source be with you!(2023)192 cited
- → SGPT: GPT Sentence Embeddings for Semantic Search(2022)56 cited
- → SantaCoder: don't reach for the stars!(2023)51 cited
- → A large-scale audit of dataset licensing and attribution in AI(2024)47 cited
- → OLMo: Accelerating the Science of Language Models(2024)46 cited
- → Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model(2024)43 cited
- → Vilio: State-of-the-art Visio-Linguistic Models applied to Hateful Memes(2020)40 cited