0 citations

Tokenizer Choice For LLM Training: Negligible or Crucial?

2024pp. 3907–3924

Citations Over TimeTop 1% of 2024 papers

Abstract

Mehdi Ali, Michael Fromm, Klaudia Thellmann, Richard Rutmann, Max Lübbering, Johannes Leveling, Katrin Klug, Jan Ebert, Niclas Doll, Jasper Buschhoff, Charvi Jain, Alexander Weber, Lena Jurkschat, Hammam Abdelwahab, Chelsea John, Pedro Ortiz Suarez, Malte Ostendorff, Samuel Weinbach, Rafet Sifa, Stefan Kesselheim, Nicolas Flores-Herr. Findings of the Association for Computational Linguistics: NAACL 2024. 2024.

Related Papers

→ Training Systems Concept for the Armored Family of Vehicles with Consideration of the Roles of Embedded Training and Stand-Alone Training Devices(1988)1 cited
Using DataGrid Control to Realize DataBase of Querying in VB6.0(2000)
Susquehanna Chorale Spring Concert "Roots and Wings"(2017)
→ ИСПОЛЬЗОВAНИЕ ПОТЕНЦИAЛA СОЦИAЛЬНЫХ ПAРТНЕРОВ В ПОДГОТОВКЕ БУДУЩИХ ПЕДAГОГОВ(2024)