Richard Rutmann
Publications by Year
Research Areas
Natural Language Processing Techniques, Topic Modeling, Scientific Computing and Data Management, Metal Alloys Wear and Properties, Artificial Intelligence in Law
Most-Cited Works
- → Tokenizer Choice For LLM Training: Negligible or Crucial?(2024)23 cited
- → Teuken-7B-Base & Teuken-7B-Instruct: Towards European LLMs(2024)2 cited
- → Modalities, a PyTorch-native Framework For Large-scale LLM Training and Research(2026)
- → Judging Quality Across Languages: A Multilingual Approach to Pretraining Data Filtering with Language Models(2025)
- → Data Processing for the OpenGPT-X Model Family(2024)