Michael Kuchnik
Publications by Year
Research Areas
Advanced Data Storage Technologies, Advanced Neural Network Applications, Parallel Computing and Optimization Techniques, Machine Learning and Data Classification, Natural Language Processing Techniques
Most-Cited Works
- → File systems unfit as distributed storage backends(2019)89 cited
- → The Case for Custom Storage Backends in Distributed Storage Systems(2020)16 cited
- → Croissant: A Metadata Format for ML-Ready Datasets(2024)16 cited
- The Atlas Cluster Trace Repository.(2018)
- → Revisiting Reliability in Large-Scale Machine Learning Research Clusters(2025)10 cited
- → Efficient Augmentation via Data Subsampling(2018)10 cited
- → Plumber: Diagnosing and Removing Performance Bottlenecks in Machine Learning Data Pipelines(2021)9 cited
- → Progressive compressed records(2021)9 cited
- → Validating Large Language Models with ReLM(2022)8 cited