Zejiang Shen
Massachusetts Institute of Technology(US)
Publications by Year
Research Areas
Topic Modeling, Natural Language Processing Techniques, Handwritten Text Recognition Techniques, Advanced Text Analysis Techniques, Image Processing and 3D Reconstruction
Most-Cited Works
- → LayoutParser: A Unified Toolkit for Deep Learning Based Document Image Analysis(2021)107 cited
- → A Design Space for Intelligent and Interactive Writing Assistants(2024)94 cited
- → The Semantic Scholar Open Data Platform(2023)54 cited
- → A Large Dataset of Historical Japanese Documents with Complex Layouts(2020)53 cited
- → Deep Learning based Framework for Automatic Damage Detection in Aircraft Engine Borescope Inspection(2019)38 cited
- → Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research(2024)35 cited
- → VILA: Improving Structured Content Extraction from Scientific PDFs Using Visual Layout Groups(2022)30 cited
- → Multi-LexSum: Real-World Summaries of Civil Rights Lawsuits at Multiple Granularities(2022)20 cited
- → PAWLS: PDF Annotation With Labels and Structure(2021)12 cited