Aiding Large Language Models Using Clinical Scoresheets for Neurobehavioral Diagnostic Classification From Text: Algorithm Development and Validation
JMIR AI2025Vol. 4, pp. e75030–e75030
Kaiying Lin, Abdur Rasool, Saimourya Surabhi, Cezmi Mutlu, H. H. Zhang, Dennis P. Wall, Peter Washington
Abstract
Current LLM-based chatbots, when prompted naively, underperform on psychiatric and behavioral diagnostic tasks compared to specialized machine learning models. Clinical assessment scales might modestly aid chatbot performance, but more sophisticated prompt engineering and domain integration are likely required to reach clinically actionable standards.
Related Papers
- → In which I suggest a preprint archive for clinical trials(2010)5 cited
- → The Preprint Peer Reviewer's Toolkit: How to post a peer review of a preprint(2022)1 cited
- Susquehanna Chorale Spring Concert "Roots and Wings"(2017)
- → This preprint has been removed(2020)
- → ИСПОЛЬЗОВAНИЕ ПОТЕНЦИAЛA СОЦИAЛЬНЫХ ПAРТНЕРОВ В ПОДГОТОВКЕ БУДУЩИХ ПЕДAГОГОВ(2024)