Real-World Performance of Open-Source Large Language Models in Diabetes Diagnosis
Mar 9, 2026·

,,
Shu-Ting YANG
1st Author
,Sujie Liu
Co-1st Author
,Yuxi Ma
Co-1st Author
Bao-Wen GAI
Junwei Liu
Liansheng Wang
Co-corresponding Author
Feng GAO
Co-corresponding Author
,Zhiguang Zhou
Co-corresponding Author
·
0 min readAbstract
This study evaluated how diverse open-source large language models perform on diabetes diagnosis from unstructured clinical text in a real-world Chinese cohort. Across 11,329 adult patients, the models achieved strong performance for complex diabetes subtype classification, but remained less reliable for rule-based diagnoses such as diabetic kidney disease and metabolic syndrome. The results highlight the promise of open-source LLMs as clinical decision-support tools while also underscoring their current limitations in procedural reasoning tasks.
Type
Publication
Frontiers in Endocrinology

Authors
Physician
An endocrinologist passionate about artificial intelligence and diabetes, focusing on applying machine learning and bioinformatics to diabetes heterogeneity research.

Authors
PhD Student
I am a PhD student working on AI methods for colorectal cancer diagnosis and prognosis.

Authors
Professor
My research leverages AI and big data to improve diagnostics, prognostics, and ultimately, outcomes in cancer and other biomedical fields.