Real-World Performance of Open-Source Large Language Models in Diabetes Diagnosis

Mar 9, 2026·

Shu-Ting YANG

1st Author

Sujie Liu

Co-1st Author

Yuxi Ma

Co-1st Author

Bao-Wen GAI

Junwei Liu

Liansheng Wang

Co-corresponding Author

Feng GAO

Co-corresponding Author

Zhiguang Zhou

Co-corresponding Author

· 0 min read

DOI

Abstract

This study evaluated how diverse open-source large language models perform on diabetes diagnosis from unstructured clinical text in a real-world Chinese cohort. Across 11,329 adult patients, the models achieved strong performance for complex diabetes subtype classification, but remained less reliable for rule-based diagnoses such as diabetic kidney disease and metabolic syndrome. The results highlight the promise of open-source LLMs as clinical decision-support tools while also underscoring their current limitations in procedural reasoning tasks.

Type

Journal article

Publication

Frontiers in Endocrinology

Last updated on Mar 9, 2026

Journal Article Diabetes Large Language Models Clinical AI Real-World Study

Authors

Shu-Ting YANG

Physician

An endocrinologist passionate about artificial intelligence and diabetes, focusing on applying machine learning and bioinformatics to diabetes heterogeneity research.

Authors

Bao-Wen GAI

PhD Student

I am a PhD student working on AI methods for colorectal cancer diagnosis and prognosis.

Authors

Feng GAO

Professor

My research leverages AI and big data to improve diagnostics, prognostics, and ultimately, outcomes in cancer and other biomedical fields.

← Multi-omics driven immune classification of colorectal cancer: Implications for immunotherapy efficacy prediction and enhancement with WNT signaling inhibition Mar 11, 2026

Lactylation of SLC26A3 in the acidic tumor microenvironment promotes malignant progression of colorectal carcinoma Jan 21, 2026 →