Search CORE

1 research outputs found

Evaluation of ChatGPT-Generated Differential Diagnosis for Common Diseases With Atypical Presentation: Descriptive Research

Author: Fumina Orihara
Fumio Otsuka
Gemmei Iizuka
Hirofumi Kimura
Hiroki Tamura
Hiromizu Takahashi
Kiyoshi Shikino
Koichi Nakashima
Kotaro Kunitomo
Masaki Tago
Midori Tokushima
Morika Suzuki
Satoshi Watanuki
Sayaka Aoyama
Shintaro Kosaka
Takashi Watari
Takuma Saito
Taro Shimizu
Teiko Kawahigashi
Tomohiro Matsumoto
Toru Morikawa
Toshinori Nishizawa
Yasuharu Tokuda
Yoji Hoshina
Yosuke Sasaki
Yu Yamamoto
Yuichiro Matsuo
Yuki Otsuka
Yuto Unoki
Publication venue: JMIR Publications
Publication date: 01/06/2024
Field of study

Abstract BackgroundThe persistence of diagnostic errors, despite advances in medical knowledge and diagnostics, highlights the importance of understanding atypical disease presentations and their contribution to mortality and morbidity. Artificial intelligence (AI), particularly generative pre-trained transformers like GPT-4, holds promise for improving diagnostic accuracy, but requires further exploration in handling atypical presentations. ObjectiveThis study aimed to assess the diagnostic accuracy of ChatGPT in generating differential diagnoses for atypical presentations of common diseases, with a focus on the model’s reliance on patient history during the diagnostic process. MethodsWe used 25 clinical vignettes from the Journal of Generalist Medicine ResultsChatGPT’s diagnostic accuracy decreased with an increase in atypical presentation. For category 1 (C1) cases, the concordance rates were 17% (n=1) for the top 1 and 67% (n=4) for the top 5. Categories 3 (C3) and 4 (C4) showed a 0% concordance for top 1 and markedly lower rates for the top 5, indicating difficulties in handling highly atypical cases. The χ2χ1Pχ1P ConclusionsChatGPT-4 demonstrates potential as an auxiliary tool for diagnosing typical and mildly atypical presentations of common diseases. However, its performance declines with greater atypicality. The study findings underscore the need for AI systems to encompass a broader range of linguistic capabilities, cultural understanding, and diverse clinical scenarios to improve diagnostic utility in real-world settings

Directory of Open Access Journals