JMIR Preprints #67270: Assess the capabilities of AI-based large language models (AI-LLMs) in interpreting histopathological slides and scientific figures: performance evaluation study

Current Preprint Settings

(as selected by the authors)

1. When the manuscript is submitted, allow peer review from:

(a) Anybody (open community peer review)
(b) Editor-selected reviewers (closed peer review)

2. When the manuscript is submitted, display the preprint PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

3. When the manuscript is accepted, display the accepted manuscript PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)

Assess the capabilities of AI-based large language models (AI-LLMs) in interpreting histopathological slides and scientific figures: performance evaluation study

Khanisyah Erza Gumilar;
Grace Ariani;
Priangga Adi Wiratama;
Rimbun Rimbun;
Tri Hartini Yuliawati;
Hong Chen;
Ibrahim Haruna Ibrahim;
Cheng-Han Lin;
Tai-Yu Hung;
Dewanti Anggrahini;
Arya Satya Rajanagara;
Zih-Ying Yu;
Yu-Cheng Hsu;
Erry Gumilar Dachlan;
Jer-Yen Yang;
Li-Na Liao;
Ming Tan

ABSTRACT

Background:

Interpreting histopathology slides and scientific figures requires specialized skills and knowledge. Pathologists analyze various tissues and cells, while the general population often struggles with the technical information in scientific figures. Artificial intelligence-based large language models (AI-LLMs) can simplify these processes by providing clearer explanations.

Objective:

This study explores the capabilities AI-LLMs in interpreting histopathology slides and scientific figures. The objective is to assess the value of AI LLMs in medical applications and scientific education.

Methods:

The study was divided into two parts: interpreting histopathology slides and scientific figures. Six histopathology images and six scientific figures were tested on each of the three most frequently used chatbots (ChatGPT-4, Gemini Advanced, and Copilot). Responses from the chatbots were coded and blindly examined by expert raters using five parameters—relevance, clarity, depth, focus, and coherence—on a 5-point Likert scale. Statistical analysis included one-way ANOVA and multiple linear regression.

Results:

ChatGPT-4 outperformed Gemini Adv and Copilot in both histopathology and scientific image interpretation, with significantly higher scores across all parameters (P<.001). High homogeneity among raters validated these findings. ChatGPT-4's superior performance may be due to its advanced algorithms, extensive training data, specialized modules, and user feedback.

Conclusions:

ChatGPT-4 excels in interpreting histopathology and scientific images, which may lead to improving diagnostic accuracy, clinical decision-making, and reducing pathologists' workload. It also benefits education by enhancing students' understanding of complex images and promoting interactive learning. ChatGPT-4 shows a significant potential to improve patient care and enrich student learning.

Citation

Please cite as:

Gumilar KE, Ariani G, Wiratama PA, Rimbun R, Yuliawati TH, Chen H, Ibrahim IH, Lin CH, Hung TY, Anggrahini D, Rajanagara AS, Yu ZY, Hsu YC, Dachlan EG, Yang JY, Liao LN, Tan M

Assess the capabilities of AI-based large language models (AI-LLMs) in interpreting histopathological slides and scientific figures: performance evaluation study

JMIR Preprints. 07/10/2024:67270

DOI: 10.2196/preprints.67270

URL: https://preprints.jmir.org/preprint/67270

Download PDF

Request queued. Please wait while the file is being generated. It may take some time.

Copyright

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.

JMIR Publications

JMIR Preprints

Currently submitted to: Journal of Medical Internet Research

Date Submitted: Oct 7, 2024

Open Peer Review Period: Oct 9, 2024 - Dec 4, 2024

(currently open for review)

Assess the capabilities of AI-based large language models (AI-LLMs) in interpreting histopathological slides and scientific figures: performance evaluation study

ABSTRACT

Citation

Copyright