当前位置: X-MOL 学术Clin. Genitourin. Cancer › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Urological Cancers and ChatGPT: Assessing the Quality of Information and Possible Risks for Patients
Clinical Genitourinary Cancer ( IF 3.2 ) Pub Date : 2024-01-05 , DOI: 10.1016/j.clgc.2023.12.017
Faruk Ozgor , Ufuk Caglar , Ahmet Halis , Hakan Cakir , Ufuk Can Aksu , Ali Ayranci , Omer Sarilar

OpenAI has created ChatGPT, an artificial intelligence language model that has gained considerable recognition for its capacity to produce text responses resembling human language. Consequently, this study seeks to evaluate the effectiveness of ChatGPT's responses in addressing publicly accessible queries related to prostate, kidney, bladder, and testicular cancers. A comprehensive compilation of frequently asked questions (FAQs) pertaining to prostate, bladder, kidney, and testicular cancers was gathered from diverse sources. Additionally, the recommendations outlined in the European Association of Urology (EAU) 2023 Guideline Oncology were consulted. The chosen questions for evaluation were presented to the ChatGPT 4.0 premium version. The quality of ChatGPT responses was appraised using the global quality score (GQS). Each ChatGPT response was independently reviewed by a panel of physicians, who assigned a GQS score to assess its overall quality. For prostate cancer, 64.6% of the questions had a GQS score of 5, compared to 62.9 % for bladder, 68.1% for kidney, and 63.9% for testicular cancers, whereas none of the responses had a GQS score of 1. Meanwhile, the category with the lowest proportion of responses, with a GQS score of 5 for each disease, was prognosis and follow-up. The mean GQS score of the answers given to EAU guideline questions was statistically significantly lower than the average score of the answers given to FAQs. ChatGPT is a valuable tool for addressing general inquiries regarding urological cancers, boasting commendable accuracy rates. Nonetheless, its performance in responding to questions aligned with the EAU guideline was deemed unsatisfactory.

中文翻译:

泌尿系统癌症和 ChatGPT:评估信息质量和患者可能面临的风险

OpenAI 创建了 ChatGPT,这是一种人工智能语言模型,因其生成类似于人类语言的文本响应的能力而获得了广泛认可。因此,本研究旨在评估 ChatGPT 在解决与前列腺癌、肾癌、膀胱癌和睾丸癌相关的公开查询方面的响应效果。从不同来源收集了有关前列腺癌、膀胱癌、肾癌和睾丸癌的常见问题解答 (FAQ) 的综合汇编。此外,还参考了欧洲泌尿外科协会 (EAU) 2023 年肿瘤学指南中概述的建议。所选的评估问题将提交给 ChatGPT 4.0 高级版本。 ChatGPT 响应的质量使用全局质量得分 (GQS) 进行评估。每个 ChatGPT 回复均由一组医生独立审核,并分配 GQS 评分来评估其整体质量。对于前列腺癌,64.6% 的问题的 GQS 得分为 5,而膀胱癌、肾癌和睾丸癌的 GQS 得分分别为 62.9%、68.1% 和 63.9%,而没有一个答案的 GQS 得分为 1。反应比例最低的类别是预后和随访,每种疾病的 GQS 评分为 5 分。 EAU 指南问题答案的平均 GQS 分数在统计上显着低于常见问题解答的平均分数。 ChatGPT 是解决有关泌尿系统癌症的一般询问的宝贵工具,其准确率值得称赞。尽管如此,其在回答符合 EAU 指南的问题方面的表现仍不能令人满意。
更新日期:2024-01-05
down
wechat
bug