�rolojik acil durumlarda ChatGPT'nin yan�t yetkinli�i

Orta�, Mazhar; Erg�l, R�fat Burak; Yaz�l�, H�seyin Burak; �zervarl�, Muhammet Firat; Tonyal�, �enol; Sar�lar, Omer; �zg�r, Faruk

31/3Gelecek Say� Ar�iv Kapaklar Pop�ler Makaleler

ICMJE COI Form

H�zl� Arama

�rolojik acil durumlarda ChatGPT'nin yan�t yetkinli�i [Ulus Travma Acil Cerrahi Derg]

Ulus Travma Acil Cerrahi Derg. 2025; 31(3): 291-295 | DOI: 10.14744/tjtes.2024.03377

�rolojik acil durumlarda ChatGPT'nin yan�t yetkinli�i

Mazhar Orta�¹, R�fat Burak Erg�l¹, H�seyin Burak Yaz�l�², Muhammet Firat �zervarl�¹, �enol Tonyal�¹, Omer Sar�lar², Faruk �zg�r²
¹�roloji Anabilim Dal�, �stanbul T�p Fak�ltesi, �stanbul �niversitesi, �stanbul-Türkiye
²�roloji Klini�i, Haseki E�itim ve Ara�t�rma Hastanesi, �stanbul-Türkiye

AMA�: Son y�llarda, yapay zek� (AI) uygulamalar� t�pta ve bir�ok di�er alanda bir bilgi kayna�� olarak kullan�lmaktad�r. Bu �al��ma, ChatGPT'nin �rolojik aciller (�A) konusunda g�sterdi�i performans� de�erlendiren ilk �al��mad�r.
GERE� VE Y�NTEM: �al��ma, halk taraf�ndan �rolojik acillerle ilgili s�k�a sorulan sorular� (SSS) ve Avrupa �roloji Derne�i (EAU) k�lavuzlar�n� incelenerek olu�turulan �rolojik acillerle ilgili sorular� i�ermektedir. SSS, sosyal medya (Facebook, Instagram ve X) veya doktor / hastane web say-falar�nda halk taraf�ndan sorulan sorular aras�ndan se�ilmi�tir. T�m sorular �ngilizce olarak ChatGPT 4 (Premium versiyonu) ile sorulmu� ve cevaplar kaydedilmi�tir. �ki �rolog, yan�tlar� global kalite puan� (GQS) skalas�na g�re 1-5 puan aras�nda de�erlendirmi�tir.
BULGULAR: Toplam 73 yan�t�n 53'� (%72.6) 5 GQS puan�na sahipti ve yaln�zca 2 yan�t (%2.7) 1 GQS puan�na sahipti. 1 GQS puan�na sahip yan�tlar priapizm ve �rosepsis ile ilgiliydi. En y�ksek GQS puan�na (%82.3) sahip konu �rosepsis iken, en d��k puanlar renal travma (%66.7) ve postrenal akut b�brek 15 hasar� konular�ndayd� (%66.7). EAU k�lavuzuna dayal� olarak olu�turulan soru say�s� 42 idi. Bu sorulara olu�turulan yan�tlar�n 23'� (%54.8) hekimlerden 5 GQS puan� ald�. SSS'ye y�nelik yan�tlar i�in GQS ortalama puan� 4.38�1.14 idi ve bu, EAU k�lavuzuna dayal� sorular i�in ortalama GQS puan�ndan (3.88�1.47) istatistiksel olarak daha y�ksekti (p=0.009).
SONU�: Bu �al��ma, ilk kez ChatGPT'nin SSS'lerin yakla��k d�rtte ��n� do�ru ve tatmin edici bir �ekilde yan�tlad��n� g�stermi�tir. Buna kar��l�k, �A hakk�nda k�lavuz temelli sorular� yan�tlarken ChatGPT'nin do�rulu�u ve yetkinli�i �nemli �l��de azalm��t�r.

Anahtar Kelimeler: Yapay zek�, ChatGPT, �rolojik acil durumlar.

ChatGPT's competence in responding to urological emergencies

Mazhar Orta�¹, R�fat Burak Erg�l¹, H�seyin Burak Yaz�l�², Muhammet Firat �zervarl�¹, �enol Tonyal�¹, Omer Sar�lar², Faruk �zg�r²
¹Department of Urology, Istanbul Faculty of Medicine, Istanbul University, Istanbul-Türkiye
²Department of Urology, Haseki Training and Research Hospital, Istanbul-Türkiye

BACKGROUND: In recent years, artificial intelligence (AI) applications have been increasingly used as sources of medical information, alongside their applications in many other fields. This study is the first to evaluate ChatGPT's performance in addressing urological emergencies (UE).
METHODS: The study included frequently asked questions (FAQs) by the public regarding UE, as well as UE-related questions formulated based on the European Association of Urology (EAU) guidelines. The FAQs were selected from questions posed by patients to doctors and hospital accounts on social media platforms (Facebook, Instagram, and X) and on websites. All questions were presented to ChatGPT 4 (premium version) in English, and the responses were recorded. Two urologists assessed the quality of the responses using a Global Quality Score (GQS) on a scale of 1 to 5.
RESULTS: Of the 73 total FAQs, 53 (72.6%) received a GQS score of 5, while only two (2.7%) received a GQS score of 1. The questions with a GQS score of 1 pertained to priapism and urosepsis. The topic with the highest proportion of responses receiving a GQS score of 5 was urosepsis (82.3%), whereas the lowest scores were observed in questions related to renal trauma (66.7%) and postrenal acute kidney injury (66.7%). A total of 42 questions were formulated based on the EAU guidelines, of which 23 (54.8%) received a GQS score of 5 from the physicians. The mean GQS score for FAQs was 4.38�1.14, which was significantly higher (p=0.009) than the mean GQS score for EAU guideline-based questions (3.88�1.47).
CONCLUSION: This study demonstrated for the first time that nearly three out of four FAQs were answered accurately and satisfactorily by ChatGPT. However, the accuracy and proficiency of ChatGPT's responses significantly decreased when addressing guideline-based questions on UE.

Keywords: Artificial intelligence, ChatGPT, urological emergencies.

Sorumlu Yazar: Mazhar Orta�, T�rkiye
Makale Dili: �ngilizce

ATIF KOPYALA

Tam Metin PDF At�f dosyas� indir RIS EndNote BibTex Medlars Procite Reference Manager Yazara e-posta g�nder Benzer makaleler PubMed Google Scholar