Tetik parmak tedavisinde s�k sorulan sorulara ChatGPT'nin yan�tlar�n�n kalitesinin de�erlendirilmesi

Gezer, Mehmet Can; Armangil, Mehmet

Cilt : 31 Say� : 4 Y�l : 2025

31/4Gelecek Say� Ar�iv Kapaklar Pop�ler Makaleler

ICMJE COI Form

H�zl� Arama

Tetik parmak tedavisinde s�k sorulan sorulara ChatGPT'nin yan�tlar�n�n kalitesinin de�erlendirilmesi [Ulus Travma Acil Cerrahi Derg]

Ulus Travma Acil Cerrahi Derg. 2025; 31(4): 389-393 | DOI: 10.14744/tjtes.2025.32735

Tetik parmak tedavisinde s�k sorulan sorulara ChatGPT'nin yan�tlar�n�n kalitesinin de�erlendirilmesi

Mehmet Can Gezer¹, Mehmet Armangil²
¹Mamak Devlet Hastanesi, Ortopedi ve Travmatoloji Klini�i, Ankara-T�rkiye
²Ankara �niversitesi T�p Fak�ltesi, Ortopedi ve Travmatoloji Anabilim Dal�, El Cerrahisi �nitesi, Ankara-T�rkiye

AMA�: Bu �al��ma, tetik parmak ile ilgili hasta sorular�na yan�t vermede Generative Pre-trained Transformer'in (ChatGPT; OpenAI, San Francisco, CA) do�ruluk ve g�venilirli�ini de�erlendirmeyi ama�lamaktad�r. Bu de�erlendirme, tedavi �ncesinde hasta e�itimini geli�tirme potansiyeline sahiptir ve yapay zeka tabanl� sistemlerin hasta e�itim s�recindeki rol�n� ayd�nlatmay� hedeflemektedir.
GERE� VE Y�NTEM: Tetik parmak ile ilgili en s�k sorulan on soru, hasta e�itimine y�nelik web sitelerinden ve literat�r taramas�ndan derlenmi� ve ChatGPT'ye y�neltilmi�tir. Yan�tlar, iki ortopedi uzman� taraf�ndan JAMA Benchmark kriterleri ve DISCERN arac� kullan�larak de�erlendirilmi�tir. Ek olarak, yan�tlar�n okunabilirli�i Flesch-Kincaid s�n�f seviyesi ile analiz edilmi�tir.
BULGULAR: ChatGPT'nin tetik parmak ile ilgili sorulara verdi�i yan�tlar i�in DISCERN puanlar� 35 ile 47 aras�nda de�i�mi� ve ortalama 42 olarak bulunmu�tur, bu da "orta" kaliteye i�aret etmektedir. Yan�tlar�n %60'� tatmin edici bulunurken, %40'�nda eksiklikler tespit edilmi�tir. JAMA Benchmark kriterlerine g�re, bilimsel referans eksikli�i �nemli bir dezavantaj olarak �ne ��km��t�r. Ortalama okunabilirlik seviyesi �niversite d�zeyindedir, bu da d��k sa�l�k okuryazarl��na sahip hastalar i�in bilgiyi anlamay� zorla�t�rmaktad�r. Yan�tlar�n daha geni� bir hasta kitlesi i�in eri�ilebilir ve anla��labilir hale getirilmesi gerekmektedir.
SONU�: Bulgular�m�z, bildi�imiz kadar�yla, tetik parmak ba�lam�nda ChatGPT kullan�m�n� ara�t�ran ilk �al��ma oldu�unu g�stermektedir. ChatGPT, tetik parmak hakk�nda genel bilgiler sa�lama konusunda makul bir ba�ar� g�stermektedir; ancak, hasta e�itimi i�in birincil kaynak olarak kullan�lmadan �nce uzman denetimi gereklidir.

Anahtar Kelimeler: Tetik parmak, ChatGPT, DISCERN, hasta e�itimi, yapay zek�.

Assessing the quality of ChatGPT's responses to commonly asked questions about trigger finger treatment

Mehmet Can Gezer¹, Mehmet Armangil²
¹Department of Orthopedics and Traumatology, Mamak State Hospital, Ankara-T�rkiye
²Department of Orthopedics and Traumatology, Hand Surgery Unit, Ankara University Faculty of Medicine, Ankara-T�rkiye

BACKGROUND: This study aims to evaluate the accuracy and reliability of Generative Pre-trained Transformer (ChatGPT; OpenAI, San Francisco, California) in answering patient-related questions about trigger finger. This evaluation has the potential to enhance patient education prior to treatment and provides insight into the role of artificial intelligence (AI)-based systems in the patient educa-tion process.
METHODS: The ten most frequently asked questions regarding trigger finger were compiled from patient education websites and a literature review, then posed to ChatGPT. Two orthopedic specialists evaluated the responses using the Journal of the American Medical Association (JAMA) Benchmark criteria and the DISCERN instrument (A Tool for Judging the Quality of Written Consumer Health Information on Treatment Choices). Additionally, the readability of the responses was assessed using the Flesch-Kincaid Grade Level.
RESULTS: The DISCERN scores for ChatGPT's responses to trigger finger questions ranged from 35 to 47, with an average of 42, indicating "moderate" quality. While 60% of the responses were satisfactory, 40% contained deficiencies. According to the JAMA Benchmark criteria, the absence of scientific references was a significant drawback. The average readability level corresponded to the university level, making the information difficult to understand for patients with low health literacy. Improvements are needed to enhance the accessibility and comprehensibility of the content for a broader patient population.
CONCLUSION: To the best of our knowledge, this is the first study to investigate the use of ChatGPT in the context of trigger finger. While ChatGPT shows reasonable effectiveness in providing general information on trigger finger, expert oversight is necessary before it can be relied upon as a primary source for patient education.

Keywords: Trigger finger, ChatGPT, DISCERN, patient education, artificial intelligence.

Sorumlu Yazar: Mehmet Can Gezer, T�rkiye
Makale Dili: �ngilizce

ATIF KOPYALA

Tam Metin PDF At�f dosyas� indir RIS EndNote BibTex Medlars Procite Reference Manager Yazara e-posta g�nder Benzer makaleler PubMed Google Scholar