ChatGPT Yields a Passing Score on a Pediatric Board Preparatory Exam but Raises Red Flags.

Mindy Le, Michael Davis

Global Pediatric Health 2024

OBJECTIVES: We aimed to evaluate the performance of a publicly-available online artificial intelligence program (OpenAI's ChatGPT-3.5 and -4.0, August 3 versions) on a pediatric board preparatory examination, 2021 and 2022 PREP® Self-Assessment, American Academy of Pediatrics (AAP).

METHODS: We entered 245 questions and answer choices from the Pediatrics 2021 PREP® Self-Assessment and 247 questions and answer choices from the Pediatrics 2022 PREP® Self-Assessment into OpenAI's ChatGPT-3.5 and ChatGPT-4.0, August 3 versions, in September 2023. The ChatGPT-3.5 and 4.0 scores were compared with the advertised passing scores (70%+) for the PREP® exams and the average scores (74.09%) and (75.71%) for all 10 715 and 6825 first-time human test takers.

RESULTS: For the AAP 2021 and 2022 PREP® Self-Assessments, ChatGPT-3.5 answered 143 of 243 (58.85%) and 137 of 247 (55.46%) questions correctly on a single attempt. ChatGPT-4.0 answered 193 of 243 (79.84%) and 208 of 247 (84.21%) questions correctly.

CONCLUSION: Using a publicly-available online chatbot to answer pediatric board preparatory examination questions yielded a passing score but demonstrated significant limitations in the chatbot's ability to assess some complex medical situations in children, posing a potential risk to this vulnerable population.

Full text links

We have located links that may give you full text access.

Show additional links to paperHide additional links to paper

PubMed

Add to Saved Papers

Get 1-tap access

Related Resources

Autoimmune Hemolytic Anemias: Classifications, Pathophysiology, Diagnoses and Management.Melika Loriamini et al.International Journal of Molecular Sciences 2024 April 13

Executive Summary: State-of-the-Art Review: Unintended Consequences: Risk of Opportunistic Infections Associated with Long-term Glucocorticoid Therapies in Adults.Daniel B Chastain et al.Clinical Infectious Diseases 2024 April 11

Clinical practice guidelines on the management of status epilepticus in adults: A systematic review.Luca Vignatelli et al.Epilepsia 2024 April 13

Finerenone: From the Mechanism of Action to Clinical Use in Kidney Disease.Nejc Piko et al.Pharmaceuticals 2024 March 27

Embolic strokes of undetermined source: a clinical consensus statement of the ESC Council on Stroke, the European Association of Cardiovascular Imaging and the European Heart Rhythm Association of the ESC.George Ntaios et al.European Heart Journal 2024 April 31

Detecting and managing the patient with chronic kidney disease in primary care: A review of the latest guidelines.Kaitlin J Mayne, Peter Hanlon, Jennifer S LeesDiabetes, Obesity & Metabolism 2024 May 4

For the best experience, use the Read mobile app

Get seemless 1-tap access through your institution/university

For the best experience, use the Read mobile app

All material on this website is protected by copyright, Copyright © 1994-2024 by WebMD LLC.
This website also contains material copyrighted by 3rd parties.

By using this service, you agree to our terms of use and privacy policy.

Your Privacy Choices

You can now claim free CME credits for this literature searchClaim now

Get seemless 1-tap access through your institution/university

For the best experience, use the Read mobile app

ChatGPT Yields a Passing Score on a Pediatric Board Preparatory Exam but Raises Red Flags.

Full text links

Related Resources

Trending Papers

For the best experience, use the Read mobile app