An Interactive Annotated World Bibliography of Printed and Digital Works in the History of Medicine and the Life Sciences from Circa 2000 BCE to 2024 by Fielding H. Garrison (1870-1935), Leslie T. Morton (1907-2004), and Jeremy M. Norman (1945- ) Traditionally Known as “Garrison-Morton”

16066 entries, 14153 authors and 1947 subjects. Updated: December 29, 2024

KUNG, Tiffany Hsiang

1 entries
  • 14106

Performance of ChatGPT on USMLE: Potential for AI-assisted medical education using large language models.

medrxiv.org/content10.1101/2022.12.19, 2022.

Abstract: "We evaluated the performance of a large language model called ChatGPT on the United States Medical Licensing Exam (USMLE), which consists of three exams: Step 1, Step 2CK, and Step 3. ChatGPT performed at or near the passing threshold for all three exams without any specialized training or reinforcement. Additionally, ChatGPT demonstrated a high level of concordance and insight in its explanations. These results suggest that large language models may have the potential to assist with medical education, and potentially, clinical decision-making."

"In the past three weeks, a new AI model called ChatGPT captured significant attention due to its ability to perform a diverse array of natural language tasks9. ChatGPT is a general Large Language Model (LLM) developed recently by OpenAI. While the previous class of AI models have primarily been Deep Learning (DL) models, which are designed to learn and recognize patterns in data, LLMs are a new type of AI algorithm trained to predict the likelihood of a given sequence of words based on the context of the words that come before it. Thus, if LLMs are trained on sufficiently large amounts of text data, they are capable of generating novel sequences of words never observed previously by the model, but that represent plausible sequences based on natural human language. ChatGPT is powered by GPT3.5, an LLM trained on the OpenAI 175B parameter foundation model and a large corpus of text data from the Internet via reinforcement and supervised learning methods. Anecdotal usage indicates that ChatGPT exhibits evidence of deductive reasoning and chain of thought, as well as long-term dependency skills" (from the paper).

https://www.medrxiv.org/content/10.1101/2022.12.19.22283643v2.full-text

Order of authorship: Kung, Cheatham, ChatGPT...Tseng.



Subjects: Artificial Intelligence in Medicine , DIGITAL RESOURCES › Digital or Digitized Periodicals Online, Education, Biomedical, & Biomedical Profession